Query lcl|NC_019514.1_cdsid_YP_007005821.1 [gene=F410_gp085] [protein=major capsid protein] [protein_id=YP_007005821.1] [location=complement(54393..55592)] Match_columns 399 No_of_seqs 30 out of 33 Neff 4.4 Searched_HMMs 1612 Date Thu Nov 7 16:32:01 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_85 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_85_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:95875 Length: 401 100.0 1E-176 9E-180 984.7 29.4 395 5-399 1-401 (401) 2 protein:vir:105334 Length: 276 99.7 1.8E-18 1.1E-21 117.9 16.2 270 1-399 1-271 (276) 3 protein:vir:95898 Length: 274 99.6 3.3E-17 2E-20 110.9 16.0 270 1-399 1-271 (274) 4 protein:vir:96262 Length: 274 99.6 3.3E-17 2E-20 110.9 16.0 270 1-399 1-271 (274) 5 protein:vir:3613 Length: 272 # 99.6 3.8E-17 2.4E-20 110.6 16.2 271 1-398 1-272 (272) 6 protein:vir:96123 Length: 274 99.6 1.1E-16 7E-20 108.0 16.8 270 1-399 1-271 (274) 7 protein:vir:1239 Length: 274 # 99.6 1.2E-16 7.5E-20 107.8 16.4 270 1-399 1-271 (274) 8 protein:vir:97433 Length: 274 99.6 2E-16 1.2E-19 106.7 17.5 270 1-399 1-272 (274) 9 protein:vir:94494 Length: 274 99.6 2E-16 1.2E-19 106.7 17.5 270 1-399 1-272 (274) 10 protein:vir:93696 Length: 364 99.6 6.7E-16 4.2E-19 103.8 20.1 313 1-399 1-362 (364) 11 protein:vir:80930 Length: 278 99.6 2.4E-16 1.5E-19 106.3 16.5 277 1-399 1-278 (278) 12 protein:vir:96833 Length: 275 99.6 2.1E-16 1.3E-19 106.5 15.8 271 1-399 1-273 (275) 13 protein:vir:94622 Length: 341 99.5 1E-15 6.4E-19 102.8 19.1 324 6-399 1-340 (341) 14 protein:vir:93742 Length: 274 99.5 3.9E-16 2.4E-19 105.1 16.7 270 1-399 1-272 (274) 15 protein:vir:9820 Length: 272 # 99.5 2E-15 1.2E-18 101.2 16.3 269 1-399 1-270 (272) 16 protein:vir:3033 Length: 272 # 99.5 2E-15 1.2E-18 101.2 16.3 269 1-399 1-270 (272) 17 protein:vir:739 Length: 231 # 99.4 2E-14 1.3E-17 95.7 13.2 231 53-398 1-231 (231) 18 protein:vir:105822 Length: 273 99.3 2.5E-13 1.5E-16 89.7 15.0 269 6-398 1-273 (273) 19 protein:vir:102605 Length: 273 99.3 2.5E-13 1.5E-16 89.7 15.0 269 6-398 1-273 (273) 20 protein:vir:95107 Length: 270 99.2 3.2E-13 2E-16 89.1 14.2 264 1-399 1-265 (270) 21 protein:vir:7990 Length: 273 # 99.2 4.6E-13 2.9E-16 88.2 14.6 271 6-398 1-273 (273) 22 protein:vir:78739 Length: 332 99.2 6E-12 3.7E-15 82.1 20.2 318 1-396 1-332 (332) 23 protein:vir:95763 Length: 297 99.2 9.3E-13 5.8E-16 86.5 15.4 290 1-399 1-297 (297) 24 protein:vir:80213 Length: 334 99.2 1.2E-11 7.5E-15 80.4 21.4 307 8-399 1-333 (334) 25 protein:vir:80180 Length: 381 99.2 3.9E-12 2.4E-15 83.1 17.0 314 1-399 1-343 (381) 26 protein:vir:105610 Length: 430 99.2 4.6E-11 2.8E-14 77.3 21.8 345 12-399 1-425 (430) 27 protein:vir:10450 Length: 344 99.1 6E-11 3.7E-14 76.6 21.5 322 1-398 1-344 (344) 28 protein:vir:41 Length: 299 # N 99.1 9.6E-12 5.9E-15 81.0 16.2 291 6-399 1-299 (299) 29 protein:vir:105905 Length: 304 99.1 6.8E-12 4.2E-15 81.8 15.0 289 1-397 1-304 (304) 30 protein:vir:94142 Length: 304 99.1 6.8E-12 4.2E-15 81.8 15.0 289 1-397 1-304 (304) 31 protein:vir:10123 Length: 404 99.1 3.1E-10 1.9E-13 72.7 23.4 337 1-399 1-404 (404) 32 protein:vir:104439 Length: 404 99.1 3.1E-10 1.9E-13 72.7 23.4 337 1-399 1-404 (404) 33 protein:vir:3298 Length: 404 # 99.1 3.1E-10 1.9E-13 72.7 23.4 337 1-399 1-404 (404) 34 protein:vir:819 Length: 404 # 99.1 3.1E-10 1.9E-13 72.7 23.4 337 1-399 1-404 (404) 35 protein:vir:2201 Length: 345 # 99.1 1.9E-10 1.2E-13 73.8 21.3 320 1-398 1-345 (345) 36 protein:vir:8885 Length: 347 # 99.0 1.2E-10 7.5E-14 75.0 19.6 321 1-399 1-347 (347) 37 protein:vir:7771 Length: 330 # 99.0 3.3E-11 2.1E-14 78.0 16.4 306 1-399 1-324 (330) 38 protein:vir:103323 Length: 364 99.0 8.4E-11 5.2E-14 75.8 18.0 315 6-399 1-340 (364) 39 protein:vir:108303 Length: 418 99.0 5.8E-11 3.6E-14 76.7 16.6 302 1-399 1-418 (418) 40 protein:vir:96223 Length: 324 99.0 2.8E-11 1.7E-14 78.5 14.3 293 1-399 1-316 (324) 41 protein:vir:9309 Length: 324 # 99.0 5.8E-11 3.6E-14 76.7 15.3 292 1-399 17-316 (324) 42 protein:vir:1541 Length: 347 # 99.0 2.7E-10 1.7E-13 73.1 18.9 321 1-399 1-345 (347) 43 protein:vir:94576 Length: 347 99.0 2.6E-10 1.6E-13 73.1 18.8 320 1-398 1-347 (347) 44 protein:vir:100247 Length: 425 99.0 3.9E-11 2.4E-14 77.6 13.6 292 1-399 120-425 (425) 45 protein:vir:8187 Length: 311 # 98.9 2.6E-10 1.6E-13 73.1 17.6 299 1-399 1-311 (311) 46 protein:vir:485 Length: 407 # 98.9 8.3E-11 5.1E-14 75.9 14.1 290 1-399 90-401 (407) 47 protein:vir:99749 Length: 324 98.9 1.6E-10 1E-13 74.2 15.6 292 1-399 1-316 (324) 48 protein:vir:97148 Length: 324 98.9 1.7E-10 1E-13 74.2 15.3 293 1-399 1-316 (324) 49 protein:vir:3364 Length: 347 # 98.9 2.4E-09 1.5E-12 67.9 21.2 320 1-399 1-345 (347) 50 protein:vir:4339 Length: 395 # 98.9 3.2E-10 2E-13 72.6 15.9 283 1-398 101-395 (395) 51 protein:vir:6324 Length: 335 # 98.9 4.2E-09 2.6E-12 66.5 21.5 317 1-399 1-328 (335) 52 protein:vir:103955 Length: 324 98.9 4.6E-10 2.8E-13 71.8 16.2 292 1-399 1-316 (324) 53 protein:vir:104085 Length: 320 98.9 3.5E-10 2.2E-13 72.4 15.3 302 1-399 1-319 (320) 54 protein:vir:2344 Length: 397 # 98.8 6.6E-10 4.1E-13 70.9 16.2 297 6-399 1-307 (397) 55 protein:vir:2770 Length: 318 # 98.8 1.1E-09 6.8E-13 69.7 17.3 262 1-324 1-318 (318) 56 protein:vir:3136 Length: 322 # 98.8 5.7E-10 3.5E-13 71.3 15.6 304 6-399 1-320 (322) 57 protein:vir:81100 Length: 415 98.8 1.8E-09 1.1E-12 68.5 18.2 290 1-399 113-405 (415) 58 protein:vir:79987 Length: 415 98.8 1.8E-09 1.1E-12 68.5 18.2 290 1-399 113-405 (415) 59 protein:vir:98339 Length: 415 98.8 1.8E-09 1.1E-12 68.5 18.2 290 1-399 113-405 (415) 60 protein:vir:104256 Length: 458 98.8 1.2E-09 7.6E-13 69.4 17.1 299 1-398 143-458 (458) 61 protein:vir:94711 Length: 347 98.8 4.1E-09 2.5E-12 66.6 19.9 319 1-399 1-347 (347) 62 protein:vir:4226 Length: 326 # 98.8 7.1E-10 4.4E-13 70.7 15.5 305 1-399 1-324 (326) 63 protein:vir:94771 Length: 298 98.8 1.3E-09 7.9E-13 69.3 16.8 284 16-397 1-298 (298) 64 protein:vir:100057 Length: 375 98.8 1.8E-09 1.1E-12 68.6 17.4 325 1-399 8-371 (375) 65 protein:vir:4456 Length: 401 # 98.8 2.2E-10 1.3E-13 73.6 12.0 289 1-398 91-401 (401) 66 protein:vir:80684 Length: 315 98.8 3.6E-09 2.2E-12 66.9 18.1 299 1-399 1-308 (315) 67 protein:vir:99675 Length: 324 98.8 3.9E-09 2.4E-12 66.7 18.2 273 50-399 1-297 (324) 68 protein:vir:78935 Length: 335 98.8 1.3E-08 8.3E-12 63.8 21.1 315 1-399 1-329 (335) 69 protein:vir:4511 Length: 409 # 98.8 8E-10 4.9E-13 70.5 14.3 294 1-399 96-407 (409) 70 protein:vir:102655 Length: 322 98.8 1.2E-08 7.2E-12 64.1 20.7 315 6-399 1-322 (322) 71 protein:vir:78830 Length: 324 98.7 1.2E-09 7.3E-13 69.6 15.0 293 1-399 1-316 (324) 72 protein:vir:96392 Length: 324 98.7 1.2E-09 7.3E-13 69.6 15.0 293 1-399 1-316 (324) 73 protein:vir:97053 Length: 390 98.7 1.1E-09 7.1E-13 69.6 14.5 285 1-396 95-390 (390) 74 protein:vir:9410 Length: 415 # 98.7 1.4E-09 8.9E-13 69.1 14.7 293 1-399 110-405 (415) 75 protein:vir:4997 Length: 397 # 98.7 4.5E-09 2.8E-12 66.3 17.3 285 1-399 98-386 (397) 76 protein:vir:78523 Length: 338 98.7 4E-09 2.5E-12 66.6 16.9 310 1-399 1-336 (338) 77 protein:vir:9759 Length: 303 # 98.7 4.8E-09 3E-12 66.2 17.3 292 12-399 1-303 (303) 78 protein:vir:99920 Length: 311 98.7 1.7E-09 1E-12 68.7 14.6 296 1-397 1-311 (311) 79 protein:vir:4830 Length: 397 # 98.7 6.8E-09 4.2E-12 65.4 17.9 285 1-399 98-386 (397) 80 protein:vir:102119 Length: 404 98.7 3.9E-09 2.4E-12 66.7 16.4 297 1-399 92-401 (404) 81 protein:vir:2430 Length: 318 # 98.7 2E-09 1.2E-12 68.3 14.7 301 1-399 1-314 (318) 82 protein:vir:9574 Length: 300 # 98.7 8.9E-09 5.5E-12 64.7 18.1 289 1-398 1-300 (300) 83 protein:vir:4700 Length: 415 # 98.7 4.3E-09 2.7E-12 66.5 15.8 295 1-399 109-405 (415) 84 protein:vir:4600 Length: 415 # 98.7 4.3E-09 2.7E-12 66.5 15.8 295 1-399 109-405 (415) 85 protein:vir:100135 Length: 418 98.7 3.5E-09 2.2E-12 67.0 15.3 289 1-399 116-416 (418) 86 protein:vir:1583 Length: 351 # 98.7 1.6E-08 9.9E-12 63.3 18.7 293 1-399 1-308 (351) 87 protein:vir:78223 Length: 333 98.6 1.1E-08 7E-12 64.1 17.6 311 1-398 1-333 (333) 88 protein:vir:1886 Length: 385 # 98.6 4.4E-09 2.8E-12 66.4 15.2 286 1-399 91-385 (385) 89 protein:vir:191 Length: 385 # 98.6 4.4E-09 2.8E-12 66.4 15.2 286 1-399 91-385 (385) 90 protein:vir:4953 Length: 397 # 98.6 1.5E-08 9.1E-12 63.5 17.9 286 1-399 98-386 (397) 91 protein:vir:81070 Length: 390 98.6 4.8E-09 3E-12 66.2 14.6 285 1-396 95-390 (390) 92 protein:vir:1638 Length: 298 # 98.6 1.1E-08 6.6E-12 64.3 16.5 284 1-397 1-298 (298) 93 protein:vir:99075 Length: 392 98.6 6E-09 3.7E-12 65.7 14.7 295 1-399 1-311 (392) 94 protein:vir:1268 Length: 397 # 98.6 1.2E-08 7.6E-12 63.9 16.3 282 1-398 102-397 (397) 95 protein:vir:3845 Length: 395 # 98.6 1.3E-08 8.3E-12 63.8 16.5 284 1-399 93-384 (395) 96 protein:vir:8102 Length: 543 # 98.6 1.4E-08 8.4E-12 63.7 16.1 300 1-399 237-543 (543) 97 protein:vir:4856 Length: 293 # 98.5 4.9E-08 3.1E-11 60.6 18.3 278 7-399 1-282 (293) 98 protein:vir:5974 Length: 324 # 98.5 8.5E-08 5.3E-11 59.3 19.1 268 1-399 1-279 (324) 99 protein:vir:105522 Length: 423 98.5 2E-08 1.3E-11 62.7 15.2 295 1-398 1-423 (423) 100 protein:vir:81227 Length: 413 98.5 3.6E-08 2.2E-11 61.4 16.4 298 1-399 106-411 (413) 101 protein:vir:105374 Length: 423 98.5 6.8E-08 4.2E-11 59.9 17.7 299 1-399 1-339 (423) 102 protein:vir:97031 Length: 402 98.4 6.3E-07 3.9E-10 54.6 22.2 314 6-399 1-336 (402) 103 protein:vir:3525 Length: 423 # 98.4 8.9E-08 5.5E-11 59.2 16.8 296 1-399 1-339 (423) 104 protein:vir:10364 Length: 390 98.4 4E-08 2.5E-11 61.1 14.8 285 1-396 101-390 (390) 105 protein:vir:105645 Length: 400 98.4 4.7E-07 2.9E-10 55.3 20.6 314 6-399 1-334 (400) 106 protein:vir:6242 Length: 390 # 98.4 3.6E-08 2.2E-11 61.4 14.2 285 1-399 97-390 (390) 107 protein:vir:8420 Length: 477 # 98.4 8.5E-08 5.3E-11 59.3 15.8 311 1-399 144-472 (477) 108 protein:vir:81160 Length: 371 98.4 9.4E-08 5.8E-11 59.1 16.0 286 1-398 69-371 (371) 109 protein:vir:101607 Length: 379 98.4 5.8E-08 3.6E-11 60.3 14.6 278 1-398 98-379 (379) 110 protein:vir:105038 Length: 428 98.4 2.1E-07 1.3E-10 57.1 17.6 305 1-398 113-428 (428) 111 protein:vir:107593 Length: 392 98.3 1.2E-07 7.7E-11 58.5 16.0 285 1-399 84-385 (392) 112 protein:vir:102082 Length: 392 98.3 1.2E-07 7.7E-11 58.5 16.0 285 1-399 84-385 (392) 113 protein:vir:102873 Length: 392 98.3 1.2E-07 7.7E-11 58.5 16.0 285 1-399 84-385 (392) 114 protein:vir:105004 Length: 392 98.3 1.2E-07 7.7E-11 58.5 16.0 285 1-399 84-385 (392) 115 protein:vir:1025 Length: 408 # 98.3 1.9E-07 1.2E-10 57.5 16.2 284 1-399 101-394 (408) 116 protein:vir:1328 Length: 392 # 98.2 2.1E-07 1.3E-10 57.1 15.2 287 1-399 97-392 (392) 117 protein:vir:3991 Length: 404 # 98.2 6.2E-07 3.9E-10 54.6 17.7 285 1-399 101-394 (404) 118 protein:vir:94673 Length: 419 98.2 3.1E-07 1.9E-10 56.3 15.9 297 1-399 112-418 (419) 119 protein:vir:2504 Length: 305 # 98.2 2.2E-07 1.4E-10 57.1 14.6 290 8-399 1-301 (305) 120 protein:vir:6212 Length: 434 # 98.2 3.9E-07 2.4E-10 55.7 15.9 295 1-399 131-430 (434) 121 protein:vir:1433 Length: 435 # 98.2 5.8E-07 3.6E-10 54.8 16.2 302 1-399 119-433 (435) 122 protein:vir:7409 Length: 408 # 98.2 8.4E-07 5.2E-10 53.9 17.0 286 1-399 101-394 (408) 123 protein:vir:80376 Length: 435 98.1 3.9E-07 2.4E-10 55.7 14.4 300 1-399 119-433 (435) 124 protein:vir:100172 Length: 394 98.1 5.8E-07 3.6E-10 54.8 15.2 284 1-399 100-385 (394) 125 protein:vir:102944 Length: 330 98.1 3.6E-06 2.2E-09 50.5 18.9 284 1-399 1-310 (330) 126 protein:vir:174 Length: 423 # 98.1 5E-07 3.1E-10 55.1 13.8 281 1-399 1-311 (423) 127 protein:vir:9704 Length: 394 # 98.0 5.7E-07 3.5E-10 54.8 13.4 272 1-399 118-391 (394) 128 protein:vir:3870 Length: 400 # 98.0 6.2E-07 3.8E-10 54.6 13.6 277 1-399 122-400 (400) 129 protein:vir:1383 Length: 421 # 98.0 9.5E-07 5.9E-10 53.6 14.1 278 1-399 104-387 (421) 130 protein:vir:100884 Length: 389 97.9 2.7E-06 1.7E-09 51.1 16.1 282 1-399 99-383 (389) 131 protein:vir:96762 Length: 632 97.9 1.5E-06 9.1E-10 52.6 14.0 279 1-397 335-632 (632) 132 protein:vir:7019 Length: 401 # 97.9 1.2E-05 7.2E-09 47.6 20.1 313 6-399 1-334 (401) 133 protein:vir:7855 Length: 497 # 97.9 1.7E-06 1E-09 52.3 13.6 313 1-399 142-494 (497) 134 protein:vir:101650 Length: 497 97.9 1.7E-06 1E-09 52.3 13.6 313 1-399 142-494 (497) 135 protein:vir:5739 Length: 366 # 97.8 6.6E-06 4.1E-09 49.0 16.0 300 1-398 52-366 (366) 136 protein:vir:93616 Length: 645 97.5 2E-05 1.2E-08 46.3 14.8 306 1-399 315-639 (645) 137 protein:vir:962 Length: 397 # 97.5 1.4E-05 8.8E-09 47.2 13.8 274 1-398 124-397 (397) 138 protein:vir:79008 Length: 299 97.3 0.00012 7.1E-08 42.2 17.1 287 1-398 1-299 (299) 139 protein:vir:1781 Length: 221 # 97.2 8.1E-05 5E-08 43.0 15.3 208 100-365 1-221 (221) 140 protein:vir:95376 Length: 425 97.1 9E-05 5.6E-08 42.8 14.3 282 1-399 118-422 (425) 141 protein:vir:1084 Length: 437 # 97.0 8.7E-05 5.4E-08 42.9 13.5 278 1-399 148-431 (437) 142 protein:vir:2685 Length: 387 # 97.0 2.4E-05 1.5E-08 45.9 10.2 273 1-399 99-382 (387) 143 protein:vir:96978 Length: 387 97.0 2.4E-05 1.5E-08 45.9 10.2 273 1-399 99-382 (387) 144 protein:vir:94424 Length: 387 97.0 2.4E-05 1.5E-08 45.9 10.2 273 1-399 99-382 (387) 145 protein:vir:93881 Length: 387 96.7 0.0001 6.3E-08 42.5 11.9 274 1-399 100-382 (387) 146 protein:vir:107120 Length: 329 96.6 0.00037 2.3E-07 39.4 14.0 279 1-399 12-309 (329) 147 protein:vir:9361 Length: 402 # 96.5 0.00011 6.6E-08 42.4 10.6 274 1-399 114-397 (402) 148 protein:vir:2106 Length: 430 # 96.5 0.00038 2.4E-07 39.3 13.4 299 1-399 1-430 (430) 149 protein:vir:97255 Length: 310 96.1 0.00017 1.1E-07 41.2 9.6 291 16-398 1-310 (310) 150 protein:vir:97331 Length: 319 96.0 0.0011 6.8E-07 36.8 15.3 278 1-399 1-297 (319) 151 protein:vir:94800 Length: 319 96.0 0.0011 6.8E-07 36.8 15.3 278 1-399 1-297 (319) 152 protein:vir:78640 Length: 352 95.9 0.00054 3.4E-07 38.5 11.3 273 1-399 64-347 (352) 153 protein:vir:80446 Length: 367 95.8 0.0014 8.8E-07 36.2 16.3 303 1-399 1-349 (367) 154 protein:vir:100939 Length: 430 95.5 0.001 6.2E-07 37.0 11.4 298 1-399 1-430 (430) 155 protein:vir:9265 Length: 430 # 95.5 0.001 6.2E-07 37.0 11.4 298 1-399 1-430 (430) 156 protein:vir:3158 Length: 321 # 94.9 0.0033 2E-06 34.2 13.3 293 1-399 1-313 (321) 157 protein:vir:105464 Length: 346 94.2 0.0051 3.1E-06 33.2 13.2 300 6-394 1-346 (346) 158 protein:vir:95131 Length: 325 93.9 0.0061 3.8E-06 32.7 12.1 286 1-399 1-307 (325) 159 protein:vir:108211 Length: 318 93.2 0.0084 5.2E-06 32.0 12.5 297 8-399 1-312 (318) 160 protein:vir:4092 Length: 390 # 92.6 0.011 6.6E-06 31.4 15.8 284 1-366 64-390 (390) 161 protein:vir:78920 Length: 290 87.3 0.039 2.4E-05 28.3 13.7 272 26-395 1-290 (290) 162 protein:vir:9509 Length: 381 # 86.1 0.048 3E-05 27.8 12.3 288 1-399 57-373 (381) 163 protein:vir:101291 Length: 381 86.1 0.048 3E-05 27.8 12.3 288 1-399 57-373 (381) 164 protein:vir:4159 Length: 315 # 85.5 0.052 3.2E-05 27.6 16.9 288 1-355 1-315 (315) 165 protein:vir:95963 Length: 395 83.3 0.069 4.3E-05 26.9 10.0 292 1-399 67-377 (395) 166 protein:vir:78387 Length: 349 77.1 0.13 8E-05 25.5 19.7 298 1-399 1-329 (349) 167 protein:vir:103886 Length: 302 75.3 0.15 9.2E-05 25.1 17.5 280 6-397 1-302 (302) 168 protein:vir:98635 Length: 377 72.9 0.18 0.00011 24.7 12.6 298 1-398 62-377 (377) 169 protein:vir:78350 Length: 383 70.5 0.21 0.00013 24.3 12.0 293 1-364 64-383 (383) 170 protein:vir:4197 Length: 314 # 61.6 0.35 0.00022 23.1 16.9 282 1-365 1-314 (314) 171 protein:vir:94989 Length: 349 61.1 0.36 0.00022 23.0 19.0 302 1-399 1-329 (349) 172 protein:vir:94933 Length: 330 61.0 0.36 0.00022 23.0 15.2 298 1-399 1-330 (330) 173 protein:vir:8324 Length: 410 # 57.2 0.44 0.00027 22.5 9.9 262 1-344 89-410 (410) 174 protein:vir:79928 Length: 393 46.5 0.73 0.00045 21.3 14.0 290 1-375 59-393 (393) 175 protein:vir:80128 Length: 466 42.7 0.87 0.00054 20.9 11.9 285 1-367 131-466 (466) 176 protein:vir:95512 Length: 693 42.6 0.88 0.00055 20.9 14.1 288 1-359 376-693 (693) 177 protein:vir:106647 Length: 303 29.3 1.7 0.001 19.4 17.8 286 1-399 1-301 (303) 178 protein:vir:9643 Length: 377 # 26.3 2 0.0012 19.0 12.2 290 1-398 59-377 (377) No 1 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=100.00 E-value=1.4e-176 Score=984.72 Aligned_cols=395 Identities=68% Similarity=1.130 Sum_probs=387.5 Q ss_pred CeeecCC-CCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCCCCC Q lcl|NC_019514. 5 GMLYNDP-NTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQGIDA 83 (399) Q Consensus 5 ~~~~n~~-~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~gi~a 83 (399) -+.||.| +.+.++++++.||||++|||.+|+|+||+|+|||++|||++|||||+|||||||||.||++++|||+|||+| T Consensus 1 ~~~~~a~~~~~~~s~~g~~~~~~~t~y~~~k~L~~Aa~~lv~~~fA~~~piPkn~GkTIk~r~y~pl~~~~~pl~eGv~a 80 (401) T protein:vir:95 1 MLNYNAPTDGQKSSIDGANSDQMQTFFWLKKAIITARKEQYFMPLASVTNMPKHYGKTIKVYEYVPLLDDRNINDQGIDA 80 (401) T ss_pred CCccCCCcccccccccccccceeeehhhHHHHHhhhhhhhhhhhcccccccccccCCeEEEEecccccccccchhcCCCc Confidence 5789999 678899999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHHHHHH Q lcl|NC_019514. 84 AGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHISTELMN 163 (399) Q Consensus 84 aga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~~lg~ 163 (399) +|+.+++|++||+|||++||+.++|+|||.||||||+||+|.|++++|||||+|+||||+++|||+|++|.+|++++|++ T Consensus 81 ~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~D~~l~~h~s~ell~ 160 (401) T protein:vir:95 81 SGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDSDDGLMEHLSRELMN 160 (401) T ss_pred ccccccCccccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhcchHHHHHHHHHHhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccccCc Q lcl|NC_019514. 164 GAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTRTISA 243 (399) Q Consensus 164 ~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~~I~~ 243 (399) ++++++||++++|+||+|++|+|||+++++++++++++++++||+++||+|+++|++|||||||+||+||+|+|||+|++ T Consensus 161 g~~~~t~d~i~~dll~ag~~viyAg~ats~At~~~~~~~~t~vt~~~l~rl~~~L~~nRapk~t~~i~~s~~~dTk~i~~ 240 (401) T protein:vir:95 161 GATQITEAVLQKDLLAAAGTVLYAGAATSDATITGEGSTPSVVSYKNLMRLDQILTENRTPTQTTIITGSRMIDTKVIGA 240 (401) T ss_pred hhhhhHHHHHHHHHHhhcCeeecCCccceeeeccccccccceechhHHHHHHHHHHhcccccchhhhhhhhccCcccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCC-----ccccc Q lcl|NC_019514. 244 GRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTN-----PGYRE 318 (399) Q Consensus 244 ~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~-----~~~~~ 318 (399) |||+||||||+|||++|.|.|.+|+||||||||++++||+||||+++|||||++|||++|+|+||+.+.. ..+++ T Consensus 241 s~va~~h~~L~~di~a~~D~~~~~~fi~v~kYa~~~~i~~gEiG~i~~vR~i~~p~~~~w~~ag~~a~~~~~~y~~~~~~ 320 (401) T protein:vir:95 241 TRVMYVGSELVPELKAMKDLFGNKAFIETQHYADAGTIMNGEVGSIDKFRIIQVPEMLHWAGAGAQATGANPGYRTSMVS 320 (401) T ss_pred ceEEEEecCchhHHHHHHHhcCCCCceehhhcCCccccccccccccCceeEEecccceeecCCccccccccccccccccc Confidence 9999999999999999999999999999999999999999999999999999999999999999976553 25678 Q ss_pred cCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEecc Q lcl|NC_019514. 319 TNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVKTVAP 398 (399) Q Consensus 319 t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~a~ 398 (399) ++++|||||+||||+|||++|+|+|+|+..||++|||+||+++||++||||||||+||||||++++||++||+|||+||| T Consensus 321 ~gg~~dVyp~lV~G~dAf~~~~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQ~g~vgwK~~~a~~vL~~e~m~~ies~a~ 400 (401) T protein:vir:95 321 GQEHYDVYPMLVVGDDSFTSIGFQTDGKSLKFTVMTKMPGKETADRNDPYGETGFSSIKWYYGILVKRPERLALIKTVAP 400 (401) T ss_pred CCCcceeeeeeEEccccceecccccCCccccceeEeecCCcCCCCCCCcccceehhhhhhhhhhheeccceeEEEEeecC Confidence 89999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred C Q lcl|NC_019514. 399 L 399 (399) Q Consensus 399 ~ 399 (399) | T Consensus 401 ~ 401 (401) T protein:vir:95 401 L 401 (401) T ss_pred C Confidence 9 No 2 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.68 E-value=1.8e-18 Score=117.91 Aligned_cols=270 Identities=16% Similarity=0.121 Sum_probs=178.9 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhccc-ccccccCCCEEEEEEccccccccccccC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADV-VSMPKNYGKEIRVYHYIPLLDDRNVNDQ 79 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~-~~mPkN~GktIk~rry~pl~~~~~~~~~ 79 (399) |+. +.|+. +.-+.+.+|..-.++..+..++|.+++.. ..+.-+.|++|++-+|..+.+.. T Consensus 1 Ma~----------~~T~l----~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~----- 61 (276) T protein:vir:10 1 MAQ----------GTTTK----STQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDAT----- 61 (276) T ss_pred CCc----------ceeeh----hhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccc----- Confidence 431 12333 33344557887788888899999999984 55777789999999997763321 Q ss_pred CCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHH Q lcl|NC_019514. 80 GIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIST 159 (399) Q Consensus 80 gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~ 159 (399) .+.+| . .|+... .+....++.++|+|.-++++|+.......+ .+.+..+ T Consensus 62 -------~~~eg----~-----~i~~~~--------------lt~~~~~a~i~~~~k~~~~tD~a~~~~~~d-p~~~~~~ 110 (276) T protein:vir:10 62 -------VVPEG----Q-----KIPVDK--------------IETNRREAKIHKIGKGTDITDEALLSGYGD-PQGEAVR 110 (276) T ss_pred -------cccCC----C-----ccCccc--------------cccceeeEEeehccccccccHHHHHhhccc-hHHHHHH Confidence 12332 1 122222 222346788999999999999988887665 4455566 Q ss_pred HHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_019514. 160 ELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTR 239 (399) Q Consensus 160 ~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~ 239 (399) .++..-+.-.+..+...+..+.. +. ....++++.+.+|...|..+... T Consensus 111 ~~~~~~a~~~d~~~~~~l~~~~~------------~~-----~~~~~t~d~i~~A~~~lgd~~~~--------------- 158 (276) T protein:vir:10 111 QHGLAIANKVDNDVLEALRGTKL------------TV-----SADIGTLAGLEAAIDTFDDEDLE--------------- 158 (276) T ss_pred HHHHHHHHHHHHHHHHHHhcccc------------cc-----cccccCHHHHHHHHHHhccccCc--------------- Confidence 66655433332222222222111 01 11346899999999888765432 Q ss_pred ccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCcccccc Q lcl|NC_019514. 240 TISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRET 319 (399) Q Consensus 240 ~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t 319 (399) -++++|||+....||.+. .+.|+..+++|+. .+.+|+||++-|+|+|+++.+ T Consensus 159 ----~~~ivv~p~~~~~L~k~~----~~~f~~~s~~g~~-~~~~G~ig~~~G~~Vi~s~~~------------------- 210 (276) T protein:vir:10 159 ----PMVLFINPKDAGKLRSSA----SDNFTRATELGDN-IIVKGAFGEALGAVIVRSKKL------------------- 210 (276) T ss_pred ----ccEEEEcHHHHHHHHHhc----ccccccccccccc-ceeccccceecceeEEEcCCC------------------- Confidence 268999999999998653 4789999999977 579999999999999999874 Q ss_pred CccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEeccC Q lcl|NC_019514. 320 NGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 320 ~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~a~~ 399 (399) +.|-.+++|++|++... +. + +. .- .|| |+.-++-.+.-...|++.+.+++..+.+.-+.-- T Consensus 211 ----p~~t~~l~~~gAi~~~~-~~-~----~~---vE-----~dR-d~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~ 271 (276) T protein:vir:10 211 ----DEGEAILAKRGAVKLIT-KR-D----FF---LE-----TDR-DPSTKTTALYSDKHYVAYLYDESKAVKVTKGAGT 271 (276) T ss_pred ----CcceEEEEeccceeeee-cC-C----ce---ee-----ccc-chhhcccEEEEeeEEEEEEEcCcceEEEecCCcC Confidence 13455699999998752 11 1 11 11 122 5655555555557789999999999988766644 No 3 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=99.61 E-value=3.3e-17 Score=110.94 Aligned_cols=270 Identities=16% Similarity=0.119 Sum_probs=174.6 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcc-cccccccCCCEEEEEEccccccccccccC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLAD-VVSMPKNYGKEIRVYHYIPLLDDRNVNDQ 79 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~-~~~mPkN~GktIk~rry~pl~~~~~~~~~ 79 (399) |+. . .|+ ++..+.+..|..-+++.-.+.++|.+++. ...+.-..|+||++.+|..+.+.... .+ T Consensus 1 m~~---------~-~T~----l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~-~~ 65 (274) T protein:vir:95 1 MAQ---------G-MTK----LTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVV-AE 65 (274) T ss_pred CCc---------c-eee----hhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccc-cC Confidence 332 1 233 33345555788788888889999999986 55677678999999999765332211 11 Q ss_pred CCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHH Q lcl|NC_019514. 80 GIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIST 159 (399) Q Consensus 80 gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~ 159 (399) |..++. . ..+....++.|+|+|.-.+++|+.......+ ++.+..+ T Consensus 66 -----g~~i~~---------------~--------------~lt~~~~~~~i~~~~~a~~i~D~~~~~~~~d-~~~~~~~ 110 (274) T protein:vir:95 66 -----GEKIPT---------------D--------------ILETKKREAKIRKIAKGTSISDEALLSGYGD-PQGEQVR 110 (274) T ss_pred -----CCccch---------------h--------------hcccceeEEEeeeeecceeehHHHHhhccch-HHHHHHH Confidence 122211 1 1123346788999999999999987777665 4444455 Q ss_pred HHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_019514. 160 ELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTR 239 (399) Q Consensus 160 ~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~ 239 (399) .++..-+ +.+..++++.... ++.. . ....++++.+.+|...|..+... T Consensus 111 ~~~~~~a----~~vd~~i~~~l~~------a~~~--~-----~~~~~~~d~i~~A~~~lgd~~~~--------------- 158 (274) T protein:vir:95 111 QHGLAHA----NKVDDDVLEALKS------AKLT--V-----EADITKLTGLQTAIDKFNDEDLE--------------- 158 (274) T ss_pred HHHHHHH----HHHHHHHHHHHhc------cccc--c-----cccccCHHHHHHHHHHhcccccc--------------- Confidence 5554433 3334444433211 1110 0 11346899999999888764331 Q ss_pred ccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCcccccc Q lcl|NC_019514. 240 TISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRET 319 (399) Q Consensus 240 ~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t 319 (399) -++.+|||+....|+.. ....|++.+++|+ ..+.+|+||++-|+|+|+++.+ T Consensus 159 ----~~~ivv~p~~~~~L~k~----~~~~f~~~s~~g~-~~~~~G~ig~~~G~~Vi~s~~~------------------- 210 (274) T protein:vir:95 159 ----PMVLFISPLDAGKLRGD----ATTNFTRATELGD-DVIVKGAFGEALGAVIVRSNKL------------------- 210 (274) T ss_pred ----ccEEEeCHHHHHHHHhh----ccccccccccccc-cceeccccceecCeEEEEeCCC------------------- Confidence 27899999999999752 1137999999997 5789999999999999998753 Q ss_pred CccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEeccC Q lcl|NC_019514. 320 NGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 320 ~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~a~~ 399 (399) ++|-..++|++|++... +.+ +. . | .+| ||.-+.=.+--...|++.++|++..+.+..++-- T Consensus 211 ----~~~t~~l~~~gA~~~~~--~~~----~~---v---E--~~R-d~~~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~~ 271 (274) T protein:vir:95 211 ----EAGTAILAKKGAVKLIT--KRD----FF---L---E--TDR-DPSTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) T ss_pred ----CCceEEEEeccceeeee--cCC----cc---c---c--ccc-ccccccCEEEEeEEEEEEEEcCCcEEEEEcCCcc Confidence 25566899999999752 111 11 1 1 112 5554443444457799999999999999877654 No 4 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=99.61 E-value=3.3e-17 Score=110.94 Aligned_cols=270 Identities=16% Similarity=0.119 Sum_probs=174.6 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcc-cccccccCCCEEEEEEccccccccccccC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLAD-VVSMPKNYGKEIRVYHYIPLLDDRNVNDQ 79 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~-~~~mPkN~GktIk~rry~pl~~~~~~~~~ 79 (399) |+. . .|+ ++..+.+..|..-+++.-.+.++|.+++. ...+.-..|+||++.+|..+.+.... .+ T Consensus 1 m~~---------~-~T~----l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~-~~ 65 (274) T protein:vir:96 1 MAQ---------G-MTK----LTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVV-AE 65 (274) T ss_pred CCc---------c-eee----hhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccc-cC Confidence 332 1 233 33345555788788888889999999986 55677678999999999765332211 11 Q ss_pred CCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHH Q lcl|NC_019514. 80 GIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIST 159 (399) Q Consensus 80 gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~ 159 (399) |..++. . ..+....++.|+|+|.-.+++|+.......+ ++.+..+ T Consensus 66 -----g~~i~~---------------~--------------~lt~~~~~~~i~~~~~a~~i~D~~~~~~~~d-~~~~~~~ 110 (274) T protein:vir:96 66 -----GEKIPT---------------D--------------ILETKKREAKIRKIAKGTSISDEALLSGYGD-PQGEQVR 110 (274) T ss_pred -----CCccch---------------h--------------hcccceeEEEeeeeecceeehHHHHhhccch-HHHHHHH Confidence 122211 1 1123346788999999999999987777665 4444455 Q ss_pred HHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_019514. 160 ELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTR 239 (399) Q Consensus 160 ~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~ 239 (399) .++..-+ +.+..++++.... ++.. . ....++++.+.+|...|..+... T Consensus 111 ~~~~~~a----~~vd~~i~~~l~~------a~~~--~-----~~~~~~~d~i~~A~~~lgd~~~~--------------- 158 (274) T protein:vir:96 111 QHGLAHA----NKVDDDVLEALKS------AKLT--V-----EADITKLTGLQTAIDKFNDEDLE--------------- 158 (274) T ss_pred HHHHHHH----HHHHHHHHHHHhc------cccc--c-----cccccCHHHHHHHHHHhcccccc--------------- Confidence 5554433 3334444433211 1110 0 11346899999999888764331 Q ss_pred ccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCcccccc Q lcl|NC_019514. 240 TISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRET 319 (399) Q Consensus 240 ~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t 319 (399) -++.+|||+....|+.. ....|++.+++|+ ..+.+|+||++-|+|+|+++.+ T Consensus 159 ----~~~ivv~p~~~~~L~k~----~~~~f~~~s~~g~-~~~~~G~ig~~~G~~Vi~s~~~------------------- 210 (274) T protein:vir:96 159 ----PMVLFISPLDAGKLRGD----ATTNFTRATELGD-DVIVKGAFGEALGAVIVRSNKL------------------- 210 (274) T ss_pred ----ccEEEeCHHHHHHHHhh----ccccccccccccc-cceeccccceecCeEEEEeCCC------------------- Confidence 27899999999999752 1137999999997 5789999999999999998753 Q ss_pred CccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEeccC Q lcl|NC_019514. 320 NGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 320 ~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~a~~ 399 (399) ++|-..++|++|++... +.+ +. . | .+| ||.-+.=.+--...|++.++|++..+.+..++-- T Consensus 211 ----~~~t~~l~~~gA~~~~~--~~~----~~---v---E--~~R-d~~~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~~ 271 (274) T protein:vir:96 211 ----EAGTAILAKKGAVKLIT--KRD----FF---L---E--TDR-DPSTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) T ss_pred ----CCceEEEEeccceeeee--cCC----cc---c---c--ccc-ccccccCEEEEeEEEEEEEEcCCcEEEEEcCCcc Confidence 25566899999999752 111 11 1 1 112 5554443444457799999999999999877654 No 5 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.60 E-value=3.8e-17 Score=110.61 Aligned_cols=271 Identities=14% Similarity=0.148 Sum_probs=174.6 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhccc-ccccccCCCEEEEEEccccccccccccC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADV-VSMPKNYGKEIRVYHYIPLLDDRNVNDQ 79 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~-~~mPkN~GktIk~rry~pl~~~~~~~~~ 79 (399) |+. +.|+...-|-|| .|..-.++.....+++.+++.. .++..+.|+||++.+|..+.+. T Consensus 1 ma~----------~~T~~~d~iiPe----v~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda------ 60 (272) T protein:vir:36 1 MSK----------QKTTLADLVNPE----VLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDA------ 60 (272) T ss_pred CCC----------cceehhhhhchH----HHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccc------ Confidence 431 124444444455 6676677777889999999974 5677788999999999765332 Q ss_pred CCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHH Q lcl|NC_019514. 80 GIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIST 159 (399) Q Consensus 80 gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~ 159 (399) ..+.+| . .|+... .+....+++++|+|...++||+.......+ ++.+..+ T Consensus 61 ------~~~~eg----~-----~i~~~~--------------lt~~~~~~~i~~~~k~~~vtD~~~~~~~~d-~~~~~~~ 110 (272) T protein:vir:36 61 ------ADVAEG----G-----EISLDK--------------IGTTTKSVTIKKAAKGTEITDEAALSGYGD-PIGESNK 110 (272) T ss_pred ------cccCCC----C-----ccChhh--------------cCCcceeEeeehhhccccccHHHHhhccch-HHHHHHH Confidence 123333 1 122222 223346788999999999999876665444 4444455 Q ss_pred HHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_019514. 160 ELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTR 239 (399) Q Consensus 160 ~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~ 239 (399) .++..-+ +.+..++++... | +..++...++++.+.+|...|.....+ T Consensus 111 ~~a~~~a----~~~d~~i~~~l~-----~---------~~~~~~~~~~~d~i~~A~~~lgd~~~~--------------- 157 (272) T protein:vir:36 111 QLGLSLA----NKVDDDLLSAAK-----T---------TSQTVSTKANVDGVQAALDIFNDEDAQ--------------- 157 (272) T ss_pred HHHHHHH----HHHHHHHHHHhc-----c---------ccccccccccHHHHHHHHHHhhhcCCC--------------- Confidence 5544332 444555554331 1 112334567899999999888765543 Q ss_pred ccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCcccccc Q lcl|NC_019514. 240 TISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRET 319 (399) Q Consensus 240 ~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t 319 (399) .++.+|||.....||. ++.|..+..++....+.+|+||++-|+|+|+++.+-. + + T Consensus 158 ----~~~ivv~p~~~~~L~k------~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~p~--------~-------~ 212 (272) T protein:vir:36 158 ----AYVLIVNPKDAAKIRK------DANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAE--------G-------S 212 (272) T ss_pred ----ceEEEEcHHHHHHHhc------ccccccccccccccceeeeccceecCeeEEEeCCCCC--------C-------c Confidence 2789999999999974 4778888878777889999999999999999998521 1 1 Q ss_pred CccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEecc Q lcl|NC_019514. 320 NGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVKTVAP 398 (399) Q Consensus 320 ~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~a~ 398 (399) .+|..+++|++|++...-+ + +. .-..+ |+.-+.=..--...|++.+++++-.+.+...-- T Consensus 213 ----~~~~~~~~~~gA~~~~~~~--~----~~---vE~~R------~~~~~~d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 213 ----ALMFKIVSNSPALKLVLKR--G----VQ---VETDR------DIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred ----eeEEEEEecccceeeeecC--C----cc---ccccc------chhhcCcEEEEEEEEEEEEEcCccEEEEeecCC Confidence 1788889999999975222 1 11 11111 333332222234668999999998877643322 No 6 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=99.58 E-value=1.1e-16 Score=108.02 Aligned_cols=270 Identities=16% Similarity=0.121 Sum_probs=172.0 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhccc-ccccccCCCEEEEEEccccccccccccC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADV-VSMPKNYGKEIRVYHYIPLLDDRNVNDQ 79 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~-~~mPkN~GktIk~rry~pl~~~~~~~~~ 79 (399) |+ +. .|+...=| .+..|..-+++...+.++|.+++.. ..++-..|++|++.+|....+..+ T Consensus 1 ma---------~~-~T~~~d~i----~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~---- 62 (274) T protein:vir:96 1 MA---------QG-TTKVSNLI----VPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQV---- 62 (274) T ss_pred CC---------cc-ccchhhhh----hhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCccc---- Confidence 33 11 23333334 4447887888888899999999974 467777899999999965422211 Q ss_pred CCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHH Q lcl|NC_019514. 80 GIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIST 159 (399) Q Consensus 80 gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~ 159 (399) +.+|. .|+.. .++....++.|+|+|...+++|+.......+ ++.+..+ T Consensus 63 --------~~~g~---------~i~~~--------------~it~~~~~~~i~~~~~~~~i~D~~~~~~~~d-~~~~~~~ 110 (274) T protein:vir:96 63 --------IAEGE---------KIPVD--------------QIGTSKREAKVRKIGKGTELTDEAVLSGFGD-PQGEAVR 110 (274) T ss_pred --------cCCCC---------cCchh--------------hcccceeEEEEEeeeceeeecHHHHHhhcch-HHHHHHH Confidence 22210 01111 1223346778999999999999887665554 4555555 Q ss_pred HHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_019514. 160 ELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTR 239 (399) Q Consensus 160 ~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~ 239 (399) .++..-+ +.+..++++... ++ +. ......++++.+..|...|..+... T Consensus 111 ~~~~~~a----~~~d~~i~~~l~-----~a-~~-------~~~~~~~~~d~i~dA~~~l~d~~~~--------------- 158 (274) T protein:vir:96 111 QHGLAIA----NKVDNDVLEALK-----GA-TL-------TVEADITKLDGLQTAIDKFNDEDLE--------------- 158 (274) T ss_pred HHHHHHH----HHHHHHHHHHHh-----cC-CC-------CcCcccccHHHHHHHHHHhcccCCC--------------- Confidence 5554433 344444443321 11 10 0112456899999999888765432 Q ss_pred ccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCcccccc Q lcl|NC_019514. 240 TISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRET 319 (399) Q Consensus 240 ~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t 319 (399) -++.+|||.....|+.+. ...|++.+++|+. .+.+|.||++-|+|+|+++.+- T Consensus 159 ----~~~ivv~p~~~~~L~k~~----~~~f~~~~~~g~~-~~~~g~ig~~~G~~Vi~s~~~p------------------ 211 (274) T protein:vir:96 159 ----PMVLFVNPLDAGGLRTSA----SDNFTRPTQLGDN-IIVKGAFGEALGAVIVRSNKLN------------------ 211 (274) T ss_pred ----ceEEEeCHHHHHHHHhcc----ccccccccccccc-ceeecccceecCeeEEEcCCCC------------------ Confidence 278999999999997642 2579999999874 6789999999999999998751 Q ss_pred CccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEeccC Q lcl|NC_019514. 320 NGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 320 ~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~a~~ 399 (399) ++-.+++|++|++... +.+ +. + | .+| ||.-+.=..--...|++.++|++..+.+..++-= T Consensus 212 -----~~t~~l~~~gA~~~~~--~~~----~~--v----E--~~R-d~~~~~d~i~~~~~yg~~~~~~~~vv~~t~~~~~ 271 (274) T protein:vir:96 212 -----KGEALLAKKGAVKLIT--KRD----FF--L----E--KDR-DASRKSTALYSDKHYVAYLYDESKVVKITKGAGD 271 (274) T ss_pred -----cceEEEEeCcceeeee--cCC----cc--c----c--ccc-chhhcccEEEEeeEEEEEEEcCccEEEEEcCccc Confidence 2235699999999752 111 11 1 1 111 4443322233336799999999999999877654 No 7 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=99.58 E-value=1.2e-16 Score=107.84 Aligned_cols=270 Identities=14% Similarity=0.109 Sum_probs=171.8 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhccc-ccccccCCCEEEEEEccccccccccccC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADV-VSMPKNYGKEIRVYHYIPLLDDRNVNDQ 79 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~-~~mPkN~GktIk~rry~pl~~~~~~~~~ 79 (399) |+ +. .|+ ++..+.+.+|..-.++.....++|.+++.. ..+.-+.|+||++.+|..+.+..+ T Consensus 1 ma---------~~-~T~----l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~---- 62 (274) T protein:vir:12 1 MA---------QG-LTK----TSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQV---- 62 (274) T ss_pred CC---------cc-eee----hhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCcccc---- Confidence 32 11 233 334455557887788878889999999984 667778899999999976533222 Q ss_pred CCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHH Q lcl|NC_019514. 80 GIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIST 159 (399) Q Consensus 80 gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~ 159 (399) +.+|. .|+... .+.....+.|+|+|.-.+++|+.......+ .+.+..+ T Consensus 63 --------~~~g~---------~i~~~~--------------lt~~~~~~~i~~~~~~~~i~D~~~~~~~~d-~~~~~~~ 110 (274) T protein:vir:12 63 --------VAEGE---------KIPTDI--------------LETKKREAKIRKIAKGTSITDEALLSGYGD-PQGEQVR 110 (274) T ss_pred --------ccCCC---------ccchhh--------------cccceeeEEeeeecceeeecHHHHHhcccc-hHHHHHH Confidence 22210 011111 222345678999999999999877666554 3444455 Q ss_pred HHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_019514. 160 ELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTR 239 (399) Q Consensus 160 ~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~ 239 (399) .++..-+ +.+..++++.... ++. + .....++++.+.+|...|..+... T Consensus 111 q~~~~~a----~~vd~~~l~~~~~------a~~----~---~~~~a~~~d~i~dA~~~lgd~~~~--------------- 158 (274) T protein:vir:12 111 QHGLAHA----NKVDNDVLEALMG------AKL----T---VNADITKLNGLQSAIDKFNDEDLE--------------- 158 (274) T ss_pred HHHHHHH----HHHHHHHHHHHhc------ccc----c---ccccccCHHHHHHHHHHhcccccc--------------- Confidence 5554433 3333444433210 111 0 112356899999999888764321 Q ss_pred ccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCcccccc Q lcl|NC_019514. 240 TISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRET 319 (399) Q Consensus 240 ~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t 319 (399) -++.+|||.....|+.. ....|++.++||+ ..+.+|+||++-|+|+|+++.+ T Consensus 159 ----~~~ivv~p~~~~~L~k~----~~~~fv~~s~~g~-~~~~~G~ig~~~G~~Vi~s~~~------------------- 210 (274) T protein:vir:12 159 ----PMVLFINPLDAGKLRGD----ASTNFTRATELGD-DIIVKGAFGEALGAIIVRSNKL------------------- 210 (274) T ss_pred ----ccEEEeCHHHHHHHHhh----hhhhccccccccc-cceecccceeecCeeEEEeCCC------------------- Confidence 26899999999999752 1136999999998 4579999999999999999764 Q ss_pred CccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEeccC Q lcl|NC_019514. 320 NGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 320 ~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~a~~ 399 (399) .+|-..++|++|++... +.+ +. .- .+| ||.-+.=.+--...|++.++|+...+.+..++-= T Consensus 211 ----p~~t~~l~~~gA~~~~~--~~~----~~---vE-----~~R-d~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~ 271 (274) T protein:vir:12 211 ----EAGTAILAKKGAVKLIL--KRD----FF---LE-----VAR-DASTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) T ss_pred ----CcceEEEEeccceeeee--cCC----ce---ec-----ccc-chhhcccEEEeeeEEEEEEEcCCceEEEEcCCcc Confidence 13456799999999752 111 11 11 111 4444333333446689999999998888755433 No 8 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=99.57 E-value=2e-16 Score=106.69 Aligned_cols=270 Identities=14% Similarity=0.123 Sum_probs=172.5 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhccc-ccccccCCCEEEEEEccccccccccccC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADV-VSMPKNYGKEIRVYHYIPLLDDRNVNDQ 79 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~-~~mPkN~GktIk~rry~pl~~~~~~~~~ 79 (399) |+ + +.|+ ++..+.+..|..-.++...+.++|.+++.. ..++-+.|++|++.+|..+.+... T Consensus 1 ma---------~-~~T~----~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~---- 62 (274) T protein:vir:97 1 MP---------Q-GLTK----TSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQV---- 62 (274) T ss_pred CC---------c-ccee----hhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCcccc---- Confidence 33 2 1233 334455558888888888999999999985 567877899999999976533221 Q ss_pred CCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHH Q lcl|NC_019514. 80 GIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIST 159 (399) Q Consensus 80 gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~ 159 (399) +.+|. .|+... .+....++.++|+|.-.+++|+.......+ .+.+..+ T Consensus 63 --------~~~g~---------~i~~~~--------------lt~~~~~~~i~~~~~~~~i~D~~~~~~~~d-p~~~~~~ 110 (274) T protein:vir:97 63 --------VAEGE---------KIPTDI--------------LETKKREAKIRKIAKGTSITDEALLSGYGD-PQGEQVR 110 (274) T ss_pred --------ccCCC---------cccccc--------------cccceeEEEeeeecceecccHHHHHhccch-HHHHHHH Confidence 22221 111111 223346788999999999999877666554 3444455 Q ss_pred HHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_019514. 160 ELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTR 239 (399) Q Consensus 160 ~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~ 239 (399) .++..-+. .+..++++.. .++ +. + .....++++.+.+|...|..+... T Consensus 111 ~~a~a~a~----~vd~~~~~~l-----~~a-~~----~---~~~~~~~~d~i~dA~~~l~d~~~~--------------- 158 (274) T protein:vir:97 111 QHGLAHAN----KVDNDVLEAL-----MGA-KL----T---VNADITKLNGLQSAIDKFNDEDLE--------------- 158 (274) T ss_pred HHHHHHHH----HHHHHHHHHH-----hcc-Cc----c---ccccccCHHHHHHHHHHhhccCCC--------------- Confidence 55544333 3333333321 011 11 0 112346899999999888764432 Q ss_pred ccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCcccccc Q lcl|NC_019514. 240 TISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRET 319 (399) Q Consensus 240 ~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t 319 (399) -++.+|||.....|+.. ....|++.+++|+. .+.+|.||++-|+|+|+++.+- T Consensus 159 ----~~~ivv~p~~~~~L~k~----~~~~f~~~s~~g~~-~~~~G~ig~~~G~~Vi~s~~~p------------------ 211 (274) T protein:vir:97 159 ----PMVLFVNPLDAGKLRGD----ASTNFTRATELGDD-IIVKGAFGEALGAIIVRTNKLE------------------ 211 (274) T ss_pred ----ceEEEeCHHHHHHHHhh----hhhhccccCccccc-ceeccccceecCeeEEEcCCCC------------------ Confidence 27899999999999742 11379999999984 6789999999999999998751 Q ss_pred CccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEe-cc Q lcl|NC_019514. 320 NGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVKTV-AP 398 (399) Q Consensus 320 ~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~-a~ 398 (399) +|-..++|++|++... +.+ +. .- .+| ||.-+.-...-...|++.++|+.-.+.+.-+ +- T Consensus 212 -----~~t~~l~~~gA~~~~~--~~~----~~---vE-----~~R-d~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~ 271 (274) T protein:vir:97 212 -----AGTAILAKKGAVKLIL--KRD----FF---LE-----VAR-DASTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) T ss_pred -----cceEEEEeCcceEeee--cCC----ce---ec-----ccc-chhhcccEEEEEEEEEEEEEcCCceEEEecCccc Confidence 3556799999999752 111 11 11 112 4554444444446789999999888876533 33 Q ss_pred C Q lcl|NC_019514. 399 L 399 (399) Q Consensus 399 ~ 399 (399) + T Consensus 272 ~ 272 (274) T protein:vir:97 272 L 272 (274) T ss_pred c Confidence 3 No 9 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=99.57 E-value=2e-16 Score=106.69 Aligned_cols=270 Identities=14% Similarity=0.123 Sum_probs=172.5 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhccc-ccccccCCCEEEEEEccccccccccccC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADV-VSMPKNYGKEIRVYHYIPLLDDRNVNDQ 79 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~-~~mPkN~GktIk~rry~pl~~~~~~~~~ 79 (399) |+ + +.|+ ++..+.+..|..-.++...+.++|.+++.. ..++-+.|++|++.+|..+.+... T Consensus 1 ma---------~-~~T~----~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~---- 62 (274) T protein:vir:94 1 MP---------Q-GLTK----TSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQV---- 62 (274) T ss_pred CC---------c-ccee----hhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCcccc---- Confidence 33 2 1233 334455558888888888999999999985 567877899999999976533221 Q ss_pred CCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHH Q lcl|NC_019514. 80 GIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIST 159 (399) Q Consensus 80 gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~ 159 (399) +.+|. .|+... .+....++.++|+|.-.+++|+.......+ .+.+..+ T Consensus 63 --------~~~g~---------~i~~~~--------------lt~~~~~~~i~~~~~~~~i~D~~~~~~~~d-p~~~~~~ 110 (274) T protein:vir:94 63 --------VAEGE---------KIPTDI--------------LETKKREAKIRKIAKGTSITDEALLSGYGD-PQGEQVR 110 (274) T ss_pred --------ccCCC---------cccccc--------------cccceeEEEeeeecceecccHHHHHhccch-HHHHHHH Confidence 22221 111111 223346788999999999999877666554 3444455 Q ss_pred HHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_019514. 160 ELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTR 239 (399) Q Consensus 160 ~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~ 239 (399) .++..-+. .+..++++.. .++ +. + .....++++.+.+|...|..+... T Consensus 111 ~~a~a~a~----~vd~~~~~~l-----~~a-~~----~---~~~~~~~~d~i~dA~~~l~d~~~~--------------- 158 (274) T protein:vir:94 111 QHGLAHAN----KVDNDVLEAL-----MGA-KL----T---VNADITKLNGLQSAIDKFNDEDLE--------------- 158 (274) T ss_pred HHHHHHHH----HHHHHHHHHH-----hcc-Cc----c---ccccccCHHHHHHHHHHhhccCCC--------------- Confidence 55544333 3333333321 011 11 0 112346899999999888764432 Q ss_pred ccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCcccccc Q lcl|NC_019514. 240 TISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRET 319 (399) Q Consensus 240 ~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t 319 (399) -++.+|||.....|+.. ....|++.+++|+. .+.+|.||++-|+|+|+++.+- T Consensus 159 ----~~~ivv~p~~~~~L~k~----~~~~f~~~s~~g~~-~~~~G~ig~~~G~~Vi~s~~~p------------------ 211 (274) T protein:vir:94 159 ----PMVLFVNPLDAGKLRGD----ASTNFTRATELGDD-IIVKGAFGEALGAIIVRTNKLE------------------ 211 (274) T ss_pred ----ceEEEeCHHHHHHHHhh----hhhhccccCccccc-ceeccccceecCeeEEEcCCCC------------------ Confidence 27899999999999742 11379999999984 6789999999999999998751 Q ss_pred CccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEe-cc Q lcl|NC_019514. 320 NGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVKTV-AP 398 (399) Q Consensus 320 ~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~-a~ 398 (399) +|-..++|++|++... +.+ +. .- .+| ||.-+.-...-...|++.++|+.-.+.+.-+ +- T Consensus 212 -----~~t~~l~~~gA~~~~~--~~~----~~---vE-----~~R-d~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~ 271 (274) T protein:vir:94 212 -----AGTAILAKKGAVKLIL--KRD----FF---LE-----VAR-DASTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) T ss_pred -----cceEEEEeCcceEeee--cCC----ce---ec-----ccc-chhhcccEEEEEEEEEEEEEcCCceEEEecCccc Confidence 3556799999999752 111 11 11 112 4554444444446789999999888876533 33 Q ss_pred C Q lcl|NC_019514. 399 L 399 (399) Q Consensus 399 ~ 399 (399) + T Consensus 272 ~ 272 (274) T protein:vir:94 272 L 272 (274) T ss_pred c Confidence 3 No 10 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=99.57 E-value=6.7e-16 Score=103.77 Aligned_cols=313 Identities=16% Similarity=0.206 Sum_probs=185.1 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhh-hcc---------cccccccCCCEEEEEEcccc Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMP-LAD---------VVSMPKNYGKEIRVYHYIPL 70 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~-fA~---------~~~mPkN~GktIk~rry~pl 70 (399) |+.-.+-+|+ |+... .|++++...+...-.|.. |-. ..++-|+.|.+|.|.--.+| T Consensus 1 Ma~T~~~~~~-------------p~a~~-~ws~~l~~~~~~~s~f~~~l~G~~~~~~I~~~~dL~k~~Gd~v~f~L~~~L 66 (364) T protein:vir:93 1 MSQTVIPFGD-------------PKAVK-RWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDRITFDLSVHL 66 (364) T ss_pred CceeccCcCC-------------HHHHH-HHHHHHHHHHHhhCccccccccCCCCCcEEEeeecCCCCCceEEeeeeeec Confidence 6554444444 45343 477766666766665554 432 34688999999998888777 Q ss_pred ccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcc Q lcl|NC_019514. 71 LDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSD 150 (399) Q Consensus 71 ~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D 150 (399) .|-+=.|-.-.||+-+.-+ +....+ ++|+....- +..+++. ++-.-+ T Consensus 67 --------~g~gv~Gd~~leGnee~L~-----~~~~~i-------~idq~r~~V-~~~g~ms---------~qRt~~--- 113 (364) T protein:vir:93 67 --------RGKPTYGDARVEGKEESLR-----FYQDEV-------RIDQVRHSV-SAGGRMS---------RKRTVH--- 113 (364) T ss_pred --------ccCCcccCceeecccccee-----EEeeEE-------EEeeccccc-cccCchh---------hhhhHH--- Confidence 2223333334444332111 111111 222221110 0011111 110000 Q ss_pred hHHHHHHHHHHHHhhhHHHHHHHHHHHHhc--------------------------CCeEEecCCCcccccccccccCCc Q lcl|NC_019514. 151 SELFSHISTELMNGAVQLTEAVLQKDLLAG--------------------------AGTIVYTGAATQDSEITGEGATPS 204 (399) Q Consensus 151 ~~l~~~~~~~lg~~a~~~~e~~l~~~~lag--------------------------~~~v~yag~ats~~~~t~~~~~~~ 204 (399) .|.++-...|..--.... |++.-..|+| .+-++|+|.+++++++++ .+ T Consensus 114 -dlr~~ar~~L~~w~~~~~-d~~~f~~laGarg~~~~~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~l~s----tD 187 (364) T protein:vir:93 114 -NIRRIARDRLGDYFYKFT-DELLFIYLSGARGINLDFIETPDFTGYAGNPLDAPDVDHLLYGGVATSKASLAA----TD 187 (364) T ss_pred -HHHHHHHHHHHHHHHHHH-HHHHHHHhhcccccccccccccCcccccccccCCCCCCcEEeccccCchhhccc----cc Confidence 233433444444444333 4444444444 256888889999888875 58 Q ss_pred eecHHHHHHHHHHHHhccCc------cccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCC- Q lcl|NC_019514. 205 VVDYDDLMRLSITLDENRTP------KQTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYAD- 277 (399) Q Consensus 205 ~vt~~~lr~a~~~L~~nrap------~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~- 277 (399) .+|++.|+++...++..+++ .+.-.+.|. .+||+|+||....|||-- .++.|...++++. T Consensus 188 ~~sl~~id~a~~~a~~~~~~~~~~~~~~Pv~~~g~---------~~yV~~l~p~q~~~Lr~~----t~~~w~d~qk~A~~ 254 (364) T protein:vir:93 188 IMAPLVIEKAVEKAAMMQAENPDVANMVPVSIDGD---------DHYVCVMSEYQATDMRTA----AGGTWIDFQKAAAA 254 (364) T ss_pred cccHHHHHHHHHHHHHhCCCCCCCcccceeEecCc---------ceeEEEEcchhhhhhhhc----CCHHHHHHHHHhhh Confidence 89999999999999887542 222222322 489999999999999732 2578999999863 Q ss_pred ----ccccccccceeEcCeEEEecCccchhcccCCCccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEE Q lcl|NC_019514. 278 ----AGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVT 353 (399) Q Consensus 278 ----~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~i 353 (399) ..|||.||+|.++||-+++-+++-.+-+.| +++.+.|-.-|.+|..|.+..=-+++| ..+.-+ T Consensus 255 ~~g~~nPlF~G~~gm~ngvii~~~~~vi~~~~~~-----------~~~~v~~~ralllGaQA~~~a~g~~~g--~~~~w~ 321 (364) T protein:vir:93 255 AEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYG-----------AGANVEAARALFMGRQAGVIAYGTANG--LRFDWE 321 (364) T ss_pred cccccCCceecCeeeEcCeEEeccCCcccccccc-----------cCccccchhhheecceeeEEEeecCCC--CCceee Confidence 456999999999999999988875553333 233455677899999997766444444 336666 Q ss_pred EecCCCCCCCCCCc-cchhhHHHHH-HHHHHhhccccceEEEEEeccC Q lcl|NC_019514. 354 TKMPGEATADRNDP-YGEMGFSSIK-WYYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 354 vk~pG~~~ad~~DP-lgQrg~~gwK-~~~~~~iLn~~~m~~ie~~a~~ 399 (399) .+.-.+ ++.+ ++....+||| .=|-. .|-=.+.|-++|++ T Consensus 322 Ee~~D~----gn~~~i~~~~i~G~kK~rF~~---~DfGvi~idtaa~~ 362 (364) T protein:vir:93 322 ETVKDY----GNEPAIAAGFIAGMKKARFNN---KDFGVISIDTAAKK 362 (364) T ss_pred ecccCC----CCchhhhhhhHhhhhhcccCC---ccceEEEecccccc Confidence 565443 2333 6666677776 22221 23345678999999 No 11 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=99.56 E-value=2.4e-16 Score=106.25 Aligned_cols=277 Identities=15% Similarity=0.107 Sum_probs=169.0 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcc-cccccccCCCEEEEEEccccccccccccC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLAD-VVSMPKNYGKEIRVYHYIPLLDDRNVNDQ 79 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~-~~~mPkN~GktIk~rry~pl~~~~~~~~~ 79 (399) |+. ..|+. +..+.+..|..-.++..++.+++.+++. ...++-..|.+|++.+|..+.+. T Consensus 1 Ma~----------~~T~~----~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a------ 60 (278) T protein:vir:80 1 MAD----------LTTKL----ANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDA------ 60 (278) T ss_pred CCC----------cceeh----hheecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcc------ Confidence 442 12332 3345555788888888999999999996 55677778999999999765221 Q ss_pred CCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHH Q lcl|NC_019514. 80 GIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIST 159 (399) Q Consensus 80 gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~ 159 (399) ..+++|. .|+... .+....++.|+|+|.-.+++|+.......+ ++.++.+ T Consensus 61 ------~~~~~g~---------~i~~~~--------------lt~~~~~~~i~~~~~a~~v~D~~~~~~~~d-~~~~~~~ 110 (278) T protein:vir:80 61 ------QDVAEGA---------AIDYSA--------------LETESVKHGIKKAGKGVKLTDESVLSGYGD-PVEEAQK 110 (278) T ss_pred ------eeecCCC---------cCcccc--------------cccceeeEeeehhhccccccHHHHhhcccc-HHHHHHH Confidence 1233331 122222 223346778999999999999877776665 5555555 Q ss_pred HHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_019514. 160 ELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTR 239 (399) Q Consensus 160 ~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~ 239 (399) .++..-++-.+..+. ..++++.... .++. +.. .. --.++.+-.+...|...+.| T Consensus 111 ~~a~~~a~~~d~~l~-~~l~~a~~~~-~~~~------t~~-~~--~~~~~~~~da~~~l~~~~~~--------------- 164 (278) T protein:vir:80 111 QIRMAIASKVDNDIL-EEALTTTLEV-KGAI------NIG-LI--DKIENTFTDAPDAIEDESIT--------------- 164 (278) T ss_pred HHHHHHHHHHHHHHH-HHHhcccccc-cccc------ccc-hh--hhHHHHHHHHHHhhcccCCC--------------- Confidence 555544443332222 2333332211 1110 000 00 01244444444445444333 Q ss_pred ccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCcccccc Q lcl|NC_019514. 240 TISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRET 319 (399) Q Consensus 240 ~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t 319 (399) ..++++|||.....|+.. ....|++..+||+. .+.+|+||++.|||+|+++.+- T Consensus 165 ---~~~~ivv~p~~~~~L~k~----~~~~~~~~~~~g~~-~~~~G~ig~~~G~~Vi~s~~~p------------------ 218 (278) T protein:vir:80 165 ---TTGVLFLNYKDTAKLREE----AAGSWTKASQLGDD-LLVKGAFGELLGWEIVRTKKLA------------------ 218 (278) T ss_pred ---cccEEEECHHHHHHHHhh----hhhhcccccccccc-ceeeccceeecceeEEEcCCCC------------------ Confidence 335789999999999753 23579999999986 5789999999999999999861 Q ss_pred CccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEeccC Q lcl|NC_019514. 320 NGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 320 ~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~a~~ 399 (399) .+-..+++++|++...-+. +. . | .+| ||.-+.-.+--...|++.++|++..+.|-.+|-= T Consensus 219 -----~~t~~l~~~gAi~~~~~~~------~~---v---E--~~R-d~~~~~d~i~~~~~yg~~v~~~~~~v~it~~a~~ 278 (278) T protein:vir:80 219 -----DGNALAVKAGALKTFLKRN------LL---A---E--SGR-DMDHKLTKFNADQHYAVALVDETKAVKVVPVAGN 278 (278) T ss_pred -----cceEEEEeccceeeeecCC------cc---c---c--ccc-chhhccceeeeeeEEEEEEEcCcceEEEeeccCC Confidence 1234688999998752111 11 1 1 121 4544333333346789999999999888766655 No 12 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=99.55 E-value=2.1e-16 Score=106.52 Aligned_cols=271 Identities=14% Similarity=0.107 Sum_probs=172.2 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcc-cccccccCCCEEEEEEccccccccccccC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLAD-VVSMPKNYGKEIRVYHYIPLLDDRNVNDQ 79 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~-~~~mPkN~GktIk~rry~pl~~~~~~~~~ 79 (399) |+ +++.|+...-|-|| +|..-.++.....++|.+++. ...+.-..|++|++.+|..+.+... T Consensus 1 ~~---------~~~~T~l~d~i~PE----v~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~---- 63 (275) T protein:vir:96 1 MA---------LENMTKLANMVNPE----VLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKV---- 63 (275) T ss_pred CC---------CcccchhhhhhchH----HHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCcccc---- Confidence 32 22334444444455 777778888889999999997 4456667799999999976533221 Q ss_pred CCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHH Q lcl|NC_019514. 80 GIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIST 159 (399) Q Consensus 80 gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~ 159 (399) +.+| . .|+... .+....++.++|+|.-++++|+.......+ .+.+..+ T Consensus 64 --------~~~g----~-----~i~~~~--------------lt~~~~~~~i~~~~~~~~i~D~~~~~~~~d-~~~~~~~ 111 (275) T protein:vir:96 64 --------VPEG----E-----EIPIDL--------------IETKKRQATIRKIGKGTVLTDEALLSGYGD-PKGEAVR 111 (275) T ss_pred --------ccCC----C-----Ccchhh--------------cccceeeEEeehhcccccccHHHHHhhccc-hHHHHHH Confidence 2222 0 111111 223346688999999999999987776555 3444455 Q ss_pred HHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_019514. 160 ELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTR 239 (399) Q Consensus 160 ~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~ 239 (399) .++..-+ +.+..++++-.. .++. + .....++++.+-+|...|..+... T Consensus 112 ~~a~~~a----~~~d~~ll~~l~------~a~~----~---~~~~~~~~d~i~dA~~~lgd~~~~--------------- 159 (275) T protein:vir:96 112 QHGLAIA----NKVDNDVLEALQ------GATL----K---VEADITKLAGLQTAIDKFNDEDLE--------------- 159 (275) T ss_pred HHHHHHH----HHHHHHHHHHHh------cccc----c---ccccccCHHHHHHHHHHhccccCC--------------- Confidence 5554333 233333333221 0111 0 112446899999999888654321 Q ss_pred ccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCcccccc Q lcl|NC_019514. 240 TISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRET 319 (399) Q Consensus 240 ~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t 319 (399) -++.+|||+....||.+. ...|++.+++|+. .+.+|+||++-|+|+|+++.+. T Consensus 160 ----~~~ivv~p~~~~~L~k~~----~~~f~~~~~~g~~-~~~~G~ig~~~G~~Vi~s~~~p------------------ 212 (275) T protein:vir:96 160 ----PMVLFVNPLDAGKLRASA----TDNFTRATLLGDN-VIVKGAFGEALGAIIVRSNKIK------------------ 212 (275) T ss_pred ----ccEEEeCHHHHHHHHhcc----ccccccccccccc-ceeccccceecCeeEEEeCCCC------------------ Confidence 278999999999998753 3579999999976 5789999999999999998741 Q ss_pred CccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEE-Eecc Q lcl|NC_019514. 320 NGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVK-TVAP 398 (399) Q Consensus 320 ~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie-~~a~ 398 (399) ++-.+++|++|++... +. + +. . | .+| |+.-+.-.+--...|++.+++++-.+.++ +.+. T Consensus 213 -----~~t~~i~~~gA~~~~~-~~-~----~~---v---E--~~R-d~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~ 272 (275) T protein:vir:96 213 -----EGEAILAKRGAVKLIT-KR-D----FF---L---E--TER-HASHKSTALFSDKHYVAYLYDESKVVKITKSASG 272 (275) T ss_pred -----cceEEEEeccceeeee-cC-C----cc---c---c--ccc-chhhcCcEEEEeEEEEEEEEcCccEEEEEecccc Confidence 3445789999999752 21 1 11 1 1 112 55544444444567899999998888764 4555 Q ss_pred C Q lcl|NC_019514. 399 L 399 (399) Q Consensus 399 ~ 399 (399) | T Consensus 273 ~ 273 (275) T protein:vir:96 273 L 273 (275) T ss_pred c Confidence 6 No 13 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=99.55 E-value=1e-15 Score=102.75 Aligned_cols=324 Identities=11% Similarity=0.055 Sum_probs=163.9 Q ss_pred eeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCCCCCCC Q lcl|NC_019514. 6 MLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQGIDAAG 85 (399) Q Consensus 6 ~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~gi~aag 85 (399) |.+=| ..|..........++.+..|.+.+|+.-++.++|..+...++.--.+|+||++.+..... . T Consensus 1 ~~~~~-~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~~~~d~~~~~~~Gdtv~ip~~g~~~----~--------- 66 (341) T protein:vir:94 1 MALGN-TITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTSVVKTWGAQVKKGDTFHVPRISELG----V--------- 66 (341) T ss_pred Ccchh-hhccccccchhHHHHHHHHHHHHHHHHHHhhcchhhccccccccccCCceEEEeccCcce----e--------- Confidence 33322 111111111212223333589999988889999999887666544569999999863320 0 Q ss_pred ceeccCccccccccccccccccccccccccccccccceeeeeEeeeeee-cceeehhhhhhhhhcchHHHHHHHHHHHHh Q lcl|NC_019514. 86 ATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKF-GFFTEFSQESLDFDSDSELFSHISTELMNG 164 (399) Q Consensus 86 a~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qY-G~~~e~Td~~~d~~~D~~l~~~~~~~lg~~ 164 (399) .-.+.| . .|..+. .+..+++.+|.|+ .+=..++|+-....+. ++..++.++.++. T Consensus 67 ~d~~~~----~-----~i~~~~--------------~~~~~~~itiD~~~~~~~~i~d~d~~~~~~-d~~~~~~~~~~~a 122 (341) T protein:vir:94 67 EDKATD----V-----PVGVQP--------------VNDTDFVITVDTDRTTAVALDDLLEIQASY-DLRAPYLEAMGYA 122 (341) T ss_pred eeecCC----C-----cccccc--------------ccCceEEEEEeeeeecceeechHHHHhhcc-chHHHHHHHHHHH Confidence 011111 0 111111 1122345556453 2223344432212222 3555555666655 Q ss_pred hhHHHHHHHHHHHHhcCCeEEecC-CCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccccCc Q lcl|NC_019514. 165 AVQLTEAVLQKDLLAGAGTIVYTG-AATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTRTISA 243 (399) Q Consensus 165 a~~~~e~~l~~~~lag~~~v~yag-~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~~I~~ 243 (399) -++-++.-+. .+++++....-.+ ..+..+.. ......++++.+..+.+.|.++.+|. . T Consensus 123 LA~~~D~~i~-~~~a~~~~~~~~~~~~~~~~~~---t~~~~~~~~~~i~~a~~~Lde~~VP~-----------------~ 181 (341) T protein:vir:94 123 LAKDMTGSIL-GLRAAVQNTASQNVFSSSNGAI---TGNGQAFSFAVFLAARRLLLEADVPE-----------------E 181 (341) T ss_pred HHHHHHHHHH-HHhhhccccccCccccCccccc---cCchhhhhHHHHHHHHHHHhhcCCCc-----------------c Confidence 5555543332 3333332111110 11111111 12234578999999999999999985 2 Q ss_pred eeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCc--------- Q lcl|NC_019514. 244 GRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNP--------- 314 (399) Q Consensus 244 ~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~--------- 314 (399) -++++|+|+...+|+. ++.|......+ ...+.+|+||++.||.+++++++-.-.+.+...+... T Consensus 182 gR~lvv~P~~~~~Ll~------~~~~~~~~~~g-~~~l~~G~ig~i~G~~V~~Sn~lp~~~~~~~~~~~~~~~~~~~~~~ 254 (341) T protein:vir:94 182 KIVLLISPGQESALFT------IPQFISKDFIN-NAPIAQGQIGSLMGVRVIRTSLIGNNSATGWRNGAPTIAPAEATPG 254 (341) T ss_pred CCEEEeCHHHHHHHhh------chhhhhhhccc-cchhheeeeeeEeceEEEEeccccccccccccccccceeccccccc Confidence 3788999999999964 47898885544 4568999999999999999998743222111111000 Q ss_pred -----cccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccc Q lcl|NC_019514. 315 -----GYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPER 389 (399) Q Consensus 315 -----~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~ 389 (399) .+....+-+|..--|++=+.|=+.+-+.- +.-+..++.++-. +-..=++--|--++==|..|++.+||++. T Consensus 255 i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~---~~~~~~~~~~~~~-~~~~~~~~~~~~~i~~~~~~G~~~lrp~~ 330 (341) T protein:vir:94 255 FTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCH---MDWAAAVVSKAPR-VTQSFENREQVWLMVGRQAYGARLYRPLH 330 (341) T ss_pred ccccccccccccccccEEEEEEecccccceeeec---chhhhcccccccc-ccccchhhhhhhhhhhhhhhcccccCcce Confidence 00001111122222222222222211100 0011111222111 11111222333333358899999999999 Q ss_pred eEEEEEeccC Q lcl|NC_019514. 390 LALVKTVAPL 399 (399) Q Consensus 390 m~~ie~~a~~ 399 (399) .+.|++.+.- T Consensus 331 ~v~~~~~~~~ 340 (341) T protein:vir:94 331 AVNIHTTGDT 340 (341) T ss_pred eEEEecCcCC Confidence 9999987654 No 14 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=99.55 E-value=3.9e-16 Score=105.08 Aligned_cols=270 Identities=14% Similarity=0.127 Sum_probs=171.0 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhccc-ccccccCCCEEEEEEccccccccccccC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADV-VSMPKNYGKEIRVYHYIPLLDDRNVNDQ 79 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~-~~mPkN~GktIk~rry~pl~~~~~~~~~ 79 (399) |+ + +.|. ++..+.+..|..-.++...+.++|.+++.. ..++-+.|++|++.++..+.+.. T Consensus 1 ma---------~-~~T~----~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~----- 61 (274) T protein:vir:93 1 MP---------Q-GITK----TSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQ----- 61 (274) T ss_pred CC---------c-ccee----hhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcc----- Confidence 43 1 1233 333455557888888888999999999985 57888889999999986653321 Q ss_pred CCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHH Q lcl|NC_019514. 80 GIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIST 159 (399) Q Consensus 80 gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~ 159 (399) .+.+|. .|+... .+....+++++|+|.-++++|+.......+ ++.+..+ T Consensus 62 -------~~~eg~---------~i~~~~--------------it~~~~~~~i~~~~~~~~i~D~~~~~~~~d-~~~~~~~ 110 (274) T protein:vir:93 62 -------VVAEGE---------KIPTDI--------------LETKKREAKIRKIAKGTSITDEALLSGYGD-PQGEQVR 110 (274) T ss_pred -------cccCCC---------cccccc--------------cccceeEEEeeeecccccccHHHHHhhccc-hHHHHHH Confidence 122221 112222 223456788999999999999877666554 4555555 Q ss_pred HHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_019514. 160 ELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTR 239 (399) Q Consensus 160 ~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~ 239 (399) .++..-+. .+..++++.. .+ ++. + .....++++.+.+|...|..+... T Consensus 111 ~~~~~~a~----~~d~~~~~~~-----~~-a~~----~---~~~~~~~~d~i~dA~~~l~d~~~~--------------- 158 (274) T protein:vir:93 111 QHGLAHAN----KVDNDVLEAL-----MG-AKL----T---VNADITKLNGLQSAIDKFNDEDLE--------------- 158 (274) T ss_pred HHHHHHHH----HHHHHHHHHH-----hc-ccc----c---ccccccCHHHHHHHHHHhhhccCC--------------- Confidence 55544333 3334444332 11 111 0 112346899999999888764321 Q ss_pred ccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCcccccc Q lcl|NC_019514. 240 TISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRET 319 (399) Q Consensus 240 ~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t 319 (399) -++.+|||+....|+.- ....|++.+++|+. .+.+|.||++-|||+|+++.+- T Consensus 159 ----~~~ivv~p~~~~~L~k~----~~~~f~~~s~~g~~-~~~~G~ig~~~G~~Vi~s~~~p------------------ 211 (274) T protein:vir:93 159 ----PMVLFINPLDAGKLRGD----ASTNFTRATELGDD-IIVKGAFGEALGAIIVRTNKLE------------------ 211 (274) T ss_pred ----ccEEEeCHHHHHHHHhh----hhhccccccccccc-ceeecccceecCeeEEEcCCCC------------------ Confidence 26899999999999641 11379999999985 5789999999999999998751 Q ss_pred CccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEec-c Q lcl|NC_019514. 320 NGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVKTVA-P 398 (399) Q Consensus 320 ~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~a-~ 398 (399) +|-.+++|++|++... +.+ +. .- .++ |+.-+.=.+--..+|++.+++++-.+.+.-++ - T Consensus 212 -----~~t~~l~~~gai~~~~--~~~----~~---vE-----~~R-d~~~~~d~i~~~~~y~~~~~~~~~~v~~t~~~~s 271 (274) T protein:vir:93 212 -----AGTAILAKKGAVKLIL--KRD----FF---LE-----VAR-DASTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) T ss_pred -----cceEEEEeCCeEEEEe--cCC----cc---cc-----ccc-chhhcccEEEEEEEEEEEEEcCCceEEEeeCccc Confidence 3445799999999762 111 11 11 111 34333322333367899999998888765443 3 Q ss_pred C Q lcl|NC_019514. 399 L 399 (399) Q Consensus 399 ~ 399 (399) + T Consensus 272 ~ 272 (274) T protein:vir:93 272 L 272 (274) T ss_pred c Confidence 3 No 15 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=99.49 E-value=2e-15 Score=101.22 Aligned_cols=269 Identities=12% Similarity=0.120 Sum_probs=170.9 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhccc-ccccccCCCEEEEEEccccccccccccC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADV-VSMPKNYGKEIRVYHYIPLLDDRNVNDQ 79 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~-~~mPkN~GktIk~rry~pl~~~~~~~~~ 79 (399) |+ + +.|+...-+-|+ .|...+++.-...+++.+++.. ..++...|++|++.++.... T Consensus 1 MA---------~-~~T~~~~~~iPe----v~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~-------- 58 (272) T protein:vir:98 1 MA---------V-GTTKMAQMLDPE----VLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIG-------- 58 (272) T ss_pred CC---------C-ccccchheechH----HHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCC-------- Confidence 43 1 124444445554 6777666667788899898874 45667789999999985431 Q ss_pred CCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHH Q lcl|NC_019514. 80 GIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIST 159 (399) Q Consensus 80 gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~ 159 (399) ...++.||. .|+... ++...++.++++++.+.++||+.... +.+.++.++.+ T Consensus 59 ----~a~~v~eg~---------~i~~~~--------------~~~~~~~~~~~~~~~~~~itd~~~~~-s~~d~~~~~~~ 110 (272) T protein:vir:98 59 ----DAEDVAEGE---------AIPMTQ--------------LGFKKTTMTIKKAGKGVEITDEAILS-GYGDPVGQAAK 110 (272) T ss_pred ----CcccccCCC---------cccccc--------------cccceEEEEeeeeeeeeeecHHHHhh-ccccHHHHHHH Confidence 223445542 122222 23445778899999999999987644 34447777777 Q ss_pred HHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_019514. 160 ELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTR 239 (399) Q Consensus 160 ~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~ 239 (399) .++..-+.-++ .++++.. .++ ..++....+++.+.++...|..+..+ T Consensus 111 ~~~~~~a~~~d----~~i~~~~-----~~a---------~~~~~~~~t~d~i~da~~~l~~~~~~--------------- 157 (272) T protein:vir:98 111 QIVEAIDHKVD----ADVLDAL-----SKS---------TQTVEATATVDGVSKALDIFNDEDDA--------------- 157 (272) T ss_pred HHHHHHHHHHH----HHHHHHh-----ccc---------ccccccccCHHHHHHHHHHHhccCCC--------------- Confidence 77766554443 3343321 110 11223345789999998888765432 Q ss_pred ccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCcccccc Q lcl|NC_019514. 240 TISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRET 319 (399) Q Consensus 240 ~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t 319 (399) -.+.+|||.....|+... .+.|...++++.. .+.+|.||++.|+|+|+++.+. T Consensus 158 ----~~~~vv~p~~~~~L~k~~----~~~~~~~~~~~~~-~~~~g~ig~i~G~~Vi~s~~~p------------------ 210 (272) T protein:vir:98 158 ----ETVIVMNPADASTLRLDA----AKEWLGATEVGAN-RVVSGVYGEVLGVQIVRSRKCP------------------ 210 (272) T ss_pred ----ccEEEEcHHHHHHHHHhc----ccccccccccccc-ccccccchhhcCeeEEEcCCCC------------------ Confidence 157899999999997642 3578888888876 4789999999999999999862 Q ss_pred CccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEeccC Q lcl|NC_019514. 320 NGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 320 ~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~a~~ 399 (399) .+-.++++++|++...-++ +. + - .+ -|+..++-.+--..+|++.+++++.++.+..++-= T Consensus 211 -----~~t~~~~~~~a~~~~~~~~------~~-v-e------~~-r~~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~ 270 (272) T protein:vir:98 211 -----KGTAYMVRKGALRIMLKRN------TM-V-E------TD-RDITKAINQIVANKHYGVYLYKAEKAVKITLKDAA 270 (272) T ss_pred -----cceEEEEcCCeEEEEecCC------ce-e-e------ec-cccccceeEEEEEEEEEEEEEcCCceEEEEecccc Confidence 1235679999998864211 11 1 0 11 13433332222335678889999988887665444 No 16 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=99.49 E-value=2e-15 Score=101.22 Aligned_cols=269 Identities=12% Similarity=0.120 Sum_probs=170.9 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhccc-ccccccCCCEEEEEEccccccccccccC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADV-VSMPKNYGKEIRVYHYIPLLDDRNVNDQ 79 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~-~~mPkN~GktIk~rry~pl~~~~~~~~~ 79 (399) |+ + +.|+...-+-|+ .|...+++.-...+++.+++.. ..++...|++|++.++.... T Consensus 1 MA---------~-~~T~~~~~~iPe----v~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~-------- 58 (272) T protein:vir:30 1 MA---------V-GTTKMAQMLDPE----VLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIG-------- 58 (272) T ss_pred CC---------C-ccccchheechH----HHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCC-------- Confidence 43 1 124444445554 6777666667788899898874 45667789999999985431 Q ss_pred CCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHH Q lcl|NC_019514. 80 GIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIST 159 (399) Q Consensus 80 gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~ 159 (399) ...++.||. .|+... ++...++.++++++.+.++||+.... +.+.++.++.+ T Consensus 59 ----~a~~v~eg~---------~i~~~~--------------~~~~~~~~~~~~~~~~~~itd~~~~~-s~~d~~~~~~~ 110 (272) T protein:vir:30 59 ----DAEDVAEGE---------AIPMTQ--------------LGFKKTTMTIKKAGKGVEITDEAILS-GYGDPVGQAAK 110 (272) T ss_pred ----CcccccCCC---------cccccc--------------cccceEEEEeeeeeeeeeecHHHHhh-ccccHHHHHHH Confidence 223445542 122222 23445778899999999999987644 34447777777 Q ss_pred HHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_019514. 160 ELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTR 239 (399) Q Consensus 160 ~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~ 239 (399) .++..-+.-++ .++++.. .++ ..++....+++.+.++...|..+..+ T Consensus 111 ~~~~~~a~~~d----~~i~~~~-----~~a---------~~~~~~~~t~d~i~da~~~l~~~~~~--------------- 157 (272) T protein:vir:30 111 QIVEAIDHKVD----ADVLDAL-----SKS---------TQTVEATATVDGVSKALDIFNDEDDA--------------- 157 (272) T ss_pred HHHHHHHHHHH----HHHHHHh-----ccc---------ccccccccCHHHHHHHHHHHhccCCC--------------- Confidence 77766554443 3343321 110 11223345789999998888765432 Q ss_pred ccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCcccccc Q lcl|NC_019514. 240 TISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRET 319 (399) Q Consensus 240 ~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t 319 (399) -.+.+|||.....|+... .+.|...++++.. .+.+|.||++.|+|+|+++.+. T Consensus 158 ----~~~~vv~p~~~~~L~k~~----~~~~~~~~~~~~~-~~~~g~ig~i~G~~Vi~s~~~p------------------ 210 (272) T protein:vir:30 158 ----ETVIVMNPADASTLRLDA----AKEWLGATEVGAN-RVVSGVYGEVLGVQIVRSRKCP------------------ 210 (272) T ss_pred ----ccEEEEcHHHHHHHHHhc----ccccccccccccc-ccccccchhhcCeeEEEcCCCC------------------ Confidence 157899999999997642 3578888888876 4789999999999999999862 Q ss_pred CccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEeccC Q lcl|NC_019514. 320 NGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 320 ~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~a~~ 399 (399) .+-.++++++|++...-++ +. + - .+ -|+..++-.+--..+|++.+++++.++.+..++-= T Consensus 211 -----~~t~~~~~~~a~~~~~~~~------~~-v-e------~~-r~~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~ 270 (272) T protein:vir:30 211 -----KGTAYMVRKGALRIMLKRN------TM-V-E------TD-RDITKAINQIVANKHYGVYLYKAEKAVKITLKDAA 270 (272) T ss_pred -----cceEEEEcCCeEEEEecCC------ce-e-e------ec-cccccceeEEEEEEEEEEEEEcCCceEEEEecccc Confidence 1235679999998864211 11 1 0 11 13433332222335678889999988887665444 No 17 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.35 E-value=2e-14 Score=95.66 Aligned_cols=231 Identities=14% Similarity=0.110 Sum_probs=150.9 Q ss_pred cccccCCCEEEEEEccccccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeee Q lcl|NC_019514. 53 SMPKNYGKEIRVYHYIPLLDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQ 132 (399) Q Consensus 53 ~mPkN~GktIk~rry~pl~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~ 132 (399) +=--|.|+||.|-.| +.+. ..+.+| .+ |+... ++....+++|+ T Consensus 1 ~~~~~~Gdtit~P~~--iGda------------~~v~eG----~~-----i~~~~--------------l~~t~~~atIk 43 (231) T protein:vir:73 1 ENGINLANLCEYPND--IGDA------------ADVAEG----GE-----ISLDK--------------IGTTTKSVTIK 43 (231) T ss_pred CccccCCceEEeccc--ccch------------hhhcCC----Cc-----CChhh--------------ccccceeeeEe Confidence 667789999999988 3221 234444 22 22222 22345788999 Q ss_pred eecceeehhhhhhhhhcchHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHH Q lcl|NC_019514. 133 KFGFFTEFSQESLDFDSDSELFSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLM 212 (399) Q Consensus 133 qYG~~~e~Td~~~d~~~D~~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr 212 (399) |+|.-++++|+..+....+++ .+ ..+|.+.-+-+.+..|+++... ++..++.+.+|++.|. T Consensus 44 ~~gk~~~itD~a~l~~~gDp~-~e----a~~Q~~~~iA~kvD~di~~~~~--------------~a~l~~~~~~t~d~i~ 104 (231) T protein:vir:73 44 KAAKGTEITDEAALSGYGDPI-GE----SNKQLGLSLANKVDDDLLKAAK--------------TTSQTVSTKANVDGVQ 104 (231) T ss_pred eeccceeeeHHHHhhccCchH-HH----HHHHHHHHHHHhhhHHHHHhhc--------------cccccccccccHHHHH Confidence 999999999998877665533 22 3333333444666667775442 1223345678999999 Q ss_pred HHHHHHHhccCccccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCe Q lcl|NC_019514. 213 RLSITLDENRTPKQTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQF 292 (399) Q Consensus 213 ~a~~~L~~nrap~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~v 292 (399) +|.-.|..+... -++++|||.....||. ++.|.....++...-+++|+||++-|+ T Consensus 105 ~A~~~fgde~~~-------------------~~vivv~p~~~~~Lrk------~~~~~~~~~~~g~~i~~~G~iG~i~G~ 159 (231) T protein:vir:73 105 AALDIFNDEDAQ-------------------AYVLIVNPKDAAKIRK------DANAKNIGSEVGANALINGTYADVLGA 159 (231) T ss_pred HHHHHhcccccc-------------------ceEEEEcchHHHhhhh------ccchhhhhhhhccceeeecccceEcce Confidence 999888664332 3899999998888875 355666555555556899999999999 Q ss_pred EEEecCccchhcccCCCccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhh Q lcl|NC_019514. 293 RLVVVPEMLHWAGAGATVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMG 372 (399) Q Consensus 293 RfV~~~~~~~~~~aGa~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg 372 (399) |+|.+..+.. + ++ .+..+++.++|.+...=++ +. . -.|| ||.-+.= T Consensus 160 ~Vi~S~~~~~--------~-------~~----~~~~~i~~~gAl~~~~k~~------~~---v-----EtdR-d~~~k~~ 205 (231) T protein:vir:73 160 QIVRSKKLAE--------G-------SA----LMFKIVSNSPALKLVLKRG------VQ---V-----ETDR-DIVTKTT 205 (231) T ss_pred EEEEcCCCCC--------C-------ce----eeeeEEeeccceeeeeccc------ce---e-----eccc-ccccccc Confidence 9999988521 1 01 3445566777777652222 11 1 1233 6766666 Q ss_pred HHHHHHHHHHhhccccceEEEEEecc Q lcl|NC_019514. 373 FSSIKWYYGTLILRPERLALVKTVAP 398 (399) Q Consensus 373 ~~gwK~~~~~~iLn~~~m~~ie~~a~ 398 (399) .+--..+|++.+.||.-.+.|-..-- T Consensus 206 ~i~~~~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 206 VITADEHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred EEEEeEEEEEEEEcCccEEEEEeecC Confidence 66666789999999998888754433 No 18 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=99.27 E-value=2.5e-13 Score=89.68 Aligned_cols=269 Identities=17% Similarity=0.180 Sum_probs=151.7 Q ss_pred eeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhccc-cc-ccccCCCEEEEEEccccccccccccCCCCC Q lcl|NC_019514. 6 MLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADV-VS-MPKNYGKEIRVYHYIPLLDDRNVNDQGIDA 83 (399) Q Consensus 6 ~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~-~~-mPkN~GktIk~rry~pl~~~~~~~~~gi~a 83 (399) |..|++ + |+ .|...+++.-++.+++.++... .+ -+ +.|+||++++....... + .++ T Consensus 1 MA~~~~----------~-pe----~~~~~v~~~~~~~lv~~~l~~~~~~~~~-~~Gdtv~ip~~~~~~~~----d--~~~ 58 (273) T protein:vir:10 1 MAFNNF----------I-PE----LWSDMLLEEWTAQTVFANLVNREYEGTA-SKGNVVHIAGVVAPTVK----D--YKA 58 (273) T ss_pred Ccchhh----------h-HH----HHHHHHHHHHHhhhccchhhcccccccc-ccCceEEEeeccccccc----c--ccc Confidence 222332 1 22 4777788788899999998853 22 24 45999999997553211 0 011 Q ss_pred CCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeee-cceeehhhhhhhhhcchHHHHHHHHHHH Q lcl|NC_019514. 84 AGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKF-GFFTEFSQESLDFDSDSELFSHISTELM 162 (399) Q Consensus 84 aga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qY-G~~~e~Td~~~d~~~D~~l~~~~~~~lg 162 (399) .+.++. ....+..+++.+|.|+ +.=..++|+-.....++ +.+ +.+..+ T Consensus 59 ~~~~~~-----------------------------~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~-~~~-~~~~~~ 107 (273) T protein:vir:10 59 AGRQTS-----------------------------ADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LEA-YTRAGA 107 (273) T ss_pred CCCccC-----------------------------ccccccceEEEEEeeeeecceEeecHHHhhhhcc-HHH-HHHHHH Confidence 111111 1111223455667553 34445676544444444 433 445445 Q ss_pred HhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccccC Q lcl|NC_019514. 163 NGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTRTIS 242 (399) Q Consensus 163 ~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~~I~ 242 (399) +.-++-++ .-...+++++.... .+. . .....-.++.|..+...|.++++|. T Consensus 108 ~alA~~vD-~~i~~~~~~a~~~~-~~~----~------~~~~~~~~~~i~~a~~~ld~~~vP~----------------- 158 (273) T protein:vir:10 108 TALATDTD-KFIADMLVDNGTAL-TGS----A------PTDADDAFDLIAKALKELTKANVPN----------------- 158 (273) T ss_pred HHHHHHHH-HHHHHHHhcccccc-ccc----c------ccchhHHHHHHHHHHHHhhhcCCCc----------------- Confidence 44343332 22223333332111 110 1 1111225789999999999999996 Q ss_pred ceeEEEeCCCchHHHHHhhccCCCcc-ceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCccccccCc Q lcl|NC_019514. 243 AGRVLYIGSELIPLIRKLVDPFGNAA-FVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRETNG 321 (399) Q Consensus 243 ~~yv~~~h~d~~~dirdl~d~~~~p~-fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t~~ 321 (399) ..++++|+|+....|+.. +. |.....+++...+-+|+||++.||.++++.++-. + ++ T Consensus 159 ~~R~lvv~p~~~~~L~~~------~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~----~-----------~~- 216 (273) T protein:vir:10 159 VGRVVVVNAEMAFWLRSS------GSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRD----T-----------DD- 216 (273) T ss_pred CCCEEEECHHHHHHHhcc------hhhhhhhhccccccceeeeeeeEEeceEEEEeccccc----C-----------Cc- Confidence 237889999999999753 45 4456677777777899999999999999988621 0 00 Q ss_pred cceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEecc Q lcl|NC_019514. 322 KYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVKTVAP 398 (399) Q Consensus 322 ~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~a~ 398 (399) +..+.+-++|++.+- ++...... -||-.+.=.+=-+++|++.+|+++-+++++.... T Consensus 217 ----~~~~~~~~~A~~~a~----------q~~~~e~~------r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 217 ----EQFVAFHPSAAAYVS----------QIDTVEAL------RDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred ----cEEEEEeccceeeee----------eeehhhcc------cCCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 112444566665431 11111111 1332222122235889999999999999997777 No 19 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=99.27 E-value=2.5e-13 Score=89.68 Aligned_cols=269 Identities=17% Similarity=0.180 Sum_probs=151.7 Q ss_pred eeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhccc-cc-ccccCCCEEEEEEccccccccccccCCCCC Q lcl|NC_019514. 6 MLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADV-VS-MPKNYGKEIRVYHYIPLLDDRNVNDQGIDA 83 (399) Q Consensus 6 ~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~-~~-mPkN~GktIk~rry~pl~~~~~~~~~gi~a 83 (399) |..|++ + |+ .|...+++.-++.+++.++... .+ -+ +.|+||++++....... + .++ T Consensus 1 MA~~~~----------~-pe----~~~~~v~~~~~~~lv~~~l~~~~~~~~~-~~Gdtv~ip~~~~~~~~----d--~~~ 58 (273) T protein:vir:10 1 MAFNNF----------I-PE----LWSDMLLEEWTAQTVFANLVNREYEGTA-SKGNVVHIAGVVAPTVK----D--YKA 58 (273) T ss_pred Ccchhh----------h-HH----HHHHHHHHHHHhhhccchhhcccccccc-ccCceEEEeeccccccc----c--ccc Confidence 222332 1 22 4777788788899999998853 22 24 45999999997553211 0 011 Q ss_pred CCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeee-cceeehhhhhhhhhcchHHHHHHHHHHH Q lcl|NC_019514. 84 AGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKF-GFFTEFSQESLDFDSDSELFSHISTELM 162 (399) Q Consensus 84 aga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qY-G~~~e~Td~~~d~~~D~~l~~~~~~~lg 162 (399) .+.++. ....+..+++.+|.|+ +.=..++|+-.....++ +.+ +.+..+ T Consensus 59 ~~~~~~-----------------------------~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~-~~~-~~~~~~ 107 (273) T protein:vir:10 59 AGRQTS-----------------------------ADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LEA-YTRAGA 107 (273) T ss_pred CCCccC-----------------------------ccccccceEEEEEeeeeecceEeecHHHhhhhcc-HHH-HHHHHH Confidence 111111 1111223455667553 34445676544444444 433 445445 Q ss_pred HhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccccC Q lcl|NC_019514. 163 NGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTRTIS 242 (399) Q Consensus 163 ~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~~I~ 242 (399) +.-++-++ .-...+++++.... .+. . .....-.++.|..+...|.++++|. T Consensus 108 ~alA~~vD-~~i~~~~~~a~~~~-~~~----~------~~~~~~~~~~i~~a~~~ld~~~vP~----------------- 158 (273) T protein:vir:10 108 TALATDTD-KFIADMLVDNGTAL-TGS----A------PTDADDAFDLIAKALKELTKANVPN----------------- 158 (273) T ss_pred HHHHHHHH-HHHHHHHhcccccc-ccc----c------ccchhHHHHHHHHHHHHhhhcCCCc----------------- Confidence 44343332 22223333332111 110 1 1111225789999999999999996 Q ss_pred ceeEEEeCCCchHHHHHhhccCCCcc-ceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCccccccCc Q lcl|NC_019514. 243 AGRVLYIGSELIPLIRKLVDPFGNAA-FVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRETNG 321 (399) Q Consensus 243 ~~yv~~~h~d~~~dirdl~d~~~~p~-fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t~~ 321 (399) ..++++|+|+....|+.. +. |.....+++...+-+|+||++.||.++++.++-. + ++ T Consensus 159 ~~R~lvv~p~~~~~L~~~------~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~----~-----------~~- 216 (273) T protein:vir:10 159 VGRVVVVNAEMAFWLRSS------GSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRD----T-----------DD- 216 (273) T ss_pred CCCEEEECHHHHHHHhcc------hhhhhhhhccccccceeeeeeeEEeceEEEEeccccc----C-----------Cc- Confidence 237889999999999753 45 4456677777777899999999999999988621 0 00 Q ss_pred cceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEecc Q lcl|NC_019514. 322 KYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVKTVAP 398 (399) Q Consensus 322 ~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~a~ 398 (399) +..+.+-++|++.+- ++...... -||-.+.=.+=-+++|++.+|+++-+++++.... T Consensus 217 ----~~~~~~~~~A~~~a~----------q~~~~e~~------r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 217 ----EQFVAFHPSAAAYVS----------QIDTVEAL------RDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred ----cEEEEEeccceeeee----------eeehhhcc------cCCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 112444566665431 11111111 1332222122235889999999999999997777 No 20 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.25 E-value=3.2e-13 Score=89.12 Aligned_cols=264 Identities=13% Similarity=0.097 Sum_probs=163.4 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhccc-ccccccCCCEEEEEEccccccccccccC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADV-VSMPKNYGKEIRVYHYIPLLDDRNVNDQ 79 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~-~~mPkN~GktIk~rry~pl~~~~~~~~~ 79 (399) |+ -|+...-|-|+ .|..-..++.+..++|.++|.. ..|+-..|++|++-+|..+ .+... T Consensus 1 Ma------------~T~~~d~I~Pe----v~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~i-gdae~--- 60 (270) T protein:vir:95 1 MT------------QTKKANLINPE----VLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAYI-GAAED--- 60 (270) T ss_pred CC------------ceehhhhcchH----HHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecCC-Ccccc--- Confidence 32 23333334565 5555567777788999999985 6677778999999999743 22222 Q ss_pred CCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHH Q lcl|NC_019514. 80 GIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIST 159 (399) Q Consensus 80 gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~ 159 (399) +.+| .. |+... .+.....+.++|+|.-.++||+.....-.+++-+ ..+ T Consensus 61 --------~~eg----~~-----i~~~~--------------lt~~~~~a~i~~~gk~~~itD~a~~~~~~dp~~~-~~~ 108 (270) T protein:vir:95 61 --------LQEG----VA-----MDTTQ--------------MSMTTTKVTVKETGKAVEVTQTAIITNVNGTLQE-ASR 108 (270) T ss_pred --------ccCC----Cc-----cchhh--------------cccchheeeeehhhCcceecHHHHhhhccchHHH-HHH Confidence 2222 11 11111 2233467889999999999999887775443443 333 Q ss_pred HHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_019514. 160 ELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTR 239 (399) Q Consensus 160 ~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~ 239 (399) .++ .-+-+.++.++++...-. ..++....+++.+-++...|-... T Consensus 109 q~a----~~~a~~~d~~li~~l~~a--------------~~~~~~~~t~~~~~dA~~~lgd~~----------------- 153 (270) T protein:vir:95 109 QLA----MSLADKVEIDYIAELNKS--------------KQTATVSADATGILDAIEVFNSEN----------------- 153 (270) T ss_pred HHH----HHHHHHHHHHHHHHhccc--------------ccccccccCHHHHHHHHHHhcccc----------------- Confidence 333 233345556666544211 112234568888888886664321 Q ss_pred ccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCcccccc Q lcl|NC_019514. 240 TISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRET 319 (399) Q Consensus 240 ~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t 319 (399) ..-++.+|||.+...||.. .|+.-.+|++.. +.+|+||.+-|+|+|+.+.. T Consensus 154 --~~~~~i~vhs~~~~~Lrk~-------~~~~~~~~~~~~-~~~G~ig~~~G~~Viv~s~~------------------- 204 (270) T protein:vir:95 154 --DEDYVLYVNPKDYNKLVKS-------LFKVGGNVQDRA-ISKGDLVEIVGVSDIVKSKR------------------- 204 (270) T ss_pred --CCCcEEEEcHHHHHHHHhh-------hcccccccccch-hcccccceecceeEEEeCCC------------------- Confidence 1236899999999999852 477778898864 78999999999998876542 Q ss_pred CccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEeccC Q lcl|NC_019514. 320 NGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 320 ~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~a~~ 399 (399) ++.|-..+++++|.+...-++ +. .-. || ||+-+.=.+--...|++.+.++.-.+.|.. +|- T Consensus 205 ---~~~~~~~l~~~gAi~~~~~~~------~~---vEt-----dR-d~~~~~d~i~~~~~y~v~~~~~skvv~~t~-~~a 265 (270) T protein:vir:95 205 ---VSENTAFLQRYGAMEIVNKKK------PE---AYT-----DF-DILKRTHLLSTNYHYSVNLKDETGVVKVTF-KPS 265 (270) T ss_pred ---CCceeEEEEeccceeeeecCC------ce---eee-----cc-chhhcccEEEeeeEEEEEEEccceEEEEEe-cCC Confidence 123456789999988653222 11 111 22 555555444555678999999887776643 222 No 21 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=99.24 E-value=4.6e-13 Score=88.20 Aligned_cols=271 Identities=16% Similarity=0.165 Sum_probs=150.0 Q ss_pred eeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccc-cccccCCCEEEEEEccccccccccccCCCCCC Q lcl|NC_019514. 6 MLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVV-SMPKNYGKEIRVYHYIPLLDDRNVNDQGIDAA 84 (399) Q Consensus 6 ~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~-~mPkN~GktIk~rry~pl~~~~~~~~~gi~aa 84 (399) |..|++ -|+ .|...+++.-++.+++.+++... +.=.+.|+||.+++....... + .++. T Consensus 1 MA~~~~-----------~pe----i~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~~~----d--~~~~ 59 (273) T protein:vir:79 1 MAFNNF-----------IPE----LWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVK----D--YKAA 59 (273) T ss_pred Ccchhh-----------hHH----HHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCccccc----c--cccC Confidence 222321 123 57888888888999999998532 222457999999997543110 0 0111 Q ss_pred CceeccCccccccccccccccccccccccccccccccceeeeeEeeeeee-cceeehhhhhhhhhcchHHHHHHHHHHHH Q lcl|NC_019514. 85 GATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKF-GFFTEFSQESLDFDSDSELFSHISTELMN 163 (399) Q Consensus 85 ga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qY-G~~~e~Td~~~d~~~D~~l~~~~~~~lg~ 163 (399) |.++. .+ ..+..+++.+|.|+ +.=..++|+-......+ +.+ +.+.+++ T Consensus 60 ~~~~~---------------~~--------------~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~-~~~-~~~~~~~ 108 (273) T protein:vir:79 60 GRQTS---------------AD--------------AISDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LEA-YTRAGAT 108 (273) T ss_pred CCccC---------------cc--------------ccccceEEEEEeeecccceeeccHHHHhhccc-HHH-HHHHHHH Confidence 11111 11 11223566778664 33345565433333333 433 4444444 Q ss_pred hhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccccCc Q lcl|NC_019514. 164 GAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTRTISA 243 (399) Q Consensus 164 ~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~~I~~ 243 (399) .-++-++ .-...+++++.... +++ ......-.++.+..+.+.|.++++|. . T Consensus 109 ala~~vD-~~i~~~~~~a~~~~-~~~----------~~~~~~~~~~~i~~a~~~ld~~~vP~-----------------~ 159 (273) T protein:vir:79 109 ALATDTD-KFIADMLVDNGTAL-TGS----------APSDADDAFDLIASALKELTKANVPN-----------------V 159 (273) T ss_pred HHHHHHH-HHHHHHHhhccccc-ccc----------cccchhhHHHHHHHHHHHhhhccCCc-----------------c Confidence 3333332 22223343332111 110 01111225788999999999999986 2 Q ss_pred eeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCccccccCccc Q lcl|NC_019514. 244 GRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRETNGKY 323 (399) Q Consensus 244 ~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t~~~~ 323 (399) .++++|+|+...+|+.. +..|......++...+.+|+||++.||.|++++.+-.. ++ T Consensus 160 ~R~lvv~p~~~~~Ll~~-----~~~~~~~~~~~~~~~l~~G~ig~~~G~~i~~s~~lp~~---------------~~--- 216 (273) T protein:vir:79 160 GRVVVVNAEMAFWLRSS-----GSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDT---------------DD--- 216 (273) T ss_pred CcEEEECHHHHHHHhhc-----hhhhhhhhhcccccceeeeEeeEEeceEEEeccccccc---------------Cc--- Confidence 37899999999998642 12366666667777788999999999999999886221 00 Q ss_pred eEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEecc Q lcl|NC_019514. 324 DIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVKTVAP 398 (399) Q Consensus 324 DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~a~ 398 (399) +-.+.+=+.|++.. + ++....++ -||-.+.=.+--.++|++.+|+++-+++|+.... T Consensus 217 --~~~~a~~~~A~~~a--~--------~~~~~e~~------r~~~~~~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 217 --EQFVAFHPSAAAYV--S--------QIDTVEAL------RDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred --eEEEEEeccceeee--e--------ehhhhhcc------cCcccceeeeeeeeeeeeEEecCceEEEEeccCC Confidence 11233334555432 1 11111111 1332222222235789999999999999997666 No 22 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=99.23 E-value=6e-12 Score=82.12 Aligned_cols=318 Identities=12% Similarity=0.074 Sum_probs=179.0 Q ss_pred CCcCCeeecCC-CCccc----cccccccc-ceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccccccc Q lcl|NC_019514. 1 MASKGMLYNDP-NTTPS----GIDAPDGK-QMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDR 74 (399) Q Consensus 1 ~~~~~~~~n~~-~~t~t----T~~~~i~p-~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~ 74 (399) |- .+||. +.++. ...+.... ++...-|+.+.++.=+..-+|..+-..+++ ..|++++|.|.... T Consensus 1 ~~----~~~~~~~~~~~~~~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~r~i--~~G~tv~i~~ig~~---- 70 (332) T protein:vir:78 1 MT----TLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDL--RGGKSKQFMFTGKL---- 70 (332) T ss_pred Cc----ccccccCCccccCCccccccccchhhhhhhhhhhHHHHHHHHhhhhhccccccc--cccceEEEEeccce---- Confidence 32 22322 11111 11111111 333344777777765566666666666665 36999999998543 Q ss_pred ccccCCCCCCCceeccCccccccccccccccc-cccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHH Q lcl|NC_019514. 75 NVNDQGIDAAGATIVNGNLYGSSKDIGTIVGK-IPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSEL 153 (399) Q Consensus 75 ~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~-~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l 153 (399) ..++ .+.| .+ |.+. +| +-.+++.+|-|.=-|-.+=|.+.+...+-.+ T Consensus 71 -------~~~~--~~~g----~~-----l~~~~~~--------------~~~~~~l~ID~~ky~~~~VddiD~~q~~~dl 118 (332) T protein:vir:78 71 -------SAGY--HTPG----TP-----IVGDAGI--------------KANEKTLVMDDLLVSSQFVYSLDEIFSQYST 118 (332) T ss_pred -------eEee--ecCC----CC-----CCCCCCC--------------CCceEEEEEehhhhhHHHHHhHHHHhcCcch Confidence 1111 1111 11 0110 11 1123445566655555555666666666568 Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCC--CcccccccccccCCceecHHHHHHHHHHHHhccCccccceec Q lcl|NC_019514. 154 FSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGA--ATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVIT 231 (399) Q Consensus 154 ~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~--ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~ 231 (399) ..+++++.|..-++.++.-+-..+..+++...-+++ ..+...+++..+.+-.--++.|+.+...|.+++.|. T Consensus 119 ~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~------ 192 (332) T protein:vir:78 119 RAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQ------ 192 (332) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCcccccccccccccCCccccCHHHHHHHHHHHHHHHhhcCCCc------ Confidence 888888888887777744343333333321111110 111222333223333335688999999999999985 Q ss_pred cccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccc-cceeEcCeEEEecCccchhcccCCCc Q lcl|NC_019514. 232 GSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNG-EIGTVDQFRLVVVPEMLHWAGAGATV 310 (399) Q Consensus 232 ~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~g-EIG~i~~vRfV~~~~~~~~~~aGa~~ 310 (399) .-+++++.|+.-..|..- .++.|+.....+..+.+.+| +||++.||++++++++-.=.+..... T Consensus 193 -----------~gR~~vv~P~~y~~Ll~~----~d~~~~n~~~~~~~~~~~~g~~i~~i~G~~V~~Sn~lp~~~g~~~~~ 257 (332) T protein:vir:78 193 -----------EGRVAVLSPRQYYSLISS----VDTNILNREIGNSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSS 257 (332) T ss_pred -----------cCCEEEeCHHHHHHHHhh----cCceeeeeeccccccceecceeeeEEeeeEEEecCccccCccccccc Confidence 238999999999999542 24678777556667778888 49999999999999983211111111 Q ss_pred cCCccccccCccc----eEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhcc Q lcl|NC_019514. 311 GTNPGYRETNGKY----DIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILR 386 (399) Q Consensus 311 ~~~~~~~~t~~~~----DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn 386 (399) + ..++.++.+ +-..-|+|.++|.+.+-+.+ +++-+- -+--|+--|.-.+=-+..|++.+|| T Consensus 258 ~---~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~------~~~~~t------~~~~~~~~~~d~i~~~~~~G~~v~r 322 (332) T protein:vir:78 258 A---AVTGENNDYQVDASALAGLIFHREAAGCIQSVA------PTIQTT------SGDFNVQYQGDLIVGKLAMGCGSLR 322 (332) T ss_pred c---cccccccccccccccceEEeecccceeeeeeec------cchhhh------hcccchhhhHhhhhhhhhhcCceec Confidence 1 111112222 23446889999988885554 222111 0111333454555556789999999 Q ss_pred ccceEEEEEe Q lcl|NC_019514. 387 PERLALVKTV 396 (399) Q Consensus 387 ~~~m~~ie~~ 396 (399) ++..+.|+++ T Consensus 323 Pe~~v~l~~a 332 (332) T protein:vir:78 323 TSVAGSFQAA 332 (332) T ss_pred ccceEEEeeC Confidence 9999999999 No 23 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=99.22 E-value=9.3e-13 Score=86.55 Aligned_cols=290 Identities=14% Similarity=0.137 Sum_probs=173.0 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQG 80 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~g 80 (399) |.-+.+ |..+.+.++..+.+-|+ .+.++.++.+.+.-.+.+++...+++.+.+.++....- T Consensus 1 m~~~~~--~~~~~~~t~~~~~lvP~----~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~------------- 61 (297) T protein:vir:95 1 MTVQTF--NPENVLVSQKKDGTLHK----EFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTD------------- 61 (297) T ss_pred CCcccc--ccccccccCCCcceech----hHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcC------------- Confidence 654433 43344444444443343 34677888888888999999999987665544432211 Q ss_pred CCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHHH Q lcl|NC_019514. 81 IDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHISTE 160 (399) Q Consensus 81 i~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~~ 160 (399) .+...+..||- .+... ..+...++...++++.++.+|+++++ +++.++.+.+..+ T Consensus 62 -~~~a~~v~Eg~---------~~~~~--------------~~~f~~v~l~~~k~~~~~~is~ell~-ds~~~l~~~i~~~ 116 (297) T protein:vir:95 62 -GISAYWVNETE---------KIKTD--------------KPEVVPVTLKAHKLGIILVTSREALN-YTWKKFFEDMKPQ 116 (297) T ss_pred -CceeEEeecCc---------ccccc--------------ccceeEEEEeeEEEEEeehhhHHHHh-cCHHHHHHHHHHH Confidence 12234566652 12222 23345678899999999999999776 4445588888788 Q ss_pred HHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccc Q lcl|NC_019514. 161 LMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTRT 240 (399) Q Consensus 161 lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~~ 240 (399) +.+..+.-.+.. +++|.+...-.|..+... .........+++++|.++...|..+.... T Consensus 117 la~ai~~~~d~a----~l~G~g~~~~~gi~~~~~--~~~~~~~~~~t~~~i~~~~~~l~~~~~~~--------------- 175 (297) T protein:vir:95 117 IVEAFYKKIDEA----GLLGHDTPFANSVAKAAK--DANKVIGGPINYDNILKLQDALYDADVEP--------------- 175 (297) T ss_pred HHHHHHHHHHHH----HhcccCCccccccccccc--ccceecccccCHHHHHHHHHHhhhccCCc--------------- Confidence 877766555443 445543322122211111 11223345689999999998887754321 Q ss_pred cCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCccccccC Q lcl|NC_019514. 241 ISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRETN 320 (399) Q Consensus 241 I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t~ 320 (399) + +.++||.....|+.|+|.-+ .+++.+..|.+.|+..+.++... .. T Consensus 176 --~--~~v~~~~~~~~L~~l~d~~G-------------~~i~~~~~~~l~G~Pv~~~~~~~--------~~--------- 221 (297) T protein:vir:95 176 --N--AFVSKIQNRSALREARDGNK-------------VSIYDKAANTIDGITTVDLKSAR--------FE--------- 221 (297) T ss_pred --C--EEEEcHHHHHHHHHhhccCC-------------ceeecCCCCcccceeeEeecCCC--------CC--------- Confidence 2 35789999999998876333 34566666777788777554310 00 Q ss_pred ccceEEEEEEEcccceeeeccccCCCCccceEE---EecCCCCCCCCCCcc--chhhHHHHH--HHHHHhhccccceEEE Q lcl|NC_019514. 321 GKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVT---TKMPGEATADRNDPY--GEMGFSSIK--WYYGTLILRPERLALV 393 (399) Q Consensus 321 ~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~i---vk~pG~~~ad~~DPl--gQrg~~gwK--~~~~~~iLn~~~m~~i 393 (399) +- .+++|.-+...++..++ ..+++. .+..+. ...+.++ -|++.+.+| +++.+.+++++-.++| T Consensus 222 -~~----~~~~gd~s~~~~~~~~~---~~i~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l 291 (297) T protein:vir:95 222 -KG----DLLAGDFDNLIYGVPYN---ITYKISEEGQISTIT--NADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKL 291 (297) T ss_pred -Cc----eEEEEecccEEEEEecC---eEEEEeecccccccc--ccCccchhhhhcCcEEEEEEEEeccEeecccceEEE Confidence 11 14577766665655442 222222 111111 1122333 467777777 7889999999999999 Q ss_pred EEeccC Q lcl|NC_019514. 394 KTVAPL 399 (399) Q Consensus 394 e~~a~~ 399 (399) +.+.|| T Consensus 292 ~~at~~ 297 (297) T protein:vir:95 292 TPAERV 297 (297) T ss_pred eecCCC Confidence 999999 No 24 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=99.22 E-value=1.2e-11 Score=80.45 Aligned_cols=307 Identities=11% Similarity=0.069 Sum_probs=177.3 Q ss_pred ecCCC--Cccccccccccc--ceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCCCCC Q lcl|NC_019514. 8 YNDPN--TTPSGIDAPDGK--QMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQGIDA 83 (399) Q Consensus 8 ~n~~~--~t~tT~~~~i~p--~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~gi~a 83 (399) |++|. ..+-...+.-.+ .+....|+-+.+..=+-..+|..+-..|.+ ..|++++|-|-... T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~r~i--~~G~s~~~~~iG~~------------- 65 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNVRSL--RGTNQLRVDRVGAS------------- 65 (334) T ss_pred CCCCcCCCccccccccccchheehhhhhhhHHHHHHHHhhhhhccceeeec--cccceEEEeeecce------------- Confidence 55552 211111111122 233334788888776667888899999988 55999999987553 Q ss_pred CCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHHHHHH Q lcl|NC_019514. 84 AGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHISTELMN 163 (399) Q Consensus 84 aga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~~lg~ 163 (399) -...-.+|.+.|.+.+.....+| +|-+.=-+-.+=|.+.+....-.+..++++++|. T Consensus 66 ----~~~~~~~g~~l~~~~~~~~~~~l-------------------~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~ 122 (334) T protein:vir:80 66 ----TIAGRKAGEELVVQKNVSDKLNL-------------------TVDTVLYARHFFDKFDEWTSNLDVRKETAREDGI 122 (334) T ss_pred ----eeeeecCCCCCCCCCcccCceEE-------------------EEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHH Confidence 12233334444444444443333 3433211222223333334432377777777777 Q ss_pred hhhHHHHHHHHHHHHhcCCeE--------EecCCCcccccccccccCCceecH----HHHHHHHHHHHhccCccccceec Q lcl|NC_019514. 164 GAVQLTEAVLQKDLLAGAGTI--------VYTGAATQDSEITGEGATPSVVDY----DDLMRLSITLDENRTPKQTKVIT 231 (399) Q Consensus 164 ~a~~~~e~~l~~~~lag~~~v--------~yag~ats~~~~t~~~~~~~~vt~----~~lr~a~~~L~~nrap~~t~~i~ 231 (399) .-++.++.-+.+.++.++... .-+|.. +...+++. +.....+. +-++.|...|.++..|.. T Consensus 123 aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~~~G~~-~~~~~~g~-~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~----- 195 (334) T protein:vir:80 123 ALARQYDQACIIQLQKCGDFLAPAHLKPAFHDGIL-LPSTISGL-AADAAADADVLVAAHRQGVEAMVFRDLGDQ----- 195 (334) T ss_pred HHHHHHHHHHHHHHHHhhhhcccccccccccCCcc-eeeccccc-ccchhhhHHHHHHHHHHHHHHHHhcCCCCC----- Confidence 766666433333333333211 111211 11122111 11222333 344567778888888741 Q ss_pred cccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCC---ccccccccceeEcCeEEEecCccchhcccCC Q lcl|NC_019514. 232 GSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYAD---AGTILNGEIGTVDQFRLVVVPEMLHWAGAGA 308 (399) Q Consensus 232 ~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~---~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa 308 (399) + ..-+++++.|..-..|.. ++.|+.+ .|+. ...+-.|+|+++.||++++++++-.-...+ T Consensus 196 ~---------~~~R~~vv~P~~y~~Ll~------~~r~~n~-d~~~s~~~~~~~~g~i~~v~G~~V~~Sn~~P~~~~t~- 258 (334) T protein:vir:80 196 L---------MSEGVTLLDPVIFSFLLE------HDRLMNV-EFGAKEGGNSFVGGRIAMLNGVRVVETPRFPQSAITA- 258 (334) T ss_pred c---------CCceEEEeChHHHHHHhc------ccccccc-eeccccccccccceeEEEEeceEEEeecCCCCccccc- Confidence 0 124999999999999964 4788888 5543 346788899999999999999973221111 Q ss_pred CccCCccccccCccceEE-------EEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHH Q lcl|NC_019514. 309 TVGTNPGYRETNGKYDIY-------PMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYG 381 (399) Q Consensus 309 ~~~~~~~~~~t~~~~DVy-------p~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~ 381 (399) ...++.+.+| ..+++...|-+++.+..- . .+ ---|+--|..++==|..|+ T Consensus 259 --------~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~---------~---~e---~~~~~~~~~d~i~~~~a~G 315 (334) T protein:vir:80 259 --------NALGADFNVTDAEVRRKMITFIPSMALISAQVHPV---------S---AQ---FWEEKKDFGHYLDTFQSYN 315 (334) T ss_pred --------cccccccccccccccceEEEEEeCceEEEEEEeec---------c---ee---eeechhhHHHHHHHHHHcC Confidence 1223334444 568888888887744431 0 11 1226667888888899999 Q ss_pred HhhccccceEEEEEeccC Q lcl|NC_019514. 382 TLILRPERLALVKTVAPL 399 (399) Q Consensus 382 ~~iLn~~~m~~ie~~a~~ 399 (399) +.+||++..+.+|.-.+= T Consensus 316 ~g~lRPeaa~vv~~~~~~ 333 (334) T protein:vir:80 316 IGQRRPDAVAVHDITVTN 333 (334) T ss_pred CceeccceEEEEEEeeec Confidence 999999999999876665 No 25 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=99.18 E-value=3.9e-12 Score=83.12 Aligned_cols=314 Identities=12% Similarity=0.081 Sum_probs=152.3 Q ss_pred CCcCCeeecCCC--CcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccccccccccc Q lcl|NC_019514. 1 MASKGMLYNDPN--TTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVND 78 (399) Q Consensus 1 ~~~~~~~~n~~~--~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~ 78 (399) |+.-- -=|..- .-.+|.... +.+..|.+.+++.-++.++|..+....+.....|+||++.+.... . T Consensus 1 ~~~~~-~~~~~~~~~~~~t~~~~----fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g~~-~------ 68 (381) T protein:vir:80 1 MATIQ-GTGGYKGSAVDLSNVQV----FIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNISRA-A------ 68 (381) T ss_pred Cceec-ccccccCcccchhhHHh----hhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccCcc-e------ Confidence 65321 112211 111111122 222368888888888999999998887887788999999886432 0 Q ss_pred CCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecce-eehhhhhhhhhcchHHHHHH Q lcl|NC_019514. 79 QGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFF-TEFSQESLDFDSDSELFSHI 157 (399) Q Consensus 79 ~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~-~e~Td~~~d~~~D~~l~~~~ 157 (399) -.-.++| . .|+.+. .+-.+++.+|.|+=.| ..++|. ......-.+..++ T Consensus 69 ------a~d~~~g----~-----~i~~~~--------------~~~~~~~itID~~~~~~~~Idd~-D~~~~~~D~~~~~ 118 (381) T protein:vir:80 69 ------VYDKQPQ----T-----PVNLQA--------------RTDSEFTFTVTKYKESSFMIEDI-VNTQASYTLRQYY 118 (381) T ss_pred ------eeeecCC----C-----cccccc--------------cCCceEEEEEeeeeecceeechH-HHHhhccChHHHH Confidence 0011111 0 111111 1112345667555333 344442 2222221245555 Q ss_pred HHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccc-----ccccc-ccCCceecHHHHHHHHHHHHhccCccccceec Q lcl|NC_019514. 158 STELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDS-----EITGE-GATPSVVDYDDLMRLSITLDENRTPKQTKVIT 231 (399) Q Consensus 158 ~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~-----~~t~~-~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~ 231 (399) .+.++..-++.++.. ...+++...........+... ..... .+....++++.|..|.+.|++++.|. T Consensus 119 ~~~~~~aLA~~~D~~-i~~~~~~~~~~~~~~~~t~~~~i~~~~~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP~------ 191 (381) T protein:vir:80 119 TKEAGYALARDMDNF-ALAHRAVINAFPSQRIYSYDTTLGDGTVNAHLTGTPAPLTYAALLLAKQKLDEADVPQ------ 191 (381) T ss_pred HHHHHHHHHHHHHHH-HHHHHhhcccccccccccccccccccccccccccchhhHHHHHHHHHHHHHhhcCCCc------ Confidence 566665555555332 222333222221111111111 11111 12234578999999999999999985 Q ss_pred cccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchh------cc Q lcl|NC_019514. 232 GSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHW------AG 305 (399) Q Consensus 232 ~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~------~~ 305 (399) .-++++|+|+...+|+. ++.|..+ +|++.+.+.+|+||++.||+|++++++-.- .. T Consensus 192 -----------egR~lvv~P~~~~~Ll~------~~~~~~a-d~~~~~~l~~G~Ig~i~G~~Vv~Sn~lp~~~~t~~~~~ 253 (381) T protein:vir:80 192 -----------EGRIVMVSPAQYIDLLS------INQFISV-DFSQVKPVTSGVVGTILGMEVIVTTQIGINSLTGYVNG 253 (381) T ss_pred -----------CCcEEEeCHHHHHHHhh------chhhhhh-hhccchhhhceeeeEEcceEEEeecccccccccceeee Confidence 22788999999999964 4778887 588888899999999999999999887431 12 Q ss_pred cCCCccCCccccccCccceEEEEEEEcccceeeecccc---CC--C--CccceEEEecCCCCCCCCCCccchh------- Q lcl|NC_019514. 306 AGATVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQT---DG--K--TLKFKVTTKMPGEATADRNDPYGEM------- 371 (399) Q Consensus 306 aGa~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g---~g--~--~~~~~~ivk~pG~~~ad~~DPlgQr------- 371 (399) +|+.....+.. +++.+ -|+.+|....+.. .+ . ..+...+...- ++++=..- T Consensus 254 agap~~~~~~~--~~~~~-------~g~~s~~a~av~~~k~yd~~~~~~~~~~~~~~g~------~~~~~~~~~~~~~~~ 318 (381) T protein:vir:80 254 QGAPTQPTPGV--LGSPY-------LPDQAGTANVVNTGSASDLAVSLSYFGLPVFSGA------GATAADGGQTLGSFG 318 (381) T ss_pred ccccccccccc--ccccc-------ccccccceeeeeeeeeeceeeeeeeccceeeecc------eeeecCCCceeeeeh Confidence 22222222111 11111 1222221111111 11 0 00011111101 11222222 Q ss_pred hHHHHHHHHHHhhccccceEEEEEeccC Q lcl|NC_019514. 372 GFSSIKWYYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 372 g~~gwK~~~~~~iLn~~~m~~ie~~a~~ 399 (399) +++-|+ ++.+--..|-+.-=..+.- T Consensus 319 ~~~~~~---~~~~~~~~~~~~~~~~~~~ 343 (381) T protein:vir:80 319 GANRWA---TAVVCHPDWLAVGVQQNVK 343 (381) T ss_pred hhhhhh---hhcccccccccccceeEee Confidence 233333 5555455554432111111 No 26 >protein:vir:105610 Length: 430 # NCBI annotation: virion structural protein # Family: family:all:974 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164307;genbank:gi:56692923;genbank:GeneID:3197221 Probab=99.15 E-value=4.6e-11 Score=77.27 Aligned_cols=345 Identities=14% Similarity=0.131 Sum_probs=176.6 Q ss_pred CCcccccccccccceehhhhhHHHHHHHHHHHHh-hhhcc-------------------------cccccccCCCEEEEE Q lcl|NC_019514. 12 NTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYF-MPLAD-------------------------VVSMPKNYGKEIRVY 65 (399) Q Consensus 12 ~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~-~~fA~-------------------------~~~mPkN~GktIk~r 65 (399) -++..|.-+..+|+ .-..|++.+...+...-.| .+|.- ..+|-|+.|.+|.|- T Consensus 1 ~~~a~T~~~~~~p~-a~~~ws~~l~~~~~k~~~~~~kl~G~~~~~~~~~~~~~~~~ts~~~pI~r~~dL~K~~GD~Vtf~ 79 (430) T protein:vir:10 1 MTASKTTMRYGDPN-AMIQQAAGLFALCQGRNSTLNRLTGKMPSGTSDAEKKTKGQSSLELPIVQAQDLGRNKGDEVRFH 79 (430) T ss_pred CcceeeecccCChh-HHHHHHHHHHHHHhhhhhhHHHhhccccccccchhhhccCCCCCCccEEEeccCCCCCccEEEEe Confidence 23334444444566 3334665555555443332 33322 456789999999998 Q ss_pred EccccccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeec---ceeehh- Q lcl|NC_019514. 66 HYIPLLDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFG---FFTEFS- 141 (399) Q Consensus 66 ry~pl~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG---~~~e~T- 141 (399) --.+| .|-+=.|-...||+-+.-+-. ...-++|..... .+..+++.|== +|.+.- T Consensus 80 L~~~L--------~g~gv~Gd~~lEGnee~L~~~------------~d~l~IDq~R~~-V~~gg~msqQRt~~dlR~~ar 138 (430) T protein:vir:10 80 FVQPA--------NAFPIMGSEYAEGKGTGLKIG------------SDQLRVNQARFP-VDLGDVMSQIRNPYDLRRLGR 138 (430) T ss_pred Eeecc--------ccCceecCceeeccccceEEE------------eeEEEEeeeccc-cccCCchhhhhhhhHHHHHHH Confidence 88777 222223333444443322211 111122222111 01112222200 000000 Q ss_pred hhhhhh---hcchHHHHHHHHHHH-----------HhhhHHHHHHHHHHHHhcCCe-EEe-cCCCcccccc---cccccC Q lcl|NC_019514. 142 QESLDF---DSDSELFSHISTELM-----------NGAVQLTEAVLQKDLLAGAGT-IVY-TGAATQDSEI---TGEGAT 202 (399) Q Consensus 142 d~~~d~---~~D~~l~~~~~~~lg-----------~~a~~~~e~~l~~~~lag~~~-v~y-ag~ats~~~~---t~~~~~ 202 (399) +.+.+. ..|.-++-|++.--| ....+.+ +....+|.+-..+ +++ +|.+++.... ...-.. T Consensus 139 ~~L~~w~~~~~Dq~~~v~laGarg~~~~~~~~~~~~~~~~~~-~~~~N~v~aPt~nrh~~~~G~at~~~~~~~~~~sl~s 217 (430) T protein:vir:10 139 PKAKWFMDAYLDQSMLVHLAGARGNHYNKEWCLPLETHPKLA-DMLVNRVKAPTKNRHFVASADAITGVAPNAGEYNITT 217 (430) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhcccccccccccccCCcchh-hhhccccCCCCCceeEeecccccccccccccccchhh Confidence 000000 001111111111000 0001111 2233445553432 555 4544443321 111244 Q ss_pred CceecHHHHHHHHHHHHhccCccccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceeh-------hhc Q lcl|NC_019514. 203 PSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPV-------HQY 275 (399) Q Consensus 203 ~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v-------~~Y 275 (399) .++.+++.|+++...++..+-|.+.-.|+|..+-+.+|+ ||+|+||....|||. ++.|..- ... T Consensus 218 tD~~s~~~id~a~~~a~~~~~~i~Pv~v~gd~~~g~~~~---yV~~~~p~q~~~Lr~------dt~~~~wq~~~~a~a~~ 288 (430) T protein:vir:10 218 ADVLDVDVVDSIATYMDQIELPPPPVKFEGDEAAEDSPI---RVLLCSPAQYNSFAK------QEKFRSWQAAALARASN 288 (430) T ss_pred hcccCHHHHHHHHHHHHhhCCCCcceEeecccccCCccE---EEEEechHHHHHHhh------CcchHHHHHHHHHhhcc Confidence 588999999999999999998888888899888886644 999999999999984 6787632 234 Q ss_pred CCccccccccceeEcCeEEEecCccchhc-----ccCCCccCC----ccc-cccCccceEEEEEEEcccceeeecc--cc Q lcl|NC_019514. 276 ADAGTILNGEIGTVDQFRLVVVPEMLHWA-----GAGATVGTN----PGY-RETNGKYDIYPMLCVGAESFTTIGF--QT 343 (399) Q Consensus 276 a~~~~i~~gEIG~i~~vRfV~~~~~~~~~-----~aGa~~~~~----~~~-~~t~~~~DVyp~lV~G~~Afg~v~l--~g 343 (399) |+.-|||.||+|.++||=+.+-+..-.|- .-|++...+ ... ..-++++.|=.-|.+|..|-..--= .+ T Consensus 289 g~~nPlF~G~~gm~ngvii~~~~~virf~~g~~~~~~a~~~~~~~~~~~~~a~~~~~~~v~RalllGaQA~~~A~g~~~~ 368 (430) T protein:vir:10 289 AKQHPIFRVDAGLWSNTLIIKMPKPIRFYAGDTIKYCAAYNSEAESSAVVSDSFGNQYAVDRALLLGGQALAQAWAASEH 368 (430) T ss_pred cccCCceecceeeecCeEEecCCceeeecCCCccccccCCcccccccccccccccccccchhhhhccchhheeeeeccCC Confidence 66799999999999999888766443332 111111111 000 0113456677888999885422211 12 Q ss_pred CCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccc------------cceEEEEEeccC Q lcl|NC_019514. 344 DGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRP------------ERLALVKTVAPL 399 (399) Q Consensus 344 ~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~------------~~m~~ie~~a~~ 399 (399) +|. .|.-.- ....||.+=-++.++.++.+-.+- .=.+.|-++|++ T Consensus 369 ~g~--~f~w~E---------e~~D~g~~~~i~~~~i~G~kK~rF~~~~~~~~~~~DfGvi~idtaa~~ 425 (430) T protein:vir:10 369 SGM--PFFWSE---------KDMDHGDKLELLIGAILGCSKIRFAVEATNGLEYTDHGVMAIDTAVKI 425 (430) T ss_pred CCc--ceeeee---------eccccCchhhhhhhHHhccceeeecCCCCCCceeeeeEEEEhhhhhhh Confidence 332 233221 223466666677777777665432 234568888888 No 27 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=99.13 E-value=6e-11 Score=76.61 Aligned_cols=322 Identities=16% Similarity=0.059 Sum_probs=172.5 Q ss_pred CCcC--CeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccccccccccc Q lcl|NC_019514. 1 MASK--GMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVND 78 (399) Q Consensus 1 ~~~~--~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~ 78 (399) |+.- ++..|+ ..-+....+..---+...-|+.+.++.=+..-+|..+-..+++= .||+++|-|.... T Consensus 1 ma~~~~~~~~n~-~~~~~~~~~~~~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~--~g~s~~~~~iG~~-------- 69 (344) T protein:vir:10 1 MANMTGGQQLGT-NQGKDVMAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSIS--SGKSAQFPVLGRT-------- 69 (344) T ss_pred CccccccccCCc-ccCCccCCccchhHHHHHHHHHHHHHHHHHHhhhcccceeeeec--ccceEEEEeecee-------- Confidence 7632 222233 11111111111111223337777777766667777777777654 5999999988553 Q ss_pred CCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHH Q lcl|NC_019514. 79 QGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIS 158 (399) Q Consensus 79 ~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~ 158 (399) -..+..+|.+.|. +.+++.- .+.+.+|-|.=-|-.+=|.+.+...+-.+..+++ T Consensus 70 ---------~~~~~~~G~~l~~---t~~~~~~--------------~e~~l~ID~~~y~~~~VdDiD~~q~~~D~r~~~~ 123 (344) T protein:vir:10 70 ---------QAAYLAPGENLDD---IRKDIKH--------------TEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYT 123 (344) T ss_pred ---------EEEeeecCCCCCC---CCCCccc--------------ceEEEEEcchhhhhhhhhhHHHHhcCcchHHHHH Confidence 1112222222221 0111111 2234455554334444445555555545777788 Q ss_pred HHHHHhhhHHHHHHHHHHHHhcCCe----EEecCCCcccccccc---cccCCce-----ecHHHHHHHHHHHHhccCccc Q lcl|NC_019514. 159 TELMNGAVQLTEAVLQKDLLAGAGT----IVYTGAATQDSEITG---EGATPSV-----VDYDDLMRLSITLDENRTPKQ 226 (399) Q Consensus 159 ~~lg~~a~~~~e~~l~~~~lag~~~----v~yag~ats~~~~t~---~~~~~~~-----vt~~~lr~a~~~L~~nrap~~ 226 (399) ++.|..-++.++..+-+.+..++.. ..+.++..+...+.. ....... .=++.|+.+...|.++..|. T Consensus 124 ~~~G~aLA~~~D~~i~~~la~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~- 202 (344) T protein:vir:10 124 SQLGESLAMAADGAVLAEIAGLCNVESQYNENITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPS- 202 (344) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCc- Confidence 8888777777644443444332211 111111111111100 0001111 12567899999999999985 Q ss_pred cceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhccc Q lcl|NC_019514. 227 TKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGA 306 (399) Q Consensus 227 t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~a 306 (399) .-+++++.|+.-..|.+ ++.|... .|+....+-+|.||++.||++++++++-.-..+ T Consensus 203 ----------------~gR~~vv~P~~y~~Ll~------~~~~~~~-~~~~~~~~~~G~V~~v~G~~V~~Sn~lp~~~~~ 259 (344) T protein:vir:10 203 ----------------SDRVFYCDPDSYSAILA------ALMPNAA-NYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAG 259 (344) T ss_pred ----------------cCCEEEeChHHHHHHhh------ccccccc-ccccccceeeeEEEEEeceEEEeccccccccCC Confidence 23889999999988854 4567665 588888899999999999999999997431111 Q ss_pred CCCccCCc----cccccCccc----eEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHH Q lcl|NC_019514. 307 GATVGTNP----GYRETNGKY----DIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKW 378 (399) Q Consensus 307 Ga~~~~~~----~~~~t~~~~----DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~ 378 (399) +...+.++ ...+.+..+ +--.-|||=++|-+++-+.. +.. -.-.|+--|.-++==|+ T Consensus 260 ~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~---------~~~------e~~r~~~~~~d~i~g~~ 324 (344) T protein:vir:10 260 TSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRD---------LAL------ERARRANFQADQIIAKY 324 (344) T ss_pred cccccccCccccccCCcccceeeecceeEEEeechhhhhhhhhcc---------cee------ecccchhHHHHHHHHHh Confidence 11100000 000111111 11123455555555553222 100 01125555666666689 Q ss_pred HHHHhhccccceEEEEEecc Q lcl|NC_019514. 379 YYGTLILRPERLALVKTVAP 398 (399) Q Consensus 379 ~~~~~iLn~~~m~~ie~~a~ 398 (399) .|++.+||++..+.||-..+ T Consensus 325 ~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 325 AMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred hcccceecccceEEEEeecC Confidence 99999999999999999999 No 28 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=99.11 E-value=9.6e-12 Score=80.99 Aligned_cols=291 Identities=15% Similarity=0.145 Sum_probs=175.5 Q ss_pred eeecCCCCcccccc-cccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCCCCCC Q lcl|NC_019514. 6 MLYNDPNTTPSGID-APDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQGIDAA 84 (399) Q Consensus 6 ~~~n~~~~t~tT~~-~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~gi~aa 84 (399) |=+|. ++..++.. +.+=|+ .+.++.++...+..++.+++...||+.+ +.++.+.. .+. T Consensus 1 ~g~~a-~~~~~~~~~~~~iP~----~~~~~ii~~~~~~s~l~~~~~~~~~~~~---~~~~~~~~-------------~~~ 59 (299) T protein:vir:41 1 MGFNP-DTTTMQSAKTGSIPI----NISEQIITGVKNGSAAMKLAKAVPMTKP---EEEFTFMS-------------GVG 59 (299) T ss_pred CCcCC-CcccccCCCceecch----hHHHHHHHHHHhcchhhhhceeeecCCC---cEEEEEEc-------------CCc Confidence 66665 44333333 332233 2356777778899999999988887643 33332211 122 Q ss_pred CceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHHHHHHh Q lcl|NC_019514. 85 GATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHISTELMNG 164 (399) Q Consensus 85 ga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~~lg~~ 164 (399) ..+..|| +.......+...++...++++.+..+|+++++ +++.++.+.+...+.+. T Consensus 60 a~~v~E~-----------------------~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~-ds~~~~~~~i~~~l~~a 115 (299) T protein:vir:41 60 AFWVDEA-----------------------ERIQTSKPTFTKAKMRSKKMGVIIPTTKENLN-YSVTNFFSLMQAEIVEA 115 (299) T ss_pred eeeeecC-----------------------ccccccccceeEEEEeeEEEEEeehhhHHHHh-cCHHHHHHHHHHHHHHH Confidence 3455554 22223334556788999999999999999776 55556888777777776 Q ss_pred hhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccccCce Q lcl|NC_019514. 165 AVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTRTISAG 244 (399) Q Consensus 165 a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~~I~~~ 244 (399) .+...+ ..+++|.+.-.=.|..+. +............++++|.++.-.|..+..+. T Consensus 116 ~~~~~d----~a~l~G~g~~~~~gil~~-~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~~------------------- 171 (299) T protein:vir:41 116 FYKKFD----QAVFTGVESPYNWNILKS-ATDASNLVEETANKYDDLNEAIGLIEAEDLEP------------------- 171 (299) T ss_pred HHHHHH----HHHhhcccCccccccccc-ccccceeeccccccHHHHHHHHHhhhcccCCc------------------- Confidence 655444 344555432111111111 11111122334578999999998887655432 Q ss_pred eEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCccccccCccce Q lcl|NC_019514. 245 RVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRETNGKYD 324 (399) Q Consensus 245 yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t~~~~D 324 (399) -..+|||.+...|+.|+|--+.|-|.+. ..+..+.+-|+.++.++.|. +|. ++ T Consensus 172 ~~~v~n~~~~~~L~~lkd~~G~~l~~~~---------~~~~~~~l~G~PV~~~~~~~----~~~------------~~-- 224 (299) T protein:vir:41 172 NGIATIRKQRVKYRSTKDGNGMPIFNTA---------TSNGVDDVLGLPIAYTPKYT----FGD------------KD-- 224 (299) T ss_pred CEEEEcHHHHHHHHHhhccCCceeecCC---------cCCCCceecceeeEEecccC----CCC------------Cc-- Confidence 2358999999999999887767777643 23345688899999998872 111 11 Q ss_pred EEEEEEEcccceeeeccccCCCCccceEEE---ecCCCCCCCCCCc--cchhhHHHHH--HHHHHhhccccceEEEEEec Q lcl|NC_019514. 325 IYPMLCVGAESFTTIGFQTDGKTLKFKVTT---KMPGEATADRNDP--YGEMGFSSIK--WYYGTLILRPERLALVKTVA 397 (399) Q Consensus 325 Vyp~lV~G~~Afg~v~l~g~g~~~~~~~iv---k~pG~~~ad~~DP--lgQrg~~gwK--~~~~~~iLn~~~m~~ie~~a 397 (399) +.+++|.-++..+++.++ ..++... +..+. ...+.| +-|++.+.+| +++++.+++++-+++|+..+ T Consensus 225 --~~~~~gdfs~~~i~~~~~---~~i~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~a 297 (299) T protein:vir:41 225 --ISELVGDWNQAYYGILRG---VEYEILTEATLTTVA--DETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQPKA 297 (299) T ss_pred --eEEEEEecccEEEEEecC---cEEEEeecccccccc--cccccchhhhhcCcEEEEEEEEeccEEecccceEEEEecc Confidence 246778777666666543 2233321 11221 111222 3477778877 57789999999999887765 Q ss_pred cC Q lcl|NC_019514. 398 PL 399 (399) Q Consensus 398 ~~ 399 (399) -= T Consensus 298 a~ 299 (299) T protein:vir:41 298 GN 299 (299) T ss_pred CC Confidence 55 No 29 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=99.10 E-value=6.8e-12 Score=81.79 Aligned_cols=289 Identities=13% Similarity=0.125 Sum_probs=171.3 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQG 80 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~g 80 (399) |+-... |.-++++|+..+.+-|+ .+.++.++...+..++.+++...+|+.+ ..++-++.- T Consensus 1 ma~~~~--~~~~~~~t~~gg~lip~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~---~~~ip~~~~----------- 60 (304) T protein:vir:10 1 MATPTY--TPGNVILSDFKNGVIPA----EQGTLIMKDIMANSAIMKLAKNEPMTAQ---KKKFTYLAK----------- 60 (304) T ss_pred Cccccc--ccccccccCCCceecch----hHHHHHHHHHHhccchhhhcceeeccCC---ceEEEEEeC----------- Confidence 876543 44334433333333333 2356777778888999999998887642 233333311 Q ss_pred CCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHHH Q lcl|NC_019514. 81 IDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHISTE 160 (399) Q Consensus 81 i~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~~ 160 (399) .+...|..|+ +.......+...++.++++++.++.+|+++++ ++..++...+.+. T Consensus 61 -~~~a~~v~E~-----------------------~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~-ds~~~l~~~i~~~ 115 (304) T protein:vir:10 61 -GVGAYWVSET-----------------------ERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLK-WTAKDFFNEVKPL 115 (304) T ss_pred -CcceEEeecC-----------------------cccccccceeeEEEEEEEEEEEeehhhHHHHh-cchHHHHHHHHHH Confidence 1223455554 22223344566788999999999999999766 4444587878777 Q ss_pred HHHhhhHHHHHHHHHHHHhcCCeEEecCC----CcccccccccccCCceecHHHHHHHHHHHHhccCccccceecccccc Q lcl|NC_019514. 161 LMNGAVQLTEAVLQKDLLAGAGTIVYTGA----ATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMI 236 (399) Q Consensus 161 lg~~a~~~~e~~l~~~~lag~~~v~yag~----ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~ 236 (399) |.+..+.-.+ ..+++|.+...-.|. ....+............++++|.++...|+.+.... T Consensus 116 l~~~ia~~~d----~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~----------- 180 (304) T protein:vir:10 116 IAEAFYKAFD----QAVIFGTKSPYNTSTSGKPLVEGAEEKGNVVTDTNNLYVDLSALMATIEDEELDP----------- 180 (304) T ss_pred HHHHHHHHHH----hhheeccCCCcccccccccccccccccccccccccchHHHHHHHHHHhhhccCCc----------- Confidence 7776554444 344555432211111 001111112223456678999999988887644321 Q ss_pred CccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCccc Q lcl|NC_019514. 237 DTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGY 316 (399) Q Consensus 237 ~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~ 316 (399) + ..+|||.+...|+.++|-.+ .+++....|++-|+.++.++.+.. . + T Consensus 181 ------~--~~v~~~~~~~~L~~lkd~~G-------------~~l~~~~~~~l~G~PV~~~~~~~~------~-~----- 227 (304) T protein:vir:10 181 ------N--GVLTTRSFRSKMRNALDAND-------------RPLFDANGNEIMGLPLSYTGADVY------D-K----- 227 (304) T ss_pred ------C--EEEEcHHHHHHHHHhhccCC-------------cEeecCCCccccceeeEEeccccc------C-C----- Confidence 2 34789999999998865433 445555668899999988887621 0 0 Q ss_pred cccCccceEEEEEEEcccceeeeccccCCCCccceEE---EecCCCCCCCCCCcc------chhhHHHHH--HHHHHhhc Q lcl|NC_019514. 317 RETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVT---TKMPGEATADRNDPY------GEMGFSSIK--WYYGTLIL 385 (399) Q Consensus 317 ~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~i---vk~pG~~~ad~~DPl------gQrg~~gwK--~~~~~~iL 385 (399) ++. .+++|.-+.-.+++.++ ..+++. ....+ ...|+. -|+..+.|+ +++++.++ T Consensus 228 ----~~~----~~~~gd~~~~~~~~~~~---~~i~~~~e~~~~~~----~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~ 292 (304) T protein:vir:10 228 ----KKS----LALMGDWDYARYGILQG---IEYAISEDATLTTL----QASDASGQPVSLFERDMFALRATMHIAYMNV 292 (304) T ss_pred ----CCc----EEEEEehhhEEEEEecc---eEEEEeecceeeee----cccccCccchhhhhcCcEEEEEEEEeccEee Confidence 111 25567655555555442 112221 11112 123443 467778887 68899999 Q ss_pred cccceEEEEEec Q lcl|NC_019514. 386 RPERLALVKTVA 397 (399) Q Consensus 386 n~~~m~~ie~~a 397 (399) +++-+++|+.+= T Consensus 293 ~~~a~~~l~~a~ 304 (304) T protein:vir:10 293 KPEAFATLKPTE 304 (304) T ss_pred cccceEEEEecC Confidence 999999999988 No 30 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=99.10 E-value=6.8e-12 Score=81.79 Aligned_cols=289 Identities=13% Similarity=0.125 Sum_probs=171.3 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQG 80 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~g 80 (399) |+-... |.-++++|+..+.+-|+ .+.++.++...+..++.+++...+|+.+ ..++-++.- T Consensus 1 ma~~~~--~~~~~~~t~~gg~lip~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~---~~~ip~~~~----------- 60 (304) T protein:vir:94 1 MATPTY--TPGNVILSDFKNGVIPA----EQGTLIMKDIMANSAIMKLAKNEPMTAQ---KKKFTYLAK----------- 60 (304) T ss_pred Cccccc--ccccccccCCCceecch----hHHHHHHHHHHhccchhhhcceeeccCC---ceEEEEEeC----------- Confidence 876543 44334433333333333 2356777778888999999998887642 233333311 Q ss_pred CCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHHH Q lcl|NC_019514. 81 IDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHISTE 160 (399) Q Consensus 81 i~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~~ 160 (399) .+...|..|+ +.......+...++.++++++.++.+|+++++ ++..++...+.+. T Consensus 61 -~~~a~~v~E~-----------------------~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~-ds~~~l~~~i~~~ 115 (304) T protein:vir:94 61 -GVGAYWVSET-----------------------ERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLK-WTAKDFFNEVKPL 115 (304) T ss_pred -CcceEEeecC-----------------------cccccccceeeEEEEEEEEEEEeehhhHHHHh-cchHHHHHHHHHH Confidence 1223455554 22223344566788999999999999999766 4444587878777 Q ss_pred HHHhhhHHHHHHHHHHHHhcCCeEEecCC----CcccccccccccCCceecHHHHHHHHHHHHhccCccccceecccccc Q lcl|NC_019514. 161 LMNGAVQLTEAVLQKDLLAGAGTIVYTGA----ATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMI 236 (399) Q Consensus 161 lg~~a~~~~e~~l~~~~lag~~~v~yag~----ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~ 236 (399) |.+..+.-.+ ..+++|.+...-.|. ....+............++++|.++...|+.+.... T Consensus 116 l~~~ia~~~d----~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~----------- 180 (304) T protein:vir:94 116 IAEAFYKAFD----QAVIFGTKSPYNTSTSGKPLVEGAEEKGNVVTDTNNLYVDLSALMATIEDEELDP----------- 180 (304) T ss_pred HHHHHHHHHH----hhheeccCCCcccccccccccccccccccccccccchHHHHHHHHHHhhhccCCc----------- Confidence 7776554444 344555432211111 001111112223456678999999988887644321 Q ss_pred CccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCccc Q lcl|NC_019514. 237 DTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGY 316 (399) Q Consensus 237 ~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~ 316 (399) + ..+|||.+...|+.++|-.+ .+++....|++-|+.++.++.+.. . + T Consensus 181 ------~--~~v~~~~~~~~L~~lkd~~G-------------~~l~~~~~~~l~G~PV~~~~~~~~------~-~----- 227 (304) T protein:vir:94 181 ------N--GVLTTRSFRSKMRNALDAND-------------RPLFDANGNEIMGLPLSYTGADVY------D-K----- 227 (304) T ss_pred ------C--EEEEcHHHHHHHHHhhccCC-------------cEeecCCCccccceeeEEeccccc------C-C----- Confidence 2 34789999999998865433 445555668899999988887621 0 0 Q ss_pred cccCccceEEEEEEEcccceeeeccccCCCCccceEE---EecCCCCCCCCCCcc------chhhHHHHH--HHHHHhhc Q lcl|NC_019514. 317 RETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVT---TKMPGEATADRNDPY------GEMGFSSIK--WYYGTLIL 385 (399) Q Consensus 317 ~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~i---vk~pG~~~ad~~DPl------gQrg~~gwK--~~~~~~iL 385 (399) ++. .+++|.-+.-.+++.++ ..+++. ....+ ...|+. -|+..+.|+ +++++.++ T Consensus 228 ----~~~----~~~~gd~~~~~~~~~~~---~~i~~~~e~~~~~~----~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~ 292 (304) T protein:vir:94 228 ----KKS----LALMGDWDYARYGILQG---IEYAISEDATLTTL----QASDASGQPVSLFERDMFALRATMHIAYMNV 292 (304) T ss_pred ----CCc----EEEEEehhhEEEEEecc---eEEEEeecceeeee----cccccCccchhhhhcCcEEEEEEEEeccEee Confidence 111 25567655555555442 112221 11112 123443 467778887 68899999 Q ss_pred cccceEEEEEec Q lcl|NC_019514. 386 RPERLALVKTVA 397 (399) Q Consensus 386 n~~~m~~ie~~a 397 (399) +++-+++|+.+= T Consensus 293 ~~~a~~~l~~a~ 304 (304) T protein:vir:94 293 KPEAFATLKPTE 304 (304) T ss_pred cccceEEEEecC Confidence 999999999988 No 31 >protein:vir:10123 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859253;genbank:gi:32171009;genbank:GeneID:2653345 Probab=99.09 E-value=3.1e-10 Score=72.72 Aligned_cols=337 Identities=16% Similarity=0.132 Sum_probs=172.8 Q ss_pred CCcCCeeecCCC------CcccccccccccceehhhhhHHHHHHHH-HHHHhhhhcc--------cccccccCCCEEEEE Q lcl|NC_019514. 1 MASKGMLYNDPN------TTPSGIDAPDGKQMNTFFWWKKALIEAR-KDQYFMPLAD--------VVSMPKNYGKEIRVY 65 (399) Q Consensus 1 ~~~~~~~~n~~~------~t~tT~~~~i~p~m~~~y~~kk~L~~A~-p~lv~~~fA~--------~~~mPkN~GktIk~r 65 (399) |- .|-.|. +...|..+..+|+++. |.+++..... ..-.+..+.. ..++-|+.|.+|.|. T Consensus 1 ~~----~~~~~~a~~~~~~~lft~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~ 74 (404) T protein:vir:10 1 MT----TVTSAQANKLYQVALFTAANRNRSMVNI--LTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFS 74 (404) T ss_pred CC----CcCCcchhhhHHHHHHHHHhcCChhHhh--hhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEe Confidence 21 122221 1112333444555443 3444444422 1122322333 356679999999988 Q ss_pred EccccccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhh Q lcl|NC_019514. 66 HYIPLLDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESL 145 (399) Q Consensus 66 ry~pl~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~ 145 (399) --.+| .|-+=.|-.-.||+-+.-+- .... -++|+....- +..+++.|= .... T Consensus 75 L~~~L--------~g~gv~Gd~~lEGnee~L~~-----~s~~-------i~Idq~r~~V-~~~g~msqQ-------Rt~~ 126 (404) T protein:vir:10 75 IMHKL--------SKRPTMGDERVEGRGEDLSH-----ADFS-------LKINQGRHLV-DAGGRMSQQ-------RTKF 126 (404) T ss_pred Eeeec--------ccCCcccCceeeccccceeE-----EeeE-------EEEeeecccc-cccCchhhh-------hhHH Confidence 88777 22233333344444321111 1111 1222211110 011111110 0001 Q ss_pred hhhcchHHHHHHHHHHHHhhhHHHHHHH-------------------------------HHHHHhcC-CeEEecCCCccc Q lcl|NC_019514. 146 DFDSDSELFSHISTELMNGAVQLTEAVL-------------------------------QKDLLAGA-GTIVYTGAATQD 193 (399) Q Consensus 146 d~~~D~~l~~~~~~~lg~~a~~~~e~~l-------------------------------~~~~lag~-~~v~yag~ats~ 193 (399) | |.+.-...|..=-+... |++ ..+|.|-. +-++|+|.++++ T Consensus 127 d------lr~~ar~~L~~w~~~~~-d~~~~~~laG~rg~~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~ 199 (404) T protein:vir:10 127 N------LASSARTLLGTYFNDLQ-DQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSF 199 (404) T ss_pred H------HHHHHHHHHHHHHHHHH-HHHHHHHHhccccccccccceeeccccccccceeecccCCCCCCcEEeccCccch Confidence 1 22211122221111111 111 12233322 237788889998 Q ss_pred ccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehh Q lcl|NC_019514. 194 SEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVH 273 (399) Q Consensus 194 ~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~ 273 (399) ..+++ .+.++++.|.++.+.++...-|.+.-.+.|-.+-+. .+-||+|+||....|||. ...-+.|.... T Consensus 200 ~~l~s----tD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~---~~~yV~~~~p~q~~~Lr~---dt~~~~w~d~q 269 (404) T protein:vir:10 200 EQIEA----ADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE---DPYYVLYVTPRQWNDWYT---STSGKDWNQMM 269 (404) T ss_pred hhhhh----cccccHHHHHHHHHHHHHhCCCCcceEeccccccCc---cceEEEEechHHHHHHhh---CCCcHHHHHHH Confidence 88875 499999999999999999777765555555444443 345999999999988874 11113477776 Q ss_pred hc------CCccccccccceeEcCeEEEecCccc--hhcccCCCccCCcccccc---CccceEEEEEEEcccceeeeccc Q lcl|NC_019514. 274 QY------ADAGTILNGEIGTVDQFRLVVVPEML--HWAGAGATVGTNPGYRET---NGKYDIYPMLCVGAESFTTIGFQ 342 (399) Q Consensus 274 ~Y------a~~~~i~~gEIG~i~~vRfV~~~~~~--~~~~aGa~~~~~~~~~~t---~~~~DVyp~lV~G~~Afg~v~l~ 342 (399) ++ |..-|||.||.|.++||=+.+-+.+. ...+.....+.|...-++ ..+..|=.-|.+|..|-+..-=+ T Consensus 270 ~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~RallLGaQAl~~A~g~ 349 (404) T protein:vir:10 270 VRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQ 349 (404) T ss_pred HHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccccccccccccccchhheeecceeEEEEeec Confidence 65 46789999999999998877765431 112222111111111011 11234556688999764333112 Q ss_pred cCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhcc---------ccceEEEEEeccC Q lcl|NC_019514. 343 TDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILR---------PERLALVKTVAPL 399 (399) Q Consensus 343 g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn---------~~~m~~ie~~a~~ 399 (399) .+| ..|+-+.+. =.||.+--++.+++++.+-.+ |.=.+.|-++|++ T Consensus 350 ~~g--~~~~w~Ee~---------~D~g~~~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idta~~~ 404 (404) T protein:vir:10 350 KAG--GHFNMVEKK---------TDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) T ss_pred cCC--CCceeEeec---------cccCchhhhhhHHHhhhhhccccCCCCceeeEEEEEecccccC Confidence 222 235544331 136667778888888887766 4445679999999 No 32 >protein:vir:104439 Length: 404 # NCBI annotation: putative virion structural protein # Family: family:all:974 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794063;genbank:gi:116222008;genbank:GeneID:4397504 Probab=99.09 E-value=3.1e-10 Score=72.72 Aligned_cols=337 Identities=16% Similarity=0.132 Sum_probs=172.8 Q ss_pred CCcCCeeecCCC------CcccccccccccceehhhhhHHHHHHHH-HHHHhhhhcc--------cccccccCCCEEEEE Q lcl|NC_019514. 1 MASKGMLYNDPN------TTPSGIDAPDGKQMNTFFWWKKALIEAR-KDQYFMPLAD--------VVSMPKNYGKEIRVY 65 (399) Q Consensus 1 ~~~~~~~~n~~~------~t~tT~~~~i~p~m~~~y~~kk~L~~A~-p~lv~~~fA~--------~~~mPkN~GktIk~r 65 (399) |- .|-.|. +...|..+..+|+++. |.+++..... ..-.+..+.. ..++-|+.|.+|.|. T Consensus 1 ~~----~~~~~~a~~~~~~~lft~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~ 74 (404) T protein:vir:10 1 MT----TVTSAQANKLYQVALFTAANRNRSMVNI--LTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFS 74 (404) T ss_pred CC----CcCCcchhhhHHHHHHHHHhcCChhHhh--hhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEe Confidence 21 122221 1112333444555443 3444444422 1122322333 356679999999988 Q ss_pred EccccccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhh Q lcl|NC_019514. 66 HYIPLLDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESL 145 (399) Q Consensus 66 ry~pl~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~ 145 (399) --.+| .|-+=.|-.-.||+-+.-+- .... -++|+....- +..+++.|= .... T Consensus 75 L~~~L--------~g~gv~Gd~~lEGnee~L~~-----~s~~-------i~Idq~r~~V-~~~g~msqQ-------Rt~~ 126 (404) T protein:vir:10 75 IMHKL--------SKRPTMGDERVEGRGEDLSH-----ADFS-------LKINQGRHLV-DAGGRMSQQ-------RTKF 126 (404) T ss_pred Eeeec--------ccCCcccCceeeccccceeE-----EeeE-------EEEeeecccc-cccCchhhh-------hhHH Confidence 88777 22233333344444321111 1111 1222211110 011111110 0001 Q ss_pred hhhcchHHHHHHHHHHHHhhhHHHHHHH-------------------------------HHHHHhcC-CeEEecCCCccc Q lcl|NC_019514. 146 DFDSDSELFSHISTELMNGAVQLTEAVL-------------------------------QKDLLAGA-GTIVYTGAATQD 193 (399) Q Consensus 146 d~~~D~~l~~~~~~~lg~~a~~~~e~~l-------------------------------~~~~lag~-~~v~yag~ats~ 193 (399) | |.+.-...|..=-+... |++ ..+|.|-. +-++|+|.++++ T Consensus 127 d------lr~~ar~~L~~w~~~~~-d~~~~~~laG~rg~~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~ 199 (404) T protein:vir:10 127 N------LASSARTLLGTYFNDLQ-DQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSF 199 (404) T ss_pred H------HHHHHHHHHHHHHHHHH-HHHHHHHHhccccccccccceeeccccccccceeecccCCCCCCcEEeccCccch Confidence 1 22211122221111111 111 12233322 237788889998 Q ss_pred ccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehh Q lcl|NC_019514. 194 SEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVH 273 (399) Q Consensus 194 ~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~ 273 (399) ..+++ .+.++++.|.++.+.++...-|.+.-.+.|-.+-+. .+-||+|+||....|||. ...-+.|.... T Consensus 200 ~~l~s----tD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~---~~~yV~~~~p~q~~~Lr~---dt~~~~w~d~q 269 (404) T protein:vir:10 200 EQIEA----ADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE---DPYYVLYVTPRQWNDWYT---STSGKDWNQMM 269 (404) T ss_pred hhhhh----cccccHHHHHHHHHHHHHhCCCCcceEeccccccCc---cceEEEEechHHHHHHhh---CCCcHHHHHHH Confidence 88875 499999999999999999777765555555444443 345999999999988874 11113477776 Q ss_pred hc------CCccccccccceeEcCeEEEecCccc--hhcccCCCccCCcccccc---CccceEEEEEEEcccceeeeccc Q lcl|NC_019514. 274 QY------ADAGTILNGEIGTVDQFRLVVVPEML--HWAGAGATVGTNPGYRET---NGKYDIYPMLCVGAESFTTIGFQ 342 (399) Q Consensus 274 ~Y------a~~~~i~~gEIG~i~~vRfV~~~~~~--~~~~aGa~~~~~~~~~~t---~~~~DVyp~lV~G~~Afg~v~l~ 342 (399) ++ |..-|||.||.|.++||=+.+-+.+. ...+.....+.|...-++ ..+..|=.-|.+|..|-+..-=+ T Consensus 270 ~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~RallLGaQAl~~A~g~ 349 (404) T protein:vir:10 270 VRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQ 349 (404) T ss_pred HHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccccccccccccccchhheeecceeEEEEeec Confidence 65 46789999999999998877765431 112222111111111011 11234556688999764333112 Q ss_pred cCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhcc---------ccceEEEEEeccC Q lcl|NC_019514. 343 TDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILR---------PERLALVKTVAPL 399 (399) Q Consensus 343 g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn---------~~~m~~ie~~a~~ 399 (399) .+| ..|+-+.+. =.||.+--++.+++++.+-.+ |.=.+.|-++|++ T Consensus 350 ~~g--~~~~w~Ee~---------~D~g~~~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idta~~~ 404 (404) T protein:vir:10 350 KAG--GHFNMVEKK---------TDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) T ss_pred cCC--CCceeEeec---------cccCchhhhhhHHHhhhhhccccCCCCceeeEEEEEecccccC Confidence 222 235544331 136667778888888887766 4445679999999 No 33 >protein:vir:3298 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049514;genbank:gi:9632520;genbank:GeneID:1262006 Probab=99.09 E-value=3.1e-10 Score=72.72 Aligned_cols=337 Identities=16% Similarity=0.132 Sum_probs=172.8 Q ss_pred CCcCCeeecCCC------CcccccccccccceehhhhhHHHHHHHH-HHHHhhhhcc--------cccccccCCCEEEEE Q lcl|NC_019514. 1 MASKGMLYNDPN------TTPSGIDAPDGKQMNTFFWWKKALIEAR-KDQYFMPLAD--------VVSMPKNYGKEIRVY 65 (399) Q Consensus 1 ~~~~~~~~n~~~------~t~tT~~~~i~p~m~~~y~~kk~L~~A~-p~lv~~~fA~--------~~~mPkN~GktIk~r 65 (399) |- .|-.|. +...|..+..+|+++. |.+++..... ..-.+..+.. ..++-|+.|.+|.|. T Consensus 1 ~~----~~~~~~a~~~~~~~lft~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~ 74 (404) T protein:vir:32 1 MT----TVTSAQANKLYQVALFTAANRNRSMVNI--LTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFS 74 (404) T ss_pred CC----CcCCcchhhhHHHHHHHHHhcCChhHhh--hhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEe Confidence 21 122221 1112333444555443 3444444422 1122322333 356679999999988 Q ss_pred EccccccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhh Q lcl|NC_019514. 66 HYIPLLDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESL 145 (399) Q Consensus 66 ry~pl~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~ 145 (399) --.+| .|-+=.|-.-.||+-+.-+- .... -++|+....- +..+++.|= .... T Consensus 75 L~~~L--------~g~gv~Gd~~lEGnee~L~~-----~s~~-------i~Idq~r~~V-~~~g~msqQ-------Rt~~ 126 (404) T protein:vir:32 75 IMHKL--------SKRPTMGDERVEGRGEDLSH-----ADFS-------LKINQGRHLV-DAGGRMSQQ-------RTKF 126 (404) T ss_pred Eeeec--------ccCCcccCceeeccccceeE-----EeeE-------EEEeeecccc-cccCchhhh-------hhHH Confidence 88777 22233333344444321111 1111 1222211110 011111110 0001 Q ss_pred hhhcchHHHHHHHHHHHHhhhHHHHHHH-------------------------------HHHHHhcC-CeEEecCCCccc Q lcl|NC_019514. 146 DFDSDSELFSHISTELMNGAVQLTEAVL-------------------------------QKDLLAGA-GTIVYTGAATQD 193 (399) Q Consensus 146 d~~~D~~l~~~~~~~lg~~a~~~~e~~l-------------------------------~~~~lag~-~~v~yag~ats~ 193 (399) | |.+.-...|..=-+... |++ ..+|.|-. +-++|+|.++++ T Consensus 127 d------lr~~ar~~L~~w~~~~~-d~~~~~~laG~rg~~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~ 199 (404) T protein:vir:32 127 N------LASSARTLLGTYFNDLQ-DQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSF 199 (404) T ss_pred H------HHHHHHHHHHHHHHHHH-HHHHHHHHhccccccccccceeeccccccccceeecccCCCCCCcEEeccCccch Confidence 1 22211122221111111 111 12233322 237788889998 Q ss_pred ccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehh Q lcl|NC_019514. 194 SEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVH 273 (399) Q Consensus 194 ~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~ 273 (399) ..+++ .+.++++.|.++.+.++...-|.+.-.+.|-.+-+. .+-||+|+||....|||. ...-+.|.... T Consensus 200 ~~l~s----tD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~---~~~yV~~~~p~q~~~Lr~---dt~~~~w~d~q 269 (404) T protein:vir:32 200 EQIEA----ADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE---DPYYVLYVTPRQWNDWYT---STSGKDWNQMM 269 (404) T ss_pred hhhhh----cccccHHHHHHHHHHHHHhCCCCcceEeccccccCc---cceEEEEechHHHHHHhh---CCCcHHHHHHH Confidence 88875 499999999999999999777765555555444443 345999999999988874 11113477776 Q ss_pred hc------CCccccccccceeEcCeEEEecCccc--hhcccCCCccCCcccccc---CccceEEEEEEEcccceeeeccc Q lcl|NC_019514. 274 QY------ADAGTILNGEIGTVDQFRLVVVPEML--HWAGAGATVGTNPGYRET---NGKYDIYPMLCVGAESFTTIGFQ 342 (399) Q Consensus 274 ~Y------a~~~~i~~gEIG~i~~vRfV~~~~~~--~~~~aGa~~~~~~~~~~t---~~~~DVyp~lV~G~~Afg~v~l~ 342 (399) ++ |..-|||.||.|.++||=+.+-+.+. ...+.....+.|...-++ ..+..|=.-|.+|..|-+..-=+ T Consensus 270 ~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~RallLGaQAl~~A~g~ 349 (404) T protein:vir:32 270 VRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQ 349 (404) T ss_pred HHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccccccccccccccchhheeecceeEEEEeec Confidence 65 46789999999999998877765431 112222111111111011 11234556688999764333112 Q ss_pred cCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhcc---------ccceEEEEEeccC Q lcl|NC_019514. 343 TDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILR---------PERLALVKTVAPL 399 (399) Q Consensus 343 g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn---------~~~m~~ie~~a~~ 399 (399) .+| ..|+-+.+. =.||.+--++.+++++.+-.+ |.=.+.|-++|++ T Consensus 350 ~~g--~~~~w~Ee~---------~D~g~~~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idta~~~ 404 (404) T protein:vir:32 350 KAG--GHFNMVEKK---------TDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) T ss_pred cCC--CCceeEeec---------cccCchhhhhhHHHhhhhhccccCCCCceeeEEEEEecccccC Confidence 222 235544331 136667778888888887766 4445679999999 No 34 >protein:vir:819 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050552;genbank:gi:9633449;genbank:GeneID:1262254 Probab=99.09 E-value=3.1e-10 Score=72.72 Aligned_cols=337 Identities=16% Similarity=0.132 Sum_probs=172.8 Q ss_pred CCcCCeeecCCC------CcccccccccccceehhhhhHHHHHHHH-HHHHhhhhcc--------cccccccCCCEEEEE Q lcl|NC_019514. 1 MASKGMLYNDPN------TTPSGIDAPDGKQMNTFFWWKKALIEAR-KDQYFMPLAD--------VVSMPKNYGKEIRVY 65 (399) Q Consensus 1 ~~~~~~~~n~~~------~t~tT~~~~i~p~m~~~y~~kk~L~~A~-p~lv~~~fA~--------~~~mPkN~GktIk~r 65 (399) |- .|-.|. +...|..+..+|+++. |.+++..... ..-.+..+.. ..++-|+.|.+|.|. T Consensus 1 ~~----~~~~~~a~~~~~~~lft~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~ 74 (404) T protein:vir:81 1 MT----TVTSAQANKLYQVALFTAANRNRSMVNI--LTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFS 74 (404) T ss_pred CC----CcCCcchhhhHHHHHHHHHhcCChhHhh--hhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEe Confidence 21 122221 1112333444555443 3444444422 1122322333 356679999999988 Q ss_pred EccccccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhh Q lcl|NC_019514. 66 HYIPLLDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESL 145 (399) Q Consensus 66 ry~pl~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~ 145 (399) --.+| .|-+=.|-.-.||+-+.-+- .... -++|+....- +..+++.|= .... T Consensus 75 L~~~L--------~g~gv~Gd~~lEGnee~L~~-----~s~~-------i~Idq~r~~V-~~~g~msqQ-------Rt~~ 126 (404) T protein:vir:81 75 IMHKL--------SKRPTMGDERVEGRGEDLSH-----ADFS-------LKINQGRHLV-DAGGRMSQQ-------RTKF 126 (404) T ss_pred Eeeec--------ccCCcccCceeeccccceeE-----EeeE-------EEEeeecccc-cccCchhhh-------hhHH Confidence 88777 22233333344444321111 1111 1222211110 011111110 0001 Q ss_pred hhhcchHHHHHHHHHHHHhhhHHHHHHH-------------------------------HHHHHhcC-CeEEecCCCccc Q lcl|NC_019514. 146 DFDSDSELFSHISTELMNGAVQLTEAVL-------------------------------QKDLLAGA-GTIVYTGAATQD 193 (399) Q Consensus 146 d~~~D~~l~~~~~~~lg~~a~~~~e~~l-------------------------------~~~~lag~-~~v~yag~ats~ 193 (399) | |.+.-...|..=-+... |++ ..+|.|-. +-++|+|.++++ T Consensus 127 d------lr~~ar~~L~~w~~~~~-d~~~~~~laG~rg~~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~ 199 (404) T protein:vir:81 127 N------LASSARTLLGTYFNDLQ-DQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSF 199 (404) T ss_pred H------HHHHHHHHHHHHHHHHH-HHHHHHHHhccccccccccceeeccccccccceeecccCCCCCCcEEeccCccch Confidence 1 22211122221111111 111 12233322 237788889998 Q ss_pred ccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehh Q lcl|NC_019514. 194 SEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVH 273 (399) Q Consensus 194 ~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~ 273 (399) ..+++ .+.++++.|.++.+.++...-|.+.-.+.|-.+-+. .+-||+|+||....|||. ...-+.|.... T Consensus 200 ~~l~s----tD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~---~~~yV~~~~p~q~~~Lr~---dt~~~~w~d~q 269 (404) T protein:vir:81 200 EQIEA----ADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE---DPYYVLYVTPRQWNDWYT---STSGKDWNQMM 269 (404) T ss_pred hhhhh----cccccHHHHHHHHHHHHHhCCCCcceEeccccccCc---cceEEEEechHHHHHHhh---CCCcHHHHHHH Confidence 88875 499999999999999999777765555555444443 345999999999988874 11113477776 Q ss_pred hc------CCccccccccceeEcCeEEEecCccc--hhcccCCCccCCcccccc---CccceEEEEEEEcccceeeeccc Q lcl|NC_019514. 274 QY------ADAGTILNGEIGTVDQFRLVVVPEML--HWAGAGATVGTNPGYRET---NGKYDIYPMLCVGAESFTTIGFQ 342 (399) Q Consensus 274 ~Y------a~~~~i~~gEIG~i~~vRfV~~~~~~--~~~~aGa~~~~~~~~~~t---~~~~DVyp~lV~G~~Afg~v~l~ 342 (399) ++ |..-|||.||.|.++||=+.+-+.+. ...+.....+.|...-++ ..+..|=.-|.+|..|-+..-=+ T Consensus 270 ~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~RallLGaQAl~~A~g~ 349 (404) T protein:vir:81 270 VRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQ 349 (404) T ss_pred HHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccccccccccccccchhheeecceeEEEEeec Confidence 65 46789999999999998877765431 112222111111111011 11234556688999764333112 Q ss_pred cCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhcc---------ccceEEEEEeccC Q lcl|NC_019514. 343 TDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILR---------PERLALVKTVAPL 399 (399) Q Consensus 343 g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn---------~~~m~~ie~~a~~ 399 (399) .+| ..|+-+.+. =.||.+--++.+++++.+-.+ |.=.+.|-++|++ T Consensus 350 ~~g--~~~~w~Ee~---------~D~g~~~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idta~~~ 404 (404) T protein:vir:81 350 KAG--GHFNMVEKK---------TDMDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) T ss_pred cCC--CCceeEeec---------cccCchhhhhhHHHhhhhhccccCCCCceeeEEEEEecccccC Confidence 222 235544331 136667778888888887766 4445679999999 No 35 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=99.06 E-value=1.9e-10 Score=73.83 Aligned_cols=320 Identities=17% Similarity=0.054 Sum_probs=175.5 Q ss_pred CCcCC--eeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccccccccccc Q lcl|NC_019514. 1 MASKG--MLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVND 78 (399) Q Consensus 1 ~~~~~--~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~ 78 (399) |+.-- ...|. ++.+-...++.---+...-|+.+.++.=+..-+|..+=..+++= .||+++|-|.... T Consensus 1 ~~~~~~~~~~~~-~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~r~i~--~gks~~~~~iG~~-------- 69 (345) T protein:vir:22 1 MASMTGGQQMGT-NQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSIS--SGKSAQFPVLGRT-------- 69 (345) T ss_pred Ccccccchhccc-ccccccccCCchhHHHHHHHhHHHHHHHHHHhhhcccceeeecc--ccceEEEeeecce-------- Confidence 65421 22222 11111111121112233336777766644555555555566553 6999999988553 Q ss_pred CCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHH Q lcl|NC_019514. 79 QGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIS 158 (399) Q Consensus 79 ~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~ 158 (399) -+.....|.+.|. +.+++.-+ +...+|-|.=-|-.+=|.+.+....-.+..+++ T Consensus 70 ---------~~~~~~~G~~l~~---~~~~~~~~--------------e~~ltID~~~y~~~~VddiD~~q~~~D~r~~~s 123 (345) T protein:vir:22 70 ---------QAAYLAPGENLDD---KRKDIKHT--------------EKVITIDGLLTADVLIYDIEDAMNHYDVRSEYT 123 (345) T ss_pred ---------EEEeeecCCCCCC---CCCCcccc--------------eEEEEecchhhhhhhHhhHHHHhcCchhHHHHH Confidence 1222222333322 11111111 223445454444444455555555545777788 Q ss_pred HHHHHhhhHHHHHHHHHHHHhcCCe----EEecCCCcccc--cccccc--c----CCceecHHHHHHHHHHHHhccCccc Q lcl|NC_019514. 159 TELMNGAVQLTEAVLQKDLLAGAGT----IVYTGAATQDS--EITGEG--A----TPSVVDYDDLMRLSITLDENRTPKQ 226 (399) Q Consensus 159 ~~lg~~a~~~~e~~l~~~~lag~~~----v~yag~ats~~--~~t~~~--~----~~~~vt~~~lr~a~~~L~~nrap~~ 226 (399) ++.|..-++.++..+-+.+..++.. .-+.++-.... ..++.+ . .....-++.|+.|...|.++..|. T Consensus 124 ~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~~~~~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~- 202 (345) T protein:vir:22 124 SQLGESLAMAADGAVLAEIAGLCNVESKYNENIEGLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPA- 202 (345) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCc- Confidence 8888877777755454444433321 11111100000 011111 0 111123788999999999999996 Q ss_pred cceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhccc Q lcl|NC_019514. 227 TKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGA 306 (399) Q Consensus 227 t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~a 306 (399) .-++++|.|+.-..|.+ ++.|... .|+.....-+|.||++.|||+++++++-. ..+ T Consensus 203 ----------------~~R~~vv~P~~y~~Ll~------~~~~~~~-~~~~~~~~~~G~V~~i~G~~V~~sn~lp~-~~~ 258 (345) T protein:vir:22 203 ----------------ADRVFYCDPDSYSAILA------ALMPNAA-NYAALIDPEKGSIRNVMGFEVVEVPHLTA-GGA 258 (345) T ss_pred ----------------cCCEEEeChHHHHHHhc------ccccccc-ccccccccccceEEEEeceEEEecccccc-ccc Confidence 23899999999998854 4667665 58888878899999999999999998642 111 Q ss_pred CCCcc-----CCccccccCccceEE------EEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHH Q lcl|NC_019514. 307 GATVG-----TNPGYRETNGKYDIY------PMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSS 375 (399) Q Consensus 307 Ga~~~-----~~~~~~~t~~~~DVy------p~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~g 375 (399) +-... ++...+.+++.+ ++ ..|+|-+.|-+++.+..-. .| .--|+--|.-++= T Consensus 259 ~~~~~~~~~~~~~~~~~~g~~~-~~~~~~~~~~l~~h~~A~~~v~~~~~~------------~e---~~r~~~~~~d~I~ 322 (345) T protein:vir:22 259 GTAREGTTGQKHVFPANKGEGN-VKVAKDNVIGLFMHRSAVGTVKLRDLA------------LE---RARRANFQADQII 322 (345) T ss_pred CccccCccccccccccccccee-eeeccCceEEEEEehhheeeeeeecce------------ee---eeechhHHHHHHH Confidence 11111 111111111111 11 3577777777776433310 01 1226667777777 Q ss_pred HHHHHHHhhccccceEEEEEecc Q lcl|NC_019514. 376 IKWYYGTLILRPERLALVKTVAP 398 (399) Q Consensus 376 wK~~~~~~iLn~~~m~~ie~~a~ 398 (399) =|..|++.+||++..+.|+.=.. T Consensus 323 ~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 323 AKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred HHHhcCCcccccceeEEEEEeeC Confidence 78999999999999998877666 No 36 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=99.05 E-value=1.2e-10 Score=74.96 Aligned_cols=321 Identities=15% Similarity=0.072 Sum_probs=172.8 Q ss_pred CCcCCeeecCCCCcccc-cccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSG-IDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQ 79 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT-~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~ 79 (399) |+.-.---|. .+.+-+ ..+..---+...-|..+.+..=+..-+|..+-..+++ ..|++++|.|...... T Consensus 1 ~a~~~~~~~~-~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i--~~G~sv~~~~iG~~~~------- 70 (347) T protein:vir:88 1 MANATGGQQI-GANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTI--QNGKSASFPVMGRTKG------- 70 (347) T ss_pred CCCcccchhh-hccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhccccccc--cCcceEEEeeecceee------- Confidence 7755433332 111111 1112111223333677776654455566666666654 5799999999976411 Q ss_pred CCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHH Q lcl|NC_019514. 80 GIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIST 159 (399) Q Consensus 80 gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~ 159 (399) .-.+.| -+.|. -+|- ....+++.+|-|+=-|-.+=|.+.+....-.+..++++ T Consensus 71 ------~~~~~g----~~l~~---------------~~~~--~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~ 123 (347) T protein:vir:88 71 ------YYLAPG----ENLDD---------------KRKD--IKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSA 123 (347) T ss_pred ------eeeccc----cCCCC---------------CCCC--CccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHH Confidence 111222 11110 0000 11123555666653344333333333333236666777 Q ss_pred HHHHhhhHHHHHHHHHHHHhcCCe-----EEecCCCccc-ccccccc-----cCCceecHHHHHHHHHHHHhccCccccc Q lcl|NC_019514. 160 ELMNGAVQLTEAVLQKDLLAGAGT-----IVYTGAATQD-SEITGEG-----ATPSVVDYDDLMRLSITLDENRTPKQTK 228 (399) Q Consensus 160 ~lg~~a~~~~e~~l~~~~lag~~~-----v~yag~ats~-~~~t~~~-----~~~~~vt~~~lr~a~~~L~~nrap~~t~ 228 (399) +.|+.-++.++..+.+.+..++.. -..+|.-++. ..+++.+ ..+...-++.|+.+.+.|++++.|. T Consensus 124 ~~g~aLA~~~D~~i~~~l~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~--- 200 (347) T protein:vir:88 124 QLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPA--- 200 (347) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccccccccccCCccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCC--- Confidence 777776777754444444333221 1122211111 1111111 1112223788999999999999996 Q ss_pred eeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCC Q lcl|NC_019514. 229 VITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGA 308 (399) Q Consensus 229 ~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa 308 (399) .-+++++.|+.-.+|.+ ++.|.. ..|.+...+-+|.||++.+|++++++++- ....|. T Consensus 201 --------------~gR~~vv~P~~y~~Ll~------~~~~~~-~~~~~~~~~~~G~vg~i~G~~V~~s~nlp-~~~~~~ 258 (347) T protein:vir:88 201 --------------GDRRFYCAPEDYSAILS------ALMPNA-ANYAALIDPETGNIRNVMGFEVIEVPHLT-VGGAGD 258 (347) T ss_pred --------------CCCEEEeCHHHHHHHhc------chhhhh-hhhccccchhcceeeeeccceEEEeeccc-cccccc Confidence 23889999999888853 355654 48888778889999999999999999983 211111 Q ss_pred CccCCccc----------cccCccceE----EEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHH Q lcl|NC_019514. 309 TVGTNPGY----------RETNGKYDI----YPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFS 374 (399) Q Consensus 309 ~~~~~~~~----------~~t~~~~DV----yp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~ 374 (399) .+-. ..+ +++.+.+.. ---||+-..|-|++-+..-. .| . --||--|.-.+ T Consensus 259 ~~~~-~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~------------~e-~--~r~~~~~~d~i 322 (347) T protein:vir:88 259 NNPA-DGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMA------------LE-R--ARRPEFQADQI 322 (347) T ss_pred cccc-ccccccccccccccccccccccccCcEEEEEechhhhhheecccce------------ee-e--eechhhHHHHh Confidence 1100 000 111111211 11255556666665333211 01 1 12666677777 Q ss_pred HHHHHHHHhhccccceEEEEEeccC Q lcl|NC_019514. 375 SIKWYYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 375 gwK~~~~~~iLn~~~m~~ie~~a~~ 399 (399) ==|..|++.+||++..+.|++-+.- T Consensus 323 ~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 323 IGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred hhhhhhcCceeccceEEEEEeCCCC Confidence 7789999999999999999987666 No 37 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=99.04 E-value=3.3e-11 Score=78.03 Aligned_cols=306 Identities=12% Similarity=0.031 Sum_probs=179.9 Q ss_pred CCcCCeeecCCCCcccccccc-cccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAP-DGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQ 79 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~-i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~ 79 (399) |+...+.=.. ++.+...+. +-|+ + ..+.++..++..++.+++...+|+.+. +++-+.. . T Consensus 1 m~~~~~~a~~--~~~t~~~g~~i~~~----~-~~~ii~~~~~~s~l~~~~~~~~~~~~~---~~~p~~~----------~ 60 (330) T protein:vir:77 1 MAGSTVPSTQ--VALTGDFSAFLTPE----Q-SQDYFAEIEKTSIVQRIARKVPMGPTG---ISIPHWT----------G 60 (330) T ss_pred Ccccccchhh--ccccCCCcceechh----H-HHHHHHHHHhccchhhhcceeeccCCc---eEEEEEc----------C Confidence 8776633221 222222222 3333 3 357888888999999999988877533 3333331 1 Q ss_pred CCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHH Q lcl|NC_019514. 80 GIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIST 159 (399) Q Consensus 80 gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~ 159 (399) .+...|..||. .+. ....+...++.+.++++.++++|+++++ +++.++...+.+ T Consensus 61 --~~~a~~v~Eg~---------~~~--------------~~~~~f~~i~~~~~k~~~~~~is~ell~-ds~~~~~~~i~~ 114 (330) T protein:vir:77 61 --AVSASWTGEAE---------RKP--------------ITKGSFGKQELEPVKITTIFAESAEVVR-LNPLNYLNTMRT 114 (330) T ss_pred --CcceeEecCCC---------ccc--------------cccceeeEEEEeEEEEEEeehhhHHHHh-cchHHHHHHHHH Confidence 12334666652 122 2233455688899999999999999665 455567777777 Q ss_pred HHHHhhhHHHHHHHHHHHHhcCCeE-----EecCCCccc---ccccccccCCceecHHHHHHHHHHHHhccCccccceec Q lcl|NC_019514. 160 ELMNGAVQLTEAVLQKDLLAGAGTI-----VYTGAATQD---SEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVIT 231 (399) Q Consensus 160 ~lg~~a~~~~e~~l~~~~lag~~~v-----~yag~ats~---~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~ 231 (399) .|.+..+.-.+ .-+++|.+.- +.++..... .......+......+++|.++...|..+.... T Consensus 115 ~l~~ai~~~~~----~~~l~G~g~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~------ 184 (330) T protein:vir:77 115 KIAEAIALKFD----AAAIHGIDKPSAFKGYLAETTKVVSLADTNLTTASGPQGNAYLAVNNALSLLVNSGKKW------ 184 (330) T ss_pred HHHHHHHHHHH----HHhhcccCCCCccccccccccccceeecccccccccccchhHHHHHHHHHhhhhcCCCc------ Confidence 77776654443 4555655421 111110000 00011223344556788888887777655432 Q ss_pred cccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCcc Q lcl|NC_019514. 232 GSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVG 311 (399) Q Consensus 232 ~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~ 311 (399) -..+|||.+...|+.|+|--+.+-|.+..+-++.. ..+-+.+-|+.++.++.|.. | T Consensus 185 -------------~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~---~~~~~~l~G~PV~~~~~~p~----~---- 240 (330) T protein:vir:77 185 -------------TGTLLDNVTEPILNTAVDGNGRPLFVESTYTEQVG---AIREGRILGRPTYVADNVVN----G---- 240 (330) T ss_pred -------------cEEEEcHHHHHHHHHHhccCCceeecCcccccccc---ccCCceecceeeEEeccccC----C---- Confidence 23579999999999999988888888765555443 33567888999999988731 1 Q ss_pred CCccccccCccceEEEEEEEcccceeeeccccCCCCccceEE---EecCCC----CCCCCCCccchhhHHHHH--HHHHH Q lcl|NC_019514. 312 TNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVT---TKMPGE----ATADRNDPYGEMGFSSIK--WYYGT 382 (399) Q Consensus 312 ~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~i---vk~pG~----~~ad~~DPlgQrg~~gwK--~~~~~ 382 (399) +++++ +.+++|.-+...++..++= .+++. ...-|. ......--+-|++...|| +++.+ T Consensus 241 ------~~~~~----~~~~~gd~s~~~i~~~~~~---~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~ 307 (330) T protein:vir:77 241 ------TVGNR----VVGVMGDFSQVIWGQIGGL---SFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAF 307 (330) T ss_pred ------CCCCc----cEEEEEecceEEEEEecCc---EEEEeecceeeecccccccccccccchhhcCcEEEEEEEEecc Confidence 11222 2366777666666665431 12221 111110 000011123567778888 58899 Q ss_pred hhccccceEEEEEeccC Q lcl|NC_019514. 383 LILRPERLALVKTVAPL 399 (399) Q Consensus 383 ~iLn~~~m~~ie~~a~~ 399 (399) .+.+++-+++|+.++.- T Consensus 308 ~v~~~~a~~~i~~~~~~ 324 (330) T protein:vir:77 308 MVNDKDAFVKLTDQVAG 324 (330) T ss_pred EEecccceEEEEeccCC Confidence 99999999999887766 No 38 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=99.03 E-value=8.4e-11 Score=75.82 Aligned_cols=315 Identities=10% Similarity=-0.002 Sum_probs=170.8 Q ss_pred eeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCCCCCCC Q lcl|NC_019514. 6 MLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQGIDAAG 85 (399) Q Consensus 6 ~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~gi~aag 85 (399) |.|=|+-+.+.-.....--.+...-|..+.++.=+-.-+|..+=..+.+ ..||+.+|.|-... T Consensus 1 ms~~n~~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~rti--~~gkS~q~~~iG~~--------------- 63 (364) T protein:vir:10 1 MSNPNVLTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQEV--VGTNSVSNKYIGET--------------- 63 (364) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeee--cccceEEeeeeeee--------------- Confidence 4444443333332222112222223566666653334445555556654 48999999888543 Q ss_pred ceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchH-HHHHHHHHHHHh Q lcl|NC_019514. 86 ATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSE-LFSHISTELMNG 164 (399) Q Consensus 86 a~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~-l~~~~~~~lg~~ 164 (399) -.....+|..+|-..+.....+++ |-+.=-+-.+-+.+.+...+=. |-.++++++|+. T Consensus 64 --~~~~~~~G~~ld~~~~~~~k~~it-------------------ID~ll~a~~~V~diDe~q~~~D~vR~e~s~e~G~A 122 (364) T protein:vir:10 64 --ELQVLSPGKSPDASPTEFDKNRLV-------------------VDTTVIARNTVAHFHDVQNDIDGLKSKLSVNQAKK 122 (364) T ss_pred --EEeeeccCcccCCCCcccCcEEEE-------------------ecceeeechhhhhHHHHhcCccchhHHHHHHHHHH Confidence 123333444444333333333333 2221112222333344444433 566777777777 Q ss_pred hhHHHHHHHHHHHHhcC--CeEEec-----CCCcccccccccccCCceec----HHHHHHHHHHHHhccCccccceeccc Q lcl|NC_019514. 165 AVQLTEAVLQKDLLAGA--GTIVYT-----GAATQDSEITGEGATPSVVD----YDDLMRLSITLDENRTPKQTKVITGS 233 (399) Q Consensus 165 a~~~~e~~l~~~~lag~--~~v~ya-----g~ats~~~~t~~~~~~~~vt----~~~lr~a~~~L~~nrap~~t~~i~~s 233 (399) -++.++..+.+.+++++ +..-+. .++.....+. -.+.....+ ++.+..+...|.++..|. T Consensus 123 LA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~~~~g~~i~~~-~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~-------- 193 (364) T protein:vir:10 123 LKKMEDSMVIQQLVLGGISNTEAIRKNPRVAGHGFSIHIV-GLASSFLTSPQYMMAAIEMAMEQQTEQEVDT-------- 193 (364) T ss_pred HHHHHHHHHHHHHHhhhhhcccccccCCcccCCcceeeec-ccCcchhhhHHHHHHHHHHHHHHHhhcCCCc-------- Confidence 77776555555554442 111110 0001011111 001112233 333446778899988885 Q ss_pred cccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcC--CccccccccceeEcCeEEEecCccchhc-ccC--- Q lcl|NC_019514. 234 RMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYA--DAGTILNGEIGTVDQFRLVVVPEMLHWA-GAG--- 307 (399) Q Consensus 234 ~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya--~~~~i~~gEIG~i~~vRfV~~~~~~~~~-~aG--- 307 (399) .-+++++.|..-+.|.+ ++.|++. +|+ ......+|+|+++.|||+++++++ |+. +.+ T Consensus 194 ---------~~R~~vv~P~~y~~Ll~------~~~lvn~-d~~~~~~~~~~~G~v~~v~Gv~Vv~Sn~l-P~~~~~~~~t 256 (364) T protein:vir:10 194 ---------SELCGLMPWTAFNCLRD------ADRIVDK-SYTIAASDNTVDGFVLKSWNTPIVPSNRF-PKLSDNTEGT 256 (364) T ss_pred ---------cccEEEeChHHHHHHhc------CCccccc-cccccCCCccccceeEEEeceEEEecccc-cccccccccc Confidence 23899999999988864 4678877 454 445578999999999999999998 553 211 Q ss_pred CCccCCcc-ccccCccce------EEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHH Q lcl|NC_019514. 308 ATVGTNPG-YRETNGKYD------IYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYY 380 (399) Q Consensus 308 a~~~~~~~-~~~t~~~~D------Vyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~ 380 (399) +..+..+. ...+++.|+ -...++|=++|-+++.+..-- ..- -.|+--|..++=-|..| T Consensus 257 ~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t---------~e~------~~~~~~~~~~ida~~a~ 321 (364) T protein:vir:10 257 GNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISIT---------GDI------FYEKKEKTWYIDTFLAE 321 (364) T ss_pred ccccccccccccCCcccccccccceeEEEEEecceEEEEEEecce---------eee------eeccceeeeeeeeehcc Confidence 11111111 122344544 455788888888887555411 100 11334455555558999 Q ss_pred HHhhccccceEEEEEeccC Q lcl|NC_019514. 381 GTLILRPERLALVKTVAPL 399 (399) Q Consensus 381 ~~~iLn~~~m~~ie~~a~~ 399 (399) ++.+||++.-+.|.++++= T Consensus 322 G~g~lRPeaa~~i~~~~~~ 340 (364) T protein:vir:10 322 GAIPDRWEAVAVVTAADTA 340 (364) T ss_pred cCcccCccceEEEEecCCC Confidence 9999999999999998877 No 39 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=99.01 E-value=5.8e-11 Score=76.72 Aligned_cols=302 Identities=14% Similarity=0.080 Sum_probs=149.2 Q ss_pred CCcCCeeecCCCCccccccccc-ccceehhhhhHHHHHHHHHHHHhhhhccc--ccccccCCCEEEEEEccccccccccc Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPD-GKQMNTFFWWKKALIEARKDQYFMPLADV--VSMPKNYGKEIRVYHYIPLLDDRNVN 77 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i-~p~m~~~y~~kk~L~~A~p~lv~~~fA~~--~~mPkN~GktIk~rry~pl~~~~~~~ 77 (399) |+ ++++.| .|| -|.+++|+.-+..+||.++... ++-.++.|+||++++-..+. .+ T Consensus 1 m~--------------~~~N~~ltp~----iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~~~~----v~ 58 (418) T protein:vir:10 1 MA--------------VQDNNLLTDD----VIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPYRVK----SA 58 (418) T ss_pred CC--------------ccccccccHH----HHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCCcee----ec Confidence 22 222222 244 5889999999999999887764 23336789999998754431 11 Q ss_pred cCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeee-eecceeehhhhhhhhhcchHHHHH Q lcl|NC_019514. 78 DQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQ-KFGFFTEFSQESLDFDSDSELFSH 156 (399) Q Consensus 78 ~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~-qYG~~~e~Td~~~d~~~D~~l~~~ 156 (399) + |. +|+.++++ ...++.+|- +-.+-.+++|+-.....+ .+... T Consensus 59 d------g~---------------~~~~~~~t--------------e~~v~l~id~~k~~~~~itD~e~a~~~~-d~~~~ 102 (418) T protein:vir:10 59 S------GR---------------TLVKQPMV--------------DQTIPFKIAYQEHVGLEYTVKDKTLDIM-QFSER 102 (418) T ss_pred c------cC---------------Cccccccc--------------cceEEEEEecccccceeechHHHhhhhh-HHHHH Confidence 1 11 11222211 122344552 333445667653322222 34433 Q ss_pred HHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceecccccc Q lcl|NC_019514. 157 ISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMI 236 (399) Q Consensus 157 ~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~ 236 (399) +.+..+..-++-+|.-+ ..+++++.. .. +..++.. -.++++..+.+.|.++++|+- T Consensus 103 ~l~~A~~aLA~~vD~~i-a~l~~~a~~--------~~---gt~gt~~--~~~~~i~~a~~~Ld~~~VP~~---------- 158 (418) T protein:vir:10 103 YLKSGMVQIANQIDRSL-ALTLKKAFH--------SS---GTPGVRP--GAFIDFANAGAKQTTYAVPQD---------- 158 (418) T ss_pred HHHHHHHHHHHHHHHHH-HHHHhhccc--------cc---ccCCcCc--chHHHHHHHHHHHHhcCCCCC---------- Confidence 33333333333332222 122333321 11 1111111 258999999999999999961 Q ss_pred CccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhc-cc--------C Q lcl|NC_019514. 237 DTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWA-GA--------G 307 (399) Q Consensus 237 ~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~-~a--------G 307 (399) .-+++++.|+....|.+ ++.|.. .+++..+.+-+|+||++.||+++++.+.-.-. +. | T Consensus 159 ------G~R~lVv~P~~~~~L~~------~~~~~~-~~~~~~~~lr~G~IG~i~GF~V~~S~nip~~tag~~~~t~~v~g 225 (418) T protein:vir:10 159 ------GMRHAVLDPFTCASLSD------EVTKLF-KESMVEQAYKMGYRGNVAAYEVYESQNLPKHTVGDHGGTPLVNG 225 (418) T ss_pred ------CceEEEeCHHHHHHHhh------hccccc-cccccchhhheeeeeeeeceEEEEecCCCcccccccccceeeec Confidence 12788899999888853 244543 57788888889999999999999988865322 10 0 Q ss_pred CCccCCcc------cc--cc-----------------------------------------CccceEEEEEEEccc---- Q lcl|NC_019514. 308 ATVGTNPG------YR--ET-----------------------------------------NGKYDIYPMLCVGAE---- 334 (399) Q Consensus 308 a~~~~~~~------~~--~t-----------------------------------------~~~~DVyp~lV~G~~---- 334 (399) +...+... .+ .+ ++.+.+||-|+-+.. T Consensus 226 a~~~~~~~~~~~~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~~~~~~~~~tv~i~p~~~~~~~~~~~ 305 (418) T protein:vir:10 226 TVVNGDTVGFDGGTASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQEFVVLEDVDTDAGGAGSIKISPSLNDGTATINN 305 (418) T ss_pred ccccceeEEEeecceeeccceeeccEEEECceeecccccccccccceEEEEEeeccccccCcceeEeccccccccccccc Confidence 10000000 00 00 111222332210000 Q ss_pred ---------ceeee----------------------ccccCCCCccceEEEe----cCCCCCCCC-CCcc---------- Q lcl|NC_019514. 335 ---------SFTTI----------------------GFQTDGKTLKFKVTTK----MPGEATADR-NDPY---------- 368 (399) Q Consensus 335 ---------Afg~v----------------------~l~g~g~~~~~~~ivk----~pG~~~ad~-~DPl---------- 368 (399) +|..+ +|.. .+..|..+.. +.|...... .+|+ T Consensus 306 ~~~~~~~~~~~~~v~a~~a~~~~it~~~~a~~~~~~nl~f--~~~a~~l~~~~l~~p~g~~~~~~~~~~~~G~s~r~~~~ 383 (418) T protein:vir:10 306 ENGDPVSLTAYQNVTALPADNAPITVLGAANTTYEQNYLF--HRDAIALAMIDLELPQSAVIKSRAADPETGLSLTLTGA 383 (418) T ss_pred cccccccccCCCcccccccCcceeeeecccccceeeeeee--ecceEEEEEeeccCCCCCCcceEEEeccCCeEEEEEEc Confidence 00000 0000 0011222222 223211111 1333 Q ss_pred ----chhhHHHHHHHHHHhhccccceEEEEEeccC Q lcl|NC_019514. 369 ----GEMGFSSIKWYYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 369 ----gQrg~~gwK~~~~~~iLn~~~m~~ie~~a~~ 399 (399) .+--.+.|=.+|++..|++||.+||==.|-- T Consensus 384 ~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~g~~~~ 418 (418) T protein:vir:10 384 YDINEQSEIHRIDAVWGADMIYGELALRLWGAASS 418 (418) T ss_pred ccccccceEEEEEeecCceeecccceEEEEeecCC Confidence 2223334445999999999999887644444 No 40 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=99.00 E-value=2.8e-11 Score=78.48 Aligned_cols=293 Identities=14% Similarity=0.142 Sum_probs=167.8 Q ss_pred CCc----------------CCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEE Q lcl|NC_019514. 1 MAS----------------KGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRV 64 (399) Q Consensus 1 ~~~----------------~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~ 64 (399) |++ ++..+|.-..+.++..+.+-|+ . +..+.++.++..-++.+++...+|+. .++++ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~lip~---~-~~~~ii~~~~~~s~l~~l~~~~~~~~---~~~~~ 73 (324) T protein:vir:96 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLN---D-FTTPILQEVMENSKIMQLGKYEPMEG---TEKKF 73 (324) T ss_pred CCcchhhhHHHHHHHHhhhhhhhcccccccccCCCcceech---h-HHHHHHHHHHhhchhhhhcceeeccC---CceEE Confidence 332 2333333222322323333333 2 35677778888889999999988874 34555 Q ss_pred EEccccccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhh Q lcl|NC_019514. 65 YHYIPLLDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQES 144 (399) Q Consensus 65 rry~pl~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~ 144 (399) -++.. .+...|..||- .+... ..+...++...++++.++.+|+++ T Consensus 74 p~~~~------------~~~a~~v~Eg~---------~~~~~--------------~~~f~~v~~~~~k~~~~~~is~el 118 (324) T protein:vir:96 74 TFWAD------------KPGAYWVGEGQ---------KIETS--------------KATWVNATMRAFKLGVILPVTKEF 118 (324) T ss_pred EEEec------------CcceeeecCCc---------ccccc--------------ccceeEEEEEeEEEEEeehhhHHH Confidence 44421 12234566652 22222 234456888999999999999997 Q ss_pred hhhhcchHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCc Q lcl|NC_019514. 145 LDFDSDSELFSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTP 224 (399) Q Consensus 145 ~d~~~D~~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap 224 (399) ++ +++.++...+.+++.+..+.-+|. .+++|.+.--...+-.... ......+...+++++|.++.-.|..+... T Consensus 119 l~-ds~~~l~~~i~~~l~~aia~~~d~----~~l~G~g~~~~~~~~~~~~-~~~~~~~~~~~~~~~i~~~~~~i~~~~~~ 192 (324) T protein:vir:96 119 LN-YTYSQFFEEMKPMIAEAFYKKFDE----AGILNQGNNPFGKSIAQSI-KKTNKVIKGDFTQDNIIDLEALLEDDELE 192 (324) T ss_pred Hh-cchHHHHHHHHHHHHHHHHHHHHH----HhhhcCCCCCcCccccccc-cccceecccccchHHHHHHHHhhhhccCC Confidence 76 455568888888888876665544 3344433211111111100 11122334557899999998887664332 Q ss_pred cccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhc Q lcl|NC_019514. 225 KQTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWA 304 (399) Q Consensus 225 ~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~ 304 (399) . + ..+|||.....|+.++|.-+.+.|. .+--+++-|+.++.++... T Consensus 193 ~-----------------~--~~i~n~~~~~~L~~lkd~~G~~~~~------------~~~~~~l~G~PV~~~~~~~--- 238 (324) T protein:vir:96 193 A-----------------N--AFISKTQNRSLLRKIVDPETKERIY------------DRNSDSLDGLPVVNLKSSN--- 238 (324) T ss_pred C-----------------C--EEEEcHHHHHHHHHhhCCCCCeeec------------CCCCCcccceeeEeecCCC--- Confidence 1 1 2579999999999998765554442 1234567788877654310 Q ss_pred ccCCCccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEE---EecCCCCCCCCCCc--cchhhHHHHH-- Q lcl|NC_019514. 305 GAGATVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVT---TKMPGEATADRNDP--YGEMGFSSIK-- 377 (399) Q Consensus 305 ~aGa~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~i---vk~pG~~~ad~~DP--lgQrg~~gwK-- 377 (399) . ++. .+++|.-+.-.+++.++ ..++.. ....+. ..+.-+ |-|+..+.|+ T Consensus 239 -------------~--~~~----~~~~gd~s~~~~~~~~~---~~i~~~~~~~~~~~~--~~~~~~~~~~~~n~v~~r~~ 294 (324) T protein:vir:96 239 -------------L--KRG----ELITGDFDKLIYGIPQL---IEYKIDETAQLSTVK--NEDGTPVNLFEQDMVALRAT 294 (324) T ss_pred -------------C--Ccc----eEEEEecceEEEEEecC---cEEEEeecccccccc--cccccchhhhhcCcEEEEEE Confidence 0 011 25677665555554442 122222 111111 111112 3466777777 Q ss_pred HHHHHhhccccceEEEEEeccC Q lcl|NC_019514. 378 WYYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 378 ~~~~~~iLn~~~m~~ie~~a~~ 399 (399) +++.+.+++++-+++|+.+... T Consensus 295 ~r~d~~v~~~~a~~~l~~a~~~ 316 (324) T protein:vir:96 295 MHVALHIADDKAFAKLVPADKR 316 (324) T ss_pred EEeccEEecccceEEEeccccc Confidence 7789999999999999987777 No 41 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=98.98 E-value=5.8e-11 Score=76.70 Aligned_cols=292 Identities=13% Similarity=0.131 Sum_probs=167.7 Q ss_pred CCcCCeeecCCCCccccc-ccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGI-DAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQ 79 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~-~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~ 79 (399) +..++..+|. +++.++. .+.+-|+ .+..+.++.+...-++.+++.+.+|+.+ ++++-+... T Consensus 17 ~~~~~~~~~a-~~~~~~~~~~~liP~----~~~~~ii~~~~~~s~l~~l~~~~~~~~~---~~~ip~~~~---------- 78 (324) T protein:vir:93 17 NNVKPQVFNP-DNVMMHEKKDGTLLN----DFTTPILQEVMENSKIMQLGKYEPMEGT---EKKFTFWAD---------- 78 (324) T ss_pred hhhhhhhccc-ccccccCCCcceech----hHHHHHHHHHHhhchhhhhcceeeccCC---ceEEEEEec---------- Confidence 4445566654 4333333 2323333 2456777778888889999998888743 344433311 Q ss_pred CCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHH Q lcl|NC_019514. 80 GIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIST 159 (399) Q Consensus 80 gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~ 159 (399) .+...++.||- .+. ....+...++...++++.++.+|+++++ +++..+...+.. T Consensus 79 --~~~a~~v~Eg~---------~~~--------------~~~~~f~~i~~~~~k~~~~~~iS~ell~-ds~~~l~~~i~~ 132 (324) T protein:vir:93 79 --KPGAYWVGEGQ---------KIE--------------TSKATWVNATMRAFKLGVILPVTKEFLN-YTYSQFFEEMKP 132 (324) T ss_pred --CcceeeecCCc---------ccc--------------ccccceeEEEEEeEEEEEeehhhHHHHh-cchHHHHHHHHH Confidence 12234566552 122 2233455688899999999999998776 455568888888 Q ss_pred HHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_019514. 160 ELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTR 239 (399) Q Consensus 160 ~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~ 239 (399) ++.+..+.-.+.. ++.|.+.--...+-.. ........+...+++++|.++.-.|..+.... T Consensus 133 ~l~~aia~~~d~a----~l~G~g~~~~~~~~~~-~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~-------------- 193 (324) T protein:vir:93 133 MIAEAFYKKFDEA----GILNQGNNPFGKSIAQ-SIEKTNKVIKGDFTQDNIIDLEALLEDDELEA-------------- 193 (324) T ss_pred HHHHHHHHHHHHH----HhcCCCCCCcCccccc-cccccceeccccccHHHHHHHHHhhhhccCCC-------------- Confidence 8887766655443 3444332111110000 00111223445678999999998887754321 Q ss_pred ccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCcccccc Q lcl|NC_019514. 240 TISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRET 319 (399) Q Consensus 240 ~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t 319 (399) + ..+|||.....|+.|+|.-+.+-|. .+--+++-|+.++.++... . T Consensus 194 ----~-~~v~n~~~~~~L~~l~d~~G~~~~~------------~~~~~~l~G~PVv~~~~~~---------~-------- 239 (324) T protein:vir:93 194 ----N-AFISKTQNRSLLRKIVDPETKERIY------------DRNSDSLDGLPVVNLKSSN---------L-------- 239 (324) T ss_pred ----C-EEEEcHHHHHHHHHhhCCCCCeeec------------CCCCCcccceeeEeecCCC---------C-------- Confidence 1 3578999999999998766555442 1234566777777654310 0 Q ss_pred CccceEEEEEEEcccceeeeccccCCCCccceEEE---ecCCCCCCCCCCc--cchhhHHHHH--HHHHHhhccccceEE Q lcl|NC_019514. 320 NGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTT---KMPGEATADRNDP--YGEMGFSSIK--WYYGTLILRPERLAL 392 (399) Q Consensus 320 ~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~iv---k~pG~~~ad~~DP--lgQrg~~gwK--~~~~~~iLn~~~m~~ 392 (399) ++. .+++|.-+...+++.++ ..+++.. ...+. ..+.-+ +-|+..+.++ +++++.+++++-+++ T Consensus 240 -~~~----~i~~gdfs~~~~~~~~~---~~i~~~~~~~~~~~~--~~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~a~~~ 309 (324) T protein:vir:93 240 -KRG----ELITGDFDKLIYGIPQL---IEYKIDETAQLSTVK--NEDGTPVNLFEQDMVALRATMHVALHIADDKAFAK 309 (324) T ss_pred -Ccc----eEEEEecceEEEEEecC---cEEEEeecccccccc--cccccchhhhhcCcEEEEEEEEeccEEecccceEE Confidence 111 24577655554544432 2233221 11111 111112 3456667766 678999999999999 Q ss_pred EEEeccC Q lcl|NC_019514. 393 VKTVAPL 399 (399) Q Consensus 393 ie~~a~~ 399 (399) |..|... T Consensus 310 l~~a~~~ 316 (324) T protein:vir:93 310 LVPADKR 316 (324) T ss_pred Eeccccc Confidence 9877666 No 42 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=98.97 E-value=2.7e-10 Score=73.06 Aligned_cols=321 Identities=15% Similarity=0.091 Sum_probs=171.1 Q ss_pred CCcCCeeecCCCCcccccccccccceehh--hhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccccccccccc Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTF--FWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVND 78 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~--y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~ 78 (399) |+.-.- =+..++.+-- .+..+....+| -|+.+.+..=+..-+|..+-..+++ ..|++++|-|..... T Consensus 1 ma~~~~-~~~~~t~~~~-~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~~~--~~G~sv~i~~ig~~t------- 69 (347) T protein:vir:15 1 MANIQG-GQQIGTNQGK-GQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSI--ASGKSAQFPVIGRTK------- 69 (347) T ss_pred CCcccc-CCcccccccc-CCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccccc--cccceeEeeecccee------- Confidence 654321 0111111111 11222222232 2566666554455566666666543 469999999986631 Q ss_pred CCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHH Q lcl|NC_019514. 79 QGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIS 158 (399) Q Consensus 79 ~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~ 158 (399) ..-.+.| .+.|. |+...+..+.+.+|-|+=-|-.+=|.+.+....-.+..+++ T Consensus 70 ------~~~~~~g----~~l~~-----------------~~~~~~~~e~~ltID~~~~~~~~VddlD~~q~~~D~~~~~~ 122 (347) T protein:vir:15 70 ------AAYLKPG----ENLDD-----------------KRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYT 122 (347) T ss_pred ------eeeeccC----CCCCC-----------------CCCCCccceEEEEechhhhhhHHhhhHHHHhcCCcchHHHH Confidence 1112222 11111 01111122344555565444444455554444434677777 Q ss_pred HHHHHhhhHHHHHHHHHHHHhcCCe-------EEecCCCccccccccc--ccCCceec----HHHHHHHHHHHHhccCcc Q lcl|NC_019514. 159 TELMNGAVQLTEAVLQKDLLAGAGT-------IVYTGAATQDSEITGE--GATPSVVD----YDDLMRLSITLDENRTPK 225 (399) Q Consensus 159 ~~lg~~a~~~~e~~l~~~~lag~~~-------v~yag~ats~~~~t~~--~~~~~~vt----~~~lr~a~~~L~~nrap~ 225 (399) ++.+..-++..+..+...+..++.. .-.+|..+.....+.. ...+.... ++.++.+.+.|.++..|. T Consensus 123 ~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~ 202 (347) T protein:vir:15 123 AQLGESLAMAADGAVLAELAGLVNLPDASNENIEGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPA 202 (347) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCc Confidence 7777776666654444444322211 1111111111111110 01111111 778888999999999985 Q ss_pred ccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcc Q lcl|NC_019514. 226 QTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAG 305 (399) Q Consensus 226 ~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~ 305 (399) ..+++++.|+.-.+|.. ++.|+.+ .|+....+.+|.||++.||++++++++-.... T Consensus 203 -----------------~gR~~vv~P~~y~~LL~------~~~~~~~-d~~~~~~~~~G~Vg~i~G~~V~~Sn~lp~~~~ 258 (347) T protein:vir:15 203 -----------------ADRTFYTTPDNYSAILA------ALMPNAA-NYQALIDHERGTIRNVMGFEVVEVPHLTAGGA 258 (347) T ss_pred -----------------cCCEEEeCHHHHHHHhc------ccccccc-cccccccccceEEEEEeceEEEeccccccccc Confidence 23889999999999943 4678766 57877888999999999999999999753322 Q ss_pred cCCCccC----Cccc-cc----cCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHH Q lcl|NC_019514. 306 AGATVGT----NPGY-RE----TNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSI 376 (399) Q Consensus 306 aGa~~~~----~~~~-~~----t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gw 376 (399) ++...++ .-.+ .. .....+....|++-+.|.|++-++.- ++- .--|+--|.-.+-- T Consensus 259 t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~------~~e---------~~~~~~~~~d~i~~ 323 (347) T protein:vir:15 259 GDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDL------ALE---------RARRANYQADQIIA 323 (347) T ss_pred ccccccccccccccccccccceeeeccccceeeeeccceeeeeEeece------eee---------ecccchhhhhhheh Confidence 2111100 0000 00 01223455678999999998855441 111 11266666666666 Q ss_pred HHHHHHhhccccceEEEEEeccC Q lcl|NC_019514. 377 KWYYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 377 K~~~~~~iLn~~~m~~ie~~a~~ 399 (399) |..|++.+||++..+.|+-- .+ T Consensus 324 ~~~~G~~vlrP~~av~~~~~-~~ 345 (347) T protein:vir:15 324 KYAMGHGGLRPEAAGAIVLP-KV 345 (347) T ss_pred hhhcCCceeccccEEEEecC-CC Confidence 78899999999997777421 11 No 43 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=98.97 E-value=2.6e-10 Score=73.14 Aligned_cols=320 Identities=14% Similarity=0.069 Sum_probs=183.5 Q ss_pred CCc--CCeeec-CCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccc Q lcl|NC_019514. 1 MAS--KGMLYN-DPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVN 77 (399) Q Consensus 1 ~~~--~~~~~n-~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~ 77 (399) |+. -++.+| +|..-..+ +.--.+...-|+.+.++.=+..-+|..+-..+.+ ..|++++|.|...... T Consensus 1 ma~~~~~~~~~t~~g~~~~~---~d~~al~ie~~~geV~~~f~~~s~~~~~~~~rti--~~G~sv~~~~iG~~~~----- 70 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSA---GDKLALFLKVFGGEVLTAFTRTSVTMNKHLVRSI--QSGKSAQFPVLGRTKA----- 70 (347) T ss_pred CCccccccccccccccCCcc---cchHHHHHHHHhHHHHHHHHHHHhhhhhhhheec--cccceEEeeeccceeE----- Confidence 663 233343 22211111 1112244445788888776677788888888876 3599999999866411 Q ss_pred cCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHH Q lcl|NC_019514. 78 DQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHI 157 (399) Q Consensus 78 ~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~ 157 (399) .-.+. |-+.|. +.+++. ..+.+.+|-++--|-.+-|.+.+....-.+..++ T Consensus 71 --------~~~~~----G~~l~~---~~~~~~--------------~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~ 121 (347) T protein:vir:94 71 --------AYLQP----GENLDD---KRKDMK--------------HTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEY 121 (347) T ss_pred --------eeeec----CcCCCC---CcCCcc--------------ccceEEEEcchhhhhhhhhhHHHHhcCcchHHHH Confidence 11222 222211 001111 1234455666544444444445555443477777 Q ss_pred HHHHHHhhhHHHHHHHHHHHHhcCCeEE-----ecC-CCccccccccc------ccCCceecHHHHHHHHHHHHhccCcc Q lcl|NC_019514. 158 STELMNGAVQLTEAVLQKDLLAGAGTIV-----YTG-AATQDSEITGE------GATPSVVDYDDLMRLSITLDENRTPK 225 (399) Q Consensus 158 ~~~lg~~a~~~~e~~l~~~~lag~~~v~-----yag-~ats~~~~t~~------~~~~~~vt~~~lr~a~~~L~~nrap~ 225 (399) +++.|..-++.++..+.+.++.++...- ..| ...+...+... .+.+...-++.|+++...|+++..|. T Consensus 122 ~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~~~~~g~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~ 201 (347) T protein:vir:94 122 TAQLGESLAMAADGAVLAEMAKLCNLPTANNENIAGLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPS 201 (347) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCC Confidence 8888877777775555555544433110 111 11111111111 01111223788999999999999985 Q ss_pred ccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcc Q lcl|NC_019514. 226 QTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAG 305 (399) Q Consensus 226 ~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~ 305 (399) .-+++++.|..-..|.... .+.. ..|..-..+-+|.||++.||++++++++-.|.. T Consensus 202 -----------------~~R~~vv~P~~y~~LLk~~------~~~~-~~~~~~~~~~~G~V~~v~G~~V~~Sn~~p~~~~ 257 (347) T protein:vir:94 202 -----------------SDRVFYTTPDNYSAILAAL------MPNA-ANYQALIDPSTGSIRNVMGFEVIEVPHLTAGGA 257 (347) T ss_pred -----------------CCCEEEeChHHHHHHHHhh------cccc-cccccccccccceeEEeeceEEEEcCccccccC Confidence 2389999999998885422 2332 366666667789999999999999999977753 Q ss_pred cCCCccCC----cc----ccccCccceE----EEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhH Q lcl|NC_019514. 306 AGATVGTN----PG----YRETNGKYDI----YPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGF 373 (399) Q Consensus 306 aGa~~~~~----~~----~~~t~~~~DV----yp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~ 373 (399) .....++. +. .+.++++|++ -.-||+-.+|-+++-+...- ++. --|+--|..+ T Consensus 258 ~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~~----~e~-----------~~~~~~~~~~ 322 (347) T protein:vir:94 258 GDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMA----LER-----------ARRANFQADQ 322 (347) T ss_pred cccccccccccccccccccccccccccccccceEEEEechhhhhhhhhcccc----eee-----------eechhhhhhh Confidence 22221111 00 0122223321 12477778887777554421 111 1377888888 Q ss_pred HHHHHHHHHhhccccceEEEEEecc Q lcl|NC_019514. 374 SSIKWYYGTLILRPERLALVKTVAP 398 (399) Q Consensus 374 ~gwK~~~~~~iLn~~~m~~ie~~a~ 398 (399) .=-|..|++.+||+|.-+.|+.-+- T Consensus 323 i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 323 IIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred hhhhhhhcCcccccceeEEEEecCC Confidence 8889999999999999987776655 No 44 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=98.95 E-value=3.9e-11 Score=77.63 Aligned_cols=292 Identities=18% Similarity=0.150 Sum_probs=165.5 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQG 80 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~g 80 (399) +-.++..-+.-...+++..+-+-|+ -+..+.++..++.-++.+++...+++.+.++-.+ .. .+ T Consensus 120 ~l~~~e~~~al~~~t~~~gG~lvP~----~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~~~--~~-----------~~ 182 (425) T protein:vir:10 120 HVKRGDVQAALNKGEDSEGGYLTPI----EWDRTITNKLVLISPMRQLCRVQPVSKAGFSKLF--NM-----------GG 182 (425) T ss_pred HhhhhhhHHHhhcCcCCCCceeccH----hHHHHHHHHHHhhhhhhhhceeeeccCCceEEEE--Ec-----------CC Confidence 1111111111111112222223343 2456677778888899999999888765432221 11 11 Q ss_pred CCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHHH Q lcl|NC_019514. 81 IDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHISTE 160 (399) Q Consensus 81 i~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~~ 160 (399) +...|..||-.. ++ .-..+...++.+.++++.|+.+|+++++ ++...+.+.+... T Consensus 183 --~~a~wv~E~~~~---~~-------------------~~~~~f~~v~~~~~k~~~~i~iS~ell~-ds~~~l~~~i~~~ 237 (425) T protein:vir:10 183 --TTSGWVGEASQR---PQ-------------------TNAATFQPLSFASGEIYANPAATQQILD-DAEIDLESWLATE 237 (425) T ss_pred --cceeeecccccc---cc-------------------ccccccceeeeeheeeEeehHhHHHHHh-cchhHHHHHHHHH Confidence 122455554211 11 0012345678889999999999999776 4445588877778 Q ss_pred HHHhhhHHHHHHHHHHHHhcCCeEE------ecCCCcccc------cccccccCCceecHHHHHHHHHHHHhccCccccc Q lcl|NC_019514. 161 LMNGAVQLTEAVLQKDLLAGAGTIV------YTGAATQDS------EITGEGATPSVVDYDDLMRLSITLDENRTPKQTK 228 (399) Q Consensus 161 lg~~a~~~~e~~l~~~~lag~~~v~------yag~ats~~------~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~ 228 (399) |.+.-+...+ ..+++|.++-. +..+.+... ...........+++++|.++...|+..-... T Consensus 238 la~ai~~~~d----~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~~~--- 310 (425) T protein:vir:10 238 VQTEFAKQEG----KAFLAGDGTNKPNGLLTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPSAFTGN--- 310 (425) T ss_pred HHHHHHHHHH----hhhhcccCCCCcceeeeccccccccccccccccccccccccccccHHHHHHHHhhhhhhhccC--- Confidence 8777665443 34666644311 111111000 0001112345688999988887776532211 Q ss_pred eeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCC Q lcl|NC_019514. 229 VITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGA 308 (399) Q Consensus 229 ~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa 308 (399) -+-++||.+...|+.|+|--+.|-|.|-. -.|.-+++-|..++.++.|.. .++| T Consensus 311 ----------------a~~vmn~~~~~~L~~lkD~~G~~l~~~~~--------~~g~~~~l~G~PV~~~~~~p~-~~~~- 364 (425) T protein:vir:10 311 ----------------ARFAMNRNTQRQVRKLKDGQGNYLWQPSY--------VAGQPATLAGYPVTEVPDMPD-VAAN- 364 (425) T ss_pred ----------------CEEEEchHHHHHHHHhhcCCCceeeccCc--------cCCCCceecceeeEEecCcCC-ccCC- Confidence 13479999999999999888888786532 234556788889998887621 1110 Q ss_pred CccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHH--HHHHhhcc Q lcl|NC_019514. 309 TVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKW--YYGTLILR 386 (399) Q Consensus 309 ~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~--~~~~~iLn 386 (399) .. .++||.-+.+..-....+ +++ ..|||.+++.++|++ ++.+.+++ T Consensus 365 ----------------~~-~i~~Gd~~~~~~i~~~~~----~~v-----------~~d~~~~~~~~~~~~~~r~d~~v~~ 412 (425) T protein:vir:10 365 ----------------ST-PILFGDFQQTYLIIDRIG----VRV-----------LRDPYTAKPYVLFYTTKRVGGGLLN 412 (425) T ss_pred ----------------cc-EEEEEehhccEEEEEecc----eEE-----------EecccccCCcEEEEEEEEeccEeec Confidence 11 245675432222121122 221 137888888888884 48899999 Q ss_pred ccceEEEEEeccC Q lcl|NC_019514. 387 PERLALVKTVAPL 399 (399) Q Consensus 387 ~~~m~~ie~~a~~ 399 (399) ++-++.|+.+|.= T Consensus 413 ~~A~~~l~~~as~ 425 (425) T protein:vir:10 413 PEPMRAMKVAASE 425 (425) T ss_pred ccceEEEEeeccC Confidence 9999999988877 No 45 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=98.94 E-value=2.6e-10 Score=73.11 Aligned_cols=299 Identities=11% Similarity=0.021 Sum_probs=171.3 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQG 80 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~g 80 (399) |+ +.++. +-+-|+ .+..+.++.+++.-++.+++...+|+.+. +++-++.. T Consensus 1 ma-----------t~~~g-g~lvP~----~~~~~ii~~~~~~s~i~~~~~~i~~~~~~---~~~p~~~~----------- 50 (311) T protein:vir:81 1 MV-----------ALATG-TFQLPK----HLVPGVWQKAQGQSVLARLSMAEPQEFGE---QQYMTLTA----------- 50 (311) T ss_pred Cc-----------eecCC-ceEcch----hHHHHHHHHHHhcchhhhhcceeecCCCc---eEEEEEeC----------- Confidence 22 22222 223333 23577888899999999999998887653 34333311 Q ss_pred CCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcch--HHHHHHH Q lcl|NC_019514. 81 IDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDS--ELFSHIS 158 (399) Q Consensus 81 i~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~--~l~~~~~ 158 (399) .+...|..|| +.......+...++...++++.++.+|++++....|+ .+.+.+. T Consensus 51 -~~~a~wv~Eg-----------------------~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~ 106 (311) T protein:vir:81 51 -PPRGEVVGEG-----------------------AQKSESTATFAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMA 106 (311) T ss_pred -CceeEEeecC-----------------------cccccccceeeEEEEeeEEEEEeehhhHHHhhcCcccHHHHHHHHH Confidence 1223566665 2222333455678888999999999999988655444 4666666 Q ss_pred HHHHHhhhHHHHHHHHHHHHhcCC----eE---EecCCCcccccccccccCCce-ecHHHHHHHHHHHHhccCcccccee Q lcl|NC_019514. 159 TELMNGAVQLTEAVLQKDLLAGAG----TI---VYTGAATQDSEITGEGATPSV-VDYDDLMRLSITLDENRTPKQTKVI 230 (399) Q Consensus 159 ~~lg~~a~~~~e~~l~~~~lag~~----~v---~yag~ats~~~~t~~~~~~~~-vt~~~lr~a~~~L~~nrap~~t~~i 230 (399) ..|.+.-+.-.+ ..+++|.+ .. +-++.......++ .+..+. ....++..+...+..++... T Consensus 107 ~~la~ai~~~~d----~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~--~~~~~~~~~~~~i~~~~~~~~~~~~~~----- 175 (311) T protein:vir:81 107 DLSGVALGRALD----LIGIHGINPLTGAALSGSPAKILDTTNIVE--LTTGTSATPDLAVEAAVGLVLGDNLSP----- 175 (311) T ss_pred HHHHHHHHHHHH----HhhhccccCCCCcccccccccccccceeee--ecccccchHHHHHHHHHHHhhhcCCCc----- Confidence 666665444433 33445421 10 1111100001111 111222 23345666665554433321 Q ss_pred ccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCc Q lcl|NC_019514. 231 TGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATV 310 (399) Q Consensus 231 ~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~ 310 (399) -..++||.+...|+.|+|--+.+-|.+.. ..+.-|++-|+.++.+..+..=...+.. T Consensus 176 --------------~~~vmn~~~~~~l~~lkd~~G~~l~~~~~--------~~~~~~tl~G~Pv~~~~~i~~~~~~~~~- 232 (311) T protein:vir:81 176 --------------DGVALDNTFSFMLATQRDSQGRKLYPELG--------FGTDVASFAGLNAAVSDTVRGGPEAVTA- 232 (311) T ss_pred --------------eEEEEcHHHHHHHHhhhccCCCeeecCcc--------ccCCCceecceeEEeccccccccccccc- Confidence 23578999999999999888888887541 3346688889998887765321111111 Q ss_pred cCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHhhcccc Q lcl|NC_019514. 311 GTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTLILRPE 388 (399) Q Consensus 311 ~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~iLn~~ 388 (399) ....+....++. .+++|.-+.-.+++.++ +.+-+..-+ .++....|-|++.+.|+ +.+++.+++++ T Consensus 233 -~~~~~~~~~~~~----~~~~gDfs~~~i~~~~~-----~~~~~~~~~--~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~ 300 (311) T protein:vir:81 233 -STGVYRTTNPNV----KAIAGDFSAFRWGVQVS-----IPLELIEFG--DPDGLGDLKRQNQIAIRAEVVYGIGIMSTD 300 (311) T ss_pred -ccchhcccCCcc----EEEEEecccEEEEEecc-----ceEEEeccC--CCCcchhhhhcCcEEEEEEEEeccEeeccc Confidence 111222222233 25678776655655442 222222221 23444467889999998 68899999999 Q ss_pred ceEEEEEeccC Q lcl|NC_019514. 389 RLALVKTVAPL 399 (399) Q Consensus 389 ~m~~ie~~a~~ 399 (399) -+++|+-+..- T Consensus 301 a~~~l~~a~~~ 311 (311) T protein:vir:81 301 AFAVVRDADES 311 (311) T ss_pred ceEEEEeeccC Confidence 99999888777 No 46 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=98.92 E-value=8.3e-11 Score=75.86 Aligned_cols=290 Identities=16% Similarity=0.138 Sum_probs=166.0 Q ss_pred CCc-CCeeecCCC----Ccccccc-cccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccccccc Q lcl|NC_019514. 1 MAS-KGMLYNDPN----TTPSGID-APDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDR 74 (399) Q Consensus 1 ~~~-~~~~~n~~~----~t~tT~~-~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~ 74 (399) |+. ...-++... .+.++.. +-+-|+ -++.+.+...+...++.+++...++... .+++.+. T Consensus 90 l~~g~~~~~~~~e~~a~~~~t~~~gG~~iP~----~~~~~I~~~~~~~~~l~~~~~~~~~~~~---~~~~~~~------- 155 (407) T protein:vir:48 90 MRKGREDGLRELERKALQVGNDEDGGYAIPE----ELDRTILTLLKDEVVMRQEATVITLGGS---DYKKLVN------- 155 (407) T ss_pred HhccchhhhhHHHHHhhhcccCCCCcccccH----hHHHHHHHHHHhhhhhhhhceeeecCCC---ceEEEEe------- Confidence 221 111111111 1112222 223343 3467777778888888889887776532 2222221 Q ss_pred ccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHH Q lcl|NC_019514. 75 NVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELF 154 (399) Q Consensus 75 ~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~ 154 (399) ..+ +...|..|+... ++. ...+...++..+++++.|+.+|+++++ +++..+. T Consensus 156 ---~~~--~~a~~v~E~~~~---~~~-------------------~~~~f~~i~~~~~k~~~~~~iS~ell~-ds~~~l~ 207 (407) T protein:vir:48 156 ---LGG--TTSGWVGETDAR---PET-------------------ATSKLGLIEPFMGEIYGNPQATQKMLD-DAFFNVE 207 (407) T ss_pred ---cCC--cceeeecccccc---ccc-------------------ccccceeEEeeeeeeEeehhhHHHHHh-cchHHHH Confidence 111 122456665321 110 012345678889999999999999776 4544577 Q ss_pred HHHHHHHHHhhhHHHHHHHHHHHHhcCCeEE------ecCCCccc------ccccccccCCceecHHHHHHHHHHHHhcc Q lcl|NC_019514. 155 SHISTELMNGAVQLTEAVLQKDLLAGAGTIV------YTGAATQD------SEITGEGATPSVVDYDDLMRLSITLDENR 222 (399) Q Consensus 155 ~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~------yag~ats~------~~~t~~~~~~~~vt~~~lr~a~~~L~~nr 222 (399) ..+..+|.+..+...+. -+++|.+.-. ++...... ........+...+++++|.++...|+..- T Consensus 208 ~~i~~~l~~~i~~~~~~----a~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~ 283 (407) T protein:vir:48 208 DWINSELALEFAEQEEI----AFTSGDGSKKPKGFLAYESTDEDDKTRAFGKLQHIASGAASGVTADAIIKLIYTLRKAH 283 (407) T ss_pred HHHHHHHHHHHHHHHHh----hhhccCCCCccceeeecccccccccccccccccccccccccccChHHHHHHHHhhchhh Confidence 77777777766544433 3556543311 11111100 00011123446689999999998886532 Q ss_pred CccccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccch Q lcl|NC_019514. 223 TPKQTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLH 302 (399) Q Consensus 223 ap~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~ 302 (399) .. .++| ++|+.+..-|+.|+|--+.|-|.|--+ .|..+++-|..++.++.|.. T Consensus 284 ~~-----------------~a~~--v~n~~~~~~L~~lkD~~Gr~l~~~~~~--------~g~~~~l~G~PV~~~~~~p~ 336 (407) T protein:vir:48 284 RS-----------------GAKF--MMNNSSLFAIRLLKDNDGNYLWRPGIE--------LGQPSSLAGYGIVENEQMPD 336 (407) T ss_pred hc-----------------CCEE--EEcHHHHHHHHHhhccCCceeeccCcC--------CCCCceecceeeEEecCcCC Confidence 21 1234 589999999999998877788876422 34556778889888887621 Q ss_pred hcccCCCccCCccccccCccceEEEEEEEccc--ceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHH-- Q lcl|NC_019514. 303 WAGAGATVGTNPGYRETNGKYDIYPMLCVGAE--SFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKW-- 378 (399) Q Consensus 303 ~~~aGa~~~~~~~~~~t~~~~DVyp~lV~G~~--Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~-- 378 (399) . ++ ++ ..++||.= +|-...- .| +++ . .|||.+++.++|++ T Consensus 337 ---~----~~-------~~-----~~i~~Gd~~~~~~i~~~--~~----~~i--~---------~d~~~~~~~~~~~~~~ 380 (407) T protein:vir:48 337 ---I----AA-------DA-----KAIAFGNFKRGYTIVDR--IG----TRI--L---------RDPYTNKPFVGFYTTK 380 (407) T ss_pred ---c----cC-------Cc-----cEEEEEeccccEEEEEe--ec----eEE--E---------eeccccCCcEEEEEEE Confidence 1 11 11 12456753 2322211 11 222 1 27898999999996 Q ss_pred HHHHhhccccceEEEEEeccC Q lcl|NC_019514. 379 YYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 379 ~~~~~iLn~~~m~~ie~~a~~ 399 (399) .+.+.+++++-++.++.++.- T Consensus 381 r~d~~v~~~~a~~~l~~~aa~ 401 (407) T protein:vir:48 381 RTGGMLVDSQAIKLMKIGAAT 401 (407) T ss_pred EeccEEecccceEEEEeeccC Confidence 589999999999999998887 No 47 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=98.91 E-value=1.6e-10 Score=74.23 Aligned_cols=292 Identities=13% Similarity=0.135 Sum_probs=169.9 Q ss_pred CCcC----------------CeeecCCCCcccccc-cccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEE Q lcl|NC_019514. 1 MASK----------------GMLYNDPNTTPSGID-APDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIR 63 (399) Q Consensus 1 ~~~~----------------~~~~n~~~~t~tT~~-~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk 63 (399) |+++ +..++. +++.++.. +.+-|+ .+..+.++.++..-.+.+++...+++. .+++ T Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~~~~a-~~~~~~~~~~~lip~----~~~~~ii~~~~~~s~l~~~~~~~~~~~---~~~~ 72 (324) T protein:vir:99 1 MEQTQKLKLNLQHFASNNVKPQVFNP-DNVMMHEKKDGTLLN----DFTTPILQEVMENSKIMRLGKYEPMEG---TEKK 72 (324) T ss_pred CCCchHhhHHHHHHHHHhhhhhhccc-cceeccCCCcceech----hHHHHHHHHHHhhchhhhhcceeeccC---CceE Confidence 5443 333333 22222222 222232 245677777888888999998888773 3455 Q ss_pred EEEccccccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhh Q lcl|NC_019514. 64 VYHYIPLLDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQE 143 (399) Q Consensus 64 ~rry~pl~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~ 143 (399) +.++.. .+...|..||- .+... ..+...++.+.+++|.++.+|++ T Consensus 73 ~p~~~~------------~~~a~~v~Eg~---------~~~~~--------------~~~~~~v~~~~~k~~~~~~iS~e 117 (324) T protein:vir:99 73 FTFWAD------------KPGAYWVGEGQ---------KIETS--------------KATWVNATMRAFKLGVILPVTKE 117 (324) T ss_pred EEEEec------------CcceeEeccCc---------ccccc--------------ccceeEEEEeeEEEEEeehhhHH Confidence 555421 23345666662 22222 23445678889999999999999 Q ss_pred hhhhhcchHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccC Q lcl|NC_019514. 144 SLDFDSDSELFSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRT 223 (399) Q Consensus 144 ~~d~~~D~~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nra 223 (399) +++.. ...+...+..+|.+..+.-.+. .+++|.+.--..++-... .......+...+++++|.++.-.|+.+.. T Consensus 118 ll~ds-~~~l~~~i~~~l~~ai~~~~d~----~~l~G~g~~~~~~~~~~~-~~~~~~~~~~~~~~~~i~~~~~~l~~~~~ 191 (324) T protein:vir:99 118 FLNYT-YSQFFEEMKPMIAEAFYKKFDE----AGILNQGNNPFGKSIAQS-IEKTNKVIKGDFTQDNIIDLEALLEDDEL 191 (324) T ss_pred HHhcc-hHHHHHHHHHHHHHHHHHHHHH----HhhhcCCCCccCcccccc-ccccceeccccCCHHHHHHHHHhhhhccC Confidence 77644 3458888888888776654443 445554432211111111 11122344567899999999988877543 Q ss_pred ccccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchh Q lcl|NC_019514. 224 PKQTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHW 303 (399) Q Consensus 224 p~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~ 303 (399) .. + +.++||.....|+.|+|.-+.+-|.. +.-+++-|+.++.++.+. T Consensus 192 ~~-----------------~--~~v~n~~~~~~L~~l~d~~g~~~~~~------------~~~~~l~G~PVv~~~~~~-- 238 (324) T protein:vir:99 192 EA-----------------N--AFISKTQNRSLLRKIVDPETKERIYD------------RNSDTLDGLPVVNLKSSN-- 238 (324) T ss_pred CC-----------------C--EEEEcHHHHHHHHHhhcCCCceeecC------------CCCccccceeEEeecCCC-- Confidence 21 2 24689999999999987665554421 223567777777665321 Q ss_pred cccCCCccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEE---EecCCCCCCCCCCcc--chhhHHHHH- Q lcl|NC_019514. 304 AGAGATVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVT---TKMPGEATADRNDPY--GEMGFSSIK- 377 (399) Q Consensus 304 ~~aGa~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~i---vk~pG~~~ad~~DPl--gQrg~~gwK- 377 (399) . ++. .+++|.-+.-.+++.++ ..+++. ..... ...+..++ -|++.+.|+ T Consensus 239 -------~---------~~~----~~i~gd~~~~~~~~~~~---~~i~~~~~~~~~~~--~~~~~~~~~~f~~~~~~~r~ 293 (324) T protein:vir:99 239 -------L---------KRG----ELITGDFDKLIYGIPQL---IEYKIDETAQLSTV--KNEDGTPVNLFEQDMVALRA 293 (324) T ss_pred -------C---------Ccc----eEEEEecccEEEEEecC---cEEEEeeccccccc--ccccccchhhhhcCcEEEEE Confidence 0 111 25667666555555442 223322 11111 11222333 467788887 Q ss_pred -HHHHHhhccccceEEEEEeccC Q lcl|NC_019514. 378 -WYYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 378 -~~~~~~iLn~~~m~~ie~~a~~ 399 (399) +++++.+++++-++.|..+... T Consensus 294 ~~r~d~~v~~~~a~~~lt~a~~~ 316 (324) T protein:vir:99 294 TMHVALHIADDKAFAKLVPADKK 316 (324) T ss_pred EEEEccEEecccceEEEEeccCC Confidence 6789999999999999877666 No 48 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=98.90 E-value=1.7e-10 Score=74.21 Aligned_cols=293 Identities=14% Similarity=0.156 Sum_probs=170.9 Q ss_pred CCcC----------------CeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEE Q lcl|NC_019514. 1 MASK----------------GMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRV 64 (399) Q Consensus 1 ~~~~----------------~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~ 64 (399) |+++ +..++.-..+.++..+.+-|+ .+..+.++.+++..++.+++...+++. .++++ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~~iP~----~~~~~ii~~~~~~s~l~~~~~~~~~~~---~~~~i 73 (324) T protein:vir:97 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMN----EFTTPILQEVMENSKIMQLGKYEPMEG---TEKKF 73 (324) T ss_pred CccchhHHHHHHHHHHhhhhhhhhccccccccCCCcceech----hHHHHHHHHHHhhcchhhhcceeeccC---CceEE Confidence 5443 222332122222223333343 235677777889999999999888873 34555 Q ss_pred EEccccccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhh Q lcl|NC_019514. 65 YHYIPLLDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQES 144 (399) Q Consensus 65 rry~pl~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~ 144 (399) .++.- .+...|..||-. + .....+...++.+.++++.++.+|+++ T Consensus 74 p~~~~------------~~~a~~v~Eg~~---------~--------------~~~~~~f~~v~~~~~k~~~~~~is~el 118 (324) T protein:vir:97 74 TFWAD------------KPGAYWVGEGQK---------I--------------ETSKATWVNATMRAFKLGVILPVTKEF 118 (324) T ss_pred EEEec------------CcceeEeccCcc---------c--------------cccccceeEEEEeeEEEEEeehhhHHH Confidence 55421 123356666522 1 122344567888999999999999997 Q ss_pred hhhhcchHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCc Q lcl|NC_019514. 145 LDFDSDSELFSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTP 224 (399) Q Consensus 145 ~d~~~D~~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap 224 (399) ++.. ..++...+...|.+..+.-.+ ..+++|.+.--.+++-.+... .....+.+.+++++|.++.-.|+.+... T Consensus 119 l~ds-~~~l~~~i~~~l~~aia~~~d----~a~l~G~g~~~~~~gi~~~~~-~~~~~~~~~~~~~~i~~~~~~l~~~~~~ 192 (324) T protein:vir:97 119 LNYT-YSQFFEEMKPMIAEAFYKKFD----EAGILNQGNNPFGKSIAQSIE-KTNKVIKGDFTQDNIIDLEALLEDDELE 192 (324) T ss_pred Hhcc-hHHHHHHHHHHHHHHHHHHHH----HHhhccCCCCccCcccccccc-ccceeccccCCHHHHHHHHHhhhhccCC Confidence 7633 345777777777776554443 455666543322222111111 1223455778999999999888775432 Q ss_pred cccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhc Q lcl|NC_019514. 225 KQTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWA 304 (399) Q Consensus 225 ~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~ 304 (399) . ++ .+|||.....|+.|+|.-+.+.|. .+--|.+-|+.++.++... T Consensus 193 ~-----------------~~--~v~n~~~~~~L~~lkd~~g~~~~~------------~~~~~tl~G~PV~~~~~~~--- 238 (324) T protein:vir:97 193 A-----------------NA--FISKTQNRSLLRKIVDPETKERIY------------DRNSDTLDGLPVVNLKSSN--- 238 (324) T ss_pred C-----------------CE--EEEcHHHHHHHHHhhcCCCceeec------------CCCCccccceeeEeecCCC--- Confidence 1 22 468999999999998866655553 1234567788877664310 Q ss_pred ccCCCccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEE---EecCCCCCCCCCCc--cchhhHHHHH-- Q lcl|NC_019514. 305 GAGATVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVT---TKMPGEATADRNDP--YGEMGFSSIK-- 377 (399) Q Consensus 305 ~aGa~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~i---vk~pG~~~ad~~DP--lgQrg~~gwK-- 377 (399) . ++. .+++|.-+...++..++ ..+++. .+..+. ..+.-+ +-|+..+.++ T Consensus 239 ------~---------~~~----~~~~gd~~~~~i~~~~~---~~i~~~~~~~~~~~~--~~~~~~~~~f~~d~~~~r~~ 294 (324) T protein:vir:97 239 ------L---------KRG----ELITGDFDKLIYGIPQL---IEYKIDETAQLSTVK--NEDGTPVNLFEQDMVALRAT 294 (324) T ss_pred ------C---------Ccc----eEEEEecccEEEEEecC---cEEEEeecccccccc--cccccchhhhhcCcEEEEEE Confidence 0 011 25677666555655543 122222 122221 111122 2456667766 Q ss_pred HHHHHhhccccceEEEEEeccC Q lcl|NC_019514. 378 WYYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 378 ~~~~~~iLn~~~m~~ie~~a~~ 399 (399) +++.+.+++++-++.|+-+.+. T Consensus 295 ~r~d~~v~~~~a~~~l~~~~~~ 316 (324) T protein:vir:97 295 MHVALHIADDKAFAKLVPADKK 316 (324) T ss_pred EEeccEEecccceEEEEeccCC Confidence 6788999999999999887776 No 49 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=98.89 E-value=2.4e-09 Score=67.88 Aligned_cols=320 Identities=16% Similarity=0.075 Sum_probs=167.4 Q ss_pred CCcCC--eee-cCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccc Q lcl|NC_019514. 1 MASKG--MLY-NDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVN 77 (399) Q Consensus 1 ~~~~~--~~~-n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~ 77 (399) |+.-. +.. ..|..-..+.+ --.+...-|+.+.|..=+..-+|..+-..+++ ..|++++|-|...... T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~---~~al~ie~~~g~V~~~f~~~s~~~~~v~~r~~--~~G~sv~i~~iG~~t~----- 70 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAAD---KLALFLKVFGGEVLTAFARTSVTMPRHMLRSI--ASGKSAQFPVIGRTKA----- 70 (347) T ss_pred CCCCccCcccccccccCCcccc---hHHHHHHHHHHHHHHHHHHHHhhhhhhccccc--cccceeEeeeccceee----- Confidence 66322 112 22211111111 11123334777777765566677777777755 4699999999966411 Q ss_pred cCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHH Q lcl|NC_019514. 78 DQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHI 157 (399) Q Consensus 78 ~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~ 157 (399) .-.+.| -+.|.. .++ ....+.+.+|-|+=-|-.+=|.+.+....-.+..++ T Consensus 71 --------~~~~~g----~~l~~~---~~~--------------~~~~e~~ltiD~~~y~~~~VddiD~~q~~~D~~~~~ 121 (347) T protein:vir:33 71 --------AYLKPG----ENLDDK---RKD--------------IKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEY 121 (347) T ss_pred --------eeecCC----CCCCCC---CCC--------------CccceEEEEechhhhhhHHHhhHHHHhcCCchhHHH Confidence 111211 111110 000 111223344444322222222223333322356666 Q ss_pred HHHHHHhhhHHHHHHHHHHHHhcC-----CeE--EecCCCccc-ccccccccCCcee-----cHHHHHHHHHHHHhccCc Q lcl|NC_019514. 158 STELMNGAVQLTEAVLQKDLLAGA-----GTI--VYTGAATQD-SEITGEGATPSVV-----DYDDLMRLSITLDENRTP 224 (399) Q Consensus 158 ~~~lg~~a~~~~e~~l~~~~lag~-----~~v--~yag~ats~-~~~t~~~~~~~~v-----t~~~lr~a~~~L~~nrap 224 (399) +++.+..-++.++..+.+.+...+ .+. --.|..+.. ...++.+.+.+.. -++.|+.+...|.++..| T Consensus 122 ~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP 201 (347) T protein:vir:33 122 TAQLGESLAMAADGAVLAELAGLVNLPDGSNENIEGLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVP 201 (347) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCC Confidence 666666666666444433322110 000 000111111 1111111111111 267889999999999998 Q ss_pred cccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhc Q lcl|NC_019514. 225 KQTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWA 304 (399) Q Consensus 225 ~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~ 304 (399) . ..+++++.|+.-..|.. ++.|+.. .|+..+.+.+|.||++.||++++++++-.-. T Consensus 202 ~-----------------~gR~~vv~P~~y~~Ll~------~~~~~~~-d~~~~~~~~~G~V~~i~G~~V~~Sn~lp~~~ 257 (347) T protein:vir:33 202 A-----------------ADRTFYTTPDNYSAILA------ALMPNAA-NYQALLDPERGTIRNVMGFEVVEVPHLTAGG 257 (347) T ss_pred c-----------------cCcEEEeCHHHHHHHhc------ccccccc-ccccccccccceeEEEeceeEEEecccccCc Confidence 5 23889999999999853 4677765 6887788999999999999999999974432 Q ss_pred ccCCC----ccCCcccc---ccCccc--eEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHH Q lcl|NC_019514. 305 GAGAT----VGTNPGYR---ETNGKY--DIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSS 375 (399) Q Consensus 305 ~aGa~----~~~~~~~~---~t~~~~--DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~g 375 (399) ..+-. .+..-.+. +..... +--.-|++-++|-|++-++.- ++- ..-|+--|.-.+- T Consensus 258 ~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~------~~e---------~~r~~~~~~d~i~ 322 (347) T protein:vir:33 258 AGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDL------ALE---------RARRANYQADQII 322 (347) T ss_pred cccccccccccccccccCCcccceeccccceeeeeecchhheeeeeece------eee---------eccchhhhhHhhh Confidence 11100 01000111 111111 222347888888888754431 111 1226777777788 Q ss_pred HHHHHHHhhccccceEEEEEeccC Q lcl|NC_019514. 376 IKWYYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 376 wK~~~~~~iLn~~~m~~ie~~a~~ 399 (399) -|..|++.+||++..+.|+-- .+ T Consensus 323 ~~~~~G~~vlrP~~av~i~~~-~~ 345 (347) T protein:vir:33 323 AKYAMGHGGLRPEAAGAIVLP-KV 345 (347) T ss_pred hhhhcCCceecccceEEEecC-CC Confidence 889999999999998777421 11 No 50 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=98.87 E-value=3.2e-10 Score=72.64 Aligned_cols=283 Identities=16% Similarity=0.118 Sum_probs=165.4 Q ss_pred CCcCCeee--cCCCCcccccccc-cccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccc Q lcl|NC_019514. 1 MASKGMLY--NDPNTTPSGIDAP-DGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVN 77 (399) Q Consensus 1 ~~~~~~~~--n~~~~t~tT~~~~-i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~ 77 (399) |+.+.... +.-.++.++..+. +.|+ +..+.+....+...+.+++...+++.+ ++++-+.. T Consensus 101 ~~~~~~~~~~~~~~~~~~~~~g~~vp~~-----~~~~ii~~~~~~~~l~~l~~~~~~~~~---~~~~~~~~--------- 163 (395) T protein:vir:43 101 LRGSHRVSMPRSAITSIDGSGGALVAPD-----RRPGVVAAPQRRLTIRDLVAPGTTESN---SVEYVRET--------- 163 (395) T ss_pred hhhhhhhhhhhhhhcccCCCCccccchh-----hHHHHHHHHHhhhhHHhhccceecCCC---ceEEEEEe--------- Confidence 22222211 1111122222222 2232 356666667788889999999998743 45555541 Q ss_pred cCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHH Q lcl|NC_019514. 78 DQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHI 157 (399) Q Consensus 78 ~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~ 157 (399) +.++...|..||- .+. ....+...++.++++++.++.+|+++++ +++ .+...+ T Consensus 164 --~~~~~a~~v~E~~---------~~~--------------~~~~~~~~i~~~~~k~~~~~~is~ell~-d~~-~l~~~v 216 (395) T protein:vir:43 164 --GFVNNAAPVSEGT---------QKP--------------YSDLTFELENAPVRTIAHLFKASRQILD-DAS-ALQSYI 216 (395) T ss_pred --cCCCceeeecCCc---------ccc--------------ccccceeEEEEeeeeEEEeehhhHHHHH-hHH-HHHHHH Confidence 1122334555541 111 2234456688999999999999999876 454 477777 Q ss_pred HHHHHHhhhHHHHHHHHHHHHhcCCe-------EEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCcccccee Q lcl|NC_019514. 158 STELMNGAVQLTEAVLQKDLLAGAGT-------IVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVI 230 (399) Q Consensus 158 ~~~lg~~a~~~~e~~l~~~~lag~~~-------v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i 230 (399) ...|.+..+...+. .+++|.++ .-..+.. ..+...++.....++++.++.-.|+.+..+. T Consensus 217 ~~~la~a~~~~~d~----~~l~G~g~~~~~~Gi~~~~~~~----~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~----- 283 (395) T protein:vir:43 217 DARARYGLMLVEEC----QLLYGNGTGANLHGIIPQAQAY----APPSGVVVTAEQRIDRIRLAILQAQLAEFPA----- 283 (395) T ss_pred HHHHHHHHHHHHHH----HHHhccCCCCcccccccccccc----ccccccccccchhHHHHHHHHHhhccccCCC----- Confidence 77777776655543 44555432 1111111 1122223445567888888887776644432 Q ss_pred ccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCc Q lcl|NC_019514. 231 TGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATV 310 (399) Q Consensus 231 ~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~ 310 (399) =+.++||.+...|+.|+|..+.+-|.+ ...+.-+.+-|+++|+++.+.. T Consensus 284 --------------~~~vmn~~~~~~l~~lkd~~G~~i~~~---------~~~~~~~~l~G~pVv~~~~~~~-------- 332 (395) T protein:vir:43 284 --------------SGIVLNPIDWALIELNKDAENRYIIGS---------PQNGTTPTLWRLPVVETQAITQ-------- 332 (395) T ss_pred --------------cEEEEcHHHHHHHHHhhccCCceeccc---------cccCCCceecceeeEEcCCCCC-------- Confidence 135799999999999987666555532 2455667888999999987521 Q ss_pred cCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHhhcccc Q lcl|NC_019514. 311 GTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTLILRPE 388 (399) Q Consensus 311 ~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~iLn~~ 388 (399) + . +++|.-......+...+ +.+-+- +-.+.+-+++.+.|+ +++.+.+++++ T Consensus 333 --~--------~------~~~gd~~~~~~~~~~~~----~~i~~~-------~~~~~~f~~~~~~~r~~~r~d~~v~~~~ 385 (395) T protein:vir:43 333 --D--------E------FLTGAFSLGAQIFDRMD----IEVLVS-------TENDKDFENNMVTIRAEERLAFAVYRPE 385 (395) T ss_pred --C--------c------EEEEeccceEEEEEecc----eEEEEe-------ccccchhhcCcEEEEEEEeeccEEeccc Confidence 1 0 34555332222121122 111111 122445678888888 57899999999 Q ss_pred ceEEEEEecc Q lcl|NC_019514. 389 RLALVKTVAP 398 (399) Q Consensus 389 ~m~~ie~~a~ 398 (399) -.+++++.+. T Consensus 386 a~~~~~~taa 395 (395) T protein:vir:43 386 AFVTGSLTAS 395 (395) T ss_pred ceEEEEeccC Confidence 9999998888 No 51 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=98.86 E-value=4.2e-09 Score=66.51 Aligned_cols=317 Identities=10% Similarity=0.067 Sum_probs=177.2 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQG 80 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~g 80 (399) |- -+--+=.|..-.+.. +. .+...-|+.+.+..=+-..+|..+-..+.+ ..|++..|-|-... T Consensus 1 ms-~~~~~tr~~~~~s~~--d~--al~le~f~geV~~af~~~s~~~~~~~~rti--~~g~s~~~~~iG~~---------- 63 (335) T protein:vir:63 1 MS-FLNDLTRPNYAGKNA--DV--DIHLEEHLGIVDKHFAYTSKFAPLMNIRDL--RGSNVVRLDRLGNV---------- 63 (335) T ss_pred CC-Ccccchhhhcccccc--hh--heehhhhhhhHHHHHHhhhhhccccceeee--ccceeEEEeeeeee---------- Confidence 21 110000121111221 21 244455788888886668888889999988 55999999988654 Q ss_pred CCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHHH Q lcl|NC_019514. 81 IDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHISTE 160 (399) Q Consensus 81 i~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~~ 160 (399) -..+..+|..+|.+.+.....+++ |-+.=-.-.+=+.+.|....-.+..+++++ T Consensus 64 -------~~~~~~pG~~l~~~~~~~~k~~it-------------------VD~ll~a~~~I~dlDe~~~~yDvRse~s~e 117 (335) T protein:vir:63 64 -------EAKGRRAGEELERSRVVNDKWNLT-------------------VDTLLYLRHQFDHQDEWTQSFDMRKEVAEL 117 (335) T ss_pred -------eeecccCCcCcCCCCccccceEEE-------------------ecceeechhhhhhHHHHhcCchhHHHHHHH Confidence 233334445555443333333333 221101111223344444553477778888 Q ss_pred HHHhhhHHHHHHHHHHHHhcCCe----EEecC---CCccccccccccc-CCceecHHHHHHHHHHHHhccCccccceecc Q lcl|NC_019514. 161 LMNGAVQLTEAVLQKDLLAGAGT----IVYTG---AATQDSEITGEGA-TPSVVDYDDLMRLSITLDENRTPKQTKVITG 232 (399) Q Consensus 161 lg~~a~~~~e~~l~~~~lag~~~----v~yag---~ats~~~~t~~~~-~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~ 232 (399) +|..=++.++.-.-+.++.++.. .+.+| +-+....+++... .+-.--.+.++.|...|.++..|.. T Consensus 118 ~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~~~G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~------ 191 (335) T protein:vir:63 118 DGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVLEKLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDA------ 191 (335) T ss_pred HHHHHHHHHHHHHHHHHHhhccccCccccCCCcCCCcceeeeeccCcccccHHHHHHHHHHHHHHHHhccCCCc------ Confidence 88777776654444444444432 22222 2223333332211 1111112445677778888887740 Q ss_pred ccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCc---cccccccceeEcCeEEEecCccchhcccCCC Q lcl|NC_019514. 233 SRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADA---GTILNGEIGTVDQFRLVVVPEMLHWAGAGAT 309 (399) Q Consensus 233 s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~---~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~ 309 (399) .-.-+++++.|..-+.|.+ ++.|+.. .|+.. ...-+|+|+++.|||+++++++-.-...+.+ T Consensus 192 --------~~~dr~~vv~P~~y~~Ll~------~~~l~n~-~~~~s~~~~~~~~g~v~~v~Gv~V~~sn~lP~~~~t~~~ 256 (335) T protein:vir:63 192 --------VYSEGLTPMSPRVFSLLLE------HDKLMNV-EYQATGATNDYVKSRVAILNGVKVLETPRFATKAIAAHP 256 (335) T ss_pred --------ccCceEEEeChHHHHHHhc------ccccccc-ccccccccccccCceeEEeeceEEEeeccCCCCCccccc Confidence 0012899999999999965 4678887 66643 3467889999999999999988432222222 Q ss_pred ccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccc Q lcl|NC_019514. 310 VGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPER 389 (399) Q Consensus 310 ~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~ 389 (399) .+. ....++....=...+++-+.|-+++-+..-. ..--.|+--|..++=-|..|++.+||++. T Consensus 257 lg~--a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt---------------~e~~~~~~~~~~~i~~~~a~G~g~lRPe~ 319 (335) T protein:vir:63 257 LGR--HFNVSAEESERQIALFLPSKTLITAQVAPVQ---------------AKLWEDNEKFSWVLDTFQMYNIGARRPDT 319 (335) T ss_pred ccc--cCCccccccceeEEEEEecceEEEEEEeecc---------------cceeeccchhhHHhHHHHHcCCcccccce Confidence 111 1112222222356788888888877444311 00112444477777788999999999999 Q ss_pred eEEEEEeccC Q lcl|NC_019514. 390 LALVKTVAPL 399 (399) Q Consensus 390 m~~ie~~a~~ 399 (399) -+.||+ --+ T Consensus 320 a~~i~~-tg~ 328 (335) T protein:vir:63 320 AGAIEL-KGI 328 (335) T ss_pred EEEEEE-cCC Confidence 999997 333 No 52 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=98.86 E-value=4.6e-10 Score=71.80 Aligned_cols=292 Identities=14% Similarity=0.145 Sum_probs=166.8 Q ss_pred CCc----------------CCeeecCCCCcccccc-cccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEE Q lcl|NC_019514. 1 MAS----------------KGMLYNDPNTTPSGID-APDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIR 63 (399) Q Consensus 1 ~~~----------------~~~~~n~~~~t~tT~~-~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk 63 (399) |++ +++.+|. +++.++.. +.+-|. -+..+.+...+..-.+.+++...+++. .+++ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a-~~~~~~~~~~~liP~----~~~~~ii~~~~~~s~l~~~~~~~~~~~---~~~~ 72 (324) T protein:vir:10 1 MEQTQKLKLNLQHFASNNVKPQVFNP-DNVMMHEKKDGTLLN----DFTTPILQEVMENSKIMQLGKYEPMEG---TEKK 72 (324) T ss_pred CCCchHHHHHHHHHHHHhhccceecc-cceeccCCCcceech----hHHHHHHHHHHhhchhhhhcceeeccC---CceE Confidence 443 3344443 22222222 222232 235677777888888999998888773 3455 Q ss_pred EEEccccccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhh Q lcl|NC_019514. 64 VYHYIPLLDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQE 143 (399) Q Consensus 64 ~rry~pl~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~ 143 (399) +-+... .+...|..||.. +.. ...+...++...++++.++.+|++ T Consensus 73 ~p~~~~------------~~~a~~v~Eg~~---------~~~--------------~~~~~~~v~~~~~k~~~~~~iS~e 117 (324) T protein:vir:10 73 FTFWAD------------KPGAYWVGEGQK---------IET--------------SKATWVNATMRAFKLGVILPVTKE 117 (324) T ss_pred EEEEeC------------CcceeEeccCcc---------ccc--------------cccceeEEEEeeEEEEEeehhhHH Confidence 544421 123456666522 222 223455678889999999999999 Q ss_pred hhhhhcchHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccC Q lcl|NC_019514. 144 SLDFDSDSELFSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRT 223 (399) Q Consensus 144 ~~d~~~D~~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nra 223 (399) +++- +...+...+.++|.+..+.-++. .+++|.+.--...+-.. ........+...+++++|.++.-.|+.+.. T Consensus 118 ll~d-s~~~l~~~i~~~l~~ai~~~~d~----a~l~G~g~~~~~~~i~~-~~~~~~~~~~~~~t~~~i~~~~~~l~~~~~ 191 (324) T protein:vir:10 118 FLNY-TYSQFFEEMKPMIAEAFYKKFDE----AGILNQGNNPFGKSIAQ-SIEKTNKVIKGDFTQDNIIDLEALLEDDEL 191 (324) T ss_pred HHhc-chHHHHHHHHHHHHHHHHHHHHH----HhhhcCCCCccCccccc-cccccceeccccCCHHHHHHHHHhhhhccC Confidence 7763 33458888888887776654443 34555433211111111 111122344567899999999988877543 Q ss_pred ccccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchh Q lcl|NC_019514. 224 PKQTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHW 303 (399) Q Consensus 224 p~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~ 303 (399) .. ++ .++||.....|+.|+|.-+.+-|.+ +.-+++-|+.++.++.+. T Consensus 192 ~~-----------------~~--~v~n~~~~~~L~~l~d~~g~~~~~~------------~~~~~l~G~PV~~~~~~~-- 238 (324) T protein:vir:10 192 EA-----------------NA--FISKTQNRSLLRKIVDPETKERIYD------------RNSDTLDGLPVVNLKSSN-- 238 (324) T ss_pred CC-----------------CE--EEEcHHHHHHHHHhhccCCceeecC------------CCCccccceeEEeecCCC-- Confidence 21 22 4689999999999988666655532 233567777777654320 Q ss_pred cccCCCccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEE---EecCCCCCCCCCCc--cchhhHHHHH- Q lcl|NC_019514. 304 AGAGATVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVT---TKMPGEATADRNDP--YGEMGFSSIK- 377 (399) Q Consensus 304 ~~aGa~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~i---vk~pG~~~ad~~DP--lgQrg~~gwK- 377 (399) . ++. .+++|.-+.-.+++.++ ..++.. ..... ...++.+ +-|++.+.|+ T Consensus 239 --------~--------~~~----~~~~gd~~~~~~~~~~~---~~i~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~r~ 293 (324) T protein:vir:10 239 --------L--------KRG----ELITGDFDKLIYGIPQL---IEYKIDETAQLSTV--KNEDGTPVNLFEQDMVALRA 293 (324) T ss_pred --------C--------Ccc----eEEEEecccEEEEEecC---cEEEEeeccccccc--ccccccchhhhhcCcEEEEE Confidence 0 111 24566555544544432 223322 11111 1122222 3567788887 Q ss_pred -HHHHHhhccccceEEEEEeccC Q lcl|NC_019514. 378 -WYYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 378 -~~~~~~iLn~~~m~~ie~~a~~ 399 (399) +++++.+++++-+++|.-+... T Consensus 294 ~~r~d~~v~~~~A~~~l~~a~~~ 316 (324) T protein:vir:10 294 TMHVALHIADDKAFAKLVPADKK 316 (324) T ss_pred EEEEccEEecccceEEEEeccCC Confidence 6788899999999999877666 No 53 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=98.85 E-value=3.5e-10 Score=72.43 Aligned_cols=302 Identities=15% Similarity=0.098 Sum_probs=167.2 Q ss_pred CCcCCeeecCCC----C-ccccccc-ccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccccccc Q lcl|NC_019514. 1 MASKGMLYNDPN----T-TPSGIDA-PDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDR 74 (399) Q Consensus 1 ~~~~~~~~n~~~----~-t~tT~~~-~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~ 74 (399) |++.- .+| ++ . +.++..+ .|-++ +..+.++.+.....+.+++...+|+. .++++-++.. T Consensus 1 ~~~~~-~~~-~~~~~~~~t~~~~~~~~ip~~-----~~~~ii~~~~~~s~l~~~~~~~~~~~---~~~~~p~~~~----- 65 (320) T protein:vir:10 1 MAAGT-AFQ-VDHAQIAQTGDTMFKGYLEPE-----QAKDYFAEAEKTSIVQQFAQKVPMGT---TGQKIPHWIG----- 65 (320) T ss_pred CCCCc-cCC-HHHHHhhccccccccccccHH-----HHHHHHHHHHhccchhhhcceeeccC---CceEEEEEeC----- Confidence 66543 232 22 1 2222222 23333 45778888888888999999988874 3344444321 Q ss_pred ccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHH Q lcl|NC_019514. 75 NVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELF 154 (399) Q Consensus 75 ~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~ 154 (399) .+...|..|+- .+. ....+...++...+++|.+..+|+++++ ++...+. T Consensus 66 -------~~~a~~v~E~~---------~~~--------------~~~~~f~~v~~~~~k~~~~~~is~ell~-ds~~~l~ 114 (320) T protein:vir:10 66 -------DVSAQWIGEGD---------MKP--------------ITKGNMTSQNIAPHKIATIFVASAETVR-ANPANYL 114 (320) T ss_pred -------CcceEEecCCc---------ccc--------------ccccceeEEEEeeEEEEEeehhhHHHHh-cChHHHH Confidence 12224566552 122 2233455688899999999999999776 4455688 Q ss_pred HHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcc---cccccc-cccCCceecHH-HHHHHHHHHHhccCccccce Q lcl|NC_019514. 155 SHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQ---DSEITG-EGATPSVVDYD-DLMRLSITLDENRTPKQTKV 229 (399) Q Consensus 155 ~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats---~~~~t~-~~~~~~~vt~~-~lr~a~~~L~~nrap~~t~~ 229 (399) ..+.+++.+..+..++.. +++|.+.-.-.+.... ...... ..+.......+ ++.++.-.|+.+.... T Consensus 115 ~~i~~~l~~a~a~~~d~a----~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---- 186 (320) T protein:vir:10 115 GTMRTKVATAFAMAFDSA----ALNGTDSPFPTYLAQTTKSVSLADPGGATASDLTAYDAVAVNGLSLLVNAKKKW---- 186 (320) T ss_pred HHHHHHHHHHHHHHHHHH----hhcccCCCCCcccccccccccceecccccccccccHHHHHHHHHhhhhcccCCC---- Confidence 888888887766655433 5666553222111111 000011 11112222222 3444444444332221 Q ss_pred eccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCC Q lcl|NC_019514. 230 ITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGAT 309 (399) Q Consensus 230 i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~ 309 (399) -+.+|||.....|+.|+|-.+.+-|.+...-+....+. -+.+-++.++.++.+.. T Consensus 187 ---------------~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~---~~~i~g~pv~~~~~~~~------- 241 (320) T protein:vir:10 187 ---------------THTLLDDIVEPILNGAKDKNGRPLFIESTYTDENSPFR---AGRIVSRPTILSDHVAD------- 241 (320) T ss_pred ---------------cEEEEcHHHHHHHHHhhccCCceeeccccccCcccccc---CceeeeeeeEecCCCCC------- Confidence 24578999999999999887777777665555554332 34677888888876411 Q ss_pred ccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEE---EecCCCCCCCCCCccchhhHHHHH--HHHHHhh Q lcl|NC_019514. 310 VGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVT---TKMPGEATADRNDPYGEMGFSSIK--WYYGTLI 384 (399) Q Consensus 310 ~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~i---vk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~i 384 (399) + + ..+++|.=+...+++.++ ..+++. .+.-|.......--+-|++.+.|+ +++++.+ T Consensus 242 ---~--------~----~~~~~gd~~~~~~~~~~~---~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v 303 (320) T protein:vir:10 242 ---G--------T----TVGYMGDFRNVIWGQVGG---LSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHN 303 (320) T ss_pred ---C--------c----eEEEEeecceEEEEEecC---eEEEEeecceeeeccccccccchhhhcCcEEEEEEEeeccEE Confidence 0 1 124566655555555443 122221 121111001111123577778888 7899999 Q ss_pred ccccceEEEE-EeccC Q lcl|NC_019514. 385 LRPERLALVK-TVAPL 399 (399) Q Consensus 385 Ln~~~m~~ie-~~a~~ 399 (399) ++++-.++|. .+||= T Consensus 304 ~~~~a~~~l~~~~ap~ 319 (320) T protein:vir:10 304 NDKDAFVKLTNVVTPD 319 (320) T ss_pred ecccceEEEEeccCCC Confidence 9999999998 44444 No 54 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=98.83 E-value=6.6e-10 Score=70.93 Aligned_cols=297 Identities=13% Similarity=0.055 Sum_probs=172.1 Q ss_pred eeecCCC-----CcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCC Q lcl|NC_019514. 6 MLYNDPN-----TTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQG 80 (399) Q Consensus 6 ~~~n~~~-----~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~g 80 (399) |=||.-. ++.++..+-|-|++ ..+.+++++..-.+.+++...+|+.+ ++++.+..- T Consensus 1 ~g~~~e~~~~~~~~t~~~~g~l~~~~-----~~~ii~~l~~~s~i~~l~~~~~~~~~---~~~ip~~~~----------- 61 (397) T protein:vir:23 1 MGFSADHSQIAQTKDTMFTGYLDPVQ-----AKDYFAEAEKTSIVQRVAQKIPMGAT---GIVIPHWTG----------- 61 (397) T ss_pred CCcCHHHHHHhhccCCCCccccchhH-----HHHHHHHHHhccchhhhcceeeccCC---ceEEEEEcC----------- Confidence 5555432 12222333455552 35677888888888999999888743 344444311 Q ss_pred CCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHHH Q lcl|NC_019514. 81 IDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHISTE 160 (399) Q Consensus 81 i~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~~ 160 (399) .+...|..||.. ......+...++.++++|+.++.+|+++++ +++.++...+.+. T Consensus 62 -~~~a~wv~Eg~~-----------------------~~~s~~~f~~v~l~~~k~~~~v~iS~ell~-ds~~~l~~~i~~~ 116 (397) T protein:vir:23 62 -DVSAQWIGEGDM-----------------------KPITKGNMTKRDVHPAKIATIFVASAETVR-ANPANYLGTMRTK 116 (397) T ss_pred -CcceEEecCCcc-----------------------ccccccceeEEEEeeEEEEEeehhhHHHHh-cchHHHHHHHHHH Confidence 123345555521 222334456788999999999999999776 4455688888888 Q ss_pred HHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccc Q lcl|NC_019514. 161 LMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTRT 240 (399) Q Consensus 161 lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~~ 240 (399) +.+..+...+.. +++|.+.-.-.+...... .....+....+++++..+...|..+..+. T Consensus 117 l~~aia~~~d~a----~l~G~gt~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~l~~~~~~~--------------- 175 (397) T protein:vir:23 117 VATAIAMAFDNA----ALHGTNAPSAFQGYLDQS--NKTQSISPNAYQGLGVSGLTKLVTDGKKW--------------- 175 (397) T ss_pred HHHHHHHHHHHH----HhhcccCCcccccccccc--cceeeecccchhHHHHHHHHhhhhcccCC--------------- Confidence 887766555443 444443211111000000 11112334456777777776676643321 Q ss_pred cCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCccccccC Q lcl|NC_019514. 241 ISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRETN 320 (399) Q Consensus 241 I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t~ 320 (399) =..++|+.....|+.++|--+.+-|.+..+-+... ..-.|++-|+..+.++.|.. | T Consensus 176 ----a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~---~~~~~tl~G~Pv~~s~~~~~----g------------- 231 (397) T protein:vir:23 176 ----THTLLDDTVEPVLNGSVDANGRPLFVESTYESLTT---PFREGRILGRPTILSDHVAE----G------------- 231 (397) T ss_pred ----CEEEEcHHHHHHHHHhhccCCceeecccccccccc---cccCceeeeeeEEEeCCCCC----C------------- Confidence 23589999999999999988888888765554443 23447888999999887621 1 Q ss_pred ccceEEEEEEEcccceeeeccccCCCCcc-ceEEEecCCCCCCCCCCcc--chhhHHHHH--HHHHHhhccccceEEEEE Q lcl|NC_019514. 321 GKYDIYPMLCVGAESFTTIGFQTDGKTLK-FKVTTKMPGEATADRNDPY--GEMGFSSIK--WYYGTLILRPERLALVKT 395 (399) Q Consensus 321 ~~~DVyp~lV~G~~Afg~v~l~g~g~~~~-~~~ivk~pG~~~ad~~DPl--gQrg~~gwK--~~~~~~iLn~~~m~~ie~ 395 (399) + ..+++|.=+...++..++= ..+ .+.....=|. ....+|+ -|+..+.|+ +++.+.+++++-+++++. T Consensus 232 -~----~~~~~gDfs~~~i~~~~~i-~i~~~~e~~~~~~~--~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~ 303 (397) T protein:vir:23 232 -D----VVGYAGDFSQIIWGQVGGL-SFDVTDQATLNLGS--QESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTF 303 (397) T ss_pred -c----eEEEEeecceEEEEEEece-EEEEeeeeeeeecc--ccccceeeeeeccceeEEEEeeeccceecccceEEEee Confidence 1 1345565444444443321 100 1111222121 1223443 477778887 678899999999998886 Q ss_pred eccC Q lcl|NC_019514. 396 VAPL 399 (399) Q Consensus 396 ~a~~ 399 (399) ...- T Consensus 304 ~~~~ 307 (397) T protein:vir:23 304 DPVL 307 (397) T ss_pred cccc Confidence 4332 No 55 >protein:vir:2770 Length: 318 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612887;genbank:gi:20065804;genbank:GeneID:935710 Probab=98.83 E-value=1.1e-09 Score=69.69 Aligned_cols=262 Identities=15% Similarity=0.104 Sum_probs=139.0 Q ss_pred CCcCCeeecCCCCcc--------cccccccccceehhhhhHHHHHHHHHHHHhhhhcc---------cccccccCCCEEE Q lcl|NC_019514. 1 MASKGMLYNDPNTTP--------SGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLAD---------VVSMPKNYGKEIR 63 (399) Q Consensus 1 ~~~~~~~~n~~~~t~--------tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~---------~~~mPkN~GktIk 63 (399) |. |-+..+| -|..+...|.++ .|.+++-..+.....+..|-+ ..++-|+.|.+|. T Consensus 1 mt------~~~~~~~~~~~~~~~ft~~~~~~~~vk--~ws~~l~~~~~~~~~~~~~~g~~~~~~I~r~~dL~K~~GD~Vt 72 (318) T protein:vir:27 1 MT------TVTSAQANKLFQVALFTAANRNRSMVN--ILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVT 72 (318) T ss_pred CC------ccCCCChHHHHHHHHHHHHhcCChHHH--HHHHhhhhHHHhhhhhhcccCCCCCceEEEeccCCCCCccEEE Confidence 21 2222111 122223344444 488877666665544555544 2357799999999 Q ss_pred EEEccccccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhh Q lcl|NC_019514. 64 VYHYIPLLDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQE 143 (399) Q Consensus 64 ~rry~pl~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~ 143 (399) |.--.+| .|-+=.|-.-.||+-+.-+-. . ..-++|.....- +..+++.|== . T Consensus 73 f~L~~~L--------~g~gv~Gd~~lEGnee~L~~~-----~-------d~l~IDq~r~~V-~~gg~msqqR-------t 124 (318) T protein:vir:27 73 FSIMHKL--------SKRPTMGDERVEGRGEDLSHA-----D-------FSLKINQGRHLV-DAGGRMSQQR-------T 124 (318) T ss_pred EeEeecc--------ccCccccCceeeccccceEEE-----e-------eEEEEeeecccc-ccccchhhhh-------h Confidence 9988887 222223333444443322211 1 111122211110 0111111100 0 Q ss_pred hhhhhcchHHHHHHHHHHHHhhhHHHHHHH-------------------------------HHHHHhc-CCeEEecCCCc Q lcl|NC_019514. 144 SLDFDSDSELFSHISTELMNGAVQLTEAVL-------------------------------QKDLLAG-AGTIVYTGAAT 191 (399) Q Consensus 144 ~~d~~~D~~l~~~~~~~lg~~a~~~~e~~l-------------------------------~~~~lag-~~~v~yag~at 191 (399) .-| |.+.-...|..=-++.. |++ ..++.+- .+-++|+|.+| T Consensus 125 ~~d------lR~~ar~~L~~w~~~~~-Dq~~~v~laGarg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~g~at 197 (318) T protein:vir:27 125 KFN------LASSARTLLGTYFNDLQ-DQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDAT 197 (318) T ss_pred hHH------HHHHHHHHHHHHHHHHH-HHHHHHHHhhcccccccccceEecccCccchhhhhcccCCCCCCcEEeccCcc Confidence 011 11111111211111111 111 2223332 23488889999 Q ss_pred ccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCcccee Q lcl|NC_019514. 192 QDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVP 271 (399) Q Consensus 192 s~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~ 271 (399) ++..+++ .+.++++.|.++...++...-|.+.-.+.|..+-+. .+-||+|+||....|||. ....+.|.. T Consensus 198 ~~~~l~s----tD~~s~~lid~~~~~~~~~a~pi~PV~v~g~~~~~~---~~~yV~~~~p~q~~~Lrt---dt~~~~w~d 267 (318) T protein:vir:27 198 SFEQIEA----ADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE---DPYYVLYVTPRQWNDWYT---STSGKDWNQ 267 (318) T ss_pred chhhhhh----cccccHHHHHHHHHHHHHhCCCCcceeeccccccCC---cceEEEEechHHHHHHhh---cCCCHHHHH Confidence 9888875 489999999999999998777765555555444443 345999999999988863 122235777 Q ss_pred hhhc------CCccccccccceeEcCeEEEecCcc-chhcccCCCccCCccccccCccce Q lcl|NC_019514. 272 VHQY------ADAGTILNGEIGTVDQFRLVVVPEM-LHWAGAGATVGTNPGYRETNGKYD 324 (399) Q Consensus 272 v~~Y------a~~~~i~~gEIG~i~~vRfV~~~~~-~~~~~aGa~~~~~~~~~~t~~~~D 324 (399) ..++ |+.-|||.||+|.++||=+.+-+.+ -.| -+|..+... .+. T Consensus 268 ~q~~A~~r~~g~knPLF~G~~gm~ngvil~~~~~vpIrf-~~G~~v~~~--------~~~ 318 (318) T protein:vir:27 268 MMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRF-YQGQRFWYQ--------RIT 318 (318) T ss_pred HHHHHHhcccccCCCceecceeeecCEEEeecCCccEEE-cCCCeeeee--------ecC Confidence 7665 4567899999999999988888864 333 244433210 111 No 56 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=98.82 E-value=5.7e-10 Score=71.27 Aligned_cols=304 Identities=16% Similarity=0.133 Sum_probs=166.9 Q ss_pred eeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCCCCCCC Q lcl|NC_019514. 6 MLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQGIDAAG 85 (399) Q Consensus 6 ~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~gi~aag 85 (399) |..-|| ++.+++-|-|| .|+.+.|.--...|+...++-...- .+|.||++.+-...... T Consensus 1 ~~~~n~---ts~~qafi~~E----iWsa~il~~l~~~Lv~~~~~~~~d~--g~GDtV~InsIg~~tV~------------ 59 (322) T protein:vir:31 1 MSTGNN---TSNTQALIVSE----IWADEIEDILHEKLLDVNIARVVDF--PDGDKLTIPSVGTPVVR------------ 59 (322) T ss_pred CCCCCC---cccceEEeehh----hhHHHHHHHhhhhhhhhhhhccccc--CCCCeEEeccccccccc------------ Confidence 333332 56666776555 6888888877788888888775444 46999999888654111 Q ss_pred ceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHHHHHHhh Q lcl|NC_019514. 86 ATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHISTELMNGA 165 (399) Q Consensus 86 a~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~~lg~~a 165 (399) -++++ +.|+.+.+.-+ ..++..+=+||=.|. ++|...+-.. .|.....++.+.+. T Consensus 60 -dY~~~---------~~i~~d~ltt~------------~~~l~IDq~KYfaf~-VdDD~~Qa~~--dl~~~~~~~aa~al 114 (322) T protein:vir:31 60 -SRPEQ---------GDFTFDNLDTG------------EISIILRDEVYAGNA-ISKKLRQDSR--WISNVGAMLPAEQA 114 (322) T ss_pred -cccCC---------CCcccccCCCc------------eEEEEEehhhhhccc-cchhHHHhhh--hHHHHHHHHHHHHH Confidence 01111 11222222111 112223334464443 4443222222 35666667777777 Q ss_pred hHHHHHHHHHHHHh-cCC--------eEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceecccccc Q lcl|NC_019514. 166 VQLTEAVLQKDLLA-GAG--------TIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMI 236 (399) Q Consensus 166 ~~~~e~~l~~~~la-g~~--------~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~ 236 (399) +... |....++|+ |+. .+.. |. ++.- -..+++....++.|+++...|.+++.|+ T Consensus 115 a~~~-D~fva~lL~~gA~~~~~~~~p~vin-~~-~~~i---v~~gt~~~~ay~~lv~l~~kLdkanVP~----------- 177 (322) T protein:vir:31 115 RAIM-ERYQTDLLALGNAQFAGQNDPNVIN-GV-PHRF---VGTGTDQTMDVTDFSRVNYVMTQSKMPM----------- 177 (322) T ss_pred HHHH-HHHHHHHHHHHhhhhhccCCcceec-CC-ccce---eccCCCchhhHHHHHHHHHHhccccCCC----------- Confidence 7666 444555433 331 1111 10 0000 0113356678999999999999999996 Q ss_pred CccccCceeEEEeCCCchHHHHHhhcc---CCCccceehhhcCCccccccccceeEcCeEEEecCccch--h-cccCCCc Q lcl|NC_019514. 237 DTRTISAGRVLYIGSELIPLIRKLVDP---FGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLH--W-AGAGATV 310 (399) Q Consensus 237 ~T~~I~~~yv~~~h~d~~~dirdl~d~---~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~--~-~~aGa~~ 310 (399) ..++++|.|.....|+.+..+ ..|+-|..+.+.|..+.+. =||++.||++++|.++-. . .-+|.+. T Consensus 178 ------~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~a~g~~--~Vg~~~GF~V~~SN~l~~~~~~i~aG~d~ 249 (322) T protein:vir:31 178 ------GGMIGIIDPSVAHHLETITNISNISNNPRWEGIVESGIAPDMQ--FVRSVYGIDLFVSNLLADANETINAGGDA 249 (322) T ss_pred ------CCeEEEeCchhhhhhhhhhhhhhhhccccccccccccchhhHH--HHHHHhceeeeeeccccccccccccCccc Confidence 348999999998877554332 4678999888888755432 399999999999998721 0 0111111 Q ss_pred cCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccce Q lcl|NC_019514. 311 GTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERL 390 (399) Q Consensus 311 ~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m 390 (399) . .+.+++|-.|+.+ .+ +++.++-+..+. +-+.=+. ..+..+-.+-||. +.|++.++++|=+ T Consensus 250 ~-----~t~ag~~n~f~~~---~~-~~~~~~~~~~~~-----l~~~e~~-r~~~~~~d~~~~~----~~~g~g~~r~e~l 310 (322) T protein:vir:31 250 R-----STTAGKCNMFMNV---SD-MGLLPFVVAWKE-----MPTTKSF-IDDYNDDLNTATT----ARWGNGLVRDENL 310 (322) T ss_pred c-----cccceeecccccc---cc-hhhhhhhhHhhh-----hhhhhcc-cCccccccceeee----eeecceeecccce Confidence 1 1223344333221 11 222233222111 1111111 1223344555555 4689999999999 Q ss_pred EEEEEec-cC Q lcl|NC_019514. 391 ALVKTVA-PL 399 (399) Q Consensus 391 ~~ie~~a-~~ 399 (399) +.+++-+ ++ T Consensus 311 ~~~~a~~~~~ 320 (322) T protein:vir:31 311 VCVLANADKV 320 (322) T ss_pred EEEEeccccc Confidence 8887644 44 No 57 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=98.82 E-value=1.8e-09 Score=68.47 Aligned_cols=290 Identities=9% Similarity=0.036 Sum_probs=163.4 Q ss_pred CCcCCeeecCC-CCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccC Q lcl|NC_019514. 1 MASKGMLYNDP-NTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQ 79 (399) Q Consensus 1 ~~~~~~~~n~~-~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~ 79 (399) +..+ .... ..++++..+.+-|+ .+..+.+....+...+.+++...+|+-+.|+... .+...- T Consensus 113 ~~~~---~~~~~~~~~~~~gg~~iP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~-~~~~~~--------- 175 (415) T protein:vir:81 113 LETR---NDIQGGSLKTDSGFVVIPE----EIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPV-VRQSEV--------- 175 (415) T ss_pred Hhhh---hhhhhccccccccccccch----HHHHHHHHHHHhhhhhhhheeeeeccCCceeEEE-EeecCC--------- Confidence 1110 0000 11222233334454 4466676667888889999999999988875332 333221 Q ss_pred CCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHH Q lcl|NC_019514. 80 GIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIST 159 (399) Q Consensus 80 gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~ 159 (399) +...|+.||... ++ ....+...++.++++++.++.+|+++++ +++..+...+.. T Consensus 176 ---~~~~~v~E~~~~---~~-------------------~~~~~~~~v~~~~~k~~~~~~iS~ell~-ds~~~l~~~i~~ 229 (415) T protein:vir:81 176 ---AALEKVEELEEN---PE-------------------LAVKPFFQLAYDINTHRGYFRISREAIE-DAKVNVLQELKL 229 (415) T ss_pred ---ccceeecccccc---Cc-------------------ccccceeeEEeeeeeeEeeehhhHHHHh-hchHHHHHHHHH Confidence 233466665221 11 1112355688889999999999999776 455557777778 Q ss_pred HHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_019514. 160 ELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTR 239 (399) Q Consensus 160 ~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~ 239 (399) .|.+..+...+..+....-+|.+...-.+. +..+...++...+++++|.++.-.|....... T Consensus 230 ~l~~~~~~~~~~~il~g~g~g~~~~~~~~~----~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~-------------- 291 (415) T protein:vir:81 230 WMARTIAATRNKAIIDVITKGSTGSTSSGF----EKEGKKLEVKKAKSLDDIKDAINLNVKPNYEH-------------- 291 (415) T ss_pred HHHHHHHHHHHHHHhhccccCccccccccc----cccccccccccccchhHHHHHHHhhhhhccCC-------------- Confidence 888776655543333222222221111111 11122334556789999999887776532211 Q ss_pred ccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCcccccc Q lcl|NC_019514. 240 TISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRET 319 (399) Q Consensus 240 ~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t 319 (399) + ..+|||.+...|+.|+|-.+.|-|.|- +..|-.+++-|+.++.++.+. . |.+ T Consensus 292 ---~--~~v~n~~~~~~l~~lkd~~G~~l~~~~--------~~~~~~~~l~G~pV~~~~~~~--~--~~~---------- 344 (415) T protein:vir:81 292 ---N--VAIVSQTMFAKLDKMKDKLGNYLIQPD--------VKEKTQQRLLGAKIEILPDEV--L--GQK---------- 344 (415) T ss_pred ---C--EEEEcHHHHHHHHHhhccCCceeeccC--------cCCCCCceecceeeEEecccc--c--CCC---------- Confidence 1 247899999999999887777777652 234556788999998888752 1 111 Q ss_pred CccceEEEEEEEc--ccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEec Q lcl|NC_019514. 320 NGKYDIYPMLCVG--AESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVKTVA 397 (399) Q Consensus 320 ~~~~DVyp~lV~G--~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~a 397 (399) +. . .++|| +++|... ...+ +..-+- . +-..++++.+. +++.+.+++++=++.++... T Consensus 345 -~~---~-~~~~Gd~~~~~~~~--~~~~----~~v~~~------~---~~~~~~~~~~~-~r~d~~v~~~~a~~~~~~~~ 403 (415) T protein:vir:81 345 -GN---N-TLIIGNLKDAIVLF--DRSQ----YQASWT------D---YMHFGECLMIA-VRQDCRILDYKSAIVIEYDD 403 (415) T ss_pred -Cc---c-EEEEEehhccEEEE--eecc----eEEEEe------c---cccCceEEEEE-EEeccEEeccccEEEEEEec Confidence 01 1 26777 3444322 2222 111111 1 11223333222 46778888888888888887 Q ss_pred cC Q lcl|NC_019514. 398 PL 399 (399) Q Consensus 398 ~~ 399 (399) +. T Consensus 404 ~~ 405 (415) T protein:vir:81 404 SE 405 (415) T ss_pred cC Confidence 77 No 58 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=98.82 E-value=1.8e-09 Score=68.47 Aligned_cols=290 Identities=9% Similarity=0.036 Sum_probs=163.4 Q ss_pred CCcCCeeecCC-CCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccC Q lcl|NC_019514. 1 MASKGMLYNDP-NTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQ 79 (399) Q Consensus 1 ~~~~~~~~n~~-~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~ 79 (399) +..+ .... ..++++..+.+-|+ .+..+.+....+...+.+++...+|+-+.|+... .+...- T Consensus 113 ~~~~---~~~~~~~~~~~~gg~~iP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~-~~~~~~--------- 175 (415) T protein:vir:79 113 LETR---NDIQGGSLKTDSGFVVIPE----EIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPV-VRQSEV--------- 175 (415) T ss_pred Hhhh---hhhhhccccccccccccch----HHHHHHHHHHHhhhhhhhheeeeeccCCceeEEE-EeecCC--------- Confidence 1110 0000 11222233334454 4466676667888889999999999988875332 333221 Q ss_pred CCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHH Q lcl|NC_019514. 80 GIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIST 159 (399) Q Consensus 80 gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~ 159 (399) +...|+.||... ++ ....+...++.++++++.++.+|+++++ +++..+...+.. T Consensus 176 ---~~~~~v~E~~~~---~~-------------------~~~~~~~~v~~~~~k~~~~~~iS~ell~-ds~~~l~~~i~~ 229 (415) T protein:vir:79 176 ---AALEKVEELEEN---PE-------------------LAVKPFFQLAYDINTHRGYFRISREAIE-DAKVNVLQELKL 229 (415) T ss_pred ---ccceeecccccc---Cc-------------------ccccceeeEEeeeeeeEeeehhhHHHHh-hchHHHHHHHHH Confidence 233466665221 11 1112355688889999999999999776 455557777778 Q ss_pred HHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_019514. 160 ELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTR 239 (399) Q Consensus 160 ~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~ 239 (399) .|.+..+...+..+....-+|.+...-.+. +..+...++...+++++|.++.-.|....... T Consensus 230 ~l~~~~~~~~~~~il~g~g~g~~~~~~~~~----~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~-------------- 291 (415) T protein:vir:79 230 WMARTIAATRNKAIIDVITKGSTGSTSSGF----EKEGKKLEVKKAKSLDDIKDAINLNVKPNYEH-------------- 291 (415) T ss_pred HHHHHHHHHHHHHHhhccccCccccccccc----cccccccccccccchhHHHHHHHhhhhhccCC-------------- Confidence 888776655543333222222221111111 11122334556789999999887776532211 Q ss_pred ccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCcccccc Q lcl|NC_019514. 240 TISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRET 319 (399) Q Consensus 240 ~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t 319 (399) + ..+|||.+...|+.|+|-.+.|-|.|- +..|-.+++-|+.++.++.+. . |.+ T Consensus 292 ---~--~~v~n~~~~~~l~~lkd~~G~~l~~~~--------~~~~~~~~l~G~pV~~~~~~~--~--~~~---------- 344 (415) T protein:vir:79 292 ---N--VAIVSQTMFAKLDKMKDKLGNYLIQPD--------VKEKTQQRLLGAKIEILPDEV--L--GQK---------- 344 (415) T ss_pred ---C--EEEEcHHHHHHHHHhhccCCceeeccC--------cCCCCCceecceeeEEecccc--c--CCC---------- Confidence 1 247899999999999887777777652 234556788999998888752 1 111 Q ss_pred CccceEEEEEEEc--ccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEec Q lcl|NC_019514. 320 NGKYDIYPMLCVG--AESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVKTVA 397 (399) Q Consensus 320 ~~~~DVyp~lV~G--~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~a 397 (399) +. . .++|| +++|... ...+ +..-+- . +-..++++.+. +++.+.+++++=++.++... T Consensus 345 -~~---~-~~~~Gd~~~~~~~~--~~~~----~~v~~~------~---~~~~~~~~~~~-~r~d~~v~~~~a~~~~~~~~ 403 (415) T protein:vir:79 345 -GN---N-TLIIGNLKDAIVLF--DRSQ----YQASWT------D---YMHFGECLMIA-VRQDCRILDYKSAIVIEYDD 403 (415) T ss_pred -Cc---c-EEEEEehhccEEEE--eecc----eEEEEe------c---cccCceEEEEE-EEeccEEeccccEEEEEEec Confidence 01 1 26777 3444322 2222 111111 1 11223333222 46778888888888888887 Q ss_pred cC Q lcl|NC_019514. 398 PL 399 (399) Q Consensus 398 ~~ 399 (399) +. T Consensus 404 ~~ 405 (415) T protein:vir:79 404 SE 405 (415) T ss_pred cC Confidence 77 No 59 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=98.82 E-value=1.8e-09 Score=68.47 Aligned_cols=290 Identities=9% Similarity=0.036 Sum_probs=163.4 Q ss_pred CCcCCeeecCC-CCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccC Q lcl|NC_019514. 1 MASKGMLYNDP-NTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQ 79 (399) Q Consensus 1 ~~~~~~~~n~~-~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~ 79 (399) +..+ .... ..++++..+.+-|+ .+..+.+....+...+.+++...+|+-+.|+... .+...- T Consensus 113 ~~~~---~~~~~~~~~~~~gg~~iP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~-~~~~~~--------- 175 (415) T protein:vir:98 113 LETR---NDIQGGSLKTDSGFVVIPE----EIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPV-VRQSEV--------- 175 (415) T ss_pred Hhhh---hhhhhccccccccccccch----HHHHHHHHHHHhhhhhhhheeeeeccCCceeEEE-EeecCC--------- Confidence 1110 0000 11222233334454 4466676667888889999999999988875332 333221 Q ss_pred CCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHH Q lcl|NC_019514. 80 GIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIST 159 (399) Q Consensus 80 gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~ 159 (399) +...|+.||... ++ ....+...++.++++++.++.+|+++++ +++..+...+.. T Consensus 176 ---~~~~~v~E~~~~---~~-------------------~~~~~~~~v~~~~~k~~~~~~iS~ell~-ds~~~l~~~i~~ 229 (415) T protein:vir:98 176 ---AALEKVEELEEN---PE-------------------LAVKPFFQLAYDINTHRGYFRISREAIE-DAKVNVLQELKL 229 (415) T ss_pred ---ccceeecccccc---Cc-------------------ccccceeeEEeeeeeeEeeehhhHHHHh-hchHHHHHHHHH Confidence 233466665221 11 1112355688889999999999999776 455557777778 Q ss_pred HHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_019514. 160 ELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTR 239 (399) Q Consensus 160 ~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~ 239 (399) .|.+..+...+..+....-+|.+...-.+. +..+...++...+++++|.++.-.|....... T Consensus 230 ~l~~~~~~~~~~~il~g~g~g~~~~~~~~~----~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~-------------- 291 (415) T protein:vir:98 230 WMARTIAATRNKAIIDVITKGSTGSTSSGF----EKEGKKLEVKKAKSLDDIKDAINLNVKPNYEH-------------- 291 (415) T ss_pred HHHHHHHHHHHHHHhhccccCccccccccc----cccccccccccccchhHHHHHHHhhhhhccCC-------------- Confidence 888776655543333222222221111111 11122334556789999999887776532211 Q ss_pred ccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCcccccc Q lcl|NC_019514. 240 TISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRET 319 (399) Q Consensus 240 ~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t 319 (399) + ..+|||.+...|+.|+|-.+.|-|.|- +..|-.+++-|+.++.++.+. . |.+ T Consensus 292 ---~--~~v~n~~~~~~l~~lkd~~G~~l~~~~--------~~~~~~~~l~G~pV~~~~~~~--~--~~~---------- 344 (415) T protein:vir:98 292 ---N--VAIVSQTMFAKLDKMKDKLGNYLIQPD--------VKEKTQQRLLGAKIEILPDEV--L--GQK---------- 344 (415) T ss_pred ---C--EEEEcHHHHHHHHHhhccCCceeeccC--------cCCCCCceecceeeEEecccc--c--CCC---------- Confidence 1 247899999999999887777777652 234556788999998888752 1 111 Q ss_pred CccceEEEEEEEc--ccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEec Q lcl|NC_019514. 320 NGKYDIYPMLCVG--AESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVKTVA 397 (399) Q Consensus 320 ~~~~DVyp~lV~G--~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~a 397 (399) +. . .++|| +++|... ...+ +..-+- . +-..++++.+. +++.+.+++++=++.++... T Consensus 345 -~~---~-~~~~Gd~~~~~~~~--~~~~----~~v~~~------~---~~~~~~~~~~~-~r~d~~v~~~~a~~~~~~~~ 403 (415) T protein:vir:98 345 -GN---N-TLIIGNLKDAIVLF--DRSQ----YQASWT------D---YMHFGECLMIA-VRQDCRILDYKSAIVIEYDD 403 (415) T ss_pred -Cc---c-EEEEEehhccEEEE--eecc----eEEEEe------c---cccCceEEEEE-EEeccEEeccccEEEEEEec Confidence 01 1 26777 3444322 2222 111111 1 11223333222 46778888888888888887 Q ss_pred cC Q lcl|NC_019514. 398 PL 399 (399) Q Consensus 398 ~~ 399 (399) +. T Consensus 404 ~~ 405 (415) T protein:vir:98 404 SE 405 (415) T ss_pred cC Confidence 77 No 60 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=98.81 E-value=1.2e-09 Score=69.43 Aligned_cols=299 Identities=11% Similarity=0.064 Sum_probs=163.2 Q ss_pred CCcCCee-----ec-C---CCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccc Q lcl|NC_019514. 1 MASKGML-----YN-D---PNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLL 71 (399) Q Consensus 1 ~~~~~~~-----~n-~---~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~ 71 (399) |.++... .. . ..+++.+..+..-| + -+....+..+.+...+.++++..||+-+ ..+..+.. T Consensus 143 ~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip---~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~--~~~~~~~~---- 212 (458) T protein:vir:10 143 VMEKGVFETEHGQRHLKAVNQSSSVEVSSESYE---T-IFSQRIIRDLQKELVVGALFEELPMSSK--ILTMLVEP---- 212 (458) T ss_pred HHhhccchhhhhhhhhhhhhhcccCccccceeh---h-hHhHHHHHHHHhhhhHHhhcceeecCCc--ceEEEEec---- Confidence 2211111 00 0 01111111222222 2 2466777788888899999998887643 22222221 Q ss_pred cccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcch Q lcl|NC_019514. 72 DDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDS 151 (399) Q Consensus 72 ~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~ 151 (399) + .+...|..++..- ++... ......+...++.+.++++.|+.+|+++++ +++. T Consensus 213 --------~-~~~a~~v~e~~~~---~~~~~--------------~~~~~~~~~~i~~~~~k~~~~v~is~ell~-ds~~ 265 (458) T protein:vir:10 213 --------D-AGKATWVAASTYG---TDTTT--------------GEEVKGALKEIHFSTYKLAAKSFITDETEE-DAIF 265 (458) T ss_pred --------C-Ccceeeccccccc---ccccc--------------cccccccceeeEeeeeeEEeeehhhHHHHh-cchH Confidence 1 1223445554211 10000 011123345577889999999999999654 4555 Q ss_pred HHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeE------EecCCCcccccccccccCCceecHHHHHHHHHHHHhccCcc Q lcl|NC_019514. 152 ELFSHISTELMNGAVQLTEAVLQKDLLAGAGTI------VYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPK 225 (399) Q Consensus 152 ~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v------~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~ 225 (399) .+.+.+...|.+..+... -..+++|.++- -+++..+............+.+++++|.++...|+.+... T Consensus 266 ~~~~~i~~~l~~~i~~~~----d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~- 340 (458) T protein:vir:10 266 SLLPLLRKRLIEAHAVSI----EEAFMTGDGSGKPKGLLTLASEDSAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLK- 340 (458) T ss_pred HHHHHHHHHHHHHHHHHH----HHHhhcCCCCCccceeeecccccccceeecccccccccccHHHHHHHHHhhhhhhcC- Confidence 577777777777655444 34556665431 1222111111112233455678999999998877653321 Q ss_pred ccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcc Q lcl|NC_019514. 226 QTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAG 305 (399) Q Consensus 226 ~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~ 305 (399) .++ -+|||.+...|+.|+|-.+.|-|.+- .......|..+++-|++++.++.|- T Consensus 341 ----------------~~~--~v~~~~~~~~l~~lkd~~G~~i~~~~----~~~~~~~~~~~~l~G~pv~~~~~~p---- 394 (458) T protein:vir:10 341 ----------------LSK--LVLIVSMDAYYDLLEDEEWQDVAQVG----NDSVKLQGQVGRIYGLPVVVSEYFP---- 394 (458) T ss_pred ----------------CCE--EEEcHHHHHHHHhhcccCCceeeccc----cccccccCcCceecceeeEEccccc---- Confidence 122 37899999999999876555555432 2223456677889999999988761 Q ss_pred cCCCccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHh Q lcl|NC_019514. 306 AGATVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTL 383 (399) Q Consensus 306 aGa~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~ 383 (399) +++ +..+|+ +..|+. .|-... ..| +++ . .|||.+.+.+++. ..++.. T Consensus 395 ~~~------------~~~~~~-~~~f~~-~~~~~~--~~~----~~v--~---------~d~~~~~~~~~~~~~~r~~~~ 443 (458) T protein:vir:10 395 AKA------------NSAEFA-VIVYKD-NFVMPR--QRA----VTV--E---------RERQAGKQRDAYYVTQRVNLQ 443 (458) T ss_pred ccc------------CCcceE-EEEecc-cEEEEE--eec----eEE--E---------eecccCCCceEEEEEEEecce Confidence 111 112232 334432 232221 122 221 1 2788877777765 345677 Q ss_pred hccccceEEEEEecc Q lcl|NC_019514. 384 ILRPERLALVKTVAP 398 (399) Q Consensus 384 iLn~~~m~~ie~~a~ 398 (399) ++++.-++.+..+|- T Consensus 444 v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 444 RYFANGVVSGTYAAS 458 (458) T ss_pred EecccceEEEeeccC Confidence 888888899888888 No 61 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=98.81 E-value=4.1e-09 Score=66.56 Aligned_cols=319 Identities=16% Similarity=0.102 Sum_probs=158.9 Q ss_pred CCcCCeeecCCCCcccccccccccceehhh--hhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccccccccccc Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFF--WWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVND 78 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y--~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~ 78 (399) |+--+. +-.++.+-+..+ -+...-.|. |.-+.+..=+..-+|..+-..+++ ..|++++|-|...... T Consensus 1 m~~~~~--~~~~t~~g~~~~-~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~~r~i--~~G~sv~i~~iG~~tv------ 69 (347) T protein:vir:94 1 MANVPG--QKIGTDQGKGKS-SSDALALFLKVFAGEVLTAFTRRSVTADKHIVRTI--QNGKSAQFPVMGRTSG------ 69 (347) T ss_pred CCCCCc--cccccccccCCc-cccHHHHHHHHHhHHHHHHHHHHHhhhcccccccc--cccceEEEecccceee------ Confidence 654432 222222222211 111112221 333333221122334444445544 4699999988865311 Q ss_pred CCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHH Q lcl|NC_019514. 79 QGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIS 158 (399) Q Consensus 79 ~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~ 158 (399) .-.+.| .+.+. |+...+-.+.+.+|-|+=-|-.+-|.+.+....-.+..+++ T Consensus 70 -------~~~t~G----~~l~~-----------------~~~~~~~~e~~itID~~~~~~~~VddiD~~q~~~D~~~~~~ 121 (347) T protein:vir:94 70 -------VYLAPG----ERLSD-----------------KRKGIKHTEKVITIDGLLTADVMIFDIEDAMNHYDVAGEYS 121 (347) T ss_pred -------eeecCC----CCcCC-----------------CCCCCCcceEEEEecchhhhhHHhhhHHHHhcCcchHHHHH Confidence 111211 11100 11122233444566666444444444444444434666777 Q ss_pred HHHHHhhhHHHHHHHHHHH--HhcC---CeEEecCCCcccccccc--cccCCc-----eecHHHHHHHHHHHHhccCccc Q lcl|NC_019514. 159 TELMNGAVQLTEAVLQKDL--LAGA---GTIVYTGAATQDSEITG--EGATPS-----VVDYDDLMRLSITLDENRTPKQ 226 (399) Q Consensus 159 ~~lg~~a~~~~e~~l~~~~--lag~---~~v~yag~ats~~~~t~--~~~~~~-----~vt~~~lr~a~~~L~~nrap~~ 226 (399) ++.|..-++.++..+-+.+ +++. ....-+|. ...+.++. .+..++ .--++.|+.+.+.|++++.|. T Consensus 122 ~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~~~~~g~-~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~- 199 (347) T protein:vir:94 122 NQLGEALAIAADGAVLAEMAILCNLPAASNENIAGL-GTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPA- 199 (347) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccccccccCCC-cccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCC- Confidence 7777766666643332222 2211 00111110 00000000 000000 111577889999999999986 Q ss_pred cceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhccc Q lcl|NC_019514. 227 TKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGA 306 (399) Q Consensus 227 t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~a 306 (399) .-+++++.|+....|.+ ++.|... .|+....+.+|.||++.||++++++++-.. +. T Consensus 200 ----------------~~R~~vv~P~~~~~Ll~------~~~~~~~-~~~~~~~~~~G~Vg~i~G~~V~~Sn~lp~~-~~ 255 (347) T protein:vir:94 200 ----------------GDRYFYTTPDNYSAILA------ALMPNAA-NYAALIDPETGNIRNVMGFVVVEVPHLVQG-GA 255 (347) T ss_pred ----------------CCcEEEeCHHHHHHHhc------cchhhhh-hccccccccccceEEEeceEEEecCccccc-cc Confidence 23899999999988842 4556654 688878889999999999999999997432 22 Q ss_pred CCCccCCc--cc--------cccCccc----eEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhh Q lcl|NC_019514. 307 GATVGTNP--GY--------RETNGKY----DIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMG 372 (399) Q Consensus 307 Ga~~~~~~--~~--------~~t~~~~----DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg 372 (399) +..+...+ .. +++.+++ +--..|+|=+.|-+++-...-. . | .--|+--|.- T Consensus 256 t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~---------~---e---~~r~~~~~~d 320 (347) T protein:vir:94 256 GETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLA---------L---E---RDRDVDAQGD 320 (347) T ss_pred ccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhccccc---------c---c---chhchhhHHH Confidence 21111100 00 0111111 1123456656665555322210 0 0 1225555666 Q ss_pred HHHHHHHHHHhhccccceEEEEEeccC Q lcl|NC_019514. 373 FSSIKWYYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 373 ~~gwK~~~~~~iLn~~~m~~ie~~a~~ 399 (399) .+==|..|++.+||++..+.||..+-= T Consensus 321 ~i~~~~~~G~~~~rP~~a~~~~~~~A~ 347 (347) T protein:vir:94 321 LIVGKYAMGHGGLRPEAAGALVFSPAE 347 (347) T ss_pred HhhhhhhhcCcccccceeEEEEecCCC Confidence 666689999999999999999876434 No 62 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=98.80 E-value=7.1e-10 Score=70.75 Aligned_cols=305 Identities=13% Similarity=0.080 Sum_probs=165.3 Q ss_pred CCcCC------eeecCCC--CcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccccc Q lcl|NC_019514. 1 MASKG------MLYNDPN--TTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLD 72 (399) Q Consensus 1 ~~~~~------~~~n~~~--~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~ 72 (399) |.=-+ +.-|.-. ++.++..+.. +... +..+.++..++.-.+.+++...+|+.+ +.++-+... T Consensus 1 ~~~~~~r~~~~~~~~e~~a~~~~~~~~g~~---ip~~-~~~~ii~~~~~~s~i~~~~~~~~~~~~---~~~~p~~~~--- 70 (326) T protein:vir:42 1 MAVNPDRTTPFLGVNDPKVAQTGDSMFEGY---LEPE-QAQDYFAEAEKISIVQQFAQKIPMGTT---GQKIPHWTG--- 70 (326) T ss_pred CCCCccchhhhcCcchhhheeccccCCcce---echh-hHHHHHHHHHhcchhhhhcceeeccCC---ceEEEEEeC--- Confidence 21111 0011111 1222222332 3333 356777778888889999999988854 344333211 Q ss_pred ccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchH Q lcl|NC_019514. 73 DRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSE 152 (399) Q Consensus 73 ~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~ 152 (399) .+...+..|| +.......+...++...+++|.++.+|+++++ +++.. T Consensus 71 ---------~~~a~~v~Eg-----------------------~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~-~s~~~ 117 (326) T protein:vir:42 71 ---------DVSASWIGEG-----------------------DMKPITKGNMTSQTIAPHKIATIFVASAETVR-ANPAN 117 (326) T ss_pred ---------CcceEEecCC-----------------------ccccccccceeEEEEeeEEEEEeehhhHHHHh-cCHHH Confidence 1223455554 22222334556688899999999999999766 45556 Q ss_pred HHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcc----cccccccccCCceecHHHHHHHHHHHHhccCccccc Q lcl|NC_019514. 153 LFSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQ----DSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTK 228 (399) Q Consensus 153 l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats----~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~ 228 (399) +...+.+.+.+..+.-.+. .+++|.+.-.-.|.... .........+...++.+++..+.......+... T Consensus 118 ~~~~i~~~l~~a~~~~~d~----a~l~G~gs~~p~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 190 (326) T protein:vir:42 118 YLGTMRTKVATAFAMAFDN----AAINGTDSPFPTFLAQTTKEVSLVDPDGTGSNADLTVYDAVAVNALSLLVNAGK--- 190 (326) T ss_pred HHHHHHHHHHHHHHHHHHH----HhhcccCCCccccccccccccceeecccccccccchhHHHHHHHHHhhhhhhcc--- Confidence 8888888888776655443 44555543211111100 011111112223344444433221111111111 Q ss_pred eeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCC Q lcl|NC_019514. 229 VITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGA 308 (399) Q Consensus 229 ~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa 308 (399) ..-+-++||.+...|+.|+|..+.|-|.+..+-+...++ ..|.+-++.++.++.+. +| T Consensus 191 --------------~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~---~~~~l~G~pv~~~~~~~----~~- 248 (326) T protein:vir:42 191 --------------KWTHTLLDDITEPILNGAKDKSGRPLFIESTYTEENSPF---RLGRIVARPTILSDHVA----SG- 248 (326) T ss_pred --------------CccEEEEeHHHHHHHHHhhccCCceeeccccccCccccc---cCceeeeeeEEEcCCCC----CC- Confidence 012346899999999999998888888876555554432 35788999999887642 11 Q ss_pred CccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceE---EEecCCCCCCCCCCcc--chhhHHHHH--HHHH Q lcl|NC_019514. 309 TVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKV---TTKMPGEATADRNDPY--GEMGFSSIK--WYYG 381 (399) Q Consensus 309 ~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~---ivk~pG~~~ad~~DPl--gQrg~~gwK--~~~~ 381 (399) + .++++|.=+...++..++ ..++. ..+.-| .....+|+ -|+...+|| +++. T Consensus 249 -------------~----~~~~~Gd~s~~~~~~~~~---~~v~~~~e~~~~~~--~~~~~~~~~~~~~d~~~~r~~~~~d 306 (326) T protein:vir:42 249 -------------T----VVGYQGDFRQLVWGQVGG---LSFDVTDQATLNLG--TPQAPNFVSLWQHNLVAVRVEAEYA 306 (326) T ss_pred -------------c----eEEEEeecceEEEEEecc---eEEEEeecceeeec--ccccccchhhhhcCcEEEEEEEEec Confidence 1 133455544334433321 11111 112212 12333443 466778887 7889 Q ss_pred HhhccccceEEEEEeccC Q lcl|NC_019514. 382 TLILRPERLALVKTVAPL 399 (399) Q Consensus 382 ~~iLn~~~m~~ie~~a~~ 399 (399) +.+.+++-+++|+.++.- T Consensus 307 ~~v~~~~a~~~l~~~~~~ 324 (326) T protein:vir:42 307 FHCNDKDAFVKLTNVDAT 324 (326) T ss_pred cEEecccceEEEeecccc Confidence 999999999999887777 No 63 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=98.80 E-value=1.3e-09 Score=69.35 Aligned_cols=284 Identities=11% Similarity=0.062 Sum_probs=164.8 Q ss_pred cccc-cccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCCCCCCCceeccCccc Q lcl|NC_019514. 16 SGID-APDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQGIDAAGATIVNGNLY 94 (399) Q Consensus 16 tT~~-~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~gi~aaga~lt~g~~~ 94 (399) +++. +.+-|+ .+..+.++..++.-++.+++...+++.+. +++-+... .+...|..||- T Consensus 1 ma~~gG~lip~----~~~~~ii~~~~~~s~i~~~~~~~~~~~~~---~~~p~~~~------------~~~a~~v~Eg~-- 59 (298) T protein:vir:94 1 MVLNKGTLFDP----ELVTDLISKVAGKSSIARLSAQKPIPFNG---EKVFTFTM------------DSEIDVVAESG-- 59 (298) T ss_pred CeeccccccCh----hHHHHHHHHHHhhchhhhhcceeeccCCc---eEEEEEec------------CcceEEeeCCc-- Confidence 2222 233333 13566777788899999999998887743 33333311 12335666652 Q ss_pred cccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcch--HHHHHHHHHHHHhhhHHHHHH Q lcl|NC_019514. 95 GSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDS--ELFSHISTELMNGAVQLTEAV 172 (399) Q Consensus 95 G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~--~l~~~~~~~lg~~a~~~~e~~ 172 (399) .+. ....+...++...+|++.+..+|++++....|+ ++.+.+...|.+.-+.-.|. T Consensus 60 -------~~~--------------~~~~~f~~v~l~~~k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~- 117 (298) T protein:vir:94 60 -------KKT--------------HGGVTLAPQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDL- 117 (298) T ss_pred -------ccc--------------ccccceeEEEEeeeEEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHH- Confidence 112 223345678889999999999999987544443 56677777777665544433 Q ss_pred HHHHHHhcCC--------eEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccccCce Q lcl|NC_019514. 173 LQKDLLAGAG--------TIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTRTISAG 244 (399) Q Consensus 173 l~~~~lag~~--------~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~~I~~~ 244 (399) .+++|.+ ..-..+........ ........-.++++.++...|..+..+. T Consensus 118 ---~~l~G~~~~~g~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~i~~~~~~~~~~~~~~------------------- 174 (298) T protein:vir:94 118 ---MAFHGVNPRLGTASAVIGTNHFDSKVTQK-VEAPRGIADPNGAIENAVELLTGVDADV------------------- 174 (298) T ss_pred ---HhhcccccCCCcccccccccccccccccc-cccccccccHHHHHHHHHHhhhhcCCCc------------------- Confidence 3344421 01111100000000 0111223334778888888887755432 Q ss_pred eEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCccccccCccce Q lcl|NC_019514. 245 RVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRETNGKYD 324 (399) Q Consensus 245 yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t~~~~D 324 (399) -+.++||.....|+.|+|--+.|-|.+. ...+.-|++-|+.++.++.+.. +.. .++. T Consensus 175 ~~~vmn~~~~~~l~~lkd~~G~~l~~~~--------~~~~~~~tl~G~PV~~~~~v~~----~~~----------~~~~- 231 (298) T protein:vir:94 175 TGIAINPSFRSALAKQKDLQGNALFPEL--------KWGATPDTINGLPVDVNKTVSD----MSL----------TQRD- 231 (298) T ss_pred cEEEEcHHHHHHHHHhhccCCCeeecCc--------ccCCCCceecceeeEEeccccc----ccC----------CCcc- Confidence 2578999999999999887777777654 2344557889999998886521 100 0111 Q ss_pred EEEEEEEcccceee-eccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHhhccccceEEEEEec Q lcl|NC_019514. 325 IYPMLCVGAESFTT-IGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTLILRPERLALVKTVA 397 (399) Q Consensus 325 Vyp~lV~G~~Afg~-v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~iLn~~~m~~ie~~a 397 (399) .+++|.-+-+. ++..++ +..-+..-+. ..+....|-|++.++|+ +++++.+++++-+++|+-+= T Consensus 232 ---~~~~Gdfs~~~~~~~~~~-----~~~~~~~~~~-~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 232 ---RAIIGDFANGFKWGYAKE-----VPLEVIQYGD-PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred ---EEEEeeccceEEEEEecC-----ceEEEeecCC-CcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 35777654332 333332 2333333321 11222346788888887 57899999999999997665 No 64 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=98.79 E-value=1.8e-09 Score=68.59 Aligned_cols=325 Identities=10% Similarity=0.038 Sum_probs=176.4 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQG 80 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~g 80 (399) +-..+..-|+|....+... --+...-|+.+.+..=+..-++..+-..+.+= .|++++|-|-... T Consensus 8 ~~~~~n~~t~~~~~~~~~~----~al~le~f~geV~~~f~~~si~~~~~~~rti~--~Gksv~f~~iG~~---------- 71 (375) T protein:vir:10 8 ALGRSNLSTGTGYGGATDK----YALYLKLFSGEMFKGFQHETIARDLVTKRTLK--NGKSLQFIYTGRM---------- 71 (375) T ss_pred ccCccccCCccccccccch----HHHHHHHHhHHHHHHHHHHHhhhccccccccc--cCceEEEEeeeee---------- Confidence 2222333344332222111 11333346777776655556666666666443 5999999988553 Q ss_pred CCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHHH Q lcl|NC_019514. 81 IDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHISTE 160 (399) Q Consensus 81 i~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~~ 160 (399) -...-..|.+.+... .+++..| +.+.+|-|+=-|-.+=|.+.+....-.+..+++++ T Consensus 72 -------t~~~~t~G~~i~~~~--~~d~~~t--------------e~~l~ID~~~y~~~~VdDiD~aqa~~Dlr~e~s~~ 128 (375) T protein:vir:10 72 -------TSSFHTPGTPILGNA--DKAPPVA--------------EKTIVMDDLLISSAFVYDLDETLAHYELRGEISKK 128 (375) T ss_pred -------EEeeecCCcCcCCcc--ccCCCCC--------------ceEEEecchhhhhhhHhhHHHHhcCchhHHHHHHH Confidence 111111122211000 0011122 12233444333333334445555554577778888 Q ss_pred HHHhhhHHHHHHHHHHHHhcC--------CeEEecCCCcccccccccccCC----ceecHHHHHHHHHHHHhccCccccc Q lcl|NC_019514. 161 LMNGAVQLTEAVLQKDLLAGA--------GTIVYTGAATQDSEITGEGATP----SVVDYDDLMRLSITLDENRTPKQTK 228 (399) Q Consensus 161 lg~~a~~~~e~~l~~~~lag~--------~~v~yag~ats~~~~t~~~~~~----~~vt~~~lr~a~~~L~~nrap~~t~ 228 (399) .|..-++.++..+-+.+..++ ......|+ ++-... +-.+.+ -..-++.|+.+...|.++..|. T Consensus 129 ~G~aLA~~~D~~i~~~l~kaa~~~~p~~~~~~~~~Gg-~~i~~~-sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~--- 203 (375) T protein:vir:10 129 IGYALAEKYDRLIFRSITRGARSASPVSATNFVEPGG-TQIRVG-SGTNESDAFTASALVNAFYDAAAAMDEKGVSS--- 203 (375) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhccccccccccccCc-ceeeec-cccccccccCHHHHHHHHHHHHHHHhhcCCCC--- Confidence 887777776444444443322 11222222 111111 111112 2234678889999999999985 Q ss_pred eeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccC- Q lcl|NC_019514. 229 VITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAG- 307 (399) Q Consensus 229 ~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aG- 307 (399) .-+++++.|+.-.-|..-+ +.+.|+.. .|+.....-+|.+|++.||+++++.++=.-.+.+ T Consensus 204 --------------~~R~~vv~P~~y~~Ll~~~---d~~~~~n~-d~~~~~~~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~ 265 (375) T protein:vir:10 204 --------------QGRCAVLNPRQYYALIQDI---GSNGLVNR-DVQGSALQSGNGVIEIAGIHIYKSMNIPFLGKYGV 265 (375) T ss_pred --------------CCCEEEeChHHHHHHHhcC---Cccceeee-cccccceeccceEEEEeceEEEEeccccccccccc Confidence 1378999999887774311 13457766 5776666778899999999999998875444321 Q ss_pred -----CCccCCc--------------cccccCccceEEE-------EEEEcccceeeeccccCCCCccceEEEecCCCCC Q lcl|NC_019514. 308 -----ATVGTNP--------------GYRETNGKYDIYP-------MLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEAT 361 (399) Q Consensus 308 -----a~~~~~~--------------~~~~t~~~~DVyp-------~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ 361 (399) ..+++.. ...+.+++|++=. =|+|=++|-|++.+.+-- .=+ . T Consensus 266 ~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~~~~~~A~g~v~~~~~~------~~~--~---- 333 (375) T protein:vir:10 266 KYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCGLIFQKEAAGVVEAIGPQ------VQV--T---- 333 (375) T ss_pred cccccccccchhhhhccccccCCcceeeccccccccccccccCceEEEEEchhheeeeeeeccc------ccc--c---- Confidence 1111100 0011223343221 477778888887555521 111 0 Q ss_pred CCCCCccchhhHHHHHHHHHHhhccccceEEEEEeccC Q lcl|NC_019514. 362 ADRNDPYGEMGFSSIKWYYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 362 ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~a~~ 399 (399) -..-++.-|..+.=-|+.+++.+||++.-+.|.+.++- T Consensus 334 ~~~~~~~~q~~~i~~~~a~G~~~lrp~~av~l~~~~~~ 371 (375) T protein:vir:10 334 NGDVSVIYQGDVILGRMAMGADYLNPAAAVELYIGATA 371 (375) T ss_pred cchhhheeeeeeeeeeeeeccCccCceeEEEEecCcCc Confidence 01237777888888899999999999999999998766 No 65 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=98.78 E-value=2.2e-10 Score=73.56 Aligned_cols=289 Identities=16% Similarity=0.152 Sum_probs=161.8 Q ss_pred CCcC-CeeecCCC----Ccccccccc-cccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccccccc Q lcl|NC_019514. 1 MASK-GMLYNDPN----TTPSGIDAP-DGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDR 74 (399) Q Consensus 1 ~~~~-~~~~n~~~----~t~tT~~~~-i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~ 74 (399) +++. ..-++... .+.+...++ .-|+ -+..+.++......++.+++...++..+. ++..+. T Consensus 91 lr~~~~~~~~~~e~~a~~~~~~~~GG~~iP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~---~~~~~~------- 156 (401) T protein:vir:44 91 LRKGREDGLRDLERKALQVGTDEDGGYAVPE----ELDRSILSLLKDEVVMRQEATVITVGGSD---YKKLVN------- 156 (401) T ss_pred HhhhhhhhhHHHHHHHhhcCCCCCCceeccH----hHHHHHHHHHHhhhhhhhhceeeecCCCc---eEEEEe------- Confidence 1100 00011110 111112222 2232 23556666677777888899888775432 222111 Q ss_pred ccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHH Q lcl|NC_019514. 75 NVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELF 154 (399) Q Consensus 75 ~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~ 154 (399) ..+ +...|..||... + .....+...++.+.++++.|+.+|+++++ +++.++. T Consensus 157 ---~~~--~~a~wv~E~~~~---~-------------------~~~~~~~~~v~~~~~k~~~~~~iS~ell~-ds~~~l~ 208 (401) T protein:vir:44 157 ---LGG--TASGWVGETDTR---S-------------------QTATSRLGLIEPFMGEIYGNPQATQKMLD-DAFFNVE 208 (401) T ss_pred ---cCC--ccceeecccccc---C-------------------ccccccceeeeeehhheeeehhhhHHHHh-cchHHHH Confidence 011 112355555211 1 11113455678889999999999999776 4555577 Q ss_pred HHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecC------CCccc-----ccc-cccccCCceecHHHHHHHHHHHHhcc Q lcl|NC_019514. 155 SHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTG------AATQD-----SEI-TGEGATPSVVDYDDLMRLSITLDENR 222 (399) Q Consensus 155 ~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag------~ats~-----~~~-t~~~~~~~~vt~~~lr~a~~~L~~nr 222 (399) +.+.++|.+.-+... -..+++|.++-.=.| ..+.. ..+ .........+++++|.++.-.|+... T Consensus 209 ~~i~~~la~ai~~~~----~~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~~~~~~l~~~~ 284 (401) T protein:vir:44 209 AWINSELATEFAEQE----EIAFTTGDGTKKPKGFLAYESTEESDKARAFGKLQHIVSGEATAVTADAIIKLIYTLRKAH 284 (401) T ss_pred HHHHHHHHHHHHHHH----HhhhhccCCCCccceeeccccccccccccccccccccccccccccCHHHHHHHHHhcchhh Confidence 777777776655433 344555543311000 00000 000 00112346688999999988886532 Q ss_pred CccccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccch Q lcl|NC_019514. 223 TPKQTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLH 302 (399) Q Consensus 223 ap~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~ 302 (399) ... =+.++|+.+-.-|+.|+|--+.|-|.|--+. |.-+++-|..+|.++.+.. T Consensus 285 ~~~-------------------a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~--------g~~~~l~G~PVv~~~~~p~ 337 (401) T protein:vir:44 285 RTG-------------------AKFMMNNNSLFAIRLLKDTEGNYLWRPGLEL--------GQPSSLAGYGIAENEQMPD 337 (401) T ss_pred hcC-------------------CEEEEcHHHHHHHHHhhccCCceeecCCcCC--------CCCceecceeeEEecCcCC Confidence 211 1357999999999999987777777654333 3446788899988877521 Q ss_pred hcccCCCccCCccccccCccceEEEEEEEccc--ceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHH-- Q lcl|NC_019514. 303 WAGAGATVGTNPGYRETNGKYDIYPMLCVGAE--SFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKW-- 378 (399) Q Consensus 303 ~~~aGa~~~~~~~~~~t~~~~DVyp~lV~G~~--Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~-- 378 (399) .++| .. .++||.= +|.... ..| +++ ..|||.+++.++|++ T Consensus 338 -~~~~-------------~~-----~i~~Gd~~~~~~i~~--~~~----~~~-----------~~~~~~~~~~v~~~a~~ 381 (401) T protein:vir:44 338 -IAAD-------------AK-----AIAFGNFKRGYTIVD--RIG----TRI-----------LRDPYTNKPFVGFYTTK 381 (401) T ss_pred -ccCC-------------cc-----EEEEeehhccEEEEE--ecc----eEE-----------eeeccccCCcEEEEEEE Confidence 1111 11 2455653 343222 222 222 137889999999996 Q ss_pred HHHHhhccccceEEEEEecc Q lcl|NC_019514. 379 YYGTLILRPERLALVKTVAP 398 (399) Q Consensus 379 ~~~~~iLn~~~m~~ie~~a~ 398 (399) .+.+.+++++-++.|+.++- T Consensus 382 r~d~~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 382 RTGGMLVDSQAIKLLKIAAA 401 (401) T ss_pred EeccEEecccceEEEEeecC Confidence 59999999999999999888 No 66 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=98.76 E-value=3.6e-09 Score=66.88 Aligned_cols=299 Identities=12% Similarity=-0.022 Sum_probs=162.2 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQG 80 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~g 80 (399) |+ .+++++....+.++ +..+.++..++..++.+++...+|+.+ .+++-+... T Consensus 1 Ma---------~~~~~~gg~~vP~~-----~~~~ii~~l~~~s~i~~l~~~i~~~~~---~~~ip~~~~----------- 52 (315) T protein:vir:80 1 MA---------DDFLSAGKLELPGS-----MIGAVRDRAIDSGVLAKLSPEQPTIFG---PVKGAVFSG----------- 52 (315) T ss_pred CC---------CCcCCcCceEcchH-----HHHHHHHHHHhhchhhhhcceeecCCC---ceEEEEEeC----------- Confidence 33 22233333444333 246777778899999999999988754 234333311 Q ss_pred CCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcc---hHHHHHH Q lcl|NC_019514. 81 IDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSD---SELFSHI 157 (399) Q Consensus 81 i~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D---~~l~~~~ 157 (399) .+...|+.||- .+.. ...+...++...++++.++.+|++++.-..+ .+|...+ T Consensus 53 -~~~a~wv~Eg~---------~~~~--------------s~~~f~~v~l~~~kl~~~~~iS~ell~~s~~~~~~~l~~~i 108 (315) T protein:vir:80 53 -VPRAKIVGEGE---------VKPS--------------ASVDVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLI 108 (315) T ss_pred -CcceEEeeCCc---------cccc--------------cccceeeeEeeeeeEEeeehhhHHHhhcCchhHHHHHHHHH Confidence 12335777762 2222 2344556888899999999999997733222 2244444 Q ss_pred HHHHHHhhhHHHHHHHHHHHHhcCCeEE---ecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceecccc Q lcl|NC_019514. 158 STELMNGAVQLTEAVLQKDLLAGAGTIV---YTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSR 234 (399) Q Consensus 158 ~~~lg~~a~~~~e~~l~~~~lag~~~v~---yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~ 234 (399) .+.|.+.-+ ..+-..+++|.+..- -.|..+. ...+.........+++++.++.-.|..+.... T Consensus 109 ~~~la~ai~----~~~d~a~~~G~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~--------- 174 (315) T protein:vir:80 109 SPALGASIG----RAVDLIAFHGIDPATGKAASAVHTS-LNKTKNIVDATDSATADLVKAVGLIAGAGLQV--------- 174 (315) T ss_pred HHHHHHHHH----HHHhhheeeccCCCCCccccccccc-cccccceeeccccchHHHHHHHHHHhhccCcc--------- Confidence 444443322 222334555532110 0000000 00111111112234677777776654432221 Q ss_pred ccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCc Q lcl|NC_019514. 235 MIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNP 314 (399) Q Consensus 235 ~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~ 314 (399) . -.-++||.+...||.|+|.-..+.|-.- --+ .+..|.-|++-|..++.++.|..-.+. T Consensus 175 -------~--~~~imn~~~~~~L~~l~~~~g~~~~g~~---~~~-~~~~g~~~tl~G~PV~~~~~~~~~~~~-------- 233 (315) T protein:vir:80 175 -------P--NGVALDPAFSFALSTEVYPKGSPLAGQP---MYP-AAGFAGLDNWRGLNVGASSTVSGAPEM-------- 233 (315) T ss_pred -------c--eEEEEcHHHHHHHHHHhhccCCcccccc---ccc-ccccCCCceecceeeEecCcCCccccc-------- Confidence 1 2356899999999999765444444311 111 234556689999999988887321111 Q ss_pred cccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHhhccccceEE Q lcl|NC_019514. 315 GYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTLILRPERLAL 392 (399) Q Consensus 315 ~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~iLn~~~m~~ 392 (399) ...+++ .+++|.-+...+++.++ ..+++. +..........|-|++.+.|| +.+++.+.+++-+++ T Consensus 234 ---~~~~~~----~~~~GDfs~~~~g~~~~---~~i~i~---~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~ 300 (315) T protein:vir:80 234 ---SPASGV----KAIVGDFSRVHWGFQRN---FPIELI---EYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAV 300 (315) T ss_pred ---cccccc----EEEEeecccEEEEEecC---eeEEEe---ccccccCcccchhhcCcEEEEEEEEecceeecccceEE Confidence 111222 35678766655655542 123322 111112223457788888888 678999999999999 Q ss_pred EEEec-cC Q lcl|NC_019514. 393 VKTVA-PL 399 (399) Q Consensus 393 ie~~a-~~ 399 (399) |+.++ +- T Consensus 301 l~~~~a~~ 308 (315) T protein:vir:80 301 VKEKAAPK 308 (315) T ss_pred EeeccCCC Confidence 99655 44 No 67 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=98.76 E-value=3.9e-09 Score=66.70 Aligned_cols=273 Identities=17% Similarity=0.071 Sum_probs=149.1 Q ss_pred ccccccccCCCEEEEEEccccccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEe Q lcl|NC_019514. 50 DVVSMPKNYGKEIRVYHYIPLLDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVG 129 (399) Q Consensus 50 ~~~~mPkN~GktIk~rry~pl~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~ 129 (399) .++.+ ..||+.+|-|-... -.....+|.+.|. .++++.-+ +... T Consensus 1 ~vr~i--~~g~s~~~~~iG~~-----------------~~~~~~~G~~l~~---~~~~~~~~--------------e~~i 44 (324) T protein:vir:99 1 MTRTI--TSGKSAQFPVMGRT-----------------KARYLKQGQSLDD---GREDIKHT--------------EKVI 44 (324) T ss_pred Ceeee--ecCceEEEeeeeee-----------------EeccccCCCCcCC---CcCCcCcc--------------cEEE Confidence 45554 44888888877442 1222222222221 01111111 2223 Q ss_pred eeeeecceeehhhhhhhhhcchHHHHHHHHHHHHhhhHHHHHHHHHHHHhcC--------CeEEecCCCccccccccccc Q lcl|NC_019514. 130 RIQKFGFFTEFSQESLDFDSDSELFSHISTELMNGAVQLTEAVLQKDLLAGA--------GTIVYTGAATQDSEITGEGA 201 (399) Q Consensus 130 ~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~~lg~~a~~~~e~~l~~~~lag~--------~~v~yag~ats~~~~t~~~~ 201 (399) +|-|+=-|-.+=|.+.+....-.+..+.+++.|..-++.++..+-+.+..++ ....-.|+..+.. .+.++ T Consensus 45 tID~~l~~~~~VdDiD~~qa~~Dlr~e~s~~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~--~~~~~ 122 (324) T protein:vir:99 45 TIDGLLTTDVLIYDIEDAMNHYDVRSEYSTQMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVK--ITGKK 122 (324) T ss_pred EecchhhhhhhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceec--ccccc Confidence 3444433333333444444443477777777777777666444333322111 1111112111111 11111 Q ss_pred CCceec----HHHHHHHHHHHHhccCccccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCC Q lcl|NC_019514. 202 TPSVVD----YDDLMRLSITLDENRTPKQTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYAD 277 (399) Q Consensus 202 ~~~~vt----~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~ 277 (399) .+...+ ++.|+.+...|.++..|. .-+++++.|+.-..|.+ ..+.-...|+. T Consensus 123 ~~~~~~~~~~~dai~~a~~~Lde~~VP~-----------------~gR~~vv~P~~y~~Ll~-------~~~~~~~~~~~ 178 (324) T protein:vir:99 123 EDPAKYGTQVIQALTYARAAFAKKYIPA-----------------GDRTFYTDPDTYSAILA-------ALMPNAANYAA 178 (324) T ss_pred cccccCHHHHHHHHHHHHHHHhhcCCCC-----------------CCCEEEeChHHHHHHhh-------ccccccccccc Confidence 112222 678889999999999985 23889999999888854 33444568888 Q ss_pred ccccccccceeEcCeEEEecCccchhcccCCCccCCc--------cccccCccceE----EEEEEEcccceeeeccccCC Q lcl|NC_019514. 278 AGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNP--------GYRETNGKYDI----YPMLCVGAESFTTIGFQTDG 345 (399) Q Consensus 278 ~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~--------~~~~t~~~~DV----yp~lV~G~~Afg~v~l~g~g 345 (399) .+.+-+|.||++.||++++++++-.-.+.......+. ..+.+..+|.+ =.-|+|=.+|-+++-+..- T Consensus 179 ~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky~~d~~~~~gl~~~~~a~~tv~~~~~- 257 (324) T protein:vir:99 179 LIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKMTVGADNVVGLFVHRSAVATLKLKDM- 257 (324) T ss_pred ccceecceEEEEeceEEEecCCccccccccccccccccccccccccccccccccccccCceeEEEEehhheEEEeeecc- Confidence 8889999999999999999999865322211111110 00111112211 1236777777777643331 Q ss_pred CCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEeccC Q lcl|NC_019514. 346 KTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 346 ~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~a~~ 399 (399) .. ..--|+--|..++==|..|++.+||++..+.+|.-+.. T Consensus 258 --------~~------e~~~~~~~~~d~i~~~~a~G~~~lRPe~a~~v~l~~~~ 297 (324) T protein:vir:99 258 --------AL------ERARRPEYQADQIIAKYAMGHGGLRPEAVGAIIFEDGE 297 (324) T ss_pred --------ee------cceechhhHHHhhhhhhhhcCcccccceEEEEEEccCc Confidence 11 11226666777777789999999999999888854443 No 68 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=98.75 E-value=1.3e-08 Score=63.76 Aligned_cols=315 Identities=10% Similarity=0.075 Sum_probs=175.6 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQG 80 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~g 80 (399) |- -|--+=+|.--.+.... .+...-|+-+.|+.=+-..+|..+-..+.+ ..|+++.|-|-... T Consensus 1 ms-~~~~~t~~~~~~s~~d~----al~le~f~geV~~af~~~s~~~~~~~~rti--~~g~s~~~~~iG~~---------- 63 (335) T protein:vir:78 1 MS-FLNDLTRPNYAGKNADV----DIHLEEHLGIVDKHFAYTSKFAPLMNIRDL--RGSNVVRLDRLGNV---------- 63 (335) T ss_pred CC-ccccccccccccccchh----hhhhhhhhhHHHHHHHHhhhhccccceeee--ccceeEEEeeeeee---------- Confidence 21 11000022222222221 245555888888887778888899999988 55999999887553 Q ss_pred CCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHHH Q lcl|NC_019514. 81 IDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHISTE 160 (399) Q Consensus 81 i~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~~ 160 (399) -..+..+|..+|.+.+.....+++ |-+.=-.-.+=|.+.+....=.+..+++++ T Consensus 64 -------~~~~~~pG~~l~~~~~~~~k~~it-------------------ID~ll~a~~~VddlDe~~~~yDvR~e~s~~ 117 (335) T protein:vir:78 64 -------EAKGRRAGEELERSRVVNDKWNLT-------------------VDTLLYLRHQFDHQDEWTQSFDMRKEVAEL 117 (335) T ss_pred -------eecccccCcccCCCCcccCCeEEE-------------------ecceeechhhHhhHHHhhcCchhHHHHHHH Confidence 234555666666655544444443 222111112233344444553477778888 Q ss_pred HHHhhhHHHHHHHHHHHHhcCCe---EEe-----cCCCcccccccc-cccCCceecHHHHHHHHHHHHhccCccccceec Q lcl|NC_019514. 161 LMNGAVQLTEAVLQKDLLAGAGT---IVY-----TGAATQDSEITG-EGATPSVVDYDDLMRLSITLDENRTPKQTKVIT 231 (399) Q Consensus 161 lg~~a~~~~e~~l~~~~lag~~~---v~y-----ag~ats~~~~t~-~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~ 231 (399) +|+.=++.++....+-++.++.. +-+ .|. +....+++ ....+-..=.+.++.+...|.++..|.. T Consensus 118 ~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~G~-~~~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~----- 191 (335) T protein:vir:78 118 DGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGV-LEKLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDA----- 191 (335) T ss_pred HHHHHHHHHHHHHHHHHHhhcccccccccCCCcCCCc-ceeeeeccccccccHHHHHHHHHHHHHHHHhccCCCC----- Confidence 88777777755444444444421 111 121 11111221 1111111223455566666888777741 Q ss_pred cccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCC---ccccccccceeEcCeEEEecCccchhcccCC Q lcl|NC_019514. 232 GSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYAD---AGTILNGEIGTVDQFRLVVVPEMLHWAGAGA 308 (399) Q Consensus 232 ~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~---~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa 308 (399) ...-+++++.|..-+.|.+ ++.|+.. .|+. ....-+|+|+++.|||+++++++-.-...+. T Consensus 192 ---------~~~~rv~vv~P~~y~~Ll~------~~~l~n~-~~~~s~~~~~~~~g~v~~v~Gv~V~~Sn~lP~~~~t~~ 255 (335) T protein:vir:78 192 ---------VYSEGLTPMSPRVFSLLLE------HDKLMSV-EYQATGATNDYVKSRVAILNGVKVLETPRFATKAISAH 255 (335) T ss_pred ---------CCCccEEEeChHHHHHHhc------ccccccc-cccccccccccccceeEEeeceEEEeeccCCCCCCccc Confidence 1123899999999999965 4678887 5653 2346788999999999999998843222211 Q ss_pred CccCCccccccCccceE--EEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhcc Q lcl|NC_019514. 309 TVGTNPGYRETNGKYDI--YPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILR 386 (399) Q Consensus 309 ~~~~~~~~~~t~~~~DV--yp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn 386 (399) +-+.. .+..++|. -..+++=+.|-+++-+..-. + .--.|+--|..++=-|..|++.+|| T Consensus 256 ~lg~a----~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~-----------~----e~~~~~~~~~~~i~~~~a~G~g~lR 316 (335) T protein:vir:78 256 PLGRH----FNVSAEEAERQIALFLPSKTLITAQVAPVQ-----------A----KLWEDHDQFSWVLDTFQMYNIGARR 316 (335) T ss_pred ccccc----CCcccccccceEEEEEecceEEEEEEEecc-----------c----ceeeccchhhHhhhHHHHcCCcccC Confidence 11110 11222232 24566667776666433311 0 0122444477777778999999999 Q ss_pred ccceEEEEEeccC Q lcl|NC_019514. 387 PERLALVKTVAPL 399 (399) Q Consensus 387 ~~~m~~ie~~a~~ 399 (399) ++.-+.||+--.= T Consensus 317 Pe~a~~i~~tg~~ 329 (335) T protein:vir:78 317 PDTAGAIELKGIE 329 (335) T ss_pred cceEEEEEecCCC Confidence 9999999853222 No 69 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=98.75 E-value=8e-10 Score=70.46 Aligned_cols=294 Identities=12% Similarity=0.090 Sum_probs=164.4 Q ss_pred CCcC--------CeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccccc Q lcl|NC_019514. 1 MASK--------GMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLD 72 (399) Q Consensus 1 ~~~~--------~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~ 72 (399) |+.. ...++...+..+++..+=+.-+-+. +..+.+....+...+.+++.+.++. .+..+.+.+..- T Consensus 96 l~~~~~~~~~~e~~~~~~~~a~~~~~~~~gg~liP~~-~~~~ii~~~~~~~~l~~~~~~~~~~--~~~~~~~~~~~~--- 169 (409) T protein:vir:45 96 MRHGASELTSEERKALRELRAQGVAQDEKGGYTVPET-FLAKVVEKMKSYGGIASVAQILTTS--DGRTMEWATADG--- 169 (409) T ss_pred HHhhhhhccHHHHHHHHHHhhccCccCcCCceeccHh-HHHHHHHHHHhhhhhhhhceeeecC--CCceEEEEeecc--- Confidence 1100 0011111121122211101111122 3455666666777788888887764 344555555422 Q ss_pred ccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeee-cceeehhhhhhhhhcch Q lcl|NC_019514. 73 DRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKF-GFFTEFSQESLDFDSDS 151 (399) Q Consensus 73 ~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qY-G~~~e~Td~~~d~~~D~ 151 (399) ....+.+..||-. +.. ..++...++.+.+++ +.|+.+|+++++- ++. T Consensus 170 --------~~~~~~~v~E~~~---------~~~--------------~~~~f~~~~l~~~k~~~~~i~is~ell~d-s~~ 217 (409) T protein:vir:45 170 --------TSEVGVLLGENEE---------AGE--------------EDTDFGMGSLGALKMTSKIIRVSNELLQD-SAI 217 (409) T ss_pred --------Ccccccccccccc---------ccc--------------cccccceeeeeeeeeeeeehhhhHHHHhc-cHH Confidence 1223345555421 111 112233455555564 7899999997754 444 Q ss_pred HHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeE-------EecCCCcccccccccccCCceecHHHHHHHHHHHHhccCc Q lcl|NC_019514. 152 ELFSHISTELMNGAVQLTEAVLQKDLLAGAGTI-------VYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTP 224 (399) Q Consensus 152 ~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v-------~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap 224 (399) .+.+.+...+.+..+.. +-..+++|.++- +.++.. ......+...+++++|.++...|+..... T Consensus 218 ~l~~~i~~~la~a~~~~----~~~a~l~G~G~~~~~~p~Gil~~~~-----~~~~~~~~~~~~~d~i~~l~~~l~~~~~~ 288 (409) T protein:vir:45 218 DMEAYLARRIAERIGRG----EARYLIQGTGAGTPKQPKGLAASVT-----GTTQTAAANAVKWQEILALKHSIDPAYRR 288 (409) T ss_pred HHHHHHHHHHHHHHHHH----HHHHhhccCCCCCccccceeeeccc-----cccccccccccchHHHHHHHHhhhhhhcc Confidence 57777777776654433 334466766532 111110 01122344568999999998888764332 Q ss_pred cccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhc Q lcl|NC_019514. 225 KQTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWA 304 (399) Q Consensus 225 ~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~ 304 (399) .+.++.++|+.+...|+.|+|.-+.+-|.+- +..|.-+++-|.+++.++.|.. T Consensus 289 -----------------~a~~~~~~n~~~~~~l~~lkd~~G~~i~~~~--------~~~~~~~~l~G~PV~~~~~~p~-- 341 (409) T protein:vir:45 289 -----------------GPKFRLAFNDNTLKLISEMEDGQGRPLWLPD--------IVGVAPASVLNVPYVIDQEIDD-- 341 (409) T ss_pred -----------------CCeEEEEECHHHHHHHHHhhcCCCceeeccC--------cCCCCCceecceeeEEecCcCC-- Confidence 1347889999999999999887777766542 2334456788999998888621 Q ss_pred ccCCCccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHH--HHHH Q lcl|NC_019514. 305 GAGATVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKW--YYGT 382 (399) Q Consensus 305 ~aGa~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~--~~~~ 382 (399) . ++ + .+ .++||.=+...+...++- .+. ...|||.+.+.+++++ .+.+ T Consensus 342 -~----~~-------~----~~-~i~~Gd~~~~~i~~~~~~-------~~~-------~~~d~~~~~~~~~~~~~~r~d~ 390 (409) T protein:vir:45 342 -I----GA-------G----KK-FMFCGDFDRFIIRRVRYM-------ILK-------RLVERYAEYDQTGFLAFHRFDC 390 (409) T ss_pred -c----cC-------C----cc-EEEEeehhhhheeeccce-------EEE-------EeecccccCCcEEEEEEEEecc Confidence 1 11 0 12 255676333334333211 111 1238888888888874 7899 Q ss_pred hhccccceEEEEEeccC Q lcl|NC_019514. 383 LILRPERLALVKTVAPL 399 (399) Q Consensus 383 ~iLn~~~m~~ie~~a~~ 399 (399) .+.+++-++.++..+.. T Consensus 391 ~~~~~~A~~~l~~k~s~ 407 (409) T protein:vir:45 391 ILEDTSAIKALVGKGSV 407 (409) T ss_pred EeechhheEEEEeccCC Confidence 99999999999998888 No 70 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=98.75 E-value=1.2e-08 Score=64.08 Aligned_cols=315 Identities=14% Similarity=0.070 Sum_probs=157.6 Q ss_pred eeecCCCCcccccccccccceehhhhhHHHHHHHH--HHHHhhhhcccccccccCCCEEEEEEccccccccccccCCCCC Q lcl|NC_019514. 6 MLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEAR--KDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQGIDA 83 (399) Q Consensus 6 ~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~--p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~gi~a 83 (399) |++|+- .|.---|+-++...| .++...+.+ -++.-++|-..--++-+.++--++-.+... ..+.-. T Consensus 1 ~~~~~~----~~~~~~Ms~~i~~~f-v~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~~~~-------~~~~~~ 68 (322) T protein:vir:10 1 MKLNAI----MSMLPLIAGDIDQAF-VQTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETLASM-------DPDAVK 68 (322) T ss_pred Ccccce----eeeeeeeechhhhHH-HHHHHHHHHHHHHHhhhhhhcccccccccccccceeecccc-------cccccc Confidence 777763 221111222333322 233333322 333445555555566666665444444321 000000 Q ss_pred CCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhh-hhhhcchHHHHHHHHHHH Q lcl|NC_019514. 84 AGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQES-LDFDSDSELFSHISTELM 162 (399) Q Consensus 84 aga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~-~d~~~D~~l~~~~~~~lg 162 (399) .+. ..+ -++|. +.. | .+|.+ ....+.+.+++|..+..+.|.- ...+.|+ .....++.+ T Consensus 69 ~~~-~~~-----~~~d~---~~d----t----p~~~~--~~~~r~~~~~d~~~~~~VDd~D~~k~~~D~--~~~~~~~~a 127 (322) T protein:vir:10 69 RKR-SRQ-----QSADG---TYP----T----PVNNK--PFAKRRTNVDTYDTGHVVEQEDISQMLLDP--NSALITSQA 127 (322) T ss_pred ccc-ccc-----cccCc---ccC----C----Ccccc--ccceEEEeecccccceecchHHHHHhhcCc--hHHHHHHHH Confidence 000 000 00000 000 0 00111 1345668899997766554431 1244444 222223333 Q ss_pred HhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccc---cccCCceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_019514. 163 NGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITG---EGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTR 239 (399) Q Consensus 163 ~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~---~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~ 239 (399) -.-++-..+.+..-++.+++ + |..+......+ -...+.-++++.|+.|.+.|+++.+|... T Consensus 128 ~AL~R~~D~~I~~a~~g~a~-~---~~~gt~v~~~ss~~i~~g~~g~t~~kl~~a~~~l~~~dvp~d~------------ 191 (322) T protein:vir:10 128 YAMARKTDDLIIAGAWKPAS-I---KGTGQPVEFLATQEIGDGTKPISFDYVTEITERFLENEIEPEV------------ 191 (322) T ss_pred HHhhhHHHHHHHhhhhcccc-c---cccccccccCCCcccccCccchhHHHHHHHHHHHHhcCCCCCC------------ Confidence 33233343444333333222 1 11111111111 11235678999999999999999998521 Q ss_pred ccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccc-cccceeEcCeEEEecCccchhcccCCCccCCccccc Q lcl|NC_019514. 240 TISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTIL-NGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRE 318 (399) Q Consensus 240 ~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~-~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~ 318 (399) -+++++.|....+|- .++.|+.+ .|.+.+.+. +|.||++-||.|+.+..+-.-...+-.-+.+++ T Consensus 192 ----~R~~vv~p~~~~~LL------~d~~~ts~-D~~~~~~l~~~G~ig~~lGf~~i~s~~lp~~~~t~~~~~~~~~--- 257 (322) T protein:vir:10 192 ----SKVIVIGPTQARKLL------QITEATSA-DYTSAMDLQSKGIITNWMGYTWIVSTRLDKFDPTQWGMAAEDG--- 257 (322) T ss_pred ----CeEEEeCHHHHHHHh------cchhhhhh-hcccchhhhhcCeeeeeeeEEEEEeccCCccccccccccccCC--- Confidence 267889999988883 36899985 787777786 599999999999999876432221111111111 Q ss_pred cCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEecc Q lcl|NC_019514. 319 TNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVKTVAP 398 (399) Q Consensus 319 t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~a~ 398 (399) ...++.+.++.=+.|.+..-- +..+.++ ...|+ + ..-..+.-++.|+|.+++|+..+.|+|-=- T Consensus 258 --~~~~~~~~~a~~k~Av~~a~~----~dv~~~i-~~~~~-----~----~~a~~I~~~~~~Ga~ri~~~gVv~i~~~e~ 321 (322) T protein:vir:10 258 --PQGDEIWCIAMTDMALGYHSC----KDIWTKV-AEDPS-----A----SFAWRIYSAFTADCVRVEDEHIFKLRLKNS 321 (322) T ss_pred --CCccceeEEEEecCceeEEEe----eeeeEEe-eccCC-----c----chhhhhhhhhhhCceEeccCcEEEEEEecc Confidence 133566655555555444311 1111111 22232 2 112335567889999999999999999777 Q ss_pred C Q lcl|NC_019514. 399 L 399 (399) Q Consensus 399 ~ 399 (399) | T Consensus 322 ~ 322 (322) T protein:vir:10 322 L 322 (322) T ss_pred C Confidence 7 No 71 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=98.75 E-value=1.2e-09 Score=69.55 Aligned_cols=293 Identities=13% Similarity=0.144 Sum_probs=165.0 Q ss_pred CCcCC----------------eeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEE Q lcl|NC_019514. 1 MASKG----------------MLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRV 64 (399) Q Consensus 1 ~~~~~----------------~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~ 64 (399) |+++. ..+|.-..+.++..+.+-|+ . +..+.++.++..-.+.+++...+++. .++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~---~-~~~~ii~~~~~~s~l~~l~~~~~~~~---~~~~~ 73 (324) T protein:vir:78 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMN---E-FTTPILQEVMENSKIMQLGKYEPMEG---TEKKF 73 (324) T ss_pred CCcchhhhHHHHHHHHHhhhhhhhccccccccCcCccccch---h-HHHHHHHHHHhhchhhhhcceeeccC---CceEE Confidence 55431 12222112222222233333 2 35677888888888999988888773 34444 Q ss_pred EEccccccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhh Q lcl|NC_019514. 65 YHYIPLLDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQES 144 (399) Q Consensus 65 rry~pl~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~ 144 (399) .+... .+...|..|| +.......+...++.+.++++.+..+|+++ T Consensus 74 p~~~~------------~~~a~~v~Eg-----------------------~~~~~~~~~~~~v~~~~~k~~~~~~is~el 118 (324) T protein:vir:78 74 TFWAD------------KPGAYWVGEG-----------------------QKIETSKATWVNATMRAFKLGVILPVTKEF 118 (324) T ss_pred EEEec------------CcceeEecCC-----------------------ccccccccceeEEEEeeEEEEEeehhhHHH Confidence 43311 1233455554 222233344566888999999999999997 Q ss_pred hhhhcchHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCc Q lcl|NC_019514. 145 LDFDSDSELFSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTP 224 (399) Q Consensus 145 ~d~~~D~~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap 224 (399) ++ +++..+...+...|++..+..++.. +++|.+.--.+++-.. ........+....++++|.++.-.|..+... T Consensus 119 l~-ds~~~l~~~i~~~la~ai~~~~d~a----~l~G~g~~~~~~gi~~-~~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~ 192 (324) T protein:vir:78 119 LN-YTYSQFFEEMKPMIAEAFYKKFDEA----GILNQGNNPFGKSIAQ-SIEKTNKVIKGDFTQDNIIDLEALLEDDELE 192 (324) T ss_pred Hh-cchHHHHHHHHHHHHHHHHHHHHHH----HhccCCCCCcCccccc-cccccceeccccccHHHHHHHHHhhhhccCC Confidence 76 4444588888888887766555443 3445432221111111 0111122344567899999999888765432 Q ss_pred cccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhc Q lcl|NC_019514. 225 KQTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWA 304 (399) Q Consensus 225 ~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~ 304 (399) . + +.++||.....|+.++|.-+.+.|. .+.-+++-|+.++.++... T Consensus 193 ~-----------------~--~~vmn~~~~~~L~~l~d~~G~~~~~------------~~~~~~l~G~PV~~~~~~~--- 238 (324) T protein:vir:78 193 A-----------------N--AFISKTQNRSLLRKIVDPETKERIY------------DRNSDSLDGLPVVNLKSSN--- 238 (324) T ss_pred C-----------------C--EEEEcHHHHHHHHHhhccCCCeeec------------CCCCCcccceeeEeeCCCC--- Confidence 1 1 3578999999999998765554442 2334567777777654310 Q ss_pred ccCCCccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEE---EecCCCCCCCCCCcc--chhhHHHHH-- Q lcl|NC_019514. 305 GAGATVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVT---TKMPGEATADRNDPY--GEMGFSSIK-- 377 (399) Q Consensus 305 ~aGa~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~i---vk~pG~~~ad~~DPl--gQrg~~gwK-- 377 (399) . ++. .+++|.-+...++..++ ..+++. ....+ ...+..|+ -|+....|+ T Consensus 239 -------~--------~~~----~~~~gd~~~~~~g~~~~---~~i~~~~~~~~~~~--~~~~~~~~~~f~~d~~~~r~~ 294 (324) T protein:vir:78 239 -------L--------KRG----ELITGDFDKLIYGIPQL---IEYKIDETAQLSTV--KNEDGTPVNLFEQDMVALRAT 294 (324) T ss_pred -------C--------Ccc----eEEEEecceEEEEEecC---cEEEEeeccccccc--ccccccchhhhhcCcEEEEEE Confidence 0 111 24566655554544432 122222 11111 11122333 455667766 Q ss_pred HHHHHhhccccceEEEEEeccC Q lcl|NC_019514. 378 WYYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 378 ~~~~~~iLn~~~m~~ie~~a~~ 399 (399) +++.+.+++++-+++|..+-.. T Consensus 295 ~r~d~~v~~~~A~~~l~~a~~~ 316 (324) T protein:vir:78 295 MHVALHIADDKAFAKLVPADKR 316 (324) T ss_pred EEEccEEecccceEEEeccccc Confidence 6789999999999998876555 No 72 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=98.75 E-value=1.2e-09 Score=69.55 Aligned_cols=293 Identities=13% Similarity=0.144 Sum_probs=165.0 Q ss_pred CCcCC----------------eeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEE Q lcl|NC_019514. 1 MASKG----------------MLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRV 64 (399) Q Consensus 1 ~~~~~----------------~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~ 64 (399) |+++. ..+|.-..+.++..+.+-|+ . +..+.++.++..-.+.+++...+++. .++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~---~-~~~~ii~~~~~~s~l~~l~~~~~~~~---~~~~~ 73 (324) T protein:vir:96 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMN---E-FTTPILQEVMENSKIMQLGKYEPMEG---TEKKF 73 (324) T ss_pred CCcchhhhHHHHHHHHHhhhhhhhccccccccCcCccccch---h-HHHHHHHHHHhhchhhhhcceeeccC---CceEE Confidence 55431 12222112222222233333 2 35677888888888999988888773 34444 Q ss_pred EEccccccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhh Q lcl|NC_019514. 65 YHYIPLLDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQES 144 (399) Q Consensus 65 rry~pl~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~ 144 (399) .+... .+...|..|| +.......+...++.+.++++.+..+|+++ T Consensus 74 p~~~~------------~~~a~~v~Eg-----------------------~~~~~~~~~~~~v~~~~~k~~~~~~is~el 118 (324) T protein:vir:96 74 TFWAD------------KPGAYWVGEG-----------------------QKIETSKATWVNATMRAFKLGVILPVTKEF 118 (324) T ss_pred EEEec------------CcceeEecCC-----------------------ccccccccceeEEEEeeEEEEEeehhhHHH Confidence 43311 1233455554 222233344566888999999999999997 Q ss_pred hhhhcchHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCc Q lcl|NC_019514. 145 LDFDSDSELFSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTP 224 (399) Q Consensus 145 ~d~~~D~~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap 224 (399) ++ +++..+...+...|++..+..++.. +++|.+.--.+++-.. ........+....++++|.++.-.|..+... T Consensus 119 l~-ds~~~l~~~i~~~la~ai~~~~d~a----~l~G~g~~~~~~gi~~-~~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~ 192 (324) T protein:vir:96 119 LN-YTYSQFFEEMKPMIAEAFYKKFDEA----GILNQGNNPFGKSIAQ-SIEKTNKVIKGDFTQDNIIDLEALLEDDELE 192 (324) T ss_pred Hh-cchHHHHHHHHHHHHHHHHHHHHHH----HhccCCCCCcCccccc-cccccceeccccccHHHHHHHHHhhhhccCC Confidence 76 4444588888888887766555443 3445432221111111 0111122344567899999999888765432 Q ss_pred cccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhc Q lcl|NC_019514. 225 KQTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWA 304 (399) Q Consensus 225 ~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~ 304 (399) . + +.++||.....|+.++|.-+.+.|. .+.-+++-|+.++.++... T Consensus 193 ~-----------------~--~~vmn~~~~~~L~~l~d~~G~~~~~------------~~~~~~l~G~PV~~~~~~~--- 238 (324) T protein:vir:96 193 A-----------------N--AFISKTQNRSLLRKIVDPETKERIY------------DRNSDSLDGLPVVNLKSSN--- 238 (324) T ss_pred C-----------------C--EEEEcHHHHHHHHHhhccCCCeeec------------CCCCCcccceeeEeeCCCC--- Confidence 1 1 3578999999999998765554442 2334567777777654310 Q ss_pred ccCCCccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEE---EecCCCCCCCCCCcc--chhhHHHHH-- Q lcl|NC_019514. 305 GAGATVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVT---TKMPGEATADRNDPY--GEMGFSSIK-- 377 (399) Q Consensus 305 ~aGa~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~i---vk~pG~~~ad~~DPl--gQrg~~gwK-- 377 (399) . ++. .+++|.-+...++..++ ..+++. ....+ ...+..|+ -|+....|+ T Consensus 239 -------~--------~~~----~~~~gd~~~~~~g~~~~---~~i~~~~~~~~~~~--~~~~~~~~~~f~~d~~~~r~~ 294 (324) T protein:vir:96 239 -------L--------KRG----ELITGDFDKLIYGIPQL---IEYKIDETAQLSTV--KNEDGTPVNLFEQDMVALRAT 294 (324) T ss_pred -------C--------Ccc----eEEEEecceEEEEEecC---cEEEEeeccccccc--ccccccchhhhhcCcEEEEEE Confidence 0 111 24566655554544432 122222 11111 11122333 455667766 Q ss_pred HHHHHhhccccceEEEEEeccC Q lcl|NC_019514. 378 WYYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 378 ~~~~~~iLn~~~m~~ie~~a~~ 399 (399) +++.+.+++++-+++|..+-.. T Consensus 295 ~r~d~~v~~~~A~~~l~~a~~~ 316 (324) T protein:vir:96 295 MHVALHIADDKAFAKLVPADKR 316 (324) T ss_pred EEEccEEecccceEEEeccccc Confidence 6789999999999998876555 No 73 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=98.73 E-value=1.1e-09 Score=69.60 Aligned_cols=285 Identities=13% Similarity=0.056 Sum_probs=161.6 Q ss_pred CCc--------CCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccccc Q lcl|NC_019514. 1 MAS--------KGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLD 72 (399) Q Consensus 1 ~~~--------~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~ 72 (399) ++. .....|...+++++..+.+-|. .+....++...+...+.+++...+++.+ ++++.+... T Consensus 95 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lip~----~~~~~ii~~~~~~~~i~~~~~~~~~~~~---~~~~~~~~~--- 164 (390) T protein:vir:97 95 WNDRSARATMNIKAALNTASTDAAGSAGALTTP----NRLPGFITPPDARLTVRDLIGSGRTDSA---LIEYVQETG--- 164 (390) T ss_pred hhhhhhhhhhHHHHHHHhhhcccccccccccch----hhhHHHHHHHhhhhhhHhhcceeeccCC---ceEEEEEec--- Confidence 111 1111222212223333333333 2346677767788888889988888633 344444321 Q ss_pred ccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchH Q lcl|NC_019514. 73 DRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSE 152 (399) Q Consensus 73 ~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~ 152 (399) .+....+..||. .......+...++.++++++.++.+|+++++ +++ . T Consensus 165 --------~~~~a~~v~Eg~-----------------------~~~~~~~~~~~i~~~~~k~~~~~~is~ell~-ds~-~ 211 (390) T protein:vir:97 165 --------FVNNAAIVAEGA-----------------------LKPESSLKFAKKTDTTHVIAHTMKATRQILS-DAP-Q 211 (390) T ss_pred --------CCcceeeecCCc-----------------------cccccccceeEEEEeeeeEEEeehhhHHHHH-hHH-H Confidence 112334555552 1222234456788999999999999999775 454 4 Q ss_pred HHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEE-ecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceec Q lcl|NC_019514. 153 LFSHISTELMNGAVQLTEAVLQKDLLAGAGTIV-YTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVIT 231 (399) Q Consensus 153 l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~-yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~ 231 (399) +...+...|.+..+...+ ..+++|.+.-- -.|-.+.....+...+.....+++++..+.-.|+.+..+. T Consensus 212 l~~~i~~~la~a~~~~~d----~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~------ 281 (390) T protein:vir:97 212 LASYMNNRLIRGLKVKED----AEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYPA------ 281 (390) T ss_pred HHHHHHHHHHHHHHHHHH----HHHhhcCCCCccccceeeccccccccccccccchHHHHHHHHHhhccccCCC------ Confidence 777777777776655443 34555542210 0010000001111122334567888888887776654432 Q ss_pred cccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCcc Q lcl|NC_019514. 232 GSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVG 311 (399) Q Consensus 232 ~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~ 311 (399) =+.++||.....|+.|+|--+.|-|-+. ..+.-+++-|+.+++++.+.. T Consensus 282 -------------~~~v~n~~~~~~L~~lkd~~G~~l~~~~---------~~~~~~~l~G~pV~~~~~~~~--------- 330 (390) T protein:vir:97 282 -------------SGIVINPIDWAAIELAKDANNQYLIGNA---------RGTLTPTLWGLPVVATQAMAP--------- 330 (390) T ss_pred -------------CEEEEcHHHHHHHHHhhcCCCceeecCc---------cCCCCceecceeeEEcCCCCC--------- Confidence 1357899999999999876666555432 234456888999999887521 Q ss_pred CCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHhhccccc Q lcl|NC_019514. 312 TNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTLILRPER 389 (399) Q Consensus 312 ~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~iLn~~~ 389 (399) + .+++|.-+.+..-+...+ +.+-+ ...+++-+++.+.|+ +++...+++++- T Consensus 331 -~--------------~~~~gd~~~~~~~~~~~~----~~i~~--------~~~~~~f~~~~~~~r~~~r~d~~v~~~~a 383 (390) T protein:vir:97 331 -G--------------EFLVGAFDLAAQIFDQWD----ARVEI--------GYVNDDFQRNMVTVLAEERLALVVYRPEA 383 (390) T ss_pred -C--------------cEEEEeccceEEEEEecc----eEEEE--------eecccccccCcEEEEEEEeeccEEecccc Confidence 1 135665433222122222 11111 122456788888888 689999999999 Q ss_pred eEEEEEe Q lcl|NC_019514. 390 LALVKTV 396 (399) Q Consensus 390 m~~ie~~ 396 (399) ++.++.+ T Consensus 384 ~v~~~~a 390 (390) T protein:vir:97 384 LITGSFA 390 (390) T ss_pred EEEEEeC Confidence 9999999 No 74 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=98.72 E-value=1.4e-09 Score=69.08 Aligned_cols=293 Identities=10% Similarity=0.023 Sum_probs=163.4 Q ss_pred CCcCCeeecCC-CCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccC Q lcl|NC_019514. 1 MASKGMLYNDP-NTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQ 79 (399) Q Consensus 1 ~~~~~~~~n~~-~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~ 79 (399) +..-...-+.. ..++++..+..-|+ .+..+.+..+.+...+.+++...+|+-+.++. .+.++.. T Consensus 110 ~~~~~~~~~~~~~~~~~~~g~~~iP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~-~~~~~~~---------- 174 (415) T protein:vir:94 110 TEYLETRNDIQGGSLKTDSGFVVIPE----EIVTDILKLKEVEFNLDKYVTVKRVTNGSGKY-PVVRQSE---------- 174 (415) T ss_pred HHHhhhhhhhhhhccccccccccCcH----HHHHHHHHHHHhhhhhhhhcceeeccCCceeE-EEEeecC---------- Confidence 00000000111 11222233333343 34566666688999999999999998766552 2223322 Q ss_pred CCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHH Q lcl|NC_019514. 80 GIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIST 159 (399) Q Consensus 80 gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~ 159 (399) .+...+..||... .. ....+...++..+++++.|+.+|+++++ +++..+.+.+.. T Consensus 175 --~~~~~~v~Eg~~~---------~~-------------~~~~~~~~i~~~~~k~~~~~~is~ell~-ds~~~~~~~i~~ 229 (415) T protein:vir:94 175 --VAALEKVEELEEN---------PE-------------LAVKPFFQLAYDINTHRGYFRISREAIE-DAKVNVLQELKL 229 (415) T ss_pred --Cccceeccccccc---------cc-------------cccccceeeEeeheeeeeechhhHHHHh-hchHHHHHHHHH Confidence 1233456665221 10 0112345678889999999999999666 455567777778 Q ss_pred HHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_019514. 160 ELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTR 239 (399) Q Consensus 160 ~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~ 239 (399) .|.+..+...+..+....-+|.+...-.+. +......++....++++|.++.-.|....... T Consensus 230 ~l~~~~~~~~~~~il~g~g~g~~~~~~~~~----~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~-------------- 291 (415) T protein:vir:94 230 WMARTIAATRNKAIIDVITKGSTGSTSSGF----EKEGKKLEVKKAKSLDDIKDAINLNVKPNYEH-------------- 291 (415) T ss_pred HHHHHHHHHHHHHHhhccccCccccccccc----cccccccccccccchHHHHHHHHhhhhhccCC-------------- Confidence 888776655544333332222222111111 11122234456678999998887765533221 Q ss_pred ccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCcccccc Q lcl|NC_019514. 240 TISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRET 319 (399) Q Consensus 240 ~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t 319 (399) + ..+|||.+...|+.|+|-.+.|-|.+- +..|-.+++-|+.++.++.+.. |.. T Consensus 292 ----~-~~vmn~~~~~~l~~lkd~~G~~l~~~~--------~~~~~~~~l~G~pV~~~~~~~~----~~~---------- 344 (415) T protein:vir:94 292 ----N-VAIVSQTMFAKLDKMKDKLGNYLIQPD--------VKEKTQQRLLGAKIEILPDEVL----GQK---------- 344 (415) T ss_pred ----C-EEEEcHHHHHHHHHhhccCCCeeeccC--------cCCCCCceecceeeEEeccccc----CCC---------- Confidence 1 347899999999999987777777542 2345567889999988887521 111 Q ss_pred CccceEEEEEEEc--ccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEec Q lcl|NC_019514. 320 NGKYDIYPMLCVG--AESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVKTVA 397 (399) Q Consensus 320 ~~~~DVyp~lV~G--~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~a 397 (399) +. . .+++| +++|.. +...+ +.+-+- . +-..|+++.++ +++.+.+++++-++.++... T Consensus 345 -~~---~-~i~~gd~~~~~~~--~~~~~----~~v~~~------~---~~~~~~~~r~~-~r~d~~~~~~~a~~~~~~~~ 403 (415) T protein:vir:94 345 -GN---N-TLIIGNLKDAIVL--FDRSQ----YQASWT------D---YMHFGECLMIA-VRQDCRILDYKSAIVIEYDD 403 (415) T ss_pred -Cc---c-EEEEEehhccEEE--Eeecc----eEEEEe------c---cccCceEEEEE-EEeccEEeccccEEEEEEec Confidence 11 1 25677 343332 22222 111111 1 22334444333 46788888888888888777 Q ss_pred cC Q lcl|NC_019514. 398 PL 399 (399) Q Consensus 398 ~~ 399 (399) +. T Consensus 404 ~~ 405 (415) T protein:vir:94 404 SE 405 (415) T ss_pred cC Confidence 77 No 75 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=98.71 E-value=4.5e-09 Score=66.33 Aligned_cols=285 Identities=13% Similarity=0.102 Sum_probs=163.1 Q ss_pred CCcCCeee-cCCCCcccccccc-cccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccccccccccc Q lcl|NC_019514. 1 MASKGMLY-NDPNTTPSGIDAP-DGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVND 78 (399) Q Consensus 1 ~~~~~~~~-n~~~~t~tT~~~~-i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~ 78 (399) ++.++... +. .+..++..++ +-|+ .+..+.+....+..++.+++.+.+|+-+.|+....++- T Consensus 98 l~~~~~~~~~~-~~~~t~~~gg~~iP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~----------- 161 (397) T protein:vir:49 98 VRGRYQNLLDS-KTDGSGSDAGLTIPQ----DIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEKWA----------- 161 (397) T ss_pred hhcchhhHHHh-hhccCCccCcceecH----HHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEeec----------- Confidence 33322211 11 1122222222 2233 24566777788888999999999999888874433322 Q ss_pred CCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHH Q lcl|NC_019514. 79 QGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIS 158 (399) Q Consensus 79 ~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~ 158 (399) +..+.+.+..||... ++ ....+...++.+.++++.++.+|++++.- ++.++...+. T Consensus 162 -~~~~~a~~v~E~~~~---~~-------------------~~~~~~~~v~~~~~k~~~~~~iS~ell~d-s~~~l~~~i~ 217 (397) T protein:vir:49 162 -DITGLAKLDDEGGQI---GQ-------------------NDDPKLSLIRYAIKRYAGISTVTNSLLAD-SAENILAWLS 217 (397) T ss_pred -cCCcceeeecccccc---cc-------------------ccccceeeeEeeeeeeEeehhhHHHHHhh-hhHHHHHHHH Confidence 112334456665221 11 11133456788899999999999987753 3444777777 Q ss_pred HHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCc Q lcl|NC_019514. 159 TELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDT 238 (399) Q Consensus 159 ~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T 238 (399) .++.+..+...+ ..+++|.+. .......+++++|.++.-.|+.+-.+. T Consensus 218 ~~l~~~~~~~~d----~ail~G~g~---------------~~~~~~~~~~d~i~~~~~~l~~~~~~~------------- 265 (397) T protein:vir:49 218 GWIAKKVVVTRN----KAILEAIGT---------------LPNKPTLAKWDDIIDLQAKVDPAIKQT------------- 265 (397) T ss_pred HHHHHHHHHHHH----HHHHhcccc---------------ccccccccCHHHHHHHHHhhhhhhcCC------------- Confidence 777777655443 345666532 112345678999999988887543321 Q ss_pred cccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCccccc Q lcl|NC_019514. 239 RTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRE 318 (399) Q Consensus 239 ~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~ 318 (399) -+-++||.....|+.|+|--+.|-|.|- +..|--+++-|+.++.++.... .. . T Consensus 266 ------a~~v~n~~~~~~l~~lkd~~g~~l~~~~--------~~~g~~~~l~G~pV~~~~~~~~--~~-----------~ 318 (397) T protein:vir:49 266 ------SLFLTNTSGFTALKKVKNAMGDYLMERD--------VKSPTGYSIDGFVVKEISDRFL--PN-----------G 318 (397) T ss_pred ------CEEEEcHHHHHHHHHhhccCCceeeccc--------ccCCCCceecceeeEEeccccc--cc-----------c Confidence 2457999999999999987777666542 1233446788877665443210 00 1 Q ss_pred cCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHhhccccceEEEEEe Q lcl|NC_019514. 319 TNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTLILRPERLALVKTV 396 (399) Q Consensus 319 t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~iLn~~~m~~ie~~ 396 (399) +++..+ ++||.-.-+..-+...+ +.+-+- +-.+.+-+++.+.++ +++.+.+++++-++.++.. T Consensus 319 ~~~~~~----~~~gd~~~~~~~~~~~~----~~i~~~-------~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~ 383 (397) T protein:vir:49 319 TGGAMP----LYFGDLKQAVTLFDRQH----LSLLST-------NIGGGAFETDTTKVRVIDRFDVVSTDTEAFVPASFK 383 (397) T ss_pred cCCcee----EEEeeccceEEEEeecc----cEEEEe-------ccccchhhcCeeeEEEEEeeccEEecccceEEEEec Confidence 111222 45775332221111112 111111 112344566666666 6788899999999999888 Q ss_pred ccC Q lcl|NC_019514. 397 APL 399 (399) Q Consensus 397 a~~ 399 (399) ++. T Consensus 384 ~~~ 386 (397) T protein:vir:49 384 AIA 386 (397) T ss_pred ccc Confidence 777 No 76 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=98.71 E-value=4e-09 Score=66.63 Aligned_cols=310 Identities=13% Similarity=0.061 Sum_probs=168.6 Q ss_pred CC------cCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEc--ccccc Q lcl|NC_019514. 1 MA------SKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHY--IPLLD 72 (399) Q Consensus 1 ~~------~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry--~pl~~ 72 (399) |+ ....-= ++....+++.+.+-|+ .+..+.++..++.-.+.+++...+|+.+ .+++.++ .|- T Consensus 1 ~~~~~e~~~~~~~~-~~~~~~~~~~~~liP~----~~~~~ii~~~~~~s~l~~l~~~~~~~~~---~~~ip~~~~~~~-- 70 (338) T protein:vir:78 1 MATLNELAPNTAGS-NHQGRLAHVPSDLLPK----EIVGPIFDKAQESSLVLRLGENIPISYG---ETIIPTTVKRPE-- 70 (338) T ss_pred CcchHHhhhhhccc-ccccceecccccccch----HHHHHHHHHHHhhchhhhhcceeeccCC---ceEEEEEecCcc-- Confidence 32 222221 2222223333444444 3467788888899999999999988754 3333333 221 Q ss_pred ccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchH Q lcl|NC_019514. 73 DRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSE 152 (399) Q Consensus 73 ~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~ 152 (399) ..+..++... ....|+.......+...++...++++.+..+|+++++. +... T Consensus 71 ------------a~~v~~~~~~---------------~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~d-s~~~ 122 (338) T protein:vir:78 71 ------------VGQVGVGTSN---------------EQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARM-NPSG 122 (338) T ss_pred ------------ceeecccccc---------------cccccccccccccceeEEEEEEEEEEEeehhhHHHHhc-CHHH Confidence 1111111100 01113333344455677888999999999999997764 3344 Q ss_pred HHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecC-------CCcccccccccccCCceecHHHHHHHHHHHHhccCcc Q lcl|NC_019514. 153 LFSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTG-------AATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPK 225 (399) Q Consensus 153 l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag-------~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~ 225 (399) +.+.+.+++.+..+.-.+ ..+++|.+..--.+ +.............+....++++.++...+..|.... T Consensus 123 ~~~~i~~~la~a~~~~~d----~~~l~G~g~~~~~~~~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 198 (338) T protein:vir:78 123 LYTKLQADLAYAIGRGID----LAVFHGKSPLTGSALQGIDTNNVIVNTTNVDYLQTGTTPLLDRFLDGYDLVSANTDVD 198 (338) T ss_pred HHHHHHHHHHHHHHHHHH----HHhhcccCCCccccccccccccccccccccccccccchhhHHHHHHHHHHhhhhcccc Confidence 777777777776554443 34566554211110 0000000111113334566788888877665544321 Q ss_pred ccceeccccccCccccCceeEEEeCCCchHHHHH---hhccCCCccceehhhcCCccccccccceeEcCeEEEecCccch Q lcl|NC_019514. 226 QTKVITGSRMIDTRTISAGRVLYIGSELIPLIRK---LVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLH 302 (399) Q Consensus 226 ~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dird---l~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~ 302 (399) .=+.++||.+...|+. ++|-.+.|-|.+. .+.|.-+++-|+.++.++.+.. T Consensus 199 ------------------~~~~~m~~~~~~~L~~~~~l~d~~g~~l~~~~--------~~~~~~~~l~G~PV~~~~~ip~ 252 (338) T protein:vir:78 199 ------------------FNGWAADPRYRARLLRSQAYRDANGNVDPTRI--------NLAASAGDLLGLPVQFGKAVGG 252 (338) T ss_pred ------------------ceEEEEchHHHHHHHHHhhhccCCCceeeccc--------ccCCCCceeeeeeEEEccccCc Confidence 1246779988877754 4444444555432 3556678899999999887632 Q ss_pred hcccCCCccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCcc------chhhHHHH Q lcl|NC_019514. 303 WAGAGATVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPY------GEMGFSSI 376 (399) Q Consensus 303 ~~~aGa~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPl------gQrg~~gw 376 (399) -.+ ..+++++ .+++|.-+.-.++..++ ..+++. -..+ --+..||- -|+...+| T Consensus 253 ~~~-----------~~~~~~~----~~~~gdfs~~~~~~~~~---~~i~~~-~~~~--~~~~~~~~~~~~~~~~~~~~~~ 311 (338) T protein:vir:78 253 DLG-----------AATDSKV----RVVGGDFSQLKYGFADE---IRVKMS-DTAT--LTDNTSPTPQTVSMWQTNQIAI 311 (338) T ss_pred ccc-----------ccCCccc----EEEEEecceEEEEeecc---cEEEEe-eccc--ccccccccccchhhhhcCcEEE Confidence 111 1122233 34577766655554442 112221 0111 12344554 45566777 Q ss_pred H--HHHHHhhccccceEEEEEeccC Q lcl|NC_019514. 377 K--WYYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 377 K--~~~~~~iLn~~~m~~ie~~a~~ 399 (399) | +++.+.+++++-+++|.-+.-= T Consensus 312 r~~~r~d~~v~~~~a~~~l~~~~~~ 336 (338) T protein:vir:78 312 LIEVTFGWLLGDKQAFVKFVDDEDP 336 (338) T ss_pred EEEEEeccEeecccceEEEecccCC Confidence 6 6789999999999888765444 No 77 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=98.71 E-value=4.8e-09 Score=66.20 Aligned_cols=292 Identities=11% Similarity=0.071 Sum_probs=161.7 Q ss_pred CCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCCCCCCCceeccC Q lcl|NC_019514. 12 NTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQGIDAAGATIVNG 91 (399) Q Consensus 12 ~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~gi~aaga~lt~g 91 (399) =.|.++. +.+-|. . +..+.++.+++.-.+.+++...+|+.+ ++++-++.. .+...|..|| T Consensus 1 m~t~t~g-g~liP~---~-~~~~ii~~l~~~s~i~~l~~~~~~~~~---~~~ip~~~~------------~~~a~wv~E~ 60 (303) T protein:vir:97 1 MGTETSK-ASLFDK---H-LVSDLINKVKGHSSLAKLSSQKPIPFN---GSKEFTFTL------------DSDIDVVAEN 60 (303) T ss_pred CcccCCC-CeEcch---h-HHHHHHHHHHhhchhhhhcceeecCCC---ceEEEEEec------------CcceEEeecC Confidence 2222333 333333 2 356787788899999999999988743 344444321 1234667665 Q ss_pred ccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcch--HHHHHHHHHHHHhhhHHH Q lcl|NC_019514. 92 NLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDS--ELFSHISTELMNGAVQLT 169 (399) Q Consensus 92 ~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~--~l~~~~~~~lg~~a~~~~ 169 (399) - .+. ...++...++...++++.++.+|++++...+|+ ++.+.+...|.+..+..+ T Consensus 61 ~---------~~~--------------~s~~~f~~v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~l 117 (303) T protein:vir:97 61 G---------KKT--------------HGGLSLEPVTIVPIKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGI 117 (303) T ss_pred c---------ccc--------------ccccceeeEEeeeEEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHH Confidence 2 222 223445568888999999999999987544443 566666666666544433 Q ss_pred HHHHHHHHHhcCCeEEecCCCccc-----cccc-ccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccccCc Q lcl|NC_019514. 170 EAVLQKDLLAGAGTIVYTGAATQD-----SEIT-GEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTRTISA 243 (399) Q Consensus 170 e~~l~~~~lag~~~v~yag~ats~-----~~~t-~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~~I~~ 243 (399) -..+++|.+..--.++.... ...+ .........++++|.++...|..+.... . T Consensus 118 ----d~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~-----------------~ 176 (303) T protein:vir:97 118 ----DLMAMHGINPRTKKASDVIGTNHFDSKVTQVVKFTESEDADANIEAAVNLIQGAEGVV-----------------T 176 (303) T ss_pred ----HhhhhcccccCCccccccccccccccccccccccccccchHHHHHHHHHHHhhcCCCc-----------------c Confidence 34455553111111110000 0000 0111123457888888887776543221 1 Q ss_pred eeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCccccccCccc Q lcl|NC_019514. 244 GRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRETNGKY 323 (399) Q Consensus 244 ~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t~~~~ 323 (399) ..++||.+...|+.|+|--+.+-|.|-.+ ..+..|++-|++++.++.+....+ ++... T Consensus 177 --~~vmn~~~~~~L~~lkd~~g~~~~~~~~~-------~~~~~~~l~G~Pv~~s~~v~~~~~-------------~~~~~ 234 (303) T protein:vir:97 177 --GLAMDTEFSTALAKVTNGEMGPKMYPELA-------WGANPDSINGLKSSVNTTVGAGAD-------------EAESK 234 (303) T ss_pred --EEEEcHHHHHHHHHhhccCCCeEEecCcc-------CCCCCceecceeeEEecccCCccc-------------cCCCc Confidence 26679999999999987666555643211 233557899999999988632110 01111 Q ss_pred eEEEEEEEccccee-eeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHhhccccceEEEEEeccC Q lcl|NC_019514. 324 DIYPMLCVGAESFT-TIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 324 DVyp~lV~G~~Afg-~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~iLn~~~m~~ie~~a~~ 399 (399) ..+++|.-+.. .++... + ..++++ .-+. ......-|-|+..+.++ .++++.+++++-+++|+=+ ++ T Consensus 235 ---~~~~~Gdf~~~~~~~~~~-~--~~~~~~--~~~~-~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~-~~ 303 (303) T protein:vir:97 235 ---DLVIIGDFESMFKWGYAK-Q--IPMEII--KYGD-PDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKG-EV 303 (303) T ss_pred ---cEEEEeeccccEEEEEec-C--cEEEEe--eccC-CCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCC-CC Confidence 23678875332 344433 1 122222 2221 11112235677777775 5788999999999888643 34 No 78 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=98.70 E-value=1.7e-09 Score=68.70 Aligned_cols=296 Identities=13% Similarity=0.047 Sum_probs=160.0 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQG 80 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~g 80 (399) |+ |.+|..+..-|+ . +..+.++.+++..++.+++...+++.+. +++-+.. . T Consensus 1 Ma-----------t~tt~~g~~vP~---~-~~~~ii~~~~~~s~l~~~~~~i~~~~~~---~~~p~~~----------~- 51 (311) T protein:vir:99 1 MA-----------TFGTGNLKNLPR---N-IADGMVKDVVQGSTVAVLSARKPQRFGN---EDIITFN----------G- 51 (311) T ss_pred Cc-----------eecCCCceeccH---H-HHHHHHHHHHhhchhhhhcceeeccCCc---eEEEEEe----------C- Confidence 32 223333333343 2 3567888899999999999998888533 2333321 1 Q ss_pred CCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcc--hHHHHHHH Q lcl|NC_019514. 81 IDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSD--SELFSHIS 158 (399) Q Consensus 81 i~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D--~~l~~~~~ 158 (399) .+...|..||- .......+...++...+|++.+..+|++++...+| .+|.+.+. T Consensus 52 -~~~a~wv~Eg~-----------------------~~~~~~~~f~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~ 107 (311) T protein:vir:99 52 -RPKAEFVGEGQ-----------------------QKSSTTGEFDFVTSTPKKAQVTMRFNEEVQWADEDYQLGVLQTLS 107 (311) T ss_pred -CceeEEeecCc-----------------------ccccccceeeEEEEeeEEEEEeehhhHHHhhcccccHHHHHHHHH Confidence 12335666652 22233345567888899999999999998765444 35777777 Q ss_pred HHHHHhhhHHHHHHHHHHHHhcCCeEE---------ecCCCcccccccccccCCcee-cHHHHHHHHHHHHhccCccccc Q lcl|NC_019514. 159 TELMNGAVQLTEAVLQKDLLAGAGTIV---------YTGAATQDSEITGEGATPSVV-DYDDLMRLSITLDENRTPKQTK 228 (399) Q Consensus 159 ~~lg~~a~~~~e~~l~~~~lag~~~v~---------yag~ats~~~~t~~~~~~~~v-t~~~lr~a~~~L~~nrap~~t~ 228 (399) .+|.+.-+.-.|.. +++|.+.-. +.+..+...+.+ ..... ...++..+...+..+++.. T Consensus 108 ~~la~ai~~~~d~~----~l~G~g~~~g~~~~g~~~~~~~~~~~~~~~----~~~~~~~~~~i~~~~~~~~~~~~~~--- 176 (311) T protein:vir:99 108 EAGAEALARALDLG----LYHRINPLTGTVIPGWSNYLGAASKRVELT----ADTIANPDLAIEAAVGLLVANGHPT--- 176 (311) T ss_pred HHHHHHHHHHHHHH----hhcccCcccCccccccccccccccceeecc----ccccchhHHHHHHHHHHHhhhccCC--- Confidence 77776655554433 444432111 111111111111 12222 2345556665555554432 Q ss_pred eeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCC Q lcl|NC_019514. 229 VITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGA 308 (399) Q Consensus 229 ~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa 308 (399) ..+ .-++||.+...|+.|+|--+.|-|.+. ...++.|++.+++++.++.+..-..... T Consensus 177 -------------~~~-~~vmn~~~~~~L~~lkd~~G~~l~~~~--------~~~~~~~~l~G~Pv~~s~~i~~~~~~~~ 234 (311) T protein:vir:99 177 -------------PVN-GLALHPSIAWGLSTARYTDGRKKFPEL--------GLGIGVSSFEGIDASVSDTVNGGDEADP 234 (311) T ss_pred -------------Ccc-EEEEcHHHHHHHHhhhccCCCeeecCc--------ccCCCCceecceeeEeeccccccccccc Confidence 011 247899999999999887777777543 2334668999999998887632111111 Q ss_pred CccCCccccccCccceEEEEEEEcccce-eeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHhhc Q lcl|NC_019514. 309 TVGTNPGYRETNGKYDIYPMLCVGAESF-TTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTLIL 385 (399) Q Consensus 309 ~~~~~~~~~~t~~~~DVyp~lV~G~~Af-g~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~iL 385 (399) .. ... ..++ +..+++|.=+= -.++... ...++.... + .++..=.|-|+...++| +++++.++ T Consensus 235 ~~---~~~-~~~~----~~~~~~Gdf~~~~~~~~~~---~~~~~~~~~--~--~~~~~~~~~~~d~~~~r~~~r~d~~v~ 299 (311) T protein:vir:99 235 DD---EDL-DAAR----AVRGIVGDFANGIHWGVQR---DIPVELIKY--G--DPDGQGDLKRHNQIALRLEIVYGWYVF 299 (311) T ss_pred cc---chh-hccC----cceEEEeeccccEEEEEec---CceEEEeec--C--CCCcchhhhhcCcEEEEEEEeecceec Confidence 10 000 0011 12244554210 1121111 111222211 1 11211244667777776 78888999 Q ss_pred cccceEEEEEec Q lcl|NC_019514. 386 RPERLALVKTVA 397 (399) Q Consensus 386 n~~~m~~ie~~a 397 (399) ++++.+....+| T Consensus 300 ~~~~v~~~~~~A 311 (311) T protein:vir:99 300 TDRFVVIENAVA 311 (311) T ss_pred ChhHeeeecccC Confidence 999998888888 No 79 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=98.70 E-value=6.8e-09 Score=65.36 Aligned_cols=285 Identities=13% Similarity=0.098 Sum_probs=165.2 Q ss_pred CCcCCee-ecCCCCccccccc-ccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccccccccccc Q lcl|NC_019514. 1 MASKGML-YNDPNTTPSGIDA-PDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVND 78 (399) Q Consensus 1 ~~~~~~~-~n~~~~t~tT~~~-~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~ 78 (399) ++.+... .+.. ++.+++.+ .+-|+ .+..+.+....+...+.+++...+|+-+.|+...+++-.- T Consensus 98 ~~~~~~~~~~~~-~~~t~~~gg~~iP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~--------- 163 (397) T protein:vir:48 98 VRGRYQNLLDSK-TDASGSDAGLTIPQ----DIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKWADI--------- 163 (397) T ss_pred HhhhhhHHHHHh-hccCCccccccccH----HHHHHHHHHHHHHHHHHhhhceeeccCCcceEEEEeecCC--------- Confidence 3222211 0111 12222222 22333 3466777777888999999999999988887665544311 Q ss_pred CCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHH Q lcl|NC_019514. 79 QGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIS 158 (399) Q Consensus 79 ~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~ 158 (399) .+...+..||... + .....+...++.+.++++.++.+|+++++ +++..+...+. T Consensus 164 ---~~~a~~v~E~~~~---~-------------------~~~~~~~~~v~~~~~k~~~~~~iS~ell~-ds~~~l~~~v~ 217 (397) T protein:vir:48 164 ---TGLAKLDDEAGSI---G-------------------TNDDPKLYPIRYAIKRYAGISTVTNSLLA-DSAENILAWLS 217 (397) T ss_pred ---Ccceeeecccccc---c-------------------cccccceeeEEeeheeeeeehhhHHHHHh-hchHHHHHHHH Confidence 1223455554211 1 11123455678889999999999999765 44555777777 Q ss_pred HHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCc Q lcl|NC_019514. 159 TELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDT 238 (399) Q Consensus 159 ~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T 238 (399) ..+.+..+...+ ..+++|.+.. ......+++++|.++...|+..-.+. T Consensus 218 ~~l~~~~~~~~d----~~il~G~g~~---------------~~~~~~~~~d~i~~~~~~l~~~~~~~------------- 265 (397) T protein:vir:48 218 GWIAKKVVVTRN----KAILEAIATL---------------PTKPTLTKWDDIIDLQAKVDPAIKQT------------- 265 (397) T ss_pred HHHHHHHHHHHH----HHHhhccccc---------------ccccccccHHHHHHHHHHhhhhhcCC------------- Confidence 777777655443 3456665321 12345678999999988887643221 Q ss_pred cccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCccccc Q lcl|NC_019514. 239 RTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRE 318 (399) Q Consensus 239 ~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~ 318 (399) -+-+|||.+...|+.|+|..+.|-|.+- +..|.-+.+-|+.++.++.... ..++. T Consensus 266 ------a~~v~n~~~~~~L~~lkd~~G~~i~~~~--------~~~~~~~~l~G~PV~~~~~~~~--~~~~~--------- 320 (397) T protein:vir:48 266 ------SFFLTNTSGFTALKKVKNAFGDYLMERD--------VKSPTGYSIDGFAVKEVADRWL--ANASS--------- 320 (397) T ss_pred ------CEEEECHHHHHHHHHhhcCCCceeeccC--------cCCCCCceeccceeEEeccccc--CCcCC--------- Confidence 2446899999999999987777777542 2345667888888776543211 01110 Q ss_pred cCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHhhccccceEEEEEe Q lcl|NC_019514. 319 TNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTLILRPERLALVKTV 396 (399) Q Consensus 319 t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~iLn~~~m~~ie~~ 396 (399) +.. .+++|.-+-...-+..++ +.+.+. +-.+.+-+++.+.|+ +++.+.+++++-++.++.. T Consensus 321 -----~~~-~~~~gd~~~~~~~~~~~~----~~i~~~-------~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~ 383 (397) T protein:vir:48 321 -----GAM-PLYFGDLKQAVTLFDRQQ----MSLLST-------NIGGGAFETDTTKIRVIDRFDVVATDTESFVPASFK 383 (397) T ss_pred -----Cce-EEEEEeccceEEEEeecc----eEEEEe-------ccchhhhhcCceeEEEEeeeccEEecccceEEEEec Confidence 011 245664331111111122 111111 112345667777776 5578899999988888876 Q ss_pred ccC Q lcl|NC_019514. 397 APL 399 (399) Q Consensus 397 a~~ 399 (399) +.- T Consensus 384 ~~~ 386 (397) T protein:vir:48 384 AIA 386 (397) T ss_pred ccc Confidence 665 No 80 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=98.69 E-value=3.9e-09 Score=66.71 Aligned_cols=297 Identities=11% Similarity=0.014 Sum_probs=163.7 Q ss_pred CCcCCeee---c----CCCCcccccccc-cccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccccc Q lcl|NC_019514. 1 MASKGMLY---N----DPNTTPSGIDAP-DGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLD 72 (399) Q Consensus 1 ~~~~~~~~---n----~~~~t~tT~~~~-i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~ 72 (399) ++.+.... . .--+..++..+. +-|. .+..+.+..+....++.+++...+||...|+....++... T Consensus 92 ~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~vP~----~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~~~~~~~~--- 164 (404) T protein:vir:10 92 LKQKNQRGLNLSEKEINAISENIDEDGGYAVPE----DIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRTYEKRSKQ--- 164 (404) T ss_pred HHHHHhhhhcchhhHHhhhccccCCCCceeech----hHHHHHHHHHhhhhhHhhhhceeeccCCccceEEEEecCC--- Confidence 22111110 0 000112222222 2232 2355666667888899999999999998887554444321 Q ss_pred ccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchH Q lcl|NC_019514. 73 DRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSE 152 (399) Q Consensus 73 ~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~ 152 (399) +...|..+|-. +... ....+...++.+.++++.|+.+|+++++ +++.. T Consensus 165 ----------~~~~~v~e~~~---------~~~~------------~~~~~f~~i~~~~~k~~~~~~iS~ell~-ds~~~ 212 (404) T protein:vir:10 165 ----------KPMKPLSENQQ---------IPTN------------GDNGKLERFNFKLKDLADFMSIPNDLLK-FADKS 212 (404) T ss_pred ----------cceeecccccc---------cccc------------ccccceeeeEeeheeeEeeehhhHHHHh-hcHHH Confidence 23345555411 0000 0113345678889999999999998765 45556 Q ss_pred HHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHH-HHHhccCccccceec Q lcl|NC_019514. 153 LFSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSI-TLDENRTPKQTKVIT 231 (399) Q Consensus 153 l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~-~L~~nrap~~t~~i~ 231 (399) +...+..++.+..+...+. -+++|.+.-.-.++-..... ....++.+..+++++..+.. .|+..-.+ T Consensus 213 l~~~i~~~la~~~~~~~~~----~il~G~g~~~~~~gi~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~~~~~------- 280 (404) T protein:vir:10 213 LEDWIINWFVDKVRITRNA----EILYGAGGDEHATGIMTANK-FKKITLPKSPALKDFKKCKNVELLNVFKA------- 280 (404) T ss_pred HHHHHHHHHHHHHHHHHHH----HHhhcCCCCCcccceeeccc-cceeeccccccHHHHHHHHHhhhhccccC------- Confidence 8888888888887765544 44565442111111000000 11123345567888887663 34332111 Q ss_pred cccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCcc Q lcl|NC_019514. 232 GSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVG 311 (399) Q Consensus 232 ~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~ 311 (399) .-+.+|||.....|+.|+|-.+.+-|.|-. -.+..+++-|..+++.+..++ +. T Consensus 281 ------------~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~--------~~~~~~~l~G~PV~~~~~~~~----~~--- 333 (404) T protein:vir:10 281 ------------TSSWIVNQDGFNYLDSLEDKTGRPYLQPDP--------KDPTQYRFLGLPVIELPNDLL----LS--- 333 (404) T ss_pred ------------CCEEEEcHHHHHHHHHhhccCCceeeccCc--------CCCCCccccceeeEEeccccc----CC--- Confidence 123579999999999999877777776532 233446777888776554322 00 Q ss_pred CCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCcc--chhhHHHHH--HHHHHhhccc Q lcl|NC_019514. 312 TNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPY--GEMGFSSIK--WYYGTLILRP 387 (399) Q Consensus 312 ~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPl--gQrg~~gwK--~~~~~~iLn~ 387 (399) +.+.. .+++|.-+-+.+-+..++ +.+.+- .+++ -+++...|+ +++.+.++++ T Consensus 334 -------~~~~~----~~~~gd~s~~~~~~~~~~----~~i~~~---------~~~~~~~~~~~~~~~~~~r~d~~v~~~ 389 (404) T protein:vir:10 334 -------TESAI----PVLLGDTKEAYKYVSDGA----YELATT---------NIGAGAFETNTTKARIIMRIDGNVKDS 389 (404) T ss_pred -------CCCcc----EEEEEeccccEEEEEecc----eEEEEe---------ccccchhhcCceEEEEEEeeccEEecc Confidence 01111 246786543222222222 222211 1232 346666665 6788899999 Q ss_pred cceEEEEEeccC Q lcl|NC_019514. 388 ERLALVKTVAPL 399 (399) Q Consensus 388 ~~m~~ie~~a~~ 399 (399) +-++.++..+.. T Consensus 390 ~a~~~~~~~~aa 401 (404) T protein:vir:10 390 EALLIAEIPVES 401 (404) T ss_pred cceEEEEeeccc Confidence 999999888777 No 81 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=98.69 E-value=2e-09 Score=68.32 Aligned_cols=301 Identities=15% Similarity=0.088 Sum_probs=165.6 Q ss_pred CCcCCeeecCCC----CcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccccccccc Q lcl|NC_019514. 1 MASKGMLYNDPN----TTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNV 76 (399) Q Consensus 1 ~~~~~~~~n~~~----~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~ 76 (399) |+ ++..||.-. .+.++..+..-|+ . +..+.++...+.-++.++++..+|+.+ ++++-+.. T Consensus 1 ~~-~~~~~~~e~~~~~~~~~~~~~~~ip~---~-~~~~ii~~~~~~~~l~~~~~~~~~~~~---~~~ip~~~-------- 64 (318) T protein:vir:24 1 MA-AGTAFAVDHAQIAQTGDTMFKGYLEP---E-QAKDYFAEAEKTSIVQQFAQKVPMGTT---GQKIPHWV-------- 64 (318) T ss_pred CC-CCCCCCHHHHHhhcccCcccceeech---h-HHHHHHHHHHhhchhhhhcceeeccCC---ceEEEEEe-------- Confidence 33 344454321 1222222332233 2 356777778888899999999888743 34433331 Q ss_pred ccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHH Q lcl|NC_019514. 77 NDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSH 156 (399) Q Consensus 77 ~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~ 156 (399) . .+...|..||. .|.. ...+...++.+.++++.++.+|+++++ +++..+.+. T Consensus 65 --~--~~~a~~v~Eg~---------~~~~--------------~~~~f~~i~~~~~k~~~~~~iS~e~l~-ds~~~~~~~ 116 (318) T protein:vir:24 65 --G--DVSAQWIGEGD---------MKPI--------------TKGNMTSQTIAPHKIATIFVASAETVR-ANPANYLGT 116 (318) T ss_pred --C--CcceEEecCCc---------cccc--------------cccceeEEEEeeEEEEEeehhhHHHhh-cChHHHHHH Confidence 1 12335666652 2222 233455688899999999999999766 344458888 Q ss_pred HHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCC--cccccccccccCCceecHHHHHHHHHHHHhccCccccceecccc Q lcl|NC_019514. 157 ISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAA--TQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSR 234 (399) Q Consensus 157 ~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~a--ts~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~ 234 (399) +.+.+.+..+.-.+ ..+++|.+.-.-.|-. +.....+ ..........+++..+...+...... T Consensus 117 i~~~l~~~~~~~~d----~a~l~G~g~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~---------- 181 (318) T protein:vir:24 117 MRTKVATAFAMAFD----GAAMHGTDSPFPTYIGQTTKAISIA-DTTGATTVYDQVAVNGLSLLVNDGKK---------- 181 (318) T ss_pred HHHHHHHHHHHHHH----HhhhcccCCCCCccccccccccccc-ccccccchHHHHHHHHHHhhccccCC---------- Confidence 87888777665444 3446655432111111 1111111 11222223334455555444332211 Q ss_pred ccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCc Q lcl|NC_019514. 235 MIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNP 314 (399) Q Consensus 235 ~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~ 314 (399) . =+.++||.....|+.|+|..+.+-|.+...-+...+ .+.+.+-++.++.++.+. .| T Consensus 182 --------~-~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~---~~~~~i~g~pv~~~~~~~----~~------- 238 (318) T protein:vir:24 182 --------W-THTLLDDITEPILNGAKDQNGRPLFIESTYGEAASP---FRSGRIVARPTILSDHVV----EG------- 238 (318) T ss_pred --------C-CEEEEcHHHHHHHHHhhccCCceeecCccccCcccc---ccCceEEEEeeEEeCCCC----CC------- Confidence 1 134899999999999999888888877544444432 234566677777666531 01 Q ss_pred cccccCccceEEEEEEEcccceeeeccccCCCCccceEE---EecCCCCCCCCCCcc--chhhHHHHH--HHHHHhhccc Q lcl|NC_019514. 315 GYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVT---TKMPGEATADRNDPY--GEMGFSSIK--WYYGTLILRP 387 (399) Q Consensus 315 ~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~i---vk~pG~~~ad~~DPl--gQrg~~gwK--~~~~~~iLn~ 387 (399) + .++++|.=+...++..++ ..+++. .+.=+ +.+.++|. -|++.+.|| +++.+.++++ T Consensus 239 -------~----~~~~~gdfs~~~~~~~~~---l~i~~~~~~~~~~~--~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~ 302 (318) T protein:vir:24 239 -------T----TVGFMGDFSQLIWGQIGG---LSFDVTDQATLNLG--TVESPNFVSLWQHNLVAVRVEAEYAFHCNDA 302 (318) T ss_pred -------c----cEEEEeecceEEEEEecC---eEEEEeeccceecc--ccccccchhhhhcCcEEEEEEEEEccEEecc Confidence 1 134566655555544432 112211 11111 22333443 466777776 7889999999 Q ss_pred cceEEEEEeccC Q lcl|NC_019514. 388 ERLALVKTVAPL 399 (399) Q Consensus 388 ~~m~~ie~~a~~ 399 (399) +-.+.|..++-= T Consensus 303 ~a~~~i~~~~a~ 314 (318) T protein:vir:24 303 EAFVALTNVVSG 314 (318) T ss_pred cceEEEEeeccC Confidence 999888876555 No 82 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=98.68 E-value=8.9e-09 Score=64.72 Aligned_cols=289 Identities=11% Similarity=0.079 Sum_probs=164.6 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQG 80 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~g 80 (399) |+ ++++..+.+-|+ . +..+.++.+.+.-++.+++...+||.+. +++.++. . T Consensus 1 ma-----------~~t~~~G~lip~---~-~~~~ii~~l~~~s~i~~l~~~~~~~~~~---~~~p~~~----------~- 51 (300) T protein:vir:95 1 MS-----------EAQLSKGNLFNP---E-LVTKVINKVKGHSSIAKLSPQKPIPFNG---QREFVFD----------F- 51 (300) T ss_pred Cc-----------ccccCCcceech---h-hHHHHHHHHHhhhhhhhhcceeeccCCc---eEEEEEe----------c- Confidence 43 234444444444 2 3577888899999999999999988853 2333321 1 Q ss_pred CCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcc--hHHHHHHH Q lcl|NC_019514. 81 IDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSD--SELFSHIS 158 (399) Q Consensus 81 i~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D--~~l~~~~~ 158 (399) .+...|+.||.. + -....+...++...++++.++.+|++++...+| .+|.+.+. T Consensus 52 -~~~a~wv~Eg~~---------~--------------~~s~~~f~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~ 107 (300) T protein:vir:95 52 -DSDIDIVAENGK---------K--------------THGGVSLDPVTIVPLKVEYGARVSDEFLHASEEAKVDMLTDFV 107 (300) T ss_pred -CcceEEeeCCcc---------c--------------ccccccceeeEeeeEEEEEeehhhHHHhccCCCCHHHHHHHHH Confidence 123356666621 1 122344556788899999999999997754333 35777777 Q ss_pred HHHHHhhhHHHHHHHHHHHHhcCC------eEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceecc Q lcl|NC_019514. 159 TELMNGAVQLTEAVLQKDLLAGAG------TIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITG 232 (399) Q Consensus 159 ~~lg~~a~~~~e~~l~~~~lag~~------~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~ 232 (399) +.|.+..+.-.+ ..+++|.+ .-.. |.....+..+...+.....++++|.++...|...+... T Consensus 108 ~~l~~aia~~~d----~~~l~G~~~~~g~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~------- 175 (300) T protein:vir:95 108 EGFSKKLARGLD----IMSIHGINPRTKQASTII-GDNCFDKKVTQTVPFKDTNPDESMEDAVGMIDGSERDI------- 175 (300) T ss_pred HHHHHHHHHHHH----HhhhhcccCCCCCCcccc-cccccccccceeecccccchHHHHHHHHHHhhhcCCCc------- Confidence 777766554443 33444421 1111 11111111122222334567889999988887654321 Q ss_pred ccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccC Q lcl|NC_019514. 233 SRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGT 312 (399) Q Consensus 233 s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~ 312 (399) + +.++||.....|+.|+|-.+.|-|.+.. ..+.-|++-|++++.++.+.. +.. T Consensus 176 ----------~--~~vmn~~~~~~L~~lkd~~G~~i~~~~~--------~~~~~~~l~G~Pv~~s~~v~~----~~~--- 228 (300) T protein:vir:95 176 ----------T--GAILDPIFTTALSKMKNAEGGKLYPELA--------WGGVPDAINGLAVDKNRTVSY----SQT--- 228 (300) T ss_pred ----------c--EEEECHHHHHHHHHhhccCCCeeccCcc--------ccCCCceecceeeEEecCCCC----CCC--- Confidence 1 3578999999999998877766664331 234567899999998887621 111 Q ss_pred CccccccCccceEEEEEEEccccee-eeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHhhccccc Q lcl|NC_019514. 313 NPGYRETNGKYDIYPMLCVGAESFT-TIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTLILRPER 389 (399) Q Consensus 313 ~~~~~~t~~~~DVyp~lV~G~~Afg-~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~iLn~~~ 389 (399) .++ ..+++|.=+-+ .+++..+ ..++.. .-+. ..++.--|-|+.-++++ +.+++.+++++. T Consensus 229 -------~~~----~~~~~GDf~~~~~~~~~~~---~~~~v~--~~~~-~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a 291 (300) T protein:vir:95 229 -------DPK----NTAIVGDFETMFKWGYAKE---VPMEII--KYGD-PDNSGRDLKGYNQIYIRCEAYIGWGIMDAAS 291 (300) T ss_pred -------CCc----cEEEEeeccceEEEEEecc---cEEEEe--eccC-CCCcchhhhhcCcEEEEEEEeecceeecccc Confidence 111 13455642111 1333221 123322 1111 11111125566667777 367888999999 Q ss_pred eEEEEEecc Q lcl|NC_019514. 390 LALVKTVAP 398 (399) Q Consensus 390 m~~ie~~a~ 398 (399) +++|.-++= T Consensus 292 ~~~l~~~~g 300 (300) T protein:vir:95 292 FARIVKTGG 300 (300) T ss_pred eEEEecCCC Confidence 999887777 No 83 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=98.66 E-value=4.3e-09 Score=66.46 Aligned_cols=295 Identities=11% Similarity=0.002 Sum_probs=164.2 Q ss_pred CCcCCeeecCC--CCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccccccccccc Q lcl|NC_019514. 1 MASKGMLYNDP--NTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVND 78 (399) Q Consensus 1 ~~~~~~~~n~~--~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~ 78 (399) +.+....-+.- ..++++..+.+-|+ .+....+..+.+...+.+++...+|+.+.++-... +..+ T Consensus 109 ~~~~~~~~~~~~~~~~~t~~g~~~iP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~-~~~~--------- 174 (415) T protein:vir:47 109 FTEYLETRNDIQGGSLKTDSGFVVIPE----EIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVV-RQSE--------- 174 (415) T ss_pred HHHHHhhhhhhhhccccccCCcccccH----HHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEE-EecC--------- Confidence 00000000000 01222233334443 34566666688999999999999999887653222 2211 Q ss_pred CCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHH Q lcl|NC_019514. 79 QGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIS 158 (399) Q Consensus 79 ~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~ 158 (399) .+...+..||... ++ ....+...++...++++.++.+|+++++ +++..+...+. T Consensus 175 ---~~~~~~v~Eg~~~---~~-------------------~~~~~~~~v~~~~~k~~~~~~iS~ell~-ds~~~l~~~i~ 228 (415) T protein:vir:47 175 ---VAALEKVEELEEN---PE-------------------LAVKPFFQLAYDINTHRGYFRISREAIE-DAKVNVLQELK 228 (415) T ss_pred ---Ccceeeccccccc---cc-------------------ccccceeeEEeeeeeeEeeehhhHHHHh-hchHHHHHHHH Confidence 1222455555221 11 1113455678889999999999999775 45555888888 Q ss_pred HHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCc Q lcl|NC_019514. 159 TELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDT 238 (399) Q Consensus 159 ~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T 238 (399) .+|.+..+...+..+....-+|.+...- ..........++.+.+++++|.++.-.|....... T Consensus 229 ~~l~~~i~~~~d~~il~g~g~g~~~~~~----~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~------------- 291 (415) T protein:vir:47 229 LWMARTIAATRNKAIIDVITKGSTGSTS----SGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEH------------- 291 (415) T ss_pred HHHHHHHHHHHHHHHhhccccCCccccc----cccccccceeccccccchHHHHHHHHhhhhhccCC------------- Confidence 8888776655544433322222221111 11111122234556689999999987776543221 Q ss_pred cccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCccccc Q lcl|NC_019514. 239 RTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRE 318 (399) Q Consensus 239 ~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~ 318 (399) + ..++||.+...|+.|+|-.+.|-|.+- +..+-.+++-|+.++.++.+.. +.+ T Consensus 292 ----~--~~v~n~~~~~~L~~lkd~~G~~i~~~~--------~~~~~~~~l~G~pV~~~~~~~~----~~~--------- 344 (415) T protein:vir:47 292 ----N--VAIVSQTMFAKLDKMKDKLGNYLIQPD--------VKEKTQQRLLGAKIEILPDEVL----GQK--------- 344 (415) T ss_pred ----C--EEEEcHHHHHHHHHhhccCCCeeeccC--------cCCCCCccccceeeEEeccccc----cCC--------- Confidence 1 346999999999999887777667542 2344557889999988876521 111 Q ss_pred cCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEecc Q lcl|NC_019514. 319 TNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVKTVAP 398 (399) Q Consensus 319 t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~a~ 398 (399) +.. .++||.=+-..+.+...+ ..++.. . +-..|+++.++ +++.+.+++++-++.++..++ T Consensus 345 --~~~----~~~~gd~~~~~~~~~~~~--~~v~~~--------~---~~~~~~~~~~~-~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:47 345 --GNN----TLIIGNLKDAIVLFDRSQ--YQASWT--------D---YMHFGECLMIA-VRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred --Ccc----EEEEEehhccEEEEeecc--eEEEee--------c---cccCceEEEEE-EEeccEEeccccEEEEEeecc Confidence 011 267774332222222222 111111 1 22334444433 567888999999999988888 Q ss_pred C Q lcl|NC_019514. 399 L 399 (399) Q Consensus 399 ~ 399 (399) . T Consensus 405 ~ 405 (415) T protein:vir:47 405 E 405 (415) T ss_pred C Confidence 7 No 84 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=98.66 E-value=4.3e-09 Score=66.46 Aligned_cols=295 Identities=11% Similarity=0.002 Sum_probs=164.2 Q ss_pred CCcCCeeecCC--CCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccccccccccc Q lcl|NC_019514. 1 MASKGMLYNDP--NTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVND 78 (399) Q Consensus 1 ~~~~~~~~n~~--~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~ 78 (399) +.+....-+.- ..++++..+.+-|+ .+....+..+.+...+.+++...+|+.+.++-... +..+ T Consensus 109 ~~~~~~~~~~~~~~~~~t~~g~~~iP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~-~~~~--------- 174 (415) T protein:vir:46 109 FTEYLETRNDIQGGSLKTDSGFVVIPE----EIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVV-RQSE--------- 174 (415) T ss_pred HHHHHhhhhhhhhccccccCCcccccH----HHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEE-EecC--------- Confidence 00000000000 01222233334443 34566666688999999999999999887653222 2211 Q ss_pred CCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHH Q lcl|NC_019514. 79 QGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIS 158 (399) Q Consensus 79 ~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~ 158 (399) .+...+..||... ++ ....+...++...++++.++.+|+++++ +++..+...+. T Consensus 175 ---~~~~~~v~Eg~~~---~~-------------------~~~~~~~~v~~~~~k~~~~~~iS~ell~-ds~~~l~~~i~ 228 (415) T protein:vir:46 175 ---VAALEKVEELEEN---PE-------------------LAVKPFFQLAYDINTHRGYFRISREAIE-DAKVNVLQELK 228 (415) T ss_pred ---Ccceeeccccccc---cc-------------------ccccceeeEEeeeeeeEeeehhhHHHHh-hchHHHHHHHH Confidence 1222455555221 11 1113455678889999999999999775 45555888888 Q ss_pred HHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCc Q lcl|NC_019514. 159 TELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDT 238 (399) Q Consensus 159 ~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T 238 (399) .+|.+..+...+..+....-+|.+...- ..........++.+.+++++|.++.-.|....... T Consensus 229 ~~l~~~i~~~~d~~il~g~g~g~~~~~~----~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~------------- 291 (415) T protein:vir:46 229 LWMARTIAATRNKAIIDVITKGSTGSTS----SGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEH------------- 291 (415) T ss_pred HHHHHHHHHHHHHHHhhccccCCccccc----cccccccceeccccccchHHHHHHHHhhhhhccCC------------- Confidence 8888776655544433322222221111 11111122234556689999999987776543221 Q ss_pred cccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCccccc Q lcl|NC_019514. 239 RTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRE 318 (399) Q Consensus 239 ~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~ 318 (399) + ..++||.+...|+.|+|-.+.|-|.+- +..+-.+++-|+.++.++.+.. +.+ T Consensus 292 ----~--~~v~n~~~~~~L~~lkd~~G~~i~~~~--------~~~~~~~~l~G~pV~~~~~~~~----~~~--------- 344 (415) T protein:vir:46 292 ----N--VAIVSQTMFAKLDKMKDKLGNYLIQPD--------VKEKTQQRLLGAKIEILPDEVL----GQK--------- 344 (415) T ss_pred ----C--EEEEcHHHHHHHHHhhccCCCeeeccC--------cCCCCCccccceeeEEeccccc----cCC--------- Confidence 1 346999999999999887777667542 2344557889999988876521 111 Q ss_pred cCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEecc Q lcl|NC_019514. 319 TNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVKTVAP 398 (399) Q Consensus 319 t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~a~ 398 (399) +.. .++||.=+-..+.+...+ ..++.. . +-..|+++.++ +++.+.+++++-++.++..++ T Consensus 345 --~~~----~~~~gd~~~~~~~~~~~~--~~v~~~--------~---~~~~~~~~~~~-~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:46 345 --GNN----TLIIGNLKDAIVLFDRSQ--YQASWT--------D---YMHFGECLMIA-VRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred --Ccc----EEEEEehhccEEEEeecc--eEEEee--------c---cccCceEEEEE-EEeccEEeccccEEEEEeecc Confidence 011 267774332222222222 111111 1 22334444433 567888999999999988888 Q ss_pred C Q lcl|NC_019514. 399 L 399 (399) Q Consensus 399 ~ 399 (399) . T Consensus 405 ~ 405 (415) T protein:vir:46 405 E 405 (415) T ss_pred C Confidence 7 No 85 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=98.66 E-value=3.5e-09 Score=66.96 Aligned_cols=289 Identities=16% Similarity=0.079 Sum_probs=164.3 Q ss_pred CC-------cCCeeecCCC--CcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccc Q lcl|NC_019514. 1 MA-------SKGMLYNDPN--TTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLL 71 (399) Q Consensus 1 ~~-------~~~~~~n~~~--~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~ 71 (399) +. ++....+... ++.++..+.+-|. .+....+....+...+.+++...+++.+. +++-+.. T Consensus 116 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lvp~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~---~~~~~~~--- 185 (418) T protein:vir:10 116 ARKSVRVRVDRKSIMNVPATVGSGVSGSNSLVVA----DRQAGIIAPPQRKMTIRDLLMPGQTSSSS---IEYTVET--- 185 (418) T ss_pred HhhhhhhhhHHHHHHHhhhhccCCCCCCccccch----hHHHHHHHHHhhhhhHHhhcceeeccCCc---eeEEEEe--- Confidence 11 0111111111 1222222333333 23456666677888888999888886543 3333321 Q ss_pred cccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcch Q lcl|NC_019514. 72 DDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDS 151 (399) Q Consensus 72 ~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~ 151 (399) +.++...|..||- .+.. ...+...++...++++.++.+|+++++ +++ T Consensus 186 --------~~~~~a~~v~E~~---------~~~~--------------~~~~f~~v~~~~~k~~~~~~is~ell~-ds~- 232 (418) T protein:vir:10 186 --------GFTNNAAAVAEGA---------QKPT--------------SDLKFNLKNQPVRTIAHLFKASRQILD-DAP- 232 (418) T ss_pred --------cCCCceeeeccCc---------cccc--------------cccceeeEEEeeeeEEEeehhhHHHHH-hHH- Confidence 1122234555541 1222 223455688899999999999999886 455 Q ss_pred HHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEE-ecCCCcccccccccccCCceecHHHHHHHHHHHHhccCcccccee Q lcl|NC_019514. 152 ELFSHISTELMNGAVQLTEAVLQKDLLAGAGTIV-YTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVI 230 (399) Q Consensus 152 ~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~-yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i 230 (399) .+...+...+.+..+...+. -+++|.+.-- -.|-.+.....+...++.+..++++|..+.-.+.....+. T Consensus 233 ~l~~~i~~~l~~a~~~~~d~----a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~----- 303 (418) T protein:vir:10 233 ALQSYIDGRARYGLQLTEEG----QILKGDGTGANILGILPQASAFMPSITLANATPIDKIRLALLQAVLAEFPA----- 303 (418) T ss_pred HHHHHHHHHHHHHHHHHHHH----HHhccCCCCccccccccccccccccccccccccHHHHHHHHHhhccccCCC----- Confidence 48888888888776655543 4455543210 0011111111122234445567888888887665533321 Q ss_pred ccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCc Q lcl|NC_019514. 231 TGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATV 310 (399) Q Consensus 231 ~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~ 310 (399) =..+|||.+...|+.|+|-.+.+-|-+ ...+..|.+-|+.+|.++.|.. T Consensus 304 --------------~~~v~n~~~~~~L~~lkd~~G~~i~~~---------~~~~~~~~l~G~pV~~~~~~p~-------- 352 (418) T protein:vir:10 304 --------------TGIVLNPIDWASIELTKDSQGRYIVGN---------PVNGTTPRLWNLPVVETQAMTA-------- 352 (418) T ss_pred --------------CEEEEcHHHHHHHHHhhcCCCceeccc---------cccCCCceecceeeEEcCCCCC-------- Confidence 135689999999999988766665632 2355668889999999887621 Q ss_pred cCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHhhcccc Q lcl|NC_019514. 311 GTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTLILRPE 388 (399) Q Consensus 311 ~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~iLn~~ 388 (399) + .+++|.-+....-....+ +.+-+- +-...+-+++.+.|+ +++.+.++++. T Consensus 353 --~--------------~~~~gd~s~~~~~~~~~~----~~i~~~-------~~~~~~f~~~~~~~r~~~~~d~~~~~~~ 405 (418) T protein:vir:10 353 --N--------------EFLVGAFSMAAQIFDRME----IEVLLS-------TENVDDFEKNMVSIRAEERLALAVYRPE 405 (418) T ss_pred --C--------------cEEEeeccceEEEEEecc----eEEEEe-------cccchhhhcCceEEEEEEeeccEEeccc Confidence 0 145665433222222222 222111 112345678888887 56889999999 Q ss_pred ceEEEEEeccC Q lcl|NC_019514. 389 RLALVKTVAPL 399 (399) Q Consensus 389 ~m~~ie~~a~~ 399 (399) -+++++..++. T Consensus 406 a~~~~~~~~~~ 416 (418) T protein:vir:10 406 SFVTGALVEQA 416 (418) T ss_pred ceEEEEeccCC Confidence 99999999999 No 86 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=98.65 E-value=1.6e-08 Score=63.32 Aligned_cols=293 Identities=10% Similarity=-0.000 Sum_probs=142.4 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHh---hhhcccccccccCCCEEEEEEccccccccccc Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYF---MPLADVVSMPKNYGKEIRVYHYIPLLDDRNVN 77 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~---~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~ 77 (399) |+ .|...--|-|++=.-|-..+..+. ...+.. ...++...+-...|.+|++=.|.+|.-+.... T Consensus 1 MA------------~T~lsd~i~PEvf~~yv~~~~~~~-~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~~~~ 67 (351) T protein:vir:15 1 MA------------ETHLSDLIVPEVFGNYVVNQIIKT-NRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGDPDNW 67 (351) T ss_pred CC------------ceeeeeeechhHHHHHHhhhhHHh-hhHhhcccccccHHHHHHhhcCCCEEEecccccCCCccccc Confidence 43 123223344553333322211111 111111 22333333344679999999998883332222 Q ss_pred cCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHH Q lcl|NC_019514. 78 DQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHI 157 (399) Q Consensus 78 ~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~ 157 (399) +++-+-....| +-....+.+++.|.=++.+|.+.+..-.+ .++++ T Consensus 68 ~~~~~i~~~ki----------------------------------tt~~~~a~i~~~~kg~~~tD~a~~~sg~d-p~~~i 112 (351) T protein:vir:15 68 TDSDDIDVNNL----------------------------------TSGKQQGIKFYQTKAYGYTDLGTMISGAP-VQETI 112 (351) T ss_pred CCCcccchhee----------------------------------cccceeEEEEeeccceehhhhhHhhccch-HHHHH Confidence 22222111112 22335577888888889999877765554 44445 Q ss_pred HHHHHHhhhHHHHHHHHHHHHhcCCeEEe----cCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccc Q lcl|NC_019514. 158 STELMNGAVQLTEAVLQKDLLAGAGTIVY----TGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGS 233 (399) Q Consensus 158 ~~~lg~~a~~~~e~~l~~~~lag~~~v~y----ag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s 233 (399) ...++. -..+..|.+|++-..-++. ++..+.+. +........++.+.|-+|...|-..+-.+ T Consensus 113 ~~q~a~----~w~~~~q~~lla~l~gv~~~~~~~~~~~~d~--t~~~~~~~~is~~~l~~A~~~~GD~~~~~-------- 178 (351) T protein:vir:15 113 GNRFAA----FWQRADQKTLLSVLKGVMGVTKIANSKVYDQ--TKVSPSEPMFGAKGFTGAIGLMGDLQDTA-------- 178 (351) T ss_pred HHHHHH----HHHHHHHHHHHHHHHHHhhchhhcccceecc--ccccccccccCHHHHHHHHHHhccccccc-------- Confidence 555544 3444555555543211111 11112222 22223455689999999998875522111 Q ss_pred cccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCC Q lcl|NC_019514. 234 RMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTN 313 (399) Q Consensus 234 ~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~ 313 (399) -.+.+|||....+|+++ +++...+|.+. ...||.+.|.|+|+++-+- T Consensus 179 ----------~~~ivmhS~v~~~L~~~-------~li~~~~~s~~----~~~i~t~~G~~VivdD~~p------------ 225 (351) T protein:vir:15 179 ----------FGAIAVNSATYSLMKVQ-------GLIETIQPQNG----ATPFEAYNGLRIVLDDDIE------------ 225 (351) T ss_pred ----------eEEEEEChHHHHHHHhh-------hhhhhcccccc----CcccceecceEEEEcCCCc------------ Confidence 26788999999999864 46666678765 3479999999999998651 Q ss_pred ccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhH-----HHHHHHH---HHhhc Q lcl|NC_019514. 314 PGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGF-----SSIKWYY---GTLIL 385 (399) Q Consensus 314 ~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~-----~gwK~~~---~~~iL 385 (399) +..+++..++|-..+||++|++... +.+ ...+...|.. -.+.|=|=+|.. .|+||-- ....- T Consensus 226 --~~~~~~~~~~ytsyl~~~GAi~~~~----~~~--~ve~~rd~~~--~~g~d~l~~r~~~~~hp~G~s~~~~~~~~~~~ 295 (351) T protein:vir:15 226 --IDLTDKTKPVSTSYIFAPGAVRYST----NMR--STETKYDPLI--NGGQDVIVQKRVGTIHVAGTSIKASFSPSKAS 295 (351) T ss_pred --cccCCCCCceeEEEEEecceeeeec----CCc--CcceeecccC--CCCceEEEEeeeeeeeeeeeeecccccccCcC Confidence 1122334569999999999998531 111 1122222210 011122222211 0111110 00000 Q ss_pred cccceEEEEEeccC Q lcl|NC_019514. 386 RPERLALVKTVAPL 399 (399) Q Consensus 386 n~~~m~~ie~~a~~ 399 (399) ++- .+-|+.++.- T Consensus 296 sPt-~~~L~~~~NW 308 (351) T protein:vir:15 296 FPT-IDELAKSSTW 308 (351) T ss_pred CcC-hHHhcCCccc Confidence 000 0000011100 No 87 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=98.64 E-value=1.1e-08 Score=64.14 Aligned_cols=311 Identities=13% Similarity=0.077 Sum_probs=166.6 Q ss_pred CCcC-Ceeec----CCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccc Q lcl|NC_019514. 1 MASK-GMLYN----DPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRN 75 (399) Q Consensus 1 ~~~~-~~~~n----~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~ 75 (399) |+.- ....| +++...++..+.+-|+ .+..+.++...+..++.+++...+|+- .++++.+..- T Consensus 1 ~a~l~el~~~~~~~~~~g~~~~~~~~liP~----~~~~~ii~~l~~~s~l~~~~~~~~~~~---~~~~~p~~~~------ 67 (333) T protein:vir:78 1 MATLNELLPNSAGSNHQGRLAHVPSDLLPK----EIVGPIFDKAQESSLVLRMGEQIPISY---GETIIPTTVK------ 67 (333) T ss_pred CchhHHhhhhcccccccCceecCCccccch----hHHHHHHHHHHhhchhhhhcceeeccC---CceEEEEEeC------ Confidence 4431 11122 2222222222223333 235677788889999999999988873 2333333311 Q ss_pred cccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHH Q lcl|NC_019514. 76 VNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFS 155 (399) Q Consensus 76 ~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~ 155 (399) .+...|..||.... +. .++.......+...++...+|++.+..+|+++++. +...+.+ T Consensus 68 ------~~~a~~v~eg~~~~--------------~~-e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~-s~~~~~~ 125 (333) T protein:vir:78 68 ------RPEVGQVGVGTSNE--------------QR-EGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARM-NPSGLYT 125 (333) T ss_pred ------CceeEeecCccccc--------------cc-ccccccccccceeEEEEeeEEEEEeehhhHHHHhc-CHHHHHH Confidence 12333555553220 11 12222334455667888999999999999997763 3344777 Q ss_pred HHHHHHHHhhhHHHHHHHHHHHHhcCCeEEec---CCCc----ccccccccccCCceecHHHHHHHHHHHHhccCccccc Q lcl|NC_019514. 156 HISTELMNGAVQLTEAVLQKDLLAGAGTIVYT---GAAT----QDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTK 228 (399) Q Consensus 156 ~~~~~lg~~a~~~~e~~l~~~~lag~~~v~ya---g~at----s~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~ 228 (399) .+...|.+..+.-.+ ..+++|.+...-. |..+ ...+.....+.....++++|.++...+..|.... T Consensus 126 ~i~~~la~ai~~~~d----~~~l~G~g~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~--- 198 (333) T protein:vir:78 126 KLQGDLAYAIGRGID----LAVFHGKSPLTGSALQGIDTDNVIANTTNVDYLQETGDPLLDRLLDGYDLVSANTDVE--- 198 (333) T ss_pred HHHHHHHHHHHHHHH----HHHhcccCCCCCcccccccccccccccccccccccccchhHHHHHHHHHhhccccccC--- Confidence 777777766554443 3445555432111 1000 0001111123344567888888887766543321 Q ss_pred eeccccccCccccCceeEEEeCCCchHHHHH---hhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcc Q lcl|NC_019514. 229 VITGSRMIDTRTISAGRVLYIGSELIPLIRK---LVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAG 305 (399) Q Consensus 229 ~i~~s~~~~T~~I~~~yv~~~h~d~~~dird---l~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~ 305 (399) + =+.++||.+...|+. ++|-.+.+-|.+ ....+..|++-|+.+++++.+..= T Consensus 199 --------------~-~~~vmn~~~~~~L~~~~~~~d~~G~~i~~~--------~~~~~~~~~l~G~Pv~~~~~i~~~-- 253 (333) T protein:vir:78 199 --------------F-NGWAVDPRFRAHLLRAQAYRDANGNVDPSR--------INLAAQTGDVLGLPAQFGRAVGGD-- 253 (333) T ss_pred --------------c-eEEEEcchHHHHHHHHhhhcCCCCceeecC--------ccccCCCceeeceeeEEccccCCC-- Confidence 1 135669998877765 444334444432 245567799999999999886321 Q ss_pred cCCCccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCC----ccchhhHHHHH--HH Q lcl|NC_019514. 306 AGATVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRND----PYGEMGFSSIK--WY 379 (399) Q Consensus 306 aGa~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~D----PlgQrg~~gwK--~~ 379 (399) .+ ...+++. .+++|.-..-.+++.++ +.+-+-.= ++....| .+-|++.+.++ .+ T Consensus 254 ~~---------~~~~~~~----~~~~gD~~~~~~g~~~~-----~~i~~~~~--~~~~~~~~~~~~~~~~~~v~~r~~~r 313 (333) T protein:vir:78 254 LG---------AAVDSKT----RIIGGDFSQLKFGFADE-----IRIKMSDT--ATLTDSGSATVSMWQTNQIAILIEVT 313 (333) T ss_pred cc---------ccCCCcc----EEEEEecccEEEEEeec-----cEEEEecc--ccccccccceeehhhcCcEEEEEEEE Confidence 01 1112233 24566555544554432 22222111 1122222 23466666666 57 Q ss_pred HHHhhccccceEEEE-Eecc Q lcl|NC_019514. 380 YGTLILRPERLALVK-TVAP 398 (399) Q Consensus 380 ~~~~iLn~~~m~~ie-~~a~ 398 (399) +.+.+++++-+++|+ ..+| T Consensus 314 ~d~~v~~~~a~~~l~~~~a~ 333 (333) T protein:vir:78 314 FGWLLGDKQAFVKFVDDEQP 333 (333) T ss_pred EccEEecccceEEEeccCCC Confidence 888999999998887 5566 No 88 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=98.64 E-value=4.4e-09 Score=66.38 Aligned_cols=286 Identities=12% Similarity=0.015 Sum_probs=163.3 Q ss_pred CCcCCe---eecCCCCcccccccc-cccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccccccccc Q lcl|NC_019514. 1 MASKGM---LYNDPNTTPSGIDAP-DGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNV 76 (399) Q Consensus 1 ~~~~~~---~~n~~~~t~tT~~~~-i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~ 76 (399) ...+.. .++.--.+.++..+. +.|+ +....+....+...+.+++...++..+ .+++.+.. T Consensus 91 ~~~~~~~~~~~~~~~~~~~~~~g~~i~~~-----~~~~ii~~~~~~~~l~~~~~~~~~~~~---~~~~~~~~-------- 154 (385) T protein:vir:18 91 GKQGTFGAKTFNKSLGSDADSAGSLIQPM-----QIPGIIMPGLRRLTIRDLLAQGRTSSN---ALEYVREE-------- 154 (385) T ss_pred HhhccchhhHHHhhhccccccCCceecch-----hhhHHHHHhhhccchhhhcceecccCc---ceEEEEEe-------- Confidence 111111 111111122222233 3333 245666777888889999988887643 45555542 Q ss_pred ccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHH Q lcl|NC_019514. 77 NDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSH 156 (399) Q Consensus 77 ~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~ 156 (399) +.++...|..|| +.+. ....+...++.++++++.++.+|+++++ +++ .+.+. T Consensus 155 ---~~~~~a~~v~E~---------~~~~--------------~~~~~~~~~~~~~~k~~~~~~is~ell~-d~~-~l~~~ 206 (385) T protein:vir:18 155 ---VFTNNADVVAEK---------ALKP--------------ESDITFSKQTANVKTIAHWVQASRQVMD-DAP-MLQSY 206 (385) T ss_pred ---cCCcceeeeccC---------cccc--------------ccccceeEEEEeeeeEEEeehhhHHHHh-hHH-HHHHH Confidence 122333455554 1222 2334456688999999999999999877 454 48877 Q ss_pred HHHHHHHhhhHHHHHHHHHHHHhcCCeEEec-CCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccc Q lcl|NC_019514. 157 ISTELMNGAVQLTEAVLQKDLLAGAGTIVYT-GAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRM 235 (399) Q Consensus 157 ~~~~lg~~a~~~~e~~l~~~~lag~~~v~ya-g~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~ 235 (399) +...+.+..+...+ ..+++|.+.---. |-.+.....+...++....++++|.++.-.|+.+.... T Consensus 207 i~~~la~a~~~~~d----~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~---------- 272 (385) T protein:vir:18 207 INNRLMYGLALKEE----GQLLNGDGTGDNLEGLNKVATAYDTSLNATGDTRADIIAHAIYQVTESEFSA---------- 272 (385) T ss_pred HHHHHHHHHHHHHH----HHHHhccCCCCcccccccccccccccccccccchHHHHHHHHHhhccccCCC---------- Confidence 77777777665544 3456654221110 00000000111123345568999999998887654332 Q ss_pred cCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCcc Q lcl|NC_019514. 236 IDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPG 315 (399) Q Consensus 236 ~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~ 315 (399) =+.+|||.+...|+.|+|..+.+-|-+ ...+..+.+-|+++++++.+. ++ T Consensus 273 ---------~~~~~~~~~~~~l~~lkd~~G~~l~~~---------~~~~~~~~l~G~pV~~~~~~p----------~~-- 322 (385) T protein:vir:18 273 ---------SGIVLNPRDWHNIALLKDNEGRYIFGG---------PQAFTSNIMWGLPVVPTKAQA----------AG-- 322 (385) T ss_pred ---------CEEEEcHHHHHHHHHhhcCCCceeccC---------cccCCCceecceeeEEcCcCC----------CC-- Confidence 146889999999999998766666632 235566888899999988762 01 Q ss_pred ccccCccceEEEEEEEccc--ceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHhhccccceE Q lcl|NC_019514. 316 YRETNGKYDIYPMLCVGAE--SFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTLILRPERLA 391 (399) Q Consensus 316 ~~~t~~~~DVyp~lV~G~~--Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~iLn~~~m~ 391 (399) .+++|.- +|.... ..+ +. +.... ...| +-+++.+.|+ +.+++.++++.-++ T Consensus 323 ------------~~~~gd~~~~~~~~~--~~~----~~---v~~~~---~~~~-~~~~~~~~~~~~~r~~~~v~~~~a~~ 377 (385) T protein:vir:18 323 ------------TFTVGGFDMASQVWD--RMD----AT---VEVSR---EDRD-NFVKNMLTILCEERLALAHYRPTAII 377 (385) T ss_pred ------------cEEEeecccEEEEEE--ecc----eE---EEEec---cccc-hhhcCcEEEEEEEeeccEEecccceE Confidence 1445542 333221 111 11 11111 1223 3467888877 47889999999999 Q ss_pred EEEEeccC Q lcl|NC_019514. 392 LVKTVAPL 399 (399) Q Consensus 392 ~ie~~a~~ 399 (399) +++..+-= T Consensus 378 ~~~~~aa~ 385 (385) T protein:vir:18 378 KGTFSSGS 385 (385) T ss_pred EEEeccCC Confidence 98876555 No 89 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=98.64 E-value=4.4e-09 Score=66.38 Aligned_cols=286 Identities=12% Similarity=0.015 Sum_probs=163.3 Q ss_pred CCcCCe---eecCCCCcccccccc-cccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccccccccc Q lcl|NC_019514. 1 MASKGM---LYNDPNTTPSGIDAP-DGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNV 76 (399) Q Consensus 1 ~~~~~~---~~n~~~~t~tT~~~~-i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~ 76 (399) ...+.. .++.--.+.++..+. +.|+ +....+....+...+.+++...++..+ .+++.+.. T Consensus 91 ~~~~~~~~~~~~~~~~~~~~~~g~~i~~~-----~~~~ii~~~~~~~~l~~~~~~~~~~~~---~~~~~~~~-------- 154 (385) T protein:vir:19 91 GKQGTFGAKTFNKSLGSDADSAGSLIQPM-----QIPGIIMPGLRRLTIRDLLAQGRTSSN---ALEYVREE-------- 154 (385) T ss_pred HhhccchhhHHHhhhccccccCCceecch-----hhhHHHHHhhhccchhhhcceecccCc---ceEEEEEe-------- Confidence 111111 111111122222233 3333 245666777888889999988887643 45555542 Q ss_pred ccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHH Q lcl|NC_019514. 77 NDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSH 156 (399) Q Consensus 77 ~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~ 156 (399) +.++...|..|| +.+. ....+...++.++++++.++.+|+++++ +++ .+.+. T Consensus 155 ---~~~~~a~~v~E~---------~~~~--------------~~~~~~~~~~~~~~k~~~~~~is~ell~-d~~-~l~~~ 206 (385) T protein:vir:19 155 ---VFTNNADVVAEK---------ALKP--------------ESDITFSKQTANVKTIAHWVQASRQVMD-DAP-MLQSY 206 (385) T ss_pred ---cCCcceeeeccC---------cccc--------------ccccceeEEEEeeeeEEEeehhhHHHHh-hHH-HHHHH Confidence 122333455554 1222 2334456688999999999999999877 454 48877 Q ss_pred HHHHHHHhhhHHHHHHHHHHHHhcCCeEEec-CCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccc Q lcl|NC_019514. 157 ISTELMNGAVQLTEAVLQKDLLAGAGTIVYT-GAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRM 235 (399) Q Consensus 157 ~~~~lg~~a~~~~e~~l~~~~lag~~~v~ya-g~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~ 235 (399) +...+.+..+...+ ..+++|.+.---. |-.+.....+...++....++++|.++.-.|+.+.... T Consensus 207 i~~~la~a~~~~~d----~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~---------- 272 (385) T protein:vir:19 207 INNRLMYGLALKEE----GQLLNGDGTGDNLEGLNKVATAYDTSLNATGDTRADIIAHAIYQVTESEFSA---------- 272 (385) T ss_pred HHHHHHHHHHHHHH----HHHHhccCCCCcccccccccccccccccccccchHHHHHHHHHhhccccCCC---------- Confidence 77777777665544 3456654221110 00000000111123345568999999998887654332 Q ss_pred cCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCcc Q lcl|NC_019514. 236 IDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPG 315 (399) Q Consensus 236 ~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~ 315 (399) =+.+|||.+...|+.|+|..+.+-|-+ ...+..+.+-|+++++++.+. ++ T Consensus 273 ---------~~~~~~~~~~~~l~~lkd~~G~~l~~~---------~~~~~~~~l~G~pV~~~~~~p----------~~-- 322 (385) T protein:vir:19 273 ---------SGIVLNPRDWHNIALLKDNEGRYIFGG---------PQAFTSNIMWGLPVVPTKAQA----------AG-- 322 (385) T ss_pred ---------CEEEEcHHHHHHHHHhhcCCCceeccC---------cccCCCceecceeeEEcCcCC----------CC-- Confidence 146889999999999998766666632 235566888899999988762 01 Q ss_pred ccccCccceEEEEEEEccc--ceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHhhccccceE Q lcl|NC_019514. 316 YRETNGKYDIYPMLCVGAE--SFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTLILRPERLA 391 (399) Q Consensus 316 ~~~t~~~~DVyp~lV~G~~--Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~iLn~~~m~ 391 (399) .+++|.- +|.... ..+ +. +.... ...| +-+++.+.|+ +.+++.++++.-++ T Consensus 323 ------------~~~~gd~~~~~~~~~--~~~----~~---v~~~~---~~~~-~~~~~~~~~~~~~r~~~~v~~~~a~~ 377 (385) T protein:vir:19 323 ------------TFTVGGFDMASQVWD--RMD----AT---VEVSR---EDRD-NFVKNMLTILCEERLALAHYRPTAII 377 (385) T ss_pred ------------cEEEeecccEEEEEE--ecc----eE---EEEec---cccc-hhhcCcEEEEEEEeeccEEecccceE Confidence 1445542 333221 111 11 11111 1223 3467888877 47889999999999 Q ss_pred EEEEeccC Q lcl|NC_019514. 392 LVKTVAPL 399 (399) Q Consensus 392 ~ie~~a~~ 399 (399) +++..+-= T Consensus 378 ~~~~~aa~ 385 (385) T protein:vir:19 378 KGTFSSGS 385 (385) T ss_pred EEEeccCC Confidence 98876555 No 90 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=98.63 E-value=1.5e-08 Score=63.54 Aligned_cols=286 Identities=13% Similarity=0.125 Sum_probs=162.8 Q ss_pred CCcCCeeecCCCCcccccccc-cccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAP-DGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQ 79 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~-i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~ 79 (399) ++++....-.--++.++..++ +-|+ .+..+.+....+.-.+.+++...+|+.+.|+....+.- . T Consensus 98 l~~~~~~~~~~~~~~t~~~gg~~vP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-~---------- 162 (397) T protein:vir:49 98 VRGRYQNLLDSKTDASGSDAGLTIPQ----DIQTAIHTLVSQYDSLQEYVNVENVTTLTGSRVYEKWT-D---------- 162 (397) T ss_pred HhcchhHHHHHhhccccccCcccccH----hHHHHHHHHHHhhhhHHhhhceeecccCccceEEEeec-c---------- Confidence 222211110001222333332 2344 34566666677888899999999999988875433322 1 Q ss_pred CCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHH Q lcl|NC_019514. 80 GIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIST 159 (399) Q Consensus 80 gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~ 159 (399) ..+...|..||-. +.. ....+...++.++++++.++.+|+++++ +++..+...+.. T Consensus 163 -~~~~a~~v~E~~~---------~~~-------------~~~~~~~~i~~~~~k~~~~~~iS~ell~-ds~~~l~~~i~~ 218 (397) T protein:vir:49 163 -ITGLANIDDEAGK---------IAD-------------VDDPKLSLIKYTIKRYAGISTVTNSLLA-DSAENILAWLSG 218 (397) T ss_pred -CCcceeeecCccc---------ccc-------------ccccceeeEEeeeeeEEeeehhHHHHHh-hhHHHHHHHHHH Confidence 1122345555521 111 1123455688899999999999999775 455557777777 Q ss_pred HHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_019514. 160 ELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTR 239 (399) Q Consensus 160 ~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~ 239 (399) .+.+..+...+ ..+++|.+.. .......++++|.++...|+.+-.+. T Consensus 219 ~l~~~~~~~~d----~ai~~G~g~~---------------~~~~~~~~~d~i~~~~~~l~~~~~~~-------------- 265 (397) T protein:vir:49 219 WIAKKVVVTRN----KAILEAIAAL---------------PTKPTLTKWDDIIDLEAKVDPAIKQT-------------- 265 (397) T ss_pred HHHHHHHHHHH----HHHHhhcccc---------------ccccccccHHHHHHHHHhhhhhhcCC-------------- Confidence 77766654443 3456664321 11234468999999988887644321 Q ss_pred ccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCcccccc Q lcl|NC_019514. 240 TISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRET 319 (399) Q Consensus 240 ~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t 319 (399) =+.++||.+...|+.|+|--+.|-|.+- +..|.-+++-|+.++.++.. |...++. T Consensus 266 -----a~~vmn~~~~~~l~~lkd~~G~~l~~~~--------~~~~~~~~l~G~PV~~~~~~--~~~~~~~---------- 320 (397) T protein:vir:49 266 -----SFFLTNTSGFTALKKVKNALGDYLMERD--------VKSPTGYSIDGFAVKEVADR--WLANGTG---------- 320 (397) T ss_pred -----CEEEEcHHHHHHHHHhhcCCCceeeccC--------cCCCCCceecceeeEEeccc--ccccccC---------- Confidence 1357899999999999987777776542 34556678889888765532 1111111 Q ss_pred CccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHhhccccceEEEEEec Q lcl|NC_019514. 320 NGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTLILRPERLALVKTVA 397 (399) Q Consensus 320 ~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~iLn~~~m~~ie~~a 397 (399) + -. .++||.=.-+..-+...| +.+.+- +-.+-+-+.+.+.++ ..+.+.++++.-++.++..+ T Consensus 321 -~---~~-~i~~gd~~~~~~~~~~~~----~~i~~~-------~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~ 384 (397) T protein:vir:49 321 -G---AM-PLYFGDLKQAVTLFDRQH----MSLLST-------NIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKA 384 (397) T ss_pred -C---ce-eEEEeeccceEEEEeecc----eEEEEe-------ccccchhhcCceeEEEEeeeCcEEecccceEEEEeec Confidence 1 11 245674332111111112 111111 111233445555555 57788999999999988766 Q ss_pred cC Q lcl|NC_019514. 398 PL 399 (399) Q Consensus 398 ~~ 399 (399) .- T Consensus 385 ~~ 386 (397) T protein:vir:49 385 IA 386 (397) T ss_pred cc Confidence 55 No 91 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=98.61 E-value=4.8e-09 Score=66.19 Aligned_cols=285 Identities=12% Similarity=0.050 Sum_probs=158.5 Q ss_pred CCcCC--e------eecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccccc Q lcl|NC_019514. 1 MASKG--M------LYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLD 72 (399) Q Consensus 1 ~~~~~--~------~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~ 72 (399) +..+. . ..|.-.++.++..+.+-|. . +.+..++...+...+.+++...+++.+ ++++.+... T Consensus 95 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~---~-~~~~ii~~~~~~~~l~~~~~~~~~~~~---~~~~~~~~~--- 164 (390) T protein:vir:81 95 WNDRSARATMNIKAALNTASTDAAGSAGALTTP---N-RLPGFITPPDARLTVRDLIGSGRTDSA---LIEYVQETG--- 164 (390) T ss_pred HhhhhhhhhhHHHHHHHhhccccccCCcceech---h-hhHHHHHHHhhhhhhhhhcceeeccCC---ceEEEEEec--- Confidence 00000 0 0111011112222222222 1 245666667788888899988887643 344444311 Q ss_pred ccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchH Q lcl|NC_019514. 73 DRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSE 152 (399) Q Consensus 73 ~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~ 152 (399) ......|..||. ++.. ...+...++.++++++.++.+|+++++ ++. . T Consensus 165 --------~~~~a~~v~Eg~---------~~~~--------------~~~~~~~i~~~~~k~~~~~~is~ell~-d~~-~ 211 (390) T protein:vir:81 165 --------FVNNAAIVAEGA---------LKPE--------------SSLKFAKKTDTTHVIAHTMKATRQILS-DAP-Q 211 (390) T ss_pred --------CCcceeeecCCc---------cccc--------------ccceeeEEEEeeeEEEEeehhhHHHHH-hHH-H Confidence 112234555552 2222 234456788999999999999999876 344 4 Q ss_pred HHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEE-ecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceec Q lcl|NC_019514. 153 LFSHISTELMNGAVQLTEAVLQKDLLAGAGTIV-YTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVIT 231 (399) Q Consensus 153 l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~-yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~ 231 (399) +...+...|.+..+...+ .-+++|.+.-- ..|--+.....+...+.....++++|..+.-.|.....+. T Consensus 212 ~~~~i~~~l~~~~~~~~d----~a~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------ 281 (390) T protein:vir:81 212 LASYMNNRLIRGLKVKED----AEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYNP------ 281 (390) T ss_pred HHHHHHHHHHHHHHHHHH----HHHHhcCCCCCcccceeecccccccccccccchhHHHHHHHHHhhccccCCC------ Confidence 777777777766655443 34566543211 0111111111111223445567888888887776654331 Q ss_pred cccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCcc Q lcl|NC_019514. 232 GSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVG 311 (399) Q Consensus 232 ~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~ 311 (399) . .-++||.....|+.|+|-.+.+-|-+. ..+..+++-|+.+++++.+. T Consensus 282 -----------~--~~v~~~~~~~~l~~lkd~~G~~l~~~~---------~~~~~~~l~G~pv~~~~~~p---------- 329 (390) T protein:vir:81 282 -----------S--GIVINPIDWAAIELAKDANNQYLIGNA---------RGTLTPTLWGLPVVATQAMA---------- 329 (390) T ss_pred -----------C--EEEEcHHHHHHHHHhhcCCCceeecCc---------ccccCceecceeeEEcCCCC---------- Confidence 1 347899999999999887666656432 23445678899999988752 Q ss_pred CCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHhhccccc Q lcl|NC_019514. 312 TNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTLILRPER 389 (399) Q Consensus 312 ~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~iLn~~~ 389 (399) .+ .+++|.-+.+..-+...+ +.+-+ +..+.+-+++.+.|+ +++.+.+++++- T Consensus 330 ~~--------------~~~~gd~~~~~~~~~~~~----~~v~~--------~~~~~~~~~~~v~~r~~~r~d~~v~~~~a 383 (390) T protein:vir:81 330 PG--------------EFLVGAFDLAAQIFDQWD----ARVEI--------GYVGEDFQRNMITVLAEERLALVVYRPEA 383 (390) T ss_pred CC--------------cEEEEehhceEEEEEecc----eEEEE--------ecccchhhcCcEEEEEEEeeccEEecccc Confidence 01 135565443222121122 11111 122345667777766 678889999999 Q ss_pred eEEEEEe Q lcl|NC_019514. 390 LALVKTV 396 (399) Q Consensus 390 m~~ie~~ 396 (399) ++++..+ T Consensus 384 ~v~~t~a 390 (390) T protein:vir:81 384 LISGSFA 390 (390) T ss_pred eEEEEeC Confidence 9999999 No 92 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=98.61 E-value=1.1e-08 Score=64.30 Aligned_cols=284 Identities=11% Similarity=0.052 Sum_probs=158.7 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQG 80 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~g 80 (399) |+ +..+.+-|. . +..+.++.+++.-.+.+++...+|+.+. +++-+..- T Consensus 1 ma--------------~~gG~lvp~---~-~~~~ii~~~~~~s~i~~l~~~~~~~~~~---~~ip~~~~----------- 48 (298) T protein:vir:16 1 MV--------------LNKGTLFDP---T-LVTDLISKVAGKSSIARLSAQKPIPFNG---EKVFTFTM----------- 48 (298) T ss_pred Cc--------------ccCcceech---h-HHHHHHHHHHhhhhhhhhcceeeccCCc---eEEEEEec----------- Confidence 22 222233333 1 2467777788999999999988887533 33333311 Q ss_pred CCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcch--HHHHHHH Q lcl|NC_019514. 81 IDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDS--ELFSHIS 158 (399) Q Consensus 81 i~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~--~l~~~~~ 158 (399) .+...|..||.. +...+ .+...++...++++.+..+|++++....|+ ++.+.+. T Consensus 49 -~~~a~~v~E~~~---------~~~~~--------------~~f~~v~l~~~k~a~~~~iS~ell~~s~d~~~~l~~~i~ 104 (298) T protein:vir:16 49 -DSEIDVVAESGK---------KTHGG--------------VTLAPQTMVPIKVEYGARISDEFMYASDEEKINILQEFN 104 (298) T ss_pred -CcceEEecCCcc---------ccccc--------------cceeEEEEeeeeEEEeehhhHHHhhcCcccHHHHHHHHH Confidence 233456776622 22222 334568889999999999999987544332 4666666 Q ss_pred HHHHHhhhHHHHHHHHHHHHhcCC--------eEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCcccccee Q lcl|NC_019514. 159 TELMNGAVQLTEAVLQKDLLAGAG--------TIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVI 230 (399) Q Consensus 159 ~~lg~~a~~~~e~~l~~~~lag~~--------~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i 230 (399) ..|.+.-+.-.+ ..+++|.+ ..-.++....... .........-.+++|.++...|..++.+. T Consensus 105 ~~la~ai~~~~d----~~~l~G~~~~~g~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~i~~~~~~~~~~~~~~----- 174 (298) T protein:vir:16 105 DGFAKKVARGID----LMAFHGVNPRLGTASAVIGTNHFDSKVTQ-KVEAPRGIADPNGAIENAVELLTGVDADV----- 174 (298) T ss_pred HHHHHHHHHHHH----HHhhccccCCCCccccccccccccccccc-ccccccccccHHHHHHHHHHHhhhcCCCc----- Confidence 666665544433 34445421 1111111000000 00111112223678888887777765432 Q ss_pred ccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCc Q lcl|NC_019514. 231 TGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATV 310 (399) Q Consensus 231 ~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~ 310 (399) =..++||.+...|+.|+|--+.|-|.+. ...+.-|++-|+.++.++.+.. +. T Consensus 175 --------------~~~vmn~~~~~~l~~lkd~~G~~i~~~~--------~~~~~~~~l~G~PV~~~~~v~~----~~-- 226 (298) T protein:vir:16 175 --------------TGIAINPSFRSALAKQKDLQDNALFPEL--------KWGATPDTINGLPVDVNKTVSD----MS-- 226 (298) T ss_pred --------------cEEEEcHHHHHHHHHhhccCCCeeecCc--------ccCCCCceecceeeEEeccccc----cc-- Confidence 1356799999999999887777777543 2344557889999998887532 00 Q ss_pred cCCccccccCccceEEEEEEEccccee-eeccccCCCCccceEEEecCCCCCCCC-CCccchhhHHHHH--HHHHHhhcc Q lcl|NC_019514. 311 GTNPGYRETNGKYDIYPMLCVGAESFT-TIGFQTDGKTLKFKVTTKMPGEATADR-NDPYGEMGFSSIK--WYYGTLILR 386 (399) Q Consensus 311 ~~~~~~~~t~~~~DVyp~lV~G~~Afg-~v~l~g~g~~~~~~~ivk~pG~~~ad~-~DPlgQrg~~gwK--~~~~~~iLn 386 (399) . .++. .+++|.-+-+ .++...+ .++++. .-+ .+|+ .-=|-|++.++|+ +++.+.+++ T Consensus 227 -------~-~~~~----~~~~GDfs~~~~~~~~~~---~~~~~~--~~~--~~~~~~~~~f~~~~v~~ra~~r~d~~v~~ 287 (298) T protein:vir:16 227 -------L-TQRD----RAIIGDFANGFKWGYAKE---VPLEVI--QYG--DPDNSGLDLKGYNQVYIRAELFLGWGILD 287 (298) T ss_pred -------C-CCcc----EEEEeeccceEEEEEecC---ceEEEe--ecc--CCcCcchhhhhcCcEEEEEEEEEccEeec Confidence 0 1121 3667764322 2443332 123322 111 0111 0113466777777 478899999 Q ss_pred ccceEEEEEec Q lcl|NC_019514. 387 PERLALVKTVA 397 (399) Q Consensus 387 ~~~m~~ie~~a 397 (399) ++-+++|+-+= T Consensus 288 ~~a~~~l~~at 298 (298) T protein:vir:16 288 ATKFARVTEAN 298 (298) T ss_pred ccceEEEeecC Confidence 99999997665 No 93 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=98.59 E-value=6e-09 Score=65.66 Aligned_cols=295 Identities=15% Similarity=0.116 Sum_probs=127.0 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccc---cccccCCCEEEEEEccccccccccc Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVV---SMPKNYGKEIRVYHYIPLLDDRNVN 77 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~---~mPkN~GktIk~rry~pl~~~~~~~ 77 (399) |+ .++ +.|| .|.+++|..-++.|||.++.+.. ++--..|.||+++++.++... T Consensus 1 Ma-----~~~-----------~~p~----~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~~~~~---- 56 (392) T protein:vir:99 1 MA-----NAF-----------SKPT----AVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGH---- 56 (392) T ss_pred Cc-----ccc-----------ccHH----HHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeecccccce---- Confidence 22 111 2344 68899999999999999887532 332246999999987654110 Q ss_pred cCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeee-cceeehhhhhhhhhcchHHHHH Q lcl|NC_019514. 78 DQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKF-GFFTEFSQESLDFDSDSELFSH 156 (399) Q Consensus 78 ~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qY-G~~~e~Td~~~d~~~D~~l~~~ 156 (399) + .+..+. +.. +.|..+++. -..++.+|.|+ .+=.+++|+-......+ +... T Consensus 57 ~--~~~~~~--------~~~---~~~~~~~~~--------------~~~~~~~id~~k~~~~~i~d~e~~~~~~~-~~~~ 108 (392) T protein:vir:99 57 T--RKLRGA--------GAE---RNLTVSDFT--------------EDSFPVTLTDVAYHLGVLTDEELTFDLES-FATQ 108 (392) T ss_pred e--eecccc--------ccC---Ccccccccc--------------cceEEEEEeeeeecceeechHHHhhhhhh-hHHH Confidence 0 000000 000 012222211 12344455332 23344666533333333 3322 Q ss_pred HHHHHHHhhhHHHHHHHHHHH---HhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccc Q lcl|NC_019514. 157 ISTELMNGAVQLTEAVLQKDL---LAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGS 233 (399) Q Consensus 157 ~~~~lg~~a~~~~e~~l~~~~---lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s 233 (399) +.+..++. +. +.+-.++ ++++ .+.... ........-.++.+..+.+.|.++++|. T Consensus 109 ~~~~a~~a---la-~~vd~~i~~~~~~a--~~~~~~--------~~~~~~~~~~~~~i~~a~~~L~~~~vP~-------- 166 (392) T protein:vir:99 109 ILPRQVRG---VA-DILEEGVRDMIVGA--PYEAAG--------AVHEVAPDEFFKGVNGARRALNELYIPQ-------- 166 (392) T ss_pred HHHHHHHH---HH-HHHHHHHHHHHhcc--cccccc--------cccccChhhhHHHHHHHHHHHhhcCCCC-------- Confidence 22222322 22 2332333 2222 112111 1112223346889999999999999984 Q ss_pred cccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCc--cccccccceeEcCeEEEecCccchhcccCCCcc Q lcl|NC_019514. 234 RMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADA--GTILNGEIGTVDQFRLVVVPEMLHWAGAGATVG 311 (399) Q Consensus 234 ~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~--~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~ 311 (399) .|++++.|+....|.. ++.|+..+.+|+. +.+-+|+||++.+|.+++++....=......++ T Consensus 167 ----------~R~~vv~p~~~~~l~~------~~~~~~~~~~g~~~~~~l~~G~vg~i~G~~v~~s~~~~~~t~~a~~~~ 230 (392) T protein:vir:99 167 ----------GRVLVVGTAVTEQILN------DDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPT 230 (392) T ss_pred ----------CCEEEEcHHHHHHHhc------ccceeecccccchhhhhhhcceeeeeeeeEEEeecccccccceeeecc Confidence 2678889999988853 5889999888765 457789999999999999887643221111111 Q ss_pred CCccccccC---ccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhcccc Q lcl|NC_019514. 312 TNPGYRETN---GKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPE 388 (399) Q Consensus 312 ~~~~~~~t~---~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~ 388 (399) +.. ..+.. .....+...+-|..++..--+... ....+.... ..+.+.-.+.+. ...+ .-+... T Consensus 231 a~~-~at~a~v~~~~~~~~~s~s~~~~v~~~~~~~~------~~t~~s~~~----~v~~~~g~~~v~--~~~~-~~~~~~ 296 (392) T protein:vir:99 231 AFI-MATRAPAPPMGAVRSTAISGDQRIAMRWLVDY------DSTITSNRS----LIDTYFGLKVVE--DPNG-VGFVRA 296 (392) T ss_pred ccc-cccccccccccccceeEEecccceecceeecc------cceeecccc----ccceeEEEEEEe--eccc-cceeee Confidence 000 00000 000011111222221111000000 000000000 001111000000 0000 000000 Q ss_pred ceEEE-EEe---ccC Q lcl|NC_019514. 389 RLALV-KTV---APL 399 (399) Q Consensus 389 ~m~~i-e~~---a~~ 399 (399) ....+ ... .++ T Consensus 297 ~~~~~~~~~v~v~~v 311 (392) T protein:vir:99 297 RKIHLIPGSIEVAPE 311 (392) T ss_pred eeeeeecceeeeeee Confidence 00000 000 000 No 94 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=98.59 E-value=1.2e-08 Score=63.94 Aligned_cols=282 Identities=12% Similarity=0.058 Sum_probs=160.3 Q ss_pred CCcCCe------ee-----cCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccc Q lcl|NC_019514. 1 MASKGM------LY-----NDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIP 69 (399) Q Consensus 1 ~~~~~~------~~-----n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~p 69 (399) |..+.. .. +.-.+++++..+.+-|+ .+....++...+..++.+++...+++...|+....+.- . T Consensus 102 ~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~lvP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-~ 176 (397) T protein:vir:12 102 LRGKRLTDEERDLLDSPEFRAMSGINDEDGGILIPE----DIGRQIHEFKRQFEPLEQYVTVEPVTTRSGTRLLEKNA-D 176 (397) T ss_pred HhccCCcHHHHHHHhhhhhhhccccccccCcccCch----hHHHHHHHhhhhhhhHHhhcceeeccCCceeEEEEEec-C Confidence 111110 00 11112222333333344 33555555577888899999999999887753222211 1 Q ss_pred cccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhc Q lcl|NC_019514. 70 LLDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDS 149 (399) Q Consensus 70 l~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~ 149 (399) .+..+|..||-.. ++ ....+...++.+.++++.++.+|+++++ ++ T Consensus 177 ------------~~~a~~v~Eg~~~---~~-------------------~~~~~~~~v~~~~~k~~~~~~is~e~l~-ds 221 (397) T protein:vir:12 177 ------------MVPFSPVEELGNL---PE-------------------IDQPRFTKVSYSIIDYGGIMTLSNSMLN-DS 221 (397) T ss_pred ------------Ccceeeecccccc---cc-------------------cccccceeEEeeheeeEeeehhhHHHHh-hc Confidence 1233566666221 11 1112345677889999999999999775 44 Q ss_pred chHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHH-HHHhccCccccc Q lcl|NC_019514. 150 DSELFSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSI-TLDENRTPKQTK 228 (399) Q Consensus 150 D~~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~-~L~~nrap~~t~ 228 (399) +..+.+.+...|.+.-+...+ .-+++|.+.. .....+++++|..+.- .|+..-.+ T Consensus 222 ~~~l~~~i~~~l~~~~~~~~d----~~il~G~g~~----------------~~~g~~~~~~i~~~~~~~l~~~~~~---- 277 (397) T protein:vir:12 222 DQAIMTYVAKWFAKKSVVTRN----NLILAAIASL----------------KKVDIDGLDGIKKALNVTLDPMVAP---- 277 (397) T ss_pred hHHHHHHHHHHHHHHHHHHHH----HHHHhccccc----------------cccccccHHHHHHHHhhccchhhhC---- Confidence 445777788888877665443 3466665321 2234567888877653 44432211 Q ss_pred eeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCC Q lcl|NC_019514. 229 VITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGA 308 (399) Q Consensus 229 ~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa 308 (399) .-+-++||.....|+.|+|--+.+-|.|- +..|--+++-|..++.++...+ +. T Consensus 278 ---------------~a~~~~n~~~~~~L~~lkd~~G~~l~~~~--------~~~g~~~~l~G~pv~~~~~~~~----~~ 330 (397) T protein:vir:12 278 ---------------GSIVLTNQDGYDWLDTLKDGTGRYLLQPD--------PTNPTKKLLDGRPVVPFTNRVL----KT 330 (397) T ss_pred ---------------CCEEEEcHHHHHHHHHhhccCCceeeccc--------ccCCCCccccceeeEEeccccc----cc Confidence 12357999999999999887776666542 2344556788999987776433 11 Q ss_pred CccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHhhcc Q lcl|NC_019514. 309 TVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTLILR 386 (399) Q Consensus 309 ~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~iLn 386 (399) + .+. . .++||.-+-+.+.+...+ ..++. .+..+.+-+++...|+ +++.+.+++ T Consensus 331 ~----------~~~---~-~~~~gd~~~~~~~~~~~~--~~i~~---------~~~~~~~f~~~~~~~r~~~r~d~~~~~ 385 (397) T protein:vir:12 331 Q----------KGK---A-PLIIGNLKEAIVLFDREQ--QSIAS---------TDTGAGAFETNSTKVRGIEREDVRKWD 385 (397) T ss_pred C----------CCc---c-EEEEEehhceEEEEeecc--eEEEE---------eccccchhhcCceEEEEEEeeccEEec Confidence 1 011 1 146775321111111111 11111 1122444567777777 458999999 Q ss_pred ccceEEEEEecc Q lcl|NC_019514. 387 PERLALVKTVAP 398 (399) Q Consensus 387 ~~~m~~ie~~a~ 398 (399) +.-++.+...+. T Consensus 386 ~~a~~~~~~t~~ 397 (397) T protein:vir:12 386 EDAVVFGQITVE 397 (397) T ss_pred ccceEEEEEeeC Confidence 999999999999 No 95 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=98.59 E-value=1.3e-08 Score=63.76 Aligned_cols=284 Identities=13% Similarity=0.070 Sum_probs=160.8 Q ss_pred CCcCCee--ecCC--CCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccccccccc Q lcl|NC_019514. 1 MASKGML--YNDP--NTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNV 76 (399) Q Consensus 1 ~~~~~~~--~n~~--~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~ 76 (399) |+...+. -+.- .+++++..+.+-|+ .+..+.+....+...+.+++...+|+.+.|+-....+... T Consensus 93 ~~~~~~~~~~~~~~~~~~~~~~gg~~vP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~------- 161 (395) T protein:vir:38 93 MKNQFVKDFKNLVTSGTTGTGNAGLTIPE----DIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEKLADI------- 161 (395) T ss_pred HHHHHHHHHHHHHhhccCccCCCceecch----hHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEeeccC------- Confidence 0000000 0000 12222222223333 3456677778888899999999999999987555444321 Q ss_pred ccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHH Q lcl|NC_019514. 77 NDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSH 156 (399) Q Consensus 77 ~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~ 156 (399) .+...|..||-.. ++ ....+...++.+.++++.++.+|+++++ +++.+|.+. T Consensus 162 -----~~~a~~v~E~~~~---~~-------------------~~~~~f~~v~~~~~k~~~~~~iS~ell~-ds~~~l~~~ 213 (395) T protein:vir:38 162 -----TPLKDLDDESALI---GD-------------------NDDPELTVVKYLIHRYAGITTVTNTLLK-DTVDNIIQW 213 (395) T ss_pred -----Ccccccccccccc---cc-------------------ccccceeeEEeeeeeeEeehhhHHHHHh-hhHHHHHHH Confidence 2233455554211 10 1123455678889999999999999775 566668888 Q ss_pred HHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHH-HHHhccCccccceeccccc Q lcl|NC_019514. 157 ISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSI-TLDENRTPKQTKVITGSRM 235 (399) Q Consensus 157 ~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~-~L~~nrap~~t~~i~~s~~ 235 (399) +..+|.+..+...+ ..+++|.+.- .......++++|..+.- .|+..-. T Consensus 214 i~~~la~~~~~~~~----~~il~g~g~~---------------~~~~~~~~~~~i~~~~~~~l~~~~~------------ 262 (395) T protein:vir:38 214 LVNWAAKKDVVTRN----AKILEVMGKA---------------PKKPTISQFDNIKDLENNTLDPAIE------------ 262 (395) T ss_pred HHHHHHHHHHHHHH----HHHhhccccc---------------ccccccccHHHHHHHHHHhhhhhhc------------ Confidence 88888877665443 3455654321 11223446777766542 3322111 Q ss_pred cCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCcc Q lcl|NC_019514. 236 IDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPG 315 (399) Q Consensus 236 ~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~ 315 (399) +.-+-+|||.+...|+.|+|--+.|-|.+- +..|.-+++-|+.+++++.+.. +.+.+ T Consensus 263 -------~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~--------~~~~~~~~l~G~pV~~~~~~~~----~~~~~---- 319 (395) T protein:vir:38 263 -------STSSFITNQSGYNILSKVKDADGRYLMQPD--------VTSPDKYLIDGKPVIRIADKWL----PDVSG---- 319 (395) T ss_pred -------CCCEEEEcHHHHHHHHHhhccCCceeeccC--------cCCCCcceeccceeEEeccccc----CcCCC---- Confidence 112357999999999999887777777542 2345556788888888775311 11100 Q ss_pred ccccCccceEEEEEEEccccee-eeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHH--HHHHhhccccceEE Q lcl|NC_019514. 316 YRETNGKYDIYPMLCVGAESFT-TIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKW--YYGTLILRPERLAL 392 (399) Q Consensus 316 ~~~t~~~~DVyp~lV~G~~Afg-~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~--~~~~~iLn~~~m~~ 392 (399) .. .++||.-+-+ .+.... | +.+-+ .+-.+.+-+++.++|++ ++.+.++++.-++. T Consensus 320 ------~~----~i~~gd~~~~~~i~~~~-~----~~i~~-------~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~ 377 (395) T protein:vir:38 320 ------SH----PLYFGDLKQGITLFDRQ-Q----MQIDT-------TNVGAGSFEHDTTKLRFIDRFDVQLIDDGAFAA 377 (395) T ss_pred ------cc----eEEEEeccccEEEEEec-c----eEEEE-------eccccchhhcCceEEEEEEeeccEEecccceEE Confidence 11 2467753322 222211 1 11111 11223456777777774 48999999999999 Q ss_pred EEEeccC Q lcl|NC_019514. 393 VKTVAPL 399 (399) Q Consensus 393 ie~~a~~ 399 (399) |+..+.. T Consensus 378 ~~~~~~~ 384 (395) T protein:vir:38 378 ASFKTVA 384 (395) T ss_pred EEeeccc Confidence 9977666 No 96 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=98.57 E-value=1.4e-08 Score=63.72 Aligned_cols=300 Identities=14% Similarity=0.044 Sum_probs=158.5 Q ss_pred CCc-CCeeecCCC--CcccccccccccceehhhhhHHHHHHHH-HHHHhhhhcccccccccCCCEEEEEEcccccccccc Q lcl|NC_019514. 1 MAS-KGMLYNDPN--TTPSGIDAPDGKQMNTFFWWKKALIEAR-KDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNV 76 (399) Q Consensus 1 ~~~-~~~~~n~~~--~t~tT~~~~i~p~m~~~y~~kk~L~~A~-p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~ 76 (399) +.+ .-..++... +++++..+.+-|. . +....+..+. +.-.+.+++.+.++ .|+.... +.. T Consensus 237 l~~~e~~~~~~~~~~~~t~~~gg~lip~---~-~~~~ii~~~~~~~~~l~~~~~~~~~---~g~~~~~-~~~-------- 300 (543) T protein:vir:81 237 LTEEEKRAINEVRAMGLTKADGGYLVPF---Q-LDPTVIITSNGSLNDIRRFARQVVA---TGDVWHG-VSS-------- 300 (543) T ss_pred hhhhhhhhhhhhhhcccccccCcccCch---h-hhhHHHHHHHhhhchhhhhcccccC---CcceEEE-Eec-------- Confidence 110 001111111 1222222232332 1 2334455544 33446666654333 3543221 111 Q ss_pred ccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHH Q lcl|NC_019514. 77 NDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSH 156 (399) Q Consensus 77 ~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~ 156 (399) . .+...|..||.. +.. ..++...++.+.++++.|+.||+++++ ++ +.+... T Consensus 301 --~--~~~a~~v~Eg~~---------~~~--------------~~~~~~~i~~~~~k~~~~~~is~ell~-d~-~~~~~~ 351 (543) T protein:vir:81 301 --A--AVQWSWDAEFEE---------VSD--------------DSPEFGQPEIPVKKAQGFVPISIEALQ-DE-ANVTET 351 (543) T ss_pred --C--CcceeecccCcc---------ccc--------------cccccceeeeeeeeeEeeehhhHHHHh-cc-HHHHHH Confidence 1 122345555521 222 223445678889999999999999886 34 458888 Q ss_pred HHHHHHHhhhHHHHHHHHHHHHhcCCeE-EecCCCccc--ccccccccCCceecHHHHHHHHHHHHhccCccccceeccc Q lcl|NC_019514. 157 ISTELMNGAVQLTEAVLQKDLLAGAGTI-VYTGAATQD--SEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGS 233 (399) Q Consensus 157 ~~~~lg~~a~~~~e~~l~~~~lag~~~v-~yag~ats~--~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s 233 (399) +...|.+..+...+ .-+++|.++- ...|-.+.. ...+....+...++++++..+...|+.+..+. T Consensus 352 i~~~l~~~~~~~~d----~ail~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~-------- 419 (543) T protein:vir:81 352 VALLFAEGKDELEA----VTLTTGTGQGNQPTGIVTALAGTAAEIAPVTAETFALADVYAVYEQLAARHRRQ-------- 419 (543) T ss_pred HHHHHHHHHHHHHH----HHHhccCCCCcccccchhhcccccccccccccccccHHHHHHHHHhhhccccCC-------- Confidence 87777777654443 3455654321 111110000 00111123445688999999998887654321 Q ss_pred cccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCC Q lcl|NC_019514. 234 RMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTN 313 (399) Q Consensus 234 ~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~ 313 (399) -+-++||.+...|+.|+|.-+.|-|-+. ..|.-+++-|..++.++.|-.-...+.+ T Consensus 420 -----------~~~v~n~~~~~~l~~lkd~~G~~l~~~~---------~~g~~~~l~G~pv~~~~~~~~~~~~~~~---- 475 (543) T protein:vir:81 420 -----------GAWLANNLIYNKIRQFDTQGGAGLWTTI---------GNGEPSQLLGRPVGEAEAMDANWNTSAS---- 475 (543) T ss_pred -----------cEEEEcHHHHHHHHHhhcCCCceeccCc---------CCCCCccccceeeEEecccccccccccc---- Confidence 2457999999999999987777766543 3445578899999999987432221111 Q ss_pred ccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEE Q lcl|NC_019514. 314 PGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALV 393 (399) Q Consensus 314 ~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~i 393 (399) .+.+ .++||.=+.-.++..++ +.+.+-.=+ ..+.....|++++..| +++++.++++.-++.+ T Consensus 476 ---------~~~~-~i~~gd~~~~~i~~~~~-----~~i~~~~~~--~~~~~~~~~~~~~~~~-~r~d~~v~~~~A~~~l 537 (543) T protein:vir:81 476 ---------ADNF-VLLYGNFQNYVIADRIG-----MTVEFIPHL--FGTNRRPNGSRGWFAY-YRMGADVVNPNAFRLL 537 (543) T ss_pred ---------CCcc-eEEEeeccceeEEeecc-----cEEEEeccc--cccchhhcCceEEEEE-EeeccEeecccceEEE Confidence 1122 35667766555555442 222222111 1112223445444432 4678899999998888 Q ss_pred EEeccC Q lcl|NC_019514. 394 KTVAPL 399 (399) Q Consensus 394 e~~a~~ 399 (399) +..+.- T Consensus 538 ~~~~~a 543 (543) T protein:vir:81 538 NVETAS 543 (543) T ss_pred EecccC Confidence 877666 No 97 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=98.54 E-value=4.9e-08 Score=60.64 Aligned_cols=278 Identities=11% Similarity=0.100 Sum_probs=158.6 Q ss_pred eecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCCCCCCCc Q lcl|NC_019514. 7 LYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQGIDAAGA 86 (399) Q Consensus 7 ~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~gi~aaga 86 (399) +++.-...+++..+-+-|+ .+..+.++...+...+.+++...+|+.+.|+....++- +..+... T Consensus 1 ~l~~~~~~t~~~gg~liP~----~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~------------~~~~~a~ 64 (293) T protein:vir:48 1 MLDSKTDHSGSDAGLTIPQ----DIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWT------------DITGLAN 64 (293) T ss_pred CceeecccccCcCceEech----hHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeec------------CCCccee Confidence 5555333323322222233 34566766678899999999999999998864443332 1122334 Q ss_pred eeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHHHHHHhhh Q lcl|NC_019514. 87 TIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHISTELMNGAV 166 (399) Q Consensus 87 ~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~~lg~~a~ 166 (399) +..||-. + ......+...++.+.++++.++.+|+++++ +++.++.+.+..++.+..+ T Consensus 65 ~v~Eg~~---------~-------------~~~~~~~~~~i~l~~~k~~~~~~iS~ell~-ds~~~l~~~i~~~la~~~~ 121 (293) T protein:vir:48 65 IDDEAGK---------I-------------ADIDDPKLSLIKYTIKRYAGISTVTNSLLA-DSAENILAWLSGWIAKKVV 121 (293) T ss_pred eecCCcc---------c-------------ccccccceeEEEEeeeEEEEeehhhHHHHh-hhhHHHHHHHHHHHHHHHH Confidence 5666521 1 111123456678889999999999999775 3334477777677666554 Q ss_pred HHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccccCceeE Q lcl|NC_019514. 167 QLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTRTISAGRV 246 (399) Q Consensus 167 ~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~~I~~~yv 246 (399) ... -..+++|.+. ..+....+++++|.++.-.|+..-.+. + + T Consensus 122 ~~~----~~~i~~g~~~---------------~~~~~~~~~~d~i~~~~~~l~~~~~~~-----------------a--~ 163 (293) T protein:vir:48 122 VTR----NKAILGVVDK---------------LPTKPTLTKWDDIIDLEAKVDPAIKQT-----------------S--F 163 (293) T ss_pred HHH----HhHHhhcccc---------------ccccccccCHHHHHHHHHhhhhhhcCC-----------------C--E Confidence 433 2344544321 112345678999999988776422111 1 2 Q ss_pred EEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCccccccCccceEE Q lcl|NC_019514. 247 LYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRETNGKYDIY 326 (399) Q Consensus 247 ~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t~~~~DVy 326 (399) -+||+.+...|+.|+|--+.|-|.+- +..|--+++-|..++.++.. .+.+. ..+.+ T Consensus 164 ~vmn~~~~~~L~~lkd~~g~~l~~~~--------~~~~~~~~l~G~Pv~~~~~~-~~~~~---------------~~~~~ 219 (293) T protein:vir:48 164 FLTNTSGFTALKKVKNALGDYLMERD--------VKSPTGYSIAGFAVKEISDR-WLPNA---------------SSGVM 219 (293) T ss_pred EEEcHHHHHHHHHhhccCCceEeecC--------cCCCCCceecceeeEEeccc-ccCCc---------------cCCce Confidence 35799999999999987666666542 23445567888777654432 11000 01123 Q ss_pred EEEEEcc--cceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHhhccccceEEEEEeccC Q lcl|NC_019514. 327 PMLCVGA--ESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 327 p~lV~G~--~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~iLn~~~m~~ie~~a~~ 399 (399) + ++||. ++|-....++ +.+.+. +-.+-+-+++..+++ +.+.+.+.+++-++.++..+.. T Consensus 220 ~-~~~gd~~~~~~~~~~~~------~~i~~~-------~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~ 282 (293) T protein:vir:48 220 P-LYFGDLKQAVTLFDRQQ------MSLLST-------NIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIA 282 (293) T ss_pred E-EEEEeccceEEEEEecc------eEEEEe-------cccchhhhcCeEEEEEEEeeCcEEecccceEEEEeeccc Confidence 3 35663 3343222222 222211 111233445555555 5678889999999999866655 No 98 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=98.52 E-value=8.5e-08 Score=59.35 Aligned_cols=268 Identities=13% Similarity=0.074 Sum_probs=139.6 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHH-HHHHhhhhcccccccc--------cCCCEEEEEEcccc- Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEAR-KDQYFMPLADVVSMPK--------NYGKEIRVYHYIPL- 70 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~-p~lv~~~fA~~~~mPk--------N~GktIk~rry~pl- 70 (399) |+ .|...--|-||+=.- ++.... ....|.|-+-..+.|. -.|.+|.+=.|..| T Consensus 1 MA------------~T~lsd~i~peVf~~-----yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~ 63 (324) T protein:vir:59 1 MA------------YTKISDVIVPELFNP-----YVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLD 63 (324) T ss_pred CC------------ceeeeceechhHHHH-----HHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCC Confidence 43 122222344553333 333322 2223433333333332 25899998888887 Q ss_pred ccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcc Q lcl|NC_019514. 71 LDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSD 150 (399) Q Consensus 71 ~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D 150 (399) .+.++.. ++-+-....++ -....+.+++.|.=++.+|.+.+.--. T Consensus 64 Gd~~~v~-~~~~i~~~~l~----------------------------------t~~~~a~i~~~~k~~~~tD~a~~~sg~ 108 (324) T protein:vir:59 64 GDSQVLN-DTDDLVPQKIN----------------------------------AGQDKAVLILRGNAWSSHDLAATLSGS 108 (324) T ss_pred CcccccC-CCcccchhhcc----------------------------------cceeeEEEEeecCceeehhhhhhhccc Confidence 2332222 22111111122 223456677777767899987665444 Q ss_pred hHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEec-CCCcccccccccccCCceecHHHHHHHHHHHHhccCccccce Q lcl|NC_019514. 151 SELFSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYT-GAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKV 229 (399) Q Consensus 151 ~~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~ya-g~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~ 229 (399) + -++++.+.++ .-..+..+.++++...-++.. ..++.. .+-..+.+..+|.+.|-+|...|-.+.. T Consensus 109 d-p~~~i~~q~a----~~~~~~~~~~lia~l~g~~~~~~~~~~~--~dvsa~~~~~~s~~~l~~A~~~~GD~~~------ 175 (324) T protein:vir:59 109 D-PMQAIGSRVA----AYWAREMQKIVFAELAGVFSNDDMKDNK--LDISGTADGIYSAETFVDASYKLGDHES------ 175 (324) T ss_pred h-HHHHHHHHHH----HHHHHHHHHHHHHHHHHhhhccccccce--eeeeccccceecHHHHHHHHHHhCCccc------ Confidence 4 3444444444 444456666666544322222 122221 2222344567999999999988766433 Q ss_pred eccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCC Q lcl|NC_019514. 230 ITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGAT 309 (399) Q Consensus 230 i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~ 309 (399) .-.+.++||....+||++ +++.-.+|.+.. ++||.+-|.|+|++.-+-. T Consensus 176 -------------~~~~ivmhS~v~~~L~~~-------~li~~~~~s~~~----~~i~~~~G~~VivdD~~p~------- 224 (324) T protein:vir:59 176 -------------LLTAIGMHSATMASAVKQ-------DLIEFVKDSQSG----IRFPTYMNKRVIVDDSMPV------- 224 (324) T ss_pred -------------CcEEEEEchHHHHHHHHh-------hhhhhccccccC----ceeeeecccEEEEeCCCCc------- Confidence 237899999999999874 345445666653 5899999999999876521 Q ss_pred ccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccc Q lcl|NC_019514. 310 VGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPER 389 (399) Q Consensus 310 ~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~ 389 (399) ....+..++|-.++||++|++...-+- . +.. | + ..||.+-. ..|.-+| T Consensus 225 -------~~~~~~~~~y~s~l~~~GAi~~~~~~~------~--v~v---E-~--dRd~~~g~-----------~~l~~r~ 272 (324) T protein:vir:59 225 -------ETLEDGTKVFTSYLFGAGALGYAEGQP------E--VPT---E-T--ARNALGSQ-----------DILINRK 272 (324) T ss_pred -------cccCCCCceEEEEEEecCeEEEeecCC------C--cce---e-c--ccCccccc-----------eEEEEee Confidence 111234569999999999998864221 0 111 1 1 11443211 1111111 Q ss_pred eEEEEEeccC Q lcl|NC_019514. 390 LALVKTVAPL 399 (399) Q Consensus 390 m~~ie~~a~~ 399 (399) . .+..+. T Consensus 273 ~---~~~~p~ 279 (324) T protein:vir:59 273 H---FVLHPR 279 (324) T ss_pred E---EEeEee Confidence 1 112221 No 99 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=98.49 E-value=2e-08 Score=62.74 Aligned_cols=295 Identities=11% Similarity=0.060 Sum_probs=143.9 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhccccccc----ccCCCEEEEEEcccccccccc Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMP----KNYGKEIRVYHYIPLLDDRNV 76 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mP----kN~GktIk~rry~pl~~~~~~ 76 (399) |+ |.. ++ +.|| -|.+++|+.-+.+||+.++....-.. .++|.||+.|+-..+...+.. T Consensus 1 MA------Nsl----~~----l~p~----iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~~~~d~~ 62 (423) T protein:vir:10 1 MA------NNL----DA----NVSQ----IVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQFKSERTM 62 (423) T ss_pred Cc------ccc----cc----ccHH----HHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCceeeeccc Confidence 33 222 11 2344 58899999999999999887643222 458999999887665222111 Q ss_pred ccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeee-ecceeehhhhhhhhhcchHHHH Q lcl|NC_019514. 77 NDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQK-FGFFTEFSQESLDFDSDSELFS 155 (399) Q Consensus 77 ~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~q-YG~~~e~Td~~~d~~~D~~l~~ 155 (399) ... + ++ . +.+.++... +...|-| ..+-++++|+=...+.++ T Consensus 63 ~~~-~--------t~----~--~~~~l~e~~-------------------v~l~id~~k~~a~~v~d~E~~l~i~~---- 104 (423) T protein:vir:10 63 DGD-I--------TG----K--SKNSLISAK-------------------ATGEVGNYITVAVEYRQIEEALKLNQ---- 104 (423) T ss_pred Ccc-c--------Cc----c--cccccccce-------------------EEEEecceeeeeeeeChHHHhcChhH---- Confidence 110 0 00 0 011111111 1222322 234566777633323332 Q ss_pred HHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccc Q lcl|NC_019514. 156 HISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRM 235 (399) Q Consensus 156 ~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~ 235 (399) +.+.|..++..+. +.+-.+|...... ..++. ++..++... .++++..+.+.|.++++|+ T Consensus 105 -~~~~l~~A~~aLA-~~vd~~ia~~~~~-~~~~~------vgt~~t~~~--a~~~~a~a~~~L~~~~vP~---------- 163 (423) T protein:vir:10 105 -LDQILVPINERMV-TDLETELALFMMK-HGALS------LGSPNTPIK--KWSDVAQTASFLKDLGINS---------- 163 (423) T ss_pred -HHHHHHHHHHHHH-HHHHHHHHHHhhh-ccccc------ccccccccc--cHHHHHHHHHHHhhccCCc---------- Confidence 2344444444344 3444444321100 00010 111111111 3789999999999999997 Q ss_pred cCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccc-eeEcCeEEEecCccchhc-c-cCC---- Q lcl|NC_019514. 236 IDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEI-GTVDQFRLVVVPEMLHWA-G-AGA---- 308 (399) Q Consensus 236 ~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEI-G~i~~vRfV~~~~~~~~~-~-aGa---- 308 (399) .-+++++.|+....|.. +..+..+.+-+..+.+-+|+| |++.+|++.++...-.-. + .|+ T Consensus 164 -------~~R~~Vv~p~~~a~Ll~------~~~~~~~~~~~~~~alr~~~i~G~~~GFdi~~Sn~vp~~T~g~~~ga~~~ 230 (423) T protein:vir:10 164 -------GENYAVMDPWAAQRLAD------AQSGLHVSEQLVRTAWENAQISGNFGGIRALMSNGLASRTQGAFGGKLTV 230 (423) T ss_pred -------CCCEEEeCHHHHHHHhh------hhhhhccccccchHHHHhcccceeecceEEEEecCCcccccccccceeee Confidence 22788999999988732 123444456667777888877 999999999988775332 1 111 Q ss_pred ---------Ccc---------------CCccc------------------------------------------cccCcc Q lcl|NC_019514. 309 ---------TVG---------------TNPGY------------------------------------------RETNGK 322 (399) Q Consensus 309 ---------~~~---------------~~~~~------------------------------------------~~t~~~ 322 (399) ++. .++.+ ...+.. T Consensus 231 ~~~~~vt~a~~~~~~~~~~~~~~~T~s~~g~l~~GD~~t~aGv~~v~~~tk~~l~~~~~~~~~~~~V~~~~~~~a~~~~t 310 (423) T protein:vir:10 231 KGTPEVNYDSVKDSYAFTATLTGATASKKGFLKVGDQLQFDDTHWLNQQSKQTLYNGASALSFTATVMEDANAHSSGDVT 310 (423) T ss_pred eeeeEEEecccccccccccceeeccceeceeEEecceEeecceeeecccccceeecccCCcceEEEEEecccccccCceE Confidence 110 00000 011122 Q ss_pred ceEEEEE------------------------------------EEcccceeeeccccCCCCccceEEEecCCCCCCCCCC Q lcl|NC_019514. 323 YDIYPML------------------------------------CVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRND 366 (399) Q Consensus 323 ~DVyp~l------------------------------------V~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~D 366 (399) +.+||.+ ++.++||+.. ..++. +||... -..+ T Consensus 311 v~i~p~~~~~~~~~~~~~V~a~~a~~~~vT~~~~~~~t~~~nl~~~~~a~~l~----------~~pl~-~~~~~~-~~~~ 378 (423) T protein:vir:10 311 VKISGVPIFDAGYPQYNAVDRLLAEGDTVSVIGTSKQAMKPNLFYNKLFCGLG----------TIPLP-KLHSID-SAVA 378 (423) T ss_pred EEeccccccccCcccccceeccccCCceeEEeeccCCceeEEEEecCcceEEE----------EEccc-CCCccc-eeec Confidence 3333322 3333333221 00010 111100 0011 Q ss_pred cc--------------chhhHHHHHHHHHHhhccccceEEEEEecc Q lcl|NC_019514. 367 PY--------------GEMGFSSIKWYYGTLILRPERLALVKTVAP 398 (399) Q Consensus 367 Pl--------------gQrg~~gwK~~~~~~iLn~~~m~~ie~~a~ 398 (399) ++ ..--.+.|=.+|++..|++||..|+= +.| T Consensus 379 ~~~g~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~-g~~ 423 (423) T protein:vir:10 379 TYEGFSIRVHKYADGDANKQMMRFDLLPAYVCYNPHMGGQFF-GNP 423 (423) T ss_pred ccccceEEEEEeeeccccceEEEEEeecceeeeccceEEEEE-ecC Confidence 11 11111234466899999999887763 444 No 100 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=98.49 E-value=3.6e-08 Score=61.42 Aligned_cols=298 Identities=17% Similarity=0.111 Sum_probs=163.7 Q ss_pred CCcCCeeecCCCC--cccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccccccccccc Q lcl|NC_019514. 1 MASKGMLYNDPNT--TPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVND 78 (399) Q Consensus 1 ~~~~~~~~n~~~~--t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~ 78 (399) ...+-...++... .+++..+.+-|+ .+..+.++...+...+.+++...+++.+ ++++.+..+. T Consensus 106 ~~~~~~~~~~~~~~~~~~~~~~~~vp~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~---~~~~~~~~~~-------- 170 (413) T protein:vir:81 106 VAPRVKAASDPASTATLTDEFQGGYGT----TWNRNIIYRRREKLVVADLMDNLTMTNT---TIKYLMEKAN-------- 170 (413) T ss_pred hhhHHHhhhhhhhhcccccccccccch----hhHHHHHHHHhhhhhHHhhcceeeccCC---ceeEEEeccc-------- Confidence 0000011122211 122222332233 3467778888889999999998888754 3444444322 Q ss_pred CCCCCCCceeccCcccccccccccccccccccccccccccccc-ceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHH Q lcl|NC_019514. 79 QGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVG-FSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHI 157 (399) Q Consensus 79 ~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~-~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~ 157 (399) ........|..||-. ..... .++..++...++|+.++.+|+++++ +++ .|...+ T Consensus 171 ~~~~~~a~~v~Eg~~-----------------------~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~-ds~-~l~~~i 225 (413) T protein:vir:81 171 RVVEGGFKTVAEGGK-----------------------KPYMRFADFDIVTESLSKIAGLTKITDEMIE-DYD-FLVSYI 225 (413) T ss_pred cccccccceecCccc-----------------------ccccCcccceeeEeeeeeEEEeehhhHHHHH-HHH-HHHHHH Confidence 001122345555411 11111 1345678889999999999999776 555 488888 Q ss_pred HHHHHHhhhHHHHHHHHHHHHhcCCeEEe-cCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceecccccc Q lcl|NC_019514. 158 STELMNGAVQLTEAVLQKDLLAGAGTIVY-TGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMI 236 (399) Q Consensus 158 ~~~lg~~a~~~~e~~l~~~~lag~~~v~y-ag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~ 236 (399) ...|.+..+...+. .+++|.+.-.- .|-.+.....+. ......-.++++.++...+..++... T Consensus 226 ~~~la~~~~~~~d~----~~l~G~G~~~~~~Gi~~~~~~~~~-~~~~~~~~~~~i~~~~~~~~~~~~~~----------- 289 (413) T protein:vir:81 226 NARLLEELAIEEER----QLLLGDGTGNNLTGLLKRDGIQTL-AVSNKDELADSIYKAMTNISLATPFQ----------- 289 (413) T ss_pred HHHHHHHHHHHHHH----HHhccCCCCCcccccccccccccc-cccccchhHHHHHHHHHHhhhhccCC----------- Confidence 88888776655443 45666432110 011100000011 11112234666666665544332211 Q ss_pred CccccCceeEEEeCCCchHHHHHhhccCCCccceehhh--cCCccccccccceeEcCeEEEecCccchhcccCCCccCCc Q lcl|NC_019514. 237 DTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQ--YADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNP 314 (399) Q Consensus 237 ~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~--Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~ 314 (399) ++ ..++||.+...|+.|+|-.+.|-|.+..+ +++. ...-.+++-|.+++.++.+.. + T Consensus 290 ------~~-~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~---~~~~~~~l~G~pv~~s~~~~~----------~- 348 (413) T protein:vir:81 290 ------AD-ALVINPLDYQELRLAKDANGQYYGGGVFQGQYGSG---GIMLDPAPWGLRTVQSQVVPV----------G- 348 (413) T ss_pred ------Cc-EEEEcHHHHHHHHHhhccCCceecccccccccccc---ccccCceecceeeEEcCCCCc----------c- Confidence 11 24789999999999998777777765422 2222 222346788999998887521 1 Q ss_pred cccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHhhccccceEE Q lcl|NC_019514. 315 GYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTLILRPERLAL 392 (399) Q Consensus 315 ~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~iLn~~~m~~ 392 (399) .++||.-+.+...+...+ +..-+ .+-.+++-+++.++|+ +++.+.+.+++-+++ T Consensus 349 -------------~~~~gd~~~~~~~~~~~~----~~v~~-------~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~ 404 (413) T protein:vir:81 349 -------------KPVVGAFRSAASVLRKGG----VRIDS-------TNTNVDDFENNLITVRAEERVGLMVTFPEAIVQ 404 (413) T ss_pred -------------cEEEEecccEEEEEEecc----eEEEE-------eccccchhhcCcEEEEEEEeeccEEecccceEE Confidence 145676554433333322 22111 1223456778888888 578999999999999 Q ss_pred EEEeccC Q lcl|NC_019514. 393 VKTVAPL 399 (399) Q Consensus 393 ie~~a~~ 399 (399) ++.+.++ T Consensus 405 l~~~~~~ 411 (413) T protein:vir:81 405 LDVAEVV 411 (413) T ss_pred EEecCCC Confidence 9988888 No 101 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=98.48 E-value=6.8e-08 Score=59.87 Aligned_cols=299 Identities=12% Similarity=0.038 Sum_probs=134.2 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhccc----ccccccCCCEEEEEEcccccccccc Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADV----VSMPKNYGKEIRVYHYIPLLDDRNV 76 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~----~~mPkN~GktIk~rry~pl~~~~~~ 76 (399) |+ |.+ .| .-|| -|.+++|..-++.||+.++... +....++|.||++|+-.++...+.. T Consensus 1 Ma------N~l----lT----~~p~----iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~ 62 (423) T protein:vir:10 1 MP------NNL----DS----NVSQ----IVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTP 62 (423) T ss_pred Cc------cch----hh----hhHH----HHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCceeeeccC Confidence 11 211 11 1133 4788999999999999888753 2224568999999988665221111 Q ss_pred ccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHH Q lcl|NC_019514. 77 NDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSH 156 (399) Q Consensus 77 ~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~ 156 (399) ... +.+.+.+.++...+.| ..+=+|| +-++++|+=...+.+ + T Consensus 63 ~~~---------------~~~~~~~dl~e~~v~l-----------------~id~~k~-va~~v~d~E~~~~i~-----~ 104 (423) T protein:vir:10 63 TGD---------------ISGQNKNNLISGKATG-----------------RVGNYIT-VAVEYQQLEEAIKLN-----Q 104 (423) T ss_pred Ccc---------------ccccccCccccceeEE-----------------Eeeceee-eeeeechHHHhcChh-----h Confidence 100 0111111111111112 1222233 345566653222222 1 Q ss_pred HHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceecccccc Q lcl|NC_019514. 157 ISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMI 236 (399) Q Consensus 157 ~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~ 236 (399) +.+.|..++..+. +.+-.+|++-....-+. ...+. ++... .++++..+.+.|.++++|+ T Consensus 105 ~~~~l~~A~~aLA-~~vd~~ia~~~~~~~~~------~~gt~-~t~~~--a~~~i~~a~~~Ld~~~vP~----------- 163 (423) T protein:vir:10 105 LEEILAPVRQRIV-TDLETELAHFMMNNGAL------SLGSP-NTPIT--KWSDVAQTASFLKDLGVNE----------- 163 (423) T ss_pred HHHHHHHHHHHHH-HHHHHHHHHHHhhcccc------ccccC-Ccccc--hHHHHHHHHHHHHhccCCc----------- Confidence 3344444443343 44444444321111111 11111 11111 4899999999999999997 Q ss_pred CccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccc-eeEcCeEEEecCccchhcccCCCcc---- Q lcl|NC_019514. 237 DTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEI-GTVDQFRLVVVPEMLHWAGAGATVG---- 311 (399) Q Consensus 237 ~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEI-G~i~~vRfV~~~~~~~~~~aGa~~~---- 311 (399) .-+++++.|+....|.. +..+....+-+..+.+-+|+| |++.+|++.++.+.-.-....+..+ T Consensus 164 ------~~R~~Vv~p~~~a~Ll~------~~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnip~~T~gt~~~t~~~~ 231 (423) T protein:vir:10 164 ------GENYAVMDPWSAQRLAD------AQTGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVK 231 (423) T ss_pred ------CCCEEEeChHHHHHHhc------cccceecccccchhhhhhccceeeecceEEEEeCCCccccccccccceeee Confidence 22788999999877742 233444445566677888877 9999999999988765422111100 Q ss_pred CCccc------ccc-------CccceEEEEEEEcccceeeeccccC-----------CCCccceEEEecCCCCCCCC-CC Q lcl|NC_019514. 312 TNPGY------RET-------NGKYDIYPMLCVGAESFTTIGFQTD-----------GKTLKFKVTTKMPGEATADR-ND 366 (399) Q Consensus 312 ~~~~~------~~t-------~~~~DVyp~lV~G~~Afg~v~l~g~-----------g~~~~~~~ivk~pG~~~ad~-~D 366 (399) +.+.. +.+ ......|..|..|. .|...|++.= .-+..++-+|..-. +++. +| T Consensus 232 ~~~~v~~~a~~~a~~~~~~~~~~~~~~~~~l~~GD-~~t~aGv~~v~~~tk~~~~~~~t~~~~~~~v~a~~--~~~~~g~ 308 (423) T protein:vir:10 232 TQPTVTYNAVKDSYQFTVTLTGATASVTGFLKAGD-QVKFTNTYWLQQQTKQALYNGATPISFTATVTADA--NSDSGGD 308 (423) T ss_pred ecceeccccccccceeeeeeeeccccccCceeecc-eEEecceeeecccccccccccccCcceEEEEEeee--eeccCCc Confidence 01111 000 01234567777776 5555554331 00011122222110 0000 00 Q ss_pred ccchhhHHHHHHHHHHhh---ccccceE---EEEEeccC Q lcl|NC_019514. 367 PYGEMGFSSIKWYYGTLI---LRPERLA---LVKTVAPL 399 (399) Q Consensus 367 PlgQrg~~gwK~~~~~~i---Ln~~~m~---~ie~~a~~ 399 (399) ...|.+ .+.| -+..+-. .....+.+ T Consensus 309 -------~tv~i~-p~~i~~~~~~~~~~v~a~~a~~~~v 339 (423) T protein:vir:10 309 -------VTVTLS-GVPIYDTTNPQYNSVSRQVEAGDAV 339 (423) T ss_pred -------eeeecc-CccccccCCcccccccccccCCcee Confidence 000000 0000 0000000 00000001 No 102 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=98.44 E-value=6.3e-07 Score=54.57 Aligned_cols=314 Identities=11% Similarity=0.022 Sum_probs=162.5 Q ss_pred eeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCCCCCCC Q lcl|NC_019514. 6 MLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQGIDAAG 85 (399) Q Consensus 6 ~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~gi~aag 85 (399) |.|=|+-+.+.-.....--.+...-|..+.++.=+-.-+|..+=..+.+ ..|++.+|.|-.-. T Consensus 1 Ms~~n~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~vrti--~~GkS~qf~~iG~~--------------- 63 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTV--TGTNTVSNKYLGET--------------- 63 (402) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeee--cccceEEEEEEeee--------------- Confidence 4444433333332222111222223556666553334445555556654 48999999887442 Q ss_pred ceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchH-HHHHHHHHHHHh Q lcl|NC_019514. 86 ATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSE-LFSHISTELMNG 164 (399) Q Consensus 86 a~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~-l~~~~~~~lg~~ 164 (399) -..+..+|..+|-..+......++-.+- .| +-.+-+.+.+...+=. |-.++++++|+. T Consensus 64 --~a~y~~~G~~ldg~~~~~~k~~ItID~l-----------------L~--a~~~V~diDeaq~~yD~vRse~s~e~G~A 122 (402) T protein:vir:97 64 --ELQVLAPGQSPNATPTQADKNQLVIDTT-----------------VI--ARNTVAHIHDVQGDIDSLKPKLAMNQAKQ 122 (402) T ss_pred --EEeeeccccccCCCCcccccEEEEeCce-----------------ee--chhhhhhHHHHHhcccchhHHHHHHHHHH Confidence 1233334444543333333333331111 11 1111222222222222 556667777777 Q ss_pred hhHHHHHHHHHHHHh-cCCeEE----ecCCCcccccccc-cccCCceecHHH----HHHHHHHHHhccCccccceecccc Q lcl|NC_019514. 165 AVQLTEAVLQKDLLA-GAGTIV----YTGAATQDSEITG-EGATPSVVDYDD----LMRLSITLDENRTPKQTKVITGSR 234 (399) Q Consensus 165 a~~~~e~~l~~~~la-g~~~v~----yag~ats~~~~t~-~~~~~~~vt~~~----lr~a~~~L~~nrap~~t~~i~~s~ 234 (399) -++.++..+.+.+++ +..... ..++..-...+.. .+..+...+... +..+...|.++..|. T Consensus 123 LA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~~~~g~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~--------- 193 (402) T protein:vir:97 123 LKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDI--------- 193 (402) T ss_pred HHHHHHHHHHHHHHHhhccccccccccCcccccccccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCc--------- Confidence 666665444444433 321110 1111100000000 001112334444 446777888888885 Q ss_pred ccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcC--CccccccccceeEcCeEEEecCccchhcccCCCccC Q lcl|NC_019514. 235 MIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYA--DAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGT 312 (399) Q Consensus 235 ~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya--~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~ 312 (399) .-+++++.|..-.-|.+ ++.|++. +|+ ....+.+|+|+++.|||+++++++--..+.+..... T Consensus 194 --------~dRv~vv~P~~y~~Ll~------~~rl~n~-d~~~~~~g~~~~G~v~~v~Gv~Vv~SnnlP~~a~~it~~~l 258 (402) T protein:vir:97 194 --------SDVAIMMPWKFFNALRD------ADRIVDK-TYTISQSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLL 258 (402) T ss_pred --------cccEEEeChHHHHHHhh------cccccch-hhccccCCccccceeEEEeceEEEecCcccccccccccccc Confidence 23899999999988865 4778887 663 555688999999999999999997432211111000 Q ss_pred CccccccCccceE------EEEEEEcccceeeeccccCCCCccceEEEecCCCCCCC-CCCccchhhHHHHHHHHHHhhc Q lcl|NC_019514. 313 NPGYRETNGKYDI------YPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATAD-RNDPYGEMGFSSIKWYYGTLIL 385 (399) Q Consensus 313 ~~~~~~t~~~~DV------yp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad-~~DPlgQrg~~gwK~~~~~~iL 385 (399) ....+++.+|| -..++|=..|-+++.+.. + +.+ --|+=-|..++=-|+.|+...+ T Consensus 259 --s~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~---------v-------T~~~~~d~r~~~~~id~~~a~G~g~~ 320 (402) T protein:vir:97 259 --SNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIE---------V-------TGDIFYEKKEKTYYIDTFMAEGAIPD 320 (402) T ss_pred --ccCCCCccCCcCcccceeEEEEEecceEEEEEeec---------c-------ccchhhchhHHHHHHHHHHHhCCccc Confidence 11122344442 134555556666653222 1 111 1277778889999999999999 Q ss_pred cccceEEEEEec--cC Q lcl|NC_019514. 386 RPERLALVKTVA--PL 399 (399) Q Consensus 386 n~~~m~~ie~~a--~~ 399 (399) |++.-..+++.- |- T Consensus 321 RPeaa~vv~~~~~~t~ 336 (402) T protein:vir:97 321 RWEAVSVVTTKRDATT 336 (402) T ss_pred CccceEEEEEeccccc Confidence 999998886643 11 No 103 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=98.41 E-value=8.9e-08 Score=59.23 Aligned_cols=296 Identities=14% Similarity=0.118 Sum_probs=133.4 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhccccc----ccccCCCEEEEEEcccccccccc Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVS----MPKNYGKEIRVYHYIPLLDDRNV 76 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~----mPkN~GktIk~rry~pl~~~~~~ 76 (399) |+ |.. .| .-|| -|.+++|+.-+.+|||.++....- .-++.|.||++|+-.++...+.. T Consensus 1 MA------N~l----lT----~iP~----iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~~ 62 (423) T protein:vir:35 1 MA------NNL----ES----NISQ----IVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQFKSERTE 62 (423) T ss_pred Cc------cch----hh----hhHH----HHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCcceeeccc Confidence 33 222 11 1133 478999999999999999865321 12477999999988665221110 Q ss_pred ccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeee-ecceeehhhhhhhhhcchHHHH Q lcl|NC_019514. 77 NDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQK-FGFFTEFSQESLDFDSDSELFS 155 (399) Q Consensus 77 ~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~q-YG~~~e~Td~~~d~~~D~~l~~ 155 (399) . + +.+.|..++++ ...++..|-| ..+-.+++|+=.-.+.++ + T Consensus 63 ~-------------~-------~~~~~~~~~~~--------------e~~v~l~id~~k~~a~~v~d~e~~l~i~~-~-- 105 (423) T protein:vir:35 63 T-------------G-------DITGKDKNGLF--------------SAKATGKVGKYITVAVEWTQIEEALKLNQ-L-- 105 (423) T ss_pred C-------------c-------CCCCccccccc--------------cceeeEEeccceeccceeCHHHHHhhHHH-H-- Confidence 0 0 00111111111 1112223322 224456676533222222 3 Q ss_pred HHHHHHHHhhhHHHH---HHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceecc Q lcl|NC_019514. 156 HISTELMNGAVQLTE---AVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITG 232 (399) Q Consensus 156 ~~~~~lg~~a~~~~e---~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~ 232 (399) .+.+..++.++.. ..+-..+.+++.+. .|. .++. .-.++++..+.+.|.++++|+ T Consensus 106 --~~~l~~a~~ala~~vd~~l~~~l~~~a~~~--vgt---------~~t~--~~~~~~i~~a~~~Ld~~~vP~------- 163 (423) T protein:vir:35 106 --DQILSPIHERMVTDLETELAHFMMNNGALS--LGS---------PNTA--IKKWADVAQTASFIKDIGIKT------- 163 (423) T ss_pred --HHHHHHHHHHHHHHHHHHHHHHHhhccccc--ccc---------ccCC--cchHHHHHHHHHHHHHhcCCc------- Confidence 2334444433332 22323233333211 121 1111 124899999999999999997 Q ss_pred ccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccc-eeEcCeEEEecCccchhc-ccC-CC Q lcl|NC_019514. 233 SRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEI-GTVDQFRLVVVPEMLHWA-GAG-AT 309 (399) Q Consensus 233 s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEI-G~i~~vRfV~~~~~~~~~-~aG-a~ 309 (399) .-+++++.|+....|.+ .+..|... +-+..+.+-+|+| |++.+|++.++.+.-.-. +.. +. T Consensus 164 ----------~~R~~Vv~p~~~a~Ll~-----~~~~~~~~-~~~~~~alr~g~i~G~i~GFdv~~Snnvp~~T~gt~~~~ 227 (423) T protein:vir:35 164 ----------GENYAIMDPWSAQRLAD-----AQSGLHAA-DQLVRTAWENAQISGNFGGIRALMSNGLASRKQGDFDGA 227 (423) T ss_pred ----------CCCEEEeCHHHHHHHhc-----cccceecc-ccchhHHHhhccceeeecceEEEEcCCCccccccccccc Confidence 22899999999877742 12333333 4445566788876 999999999988876432 221 11 Q ss_pred ccCCc-------ccccc--------CccceEEEEEEEcccceeeeccccC-----------CCCccceEEEecCCCCCCC Q lcl|NC_019514. 310 VGTNP-------GYRET--------NGKYDIYPMLCVGAESFTTIGFQTD-----------GKTLKFKVTTKMPGEATAD 363 (399) Q Consensus 310 ~~~~~-------~~~~t--------~~~~DVyp~lV~G~~Afg~v~l~g~-----------g~~~~~~~ivk~pG~~~ad 363 (399) +..+. ....+ +-....|..|..|. .|+.-|++.= .-+..++-+|..-....+. T Consensus 228 ~~v~~a~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD-~~t~aGv~~v~~~t~~~~~~~~t~~~~~~~V~~~~~~~a~ 306 (423) T protein:vir:35 228 ITVKTAPNVDYLSVKDSYQFTVALTGATPSKTGFLKAGD-QLKFTSTHWLNQQSKQTLYNGSTAMSFTATVLEETNSTAS 306 (423) T ss_pred eeeccccccccccccccccceeeeeeeeeccCCcEEecc-eEEeeeeeeccccccceeecccCCceeEEEEecccccccc Confidence 11000 00000 01123466777776 5655554431 0112233333211000000 Q ss_pred CCCccchhhHHHHHHHHHHhhc---cccceE---EEEEeccC Q lcl|NC_019514. 364 RNDPYGEMGFSSIKWYYGTLIL---RPERLA---LVKTVAPL 399 (399) Q Consensus 364 ~~DPlgQrg~~gwK~~~~~~iL---n~~~m~---~ie~~a~~ 399 (399) ++.= .|.+ .+.+. +..+-. .....+.+ T Consensus 307 g~~~--------v~i~-p~~~~~~~~~~~~~v~a~~a~~~~v 339 (423) T protein:vir:35 307 GDVT--------VKLS-GVPIYDEKNSQYNAVDAKVKAGDAV 339 (423) T ss_pred Ccee--------EEcc-ccccccCCCcccccccccccCCcee Confidence 0000 0000 00010 111100 00000011 No 104 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=98.40 E-value=4e-08 Score=61.14 Aligned_cols=285 Identities=13% Similarity=0.063 Sum_probs=158.3 Q ss_pred CCcCCeee--cCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccccccccccc Q lcl|NC_019514. 1 MASKGMLY--NDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVND 78 (399) Q Consensus 1 ~~~~~~~~--n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~ 78 (399) ........ +.-.+..++..+.+.|. .+..+.+....+...+.+++...+++.+ ++++.+... T Consensus 101 ~~~~~~~~~~~~~~~~~~~~~g~~~~~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~---~~~~~~~~~--------- 164 (390) T protein:vir:10 101 RATMNIKAALNTASTDAAGSAGALTTP----NRLPGFITQPDARLTVRDLIGSGRTDSA---LIEYVQETG--------- 164 (390) T ss_pred hhhhHHHHHHHhhhcccccccccccch----hHHHHHHHHHHhhchhhhhcceeeccCC---ceEEEEEec--------- Confidence 00000000 11111222222333332 1235666667777788888888887644 344444321 Q ss_pred CCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHH Q lcl|NC_019514. 79 QGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIS 158 (399) Q Consensus 79 ~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~ 158 (399) .+....|..||- ......++...++..+++|+.++.+|+++++ ++. .+...+. T Consensus 165 --~~~~a~~v~Eg~-----------------------~~~~~~~~~~~i~~~~~k~~~~~~is~ell~-d~~-~l~~~i~ 217 (390) T protein:vir:10 165 --FVNNAAIVAEGA-----------------------LKPESSLKFAKKTDTTHVIAHTMKATRQILS-DAP-QLASYMN 217 (390) T ss_pred --CCcceeeecCCc-----------------------cccccccceeEEEEeeEEEEEeehhhHHHHH-hHH-HHHHHHH Confidence 122334555551 1222334456688899999999999999776 454 4777777 Q ss_pred HHHHHhhhHHHHHHHHHHHHhcCCeEE-ecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccC Q lcl|NC_019514. 159 TELMNGAVQLTEAVLQKDLLAGAGTIV-YTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMID 237 (399) Q Consensus 159 ~~lg~~a~~~~e~~l~~~~lag~~~v~-yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~ 237 (399) ..|.+..+...+ ..+++|.+.-. -.|--+.........++....+++++..+.-.|+....+. T Consensus 218 ~~l~~~~~~~~~----~~il~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~------------ 281 (390) T protein:vir:10 218 NRLIRGLKVKED----AEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYPA------------ 281 (390) T ss_pred HHHHHHHHHHHH----HHHhhcCCCCccccccccccccccccccccccchHHHHHHHHHhhccccCCC------------ Confidence 777776655443 34556543211 1111111011111122334456788888887777654432 Q ss_pred ccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCcccc Q lcl|NC_019514. 238 TRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYR 317 (399) Q Consensus 238 T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~ 317 (399) -+.++||.....|+.|+|.-+.+-|.+. ..+.-+++-|++++.++.+-. + T Consensus 282 -------~~~v~n~~~~~~L~~lkd~~g~~l~~~~---------~~~~~~~l~G~pv~~~~~~p~----------~---- 331 (390) T protein:vir:10 282 -------SGIVINPIDWAAIELAKDANNQYLIGNA---------RGTLTPTLWGLPVVATQAMAP----------G---- 331 (390) T ss_pred -------CEEEEcHHHHHHHHHhhcCCCceeecCC---------cCcCCceecceeeEEcCCCCC----------C---- Confidence 2346899999999999987766666442 233456788999999887620 1 Q ss_pred ccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHhhccccceEEEEE Q lcl|NC_019514. 318 ETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTLILRPERLALVKT 395 (399) Q Consensus 318 ~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~iLn~~~m~~ie~ 395 (399) .+++|.-+.+..-....+ +.+-+- ..+.+-+++.+.++ .++.+.+++++-++.+.. T Consensus 332 ----------~~~~gdf~~~~~~~~~~~----~~i~~~--------~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~ 389 (390) T protein:vir:10 332 ----------EFLVGAFDLAAQIFDQWD----ARVEIG--------YVNDDFQRNMVTVLAEERLALVVYRPEALISGSF 389 (390) T ss_pred ----------cEEEEeccceEEEEEecc----eEEEEe--------ecccccccCcEEEEEEEeeccEEeccccEEEEEe Confidence 135665433222111111 111111 11234466777766 578999999999999999 Q ss_pred e Q lcl|NC_019514. 396 V 396 (399) Q Consensus 396 ~ 396 (399) | T Consensus 390 a 390 (390) T protein:vir:10 390 A 390 (390) T ss_pred C Confidence 9 No 105 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=98.40 E-value=4.7e-07 Score=55.26 Aligned_cols=314 Identities=12% Similarity=0.034 Sum_probs=160.2 Q ss_pred eeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCCCCCCC Q lcl|NC_019514. 6 MLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQGIDAAG 85 (399) Q Consensus 6 ~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~gi~aag 85 (399) |.+=|+.+.+-...+..---+...-|..+.+..=+-.-+|..+=.++. -..||+..|-|-.-. T Consensus 1 Ms~~n~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~vRt--I~~gkS~qf~~lG~s--------------- 63 (400) T protein:vir:10 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQT--VTGTNTVSNKYLGET--------------- 63 (400) T ss_pred CCCCccccccccccccchhhhHHhHhcchHHHHHHHHhhhcccceeee--ecccceEEEEEeeee--------------- Confidence 444343343333332322223333356666665444445555556664 577889888887332 Q ss_pred ceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHHHHHHhh Q lcl|NC_019514. 86 ATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHISTELMNGA 165 (399) Q Consensus 86 a~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~~lg~~a 165 (399) -..+..+|.++|...+......+|-.+-.+=+. +++.|-|-..++++ +-.++++++|+.= T Consensus 64 --~a~y~~pG~~ldg~~~~~dk~~ItIDtLL~a~~---------------~V~dlDd~q~~yD~---vRse~s~e~G~AL 123 (400) T protein:vir:10 64 --ELQVLAPGQSPAATSTQADKNQLVIDATVIARN---------------TVAHLHDVQGDIDS---LKPKLATNQAKQL 123 (400) T ss_pred --EEeeecCCCCcCCCCcccCcEEEEeCceeeecc---------------hhhhHHHHhhcccc---ccHHHHHHHHHHH Confidence 233444455555433444444343222211111 12223332233332 3344455555555 Q ss_pred hHHHHH-HHHHHHHhcCCeEEe----cCCCc--ccccccccccCCceecHHHH----HHHHHHHHhccCccccceecccc Q lcl|NC_019514. 166 VQLTEA-VLQKDLLAGAGTIVY----TGAAT--QDSEITGEGATPSVVDYDDL----MRLSITLDENRTPKQTKVITGSR 234 (399) Q Consensus 166 ~~~~e~-~l~~~~lag~~~v~y----ag~at--s~~~~t~~~~~~~~vt~~~l----r~a~~~L~~nrap~~t~~i~~s~ 234 (399) ++.++. ++|-.+++++...-. .|+.. ....++ .++....++...| +.|...|.++..|. T Consensus 124 A~~~Dq~iiq~i~~a~~a~t~~~~~~~~g~~~g~s~~v~-~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~--------- 193 (400) T protein:vir:10 124 KKMEDEMLIQQMLLGGIANTQAKRTNPRVKGHGFSVNVE-VNEGEALVNPQYVMAAVEFALEQQLEQEVDI--------- 193 (400) T ss_pred HHHHHHHHHHHHHHhcccccccccccCCccccccceeec-ccccccccCHHHHHHHHHHHHHHHHhcCCCc--------- Confidence 555533 445555665422111 11110 011111 1122233344444 45666677766652 Q ss_pred ccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcC--CccccccccceeEcCeEEEecCccchhcccCCCccC Q lcl|NC_019514. 235 MIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYA--DAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGT 312 (399) Q Consensus 235 ~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya--~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~ 312 (399) + -++.++.|+--.-|++ .+.+++. .|+ .....-.|+|.++.|||+|+++++--..+....... T Consensus 194 -------~-d~vvl~pp~~Ys~Ll~------~dkLvnr-df~~s~~g~~~~g~v~~v~Gv~Iv~Sn~lP~~a~~~~~~~l 258 (400) T protein:vir:10 194 -------S-DVAILMPWRYFNVLRD------ADRIVDK-SYTISQSGATIQGFVLSSYNCPVIPSNRFPKYSQGQKHHLL 258 (400) T ss_pred -------c-ceEEEcCHHHHHHHHh------CCcccch-hccccCCCccccceEEEEeceEEEeeCcCCcccCccccccc Confidence 1 1677777766666654 2345555 354 335568899999999999999997432211111000 Q ss_pred CccccccCccceE------EEEEEEcccceeeeccccCCCCccceEEEecCCCCCCC-CCCccchhhHHHHHHHHHHhhc Q lcl|NC_019514. 313 NPGYRETNGKYDI------YPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATAD-RNDPYGEMGFSSIKWYYGTLIL 385 (399) Q Consensus 313 ~~~~~~t~~~~DV------yp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad-~~DPlgQrg~~gwK~~~~~~iL 385 (399) ....+++.||| -..|+|=.+|-+++.+.. + +.+ --|+=-|..++=-|+.|+...+ T Consensus 259 --S~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~---------l-------t~~~~~d~r~~~~~id~~~a~G~g~~ 320 (400) T protein:vir:10 259 --SNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSID---------V-------IGDIFYEKKEKTYYIDTFMSEGAIPD 320 (400) T ss_pred --ccCCCCccCCccccccceeEEEEehhheEEEEeec---------c-------ccccccchhhHHHHHHHHHHhCCccc Confidence 00122333432 245666666666653222 1 111 2378889999999999999999 Q ss_pred cccceEEEEEeccC Q lcl|NC_019514. 386 RPERLALVKTVAPL 399 (399) Q Consensus 386 n~~~m~~ie~~a~~ 399 (399) |++.-..++++=+- T Consensus 321 RPeaa~vv~~~~~~ 334 (400) T protein:vir:10 321 RWEAVSVVTTKRQS 334 (400) T ss_pred chhheEEEEecCCc Confidence 99999999998665 No 106 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=98.39 E-value=3.6e-08 Score=61.40 Aligned_cols=285 Identities=11% Similarity=0.070 Sum_probs=157.8 Q ss_pred CC--cCCe--eecCCCCcccccccccccceehhhhhHHHHHHHH-HHHHhhhhcccccccccCCCEEEEEEccccccccc Q lcl|NC_019514. 1 MA--SKGM--LYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEAR-KDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRN 75 (399) Q Consensus 1 ~~--~~~~--~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~-p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~ 75 (399) +. .|.. .--....+.+...+-+.|.+ ..+.+++.. ...++.+++...++. .|..+++.+..- T Consensus 97 ~~~~~r~~~~~~~~~~~t~~~~g~~~~~~~-----~~~~i~~~~~~~~~l~~~~~~~~~~--~~~~~~~p~~~~------ 163 (390) T protein:vir:62 97 NLGEARSFEFAPEKRDGTKAGNPNVLSRTL-----YGQLIAQAVERSAIMRGGATTFTTS--DANPLDFTVITG------ 163 (390) T ss_pred hhhhhHHHHhhhhhhcccccCCCccccccc-----hHHHHHHHHhhhhhhhhcceeeecC--CCceeEEEEEcC------ Confidence 10 0100 00001122222222333331 245666655 444466788776653 344444444311 Q ss_pred cccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHH Q lcl|NC_019514. 76 VNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFS 155 (399) Q Consensus 76 ~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~ 155 (399) .+...|..|+- .+.....+...++.+.++++.++.+|+++++ +++.++.. T Consensus 164 ------~~~a~wv~E~~-----------------------~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~-ds~~~l~~ 213 (390) T protein:vir:62 164 ------RSSASIVGETA-----------------------EIPESYPATAQRSMGGFKYGFASVVSYEFAT-DQVLDLVG 213 (390) T ss_pred ------Ccceeeecccc-----------------------cccccccceeeeEeeeeeEEeehHHHHHHHh-hhhHHHHH Confidence 12234566542 2222334455688899999999999999886 45545777 Q ss_pred HHHHHHHHhhhHHHHHHHHHHHHhcCCeE--EecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccc Q lcl|NC_019514. 156 HISTELMNGAVQLTEAVLQKDLLAGAGTI--VYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGS 233 (399) Q Consensus 156 ~~~~~lg~~a~~~~e~~l~~~~lag~~~v--~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s 233 (399) .+...+.+.-+...+. .+++|.+.- +.+..+...+.. ..+..+.+++++|.++...|+..-... T Consensus 214 ~i~~~l~~~i~~~~d~----~~l~G~G~p~Gi~~~~~~~~~~~--~~~~~~~~~~~~l~~~~~~l~~~~~~~-------- 279 (390) T protein:vir:62 214 FLVSDAGPAIGDAMGR----HFITGTGQPRGILTDASPATATF--LATDTDSKVSDALIDLFHEVPSAYRAN-------- 279 (390) T ss_pred HHHHHHHHHHHHHHHh----hhhccCCccccccccccccccce--ecccccccchHHHHHHHHhhhhhhhcC-------- Confidence 7777777665544433 466665420 111111111111 113345689999988887775432211 Q ss_pred cccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCC Q lcl|NC_019514. 234 RMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTN 313 (399) Q Consensus 234 ~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~ 313 (399) + +-++|+.+...|+.|+|--+.+-|.|-- -.|..+.+-|..++.++.+- ++ T Consensus 280 ---------a--~~vmn~~~~~~L~~lkd~~g~~l~~~~~--------~~g~~~~l~G~Pv~~~~~~p----------~~ 330 (390) T protein:vir:62 280 ---------A--KYVVNDLRAAQMRKLKDANGQYLWQSGL--------TVGAPSLFNGKVVETDDGMP----------AD 330 (390) T ss_pred ---------C--EEEEchHHHHHHHHhhccCCCeeecCCc--------CCCccceecccceEEecCCC----------Cc Confidence 1 3477999999999998866666665432 23444567777777776541 11 Q ss_pred ccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHH--HHHhhccccceE Q lcl|NC_019514. 314 PGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWY--YGTLILRPERLA 391 (399) Q Consensus 314 ~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~--~~~~iLn~~~m~ 391 (399) .++||.=++..++..++ + .+. ...|++-+++.+.++++ +.+.+++++-++ T Consensus 331 --------------~i~~gd~s~~~i~~~~~-----~--~v~-------~~~~~~~~~~~~~~~~~~r~d~~~~~~~A~~ 382 (390) T protein:vir:62 331 --------------KILFADLSKYRVRFAGS-----L--RVD-------RSVDAKFSTDQIVYRFLQRADGLLVDARGAK 382 (390) T ss_pred --------------cEEEeeccceeEEeecc-----e--EEE-------eeccccccCCcEEEEEEEEeCcEeechhheE Confidence 14577755555555442 1 122 12367777777776644 899999999998 Q ss_pred EEEEeccC Q lcl|NC_019514. 392 LVKTVAPL 399 (399) Q Consensus 392 ~ie~~a~~ 399 (399) .|+..+.- T Consensus 383 ~l~~~~~a 390 (390) T protein:vir:62 383 VLTVTPGA 390 (390) T ss_pred EEEeecCC Confidence 88766555 No 107 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=98.37 E-value=8.5e-08 Score=59.34 Aligned_cols=311 Identities=10% Similarity=0.091 Sum_probs=148.0 Q ss_pred CCcCCeeecCC-C-Ccccccccc-cccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccc Q lcl|NC_019514. 1 MASKGMLYNDP-N-TTPSGIDAP-DGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVN 77 (399) Q Consensus 1 ~~~~~~~~n~~-~-~t~tT~~~~-i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~ 77 (399) +.......... . +++++..+. +-|+ +...+.++..++..++.+++...+||-..|. +++-+... T Consensus 144 ~~~~~~~~~~~~~~~~~~~~gg~lv~~~----~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~-~~ip~~~~-------- 210 (477) T protein:vir:84 144 IRKIAKVGEEYRDLDRNGGTGGYAVPPL----WMMNRFIELARAGRTYANLCPTEPLPGGTSS-INIPKILT-------- 210 (477) T ss_pred HHHHHHhhhhhccccccCCCcceeeccc----hhHHHHHHHhhhcchHHHhhceeeecCCcce-eEEEEEec-------- Confidence 00000000011 1 111222222 2233 2234455557788888889999999987765 22222211 Q ss_pred cCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHH Q lcl|NC_019514. 78 DQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHI 157 (399) Q Consensus 78 ~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~ 157 (399) -...+.|..||-. ++ .+......++...++.+.++++.|+.+|+++++ ++.+.+...+ T Consensus 211 ---~~~~a~~~~Eg~~---------~~---------~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~-ds~~~l~~~i 268 (477) T protein:vir:84 211 ---GTSTAIQAADNAA---------LT---------APSAHEVDLTDGFVQANVKTIAGQQGIAIQLLD-QAAVSVDEFV 268 (477) T ss_pred ---CcceeeeeccCcc---------cc---------cccccccccceeeEEEeeeeEEeeeHHHHHHHh-ccchhHHHHH Confidence 0112233444311 00 011122334566788999999999999999665 4444588888 Q ss_pred HHHHHHhhhHHHHHHHHHHHHhcCCe-------EEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCcccccee Q lcl|NC_019514. 158 STELMNGAVQLTEAVLQKDLLAGAGT-------IVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVI 230 (399) Q Consensus 158 ~~~lg~~a~~~~e~~l~~~~lag~~~-------v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i 230 (399) ...|.+..+...+ .-+++|.++ .-++|.. .++. .+...+...+......|........+. T Consensus 269 ~~~l~~~~~~~~d----~~~l~G~Gt~~~p~Gi~~~~~~~----~~~~---~~~~~t~~~~~~~~~~i~~~~~~~~~~-- 335 (477) T protein:vir:84 269 FRDLAADYANKLN----VQVISGTGSNNQVVGVRATAGIT----QVTA---TSAGSALEKHQIIYQKIADAIQRVHTS-- 335 (477) T ss_pred HHHHHHHHHHHHH----HHHhccCCCCCccceeeeccccc----cccc---cccccchhhHHHHHHHHHHHHhhcccc-- Confidence 8888877665554 345666542 1122110 1111 111223333322222221111100000 Q ss_pred ccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccc-----cccccceeEcCeEEEecCccchhcc Q lcl|NC_019514. 231 TGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGT-----ILNGEIGTVDQFRLVVVPEMLHWAG 305 (399) Q Consensus 231 ~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~-----i~~gEIG~i~~vRfV~~~~~~~~~~ 305 (399) ..-..-+.++||.....|+.|+|--+.|-|.|.....+... +.++-.|.+-|+++|+++.|-. + T Consensus 336 ---------~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G~pVv~s~~~p~--~ 404 (477) T protein:vir:84 336 ---------RFLEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHGLPVVTDPTLPT--T 404 (477) T ss_pred ---------ccCCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhcccceEecCcccc--c Confidence 00112355889999999999999888888876543333322 3344557899999999987621 1 Q ss_pred cCCCccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHh Q lcl|NC_019514. 306 AGATVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTL 383 (399) Q Consensus 306 aGa~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~ 383 (399) .|+. .| ...++||.-+...+.-.| +.. ...+ +.+.-.+.+.++ .++.+. T Consensus 405 ~~~~-------------~d-~~~i~~gd~~~~~i~~~~------~~~-~~~~--------~~~~~~~~~~~~v~~~~~~~ 455 (477) T protein:vir:84 405 LGTG-------------TD-QDVIHVLRASDLALFESS------VRM-RALQ--------ETRAENLSVLLQVYGYLAFT 455 (477) T ss_pred cccc-------------CC-cceEEEEEeceEEEEeec------eeE-Eecc--------ccccccceeeeeehhhhhhh Confidence 1111 11 123566766655553221 111 1111 111111222332 234554 Q ss_pred hcc-ccceEEEEEeccC Q lcl|NC_019514. 384 ILR-PERLALVKTVAPL 399 (399) Q Consensus 384 iLn-~~~m~~ie~~a~~ 399 (399) ..+ ++-.++|=-.+.- T Consensus 456 ~~r~~~afv~~t~~~~~ 472 (477) T protein:vir:84 456 AARFPQSVVEIGGTALT 472 (477) T ss_pred hhccccceEEeeccccc Confidence 444 6666555333322 No 108 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=98.37 E-value=9.4e-08 Score=59.11 Aligned_cols=286 Identities=12% Similarity=0.039 Sum_probs=159.8 Q ss_pred CCcCCee------------ecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcc Q lcl|NC_019514. 1 MASKGML------------YNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYI 68 (399) Q Consensus 1 ~~~~~~~------------~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~ 68 (399) +..+... -+.....+++..+.+-|+ .+..+.++...+..++.+++...+||.+.|+-. +.+.. T Consensus 69 ~~~~~~~~~~~~~~l~~~~~~a~~~~t~~~gg~~vP~----~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~-~~~~~ 143 (371) T protein:vir:81 69 VQVKENEVEAFVNHIRTRFRNAMSEGSNQDGGYTVPQ----DIQTRINELRESKDALQNLITVEPVTTLSGSRV-FKKRS 143 (371) T ss_pred hhhHHHHHHHHHHHHHHHHHHhhccCCCccCceeecH----hHHHHHHHHHHhhhhhhhhceeeeccCCceeEE-EEeec Confidence 0000000 011111112222222333 235666777889999999999999987765532 22221 Q ss_pred ccccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhh Q lcl|NC_019514. 69 PLLDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFD 148 (399) Q Consensus 69 pl~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~ 148 (399) . .+...|..||... ......+...++.+.++++.++.+|+++++ + T Consensus 144 ~------------~~~a~~v~Eg~~~----------------------~~~~~~~f~~i~~~~~k~~~~~~iS~ell~-d 188 (371) T protein:vir:81 144 Q------------QTGFVEVAEGAAI----------------------GEKATPQFTLLQYQVKKYAGFFRVTNELLN-D 188 (371) T ss_pred C------------Ccceeeecccccc----------------------ccccccceeeEEeeeeEEEEeehhhHHHHh-h Confidence 1 1233456665221 111224556788899999999999999765 4 Q ss_pred cchHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHH-HHHhccCcccc Q lcl|NC_019514. 149 SDSELFSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSI-TLDENRTPKQT 227 (399) Q Consensus 149 ~D~~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~-~L~~nrap~~t 227 (399) ++.+|...+...|.+..+...+ ..+++|.+.. +.....+++++..+.. .|+..-... T Consensus 189 s~~~l~~~i~~~l~~a~~~~~~----~~i~~g~g~~----------------~~~~~~~~~~i~~~~~~~l~~~~~~~-- 246 (371) T protein:vir:81 189 STEAIVNTLVRWIGDESRVTRN----GLIINVLNTK----------------AKTAIADLDGLKQIINVQLDPVFRST-- 246 (371) T ss_pred hhHHHHHHHHHHHHHHHHHHHH----HHHHhhcccc----------------cccccccHHHHHHHHHhhcchhhhcC-- Confidence 4556888888888877655443 4456665321 1234457788777653 343321110 Q ss_pred ceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccC Q lcl|NC_019514. 228 KVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAG 307 (399) Q Consensus 228 ~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aG 307 (399) + +.++||.....|+.|+|--+.+-|.+- +-.|.-|++-|..++.++.| ++...+ T Consensus 247 ---------------a--~~vmn~~~~~~L~~lkd~~g~~l~~~~--------~~~~~~~~l~G~pV~~~~~~-~~~~~~ 300 (371) T protein:vir:81 247 ---------------S--SVIVNQDAFNWLDTLKDQNGQYLLQPS--------ISSPTGRQLLGLPVVIVSNK-VLANRV 300 (371) T ss_pred ---------------C--EEEEcHHHHHHHHHhhccCCCeeeecc--------cCCCCCceecceeEEEeccc-ccCccc Confidence 1 357899999999999887777777542 23456688899999988875 221111 Q ss_pred CCccCCccccccCccceEEEEEEEcccc--eeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHh Q lcl|NC_019514. 308 ATVGTNPGYRETNGKYDIYPMLCVGAES--FTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTL 383 (399) Q Consensus 308 a~~~~~~~~~~t~~~~DVyp~lV~G~~A--fg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~ 383 (399) .. +. .. ....++||.=+ |....-.+ +.+-+- +-.+.+-+++.+.|+ .++.+. T Consensus 301 ~~-------~~---~~-~~~~i~~Gd~~~~~~~~~~~~------~~i~~~-------~~~~~~f~~~~v~~~~~~r~d~~ 356 (371) T protein:vir:81 301 DG-------GT---GA-QFAPIIVGDLKEAVVMFDRQR------TEIMSS-------NVAMDAFETDATLWRAIERMDVK 356 (371) T ss_pred cc-------cc---cC-CcceEEEEehhceEEEEeecc------eEEEEe-------ccccchhhcCceEEEEEEeeccE Confidence 11 00 01 12336777422 22211111 222111 111233456777777 456889 Q ss_pred hccccceEEEEEecc Q lcl|NC_019514. 384 ILRPERLALVKTVAP 398 (399) Q Consensus 384 iLn~~~m~~ie~~a~ 398 (399) ++++.-++.++..+- T Consensus 357 ~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 357 MRDDEAFVFGEVQLA 371 (371) T ss_pred EecccceEEEEEecC Confidence 999999998887766 No 109 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=98.36 E-value=5.8e-08 Score=60.25 Aligned_cols=278 Identities=11% Similarity=-0.010 Sum_probs=151.8 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQG 80 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~g 80 (399) ++...+..-.--+++++..+.+ |+ .|....+........+.+++...++.-+ ++++-+..-. T Consensus 98 ~~~~~~~~~~~~~~~~~~~~~i-p~----~~~~~ii~~~~~~~~i~~~~~~~~~~~~---~~~~~~~~~~---------- 159 (379) T protein:vir:10 98 GKSIQVKAVGDMTLPVNLTGAQ-PK----DYNFDVVLNPSQMLNVSDIVGAVSISGG---TYTFVRENGA---------- 159 (379) T ss_pred hhhhhhhhhcccccCCCCcccc-ch----hhhhHHHHhHHhhhhHHhhceeeeccCC---ceEEEEeecC---------- Confidence 2211111111112223333332 22 2355666666777778888887776432 3333332100 Q ss_pred CCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHHH Q lcl|NC_019514. 81 IDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHISTE 160 (399) Q Consensus 81 i~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~~ 160 (399) -.++..|..|| +.......+...++..+++|+.++.+|+++++ ++. .+...+..+ T Consensus 160 ~~~~~~~v~Eg-----------------------~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~-D~~-~l~~~i~~~ 214 (379) T protein:vir:10 160 GEGAIGAQVEG-----------------------ATKGQKDYDISMIDVNTDFIAGFTRYSKKMAN-NLP-FLTSFIPNA 214 (379) T ss_pred CCcccccccCC-----------------------ccccccccceeeeEeeeeeEEeeehhhHHHHh-hHH-HHHHHHHHH Confidence 01222344444 22233345566788999999999999999876 443 466666666 Q ss_pred HHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccc Q lcl|NC_019514. 161 LMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTRT 240 (399) Q Consensus 161 lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~~ 240 (399) |.+..+...+ .. ++.|.+.- |. .+..++....++++|.++.-.|..+..+. T Consensus 215 la~~~~~~~~-~~---~~~g~~~~---~~-------~~~~~~~~~~~~d~i~~~~~~~~~~~~~~--------------- 265 (379) T protein:vir:10 215 LRRDYAKAEN-AA---FNAVLAAN---AT-------ASTEIITNKNKVEMLINEIAKQENLDFPV--------------- 265 (379) T ss_pred HHHHHHHHHH-HH---Hhcccccc---cc-------cccccccCcccHHHHHHHHHhhhhccCCC--------------- Confidence 6665543332 22 22222110 00 01112233446788888876665533221 Q ss_pred cCceeEEEeCCCchHHHHHhhccCCCccceehh--hcCCccccccccceeEcCeEEEecCccchhcccCCCccCCccccc Q lcl|NC_019514. 241 ISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVH--QYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRE 318 (399) Q Consensus 241 I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~--~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~ 318 (399) -+.++||.+...|+-|+|-.+.+-|.|-. +.+.+ -++-|+++|+++.|. +| T Consensus 266 ----~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~--------~~l~G~pvv~s~~~~----ag----------- 318 (379) T protein:vir:10 266 ----TAIVLRPTDYYDILVTQKSVGAGYGLPGVVTQDNGV--------LRINGIPLFRATWLA----AN----------- 318 (379) T ss_pred ----CEEEEcHHHHHHHHHhhccCCceeccCCccCCCCCc--------ceecceeeEecCCCC----CC----------- Confidence 24668999999999999877766665321 12222 256689999987762 11 Q ss_pred cCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHH--HHHHhhccccceEEEEEe Q lcl|NC_019514. 319 TNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKW--YYGTLILRPERLALVKTV 396 (399) Q Consensus 319 t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~--~~~~~iLn~~~m~~ie~~ 396 (399) + ++||.-+...+.+.-+. .++.. .+-.| +-++..+.|++ -+++.+++++-++.++.+ T Consensus 319 -----~----~~~gdf~~~~~~~~~~~---~i~~~--------~~~~~-~f~~~~~~~r~~~R~~~~v~~p~a~v~~~~~ 377 (379) T protein:vir:10 319 -----K----YYVGDWTRVTKVTTEGL---SLEFS--------EVEGT-NFVKNNITARIEAQVALAVEQPAALIFGDFT 377 (379) T ss_pred -----c----eEEeecccEEEEEEece---EEEEe--------ecccc-cccCCcEEEEEEEEeccEEecCccEEEEEec Confidence 1 35777666555443221 12111 11123 34577788773 678999999999998887 Q ss_pred cc Q lcl|NC_019514. 397 AP 398 (399) Q Consensus 397 a~ 398 (399) += T Consensus 378 ~~ 379 (379) T protein:vir:10 378 AV 379 (379) T ss_pred CC Confidence 76 No 110 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=98.35 E-value=2.1e-07 Score=57.14 Aligned_cols=305 Identities=12% Similarity=0.033 Sum_probs=150.4 Q ss_pred CCcCCe-eecCCC--CcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccc Q lcl|NC_019514. 1 MASKGM-LYNDPN--TTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVN 77 (399) Q Consensus 1 ~~~~~~-~~n~~~--~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~ 77 (399) |....+ ..+... .+.++..+-+-|+ . +..+.++...+..++.+++. +.+|...|. +++-|.. T Consensus 113 ~~~~~~~~~~~~~~~~~~~~~gg~liP~---~-~~~~ii~~l~~~~~l~~~~~-~~~~~~~g~-~~~p~~~--------- 177 (428) T protein:vir:10 113 FASDELNDQSVSMAISTAAGSGGVLIPQ---N-IHSEVIELLRDRTIVRKLGA-RSIPLPNGN-MSLPRLA--------- 177 (428) T ss_pred HhhhhhhhhhHhhhhcccccCCccccch---h-HHHHHHHHHhhhchhhhhcc-eeeecCCcc-eEEEEEe--------- Confidence 111111 001100 1111112222243 1 24566666778888888833 346655555 3333331 Q ss_pred cCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHH Q lcl|NC_019514. 78 DQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHI 157 (399) Q Consensus 78 ~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~ 157 (399) . .+...|..||- ......++...++.+.++|+.++.+|+++++ ++++++.+.+ T Consensus 178 -~--~~~a~~v~Eg~-----------------------~~~~~~~~f~~i~~~~~k~~~~v~is~ell~-ds~~~l~~~i 230 (428) T protein:vir:10 178 -G--GATASYTGENQ-----------------------DAKVSEARFDDVKLTAKTMIAMVPISNALIG-RAGFNVEQLV 230 (428) T ss_pred -C--CcceeeeccCc-----------------------cccccccceeeEEeeeEEEEEeehhhHHHHh-hhhHHHHHHH Confidence 1 12334565552 2222334455688899999999999999765 5566788888 Q ss_pred HHHHHHhhhHHHHHHHHHHHHhcCCeE-EecCCCc---ccccccccccCCceecHHHHHHHHHHHHhccCccccceeccc Q lcl|NC_019514. 158 STELMNGAVQLTEAVLQKDLLAGAGTI-VYTGAAT---QDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGS 233 (399) Q Consensus 158 ~~~lg~~a~~~~e~~l~~~~lag~~~v-~yag~at---s~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s 233 (399) .+++.+..+...+. .+++|.++- .--|-.+ ....+. ..++....+++.+......|.......... T Consensus 231 ~~~l~~ai~~~~d~----~~l~G~G~~~~p~Gi~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----- 300 (428) T protein:vir:10 231 LQDILTAISVREDK----AFMRDDGTGDTPIGMKARATQWNRLL-PWAADAAVNLDTIDTYLDSIILMSMDGNSN----- 300 (428) T ss_pred HHHHHHHHHHHHHH----HHhccCCCCccccccccccccccccc-cccccccccHHHHHHHHHHHHHhhhccccc----- Confidence 88888876655543 445665421 0001100 000000 111234455665555444433211110000 Q ss_pred cccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCC Q lcl|NC_019514. 234 RMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTN 313 (399) Q Consensus 234 ~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~ 313 (399) . ..-+-++|+.+...|+.|+|--+.|-|.+. .-|++-|++++.++.+..-.+.+. T Consensus 301 -------~-~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~------------~~g~l~G~pv~~~~~~p~~~~~~~----- 355 (428) T protein:vir:10 301 -------M-ISSGWGMSNRTYMKLFGLRDGNGNKVYPEM------------AQGMLKGYPIQRTSAIPANLGEGG----- 355 (428) T ss_pred -------c-ccCEEEEcHHHHHHHHHhhccCCceeccCC------------CCCeeeceeeEEeccccccccCCC----- Confidence 0 012346799999999999887777777433 125789999998887532111110 Q ss_pred ccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEec-CCCCCCC-CCCccchhhHHHHH--HHHHHhhccccc Q lcl|NC_019514. 314 PGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKM-PGEATAD-RNDPYGEMGFSSIK--WYYGTLILRPER 389 (399) Q Consensus 314 ~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~-pG~~~ad-~~DPlgQrg~~gwK--~~~~~~iLn~~~ 389 (399) + . +.++||.=++-.+++.++ +.+-+-. ......+ ..-.+-|+..+.|+ +.+.+.+.+++- T Consensus 356 -------~---~-~~i~~gd~s~~~i~~~~~-----i~i~~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a 419 (428) T protein:vir:10 356 -------K---E-SEIYFADFNDVVIGEDGN-----MKVDFSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEG 419 (428) T ss_pred -------c---c-ceEEEEecceEEEEEecc-----eEEEeecccccccccccccchhhcchhheeeeeeeCceeeccce Confidence 1 1 224567666666655542 1111110 0000000 01135566777777 445556666666 Q ss_pred eEEEEEecc Q lcl|NC_019514. 390 LALVKTVAP 398 (399) Q Consensus 390 m~~ie~~a~ 398 (399) .+++.-+.= T Consensus 420 ~~~~t~~~~ 428 (428) T protein:vir:10 420 LVLGTGVLF 428 (428) T ss_pred EEEEeccCC Confidence 655544444 No 111 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=98.34 E-value=1.2e-07 Score=58.46 Aligned_cols=285 Identities=13% Similarity=0.038 Sum_probs=151.8 Q ss_pred CCcCCe------ee-----cCCCCccccccc-ccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcc Q lcl|NC_019514. 1 MASKGM------LY-----NDPNTTPSGIDA-PDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYI 68 (399) Q Consensus 1 ~~~~~~------~~-----n~~~~t~tT~~~-~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~ 68 (399) |..+.. .. ...-+..++..+ .+-|+ .+..+.+......-++.+++...+|+-+.|+...++.- T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~----~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~- 158 (392) T protein:vir:10 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQ----DIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNS- 158 (392) T ss_pred HhcccccHHHHHHHhhhhhhhhccccccCCCceecch----hHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeec- Confidence 111110 00 000111222222 22333 23455555567788888999999999887764433222 Q ss_pred ccccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhh Q lcl|NC_019514. 69 PLLDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFD 148 (399) Q Consensus 69 pl~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~ 148 (399) . .+...|..||-.. ++. ...+...++.+.++++.++.+|+++++ + T Consensus 159 ~------------~~~a~~v~E~~~~---~~~-------------------~~~~~~~v~l~~~k~~~~~~iS~ell~-d 203 (392) T protein:vir:10 159 D------------MIPFAEITEMGEI---PET-------------------DNPKFSNVQYAVKDRAGILPLSRSLLQ-D 203 (392) T ss_pred C------------Cccceeecccccc---ccc-------------------ccccceeEEeeeeeEEEeehhhHHHHh-h Confidence 1 1223466665221 111 113445677889999999999999765 4 Q ss_pred cchHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHH-HHHHhccCcccc Q lcl|NC_019514. 149 SDSELFSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLS-ITLDENRTPKQT 227 (399) Q Consensus 149 ~D~~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~-~~L~~nrap~~t 227 (399) ++.+|...+...|.+.-+...+ .-+++|.+. ++..+..++++|..+. ..|+..-.+. T Consensus 204 s~~~l~~~i~~~l~~~i~~~~d----~~~~~g~g~----------------~~~~~~~~~d~i~~~~~~~l~~~~~~~-- 261 (392) T protein:vir:10 204 SDQNILKYVTKWLGKKSKVTRN----VLILGVIEK----------------LTKQAIKSLDDIKDVLNVKLDPAISPN-- 261 (392) T ss_pred hHHHHHHHHHHHHHHHHHHHHH----HHHhhcccc----------------ccccCccCHHHHHHHHHHhhhhhhccC-- Confidence 5666888887887766554443 334444422 1223456788887765 3444422211 Q ss_pred ceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCc-cchhccc Q lcl|NC_019514. 228 KVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPE-MLHWAGA 306 (399) Q Consensus 228 ~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~-~~~~~~a 306 (399) + +-++||.+...|+.|+|--+.|-|.+ .+ -.|.-+++-|.|.|.+.. +.+ ... T Consensus 262 ---------------a--~~vm~~~~~~~L~~lkd~~G~~l~~~--~~------~~~~~~tllG~~~v~~~~~~~~-~~~ 315 (392) T protein:vir:10 262 ---------------A--ILLTNQDGFNYLDKLKDKDGKYILQS--DP------TQKNKKLFAGTNPVVVVSNRFL-KSK 315 (392) T ss_pred ---------------C--EEEEcHHHHHHHHHhhccCCCeEeec--Cc------cCCccccccCcccEEEeccccc-CCC Confidence 1 24789999999999988777777754 11 234556788888775432 211 001 Q ss_pred CCCccCCccccccCccceEEEEEEEcccc-eeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHh Q lcl|NC_019514. 307 GATVGTNPGYRETNGKYDIYPMLCVGAES-FTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTL 383 (399) Q Consensus 307 Ga~~~~~~~~~~t~~~~DVyp~lV~G~~A-fg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~ 383 (399) |. ..+.++ +++|.=+ |-.+.... + +..-+ .+-.+.+-+++.+.++ +++.+. T Consensus 316 ~~-------------~~~~~~-~~~gdfs~~~~i~~~~-~----~~~~~-------~~~~~~~f~~~~~~~r~~~r~d~~ 369 (392) T protein:vir:10 316 GT-------------TAKKAP-LIIGDLKEAIVLFKRE-D----MELAS-------TDVGGKAFTRNTLDLRAIQRDDVQ 369 (392) T ss_pred cc-------------cCCceE-EEEEehhceEEEEeec-c----eEEEE-------eccccchhhcCceEEEEEEeeccE Confidence 11 111222 4566422 11222111 1 11111 1122345566666666 557889 Q ss_pred hccccceEEEEEeccC Q lcl|NC_019514. 384 ILRPERLALVKTVAPL 399 (399) Q Consensus 384 iLn~~~m~~ie~~a~~ 399 (399) +++++-++.+.....- T Consensus 370 v~~~~a~~~l~~~~~a 385 (392) T protein:vir:10 370 MWDNEAAVYGEIDLSA 385 (392) T ss_pred EecccceEEEEecccc Confidence 9999999886654333 No 112 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=98.34 E-value=1.2e-07 Score=58.46 Aligned_cols=285 Identities=13% Similarity=0.038 Sum_probs=151.8 Q ss_pred CCcCCe------ee-----cCCCCccccccc-ccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcc Q lcl|NC_019514. 1 MASKGM------LY-----NDPNTTPSGIDA-PDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYI 68 (399) Q Consensus 1 ~~~~~~------~~-----n~~~~t~tT~~~-~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~ 68 (399) |..+.. .. ...-+..++..+ .+-|+ .+..+.+......-++.+++...+|+-+.|+...++.- T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~----~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~- 158 (392) T protein:vir:10 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQ----DIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNS- 158 (392) T ss_pred HhcccccHHHHHHHhhhhhhhhccccccCCCceecch----hHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeec- Confidence 111110 00 000111222222 22333 23455555567788888999999999887764433222 Q ss_pred ccccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhh Q lcl|NC_019514. 69 PLLDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFD 148 (399) Q Consensus 69 pl~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~ 148 (399) . .+...|..||-.. ++. ...+...++.+.++++.++.+|+++++ + T Consensus 159 ~------------~~~a~~v~E~~~~---~~~-------------------~~~~~~~v~l~~~k~~~~~~iS~ell~-d 203 (392) T protein:vir:10 159 D------------MIPFAEITEMGEI---PET-------------------DNPKFSNVQYAVKDRAGILPLSRSLLQ-D 203 (392) T ss_pred C------------Cccceeecccccc---ccc-------------------ccccceeEEeeeeeEEEeehhhHHHHh-h Confidence 1 1223466665221 111 113445677889999999999999765 4 Q ss_pred cchHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHH-HHHHhccCcccc Q lcl|NC_019514. 149 SDSELFSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLS-ITLDENRTPKQT 227 (399) Q Consensus 149 ~D~~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~-~~L~~nrap~~t 227 (399) ++.+|...+...|.+.-+...+ .-+++|.+. ++..+..++++|..+. ..|+..-.+. T Consensus 204 s~~~l~~~i~~~l~~~i~~~~d----~~~~~g~g~----------------~~~~~~~~~d~i~~~~~~~l~~~~~~~-- 261 (392) T protein:vir:10 204 SDQNILKYVTKWLGKKSKVTRN----VLILGVIEK----------------LTKQAIKSLDDIKDVLNVKLDPAISPN-- 261 (392) T ss_pred hHHHHHHHHHHHHHHHHHHHHH----HHHhhcccc----------------ccccCccCHHHHHHHHHHhhhhhhccC-- Confidence 5666888887887766554443 334444422 1223456788887765 3444422211 Q ss_pred ceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCc-cchhccc Q lcl|NC_019514. 228 KVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPE-MLHWAGA 306 (399) Q Consensus 228 ~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~-~~~~~~a 306 (399) + +-++||.+...|+.|+|--+.|-|.+ .+ -.|.-+++-|.|.|.+.. +.+ ... T Consensus 262 ---------------a--~~vm~~~~~~~L~~lkd~~G~~l~~~--~~------~~~~~~tllG~~~v~~~~~~~~-~~~ 315 (392) T protein:vir:10 262 ---------------A--ILLTNQDGFNYLDKLKDKDGKYILQS--DP------TQKNKKLFAGTNPVVVVSNRFL-KSK 315 (392) T ss_pred ---------------C--EEEEcHHHHHHHHHhhccCCCeEeec--Cc------cCCccccccCcccEEEeccccc-CCC Confidence 1 24789999999999988777777754 11 234556788888775432 211 001 Q ss_pred CCCccCCccccccCccceEEEEEEEcccc-eeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHh Q lcl|NC_019514. 307 GATVGTNPGYRETNGKYDIYPMLCVGAES-FTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTL 383 (399) Q Consensus 307 Ga~~~~~~~~~~t~~~~DVyp~lV~G~~A-fg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~ 383 (399) |. ..+.++ +++|.=+ |-.+.... + +..-+ .+-.+.+-+++.+.++ +++.+. T Consensus 316 ~~-------------~~~~~~-~~~gdfs~~~~i~~~~-~----~~~~~-------~~~~~~~f~~~~~~~r~~~r~d~~ 369 (392) T protein:vir:10 316 GT-------------TAKKAP-LIIGDLKEAIVLFKRE-D----MELAS-------TDVGGKAFTRNTLDLRAIQRDDVQ 369 (392) T ss_pred cc-------------cCCceE-EEEEehhceEEEEeec-c----eEEEE-------eccccchhhcCceEEEEEEeeccE Confidence 11 111222 4566422 11222111 1 11111 1122345566666666 557889 Q ss_pred hccccceEEEEEeccC Q lcl|NC_019514. 384 ILRPERLALVKTVAPL 399 (399) Q Consensus 384 iLn~~~m~~ie~~a~~ 399 (399) +++++-++.+.....- T Consensus 370 v~~~~a~~~l~~~~~a 385 (392) T protein:vir:10 370 MWDNEAAVYGEIDLSA 385 (392) T ss_pred EecccceEEEEecccc Confidence 9999999886654333 No 113 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=98.34 E-value=1.2e-07 Score=58.46 Aligned_cols=285 Identities=13% Similarity=0.038 Sum_probs=151.8 Q ss_pred CCcCCe------ee-----cCCCCccccccc-ccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcc Q lcl|NC_019514. 1 MASKGM------LY-----NDPNTTPSGIDA-PDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYI 68 (399) Q Consensus 1 ~~~~~~------~~-----n~~~~t~tT~~~-~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~ 68 (399) |..+.. .. ...-+..++..+ .+-|+ .+..+.+......-++.+++...+|+-+.|+...++.- T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~----~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~- 158 (392) T protein:vir:10 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQ----DIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNS- 158 (392) T ss_pred HhcccccHHHHHHHhhhhhhhhccccccCCCceecch----hHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeec- Confidence 111110 00 000111222222 22333 23455555567788888999999999887764433222 Q ss_pred ccccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhh Q lcl|NC_019514. 69 PLLDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFD 148 (399) Q Consensus 69 pl~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~ 148 (399) . .+...|..||-.. ++. ...+...++.+.++++.++.+|+++++ + T Consensus 159 ~------------~~~a~~v~E~~~~---~~~-------------------~~~~~~~v~l~~~k~~~~~~iS~ell~-d 203 (392) T protein:vir:10 159 D------------MIPFAEITEMGEI---PET-------------------DNPKFSNVQYAVKDRAGILPLSRSLLQ-D 203 (392) T ss_pred C------------Cccceeecccccc---ccc-------------------ccccceeEEeeeeeEEEeehhhHHHHh-h Confidence 1 1223466665221 111 113445677889999999999999765 4 Q ss_pred cchHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHH-HHHHhccCcccc Q lcl|NC_019514. 149 SDSELFSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLS-ITLDENRTPKQT 227 (399) Q Consensus 149 ~D~~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~-~~L~~nrap~~t 227 (399) ++.+|...+...|.+.-+...+ .-+++|.+. ++..+..++++|..+. ..|+..-.+. T Consensus 204 s~~~l~~~i~~~l~~~i~~~~d----~~~~~g~g~----------------~~~~~~~~~d~i~~~~~~~l~~~~~~~-- 261 (392) T protein:vir:10 204 SDQNILKYVTKWLGKKSKVTRN----VLILGVIEK----------------LTKQAIKSLDDIKDVLNVKLDPAISPN-- 261 (392) T ss_pred hHHHHHHHHHHHHHHHHHHHHH----HHHhhcccc----------------ccccCccCHHHHHHHHHHhhhhhhccC-- Confidence 5666888887887766554443 334444422 1223456788887765 3444422211 Q ss_pred ceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCc-cchhccc Q lcl|NC_019514. 228 KVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPE-MLHWAGA 306 (399) Q Consensus 228 ~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~-~~~~~~a 306 (399) + +-++||.+...|+.|+|--+.|-|.+ .+ -.|.-+++-|.|.|.+.. +.+ ... T Consensus 262 ---------------a--~~vm~~~~~~~L~~lkd~~G~~l~~~--~~------~~~~~~tllG~~~v~~~~~~~~-~~~ 315 (392) T protein:vir:10 262 ---------------A--ILLTNQDGFNYLDKLKDKDGKYILQS--DP------TQKNKKLFAGTNPVVVVSNRFL-KSK 315 (392) T ss_pred ---------------C--EEEEcHHHHHHHHHhhccCCCeEeec--Cc------cCCccccccCcccEEEeccccc-CCC Confidence 1 24789999999999988777777754 11 234556788888775432 211 001 Q ss_pred CCCccCCccccccCccceEEEEEEEcccc-eeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHh Q lcl|NC_019514. 307 GATVGTNPGYRETNGKYDIYPMLCVGAES-FTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTL 383 (399) Q Consensus 307 Ga~~~~~~~~~~t~~~~DVyp~lV~G~~A-fg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~ 383 (399) |. ..+.++ +++|.=+ |-.+.... + +..-+ .+-.+.+-+++.+.++ +++.+. T Consensus 316 ~~-------------~~~~~~-~~~gdfs~~~~i~~~~-~----~~~~~-------~~~~~~~f~~~~~~~r~~~r~d~~ 369 (392) T protein:vir:10 316 GT-------------TAKKAP-LIIGDLKEAIVLFKRE-D----MELAS-------TDVGGKAFTRNTLDLRAIQRDDVQ 369 (392) T ss_pred cc-------------cCCceE-EEEEehhceEEEEeec-c----eEEEE-------eccccchhhcCceEEEEEEeeccE Confidence 11 111222 4566422 11222111 1 11111 1122345566666666 557889 Q ss_pred hccccceEEEEEeccC Q lcl|NC_019514. 384 ILRPERLALVKTVAPL 399 (399) Q Consensus 384 iLn~~~m~~ie~~a~~ 399 (399) +++++-++.+.....- T Consensus 370 v~~~~a~~~l~~~~~a 385 (392) T protein:vir:10 370 MWDNEAAVYGEIDLSA 385 (392) T ss_pred EecccceEEEEecccc Confidence 9999999886654333 No 114 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=98.34 E-value=1.2e-07 Score=58.46 Aligned_cols=285 Identities=13% Similarity=0.038 Sum_probs=151.8 Q ss_pred CCcCCe------ee-----cCCCCccccccc-ccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcc Q lcl|NC_019514. 1 MASKGM------LY-----NDPNTTPSGIDA-PDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYI 68 (399) Q Consensus 1 ~~~~~~------~~-----n~~~~t~tT~~~-~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~ 68 (399) |..+.. .. ...-+..++..+ .+-|+ .+..+.+......-++.+++...+|+-+.|+...++.- T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~----~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~- 158 (392) T protein:vir:10 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQ----DIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNS- 158 (392) T ss_pred HhcccccHHHHHHHhhhhhhhhccccccCCCceecch----hHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeec- Confidence 111110 00 000111222222 22333 23455555567788888999999999887764433222 Q ss_pred ccccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhh Q lcl|NC_019514. 69 PLLDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFD 148 (399) Q Consensus 69 pl~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~ 148 (399) . .+...|..||-.. ++. ...+...++.+.++++.++.+|+++++ + T Consensus 159 ~------------~~~a~~v~E~~~~---~~~-------------------~~~~~~~v~l~~~k~~~~~~iS~ell~-d 203 (392) T protein:vir:10 159 D------------MIPFAEITEMGEI---PET-------------------DNPKFSNVQYAVKDRAGILPLSRSLLQ-D 203 (392) T ss_pred C------------Cccceeecccccc---ccc-------------------ccccceeEEeeeeeEEEeehhhHHHHh-h Confidence 1 1223466665221 111 113445677889999999999999765 4 Q ss_pred cchHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHH-HHHHhccCcccc Q lcl|NC_019514. 149 SDSELFSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLS-ITLDENRTPKQT 227 (399) Q Consensus 149 ~D~~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~-~~L~~nrap~~t 227 (399) ++.+|...+...|.+.-+...+ .-+++|.+. ++..+..++++|..+. ..|+..-.+. T Consensus 204 s~~~l~~~i~~~l~~~i~~~~d----~~~~~g~g~----------------~~~~~~~~~d~i~~~~~~~l~~~~~~~-- 261 (392) T protein:vir:10 204 SDQNILKYVTKWLGKKSKVTRN----VLILGVIEK----------------LTKQAIKSLDDIKDVLNVKLDPAISPN-- 261 (392) T ss_pred hHHHHHHHHHHHHHHHHHHHHH----HHHhhcccc----------------ccccCccCHHHHHHHHHHhhhhhhccC-- Confidence 5666888887887766554443 334444422 1223456788887765 3444422211 Q ss_pred ceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCc-cchhccc Q lcl|NC_019514. 228 KVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPE-MLHWAGA 306 (399) Q Consensus 228 ~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~-~~~~~~a 306 (399) + +-++||.+...|+.|+|--+.|-|.+ .+ -.|.-+++-|.|.|.+.. +.+ ... T Consensus 262 ---------------a--~~vm~~~~~~~L~~lkd~~G~~l~~~--~~------~~~~~~tllG~~~v~~~~~~~~-~~~ 315 (392) T protein:vir:10 262 ---------------A--ILLTNQDGFNYLDKLKDKDGKYILQS--DP------TQKNKKLFAGTNPVVVVSNRFL-KSK 315 (392) T ss_pred ---------------C--EEEEcHHHHHHHHHhhccCCCeEeec--Cc------cCCccccccCcccEEEeccccc-CCC Confidence 1 24789999999999988777777754 11 234556788888775432 211 001 Q ss_pred CCCccCCccccccCccceEEEEEEEcccc-eeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHh Q lcl|NC_019514. 307 GATVGTNPGYRETNGKYDIYPMLCVGAES-FTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTL 383 (399) Q Consensus 307 Ga~~~~~~~~~~t~~~~DVyp~lV~G~~A-fg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~ 383 (399) |. ..+.++ +++|.=+ |-.+.... + +..-+ .+-.+.+-+++.+.++ +++.+. T Consensus 316 ~~-------------~~~~~~-~~~gdfs~~~~i~~~~-~----~~~~~-------~~~~~~~f~~~~~~~r~~~r~d~~ 369 (392) T protein:vir:10 316 GT-------------TAKKAP-LIIGDLKEAIVLFKRE-D----MELAS-------TDVGGKAFTRNTLDLRAIQRDDVQ 369 (392) T ss_pred cc-------------cCCceE-EEEEehhceEEEEeec-c----eEEEE-------eccccchhhcCceEEEEEEeeccE Confidence 11 111222 4566422 11222111 1 11111 1122345566666666 557889 Q ss_pred hccccceEEEEEeccC Q lcl|NC_019514. 384 ILRPERLALVKTVAPL 399 (399) Q Consensus 384 iLn~~~m~~ie~~a~~ 399 (399) +++++-++.+.....- T Consensus 370 v~~~~a~~~l~~~~~a 385 (392) T protein:vir:10 370 MWDNEAAVYGEIDLSA 385 (392) T ss_pred EecccceEEEEecccc Confidence 9999999886654333 No 115 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=98.30 E-value=1.9e-07 Score=57.48 Aligned_cols=284 Identities=12% Similarity=0.072 Sum_probs=158.5 Q ss_pred CCcCCeeecCCC--C--ccccc-ccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccc Q lcl|NC_019514. 1 MASKGMLYNDPN--T--TPSGI-DAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRN 75 (399) Q Consensus 1 ~~~~~~~~n~~~--~--t~tT~-~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~ 75 (399) ++......+... . ..++. .+-+-|+ -+..+.+..+.....+.+++...+++.+.|+-...++- T Consensus 101 ~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~----~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-------- 168 (408) T protein:vir:10 101 VRNPMAFMNTVSSKTETSGSDSAAGLTIPQ----DIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWT-------- 168 (408) T ss_pred hhcchhhhhhhhhhhhhcccccCCceeccH----hHHHHHHHHHHhhchhhhhcceeeccCCcceEEEeecc-------- Confidence 222211222211 1 11111 1122233 24566777788888899999999999999885433322 Q ss_pred cccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHH Q lcl|NC_019514. 76 VNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFS 155 (399) Q Consensus 76 ~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~ 155 (399) +..+...|..||-. .+.....+...++.+.++++.+..+|+++++ ++..+|.. T Consensus 169 ----~~~~~a~~v~E~~~----------------------~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~-ds~~~l~~ 221 (408) T protein:vir:10 169 ----DVTPLTVMDAEDGK----------------------IPDLDNPQLTIIKYLIKRYAGIITATNTSLK-DTAENILA 221 (408) T ss_pred ----ccccceeeecCccc----------------------cccccCcceeeEEeeeeeEEeeehhHHHHHh-hchHHHHH Confidence 11122344555421 1111123455688889999999999999776 34545777 Q ss_pred HHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHH-HHHHhccCccccceecccc Q lcl|NC_019514. 156 HISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLS-ITLDENRTPKQTKVITGSR 234 (399) Q Consensus 156 ~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~-~~L~~nrap~~t~~i~~s~ 234 (399) .+..+|.+..+...+ ..+++|.+.. ....+..++++|..+. ..|+..-.+ T Consensus 222 ~i~~~l~~~~~~~~~----~~il~g~g~~---------------~~~~~~~~~~~l~~~~~~~~~~~~~~---------- 272 (408) T protein:vir:10 222 WLSSWIAKKVVVTRN----QAIIEVMKAA---------------PKKPTIAKFDDVITMINTAVDPAIIA---------- 272 (408) T ss_pred HHHHHHHHHHHHHHH----HHHhhccccc---------------ccccccccHHHHHHHHHHhhhhhhcc---------- Confidence 777777776665443 3466665421 1123456788887765 344332111 Q ss_pred ccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCc Q lcl|NC_019514. 235 MIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNP 314 (399) Q Consensus 235 ~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~ 314 (399) . =+-+||+.+...|+.|+|--+.+-|.+- +-.|..+++-|+.++.++.. +..+.++ T Consensus 273 --------~-a~~v~n~~~~~~l~~lkd~~G~~i~~~~--------~~~~~~~~l~G~PV~~~~~~--~~~~~~~----- 328 (408) T protein:vir:10 273 --------T-SSLLTNQSGLNKLALVKTAEGKYLLEPD--------PTKPNSYLIKGKQVIVVADR--WLPNTGS----- 328 (408) T ss_pred --------C-CEEEEcHHHHHHHHHhhccCCceEeccC--------cCCCCCceecceeeEEeccc--ccCccCC----- Confidence 0 1357999999999999887777777542 12345678889888776542 1111111 Q ss_pred cccccCccceEEEEEEEcccc--eeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHhhccccce Q lcl|NC_019514. 315 GYRETNGKYDIYPMLCVGAES--FTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTLILRPERL 390 (399) Q Consensus 315 ~~~~t~~~~DVyp~lV~G~~A--fg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~iLn~~~m 390 (399) + .+ .+++|.=+ |....-++ +. +... +-....-+++...++ +.+.+.+++++-+ T Consensus 329 ------~---~~-~i~~gd~~~~~~~~~~~~------~~--v~~~-----~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~ 385 (408) T protein:vir:10 329 ------T---VY-PLYYGDMSQAITLFDREN------MS--LLPT-----NIGAGAFETDTTKIRVIDRFDVKATDSEAL 385 (408) T ss_pred ------C---ce-EEEEEehhccEEEEEecc------eE--EEEc-----ccccchhhcCceEEEEEEeeccEEeccccE Confidence 1 22 25677533 32222222 11 1100 101122356777777 4589999999999 Q ss_pred EEEEEeccC Q lcl|NC_019514. 391 ALVKTVAPL 399 (399) Q Consensus 391 ~~ie~~a~~ 399 (399) +.++..++- T Consensus 386 ~~~~~~~~~ 394 (408) T protein:vir:10 386 VAGSFSAIA 394 (408) T ss_pred EEEEeeccc Confidence 998866643 No 116 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=98.23 E-value=2.1e-07 Score=57.14 Aligned_cols=287 Identities=13% Similarity=0.061 Sum_probs=154.8 Q ss_pred CCcC-CeeecCC---CCcccccccccccceehhhhhHHHHHHHHHHH-HhhhhcccccccccCCCEEEEEEccccccccc Q lcl|NC_019514. 1 MASK-GMLYNDP---NTTPSGIDAPDGKQMNTFFWWKKALIEARKDQ-YFMPLADVVSMPKNYGKEIRVYHYIPLLDDRN 75 (399) Q Consensus 1 ~~~~-~~~~n~~---~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~l-v~~~fA~~~~mPkN~GktIk~rry~pl~~~~~ 75 (399) +... ...-+.. ..+++...+-+.|. + ..+.+.+..+.. ++.+++.+.+ -+.|..+.+.+..- T Consensus 97 ~~~~~~~~~~~~~~~~~t~~~~g~~~~~~----~-~~~~i~~~~~~~~~l~~~~~~~~--~~~~~~~~~~~~~~------ 163 (392) T protein:vir:13 97 NLGEARSFEFAPEKRDGTKAGNPNVLSRT----L-YGQLIAQAVERSAIMRGGASTFT--TSDANPMDFTVITG------ 163 (392) T ss_pred chhhhHHHHhhhhhhcccccCCCcccccc----c-hHHHHHHHHhhhhhhhhcceeee--cCCCceeEEEEEcC------ Confidence 1000 0000111 12222222223333 1 245566655443 4666776544 34555666655422 Q ss_pred cccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHH Q lcl|NC_019514. 76 VNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFS 155 (399) Q Consensus 76 ~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~ 155 (399) .+...|+.||- .......+...++.+.++++.++.+|+++++ +++.++.. T Consensus 164 ------~~~a~~v~E~~-----------------------~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~-ds~~~l~~ 213 (392) T protein:vir:13 164 ------RATAGIVGETA-----------------------EIPESYPATTQRSMGGFKYGFASVVSYEFAT-DQVLDLVG 213 (392) T ss_pred ------Ccceeeecccc-----------------------cccccccceeeEEeeeeeEEeeehhHHHHHh-cchHHHHH Confidence 12224555551 1222334455678889999999999999776 45555777 Q ss_pred HHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCccc--ccccccccCCceecHHHHHHHHHHHHhccCccccceeccc Q lcl|NC_019514. 156 HISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQD--SEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGS 233 (399) Q Consensus 156 ~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~--~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s 233 (399) .+...|.+.-+...+ ..+++|.++-.=.|--+.. +......+....+++++|.++...|+..-.. T Consensus 214 ~i~~~l~~~i~~~~d----~~~l~G~Gt~~p~Gil~~~~~~~~~~~~~~~~~~~~d~l~~~~~~l~~~~~~--------- 280 (392) T protein:vir:13 214 FLVSDAGPAIGDAMG----RHFLTGTGTGQPRGILTDATGANAAFGEADADSKVSDALIDLFHEVPSAYRK--------- 280 (392) T ss_pred HHHHHHHHHHHHHHH----HHHhcccCCccccccccccccccccccccccccccHHHHHHHHHhhhhhhhc--------- Confidence 777777776554443 3466665431100110000 0011112334668899998888777543111 Q ss_pred cccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCC Q lcl|NC_019514. 234 RMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTN 313 (399) Q Consensus 234 ~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~ 313 (399) .++| ++|+.+...|+.|+|.-+.|-|.+-- -.|.-+.+-|+.++.++.+- ++ T Consensus 281 --------~a~~--v~n~~~~~~l~~lkd~~G~~l~~~~~--------~~g~~~~l~G~Pv~~~~~~~----------~~ 332 (392) T protein:vir:13 281 --------NAKF--VVNDLRAAQMRKLKDANGQYLWQSAL--------TVGAPDTFNGKVVETDDGMP----------AD 332 (392) T ss_pred --------CCEE--EEcHHHHHHHHHhhccCCceeecCCc--------CCCCCceecceeeEEcCCCC----------CC Confidence 1223 67999999999999877777776532 23444577788888887751 11 Q ss_pred ccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHhhccccceE Q lcl|NC_019514. 314 PGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTLILRPERLA 391 (399) Q Consensus 314 ~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~iLn~~~m~ 391 (399) .++||.=+.-.++..++ + .+. ...|++-+++..+++ .++.+.+.+++-++ T Consensus 333 --------------~i~~Gdf~~~~i~~~~~-----~--~i~-------~~~~~~~~~~~~~~r~~~r~d~~~~~~~A~~ 384 (392) T protein:vir:13 333 --------------KVLFADLSKYRVRFAGS-----L--RVD-------RSVDAKFSTDQIVYRFLQRADGLLVDARGAK 384 (392) T ss_pred --------------cEEEeeccceeEEeecc-----e--EEE-------eeccccccCCcEEEEEEEEeccEEecccceE Confidence 14566643333433331 1 122 123677777666666 56788899998888 Q ss_pred EEEEeccC Q lcl|NC_019514. 392 LVKTVAPL 399 (399) Q Consensus 392 ~ie~~a~~ 399 (399) .+++.+-- T Consensus 385 ~~~~~~aa 392 (392) T protein:vir:13 385 VLTVTPAA 392 (392) T ss_pred EEEeeccC Confidence 65543333 No 117 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=98.23 E-value=6.2e-07 Score=54.60 Aligned_cols=285 Identities=12% Similarity=0.083 Sum_probs=159.9 Q ss_pred CCcCCeeecCCC-----CcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccc Q lcl|NC_019514. 1 MASKGMLYNDPN-----TTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRN 75 (399) Q Consensus 1 ~~~~~~~~n~~~-----~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~ 75 (399) ++......+... ..+++..+-+-|+ .+..+.+....+...+.+++...+|+...|+...++... T Consensus 101 ~~~~~~~~~~~e~~a~~~~t~~~gg~~iP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~------- 169 (404) T protein:vir:39 101 VRNPMAFLNTVSSKTETSGSDSAAGLTIPQ----DIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTD------- 169 (404) T ss_pred HhcchhhhhhhhhhhhhcccccCCceeccH----HHHHHHHHHHHhhhhHHhhcceeeccCCcceEEEEeecC------- Confidence 222111222111 1111111222233 345566667888889999999999999888876554431 Q ss_pred cccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHH Q lcl|NC_019514. 76 VNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFS 155 (399) Q Consensus 76 ~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~ 155 (399) ..+...|..||-.. ++ ....+...++.++++++.++.+|+++++ +++..+.. T Consensus 170 -----~~~~a~~v~Eg~~~---~~-------------------~~~~~f~~i~~~~~k~~~~~~iS~ell~-ds~~~l~~ 221 (404) T protein:vir:39 170 -----VTPLTVMDAEDGKI---PD-------------------LDNPRLTIIKYLIKRYAGIITATNTLLK-DTAENILA 221 (404) T ss_pred -----CccceeeecCcccc---cc-------------------ccccceeeEEeeeeeEEeeehhHHHHHh-hchHHHHH Confidence 11223455555211 11 1123456688889999999999999776 45556888 Q ss_pred HHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHH-HHHhccCccccceecccc Q lcl|NC_019514. 156 HISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSI-TLDENRTPKQTKVITGSR 234 (399) Q Consensus 156 ~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~-~L~~nrap~~t~~i~~s~ 234 (399) .+..+|.+..+...+. .+++|.+.. .......+++++..+.. .++..-.+ T Consensus 222 ~i~~~l~~~~~~~~d~----~il~g~g~~---------------~~~~~~~~~~~i~~~~~~~~~~~~~~---------- 272 (404) T protein:vir:39 222 WLSSWIAKKVVVTRNQ----AIIAAMGTV---------------PKKPTIAKFDDVITMINTSVDPAIIA---------- 272 (404) T ss_pred HHHHHHHHHHHHHHHH----HHHhccccc---------------ccccccccHHHHHHHHHHhhhhhhcc---------- Confidence 8888888777655543 456665321 12234457788777653 23221111 Q ss_pred ccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCc Q lcl|NC_019514. 235 MIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNP 314 (399) Q Consensus 235 ~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~ 314 (399) .=+-+|||.+...|+.|+|-.+.|-|.+- +..+..+++-|..++.+..+ +...++ T Consensus 273 ---------~a~~v~n~~~~~~L~~lkd~~G~~l~~~~--------~~~~~~~~l~G~pV~~~~~~--~~~~~~------ 327 (404) T protein:vir:39 273 ---------TSSLLTNQSGLNKLALVKTAEGKYLLEPD--------PTKPNSYLIKGKKVIVVADR--WLPNSG------ 327 (404) T ss_pred ---------CCEEEEcHHHHHHHHHhhccCCceeeccC--------cCCCCcceecceeEEEeccc--ccCccC------ Confidence 01468999999999999887776666532 23445568888887766542 111111 Q ss_pred cccccCccceEEEEEEEcccc-eeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHhhccccceE Q lcl|NC_019514. 315 GYRETNGKYDIYPMLCVGAES-FTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTLILRPERLA 391 (399) Q Consensus 315 ~~~~t~~~~DVyp~lV~G~~A-fg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~iLn~~~m~ 391 (399) .+. + .+++|.-. +-.+... .+ +.+-+- +-.+.+.+.+...++ +.+++.+++++-++ T Consensus 328 -----~~~---~-~~~~gd~~~~~~~~~~-~~----~~i~~~-------~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~ 386 (404) T protein:vir:39 328 -----STV---Y-PLYYGDMSQAITLFDR-EN----MSLLPT-------NIGAGAFETDTTKIRVIDRFDVKTTDSEALV 386 (404) T ss_pred -----CCc---c-EEEEEeccccEEEEee-cc----eEEEEe-------ccchhhhhhceeeEEEEeeeccEEecccceE Confidence 111 1 24566432 1122111 11 111111 122345566777766 67889999999988 Q ss_pred EEEEeccC Q lcl|NC_019514. 392 LVKTVAPL 399 (399) Q Consensus 392 ~ie~~a~~ 399 (399) .++..+.= T Consensus 387 ~~~~~~~a 394 (404) T protein:vir:39 387 AGSFTAIA 394 (404) T ss_pred EEEeeccc Confidence 88854433 No 118 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=98.23 E-value=3.1e-07 Score=56.29 Aligned_cols=297 Identities=13% Similarity=0.074 Sum_probs=155.8 Q ss_pred CCcCCe--eecCCCCcccccccccccceehhhhhHHHHHH-HHHHHHhhhhcccccccccCCCEEEEEEccccccccccc Q lcl|NC_019514. 1 MASKGM--LYNDPNTTPSGIDAPDGKQMNTFFWWKKALIE-ARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVN 77 (399) Q Consensus 1 ~~~~~~--~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~-A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~ 77 (399) ..+..- .-+.+..+.++....+.|+.. .+.++. .....++.+++...++..+ ++++-+.... ..+. T Consensus 112 ~~~~~~~~~~~~~~~~~~~~~~~~~p~~~-----~~~i~~~~~~~~~i~~~~~~~~~~~~---~~~~~~~~~~---~~~~ 180 (419) T protein:vir:94 112 DIDPNRLLSRDAPAGTITNPNVPHLPQLV-----PGIVPTTPDLPLLVADLLDQQNADYN---VLEYIRDTSG---TAGA 180 (419) T ss_pred HHHHHHhhccccccccccCCcccccchhh-----hHHHHHHHhhhhhhhhcceeeeccCC---ceeeeeeccc---cccc Confidence 000000 011122222333333444422 233333 3344566777776666433 3444333211 0000 Q ss_pred cCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHH Q lcl|NC_019514. 78 DQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHI 157 (399) Q Consensus 78 ~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~ 157 (399) ..+ ....+|..|| +.......+...++.++++++.+..+|+++++ ++. .+...+ T Consensus 181 ~~~-~~~a~~v~Eg-----------------------~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~-d~~-~l~~~i 234 (419) T protein:vir:94 181 GST-WNKAAVVPEG-----------------------TAKPQSTLSFDTITTTLKTVAHWLPITRQAAD-DNS-QLMGYI 234 (419) T ss_pred ccc-CcccceecCC-----------------------ccccccccceeeEEeeeeeEEEeehhhHHHHH-hHH-HHHHHH Confidence 111 1122345554 22223334556788999999999999999887 444 366666 Q ss_pred HHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcc-----cccccccccCCceecHHHHHHHHHHHHhccCccccceecc Q lcl|NC_019514. 158 STELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQ-----DSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITG 232 (399) Q Consensus 158 ~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats-----~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~ 232 (399) ...|.+..+... -..+++|.+.-.-.|..+. .+.............+++|.++.-.|.....+. T Consensus 235 ~~~la~a~~~~~----d~aii~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~t~~~~~~~l~~~~~~~~~~~~~~------- 303 (419) T protein:vir:94 235 QGRLTYGLRFLR----DRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPP------- 303 (419) T ss_pred HHHHHHHHHHHH----HHHHHhccCcccccceecccccccccccccccccccchhHHHHHHHHHhhhhccCCC------- Confidence 666665554444 3445666554332222111 111111122334456888888887776544331 Q ss_pred ccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccC Q lcl|NC_019514. 233 SRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGT 312 (399) Q Consensus 233 s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~ 312 (399) =+.+|||.+...|+.++|.-+.+-|... . .-.+-.+++-|+.++.++.+.. T Consensus 304 ------------~~~v~n~~~~~~l~~~k~~~~~~~~~~~--~-----~~~~~~~~l~G~pV~~~~~~~~---------- 354 (419) T protein:vir:94 304 ------------DGVVVHPQDWESIELDQAPGSGVFRVIA--N-----VQGEATPRIWGLNVVSTVAIAQ---------- 354 (419) T ss_pred ------------CEEEEcHHHHHHHHHHhhcCCCceeecC--C-----cccCCCccccceeeEEcCCCCC---------- Confidence 1468999999999998875444333211 1 1234456788999999887621 Q ss_pred CccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHhhccccce Q lcl|NC_019514. 313 NPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTLILRPERL 390 (399) Q Consensus 313 ~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~iLn~~~m 390 (399) + .++||.-......+...+ +. +. . .+-.+.+-+++...|+ +++.+.+.+++-+ T Consensus 355 ~--------------~~~~gd~~~~~~~~~~~~----~~--v~-~----~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~ 409 (419) T protein:vir:94 355 G--------------TALVGGFRQGATLWSRQG----IT--VL-M----TDSHADFFTANTLVILAEFRANLAVYQPKAF 409 (419) T ss_pred c--------------cEEEeeccceEEEEEecc----eE--EE-E----eccccchhhcCcEEEEEEEeeccEEeccccE Confidence 1 145666554433333333 11 11 1 1112334455666666 6788999999999 Q ss_pred EEEEEeccC Q lcl|NC_019514. 391 ALVKTVAPL 399 (399) Q Consensus 391 ~~ie~~a~~ 399 (399) ++++.++.. T Consensus 410 ~~~~~~aa~ 418 (419) T protein:vir:94 410 VRVTFAAAT 418 (419) T ss_pred EEEEeccCC Confidence 999988888 No 119 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=98.20 E-value=2.2e-07 Score=57.07 Aligned_cols=290 Identities=11% Similarity=0.093 Sum_probs=147.9 Q ss_pred ecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCCCCCCCce Q lcl|NC_019514. 8 YNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQGIDAAGAT 87 (399) Q Consensus 8 ~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~gi~aaga~ 87 (399) ++. .+ ++..+.+-|+ . +..+.++.+++.-++.+++...+++.+ ++++-+..- .+...| T Consensus 1 ma~--~t-~~~gg~liP~---~-~~~~Ii~~~~~~s~l~~l~~~~~~~~~---~~~~p~~~~------------~~~a~w 58 (305) T protein:vir:25 1 MAD--IS-RAEVASLIQE---A-YSDTLLAAAKQGSTVLSAFQNVNMGTK---TTHLPVLAT------------LPEADW 58 (305) T ss_pred CCC--cc-CCccceecCH---H-HHHHHHHHHHhhchhhhhcceeeccCC---cEEEEEEeC------------CcceEE Confidence 222 22 2222332233 2 357788888999999999998888643 344333211 123345 Q ss_pred eccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHHHHHHhhhH Q lcl|NC_019514. 88 IVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHISTELMNGAVQ 167 (399) Q Consensus 88 lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~~lg~~a~~ 167 (399) ..||-. ..-+++.. ...+...++...+|++.++.+|+++++ +++..+...+...|.+.-+. T Consensus 59 v~E~~~----~~~~~~~~--------------s~~~f~~i~~~~~k~~~~~~is~ell~-ds~~~~~~~i~~~l~~~~a~ 119 (305) T protein:vir:25 59 VGESAT----DPKGVKPT--------------SKVTWANRTLVAEEIAVIIPVHENVID-DATVAVLTEVAELGGQAIGK 119 (305) T ss_pred eecccc----cccccccc--------------cccceeeEEeeeEEEEEeehhhHHHHh-cchHHHHHHHHHHHHHHHHH Confidence 555511 00011111 124456688889999999999999775 44455777777777777665 Q ss_pred HHHHHHHHHHHhcCCe----EEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccccCc Q lcl|NC_019514. 168 LTEAVLQKDLLAGAGT----IVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTRTISA 243 (399) Q Consensus 168 ~~e~~l~~~~lag~~~----v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~~I~~ 243 (399) ..+.. +++|.+. .-.............-.+.......+++..+...+....... .. ... T Consensus 120 ~~d~a----~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~------~~~ 182 (305) T protein:vir:25 120 KLDQA----VIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVANESDIVGATNRAAKAVASA-------GW------APD 182 (305) T ss_pred HHhhh----heeccCCCCCccccccccccccccccccccccchhhhHHHHHHHHHHHhhhhc-------cc------ccc Confidence 55443 4444431 111000000000001112233344455544444443322211 00 001 Q ss_pred eeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCccccccCccc Q lcl|NC_019514. 244 GRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRETNGKY 323 (399) Q Consensus 244 ~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t~~~~ 323 (399) ..++||.....|+.|+|--+.+-| .. +.+-|...+.++.+ ++ ..++. T Consensus 183 --~~v~~~~~~~~l~~lkd~~G~~i~-------------~~--~~l~G~Pv~~~~~~-~~---------------~~~~~ 229 (305) T protein:vir:25 183 --TLLSSLALRYEVANIRDANGNPVF-------------RD--DSFAGFRTFFNRNG-AW---------------DADAA 229 (305) T ss_pred --eeEecHHHHHHHHHhhccCCceee-------------cC--CcccccceEEcCcc-CC---------------CCCcc Confidence 257799999999988764444433 22 46788888877654 10 01122 Q ss_pred eEEEEEEEcccceeeeccccCCCCccceEE---EecCCCCCCCCCCccchhhHHHHHH--HHHHhhccccceEEEEEe-- Q lcl|NC_019514. 324 DIYPMLCVGAESFTTIGFQTDGKTLKFKVT---TKMPGEATADRNDPYGEMGFSSIKW--YYGTLILRPERLALVKTV-- 396 (399) Q Consensus 324 DVyp~lV~G~~Afg~v~l~g~g~~~~~~~i---vk~pG~~~ad~~DPlgQrg~~gwK~--~~~~~iLn~~~m~~ie~~-- 396 (399) .+++|.=+.-.++..++ ..+++. ....+ +..-.+-|+....|++ .++..++|++-.+.+.-+ T Consensus 230 ----~~~~gd~s~~~i~~~~~---~~i~~~~~~~~~~~----~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~ 298 (305) T protein:vir:25 230 ----IEVIADSSRVKIGVRQD---ITVKFLDQATLGTG----ENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPV 298 (305) T ss_pred ----EEEEEecceEEEEEecC---eEEEEeeeeeeecC----CceeeeeecCcEEEEEEEeecceeeCcccEEEEccccc Confidence 24567655555555442 223222 11111 1112245777888873 477788898877766543 Q ss_pred ccC Q lcl|NC_019514. 397 APL 399 (399) Q Consensus 397 a~~ 399 (399) +.+ T Consensus 299 ~~~ 301 (305) T protein:vir:25 299 AVV 301 (305) T ss_pred ccc Confidence 223 No 120 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=98.20 E-value=3.9e-07 Score=55.70 Aligned_cols=295 Identities=14% Similarity=0.094 Sum_probs=154.1 Q ss_pred CCcCCeeecCCC--CcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccccccccccc Q lcl|NC_019514. 1 MASKGMLYNDPN--TTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVND 78 (399) Q Consensus 1 ~~~~~~~~n~~~--~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~ 78 (399) |. ....-..-. ++.++..+-+-|+ .+..+.+....+..++.+++...++. | .+++-++.. T Consensus 131 l~-~~~~~~e~~a~~~~t~~GG~lvP~----~~~~~Ii~~l~~~~~i~~~~~~~~~~---~-~~~~p~~~~--------- 192 (434) T protein:vir:62 131 IV-GNIDEKEARALGLVTGNGSVTIPD----FLSKEIITYAQEENFLRRLGTGVKTK---E-NIKYPVLVK--------- 192 (434) T ss_pred hc-cccchhhhhhhcccccccceecch----hhHHHHHHhhhhhhhhhhhcceeccC---C-ceEEEEEec--------- Confidence 10 000000000 1112222223343 24566676677888888999876543 3 244433311 Q ss_pred CCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHH Q lcl|NC_019514. 79 QGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIS 158 (399) Q Consensus 79 ~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~ 158 (399) + +...+...+ +.|+......+++..++.+.++++.++.+|+++++ ++..+|.+.+. T Consensus 193 -~--~~a~~~~~~--------------------~e~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~-ds~~~l~~~i~ 248 (434) T protein:vir:62 193 -K--AEAQGHKNE--------------------RTNNEMPETDIEFDEIELSPTEFDALATVTKKLLA-RTGLPIEQIVM 248 (434) T ss_pred -C--Ccccceecc--------------------cccccccccccceeeEEeeheeeEeehhhHHHHHh-cchHHHHHHHH Confidence 1 111222211 11222223334566788999999999999999776 44445777777 Q ss_pred HHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCc Q lcl|NC_019514. 159 TELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDT 238 (399) Q Consensus 159 ~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T 238 (399) +.|.+..+...+ ..+++|.++--..++....+.+ ..++...+++++|.++.-.|+..-.+ T Consensus 249 ~~la~~~~~~~d----~~~l~G~G~~~~~~g~~~~~~~--~~~~~~~~~~d~l~~l~~~l~~~~~~-------------- 308 (434) T protein:vir:62 249 DELKKAYVRKET----QYMVNGDEANNINDGALAKKAV--EFKTDEKNLYDALVKMKNTPVKEVRK-------------- 308 (434) T ss_pred HHHHHHHHHHHH----HHHhccCCCCccccceeecccc--cccccccchhhHHHHHHhhcchhhhc-------------- Confidence 888777664443 4466766543333322222222 12344557889998888777553221 Q ss_pred cccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCccccc Q lcl|NC_019514. 239 RTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRE 318 (399) Q Consensus 239 ~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~ 318 (399) .+ +-++||.+...|+.|+|--+.|-|.|..+ + -.|--..+-|.+++.++.|. .+.+ T Consensus 309 ----~a-~~v~n~~~~~~L~~lkd~~G~~l~~~~~~-~-----~~g~~~tl~G~pV~~~~~~~----~~~~--------- 364 (434) T protein:vir:62 309 ----KA-RWVLNTAALTKIETMKTDDGFPLLRPFNQ-A-----EGGIGYTLLGFPVEEEDAID----IPDS--------- 364 (434) T ss_pred ----CC-EEEEcHHHHHHHHHhhccCCCEeeccCCC-c-----cCCCCceecceeeEEecCcc----CccC--------- Confidence 12 23689999999999998777777775321 1 12233468899999887762 1111 Q ss_pred cCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHH--HHh-hccccceEEEEE Q lcl|NC_019514. 319 TNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYY--GTL-ILRPERLALVKT 395 (399) Q Consensus 319 t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~--~~~-iLn~~~m~~ie~ 395 (399) -+. +.+.||.-+...+.-..+. +. +. ...++|-.++.++++++. .++ |++++=.+.+.. T Consensus 365 ----~~~-~~i~~Gdfs~~~i~~~~g~----~~--i~-------~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~ 426 (434) T protein:vir:62 365 ----PDT-PVFYFGDFSKFYIQDVIGS----LE--VQ-------KLVELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKY 426 (434) T ss_pred ----CCc-eEEEEeeccceEEEEeece----eE--EE-------eehhhhcccCceEEEEEeeecceeecCcccceEEEE Confidence 011 4566787666555433221 11 11 123556555555555332 344 333554444433 Q ss_pred eccC Q lcl|NC_019514. 396 VAPL 399 (399) Q Consensus 396 ~a~~ 399 (399) ..+. T Consensus 427 ~~~~ 430 (434) T protein:vir:62 427 VLKA 430 (434) T ss_pred Eecc Confidence 3333 No 121 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=98.17 E-value=5.8e-07 Score=54.79 Aligned_cols=302 Identities=12% Similarity=0.044 Sum_probs=150.4 Q ss_pred CCcCC---eeecCCCCccccccc-ccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccccccccc Q lcl|NC_019514. 1 MASKG---MLYNDPNTTPSGIDA-PDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNV 76 (399) Q Consensus 1 ~~~~~---~~~n~~~~t~tT~~~-~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~ 76 (399) ...+. ..-|.- ++.++..+ -+-|+ .+..+.++...+...+.++. .+.+|-..|. +++-+..- T Consensus 119 ~~~~~~~~~~~~~~-~~~t~~~gg~~vP~----~~~~~ii~~l~~~~~i~~~~-~~~~~~~~~~-~~~p~~~~------- 184 (435) T protein:vir:14 119 AIERGFGEEVAMSL-NTLSPGAGGVLVPE----NLSSEVIELLRPKSVVRKLG-ARTLPLSNGN-ITIPRLKG------- 184 (435) T ss_pred HHhhhhhhhhhhhc-ccCCcCCCccccch----hHHHHHHHHHhhhchhhhhc-ceeeecCCCc-eEEEEEeC------- Confidence 01100 000111 11222222 23344 23456666677777777763 3445655553 44433311 Q ss_pred ccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhh-cchHHHH Q lcl|NC_019514. 77 NDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFD-SDSELFS 155 (399) Q Consensus 77 ~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~-~D~~l~~ 155 (399) .+...|..||. .++.. ..+...++...++++.++.+|+++++.. .|+.+.+ T Consensus 185 -----~~~a~~v~E~~---------~~~~~--------------~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~~~l~~ 236 (435) T protein:vir:14 185 -----GAIVGYIGADT---------DIPTT--------------QQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQ 236 (435) T ss_pred -----CcceeeeccCc---------ccccc--------------ccceeEEEeeeEEEEEeehhhHHHHHhhccCHHHHH Confidence 12234555542 12222 2345567889999999999999877543 4666777 Q ss_pred HHHHHHHHhhhHHHHHHHHHHHHhcCCeE-EecCCC---cccccccccccCCceecHHHHHHHHHHHHhccCccccceec Q lcl|NC_019514. 156 HISTELMNGAVQLTEAVLQKDLLAGAGTI-VYTGAA---TQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVIT 231 (399) Q Consensus 156 ~~~~~lg~~a~~~~e~~l~~~~lag~~~v-~yag~a---ts~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~ 231 (399) .+..+|.+..+...+ ..+++|.++- .-.|-. ......+............++.+++..|+.+.+-. T Consensus 237 ~i~~~l~~ai~~~~d----~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~------ 306 (435) T protein:vir:14 237 IVVGDLTAAIGARED----KAFIRDDGTANTPKGLRFWALPSNVITASDASTLQKIETDLGKVILALENADANL------ 306 (435) T ss_pred HHHHHHHHHHHHHHH----HHhhccCCCCccccceeecccccceeccccccchhhHHHHHHHHHHHhhhccccc------ Confidence 777777766554443 3445654321 001100 00000111111222223456777776666643321 Q ss_pred cccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCcc Q lcl|NC_019514. 232 GSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVG 311 (399) Q Consensus 232 ~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~ 311 (399) . . -+-++||.....|+.|+|--+.+-|-.. --|++-|+.+++++.+-.-.+.++ T Consensus 307 -----~----~--~~~v~n~~~~~~L~~lkd~~G~~l~~~~------------~~g~l~G~Pv~~~~~~p~~~~~~~--- 360 (435) T protein:vir:14 307 -----T----Q--PGWIMAPRTFRFLEGLRDGNGNKVYPEL------------ANGMLKGYPVGKTTQVPINLGETG--- 360 (435) T ss_pred -----c----C--CEEEEcHHHHHHHHHhhccCCceeccCC------------CCCeeecceeEeeccccccccCCC--- Confidence 0 1 2347899999999999887776666321 125788999999887622111111 Q ss_pred CCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCC--CCCCCCccchhhHHHHH--HHHHHhhccc Q lcl|NC_019514. 312 TNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEA--TADRNDPYGEMGFSSIK--WYYGTLILRP 387 (399) Q Consensus 312 ~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~--~ad~~DPlgQrg~~gwK--~~~~~~iLn~ 387 (399) ... .+++|.=+.-.++..++ +.+.+..=+.. ....--.|-|++.+.++ +++.+.+.++ T Consensus 361 ---------~~~----~i~~gd~s~~~i~~~~~-----~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~ 422 (435) T protein:vir:14 361 ---------KES----EIYFTDFGDVFIGEEET-----LEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHV 422 (435) T ss_pred ---------ccc----eEEEeecccEEEEEecc-----cEEEEeccccccccccchhhhhhcChhheeeeeeeCceeecc Confidence 111 25567655544544432 22222111100 00011245677777777 5677778887 Q ss_pred cceEEEEEeccC Q lcl|NC_019514. 388 ERLALVKTVAPL 399 (399) Q Consensus 388 ~~m~~ie~~a~~ 399 (399) +-++.|.-+ +. T Consensus 423 ~a~~~l~~~-~~ 433 (435) T protein:vir:14 423 ESIAVLAGV-AW 433 (435) T ss_pred cceEEEecC-CC Confidence 766665432 22 No 122 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=98.16 E-value=8.4e-07 Score=53.88 Aligned_cols=286 Identities=13% Similarity=0.096 Sum_probs=154.6 Q ss_pred CCcCCeeecCCC----Cccccc-ccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccc Q lcl|NC_019514. 1 MASKGMLYNDPN----TTPSGI-DAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRN 75 (399) Q Consensus 1 ~~~~~~~~n~~~----~t~tT~-~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~ 75 (399) +.......+... +..++. .+-+-|+ .+....+....+...+.+++...+|+.+.|+-...+.. T Consensus 101 ~~~~~~~~~~~~~~a~~~~~~~~gg~~vP~----~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-------- 168 (408) T protein:vir:74 101 VRNPMAFLNTVSSKTETSGSDSAAGLTIPQ----DIRTMINTLVRQYDSLQQYVRVESVSTSSGSRVYEKWT-------- 168 (408) T ss_pred HhcchhhhhhhhhhhhcccccCCCceeech----hHhhHHHHHHhhhcchhhhcceeeccCCcceEEEEeec-------- Confidence 111111111111 111221 1222233 44566666678888899999999999887764433322 Q ss_pred cccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHH Q lcl|NC_019514. 76 VNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFS 155 (399) Q Consensus 76 ~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~ 155 (399) +..+.+.+..||-. .......+...++.++++++.++.+|+++++ +++..|.. T Consensus 169 ----~~~~~~~~v~E~~~----------------------~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~-ds~~~l~~ 221 (408) T protein:vir:74 169 ----DVTPLKAMDEEDGK----------------------IPDLDNPRLTIIKYLIKRYAGIITATNTLLK-DTAENILA 221 (408) T ss_pred ----CCcccccccccccc----------------------cccccccceeeEEeeeeeEEeeehhHHHHHh-hchHHHHH Confidence 11223344444411 1111223455678889999999999999776 45545888 Q ss_pred HHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHH-HHHHhccCccccceecccc Q lcl|NC_019514. 156 HISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLS-ITLDENRTPKQTKVITGSR 234 (399) Q Consensus 156 ~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~-~~L~~nrap~~t~~i~~s~ 234 (399) .+...|.+..+...+ ..+++|.++-. .....+++++|..+. ..|+.+-... T Consensus 222 ~i~~~l~~~~~~~~d----~~il~G~G~~~---------------~~~~~~~~~~i~~~~~~~l~~~~~~~--------- 273 (408) T protein:vir:74 222 WLSSWIAKKVVVTRN----QAIIAAMGTVP---------------KKPTIANFDDVITMINTSVDPAIIAT--------- 273 (408) T ss_pred HHHHHHHHHHHHHHH----HHHhhcccccc---------------cccccccHHHHHHHHHHhhhhhhcCC--------- Confidence 888888877665543 35667654321 223456788887764 3454322110 Q ss_pred ccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCc Q lcl|NC_019514. 235 MIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNP 314 (399) Q Consensus 235 ~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~ 314 (399) + +-+|||.+...|+.|+|--+.|-|.+- +..|--+++-|..++.++.. +.... T Consensus 274 --------a--~~v~n~~~~~~l~~lkd~~G~~l~~~~--------~~~~~~~~l~G~pV~~~~~~--~~~~~------- 326 (408) T protein:vir:74 274 --------S--SLLTNQSGLNKLALVKTAEGKYLLEPD--------PTKPNSYLIKGKQVIVVADR--WLPNS------- 326 (408) T ss_pred --------C--EEEEcHHHHHHHHHhhcCCCceEeccC--------cCCCCCceecceeeEEecCc--ccccc------- Confidence 1 346899999999999876666665431 12334467888887765531 10000 Q ss_pred cccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHhhccccceEE Q lcl|NC_019514. 315 GYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTLILRPERLAL 392 (399) Q Consensus 315 ~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~iLn~~~m~~ 392 (399) .++.. .+++|.-+-+..-+...+ +.+-+- +-.+..-+++.+.|+ +.+.+.+++++-++. T Consensus 327 ----~~~~~----~i~~gd~~~~~~~~~~~~----~~i~~~-------~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~ 387 (408) T protein:vir:74 327 ----GSTVY----PLYYGDMSQAITLFDREN----MSLLPT-------NIGAGAFETDTTKIRVIDRFDVKATDSEALVA 387 (408) T ss_pred ----cCCcc----eEEEEehhccEEEEEecc----eEEEEe-------ccccchhhcceeeEEEEEeeCcEEecccceEE Confidence 01112 256775432222111111 111111 111223345556655 567889999999988 Q ss_pred EEEeccC Q lcl|NC_019514. 393 VKTVAPL 399 (399) Q Consensus 393 ie~~a~~ 399 (399) ++..+.- T Consensus 388 ~~~~~~~ 394 (408) T protein:vir:74 388 GSFTAIA 394 (408) T ss_pred EEeeccc Confidence 8864433 No 123 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=98.12 E-value=3.9e-07 Score=55.73 Aligned_cols=300 Identities=11% Similarity=0.052 Sum_probs=148.8 Q ss_pred CCcCCeeecCC--C--Ccccccccc-cccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccc Q lcl|NC_019514. 1 MASKGMLYNDP--N--TTPSGIDAP-DGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRN 75 (399) Q Consensus 1 ~~~~~~~~n~~--~--~t~tT~~~~-i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~ 75 (399) +..+. ++.. . ++.++..++ +-|+ .+.++.++...+..++.++. .+.+|-..|. +++-+..- T Consensus 119 ~~~~~--~~~~~~~~~~~~~~~~gg~lvP~----~~~~~ii~~l~~~~~i~~~~-~~~v~~~~~~-~~~p~~~~------ 184 (435) T protein:vir:80 119 AIERG--FGEEVAMSLNTLSPGAGGVLVPE----NLSSEVIELLRPKSVVRKLG-ARTLPLSNGN-ITIPRLKG------ 184 (435) T ss_pred HHhhh--hhhhhhhhhcccCCCCCccccch----hHHHHHHHHHhhhchhhhcc-ceeeecCCCc-eEEEEEeC------ Confidence 00000 0000 0 111222222 3343 24566666677777777773 2345555553 44433311 Q ss_pred cccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhh-hcchHHH Q lcl|NC_019514. 76 VNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDF-DSDSELF 154 (399) Q Consensus 76 ~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~-~~D~~l~ 154 (399) .+...|..||. .+. ....+...++...++++.++.+|+++++. ..++.+. T Consensus 185 ------~~~a~~v~E~~---------~~~--------------~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~l~ 235 (435) T protein:vir:80 185 ------GAIVGYIGADT---------DIP--------------TTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVD 235 (435) T ss_pred ------CcceeeeccCc---------ccc--------------ccccceeeEEEeeEEEEEeehhhHHHHHhhcccHHHH Confidence 12234555542 112 22234556888999999999999997754 3455677 Q ss_pred HHHHHHHHHhhhHHHHHHHHHHHHhcCCeE-----EecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccce Q lcl|NC_019514. 155 SHISTELMNGAVQLTEAVLQKDLLAGAGTI-----VYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKV 229 (399) Q Consensus 155 ~~~~~~lg~~a~~~~e~~l~~~~lag~~~v-----~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~ 229 (399) +.+..++.+..+.-.+. .+++|.++- +.+....... .++...........++.+++..|+.+.... T Consensus 236 ~~i~~~l~~a~~~~~d~----a~l~G~G~~~~p~Gi~~~~~~~~~-~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~---- 306 (435) T protein:vir:80 236 QIVVGDLTAAIGAREDK----AFIRDDGTANTPKGLRFWALPGNV-ITASDGSTLQKIETDLGKAILALENADANL---- 306 (435) T ss_pred HHHHHHHHHHHHHHHHH----HhhccCCCCCcccceeecccccce-eecccccchhhHHHHHHHHHHHhhcccccc---- Confidence 77777777665544433 455664321 1111111100 111111112233456777777776654321 Q ss_pred eccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCC Q lcl|NC_019514. 230 ITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGAT 309 (399) Q Consensus 230 i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~ 309 (399) -.++| ++||.....|+.|+|--+.+-|-.. --|++-|+.+++++.+-.-.+.+ T Consensus 307 -----------~~~~~--vmn~~~~~~L~~lkd~~G~~l~~~~------------~~~~l~G~pv~~~~~~p~~~~~~-- 359 (435) T protein:vir:80 307 -----------TQPGW--IMAPRTFRFLEGLRDGNGNKVYPEL------------ANGMLKGYPVGKTTQVPINLGEA-- 359 (435) T ss_pred -----------ccCEE--EEcHHHHHHHHhhhccCCceeccCC------------CCCeEeeeeeEEeccccccccCC-- Confidence 01233 6899999999999887666666321 12578899999988863211110 Q ss_pred ccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEec-CCCC-CCCCCCccchhhHHHHH--HHHHHhhc Q lcl|NC_019514. 310 VGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKM-PGEA-TADRNDPYGEMGFSSIK--WYYGTLIL 385 (399) Q Consensus 310 ~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~-pG~~-~ad~~DPlgQrg~~gwK--~~~~~~iL 385 (399) ++.. .+++|.=++-.++..++ +.+-+.. .+.. ....--.+-|+..+.|+ +.+.+.+. T Consensus 360 ----------~~~~----~i~~gd~s~~~i~~~~~-----~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~ 420 (435) T protein:vir:80 360 ----------GKES----EIYFTDFGDVFIGEEET-----LEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPR 420 (435) T ss_pred ----------CCcc----eEEEEEcccEEEEeecc-----eEEEEeccccccccccchhhhhhcCcceeeeeeeeCcEee Confidence 1111 35667655555544332 2221111 1100 00011244566677777 45666777 Q ss_pred cccceEEEEEeccC Q lcl|NC_019514. 386 RPERLALVKTVAPL 399 (399) Q Consensus 386 n~~~m~~ie~~a~~ 399 (399) +++-++.|.- +.. T Consensus 421 ~~~a~~~l~~-~~~ 433 (435) T protein:vir:80 421 HVESIAVLSG-VAW 433 (435) T ss_pred cccceEEEec-cCC Confidence 7766555532 222 No 124 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=98.11 E-value=5.8e-07 Score=54.77 Aligned_cols=284 Identities=13% Similarity=0.031 Sum_probs=152.0 Q ss_pred CCcCCeeecCC-CCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccC Q lcl|NC_019514. 1 MASKGMLYNDP-NTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQ 79 (399) Q Consensus 1 ~~~~~~~~n~~-~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~ 79 (399) |+.....-.+- ...+++..+-+-|+ .|..+.+....+...+.+++...+++.+.++-.....- T Consensus 100 l~~~~~~~~~~~~~~t~~~gg~~vP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~------------ 163 (394) T protein:vir:10 100 IHSHGKVIDNAAGHVTSTEAGVLIPE----EIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKRA------------ 163 (394) T ss_pred HhccchhhhhhhcccccccCceeccH----HHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEecC------------ Confidence 22211111110 11112222222333 34566666677888899999999998775443222211 Q ss_pred CCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHH Q lcl|NC_019514. 80 GIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIST 159 (399) Q Consensus 80 gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~ 159 (399) .....+..|+- .......++...++.++++|+.|+.+|+++++. ++.++...+.+ T Consensus 164 --~~~~~~~~E~~----------------------~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~d-s~~~l~~~i~~ 218 (394) T protein:vir:10 164 --TDRFSSVAELA----------------------ENPALAEPEFEQVDWSVSTYRGAIPLSEEAIAD-SAVDLTSLVGQ 218 (394) T ss_pred --CCccccccccc----------------------cccccccccceeEEeeeeeeEeeehhHHHHHhh-hhHHHHHHHHH Confidence 12234555542 122222344566788899999999999997764 55568888878 Q ss_pred HHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHH-HHHhccCccccceeccccccCc Q lcl|NC_019514. 160 ELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSI-TLDENRTPKQTKVITGSRMIDT 238 (399) Q Consensus 160 ~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~-~L~~nrap~~t~~i~~s~~~~T 238 (399) .|.+.-+...+ ..+++|.+.- + ..++.+..++++|..+.. .|+... T Consensus 219 ~la~~~~~~~~----~~il~g~g~~------~-------~~~~~~~~~~d~l~~~~~~~~~~~~---------------- 265 (394) T protein:vir:10 219 SINEKSVNTYN----AMIAPVLQSF------T-------AKATTTDTLVDSLKHILNVDLDPAY---------------- 265 (394) T ss_pred HHHHHHHHHHH----HHHhhccccc------c-------cccccccccHHHHHHHHHhhhhhhc---------------- Confidence 88776554443 3455555321 0 112234457777776553 222211 Q ss_pred cccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCccccc Q lcl|NC_019514. 239 RTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRE 318 (399) Q Consensus 239 ~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~ 318 (399) .+ +-++||.+...|+.|+|-.+.|-|.+.-. + ..-.+.-+++-|++++.++.... +.+ T Consensus 266 ---~a--~~vmn~~~~~~l~~lkd~~G~~i~~~~~~--~--~~~~~~~~~L~G~PV~~~~~~~~----~~~--------- 323 (394) T protein:vir:10 266 ---SR--ALVVTQSLFNTLDTLKDKNGRYLLHDASD--S--ITDGTAKGTVLGVPVYVVGDALL----GSA--------- 323 (394) T ss_pred ---cC--EEEecHHHHHHHHHhhccCCCeeeecccc--c--cccCCcccccccceeEEeccccc----CCC--------- Confidence 11 35799999999999998877777765321 1 11123456889999987765311 110 Q ss_pred cCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEecc Q lcl|NC_019514. 319 TNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVKTVAP 398 (399) Q Consensus 319 t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~a~ 398 (399) .+. ..++||.-+-+.+.+...+ +.+-+-. +....+++..+ +.+.+.+.++.=++.|+...+ T Consensus 324 -~~~----~~i~~gd~s~~~~~~~~~~----~~v~~~~---------~~~~~~~~~~~-~r~d~~~~~~~ai~~~~~~~~ 384 (394) T protein:vir:10 324 -AGD----QKAFVGDLKRGVLFADRQQ----VTLAWED---------SKIYGRYLGAA-FRFGVKQADSNAGYFVTNTDA 384 (394) T ss_pred -CCc----eEEEEeeccccEEEEeecc----eEEEEec---------ccccceeEEEE-EEeccEEeccccEEEEEeecc Confidence 011 2456775332222222222 1111111 11222333322 467788888888888877666 Q ss_pred C Q lcl|NC_019514. 399 L 399 (399) Q Consensus 399 ~ 399 (399) . T Consensus 385 ~ 385 (394) T protein:vir:10 385 A 385 (394) T ss_pred c Confidence 6 No 125 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=98.07 E-value=3.6e-06 Score=50.45 Aligned_cols=284 Identities=14% Similarity=0.064 Sum_probs=139.3 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHH-Hhh------hhcccccccccCCCEEEEEEccccccc Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQ-YFM------PLADVVSMPKNYGKEIRVYHYIPLLDD 73 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~l-v~~------~fA~~~~mPkN~GktIk~rry~pl~~~ 73 (399) |+. +.|...--|-|++=. .++..-.|+. .|- ..++...+=...|.+|++=.|.+|..+ T Consensus 1 Ma~----------~~T~l~d~i~pevf~-----~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~G~ 65 (330) T protein:vir:10 1 MAN----------ELTKILDTITPQQYN-----AYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLTGD 65 (330) T ss_pred CCC----------CceEeeeeechhHHH-----HHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCCCc Confidence 543 224444444555333 2333322322 132 222222222346999999999888433 Q ss_pred cccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHH Q lcl|NC_019514. 74 RNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSEL 153 (399) Q Consensus 74 ~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l 153 (399) ...+.+|-. . |++. ..+-....+.+++.|.=.+++|.+..+.-.+ - T Consensus 66 ~~~~~dg~~----~---------------i~~~--------------ki~t~~~~a~i~~~~k~~~~tD~a~~~~g~d-p 111 (330) T protein:vir:10 66 SEVLGNGDK----A---------------LETG--------------KITAGADIACVLYRGRGWAANELTGVVAGSD-P 111 (330) T ss_pred ccccCCCcc----c---------------cchh--------------hcccceeEEEEEeecceeeehhhhhhhcchh-H Confidence 333322210 0 1111 1122335677888898899999887665554 4 Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecC----CCccccc-ccccccCCceecHHHHHHHHHHHHhccCccccc Q lcl|NC_019514. 154 FSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTG----AATQDSE-ITGEGATPSVVDYDDLMRLSITLDENRTPKQTK 228 (399) Q Consensus 154 ~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag----~ats~~~-~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~ 228 (399) ++++.+.+++-- .+..|.+|++-..-++-.. .+.-... ..........++++.|-+|...|-.+... T Consensus 112 ~~~i~~q~a~~w----~~~~q~~lla~l~gvf~~~~~~~~~~~~~~~~~~~~~~~a~~s~~~l~~A~~~~GD~~~~---- 183 (330) T protein:vir:10 112 VRAILNRIGAYW----LREDQKALIATLNGIFATGTAGEKGALEETHVSDQSKASTGIDAGMVLDAKQLLGDSADQ---- 183 (330) T ss_pred HHHHHHHHHHHh----hhhHHHHHHHHHHhhhhhhhcccchhhhhhheecccccccccCHHHHHHHHHHhcccccc---- Confidence 555555555443 3444444444332222111 1111010 01111234458888999988777664332 Q ss_pred eeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCC Q lcl|NC_019514. 229 VITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGA 308 (399) Q Consensus 229 ~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa 308 (399) -.+.++||....+||+. +++...+|.+. .+.||.+-|.|+|++.-+-. T Consensus 184 ---------------~~~ivmhS~v~~~L~~~-------~li~~~~~s~~----~~~i~~~~G~~VivdD~~p~------ 231 (330) T protein:vir:10 184 ---------------VTAIAMHSAVYTKLQKD-------NLIQYIQPTTA----TINIPTYLGYRVIIDDGIAP------ 231 (330) T ss_pred ---------------ceEEEEcHHHHHHHHHh-------hhhhhhccccc----CcccccccceEEEEeCCCCC------ Confidence 37899999999999863 56777777765 35899999999999987621 Q ss_pred CccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccc------hhh-----HHHHH Q lcl|NC_019514. 309 TVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYG------EMG-----FSSIK 377 (399) Q Consensus 309 ~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlg------Qrg-----~~gwK 377 (399) . ..+|-.++||.+|++... +.+..+ +...+ . .||.+ .|- --|+| T Consensus 232 --~-----------~~~yt~yl~~~GAi~~~~----~~~~~~--v~~Et----d--Rd~~~g~~~l~~r~~~~~hp~G~s 286 (330) T protein:vir:10 232 --T-----------GDIYTSYLFRTGSIGLNT----GNPSGL--TTFET----S--REAAKGNDMIYTRRALVMHPYGVK 286 (330) T ss_pred --C-----------CCceeEEEEecCceeeec----ccCCcc--ccccc----c--CCccccceEEEEeeEEEeeeeeee Confidence 1 137888899999997753 111001 11111 1 13432 110 00111 Q ss_pred HHHHH---hhccccceEEEEEeccC Q lcl|NC_019514. 378 WYYGT---LILRPERLALVKTVAPL 399 (399) Q Consensus 378 ~~~~~---~iLn~~~m~~ie~~a~~ 399 (399) |--.+ .-..+-+ +-|++++-- T Consensus 287 ~~~~~~~~~~~sPt~-~~L~~~~NW 310 (330) T protein:vir:10 287 WTGAEVDAGNITPSN-ADLAKFKNW 310 (330) T ss_pred ecccccccCcCCcCh-HHhcCCcCc Confidence 11000 0000000 000000000 No 126 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=98.05 E-value=5e-07 Score=55.11 Aligned_cols=281 Identities=11% Similarity=0.073 Sum_probs=126.4 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccc----cccccCCCEEEEEEcccccccccc Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVV----SMPKNYGKEIRVYHYIPLLDDRNV 76 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~----~mPkN~GktIk~rry~pl~~~~~~ 76 (399) |+ |.+ .| .-|| -|.+++|+.-+++|||.++.... ..-.++|.||++++..++...+.. T Consensus 1 Ma------N~l----lT----~ip~----iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~~~~~~~~~ 62 (423) T protein:vir:17 1 MP------NNL----DS----NVSQ----IVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTP 62 (423) T ss_pred Cc------cch----hh----hhHH----HHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCCcceeeccc Confidence 21 221 11 1133 47889999999999998876532 222568999999986554222111 Q ss_pred ccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeee-cceeehhhhhhhhhcchHHHH Q lcl|NC_019514. 77 NDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKF-GFFTEFSQESLDFDSDSELFS 155 (399) Q Consensus 77 ~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qY-G~~~e~Td~~~d~~~D~~l~~ 155 (399) ... ..+ -+.+.++... +...|-|. .+-++++|+=.-.+.++ T Consensus 63 ~~~---~~~------------~~~~~l~e~~-------------------v~l~id~~k~va~~v~d~E~~~~i~~---- 104 (423) T protein:vir:17 63 TGD---ISG------------QNKNNLISGK-------------------ATGRVGNYITVAVEYQQLEEAIKLNQ---- 104 (423) T ss_pred Ccc---cCC------------cccCccccce-------------------eEEEeeceeeeeeeecHHHHhcChhH---- Confidence 110 000 0111122111 12223322 23446777632222222 Q ss_pred HHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccc Q lcl|NC_019514. 156 HISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRM 235 (399) Q Consensus 156 ~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~ 235 (399) +.+.+..++..+. +.+-.+|++-....-+. . +...++... .++++..+.+.|.++++|+ T Consensus 105 -~~~~l~~A~~aLA-~~vd~~ia~~~~~~a~~------~-~gt~~t~~~--a~~~i~~a~~~Ld~~~vP~---------- 163 (423) T protein:vir:17 105 -LEEILAPVRQRIV-TDLETELAHFMMNNGAL------S-LGSPNTPIT--KWSDVAQTASFLKDLGVNE---------- 163 (423) T ss_pred -HHHHHHHHHHHHH-HHHHHHHHHHHhhcccc------c-cccCCcccc--cHHHHHHHHHHHHhccCCc---------- Confidence 3344444433333 33434443321110010 0 011111111 4899999999999999997 Q ss_pred cCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccc-eeEcCeEEEecCccchhcccCCCcc--- Q lcl|NC_019514. 236 IDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEI-GTVDQFRLVVVPEMLHWAGAGATVG--- 311 (399) Q Consensus 236 ~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEI-G~i~~vRfV~~~~~~~~~~aGa~~~--- 311 (399) .-+++++.|+....|.. ++.+....+-+..+.+-+|+| |++.+|++.++.+.-.-....+..+ T Consensus 164 -------~~R~~Vv~p~~~a~Ll~------~~~~~~~~~~~~~~alr~g~i~G~i~GFdvy~Snnip~~T~gt~~~t~~~ 230 (423) T protein:vir:17 164 -------GENYAVMDPWSAQRLAD------AQTGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTV 230 (423) T ss_pred -------CCCEEEeChHHHHHHhc------cccceecccccchHHHhhccceeeecceEEEEeCCCccccccceeceeee Confidence 22788999999887742 234444556666777888887 9999999999988764322111100 Q ss_pred -CCccc-----cccC--------ccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH Q lcl|NC_019514. 312 -TNPGY-----RETN--------GKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK 377 (399) Q Consensus 312 -~~~~~-----~~t~--------~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK 377 (399) ..+.. ..++ .....|..|..|. .|+..|++. +.++.|+=- -+.++|..|. T Consensus 231 ~~~~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD-~~t~aGv~~------v~~~tk~v~---~~~~t~~~~~------ 294 (423) T protein:vir:17 231 KTQPTVTYNAVKDSYQFTVTLTGATTSVTGFLKAGD-QVKFTNTYW------LQQQTKQAL---YNGATPISFT------ 294 (423) T ss_pred cccccccccccccccceeeeeeeeeeeccCceeecc-eEEecceee------ecccccccc---cccccccceE------ Confidence 00000 0000 0001122233332 222222211 111111000 0111222221 Q ss_pred HHHHHhhccccceEEEEEecc-------C Q lcl|NC_019514. 378 WYYGTLILRPERLALVKTVAP-------L 399 (399) Q Consensus 378 ~~~~~~iLn~~~m~~ie~~a~-------~ 399 (399) |.+. +.+.. | T Consensus 295 -----------~~v~-~~~~~~a~~~~tv 311 (423) T protein:vir:17 295 -----------ATVT-ADANSDSSGDVTV 311 (423) T ss_pred -----------EEEE-ecccccccCceEE Confidence 1111 11110 1 No 127 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=98.01 E-value=5.7e-07 Score=54.82 Aligned_cols=272 Identities=13% Similarity=0.032 Sum_probs=144.7 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQG 80 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~g 80 (399) .+......+.....++...+.+-|+ .+..+.+....+.-.+.+++...+++.+.++-..+..- T Consensus 118 ~~~~~~~~~~~~~~t~~~gg~liP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~------------- 180 (394) T protein:vir:97 118 INETTPVEPQKDGIKKENAKPVSSE----EILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRA------------- 180 (394) T ss_pred HHhhhhhhhhccccccccccccChH----HHHHHHHHHhhhhhhhhhhceeeeccCcceEEEEEecC------------- Confidence 0000011111112222222333444 23555555577888899999999988876643322211 Q ss_pred CCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHHH Q lcl|NC_019514. 81 IDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHISTE 160 (399) Q Consensus 81 i~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~~ 160 (399) +...+|..||-.. ++ ....+...++...++++.++.+|+++++ +++..+...+... T Consensus 181 -~~~~~~v~E~~~~---~~-------------------~~~~~~~~v~l~~~k~~~~i~is~ell~-ds~~~~~~~i~~~ 236 (394) T protein:vir:97 181 -TTKMVTVAELEKN---PA-------------------LAKPDFKDVAWNIDTYRGAIPLSQESID-DADVDLVGIVSES 236 (394) T ss_pred -CCccceecccccc---cc-------------------cccccceeEEeehhheeeehhhHHHHHh-hhhHHHHHHHHHH Confidence 1233566665221 11 1113345677889999999999999776 3334477777777 Q ss_pred HHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccc Q lcl|NC_019514. 161 LMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTRT 240 (399) Q Consensus 161 lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~~ 240 (399) +.+..+...+. -++.|.+. .+..+..++++|..+...+..... T Consensus 237 la~~~~~~~~~----~i~~g~~~----------------~~~~~~~~~~~~~~~~~~~~~~~~----------------- 279 (394) T protein:vir:97 237 ISQIKVNTTND----AIAKVLKS----------------FTTKTVKNLDEIKALLNGGFDPAY----------------- 279 (394) T ss_pred HHHHHHHHHHH----HHhhcccc----------------ccccccccHHHHHHHHHhhhhhhh----------------- Confidence 77665544332 23444321 112244577887776643322111 Q ss_pred cCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCccccccC Q lcl|NC_019514. 241 ISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRETN 320 (399) Q Consensus 241 I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t~ 320 (399) .++| ++||.+...|+.|+|--+.|-|.|- +-.|--+.+-|+.+++++.. +. +.+ T Consensus 280 -~a~~--v~n~~~~~~l~~lkd~~G~~i~~~~--------~~~~~~~~l~G~pv~~~~~~------~~--~~~------- 333 (394) T protein:vir:97 280 -NVSL--IVSQSFYQTLDTLKDGNGRYLLQDD--------ITAVSGKVLLGKPVFVLSDE------VL--GAN------- 333 (394) T ss_pred -CCEE--EEcHHHHHHHHHhhccCCCeeeecC--------cCCCCCceeccceeEEeccc------cc--CCc------- Confidence 1234 5899999999999987777767542 12334568888888876532 00 100 Q ss_pred ccceEEEEEEEcc--cceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEecc Q lcl|NC_019514. 321 GKYDIYPMLCVGA--ESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVKTVAP 398 (399) Q Consensus 321 ~~~DVyp~lV~G~--~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~a~ 398 (399) .++||. ..|.....++. .++. ..+.+.+.++.++ +.+.+.++++.-++.|+.-.+ T Consensus 334 -------~~~~gd~~~~~~~~~~~~~----~~~~-----------~~~~~~~~~~~~~-~r~d~~v~~~~a~~~~~~~~~ 390 (394) T protein:vir:97 334 -------KAFIGDFKRGVLFADRKDL----GLRW-----------ADNEIYGQYLQAV-LRFGVSKVDDKAGYYVTFTPE 390 (394) T ss_pred -------cEEEeeccccEEEEEecce----EEEE-----------ecccccceeEEEE-EEEccEEecccceEEEEeccc Confidence 135665 22332222221 1111 1133334344333 577888888888888875433 Q ss_pred C Q lcl|NC_019514. 399 L 399 (399) Q Consensus 399 ~ 399 (399) . T Consensus 391 ~ 391 (394) T protein:vir:97 391 P 391 (394) T ss_pred c Confidence 3 No 128 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=98.01 E-value=6.2e-07 Score=54.63 Aligned_cols=277 Identities=12% Similarity=0.030 Sum_probs=147.5 Q ss_pred CCcCCeee-cCCCC-cccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccccccccccc Q lcl|NC_019514. 1 MASKGMLY-NDPNT-TPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVND 78 (399) Q Consensus 1 ~~~~~~~~-n~~~~-t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~ 78 (399) .....+.. +.-.. ..++..+-+-|+ .+....+....+.-.+.+++...+++.+.++ +.+.. T Consensus 122 ~~~~~~~~~~~~~~~~~~~~gg~~vP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~---~~~~~---------- 184 (400) T protein:vir:38 122 LRAVPTDASDAVNAGVKAADAASTIPE----TISNTPQRELQTVVDLKPFTNVFQASTQKGT---YPTVA---------- 184 (400) T ss_pred hhhhhHHHHHHHhhcccccCCcccccH----HHHHHHHHHHHhhhhhhhcceeEeccCcceE---EEEEe---------- Confidence 00000111 11111 112222223343 2355555556777788999999998876542 22221 Q ss_pred CCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHH Q lcl|NC_019514. 79 QGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIS 158 (399) Q Consensus 79 ~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~ 158 (399) .+ ....++..||- ........+...++...++|+.++.+|+++++ +++..+...+. T Consensus 185 ~~-~~~~~~~~E~~----------------------~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~-ds~~~~~~~i~ 240 (400) T protein:vir:38 185 NA-TTKMVTVAELE----------------------KNPAMAKPEFKPVNWSVETYRQALPVSQESID-DSAIDLVGLIA 240 (400) T ss_pred cC-CCccccccccc----------------------cccccccccceeeEeehhheeeehhhHHHHHh-hhHHHHHHHHH Confidence 00 12234555542 11111223445677889999999999999665 45555777777 Q ss_pred HHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCc Q lcl|NC_019514. 159 TELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDT 238 (399) Q Consensus 159 ~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T 238 (399) +.+.+..+... -..++.|.+. .+.....++++|..+......... T Consensus 241 ~~l~~~~~~~~----~~~i~~~~~~----------------~~~~~~~~~~~~~~~~~~~~~~~~--------------- 285 (400) T protein:vir:38 241 QNGQQIKVNTT----NGAVATLLKG----------------FTAKTISSVDDLKHINNVDLDPAY--------------- 285 (400) T ss_pred HHHHHHHHHHH----HHhhhhcccc----------------ccccccccHHHHHHHHHhhhhhhh--------------- Confidence 77766544333 2334444421 122344577777666432211100 Q ss_pred cccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCccccc Q lcl|NC_019514. 239 RTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRE 318 (399) Q Consensus 239 ~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~ 318 (399) . -+-++||.+...|+.|+|--+.|-|.| .+. .+--+.+-|..++.++.+ ++..+| T Consensus 286 ---~--a~~v~~~~~~~~l~~lkd~~G~~i~~~--~~~------~~~~~~l~G~pv~~~~~~-~~~~~g----------- 340 (400) T protein:vir:38 286 ---S--RVIIASQSFYNFLDTVKDGNGRYLLQD--SIL------TPSGKSVLGMPIAVVSDD-TLGAAG----------- 340 (400) T ss_pred ---C--cEEEEcHHHHHHHHHhhccCCCeeeec--CcC------CCCccccccceeEEeccc-ccCCCC----------- Confidence 1 245689999999999998777777754 222 233467889999988865 221111 Q ss_pred cCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEecc Q lcl|NC_019514. 319 TNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVKTVAP 398 (399) Q Consensus 319 t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~a~ 398 (399) |+ .++||.-+-+.+.+...+. .++.. .+.+.+.++.++ +.+.+.+++++-++.|+..+. T Consensus 341 -----~~--~~~~gd~s~~~~~~~~~~~--~~~~~-----------~~~~~~~~~~~~-~r~d~~~~~~~a~~~l~~~~~ 399 (400) T protein:vir:38 341 -----EA--HAFLGDIKRAILFANRADF--MVRWV-----------DDQIYGQFLQAG-MRFGVSVADEKAGYFLTYTPK 399 (400) T ss_pred -----ce--EEEEEeccccEEEEeecce--EEEEe-----------cccccceeEEEE-EEeccEEecccceEEEEeecC Confidence 11 3567775433333333221 11111 122333444333 678888999998888877555 Q ss_pred C Q lcl|NC_019514. 399 L 399 (399) Q Consensus 399 ~ 399 (399) - T Consensus 400 a 400 (400) T protein:vir:38 400 A 400 (400) T ss_pred C Confidence 5 No 129 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=97.98 E-value=9.5e-07 Score=53.60 Aligned_cols=278 Identities=12% Similarity=0.075 Sum_probs=145.8 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQG 80 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~g 80 (399) |..+.+.-..-...+++..+.+-|+ .+..+.+....+...+.+++...+|+.+.++--. ..-.+ T Consensus 104 ~~~~~~~~~~ra~~t~~~gg~liP~----~~~~~Ii~~~~~~~~l~~l~~~~~~~~~~~~~~~-~~~~~----------- 167 (421) T protein:vir:13 104 IRGIQLSEEERDIMSSTNNGAVIPQ----EFVNEFEKLKEGYPSLKEHCHVIPVNRNAGKMPV-RAGAS----------- 167 (421) T ss_pred hhccchhHHHhhccccCCcceecch----hhHHHHHHHHHhhhhhhhhceeeeccCCceEEEE-eecCC----------- Confidence 2221111111012222222333343 2345566667788888999999888876553221 11111 Q ss_pred CCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHHH Q lcl|NC_019514. 81 IDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHISTE 160 (399) Q Consensus 81 i~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~~ 160 (399) .+...++.||- ......++...++.++++|+.|+.+|+++++ +++.+|...+.++ T Consensus 168 -~~~~~~~~E~~-----------------------~~~~s~~~f~~i~~~~~k~~~~v~iS~ell~-ds~~~l~~~i~~~ 222 (421) T protein:vir:13 168 -VDKLANLAKDT-----------------------ELVKAMLKTQPMAYDIDDYGLLAPIDNSLLE-DSEINFLEFVNEE 222 (421) T ss_pred -ccceeeccccc-----------------------cccccccceeEEEeeeeeeEeehhhhHHHHh-hhHHHHHHHHHHH Confidence 11122344431 1222234456688899999999999999765 4555688888888 Q ss_pred HHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccc Q lcl|NC_019514. 161 LMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTRT 240 (399) Q Consensus 161 lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~~ 240 (399) +.+......+..+. ..+.|. .++.+..++++|.++.-.|+..-.+. T Consensus 223 la~~~~~~~~~~i~-~~~~g~------------------~~~~~~~~~d~i~~~~~~l~~~~~~~--------------- 268 (421) T protein:vir:13 223 FAEFAVNTENAEIV-KQAKAV------------------LAEETINDYAGLVKTINSLVPNARKR--------------- 268 (421) T ss_pred HHHHHHHHhhhhHh-hhhhhc------------------cccccccchHHHHHHHHHhhhhhcCC--------------- Confidence 88776544332221 112221 11223457899999998887643221 Q ss_pred cCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCccccccC Q lcl|NC_019514. 241 ISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRETN 320 (399) Q Consensus 241 I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t~ 320 (399) =+.++||.....|+.|+|-.+.|-|.+.. .|.-+++-|..+++++.|.. +++ T Consensus 269 ----a~~v~n~~~~~~l~~lkd~~G~~i~~~~~---------~~~~~tl~G~pV~~~~~~~~--~~~------------- 320 (421) T protein:vir:13 269 ----AIIVTNSDGRAYLDGLMDKQGRPLLKELS---------DGGDLVFKGRPVIELEESIF--DVG------------- 320 (421) T ss_pred ----CEEEEcHHHHHHHHHhhcCCCceeecCcC---------CCCCceecceeeEEeccccc--cCC------------- Confidence 23478999999999999888888887642 34456888999998887521 111 Q ss_pred ccceEEEEEEEcccc-eeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHhhccccceEEEEEe- Q lcl|NC_019514. 321 GKYDIYPMLCVGAES-FTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTLILRPERLALVKTV- 396 (399) Q Consensus 321 ~~~DVyp~lV~G~~A-fg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~iLn~~~m~~ie~~- 396 (399) +.+ .+++|.-+ +-.+...+ + +.+-+ ..+++-+++...++ +.+.+.+.+++....+... T Consensus 321 ---~~~-~~~~gd~~~~~~~~~~~-~----~~v~~---------~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~ 382 (421) T protein:vir:13 321 ---DET-KFIVSDFKTLIKFMDRK-Q----YLIDQ---------SKEAGYTKNETIARIIERFDVNSPLDKSSDAEKIRK 382 (421) T ss_pred ---Cce-EEEEEeccccEEEEEec-c----eEEEe---------ecccccccCeeEEEEEeeecceeecchhhheeeecc Confidence 012 34566533 22232222 1 22111 11344455555544 2333444444433222222 Q ss_pred --ccC Q lcl|NC_019514. 397 --APL 399 (399) Q Consensus 397 --a~~ 399 (399) +.+ T Consensus 383 ~~a~v 387 (421) T protein:vir:13 383 FGVIV 387 (421) T ss_pred cceee Confidence 111 No 130 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=97.95 E-value=2.7e-06 Score=51.07 Aligned_cols=282 Identities=13% Similarity=0.066 Sum_probs=145.7 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQG 80 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~g 80 (399) ++.+...-+.-...+++..+-+-|+ .+....+....+...+.+++...+|+.+.++-. ..+.. T Consensus 99 lr~~~~~~~~~~~~t~~~gg~~vP~----~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~-~~~~~------------ 161 (389) T protein:vir:10 99 IHSHGKVIDATSKVTSTEAGVLIPE----EIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYP-ILKRA------------ 161 (389) T ss_pred hhcchhhhhhhcccccCCcceeehH----HHHHHHHHHHHhhhhHHhhcceeeccCCeeEEE-EEecC------------ Confidence 3333322222112222222223343 345566666778888899999999987655422 11110 Q ss_pred CCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHHH Q lcl|NC_019514. 81 IDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHISTE 160 (399) Q Consensus 81 i~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~~ 160 (399) ....++..|+ |.+......+...++.++++++.|+.+|+++++ +++.+|...+..+ T Consensus 162 -~~~~~~~~E~----------------------~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~-ds~~~l~~~i~~~ 217 (389) T protein:vir:10 162 -TDRFSSVAEL----------------------AENPKLAEPEFNKVDWSVATYRGAIPLSEEAIA-DSAVDLTALVGQS 217 (389) T ss_pred -CCcccccccc----------------------ccccccccccceeeeeeheeeEeeehhhHHHHh-hhhHHHHHHHHHH Confidence 1122344443 222222234455678889999999999999765 4555677777777 Q ss_pred HHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHH-HHHhccCccccceeccccccCcc Q lcl|NC_019514. 161 LMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSI-TLDENRTPKQTKVITGSRMIDTR 239 (399) Q Consensus 161 lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~-~L~~nrap~~t~~i~~s~~~~T~ 239 (399) |.+..+...+ ..+++|.+.. ...++.+..++++|..+.. .|+.. + T Consensus 218 la~~~~~~~~----~~i~~g~~~~-------------~~~~~~~~~~~d~l~~~~~~~~~~~--------------~--- 263 (389) T protein:vir:10 218 IKEKSVNTYN----AMIAPVLQSF-------------TAKKTTTDTLVDSLKHILNVDLDPA--------------Y--- 263 (389) T ss_pred HHHHHHHHHH----HHHhhhhccc-------------ccccccccccHHHHHHHHHhhhhhh--------------h--- Confidence 7766554432 2234443211 0112234457787776552 22210 1 Q ss_pred ccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCcccccc Q lcl|NC_019514. 240 TISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRET 319 (399) Q Consensus 240 ~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t 319 (399) .+ +-++|+.....|+.|+|-.+.|-|.+- +.+.. -.+..+++-|+.++.++...+ +.. T Consensus 264 --~a--~~~~n~~~~~~L~~lkd~~G~~i~~~~--~~~~~--~~~~~~~l~G~pV~~~~~~~~----~~~---------- 321 (389) T protein:vir:10 264 --SR--ALVVTQSLFNTLDTLKDKNGRYLLHDA--SDSIT--DGTAKGTILGVPVYVVGDTLL----GSL---------- 321 (389) T ss_pred --Cc--EEEecHHHHHHHHHhhccCCCeeeecC--ccccc--ccccccccccceeEEeccccc----CCC---------- Confidence 01 357999999999999987777777643 22211 123456788998876654322 000 Q ss_pred CccceEEEEEEEccc--ceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEec Q lcl|NC_019514. 320 NGKYDIYPMLCVGAE--SFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVKTVA 397 (399) Q Consensus 320 ~~~~DVyp~lV~G~~--Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~a 397 (399) ++. ..++||.= +|-...-++ +.+-+- ..+.+.+ ++.+. +.+.+.+++++=++.++... T Consensus 322 ~~~----~~~~~gd~~~~~~~~~~~~------~~i~~~--------~~~~~~~-~~~~~-~r~d~~~~~~~a~~~~~~~~ 381 (389) T protein:vir:10 322 AGD----QKAFVGDLKRGVLFTDRQQ------VTLAWE--------DSKIYGK-YLGAA-FRFGVQKADSKAGYFVTNTD 381 (389) T ss_pred CCc----eEEEEeeccccEEEEeecc------eEEEee--------ccccccc-eEEEE-EEeccEEecccceEEEEeec Confidence 111 13677753 332221122 111111 1122322 22211 45666677777766666544 Q ss_pred cC Q lcl|NC_019514. 398 PL 399 (399) Q Consensus 398 ~~ 399 (399) .- T Consensus 382 ~~ 383 (389) T protein:vir:10 382 VP 383 (389) T ss_pred cC Confidence 44 No 131 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=97.91 E-value=1.5e-06 Score=52.56 Aligned_cols=279 Identities=11% Similarity=0.076 Sum_probs=148.2 Q ss_pred CCc-----CC-------eeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcc Q lcl|NC_019514. 1 MAS-----KG-------MLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYI 68 (399) Q Consensus 1 ~~~-----~~-------~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~ 68 (399) ++. ++ ..-+.-.+++++..+.+-|. -++..+.++...|..++.+++ .+.+|-..|+ +++=+. T Consensus 335 a~~~G~~arg~~~~~~~l~~ra~~~~t~~~gg~lvp~---~~~~~~iie~lr~~s~i~~l~-~~~~~~~~g~-~~ip~~- 408 (632) T protein:vir:96 335 ADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVAT---ELLSEEFIDILRNKAIIGQMG-ARMLPGLVGD-VDIPKK- 408 (632) T ss_pred HHhhhhhhhhhhhhHHHHHHhhhhccccccccccccc---ccchHHHHHHHhhcchhhhhc-ceEeecCCcc-eEEEEE- Confidence 000 00 00111112222222223332 122344555566778888883 3456766665 333222 Q ss_pred ccccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhh Q lcl|NC_019514. 69 PLLDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFD 148 (399) Q Consensus 69 pl~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~ 148 (399) +. .+...|..|| +.+....+++..++.+.++|+.++.+|+++++. T Consensus 409 ---------~~--~~~a~wv~E~-----------------------~~~~~s~~~f~~i~l~~~k~~~~v~iS~ell~d- 453 (632) T protein:vir:96 409 ---------TS--GANFYWIGED-----------------------EDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQ- 453 (632) T ss_pred ---------eC--CceeEeecCC-----------------------ccccccccceeeEEeeeeEEEEehhhHHHHHhc- Confidence 01 1233455554 223334456667888899999999999997653 Q ss_pred cchHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCe-------EEecCCCcccccccccccCCceecHHHHHHHHHHHHhc Q lcl|NC_019514. 149 SDSELFSHISTELMNGAVQLTEAVLQKDLLAGAGT-------IVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDEN 221 (399) Q Consensus 149 ~D~~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~-------v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~n 221 (399) +++.+.+.+...|.+..+...+. .+++|.+. ...+|. .+-......++++++..+...|... T Consensus 454 s~~~~~~~i~~~l~~a~~~~~d~----a~l~G~G~~~~p~Gi~~~~~~-------~~~~~~~~~~~~~~i~~~~~~i~~~ 522 (632) T protein:vir:96 454 SSIHVENLIREDLIEGIGVALDL----AMLTGTGLANDPVGLLNMTGV-------PALTYPAGGVDWASVVDMETKISTF 522 (632) T ss_pred cchHHHHHHHHHHHHHHHHHHHH----HhhcccCCCCccceeeecccc-------cceecccccCCHHHHHHHHHHHhhc Confidence 45568888888888776655543 34555432 111111 1111122346888888888777665 Q ss_pred cCccccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccc Q lcl|NC_019514. 222 RTPKQTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEML 301 (399) Q Consensus 222 rap~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~ 301 (399) ++.. + . -+-++||.....++.++- +-+...+|+.+ |.+.|.+++.++.+. T Consensus 523 ~~~~-----------~----~--~~~~~~~~~~~~l~~~~l-----------~d~~G~~i~~~--~~l~G~pv~~s~~ip 572 (632) T protein:vir:96 523 NADA-----------G----R--LAYLTSVTQRGAAKKAQV-----------FDNTGERIWQN--NEVNGYRAEASNQIP 572 (632) T ss_pred cccc-----------C----c--cEEEEchhHHHHHHHHhc-----------cCCCCceeecC--CeecccceEeccccc Confidence 5421 0 1 223578887777765320 11223344542 678899988887752 Q ss_pred hhcccCCCccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHH Q lcl|NC_019514. 302 HWAGAGATVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYG 381 (399) Q Consensus 302 ~~~~aGa~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~ 381 (399) . + .+++|.-+.-.++..++ +.+.+- | ...---|+..+..| +.+. T Consensus 573 ~----------~--------------~~~~gd~s~~~i~~~~~-----~~i~~~-~-----~~~~~~~~v~~~~~-~~~d 616 (632) T protein:vir:96 573 A----------D--------------TWIFGDWSQIVIAMWGV-----LDLKVD-P-----YTKAASDGLVLRVF-QDVD 616 (632) T ss_pred c----------C--------------cEEEeecceEEEEEecc-----eEEEEc-c-----ccccccCceEEEEE-eecC Confidence 1 0 15677766666665542 333321 1 01111233333333 4577 Q ss_pred HhhccccceEEEEEec Q lcl|NC_019514. 382 TLILRPERLALVKTVA 397 (399) Q Consensus 382 ~~iLn~~~m~~ie~~a 397 (399) +.+.+++.++.++.+| T Consensus 617 ~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 617 AGVRRKEAFCIAKKGA 632 (632) T ss_pred ceeechhhhhheeecC Confidence 8899999999999999 No 132 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=97.91 E-value=1.2e-05 Score=47.64 Aligned_cols=313 Identities=12% Similarity=0.035 Sum_probs=155.3 Q ss_pred eeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCCCCCCC Q lcl|NC_019514. 6 MLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQGIDAAG 85 (399) Q Consensus 6 ~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~gi~aag 85 (399) |.+=|+.+.+-...+..---+...-|..+.+..=+-.-+|..+=.++. -..|++..|-|-.-. T Consensus 1 Ms~~n~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~vRt--i~~gkS~qf~~~G~s--------------- 63 (401) T protein:vir:70 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQT--VTGTNTVSNKYLGET--------------- 63 (401) T ss_pred CCCCccccccccccccchhHhHHhHhcchHHHHHHHHhhhcccceeee--ecccceEEEEEeeee--------------- Confidence 444343343333332222223333355566655444445555556664 567888888887432 Q ss_pred ceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchH-HHHHHHHHHHHh Q lcl|NC_019514. 86 ATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSE-LFSHISTELMNG 164 (399) Q Consensus 86 a~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~-l~~~~~~~lg~~ 164 (399) -..+..+|.++|...+......+|-.+-.+=+. +.+. +.|...+=. +-.++++++|+. T Consensus 64 --~~~~~~pG~~ld~~~~~~dK~~ItID~lL~a~~---------------~V~d----lDe~q~~yD~vRse~s~e~G~A 122 (401) T protein:vir:70 64 --ELQVLAPGQSPAATSTQADKNQLVIDATVIARN---------------TVAH----LHDVQGDIDSLKPKLATNQAKQ 122 (401) T ss_pred --EeeeecCCCCcCCCCcccccEEEEeCceeehhh---------------hhhh----HHHHHhcccccchHHHHHHHHH Confidence 223333344444333333333333222111100 0111 222222211 344556666666 Q ss_pred hhHHHHHHHHHH-HHhcCCeEEecCCCc------ccccccccccCCceecHHH----HHHHHHHHHhccCccccceeccc Q lcl|NC_019514. 165 AVQLTEAVLQKD-LLAGAGTIVYTGAAT------QDSEITGEGATPSVVDYDD----LMRLSITLDENRTPKQTKVITGS 233 (399) Q Consensus 165 a~~~~e~~l~~~-~lag~~~v~yag~at------s~~~~t~~~~~~~~vt~~~----lr~a~~~L~~nrap~~t~~i~~s 233 (399) -++.++..+.+. .++|..+........ ....++ ....+..++... ++.+...|.++..|. T Consensus 123 LA~~~Dq~iiq~i~~aa~ana~~~~~~p~~~~~G~~i~v~-~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~-------- 193 (401) T protein:vir:70 123 LKRMEDEMLIQQMMLGGIANTQAKRTNPRVKGHGFSINVE-VAEGEALVNPQYVMAAVEFALEQQLEQEVDI-------- 193 (401) T ss_pred HHHHHHHHHHHHHHHhccccccccccCCCcCCCceEEecc-ccccccccCHHHHHHHHHHHHHHHHhcCCCc-------- Confidence 666654433333 345532221110000 011111 112223344333 456666777777762 Q ss_pred cccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcC--CccccccccceeEcCeEEEecCccchhcccCCCcc Q lcl|NC_019514. 234 RMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYA--DAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVG 311 (399) Q Consensus 234 ~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya--~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~ 311 (399) + =|+.++.|.--.-|.+ .+..++. .|+ .....-+|+|+++.|||+|+++++--..+.-.. T Consensus 194 --------~-r~vvl~pp~~Ys~Ll~------~d~L~nr-d~~~s~~g~~~~G~v~~vaGv~Vv~SnnlP~~a~~it~-- 255 (401) T protein:vir:70 194 --------S-DVAILMPWRYFNVLRD------ADRIVDK-TYTISQSGATIQGFTLSSYNCPVIPSNRFPKYSQGQTH-- 255 (401) T ss_pred --------c-ceEEEcCHHHHHHHHh------cCcccch-hhccccCCccccceEEEEeceEEEeecccccccccccc-- Confidence 1 1777777766656654 2345554 443 446678899999999999999997432211000 Q ss_pred CCccccccCccceE------EEEEEEcccceeeeccccCCCCccceEEEecCCCCCCC-CCCccchhhHHHHHHHHHHhh Q lcl|NC_019514. 312 TNPGYRETNGKYDI------YPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATAD-RNDPYGEMGFSSIKWYYGTLI 384 (399) Q Consensus 312 ~~~~~~~t~~~~DV------yp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad-~~DPlgQrg~~gwK~~~~~~i 384 (399) ..-....+++.+|| -..|+|=.+|-+++.+.. + +.+ =-|+=-|..++=-|+.|+... T Consensus 256 ~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~---------l-------t~~~~~d~r~~~~~id~~~a~g~g~ 319 (401) T protein:vir:70 256 HLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSID---------V-------TGDIFYEKKEKTYYIDTFMAEGAIP 319 (401) T ss_pred ccccccCCCccCCCCccccceeEEEEehhheEEEEeec---------c-------ccchhhhhhhhHHHHHHHHHhCCcc Confidence 00001222334442 234566566555542221 1 111 126677888888899999999 Q ss_pred ccccceEEEEEeccC Q lcl|NC_019514. 385 LRPERLALVKTVAPL 399 (399) Q Consensus 385 Ln~~~m~~ie~~a~~ 399 (399) +|++.-+.+++.-+- T Consensus 320 ~RPeaa~vv~~k~~~ 334 (401) T protein:vir:70 320 DRWEAVSVVTTKRNT 334 (401) T ss_pred cchhheEEEeecCcc Confidence 999999998876653 No 133 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=97.86 E-value=1.7e-06 Score=52.26 Aligned_cols=313 Identities=14% Similarity=0.113 Sum_probs=150.1 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQG 80 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~g 80 (399) ...+...-+..-+++++....+.++ +..+.++...+...+.+++...+++.+ .+.+-+- -+ T Consensus 142 ~~~~~~~~~~~~~~~~~gg~~vp~~-----~~~~ii~~~~~~~~i~~l~~~~~~~~~---~~~~~~~-----------~~ 202 (497) T protein:vir:78 142 ETAPAAIGQNPFGSTGTFAPGILPT-----FLPGIVEQLFYELSLADLISSRPVTSP---NLSYLTE-----------SA 202 (497) T ss_pred hhhHHHHHhhhcccCcccccccchh-----hhHHHHHHHHhhhhHHhhccccccCCC---ceEEEEE-----------cC Confidence 1111111111111222222223333 345667778888899999998887653 3444332 11 Q ss_pred CCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHHH Q lcl|NC_019514. 81 IDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHISTE 160 (399) Q Consensus 81 i~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~~ 160 (399) -++...|+.|| +......++...++...++++.|+.+|+++++ ++. .+...+... T Consensus 203 ~~~~a~wv~E~-----------------------~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~-d~~-~l~~~i~~~ 257 (497) T protein:vir:78 203 AHNNAAAVAEA-----------------------GTYPFSSEEFARVYEQVGKVANALTITDEGLR-DAP-ELFNFVQGR 257 (497) T ss_pred CCCcceeeccC-----------------------cccccccccceeeEeeeeeeEeecHhHHHHHH-hHH-HHHHHHHHH Confidence 12233466665 22223344566788899999999999999876 443 366666666 Q ss_pred HHHhhhHHHHHHHHHHHHhcCCeEE------ecCCCc-ccccccc--------------cccCCceecHHHHHHHHHHHH Q lcl|NC_019514. 161 LMNGAVQLTEAVLQKDLLAGAGTIV------YTGAAT-QDSEITG--------------EGATPSVVDYDDLMRLSITLD 219 (399) Q Consensus 161 lg~~a~~~~e~~l~~~~lag~~~v~------yag~at-s~~~~t~--------------~~~~~~~vt~~~lr~a~~~L~ 219 (399) |.+.-+. .+-..+++|.++-. .+++.+ .....+. ..+....+..+.+..+..... T Consensus 258 l~~~i~~----~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 333 (497) T protein:vir:78 258 LLEGIQR----KEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRV 333 (497) T ss_pred HHHHHHH----HHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHh Confidence 6554332 33345677755321 111000 0000000 000001111111111111111 Q ss_pred hccCccccceeccccccC----------cccc------CceeEEEeCCCchHHHHHhhccCCCccceehhhcCCcccccc Q lcl|NC_019514. 220 ENRTPKQTKVITGSRMID----------TRTI------SAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILN 283 (399) Q Consensus 220 ~nrap~~t~~i~~s~~~~----------T~~I------~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~ 283 (399) .+-.+.....+.+....+ ...+ +++ +.++||....-|+.|+|-.+.+-|.+.-.-.... .. T Consensus 334 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~--~~ 410 (497) T protein:vir:78 334 VTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPN-AVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGN--PV 410 (497) T ss_pred hhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCC-eEEEchHHHHHHHHhhcCCCceeccCcccccccc--cc Confidence 111100000000000000 0000 111 4568999999999999988888786542222221 33 Q ss_pred ccceeEcCeEEEecCccchhcccCCCccCCccccccCccceEEEEEEEcccceeeecc-ccCCCCccceEEEecCCCCCC Q lcl|NC_019514. 284 GEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRETNGKYDIYPMLCVGAESFTTIGF-QTDGKTLKFKVTTKMPGEATA 362 (399) Q Consensus 284 gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l-~g~g~~~~~~~ivk~pG~~~a 362 (399) +.-+++-|+|+|+++.|.. | . .+||.=..+.+.+ ...+ +.+.+- T Consensus 411 ~~~~~l~G~pV~~t~~~~~----~--------------~------~~~Gd~~~~~~~i~~r~~----~~v~~~------- 455 (497) T protein:vir:78 411 NGGKNIWGVPVVTTPLIPL----G--------------T------ILVGHFAPSVIQTARREG----VTMQMT------- 455 (497) T ss_pred cCCceeeceeeEecCCCCC----C--------------c------eEEeecccceEEEEEecc----cEEEee------- Confidence 3455888999999988721 0 1 1334322111111 1111 212111 Q ss_pred CCCCccchhhHHHHHH--HHHHhhccccceEEEEEeccC Q lcl|NC_019514. 363 DRNDPYGEMGFSSIKW--YYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 363 d~~DPlgQrg~~gwK~--~~~~~iLn~~~m~~ie~~a~~ 399 (399) +-..++-++..+++++ .+.+.+++++-+++++..+.. T Consensus 456 ~~~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~ 494 (497) T protein:vir:78 456 NSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) T ss_pred cccchhhhcCcEEEEEEEeecceeeccccEEEEEecCCc Confidence 1112345677777774 478899999999999999888 No 134 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=97.86 E-value=1.7e-06 Score=52.26 Aligned_cols=313 Identities=14% Similarity=0.113 Sum_probs=150.1 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQG 80 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~g 80 (399) ...+...-+..-+++++....+.++ +..+.++...+...+.+++...+++.+ .+.+-+- -+ T Consensus 142 ~~~~~~~~~~~~~~~~~gg~~vp~~-----~~~~ii~~~~~~~~i~~l~~~~~~~~~---~~~~~~~-----------~~ 202 (497) T protein:vir:10 142 ETAPAAIGQNPFGSTGTFAPGILPT-----FLPGIVEQLFYELSLADLISSRPVTSP---NLSYLTE-----------SA 202 (497) T ss_pred hhhHHHHHhhhcccCcccccccchh-----hhHHHHHHHHhhhhHHhhccccccCCC---ceEEEEE-----------cC Confidence 1111111111111222222223333 345667778888899999998887653 3444332 11 Q ss_pred CCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHHH Q lcl|NC_019514. 81 IDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHISTE 160 (399) Q Consensus 81 i~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~~ 160 (399) -++...|+.|| +......++...++...++++.|+.+|+++++ ++. .+...+... T Consensus 203 ~~~~a~wv~E~-----------------------~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~-d~~-~l~~~i~~~ 257 (497) T protein:vir:10 203 AHNNAAAVAEA-----------------------GTYPFSSEEFARVYEQVGKVANALTITDEGLR-DAP-ELFNFVQGR 257 (497) T ss_pred CCCcceeeccC-----------------------cccccccccceeeEeeeeeeEeecHhHHHHHH-hHH-HHHHHHHHH Confidence 12233466665 22223344566788899999999999999876 443 366666666 Q ss_pred HHHhhhHHHHHHHHHHHHhcCCeEE------ecCCCc-ccccccc--------------cccCCceecHHHHHHHHHHHH Q lcl|NC_019514. 161 LMNGAVQLTEAVLQKDLLAGAGTIV------YTGAAT-QDSEITG--------------EGATPSVVDYDDLMRLSITLD 219 (399) Q Consensus 161 lg~~a~~~~e~~l~~~~lag~~~v~------yag~at-s~~~~t~--------------~~~~~~~vt~~~lr~a~~~L~ 219 (399) |.+.-+. .+-..+++|.++-. .+++.+ .....+. ..+....+..+.+..+..... T Consensus 258 l~~~i~~----~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 333 (497) T protein:vir:10 258 LLEGIQR----KEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRV 333 (497) T ss_pred HHHHHHH----HHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHh Confidence 6554332 33345677755321 111000 0000000 000001111111111111111 Q ss_pred hccCccccceeccccccC----------cccc------CceeEEEeCCCchHHHHHhhccCCCccceehhhcCCcccccc Q lcl|NC_019514. 220 ENRTPKQTKVITGSRMID----------TRTI------SAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILN 283 (399) Q Consensus 220 ~nrap~~t~~i~~s~~~~----------T~~I------~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~ 283 (399) .+-.+.....+.+....+ ...+ +++ +.++||....-|+.|+|-.+.+-|.+.-.-.... .. T Consensus 334 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~--~~ 410 (497) T protein:vir:10 334 VTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPN-AVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGN--PV 410 (497) T ss_pred hhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCC-eEEEchHHHHHHHHhhcCCCceeccCcccccccc--cc Confidence 111100000000000000 0000 111 4568999999999999988888786542222221 33 Q ss_pred ccceeEcCeEEEecCccchhcccCCCccCCccccccCccceEEEEEEEcccceeeecc-ccCCCCccceEEEecCCCCCC Q lcl|NC_019514. 284 GEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRETNGKYDIYPMLCVGAESFTTIGF-QTDGKTLKFKVTTKMPGEATA 362 (399) Q Consensus 284 gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l-~g~g~~~~~~~ivk~pG~~~a 362 (399) +.-+++-|+|+|+++.|.. | . .+||.=..+.+.+ ...+ +.+.+- T Consensus 411 ~~~~~l~G~pV~~t~~~~~----~--------------~------~~~Gd~~~~~~~i~~r~~----~~v~~~------- 455 (497) T protein:vir:10 411 NGGKNIWGVPVVTTPLIPL----G--------------T------ILVGHFAPSVIQTARREG----VTMQMT------- 455 (497) T ss_pred cCCceeeceeeEecCCCCC----C--------------c------eEEeecccceEEEEEecc----cEEEee------- Confidence 3455888999999988721 0 1 1334322111111 1111 212111 Q ss_pred CCCCccchhhHHHHHH--HHHHhhccccceEEEEEeccC Q lcl|NC_019514. 363 DRNDPYGEMGFSSIKW--YYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 363 d~~DPlgQrg~~gwK~--~~~~~iLn~~~m~~ie~~a~~ 399 (399) +-..++-++..+++++ .+.+.+++++-+++++..+.. T Consensus 456 ~~~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~ 494 (497) T protein:vir:10 456 NSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) T ss_pred cccchhhhcCcEEEEEEEeecceeeccccEEEEEecCCc Confidence 1112345677777774 478899999999999999888 No 135 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=97.81 E-value=6.6e-06 Score=48.99 Aligned_cols=300 Identities=14% Similarity=0.058 Sum_probs=143.5 Q ss_pred CCcCCeeecCCC-----CcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccc Q lcl|NC_019514. 1 MASKGMLYNDPN-----TTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRN 75 (399) Q Consensus 1 ~~~~~~~~n~~~-----~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~ 75 (399) |+.+ .|+... +++++..+.+-|+ . +..+.++...+..++.+++ .+.+|-..|+ +++-+... T Consensus 52 ~a~~--~~~~~~~~~a~~~~~~~Gg~lvP~---~-~~~~ii~~l~~~s~l~~lg-~~~v~~~~g~-~~~p~~t~------ 117 (366) T protein:vir:57 52 FAAT--ELGDTGLSMAISTAAGSGGALIPQ---N-MQNEVIELLRDRTVVRILG-ARSIPLPNGN-LSMPRLSG------ 117 (366) T ss_pred HHHH--hhcchhhhhhccccccCCccccch---h-HHHHHHHHHhhhcchhhhc-eeeeecCCCc-eEEEEEeC------ Confidence 2111 122221 1112222222243 1 2455666677777787874 3345555554 44444421 Q ss_pred cccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHH Q lcl|NC_019514. 76 VNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFS 155 (399) Q Consensus 76 ~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~ 155 (399) .+...|..||- ......++...++...++++.++.+|+++++ ++++++.+ T Consensus 118 ------~~~a~wv~E~~-----------------------~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~-ds~~~~~~ 167 (366) T protein:vir:57 118 ------GATAGYVGEGK-----------------------DVVATGATFDDVKLSAKTMIALVPVSNQLIG-RAGFNVEQ 167 (366) T ss_pred ------CcceeeeccCc-----------------------cccccccceeEEEEeeEEEEEeehhhHHHHh-hhhHHHHH Confidence 12234555552 1222334456688899999999999999776 45556777 Q ss_pred HHHHHHHHhhhHHHHHHHHHHHHhcCCe-------EEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccc Q lcl|NC_019514. 156 HISTELMNGAVQLTEAVLQKDLLAGAGT-------IVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTK 228 (399) Q Consensus 156 ~~~~~lg~~a~~~~e~~l~~~~lag~~~-------v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~ 228 (399) .+.+.|.+.-+...+ .-++.|.+. .-+++..+... +. ..+..+.+.+......|....... .. T Consensus 168 ~i~~~l~~a~~~~~d----~a~l~G~G~~~~p~Gi~~~~~~~~~~~--~~---~~t~~~~~~~~~~~~~~~~~~~~~-~~ 237 (366) T protein:vir:57 168 LLLGDILSAIATRED----KAFLRDDGTGDTPKGMKAVATAANRLV--AW---TGTAINLTTIDEYLDSLILKHMDS-NS 237 (366) T ss_pred HHHHHHHHHHHHHHH----HHhhccCCCCccccceeecccccccee--ec---cccccchhhHHHHHHHHHHhhhcc-cc Confidence 777777776554433 345555432 22222211111 11 122344444443333332211110 00 Q ss_pred eeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCC Q lcl|NC_019514. 229 VITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGA 308 (399) Q Consensus 229 ~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa 308 (399) . .. .-.-++|+.....|+.|+|.-+.+-|.+. .-|.+-|+++++++.+..-. |. T Consensus 238 -----~------~~-~a~~vmn~~~~~~L~~lkd~~G~~l~~~~------------~~g~l~G~Pvv~s~~ip~~~--~~ 291 (366) T protein:vir:57 238 -----N------MI-RCGWGLSNRTYMTLFGLRDGNGNKVYPEM------------SQGILKGYPIQRTSAIPANL--GD 291 (366) T ss_pred -----c------cc-cCEEEecHHHHHHHHhhhccCCceeccCC------------CCCeecceeeEEcccccccc--cc Confidence 0 00 12236999999999999887777777432 22678999999988763211 11 Q ss_pred CccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCC-CccchhhHHHHH--HHHHHhhc Q lcl|NC_019514. 309 TVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRN-DPYGEMGFSSIK--WYYGTLIL 385 (399) Q Consensus 309 ~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~-DPlgQrg~~gwK--~~~~~~iL 385 (399) . ++.. .++||.=+.-.++..++ ..+++. ..+....+++. -.+-|+..+.+| +.+.+.+. T Consensus 292 ~----------~~~~----~i~~gdfs~~~i~~~~~---i~i~~~-~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~ 353 (366) T protein:vir:57 292 D----------GNES----EIYFCDFNDVVIGEDGM---MKVDFS-TEATYKDADGQLVSAFARNQSLIRVVTEHDIGFR 353 (366) T ss_pred C----------CCcc----EEEEEecceEEEEEecc---eEEEEe-eccccccccccchhhhhcCceeEEeeeeeCcEee Confidence 1 1111 24456655544544442 111111 11111000000 012244455555 34555666 Q ss_pred cccceEEEEEecc Q lcl|NC_019514. 386 RPERLALVKTVAP 398 (399) Q Consensus 386 n~~~m~~ie~~a~ 398 (399) +++-.+.+.-+-= T Consensus 354 ~~~a~~~lt~~~~ 366 (366) T protein:vir:57 354 HPEGLVLGTGVIW 366 (366) T ss_pred ccccEEEEecccC Confidence 6665555543333 No 136 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=97.51 E-value=2e-05 Score=46.33 Aligned_cols=306 Identities=12% Similarity=0.032 Sum_probs=146.1 Q ss_pred CCcC-----CeeecCC------C-Cccccccccc-ccceehhhhhHHHHHHHHHHHHhhhhcccccccccCC-CEEEEEE Q lcl|NC_019514. 1 MASK-----GMLYNDP------N-TTPSGIDAPD-GKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYG-KEIRVYH 66 (399) Q Consensus 1 ~~~~-----~~~~n~~------~-~t~tT~~~~i-~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~G-ktIk~rr 66 (399) ++++ ....+.+ . ++.++..+++ -|+ .+..+.++...|..++.+++.....+-+.- -.++.-+ T Consensus 315 ~a~~~~~~~~~~~~~~~~a~~~~~~~~~~~~Gg~~vp~----~~~~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip~ 390 (645) T protein:vir:93 315 VARRQYPDDSRLHHVLKSAVGAGTTTDPQWAGSLSEYQ----EYAQDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVHA 390 (645) T ss_pred HHHhhcccchhhhhhhhhhhhccccccccccCCccCch----hhHHHHHHhhhhhhhHHhhccccccccccccCceeeee Confidence 1111 1111000 0 1111111221 222 234566666779999999986532222110 0111111 Q ss_pred ccccccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhh Q lcl|NC_019514. 67 YIPLLDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLD 146 (399) Q Consensus 67 y~pl~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d 146 (399) . .. .+...|..|| +.......+...++.+.++++.++.+|+++++ T Consensus 391 ~----------t~--~~~a~wv~Eg-----------------------~~~~~s~~~f~~v~l~~~kla~~~~iS~ell~ 435 (645) T protein:vir:93 391 Q----------VS--GGAAGWVGEG-----------------------KTKPLTKFDFESITFSHAKVSAIAVLTEELIR 435 (645) T ss_pred e----------ec--CcceEEeccC-----------------------ccccccccceeEEEEeeEEEEEeehhHHHHHh Confidence 1 01 1233455554 33334445667788999999999999999766 Q ss_pred hhcchHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccc Q lcl|NC_019514. 147 FDSDSELFSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQ 226 (399) Q Consensus 147 ~~~D~~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~ 226 (399) ++.+.+.+.+...|.+..+..++. .+++|.+.-.. +........+.........+..++..+...|..++.... T Consensus 436 -ds~~~~~~~i~~~l~~aia~~~d~----a~l~g~g~~~~-~~~p~gi~~~~~~~~~~~~~~~d~~~~~~~~~~a~~~~~ 509 (645) T protein:vir:93 436 -FSSPAADALVRNALAEAVVARLDT----DFVDPKKAAVA-DVSPASITHDVKGTASSGNPDADAEAAFGQFVAANLQPT 509 (645) T ss_pred -hchHHHHHHHHHHHHHHHHHHHHH----HhhcCCCcccC-CccccceeccccccccccchHHHHHHHHHHHHhcCCCcc Confidence 444557777777777776655543 34444322110 000000000011111122345677777777766554320 Q ss_pred cceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhccc Q lcl|NC_019514. 227 TKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGA 306 (399) Q Consensus 227 t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~a 306 (399) .+ +-++||.+...|+.|+|.-+.+-|..+ +. .=|++-|.+++.+..+-. T Consensus 510 ---------------~a--~~vmn~~~~~~L~~lkd~~G~~~~~~~---~~-------~~~tL~G~PV~~s~~vp~---- 558 (645) T protein:vir:93 510 ---------------GA--VWLMSSTNALALSMRKNALGQKEYPDM---TL-------LGGSFQGLPVIVSQYVGD---- 558 (645) T ss_pred ---------------cc--EEEEcHHHHHHHHhccccCCceeecCC---CC-------CCceeeceeeEEeccCCc---- Confidence 11 345699999999999887666666322 11 125788999998877510 Q ss_pred CCCccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCC---ccchhhHHHHH--HHHH Q lcl|NC_019514. 307 GATVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRND---PYGEMGFSSIK--WYYG 381 (399) Q Consensus 307 Ga~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~D---PlgQrg~~gwK--~~~~ 381 (399) + ..-...+ ++ ++|.. +.+-+.-+ ....+++..-+-|-.+.+... -|-|+..+++| +.+. T Consensus 559 ------~-~~~gd~s--~~----~ig~~--~~v~i~~s-~~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d 622 (645) T protein:vir:93 559 ------Q-LVLVNAP--DI----YLADD--GGVAVDMS-REASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWIN 622 (645) T ss_pred ------c-eeEeccc--cE----EEEEe--cceEEEee-cceeEEEeecccccccccccccchhHhhcCceEEEEEEEEc Confidence 0 0000000 11 12221 11111111 111222221111100011111 14578888888 5667 Q ss_pred HhhccccceEEEEEeccC Q lcl|NC_019514. 382 TLILRPERLALVKTVAPL 399 (399) Q Consensus 382 ~~iLn~~~m~~ie~~a~~ 399 (399) +.+.+++-.++|. +++. T Consensus 623 ~~~~~p~a~~~lt-~~~~ 639 (645) T protein:vir:93 623 WRRRRTAAVAVIT-GVNY 639 (645) T ss_pred ceeeCccceEEEe-cccC Confidence 7777777777665 5555 No 137 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=97.49 E-value=1.4e-05 Score=47.17 Aligned_cols=274 Identities=17% Similarity=0.076 Sum_probs=141.3 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQG 80 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~g 80 (399) ++++...-.. ..+.++....+.++ ++.+.. +......+.+++...+++.+.++-- .... + T Consensus 124 ~~~~~~~~~~-~~~~~~~~~~vp~~-----~~~~i~-~~~~~~~l~~~~~~~~~~~~~~~~~---~~~~----------~ 183 (397) T protein:vir:96 124 VKSKGAEKRD-GFTSVEGGALIPQE-----LLQPQL-EPKDIVDLSKYVRSVPVNSASGKFP---VISK----------S 183 (397) T ss_pred HHhhhhhhhh-cccccccccchhHH-----HHHHHH-HhhhhhhHHHhhhhccccccceeEE---EEec----------c Confidence 2221111111 11222222222222 233332 2333344566777777766654322 1100 0 Q ss_pred CCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHHH Q lcl|NC_019514. 81 IDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHISTE 160 (399) Q Consensus 81 i~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~~ 160 (399) ...-++..++ |.++.....+...++.++++++.++.+|+++++- ++.++...+..+ T Consensus 184 -~~~~~~~~E~----------------------~~~~~~~~~~~~~i~~~~~~~~~~~~~s~ell~d-s~~~l~~~i~~~ 239 (397) T protein:vir:96 184 -GSKMATVQQL----------------------EKNPQLANPKMVEIDYSVATRRGYIPISQEMIDD-ASYDVTGLIADE 239 (397) T ss_pred -CCcccccccc----------------------ccccccccccccceeecHhHhhcchhhHHHHHhh-hHHHHHHHHHHH Confidence 1112344444 2222222334556778899999999999997764 444577777777 Q ss_pred HHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccc Q lcl|NC_019514. 161 LMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTRT 240 (399) Q Consensus 161 lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~~ 240 (399) +.+..+... -..+++|.+.. +....+++++|..+.-..... +. T Consensus 240 l~~~~~~~~----~~~i~~g~g~~----------------~~~~~~~~d~~~~~~~~~~~~-~~---------------- 282 (397) T protein:vir:96 240 IQDQSLNTK----NADIAAVLKTA----------------TAKSVVGVDGLKDLINKEIKK-VY---------------- 282 (397) T ss_pred HHHHHHHHH----HHHHhhccccc----------------ccccccchHHHHHHHHHhhhh-hc---------------- Confidence 776655443 33455665421 223456888888776432221 10 Q ss_pred cCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCccccccC Q lcl|NC_019514. 241 ISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRETN 320 (399) Q Consensus 241 I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t~ 320 (399) .+ +-++||.+...|+.|+|-.+.|-|.|- +-.+..+.+-|..++.++.+.+ +.+. T Consensus 283 -~a--~~v~n~~~~~~l~~lkd~~G~~~~~~~--------~~~~~~~~l~G~pv~~~~~~~~----~~~~---------- 337 (397) T protein:vir:96 283 -DV--KLFISASMYSELDKLKDKNGRYLLQDS--------ITAASGKQLLGKEVVVLDDDVI----GKSV---------- 337 (397) T ss_pred -Cc--EEEEcHHHHHHHHHhhccCCCeEeccC--------ccCCCcccccccceEEeccccc----CCCC---------- Confidence 11 458999999999999987777777642 2334557889999988776433 1110 Q ss_pred ccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEecc Q lcl|NC_019514. 321 GKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVKTVAP 398 (399) Q Consensus 321 ~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~a~ 398 (399) + -+ .++||.-+-...-+..++ +.+-+. .+.+.++++.++ +.+.+.+++++-.+.++.-+- T Consensus 338 ~---~~-~~~~gd~~~~~~~~~~~~----~~~~~~---------~~~~~~~~~~~~-~r~d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 338 G---NV-VGFIGDAKAFASFFDRKQ----VSVSWV---------DNNIYGQLLAGI-IRYDVKATDKKAGFYVTFTIG 397 (397) T ss_pred C---ce-EEEEeehhcceEeEeecc----eEEEEe---------cccccceeEEEE-EEEccEEecccceEEEEeecC Confidence 1 11 245675442121121222 111111 122334444333 578888888988888873333 No 138 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=97.25 E-value=0.00012 Score=42.18 Aligned_cols=287 Identities=15% Similarity=0.069 Sum_probs=131.8 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhccc---ccccccCCCEEEEEEccccccccccc Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADV---VSMPKNYGKEIRVYHYIPLLDDRNVN 77 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~---~~mPkN~GktIk~rry~pl~~~~~~~ 77 (399) |+ .+|. -+ -|+..+++...+.++++.+-.. -.+--+.|++|+..+-.- T Consensus 1 MA----~~n~----------------a~-~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~-------- 51 (299) T protein:vir:79 1 MA----ALNY----------------AK-EYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTIST-------- 51 (299) T ss_pred Cc----cchh----------------HH-HHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEecccc-------- Confidence 22 0111 01 2677777788899998875432 223336789999987632 Q ss_pred cCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHH Q lcl|NC_019514. 78 DQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHI 157 (399) Q Consensus 78 ~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~ 157 (399) .|+.. .+-+. ..-+.|+++.. . .+-+|-|-=.|-..=| -.|.++-..++ .+ T Consensus 52 -~gl~D----Y~R~~---~g~~~g~~~~~-----------------~--~t~~ldqdr~~~f~vD-~~Dvdet~~~~-~~ 102 (299) T protein:vir:79 52 -TGRVD----SNRDT---IAVAQRNYDNA-----------------W--EPKVLTNQRKWSTLVH-PADINQTNYVA-SI 102 (299) T ss_pred -ccccc----cccCC---CcccccccCcc-----------------e--eEEEeeccccceeccc-hhhHHHHhhhh-HH Confidence 11110 11000 00000111111 1 1222333212222222 23333322222 23 Q ss_pred HHHHHHhhhHHHH---HHH-HHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccc Q lcl|NC_019514. 158 STELMNGAVQLTE---AVL-QKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGS 233 (399) Q Consensus 158 ~~~lg~~a~~~~e---~~l-~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s 233 (399) ..+++++..+..- |.. ...|.++++.+ |..++..++|+. . -++.|+.+...|++++.|. T Consensus 103 a~v~~~~~~~~v~pEiDay~~skl~~~a~~~---g~~~~~~~~T~~-----n-~y~~i~~~~~~lde~~vP~-------- 165 (299) T protein:vir:79 103 GNITKVYNEEQKFPEMDAYCISKIYADWTAL---GNTADTTVLTTT-----N-VLEVFDKLMEKMTEARVPE-------- 165 (299) T ss_pred HHHHHHHHHHHhhhHhhHHHHHHHHHhhhhc---CCcccccccCHH-----H-HHHHHHHHHHHHHhcCCCC-------- Confidence 4445544432221 111 22233444221 222222223221 1 3789999999999999985 Q ss_pred cccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcc-----cCC Q lcl|NC_019514. 234 RMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAG-----AGA 308 (399) Q Consensus 234 ~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~-----aGa 308 (399) ..+|+||.|+.-..|+. ++.|.............+|-||++.+|.++++|.- .+.. .|. T Consensus 166 ---------~~rvl~vtp~~~~~L~~------~~~f~k~~~~~~~~~~~~g~Vg~idG~~Ii~Vps~-r~~t~~~~~~G~ 229 (299) T protein:vir:79 166 ---------NGRILYVTPVVNTLIKN------AKEIQRTVNIKDAGTSLNRQTTDIDTVKIIKVPSN-LMKTAYDFTTGW 229 (299) T ss_pred ---------CCeEEEeCHHHHHHHhh------chhhhcccccccccceeeeeeeeecceEEEEechh-hcCccceeccCc Confidence 34899999999988864 57898888888877789999999999999997762 1111 233 Q ss_pred CccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhcccc Q lcl|NC_019514. 309 TVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPE 388 (399) Q Consensus 309 ~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~ 388 (399) .++.+ .- -..++|+=+.| .+.... -.+++ +-.||. ...+|=|-| .--+-=.|...-..+. T Consensus 230 ~~~~~--------ak-~in~ii~~~~a--~~~~~K---~~~~~--~~~P~~--~~~~~~~~~--~r~y~d~~v~~nk~~~ 289 (299) T protein:vir:79 230 KVGAG--------AK-QIFMSLVHPSA--IITPVS---YQFSK--LDEPTA--VTEGKYFYF--EESFEDVFILNKKADA 289 (299) T ss_pred cccCc--------cc-ccceEEEcCCe--eeeeEe---eeeEE--eecCCC--CCccceeee--eeeeeeeeeeccccCe Confidence 32221 11 23344443322 221111 00111 224553 222232221 0111111222233333 Q ss_pred ceEEEEEecc Q lcl|NC_019514. 389 RLALVKTVAP 398 (399) Q Consensus 389 ~m~~ie~~a~ 398 (399) --+-++.|-- T Consensus 290 i~~~~~~a~~ 299 (299) T protein:vir:79 290 IQFVVEGAGA 299 (299) T ss_pred EEEEeeecCC Confidence 3333333333 No 139 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=97.24 E-value=8.1e-05 Score=43.02 Aligned_cols=208 Identities=17% Similarity=0.178 Sum_probs=106.3 Q ss_pred ccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHHHHHHhhhHHHHHHHHHHHHh Q lcl|NC_019514. 100 IGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHISTELMNGAVQLTEAVLQKDLLA 179 (399) Q Consensus 100 ~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~~lg~~a~~~~e~~l~~~~la 179 (399) |+++ |+ +. .+=|.+.+..+.-.+..+.++++|+.-++.++ .....+++ T Consensus 1 iD~l------L~-----------------a~--------~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D-~~i~~~~~ 48 (221) T protein:vir:17 1 MDDL------LV-----------------AS--------QFVYDLDEILAQWNTRSEISKQIGEALAIHYD-ERIARVLA 48 (221) T ss_pred CCcc------hh-----------------HH--------HHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHH-HHHHHHHH Confidence 1000 00 00 01122233333445777777777777666664 33333333 Q ss_pred cCCe--EEecCC-Cccccccccccc-CCceecHHHHHHHHHHHHhccCccccceeccccccCccccCceeEEEeCCCchH Q lcl|NC_019514. 180 GAGT--IVYTGA-ATQDSEITGEGA-TPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTRTISAGRVLYIGSELIP 255 (399) Q Consensus 180 g~~~--v~yag~-ats~~~~t~~~~-~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~ 255 (399) .+.. .-..+. ...+..+++..+ .++.+ ++.|+.|...|.+++.|. .-+++++.|+.-+ T Consensus 49 ~aA~~~~p~~~~~~g~~~~~~a~~t~~~~~l-~dai~~a~~~LdekdVP~-----------------~gR~~vv~P~~y~ 110 (221) T protein:vir:17 49 SASIAAAPVTGQDGGFSVNIGAGNTNNAQAI-VDGFFEAAAVLDERSAPM-----------------DGRVAVLSPRQYY 110 (221) T ss_pred hhhhhcCcccccccCcceeccccccCCHHHH-HHHHHHHHHHHhhcCCCC-----------------CCCEEEeCcHHHH Confidence 2211 111100 001111211111 12233 688899999999999995 3489999998888 Q ss_pred HHHHhhccCCCccceehhhcCCccccccc-cceeEcCeEEEecCccchhcccCCCccCCccc----cccCccce----EE Q lcl|NC_019514. 256 LIRKLVDPFGNAAFVPVHQYADAGTILNG-EIGTVDQFRLVVVPEMLHWAGAGATVGTNPGY----RETNGKYD----IY 326 (399) Q Consensus 256 dirdl~d~~~~p~fi~v~~Ya~~~~i~~g-EIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~----~~t~~~~D----Vy 326 (399) .|-.- .++-+......++.+.+.+| |||++.||++++++++-.-. |..-..++.. .+..+++. == T Consensus 111 ~LL~~----~d~~~~n~d~~~s~g~~~~g~~i~~v~G~~V~~SnnlP~~~--gt~~~~~ag~~~~~~~~~~~yr~~fs~~ 184 (221) T protein:vir:17 111 SLISS----VDTNILNREIGNTQGDMNTGKGLYVNAGIRIYKSNVLASLY--GTNLVTDPGDATTSGENNGSYRPAITDR 184 (221) T ss_pred HHHHh----cCcceeeeecccccccccccceeeeecCcEEEEeccCCccc--ccccccCCccccccccccccccccccce Confidence 77321 24556666555666667777 89999999999999874311 1111111100 11111221 12 Q ss_pred EEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCC Q lcl|NC_019514. 327 PMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRN 365 (399) Q Consensus 327 p~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~ 365 (399) .-||+=.+|-|+|-|=|--+.-.+.+ -+.---.||+- T Consensus 185 ~glv~~~~Avgtvkl~~~~~~~~~~~--~~~~~~~~~~~ 221 (221) T protein:vir:17 185 AGLVFHKEAADTVEVLLPPSRPPLVI--SMFSIRRPDRR 221 (221) T ss_pred EEEEEcchheeeeeeecCCCCCceee--eeeeccCCCCC Confidence 36889999999998877432212222 11111234443 No 140 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=97.10 E-value=9e-05 Score=42.76 Aligned_cols=282 Identities=13% Similarity=0.083 Sum_probs=150.1 Q ss_pred CCcCCee---------ecCCCC-cccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccc Q lcl|NC_019514. 1 MASKGML---------YNDPNT-TPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPL 70 (399) Q Consensus 1 ~~~~~~~---------~n~~~~-t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl 70 (399) +...... ++.... ++++..+.+-|+ -+..+.+....+...+.+++...+++ |.+ ++-+... T Consensus 118 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~----~~~~~Ii~~l~~~~~i~~~~~~~~~~---g~~-~ip~~~~- 188 (425) T protein:vir:95 118 MLKTGEYYKRSEVVEFYEKFRNLRAVAGGELTIPE----VVVNRIMDIMGDYTTLYPLVDKIRVK---GTT-RILVDTD- 188 (425) T ss_pred HHhhhhhhhhhHHHHHHHHHHhhcccccCceeccH----HHHHHHHHHHHhhhhHHHhhceeecC---cee-EEEEecC- Confidence 1111111 111111 122222223343 23455666677777788888777774 432 2222211 Q ss_pred ccccccccCCCCCCCceeccCccccccccccccccccccccccccccccc-cceeeeeEeeeeeecceeehhhhhhhhhc Q lcl|NC_019514. 71 LDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRV-GFSRISRVGRIQKFGFFTEFSQESLDFDS 149 (399) Q Consensus 71 ~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~-~~t~~~~~~~l~qYG~~~e~Td~~~d~~~ 149 (399) .+.+.|..||- .+... ..+...++.+.++++.|+.+|+++++ ++ T Consensus 189 -----------~~~a~~v~E~~-----------------------~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~-ds 233 (425) T protein:vir:95 189 -----------TSPATWIEQSG-----------------------ALPTGDVGTIASIDFDGFKVGKVTFVDNYLLQ-DS 233 (425) T ss_pred -----------Ccccccccccc-----------------------ccccccccccceeeeeheeeeeeehhhHHHHh-cc Confidence 13344566551 11111 12345678889999999999999665 33 Q ss_pred chHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeE------EecCCCcccccccccccCCceecHHHHHHHHHHHHhccC Q lcl|NC_019514. 150 DSELFSHISTELMNGAVQLTEAVLQKDLLAGAGTI------VYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRT 223 (399) Q Consensus 150 D~~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v------~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nra 223 (399) +.++...+...+.+..+...+ ..+++|.+.- ++++.. ..... .......+++++.++...+..... T Consensus 234 ~~~l~~~i~~~l~~~i~~~~d----~~il~G~G~~~~~p~Gil~~~~-~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~ 305 (425) T protein:vir:95 234 IINLDDYVTKKIARAIAKALD----LAIVKGTGAANKQPLGIIPSLP-PENQV---TVEADNNLLKNLVKQIGLIDTGDD 305 (425) T ss_pred HHHHHHHHHHHHHHHHHHHHH----HHhhccCCCCccccceeecccc-ccccc---ccccccchHHHHHHHHHhhhhhcc Confidence 334777777777766554443 4566776431 233211 11111 123356688888887755544222 Q ss_pred ccccceeccccccCccccCceeEEEeCCCc-hHHH---HHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCc Q lcl|NC_019514. 224 PKQTKVITGSRMIDTRTISAGRVLYIGSEL-IPLI---RKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPE 299 (399) Q Consensus 224 p~~t~~i~~s~~~~T~~I~~~yv~~~h~d~-~~di---rdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~ 299 (399) . +..+ +.++|+.. ...| +-++|--+.+-|. .-.++.+.+-|.++|.++. T Consensus 306 ~----------------~~~~-~~v~~~~~~~~~l~~l~~~kd~~g~~i~~----------~~~~~~~~l~G~pvv~~~~ 358 (425) T protein:vir:95 306 S----------------VGEI-VAVMKRSTYYNRLVEFSIQVDSNGNVVGK----------LPNLRTPDLLGLRVVFNNF 358 (425) T ss_pred c----------------cCce-EEEEeChHHHHHHHHHHhhcCCCCceeec----------cCCCCCccccceeeEEcCc Confidence 1 1222 34667654 3334 4344433322222 1134566788889998887 Q ss_pred cchhcccCCCccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH-- Q lcl|NC_019514. 300 MLHWAGAGATVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK-- 377 (399) Q Consensus 300 ~~~~~~aGa~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK-- 377 (399) |.. + .++||.-++-.++..++ +.+-+ ..|.+-.++..+++ T Consensus 359 ~~~----------~--------------~i~~Gd~~~~~~~~~~~-----~~i~~---------~~~~~f~~~~~~~~~~ 400 (425) T protein:vir:95 359 LDD----------D--------------TVLFGEFEQYTLVEREN-----ITIDS---------STHVKFTEDQTAFRGK 400 (425) T ss_pred CCC----------c--------------cEEEEecccEEEEeecc-----eEEEe---------ecccccccCceEEEEE Confidence 610 0 25677666555544321 22211 22556667777777 Q ss_pred HHHHHhhccccceEEEEEeccC Q lcl|NC_019514. 378 WYYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 378 ~~~~~~iLn~~~m~~ie~~a~~ 399 (399) +++.+.+.+++=.+.++...|+ T Consensus 401 ~r~d~~~~~~~a~~~~~i~~~~ 422 (425) T protein:vir:95 401 GRFDGKPVKPEAFVLVTITDPV 422 (425) T ss_pred EeeCcEeecccceEEEEecCcC Confidence 4688999999999999999999 No 141 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=97.01 E-value=8.7e-05 Score=42.86 Aligned_cols=278 Identities=11% Similarity=0.021 Sum_probs=135.8 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCC Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQG 80 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~g 80 (399) +..+ ..+.....+++..+-+-|+ .+ . ..+......-.+.+++...+++.+.++-.....- T Consensus 148 ~~~~--e~~~~~~~~~~~~g~lvp~---~~-~-~~i~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~------------- 207 (437) T protein:vir:10 148 LKTG--EVRDVTGIALKDGKVIIPE---TI-L-TPEKEVHQFPRLGSLVRTESVTTTTGKLPIFNNS------------- 207 (437) T ss_pred HHhh--hhhhhhhcccccccccchH---HH-H-HHHHHhhhhhhhhhcceeEeeccCceeeEEeecc------------- Confidence 1000 0111111122222222232 11 2 2233333333466777777777665542211111 Q ss_pred CCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHHH Q lcl|NC_019514. 81 IDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHISTE 160 (399) Q Consensus 81 i~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~~ 160 (399) .+..++..++- .+......++..++...++++.++.+|+++++ ++..++...+..+ T Consensus 208 -~~~~~~~~e~~----------------------~~~e~~~~~~~~v~~~~~k~~~~~~is~ell~-ds~~~~~~~i~~~ 263 (437) T protein:vir:10 208 -TDLLTAHTEYG----------------------QTTKNATPVITPILWDLKTYTGGYVFSQELIS-DSSYDWQAELQSR 263 (437) T ss_pred -ccccccccccc----------------------cccccccccceeeeeehhheeeehhhhHHHHh-hhHHHHHHHHHHH Confidence 12223444431 11111223455677889999999999999765 4444577777777 Q ss_pred HHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHH-HHHhccCccccceeccccccCcc Q lcl|NC_019514. 161 LMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSI-TLDENRTPKQTKVITGSRMIDTR 239 (399) Q Consensus 161 lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~-~L~~nrap~~t~~i~~s~~~~T~ 239 (399) |.+..+...+ ..+++|.+...-.+ ....+.++|..+.. .|+..-.+ T Consensus 264 l~~~~~~~~~----~~i~~g~g~~~~~~--------------~~~~~~~~~~~~~~~~l~~~~~~--------------- 310 (437) T protein:vir:10 264 LIELRDNTDD----SLIITALTDGIKKT--------------TSTYLLGDLKKVLNVTLKPQDSA--------------- 310 (437) T ss_pred HHHHHHHHHH----HHHhhhhccccccc--------------ccccchhhHHHHHHhhhhhhhhc--------------- Confidence 7766554443 44666654321111 11224455555432 33332111 Q ss_pred ccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCcccccc Q lcl|NC_019514. 240 TISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRET 319 (399) Q Consensus 240 ~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~t 319 (399) .+ +-+|||.+...|+.|+|-.+.|-|.|- +. .|.-+++-|..++.++.|.. ..+++ T Consensus 311 ---~~-~~~~~~~~~~~l~~lkd~~g~~~~~~~--~~------~~~~~~l~G~pv~~~~~~~~--~~~~~---------- 366 (437) T protein:vir:10 311 ---AA-SIVMSQSAYNLFDMATDAMGRPLLQPN--VT------AATGYTLLGKTVVIVDDKLF--PSASA---------- 366 (437) T ss_pred ---CC-EEEEcHHHHHHHHHhhccCCCeeeccC--cc------CCCCcccccceeEEeccccc--CCcCC---------- Confidence 11 348999999999999987777777652 22 23446788888888776521 11111 Q ss_pred CccceEEEEEEEccc--ceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEE--- Q lcl|NC_019514. 320 NGKYDIYPMLCVGAE--SFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVK--- 394 (399) Q Consensus 320 ~~~~DVyp~lV~G~~--Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie--- 394 (399) +. ..++||.= +|....-++ +.+ . . -+..|.+.|...+- +-+.+.++++.-++.|. T Consensus 367 -~~----~~~~~gd~~~~~~~~~r~~------~~~--~-~----~~~~~~~~~~~~~~--~r~d~~~~~~~a~~~l~~~~ 426 (437) T protein:vir:10 367 -GD----VNIVVAPLKKAVINFKLTE------ITG--Q-F----QDTYDIWYKQLGIF--LRQNVVQASKDLIVNLTGKL 426 (437) T ss_pred -Cc----eEEEEeeccccEEEEeeec------eEE--E-E----ecccccccceeeEE--EEEccEEecccceEEEEeec Confidence 11 12457753 333222222 111 1 0 12335555533222 34588888888888765 Q ss_pred EeccC Q lcl|NC_019514. 395 TVAPL 399 (399) Q Consensus 395 ~~a~~ 399 (399) .++++ T Consensus 427 ~~~~~ 431 (437) T protein:vir:10 427 KAVTV 431 (437) T ss_pred ccccc Confidence 22222 No 142 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=96.98 E-value=2.4e-05 Score=45.92 Aligned_cols=273 Identities=14% Similarity=0.109 Sum_probs=135.7 Q ss_pred CCcCCee---------ecCCCCccccccc-ccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccc Q lcl|NC_019514. 1 MASKGML---------YNDPNTTPSGIDA-PDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPL 70 (399) Q Consensus 1 ~~~~~~~---------~n~~~~t~tT~~~-~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl 70 (399) |..+... -|.- .+.++..+ -+-|+ -+..+.+......-.+.+++...++. +.++-..-+ . T Consensus 99 ~~~~~~~~~~~~~~~~~~a~-~~~~~~~gG~lIP~----~~~~~Ii~~~~~~~~l~~~~~~~~~~---~~~~p~~~~-~- 168 (387) T protein:vir:26 99 ILPNEFEKPSMEAQRLLHAL-PTGNDSGGDKLLPK----TLSKEIVSEPFAKNQLREKARLTNIK---GLEIPRVSY-T- 168 (387) T ss_pred HhhhhHHHHHHHHHHHHhhh-ccCCCCCCceeech----hHHHHHHHHHHhhchhhhhceeeecC---Cceeeeeec-c- Confidence 1111100 0000 11122222 12233 23456666666666677888877664 222211111 1 Q ss_pred ccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcc Q lcl|NC_019514. 71 LDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSD 150 (399) Q Consensus 71 ~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D 150 (399) -+...|..|| +..+...++...++...++|+.|+.+|+++++ +++ T Consensus 169 -----------~~~a~~v~Eg-----------------------~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~-ds~ 213 (387) T protein:vir:26 169 -----------LDDDDFITDV-----------------------ETAKELKAKGDTVKFTTNKFKVFAAISDTVIH-GSD 213 (387) T ss_pred -----------CCcccccccc-----------------------ccccccccccceeeechheeeeechhhHHHHh-hhH Confidence 1123455554 22333344556788899999999999999665 555 Q ss_pred hHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCcccccee Q lcl|NC_019514. 151 SELFSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVI 230 (399) Q Consensus 151 ~~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i 230 (399) .++...+...|.+..+.. |+. ..+..|.++-.-.|..+.. .++ .+....++++|.++.-.|+....+. T Consensus 214 ~~l~~~i~~~la~~~~~~-e~~--~~~~~g~g~g~~~g~~~~~-~~~---~~~~~~~~d~i~~~~~~l~~~y~~n----- 281 (387) T protein:vir:26 214 VDLVNWVENALQSGLAAK-ERK--DALAVSPKSGLEHMSFYNG-SVK---EVEGADMYDAIINALADLHEDYRDN----- 281 (387) T ss_pred HHHHHHHHHHHHHHHHHH-HHH--hHhhcCCCccccceeeecc-ccc---cccccchHHHHHHHHhccChhhhcC----- Confidence 567777777777765533 221 1233444322211111110 011 1222346888888887776532221 Q ss_pred ccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCc Q lcl|NC_019514. 231 TGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATV 310 (399) Q Consensus 231 ~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~ 310 (399) + ..++|+.....++.+.+.-+.+ ++.|.-+.+=|..++.+.-+ T Consensus 282 ------------a--~~imn~~t~~~~~~~~~~~~~~-------------~~~~~~~~llG~PV~~~~~~---------- 324 (387) T protein:vir:26 282 ------------A--TIYMRYADYVKIISVLSNGTTN-------------FFDTPAEKVFGKPVVFTDAA---------- 324 (387) T ss_pred ------------C--EEEEechHHHHHHHHHhcCCCc-------------ccccCCccccccceEEecCC---------- Confidence 1 2367877767776665433322 33333344555555543311 Q ss_pred cCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCc-cchhhHHHHHHHHHHhhccccc Q lcl|NC_019514. 311 GTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDP-YGEMGFSSIKWYYGTLILRPER 389 (399) Q Consensus 311 ~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DP-lgQrg~~gwK~~~~~~iLn~~~ 389 (399) +.++||.=++..+.+.+.. +. + ..|. -|+++|..+. ++.+.+.+++- T Consensus 325 ----------------~~~~~GDf~~~~~~~~~~~----~~-----~------~~~~~~~~~~~~~~~-r~Dg~v~~~~A 372 (387) T protein:vir:26 325 ----------------VKPIVGDFNYFGINYDGTT----YD-----T------DKDVKKGEYLFVLTA-WYDQQRTLDSA 372 (387) T ss_pred ----------------Cceeeechhhhhhhhhhhh----he-----e------cccccCCceEEEEEE-EeCcEeechhh Confidence 1246777655555554422 11 0 0111 2455554443 68888999998 Q ss_pred eEEEEEeccC Q lcl|NC_019514. 390 LALVKTVAPL 399 (399) Q Consensus 390 m~~ie~~a~~ 399 (399) ++.++.-+.- T Consensus 373 ~~~l~~ka~~ 382 (387) T protein:vir:26 373 FRIAKAKENT 382 (387) T ss_pred eEEEEeecCC Confidence 8888875544 No 143 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=96.98 E-value=2.4e-05 Score=45.92 Aligned_cols=273 Identities=14% Similarity=0.109 Sum_probs=135.7 Q ss_pred CCcCCee---------ecCCCCccccccc-ccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccc Q lcl|NC_019514. 1 MASKGML---------YNDPNTTPSGIDA-PDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPL 70 (399) Q Consensus 1 ~~~~~~~---------~n~~~~t~tT~~~-~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl 70 (399) |..+... -|.- .+.++..+ -+-|+ -+..+.+......-.+.+++...++. +.++-..-+ . T Consensus 99 ~~~~~~~~~~~~~~~~~~a~-~~~~~~~gG~lIP~----~~~~~Ii~~~~~~~~l~~~~~~~~~~---~~~~p~~~~-~- 168 (387) T protein:vir:96 99 ILPNEFEKPSMEAQRLLHAL-PTGNDSGGDKLLPK----TLSKEIVSEPFAKNQLREKARLTNIK---GLEIPRVSY-T- 168 (387) T ss_pred HhhhhHHHHHHHHHHHHhhh-ccCCCCCCceeech----hHHHHHHHHHHhhchhhhhceeeecC---Cceeeeeec-c- Confidence 1111100 0000 11122222 12233 23456666666666677888877664 222211111 1 Q ss_pred ccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcc Q lcl|NC_019514. 71 LDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSD 150 (399) Q Consensus 71 ~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D 150 (399) -+...|..|| +..+...++...++...++|+.|+.+|+++++ +++ T Consensus 169 -----------~~~a~~v~Eg-----------------------~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~-ds~ 213 (387) T protein:vir:96 169 -----------LDDDDFITDV-----------------------ETAKELKAKGDTVKFTTNKFKVFAAISDTVIH-GSD 213 (387) T ss_pred -----------CCcccccccc-----------------------ccccccccccceeeechheeeeechhhHHHHh-hhH Confidence 1123455554 22333344556788899999999999999665 555 Q ss_pred hHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCcccccee Q lcl|NC_019514. 151 SELFSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVI 230 (399) Q Consensus 151 ~~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i 230 (399) .++...+...|.+..+.. |+. ..+..|.++-.-.|..+.. .++ .+....++++|.++.-.|+....+. T Consensus 214 ~~l~~~i~~~la~~~~~~-e~~--~~~~~g~g~g~~~g~~~~~-~~~---~~~~~~~~d~i~~~~~~l~~~y~~n----- 281 (387) T protein:vir:96 214 VDLVNWVENALQSGLAAK-ERK--DALAVSPKSGLEHMSFYNG-SVK---EVEGADMYDAIINALADLHEDYRDN----- 281 (387) T ss_pred HHHHHHHHHHHHHHHHHH-HHH--hHhhcCCCccccceeeecc-ccc---cccccchHHHHHHHHhccChhhhcC----- Confidence 567777777777765533 221 1233444322211111110 011 1222346888888887776532221 Q ss_pred ccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCc Q lcl|NC_019514. 231 TGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATV 310 (399) Q Consensus 231 ~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~ 310 (399) + ..++|+.....++.+.+.-+.+ ++.|.-+.+=|..++.+.-+ T Consensus 282 ------------a--~~imn~~t~~~~~~~~~~~~~~-------------~~~~~~~~llG~PV~~~~~~---------- 324 (387) T protein:vir:96 282 ------------A--TIYMRYADYVKIISVLSNGTTN-------------FFDTPAEKVFGKPVVFTDAA---------- 324 (387) T ss_pred ------------C--EEEEechHHHHHHHHHhcCCCc-------------ccccCCccccccceEEecCC---------- Confidence 1 2367877767776665433322 33333344555555543311 Q ss_pred cCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCc-cchhhHHHHHHHHHHhhccccc Q lcl|NC_019514. 311 GTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDP-YGEMGFSSIKWYYGTLILRPER 389 (399) Q Consensus 311 ~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DP-lgQrg~~gwK~~~~~~iLn~~~ 389 (399) +.++||.=++..+.+.+.. +. + ..|. -|+++|..+. ++.+.+.+++- T Consensus 325 ----------------~~~~~GDf~~~~~~~~~~~----~~-----~------~~~~~~~~~~~~~~~-r~Dg~v~~~~A 372 (387) T protein:vir:96 325 ----------------VKPIVGDFNYFGINYDGTT----YD-----T------DKDVKKGEYLFVLTA-WYDQQRTLDSA 372 (387) T ss_pred ----------------Cceeeechhhhhhhhhhhh----he-----e------cccccCCceEEEEEE-EeCcEeechhh Confidence 1246777655555554422 11 0 0111 2455554443 68888999998 Q ss_pred eEEEEEeccC Q lcl|NC_019514. 390 LALVKTVAPL 399 (399) Q Consensus 390 m~~ie~~a~~ 399 (399) ++.++.-+.- T Consensus 373 ~~~l~~ka~~ 382 (387) T protein:vir:96 373 FRIAKAKENT 382 (387) T ss_pred eEEEEeecCC Confidence 8888875544 No 144 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=96.98 E-value=2.4e-05 Score=45.92 Aligned_cols=273 Identities=14% Similarity=0.109 Sum_probs=135.7 Q ss_pred CCcCCee---------ecCCCCccccccc-ccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccc Q lcl|NC_019514. 1 MASKGML---------YNDPNTTPSGIDA-PDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPL 70 (399) Q Consensus 1 ~~~~~~~---------~n~~~~t~tT~~~-~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl 70 (399) |..+... -|.- .+.++..+ -+-|+ -+..+.+......-.+.+++...++. +.++-..-+ . T Consensus 99 ~~~~~~~~~~~~~~~~~~a~-~~~~~~~gG~lIP~----~~~~~Ii~~~~~~~~l~~~~~~~~~~---~~~~p~~~~-~- 168 (387) T protein:vir:94 99 ILPNEFEKPSMEAQRLLHAL-PTGNDSGGDKLLPK----TLSKEIVSEPFAKNQLREKARLTNIK---GLEIPRVSY-T- 168 (387) T ss_pred HhhhhHHHHHHHHHHHHhhh-ccCCCCCCceeech----hHHHHHHHHHHhhchhhhhceeeecC---Cceeeeeec-c- Confidence 1111100 0000 11122222 12233 23456666666666677888877664 222211111 1 Q ss_pred ccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcc Q lcl|NC_019514. 71 LDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSD 150 (399) Q Consensus 71 ~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D 150 (399) -+...|..|| +..+...++...++...++|+.|+.+|+++++ +++ T Consensus 169 -----------~~~a~~v~Eg-----------------------~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~-ds~ 213 (387) T protein:vir:94 169 -----------LDDDDFITDV-----------------------ETAKELKAKGDTVKFTTNKFKVFAAISDTVIH-GSD 213 (387) T ss_pred -----------CCcccccccc-----------------------ccccccccccceeeechheeeeechhhHHHHh-hhH Confidence 1123455554 22333344556788899999999999999665 555 Q ss_pred hHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCcccccee Q lcl|NC_019514. 151 SELFSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVI 230 (399) Q Consensus 151 ~~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i 230 (399) .++...+...|.+..+.. |+. ..+..|.++-.-.|..+.. .++ .+....++++|.++.-.|+....+. T Consensus 214 ~~l~~~i~~~la~~~~~~-e~~--~~~~~g~g~g~~~g~~~~~-~~~---~~~~~~~~d~i~~~~~~l~~~y~~n----- 281 (387) T protein:vir:94 214 VDLVNWVENALQSGLAAK-ERK--DALAVSPKSGLEHMSFYNG-SVK---EVEGADMYDAIINALADLHEDYRDN----- 281 (387) T ss_pred HHHHHHHHHHHHHHHHHH-HHH--hHhhcCCCccccceeeecc-ccc---cccccchHHHHHHHHhccChhhhcC----- Confidence 567777777777765533 221 1233444322211111110 011 1222346888888887776532221 Q ss_pred ccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCc Q lcl|NC_019514. 231 TGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATV 310 (399) Q Consensus 231 ~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~ 310 (399) + ..++|+.....++.+.+.-+.+ ++.|.-+.+=|..++.+.-+ T Consensus 282 ------------a--~~imn~~t~~~~~~~~~~~~~~-------------~~~~~~~~llG~PV~~~~~~---------- 324 (387) T protein:vir:94 282 ------------A--TIYMRYADYVKIISVLSNGTTN-------------FFDTPAEKVFGKPVVFTDAA---------- 324 (387) T ss_pred ------------C--EEEEechHHHHHHHHHhcCCCc-------------ccccCCccccccceEEecCC---------- Confidence 1 2367877767776665433322 33333344555555543311 Q ss_pred cCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCc-cchhhHHHHHHHHHHhhccccc Q lcl|NC_019514. 311 GTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDP-YGEMGFSSIKWYYGTLILRPER 389 (399) Q Consensus 311 ~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DP-lgQrg~~gwK~~~~~~iLn~~~ 389 (399) +.++||.=++..+.+.+.. +. + ..|. -|+++|..+. ++.+.+.+++- T Consensus 325 ----------------~~~~~GDf~~~~~~~~~~~----~~-----~------~~~~~~~~~~~~~~~-r~Dg~v~~~~A 372 (387) T protein:vir:94 325 ----------------VKPIVGDFNYFGINYDGTT----YD-----T------DKDVKKGEYLFVLTA-WYDQQRTLDSA 372 (387) T ss_pred ----------------Cceeeechhhhhhhhhhhh----he-----e------cccccCCceEEEEEE-EeCcEeechhh Confidence 1246777655555554422 11 0 0111 2455554443 68888999998 Q ss_pred eEEEEEeccC Q lcl|NC_019514. 390 LALVKTVAPL 399 (399) Q Consensus 390 m~~ie~~a~~ 399 (399) ++.++.-+.- T Consensus 373 ~~~l~~ka~~ 382 (387) T protein:vir:94 373 FRIAKAKENT 382 (387) T ss_pred eEEEEeecCC Confidence 8888875544 No 145 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=96.74 E-value=0.0001 Score=42.47 Aligned_cols=274 Identities=14% Similarity=0.108 Sum_probs=130.9 Q ss_pred CCcCCe--------eecCCCCcccccccc-cccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccc Q lcl|NC_019514. 1 MASKGM--------LYNDPNTTPSGIDAP-DGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLL 71 (399) Q Consensus 1 ~~~~~~--------~~n~~~~t~tT~~~~-i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~ 71 (399) ...... ..+. -++.++..++ +-|+ -+..+.+......-.+.+++.+.++. |.++- +...+. T Consensus 100 ~~~~~~~~~~~~~~~~~a-l~~~t~s~gG~~IP~----~~~~~Ii~~~~~~~~l~~~~~v~~~~---~~~~p-~~~~~~- 169 (387) T protein:vir:93 100 LPNEFEKPSMEAQRLLHA-LPTGNDSGGDKLLPK----TLSKEIVSEPFAKNQLREKARLTNIK---GLEIP-RVSYTL- 169 (387) T ss_pred hhhhhhhhhhhhHHHHHh-hccCcCCCCceeech----hHHHHHHHHHHhhchhhhheeeeecC---CceEE-EEeecC- Confidence 000000 0011 0111222222 2233 12345555555555567777776654 22221 112111 Q ss_pred cccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcch Q lcl|NC_019514. 72 DDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDS 151 (399) Q Consensus 72 ~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~ 151 (399) +...|+.||-. .+...++...++.+.++|+.|+.+|+++++ +++. T Consensus 170 -----------~~a~~v~E~~~-----------------------~~~~~~~f~~v~~~~~k~~~~~~iS~ell~-Ds~~ 214 (387) T protein:vir:93 170 -----------DDDDFITDVET-----------------------AKELKLKGDTVKFTTNKFKVFAAISDTVIH-GSDV 214 (387) T ss_pred -----------CccccccCccc-----------------------ccccccccceeeeeheeeeeechhhHHHHh-hhHH Confidence 22345555521 122223445677889999999999999664 5555 Q ss_pred HHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceec Q lcl|NC_019514. 152 ELFSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVIT 231 (399) Q Consensus 152 ~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~ 231 (399) +|...+...+.+..+.. |+ ..-+..|.++-.-.|.-+. +.++ ++....++++|.++.-.|+...... T Consensus 215 ~l~~~i~~~la~~~~~~-e~--~~~~~~g~g~g~p~g~l~~-~~~~---~v~~~~~~d~i~~~~~~l~~~~~~~------ 281 (387) T protein:vir:93 215 DLVNWVENALQSGLAAK-ER--KDALAVSPKSGLDHMSFYN-GSVK---EVEGADMYDAIINALADLHEDYRDN------ 281 (387) T ss_pred HHHHHHHHHHHHHHHHH-HH--HhHhhcCCCccccceeeec-cccc---cccccchHHHHHHHHhccChhhhcC------ Confidence 57777777777765533 21 1123444443222221111 0011 1222335788888777766532221 Q ss_pred cccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCcc Q lcl|NC_019514. 232 GSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVG 311 (399) Q Consensus 232 ~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~ 311 (399) + +.++|+.....+..+.+.-+. +++.|+-.++-|..++.+.-+ T Consensus 282 -----------a--~~~mn~~t~~~~~~~~~d~~~-------------~~~~~~~~~llG~PV~~~~~~----------- 324 (387) T protein:vir:93 282 -----------A--TIYMRYADYVKIISVLSNGTT-------------NFFDTPAEKVFGKPVVFTDAA----------- 324 (387) T ss_pred -----------C--EEEEechHHHHHHHHHhcCCC-------------cccccCCccccccceEEecCC----------- Confidence 1 236777666666555443222 223333345555555554311 Q ss_pred CCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceE Q lcl|NC_019514. 312 TNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLA 391 (399) Q Consensus 312 ~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~ 391 (399) +.++||.=++..+.+.+.- +. + ++.-=-|+.+|..+ .++.+.+.+++-+. T Consensus 325 ---------------~~~~~GDf~~~~~~~~~~~--------~~-~-----~~~~~~~~~~~~~~-~r~d~~v~~~eA~~ 374 (387) T protein:vir:93 325 ---------------VKPIVGDFNYFGINYDGTT--------YD-T-----DKDVKKGEYLFVLT-AWYDQQRTLDSAFR 374 (387) T ss_pred ---------------Cceeeeehhhhheehhhhe--------ee-e-----cccccCCceeEEEE-eeeCceeechhheE Confidence 1245777666555444321 11 1 00001244555444 48899999999888 Q ss_pred EEEEeccC Q lcl|NC_019514. 392 LVKTVAPL 399 (399) Q Consensus 392 ~ie~~a~~ 399 (399) .++..+.- T Consensus 375 ~l~~k~~~ 382 (387) T protein:vir:93 375 IAKAKENT 382 (387) T ss_pred EEEeecCC Confidence 88774444 No 146 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=96.59 E-value=0.00037 Score=39.41 Aligned_cols=279 Identities=15% Similarity=0.130 Sum_probs=130.1 Q ss_pred CCcC------CeeecCCCCcccccccccccceehhhhhHH---HHHHHHHHHHhhhhcc-cccccccCCCEEEEEEcccc Q lcl|NC_019514. 1 MASK------GMLYNDPNTTPSGIDAPDGKQMNTFFWWKK---ALIEARKDQYFMPLAD-VVSMPKNYGKEIRVYHYIPL 70 (399) Q Consensus 1 ~~~~------~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk---~L~~A~p~lv~~~fA~-~~~mPkN~GktIk~rry~pl 70 (399) |.+. ..++|+-.- +.-|.+-|+.-+.+| .|.+......+..... ....=.+.|++|+..+-.-- T Consensus 12 ~~~~~~~~~~~~~~~~~~~------~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~~g~tVkIp~i~~~ 85 (329) T protein:vir:10 12 MNKEIKNATGKLKLNLQHF------ANKSVEPGDTLLKNKHVGILEKVTAANSYSAPAVISNDAIFMQGRSFTVIKGDVT 85 (329) T ss_pred hhhhhhcccceeEEehhhh------cCCccCCchhHHHHHHHHHHHHHHHhhceeeeeecccceeeccCcEEEEeeeccc Confidence 3221 123343210 011122233222222 2322222212222110 12223678999999887321 Q ss_pred ccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcc Q lcl|NC_019514. 71 LDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSD 150 (399) Q Consensus 71 ~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D 150 (399) |+..= .-.+|..| ..++++ ..+-+|-| .-+..|.=.-.|-++- T Consensus 86 ---------gl~DY--~R~~g~~~-----------g~vt~~--------------~~t~tidq-dR~~~F~VD~~D~dEt 128 (329) T protein:vir:10 86 ---------ELKDY--KRNATNEF-----------DHPQIQ--------------ETTYFLDQ-EKYWGRFVDALDRRDT 128 (329) T ss_pred ---------ccccc--cCCCCccc-----------cccccc--------------eeEEEeec-ccceeeecchhhHhhh Confidence 11000 00112111 111111 12333444 3334443223444443 Q ss_pred hHHHHHHHHHHHHhhhHHHH---HHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCcccc Q lcl|NC_019514. 151 SELFSHISTELMNGAVQLTE---AVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQT 227 (399) Q Consensus 151 ~~l~~~~~~~lg~~a~~~~e---~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t 227 (399) +..+ .+..+++++..+..- |-.+-..+++. ++. .....+++ .=.++.|+.+...|++++.|. T Consensus 129 n~~l-~a~~i~~~~~~~~v~pEiDay~~skla~~-----a~~-~~~~~~t~------~nay~~i~~a~~~Lde~~vp~-- 193 (329) T protein:vir:10 129 EGNI-DINYVVAKQASEVVAPYLDNLRFATLARN-----KAK-HLTVGSGA------DAQYDAVLDVSVELDEIGAGA-- 193 (329) T ss_pred hhhh-hHHHHHHHHHHHHhhhHHHHHHHHHHHhh-----ccc-ccccccCH------HHHHHHHHHHHHHHHhcCCCC-- Confidence 3222 234455555443332 22233333332 111 11111111 124899999999999987652 Q ss_pred ceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccC Q lcl|NC_019514. 228 KVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAG 307 (399) Q Consensus 228 ~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aG 307 (399) .+|+||.|+.-..|+. ++.|+......+. ...+|.||++++|.++++|.-.- T Consensus 194 ----------------~Rvl~VtP~~~~~Lk~------~~~f~~~~~~~~~-~~~~g~Vg~idG~~Ii~vps~~~----- 245 (329) T protein:vir:10 194 ----------------SRILFVTPKFYKGIKK------FVIELPQGDNRQQ-VLGKGVQGELDGFTIVKVPSKML----- 245 (329) T ss_pred ----------------CcEEEeCHHHHHHHHh------hhhhhcccccccc-ceeeeeeeeecCeEEEEecCCcc----- Confidence 4899999999888864 5788877666554 67899999999999999876211 Q ss_pred CCccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEE-ecCCCCCCCCCCccchhhHHHH--HHHHHHhh Q lcl|NC_019514. 308 ATVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTT-KMPGEATADRNDPYGEMGFSSI--KWYYGTLI 384 (399) Q Consensus 308 a~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~iv-k~pG~~~ad~~DPlgQrg~~gw--K~~~~~~i 384 (399) .+ +.+|++-+.|...+ .++.-+- -.|.+ ++.| -.| ..||++.+ T Consensus 246 -------------k~---in~ii~~~~A~~~~--------~K~~~~~~~~p~~---------~~~a-~~v~gr~yyd~~V 291 (329) T protein:vir:10 246 -------------QG---VEAMAVIGEVMASP--------IQANEAKLNSNVP---------GMFG-TLAEQMLYTGAFV 291 (329) T ss_pred -------------cc---eeEEEEcCCceeee--------eeeeeeeeeCCCC---------ccch-heeeeeeeeeeEE Confidence 01 24577777776655 1222211 12221 1111 122 37888888 Q ss_pred ccccceEEE---EEeccC Q lcl|NC_019514. 385 LRPERLALV---KTVAPL 399 (399) Q Consensus 385 Ln~~~m~~i---e~~a~~ 399 (399) +++.-.... +.+.+. T Consensus 292 ~~~k~~~I~~~~~~a~~~ 309 (329) T protein:vir:10 292 PEHLQKYIFTIGGKEVET 309 (329) T ss_pred EccccCEEEEecccCccc Confidence 887744432 222222 No 147 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=96.51 E-value=0.00011 Score=42.37 Aligned_cols=274 Identities=14% Similarity=0.103 Sum_probs=129.5 Q ss_pred CCcCCeee--------cCCCCcccccccc-cccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccc Q lcl|NC_019514. 1 MASKGMLY--------NDPNTTPSGIDAP-DGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLL 71 (399) Q Consensus 1 ~~~~~~~~--------n~~~~t~tT~~~~-i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~ 71 (399) |..+...- .+--++.++..++ +-|+ -+..+.+......-.+.+++...++. |.++-..-+ + T Consensus 114 ~~~~~~~~~~~~~~~~~~a~~~~t~~~GG~lIP~----~~~~~Ii~~~~~~~~l~~~~~v~~~~---~~~~p~~~~-~-- 183 (402) T protein:vir:93 114 ILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPK----TLSKEIVSEPFAKNQLREKARLTNIK---GLEIPRVSY-T-- 183 (402) T ss_pred HhhhhHHHHHHhHHHHHhhhccCCCcCCccccch----hHHHHHHHhHHhhhhhhhhceeeecC---Cceeeeeec-c-- Confidence 11110000 0000111222221 2233 13455555555556667788776653 222211111 1 Q ss_pred cccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcch Q lcl|NC_019514. 72 DDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDS 151 (399) Q Consensus 72 ~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~ 151 (399) .+...|..|| +..+...++...++...++|+.|+.+|+++++ +++. T Consensus 184 ----------~~~a~~v~Eg-----------------------~~~~~~~~~f~~i~~~~~k~~~~i~iS~ell~-Ds~~ 229 (402) T protein:vir:93 184 ----------LDDDDFITDV-----------------------ETAKELKAKGDTVKFTTNKFKVFAAISDTVIH-GSDV 229 (402) T ss_pred ----------CCcccccccc-----------------------ccccccccccceeeecceeeeeechhhHHHHh-hhHH Confidence 1122345544 22223334556688889999999999999665 4555 Q ss_pred HHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceec Q lcl|NC_019514. 152 ELFSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVIT 231 (399) Q Consensus 152 ~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~ 231 (399) .|...+...|.+..+.. |+. ..+..|.++-.-.|..+. +.++ ++.....+++|.++.-.|+...... T Consensus 230 ~l~~~i~~~la~~~~~~-e~~--~~~~~g~g~g~p~g~~~~-~~~~---~~~~~~~~d~l~~~~~~l~~~y~~n------ 296 (402) T protein:vir:93 230 DLVNWVENALQSGLAAK-ERK--DALAVSPKSGLEHMSFYN-GSVK---EVEGADMYDAIINALADLHEDYRDN------ 296 (402) T ss_pred HHHHHHHHHHHHHHHHH-HHH--hHhhcCCCccccceeeec-cccc---cccccchHHHHHHHHhccChhhhcC------ Confidence 57777777777765543 221 223444433222221111 0111 1112235788888887776532221 Q ss_pred cccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCcc Q lcl|NC_019514. 232 GSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVG 311 (399) Q Consensus 232 ~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~ 311 (399) + +.++|+.....++.+.+.-+. +++.|.-+.+-|.-++.++- T Consensus 297 -----------a--~~imn~~t~~~~~~~~~d~~~-------------~~~~~~~~~llG~PV~~t~~------------ 338 (402) T protein:vir:93 297 -----------A--TIYMRYADYVKIISVLSNGTT-------------NFFDTPAEKVFGKPVVFTDA------------ 338 (402) T ss_pred -----------C--EEEEechHHHHHHHHHhcCCC-------------cccccCCccccccceEEecC------------ Confidence 1 236777776777666543222 23333333444444443321 Q ss_pred CCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCc-cchhhHHHHHHHHHHhhccccce Q lcl|NC_019514. 312 TNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDP-YGEMGFSSIKWYYGTLILRPERL 390 (399) Q Consensus 312 ~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DP-lgQrg~~gwK~~~~~~iLn~~~m 390 (399) . +.++||.=++..+.+.+.. ++ + ..|+ .|+++|.++. ++.+.+.+++=+ T Consensus 339 -------------~-~~i~~GDf~~~~~~~~~~~----~~-----~------~~~~~~~~~~~~~~~-r~Dg~v~~~~A~ 388 (402) T protein:vir:93 339 -------------A-VKPIVGDFNYFGINYDGTT----YD-----T------DKDVKKGEYLFVLTA-WYDQQRTLDSAF 388 (402) T ss_pred -------------C-Cceeeechhhhhhhhhhhh----hh-----h------hhcccCCceEEEEEE-EeCcEEechhhe Confidence 1 1246676555544444321 11 0 0122 2455555444 668888888877 Q ss_pred EEEEEeccC Q lcl|NC_019514. 391 ALVKTVAPL 399 (399) Q Consensus 391 ~~ie~~a~~ 399 (399) ..++.-+.- T Consensus 389 ~~l~ik~~~ 397 (402) T protein:vir:93 389 RIAKAKENT 397 (402) T ss_pred EEEEeecCC Confidence 776664433 No 148 >protein:vir:2106 Length: 430 # NCBI annotation: coat protein # Family: family:all:1412 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059630;genbank:gi:9635538;genbank:GeneID:1262831 Probab=96.47 E-value=0.00038 Score=39.31 Aligned_cols=299 Identities=12% Similarity=0.108 Sum_probs=135.7 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcc-cccccc---cCCCEEEEEEcccccccccc Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLAD-VVSMPK---NYGKEIRVYHYIPLLDDRNV 76 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~-~~~mPk---N~GktIk~rry~pl~~~~~~ 76 (399) |+-. +....+.-+ |+.|+.=+-.+||.+.++ .++.-. +.|.||..++-..+ + T Consensus 1 Ma~~-----------------~~~~lti~~--~eal~~~~n~lV~a~~~~~~r~~d~~~~r~Gdti~ip~p~~~-----~ 56 (430) T protein:vir:21 1 MALN-----------------EGQIVTLAV--DEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQES-----P 56 (430) T ss_pred Cccc-----------------cchhhHHHH--HHHHHHhhhhhhhhhhhhccCCchhhhhcccceEEeeccccc-----c Confidence 3322 111122222 788888888888888755 333333 88999865544333 2 Q ss_pred ccCCCCCCCce--eccCccccccccccccccccccccccccccccccceeeeeEeee-eeecceeehhhhhhhhhcchHH Q lcl|NC_019514. 77 NDQGIDAAGAT--IVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRI-QKFGFFTEFSQESLDFDSDSEL 153 (399) Q Consensus 77 ~~~gi~aaga~--lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l-~qYG~~~e~Td~~~d~~~D~~l 153 (399) +..|.+-.+.+ ++|+. .+ ..| +|.+..++|+++= + +|. T Consensus 57 ~~~G~~~t~~~~~~~e~~----v~------------------------------~~~~~~~~V~~~~~~kE--l-~~~-- 97 (430) T protein:vir:21 57 TQEGWDLTDKATGLLELN----VA------------------------------VNMGEPDNDFFQLRADD--L-RDE-- 97 (430) T ss_pred ccccccccCCCccceeee----Ee------------------------------EEEeeeccceEEeehhH--h-cCh-- Confidence 23333322222 33321 11 222 2456677777542 2 222 Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHHHh-----cCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccc Q lcl|NC_019514. 154 FSHISTELMNGAVQLTEAVLQKDLLA-----GAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTK 228 (399) Q Consensus 154 ~~~~~~~lg~~a~~~~e~~l~~~~la-----g~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~ 228 (399) ++ .++-.+.|.+-+-..+-.+|++ +..++-.+-+ +...+.+ ..+++-.+-..|..|.+|+-. T Consensus 98 -~~-~er~l~pAm~~LA~~Vd~dl~~~~~~~~~~v~~~~~~-------t~~~~~~---~~~~~A~a~~~L~~~~vP~~~- 164 (430) T protein:vir:21 98 -TA-YRRRIQSAARKLANNVELKVANMAAEMGSLVITSPDA-------IGTNTAD---AWNFVADAEEIMFSRELNRDM- 164 (430) T ss_pred -hh-HHHHHHHHHHHHHHHHHHHHHHHhhhhhhccccccCC-------CCCCCCc---chhhHHHHHHHHHHhcCCCCC- Confidence 22 2333334433333445555542 3333322211 1111111 367788888899999999721 Q ss_pred eeccccccCccccCceeEEEeCCCchHHH-HHhhccCCCccceehhhcCCcccccccccee-EcCeEEE-ecCccchhcc Q lcl|NC_019514. 229 VITGSRMIDTRTISAGRVLYIGSELIPLI-RKLVDPFGNAAFVPVHQYADAGTILNGEIGT-VDQFRLV-VVPEMLHWAG 305 (399) Q Consensus 229 ~i~~s~~~~T~~I~~~yv~~~h~d~~~di-rdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~-i~~vRfV-~~~~~~~~~~ 305 (399) -+-+++-|+....+ +.+. .+-- .+=...+.+-+|+||+ +.+|+++ .+...-+-.. T Consensus 165 ---------------~R~~~~~p~~~~~l~~~l~------~~~~-~~~~~~~A~r~g~i~r~~~Gfd~~~~s~~~~~~t~ 222 (430) T protein:vir:21 165 ---------------GTSYFFNPQDYKKAGYDLT------KRDI-FGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTK 222 (430) T ss_pred ---------------CcEEEeChHHHHHHhhhhc------cccc-cccchhHHHhhcccccccchhhhhhhcCCcccccC Confidence 26677788877666 3331 2211 1112345567899998 9999976 4454444321 Q ss_pred -cCCCcc---------------------------------CCccc--------------------------------ccc Q lcl|NC_019514. 306 -AGATVG---------------------------------TNPGY--------------------------------RET 319 (399) Q Consensus 306 -aGa~~~---------------------------------~~~~~--------------------------------~~t 319 (399) .|+... +++++ ..+ T Consensus 223 gt~t~~tv~gA~~~~~~~~tv~~~g~~~~~d~~~~~it~s~tg~l~~GD~ftiaGV~~v~~itk~~~~~l~qf~V~a~~~ 302 (430) T protein:vir:21 223 STATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGMKRGDKISFAGVKFLGQMAKNVLAQDATFSVVRVVD 302 (430) T ss_pred ccCcCceeccccccccccceeccccccccccccceeeeeecccceecccEEEecceeeeccccccccCCcceEEEEEecC Confidence 111110 11110 022 Q ss_pred CccceEEEEEEE--------cccceeeec--------------------cccCCCCccceEEEecC----CC----CCCC Q lcl|NC_019514. 320 NGKYDIYPMLCV--------GAESFTTIG--------------------FQTDGKTLKFKVTTKMP----GE----ATAD 363 (399) Q Consensus 320 ~~~~DVyp~lV~--------G~~Afg~v~--------------------l~g~g~~~~~~~ivk~p----G~----~~ad 363 (399) ++.+.|||-||- +.-+|..|. |.=. +..|.....++ |- .+.. T Consensus 303 ~ttv~I~Pai~~~~~~~~~~~~~~y~nVsaspa~~aavT~v~~a~~~~Nl~fh--~~A~~La~~pl~~p~~~~~~~~~~~ 380 (430) T protein:vir:21 303 GTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWA--DDAIRIVSQPIPANHELFAGMKTTS 380 (430) T ss_pred CceeEEeecccccccccccccccccceeccccccCceeEEeccCCcccceeEc--cceeEEEEecccCCCChhHhhheee Confidence 445666666541 011121110 0000 00111111111 10 0000 Q ss_pred CCCc--------------cchhhHHHHHHHHHHhhccccceEEEEEeccC Q lcl|NC_019514. 364 RNDP--------------YGEMGFSSIKWYYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 364 ~~DP--------------lgQrg~~gwK~~~~~~iLn~~~m~~ie~~a~~ 399 (399) .++| -.....+.|=.+|++..|++||..++=...+- T Consensus 381 ~~~~~~Glsirv~~~yd~~~~~~~~r~DilyG~~~l~Pe~a~v~l~g~~~ 430 (430) T protein:vir:21 381 FSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred eeccccceEEEEEEccccccCceEEEEEeecCccccCcceEEEEcCCCCC Confidence 1111 11122334457889999999987554333333 No 149 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=96.10 E-value=0.00017 Score=41.20 Aligned_cols=291 Identities=15% Similarity=0.117 Sum_probs=134.9 Q ss_pred cccccccccceehhhhhHHHHHHHHHHHHhhhhccccc----cc--ccCCCEEEEEEccccccccccccCCCCCCCc--- Q lcl|NC_019514. 16 SGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVS----MP--KNYGKEIRVYHYIPLLDDRNVNDQGIDAAGA--- 86 (399) Q Consensus 16 tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~----mP--kN~GktIk~rry~pl~~~~~~~~~gi~aaga--- 86 (399) +. .+...-| +|.=.+.....|...|++.-+ || .=.|.+..+.|-.-+ ++++-+.- T Consensus 1 mp-------altLaea-~k~~~d~l~~~ViE~~~~~s~lL~~LpF~~veg~~~~ynR~~~~--------~~~~~~~v~~~ 64 (310) T protein:vir:97 1 MA-------SVTLAES-AKLAQDELVAGVIENIITVNRMFDVLPFDSIEGNSLAYNRENVL--------GDVIMAGVGTT 64 (310) T ss_pred Cc-------ccchHHH-hhcCcchHHHHHHHHHhccchHHHhCCcccccCCcceeeEeecc--------CCccccccccc Confidence 11 1111111 111122333444444443221 12 112444333333222 11111110 Q ss_pred eeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhh-cch--HHHHHHHHHHHH Q lcl|NC_019514. 87 TIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFD-SDS--ELFSHISTELMN 163 (399) Q Consensus 87 ~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~-~D~--~l~~~~~~~lg~ 163 (399) .+.+| ..+ ...+...++.+|.=.|.-.++-..+.|+. +++ .+.+. .++..+ T Consensus 65 ~~~~g----~~~---------------------~~~t~~~~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Q-l~~~ie 118 (310) T protein:vir:97 65 FSGAG----AGK---------------------AAATFTKVNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQ-IASKAK 118 (310) T ss_pred ccCCC----ccc---------------------cccccceeeeeeeeeeehhhhhhHHHhhhcCChHHHHHHH-HHHHHH Confidence 01111 111 11222345556666677777777778886 554 11111 234444 Q ss_pred hhhHHHHHHHHHHHHhcCC-eEEecCCC---cccccccccccCCceecHHHHHHHHHHHH-hccCccccceeccccccCc Q lcl|NC_019514. 164 GAVQLTEAVLQKDLLAGAG-TIVYTGAA---TQDSEITGEGATPSVVDYDDLMRLSITLD-ENRTPKQTKVITGSRMIDT 238 (399) Q Consensus 164 ~a~~~~e~~l~~~~lag~~-~v~yag~a---ts~~~~t~~~~~~~~vt~~~lr~a~~~L~-~nrap~~t~~i~~s~~~~T 238 (399) ...+-.|+++ ++|-. +--+.|-. +....+.. ++....+|+++|+.+.-... ..+.|. T Consensus 119 a~~~~~e~~l----INGD~a~n~F~GL~~~~~~~q~i~~-~~~gg~~t~d~LDeLl~~v~~~~g~p~------------- 180 (310) T protein:vir:97 119 SAGRKYQDQL----INGNGAGNEFAGLIQLCASGQKATT-GATGSAISFAILDELMDLVVDKDGQVD------------- 180 (310) T ss_pred HHHHHHHHHh----hccccCCCcccchhhcCCccceeec-CCCCCCCCHHHHHHHHHHHhcCCCCCC------------- Confidence 4444444444 44311 00121110 01111111 12234578999988875442 333442 Q ss_pred cccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccCCccccc Q lcl|NC_019514. 239 RTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNPGYRE 318 (399) Q Consensus 239 ~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~~~~~ 318 (399) +-++||-+-.-|+++.--....+..|+.. ..|-.-+=++.+|-|+....... +... .+ T Consensus 181 -------~~l~~~~~~r~i~A~~R~~~~~g~~~~~~-----~~~G~~v~~~~GiPi~~~d~ip~----~~~~------~~ 238 (310) T protein:vir:97 181 -------YLTMHARTLRSYKALLRALGGASINEVVE-----LPSGAEVPAYSGTPIFRNDYIPT----NQTK------GG 238 (310) T ss_pred -------EEEecHHHHHHHHHHHHHhcCCCCCCccc-----cCCCCEEeeeCCeEEEEeCccCC----Cccc------cc Confidence 46889988777766432111222222111 11111223568888888754322 2111 11 Q ss_pred cCccceEEEEEEEcccc--eeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEe Q lcl|NC_019514. 319 TNGKYDIYPMLCVGAES--FTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVKTV 396 (399) Q Consensus 319 t~~~~DVyp~lV~G~~A--fg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~ 396 (399) +++.-.+|. +-+|+++ .|.+|+...|.+ -+.|.-+|+ -.|.---+..+ +||+++.+|++.=.++||-+ T Consensus 239 ~~gtTsIya-~r~Ge~~~~~Gv~Gl~~~~~~---glsVr~~G~----~~~~~v~~~~V--~~Y~~~av~~~~A~a~L~~V 308 (310) T protein:vir:97 239 TTGCTTIFA-GTLDDGSRTHGIAGLTATQAA---GIQVVDVGE----SEDSDEHIWRV--KWYCGLALFSEKGLACADGI 308 (310) T ss_pred cCCceeEEE-EeeCccccccceeccccCCcc---ceeEEeCCc----ccCCcceeEEE--EEeeeEEEecccceeeeccc Confidence 122234664 5789986 799999876643 255666774 12333444445 46999999999999999988 Q ss_pred cc Q lcl|NC_019514. 397 AP 398 (399) Q Consensus 397 a~ 398 (399) .- T Consensus 309 ~~ 310 (310) T protein:vir:97 309 TN 310 (310) T ss_pred cC Confidence 88 No 150 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=96.03 E-value=0.0011 Score=36.81 Aligned_cols=278 Identities=15% Similarity=0.133 Sum_probs=133.7 Q ss_pred CCcCC------eeecCCCCcccccccccccceehhhhh---HHHHHHHHHHHHhhhhc--ccccccccCCCEEEEEEccc Q lcl|NC_019514. 1 MASKG------MLYNDPNTTPSGIDAPDGKQMNTFFWW---KKALIEARKDQYFMPLA--DVVSMPKNYGKEIRVYHYIP 69 (399) Q Consensus 1 ~~~~~------~~~n~~~~t~tT~~~~i~p~m~~~y~~---kk~L~~A~p~lv~~~fA--~~~~mPkN~GktIk~rry~p 69 (399) |.+.. .++|+-. .+--|++-++..+. -.+|.+......+.... ... .=-+.|++|++.+-.- T Consensus 1 ~~~~~~~~~~~~~~~~~~------~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~-~e~~gg~tVkIp~i~~ 73 (319) T protein:vir:97 1 MNKTIKNATGMLKLNLQH------FANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISND-AIFMEGRSFTVMKGDT 73 (319) T ss_pred CCcccccccceeEeehhh------hhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcccCcc-eEeccCcEEEEeeecc Confidence 43221 2233321 00112222222221 34566666666655433 322 2236899999988742 Q ss_pred cccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhc Q lcl|NC_019514. 70 LLDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDS 149 (399) Q Consensus 70 l~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~ 149 (399) - |+..= .-.+|..| ..++++ ..+-+|-| .-+..|.=.-.|-++ T Consensus 74 ~---------gl~DY--~R~~g~~~-----------g~vt~~--------------~~t~tidq-dR~~~F~VD~~D~~E 116 (319) T protein:vir:97 74 T---------ELKDY--KRNATNEF-----------DHPKIE--------------ETTYFLDQ-EKYWGRFVDALDRKD 116 (319) T ss_pred c---------ccccc--cCCCCccc-----------CCcccc--------------eeEEEeec-ccccccccchhhHhh Confidence 1 11100 00112111 111121 12233444 222333322234444 Q ss_pred chHHHHHHHHHHHHhhhHHHH---HHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccc Q lcl|NC_019514. 150 DSELFSHISTELMNGAVQLTE---AVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQ 226 (399) Q Consensus 150 D~~l~~~~~~~lg~~a~~~~e---~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~ 226 (399) -+..+ .+...++++..+..- |..+-..+++. +|. .....+|+ .=.++.|+.+...|++++.|. T Consensus 117 tn~~l-~a~~i~~~~~~~~v~PEiDay~~skla~~-----a~~-~~~~~~t~------~n~y~~i~~a~~~Lde~~VP~- 182 (319) T protein:vir:97 117 TEGNI-DINYVVARQGAEVVAPYLDNLRFATLARN-----KAK-HLTVGTGS------DAQYDAVLDVSVELDEIKAPE- 182 (319) T ss_pred hhchh-hHHHHHHHHHHHHhhhhhhHHHHHHHHhh-----ccc-ccccccCH------HHHHHHHHHHHHHHHhcCCCC- Confidence 33222 234556665543332 22233334332 111 11111111 124899999999999988862 Q ss_pred cceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhccc Q lcl|NC_019514. 227 TKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGA 306 (399) Q Consensus 227 t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~a 306 (399) .+|+||.|+.-..|+. ++.|+.....+. +.+.+|-||+++||.++++|.-.- T Consensus 183 -----------------~Rvl~Vtp~~~~~L~~------~~~f~~~~~~~~-~~~~~g~Vg~idG~~Vi~vps~~~---- 234 (319) T protein:vir:97 183 -----------------NRVLFVSPTFYKGIKK------FVIALPQGDTRQ-QVLGKGVQGELDGFVIVKVPTKLL---- 234 (319) T ss_pred -----------------CcEEEeCHHHHHHHHh------hhhhhccccccc-cceeeeeceeecCeEEEEeccccc---- Confidence 3899999999888853 578988777765 457899999999999999875210 Q ss_pred CCCccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEE-ecCCCCCCCCCCccchhhHHHHH--HHHHHh Q lcl|NC_019514. 307 GATVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTT-KMPGEATADRNDPYGEMGFSSIK--WYYGTL 383 (399) Q Consensus 307 Ga~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~iv-k~pG~~~ad~~DPlgQrg~~gwK--~~~~~~ 383 (399) . -+.++++-+.|...+ . ++.-+- -.|.+ +++| -.|| .||++. T Consensus 235 --------------k---~in~i~~h~~A~~~~-~-------k~~~~~~~~p~~---------~~~a-~~v~gr~y~d~~ 279 (319) T protein:vir:97 235 --------------Q---GLQAIAVVGEVLASP-I-------QADLAKTNSNIP---------GMFG-TLAEQLLYTGAF 279 (319) T ss_pred --------------c---cceEEEEcCCeeeee-e-------eeeeeeccCCCc---------cccc-eeeeeeeeeeeE Confidence 0 124555556555443 1 111111 12211 1112 1233 788899 Q ss_pred hccccceEEE--EEeccC Q lcl|NC_019514. 384 ILRPERLALV--KTVAPL 399 (399) Q Consensus 384 iLn~~~m~~i--e~~a~~ 399 (399) ++++.-..+. ....+- T Consensus 280 V~~~k~~~Iy~~~~~~~~ 297 (319) T protein:vir:97 280 VPEHLQKYIFTIGGTEVA 297 (319) T ss_pred EeccccceEEEeecCCcc Confidence 9887754443 222222 No 151 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=96.03 E-value=0.0011 Score=36.81 Aligned_cols=278 Identities=15% Similarity=0.133 Sum_probs=133.7 Q ss_pred CCcCC------eeecCCCCcccccccccccceehhhhh---HHHHHHHHHHHHhhhhc--ccccccccCCCEEEEEEccc Q lcl|NC_019514. 1 MASKG------MLYNDPNTTPSGIDAPDGKQMNTFFWW---KKALIEARKDQYFMPLA--DVVSMPKNYGKEIRVYHYIP 69 (399) Q Consensus 1 ~~~~~------~~~n~~~~t~tT~~~~i~p~m~~~y~~---kk~L~~A~p~lv~~~fA--~~~~mPkN~GktIk~rry~p 69 (399) |.+.. .++|+-. .+--|++-++..+. -.+|.+......+.... ... .=-+.|++|++.+-.- T Consensus 1 ~~~~~~~~~~~~~~~~~~------~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~-~e~~gg~tVkIp~i~~ 73 (319) T protein:vir:94 1 MNKTIKNATGMLKLNLQH------FANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISND-AIFMEGRSFTVMKGDT 73 (319) T ss_pred CCcccccccceeEeehhh------hhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcccCcc-eEeccCcEEEEeeecc Confidence 43221 2233321 00112222222221 34566666666655433 322 2236899999988742 Q ss_pred cccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhc Q lcl|NC_019514. 70 LLDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDS 149 (399) Q Consensus 70 l~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~ 149 (399) - |+..= .-.+|..| ..++++ ..+-+|-| .-+..|.=.-.|-++ T Consensus 74 ~---------gl~DY--~R~~g~~~-----------g~vt~~--------------~~t~tidq-dR~~~F~VD~~D~~E 116 (319) T protein:vir:94 74 T---------ELKDY--KRNATNEF-----------DHPKIE--------------ETTYFLDQ-EKYWGRFVDALDRKD 116 (319) T ss_pred c---------ccccc--cCCCCccc-----------CCcccc--------------eeEEEeec-ccccccccchhhHhh Confidence 1 11100 00112111 111121 12233444 222333322234444 Q ss_pred chHHHHHHHHHHHHhhhHHHH---HHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccc Q lcl|NC_019514. 150 DSELFSHISTELMNGAVQLTE---AVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQ 226 (399) Q Consensus 150 D~~l~~~~~~~lg~~a~~~~e---~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~ 226 (399) -+..+ .+...++++..+..- |..+-..+++. +|. .....+|+ .=.++.|+.+...|++++.|. T Consensus 117 tn~~l-~a~~i~~~~~~~~v~PEiDay~~skla~~-----a~~-~~~~~~t~------~n~y~~i~~a~~~Lde~~VP~- 182 (319) T protein:vir:94 117 TEGNI-DINYVVARQGAEVVAPYLDNLRFATLARN-----KAK-HLTVGTGS------DAQYDAVLDVSVELDEIKAPE- 182 (319) T ss_pred hhchh-hHHHHHHHHHHHHhhhhhhHHHHHHHHhh-----ccc-ccccccCH------HHHHHHHHHHHHHHHhcCCCC- Confidence 33222 234556665543332 22233334332 111 11111111 124899999999999988862 Q ss_pred cceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhccc Q lcl|NC_019514. 227 TKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGA 306 (399) Q Consensus 227 t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~a 306 (399) .+|+||.|+.-..|+. ++.|+.....+. +.+.+|-||+++||.++++|.-.- T Consensus 183 -----------------~Rvl~Vtp~~~~~L~~------~~~f~~~~~~~~-~~~~~g~Vg~idG~~Vi~vps~~~---- 234 (319) T protein:vir:94 183 -----------------NRVLFVSPTFYKGIKK------FVIALPQGDTRQ-QVLGKGVQGELDGFVIVKVPTKLL---- 234 (319) T ss_pred -----------------CcEEEeCHHHHHHHHh------hhhhhccccccc-cceeeeeceeecCeEEEEeccccc---- Confidence 3899999999888853 578988777765 457899999999999999875210 Q ss_pred CCCccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEE-ecCCCCCCCCCCccchhhHHHHH--HHHHHh Q lcl|NC_019514. 307 GATVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTT-KMPGEATADRNDPYGEMGFSSIK--WYYGTL 383 (399) Q Consensus 307 Ga~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~iv-k~pG~~~ad~~DPlgQrg~~gwK--~~~~~~ 383 (399) . -+.++++-+.|...+ . ++.-+- -.|.+ +++| -.|| .||++. T Consensus 235 --------------k---~in~i~~h~~A~~~~-~-------k~~~~~~~~p~~---------~~~a-~~v~gr~y~d~~ 279 (319) T protein:vir:94 235 --------------Q---GLQAIAVVGEVLASP-I-------QADLAKTNSNIP---------GMFG-TLAEQLLYTGAF 279 (319) T ss_pred --------------c---cceEEEEcCCeeeee-e-------eeeeeeccCCCc---------cccc-eeeeeeeeeeeE Confidence 0 124555556555443 1 111111 12211 1112 1233 788899 Q ss_pred hccccceEEE--EEeccC Q lcl|NC_019514. 384 ILRPERLALV--KTVAPL 399 (399) Q Consensus 384 iLn~~~m~~i--e~~a~~ 399 (399) ++++.-..+. ....+- T Consensus 280 V~~~k~~~Iy~~~~~~~~ 297 (319) T protein:vir:94 280 VPEHLQKYIFTIGGTEVA 297 (319) T ss_pred EeccccceEEEeecCCcc Confidence 9887754443 222222 No 152 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=95.86 E-value=0.00054 Score=38.48 Aligned_cols=273 Identities=14% Similarity=0.098 Sum_probs=128.0 Q ss_pred CCcCC-----eee----cCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccc Q lcl|NC_019514. 1 MASKG-----MLY----NDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLL 71 (399) Q Consensus 1 ~~~~~-----~~~----n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~ 71 (399) |.... ... |.-...+++..+-+-|+ -+..+.+........+.+++.+.++. |.++-...+ . T Consensus 64 ~~~~~~~~~~~~~~~~~~al~~~~~~~gG~lIP~----~~~~~Ii~~l~~~s~l~~~~~v~~~~---~~~~p~~~~-~-- 133 (352) T protein:vir:78 64 ILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPK----TLSKEIVSEPFAKNQLREKARLTNIK---GLEIPRVSY-T-- 133 (352) T ss_pred hhhhHHHHHHhhHHHHHHHhccCCCCCCceeccH----hHHHHHHHHHHhhcchhhheeeEecC---CceEEEEec-C-- Confidence 11000 000 11011111112223333 12334444444444456667665542 223221111 1 Q ss_pred cccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcch Q lcl|NC_019514. 72 DDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDS 151 (399) Q Consensus 72 ~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~ 151 (399) .+...|..|| +......++...++...++|+.|+.+|+++++ +++. T Consensus 134 ----------~~~a~~v~E~-----------------------~~~~~~~~~f~~v~~~~~k~~~~i~is~ell~-Ds~~ 179 (352) T protein:vir:78 134 ----------LDDDDFITDV-----------------------ETAKELKLKGDTVKFTTNKFKVFAAISDTVIH-GSDV 179 (352) T ss_pred ----------CCcccccccc-----------------------cccccccccceeeeecceeEEeechhhHHHHh-hhhH Confidence 1223455554 22223344556688889999999999999554 4555 Q ss_pred HHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceec Q lcl|NC_019514. 152 ELFSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVIT 231 (399) Q Consensus 152 ~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~ 231 (399) +|.+.+.+.|++..+.. |+ ..-+..|.++-.-.|.-+... +. ++...-++++|.++.-.|+..-... T Consensus 180 ~l~~~i~~~la~~~~~~-e~--~~~~~~g~g~~~~~g~l~~~~-~~---~~t~~~~~d~i~~~~~~l~~~~~~~------ 246 (352) T protein:vir:78 180 DLVNWVENALQSGLAAK-ER--KDALAVSPKSGLEHMSFYNGS-VK---EVEGANMYDAIINALADLHEDYRDN------ 246 (352) T ss_pred HHHHHHHHHHHHHHHHH-HH--HhhhhcCCCCcccccceeccc-cc---cccccchHHHHHHHHhccChhhhcC------ Confidence 58888888888775533 22 122334443322222111100 00 1112224788888887765432210 Q ss_pred cccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCCCcc Q lcl|NC_019514. 232 GSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVG 311 (399) Q Consensus 232 ~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~ 311 (399) + +.++++.....|+.+++.-+.+ ++.|.-..+-|..++.+.-+ T Consensus 247 -----------a--~~~mn~~t~~~l~~~~~~~~~~-------------~~~~~~~~llG~PV~~~~~~----------- 289 (352) T protein:vir:78 247 -----------A--TIYMRYADYVKIISVLSNGTTN-------------FFDTPAEKVFGKPVVFTDAA----------- 289 (352) T ss_pred -----------C--EEEEehHHHHHHHHHHhccCCc-------------ccccCCccccccceEEecCC----------- Confidence 1 2466887777887776543333 33333334445455443311 Q ss_pred CCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHHHhhccccc Q lcl|NC_019514. 312 TNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYGTLILRPER 389 (399) Q Consensus 312 ~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~~~iLn~~~ 389 (399) +.++||.=++..+.+.+.. ++. . .|++ .|.++++ .++.+.+.+++= T Consensus 290 ---------------~~~~~Gdf~~~~~~~~~~~----~~~----~-------~~~~--~g~~~f~~~~r~Dg~~~~~eA 337 (352) T protein:vir:78 290 ---------------VKPIVGDFNYFGINYDGTT----YDT----D-------KDVK--KGEYLFVLTAWYDQQRTLDSA 337 (352) T ss_pred ---------------CceeEeehhhhhhhhhhhe----eee----e-------cccc--CCeeEEEEEeeeCceeechhh Confidence 1235677666555444321 111 0 1221 2334443 467788888888 Q ss_pred eEEEEEeccC Q lcl|NC_019514. 390 LALVKTVAPL 399 (399) Q Consensus 390 m~~ie~~a~~ 399 (399) ++.+++.+.= T Consensus 338 ~~~l~~~a~~ 347 (352) T protein:vir:78 338 FRIAKAKEST 347 (352) T ss_pred eEEEEeeccc Confidence 8777776655 No 153 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=95.81 E-value=0.0014 Score=36.19 Aligned_cols=303 Identities=11% Similarity=0.010 Sum_probs=137.6 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhccc---ccccccCCCEEEEEEccccccccccc Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADV---VSMPKNYGKEIRVYHYIPLLDDRNVN 77 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~---~~mPkN~GktIk~rry~pl~~~~~~~ 77 (399) |++ +|+ .|...--|-|++=+-|-..+.-++. ..+.-+.++.. ..+-...|.++.+=.|.+|..+. .| T Consensus 1 M~~----~~~----~T~l~Dii~pEvF~~Yv~~~~~e~~-~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~~-~n 70 (367) T protein:vir:80 1 MPD----FNN----QVRLVDAVIPEVYTSYTAIDRPELT-AFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLE-PN 70 (367) T ss_pred Ccc----hhh----hhhhhhccchhhhhHHHhhhhhhhh-hhhhcceeecCHHHHHHhhcCCCEEEeeeeccCCCCc-cc Confidence 543 332 1222222444433333221111110 11111111111 11224578999998888872211 10 Q ss_pred cCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHH Q lcl|NC_019514. 78 DQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHI 157 (399) Q Consensus 78 ~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~ 157 (399) |+..-|...+++..+ +.....+.+...|.=...+|-+.+..-++ .++++ T Consensus 71 ----------------~~~d~~~~~~t~~ki--------------ttg~~~a~v~~r~kaw~~~Dla~~lsG~d-pm~~I 119 (367) T protein:vir:80 71 ----------------YGSDNPNVEAPIDGL--------------GSGEMKTTKTWLNKAYGAMDLTAELAGSN-PMTRI 119 (367) T ss_pred ----------------cCCCCCccccccccc--------------ccchheeeeehhcccchhhhHHHHhhCch-HHHHH Confidence 111111111222211 11223345556666666677777776664 56666 Q ss_pred HHHHHHhhhHHHHHHHHHHHHhcCCeEEec-----------------------CCCcccccccccc-cCCceecHHHHHH Q lcl|NC_019514. 158 STELMNGAVQLTEAVLQKDLLAGAGTIVYT-----------------------GAATQDSEITGEG-ATPSVVDYDDLMR 213 (399) Q Consensus 158 ~~~lg~~a~~~~e~~l~~~~lag~~~v~ya-----------------------g~ats~~~~t~~~-~~~~~vt~~~lr~ 213 (399) ..+++.--. +.-|..|++=..=++.. +.-+-| +++.. ..+..++.+.+-+ T Consensus 120 a~qva~yW~----r~~q~~Lla~L~Gvf~~~~a~~~~~~~~~~~~~a~~~~~~~~~~~D--is~~t~~~~~~~s~~~~~~ 193 (367) T protein:vir:80 120 RNRFGVYWT----RQWQRRIIAMAVGVYKSNLAGNFATIKTRGRVPAEVLGTAGDMVID--ISGQTNPADAVFNREAFVD 193 (367) T ss_pred HHHHHHHhh----hhhHHHHHHHHHHhhccccccchhhhhhhhccccccccccCceeee--eeccCCCccceecHHHHHH Confidence 666554433 34444444422111111 011111 12222 2456799999999 Q ss_pred HHHHHHhccCccccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeE Q lcl|NC_019514. 214 LSITLDENRTPKQTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFR 293 (399) Q Consensus 214 a~~~L~~nrap~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vR 293 (399) |...|-.+... . =++++||.....|+.+ ..+.-.+|.+.. .+|+.+-|.| T Consensus 194 A~~~lGD~~~~-l------------------~~i~mHS~V~~~L~~~-------~li~~i~~sd~~----~~i~ty~G~~ 243 (367) T protein:vir:80 194 AAFTMGDHVGS-I------------------AAIAVHSMVYKRMTNN-------DEIEFIPDSKGQ----LTIPTYMGKV 243 (367) T ss_pred HHHHhcccccc-c------------------cEEEEchHHHHHHHhc-------cccccccCCCCc----cccceeccee Confidence 97777665442 2 4689999999999875 466666888763 4899999999 Q ss_pred EEecCccchhcccCCCccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchh-- Q lcl|NC_019514. 294 LVVVPEMLHWAGAGATVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEM-- 371 (399) Q Consensus 294 fV~~~~~~~~~~aGa~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQr-- 371 (399) +|+..-|-... ++ ...||-..+||.+|++.-... +..+.-+...|-...-++-|=|=.| T Consensus 244 VIvDD~~Pv~~--------------~~-a~~~yttYlfg~GAi~~~~~~----~~~~~E~~Rd~~~~~~gG~d~L~~Rr~ 304 (367) T protein:vir:80 244 VIVDDGMPVFG--------------TG-ADKTYLSILFGGAAFGYADGA----PQVPVAVGRRELRGNGSGLEYILERKE 304 (367) T ss_pred EEEeCCCcccc--------------cC-CCceEEEEEEecceeeecccC----CccceecccchhhhcCCceEEEEeeee Confidence 99988873311 11 123899999999999865322 1112222111100000011111111 Q ss_pred ---hHHHHHHHHHHhhccccc--------------eEEEEEeccC Q lcl|NC_019514. 372 ---GFSSIKWYYGTLILRPER--------------LALVKTVAPL 399 (399) Q Consensus 372 ---g~~gwK~~~~~~iLn~~~--------------m~~ie~~a~~ 399 (399) +-.|.||.-++.+--.-. ..-|+.++-- T Consensus 305 ~~~hP~G~s~~~~~v~~~~~~~~~~~~~~~~~sPt~~eLa~~~NW 349 (367) T protein:vir:80 305 WIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANLANPDNW 349 (367) T ss_pred EEeecceeeecccccccccccccccccccccCCCChHHhcCCccc Confidence 112333322221100000 0000000000 No 154 >protein:vir:100939 Length: 430 # NCBI annotation: Gp5 # Family: family:all:1412 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006408;genbank:gi:46358700;genbank:GeneID:2777089 Probab=95.48 E-value=0.001 Score=37.03 Aligned_cols=298 Identities=12% Similarity=0.090 Sum_probs=132.9 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhccc-cc---ccccCCCEEEEEEcccccccccc Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADV-VS---MPKNYGKEIRVYHYIPLLDDRNV 76 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~-~~---mPkN~GktIk~rry~pl~~~~~~ 76 (399) |+-. ++-.++.. -|+.|+--+-.+||.+.+.+ ++ -=.+.|.||..++-..+ + T Consensus 1 MAn~-----------------l~~~~~ii--~~eal~~l~n~~v~a~~~~~~r~~d~~~~r~Gdti~~p~~~~~-----~ 56 (430) T protein:vir:10 1 MALN-----------------EGQIVTLA--VDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQES-----P 56 (430) T ss_pred Cccc-----------------hhhHHHHH--HHHHHHHHhhhhhhhhhhcccCCchhhhhcccceEEecccccc-----c Confidence 3211 11122222 36777777777888876442 21 12377898754433332 3 Q ss_pred ccCCCCCCCce--eccCccccccccccccccccccccccccccccccceeeeeEeee-eeecceeehhhhhhhhhcchHH Q lcl|NC_019514. 77 NDQGIDAAGAT--IVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRI-QKFGFFTEFSQESLDFDSDSEL 153 (399) Q Consensus 77 ~~~gi~aaga~--lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l-~qYG~~~e~Td~~~d~~~D~~l 153 (399) +..|.+..+.+ ++|+. .+ ..| +|.+..++|++.=+ .|.+. T Consensus 57 ~~~G~~~t~~~~~i~e~~----v~------------------------------~~v~~~k~V~~~~~~kel---~~~~~ 99 (430) T protein:vir:10 57 TQEGWDLTDKATGLLELN----VA------------------------------VNMGEPDNDFFQLRADDL---RDETA 99 (430) T ss_pred cccCcccCCCCCccccce----EE------------------------------EEEeeeccceEEechhHh---cChhH Confidence 34454433332 33332 11 122 35677888886432 23212 Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHHHhcC-----CeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccc Q lcl|NC_019514. 154 FSHISTELMNGAVQLTEAVLQKDLLAGA-----GTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTK 228 (399) Q Consensus 154 ~~~~~~~lg~~a~~~~e~~l~~~~lag~-----~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~ 228 (399) ..+.+ +.|..-+-..+-.||++-. .++-.+-+ +...+.+ ..+++-.+-+.|..|.+|+-. T Consensus 100 ---~~~~i-~~Am~~LA~~Vd~dl~~~~~~~~~~v~~~~~~-------t~~~~~~---~~~~~A~a~~~L~~~~vP~~~- 164 (430) T protein:vir:10 100 ---YRHRI-QSAARKLANNVELKVANMAAEMGSLVITSPDA-------IGTNTAD---AWNFVADAEELMFSRELNRDM- 164 (430) T ss_pred ---HHHHh-HHHHHHHHHHHHHHHHHHhhhccccccccccc-------CCCcCCc---chhhHHHHHHHHHHhcCCCCC- Confidence 22333 4444444455566665332 11211111 1111111 357777888899999999711 Q ss_pred eeccccccCccccCceeEEEeCCCchHHH-HHhhccCCCccceehhhcCCcccccccccee-EcCeEEE-ecCccchhcc Q lcl|NC_019514. 229 VITGSRMIDTRTISAGRVLYIGSELIPLI-RKLVDPFGNAAFVPVHQYADAGTILNGEIGT-VDQFRLV-VVPEMLHWAG 305 (399) Q Consensus 229 ~i~~s~~~~T~~I~~~yv~~~h~d~~~di-rdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~-i~~vRfV-~~~~~~~~~~ 305 (399) -+-+++-|+....+ +.+...++ .+=...+.+-+||||+ +.+|+++ .++..-+-.+ T Consensus 165 ---------------~R~~vldp~~~~~l~~~l~~l~~-------~~~~~~~A~r~g~i~~~~~Gfd~~~~~~~~~~~t~ 222 (430) T protein:vir:10 165 ---------------GTSYFFNPQDYKKAGYDLTKRDI-------FGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTK 222 (430) T ss_pred ---------------CcEEEeChHHHHHHHhhhccccc-------cccchhHHHhhccccccchhhhhhhhcCCcccccC Confidence 15677888887777 33321111 1112345567899998 9999976 4444443321 Q ss_pred -cCCCcc---------------------------------CCccc--------------------------------ccc Q lcl|NC_019514. 306 -AGATVG---------------------------------TNPGY--------------------------------RET 319 (399) Q Consensus 306 -aGa~~~---------------------------------~~~~~--------------------------------~~t 319 (399) .|+... ++++. ... T Consensus 223 g~~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftiaGV~~v~~~tkq~~~~l~~F~Vt~~~~ 302 (430) T protein:vir:10 223 STATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVD 302 (430) T ss_pred ccCcCceeccccccccccceecccccccccccccceeeeecccceecccEEEecceeeeccccccccCCccEEEEEEecC Confidence 111110 11100 012 Q ss_pred CccceEEEEEE-----------------------------EcccceeeeccccCCCCccceEEEecC----C----CCCC Q lcl|NC_019514. 320 NGKYDIYPMLC-----------------------------VGAESFTTIGFQTDGKTLKFKVTTKMP----G----EATA 362 (399) Q Consensus 320 ~~~~DVyp~lV-----------------------------~G~~Afg~v~l~g~g~~~~~~~ivk~p----G----~~~a 362 (399) ++.+.|||-++ +|+.++.. +|-=- +..|-....++ | ..+. T Consensus 303 atsv~I~paii~~~~~~~~~~~~~y~nVsaspa~~aavTvv~~a~~~~-Nl~fh--r~A~aLa~~pL~~~~~~~~~~~~~ 379 (430) T protein:vir:10 303 GTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDART-NVFWA--DDAIRIVSQPIPANHELFAGMKTT 379 (430) T ss_pred CceeEEeccccccccccccccccccceeccccccCceeEEeccCCccc-ceeEc--ccceEEEEecccCCCCHHHhhhhh Confidence 34455666553 22221100 00000 00111111111 0 0000 Q ss_pred CCC--------------CccchhhHHHHHHHHHHhhccccceEEEEEeccC Q lcl|NC_019514. 363 DRN--------------DPYGEMGFSSIKWYYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 363 d~~--------------DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~a~~ 399 (399) .-+ |.-.....+-|=.+|++..|++||..++=...+- T Consensus 380 ~~~~~~~Glsirv~~~yd~~~~~~~~r~DvLyG~~~v~Pe~a~v~l~g~~~ 430 (430) T protein:vir:10 380 SFSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred eeccccceEEEEEEEecccccCceEEEEeeeccceecCcceEEEEcCCCCC Confidence 001 1111112223446888888888886554333333 No 155 >protein:vir:9265 Length: 430 # NCBI annotation: 5 # Family: family:all:1412 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720329;genbank:gi:24371587;genbank:GeneID:955820 Probab=95.48 E-value=0.001 Score=37.03 Aligned_cols=298 Identities=12% Similarity=0.090 Sum_probs=132.9 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhccc-cc---ccccCCCEEEEEEcccccccccc Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADV-VS---MPKNYGKEIRVYHYIPLLDDRNV 76 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~-~~---mPkN~GktIk~rry~pl~~~~~~ 76 (399) |+-. ++-.++.. -|+.|+--+-.+||.+.+.+ ++ -=.+.|.||..++-..+ + T Consensus 1 MAn~-----------------l~~~~~ii--~~eal~~l~n~~v~a~~~~~~r~~d~~~~r~Gdti~~p~~~~~-----~ 56 (430) T protein:vir:92 1 MALN-----------------EGQIVTLA--VDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQES-----P 56 (430) T ss_pred Cccc-----------------hhhHHHHH--HHHHHHHHhhhhhhhhhhcccCCchhhhhcccceEEecccccc-----c Confidence 3211 11122222 36777777777888876442 21 12377898754433332 3 Q ss_pred ccCCCCCCCce--eccCccccccccccccccccccccccccccccccceeeeeEeee-eeecceeehhhhhhhhhcchHH Q lcl|NC_019514. 77 NDQGIDAAGAT--IVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRI-QKFGFFTEFSQESLDFDSDSEL 153 (399) Q Consensus 77 ~~~gi~aaga~--lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l-~qYG~~~e~Td~~~d~~~D~~l 153 (399) +..|.+..+.+ ++|+. .+ ..| +|.+..++|++.=+ .|.+. T Consensus 57 ~~~G~~~t~~~~~i~e~~----v~------------------------------~~v~~~k~V~~~~~~kel---~~~~~ 99 (430) T protein:vir:92 57 TQEGWDLTDKATGLLELN----VA------------------------------VNMGEPDNDFFQLRADDL---RDETA 99 (430) T ss_pred cccCcccCCCCCccccce----EE------------------------------EEEeeeccceEEechhHh---cChhH Confidence 34454433332 33332 11 122 35677888886432 23212 Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHHHhcC-----CeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccc Q lcl|NC_019514. 154 FSHISTELMNGAVQLTEAVLQKDLLAGA-----GTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTK 228 (399) Q Consensus 154 ~~~~~~~lg~~a~~~~e~~l~~~~lag~-----~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~ 228 (399) ..+.+ +.|..-+-..+-.||++-. .++-.+-+ +...+.+ ..+++-.+-+.|..|.+|+-. T Consensus 100 ---~~~~i-~~Am~~LA~~Vd~dl~~~~~~~~~~v~~~~~~-------t~~~~~~---~~~~~A~a~~~L~~~~vP~~~- 164 (430) T protein:vir:92 100 ---YRHRI-QSAARKLANNVELKVANMAAEMGSLVITSPDA-------IGTNTAD---AWNFVADAEELMFSRELNRDM- 164 (430) T ss_pred ---HHHHh-HHHHHHHHHHHHHHHHHHhhhccccccccccc-------CCCcCCc---chhhHHHHHHHHHHhcCCCCC- Confidence 22333 4444444455566665332 11211111 1111111 357777888899999999711 Q ss_pred eeccccccCccccCceeEEEeCCCchHHH-HHhhccCCCccceehhhcCCcccccccccee-EcCeEEE-ecCccchhcc Q lcl|NC_019514. 229 VITGSRMIDTRTISAGRVLYIGSELIPLI-RKLVDPFGNAAFVPVHQYADAGTILNGEIGT-VDQFRLV-VVPEMLHWAG 305 (399) Q Consensus 229 ~i~~s~~~~T~~I~~~yv~~~h~d~~~di-rdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~-i~~vRfV-~~~~~~~~~~ 305 (399) -+-+++-|+....+ +.+...++ .+=...+.+-+||||+ +.+|+++ .++..-+-.+ T Consensus 165 ---------------~R~~vldp~~~~~l~~~l~~l~~-------~~~~~~~A~r~g~i~~~~~Gfd~~~~~~~~~~~t~ 222 (430) T protein:vir:92 165 ---------------GTSYFFNPQDYKKAGYDLTKRDI-------FGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTK 222 (430) T ss_pred ---------------CcEEEeChHHHHHHHhhhccccc-------cccchhHHHhhccccccchhhhhhhhcCCcccccC Confidence 15677888887777 33321111 1112345567899998 9999976 4444443321 Q ss_pred -cCCCcc---------------------------------CCccc--------------------------------ccc Q lcl|NC_019514. 306 -AGATVG---------------------------------TNPGY--------------------------------RET 319 (399) Q Consensus 306 -aGa~~~---------------------------------~~~~~--------------------------------~~t 319 (399) .|+... ++++. ... T Consensus 223 g~~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftiaGV~~v~~~tkq~~~~l~~F~Vt~~~~ 302 (430) T protein:vir:92 223 STATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVD 302 (430) T ss_pred ccCcCceeccccccccccceecccccccccccccceeeeecccceecccEEEecceeeeccccccccCCccEEEEEEecC Confidence 111110 11100 012 Q ss_pred CccceEEEEEE-----------------------------EcccceeeeccccCCCCccceEEEecC----C----CCCC Q lcl|NC_019514. 320 NGKYDIYPMLC-----------------------------VGAESFTTIGFQTDGKTLKFKVTTKMP----G----EATA 362 (399) Q Consensus 320 ~~~~DVyp~lV-----------------------------~G~~Afg~v~l~g~g~~~~~~~ivk~p----G----~~~a 362 (399) ++.+.|||-++ +|+.++.. +|-=- +..|-....++ | ..+. T Consensus 303 atsv~I~paii~~~~~~~~~~~~~y~nVsaspa~~aavTvv~~a~~~~-Nl~fh--r~A~aLa~~pL~~~~~~~~~~~~~ 379 (430) T protein:vir:92 303 GTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDART-NVFWA--DDAIRIVSQPIPANHELFAGMKTT 379 (430) T ss_pred CceeEEeccccccccccccccccccceeccccccCceeEEeccCCccc-ceeEc--ccceEEEEecccCCCCHHHhhhhh Confidence 34455666553 22221100 00000 00111111111 0 0000 Q ss_pred CCC--------------CccchhhHHHHHHHHHHhhccccceEEEEEeccC Q lcl|NC_019514. 363 DRN--------------DPYGEMGFSSIKWYYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 363 d~~--------------DPlgQrg~~gwK~~~~~~iLn~~~m~~ie~~a~~ 399 (399) .-+ |.-.....+-|=.+|++..|++||..++=...+- T Consensus 380 ~~~~~~~Glsirv~~~yd~~~~~~~~r~DvLyG~~~v~Pe~a~v~l~g~~~ 430 (430) T protein:vir:92 380 SFSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred eeccccceEEEEEEEecccccCceEEEEeeeccceecCcceEEEEcCCCCC Confidence 001 1111112223446888888888886554333333 No 156 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=94.87 E-value=0.0033 Score=34.19 Aligned_cols=293 Identities=12% Similarity=0.009 Sum_probs=138.9 Q ss_pred CCcCCeee--------cCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccccc Q lcl|NC_019514. 1 MASKGMLY--------NDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLD 72 (399) Q Consensus 1 ~~~~~~~~--------n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~ 72 (399) |+.|-+-= |.-.....+....+.|++.. +++.+....-.+.+.+...++....|+-.++ -+.+- T Consensus 1 ~~~k~~~~~l~~~~~~~~~~~~~~~~g~~v~~~~~~-----~l~~~i~e~s~~l~~i~v~~v~~~~~~i~~~-~~~~~-- 72 (321) T protein:vir:31 1 MASRTINNDLSRITEKNALTVDDLDAGGTLPDPLWD-----EFWTDMIEETPLLDAIRTETVGAKKTRIPTL-NIGER-- 72 (321) T ss_pred CchHHHHHHHHHHHHhccccccccCCcceeCHHHHH-----HHHHHHHHhhhhhhhceeeeccCcceeeeee-ccCCc-- Confidence 66553211 11111112223456666543 3444444444577888888887666653322 12111 Q ss_pred ccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhh-hcch Q lcl|NC_019514. 73 DRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDF-DSDS 151 (399) Q Consensus 73 ~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~-~~D~ 151 (399) .+..-+++ +.+-+...++...++-.++|+..+.++|++.++- ++-+ T Consensus 73 -----------~~~~~~e~----------------------~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d~a~~~ 119 (321) T protein:vir:31 73 -----------HRRPQDEG----------------------EWNENESDVSTGTIDISTEKATVAWDLPREVVQENPEGE 119 (321) T ss_pred -----------cccccccc----------------------ccccccccceeeeeeeeeEEEEeehhccHHHHHhhhcch Confidence 00111111 1222333344556778899999999999998753 3334 Q ss_pred HHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEe------cCCCcc-cccccccccCCceecHHHHHHHHHHHHhccCc Q lcl|NC_019514. 152 ELFSHISTELMNGAVQLTEAVLQKDLLAGAGTIVY------TGAATQ-DSEITGEGATPSVVDYDDLMRLSITLDENRTP 224 (399) Q Consensus 152 ~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~y------ag~ats-~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap 224 (399) ++.+.+...+.++-+.- +...+++|.++-.= .|--+. .............++++.+.++...|...... T Consensus 120 d~e~~i~~~ia~~~a~~----~~~~~~nGd~~~~~~~~~~n~G~l~~a~~~~~~~~~~~~~~~~d~l~~l~~~l~~~yr~ 195 (321) T protein:vir:31 120 ALADRILNLMTDAWSAD----VEDLAANGDEDAEDSFENQNDGFITVAEGDVETIDAADDILDNDLVIRTIAGLDSKYRA 195 (321) T ss_pred hHHHHHHHHHHHHHHHH----HHhheeeccccCCCcccccchhhhhhhccccccccccccccCHHHHHHHHHhccHhHhc Confidence 57777777666654322 23334455432100 010000 00000111234557888888888777543210 Q ss_pred cccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhc Q lcl|NC_019514. 225 KQTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWA 304 (399) Q Consensus 225 ~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~ 304 (399) .+..+.+|++++..+++.....-+.|-|.+. +..+.-.++.|+.++.+|.|-. T Consensus 196 -----------------~~~~v~im~~~~~~~~~~~l~~~~~~~~~~~--------l~~~~~~tl~G~pvv~~~~mP~-- 248 (321) T protein:vir:31 196 -----------------RMNPALIVSEDQLLSYHYTLTDRDTPLGDNV--------IMGEADVNPFSFPIIGSGLWPD-- 248 (321) T ss_pred -----------------CCCeEEEechHHHHHHHHHHhcCCCccccch--------hhccccccccceeEEEcCCCCC-- Confidence 0127899999999888764333333333332 3445556788999999998721 Q ss_pred ccCCCccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCcc-chh-hHH-HHHHHHH Q lcl|NC_019514. 305 GAGATVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPY-GEM-GFS-SIKWYYG 381 (399) Q Consensus 305 ~aGa~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPl-gQr-g~~-gwK~~~~ 381 (399) +..+-+. +.=|++ ++..+. ++..... .|+. .++ .+. -...-.. T Consensus 249 --------~~il~t~------~~nl~~--------~~~~~~---~~~~~~~---------~~~~~~~~~~~~~~~~~~~~ 294 (321) T protein:vir:31 249 --------DKAMFTD------PQNLIY--------ALYRDL---EIDVLTE---------SDKVSERDLHARYFMRGDDD 294 (321) T ss_pred --------CcEEEec------cccEEE--------EEeecc---EEEEeec---------CccccccceeeEeeeeeecc Confidence 1111111 111121 122211 1111111 0111 010 000 0112345 Q ss_pred HhhccccceEEEEE-eccC Q lcl|NC_019514. 382 TLILRPERLALVKT-VAPL 399 (399) Q Consensus 382 ~~iLn~~~m~~ie~-~a~~ 399 (399) +.|.+.+..+.+|= ..|+ T Consensus 295 ~~ve~~~a~a~~~~i~~~~ 313 (321) T protein:vir:31 295 FAIENTEAVVLAEGLGDPL 313 (321) T ss_pred eeEeccccEEEEecCCcch Confidence 66777777777762 2222 No 157 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=94.21 E-value=0.0051 Score=33.17 Aligned_cols=300 Identities=15% Similarity=0.130 Sum_probs=131.6 Q ss_pred eeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccc-ccc---ccCCCEEEEEEccc---ccccccccc Q lcl|NC_019514. 6 MLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVV-SMP---KNYGKEIRVYHYIP---LLDDRNVND 78 (399) Q Consensus 6 ~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~-~mP---kN~GktIk~rry~p---l~~~~~~~~ 78 (399) |.+|. -+- |+..+.+...+.++...+.-.. .-. =|.|++||.-+-.. |.|- +.+. T Consensus 1 Mainy----------------a~~-~~~~Ld~~~~~~~lts~~l~~~~~~~~v~~~ggktVkIp~is~tsGl~DY-~R~~ 62 (346) T protein:vir:10 1 MTINY----------------AEK-YQAAVQQAFYDGHLYSAELWNSPSNSIIKFDGAKHIKVPRLEITSGRKDR-QRRT 62 (346) T ss_pred Ccchh----------------HHH-HHHHHHHHHHhhhccchhhcccccccceEecCCCEEEEEEeeeecccccc-cccC Confidence 22222 011 3333333344554432221111 111 16789999887632 2111 0000 Q ss_pred CCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHH Q lcl|NC_019514. 79 QGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHIS 158 (399) Q Consensus 79 ~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~ 158 (399) |..+. | .++++ .. +-+|.| --+.+|+=..+|.+|-..++ .+. T Consensus 63 -g~~~~------g---------------~v~~~------------~e--t~tl~q-DR~~~F~vD~mDvDETn~~~-~~a 104 (346) T protein:vir:10 63 -ITTPV------A---------------NYSND------------WD--SYELKN-ERYWSTLVDPSDIDETNMVV-SLA 104 (346) T ss_pred -Ccccc------c---------------ccccc------------ee--EEEeec-cccceecccccchHHHHHHh-HHH Confidence 00000 0 01111 11 111222 11222222223444443222 356 Q ss_pred HHHHHhhhHHHH---HHHH-HHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceecccc Q lcl|NC_019514. 159 TELMNGAVQLTE---AVLQ-KDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSR 234 (399) Q Consensus 159 ~~lg~~a~~~~e---~~l~-~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~ 234 (399) .+++++..+..- |..+ .-|.+++..+- ++.....++|+ .=-++.|+.+...|++++.|. T Consensus 105 nv~~ef~r~~vvPEiDayrfskLa~~a~~~~--~~~~~~~a~T~------~ni~~~i~~~~~~lde~~vp~--------- 167 (346) T protein:vir:10 105 NITKQFNLDSKMPEKDRYMFSHLYSGKEAAH--DGGITTNTLDE------KNILPAFDNMMLDFDEARIPS--------- 167 (346) T ss_pred HHHHHHHHHhhcchhhHHHHHHHHHhhhhhc--cccccccccCH------HHHHHHHHHHHHHHHHccCCC--------- Confidence 666665544432 2222 22333332111 11111111111 124688999999999998885 Q ss_pred ccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccch-----hcccCCC Q lcl|NC_019514. 235 MIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLH-----WAGAGAT 309 (399) Q Consensus 235 ~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~-----~~~aGa~ 309 (399) ..+|+||.|+.-.-|+. .++|.....-++... .++-||++++|-+|++|.-.= |. .|.. T Consensus 168 --------~~rvl~vTp~~~~lLk~------s~~f~k~~~v~~~~~-i~~~V~siDGv~Ii~VPs~r~~t~~~f~-~G~~ 231 (346) T protein:vir:10 168 --------TNRILYVTPKTNAILKR------AEAMNRALTLKDPNN-IQRTVYSLDDVTIRVVPSDLMQTAYDFS-DGSK 231 (346) T ss_pred --------CCeEEEECHHHHHHHhh------chhheeccccccccc-cceeeeeecCeEEEEcchhhcccchhhc-cCcc Confidence 33899999999887753 577886555566554 599999999999998874211 11 2333 Q ss_pred ccCCc-----cc-ccc----CccceEEEEEEEcccc----------eeeeccccCCCCccceEEEecC---CCCCCCCCC Q lcl|NC_019514. 310 VGTNP-----GY-RET----NGKYDIYPMLCVGAES----------FTTIGFQTDGKTLKFKVTTKMP---GEATADRND 366 (399) Q Consensus 310 ~~~~~-----~~-~~t----~~~~DVyp~lV~G~~A----------fg~v~l~g~g~~~~~~~ivk~p---G~~~ad~~D 366 (399) +++.+ .. +.+ -.|+|-+.+.==+.+. |..+=+--..+..-+.++--.| +.......+ T Consensus 232 ~~t~ak~INfiiv~~~A~ia~~K~~~~~if~P~~~~~g~~l~~~R~Y~D~fv~~nk~~~Iyv~~~~a~~~~~~~~~~~~k 311 (346) T protein:vir:10 232 IIDTAKQIEMFLIYNGVQIAPEKYSFVGFDQPSAATSGNYLYYEQSYDDVLLLNTKTKGIQFVVSDKPKKDQEQSGQDAK 311 (346) T ss_pred ccCCccceeEEEECCceeeeeeeeeeeEeeCCCCCcccceeeeeeeeeeeeeeccccceEEEeeecccccCccCcccccC Confidence 32211 00 000 0011110000001111 1111111122222222222222 211222456 Q ss_pred ccchhhHHHHHHH-------HHHhhccccceEEEE Q lcl|NC_019514. 367 PYGEMGFSSIKWY-------YGTLILRPERLALVK 394 (399) Q Consensus 367 PlgQrg~~gwK~~-------~~~~iLn~~~m~~ie 394 (399) |=.+--.--+|+| |+.+-+-++-+++++ T Consensus 312 pt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 346 (346) T protein:vir:10 312 PTAESTLEEIKAYLDKNHIDYTGKTKKDELLALVK 346 (346) T ss_pred cccccchHHHHHHhcccccccccccchhhHHhhcC Confidence 6667677779998 456677788888888 No 158 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=93.87 E-value=0.0061 Score=32.72 Aligned_cols=286 Identities=10% Similarity=0.035 Sum_probs=113.8 Q ss_pred CCcCCee-ecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhh--c----ccccccccCCCEEEEEEccccccc Q lcl|NC_019514. 1 MASKGML-YNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPL--A----DVVSMPKNYGKEIRVYHYIPLLDD 73 (399) Q Consensus 1 ~~~~~~~-~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~f--A----~~~~mPkN~GktIk~rry~pl~~~ 73 (399) |+.--|+ ||. ...+.+++ .|.+ ...+|... + +..++ .|+.++.=.|.+|..+ T Consensus 1 m~lsD~~vfN~-~~~~a~~e---------------~~~q--~~~~fn~as~gai~l~~~~~---~Gd~~~~pf~~~l~g~ 59 (325) T protein:vir:95 1 MALSDLAVYSE-YAYSAFSE---------------TLRQ--QVDLFNTATGGAIMLQSAAH---QGDFSDVAFFAKVTGG 59 (325) T ss_pred Cchhhhhhhhh-hhhhhhhh---------------hhhh--hHhhhhhcccceeEeccccc---cCceeecccccccccc Confidence 4433332 333 22211111 1111 11222211 0 11111 3788887778776332 Q ss_pred ----cccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhc Q lcl|NC_019514. 74 ----RNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDS 149 (399) Q Consensus 74 ----~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~ 149 (399) ++.++++ +-....|+.+ + ++..-+..=-.|.+.+-....--+ T Consensus 60 ~~~~~~~~~~~-~vt~~kitt~--------------------~-------------~~av~~~r~~g~~~~d~~~~~~g~ 105 (325) T protein:vir:95 60 LVRRRNAYGSG-TVAEKVLKHL--------------------V-------------DTSVKVAAGTPPVRLDPGQFRWIQ 105 (325) T ss_pred ccccccCCCCc-eeccceeccc--------------------c-------------ceeeEEecccCcccccHHHHhhcC Confidence 1222221 1111122221 1 111112221223333333333345 Q ss_pred chHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccc-cccCCceecHHHHHHHHHHHHhccCccccc Q lcl|NC_019514. 150 DSELFSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITG-EGATPSVVDYDDLMRLSITLDENRTPKQTK 228 (399) Q Consensus 150 D~~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~-~~~~~~~vt~~~lr~a~~~L~~nrap~~t~ 228 (399) || ++++....+++.++-...-++..++++..-.+-.. +.--..+++ .+..+..+|...|-++...|=.+ .-++ T Consensus 106 ~~--~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~a~~~~-~~~v~dis~~~~~~~~~~s~~~l~~A~~klGD~-~~~l-- 179 (325) T protein:vir:95 106 QN--PEVAGAAMGQQLAVDTMADMLNVGLGSVYSALSQV-SDVVYDATANTDAADKLPTWNNLNNGQAKFGDQ-SSQI-- 179 (325) T ss_pred CC--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc-ccceeeeecccCcccccccHHHHHHHHHHhccc-ccce-- Confidence 54 34455666666554433333333433332111111 000011111 11234557899999998888554 3332 Q ss_pred eeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccCC Q lcl|NC_019514. 229 VITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGA 308 (399) Q Consensus 229 ~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa 308 (399) =..++||....+|++. +.+..+++-+.+.+. .|+.+-|-|+|+++-|-. T Consensus 180 ----------------~~~~MHS~v~~~L~~~-------~L~~~~~~~~~~g~~--~i~t~~G~~VIVdD~~p~------ 228 (325) T protein:vir:95 180 ----------------AAWIMHSTPMHKLYGS-------NLTNGERLFTYGTVN--VVRDPFGKLLVMTDSPNL------ 228 (325) T ss_pred ----------------eEEEEchHHHHHHHHh-------hccccccccccCCcc--cccccCCcEEEEeCCCCC------ Confidence 3578999999999874 233333332332221 466777899998885311 Q ss_pred CccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccc--hh-------hHHHHHHH Q lcl|NC_019514. 309 TVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYG--EM-------GFSSIKWY 379 (399) Q Consensus 309 ~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlg--Qr-------g~~gwK~~ 379 (399) ..+++. .+|-+++||++|++.-.=+. +.. .+.+. ++.+=++ +| +-.|+||- T Consensus 229 --------~~~g~~-~~ytty~lg~GAi~~~~~~~------~~~---~~~~~--~~~~~~~~~~~~~~tf~lhp~G~sw~ 288 (325) T protein:vir:95 229 --------FAAGTP-NVYHILGLVPGGVLIGQNND------FDA---NEETK--NGDENIIRTYQAEWSYNIGVKGFAWD 288 (325) T ss_pred --------CCccCc-eeEEEEEEecCeEEecCCCC------ccc---ccccc--CcccceeeeeeeeeeEEeecceeeee Confidence 111111 28999999999976443111 111 12210 1111111 00 11222221 Q ss_pred HHHhhccccceEEEEEeccC Q lcl|NC_019514. 380 YGTLILRPERLALVKTVAPL 399 (399) Q Consensus 380 ~~~~iLn~~~m~~ie~~a~~ 399 (399) -+..-.++-. +-|++++-- T Consensus 289 ~s~~g~sPt~-aeL~~~~NW 307 (325) T protein:vir:95 289 KANGGKSPTD-AALFTSTNW 307 (325) T ss_pred cccccCCcCh-HhhcCCcCc Confidence 1111111100 011111111 No 159 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=93.19 E-value=0.0084 Score=31.95 Aligned_cols=297 Identities=13% Similarity=0.111 Sum_probs=136.4 Q ss_pred ecCCC-Ccccccccc--ccccee-hhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCCCCC Q lcl|NC_019514. 8 YNDPN-TTPSGIDAP--DGKQMN-TFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQGIDA 83 (399) Q Consensus 8 ~n~~~-~t~tT~~~~--i~p~m~-~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~gi~a 83 (399) +-+|- .+++...+. ++.-|+ +-+-.+++.+.++|+++-..|=+.-. -+.+..++|++-.|...+..+ T Consensus 1 ~~~~~~i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~~--a~~~~~v~f~~~~p~~~~~d~------- 71 (318) T protein:vir:10 1 MTAPTGIVSVSDGPAITVRELVGNPLWIPTALKKMMVNQFISESLFRNGG--ANPNGVVAYNEGNPSFLEDDV------- 71 (318) T ss_pred CCCCCcceeeecCCceehHHhhCCchhHHHHHHHHHhccchhhhhhhccc--ccccceeEEEecccccccCcH------- Confidence 55662 233322222 223333 22223344444667776554433222 566779999998876332111 Q ss_pred CCceeccCcccccccccccccccccccccccccccccccee-eeeEeeeeeecceeehhhhhhhhhcchHHHHHHHHHHH Q lcl|NC_019514. 84 AGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSR-ISRVGRIQKFGFFTEFSQESLDFDSDSELFSHISTELM 162 (399) Q Consensus 84 aga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~-~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~~lg 162 (399) ....||-=| | ++ +.+. ...-+..+|||-=..+||+..+----+.+-+++ +-|+ T Consensus 72 --e~VaEggEi---P-----------~~---------~~~~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~-~~l~ 125 (318) T protein:vir:10 72 --ADVAEFGEI---P-----------VS---------AGARGLPRTAFAVKKALGVRVSKEMIDENRVGAVNDQM-LQLR 125 (318) T ss_pred --hhccCcccc---c-----------cc---------CCCCCchhhhhhehhccceeccHHHHhhcChhHHHHHH-HHHH Confidence 112333111 1 00 0011 111234569999999999988766666555554 5566 Q ss_pred HhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccc-cceeccccccCcccc Q lcl|NC_019514. 163 NGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQ-TKVITGSRMIDTRTI 241 (399) Q Consensus 163 ~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~-t~~i~~s~~~~T~~I 241 (399) +.-....+.....-|..+.+-...+.+.... .+.+ ..++-.|....+..++-.. ..+-.-..++|=. T Consensus 126 Nti~r~~d~~a~dal~sa~t~~~~~s~~w~~---------~~~~-~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~-- 193 (318) T protein:vir:10 126 NTFIRANDRSAKALLQSPIVPTLAVPTAWDN---------GGKV-RTDIAIAIEQISTAAPTAYPAGVGSSDEYFGFI-- 193 (318) T ss_pred HHHHHHHHHHHHHHHhccccccccCCcCCCC---------cccc-cccchhhhhhhhhhhhhhhhhhhhhhhhccCcc-- Confidence 6665555433333333333322222211111 0111 1233334433333222100 0000000111200 Q ss_pred CceeEEEeCCCchHHHHHhhccCCCccceehhhcC-Ccccc-----ccccc-eeEcCeEEEecCccchhcccCCCccCCc Q lcl|NC_019514. 242 SAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYA-DAGTI-----LNGEI-GTVDQFRLVVVPEMLHWAGAGATVGTNP 314 (399) Q Consensus 242 ~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya-~~~~i-----~~gEI-G~i~~vRfV~~~~~~~~~~aGa~~~~~~ 314 (399) .=+.++||.+.+-|++ ++.|.++ |- +..++ +.|.+ |++-|+++|.+|..-. + T Consensus 194 --pdtIVlhP~~~~~l~~------n~~~~~~--y~~~a~~~~~~~~~tg~~~g~~lGl~vi~s~~~p~----------~- 252 (318) T protein:vir:10 194 --PDTIVMHYALLPILMD------NENFMKV--YERNANYVSTAPDWTGNFPGSVMGLNVIRSRTFPI----------D- 252 (318) T ss_pred --ceeeEECHHHHHHHhc------chhhhhh--hhccchhhhhcccccccccceeeceEEeecCccCC----------C- Confidence 0157999999999964 5777654 31 11222 34554 6778899999998511 1 Q ss_pred cccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEEEE Q lcl|NC_019514. 315 GYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLALVK 394 (399) Q Consensus 315 ~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie 394 (399) + .+||=++..| +-++..+..++.+ .++++||. +-...+|. ....+-.-+.+.| T Consensus 253 -------~-----alvlq~g~vG---~~~d~~pl~~t~~-------~~egg~~~-g~~~~s~~----~~~~~~~~~~V~~ 305 (318) T protein:vir:10 253 -------R-----VLIMERGTVG---FYSDTRPLQFTAL-------YPEGNGPN-GGPTESYR----ADASHKRALAVDQ 305 (318) T ss_pred -------e-----eEEEecCCcc---eeeccccceeeec-------ccCCCCCC-CCcchhhh----eehheeeeeeeeC Confidence 1 3677676555 4455444333333 35678886 44566654 1111111122211 Q ss_pred --EeccC Q lcl|NC_019514. 395 --TVAPL 399 (399) Q Consensus 395 --~~a~~ 399 (399) .+..+ T Consensus 306 PkA~~~i 312 (318) T protein:vir:10 306 PKAALWL 312 (318) T ss_pred cceeEEE Confidence 11111 No 160 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=92.61 E-value=0.011 Score=31.39 Aligned_cols=284 Identities=13% Similarity=-0.001 Sum_probs=110.6 Q ss_pred CCcCCe---------eecCC-CCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccc Q lcl|NC_019514. 1 MASKGM---------LYNDP-NTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPL 70 (399) Q Consensus 1 ~~~~~~---------~~n~~-~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl 70 (399) +..|+. .+|.- ...+++..+-+-|+ -+..+.+......-.+.+++...++..+..+ |..+.- T Consensus 64 ~~~~~~~~l~~~~r~~~~~~~~~~~~~~gg~lvP~----~~~~~I~~~~~~~s~i~~~~~~~~~~~~~~~-i~~~~~--- 135 (390) T protein:vir:40 64 LASRGANALTSDESKYYNEVIAGNGFAGVTALLPP----TVFERVFEDLTVEHPLLSKINFVNTTATTEW-IISVGD--- 135 (390) T ss_pred HHhcCchhccHHHHHHHHHHHhccCcccCcccccH----HHHHHHHHHHHhhhhhhhhceeeecCCceeE-EEEEcC--- Confidence 111110 11111 11222223333343 2345566666677677888888877543222 211111 Q ss_pred ccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcc Q lcl|NC_019514. 71 LDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSD 150 (399) Q Consensus 71 ~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D 150 (399) .+...|..|+- ........+...++.+.++|+.+..+|+++++ ++. T Consensus 136 -----------~~~a~~~~E~~----------------------~~~~~~~~~f~~i~l~~~k~~~~i~iS~ell~-ds~ 181 (390) T protein:vir:40 136 -----------VATAWWGPLCA----------------------EIKEVLDNGFDKIQTGMYKLSAYIPVCNAMLD-LGP 181 (390) T ss_pred -----------Ccceeeecccc----------------------ccCccccccceeeEeeeeeEEEeehhhHHHHh-cch Confidence 12334555431 11111223455678889999999999999877 444 Q ss_pred hHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEE------ecCCCcccccccccccCCceecHHHHHHHHHHHHhccCc Q lcl|NC_019514. 151 SELFSHISTELMNGAVQLTEAVLQKDLLAGAGTIV------YTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTP 224 (399) Q Consensus 151 ~~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~------yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap 224 (399) .++.+.+...+++..+...+ ..+++|.++-. ..+.++..... ......++..+.-.+...|+..--. T Consensus 182 ~~l~~~i~~~la~~i~~~~~----~a~l~G~G~~~P~Gil~~~~~~~~~~~~---~~~~~~~t~~~~~~~~~~l~~~~~~ 254 (390) T protein:vir:40 182 SWLDQYVRTILGEAMALGLE----AGIVNGSGKDQPIGMMRDLNNVTAGEHP---VKTATPLTDLTPATLATKVMLPLTD 254 (390) T ss_pred HHHHHHHHHHHHHHHHHHHH----hhhhcccCCCccceeeeccccccccccc---cccccccchhhHHHHHHHHHHHhhc Confidence 45777777777776554443 45666654321 11111110000 0112234444444444444321100 Q ss_pred cccceeccccccCccccCceeEEEeCCCchH----HHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCcc Q lcl|NC_019514. 225 KQTKVITGSRMIDTRTISAGRVLYIGSELIP----LIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEM 300 (399) Q Consensus 225 ~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~----dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~ 300 (399) . +.+ ..+. =+-+||+.... .++.++|.-+.+-|.. ..-++.+|.++.| T Consensus 255 ~------~~~-----~~~~-a~~i~n~~t~~~~l~~~~~~~d~~G~~v~~~----------------~~~g~pvv~~~~~ 306 (390) T protein:vir:40 255 N------GKK-----SVSD-AILVINPADYWSKIYAATSYMTPQGVWVTGI----------------LPVPLEIVQSVAV 306 (390) T ss_pred c------hhh-----hhcC-ceEEEcchhHHHHHHHHhhccCCCCcccccc----------------CCCceeEEEcCCC Confidence 0 000 0011 12367775532 2334444322221110 0123444444433 Q ss_pred chhcccCCCccCCccc-----------------cccCccceEEEE-----EEEcccceeeeccccCCCCccceEEEe-cC Q lcl|NC_019514. 301 LHWAGAGATVGTNPGY-----------------RETNGKYDIYPM-----LCVGAESFTTIGFQTDGKTLKFKVTTK-MP 357 (399) Q Consensus 301 ~~~~~aGa~~~~~~~~-----------------~~t~~~~DVyp~-----lV~G~~Afg~v~l~g~g~~~~~~~ivk-~p 357 (399) .. |...-.+.-. +-..+...++-. -+.=.+||-.+.|+.-...-.+.+-+. .+ T Consensus 307 p~----~~i~~Gd~s~~~i~~~~~~~v~~~~~~~f~~~~~~~r~~~r~dg~v~~~~A~~~l~~~~~~~~~~~~~~~~~~~ 382 (390) T protein:vir:40 307 PV----GKAVAGRAKDYFMGIGSEQVIRTSTEYRLLDDETLYYAKQYANGRPKDNSSFLVFDITGLEGSPAIDVNVVNNA 382 (390) T ss_pred CC----CcEEEEeeceEEEEeecceEEEecchhhhhcCcEEEEEEEEeCCEEecccceEEEEeeccCCCCCCCcceeeCC Confidence 10 0000000000 000001111111 122244555555555221111112222 23 Q ss_pred CCCCCCCCC Q lcl|NC_019514. 358 GEATADRND 366 (399) Q Consensus 358 G~~~ad~~D 366 (399) + .+++++. T Consensus 383 ~-~~~~~~~ 390 (390) T protein:vir:40 383 T-PSETPAE 390 (390) T ss_pred C-CCCCCCC Confidence 3 2344444 No 161 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=87.34 E-value=0.039 Score=28.29 Aligned_cols=272 Identities=15% Similarity=0.123 Sum_probs=125.3 Q ss_pred eehhh---hhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCCCCCCCceeccCccccccccccc Q lcl|NC_019514. 26 MNTFF---WWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQGIDAAGATIVNGNLYGSSKDIGT 102 (399) Q Consensus 26 m~~~y---~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~ 102 (399) |..-| |+..+++.....++.+.+... ..==+.|++|++-+-.- .|+..=.. .+|..+|.- + T Consensus 1 Main~a~~~~~~Ld~~~~~~~~t~~l~~~-~~~~~ggktVkI~~i~~---------~gl~DY~R--~~g~~~g~v----~ 64 (290) T protein:vir:78 1 MAINYVDKYGKELDQKLVFGTYTNELETP-NLLWLDAKTFKIQTITT---------TGLKAHTR--NKGYNEGSA----S 64 (290) T ss_pred CchhHHHHHHHHHHHHHHhhheeeecccc-ceeeccCCEEEEeeecc---------Cccccccc--CCCcccCcc----c Confidence 32222 566666667788888777543 33346789999887532 11111000 112211110 1 Q ss_pred cccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHHHHHHHHHHhhhHHHHHHH---HHH-HH Q lcl|NC_019514. 103 IVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFSHISTELMNGAVQLTEAVL---QKD-LL 178 (399) Q Consensus 103 i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~~lg~~a~~~~e~~l---~~~-~l 178 (399) .+.+-. +|.| .-+.+|+=+..|.+|-. ..-.+..++++++.+..-..+ +-- |. T Consensus 65 ~~~et~---------------------tl~q-dR~~~F~vD~~DvDEt~-~~~~~~nv~~ef~~~~v~PEiDayr~skla 121 (290) T protein:vir:78 65 NTNKSY---------------------TIDF-DRDVEFFVDVMDVDETG-QALSAANVTKEFNSRHAGPEMDAYRFSKLA 121 (290) T ss_pred cceeeE---------------------Eeec-cccceeeccccchhHHh-hhhhHHHHHHHHHHHHhhhhhhHHHHHHHH Confidence 111111 1222 12233322223444433 333345566666554443333 222 22 Q ss_pred hcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccccCceeEEEeCCCchHHHH Q lcl|NC_019514. 179 AGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGSRMIDTRTISAGRVLYIGSELIPLIR 258 (399) Q Consensus 179 ag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dir 258 (399) .++.. .+...+ .++| ..=-++.|+.+...|++ .|. ..+++||.|+.-.-|+ T Consensus 122 ~~a~~---~~~~~~-~t~t------~~n~~~~i~~~~~~lde--vp~-----------------~~rvl~vtp~~~~lL~ 172 (290) T protein:vir:78 122 TAAKT---NSNSVA-EEIT------KDNVFTKLKAAIRKVKK--YGT-----------------QNLVMYVSPDVMAALE 172 (290) T ss_pred hhhhc---cCcccc-cccC------HHHHHHHHHHHHHHHHh--cCC-----------------CCeEEEECHHHHHHHh Confidence 33211 011000 0111 11246777788777764 442 3499999999988775 Q ss_pred HhhccCCCcccee---hhhcCCccccccccceeEcCeEEEecCccch------hcccCCCccCCccccccCccceEEEEE Q lcl|NC_019514. 259 KLVDPFGNAAFVP---VHQYADAGTILNGEIGTVDQFRLVVVPEMLH------WAGAGATVGTNPGYRETNGKYDIYPML 329 (399) Q Consensus 259 dl~d~~~~p~fi~---v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~------~~~aGa~~~~~~~~~~t~~~~DVyp~l 329 (399) . +++|.. +.+.+.. ..++-||++++|.+|++|--.. |.+ |..+++ ...-..+| T Consensus 173 ~------~~~f~r~~~~~~~~~~--~i~~~V~~idG~~ii~vps~~r~~t~~~f~~-G~~~~~---------~ak~in~i 234 (290) T protein:vir:78 173 L------SDDFVRAINVQNIGPS--SIETRITAIDGTRIVEVEAEDRFYDTFDFTD-GYKPAA---------GAKKLNFL 234 (290) T ss_pred h------Chhhhccccccccccc--cccceeeeecCcEEEEecccchhhhhhhhcc-cccccC---------CccceeEE Confidence 3 577865 4444433 3699999999999999883211 222 333222 11123455 Q ss_pred EEcccceeeeccccCCCCccceEEEecCCCCCCCCCC--ccchhhHHHHHHHHHHhhccccceEEEEE Q lcl|NC_019514. 330 CVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRND--PYGEMGFSSIKWYYGTLILRPERLALVKT 395 (399) Q Consensus 330 V~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~D--PlgQrg~~gwK~~~~~~iLn~~~m~~ie~ 395 (399) |+=..|=-.+ .+- .++. .-.||.. .++| =+.+|-+- =.|...-..+.-.+-+++ T Consensus 235 i~~~~a~i~~-~K~----~~~~--~~~P~~~--~~~d~~~~~~r~y~---d~~v~~nk~~~i~~~~~~ 290 (290) T protein:vir:78 235 LVNKGSVVGG-AKH----ASIY--LHAPGSV--GQGDGWLYQYRVYH---DIFVLDQQKDGVIASTEV 290 (290) T ss_pred EEcCCceeee-eee----eEEE--eeCCCCC--cCcceeeeeeeeee---eeeeeccccCeeEEEeeC Confidence 5555432222 111 1111 2346642 1122 22222111 124445555555666666 No 162 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=86.10 E-value=0.048 Score=27.82 Aligned_cols=288 Identities=9% Similarity=0.022 Sum_probs=131.3 Q ss_pred CCcCCe---------eecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccc Q lcl|NC_019514. 1 MASKGM---------LYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLL 71 (399) Q Consensus 1 ~~~~~~---------~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~ 71 (399) ++.|.. .||.-...+++..+-+-|+ .+..+.+++....=.+.+++...+++- . .++-+... T Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~----~~~~~I~~~l~~~s~i~~~~~v~~~~~---~-~~i~~~~~-- 126 (381) T protein:vir:95 57 SLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPE----ETIDRIFEDLTTNHPLLADLGIKNAGL---R-LKFLKSET-- 126 (381) T ss_pred HhccCcccccHHHHHHHHHHhcccCCCCceecCH----HHHHHHHHHHHhhccceeheeeEecCc---c-eEEEEecC-- Confidence 222211 1232212222222223333 234566655555556777888777752 2 22222111 Q ss_pred cccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcch Q lcl|NC_019514. 72 DDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDS 151 (399) Q Consensus 72 ~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~ 151 (399) .+...|..++. .+.....++...++.+.++++++..+|+++++-... T Consensus 127 ----------~~~a~w~~e~~----------------------~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~- 173 (381) T protein:vir:95 127 ----------SGVAVWGKIYG----------------------EIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPA- 173 (381) T ss_pred ----------Ccceeeecccc----------------------cccccccccceeeeecceeEEeechhhHHHhhcCHH- Confidence 12223444431 111122344567888999999999999998754333 Q ss_pred HHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecC-------------CCccccccccc-ccCCceecHHHHHHHHHH Q lcl|NC_019514. 152 ELFSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTG-------------AATQDSEITGE-GATPSVVDYDDLMRLSIT 217 (399) Q Consensus 152 ~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag-------------~ats~~~~t~~-~~~~~~vt~~~lr~a~~~ 217 (399) .|...+...+.+.-+... -.-+++|.++-.=-| +...+...... .-.+....++.|..+.+. T Consensus 174 ~ie~~i~~~la~~~a~~~----~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~ 249 (381) T protein:vir:95 174 WIERFVRVQIEEAFAVAL----ETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKY 249 (381) T ss_pred HHHHHHHHHHHHHHHHHh----hheeEeccCCCCceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHh Confidence 366666666665544333 234666665421101 00000000000 011222335566666666 Q ss_pred HHhccCccccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEec Q lcl|NC_019514. 218 LDENRTPKQTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVV 297 (399) Q Consensus 218 L~~nrap~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~ 297 (399) |..+-.-+ .+......+.++|+....+|+.+.+..+ ++...+ . -.| -+..+|++ T Consensus 250 ~~~~~~~~------------~~~~~~~a~~~mn~~t~~~l~~~~~~~~----------~~G~~v-~-~l~--~g~~vv~s 303 (381) T protein:vir:95 250 HSTNEKGK------------SVAVKGNVTMVVNPSDAFEVQAQYTHLN----------ANGVYV-T-ALP--FNLNVIES 303 (381) T ss_pred hccccccc------------cccccCceEEEEccccHHhhccccccCC----------CCCcee-e-cCC--CCceEEec Confidence 65432211 0112234677899999999987644221 111110 0 000 24566776 Q ss_pred CccchhcccCCCccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH Q lcl|NC_019514. 298 PEMLHWAGAGATVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK 377 (399) Q Consensus 298 ~~~~~~~~aGa~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK 377 (399) +.|.. +. ++||.-++-.+...+ + +.+ ... ...+-..+..+++ T Consensus 304 ~~~p~------------------~~------iifgDfs~Y~i~~r~-~----~~i--~~~-------~~~~~~~d~~~f~ 345 (381) T protein:vir:95 304 TVQEA------------------GK------VLTYVKGLYDGYLAG-G----INV--QKF-------KETLALDDMDLYT 345 (381) T ss_pred CCCCc------------------Cc------EEEEecccEEEEEec-c----cEE--Eee-------chhHhhcCCeEEE Confidence 65410 01 577776664554333 2 111 100 1122223233333 Q ss_pred --HHHHHhhccccceEEEEEec----cC Q lcl|NC_019514. 378 --WYYGTLILRPERLALVKTVA----PL 399 (399) Q Consensus 378 --~~~~~~iLn~~~m~~ie~~a----~~ 399 (399) ..+.+++++++=++.++... ++ T Consensus 346 a~~r~dg~~~~~~A~~v~~l~~~~~~~~ 373 (381) T protein:vir:95 346 AKQFAYGKAKDNKVAAVWKLDLKGHKPA 373 (381) T ss_pred EEEEEcCEEecCceEEEEEEEecCCCcC Confidence 45566677777776655432 11 No 163 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=86.10 E-value=0.048 Score=27.82 Aligned_cols=288 Identities=9% Similarity=0.022 Sum_probs=131.3 Q ss_pred CCcCCe---------eecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccc Q lcl|NC_019514. 1 MASKGM---------LYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLL 71 (399) Q Consensus 1 ~~~~~~---------~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~ 71 (399) ++.|.. .||.-...+++..+-+-|+ .+..+.+++....=.+.+++...+++- . .++-+... T Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~----~~~~~I~~~l~~~s~i~~~~~v~~~~~---~-~~i~~~~~-- 126 (381) T protein:vir:10 57 SLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPE----ETIDRIFEDLTTNHPLLADLGIKNAGL---R-LKFLKSET-- 126 (381) T ss_pred HhccCcccccHHHHHHHHHHhcccCCCCceecCH----HHHHHHHHHHHhhccceeheeeEecCc---c-eEEEEecC-- Confidence 222211 1232212222222223333 234566655555556777888777752 2 22222111 Q ss_pred cccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcch Q lcl|NC_019514. 72 DDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDS 151 (399) Q Consensus 72 ~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~ 151 (399) .+...|..++. .+.....++...++.+.++++++..+|+++++-... T Consensus 127 ----------~~~a~w~~e~~----------------------~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~- 173 (381) T protein:vir:10 127 ----------SGVAVWGKIYG----------------------EIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPA- 173 (381) T ss_pred ----------Ccceeeecccc----------------------cccccccccceeeeecceeEEeechhhHHHhhcCHH- Confidence 12223444431 111122344567888999999999999998754333 Q ss_pred HHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecC-------------CCccccccccc-ccCCceecHHHHHHHHHH Q lcl|NC_019514. 152 ELFSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTG-------------AATQDSEITGE-GATPSVVDYDDLMRLSIT 217 (399) Q Consensus 152 ~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag-------------~ats~~~~t~~-~~~~~~vt~~~lr~a~~~ 217 (399) .|...+...+.+.-+... -.-+++|.++-.=-| +...+...... .-.+....++.|..+.+. T Consensus 174 ~ie~~i~~~la~~~a~~~----~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~ 249 (381) T protein:vir:10 174 WIERFVRVQIEEAFAVAL----ETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKY 249 (381) T ss_pred HHHHHHHHHHHHHHHHHh----hheeEeccCCCCceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHh Confidence 366666666665544333 234666665421101 00000000000 011222335566666666 Q ss_pred HHhccCccccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEec Q lcl|NC_019514. 218 LDENRTPKQTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVV 297 (399) Q Consensus 218 L~~nrap~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~ 297 (399) |..+-.-+ .+......+.++|+....+|+.+.+..+ ++...+ . -.| -+..+|++ T Consensus 250 ~~~~~~~~------------~~~~~~~a~~~mn~~t~~~l~~~~~~~~----------~~G~~v-~-~l~--~g~~vv~s 303 (381) T protein:vir:10 250 HSTNEKGK------------SVAVKGNVTMVVNPSDAFEVQAQYTHLN----------ANGVYV-T-ALP--FNLNVIES 303 (381) T ss_pred hccccccc------------cccccCceEEEEccccHHhhccccccCC----------CCCcee-e-cCC--CCceEEec Confidence 65432211 0112234677899999999987644221 111110 0 000 24566776 Q ss_pred CccchhcccCCCccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH Q lcl|NC_019514. 298 PEMLHWAGAGATVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK 377 (399) Q Consensus 298 ~~~~~~~~aGa~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK 377 (399) +.|.. +. ++||.-++-.+...+ + +.+ ... ...+-..+..+++ T Consensus 304 ~~~p~------------------~~------iifgDfs~Y~i~~r~-~----~~i--~~~-------~~~~~~~d~~~f~ 345 (381) T protein:vir:10 304 TVQEA------------------GK------VLTYVKGLYDGYLAG-G----INV--QKF-------KETLALDDMDLYT 345 (381) T ss_pred CCCCc------------------Cc------EEEEecccEEEEEec-c----cEE--Eee-------chhHhhcCCeEEE Confidence 65410 01 577776664554333 2 111 100 1122223233333 Q ss_pred --HHHHHhhccccceEEEEEec----cC Q lcl|NC_019514. 378 --WYYGTLILRPERLALVKTVA----PL 399 (399) Q Consensus 378 --~~~~~~iLn~~~m~~ie~~a----~~ 399 (399) ..+.+++++++=++.++... ++ T Consensus 346 a~~r~dg~~~~~~A~~v~~l~~~~~~~~ 373 (381) T protein:vir:10 346 AKQFAYGKAKDNKVAAVWKLDLKGHKPA 373 (381) T ss_pred EEEEEcCEEecCceEEEEEEEecCCCcC Confidence 45566677777776655432 11 No 164 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=85.50 E-value=0.052 Score=27.61 Aligned_cols=288 Identities=10% Similarity=-0.020 Sum_probs=123.4 Q ss_pred CC-cCCeeecCCCC--cccccccccccceehhhhhHHHHHHHHHHHHhhhhcccc-cccccCCCEEEEEEcccccccccc Q lcl|NC_019514. 1 MA-SKGMLYNDPNT--TPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVV-SMPKNYGKEIRVYHYIPLLDDRNV 76 (399) Q Consensus 1 ~~-~~~~~~n~~~~--t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~-~mPkN~GktIk~rry~pl~~~~~~ 76 (399) |- =--+..|+|.. ...|+...-+-.+.+..++ +++.+....-.+.+.|.+. +|-...+ .|.. + T Consensus 1 ~~~~~~~~~~~~~~~~k~~t~~d~~Gg~l~P~~~~-~~i~~~~e~s~~l~~~~vi~~~~~~~~-~i~~-----~------ 67 (315) T protein:vir:41 1 MLTIEDIRGGKPFEIVPKIDVPDLGRGVLSVDRFG-EFVKAVRDSAVIIPEARIDNALKSYEK-DISR-----L------ 67 (315) T ss_pred CcccchhhcCChhhhhhhcCCcCCCCceechHHHH-HHHHHHHhhhhhhhhceeeeccccccc-cccc-----c------ Confidence 11 11112333321 1111111111222333333 4554455566677777752 3321111 1110 0 Q ss_pred ccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhh-hhcchHHHH Q lcl|NC_019514. 77 NDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLD-FDSDSELFS 155 (399) Q Consensus 77 ~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d-~~~D~~l~~ 155 (399) |+ |..+..|-. -+..+.+.+....+...++...+++..+.++|+++++ .+..+++.+ T Consensus 68 ---g~---~~~~~~g~~----------------~~~~~~~~~~~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~ 125 (315) T protein:vir:41 68 ---SL---VLDVGPGRD----------------ETGQKLAPPESTAEVKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQ 125 (315) T ss_pred ---cc---Ccccccccc----------------cccCcCCCCCCccccceeeeceeeeeeeccccHHHHHhhhccccHHH Confidence 00 000000100 0111223334445556678889999999999999885 333345777 Q ss_pred HHHHHHHHhhhHHHHHHHHHHHHhcCC------------eEEecCCCcccccccccccCCceecHHHHHHHHHHHHhc-c Q lcl|NC_019514. 156 HISTELMNGAVQLTEAVLQKDLLAGAG------------TIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDEN-R 222 (399) Q Consensus 156 ~~~~~lg~~a~~~~e~~l~~~~lag~~------------~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~n-r 222 (399) .+..++.+.-+.-. ...+++|-+ ...+++..... +........++.+.|..++..|+.. | T Consensus 126 ~l~~~~a~~~a~~~----~~~~~nGdg~s~~p~~~~~~G~l~~a~~~~~~---~~~~~~a~~~~~d~l~~l~~sl~~~yr 198 (315) T protein:vir:41 126 KIVTLLGEGISYVL----EKYYLHGDTSSSDPLLRMSDGWLKLASEKLTE---SDVDPEAEDWPMNLFDTMIESLPTPYR 198 (315) T ss_pred HHHHHHHHHHHHHH----HHHhhccCCcCcCccccccccceecccccccc---cccccccccccHHHHHHHHHhcChHHh Confidence 77777766554333 334556633 22232221111 1111222345778888888888652 1 Q ss_pred CccccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccch Q lcl|NC_019514. 223 TPKQTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLH 302 (399) Q Consensus 223 ap~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~ 302 (399) .- + ..-+-+++.++...||.++|.-+.+.|-|..+=|.+. .+-|..++..|.|-. T Consensus 199 ~~-------~----------~~~~~imn~~t~~~~rklk~~~g~~lw~~~~~~g~~~--------tl~G~PV~~~~~m~~ 253 (315) T protein:vir:41 199 NN-------L----------PNMKFYVTWDIYRAYRDALKGRETGLGDQALTGANSI--------LYDGRPVQYVPALEA 253 (315) T ss_pred hc-------C----------CceEEEEcHHHHHHHHHHhccCCCccccchhhcCCCc--------eecccceEecccccc Confidence 10 0 0156789999999999999888888888775555444 455677777777633 Q ss_pred hcc-cCCCccCCc--ccc--ccCccceEEEEEEEcccce-eeecccc--CCCCc-cceEEEe Q lcl|NC_019514. 303 WAG-AGATVGTNP--GYR--ETNGKYDIYPMLCVGAESF-TTIGFQT--DGKTL-KFKVTTK 355 (399) Q Consensus 303 ~~~-aGa~~~~~~--~~~--~t~~~~DVyp~lV~G~~Af-g~v~l~g--~g~~~-~~~~ivk 355 (399) ... .+.---++. ... ...-+.++++-.-.+...| .+.-+.+ ...+. .++.+-+ T Consensus 254 ~~~~~~~ilf~d~~nl~~~~~~~i~i~~~~~a~~~~~~~~~~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 254 LNDGKSRALFVVPTQLVYGFWRNIKVVPDYDAEMRLTKYVASLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred cCCCCccEEEecccceEEEeccccEEEeeecCCCCceEEEEEEEeceeEEeccceeEeeeeC Confidence 211 111000000 000 0000111111000000000 0000000 00111 1111111 No 165 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=83.33 E-value=0.069 Score=26.94 Aligned_cols=292 Identities=10% Similarity=0.034 Sum_probs=130.1 Q ss_pred CCcCCe---------eecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccc Q lcl|NC_019514. 1 MASKGM---------LYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLL 71 (399) Q Consensus 1 ~~~~~~---------~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~ 71 (399) +..|.. .||.-...+.+..+-+-|+ .+..+.++.....-.+.+++...++. |. .++-+... T Consensus 67 ~~~r~~~~l~~ee~~~~~~~~~~t~~~gG~liP~----~~~~~Ii~~l~~~s~i~~~~~v~~~~---~~-~~i~~~~~-- 136 (395) T protein:vir:95 67 LAKRSQDPLTSEERKFFNDINYDVGYTDEKILPE----TVVERVFDDLQKDHPLLSKINFQNAG---IK-TRVIKADP-- 136 (395) T ss_pred HhhcCccccchHHHHHHHHHhhccCCCCceeccH----HHHHHHHHHHHhhhhhhhhceeEecC---Cc-eEEEEecC-- Confidence 222211 1232211112222223333 23456666667777788888887774 33 22211110 Q ss_pred cccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcch Q lcl|NC_019514. 72 DDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDS 151 (399) Q Consensus 72 ~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~ 151 (399) .+...|..+. +.+......+...++...++++++..+|+++++-... T Consensus 137 ----------~~~a~w~~e~----------------------~~~~~~~~~~f~~i~l~~~kl~~~~~iS~ell~ds~~- 183 (395) T protein:vir:95 137 ----------AGQAVWGKVF----------------------GEIKGQLDAAFREENFTQYKLTCFVVLPDDLSTFGPA- 183 (395) T ss_pred ----------CcceEEeecc----------------------cccCccccccceeeeeceeeEEEeecccHHHHhcchh- Confidence 1122232221 1122223345567888899999999999997753333 Q ss_pred HHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeE--------EecCCCcccccccccccCCceecHHHHHHHHHHHHhccC Q lcl|NC_019514. 152 ELFSHISTELMNGAVQLTEAVLQKDLLAGAGTI--------VYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRT 223 (399) Q Consensus 152 ~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v--------~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nra 223 (399) +|...+...+.+.-+... -..+++|.++- -+.++.+. ..+ .......++.+++..+...|..... T Consensus 184 ~ie~~i~~~la~~ia~~~----~~a~i~G~G~~~~qP~Gil~~~~~~~~--~~~-~~~~~~~~t~~~~~~~~~~l~~~~~ 256 (395) T protein:vir:95 184 WIERFVRTQIQEAISVAL----ESAIINGGGAAKTQPVGLMKDVNTNSG--AVT-DKASSGTLTFADADTTILELNDVLK 256 (395) T ss_pred HHHHHHHHHHHHHHHHHH----hhheeeccCCCCcCceeeeeccccccc--ccc-cccccchhhhhhhHhhHHHHHHHHH Confidence 366666666666544333 34566776542 11111111 111 1122233455555555444433221 Q ss_pred ccccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchh Q lcl|NC_019514. 224 PKQTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHW 303 (399) Q Consensus 224 p~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~ 303 (399) .- ....++... ..++. -..++|+....|++ ..+-|.+. -|.+.+++. -++++|+++.|.. T Consensus 257 ~~-~~~~~~~~~---~~~~~-~~~~mn~~t~~~~~------g~~~~~~~--~G~~~~~lg------~g~~v~~~~~~p~- 316 (395) T protein:vir:95 257 NL-SVDEKGKEL---KIDGK-VALVVNPRDSWDVQ------ARYTYLTA--NGGFVTVLP------YNVTIITSEFVPE- 316 (395) T ss_pred hh-ccccccchh---hhcCc-eEEEEcchhhhhcC------CcceeccC--CCcceeccC------CcceEEEcCCCCC- Confidence 10 000011100 01111 23467776555553 23445542 222222221 2566777665420 Q ss_pred cccCCCccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHH--HHHH Q lcl|NC_019514. 304 AGAGATVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIK--WYYG 381 (399) Q Consensus 304 ~~aGa~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK--~~~~ 381 (399) + + ++||.=++-.++..+ + +.+-+ ..+.+..++.++++ +++. T Consensus 317 ---------~----------~----i~fgdfs~y~i~~r~-~----~~i~~---------~~~~~~~~d~~~f~~~~r~d 359 (395) T protein:vir:95 317 ---------G----------K----LVAFVTDRYNAVRGG-G----LTVKK---------FDQTLALEDAVLFTAKTFAY 359 (395) T ss_pred ---------C----------c----EEEEecccEEEEEec-c----eEEEe---------ccchhhhCCcEEEEEEEEEC Confidence 0 1 467765554444332 1 11111 11233333444444 3567 Q ss_pred HhhccccceEEEEEeccC Q lcl|NC_019514. 382 TLILRPERLALVKTVAPL 399 (399) Q Consensus 382 ~~iLn~~~m~~ie~~a~~ 399 (399) +++.+++=++.++....- T Consensus 360 g~~~~~~A~~~l~i~~~~ 377 (395) T protein:vir:95 360 GQPDDNKASAVYDLKVAS 377 (395) T ss_pred CEEeccccEEEEEeeccC Confidence 888888777776664222 No 166 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=77.11 E-value=0.13 Score=25.47 Aligned_cols=298 Identities=11% Similarity=0.034 Sum_probs=130.0 Q ss_pred CCcCCeeecCCCCcccccccccccc--eehhhhhHHHHHHHHHHHHhhhhcccccc---cccCCCEEEEEEcccccccc- Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQ--MNTFFWWKKALIEARKDQYFMPLADVVSM---PKNYGKEIRVYHYIPLLDDR- 74 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~--m~~~y~~kk~L~~A~p~lv~~~fA~~~~m---PkN~GktIk~rry~pl~~~~- 74 (399) |+ .|...--|-|+ +-+-|..++..++-+ .+.-+..+....| -...|..++.=.|.+|..+. T Consensus 1 Ma------------~T~l~D~iipe~~vf~~Yv~~~~~e~~~-l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~L~g~~e 67 (349) T protein:vir:78 1 MA------------ITTIGDIVTGNIPVLASYMTEDPVEKTA-FFDSGILTSTPYAAEIANGPSNIANLPFWKAIDTSIE 67 (349) T ss_pred CC------------ceEEeeeeccCHHHHHHHHHHhhHHhhh-hhhccceeccHHHHHHhhcCCCEEEeeeeecCCCCcc Confidence 33 12223334555 233354444433311 1112222222222 23569999999998884322 Q ss_pred -ccccCCC--CCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcch Q lcl|NC_019514. 75 -NVNDQGI--DAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDS 151 (399) Q Consensus 75 -~~~~~gi--~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~ 151 (399) |+.+.+- +.....++.+ ...+.+..-|.=...+|-+.+...++ T Consensus 68 ~nv~~D~~~~~~t~~kitt~----------------------------------~~~a~~~~r~kaw~~~Dla~~lsG~d 113 (349) T protein:vir:78 68 PNYSNDVYQDIATPRAIQTG----------------------------------EMMARVAYLNEGFGQADLTVELTSQN 113 (349) T ss_pred cccCCCCccccccccccccc----------------------------------ceeeeeeeeccccchhHHHHHhhCch Confidence 2211111 0011122222 12233333333345566666655554 Q ss_pred HHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCC-Ccc----cccccccccCCceecHHHHHHHHHHHHhc----c Q lcl|NC_019514. 152 ELFSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGA-ATQ----DSEITGEGATPSVVDYDDLMRLSITLDEN----R 222 (399) Q Consensus 152 ~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~-ats----~~~~t~~~~~~~~vt~~~lr~a~~~L~~n----r 222 (399) .++++...+++--.+ .-|..|++=..=++.... +++ ....+...++....+...+..+...|-.. + T Consensus 114 -pm~~Ia~~va~yW~r----~~q~~Lia~L~Gvf~~~~~a~~~~~~~~~~t~d~s~~a~~~~~~~~dA~~~lgda~~Gd~ 188 (349) T protein:vir:78 114 -PLQSVASRLDNFWQR----QAQRRLIATALGLYNDNVSATDAYHEQNDMVVDVSATLGFDAGAFIDATQTMGDALMGNG 188 (349) T ss_pred -HHHHHHHHHHHHHhh----HHHHHHHHHHHHhhcccccccchhhhcccceeeeccccCCChhhhhhhHHHHHHHhcccc Confidence 466676666654443 334443332222222111 110 01111111233345777777776555442 1 Q ss_pred CccccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccch Q lcl|NC_019514. 223 TPKQTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLH 302 (399) Q Consensus 223 ap~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~ 302 (399) ..+ -=++++||.....|+.. ..+.-.+..+.. -.|+.+-|-|+|+..-|-. T Consensus 189 ~~~------------------lt~i~mHS~v~~~L~~~-------~li~~i~~s~~~----~~i~ty~G~~VivDD~~Pv 239 (349) T protein:vir:78 189 GEV------------------LGAIAMHSFVYAQARKA-------QLIDFIRDAENN----TMFATYQGYRVIVDDSMTV 239 (349) T ss_pred ccc------------------eeEEEEchHHHHHHHhh-------hhhhhccCcccC----cccceecCeEEEEeCCCcc Confidence 111 24689999999999865 234333434332 2689999999999988733 Q ss_pred hcccCCCccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccch-----hhHHHHH Q lcl|NC_019514. 303 WAGAGATVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGE-----MGFSSIK 377 (399) Q Consensus 303 ~~~aGa~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQ-----rg~~gwK 377 (399) . ++ +...||-+.+||.+|++.-. +.+..+..+...|-..+-.+-|=|=. .+-.|.| T Consensus 240 ~--------------~~-g~~~~yttylfg~GAi~~~~----~~~~~~~et~rd~~~g~~~G~d~l~~R~~~~~hp~G~s 300 (349) T protein:vir:78 240 V--------------GQ-GAQRKFISIIFGQGAIGYGE----GNPVMPLEYEREASRANGGGVETLWTRKTWLLHPFGYR 300 (349) T ss_pred c--------------cC-CCCceEEEEEeecceEEEcc----CCCccceeeecccccCCcceeEEEEEeeEEEeeeeeee Confidence 1 11 12458999999999998753 22211222222221000111122222 1222333 Q ss_pred HHHHHhhc--------cccceEEEEEeccC Q lcl|NC_019514. 378 WYYGTLIL--------RPERLALVKTVAPL 399 (399) Q Consensus 378 ~~~~~~iL--------n~~~m~~ie~~a~~ 399 (399) |--++..- .+-| +-|+.++-- T Consensus 301 ~~~a~v~~~~~~~~~~sPt~-aeLa~~~NW 329 (349) T protein:vir:78 301 FTSAVITGNGTETIARSASW-QDLANATNW 329 (349) T ss_pred eccccccCCccccccCCCCh-HHhcCCcCc Confidence 32222110 0000 001111111 No 167 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=75.32 E-value=0.15 Score=25.13 Aligned_cols=280 Identities=13% Similarity=0.135 Sum_probs=128.5 Q ss_pred eeecCCCCcccccccccccceehhhhhHHHHHH---HHHHHHhhhhcccccccccCCCEEEEEEccccccccccccCCCC Q lcl|NC_019514. 6 MLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIE---ARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQGID 82 (399) Q Consensus 6 ~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~---A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~gi~ 82 (399) |+.|- +. . . .+.+.+ ++.|.+ +.|. -+.+||.. .| .-.+..++-+...++ ++. T Consensus 1 m~it~-~~----l-~----~l~~~~--~~~~~~~y~~a~~-~~~~~a~~--~~-sdf~~~~~~~lg~~p--------~l~ 56 (302) T protein:vir:10 1 MLINK-QS----L-N----AAFVAI--KTIFNNAFAAAPT-TWQKIAME--VP-SNTSSNDYKWLSTFP--------KMR 56 (302) T ss_pred CcccH-HH----H-H----HHHHHH--HHHHHHHHHhhhh-hhhceeee--cC-CCcceeeceecCCCC--------Ccc Confidence 44442 11 0 1 122222 344443 2243 35778753 45 345655555554431 111 Q ss_pred CCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcch-HHHHHHHHHH Q lcl|NC_019514. 83 AAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDS-ELFSHISTEL 161 (399) Q Consensus 83 aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~-~l~~~~~~~l 161 (399) . ++ | -.+ +++ ++-..-+..++.||.=+.+|.+.. .-|+ .++..+-+.| T Consensus 57 e---~~--G----e~~-~~~-------------------l~~~~~~i~~~~~g~~v~i~R~~i--~nDdlg~~~~~~~~~ 105 (302) T protein:vir:10 57 R---WI--G----AKV-VKN-------------------LKAYKYVVENEDFEATVEVDRNDI--EDDQIGIYSPQAKMA 105 (302) T ss_pred c---cc--c----cee-ecc-------------------ccccceeEEeecccceecccHHhh--cccccchhHHHHHHH Confidence 1 00 1 111 122 223335567999999999999743 2232 5777788899 Q ss_pred HHhhhHHHHHHHHHHHHhcCCeEEecCCCcccc----------cccccc--cCCceec---HHHHHHHHHHHHhccCccc Q lcl|NC_019514. 162 MNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDS----------EITGEG--ATPSVVD---YDDLMRLSITLDENRTPKQ 226 (399) Q Consensus 162 g~~a~~~~e~~l~~~~lag~~~v~yag~ats~~----------~~t~~~--~~~~~vt---~~~lr~a~~~L~~nrap~~ 226 (399) |+.|++..++++..-|.+|-+...|-|..=.++ ++.... .....++ +...|.+.+.++...-+. T Consensus 106 G~aaa~~~~~lv~~~L~~g~~~~~~DG~~fF~~dH~~g~~~~~N~g~~~~~~~~~~l~~~~~~aa~~am~~~k~~~G~~- 184 (302) T protein:vir:10 106 GYSAAQLPDELVYEAVNGAFTKPCFDGQYFIDTDHPVGDASVSNKGTAPLSNASQAAAKAGYGAARTAMKKFKDEEGRS- 184 (302) T ss_pred HHHHHhhHHHHHHHHHhccCCCcccCCcceecccccccccccccccchhhhhcccccchHHHHHHHHHHHHHhhhcccc- Confidence 999988876666655555545444444322111 110000 0112344 445555555555433322 Q ss_pred cceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccc---hh Q lcl|NC_019514. 227 TKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEML---HW 303 (399) Q Consensus 227 t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~---~~ 303 (399) + .|.+.| .+|+|+++..-+.|.. .+.+. -+..-+ +. ..++.|++|.+. .| T Consensus 185 ---------L---~i~P~~-LiVp~~le~~A~~ll~----~~~~~---~g~~Np-~~------g~~~~vv~p~L~s~~aW 237 (302) T protein:vir:10 185 ---------L---NVSPNV-LLVGPALEDVAKMLLT----NPKLA---DNTPNP-YV------GTAELVVDGRIESDTAW 237 (302) T ss_pred ---------c---ccCCCE-EEecchhHHHHHHHhh----ccccC---CCCcce-ec------cceEEEEeeccCCCCce Confidence 2 355666 6899999999987632 11110 122222 21 346788887752 22 Q ss_pred cccCCCccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHh Q lcl|NC_019514. 304 AGAGATVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTL 383 (399) Q Consensus 304 ~~aGa~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~ 383 (399) -=. . ....+++. ..=|.+..-..-..+-+..+-+-.+...-| .|-=+-.|+.-|.++|++. T Consensus 238 yL~----------a-~~~~i~~~--~l~g~~~P~~~~~~~~~~dgv~~k~~~d~G------vd~R~~~G~~~wq~a~~s~ 298 (302) T protein:vir:10 238 FLL----------D-TTKPVKPF--IFQPRKQPEFVSQVNLDSDDVFNLRKLKFG------AEARAAAGYGFWQLAYGST 298 (302) T ss_pred EEE----------e-cCCccceE--EEcCccccEEEeccCCCCCceEEEEEEEEe------eeeeeecchhhhhhhhccC Confidence 100 0 11122222 222333222222222111111222222223 2555567777778888765 Q ss_pred hccccceEEEEEec Q lcl|NC_019514. 384 ILRPERLALVKTVA 397 (399) Q Consensus 384 iLn~~~m~~ie~~a 397 (399) = ++| T Consensus 299 g----------~~~ 302 (302) T protein:vir:10 299 G----------TGA 302 (302) T ss_pred c----------cCC Confidence 3 222 No 168 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=72.91 E-value=0.18 Score=24.71 Aligned_cols=298 Identities=10% Similarity=0.050 Sum_probs=120.6 Q ss_pred CCcCC------eeecCCCC-cccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccc Q lcl|NC_019514. 1 MASKG------MLYNDPNT-TPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDD 73 (399) Q Consensus 1 ~~~~~------~~~n~~~~-t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~ 73 (399) +..++ -.||.--. +..+..+-+-|+ -+.++.+++....=.+.+.+...++. |. .++-+. T Consensus 62 ~~~~~lt~ee~~~~~~~~~~~~~~~gg~~vP~----~~~~~I~~~l~~~s~i~~~~~v~~~~---~~-~~~~~~------ 127 (377) T protein:vir:98 62 DKNRELTAEEIKFFNDIDKNVGGKDKFKLLPE----ETMVQVFDDLVAEHPLLKVINFKNTS---LR-LKALTA------ 127 (377) T ss_pred cCCcccCHHHHHHHHHHHhccCCCCCccccCH----HHHHHHHHHHHHhhhhhhheeeEecC---cc-eEEEEe------ Confidence 11110 11221101 111111222232 13456666555554556667666653 33 222111 Q ss_pred cccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHH Q lcl|NC_019514. 74 RNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSEL 153 (399) Q Consensus 74 ~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l 153 (399) . -.+...|..++ +.+......+...++...++++.+..+|+++++-...+ + T Consensus 128 -----~-~~~~a~w~~e~----------------------~~~~~~~~~~f~~i~l~~~kl~a~~~is~elL~ds~~~-i 178 (377) T protein:vir:98 128 -----E-TSGTAVWGDIF----------------------GEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKW-I 178 (377) T ss_pred -----c-CCcceeEeecc----------------------cccCcccCccceeEeecceeEEeeecccHHhhhccHhH-H Confidence 0 01223354443 22222333455678899999999999999987544433 6 Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCC----CcccccccccccCCce-ecHHHHHHHHHHHHhccCccccc Q lcl|NC_019514. 154 FSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGA----ATQDSEITGEGATPSV-VDYDDLMRLSITLDENRTPKQTK 228 (399) Q Consensus 154 ~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~----ats~~~~t~~~~~~~~-vt~~~lr~a~~~L~~nrap~~t~ 228 (399) .+-+..++++.-+.. +-..+++|.++-.=-|- +..-........+.+. ...+.+-++.-.|+..-.....- T Consensus 179 e~~i~~~la~~~a~~----~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~a~~ 254 (377) T protein:vir:98 179 KQFITEQLKEAIAVA----LELAIVKGDGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLTPDNAPKKLVP 254 (377) T ss_pred HHHHHHHHHHHHHHH----HhhceEeccCCCcceeeeecccccccccccccccccccchhhhHhhhhhhchhHHHHHHHH Confidence 666666666554433 33456676664321111 0000000111111111 11122222221111110000000 Q ss_pred eecccccc---CccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcc Q lcl|NC_019514. 229 VITGSRMI---DTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAG 305 (399) Q Consensus 229 ~i~~s~~~---~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~ 305 (399) +++....- --+-.-..|+.+++|...+++. |+......-|...+++. -++++|+++.|.. T Consensus 255 ~m~~~t~~~~~klkd~~G~~i~~~n~~~~~~~~--------p~~~~~~~~G~~~t~lg------~p~~vv~s~~~p~--- 317 (377) T protein:vir:98 255 VMKHLSVNDKKRPLKIAGQVKLILNPEDRWALE--------AQFTSRNQFGEYVTVLP------HGITILESLAVET--- 317 (377) T ss_pred HHHHHHHHHHhhhhccCCceEEEecccchhhcc--------ccccccCCCCccccccC------CCceEEecCCCCc--- Confidence 00000000 0000112366677776554442 22221112222222221 2567777665421 Q ss_pred cCCCccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCcc---chhhHHHHHHHHHH Q lcl|NC_019514. 306 AGATVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPY---GEMGFSSIKWYYGT 382 (399) Q Consensus 306 aGa~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPl---gQrg~~gwK~~~~~ 382 (399) + + ++||.-++-.+...++ +.+-+- .+-+ +|.+|.++ +...+ T Consensus 318 -------~----------~----i~fgdf~~Y~i~~r~~-----~~i~~~---------~~~~~~~d~~~f~~~-~r~dg 361 (377) T protein:vir:98 318 -------G----------K----AIAFVANRYDAFMATA-----STIEEY---------DQTFAMEDLQLYLTK-NYFYG 361 (377) T ss_pred -------c----------c----EEEEEecceeEEeecc-----eEEEee---------chhhhhcCceEEEEE-EEEcC Confidence 0 1 3455555434433321 111110 1222 34444332 34666 Q ss_pred hhccccceEEEEEecc Q lcl|NC_019514. 383 LILRPERLALVKTVAP 398 (399) Q Consensus 383 ~iLn~~~m~~ie~~a~ 398 (399) ++.+++=++.+..+.- T Consensus 362 ~~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 362 KAKDNHTAALLTLAGG 377 (377) T ss_pred EEeccCcEEEEEEecC Confidence 7777777888877777 No 169 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=70.47 E-value=0.21 Score=24.31 Aligned_cols=293 Identities=10% Similarity=0.035 Sum_probs=109.5 Q ss_pred CCcCCe---------eecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccc Q lcl|NC_019514. 1 MASKGM---------LYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLL 71 (399) Q Consensus 1 ~~~~~~---------~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~ 71 (399) +..|+. .||.-...+++..+-+-|+ -+..+.+++-...-.+.+.+...++. |. .++-+... T Consensus 64 ~~~~g~~~lt~~e~~~~~~~~~~~~~~gg~lvP~----~~~~~I~~~l~~~s~l~~~~~v~~~~---~~-~~i~~~~~-- 133 (383) T protein:vir:78 64 SASRTDKNITNEEIKFFNDINKEVGYKEETLLPQ----TVVDEIFEDLTTEHPFLASIGMRTTG---LR-TKFLKSET-- 133 (383) T ss_pred HhcCChhhhhHHHHHHHHHHhccCCCCCccccCH----HHHHHHHHHHHhhccceeeeeeEecC---Cc-eEEEEEcC-- Confidence 222222 2343322223333333333 23456665555555566777766653 33 23222211 Q ss_pred cccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcch Q lcl|NC_019514. 72 DDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDS 151 (399) Q Consensus 72 ~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~ 151 (399) .+...|..++ +.+......++..++.+.++++.+..+|+++++-... T Consensus 134 ----------~~~a~w~~e~----------------------~~~~~~~~~~f~~i~l~~~kl~~~i~is~ell~Ds~~- 180 (383) T protein:vir:78 134 ----------SGVAVWGKIF----------------------GEIKGQLDATFSDEESIQNKLTAFVVVPKDLEKFGPA- 180 (383) T ss_pred ----------CcceEEeecc----------------------cccccccCcceeeEeecceeeEeeccchHHHhhccHH- Confidence 1122344442 2222233455677888999999999999998764444 Q ss_pred HHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEE------ecCCCccccc-ccccccCCceecHHHHHHHHHHHHhccCc Q lcl|NC_019514. 152 ELFSHISTELMNGAVQLTEAVLQKDLLAGAGTIV------YTGAATQDSE-ITGEGATPSVVDYDDLMRLSITLDENRTP 224 (399) Q Consensus 152 ~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~------yag~ats~~~-~t~~~~~~~~vt~~~lr~a~~~L~~nrap 224 (399) +|...+...+++.-+... -..+++|.++-. ..+..++... ..+..+....++.+++..+...|+.-+ + T Consensus 181 ~ie~~i~~~l~~~~a~~~----~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-~ 255 (383) T protein:vir:78 181 WVKRFVVTQIEEAFAVAL----ESAYIVGDGNDKPIGLNRKVGKGSTVVDGVYAEKAATGTLTFANPKTTVNELTDVY-K 255 (383) T ss_pred HHHHHHHHHHHHHHHHHH----hhheEeccCCCCceeeeeccCCcccccccccccccccchhhhhhhHHHHHHHHHHH-h Confidence 366666666665544333 344667765422 1111111000 000111123344555555444443211 0 Q ss_pred cccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhc Q lcl|NC_019514. 225 KQTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWA 304 (399) Q Consensus 225 ~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~ 304 (399) ...-+.++. +...-..-+.+++|..-+++.- .......=|...+++. -++.+|+++.|.. T Consensus 256 ~~~~~~~~~----~~~~~~~~~~~~n~~~~~~~~~--------~~~~~~~~G~~~t~l~------~~~~iv~s~~~p~-- 315 (383) T protein:vir:78 256 YHSVKENGH----PLNVAGKVTLLVNPTDAWDVKK--------QYTSLNANGVYVTALP------FNLNIIESLFVPE-- 315 (383) T ss_pred ccchhcccc----hhhhcCceEEEEcCcchhhhcc--------chhccCCCCceeeecC------CCceEEecCCCCc-- Confidence 000011100 0011111245666644444421 1110000011111111 2556676655421 Q ss_pred ccCCCccCCcc----ccccCccceEEEEEEE--cccceeeeccccCC---CCccceE--EEecCCCCCCCC Q lcl|NC_019514. 305 GAGATVGTNPG----YRETNGKYDIYPMLCV--GAESFTTIGFQTDG---KTLKFKV--TTKMPGEATADR 364 (399) Q Consensus 305 ~aGa~~~~~~~----~~~t~~~~DVyp~lV~--G~~Afg~v~l~g~g---~~~~~~~--ivk~pG~~~ad~ 364 (399) +.....+-- ....+-.++++.-.-+ ++.+|-.+- ..+| .+..|.. +-..+++.+|.+ T Consensus 316 --~~iifgdfs~Y~i~~r~~~~i~~~~~~~f~~d~~~f~~~~-r~dG~~~~~~A~~vl~~~~~~~~~~~~~ 383 (383) T protein:vir:78 316 --KKAISYVAERYDALIGGPLDIGTYDQTLAIEDLNLYAAKQ-FAYGKAKDDKAAAVWTLNINPAEQTPEG 383 (383) T ss_pred --ccEEEeeccceEEEecccceEEecchhhhhcCceEEEEEE-EEcCEEecCCeEEEEEEEecCCCCCCCC Confidence 111000000 0000111111111101 111121111 1122 1222222 223333344444 No 170 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=61.62 E-value=0.35 Score=23.08 Aligned_cols=282 Identities=12% Similarity=0.036 Sum_probs=128.0 Q ss_pred CCc---CCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEccccccccccc Q lcl|NC_019514. 1 MAS---KGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVN 77 (399) Q Consensus 1 ~~~---~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~ 77 (399) |-. -..+...- +++....+-+.|+ -++ |++......-.+.+.|.+.+.-++....|-..-.. T Consensus 1 ~~~~~~~~~~~k~i-t~~d~~gG~L~P~----~~~-~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g--------- 65 (314) T protein:vir:41 1 MDFLNKPFQITPKI-DVPDLGKGILAVQ----RFG-EFVREVRENSAIIKDARVLNALKSYEVDISRISLG--------- 65 (314) T ss_pred CchhhhHHHhhccc-ccccCCCceeChH----HHH-HHHHHHHhccchhhheeeecccCccceeecccccC--------- Confidence 432 22222111 1112222334444 333 55555667777888887654333332333221110 Q ss_pred cCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhh-hhcchHHHHH Q lcl|NC_019514. 78 DQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLD-FDSDSELFSH 156 (399) Q Consensus 78 ~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d-~~~D~~l~~~ 156 (399) ..-......+|. +...++...+...++...+|+..++++|+++++ -..-+++.+. T Consensus 66 ---~~~~~~~~~~~~---------------------~~~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~ 121 (314) T protein:vir:41 66 ---VELEPGRNTSGT---------------------KVAPTADEVTVSTNTLEMKELVTKVVLEDEALEDNIEQSAFEQT 121 (314) T ss_pred ---cccccccccccC---------------------CccCCcccccccceeeeeEEEEEeecccHHHHHhhhchhhHHHH Confidence 000001111111 011112223344567778999999999998774 3333467777 Q ss_pred HHHHHHHhhhHHHHHHHHHHHHhcCCeEEe--------cCCCc-ccccccccccCCceecHHHHHHHHHHHHhccCcccc Q lcl|NC_019514. 157 ISTELMNGAVQLTEAVLQKDLLAGAGTIVY--------TGAAT-QDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQT 227 (399) Q Consensus 157 ~~~~lg~~a~~~~e~~l~~~~lag~~~v~y--------ag~at-s~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t 227 (399) +..++.++-+.-.|+ -+++|.+.... -|--+ ..+.+++.++.....+.+.+.+++..|+.-..+ T Consensus 122 i~~~~Ae~~g~~~~~----~~~nGdg~~~s~~~~~~~p~G~l~~a~~~~~~~~~~~~~~~~~~~~~l~~sl~~~yr~--- 194 (314) T protein:vir:41 122 ITSLLASGVTYDLEC----FFLHADSSLTTGRELYRINDGWMKLAGNQYTDAEPEDENWPLNLFDGMMDELDTRYLQ--- 194 (314) T ss_pred HHHHHHHHHHHHHHH----HhhccccCCcCcccchhcchhhhhhcccceeecCccccccHHHHHHHHHHhcCchhhc--- Confidence 777777765544433 34455432110 01000 001112222233456788888888888662111 Q ss_pred ceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhcccC Q lcl|NC_019514. 228 KVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAG 307 (399) Q Consensus 228 ~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aG 307 (399) +++ -.+-++|++....+|.+.+.-..+.|-+.-.=| .--.+-|+.++..|.|. +.| T Consensus 195 ---~~~----------~~~~~m~~~t~~~~r~~l~~~~~~l~~~~~~~~--------~~~~l~G~PV~~~~~~~---~~~ 250 (314) T protein:vir:41 195 ---LKP----------RMKFYVSNEIYNGYRKQLLVRETGLGDSALIGA--------TGLQYDGIPIQYVPALD---ALG 250 (314) T ss_pred ---CCC----------ceEEEecHHHHHHHHHHHhccCCcccchhhhCC--------CCceecceeeEeccccc---ccC Confidence 011 177889999999999988766666666653333 33446788888887763 222 Q ss_pred CCccCCccccccCccceEEEEEEEccc------------c-------eeeeccccCCCCccceEEEecCCCCCCCCC Q lcl|NC_019514. 308 ATVGTNPGYRETNGKYDIYPMLCVGAE------------S-------FTTIGFQTDGKTLKFKVTTKMPGEATADRN 365 (399) Q Consensus 308 a~~~~~~~~~~t~~~~DVyp~lV~G~~------------A-------fg~v~l~g~g~~~~~~~ivk~pG~~~ad~~ 365 (399) +.. ...+-+.-.++ || +++.+ - +..+.+.= .+.-.+.++.+ ++.+ T Consensus 251 ~~~--~~i~fgd~~nl-v~---~~~~~ir~~~~~~a~~~~~~~~~~~r~d~~~~~--~~aa~~~~~~~-----~~~~ 314 (314) T protein:vir:41 251 DDK--ARALLTVPTNL-VY---GFWRNIRIEPKRDAAMRRTEYIASLRADCNYED--ENAAVAAVIDM-----SSGG 314 (314) T ss_pred CCC--ceEEEechhhe-EE---EeeceeEEeecccCcCCeEEEEEEEEeceEEEE--cCcEEEEEeec-----cCCC Confidence 221 12222221121 11 11111 1 11111110 11112223332 2333 No 171 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=61.12 E-value=0.36 Score=23.02 Aligned_cols=302 Identities=10% Similarity=-0.013 Sum_probs=130.8 Q ss_pred CCcCCeeecCCCCcccccccccccc--eehhhhhHHHHHHHHHHHHhhhhccccccc---ccCCCEEEEEEccccccccc Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQ--MNTFFWWKKALIEARKDQYFMPLADVVSMP---KNYGKEIRVYHYIPLLDDRN 75 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~--m~~~y~~kk~L~~A~p~lv~~~fA~~~~mP---kN~GktIk~rry~pl~~~~~ 75 (399) |+ .|...--|-|+ +-+-|..++..++-+ .+.-+.+.....|. ...|..++.=.|.+|.-+.- T Consensus 1 Ma------------~T~l~D~iipe~~vf~~Yv~~~~~e~~~-l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~l~g~~e 67 (349) T protein:vir:94 1 MA------------ITTIGNIVTGNIPVLASYMTEDPVEKTA-FFNSGILTPTPYAAEIARGPSNIANLPFWKAIDTSIE 67 (349) T ss_pred CC------------ceEEeeeeccChHHHHHHHHHhHHHhhh-hhhccceeccHHHHHHHhcCCCEEEeeeeecCCCCcc Confidence 43 12223335555 233354444433311 11123333222232 35699999988888733222 Q ss_pred cccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchHHHH Q lcl|NC_019514. 76 VNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSELFS 155 (399) Q Consensus 76 ~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~ 155 (399) +|-.+.+ |++.+++..++-.+ ..+.+..-|.=...+|-+.+...++ .++ T Consensus 68 ~n~~~dt----------------~~~~~t~~kit~~~--------------~~a~~~~r~kaw~~~Dla~~lsG~d-pm~ 116 (349) T protein:vir:94 68 PNYSNDV----------------YQDIATPRAIQTGE--------------MMARVAYLNEGFGQADLTVELTSQN-PLQ 116 (349) T ss_pred cccCCCC----------------cccccccccccccc--------------eeeeeeeeccccchhHHHHHhhCch-HHH Confidence 2222211 11222222222111 1122222333344566666655554 466 Q ss_pred HHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecC-CCccc----ccccccccCCceecHHHHHHHHHHHHhc----cCccc Q lcl|NC_019514. 156 HISTELMNGAVQLTEAVLQKDLLAGAGTIVYTG-AATQD----SEITGEGATPSVVDYDDLMRLSITLDEN----RTPKQ 226 (399) Q Consensus 156 ~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag-~ats~----~~~t~~~~~~~~vt~~~lr~a~~~L~~n----rap~~ 226 (399) ++..++++--. +.-|..|++=..=++... ++++. ...+...+.....+...+-.+...|-.. +..+ T Consensus 117 ~Ia~~va~yW~----r~~q~~Lia~L~Gvf~~~~~~~~~~~~~~~~~~d~~~~a~~~~~~~~~A~~~~Gdaa~Gd~~~~- 191 (349) T protein:vir:94 117 SVASRLDNFWQ----RQAQRRLIATALGLYNDNVSATDAYHEQNDMVVDVSATSGFDAGAFIDATQTMGDALMGNGGEV- 191 (349) T ss_pred HHHHHHHHHHh----hHHHHHHHHHHHhhhcccccccccccccCceeEEecccCCCChhhHHHHHHHHHHHhccccccc- Confidence 66666665444 444444444332222221 11110 0001111122335666666665444332 1111 Q ss_pred cceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEEEecCccchhccc Q lcl|NC_019514. 227 TKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGA 306 (399) Q Consensus 227 t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~a 306 (399) -=++++||....+|+.++ .+.-.+..+.. -.|+.+-|-|+|+..-|-... T Consensus 192 -----------------lt~i~mHS~v~~~L~~~~-------li~~i~~s~~~----~~i~ty~G~~VivDD~~Pv~~-- 241 (349) T protein:vir:94 192 -----------------LGAIAMHSFVYAQARKAQ-------LIDFIRDAENN----TMFATYQGYRVIVDDSMTVVG-- 241 (349) T ss_pred -----------------eeEEEEchHHHHHHHhcc-------hhhhccCcccC----cccceecCcEEEEeCCCcccc-- Confidence 246799999999998752 33333433332 268999999999998873311 Q ss_pred CCCccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccch-----hhHHHHHHHHH Q lcl|NC_019514. 307 GATVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGE-----MGFSSIKWYYG 381 (399) Q Consensus 307 Ga~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQ-----rg~~gwK~~~~ 381 (399) .+...+|-+.+||.+|++.-.-. +..+..+...|-..+-.+-|=|=. .+-.|.||--+ T Consensus 242 -------------~g~~~~yttylfg~GAi~~~~~~----~~~~~E~~rd~~~g~~~G~d~L~~R~~~~~hp~G~s~~~a 304 (349) T protein:vir:94 242 -------------QDTSRKFISIIFGQGAIGYGEGN----PEMPLEYEREASRANGGGVETLWTRKTWLLHPFGYSFTSA 304 (349) T ss_pred -------------CCCCceEEEEEeecceEEeecCC----CCcceeeecccccCCcceeEEEEEeeEEEeeeeeeeeccc Confidence 11234899999999999886422 212233333221100111122222 12223333222 Q ss_pred Hhhc--------cccceEEEEEeccC Q lcl|NC_019514. 382 TLIL--------RPERLALVKTVAPL 399 (399) Q Consensus 382 ~~iL--------n~~~m~~ie~~a~~ 399 (399) +..= .+-| +-|+.++-- T Consensus 305 ~v~~~~~~~~~~sPt~-aeLa~~~NW 329 (349) T protein:vir:94 305 VITGNGTETIARSASW-QDLANAANW 329 (349) T ss_pred ccCCCccccccCCCCh-HHhcCCcCc Confidence 1110 0000 001111111 No 172 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=60.97 E-value=0.36 Score=23.00 Aligned_cols=298 Identities=13% Similarity=0.109 Sum_probs=129.7 Q ss_pred CCc----------CCeeecCCCCcccc-cccccccceehhhhhHHHHHHHHHHHHhhhhccc----cccc--ccCCCEEE Q lcl|NC_019514. 1 MAS----------KGMLYNDPNTTPSG-IDAPDGKQMNTFFWWKKALIEARKDQYFMPLADV----VSMP--KNYGKEIR 63 (399) Q Consensus 1 ~~~----------~~~~~n~~~~t~tT-~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~----~~mP--kN~GktIk 63 (399) |-. |-...--|. ..++ .+..-..++. -+.....+...|++. +-|| .-.|.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~p~-l~m~alTLaea~~l~---------~d~~~~~VIE~l~~~s~iL~~lpf~~ve~~~~~ 70 (330) T protein:vir:94 1 MVRICTPPLRGRWRTLTHQFPE-LKMPTVTLAESAKLS---------QDHLVSGLIETIVEVNPLYEMMPFTEIEGNALA 70 (330) T ss_pred CceecCCccccceeehhccccc-cchhhhhhhHHhhcC---------chhhHHHHHHhhhccchHHhhcccccccCCcce Confidence 210 000000111 0000 0000000000 011111222222211 1111 11122211 Q ss_pred EEEccccccccccccCCCCCCCceeccCcccccccccc-ccccccccccccccccccccceeeeeEeeeeeecceeehhh Q lcl|NC_019514. 64 VYHYIPLLDDRNVNDQGIDAAGATIVNGNLYGSSKDIG-TIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQ 142 (399) Q Consensus 64 ~rry~pl~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~-~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td 142 (399) ..|-.-| +..++ +|+. .+.... ..|+..++.++.-++.+.++.. T Consensus 71 ~~r~~~l------------p~a~~----------r~~n~~~~~~~-------------~~Tf~q~t~~l~~l~~~~~Vd~ 115 (330) T protein:vir:94 71 YNRENVL------------GDVQF----------LAVGGTITAKN-------------PATFTKVTSELTTLIGDAEVNG 115 (330) T ss_pred eeeeecC------------Cccee----------eeccccccccC-------------cceeeeeeechhhhhhhHHHHH Confidence 1111111 11111 1110 011100 1234567778999999999999 Q ss_pred hhhhhhcch-HHHHHHHHHHHHhhhHHHHHHHHHHHHhc--CCeEEecCCCc---ccccccccccCCceecHHHHHHHHH Q lcl|NC_019514. 143 ESLDFDSDS-ELFSHISTELMNGAVQLTEAVLQKDLLAG--AGTIVYTGAAT---QDSEITGEGATPSVVDYDDLMRLSI 216 (399) Q Consensus 143 ~~~d~~~D~-~l~~~~~~~lg~~a~~~~e~~l~~~~lag--~~~v~yag~at---s~~~~t~~~~~~~~vt~~~lr~a~~ 216 (399) .++|+++++ +....-.++..+..++-.|+. +++| +++ -+.|-.. ....+.+ ++....+|+++|+.+.- T Consensus 116 ~iadl~g~~~d~~~~q~~~~ieal~~~~e~~----linGDs~~~-~F~GL~~~~~~~q~i~t-g~~gg~~T~d~LDeLl~ 189 (330) T protein:vir:94 116 LIQATRSDFMDQTSVQVASKAKSIGRQYQAS----MITGDGTGN-SFQGMMGLVAASQTISA-GANGGTLTFELLDQLLD 189 (330) T ss_pred HHHHhcCCHHHHHHHHHHHHHHHHHHHHHHH----hhccCCCCc-cccchhhcCCcccEEec-CCCCCCCCHHHHHHHHH Confidence 999999875 222222334444444444444 4443 111 2222211 1111111 22345688999988874 Q ss_pred HHHhcc-CccccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCcccccccccee----EcC Q lcl|NC_019514. 217 TLDENR-TPKQTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGT----VDQ 291 (399) Q Consensus 217 ~L~~nr-ap~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~----i~~ 291 (399) ...+-+ .| -+-+++.-+...|+++.. ..-.|+-..+ .....|+ +.+ T Consensus 190 ~v~~~~g~~--------------------~~~l~n~a~~r~I~a~~R--------~~~~~~v~~~-~~~~~G~~v~~~~G 240 (330) T protein:vir:94 190 LVKDKDGQV--------------------DYLMSSFAMRRKYFSLLR--------ALGGAAIGEV-MTLPSGRQIPTYRG 240 (330) T ss_pred HhcCCCCCC--------------------cEEEechhHHHHHHHHHH--------hccCCCCCCc-ccccCCCEEeeeCC Confidence 442222 12 133445555566665422 1223444332 2223443 678 Q ss_pred eEEEecCccchhcccCCCccCCccccccCccceEEEEEEEccc--ceeeeccccCCCCccceEEEecCCCCCCCCCCccc Q lcl|NC_019514. 292 FRLVVVPEMLHWAGAGATVGTNPGYRETNGKYDIYPMLCVGAE--SFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYG 369 (399) Q Consensus 292 vRfV~~~~~~~~~~aGa~~~~~~~~~~t~~~~DVyp~lV~G~~--Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlg 369 (399) |-++.+..... +.++ .+.++.-.||.+ =||.+ --|.+||...|.+ -+.|.-+|+. |.-+ T Consensus 241 vPi~~~d~ip~--~~~~--------~~~~~ttsIyav-~~G~~~~~qgV~Gl~~~g~~---glsVr~~G~~-----~~k~ 301 (330) T protein:vir:94 241 VPWFVNDFIPS--NMTQ--------GTATNATAIFAG-TFDDGSNKYGIAGLTARGSA---GLRVQNVGAK-----ENAD 301 (330) T ss_pred eEEEecccccC--CCCc--------ccCCCceeEEEE-eecccccccceEeecCCCCC---cceeeeCCCc-----cccc Confidence 88776543221 1111 111222347765 46654 3589999887632 2556767741 2222 Q ss_pred -hhhHHHHHHHHHHhhccccceEEEEEeccC Q lcl|NC_019514. 370 -EMGFSSIKWYYGTLILRPERLALVKTVAPL 399 (399) Q Consensus 370 -Qrg~~gwK~~~~~~iLn~~~m~~ie~~a~~ 399 (399) -++.+ +||+++.+|++.-.++||-+..= T Consensus 302 v~~~~v--~~y~~~av~~~~a~~~L~~V~~g 330 (330) T protein:vir:94 302 ETITRV--KMYCGFANFSQLGLAAIKGLIPG 330 (330) T ss_pred eeeEEE--EEeeeeEEechhheeeeccccCC Confidence 22333 57999999999999999877666 No 173 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=57.23 E-value=0.44 Score=22.54 Aligned_cols=262 Identities=16% Similarity=0.134 Sum_probs=114.5 Q ss_pred CCcCC-------------------------------eeecCC-CC--cccccccccccc-eehhhhhHHHHHHHHHHHHh Q lcl|NC_019514. 1 MASKG-------------------------------MLYNDP-NT--TPSGIDAPDGKQ-MNTFFWWKKALIEARKDQYF 45 (399) Q Consensus 1 ~~~~~-------------------------------~~~n~~-~~--t~tT~~~~i~p~-m~~~y~~kk~L~~A~p~lv~ 45 (399) |..+| ..|++- +. |..+ .+-|.++ ++.. -++++...|.- T Consensus 89 ~r~~p~~~~veyRSaGE~lkal~~~~~Gd~~A~~~~e~~r~a~~~~~Tgd~-~~~i~~~~v~d~---i~li~q~r~i~-- 162 (410) T protein:vir:83 89 MRGSPVGTEVEYRSAGEYMLDMWNSAQGNASAADRLEVYARAADHQKTGDL-QGVIPDPIVGPV---IDFIDSARPLV-- 162 (410) T ss_pred CcCCCCCCCcccccHHHHHHHHhccCCchHHHHHHHHHHHHhhccCccccc-ccccchhHhhhH---HHHHhhccchh-- Confidence 11111 123322 11 1111 1123333 1110 13333355543 Q ss_pred hhhcccccccccCCCEEEEEEccccccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceee Q lcl|NC_019514. 46 MPLADVVSMPKNYGKEIRVYHYIPLLDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRI 125 (399) Q Consensus 46 ~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~ 125 (399) +.|+. ||.+ |.|+++ ++. ..+.++-.+ ..+|++--|| ++++... ++.. T Consensus 163 slf~t---LP~~-g~T~eY-~v~--t~~~tV~~q--~~~~kqa~EG---------d~L~~gK--------------l~~~ 210 (410) T protein:vir:83 163 STLGT---LPLN-NATFYR-PIV--SQRPAVGLQ--GVAGGASDEK---------TELDSQK--------------MVID 210 (410) T ss_pred hhhhh---CCCC-CCeeEE-eee--ccccccccc--cccccccccc---------ccccccc--------------eeee Confidence 34443 7776 888887 553 222222111 1222333333 2333333 2334 Q ss_pred eeEeeeeeecceeehhhhhhhhhcchHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCce Q lcl|NC_019514. 126 SRVGRIQKFGFFTEFSQESLDFDSDSELFSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSV 205 (399) Q Consensus 126 ~~~~~l~qYG~~~e~Td~~~d~~~D~~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~ 205 (399) .-++.|+-||+|.-+|.+..+.-.-+ ++.+.-+-|..+.+.-+|...+.-|...-+ + +. .-+. T Consensus 211 t~tA~ikTyGGyt~LSRQ~IERs~v~-~L~~~lraL~~AYA~atea~vra~L~~t~t-----~----------~~-a~~~ 273 (410) T protein:vir:83 211 RLTVNAKTLGGYVNVSRQAIDFSSPS-ALDLVVNGLGQQYAIETEALVGAALASTST-----G----------AV-GYGN 273 (410) T ss_pred eccceeehhcCcccccceeeecCChh-hHHHHHHHHHHHHHHHHHHHHHHHHHHhhh-----h----------hh-hhhh Confidence 45788999999999999988876665 676666888888888887666665533221 1 11 1134 Q ss_pred ecHHHHHHHHHHHHhccCccccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCc------c Q lcl|NC_019514. 206 VDYDDLMRLSITLDENRTPKQTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADA------G 279 (399) Q Consensus 206 vt~~~lr~a~~~L~~nrap~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~------~ 279 (399) +|.+.+-.++-+-+. -+++.+++++.. |+ .+.||.--|+ - |-|.++.--+.- . T Consensus 274 ~Tad~~~~~i~da~~-------~v~da~~~~~~~-----~i-~vS~DVl~~~---~-----~~f~~~~~~~~dt~Gfg~~ 332 (410) T protein:vir:83 274 ATADNVASAIWQAAG-------AVYTAVKGMGRL-----VI-AIAPDVLGDF---G-----PLFAPVNPTNAHSTGFEAG 332 (410) T ss_pred ccHHHHHHHHHHHHH-------HHhhhhccceee-----eE-Eechhhhhhc---c-----ceeeccCCCCccccccccc Confidence 466666555543221 234444555543 44 4566664333 2 345444321111 1 Q ss_pred ccccccceeEcCeEEEecCccchhcccCCCccCCc---cccccCc---------------cceEEE-EEEEcccceeeec Q lcl|NC_019514. 280 TILNGEIGTVDQFRLVVVPEMLHWAGAGATVGTNP---GYRETNG---------------KYDIYP-MLCVGAESFTTIG 340 (399) Q Consensus 280 ~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~~~---~~~~t~~---------------~~DVyp-~lV~G~~Afg~v~ 340 (399) ++=.|=-|++.++=++..|.+. +|-++=.++ .++.++. .|-+|- +.++=+ =|.|| T Consensus 333 ~lg~gi~G~~~~ipVvm~~~a~----AgTA~f~~~~Ai~~~eS~~gp~qL~d~~i~nLt~~ySgY~a~a~~~~--~gliP 406 (410) T protein:vir:83 333 RFGQGVMGSISGIPVVMSAALG----SGDAYLFSTAAIECFEQRVGTLQVVEPSVFGLQVAYAGYFSTLVVNE--DAIVP 406 (410) T ss_pred ccccchhhhhcccceEEecCCC----cCeeeEeccceeeeeecCCceeEeeCCchhhhhhhheeeeeeccccc--cceee Confidence 1112233566666666655531 121111111 1112210 111221 111111 13333 Q ss_pred cccC Q lcl|NC_019514. 341 FQTD 344 (399) Q Consensus 341 l~g~ 344 (399) |.|+ T Consensus 407 v~g~ 410 (410) T protein:vir:83 407 LVGS 410 (410) T ss_pred eccC Confidence 3333 No 174 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=46.48 E-value=0.73 Score=21.32 Aligned_cols=290 Identities=13% Similarity=0.097 Sum_probs=149.3 Q ss_pred CCcCCee--ecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccccccccccc Q lcl|NC_019514. 1 MASKGML--YNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLDDRNVND 78 (399) Q Consensus 1 ~~~~~~~--~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~~~~~~~ 78 (399) |+-+... -|.-+.-+|.+.+=+-|. ..+--+++-|+|.....++-|.-. -|.|+...|-++.-+ T Consensus 59 m~G~~p~~eV~~~e~mtt~~a~IliP~----vis~v~~Eaaepl~~~~kl~qk~~--L~~Grsm~F~~~g~~-------- 124 (393) T protein:vir:79 59 MEGETPTNEVNLREFMATPSAQILIPR----VIVGTMREAAEPLYIGTKMLQKIR--LKSGQSMIFPSIGIM-------- 124 (393) T ss_pred hcCCCchhheehhhhhcCCCcceechh----hhhhhhhhcccchhHHHHHHHHHh--hhcCcceeccchhee-------- Confidence 4422111 111111222222222333 234568888999988888776544 478888888777543 Q ss_pred CCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcch--HHHHH Q lcl|NC_019514. 79 QGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDS--ELFSH 156 (399) Q Consensus 79 ~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~--~l~~~ 156 (399) ..--+.+| |-.|++ -+.+ +|..-++....+||--++|||+.. +|. .|++- T Consensus 125 -----Ra~~IgEG---gE~~~~------sld~-----------~T~dsv~~~~gK~G~~Ia~SqEmI---sDSg~Dvin~ 176 (393) T protein:vir:79 125 -----RAYDVAEG---QEIPED------SIDW-----------QTHESPEIRVGKSGIRLRFTDEMI---SDSQWDLMSM 176 (393) T ss_pred -----eecccccc---cccccc------chhh-----------hcCCceeEEechhhhhhhhHHHHh---hcchHHHHHH Confidence 01124444 222221 1111 112235677889999999999865 333 24444 Q ss_pred HHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccc---cCCceecHHHHHHHHHHHHhccCccccceeccc Q lcl|NC_019514. 157 ISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEG---ATPSVVDYDDLMRLSITLDENRTPKQTKVITGS 233 (399) Q Consensus 157 ~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~---~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s 233 (399) .-+.+++..++-.|....+.+.+-+.+++=+=.++..+..++|+ .-|..+|+++|-+.......++= ++ T Consensus 177 ~l~aA~RaMaRkKee~a~n~fk~~ghtvfDa~st~t~ahptGr~~~~~qNGTlSleDllDm~~av~~~hy-------t~- 248 (393) T protein:vir:79 177 MIKQAGRAMGRHKEQKAYHQFRSHGHTVFDNYSTNKLAHTTGLDKNGVQNDTFSAEDFLDLIIAVMANEY-------TP- 248 (393) T ss_pred HHHHHHHHHHhhhHHHHHhhhhcccceeeeccccCccceeecCCccccccccccHHHHHHHHHHHhcccC-------Cc- Confidence 44555666666677777888888888888776666666666654 45677999999998876665543 22 Q ss_pred cccCccccCceeEEEeCCCchHHHH--HhhccCCCccceehhhcCCccccccccc--------eeE-cCeEEEecCccch Q lcl|NC_019514. 234 RMIDTRTISAGRVLYIGSELIPLIR--KLVDPFGNAAFVPVHQYADAGTILNGEI--------GTV-DQFRLVVVPEMLH 302 (399) Q Consensus 234 ~~~~T~~I~~~yv~~~h~d~~~dir--dl~d~~~~p~fi~v~~Ya~~~~i~~gEI--------G~i-~~vRfV~~~~~~~ 302 (399) =+-+.||-|-..+- ++-.-.--.+|- .|+.+. |+-|- |++ -|+-+|.+|.- T Consensus 249 -----------svi~MHPLAWnv~AKna~me~~~~na~g---N~~~~~--~~ts~algp~~i~~~~~~nlnv~~sPfv-- 310 (393) T protein:vir:79 249 -----------SDLMMHPLAWTVFAKNELMGSLQANPYG---NYPAKG--APSSMALGPDSIQGRLPFNFNVNLSPFI-- 310 (393) T ss_pred -----------ceEEEcCchhhhhhhhhhhcceeecccc---ccCccc--cchhhhhchhhhccccccceeEEEeccc-- Confidence 36789998877661 110000011111 233221 32221 222 26888888874 Q ss_pred hcccCCCccCCccccccCccceEEEE-------EEEc--------------------ccceeeeccccCCCCccceEEEe Q lcl|NC_019514. 303 WAGAGATVGTNPGYRETNGKYDIYPM-------LCVG--------------------AESFTTIGFQTDGKTLKFKVTTK 355 (399) Q Consensus 303 ~~~aGa~~~~~~~~~~t~~~~DVyp~-------lV~G--------------------~~Afg~v~l~g~g~~~~~~~ivk 355 (399) ++.+.+.++|+|.+ |.|- ++-||.==|.++..-.-++.|.. T Consensus 311 ------------p~d~k~~rFd~~~Vd~NnvgvlLV~D~i~tdq~ddk~rdiq~iKl~ERYG~gvLn~gkaiavakNI~~ 378 (393) T protein:vir:79 311 ------------PLDKKSRRFDVYAVDRNNVGVLLVRDDLKTDQWDEKARGLQNIKMIERYGIGILNEGKAIAVAKNISM 378 (393) T ss_pred ------------ccccccceeeEEEeecCCceEEEEecCcceeccccccccceeeeeeeeeceeeeeCCceEEEEeccee Confidence 33344566777764 2221 22333322333322222333322 Q ss_pred cCCCCCCCCCCccchhhHHH Q lcl|NC_019514. 356 MPGEATADRNDPYGEMGFSS 375 (399) Q Consensus 356 ~pG~~~ad~~DPlgQrg~~g 375 (399) +.+=.||+--...-. T Consensus 379 -----~k~y~~P~~~~~~~~ 393 (393) T protein:vir:79 379 -----DKSYAEPMLIKNVGN 393 (393) T ss_pred -----ecccccchhhhccCC Confidence 122234432111000 No 175 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=42.72 E-value=0.87 Score=20.91 Aligned_cols=285 Identities=14% Similarity=0.092 Sum_probs=104.0 Q ss_pred CCcCCe---eecCC-----CCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccccc Q lcl|NC_019514. 1 MASKGM---LYNDP-----NTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPLLD 72 (399) Q Consensus 1 ~~~~~~---~~n~~-----~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl~~ 72 (399) +.++.. .|+.. ..++.+..+-+-|+ .+ ..+.+........+.+.+.+.++. |. .++.... T Consensus 131 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~vP~---~~-~~~i~~~l~~~~~l~~~~~v~~~~---g~-~~~~~~~---- 198 (466) T protein:vir:80 131 LIARSEVKEFLAQVRTLAQQKRAVSGAELTIPD---VM-LELLRDNMHRYSKLISKVRLRPLK---GT-ARQNIAG---- 198 (466) T ss_pred HHHHHHHHHHHHHHHHHhhhhhhhccccccccH---HH-HHHHHHhhhhhhhhhhheeeeecC---ce-eEeeeec---- Confidence 100000 00000 01111111123333 12 233333333444455555555553 22 1111111 Q ss_pred ccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcchH Q lcl|NC_019514. 73 DRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSDSE 152 (399) Q Consensus 73 ~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D~~ 152 (399) + .+.+.|..+| +.......++..++.++++|+.|+.+|+++++ +++.+ T Consensus 199 -------~-~~~a~wv~E~-----------------------~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~-ds~~~ 246 (466) T protein:vir:80 199 -------A-IPEGVWTEAV-----------------------ANLNELSLSFSQIEVDGYKVGGFIPIPNSTLE-DSDLN 246 (466) T ss_pred -------C-Ccceeecccc-----------------------cccccccccccceeecceeeeeehhhhHHHHh-cchHH Confidence 1 1123444444 22223334455688899999999999999775 55555 Q ss_pred HHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEE------ecCCCcccccccccccCCceecHHHHH-------------- Q lcl|NC_019514. 153 LFSHISTELMNGAVQLTEAVLQKDLLAGAGTIV------YTGAATQDSEITGEGATPSVVDYDDLM-------------- 212 (399) Q Consensus 153 l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~------yag~ats~~~~t~~~~~~~~vt~~~lr-------------- 212 (399) +...+...|.+.-+... -..+++|.++-. +.+.++..............++...+- T Consensus 247 l~~~i~~~la~~~~~~~----~~ail~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 322 (466) T protein:vir:80 247 LADEILDAIGQAIGFAL----DKAILYGTGTKMPVGIVTRLAQTTQPPNWGTKAPAWTNLSTTNLLKIDPTGKSAEEFFS 322 (466) T ss_pred HHHHHHHHHHHHHHHHH----hhheeeccCCCCcceeeecccccccccccccccccccccchhhhhhhhhhccchhhHHH Confidence 77777777776655333 345566654421 111111100000000111222222222 Q ss_pred HHHHHHHhccCccccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCe Q lcl|NC_019514. 213 RLSITLDENRTPKQTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQF 292 (399) Q Consensus 213 ~a~~~L~~nrap~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~v 292 (399) +++..+....++ ..-+.++..+++.+.+.|+.+.... ++....+..=++. ..+-|. T Consensus 323 ~~~~~~~~~~~~---------------~~~~~~~w~~~~~~~~~l~~~~~~~-~~~g~~~~~~~~~--------~~i~G~ 378 (466) T protein:vir:80 323 ELVLKLSKARAN---------------YSNGMKFWAMSSNTHAVLMSKAITF-NSAGALVASLNNT--------MPIVGG 378 (466) T ss_pred HHHHHHHhhhcc---------------ccCCceeEEecchhHHHhhcccccc-cCCccccccCCCc--------cccccc Confidence 221111111111 1123466678999888887764221 1111111111111 123344 Q ss_pred EEEecCccchhcccCCCccCCccccc----c-------------CccceEEEEE------EEcccceeeeccccCCCCcc Q lcl|NC_019514. 293 RLVVVPEMLHWAGAGATVGTNPGYRE----T-------------NGKYDIYPML------CVGAESFTTIGFQTDGKTLK 349 (399) Q Consensus 293 RfV~~~~~~~~~~aGa~~~~~~~~~~----t-------------~~~~DVyp~l------V~G~~Afg~v~l~g~g~~~~ 349 (399) .+|.++.|-. |.....+.-.+. . .+.. +|-.. ++=.+||-.+-+..- .. T Consensus 379 pvv~s~~~~~----~~~~~g~~~~y~i~~r~~~~i~~~~~~~f~~d~~-~~r~~~r~dg~~~~~~afv~~~~~~~---~~ 450 (466) T protein:vir:80 379 DIVILDFIPD----NDIIGGYGSLYLLAERADIKLAQSEHVRFIEDQT-VFKGTARYDGKPVFGEGFVAVNIANA---NP 450 (466) T ss_pred ceeecCccCc----cceeeeccccEEEEeecceEEEechhhhhhcCcE-EEEEEEEEccEEeccCceEEEEecCC---Cc Confidence 5555554411 000000000000 0 0000 11111 111234433322221 12 Q ss_pred ceEEEecCCCCCCCCCCc Q lcl|NC_019514. 350 FKVTTKMPGEATADRNDP 367 (399) Q Consensus 350 ~~~ivk~pG~~~ad~~DP 367 (399) +.....-|-+ ++.-|- T Consensus 451 ~~~~~~~~~~--~~~~~~ 466 (466) T protein:vir:80 451 TTSITFAPDE--ANVPEV 466 (466) T ss_pred ccceeeecCc--CcCCCC Confidence 3333333332 222222 No 176 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=42.56 E-value=0.88 Score=20.89 Aligned_cols=288 Identities=19% Similarity=0.238 Sum_probs=127.8 Q ss_pred CCcCCee---ecCCC------CcccccccccccceehhhhhHHHHHHH--HHHHHhhhhcccccccccCCCEEEEEEccc Q lcl|NC_019514. 1 MASKGML---YNDPN------TTPSGIDAPDGKQMNTFFWWKKALIEA--RKDQYFMPLADVVSMPKNYGKEIRVYHYIP 69 (399) Q Consensus 1 ~~~~~~~---~n~~~------~t~tT~~~~i~p~m~~~y~~kk~L~~A--~p~lv~~~fA~~~~mPkN~GktIk~rry~p 69 (399) +..|+.- +|... +.+|+.... -+.. .. .|.|.++ +-.--|.+|.-...++.=+ ..++.+... T Consensus 376 L~~rg~~~~~~~~~~~~~~a~~htTSDFp~---IL~~-~~-nk~l~~~y~~a~~t~~~~~~~~~~~DFk--~~~~~~lg~ 448 (693) T protein:vir:95 376 LVDRGIGVASLNAPQMVGLAFTHTSSDFGL---ILLD-VA-NKSVLAGWEEAEETFPLWTKSGILTDFK--PARRVGLGE 448 (693) T ss_pred HHhcCCccCCCCHHHHHHHHHhcCcchhHH---HHHH-HH-HHHHHHHHHhhhhHHHHHhccCCCCccc--ccceeecCC Confidence 1112211 12111 011221111 1111 11 2344442 1222356666655555433 223334443 Q ss_pred cccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhc Q lcl|NC_019514. 70 LLDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDS 149 (399) Q Consensus 70 l~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~ 149 (399) +.+=. ++.|+- -+ |..+++|.+ -+..|+.||--+.||.+.. .-- T Consensus 449 ~~~L~------------~V~E~g---Ey--------k~~t~~e~~------------e~~~l~tyG~~~~iTRqai-IND 492 (693) T protein:vir:95 449 FSSLR------------QVREGA---EY--------KYVTLGERG------------EQIILATYGELFSITRQAI-IND 492 (693) T ss_pred CCChh------------hcCCCC---ce--------eeeecCCcc------------ceeehhhcCCeeeecHHhh-hcc Confidence 32211 222221 11 111222211 2356999999999999743 222 Q ss_pred chHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccc----cccccccCCceecHHHHHHHHHHHHhccCcc Q lcl|NC_019514. 150 DSELFSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDS----EITGEGATPSVVDYDDLMRLSITLDENRTPK 225 (399) Q Consensus 150 D~~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~----~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~ 225 (399) |=.++..+-+.+|+.|..+..+....- |++- -+++-|.+=..+ .++ ++.+.++++.|-.+...+..++... T Consensus 493 DLga~~~ip~~~g~aA~~~~~~~vy~~-L~~N-p~m~DGk~LFhadH~Nl~t---ga~sals~~sl~~a~~am~~qk~~~ 567 (693) T protein:vir:95 493 DLQMLSDIPFKLGQAAKATIGDLVYAV-LTGN-PAMSDGKTLFHADHSNLLT---GAASALSIDSLSKAKTQMATQKAQV 567 (693) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHH-HhcC-ccccCCcceeecccccccc---ccccccChHHHHHHHHHHHHhhcch Confidence 336778888899999988887766644 4432 233444332221 122 2346788888888888888877642 Q ss_pred ccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCe-EEEecCccc--- Q lcl|NC_019514. 226 QTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQF-RLVVVPEML--- 301 (399) Q Consensus 226 ~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~v-RfV~~~~~~--- 301 (399) .. -.| + .--|.+.|+ +|+|+++-..+.+.+ ...+|..+ +-.|.+--+.++ ..|..|.+. T Consensus 568 ~~--~~g-~---~L~i~P~~l-lvP~~le~~a~~l~~----s~~~~~a~------~~~~~~NP~~~~~~vi~~prL~~~s 630 (693) T protein:vir:95 568 EK--GKG-R---TLNIRPGFV-LTPVALEDKANQIIN----SESVPGAD------VNSGIVNPIRAFAQVIGEPRLDDAS 630 (693) T ss_pred hc--cCC-c---eeecccceE-EecchHHHHHHHHhc----cccccccc------cccccccchhccccccccceecCCC Confidence 11 011 1 123566665 569999999988743 33343211 122222223332 356666662 Q ss_pred --hhcccCCCccCCccc---cccCccceEEEEEEEcccceeeecccc------CCCCccceEEEecCCC Q lcl|NC_019514. 302 --HWAGAGATVGTNPGY---RETNGKYDIYPMLCVGAESFTTIGFQT------DGKTLKFKVTTKMPGE 359 (399) Q Consensus 302 --~~~~aGa~~~~~~~~---~~t~~~~DVyp~lV~G~~Afg~v~l~g------~g~~~~~~~ivk~pG~ 359 (399) .|-=+ ++++. ++. +=.+.. =|.|-- ++-|.+-+++- +-+..-+.=++|+||. T Consensus 631 ~~~Wyl~-a~~~~-dtie~~yL~G~~---~P~ie~-~~gf~~dG~~~kvr~D~G~~~iD~Rg~~kn~GA 693 (693) T protein:vir:95 631 ATAWYMA-AKKGS-DTIEVAYLDGVD---TPYLEQ-QEGFTVDGVASKVRIDAGVAPLDFRGLQKSNGA 693 (693) T ss_pred CCceEEe-cCCCC-CeEEEEEecCCC---CCeEee-cCCCCcceEEEEEEEeccCceeeccccccCCCC Confidence 45321 12221 111 000000 012211 22233322222 1111223335889984 No 177 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=29.30 E-value=1.7 Score=19.36 Aligned_cols=286 Identities=15% Similarity=0.142 Sum_probs=132.9 Q ss_pred CCcCCeeecCCCCcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccc---ccccccc Q lcl|NC_019514. 1 MASKGMLYNDPNTTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPL---LDDRNVN 77 (399) Q Consensus 1 ~~~~~~~~n~~~~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl---~~~~~~~ 77 (399) |..- . ..+...++.+.+..-| .+.+.+-..+| +.-++-.|..|...|.+||-+++... .++ T Consensus 1 M~~e-------~--nl~~~~dL~~a~siDF--~~~f~~~i~~L-~~~LGv~r~~pla~Gt~iktyK~~~~~y~gda---- 64 (303) T protein:vir:10 1 MSAE-------N--NLINVEALGKAKSIDF--ANKLGVGLNKL-FEALAIQNKIPMNVGSALKQYRFKVEDSEKPN---- 64 (303) T ss_pred CCCC-------c--CCcchhhcccceeehh--hhhhhhhHHHH-HHHhhhhccccccCCceeeeeeeeceeecccc---- Confidence 4321 1 1233344556666555 33344433333 23355667788889999998887432 111 Q ss_pred cCCCCCCCceeccCccccccccccccccccccccccccccccccceee---eeEeeeeeecceeehhhhhh-hhhcchHH Q lcl|NC_019514. 78 DQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRI---SRVGRIQKFGFFTEFSQESL-DFDSDSEL 153 (399) Q Consensus 78 ~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~---~~~~~l~qYG~~~e~Td~~~-d~~~D~~l 153 (399) .-..||.. +.| ..++++ ..+.+++||+- ..||++. +.--++.+ T Consensus 65 --------~dVaEGe~--------------Ipl---------skvt~~~~~t~~~~~kK~rK--~tTdEAIqlsGyg~aV 111 (303) T protein:vir:10 65 --------GDVAEGDV--------------IPL---------TKVTREQVDITELQFAKYRK--STSAEAIQAHGYDLAI 111 (303) T ss_pred --------ccccCCcc--------------cch---------hhheeeecceEEEEeecccc--cccHHHHHhhcCCchh Confidence 12333321 112 223332 57889999988 4499985 33333335 Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHHHhcCCeEEecCCCcccccccccccCCceecHHHHHHHHHHHHhccCccccceeccc Q lcl|NC_019514. 154 FSHISTELMNGAVQLTEAVLQKDLLAGAGTIVYTGAATQDSEITGEGATPSVVDYDDLMRLSITLDENRTPKQTKVITGS 233 (399) Q Consensus 154 ~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v~yag~ats~~~~t~~~~~~~~vt~~~lr~a~~~L~~nrap~~t~~i~~s 233 (399) -+. .+.|++.-. .-+..|+++... .++.+...+..+.++.+-|..|.-....+-.-. .+. T Consensus 112 get-d~qL~~~Iq----~kIdnd~~~~lk----------taT~t~~~t~~t~~s~~glq~Al~~~~~kl~~~----~ed- 171 (303) T protein:vir:10 112 NQT-DNEMIKYVQ----KKFRAKFFETLK----------SAIENGKRTNKTKLSAENLQGALSKGRANLSVL----LDD- 171 (303) T ss_pred HHH-HHHHHHHHH----hhhhHHHHHHHh----------hcccccccccceeecHHHHHHHHHhhhhhcccc----ccc- Confidence 554 333332211 233444443321 111123345567789988888875443222211 010 Q ss_pred cccCccccCceeEEEeCCCchHHHHHhhccCCCccce-ehhhcCCccccccccceeEcCeEEEecCccchhcccCCCccC Q lcl|NC_019514. 234 RMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFV-PVHQYADAGTILNGEIGTVDQFRLVVVPEMLHWAGAGATVGT 312 (399) Q Consensus 234 ~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi-~v~~Ya~~~~i~~gEIG~i~~vRfV~~~~~~~~~~aGa~~~~ 312 (399) + ..+|.||+|.-..++|. ++... ...++|.. ...+ +=|+-+|+++.... | T Consensus 172 ---~-----~~~V~FvNP~Daa~yl~------~A~i~~~~t~fG~n--~L~n----fLG~~II~S~kv~~----G----- 222 (303) T protein:vir:10 172 ---E-----ITPIAFVNPNDTAEYLA------NGFINSTGAQFGVN--LLTP----YVGVKIVEFADVPQ----G----- 222 (303) T ss_pred ---c-----ccEEEEEchHHHHHHhh------cCCcchhhhhhhhh--hhhh----hhcceEEEeccCCC----c----- Confidence 1 13799999999999874 23332 22456554 2332 66666777766422 2 Q ss_pred CccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCccchhhHHHHHHHHHHhhccccceEE Q lcl|NC_019514. 313 NPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPYGEMGFSSIKWYYGTLILRPERLAL 392 (399) Q Consensus 313 ~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ 392 (399) ..+.+-..++-+|..-+-|+-+ ...+|-.+..+ +-=+.. ++..++.++- -.+..+..|-+|+.-= T Consensus 223 -~~~~T~~~Ni~~ay~~~~g~l~-~~f~~t~D~tg--lIGv~h----------~~~~~~~t~e-T~~~~~~~lfpE~~dg 287 (303) T protein:vir:10 223 -EVWMTVAENLNVAYANPRGELS-RAFAFATDATG--FVGVLH----------DIQPQRLTSD-TIYASAISMFPENIDA 287 (303) T ss_pred -eEEEeeccceEEEEecCchhhh-hhhhhcccccc--ceEEEe----------ccccceeeeh-hHhHhHHHhcccccce Confidence 2233333445455555566433 33333333211 111111 2222222111 1233444444554321 Q ss_pred E---EE-ec---cC Q lcl|NC_019514. 393 V---KT-VA---PL 399 (399) Q Consensus 393 i---e~-~a---~~ 399 (399) | .. +. .+ T Consensus 288 iv~~ti~~~e~~~~ 301 (303) T protein:vir:10 288 VIKVTIKKDEAGEL 301 (303) T ss_pred EEEEEEeccccCCC Confidence 1 11 11 11 No 178 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=26.26 E-value=2 Score=18.98 Aligned_cols=290 Identities=11% Similarity=0.037 Sum_probs=125.1 Q ss_pred CCcC-C--------eeecCCC-CcccccccccccceehhhhhHHHHHHHHHHHHhhhhcccccccccCCCEEEEEEcccc Q lcl|NC_019514. 1 MASK-G--------MLYNDPN-TTPSGIDAPDGKQMNTFFWWKKALIEARKDQYFMPLADVVSMPKNYGKEIRVYHYIPL 70 (399) Q Consensus 1 ~~~~-~--------~~~n~~~-~t~tT~~~~i~p~m~~~y~~kk~L~~A~p~lv~~~fA~~~~mPkN~GktIk~rry~pl 70 (399) .+++ . -.||.-- .+..+..+-+-|+ -+.++.+++....=.+.+.+...++. |. .++-+- T Consensus 59 ~~~~~~~~lt~ee~~~~~~~~~~~~~~~gg~lvP~----~~~~~I~~~l~~~s~i~~~~~v~~~~---~~-~~i~~~--- 127 (377) T protein:vir:96 59 DLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPE----ETMVQVFDDLVAEHPLLKVINFKNTS---LR-LKALTA--- 127 (377) T ss_pred HhccCCcccCHHHHHHHHHHHhcCCCCCCceecCH----HHHHHHHHHHHhhhhhhhhceeEecC---Cc-eEEEEe--- Confidence 1110 0 0122110 1112222223344 13456666655555667778777764 32 221111 Q ss_pred ccccccccCCCCCCCceeccCccccccccccccccccccccccccccccccceeeeeEeeeeeecceeehhhhhhhhhcc Q lcl|NC_019514. 71 LDDRNVNDQGIDAAGATIVNGNLYGSSKDIGTIVGKIPTLTETGGRVNRVGFSRISRVGRIQKFGFFTEFSQESLDFDSD 150 (399) Q Consensus 71 ~~~~~~~~~gi~aaga~lt~g~~~G~s~d~~~i~~~~~~vt~~g~rvn~~~~t~~~~~~~l~qYG~~~e~Td~~~d~~~D 150 (399) .+ .+...|..++ +.+......+...++...++++.+..+|+++++-... T Consensus 128 --------~~-~~~a~wv~e~----------------------~~~~~~~~~~f~~i~l~~~kl~~~~~is~~ll~ds~~ 176 (377) T protein:vir:96 128 --------ET-SGTAVWGDIF----------------------GEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPK 176 (377) T ss_pred --------cC-CcceeEeecc----------------------cccccccCccceeEeeeeeeEEeechhhHHHhhcchh Confidence 00 1233455543 1122222345567888899999999999997754333 Q ss_pred hHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCeE------EecCCCccccccccc----------ccCCceecHHHHHHH Q lcl|NC_019514. 151 SELFSHISTELMNGAVQLTEAVLQKDLLAGAGTI------VYTGAATQDSEITGE----------GATPSVVDYDDLMRL 214 (399) Q Consensus 151 ~~l~~~~~~~lg~~a~~~~e~~l~~~~lag~~~v------~yag~ats~~~~t~~----------~~~~~~vt~~~lr~a 214 (399) .|.+.+...+.+.-+.. +-..+++|.++- -+.++.+.......- ...-+.++.+.+-++ T Consensus 177 -~le~~i~~~l~~~~~~~----~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 251 (377) T protein:vir:96 177 -WLKQFITEQLKEAIAVA----LELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVEL 251 (377) T ss_pred -hHHHHHHHHHHHHHHHH----HhhceEeccCCCcceeeeeccccccccccccccccceeeccccccccccCChhHHHHH Confidence 36666666666554433 334456665522 222211111110000 000112333444444 Q ss_pred HHHHHhccCccccceeccccccCccccCceeEEEeCCCchHHHHHhhccCCCccceehhhcCCccccccccceeEcCeEE Q lcl|NC_019514. 215 SITLDENRTPKQTKVITGSRMIDTRTISAGRVLYIGSELIPLIRKLVDPFGNAAFVPVHQYADAGTILNGEIGTVDQFRL 294 (399) Q Consensus 215 ~~~L~~nrap~~t~~i~~s~~~~T~~I~~~yv~~~h~d~~~dirdl~d~~~~p~fi~v~~Ya~~~~i~~gEIG~i~~vRf 294 (399) ...|..+-...-.. . ..-....-+.++||....+++. ..-|.+ .-|.+.+++. -++++ T Consensus 252 ~~~l~~~~~~~~~~----~----~~~~~~~a~~~mn~~t~~~~~~------~~~~~~--~~G~~~~~l~------~p~~v 309 (377) T protein:vir:96 252 LVPVMKHLSVNDKK----H----PLKIAGQVKLLLNPEDRWTLEA------KFTSRN--QFGEYVTVLP------HGITI 309 (377) T ss_pred HHHHHHhhcccccc----c----cccccCceEEEEchhhHHhccc------cccccC--CCCCceeccC------CCceE Confidence 43443322211000 0 0001122467789887777631 122333 1233333321 35678 Q ss_pred EecCccchhcccCCCccCCccccccCccceEEEEEEEcccceeeeccccCCCCccceEEEecCCCCCCCCCCcc---chh Q lcl|NC_019514. 295 VVVPEMLHWAGAGATVGTNPGYRETNGKYDIYPMLCVGAESFTTIGFQTDGKTLKFKVTTKMPGEATADRNDPY---GEM 371 (399) Q Consensus 295 V~~~~~~~~~~aGa~~~~~~~~~~t~~~~DVyp~lV~G~~Afg~v~l~g~g~~~~~~~ivk~pG~~~ad~~DPl---gQr 371 (399) |+++.|.. | + ++||.-.+=.+..++ + +.+ +.. .+-+ +|+ T Consensus 310 ~~s~~~p~----~----------------~----i~fgdf~~Y~i~~r~-~----~~i--~~~-------~~~~~~~d~~ 351 (377) T protein:vir:96 310 LESLAVET----G----------------K----AIAFVANRYDAFMAT-A----STI--EEY-------DQTFAMEDLQ 351 (377) T ss_pred EecCCCCc----c----------------c----EEEEEcCcEEEEEec-c----cEE--Eee-------hhhhhhcCCe Confidence 87766521 0 1 345543332333222 1 111 111 0122 344 Q ss_pred hHHHHHHHHHHhhccccceEEEEEecc Q lcl|NC_019514. 372 GFSSIKWYYGTLILRPERLALVKTVAP 398 (399) Q Consensus 372 g~~gwK~~~~~~iLn~~~m~~ie~~a~ 398 (399) +|.++ .++..++.+++=++.+..+.- T Consensus 352 ~f~~~-~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 352 LYLTK-NYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred EEEEE-EEEcCEEecCCcEEEEEEecC Confidence 44332 456667777777777777777 Done!