Query lcl|NC_020862.1_cdsid_YP_007675801.1 [gene=SUFG_00013] [protein=major capsid coat protein] [protein_id=YP_007675801.1] [location=9216..10433] Match_columns 405 No_of_seqs 30 out of 32 Neff 4.0 Searched_HMMs 1612 Date Thu Nov 7 18:51:33 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_12 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_12_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:95875 Length: 401 100.0 3E-185 2E-188 1032.2 29.0 397 2-405 1-401 (401) 2 protein:vir:105334 Length: 276 99.6 8.9E-18 5.5E-21 114.1 15.4 270 9-405 1-271 (276) 3 protein:vir:95898 Length: 274 99.6 2.2E-16 1.4E-19 106.4 16.0 270 5-405 1-271 (274) 4 protein:vir:96262 Length: 274 99.6 2.2E-16 1.4E-19 106.4 16.0 270 5-405 1-271 (274) 5 protein:vir:3613 Length: 272 # 99.5 3.5E-16 2.2E-19 105.3 15.7 271 9-404 1-272 (272) 6 protein:vir:96123 Length: 274 99.5 6.6E-16 4.1E-19 103.8 16.3 270 9-405 1-271 (274) 7 protein:vir:97433 Length: 274 99.5 1E-15 6.3E-19 102.8 16.6 270 1-405 1-273 (274) 8 protein:vir:94494 Length: 274 99.5 1E-15 6.3E-19 102.8 16.6 270 1-405 1-273 (274) 9 protein:vir:1239 Length: 274 # 99.5 7.9E-16 4.9E-19 103.4 15.9 270 9-405 1-273 (274) 10 protein:vir:93742 Length: 274 99.5 9.5E-16 5.9E-19 102.9 16.0 268 1-405 1-273 (274) 11 protein:vir:96833 Length: 275 99.5 1.2E-15 7.4E-19 102.4 15.2 270 5-405 1-271 (275) 12 protein:vir:94622 Length: 341 99.5 1.3E-14 7.8E-18 96.8 19.0 325 3-405 1-340 (341) 13 protein:vir:3033 Length: 272 # 99.4 1.2E-14 7.7E-18 96.8 17.1 269 9-405 1-270 (272) 14 protein:vir:9820 Length: 272 # 99.4 1.2E-14 7.7E-18 96.8 17.1 269 9-405 1-270 (272) 15 protein:vir:80930 Length: 278 99.4 9.2E-15 5.7E-18 97.5 15.5 277 9-405 1-278 (278) 16 protein:vir:93696 Length: 364 99.3 8.8E-13 5.5E-16 86.7 19.4 313 1-405 1-362 (364) 17 protein:vir:80180 Length: 381 99.2 1.9E-12 1.2E-15 84.9 16.4 322 1-405 1-339 (381) 18 protein:vir:739 Length: 231 # 99.2 5.1E-13 3.2E-16 88.0 12.4 231 47-404 1-231 (231) 19 protein:vir:105610 Length: 430 99.1 3.9E-11 2.4E-14 77.7 21.0 340 1-405 1-425 (430) 20 protein:vir:95107 Length: 270 99.1 4.5E-12 2.8E-15 82.8 14.1 265 1-405 1-269 (270) 21 protein:vir:7990 Length: 273 # 99.0 1E-10 6.5E-14 75.3 17.8 266 1-365 1-273 (273) 22 protein:vir:105905 Length: 304 99.0 1.5E-10 9.3E-14 74.5 17.4 294 1-403 1-304 (304) 23 protein:vir:94142 Length: 304 99.0 1.5E-10 9.3E-14 74.5 17.4 294 1-403 1-304 (304) 24 protein:vir:108303 Length: 418 98.9 2.1E-10 1.3E-13 73.7 17.2 306 1-405 1-418 (418) 25 protein:vir:80213 Length: 334 98.9 1.3E-09 7.8E-13 69.4 21.1 312 1-405 1-333 (334) 26 protein:vir:105822 Length: 273 98.9 4.9E-10 3.1E-13 71.6 17.9 264 16-365 1-273 (273) 27 protein:vir:102605 Length: 273 98.9 4.9E-10 3.1E-13 71.6 17.9 264 16-365 1-273 (273) 28 protein:vir:41 Length: 299 # N 98.9 7.3E-10 4.5E-13 70.7 18.4 292 3-405 1-299 (299) 29 protein:vir:78739 Length: 332 98.9 1E-09 6.4E-13 69.9 18.9 318 1-402 1-332 (332) 30 protein:vir:7771 Length: 330 # 98.9 1.3E-09 8.1E-13 69.3 18.6 307 1-405 1-328 (330) 31 protein:vir:104439 Length: 404 98.8 4.4E-09 2.8E-12 66.4 21.1 335 1-405 1-404 (404) 32 protein:vir:3298 Length: 404 # 98.8 4.4E-09 2.8E-12 66.4 21.1 335 1-405 1-404 (404) 33 protein:vir:819 Length: 404 # 98.8 4.4E-09 2.8E-12 66.4 21.1 335 1-405 1-404 (404) 34 protein:vir:10123 Length: 404 98.8 4.4E-09 2.8E-12 66.4 21.1 335 1-405 1-404 (404) 35 protein:vir:1541 Length: 347 # 98.8 2.4E-09 1.5E-12 67.8 18.2 324 1-405 1-344 (347) 36 protein:vir:95763 Length: 297 98.8 2.1E-09 1.3E-12 68.2 17.5 288 1-405 1-297 (297) 37 protein:vir:10450 Length: 344 98.8 8.7E-09 5.4E-12 64.8 20.6 322 1-404 1-344 (344) 38 protein:vir:79987 Length: 415 98.8 1.9E-09 1.2E-12 68.3 16.8 291 1-405 113-405 (415) 39 protein:vir:98339 Length: 415 98.8 1.9E-09 1.2E-12 68.3 16.8 291 1-405 113-405 (415) 40 protein:vir:81100 Length: 415 98.8 1.9E-09 1.2E-12 68.3 16.8 291 1-405 113-405 (415) 41 protein:vir:94576 Length: 347 98.7 7.3E-09 4.5E-12 65.2 19.0 322 1-404 1-347 (347) 42 protein:vir:2201 Length: 345 # 98.7 2E-08 1.2E-11 62.8 21.3 324 1-404 1-345 (345) 43 protein:vir:485 Length: 407 # 98.7 1.1E-09 6.8E-13 69.7 14.3 290 1-405 98-401 (407) 44 protein:vir:103323 Length: 364 98.7 8.4E-09 5.2E-12 64.9 19.1 316 3-405 1-340 (364) 45 protein:vir:2770 Length: 318 # 98.7 5.3E-09 3.3E-12 66.0 17.0 260 1-319 1-318 (318) 46 protein:vir:9410 Length: 415 # 98.7 3.7E-09 2.3E-12 66.8 15.9 291 1-405 113-405 (415) 47 protein:vir:100247 Length: 425 98.7 1.1E-09 6.9E-13 69.7 13.1 287 1-405 126-425 (425) 48 protein:vir:94771 Length: 298 98.7 6.5E-09 4.1E-12 65.5 16.9 285 9-403 1-298 (298) 49 protein:vir:3364 Length: 347 # 98.7 1.1E-08 6.9E-12 64.2 18.1 324 1-405 1-344 (347) 50 protein:vir:4830 Length: 397 # 98.7 6.3E-09 3.9E-12 65.5 16.5 282 1-405 98-386 (397) 51 protein:vir:104085 Length: 320 98.6 1.3E-08 8E-12 63.9 18.0 300 1-405 1-319 (320) 52 protein:vir:4511 Length: 409 # 98.6 4.2E-09 2.6E-12 66.5 14.5 292 1-405 103-407 (409) 53 protein:vir:8187 Length: 311 # 98.6 2E-08 1.3E-11 62.7 17.8 298 5-405 1-311 (311) 54 protein:vir:4339 Length: 395 # 98.6 1.9E-08 1.2E-11 63.0 17.5 280 1-404 106-395 (395) 55 protein:vir:104256 Length: 458 98.6 1.2E-08 7.2E-12 64.1 16.1 294 1-404 155-458 (458) 56 protein:vir:4456 Length: 401 # 98.6 1.6E-09 9.8E-13 68.8 11.3 290 1-404 99-401 (401) 57 protein:vir:96223 Length: 324 98.6 2.2E-08 1.3E-11 62.6 17.3 288 1-405 20-316 (324) 58 protein:vir:97148 Length: 324 98.6 4.2E-08 2.6E-11 61.0 18.8 290 1-405 20-316 (324) 59 protein:vir:102655 Length: 322 98.6 4.8E-08 3E-11 60.7 18.8 316 3-405 1-320 (322) 60 protein:vir:4600 Length: 415 # 98.6 9.9E-09 6.1E-12 64.5 15.0 291 1-405 113-405 (415) 61 protein:vir:4700 Length: 415 # 98.6 9.9E-09 6.1E-12 64.5 15.0 291 1-405 113-405 (415) 62 protein:vir:78523 Length: 338 98.6 2.7E-08 1.7E-11 62.1 17.3 308 1-405 10-337 (338) 63 protein:vir:3136 Length: 322 # 98.5 1.2E-08 7.3E-12 64.1 15.0 304 3-405 1-319 (322) 64 protein:vir:174 Length: 423 # 98.5 4.2E-08 2.6E-11 61.0 17.8 301 16-404 1-423 (423) 65 protein:vir:99675 Length: 324 98.5 2.8E-08 1.7E-11 62.0 16.6 281 44-405 1-304 (324) 66 protein:vir:102119 Length: 404 98.5 3.1E-08 1.9E-11 61.8 16.8 295 1-405 103-403 (404) 67 protein:vir:2344 Length: 397 # 98.5 3.3E-08 2.1E-11 61.6 16.8 296 3-405 1-307 (397) 68 protein:vir:94711 Length: 347 98.5 5.1E-08 3.2E-11 60.6 17.5 322 1-405 1-347 (347) 69 protein:vir:1638 Length: 298 # 98.5 4E-08 2.5E-11 61.2 16.9 279 9-403 1-298 (298) 70 protein:vir:8885 Length: 347 # 98.5 6.3E-08 3.9E-11 60.1 17.9 323 1-405 1-347 (347) 71 protein:vir:4953 Length: 397 # 98.5 3.6E-08 2.3E-11 61.4 16.5 283 1-405 98-386 (397) 72 protein:vir:80684 Length: 315 98.5 9.2E-08 5.7E-11 59.1 18.6 302 1-405 1-308 (315) 73 protein:vir:9309 Length: 324 # 98.5 5.7E-08 3.5E-11 60.3 17.3 289 1-405 20-316 (324) 74 protein:vir:78223 Length: 333 98.5 6.6E-08 4.1E-11 59.9 17.4 310 1-404 8-333 (333) 75 protein:vir:99075 Length: 392 98.5 3E-08 1.9E-11 61.8 15.3 312 1-405 1-335 (392) 76 protein:vir:99749 Length: 324 98.4 1.1E-07 7.1E-11 58.6 17.8 290 1-405 20-316 (324) 77 protein:vir:4856 Length: 293 # 98.4 1.4E-07 8.7E-11 58.1 17.8 278 4-405 1-282 (293) 78 protein:vir:105522 Length: 423 98.4 9.2E-08 5.7E-11 59.2 16.7 304 1-404 1-423 (423) 79 protein:vir:3525 Length: 423 # 98.4 2.1E-07 1.3E-10 57.2 18.3 300 16-405 1-334 (423) 80 protein:vir:2430 Length: 318 # 98.4 1.1E-07 6.5E-11 58.8 16.1 298 1-405 8-314 (318) 81 protein:vir:8102 Length: 543 # 98.3 1.7E-07 1.1E-10 57.7 16.6 299 1-405 237-543 (543) 82 protein:vir:8420 Length: 477 # 98.3 3.6E-07 2.2E-10 55.9 18.1 310 1-405 145-475 (477) 83 protein:vir:78830 Length: 324 98.3 4.8E-07 3E-10 55.2 18.5 290 1-405 20-318 (324) 84 protein:vir:96392 Length: 324 98.3 4.8E-07 3E-10 55.2 18.5 290 1-405 20-318 (324) 85 protein:vir:4997 Length: 397 # 98.3 2.1E-07 1.3E-10 57.2 16.4 282 1-405 98-386 (397) 86 protein:vir:9759 Length: 303 # 98.3 3.3E-07 2.1E-10 56.1 17.3 287 10-404 1-303 (303) 87 protein:vir:4226 Length: 326 # 98.3 3.5E-07 2.1E-10 56.0 17.0 304 1-405 9-324 (326) 88 protein:vir:78935 Length: 335 98.3 1.9E-06 1.2E-09 52.0 21.6 314 1-405 1-329 (335) 89 protein:vir:103955 Length: 324 98.3 4.9E-07 3E-10 55.2 17.6 290 1-405 20-316 (324) 90 protein:vir:99920 Length: 311 98.3 1.9E-07 1.2E-10 57.5 15.3 297 9-403 1-311 (311) 91 protein:vir:191 Length: 385 # 98.3 2.3E-07 1.4E-10 56.9 15.7 285 1-405 97-385 (385) 92 protein:vir:1886 Length: 385 # 98.3 2.3E-07 1.4E-10 56.9 15.7 285 1-405 97-385 (385) 93 protein:vir:1268 Length: 397 # 98.2 1.9E-07 1.2E-10 57.5 15.0 278 1-404 102-397 (397) 94 protein:vir:3991 Length: 404 # 98.2 4.2E-07 2.6E-10 55.6 16.8 282 1-405 105-394 (404) 95 protein:vir:3845 Length: 395 # 98.2 2.2E-07 1.3E-10 57.1 14.8 280 1-405 101-386 (395) 96 protein:vir:81227 Length: 413 98.2 6.6E-07 4.1E-10 54.5 17.1 296 1-405 110-411 (413) 97 protein:vir:100135 Length: 418 98.2 4.8E-07 3E-10 55.2 15.6 279 1-405 126-416 (418) 98 protein:vir:81160 Length: 371 98.1 4.1E-07 2.5E-10 55.6 14.9 284 1-404 80-371 (371) 99 protein:vir:100057 Length: 375 98.1 7.3E-07 4.5E-10 54.2 16.1 330 3-405 1-371 (375) 100 protein:vir:1328 Length: 392 # 98.1 6.3E-07 3.9E-10 54.6 15.3 283 1-405 107-392 (392) 101 protein:vir:7409 Length: 408 # 98.1 1.2E-06 7.3E-10 53.1 16.6 283 1-405 103-396 (408) 102 protein:vir:105004 Length: 392 98.1 1.6E-06 9.9E-10 52.4 17.2 282 1-405 99-385 (392) 103 protein:vir:107593 Length: 392 98.1 1.6E-06 9.9E-10 52.4 17.2 282 1-405 99-385 (392) 104 protein:vir:102873 Length: 392 98.1 1.6E-06 9.9E-10 52.4 17.2 282 1-405 99-385 (392) 105 protein:vir:102082 Length: 392 98.1 1.6E-06 9.9E-10 52.4 17.2 282 1-405 99-385 (392) 106 protein:vir:9574 Length: 300 # 98.1 1.6E-06 9.8E-10 52.4 17.2 289 5-404 1-300 (300) 107 protein:vir:81070 Length: 390 98.1 7E-07 4.4E-10 54.3 14.8 281 1-402 106-390 (390) 108 protein:vir:105374 Length: 423 98.1 2.6E-06 1.6E-09 51.2 17.8 296 16-405 1-334 (423) 109 protein:vir:97053 Length: 390 98.0 1.2E-06 7.3E-10 53.1 15.4 281 1-402 106-390 (390) 110 protein:vir:1025 Length: 408 # 98.0 1.2E-06 7.7E-10 53.0 15.2 282 1-405 103-396 (408) 111 protein:vir:80376 Length: 435 98.0 4.5E-06 2.8E-09 49.9 18.2 301 1-405 124-434 (435) 112 protein:vir:6324 Length: 335 # 98.0 9.1E-06 5.7E-09 48.2 22.2 317 1-405 1-329 (335) 113 protein:vir:6242 Length: 390 # 97.9 9.9E-07 6.1E-10 53.5 13.1 281 1-405 97-390 (390) 114 protein:vir:101607 Length: 379 97.9 4E-06 2.5E-09 50.2 16.1 276 1-404 98-379 (379) 115 protein:vir:1583 Length: 351 # 97.9 1E-05 6.2E-09 48.0 18.1 283 1-405 1-298 (351) 116 protein:vir:94673 Length: 419 97.9 8.3E-06 5.1E-09 48.4 17.6 292 1-405 116-418 (419) 117 protein:vir:5974 Length: 324 # 97.9 1.4E-05 8.8E-09 47.2 20.4 283 1-405 1-294 (324) 118 protein:vir:105038 Length: 428 97.9 6.6E-06 4.1E-09 49.0 16.7 297 1-404 113-428 (428) 119 protein:vir:6212 Length: 434 # 97.8 3.3E-06 2E-09 50.7 14.9 294 1-405 131-432 (434) 120 protein:vir:1383 Length: 421 # 97.8 2.6E-06 1.6E-09 51.2 14.4 272 1-405 109-384 (421) 121 protein:vir:100172 Length: 394 97.8 6.1E-06 3.8E-09 49.2 16.0 280 1-405 104-385 (394) 122 protein:vir:10364 Length: 390 97.8 5.3E-06 3.3E-09 49.5 15.5 281 1-402 104-390 (390) 123 protein:vir:97031 Length: 402 97.8 2.1E-05 1.3E-08 46.2 21.0 310 3-405 1-341 (402) 124 protein:vir:1433 Length: 435 # 97.6 3.5E-05 2.2E-08 45.0 18.6 297 1-405 124-434 (435) 125 protein:vir:96762 Length: 632 97.6 7.9E-06 4.9E-09 48.6 13.9 281 1-403 347-632 (632) 126 protein:vir:3870 Length: 400 # 97.6 5.3E-06 3.3E-09 49.5 12.6 269 1-405 125-399 (400) 127 protein:vir:95376 Length: 425 97.6 1.2E-05 7.3E-09 47.6 14.4 280 1-405 130-422 (425) 128 protein:vir:2504 Length: 305 # 97.5 2E-05 1.2E-08 46.3 14.6 293 5-405 1-304 (305) 129 protein:vir:5739 Length: 366 # 97.5 4.7E-05 2.9E-08 44.3 16.3 290 1-404 53-366 (366) 130 protein:vir:9704 Length: 394 # 97.4 2.1E-05 1.3E-08 46.2 13.3 269 1-405 121-391 (394) 131 protein:vir:962 Length: 397 # 97.3 1.1E-05 7E-09 47.7 11.5 269 1-404 128-397 (397) 132 protein:vir:100884 Length: 389 97.3 5.4E-05 3.4E-08 44.0 14.6 276 1-405 105-383 (389) 133 protein:vir:102944 Length: 330 97.0 0.00021 1.3E-07 40.8 18.5 282 1-405 1-308 (330) 134 protein:vir:1084 Length: 437 # 96.7 0.00019 1.2E-07 41.0 12.8 277 1-405 149-428 (437) 135 protein:vir:1781 Length: 221 # 96.5 0.00041 2.6E-07 39.1 13.9 214 121-371 1-221 (221) 136 protein:vir:93616 Length: 645 96.2 0.00079 4.9E-07 37.6 13.5 303 1-405 316-639 (645) 137 protein:vir:105645 Length: 400 96.1 0.00099 6.2E-07 37.1 20.4 315 1-405 1-334 (400) 138 protein:vir:96978 Length: 387 96.1 0.00019 1.2E-07 41.0 9.8 271 1-405 111-382 (387) 139 protein:vir:94424 Length: 387 96.1 0.00019 1.2E-07 41.0 9.8 271 1-405 111-382 (387) 140 protein:vir:2685 Length: 387 # 96.1 0.00019 1.2E-07 41.0 9.8 271 1-405 111-382 (387) 141 protein:vir:100939 Length: 430 95.9 0.0011 6.9E-07 36.8 13.2 307 12-405 1-430 (430) 142 protein:vir:9265 Length: 430 # 95.9 0.0011 6.9E-07 36.8 13.2 307 12-405 1-430 (430) 143 protein:vir:101650 Length: 497 95.5 0.002 1.2E-06 35.4 15.3 315 1-405 144-494 (497) 144 protein:vir:7855 Length: 497 # 95.5 0.002 1.2E-06 35.4 15.3 315 1-405 144-494 (497) 145 protein:vir:4092 Length: 390 # 95.3 0.0023 1.4E-06 35.1 14.9 287 1-405 72-369 (390) 146 protein:vir:80446 Length: 367 95.3 0.0024 1.5E-06 34.9 15.6 308 1-405 1-339 (367) 147 protein:vir:93881 Length: 387 95.1 0.0018 1.1E-06 35.7 11.7 272 1-405 109-382 (387) 148 protein:vir:103886 Length: 302 94.9 0.0033 2E-06 34.2 17.0 287 3-403 1-302 (302) 149 protein:vir:9361 Length: 402 # 94.3 0.0017 1.1E-06 35.7 9.6 271 1-405 124-397 (402) 150 protein:vir:78640 Length: 352 93.9 0.0051 3.2E-06 33.2 11.3 268 1-405 65-347 (352) 151 protein:vir:7019 Length: 401 # 93.5 0.0074 4.6E-06 32.3 18.8 313 1-405 1-338 (401) 152 protein:vir:107120 Length: 329 93.0 0.0091 5.6E-06 31.8 16.5 273 1-405 1-308 (329) 153 protein:vir:2106 Length: 430 # 92.2 0.012 7.6E-06 31.1 16.0 308 1-405 1-430 (430) 154 protein:vir:94800 Length: 319 91.1 0.018 1.1E-05 30.2 15.1 271 1-405 15-297 (319) 155 protein:vir:97331 Length: 319 91.1 0.018 1.1E-05 30.2 15.1 271 1-405 15-297 (319) 156 protein:vir:95963 Length: 395 90.6 0.02 1.3E-05 29.9 11.1 288 1-405 75-377 (395) 157 protein:vir:4197 Length: 314 # 88.8 0.03 1.9E-05 28.9 17.7 292 1-369 4-314 (314) 158 protein:vir:94933 Length: 330 88.2 0.034 2.1E-05 28.6 13.8 298 1-405 16-330 (330) 159 protein:vir:4159 Length: 315 # 86.9 0.042 2.6E-05 28.1 16.0 290 1-393 8-315 (315) 160 protein:vir:97255 Length: 310 86.2 0.047 2.9E-05 27.8 12.4 286 1-404 1-310 (310) 161 protein:vir:79008 Length: 299 85.7 0.051 3.1E-05 27.7 16.8 285 1-405 1-296 (299) 162 protein:vir:79928 Length: 393 84.8 0.057 3.6E-05 27.4 12.8 290 1-381 59-393 (393) 163 protein:vir:98635 Length: 377 79.4 0.096 5.9E-05 26.2 8.3 293 1-404 67-377 (377) 164 protein:vir:9509 Length: 381 # 78.4 0.11 7.1E-05 25.7 11.2 288 1-405 65-369 (381) 165 protein:vir:101291 Length: 381 78.4 0.11 7.1E-05 25.7 11.2 288 1-405 65-369 (381) 166 protein:vir:95512 Length: 693 78.2 0.12 7.2E-05 25.7 12.4 291 1-365 386-693 (693) 167 protein:vir:3158 Length: 321 # 74.7 0.16 9.6E-05 25.0 15.1 283 1-405 1-311 (321) 168 protein:vir:108211 Length: 318 69.7 0.22 0.00014 24.2 13.8 296 1-404 1-318 (318) 169 protein:vir:80128 Length: 466 64.9 0.29 0.00018 23.5 11.2 285 1-405 144-449 (466) 170 protein:vir:79548 Length: 652 63.6 0.31 0.00019 23.3 10.0 280 1-403 357-652 (652) 171 protein:vir:78350 Length: 383 57.7 0.43 0.00027 22.6 11.8 293 1-405 76-376 (383) 172 protein:vir:106647 Length: 303 48.0 0.68 0.00042 21.5 18.7 289 1-405 1-300 (303) 173 protein:vir:9927 Length: 295 # 43.0 0.86 0.00054 20.9 19.1 286 5-405 1-289 (295) 174 protein:vir:9875 Length: 296 # 40.8 0.96 0.00059 20.7 15.1 284 1-403 3-296 (296) 175 protein:vir:100632 Length: 381 34.1 1.3 0.00081 19.9 10.1 292 1-405 65-369 (381) 176 protein:vir:1991 Length: 305 # 25.8 2 0.0012 18.9 14.7 225 3-299 1-305 (305) 177 protein:vir:95131 Length: 325 20.8 2.7 0.0017 18.2 14.8 283 1-402 6-325 (325) No 1 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=100.00 E-value=3e-185 Score=1032.24 Aligned_cols=397 Identities=51% Similarity=0.892 Sum_probs=384.8 Q ss_pred CccccCcCCCcc----cccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCc Q lcl|NC_020862. 2 PHIYNDPAAGDA----STVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDA 77 (405) Q Consensus 2 ~~~y~~~~~t~~----~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp 77 (405) -+.||+|+..+. ++++|||++|||.+|+|++|+|+|||++|||++|||||+|||||||||.||++++|||+||||| T Consensus 1 ~~~~~a~~~~~~~s~~g~~~~~~~t~y~~~k~L~~Aa~~lv~~~fA~~~piPkn~GkTIk~r~y~pl~~~~~pl~eGv~a 80 (401) T protein:vir:95 1 MLNYNAPTDGQKSSIDGANSDQMQTFFWLKKAIITARKEQYFMPLASVTNMPKHYGKTIKVYEYVPLLDDRNINDQGIDA 80 (401) T ss_pred CCccCCCcccccccccccccceeeehhhHHHHHhhhhhhhhhhhcccccccccccCCeEEEEecccccccccchhcCCCc Confidence 578999976554 4559999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHH Q lcl|NC_020862. 78 TGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLR 157 (405) Q Consensus 78 ~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~ 157 (405) +|++++||||||||||||+||+|||+|+|+|||||||||+|+|++++|+|||+|+||||++.||++|+.|.+|+++|||+ T Consensus 81 ~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~D~~l~~h~s~ell~ 160 (401) T protein:vir:95 81 SGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDSDDGLMEHLSRELMN 160 (401) T ss_pred ccccccCccccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhcchHHHHHHHHHHhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccc Q lcl|NC_020862. 158 GANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKT 237 (405) Q Consensus 158 ~~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~ 237 (405) +|++++||+||+|+||+|++|.|||++++.++.+++.+ ++++||+++|||++++|++|||||||+||+||+|+|||+ T Consensus 161 g~~~~t~d~i~~dll~ag~~viyAg~ats~At~~~~~~---~~t~vt~~~l~rl~~~L~~nRapk~t~~i~~s~~~dTk~ 237 (401) T protein:vir:95 161 GATQITEAVLQKDLLAAAGTVLYAGAATSDATITGEGS---TPSVVSYKNLMRLDQILTENRTPTQTTIITGSRMIDTKV 237 (401) T ss_pred hhhhhHHHHHHHHHHhhcCeeecCCccceeeecccccc---ccceechhHHHHHHHHHHhcccccchhhhhhhhccCccc Confidence 99999999999999999999999999999998887765 567999999999999999999999999999999999999 Q ss_pred ccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccc Q lcl|NC_020862. 238 ISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQ 317 (405) Q Consensus 238 I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~ 317 (405) |++|||+||||||+|||++|+|.++||+||||||||++++||+||||+| +|||||++|+|+||+|+|+++++++++|+ T Consensus 238 i~~s~va~~h~~L~~di~a~~D~~~~~~fi~v~kYa~~~~i~~gEiG~i--~~vR~i~~p~~~~w~~ag~~a~~~~~~y~ 315 (401) T protein:vir:95 238 IGATRVMYVGSELVPELKAMKDLFGNKAFIETQHYADAGTIMNGEVGSI--DKFRIIQVPEMLHWAGAGAQATGANPGYR 315 (401) T ss_pred cccceEEEEecCchhHHHHHHHhcCCCCceehhhcCCcccccccccccc--CceeEEecccceeecCCcccccccccccc Confidence 9999999999999999999999999999999999999999999999999 78999999999999999999999999999 Q ss_pred cccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccceE Q lcl|NC_020862. 318 VSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIA 397 (405) Q Consensus 318 ~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~ma 397 (405) +++++++++|||||+||||+|||++|+|+|++ .+-||+||||+||+++||++|||||||||||||||++++||++||+ T Consensus 316 ~~~~~~gg~~dVyp~lV~G~dAf~~~~l~g~g--~~~~~~~ivk~pG~~~ad~~DPlgQ~g~vgwK~~~a~~vL~~e~m~ 393 (401) T protein:vir:95 316 TSMVSGQEHYDVYPMLVVGDDSFTSIGFQTDG--KSLKFTVMTKMPGKETADRNDPYGETGFSSIKWYYGILVKRPERLA 393 (401) T ss_pred cccccCCCcceeeeeeEEccccceecccccCC--ccccceeEeecCCcCCCCCCCcccceehhhhhhhhhhheeccceeE Confidence 99999999999999999999999999999853 3358999999999999999999999999999999999999999999 Q ss_pred EEEEecCC Q lcl|NC_020862. 398 VAYSVIPE 405 (405) Q Consensus 398 rie~~a~~ 405 (405) |||+|||= T Consensus 394 ~ies~a~~ 401 (401) T protein:vir:95 394 LIKTVAPL 401 (401) T ss_pred EEEeecCC Confidence 99999999 No 2 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.63 E-value=8.9e-18 Score=114.06 Aligned_cols=270 Identities=16% Similarity=0.123 Sum_probs=181.9 Q ss_pred CCCcccccccceeehhhhhHHHHHhhhhhhhhcccccc-ccCcCCCCEEEEEecccCCCCCCccccCCCcccccccCCcc Q lcl|NC_020862. 9 AAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNK-QMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGASYAGGNL 87 (405) Q Consensus 9 ~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~-~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~~~~~gnl 87 (405) =+..++.++.-+.+.+|..-.+....+.++|.+++... .+.-..|++|+|-+|..+.++ ....||.+-.-.++ T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda-~~~~eg~~i~~~~l----- 74 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDA-TVVPEGQKIPVDKI----- 74 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCcc-ccccCCCccCcccc----- Confidence 12235788888889999988888888889999999764 466677999999999888554 44666654433222 Q ss_pred cccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHhhHHHHHH Q lcl|NC_020862. 88 YGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGANEITEDLL 167 (405) Q Consensus 88 y~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~~~ted~l 167 (405) +..+.+++++|+|..+++||++.... -.|+++++.+++...-+..-...+ T Consensus 75 -----------------------------t~~~~~a~i~~~~k~~~~tD~a~~~~-~~dp~~~~~~~~~~~~a~~~d~~~ 124 (276) T protein:vir:10 75 -----------------------------ETNRREAKIHKIGKGTDITDEALLSG-YGDPQGEAVRQHGLAIANKVDNDV 124 (276) T ss_pred -----------------------------ccceeeEEeehccccccccHHHHHhh-ccchHHHHHHHHHHHHHHHHHHHH Confidence 33455799999999999999875554 447788766665544332211112 Q ss_pred HHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccceEEEEEc Q lcl|NC_020862. 168 QADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISASRIAYIG 247 (405) Q Consensus 168 ~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~syv~~~h 247 (405) ...+.++.. . .+ -..+|++.+.++...|..+... -++++|| T Consensus 125 ~~~l~~~~~----------~--~~--------~~~~t~d~i~~A~~~lgd~~~~-------------------~~~ivv~ 165 (276) T protein:vir:10 125 LEALRGTKL----------T--VS--------ADIGTLAGLEAAIDTFDDEDLE-------------------PMVLFIN 165 (276) T ss_pred HHHHhcccc----------c--cc--------ccccCHHHHHHHHHHhccccCc-------------------ccEEEEc Confidence 222222111 1 11 1248899999999999765442 2788999 Q ss_pred ccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccccccccCCcce Q lcl|NC_020862. 248 SELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSDVAGTDKY 327 (405) Q Consensus 248 ~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~~~g~~~~ 327 (405) |+....||.++ .+.|+..+++|+. .+.+|+||++-| +|+|.++.+- T Consensus 166 p~~~~~L~k~~----~~~f~~~s~~g~~-~~~~G~ig~~~G--~~Vi~s~~~p--------------------------- 211 (276) T protein:vir:10 166 PKDAGKLRSSA----SDNFTRATELGDN-IIVKGAFGEALG--AVIVRSKKLD--------------------------- 211 (276) T ss_pred HHHHHHHHHhc----ccccccccccccc-ceeccccceecc--eeEEEcCCCC--------------------------- Confidence 99999998754 4789999999987 478999999965 8999987432 Q ss_pred eeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_020862. 328 DIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIAVAYSVIPE 405 (405) Q Consensus 328 DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~marie~~a~~ 405 (405) .|-.+++|+.|++...-+ ++. +. .|| |+.-++=.+.-...|++.+.+++..+.+..++=- T Consensus 212 -~~t~~l~~~gAi~~~~~~--------~~~--vE------~dR-d~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~ 271 (276) T protein:vir:10 212 -EGEAILAKRGAVKLITKR--------DFF--LE------TDR-DPSTKTTALYSDKHYVAYLYDESKAVKVTKGAGT 271 (276) T ss_pred -cceEEEEeccceeeeecC--------Cce--ee------ccc-chhhcccEEEEeeEEEEEEEcCcceEEEecCCcC Confidence 334468888888754221 122 11 122 5555544455556788888888888888654422 No 3 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=99.55 E-value=2.2e-16 Score=106.38 Aligned_cols=270 Identities=14% Similarity=0.127 Sum_probs=178.1 Q ss_pred ccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccc-cccCcCCCCEEEEEecccCCCCCCccccCCCccccccc Q lcl|NC_020862. 5 YNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADN-KQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGASYA 83 (405) Q Consensus 5 y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~-~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~~~~ 83 (405) .+| ..+.++..+.+.-|..-.+....+.+++++++.. ..+.-..|+||++.+|..+.++. ..++|-+-.-+++ T Consensus 1 m~~----~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~-~~~~g~~i~~~~l- 74 (274) T protein:vir:95 1 MAQ----GMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAK-VVAEGEKIPTDIL- 74 (274) T ss_pred CCc----ceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccc-cccCCCccchhhc- Confidence 222 2568888888989998888877788999999854 34665679999999998775544 3455433222222 Q ss_pred CCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHhhHH Q lcl|NC_020862. 84 GGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGANEIT 163 (405) Q Consensus 84 ~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~~~t 163 (405) +....++.|+|+|.-.++||++..... .++++++.+++...-+. T Consensus 75 ---------------------------------t~~~~~~~i~~~~~a~~i~D~~~~~~~-~d~~~~~~~~~~~~~a~-- 118 (274) T protein:vir:95 75 ---------------------------------ETKKREAKIRKIAKGTSISDEALLSGY-GDPQGEQVRQHGLAHAN-- 118 (274) T ss_pred ---------------------------------ccceeEEEeeeeecceeehHHHHhhcc-chHHHHHHHHHHHHHHH-- Confidence 334557899999999999998655543 36777766655543332 Q ss_pred HHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccceEE Q lcl|NC_020862. 164 EDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISASRI 243 (405) Q Consensus 164 ed~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~syv 243 (405) .+..++++... +++..+ + .+.++++.+.+|...|..+... -++ T Consensus 119 --~vd~~i~~~l~------~a~~~~--~--------~~~~~~d~i~~A~~~lgd~~~~-------------------~~~ 161 (274) T protein:vir:95 119 --KVDDDVLEALK------SAKLTV--E--------ADITKLTGLQTAIDKFNDEDLE-------------------PMV 161 (274) T ss_pred --HHHHHHHHHHh------cccccc--c--------ccccCHHHHHHHHHHhcccccc-------------------ccE Confidence 22333333211 011111 1 1347899999999999765432 378 Q ss_pred EEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccccccccC Q lcl|NC_020862. 244 AYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSDVAG 323 (405) Q Consensus 244 ~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~~~g 323 (405) .+|||+....|+... ...|++.+++|+ ..+.+|+||++-| ||+|++.. T Consensus 162 ivv~p~~~~~L~k~~----~~~f~~~s~~g~-~~~~~G~ig~~~G--~~Vi~s~~------------------------- 209 (274) T protein:vir:95 162 LFISPLDAGKLRGDA----TTNFTRATELGD-DVIVKGAFGEALG--AVIVRSNK------------------------- 209 (274) T ss_pred EEeCHHHHHHHHhhc----cccccccccccc-cceeccccceecC--eEEEEeCC------------------------- Confidence 999999999997531 126999999997 5678999999965 89998752 Q ss_pred CcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccceEEEEEec Q lcl|NC_020862. 324 TDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIAVAYSVI 403 (405) Q Consensus 324 ~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~marie~~a 403 (405) +++|-.+++|+.|++...-+ .+. +- .+| ||.-+.=.+--...|++.++|++..+++...+ T Consensus 210 ---~~~~t~~l~~~gA~~~~~~~--------~~~--vE------~~R-d~~~~~d~i~~~~~y~~~~~~~~~~v~~tk~~ 269 (274) T protein:vir:95 210 ---LEAGTAILAKKGAVKLITKR--------DFF--LE------TDR-DPSTKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) T ss_pred ---CCCceEEEEeccceeeeecC--------Ccc--cc------ccc-ccccccCEEEEeEEEEEEEEcCCcEEEEEcCC Confidence 23556689999999864321 111 11 122 44443333444477899999999999888665 Q ss_pred CC Q lcl|NC_020862. 404 PE 405 (405) Q Consensus 404 ~~ 405 (405) =- T Consensus 270 ~~ 271 (274) T protein:vir:95 270 GS 271 (274) T ss_pred cc Confidence 44 No 4 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=99.55 E-value=2.2e-16 Score=106.38 Aligned_cols=270 Identities=14% Similarity=0.127 Sum_probs=178.1 Q ss_pred ccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccc-cccCcCCCCEEEEEecccCCCCCCccccCCCccccccc Q lcl|NC_020862. 5 YNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADN-KQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGASYA 83 (405) Q Consensus 5 y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~-~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~~~~ 83 (405) .+| ..+.++..+.+.-|..-.+....+.+++++++.. ..+.-..|+||++.+|..+.++. ..++|-+-.-+++ T Consensus 1 m~~----~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~-~~~~g~~i~~~~l- 74 (274) T protein:vir:96 1 MAQ----GMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAK-VVAEGEKIPTDIL- 74 (274) T ss_pred CCc----ceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccc-cccCCCccchhhc- Confidence 222 2568888888989998888877788999999854 34665679999999998775544 3455433222222 Q ss_pred CCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHhhHH Q lcl|NC_020862. 84 GGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGANEIT 163 (405) Q Consensus 84 ~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~~~t 163 (405) +....++.|+|+|.-.++||++..... .++++++.+++...-+. T Consensus 75 ---------------------------------t~~~~~~~i~~~~~a~~i~D~~~~~~~-~d~~~~~~~~~~~~~a~-- 118 (274) T protein:vir:96 75 ---------------------------------ETKKREAKIRKIAKGTSISDEALLSGY-GDPQGEQVRQHGLAHAN-- 118 (274) T ss_pred ---------------------------------ccceeEEEeeeeecceeehHHHHhhcc-chHHHHHHHHHHHHHHH-- Confidence 334557899999999999998655543 36777766655543332 Q ss_pred HHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccceEE Q lcl|NC_020862. 164 EDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISASRI 243 (405) Q Consensus 164 ed~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~syv 243 (405) .+..++++... +++..+ + .+.++++.+.+|...|..+... -++ T Consensus 119 --~vd~~i~~~l~------~a~~~~--~--------~~~~~~d~i~~A~~~lgd~~~~-------------------~~~ 161 (274) T protein:vir:96 119 --KVDDDVLEALK------SAKLTV--E--------ADITKLTGLQTAIDKFNDEDLE-------------------PMV 161 (274) T ss_pred --HHHHHHHHHHh------cccccc--c--------ccccCHHHHHHHHHHhcccccc-------------------ccE Confidence 22333333211 011111 1 1347899999999999765432 378 Q ss_pred EEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccccccccC Q lcl|NC_020862. 244 AYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSDVAG 323 (405) Q Consensus 244 ~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~~~g 323 (405) .+|||+....|+... ...|++.+++|+ ..+.+|+||++-| ||+|++.. T Consensus 162 ivv~p~~~~~L~k~~----~~~f~~~s~~g~-~~~~~G~ig~~~G--~~Vi~s~~------------------------- 209 (274) T protein:vir:96 162 LFISPLDAGKLRGDA----TTNFTRATELGD-DVIVKGAFGEALG--AVIVRSNK------------------------- 209 (274) T ss_pred EEeCHHHHHHHHhhc----cccccccccccc-cceeccccceecC--eEEEEeCC------------------------- Confidence 999999999997531 126999999997 5678999999965 89998752 Q ss_pred CcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccceEEEEEec Q lcl|NC_020862. 324 TDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIAVAYSVI 403 (405) Q Consensus 324 ~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~marie~~a 403 (405) +++|-.+++|+.|++...-+ .+. +- .+| ||.-+.=.+--...|++.++|++..+++...+ T Consensus 210 ---~~~~t~~l~~~gA~~~~~~~--------~~~--vE------~~R-d~~~~~d~i~~~~~y~~~~~~~~~~v~~tk~~ 269 (274) T protein:vir:96 210 ---LEAGTAILAKKGAVKLITKR--------DFF--LE------TDR-DPSTKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) T ss_pred ---CCCceEEEEeccceeeeecC--------Ccc--cc------ccc-ccccccCEEEEeEEEEEEEEcCCcEEEEEcCC Confidence 23556689999999864321 111 11 122 44443333444477899999999999888665 Q ss_pred CC Q lcl|NC_020862. 404 PE 405 (405) Q Consensus 404 ~~ 405 (405) =- T Consensus 270 ~~ 271 (274) T protein:vir:96 270 GS 271 (274) T ss_pred cc Confidence 44 No 5 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.54 E-value=3.5e-16 Score=105.34 Aligned_cols=271 Identities=14% Similarity=0.138 Sum_probs=179.7 Q ss_pred CCCcccccccceeehhhhhHHHHHhhhhhhhhcccccc-ccCcCCCCEEEEEecccCCCCCCccccCCCcccccccCCcc Q lcl|NC_020862. 9 AAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNK-QMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGASYAGGNL 87 (405) Q Consensus 9 ~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~-~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~~~~~gnl 87 (405) =+.+.+.++.-+.+..|..-.+....+.+++++++... .+..+.|+||++.+|..+.++ ..+.||.+-+.+++ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda-~~~~eg~~i~~~~l----- 74 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDA-ADVAEGGEISLDKI----- 74 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccc-cccCCCCccChhhc----- Confidence 12336777777888888887767666789999998764 467777999999999877443 45677754433332 Q ss_pred cccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHhhHHHHHH Q lcl|NC_020862. 88 YGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGANEITEDLL 167 (405) Q Consensus 88 y~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~~~ted~l 167 (405) +..+.+++++|+|...++||++.. ....+++.++.+++...-+ ..+ T Consensus 75 -----------------------------t~~~~~~~i~~~~k~~~vtD~~~~-~~~~d~~~~~~~~~a~~~a----~~~ 120 (272) T protein:vir:36 75 -----------------------------GTTTKSVTIKKAAKGTEITDEAAL-SGYGDPIGESNKQLGLSLA----NKV 120 (272) T ss_pred -----------------------------CCcceeEeeehhhccccccHHHHh-hccchHHHHHHHHHHHHHH----HHH Confidence 334567899999999999997544 4556777776665544332 233 Q ss_pred HHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccceEEEEEc Q lcl|NC_020862. 168 QADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISASRIAYIG 247 (405) Q Consensus 168 ~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~syv~~~h 247 (405) ..+|++... |+ + .+....++++.|.++...|.....+ -++.+|| T Consensus 121 d~~i~~~l~-----~~-~-----------~~~~~~~~~d~i~~A~~~lgd~~~~-------------------~~~ivv~ 164 (272) T protein:vir:36 121 DDDLLSAAK-----TT-S-----------QTVSTKANVDGVQAALDIFNDEDAQ-------------------AYVLIVN 164 (272) T ss_pred HHHHHHHhc-----cc-c-----------ccccccccHHHHHHHHHHhhhcCCC-------------------ceEEEEc Confidence 344443221 11 0 1112457899999999888766553 3789999 Q ss_pred ccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccccccccCCcce Q lcl|NC_020862. 248 SELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSDVAGTDKY 327 (405) Q Consensus 248 ~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~~~g~~~~ 327 (405) |.....||. ++.|..+..++....+.+|+||++-| +|+|++..+- .| + T Consensus 165 p~~~~~L~k------~~~~~~~~~~~~~~~~~~G~ig~~~G--~~Vv~s~~~p----~~-----~--------------- 212 (272) T protein:vir:36 165 PKDAAKIRK------DANAKNIGSEVGANALINGTYADVLG--AQIVRSKKLA----EG-----S--------------- 212 (272) T ss_pred HHHHHHHhc------ccccccccccccccceeeeccceecC--eeEEEeCCCC----CC-----c--------------- Confidence 999999875 46788887787777889999999966 8999997542 11 1 Q ss_pred eeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccceEEEEEecC Q lcl|NC_020862. 328 DIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIAVAYSVIP 404 (405) Q Consensus 328 DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~marie~~a~ 404 (405) .+|..+++|+.|++...-++ ++ +- . +| |+.-+.=..--...|++.+++++-.+.+...=- T Consensus 213 ~~~~~~~~~~gA~~~~~~~~--------~~--vE-~-----~R-~~~~~~d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 213 ALMFKIVSNSPALKLVLKRG--------VQ--VE-T-----DR-DIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred eeEEEEEecccceeeeecCC--------cc--cc-c-----cc-chhhcCcEEEEEEEEEEEEEcCccEEEEeecCC Confidence 27888899999998643221 11 11 1 11 333222222234668999999988877754333 No 6 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=99.52 E-value=6.6e-16 Score=103.82 Aligned_cols=270 Identities=14% Similarity=0.092 Sum_probs=175.5 Q ss_pred CCCcccccccceeehhhhhHHHHHhhhhhhhhcccccc-ccCcCCCCEEEEEecccCCCCCCccccCCCcccccccCCcc Q lcl|NC_020862. 9 AAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNK-QMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGASYAGGNL 87 (405) Q Consensus 9 ~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~-~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~~~~~gnl 87 (405) =+..++.++.-+.+..|..-.+....+.+++++++... .++-..|++|++.+|..+.++.+ .++|-+-.-.+ T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~-~~~g~~i~~~~------ 73 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQV-IAEGEKIPVDQ------ 73 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCccc-cCCCCcCchhh------ Confidence 12334667777888899988888887889999998764 46767799999999985554433 33332211111 Q ss_pred cccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHhhHHHHHH Q lcl|NC_020862. 88 YGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGANEITEDLL 167 (405) Q Consensus 88 y~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~~~ted~l 167 (405) .+....+++++|+|.-.++||++... ...++++++.+.+...-+... |.. T Consensus 74 ----------------------------it~~~~~~~i~~~~~~~~i~D~~~~~-~~~d~~~~~~~~~~~~~a~~~-d~~ 123 (274) T protein:vir:96 74 ----------------------------IGTSKREAKVRKIGKGTELTDEAVLS-GFGDPQGEAVRQHGLAIANKV-DND 123 (274) T ss_pred ----------------------------cccceeEEEEEeeeceeeecHHHHHh-hcchHHHHHHHHHHHHHHHHH-HHH Confidence 13445678999999999999976444 555788776665554333322 222 Q ss_pred HHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccceEEEEEc Q lcl|NC_020862. 168 QADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISASRIAYIG 247 (405) Q Consensus 168 ~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~syv~~~h 247 (405) ....+.+++ ..+ ....++++.|.+|...|..+.-. -++.+|| T Consensus 124 i~~~l~~a~---------~~~----------~~~~~~~d~i~dA~~~l~d~~~~-------------------~~~ivv~ 165 (274) T protein:vir:96 124 VLEALKGAT---------LTV----------EADITKLDGLQTAIDKFNDEDLE-------------------PMVLFVN 165 (274) T ss_pred HHHHHhcCC---------CCc----------CcccccHHHHHHHHHHhcccCCC-------------------ceEEEeC Confidence 222233221 111 01357899999999999765432 2789999 Q ss_pred ccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccccccccCCcce Q lcl|NC_020862. 248 SELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSDVAGTDKY 327 (405) Q Consensus 248 ~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~~~g~~~~ 327 (405) |+....|+.+. ...|++..++|+. .+.+|.||++-| ||+|.++.+- T Consensus 166 p~~~~~L~k~~----~~~f~~~~~~g~~-~~~~g~ig~~~G--~~Vi~s~~~p--------------------------- 211 (274) T protein:vir:96 166 PLDAGGLRTSA----SDNFTRPTQLGDN-IIVKGAFGEALG--AVIVRSNKLN--------------------------- 211 (274) T ss_pred HHHHHHHHhcc----ccccccccccccc-ceeecccceecC--eeEEEcCCCC--------------------------- Confidence 99999997642 2469999999885 578999999965 8999887542 Q ss_pred eeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_020862. 328 DIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIAVAYSVIPE 405 (405) Q Consensus 328 DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~marie~~a~~ 405 (405) .+-.+++|+.|++...-+ ++. +. .+ -||.-+.=..--...|++.++|++..+.+..++-. T Consensus 212 -~~t~~l~~~gA~~~~~~~--------~~~--vE------~~-Rd~~~~~d~i~~~~~yg~~~~~~~~vv~~t~~~~~ 271 (274) T protein:vir:96 212 -KGEALLAKKGAVKLITKR--------DFF--LE------KD-RDASRKSTALYSDKHYVAYLYDESKVVKITKGAGD 271 (274) T ss_pred -cceEEEEeCcceeeeecC--------Ccc--cc------cc-cchhhcccEEEEeeEEEEEEEcCccEEEEEcCccc Confidence 223468999998864221 111 11 01 23332222222336799999999999998877666 No 7 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=99.51 E-value=1e-15 Score=102.79 Aligned_cols=270 Identities=14% Similarity=0.095 Sum_probs=178.5 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhcccccc-ccCcCCCCEEEEEecccCCCCCCccccCCCccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNK-QMPKHFGKELKVFYYVPLLDDLNVNDQGLDATG 79 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~-~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g 79 (405) |+ ...+.++..+.+..|..-.+....+.+++++++... .++-..|+||++.+|..+.++.+ .++|-+-.- T Consensus 1 ma--------~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~-~~~g~~i~~ 71 (274) T protein:vir:97 1 MP--------QGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQV-VAEGEKIPT 71 (274) T ss_pred CC--------ccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCcccc-ccCCCcccc Confidence 32 236788888999999998888888889999999774 57777799999999987655443 343322111 Q ss_pred ccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHH Q lcl|NC_020862. 80 ASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGA 159 (405) Q Consensus 80 ~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~ 159 (405) ++ .+..+.+++++|+|.-.+++|+.... .-.+++++..+++.... T Consensus 72 ~~----------------------------------lt~~~~~~~i~~~~~~~~i~D~~~~~-~~~dp~~~~~~~~a~a~ 116 (274) T protein:vir:97 72 DI----------------------------------LETKKREAKIRKIAKGTSITDEALLS-GYGDPQGEQVRQHGLAH 116 (274) T ss_pred cc----------------------------------cccceeEEEeeeecceecccHHHHHh-ccchHHHHHHHHHHHHH Confidence 11 13445678999999999999986444 44467777555554322 Q ss_pred hhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccccc Q lcl|NC_020862. 160 NEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTIS 239 (405) Q Consensus 160 ~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~ 239 (405) +. ..|.-....+.++. ..+ . ...++++.+.+|...|..+... T Consensus 117 a~-~vd~~~~~~l~~a~---------~~~--~--------~~~~~~d~i~dA~~~l~d~~~~------------------ 158 (274) T protein:vir:97 117 AN-KVDNDVLEALMGAK---------LTV--N--------ADITKLNGLQSAIDKFNDEDLE------------------ 158 (274) T ss_pred HH-HHHHHHHHHHhccC---------ccc--c--------ccccCHHHHHHHHHHhhccCCC------------------ Confidence 22 22222223333221 111 1 1247899999999999765432 Q ss_pred ceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccccc Q lcl|NC_020862. 240 ASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVS 319 (405) Q Consensus 240 ~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~ 319 (405) -++.+|||.....|+... ...|++.+++|+. .+.+|.||++-| ||+++++.+- T Consensus 159 -~~~ivv~p~~~~~L~k~~----~~~f~~~s~~g~~-~~~~G~ig~~~G--~~Vi~s~~~p------------------- 211 (274) T protein:vir:97 159 -PMVLFVNPLDAGKLRGDA----STNFTRATELGDD-IIVKGAFGEALG--AIIVRTNKLE------------------- 211 (274) T ss_pred -ceEEEeCHHHHHHHHhhh----hhhccccCccccc-ceeccccceecC--eeEEEcCCCC------------------- Confidence 278999999999997421 1379999999985 578999999966 8999987431 Q ss_pred cccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccceEEE Q lcl|NC_020862. 320 DVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIAVA 399 (405) Q Consensus 320 ~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~mari 399 (405) +|-.+++|+.|++...-+ .+. +- .+| ||.-+.-...-...|++.++|++-.+.+ T Consensus 212 ---------~~t~~l~~~gA~~~~~~~--------~~~--vE------~~R-d~~~~~d~i~~~~~y~~~~~~~~~vv~~ 265 (274) T protein:vir:97 212 ---------AGTAILAKKGAVKLILKR--------DFF--LE------VAR-DASTKTTALYSDKHYVAYLYDESKAVKI 265 (274) T ss_pred ---------cceEEEEeCcceEeeecC--------Cce--ec------ccc-chhhcccEEEEEEEEEEEEEcCCceEEE Confidence 344679999998863221 111 11 122 4444433344446789999999998887 Q ss_pred EE--ecCC Q lcl|NC_020862. 400 YS--VIPE 405 (405) Q Consensus 400 e~--~a~~ 405 (405) .- ++.| T Consensus 266 t~~~~~~~ 273 (274) T protein:vir:97 266 TKGSGSLE 273 (274) T ss_pred ecCccccc Confidence 64 4556 No 8 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=99.51 E-value=1e-15 Score=102.79 Aligned_cols=270 Identities=14% Similarity=0.095 Sum_probs=178.5 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhcccccc-ccCcCCCCEEEEEecccCCCCCCccccCCCccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNK-QMPKHFGKELKVFYYVPLLDDLNVNDQGLDATG 79 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~-~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g 79 (405) |+ ...+.++..+.+..|..-.+....+.+++++++... .++-..|+||++.+|..+.++.+ .++|-+-.- T Consensus 1 ma--------~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~-~~~g~~i~~ 71 (274) T protein:vir:94 1 MP--------QGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQV-VAEGEKIPT 71 (274) T ss_pred CC--------ccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCcccc-ccCCCcccc Confidence 32 236788888999999998888888889999999774 57777799999999987655443 343322111 Q ss_pred ccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHH Q lcl|NC_020862. 80 ASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGA 159 (405) Q Consensus 80 ~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~ 159 (405) ++ .+..+.+++++|+|.-.+++|+.... .-.+++++..+++.... T Consensus 72 ~~----------------------------------lt~~~~~~~i~~~~~~~~i~D~~~~~-~~~dp~~~~~~~~a~a~ 116 (274) T protein:vir:94 72 DI----------------------------------LETKKREAKIRKIAKGTSITDEALLS-GYGDPQGEQVRQHGLAH 116 (274) T ss_pred cc----------------------------------cccceeEEEeeeecceecccHHHHHh-ccchHHHHHHHHHHHHH Confidence 11 13445678999999999999986444 44467777555554322 Q ss_pred hhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccccc Q lcl|NC_020862. 160 NEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTIS 239 (405) Q Consensus 160 ~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~ 239 (405) +. ..|.-....+.++. ..+ . ...++++.+.+|...|..+... T Consensus 117 a~-~vd~~~~~~l~~a~---------~~~--~--------~~~~~~d~i~dA~~~l~d~~~~------------------ 158 (274) T protein:vir:94 117 AN-KVDNDVLEALMGAK---------LTV--N--------ADITKLNGLQSAIDKFNDEDLE------------------ 158 (274) T ss_pred HH-HHHHHHHHHHhccC---------ccc--c--------ccccCHHHHHHHHHHhhccCCC------------------ Confidence 22 22222223333221 111 1 1247899999999999765432 Q ss_pred ceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccccc Q lcl|NC_020862. 240 ASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVS 319 (405) Q Consensus 240 ~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~ 319 (405) -++.+|||.....|+... ...|++.+++|+. .+.+|.||++-| ||+++++.+- T Consensus 159 -~~~ivv~p~~~~~L~k~~----~~~f~~~s~~g~~-~~~~G~ig~~~G--~~Vi~s~~~p------------------- 211 (274) T protein:vir:94 159 -PMVLFVNPLDAGKLRGDA----STNFTRATELGDD-IIVKGAFGEALG--AIIVRTNKLE------------------- 211 (274) T ss_pred -ceEEEeCHHHHHHHHhhh----hhhccccCccccc-ceeccccceecC--eeEEEcCCCC------------------- Confidence 278999999999997421 1379999999985 578999999966 8999987431 Q ss_pred cccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccceEEE Q lcl|NC_020862. 320 DVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIAVA 399 (405) Q Consensus 320 ~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~mari 399 (405) +|-.+++|+.|++...-+ .+. +- .+| ||.-+.-...-...|++.++|++-.+.+ T Consensus 212 ---------~~t~~l~~~gA~~~~~~~--------~~~--vE------~~R-d~~~~~d~i~~~~~y~~~~~~~~~vv~~ 265 (274) T protein:vir:94 212 ---------AGTAILAKKGAVKLILKR--------DFF--LE------VAR-DASTKTTALYSDKHYVAYLYDESKAVKI 265 (274) T ss_pred ---------cceEEEEeCcceEeeecC--------Cce--ec------ccc-chhhcccEEEEEEEEEEEEEcCCceEEE Confidence 344679999998863221 111 11 122 4444433344446789999999998887 Q ss_pred EE--ecCC Q lcl|NC_020862. 400 YS--VIPE 405 (405) Q Consensus 400 e~--~a~~ 405 (405) .- ++.| T Consensus 266 t~~~~~~~ 273 (274) T protein:vir:94 266 TKGSGSLE 273 (274) T ss_pred ecCccccc Confidence 64 4556 No 9 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=99.51 E-value=7.9e-16 Score=103.39 Aligned_cols=270 Identities=15% Similarity=0.109 Sum_probs=177.0 Q ss_pred CCCcccccccceeehhhhhHHHHHhhhhhhhhccccc-cccCcCCCCEEEEEecccCCCCCCccccCCCcccccccCCcc Q lcl|NC_020862. 9 AAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADN-KQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGASYAGGNL 87 (405) Q Consensus 9 ~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~-~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~~~~~gnl 87 (405) =+..++.++..+.+..|..-.+....+.+++++++.. ..+.-+.|+||++.+|..+.++. ..++|-+-.-+++ T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~-~~~~g~~i~~~~l----- 74 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQ-VVAEGEKIPTDIL----- 74 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCccc-cccCCCccchhhc----- Confidence 1223578888899999998888777778999999887 45666779999999998776544 3444433222222 Q ss_pred cccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHhhHHHHHH Q lcl|NC_020862. 88 YGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGANEITEDLL 167 (405) Q Consensus 88 y~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~~~ted~l 167 (405) +..+.+++|+|+|.-.++||+..... -.+++++..+++...-+. .+ T Consensus 75 -----------------------------t~~~~~~~i~~~~~~~~i~D~~~~~~-~~d~~~~~~~q~~~~~a~----~v 120 (274) T protein:vir:12 75 -----------------------------ETKKREAKIRKIAKGTSITDEALLSG-YGDPQGEQVRQHGLAHAN----KV 120 (274) T ss_pred -----------------------------ccceeeEEeeeecceeeecHHHHHhc-ccchHHHHHHHHHHHHHH----HH Confidence 33455789999999999999765444 446777766555443222 22 Q ss_pred HHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccceEEEEEc Q lcl|NC_020862. 168 QADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISASRIAYIG 247 (405) Q Consensus 168 ~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~syv~~~h 247 (405) ..++++... +++..+ + .+.++++.+.+|...|..+... -++.+|| T Consensus 121 d~~~l~~~~------~a~~~~--~--------~~a~~~d~i~dA~~~lgd~~~~-------------------~~~ivv~ 165 (274) T protein:vir:12 121 DNDVLEALM------GAKLTV--N--------ADITKLNGLQSAIDKFNDEDLE-------------------PMVLFIN 165 (274) T ss_pred HHHHHHHHh------cccccc--c--------ccccCHHHHHHHHHHhcccccc-------------------ccEEEeC Confidence 223332210 011111 1 1358899999999999765432 2789999 Q ss_pred ccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccccccccCCcce Q lcl|NC_020862. 248 SELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSDVAGTDKY 327 (405) Q Consensus 248 ~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~~~g~~~~ 327 (405) |.....|+... ...|++.++||+. .+.+|+||++-| +|+|++..+ T Consensus 166 p~~~~~L~k~~----~~~fv~~s~~g~~-~~~~G~ig~~~G--~~Vi~s~~~---------------------------- 210 (274) T protein:vir:12 166 PLDAGKLRGDA----STNFTRATELGDD-IIVKGAFGEALG--AIIVRSNKL---------------------------- 210 (274) T ss_pred HHHHHHHHhhh----hhhcccccccccc-ceecccceeecC--eeEEEeCCC---------------------------- Confidence 99999997531 1369999999984 578999999965 899998632 Q ss_pred eeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccceEEEEEe--cCC Q lcl|NC_020862. 328 DIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIAVAYSV--IPE 405 (405) Q Consensus 328 DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~marie~~--a~~ 405 (405) ++|-..++|+.|++...-+ .+.+=. +| ||.-+.=..--...|++.++|+...+.+-.+ +.| T Consensus 211 p~~t~~l~~~gA~~~~~~~--------~~~vE~--------~R-d~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~ 273 (274) T protein:vir:12 211 EAGTAILAKKGAVKLILKR--------DFFLEV--------AR-DASTKTTALYSDKHYVAYLYDESKAVKITKGSGSLE 273 (274) T ss_pred CcceEEEEeccceeeeecC--------Cceecc--------cc-chhhcccEEEeeeEEEEEEEcCCceEEEEcCCcccc Confidence 2345579999999864321 122111 22 4443332333346689999999999888754 455 No 10 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=99.51 E-value=9.5e-16 Score=102.93 Aligned_cols=268 Identities=15% Similarity=0.113 Sum_probs=175.6 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccc-cccCcCCCCEEEEEecccCCCCCCccccC--CCc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADN-KQMPKHFGKELKVFYYVPLLDDLNVNDQG--LDA 77 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~-~~mPKn~GktIkfrry~pl~~~~t~l~eG--vtp 77 (405) |+ ..++.++..+.+..|..-.+....+.+++.+++.. ..++-..|+||++.++..+.++.+ ..+| +++ T Consensus 1 ma--------~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~-~~eg~~i~~ 71 (274) T protein:vir:93 1 MP--------QGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQV-VAEGEKIPT 71 (274) T ss_pred CC--------ccceehhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCccc-ccCCCcccc Confidence 42 34678888889999998888888888999999976 467777899999999987765543 3333 222 Q ss_pred ccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHH Q lcl|NC_020862. 78 TGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLR 157 (405) Q Consensus 78 ~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~ 157 (405) . + .+..+.+++++|+|.-.++||++... +-.++++++.+.+.. T Consensus 72 ~--~----------------------------------it~~~~~~~i~~~~~~~~i~D~~~~~-~~~d~~~~~~~~~~~ 114 (274) T protein:vir:93 72 D--I----------------------------------LETKKREAKIRKIAKGTSITDEALLS-GYGDPQGEQVRQHGL 114 (274) T ss_pred c--c----------------------------------cccceeEEEeeeecccccccHHHHHh-hccchHHHHHHHHHH Confidence 1 1 23445678999999999999985444 445777776655554 Q ss_pred HHhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccc Q lcl|NC_020862. 158 GANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKT 237 (405) Q Consensus 158 ~~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~ 237 (405) ..+. .+..++++.. .+++..+ + ...++++.+.+|...|..+... T Consensus 115 ~~a~----~~d~~~~~~~------~~a~~~~--~--------~~~~~~d~i~dA~~~l~d~~~~---------------- 158 (274) T protein:vir:93 115 AHAN----KVDNDVLEAL------MGAKLTV--N--------ADITKLNGLQSAIDKFNDEDLE---------------- 158 (274) T ss_pred HHHH----HHHHHHHHHH------hcccccc--c--------ccccCHHHHHHHHHHhhhccCC---------------- Confidence 3332 2223333221 0011111 0 1347899999999988765431 Q ss_pred ccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccc Q lcl|NC_020862. 238 ISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQ 317 (405) Q Consensus 238 I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~ 317 (405) -++.+|||+....|+.-. ...|++.+++|+. .+.+|.||++-| ||+++++.+- T Consensus 159 ---~~~ivv~p~~~~~L~k~~----~~~f~~~s~~g~~-~~~~G~ig~~~G--~~Vi~s~~~p----------------- 211 (274) T protein:vir:93 159 ---PMVLFINPLDAGKLRGDA----STNFTRATELGDD-IIVKGAFGEALG--AIIVRTNKLE----------------- 211 (274) T ss_pred ---ccEEEeCHHHHHHHHhhh----hhccccccccccc-ceeecccceecC--eeEEEcCCCC----------------- Confidence 268999999999997421 1369999999985 578999999966 8999987431 Q ss_pred cccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccceE Q lcl|NC_020862. 318 VSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIA 397 (405) Q Consensus 318 ~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~ma 397 (405) .|-.+++|+.|++...-+ ++.+-. + -||.-+.=..--..+|++.+++++-++ T Consensus 212 -----------~~t~~l~~~gai~~~~~~--------~~~vE~--------~-Rd~~~~~d~i~~~~~y~~~~~~~~~~v 263 (274) T protein:vir:93 212 -----------AGTAILAKKGAVKLILKR--------DFFLEV--------A-RDASTKTTALYSDKHYVAYLYDESKAV 263 (274) T ss_pred -----------cceEEEEeCCeEEEEecC--------Cccccc--------c-cchhhcccEEEEEEEEEEEEEcCCceE Confidence 234568999998864221 111111 1 133322222223367899999999988 Q ss_pred EEEE--ecCC Q lcl|NC_020862. 398 VAYS--VIPE 405 (405) Q Consensus 398 rie~--~a~~ 405 (405) .+.- ++.| T Consensus 264 ~~t~~~~s~~ 273 (274) T protein:vir:93 264 KITKGSGSLE 273 (274) T ss_pred EEeeCccccC Confidence 8764 3455 No 11 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=99.49 E-value=1.2e-15 Score=102.41 Aligned_cols=270 Identities=14% Similarity=0.097 Sum_probs=175.3 Q ss_pred ccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhcccccc-ccCcCCCCEEEEEecccCCCCCCccccCCCccccccc Q lcl|NC_020862. 5 YNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNK-QMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGASYA 83 (405) Q Consensus 5 y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~-~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~~~~ 83 (405) -+ ..+++.++.-+++..|..-.+....+.++|++++... .+.-..|+||++.+|..+.++. ...+|-+-.-+++ T Consensus 1 ~~---~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~-~~~~g~~i~~~~l- 75 (275) T protein:vir:96 1 MA---LENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAK-VVPEGEEIPIDLI- 75 (275) T ss_pred CC---CcccchhhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCccc-cccCCCCcchhhc- Confidence 12 2335777777788888888888887889999998654 3555669999999998775544 4455543322222 Q ss_pred CCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHhhHH Q lcl|NC_020862. 84 GGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGANEIT 163 (405) Q Consensus 84 ~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~~~t 163 (405) +..+.++.++|+|.-+++||++.... -.|++++..+++...-+. . T Consensus 76 ---------------------------------t~~~~~~~i~~~~~~~~i~D~~~~~~-~~d~~~~~~~~~a~~~a~-~ 120 (275) T protein:vir:96 76 ---------------------------------ETKKRQATIRKIGKGTVLTDEALLSG-YGDPKGEAVRQHGLAIAN-K 120 (275) T ss_pred ---------------------------------ccceeeEEeehhcccccccHHHHHhh-ccchHHHHHHHHHHHHHH-H Confidence 34456799999999999999864444 347777766665543322 2 Q ss_pred HHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccceEE Q lcl|NC_020862. 164 EDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISASRI 243 (405) Q Consensus 164 ed~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~syv 243 (405) .|.-....++++ +..+ . ...++++.+.++...|..+... -++ T Consensus 121 ~d~~ll~~l~~a---------~~~~--~--------~~~~~~d~i~dA~~~lgd~~~~-------------------~~~ 162 (275) T protein:vir:96 121 VDNDVLEALQGA---------TLKV--E--------ADITKLAGLQTAIDKFNDEDLE-------------------PMV 162 (275) T ss_pred HHHHHHHHHhcc---------cccc--c--------ccccCHHHHHHHHHHhccccCC-------------------ccE Confidence 222222223322 1111 0 1357899999999999654321 378 Q ss_pred EEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccccccccC Q lcl|NC_020862. 244 AYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSDVAG 323 (405) Q Consensus 244 ~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~~~g 323 (405) .+|||+....||.+. ...|++..++|+. .+.+|+||++-| +|+|++..+- T Consensus 163 ivv~p~~~~~L~k~~----~~~f~~~~~~g~~-~~~~G~ig~~~G--~~Vi~s~~~p----------------------- 212 (275) T protein:vir:96 163 LFVNPLDAGKLRASA----TDNFTRATLLGDN-VIVKGAFGEALG--AIIVRSNKIK----------------------- 212 (275) T ss_pred EEeCHHHHHHHHhcc----ccccccccccccc-ceeccccceecC--eeEEEeCCCC----------------------- Confidence 999999999998764 2469999999986 478999999965 8999887432 Q ss_pred CcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccceEEEEEec Q lcl|NC_020862. 324 TDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIAVAYSVI 403 (405) Q Consensus 324 ~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~marie~~a 403 (405) ++-.+++|+.|++...-+ .+.+ - .+ -||.-+.=.+--...|++.+++++-.+.+.. . T Consensus 213 -----~~t~~i~~~gA~~~~~~~--------~~~v--E------~~-Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~-~ 269 (275) T protein:vir:96 213 -----EGEAILAKRGAVKLITKR--------DFFL--E------TE-RHASHKSTALFSDKHYVAYLYDESKVVKITK-S 269 (275) T ss_pred -----cceEEEEeccceeeeecC--------Cccc--c------cc-cchhhcCcEEEEeEEEEEEEEcCccEEEEEe-c Confidence 334578899888863211 1111 1 12 2444443334444678888889888888744 3 Q ss_pred CC Q lcl|NC_020862. 404 PE 405 (405) Q Consensus 404 ~~ 405 (405) |= T Consensus 270 ~~ 271 (275) T protein:vir:96 270 AS 271 (275) T ss_pred cc Confidence 33 No 12 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=99.46 E-value=1.3e-14 Score=96.80 Aligned_cols=325 Identities=13% Similarity=0.069 Sum_probs=174.6 Q ss_pred ccccCcCCC---cccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCcc--ccCCCc Q lcl|NC_020862. 3 HIYNDPAAG---DASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVN--DQGLDA 77 (405) Q Consensus 3 ~~y~~~~~t---~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l--~eGvtp 77 (405) |-|.|.-++ +++. ..+..+.-|....|..-++.+++..+...++.--.+|+||+|.+.- +.+.. ++|-+- T Consensus 1 ~~~~~~~~~~~~~t~~-v~~fipei~s~~i~~~l~~~~v~~~~~~d~~~~~~~Gdtv~ip~~g----~~~~~d~~~~~~i 75 (341) T protein:vir:94 1 MALGNTITGPSINTQR-GQQFIPEQWLSEVQMFRKAKMLDTSVVKTWGAQVKKGDTFHVPRIS----ELGVEDKATDVPV 75 (341) T ss_pred Ccchhhhccccccchh-HHHHHHHHHHHHHHHHHHhhcchhhccccccccccCCceEEEeccC----cceeeeecCCCcc Confidence 666665222 2333 3355677899999888888999998876665544569999999732 22221 111111 Q ss_pred ccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeee-eeeEEecchhhhhhhccchHHHHHHHHH Q lcl|NC_020862. 78 TGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEY-GFFMEYTEDSLMFDTDSDLYGHLSREML 156 (405) Q Consensus 78 ~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qy-G~~~e~Td~~~~~d~d~~l~~~~~~ell 156 (405) ..+. .+..+++.+|.|+ .+=..++|+.. ....-++..++..+.. T Consensus 76 ~~~~----------------------------------~~~~~~~itiD~~~~~~~~i~d~d~-~~~~~d~~~~~~~~~~ 120 (341) T protein:vir:94 76 GVQP----------------------------------VNDTDFVITVDTDRTTAVALDDLLE-IQASYDLRAPYLEAMG 120 (341) T ss_pred cccc----------------------------------ccCceEEEEEeeeeecceeechHHH-HhhccchHHHHHHHHH Confidence 1111 1233566777554 44456777533 3344467777777666 Q ss_pred HHHhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_020862. 157 RGANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTK 236 (405) Q Consensus 157 ~~~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~ 236 (405) ..-++..+..+-. +++++... ..+...+..+.... . ....++++.|..+.+.|+++..|. T Consensus 121 ~aLA~~~D~~i~~-~~a~~~~~-~~~~~~~~~~~~~t---~-~~~~~~~~~i~~a~~~Lde~~VP~-------------- 180 (341) T protein:vir:94 121 YALAKDMTGSILG-LRAAVQNT-ASQNVFSSSNGAIT---G-NGQAFSFAVFLAARRLLLEADVPE-------------- 180 (341) T ss_pred HHHHHHHHHHHHH-Hhhhcccc-ccCccccCcccccc---C-chhhhhHHHHHHHHHHHhhcCCCc-------------- Confidence 5555544433333 33322211 11111111111110 0 113478899999999999999983 Q ss_pred cccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCccc-----C Q lcl|NC_020862. 237 TISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATAT-----A 311 (405) Q Consensus 237 ~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~-----~ 311 (405) .-++++|+|+...+|+. ++.|......++ ..+.+|+||++.| |.+++++.+-.-.+.+.... . T Consensus 181 ---~gR~lvv~P~~~~~Ll~------~~~~~~~~~~g~-~~l~~G~ig~i~G--~~V~~Sn~lp~~~~~~~~~~~~~~~~ 248 (341) T protein:vir:94 181 ---EKIVLLISPGQESALFT------IPQFISKDFINN-APIAQGQIGSLMG--VRVIRTSLIGNNSATGWRNGAPTIAP 248 (341) T ss_pred ---cCCEEEeCHHHHHHHhh------chhhhhhhcccc-chhheeeeeeEec--eEEEEeccccccccccccccccceec Confidence 23889999999999964 578999865554 5688999999977 89999987643221110000 0 Q ss_pred CCcccccccccCCcceeeeEEEEEc----cccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHH Q lcl|NC_020862. 312 ANRGYQVSDVAGTDKYDIAPLLVVG----DQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYG 387 (405) Q Consensus 312 t~~~~~~~~~~g~~~~DVYp~lV~G----~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~ 387 (405) ....-+.+......+++.+--.+.| ++|=+.+-+.- ..-+..++-++-. +-..=++--|.-++=-|..|| T Consensus 249 ~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~-----~~~~~~~~~~~~~-~~~~~~~~~~~~~i~~~~~~G 322 (341) T protein:vir:94 249 AEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCH-----MDWAAAVVSKAPR-VTQSFENREQVWLMVGRQAYG 322 (341) T ss_pred ccccccccccccccccccccccEEEEEEecccccceeeec-----chhhhcccccccc-ccccchhhhhhhhhhhhhhhc Confidence 0011111111112222222222222 22222221110 0011111222111 112223334444444588999 Q ss_pred HhhccccceEEEEEecCC Q lcl|NC_020862. 388 FIKLRGERIAVAYSVIPE 405 (405) Q Consensus 388 ~~iL~~~~marie~~a~~ 405 (405) +.+||++..+.|++.++- T Consensus 323 ~~~lrp~~~v~~~~~~~~ 340 (341) T protein:vir:94 323 ARLYRPLHAVNIHTTGDT 340 (341) T ss_pred ccccCcceeEEEecCcCC Confidence 999999999888887777 No 13 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=99.44 E-value=1.2e-14 Score=96.82 Aligned_cols=269 Identities=12% Similarity=0.116 Sum_probs=173.9 Q ss_pred CCCcccccccceeehhhhhHHHHHhhhhhhhhcccccc-ccCcCCCCEEEEEecccCCCCCCccccCCCcccccccCCcc Q lcl|NC_020862. 9 AAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNK-QMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGASYAGGNL 87 (405) Q Consensus 9 ~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~-~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~~~~~gnl 87 (405) =+.++++++.-+.+..|....+....+.+++.+++... .++...|++|++.++.....+.. ..||-+.+-.+ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~-v~eg~~i~~~~------ 73 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAED-VAEGEAIPMTQ------ 73 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCccc-ccCCCcccccc------ Confidence 13334667777888888887766666678999988763 45666799999999876655433 34443222111 Q ss_pred cccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHhhHHHHHH Q lcl|NC_020862. 88 YGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGANEITEDLL 167 (405) Q Consensus 88 y~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~~~ted~l 167 (405) .+...++.++++++.+.++||++ ..++..+++.++...+...-+...+.. T Consensus 74 ----------------------------~~~~~~~~~~~~~~~~~~itd~~-~~~s~~d~~~~~~~~~~~~~a~~~d~~- 123 (272) T protein:vir:30 74 ----------------------------LGFKKTTMTIKKAGKGVEITDEA-ILSGYGDPVGQAAKQIVEAIDHKVDAD- 123 (272) T ss_pred ----------------------------cccceEEEEeeeeeeeeeecHHH-HhhccccHHHHHHHHHHHHHHHHHHHH- Confidence 24556789999999999999985 455667888887776665444322222 Q ss_pred HHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccceEEEEEc Q lcl|NC_020862. 168 QADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISASRIAYIG 247 (405) Q Consensus 168 ~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~syv~~~h 247 (405) ....+.++. .. .....+++.|.++...|..+..+ -.+.+|| T Consensus 124 i~~~~~~a~---------~~-----------~~~~~t~d~i~da~~~l~~~~~~-------------------~~~~vv~ 164 (272) T protein:vir:30 124 VLDALSKST---------QT-----------VEATATVDGVSKALDIFNDEDDA-------------------ETVIVMN 164 (272) T ss_pred HHHHhcccc---------cc-----------cccccCHHHHHHHHHHHhccCCC-------------------ccEEEEc Confidence 222222221 10 12346789999998888766543 2578999 Q ss_pred ccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccccccccCCcce Q lcl|NC_020862. 248 SELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSDVAGTDKY 327 (405) Q Consensus 248 ~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~~~g~~~~ 327 (405) |+....|+... .+.|....+++.. .+.+|.||++.| +|+|+++.|- T Consensus 165 p~~~~~L~k~~----~~~~~~~~~~~~~-~~~~g~ig~i~G--~~Vi~s~~~p--------------------------- 210 (272) T protein:vir:30 165 PADASTLRLDA----AKEWLGATEVGAN-RVVSGVYGEVLG--VQIVRSRKCP--------------------------- 210 (272) T ss_pred HHHHHHHHHhc----ccccccccccccc-ccccccchhhcC--eeEEEcCCCC--------------------------- Confidence 99999998653 3568888888876 467999999966 8999998652 Q ss_pred eeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_020862. 328 DIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIAVAYSVIPE 405 (405) Q Consensus 328 DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~marie~~a~~ 405 (405) .+-++++++.+++...-+ .+++-.. -|+..++=..--..+|++.+++++.++.+...+-= T Consensus 211 -~~t~~~~~~~a~~~~~~~--------~~~ve~~---------r~~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~ 270 (272) T protein:vir:30 211 -KGTAYMVRKGALRIMLKR--------NTMVETD---------RDITKAINQIVANKHYGVYLYKAEKAVKITLKDAA 270 (272) T ss_pred -cceEEEEcCCeEEEEecC--------Cceeeec---------cccccceeEEEEEEEEEEEEEcCCceEEEEecccc Confidence 122467888888765421 1121111 23332222222235688889999999888776443 No 14 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=99.44 E-value=1.2e-14 Score=96.82 Aligned_cols=269 Identities=12% Similarity=0.116 Sum_probs=173.9 Q ss_pred CCCcccccccceeehhhhhHHHHHhhhhhhhhcccccc-ccCcCCCCEEEEEecccCCCCCCccccCCCcccccccCCcc Q lcl|NC_020862. 9 AAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNK-QMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGASYAGGNL 87 (405) Q Consensus 9 ~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~-~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~~~~~gnl 87 (405) =+.++++++.-+.+..|....+....+.+++.+++... .++...|++|++.++.....+.. ..||-+.+-.+ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~-v~eg~~i~~~~------ 73 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAED-VAEGEAIPMTQ------ 73 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCccc-ccCCCcccccc------ Confidence 13334667777888888887766666678999988763 45666799999999876655433 34443222111 Q ss_pred cccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHhhHHHHHH Q lcl|NC_020862. 88 YGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGANEITEDLL 167 (405) Q Consensus 88 y~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~~~ted~l 167 (405) .+...++.++++++.+.++||++ ..++..+++.++...+...-+...+.. T Consensus 74 ----------------------------~~~~~~~~~~~~~~~~~~itd~~-~~~s~~d~~~~~~~~~~~~~a~~~d~~- 123 (272) T protein:vir:98 74 ----------------------------LGFKKTTMTIKKAGKGVEITDEA-ILSGYGDPVGQAAKQIVEAIDHKVDAD- 123 (272) T ss_pred ----------------------------cccceEEEEeeeeeeeeeecHHH-HhhccccHHHHHHHHHHHHHHHHHHHH- Confidence 24556789999999999999985 455667888887776665444322222 Q ss_pred HHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccceEEEEEc Q lcl|NC_020862. 168 QADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISASRIAYIG 247 (405) Q Consensus 168 ~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~syv~~~h 247 (405) ....+.++. .. .....+++.|.++...|..+..+ -.+.+|| T Consensus 124 i~~~~~~a~---------~~-----------~~~~~t~d~i~da~~~l~~~~~~-------------------~~~~vv~ 164 (272) T protein:vir:98 124 VLDALSKST---------QT-----------VEATATVDGVSKALDIFNDEDDA-------------------ETVIVMN 164 (272) T ss_pred HHHHhcccc---------cc-----------cccccCHHHHHHHHHHHhccCCC-------------------ccEEEEc Confidence 222222221 10 12346789999998888766543 2578999 Q ss_pred ccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccccccccCCcce Q lcl|NC_020862. 248 SELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSDVAGTDKY 327 (405) Q Consensus 248 ~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~~~g~~~~ 327 (405) |+....|+... .+.|....+++.. .+.+|.||++.| +|+|+++.|- T Consensus 165 p~~~~~L~k~~----~~~~~~~~~~~~~-~~~~g~ig~i~G--~~Vi~s~~~p--------------------------- 210 (272) T protein:vir:98 165 PADASTLRLDA----AKEWLGATEVGAN-RVVSGVYGEVLG--VQIVRSRKCP--------------------------- 210 (272) T ss_pred HHHHHHHHHhc----ccccccccccccc-ccccccchhhcC--eeEEEcCCCC--------------------------- Confidence 99999998653 3568888888876 467999999966 8999998652 Q ss_pred eeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_020862. 328 DIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIAVAYSVIPE 405 (405) Q Consensus 328 DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~marie~~a~~ 405 (405) .+-++++++.+++...-+ .+++-.. -|+..++=..--..+|++.+++++.++.+...+-= T Consensus 211 -~~t~~~~~~~a~~~~~~~--------~~~ve~~---------r~~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~ 270 (272) T protein:vir:98 211 -KGTAYMVRKGALRIMLKR--------NTMVETD---------RDITKAINQIVANKHYGVYLYKAEKAVKITLKDAA 270 (272) T ss_pred -cceEEEEcCCeEEEEecC--------Cceeeec---------cccccceeEEEEEEEEEEEEEcCCceEEEEecccc Confidence 122467888888765421 1121111 23332222222235688889999999888776443 No 15 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=99.42 E-value=9.2e-15 Score=97.55 Aligned_cols=277 Identities=11% Similarity=0.087 Sum_probs=170.6 Q ss_pred CCCcccccccceeehhhhhHHHHHhhhhhhhhcccc-ccccCcCCCCEEEEEecccCCCCCCccccCCCcccccccCCcc Q lcl|NC_020862. 9 AAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLAD-NKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGASYAGGNL 87 (405) Q Consensus 9 ~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~-~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~~~~~gnl 87 (405) =+..+++++..+.+..|..-.+....+.+++.+++. ...++-..|++|++.+|..+.++ ..+.+|-+..-+++ T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a-~~~~~g~~i~~~~l----- 74 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDA-QDVAEGAAIDYSAL----- 74 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcc-eeecCCCcCccccc----- Confidence 122356778888898999988888888999999985 44566667999999999877543 33544433221122 Q ss_pred cccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHhhHHHHHH Q lcl|NC_020862. 88 YGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGANEITEDLL 167 (405) Q Consensus 88 y~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~~~ted~l 167 (405) +....++.|+|+|.-.+++|++... +..++++++..++....+...+ .. T Consensus 75 -----------------------------t~~~~~~~i~~~~~a~~v~D~~~~~-~~~d~~~~~~~~~a~~~a~~~d-~~ 123 (278) T protein:vir:80 75 -----------------------------ETESVKHGIKKAGKGVKLTDESVLS-GYGDPVEEAQKQIRMAIASKVD-ND 123 (278) T ss_pred -----------------------------ccceeeEeeehhhccccccHHHHhh-ccccHHHHHHHHHHHHHHHHHH-HH Confidence 3445678999999999999975443 4457777766655543333322 22 Q ss_pred HHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccceEEEEEc Q lcl|NC_020862. 168 QADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISASRIAYIG 247 (405) Q Consensus 168 ~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~syv~~~h 247 (405) ....++++... ..++.+. . .. .-.++.+-.+...|.....| ..++++|| T Consensus 124 l~~~l~~a~~~-~~~~~t~----~------~~--~~~~~~~~da~~~l~~~~~~------------------~~~~ivv~ 172 (278) T protein:vir:80 124 ILEEALTTTLE-VKGAINI----G------LI--DKIENTFTDAPDAIEDESIT------------------TTGVLFLN 172 (278) T ss_pred HHHHHhccccc-ccccccc----c------hh--hhHHHHHHHHHHhhcccCCC------------------cccEEEEC Confidence 22333333211 1111000 0 00 01234444444445444333 34578999 Q ss_pred ccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccccccccCCcce Q lcl|NC_020862. 248 SELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSDVAGTDKY 327 (405) Q Consensus 248 ~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~~~g~~~~ 327 (405) |.....|+... ...|++..+||+. .+.+|+||++-| ||+++++.+- T Consensus 173 p~~~~~L~k~~----~~~~~~~~~~g~~-~~~~G~ig~~~G--~~Vi~s~~~p--------------------------- 218 (278) T protein:vir:80 173 YKDTAKLREEA----AGSWTKASQLGDD-LLVKGAFGELLG--WEIVRTKKLA--------------------------- 218 (278) T ss_pred HHHHHHHHhhh----hhhcccccccccc-ceeeccceeecc--eeEEEcCCCC--------------------------- Confidence 99999997542 2469999999987 467999999966 8999998652 Q ss_pred eeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_020862. 328 DIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIAVAYSVIPE 405 (405) Q Consensus 328 DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~marie~~a~~ 405 (405) .+-..++++.|++...-+ .+. +. .+ -||.-+.=.+--...|++.++|++..+.|=.+|-. T Consensus 219 -~~t~~l~~~gAi~~~~~~--------~~~--vE------~~-Rd~~~~~d~i~~~~~yg~~v~~~~~~v~it~~a~~ 278 (278) T protein:vir:80 219 -DGNALAVKAGALKTFLKR--------NLL--AE------SG-RDMDHKLTKFNADQHYAVALVDETKAVKVVPVAGN 278 (278) T ss_pred -cceEEEEeccceeeeecC--------Ccc--cc------cc-cchhhccceeeeeeEEEEEEEcCcceEEEeeccCC Confidence 123467889888864322 111 11 12 23433222223346689999999999888777766 No 16 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=99.30 E-value=8.8e-13 Score=86.67 Aligned_cols=313 Identities=12% Similarity=0.103 Sum_probs=182.3 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhc-ccc---------ccccCcCCCCEEEEEecccCCCCCCc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSP-LAD---------NKQMPKHFGKELKVFYYVPLLDDLNV 70 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~-fA~---------~~~mPKn~GktIkfrry~pl~~~~t~ 70 (405) |. -|....-+|+... .|.+++.-++...-.|.+ |-. ...+-|+.|.+|.|.=-.+|.-. | T Consensus 1 Ma-------~T~~~~~~p~a~~-~ws~~l~~~~~~~s~f~~~l~G~~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~g~--g 70 (364) T protein:vir:93 1 MS-------QTVIPFGDPKAVK-RWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDRITFDLSVHLRGK--P 70 (364) T ss_pred Cc-------eeccCcCCHHHHH-HHHHHHHHHHHhhCccccccccCCCCCcEEEeeecCCCCCceEEeeeeeecccC--C Confidence 32 2223333566554 678888877777655544 433 23577888999998855555322 2 Q ss_pred cccCCCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhcc--chH Q lcl|NC_020862. 71 NDQGLDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDS--DLY 148 (405) Q Consensus 71 l~eGvtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~--~l~ 148 (405) ..++-+.+| |-..+.+.+-++.+.|...=+..... +.+.-+ +|. T Consensus 71 v~Gd~~leG--------------------------------nee~L~~~~~~i~idq~r~~V~~~g~--ms~qRt~~dlr 116 (364) T protein:vir:93 71 TYGDARVEG--------------------------------KEESLRFYQDEVRIDQVRHSVSAGGR--MSRKRTVHNIR 116 (364) T ss_pred cccCceeec--------------------------------cccceeEEeeEEEEeeccccccccCc--hhhhhhHHHHH Confidence 222222222 11123344444555555544443321 111111 233 Q ss_pred HHHHHHHHHHHhhHHHHHHHHHHhccC--------------------------ceEEecCCCccceeeecccccccCCce Q lcl|NC_020862. 149 GHLSREMLRGANEITEDLLQADILASA--------------------------DVKVFTGAATSMVTMTGEAADAEDDGL 202 (405) Q Consensus 149 ~~~~~ell~~~~~~ted~l~~~ilag~--------------------------~~v~yag~ats~~~~t~~~~~~~~n~~ 202 (405) + ..+..|..-.....|++.-..|+|+ +.++|+|.+|+..+++.. +. T Consensus 117 ~-~ar~~L~~w~~~~~d~~~f~~laGarg~~~~~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~l~st-------D~ 188 (364) T protein:vir:93 117 R-IARDRLGDYFYKFTDELLFIYLSGARGINLDFIETPDFTGYAGNPLDAPDVDHLLYGGVATSKASLAAT-------DI 188 (364) T ss_pred H-HHHHHHHHHHHHHHHHHHHHHhhcccccccccccccCcccccccccCCCCCCcEEeccccCchhhcccc-------cc Confidence 3 3566666666667777766666653 456777778887777643 77 Q ss_pred ecHHHHHHHHHHHHhccCc-----cccceeccccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCC--- Q lcl|NC_020862. 203 ITLKDLKRLSITLTDNYTP-----KKTTIIKGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYAD--- 274 (405) Q Consensus 203 it~~~lr~~~~~Lk~nrAp-----k~T~ii~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~--- 274 (405) +|++.|+++...++..+++ ++..+--++ + .+||+|+||.-..|||--+ +|.|...++|+. T Consensus 189 ~sl~~id~a~~~a~~~~~~~~~~~~~~Pv~~~g-----~---~~yV~~l~p~q~~~Lr~~t----~~~w~d~qk~A~~~~ 256 (364) T protein:vir:93 189 MAPLVIEKAVEKAAMMQAENPDVANMVPVSIDG-----D---DHYVCVMSEYQATDMRTAA----GGTWIDFQKAAAAAE 256 (364) T ss_pred ccHHHHHHHHHHHHHhCCCCCCCcccceeEecC-----c---ceeEEEEcchhhhhhhhcC----CHHHHHHHHHhhhcc Confidence 9999999999999887643 233332222 2 4899999999999998433 488999999864 Q ss_pred --cccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccccccccCCcceeeeEEEEEccccceeecceeccCCC Q lcl|NC_020862. 275 --AATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKG 352 (405) Q Consensus 275 --~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g 352 (405) .-|||.||+|.+ .|+.+++.+++-.+-+-|++ +.+.|-=-|.+|.+|.+.-=-+ .+ T Consensus 257 g~~nPlF~G~~gm~--ngvii~~~~~vi~~~~~~~~----------------~~v~~~ralllGaQA~~~a~g~----~~ 314 (364) T protein:vir:93 257 GRNNPIFKGGLGMI--NNVVLHKHRNVIRFNDYGAG----------------ANVEAARALFMGRQAGVIAYGT----AN 314 (364) T ss_pred cccCCceecCeeeE--cCeEEeccCCcccccccccC----------------ccccchhhheecceeeEEEeec----CC Confidence 456999999999 66788888877655433221 1223444689999996644322 11 Q ss_pred CCCceEEEecCCCCCCCCCCccchhhhHHHH-HHHHHhhccccceEEEEEecCC Q lcl|NC_020862. 353 KSKFRIIVKKPGEATADRNDPYGKVGFSSIK-FFYGFIKLRGERIAVAYSVIPE 405 (405) Q Consensus 353 ~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK-~~~~~~iL~~~~marie~~a~~ 405 (405) -..+.-+-+.-.+ ++.--++....+||| .=|-. .|-=...|-++|+= T Consensus 315 g~~~~w~Ee~~D~---gn~~~i~~~~i~G~kK~rF~~---~DfGvi~idtaa~~ 362 (364) T protein:vir:93 315 GLRFDWEETVKDY---GNEPAIAAGFIAGMKKARFNN---KDFGVISIDTAAKK 362 (364) T ss_pred CCCceeeecccCC---CCchhhhhhhHhhhhhcccCC---ccceEEEecccccc Confidence 2456666665432 222225555566665 22211 13334456666666 No 17 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=99.20 E-value=1.9e-12 Score=84.89 Aligned_cols=322 Identities=13% Similarity=0.038 Sum_probs=159.5 Q ss_pred CCccccCc----CCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCC Q lcl|NC_020862. 1 MPHIYNDP----AAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLD 76 (405) Q Consensus 1 ~~~~y~~~----~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvt 76 (405) |-++=.+- ++-++++ ...+.+.-|....+..-++.+++.++...+......|+||+|.+.-. +. ..-.++|-+ T Consensus 1 ~~~~~~~~~~~~~~~~~t~-~~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g~-~~-a~d~~~g~~ 77 (381) T protein:vir:80 1 MATIQGTGGYKGSAVDLSN-VQVFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNISR-AA-VYDKQPQTP 77 (381) T ss_pred CceecccccccCcccchhh-HHhhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccCc-ce-eeeecCCCc Confidence 55554321 1111111 13355567888888888889999999888788778899999986432 11 111333322 Q ss_pred cccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeee-EEecchhhhhhhccchHHHHHHHH Q lcl|NC_020862. 77 ATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFF-MEYTEDSLMFDTDSDLYGHLSREM 155 (405) Q Consensus 77 p~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~-~e~Td~~~~~d~d~~l~~~~~~el 155 (405) ...+.+ +..+++.+|.|+=.+ ..++|+. .....-++..++..++ T Consensus 78 i~~~~~----------------------------------~~~~~~itID~~~~~~~~Idd~D-~~~~~~D~~~~~~~~~ 122 (381) T protein:vir:80 78 VNLQAR----------------------------------TDSEFTFTVTKYKESSFMIEDIV-NTQASYTLRQYYTKEA 122 (381) T ss_pred cccccc----------------------------------CCceEEEEEeeeeecceeechHH-HHhhccChHHHHHHHH Confidence 222221 223456777665444 5666643 3333446667766666 Q ss_pred HHHHhhHHHHHHHHHHhccCceEEecCCCccceeeecccccc---cCCceecHHHHHHHHHHHHhccCccccceeccccc Q lcl|NC_020862. 156 LRGANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADA---EDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRM 232 (405) Q Consensus 156 l~~~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~---~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~ 232 (405) ...-+... |.....+++...........+....+.+....+ -....++++.|.++.+.|+.++.|. T Consensus 123 ~~aLA~~~-D~~i~~~~~~~~~~~~~~~~t~~~~i~~~~~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP~---------- 191 (381) T protein:vir:80 123 GYALARDM-DNFALAHRAVINAFPSQRIYSYDTTLGDGTVNAHLTGTPAPLTYAALLLAKQKLDEADVPQ---------- 191 (381) T ss_pred HHHHHHHH-HHHHHHHHhhcccccccccccccccccccccccccccchhhHHHHHHHHHHHHHhhcCCCc---------- Confidence 65444433 332222332222111111111111111111000 0113468899999999999999983 Q ss_pred cCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCC Q lcl|NC_020862. 233 TDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAA 312 (405) Q Consensus 233 ~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t 312 (405) .-++++|+|+...+|+. ++.|... +|++.+.+.+|+||++.| |++++++.+-.-...+-...+. T Consensus 192 -------egR~lvv~P~~~~~Ll~------~~~~~~a-d~~~~~~l~~G~Ig~i~G--~~Vv~Sn~lp~~~~t~~~~~ag 255 (381) T protein:vir:80 192 -------EGRIVMVSPAQYIDLLS------INQFISV-DFSQVKPVTSGVVGTILG--MEVIVTTQIGINSLTGYVNGQG 255 (381) T ss_pred -------CCcEEEeCHHHHHHHhh------chhhhhh-hhccchhhhceeeeEEcc--eEEEeecccccccccceeeecc Confidence 23789999999999974 4678887 588888899999999977 8999998764311111000000 Q ss_pred CcccccccccCCcceeeeEEEEEccccce---eecceeccCCCCCCceEEEecCCCCCC--CCCCccchhhhHH----HH Q lcl|NC_020862. 313 NRGYQVSDVAGTDKYDIAPLLVVGDQAFA---TIGLQGMSGKGKSKFRIIVKKPGEATA--DRNDPYGKVGFSS----IK 383 (405) Q Consensus 313 ~~~~~~~~~~g~~~~DVYp~lV~G~~Afg---~i~l~g~~~~g~~~~~~ivk~pG~~ta--d~~DPlgQrg~~g----wK 383 (405) .+.... ....+++ ..|+.+|. .-.++.++ +++....+|-.+. -.+++=.+-+++| .+ T Consensus 256 ap~~~~-~~~~~~~-------~~g~~s~~a~av~~~k~yd------~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~ 321 (381) T protein:vir:80 256 APTQPT-PGVLGSP-------YLPDQAGTANVVNTGSASD------LAVSLSYFGLPVFSGAGATAADGGQTLGSFGGAN 321 (381) T ss_pred cccccc-ccccccc-------cccccccceeeeeeeeeec------eeeeeeeccceeeecceeeecCCCceeeeehhhh Confidence 000100 0111111 12222221 11111111 1111111111100 0122222222222 11 Q ss_pred HHHHHhhccccceEEEEEecCC Q lcl|NC_020862. 384 FFYGFIKLRGERIAVAYSVIPE 405 (405) Q Consensus 384 ~~~~~~iL~~~~marie~~a~~ 405 (405) -|-++.+-...| |.+.|| T Consensus 322 ~~~~~~~~~~~~----~~~~~~ 339 (381) T protein:vir:80 322 RWATAVVCHPDW----LAVGVQ 339 (381) T ss_pred hhhhhccccccc----ccccce Confidence 111455545555 456677 No 18 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.19 E-value=5.1e-13 Score=87.97 Aligned_cols=231 Identities=12% Similarity=0.098 Sum_probs=148.9 Q ss_pred ccCcCCCCEEEEEecccCCCCCCccccCCCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEee Q lcl|NC_020862. 47 QMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLT 126 (405) Q Consensus 47 ~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~ 126 (405) +=--|.|+||.|-.| .-+...+.||..-+-.++ +..+.+++|+ T Consensus 1 ~~~~~~Gdtit~P~~---iGda~~v~eG~~i~~~~l----------------------------------~~t~~~atIk 43 (231) T protein:vir:73 1 ENGINLANLCEYPND---IGDAADVAEGGEISLDKI----------------------------------GTTTKSVTIK 43 (231) T ss_pred CccccCCceEEeccc---ccchhhhcCCCcCChhhc----------------------------------cccceeeeEe Confidence 334578999999988 345577888876554443 3445679999 Q ss_pred eeeeeEEecchhhhhhhccchHHHHHHHHHHHHhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHH Q lcl|NC_020862. 127 EYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLK 206 (405) Q Consensus 127 qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~ 206 (405) |+|.-+++||++.+.. --|++++.++.+.... -+.+..|+++... .++ -+.++.+|++ T Consensus 44 ~~gk~~~itD~a~l~~-~gDp~~ea~~Q~~~~i----A~kvD~di~~~~~------~a~-----------l~~~~~~t~d 101 (231) T protein:vir:73 44 KAAKGTEITDEAALSG-YGDPIGESNKQLGLSL----ANKVDDDLLKAAK------TTS-----------QTVSTKANVD 101 (231) T ss_pred eeccceeeeHHHHhhc-cCchHHHHHHHHHHHH----HHhhhHHHHHhhc------ccc-----------ccccccccHH Confidence 9999999999876554 3466777555544332 2334445554322 011 1123569999 Q ss_pred HHHHHHHHHHhccCccccceeccccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEe Q lcl|NC_020862. 207 DLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAI 286 (405) Q Consensus 207 ~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi 286 (405) .|.++...|..+... -+++||||.....||.. +.|.....++...-+++|+||++ T Consensus 102 ~i~~A~~~fgde~~~-------------------~~vivv~p~~~~~Lrk~------~~~~~~~~~~g~~i~~~G~iG~i 156 (231) T protein:vir:73 102 GVQAALDIFNDEDAQ-------------------AYVLIVNPKDAAKIRKD------ANAKNIGSEVGANALINGTYADV 156 (231) T ss_pred HHHHHHHHhcccccc-------------------ceEEEEcchHHHhhhhc------cchhhhhhhhccceeeecccceE Confidence 999999999776543 38999999988888764 44666554555556899999999 Q ss_pred cCCcEEEEeCcchhhhhcCCCcccCCCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCC Q lcl|NC_020862. 287 PGAHLRIVVVPQMMHYAGAGATATAANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEA 366 (405) Q Consensus 287 ~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~ 366 (405) -| +|++.+..+-. |. -++..+++.+.|.+...=++ +. +- T Consensus 157 ~G--~~Vi~S~~~~~----~~--------------------~~~~~~i~~~gAl~~~~k~~--------~~--vE----- 195 (231) T protein:vir:73 157 LG--AQIVRSKKLAE----GS--------------------ALMFKIVSNSPALKLVLKRG--------VQ--VE----- 195 (231) T ss_pred cc--eEEEEcCCCCC----Cc--------------------eeeeeEEeeccceeeeeccc--------ce--ee----- Confidence 76 79999976541 11 02333455555555433221 11 11 Q ss_pred CCCCCCccchhhhHHHHHHHHHhhccccceEEEEEecC Q lcl|NC_020862. 367 TADRNDPYGKVGFSSIKFFYGFIKLRGERIAVAYSVIP 404 (405) Q Consensus 367 tad~~DPlgQrg~~gwK~~~~~~iL~~~~marie~~a~ 404 (405) .+| ||.-+.=.+--..+|++.+.|+.-.+.|-..-- T Consensus 196 -tdR-d~~~k~~~i~~~~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 196 -TDR-DIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred -ccc-cccccccEEEEeEEEEEEEEcCccEEEEEeecC Confidence 133 565555555556778899999988888765444 No 19 >protein:vir:105610 Length: 430 # NCBI annotation: virion structural protein # Family: family:all:974 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164307;genbank:gi:56692923;genbank:GeneID:3197221 Probab=99.15 E-value=3.9e-11 Score=77.67 Aligned_cols=340 Identities=13% Similarity=0.095 Sum_probs=172.4 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhh-hhhhcccc-------------------------ccccCcCCCC Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEE-MFFSPLAD-------------------------NKQMPKHFGK 54 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~-lv~~~fA~-------------------------~~~mPKn~Gk 54 (405) |+-.+-+. +.-+|+ ....|.+.+..++.+. -.+.+|.- ..++-|+.|. T Consensus 1 ~~~a~T~~-----~~~~p~-a~~~ws~~l~~~~~k~~~~~~kl~G~~~~~~~~~~~~~~~~ts~~~pI~r~~dL~K~~GD 74 (430) T protein:vir:10 1 MTASKTTM-----RYGDPN-AMIQQAAGLFALCQGRNSTLNRLTGKMPSGTSDAEKKTKGQSSLELPIVQAQDLGRNKGD 74 (430) T ss_pred Ccceeeec-----ccCChh-HHHHHHHHHHHHHhhhhhhHHHhhccccccccchhhhccCCCCCCccEEEeccCCCCCcc Confidence 54332221 222333 3345666655555443 22233322 3457789999 Q ss_pred EEEEEecccCCCCCCccccCCCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEe Q lcl|NC_020862. 55 ELKVFYYVPLLDDLNVNDQGLDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEY 134 (405) Q Consensus 55 tIkfrry~pl~~~~t~l~eGvtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~ 134 (405) +|.|-=-.+|.-.. ...+-+.+|.+ ..+.+.+-.+.|.|...=+-. T Consensus 75 ~Vtf~L~~~L~g~g--v~Gd~~lEGne--------------------------------e~L~~~~d~l~IDq~R~~V~~ 120 (430) T protein:vir:10 75 EVRFHFVQPANAFP--IMGSEYAEGKG--------------------------------TGLKIGSDQLRVNQARFPVDL 120 (430) T ss_pred EEEEeEeeccccCc--eecCceeeccc--------------------------------cceEEEeeEEEEeeecccccc Confidence 99887555553322 22222222211 112333333444444333222 Q ss_pred cchhhhhhhc--cchHHHHHHHHHHHHhhHHHHHHHHHHhcc-------------------------------C-c-eEE Q lcl|NC_020862. 135 TEDSLMFDTD--SDLYGHLSREMLRGANEITEDLLQADILAS-------------------------------A-D-VKV 179 (405) Q Consensus 135 Td~~~~~d~d--~~l~~~~~~ell~~~~~~ted~l~~~ilag-------------------------------~-~-~v~ 179 (405) .. -+.+.- -+|.++ .+..|..=..-..|++.-..|+| + + .++ T Consensus 121 gg--~msqQRt~~dlR~~-ar~~L~~w~~~~~Dq~~~v~laGarg~~~~~~~~~~~~~~~~~~~~~~N~v~aPt~nrh~~ 197 (430) T protein:vir:10 121 GD--VMSQIRNPYDLRRL-GRPKAKWFMDAYLDQSMLVHLAGARGNHYNKEWCLPLETHPKLADMLVNRVKAPTKNRHFV 197 (430) T ss_pred CC--chhhhhhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccccCCcchhhhhccccCCCCCceeEe Confidence 21 111111 122222 33333333333344333333332 2 2 455 Q ss_pred ecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccceEEEEEcccchHHHHHHhc Q lcl|NC_020862. 180 FTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISASRIAYIGSELEIYITELVD 259 (405) Q Consensus 180 yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d 259 (405) ++|.+++...+.+....-...++.+++.|.++...++.++-|.+-=.|.|..+-+.+|. ||+||||....|||. T Consensus 198 ~~G~at~~~~~~~~~~sl~stD~~s~~~id~a~~~a~~~~~~i~Pv~v~gd~~~g~~~~---yV~~~~p~q~~~Lr~--- 271 (430) T protein:vir:10 198 ASADAITGVAPNAGEYNITTADVLDVDVVDSIATYMDQIELPPPPVKFEGDEAAEDSPI---RVLLCSPAQYNSFAK--- 271 (430) T ss_pred ecccccccccccccccchhhhcccCHHHHHHHHHHHHhhCCCCcceEeecccccCCccE---EEEEechHHHHHHhh--- Confidence 56777665544433222223588999999999999999987766666788877775544 999999999999985 Q ss_pred ccCCCcceeh-------hhcCCcccccCcceeEecCCcEEEEeCcchhhhh-----cCCCcccCCCcccccccccCCcce Q lcl|NC_020862. 260 SLGNPAFVPV-------EKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYA-----GAGATATAANRGYQVSDVAGTDKY 327 (405) Q Consensus 260 ~~~~p~fi~v-------~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~-----~aGa~~~~t~~~~~~~~~~g~~~~ 327 (405) |+.|..- ...|+.-|||.||+|.+. |+-+++-|..-.|- .-|++.........+....+++.+ T Consensus 272 ---dt~~~~wq~~~~a~a~~g~~nPlF~G~~gm~n--gvii~~~~~virf~~g~~~~~~a~~~~~~~~~~~~~a~~~~~~ 346 (430) T protein:vir:10 272 ---QEKFRSWQAAALARASNAKQHPIFRVDAGLWS--NTLIIKMPKPIRFYAGDTIKYCAAYNSEAESSAVVSDSFGNQY 346 (430) T ss_pred ---CcchHHHHHHHHHhhcccccCCceecceeeec--CeEEecCCceeeecCCCccccccCCcccccccccccccccccc Confidence 6777631 234678999999999994 45665555333332 112211111111111122234455 Q ss_pred eeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhcccc------------c Q lcl|NC_020862. 328 DIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGE------------R 395 (405) Q Consensus 328 DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~------------~ 395 (405) .|==-|.+|.+|-+ +.+ ++..++.+...++- ..-| ||.+=-++.++.++.+..+-. = T Consensus 347 ~v~RalllGaQA~~-~A~---g~~~~~g~~f~w~E------e~~D-~g~~~~i~~~~i~G~kK~rF~~~~~~~~~~~DfG 415 (430) T protein:vir:10 347 AVDRALLLGGQALA-QAW---AASEHSGMPFFWSE------KDMD-HGDKLELLIGAILGCSKIRFAVEATNGLEYTDHG 415 (430) T ss_pred cchhhhhccchhhe-eee---eccCCCCcceeeee------eccc-cCchhhhhhhHHhccceeeecCCCCCCceeeeeE Confidence 66667889988752 222 22222334444442 1122 333333666677776655432 2 Q ss_pred eEEEEEecCC Q lcl|NC_020862. 396 IAVAYSVIPE 405 (405) Q Consensus 396 marie~~a~~ 405 (405) ...|-++|+= T Consensus 416 vi~idtaa~~ 425 (430) T protein:vir:10 416 VMAIDTAVKI 425 (430) T ss_pred EEEhhhhhhh Confidence 3445566655 No 20 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.10 E-value=4.5e-12 Score=82.79 Aligned_cols=265 Identities=13% Similarity=0.114 Sum_probs=165.7 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccc-cccCcCCCCEEEEEecccCCCCCCccccCCCccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADN-KQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATG 79 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~-~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g 79 (405) |++ +.++.-+.+..|..-..+...+.++|.++|.. ..|+-..|++|.|-.|. +..+...+.||-+-.- T Consensus 1 Ma~----------T~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~-~igdae~~~eg~~i~~ 69 (270) T protein:vir:95 1 MTQ----------TKKANLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYA-YIGAAEDLQEGVAMDT 69 (270) T ss_pred CCc----------eehhhhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeec-CCCccccccCCCccch Confidence 433 44444455556666666777777999999987 45666679999999997 5445566666654332 Q ss_pred ccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHH Q lcl|NC_020862. 80 ASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGA 159 (405) Q Consensus 80 ~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~ 159 (405) .++ +..+-.++++++|.-+++||++....-. |+++++.+.+...- T Consensus 70 ~~l----------------------------------t~~~~~a~i~~~gk~~~itD~a~~~~~~-dp~~~~~~q~a~~~ 114 (270) T protein:vir:95 70 TQM----------------------------------SMTTTKVTVKETGKAVEVTQTAIITNVN-GTLQEASRQLAMSL 114 (270) T ss_pred hhc----------------------------------ccchheeeeehhhCcceecHHHHhhhcc-chHHHHHHHHHHHH Confidence 222 3334579999999999999986555433 66776555544322 Q ss_pred hhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccccc Q lcl|NC_020862. 160 NEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTIS 239 (405) Q Consensus 160 ~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~ 239 (405) + ..++.++++.... ++. +.+..+|++.+-++...|....- T Consensus 115 a----~~~d~~li~~l~~------a~~-----------~~~~~~t~~~~~dA~~~lgd~~~------------------- 154 (270) T protein:vir:95 115 A----DKVEIDYIAELNK------SKQ-----------TATVSADATGILDAIEVFNSEND------------------- 154 (270) T ss_pred H----HHHHHHHHHHhcc------ccc-----------ccccccCHHHHHHHHHHhccccC------------------- Confidence 2 3334444433211 111 11234788888888877754322 Q ss_pred ceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccccc Q lcl|NC_020862. 240 ASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVS 319 (405) Q Consensus 240 ~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~ 319 (405) .-.+.+|||.+...||.. +|+.-.+|++.. +.+|+||.+-| +|.|+.... T Consensus 155 ~~~~i~vhs~~~~~Lrk~-------~~~~~~~~~~~~-~~~G~ig~~~G--~~Viv~s~~-------------------- 204 (270) T protein:vir:95 155 EDYVLYVNPKDYNKLVKS-------LFKVGGNVQDRA-ISKGDLVEIVG--VSDIVKSKR-------------------- 204 (270) T ss_pred CCcEEEEcHHHHHHHHhh-------hcccccccccch-hcccccceecc--eeEEEeCCC-------------------- Confidence 136899999999999853 477777898864 68999999966 787654321 Q ss_pred cccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccceEEE Q lcl|NC_020862. 320 DVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIAVA 399 (405) Q Consensus 320 ~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~mari 399 (405) .+.|-..++++.|.+...-+ .+. +- . +| ||+-+.=..--...|++.+.++.-.+.| T Consensus 205 -------~~~~~~~l~~~gAi~~~~~~--------~~~--vE-t-----dR-d~~~~~d~i~~~~~y~v~~~~~skvv~~ 260 (270) T protein:vir:95 205 -------VSENTAFLQRYGAMEIVNKK--------KPE--AY-T-----DF-DILKRTHLLSTNYHYSVNLKDETGVVKV 260 (270) T ss_pred -------CCceeEEEEeccceeeeecC--------Cce--ee-e-----cc-chhhcccEEEeeeEEEEEEEccceEEEE Confidence 12344568888887753322 122 21 1 22 5555444444556789999998887776 Q ss_pred EE---ecCC Q lcl|NC_020862. 400 YS---VIPE 405 (405) Q Consensus 400 e~---~a~~ 405 (405) .. .++| T Consensus 261 t~~~a~~~~ 269 (270) T protein:vir:95 261 TFKPSGSLE 269 (270) T ss_pred EecCCCCcC Confidence 43 2333 No 21 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=99.01 E-value=1e-10 Score=75.31 Aligned_cols=266 Identities=15% Similarity=0.184 Sum_probs=145.5 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhcccccc--ccCcCCCCEEEEEecccCCCCCCccccCCCcc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNK--QMPKHFGKELKVFYYVPLLDDLNVNDQGLDAT 78 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~--~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~ 78 (405) |.. + ...+-.|....+....+.+++.+++... .. .+.|+||.|++.-... ..+-..+|-... T Consensus 1 MA~--~------------~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~-~~~GdTv~ip~~~~~~-~~d~~~~~~~~~ 64 (273) T protein:vir:79 1 MAF--N------------NFIPELWSDMLLEEWTAQTVFANLVNREYEGI-ASKGNVVHIAGVVAPT-VKDYKAAGRQTS 64 (273) T ss_pred Ccc--h------------hhhHHHHHHHHHHHHHhhccchhhhhcccccc-ccCCcEEEEeecCccc-ccccccCCCccC Confidence 322 0 0233467888888888889999987443 23 3569999999854322 222222332222 Q ss_pred cccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeee-eeeEEecchhhhhhhccchHHHHHHHHHH Q lcl|NC_020862. 79 GASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEY-GFFMEYTEDSLMFDTDSDLYGHLSREMLR 157 (405) Q Consensus 79 g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qy-G~~~e~Td~~~~~d~d~~l~~~~~~ell~ 157 (405) .+. .+..+++.+|.|+ +.=..++|+....... ++ ..+.+++.. T Consensus 65 ~~~----------------------------------~~~~~~~~tid~~~~~~~~i~d~d~~~~~~-~~-~~~~~~~~~ 108 (273) T protein:vir:79 65 ADA----------------------------------ISDTGVDLLIDQEKSIDFLVDDIDRVQVAG-SL-EAYTRAGAT 108 (273) T ss_pred ccc----------------------------------cccceEEEEEeeecccceeeccHHHHhhcc-cH-HHHHHHHHH Confidence 111 2344677899774 5555777754433333 34 334444433 Q ss_pred HHhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccc Q lcl|NC_020862. 158 GANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKT 237 (405) Q Consensus 158 ~~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~ 237 (405) .-+ ...|.-...+++++.... ++ ...+ + ..-.++.|..+.+.|++++.|. T Consensus 109 ala-~~vD~~i~~~~~~a~~~~-~~----~~~~-----~----~~~~~~~i~~a~~~ld~~~vP~--------------- 158 (273) T protein:vir:79 109 ALA-TDTDKFIADMLVDNGTAL-TG----SAPS-----D----ADDAFDLIASALKELTKANVPN--------------- 158 (273) T ss_pred HHH-HHHHHHHHHHHhhccccc-cc----cccc-----c----hhhHHHHHHHHHHHhhhccCCc--------------- Confidence 222 222322333343322111 11 0000 0 1124678999999999999984 Q ss_pred ccceEEEEEcccchHHHHHHhcccCCCc-ceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCc---ccCCC Q lcl|NC_020862. 238 ISASRIAYIGSELEIYITELVDSLGNPA-FVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGAT---ATAAN 313 (405) Q Consensus 238 I~~syv~~~h~dl~~dir~l~d~~~~p~-fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~---~~~t~ 313 (405) .-|+++|+|+...+|+.. +. |......++...+.+|.||++.| |.++.+..+-...+.... ..+.. T Consensus 159 --~~R~lvv~p~~~~~Ll~~------~~~~~~~~~~~~~~~l~~G~ig~~~G--~~i~~s~~lp~~~~~~~~a~~~~A~~ 228 (273) T protein:vir:79 159 --VGRVVVVNAEMAFWLRSS------GSKLTSADTSGDAAGLRAGTIGNLLG--ARIVESNNLRDTDDEQFVAFHPSAAA 228 (273) T ss_pred --cCcEEEECHHHHHHHhhc------hhhhhhhhhcccccceeeeEeeEEec--eEEEecccccccCceEEEEEecccee Confidence 247899999999998753 33 56666667777788999999977 899998776543321110 11111 Q ss_pred cccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCC Q lcl|NC_020862. 314 RGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGE 365 (405) Q Consensus 314 ~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~ 365 (405) -..+.......-.-+=|..+|-|.+.||.--++ +.++ +.+++.|. T Consensus 229 ~a~~~~~~e~~r~~~~~~~~v~~~~~yg~~v~~------p~~v-v~~~~~g~ 273 (273) T protein:vir:79 229 YVSQIDTVEALRDQDSFSDRIRALHVYGGKVVR------PTGV-VVFNKTGS 273 (273) T ss_pred eeeehhhhhcccCcccceeeeeeeeeeeeEEec------CceE-EEEeccCC Confidence 011111111111122467789999999988885 1222 23444442 No 22 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=98.97 E-value=1.5e-10 Score=74.45 Aligned_cols=294 Identities=13% Similarity=0.108 Sum_probs=167.2 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) |..--.++..+++++-+.-.-+-.+.++.+....+..++.+++...+|+. ..+++-++..-+.+ ..-..++ T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~ip~~~~~~~a------~~v~E~~ 71 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTA---QKKKFTYLAKGVGA------YWVSETE 71 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccC---CceEEEEEeCCcce------EEeecCc Confidence 66555455444444444433444456677777777888899998888764 23444433221111 1111111 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) .+.-...++..|+.++++++.++.+|+++ .-|+.-++.+.+..++.+.-+ T Consensus 72 -----------------------------~~~~~~~~~~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~~ia 121 (304) T protein:vir:10 72 -----------------------------RIQTSKPEYAQAEMEAKKIGVIIPLSKEF-LKWTAKDFFNEVKPLIAEAFY 121 (304) T ss_pred -----------------------------ccccccceeeEEEEEEEEEEEeehhhHHH-HhcchHHHHHHHHHHHHHHHH Confidence 11222236677899999999999999985 445555777777777665333 Q ss_pred hHHHHHHHHHHhccCceEEecCCC-ccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKVFTGAA-TSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTIS 239 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~yag~a-ts~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~ 239 (405) .- +-..+++|.+.-.-.|.. .+.............++.+++++|.++...|+.+.... T Consensus 122 ~~----~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~----------------- 180 (304) T protein:vir:10 122 KA----FDQAVIFGTKSPYNTSTSGKPLVEGAEEKGNVVTDTNNLYVDLSALMATIEDEELDP----------------- 180 (304) T ss_pred HH----HHhhheeccCCCcccccccccccccccccccccccccchHHHHHHHHHHhhhccCCc----------------- Confidence 32 233445554432222211 11111111112222345688999999998887755431 Q ss_pred ceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccccc Q lcl|NC_020862. 240 ASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVS 319 (405) Q Consensus 240 ~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~ 319 (405) + ..+||+.+...|+.++|-.+.|-|. ...|++-| +..+.++.|... T Consensus 181 ~--~~v~~~~~~~~L~~lkd~~G~~l~~-------------~~~~~l~G--~PV~~~~~~~~~----------------- 226 (304) T protein:vir:10 181 N--GVLTTRSFRSKMRNALDANDRPLFD-------------ANGNEIMG--LPLSYTGADVYD----------------- 226 (304) T ss_pred C--EEEEcHHHHHHHHHhhccCCcEeec-------------CCCccccc--eeeEEecccccC----------------- Confidence 2 3478999999999988766655553 33466755 455556544210 Q ss_pred cccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEE-ecCCCCCCCCCCcc------chhhhHHHH--HHHHHhh Q lcl|NC_020862. 320 DVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIV-KKPGEATADRNDPY------GKVGFSSIK--FFYGFIK 390 (405) Q Consensus 320 ~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~iv-k~pG~~tad~~DPl------gQrg~~gwK--~~~~~~i 390 (405) .++. .+++|.-+.-.+++.+ .+++-+ ..++-......|+. -|++.+.|+ +++++.+ T Consensus 227 ----~~~~----~~~~gd~~~~~~~~~~-------~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v 291 (304) T protein:vir:10 227 ----KKKS----LALMGDWDYARYGILQ-------GIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMN 291 (304) T ss_pred ----CCCc----EEEEEehhhEEEEEec-------ceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEe Confidence 0111 2567776665565542 123222 11111112334554 467778887 6899999 Q ss_pred ccccceEEEEEec Q lcl|NC_020862. 391 LRGERIAVAYSVI 403 (405) Q Consensus 391 L~~~~marie~~a 403 (405) ++++-+++|+.+= T Consensus 292 ~~~~a~~~l~~a~ 304 (304) T protein:vir:10 292 VKPEAFATLKPTE 304 (304) T ss_pred ecccceEEEEecC Confidence 9999999999887 No 23 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=98.97 E-value=1.5e-10 Score=74.45 Aligned_cols=294 Identities=13% Similarity=0.108 Sum_probs=167.2 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) |..--.++..+++++-+.-.-+-.+.++.+....+..++.+++...+|+. ..+++-++..-+.+ ..-..++ T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~ip~~~~~~~a------~~v~E~~ 71 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTA---QKKKFTYLAKGVGA------YWVSETE 71 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccC---CceEEEEEeCCcce------EEeecCc Confidence 66555455444444444433444456677777777888899998888764 23444433221111 1111111 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) .+.-...++..|+.++++++.++.+|+++ .-|+.-++.+.+..++.+.-+ T Consensus 72 -----------------------------~~~~~~~~~~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~~ia 121 (304) T protein:vir:94 72 -----------------------------RIQTSKPEYAQAEMEAKKIGVIIPLSKEF-LKWTAKDFFNEVKPLIAEAFY 121 (304) T ss_pred -----------------------------ccccccceeeEEEEEEEEEEEeehhhHHH-HhcchHHHHHHHHHHHHHHHH Confidence 11222236677899999999999999985 445555777777777665333 Q ss_pred hHHHHHHHHHHhccCceEEecCCC-ccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKVFTGAA-TSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTIS 239 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~yag~a-ts~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~ 239 (405) .- +-..+++|.+.-.-.|.. .+.............++.+++++|.++...|+.+.... T Consensus 122 ~~----~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~----------------- 180 (304) T protein:vir:94 122 KA----FDQAVIFGTKSPYNTSTSGKPLVEGAEEKGNVVTDTNNLYVDLSALMATIEDEELDP----------------- 180 (304) T ss_pred HH----HHhhheeccCCCcccccccccccccccccccccccccchHHHHHHHHHHhhhccCCc----------------- Confidence 32 233445554432222211 11111111112222345688999999998887755431 Q ss_pred ceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccccc Q lcl|NC_020862. 240 ASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVS 319 (405) Q Consensus 240 ~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~ 319 (405) + ..+||+.+...|+.++|-.+.|-|. ...|++-| +..+.++.|... T Consensus 181 ~--~~v~~~~~~~~L~~lkd~~G~~l~~-------------~~~~~l~G--~PV~~~~~~~~~----------------- 226 (304) T protein:vir:94 181 N--GVLTTRSFRSKMRNALDANDRPLFD-------------ANGNEIMG--LPLSYTGADVYD----------------- 226 (304) T ss_pred C--EEEEcHHHHHHHHHhhccCCcEeec-------------CCCccccc--eeeEEecccccC----------------- Confidence 2 3478999999999988766655553 33466755 455556544210 Q ss_pred cccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEE-ecCCCCCCCCCCcc------chhhhHHHH--HHHHHhh Q lcl|NC_020862. 320 DVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIV-KKPGEATADRNDPY------GKVGFSSIK--FFYGFIK 390 (405) Q Consensus 320 ~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~iv-k~pG~~tad~~DPl------gQrg~~gwK--~~~~~~i 390 (405) .++. .+++|.-+.-.+++.+ .+++-+ ..++-......|+. -|++.+.|+ +++++.+ T Consensus 227 ----~~~~----~~~~gd~~~~~~~~~~-------~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v 291 (304) T protein:vir:94 227 ----KKKS----LALMGDWDYARYGILQ-------GIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMN 291 (304) T ss_pred ----CCCc----EEEEEehhhEEEEEec-------ceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEe Confidence 0111 2567776665565542 123222 11111112334554 467778887 6899999 Q ss_pred ccccceEEEEEec Q lcl|NC_020862. 391 LRGERIAVAYSVI 403 (405) Q Consensus 391 L~~~~marie~~a 403 (405) ++++-+++|+.+= T Consensus 292 ~~~~a~~~l~~a~ 304 (304) T protein:vir:94 292 VKPEAFATLKPTE 304 (304) T ss_pred ecccceEEEEecC Confidence 9999999999887 No 24 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=98.94 E-value=2.1e-10 Score=73.66 Aligned_cols=306 Identities=12% Similarity=0.100 Sum_probs=153.5 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhcccccc--ccCcCCCCEEEEEecccCCCCCCccccCCCcc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNK--QMPKHFGKELKVFYYVPLLDDLNVNDQGLDAT 78 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~--~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~ 78 (405) |+-.=|. + +++--|.+.+|..-++.+|+.++.... .-..++|+||++++ +...+..+ |-+-. T Consensus 1 m~~~~N~--------~---ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~v----p~~~~v~d-g~~~~ 64 (418) T protein:vir:10 1 MAVQDNN--------L---LTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKL----PYRVKSAS-GRTLV 64 (418) T ss_pred CCccccc--------c---ccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEee----CCceeecc-cCCcc Confidence 3322221 1 123468899998888999988866542 23367899999985 33333332 21111 Q ss_pred cccccCCcccccccccccccccccccccccccccccceeeeeEEEEe-eeeeeeEEecchhhhhhhccchHHHHHHHHHH Q lcl|NC_020862. 79 GASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTL-TEYGFFMEYTEDSLMFDTDSDLYGHLSREMLR 157 (405) Q Consensus 79 g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l-~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~ 157 (405) .+ ..+-..++.+| ++-.+-.+++|+...++.+ ++.. +.++ T Consensus 65 ~~----------------------------------~~te~~v~l~id~~k~~~~~itD~e~a~~~~-d~~~----~~l~ 105 (418) T protein:vir:10 65 KQ----------------------------------PMVDQTIPFKIAYQEHVGLEYTVKDKTLDIM-QFSE----RYLK 105 (418) T ss_pred cc----------------------------------ccccceEEEEEecccccceeechHHHhhhhh-HHHH----HHHH Confidence 11 11233556777 4455667888865444333 3333 3333 Q ss_pred HHhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccc Q lcl|NC_020862. 158 GANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKT 237 (405) Q Consensus 158 ~~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~ 237 (405) .|+.---+.+-.+|+. .|.++..... ...++ .-.++++.++.+.|.++++|+ T Consensus 106 ~A~~aLA~~vD~~ia~-----l~~~a~~~~g----t~gt~----~~~~~~i~~a~~~Ld~~~VP~--------------- 157 (418) T protein:vir:10 106 SGMVQIANQIDRSLAL-----TLKKAFHSSG----TPGVR----PGAFIDFANAGAKQTTYAVPQ--------------- 157 (418) T ss_pred HHHHHHHHHHHHHHHH-----HHhhcccccc----cCCcC----cchHHHHHHHHHHHHhcCCCC--------------- Confidence 3322211222222221 0111111111 01111 124899999999999999984 Q ss_pred ccce-EEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCccc-CC--- Q lcl|NC_020862. 238 ISAS-RIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATAT-AA--- 312 (405) Q Consensus 238 I~~s-yv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~-~t--- 312 (405) .. |++++.|+....|.+ ++.|.. .++++.+.+-+|+||+|.| |.++++..+-. .-+|.... .+ T Consensus 158 --~G~R~lVv~P~~~~~L~~------~~~~~~-~~~~~~~~lr~G~IG~i~G--F~V~~S~nip~-~tag~~~~t~~v~g 225 (418) T protein:vir:10 158 --DGMRHAVLDPFTCASLSD------EVTKLF-KESMVEQAYKMGYRGNVAA--YEVYESQNLPK-HTVGDHGGTPLVNG 225 (418) T ss_pred --CCceEEEeCHHHHHHHhh------hccccc-cccccchhhheeeeeeeec--eEEEEecCCCc-ccccccccceeeec Confidence 12 788899998887753 345553 5778888889999999977 88888876652 22222111 00 Q ss_pred --Ccccccc------------------------------------------------cccCCcceeeeEEEEEccc---- Q lcl|NC_020862. 313 --NRGYQVS------------------------------------------------DVAGTDKYDIAPLLVVGDQ---- 338 (405) Q Consensus 313 --~~~~~~~------------------------------------------------~~~g~~~~DVYp~lV~G~~---- 338 (405) ..+..+. ...|+..+.+||-|+-+.. T Consensus 226 a~~~~~~~~~~~~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~~~~~~~~~tv~i~p~~~~~~~~~~~ 305 (418) T protein:vir:10 226 TVVNGDTVGFDGGTASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQEFVVLEDVDTDAGGAGSIKISPSLNDGTATINN 305 (418) T ss_pred ccccceeEEEeecceeeccceeeccEEEECceeecccccccccccceEEEEEeeccccccCcceeEeccccccccccccc Confidence 0011110 0111222333433320000 Q ss_pred ---------cceee----------------------cceeccCCCCCCceEEEecC----CCCCCCC-CCcc-------- Q lcl|NC_020862. 339 ---------AFATI----------------------GLQGMSGKGKSKFRIIVKKP----GEATADR-NDPY-------- 374 (405) Q Consensus 339 ---------Afg~i----------------------~l~g~~~~g~~~~~~ivk~p----G~~tad~-~DPl-------- 374 (405) +|..+ +|-+ -+..|.++..++ |...... .+|+ T Consensus 306 ~~~~~~~~~~~~~v~a~~a~~~~it~~~~a~~~~~~nl~f----~~~a~~l~~~~l~~p~g~~~~~~~~~~~~G~s~r~~ 381 (418) T protein:vir:10 306 ENGDPVSLTAYQNVTALPADNAPITVLGAANTTYEQNYLF----HRDAIALAMIDLELPQSAVIKSRAADPETGLSLTLT 381 (418) T ss_pred cccccccccCCCcccccccCcceeeeecccccceeeeeee----ecceEEEEEeeccCCCCCCcceEEEeccCCeEEEEE Confidence 00000 0000 001222222222 2211111 1333 Q ss_pred ------chhhhHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_020862. 375 ------GKVGFSSIKFFYGFIKLRGERIAVAYSVIPE 405 (405) Q Consensus 375 ------gQrg~~gwK~~~~~~iL~~~~marie~~a~~ 405 (405) .+--.+.|=.+|++..|++||.+||-=.|-- T Consensus 382 ~~~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~g~~~~ 418 (418) T protein:vir:10 382 GAYDINEQSEIHRIDAVWGADMIYGELALRLWGAASS 418 (418) T ss_pred EcccccccceEEEEEeecCceeecccceEEEEeecCC Confidence 3334445556999999999999988765555 No 25 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=98.93 E-value=1.3e-09 Score=69.39 Aligned_cols=312 Identities=11% Similarity=0.065 Sum_probs=179.2 Q ss_pred CCccccCcCCCcccc-ccc--ceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCc Q lcl|NC_020862. 1 MPHIYNDPAAGDAST-VGP--QFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDA 77 (405) Q Consensus 1 ~~~~y~~~~~t~~~~-v~~--qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp 77 (405) |.-+|++.....--+ -+. .+...-|.-+.+..-+...+|..+-..|.+ ..||++.|-|--..-.. -.+.|-.+ T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~r~i--~~G~s~~~~~iG~~~~~--~~~~g~~l 76 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNVRSL--RGTNQLRVDRVGASTIA--GRKAGEEL 76 (334) T ss_pred CCCCcCCCccccccccccchheehhhhhhhHHHHHHHHhhhhhccceeeec--cccceEEEeeecceeee--eecCCCCC Confidence 888887643331111 122 233345677888877788888899999988 55999999854332221 12223322 Q ss_pred ccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHH Q lcl|NC_020862. 78 TGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLR 157 (405) Q Consensus 78 ~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~ 157 (405) .++.+ + - .+.+-+|-|.=-+-.+=|+.-+...+-|+..++++++.. T Consensus 77 ~~~~~---------------~-----------------~--~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~ 122 (334) T protein:vir:80 77 VVQKN---------------V-----------------S--DKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGI 122 (334) T ss_pred CCCCc---------------c-----------------c--CceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHH Confidence 22222 1 1 122233333211111112233445556788888888776 Q ss_pred HHhhHHHHHHHHHH-hccCceE-------EecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceecc Q lcl|NC_020862. 158 GANEITEDLLQADI-LASADVK-------VFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKG 229 (405) Q Consensus 158 ~~~~~ted~l~~~i-lag~~~v-------~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~g 229 (405) .-+.....-+.+.+ +++...- +-+|..+ ....++..+....+--.-++-++.+...|.++.-|.. T Consensus 123 aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~~~G~~~-~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~------ 195 (334) T protein:vir:80 123 ALARQYDQACIIQLQKCGDFLAPAHLKPAFHDGILL-PSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQ------ 195 (334) T ss_pred HHHHHHHHHHHHHHHHhhhhcccccccccccCCcce-eecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCC------ Confidence 55554433333333 3332211 1112211 1222222221111111122445577778888887721 Q ss_pred ccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCC---cccccCcceeEecCCcEEEEeCcchhhhhcCC Q lcl|NC_020862. 230 SRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYAD---AATIMNGEIGAIPGAHLRIVVVPQMMHYAGAG 306 (405) Q Consensus 230 s~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~---~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aG 306 (405) ...-+++++.|..-..|.+ ++.|+.+ .|+. ...+-.|+|+++.| |+++++++|=.-.+ T Consensus 196 --------~~~~R~~vv~P~~y~~Ll~------~~r~~n~-d~~~s~~~~~~~~g~i~~v~G--~~V~~Sn~~P~~~~-- 256 (334) T protein:vir:80 196 --------LMSEGVTLLDPVIFSFLLE------HDRLMNV-EFGAKEGGNSFVGGRIAMLNG--VRVVETPRFPQSAI-- 256 (334) T ss_pred --------cCCceEEEeChHHHHHHhc------ccccccc-eeccccccccccceeEEEEec--eEEEeecCCCCccc-- Confidence 1124999999999999965 5789988 5643 44678999999976 89999998742111 Q ss_pred CcccCCCcccccccccCCcceeee-------EEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhh Q lcl|NC_020862. 307 ATATAANRGYQVSDVAGTDKYDIA-------PLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGF 379 (405) Q Consensus 307 a~~~~t~~~~~~~~~~g~~~~DVY-------p~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~ 379 (405) +.+..++.+.+| +.+++...|-+++-++....+ --.|+--|..+ T Consensus 257 ------------t~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e-----------------~~~~~~~~~d~ 307 (334) T protein:vir:80 257 ------------TANALGADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQ-----------------FWEEKKDFGHY 307 (334) T ss_pred ------------cccccccccccccccccceEEEEEeCceEEEEEEeeccee-----------------eeechhhHHHH Confidence 111112333344 568888888888777532211 12455568888 Q ss_pred HHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_020862. 380 SSIKFFYGFIKLRGERIAVAYSVIPE 405 (405) Q Consensus 380 ~gwK~~~~~~iL~~~~marie~~a~~ 405 (405) +==|..|++.+||++..+.+|.-.+. T Consensus 308 i~~~~a~G~g~lRPeaa~vv~~~~~~ 333 (334) T protein:vir:80 308 LDTFQSYNIGQRRPDAVAVHDITVTN 333 (334) T ss_pred HHHHHHcCCceeccceEEEEEEeeec Confidence 88899999999999999999987777 No 26 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=98.90 E-value=4.9e-10 Score=71.61 Aligned_cols=264 Identities=14% Similarity=0.157 Sum_probs=142.7 Q ss_pred cccc-eeehhhhhHHHHHhhhhhhhhcccccc-c-cCcCCCCEEEEEecccCCCCC-CccccCCCcccccccCCcccccc Q lcl|NC_020862. 16 VGPQ-FNVHYWDRKSLIDEAEEMFFSPLADNK-Q-MPKHFGKELKVFYYVPLLDDL-NVNDQGLDATGASYAGGNLYGGS 91 (405) Q Consensus 16 v~~q-m~t~y~~~k~L~~a~p~lv~~~fA~~~-~-mPKn~GktIkfrry~pl~~~~-t~l~eGvtp~g~~~~~gnly~ss 91 (405) ++.+ ..+--|....|..-.+.+++.++.... + -.+ .|+||+|++.-...... ++....+++. T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~-~Gdtv~ip~~~~~~~~d~~~~~~~~~~~------------- 66 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTAS-KGNVVHIAGVVAPTVKDYKAAGRQTSAD------------- 66 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhccccccccc-cCceEEEeecccccccccccCCCccCcc------------- Confidence 2221 123357777777777889999887542 1 234 59999999865443211 1111112211 Q ss_pred cccccccccccccccccccccccceeeeeEEEEeeee-eeeEEecchhhhhhhccchHHHHHHHHHHHHhhHHHHHHHHH Q lcl|NC_020862. 92 RDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEY-GFFMEYTEDSLMFDTDSDLYGHLSREMLRGANEITEDLLQAD 170 (405) Q Consensus 92 ~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qy-G~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~~~ted~l~~~ 170 (405) ..+..+++.+|.|+ .+=..++|+....... ++ ..+.++....-+. ..|.-... T Consensus 67 -----------------------~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~-~~-~~~~~~~~~alA~-~vD~~i~~ 120 (273) T protein:vir:10 67 -----------------------AISDTGVDLLIDQEKSIDFLVDDIDRVQVAG-SL-EAYTRAGATALAT-DTDKFIAD 120 (273) T ss_pred -----------------------ccccceEEEEEeeeeecceEeecHHHhhhhc-cH-HHHHHHHHHHHHH-HHHHHHHH Confidence 12345677888664 6666788865444443 34 3344444433332 22222223 Q ss_pred HhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccceEEEEEcccc Q lcl|NC_020862. 171 ILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISASRIAYIGSEL 250 (405) Q Consensus 171 ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~syv~~~h~dl 250 (405) +++++.... .+ ..+++ ..-.++.|..+.+.|++++.|. .-|+++|+|+. T Consensus 121 ~~~~a~~~~-~~----~~~~~---------~~~~~~~i~~a~~~ld~~~vP~-----------------~~R~lvv~p~~ 169 (273) T protein:vir:10 121 MLVDNGTAL-TG----SAPTD---------ADDAFDLIAKALKELTKANVPN-----------------VGRVVVVNAEM 169 (273) T ss_pred HHhcccccc-cc----ccccc---------hhHHHHHHHHHHHHhhhcCCCc-----------------CCCEEEECHHH Confidence 333322111 11 01111 1234788999999999999983 23788999999 Q ss_pred hHHHHHHhcccCCCcce-ehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCc---ccCCCcccccccccCCcc Q lcl|NC_020862. 251 EIYITELVDSLGNPAFV-PVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGAT---ATAANRGYQVSDVAGTDK 326 (405) Q Consensus 251 ~~dir~l~d~~~~p~fi-~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~---~~~t~~~~~~~~~~g~~~ 326 (405) ...|+.. +.|+ ....+++...+.+|.||++.| |.++.+..+-...+..+- ..+..-..+......... T Consensus 170 ~~~L~~~------~~~~~~~~~~~~~~~l~~G~ig~i~G--~~v~~s~~lp~~~~~~~~~~~~~A~~~a~q~~~~e~~r~ 241 (273) T protein:vir:10 170 AFWLRSS------GSKLTSADTSGDAAGLRAGTIGNLLG--ARIVESNNLRDTDDEQFVAFHPSAAAYVSQIDTVEALRD 241 (273) T ss_pred HHHHhcc------hhhhhhhhccccccceeeeeeeEEec--eEEEEecccccCCccEEEEEeccceeeeeeeehhhcccC Confidence 9999753 4544 566777777778999999977 899988666432221000 011111111111111111 Q ss_pred eeeeEEEEEccccceeecceeccCCCCCCceEEEecCCC Q lcl|NC_020862. 327 YDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGE 365 (405) Q Consensus 327 ~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~ 365 (405) -+=|.-+|-|...||.--++- .++ +.+++.|. T Consensus 242 ~~~~~~~v~~~~~yg~~v~~~------~~~-~~l~~~g~ 273 (273) T protein:vir:10 242 QDSFSDRIRALHVYGGKVVRP------TGV-VVFNKTGS 273 (273) T ss_pred CCcceeeeeeeeeeeeeEecc------ceE-EEEeccCC Confidence 123566788888888877751 122 23444442 No 27 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=98.90 E-value=4.9e-10 Score=71.61 Aligned_cols=264 Identities=14% Similarity=0.157 Sum_probs=142.7 Q ss_pred cccc-eeehhhhhHHHHHhhhhhhhhcccccc-c-cCcCCCCEEEEEecccCCCCC-CccccCCCcccccccCCcccccc Q lcl|NC_020862. 16 VGPQ-FNVHYWDRKSLIDEAEEMFFSPLADNK-Q-MPKHFGKELKVFYYVPLLDDL-NVNDQGLDATGASYAGGNLYGGS 91 (405) Q Consensus 16 v~~q-m~t~y~~~k~L~~a~p~lv~~~fA~~~-~-mPKn~GktIkfrry~pl~~~~-t~l~eGvtp~g~~~~~gnly~ss 91 (405) ++.+ ..+--|....|..-.+.+++.++.... + -.+ .|+||+|++.-...... ++....+++. T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~-~Gdtv~ip~~~~~~~~d~~~~~~~~~~~------------- 66 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTAS-KGNVVHIAGVVAPTVKDYKAAGRQTSAD------------- 66 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhccccccccc-cCceEEEeecccccccccccCCCccCcc------------- Confidence 2221 123357777777777889999887542 1 234 59999999865443211 1111112211 Q ss_pred cccccccccccccccccccccccceeeeeEEEEeeee-eeeEEecchhhhhhhccchHHHHHHHHHHHHhhHHHHHHHHH Q lcl|NC_020862. 92 RDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEY-GFFMEYTEDSLMFDTDSDLYGHLSREMLRGANEITEDLLQAD 170 (405) Q Consensus 92 ~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qy-G~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~~~ted~l~~~ 170 (405) ..+..+++.+|.|+ .+=..++|+....... ++ ..+.++....-+. ..|.-... T Consensus 67 -----------------------~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~-~~-~~~~~~~~~alA~-~vD~~i~~ 120 (273) T protein:vir:10 67 -----------------------AISDTGVDLLIDQEKSIDFLVDDIDRVQVAG-SL-EAYTRAGATALAT-DTDKFIAD 120 (273) T ss_pred -----------------------ccccceEEEEEeeeeecceEeecHHHhhhhc-cH-HHHHHHHHHHHHH-HHHHHHHH Confidence 12345677888664 6666788865444443 34 3344444433332 22222223 Q ss_pred HhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccceEEEEEcccc Q lcl|NC_020862. 171 ILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISASRIAYIGSEL 250 (405) Q Consensus 171 ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~syv~~~h~dl 250 (405) +++++.... .+ ..+++ ..-.++.|..+.+.|++++.|. .-|+++|+|+. T Consensus 121 ~~~~a~~~~-~~----~~~~~---------~~~~~~~i~~a~~~ld~~~vP~-----------------~~R~lvv~p~~ 169 (273) T protein:vir:10 121 MLVDNGTAL-TG----SAPTD---------ADDAFDLIAKALKELTKANVPN-----------------VGRVVVVNAEM 169 (273) T ss_pred HHhcccccc-cc----ccccc---------hhHHHHHHHHHHHHhhhcCCCc-----------------CCCEEEECHHH Confidence 333322111 11 01111 1234788999999999999983 23788999999 Q ss_pred hHHHHHHhcccCCCcce-ehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCc---ccCCCcccccccccCCcc Q lcl|NC_020862. 251 EIYITELVDSLGNPAFV-PVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGAT---ATAANRGYQVSDVAGTDK 326 (405) Q Consensus 251 ~~dir~l~d~~~~p~fi-~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~---~~~t~~~~~~~~~~g~~~ 326 (405) ...|+.. +.|+ ....+++...+.+|.||++.| |.++.+..+-...+..+- ..+..-..+......... T Consensus 170 ~~~L~~~------~~~~~~~~~~~~~~~l~~G~ig~i~G--~~v~~s~~lp~~~~~~~~~~~~~A~~~a~q~~~~e~~r~ 241 (273) T protein:vir:10 170 AFWLRSS------GSKLTSADTSGDAAGLRAGTIGNLLG--ARIVESNNLRDTDDEQFVAFHPSAAAYVSQIDTVEALRD 241 (273) T ss_pred HHHHhcc------hhhhhhhhccccccceeeeeeeEEec--eEEEEecccccCCccEEEEEeccceeeeeeeehhhcccC Confidence 9999753 4544 566777777778999999977 899988666432221000 011111111111111111 Q ss_pred eeeeEEEEEccccceeecceeccCCCCCCceEEEecCCC Q lcl|NC_020862. 327 YDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGE 365 (405) Q Consensus 327 ~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~ 365 (405) -+=|.-+|-|...||.--++- .++ +.+++.|. T Consensus 242 ~~~~~~~v~~~~~yg~~v~~~------~~~-~~l~~~g~ 273 (273) T protein:vir:10 242 QDSFSDRIRALHVYGGKVVRP------TGV-VVFNKTGS 273 (273) T ss_pred CCcceeeeeeeeeeeeeEecc------ceE-EEEeccCC Confidence 123566788888888877751 122 23444442 No 28 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=98.89 E-value=7.3e-10 Score=70.67 Aligned_cols=292 Identities=14% Similarity=0.126 Sum_probs=167.7 Q ss_pred ccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccccc Q lcl|NC_020862. 3 HIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGASY 82 (405) Q Consensus 3 ~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~~~ 82 (405) +=||.-..++++.-+.-+-+ .+.++.+....+..++.+++...+|+.+ +.++.+.. -+. ..-..| | T Consensus 1 ~g~~a~~~~~~~~~~~~iP~-~~~~~ii~~~~~~s~l~~~~~~~~~~~~---~~~~~~~~-~~~-a~~v~E-----~--- 66 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPI-NISEQIITGVKNGSAAMKLAKAVPMTKP---EEEFTFMS-GVG-AFWVDE-----A--- 66 (299) T ss_pred CCcCCCcccccCCCceecch-hHHHHHHHHHHhcchhhhhceeeecCCC---cEEEEEEc-CCc-eeeeec-----C--- Confidence 44444333333333333333 3456666666677889999988888643 33333211 111 111122 2 Q ss_pred cCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHhhH Q lcl|NC_020862. 83 AGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGANEI 162 (405) Q Consensus 83 ~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~~~ 162 (405) ........++..|+-+.++++.++++|+++++ |+..++.+.+..++.+.. .. T Consensus 67 --------------------------~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~-ds~~~~~~~i~~~l~~a~-~~ 118 (299) T protein:vir:41 67 --------------------------ERIQTSKPTFTKAKMRSKKMGVIIPTTKENLN-YSVTNFFSLMQAEIVEAF-YK 118 (299) T ss_pred --------------------------ccccccccceeEEEEeeEEEEEeehhhHHHHh-cCHHHHHHHHHHHHHHHH-HH Confidence 22222224667789999999999999998554 566677776665555433 33 Q ss_pred HHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccceE Q lcl|NC_020862. 163 TEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISASR 242 (405) Q Consensus 163 ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~sy 242 (405) .+| ..+++|.+.-.=.|..+.. .........+.+++++|.++...|..+..+. - T Consensus 119 ~~d---~a~l~G~g~~~~~gil~~~----~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~~-------------------~ 172 (299) T protein:vir:41 119 KFD---QAVFTGVESPYNWNILKSA----TDASNLVEETANKYDDLNEAIGLIEAEDLEP-------------------N 172 (299) T ss_pred HHH---HHHhhcccCcccccccccc----cccceeeccccccHHHHHHHHHhhhcccCCc-------------------C Confidence 333 3455554332111111111 0111111234578999999999888766541 2 Q ss_pred EEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccccccccc Q lcl|NC_020862. 243 IAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSDVA 322 (405) Q Consensus 243 v~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~~~ 322 (405) ..+|||.....|+.|+|.-+.|-|.|.- .+..+.+-| +.++.++.|- +|+ T Consensus 173 ~~v~n~~~~~~L~~lkd~~G~~l~~~~~---------~~~~~~l~G--~PV~~~~~~~----~~~--------------- 222 (299) T protein:vir:41 173 GIATIRKQRVKYRSTKDGNGMPIFNTAT---------SNGVDDVLG--LPIAYTPKYT----FGD--------------- 222 (299) T ss_pred EEEEcHHHHHHHHHhhccCCceeecCCc---------CCCCceecc--eeeEEecccC----CCC--------------- Confidence 3589999999999999988888887542 334456655 5667776553 111 Q ss_pred CCcceeeeEEEEEccccceeecceeccCCCCCCceEEEec-CCCC--CCCCCCc--cchhhhHHHH--HHHHHhhccccc Q lcl|NC_020862. 323 GTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKK-PGEA--TADRNDP--YGKVGFSSIK--FFYGFIKLRGER 395 (405) Q Consensus 323 g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~-pG~~--tad~~DP--lgQrg~~gwK--~~~~~~iL~~~~ 395 (405) ++ +.+++|.-++..+++++ .+.+-+.. .... .-..+.| +-|++.+.+| +++++.+++++- T Consensus 223 --~~----~~~~~gdfs~~~i~~~~-------~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A 289 (299) T protein:vir:41 223 --KD----ISELVGDWNQAYYGILR-------GVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEA 289 (299) T ss_pred --Cc----eEEEEEecccEEEEEec-------CcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccc Confidence 11 24678887776676652 13332221 1110 0011222 2477778888 678999999999 Q ss_pred eEEEEEecCC Q lcl|NC_020862. 396 IAVAYSVIPE 405 (405) Q Consensus 396 marie~~a~~ 405 (405) +++|+..+=- T Consensus 290 ~~~l~~~aa~ 299 (299) T protein:vir:41 290 FSAVQPKAGN 299 (299) T ss_pred eEEEEeccCC Confidence 9999988877 No 29 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=98.88 E-value=1e-09 Score=69.86 Aligned_cols=318 Identities=13% Similarity=0.121 Sum_probs=174.5 Q ss_pred CCcccc--CcCCCcccc----c-cc-ceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccc Q lcl|NC_020862. 1 MPHIYN--DPAAGDAST----V-GP-QFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVND 72 (405) Q Consensus 1 ~~~~y~--~~~~t~~~~----v-~~-qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~ 72 (405) |+.+-| +|.-....- . .+ ++...-|....|..-++.-++..+-..+++ ..|++++|.|--..-. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~r~i--~~G~tv~i~~ig~~~~------ 72 (332) T protein:vir:78 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDL--RGGKSKQFMFTGKLSA------ 72 (332) T ss_pred CcccccccCCccccCCccccccccchhhhhhhhhhhHHHHHHHHhhhhhccccccc--cccceEEEEeccceeE------ Confidence 665532 221111111 1 11 344445677777766666666666666666 2699999997653311 Q ss_pred cCCCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHH Q lcl|NC_020862. 73 QGLDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLS 152 (405) Q Consensus 73 eGvtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~ 152 (405) ++.+| |+.+. + ..|+ +..+++-+|-|.=-|..+=|+.-..+.+-++..+++ T Consensus 73 ~~~~~-g~~l~-~-----~~~~----------------------~~~~~~l~ID~~ky~~~~VddiD~~q~~~dl~~~~~ 123 (332) T protein:vir:78 73 GYHTP-GTPIV-G-----DAGI----------------------KANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVS 123 (332) T ss_pred eeecC-CCCCC-C-----CCCC----------------------CCceEEEEEehhhhhHHHHHhHHHHhcCcchHHHHH Confidence 22222 11110 0 0011 122344566664444444455555556667888888 Q ss_pred HHHHHHHhhHHHHHHHHHHhccCceEEecCC--CccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccc Q lcl|NC_020862. 153 REMLRGANEITEDLLQADILASADVKVFTGA--ATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGS 230 (405) Q Consensus 153 ~ell~~~~~~ted~l~~~ilag~~~v~yag~--ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs 230 (405) .+....-++....-+-+.+..++....=++. ..+.+.+++..++ +-.--++.|+++...|.+++.|. T Consensus 124 ~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~~~~g~~~~~~~~~~~~---~~~~~~~~i~~a~~~Lde~~VP~-------- 192 (332) T protein:vir:78 124 KQIGEALATHYDERIARVLAKASAEASPVTGEPGGFHVNIGAGNTN---DAQAIVDGFFEAAAVLDERSAPQ-------- 192 (332) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhcccCcccccccccccccCCcccc---CHHHHHHHHHHHHHHHhhcCCCc-------- Confidence 8877655554433333333332221110100 0122223332221 11234677899999999999983 Q ss_pred cccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcc-eeEecCCcEEEEeCcchhhhhcCC--- Q lcl|NC_020862. 231 RMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGE-IGAIPGAHLRIVVVPQMMHYAGAG--- 306 (405) Q Consensus 231 ~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gE-IGsi~g~n~Rfv~~p~~~~~~~aG--- 306 (405) .-|++++.|+.-..|.... +|.|+.....+..+.+.+|. ||+|.| |++++++++-.=.+.. T Consensus 193 ---------~gR~~vv~P~~y~~Ll~~~----d~~~~n~~~~~~~~~~~~g~~i~~i~G--~~V~~Sn~lp~~~g~~~~~ 257 (332) T protein:vir:78 193 ---------EGRVAVLSPRQYYSLISSV----DTNILNREIGNSQGDMNSGKGLYSIAG--IRILKSNNLAGLYGQDLSS 257 (332) T ss_pred ---------cCCEEEeCHHHHHHHHhhc----CceeeeeeccccccceecceeeeEEee--eEEEecCccccCccccccc Confidence 2489999999999996533 57788776666677788885 999965 8999999884111100 Q ss_pred CcccCCCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHH Q lcl|NC_020862. 307 ATATAANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFY 386 (405) Q Consensus 307 a~~~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~ 386 (405) ++.++.+..|... ++-.--|+|.++|-|++-+++.. +-+. .+--|+--|.-.+=-+..| T Consensus 258 ~~~~~~~n~~~~~-------~~~~~~~~~h~~a~~~v~~~~~~--------~~~t------~~~~~~~~~~d~i~~~~~~ 316 (332) T protein:vir:78 258 AAVTGENNDYQVD-------ASALAGLIFHREAAGCIQSVAPT--------IQTT------SGDFNVQYQGDLIVGKLAM 316 (332) T ss_pred ccccccccccccc-------cccceEEeecccceeeeeeeccc--------hhhh------hcccchhhhHhhhhhhhhh Confidence 0000111111111 12334688999998887765421 1111 0112333344445556789 Q ss_pred HHhhccccceEEEEEe Q lcl|NC_020862. 387 GFIKLRGERIAVAYSV 402 (405) Q Consensus 387 ~~~iL~~~~marie~~ 402 (405) ++.+||++..+.|+++ T Consensus 317 G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 317 GCGSLRTSVAGSFQAA 332 (332) T ss_pred cCceecccceEEEeeC Confidence 9999999999999999 No 30 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=98.85 E-value=1.3e-09 Score=69.30 Aligned_cols=307 Identities=9% Similarity=0.033 Sum_probs=173.0 Q ss_pred CCccccCcC-CCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCccc Q lcl|NC_020862. 1 MPHIYNDPA-AGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATG 79 (405) Q Consensus 1 ~~~~y~~~~-~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g 79 (405) |.---..+. .+.+.+-+.-+-+.+ .++.+....+..++.+++...+|+.+. +++-+..--+ .. ...+.| T Consensus 1 m~~~~~~a~~~~~t~~~g~~i~~~~-~~~ii~~~~~~s~l~~~~~~~~~~~~~---~~~p~~~~~~---~a---~~v~Eg 70 (330) T protein:vir:77 1 MAGSTVPSTQVALTGDFSAFLTPEQ-SQDYFAEIEKTSIVQRIARKVPMGPTG---ISIPHWTGAV---SA---SWTGEA 70 (330) T ss_pred CcccccchhhccccCCCcceechhH-HHHHHHHHHhccchhhhcceeeccCCc---eEEEEEcCCc---ce---eEecCC Confidence 554433332 222333333344545 356677777788999999998887533 3333322111 00 011122 Q ss_pred ccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHH Q lcl|NC_020862. 80 ASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGA 159 (405) Q Consensus 80 ~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~ 159 (405) +.+ .-...++..|+.+.++++.++++|+++ +.|+..++...+..++.+.. T Consensus 71 ~~~-----------------------------~~~~~~f~~i~~~~~k~~~~~~is~el-l~ds~~~~~~~i~~~l~~ai 120 (330) T protein:vir:77 71 ERK-----------------------------PITKGSFGKQELEPVKITTIFAESAEV-VRLNPLNYLNTMRTKIAEAI 120 (330) T ss_pred Ccc-----------------------------ccccceeeEEEEeEEEEEEeehhhHHH-HhcchHHHHHHHHHHHHHHH Confidence 221 122235677899999999999999984 45666677777777666544 Q ss_pred hhHHHHHHHHHHhccCceE-----EecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccC Q lcl|NC_020862. 160 NEITEDLLQADILASADVK-----VFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTD 234 (405) Q Consensus 160 ~~~ted~l~~~ilag~~~v-----~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~g 234 (405) +.- ++ ..+++|.+.- ..++.............+........+++|.++...|..+.... T Consensus 121 ~~~-~~---~~~l~G~g~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~------------ 184 (330) T protein:vir:77 121 ALK-FD---AAAIHGIDKPSAFKGYLAETTKVVSLADTNLTTASGPQGNAYLAVNNALSLLVNSGKKW------------ 184 (330) T ss_pred HHH-HH---HHhhcccCCCCccccccccccccceeecccccccccccchhHHHHHHHHHhhhhcCCCc------------ Confidence 332 22 3555654421 11111111111111112222335567888888888888776542 Q ss_pred cccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCc Q lcl|NC_020862. 235 TKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANR 314 (405) Q Consensus 235 T~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~ 314 (405) -..+||+.....|+.|+|-.+.|-|.|..+-++.. ...-+.+-| +.++.++.|-. |. T Consensus 185 -------~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~---~~~~~~l~G--~PV~~~~~~p~----~~------- 241 (330) T protein:vir:77 185 -------TGTLLDNVTEPILNTAVDGNGRPLFVESTYTEQVG---AIREGRILG--RPTYVADNVVN----GT------- 241 (330) T ss_pred -------cEEEEcHHHHHHHHHHhccCCceeecCcccccccc---ccCCceecc--eeeEEeccccC----CC------- Confidence 23579999999999999999999998765544443 334455645 56677765531 11 Q ss_pred ccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCC---------CCCCCCCccchhhhHHHH-- Q lcl|NC_020862. 315 GYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGE---------ATADRNDPYGKVGFSSIK-- 383 (405) Q Consensus 315 ~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~---------~tad~~DPlgQrg~~gwK-- 383 (405) .+++ +.+++|+-+...++..+ .+++-+..-.. ......--+-|++...|| T Consensus 242 --------~~~~----~~~~~gd~s~~~i~~~~-------~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~ 302 (330) T protein:vir:77 242 --------VGNR----VVGVMGDFSQVIWGQIG-------GLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCE 302 (330) T ss_pred --------CCCc----cEEEEEecceEEEEEec-------CcEEEEeecceeeecccccccccccccchhhcCcEEEEEE Confidence 1122 33667776666666642 23333221100 000111123467778888 Q ss_pred HHHHHhhccccceEEEEEecC----C Q lcl|NC_020862. 384 FFYGFIKLRGERIAVAYSVIP----E 405 (405) Q Consensus 384 ~~~~~~iL~~~~marie~~a~----~ 405 (405) +++.+.+++++-+++|+.+++ | T Consensus 303 ~r~d~~v~~~~a~~~i~~~~~~~~~~ 328 (330) T protein:vir:77 303 AEFAFMVNDKDAFVKLTDQVAGTDPE 328 (330) T ss_pred EEeccEEecccceEEEEeccCCcCCC Confidence 588999999999999988764 3 No 31 >protein:vir:104439 Length: 404 # NCBI annotation: putative virion structural protein # Family: family:all:974 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794063;genbank:gi:116222008;genbank:GeneID:4397504 Probab=98.84 E-value=4.4e-09 Score=66.37 Aligned_cols=335 Identities=15% Similarity=0.160 Sum_probs=166.2 Q ss_pred CCccccCcCCC---------cccccccceeehhhhhHHHHHhhhh-hhhhccccc--------cccCcCCCCEEEEEecc Q lcl|NC_020862. 1 MPHIYNDPAAG---------DASTVGPQFNVHYWDRKSLIDEAEE-MFFSPLADN--------KQMPKHFGKELKVFYYV 62 (405) Q Consensus 1 ~~~~y~~~~~t---------~~~~v~~qm~t~y~~~k~L~~a~p~-lv~~~fA~~--------~~mPKn~GktIkfrry~ 62 (405) |+- |-+|.+- ....-.++.+. |.+|+....+.. -.+..+... .++-|+.|.+|.|.=-. T Consensus 1 ~~~-~~~~~a~~~~~~~lft~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~ 77 (404) T protein:vir:10 1 MTT-VTSAQANKLYQVALFTAANRNRSMVNI--LTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMH 77 (404) T ss_pred CCC-cCCcchhhhHHHHHHHHHhcCChhHhh--hhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEee Confidence 432 3333221 11122333232 334444443332 233333333 45669999999988555 Q ss_pred cCCCCCCccccCCCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhh Q lcl|NC_020862. 63 PLLDDLNVNDQGLDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFD 142 (405) Q Consensus 63 pl~~~~t~l~eGvtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d 142 (405) +|.-. |..++-+.+| |-..+.+.+-++.|.|-..=+-.... +.+ T Consensus 78 ~L~g~--gv~Gd~~lEG--------------------------------nee~L~~~s~~i~Idq~r~~V~~~g~--msq 121 (404) T protein:vir:10 78 KLSKR--PTMGDERVEG--------------------------------RGEDLSHADFSLKINQGRHLVDAGGR--MSQ 121 (404) T ss_pred ecccC--CcccCceeec--------------------------------cccceeEEeeEEEEeeecccccccCc--hhh Confidence 55332 2222222222 11123333444455554444433331 111 Q ss_pred hc--cchHHHHHHHHHHHHhhHHHHHHHHHHhccCc--------------------------------eEEecCCCccce Q lcl|NC_020862. 143 TD--SDLYGHLSREMLRGANEITEDLLQADILASAD--------------------------------VKVFTGAATSMV 188 (405) Q Consensus 143 ~d--~~l~~~~~~ell~~~~~~ted~l~~~ilag~~--------------------------------~v~yag~ats~~ 188 (405) .- =+|.++ .+..|..-..-..|++.-..|+|+. .+++.|.+|+.. T Consensus 122 QRt~~dlr~~-ar~~L~~w~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~ 200 (404) T protein:vir:10 122 QRTKFNLASS-ARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFE 200 (404) T ss_pred hhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhccccccccccceeeccccccccceeecccCCCCCCcEEeccCccchh Confidence 11 123332 4444444444444555444444433 245556666665 Q ss_pred eeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccceEEEEEcccchHHHHHHhcccCCCccee Q lcl|NC_020862. 189 TMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVP 268 (405) Q Consensus 189 ~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~ 268 (405) +++. .+.+|++.|.++.+.++...-|.+-=.+.|-.+-+. .+-||+||||....|||. +..-+.|.. T Consensus 201 ~l~s-------tD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~---~~~yV~~~~p~q~~~Lr~---dt~~~~w~d 267 (404) T protein:vir:10 201 QIEA-------ADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE---DPYYVLYVTPRQWNDWYT---STSGKDWNQ 267 (404) T ss_pred hhhh-------cccccHHHHHHHHHHHHHhCCCCcceEeccccccCc---cceEEEEechHHHHHHhh---CCCcHHHHH Confidence 5543 378999999999999998776643223333322222 334999999999999985 111134777 Q ss_pred hhhc------CCcccccCcceeEecCCcEEEEeCcch--hhhhcCCCcccCCCcccccccccCCcceeeeEEEEEccccc Q lcl|NC_020862. 269 VEKY------ADAATIMNGEIGAIPGAHLRIVVVPQM--MHYAGAGATATAANRGYQVSDVAGTDKYDIAPLLVVGDQAF 340 (405) Q Consensus 269 v~~Y------a~~~~i~~gEIGsi~g~n~Rfv~~p~~--~~~~~aGa~~~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Af 340 (405) ..++ |..-|||.||.|.+.| +-+++-|.+ ...++....+... .+...+..++....|==-|.+|.+|- T Consensus 268 ~q~~A~a~~rg~~nPlF~G~~gm~ng--vii~~~~~~~Irf~~g~~~~~~~n--~~~a~~~~~aa~~~v~RallLGaQAl 343 (404) T protein:vir:10 268 MMVRAVNRAKGFNHPLFKGECAMWRN--ILVRKYAGMPIRFYQGSKVLVSEN--NLTATTKEVAAATNIDRAMLLGAQAL 343 (404) T ss_pred HHHHHhhccccccCCceecCeeEEcC--EEEEecCCceeeecccceeeecCC--ccccccccccccccchhheeecceeE Confidence 6665 4678999999999954 555554433 2223322211111 11111112222223444588898774 Q ss_pred eeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhcc---------ccceEEEEEecCC Q lcl|NC_020862. 341 ATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLR---------GERIAVAYSVIPE 405 (405) Q Consensus 341 g~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~---------~~~marie~~a~~ 405 (405) +. .+ | +.+...+...-+.- | ||.+=-++.++.+|.+.++ |.=...|-++|+= T Consensus 344 ~~-A~-g--~~~g~~~~w~Ee~~--------D-~g~~~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idta~~~ 404 (404) T protein:vir:10 344 AN-AY-G--QKAGGHFNMVEKKT--------D-MDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) T ss_pred EE-Ee-e--ccCCCCceeEeecc--------c-cCchhhhhhHHHhhhhhccccCCCCceeeEEEEEecccccC Confidence 32 22 1 11223455544432 2 2333337888888888776 3334556677777 No 32 >protein:vir:3298 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049514;genbank:gi:9632520;genbank:GeneID:1262006 Probab=98.84 E-value=4.4e-09 Score=66.37 Aligned_cols=335 Identities=15% Similarity=0.160 Sum_probs=166.2 Q ss_pred CCccccCcCCC---------cccccccceeehhhhhHHHHHhhhh-hhhhccccc--------cccCcCCCCEEEEEecc Q lcl|NC_020862. 1 MPHIYNDPAAG---------DASTVGPQFNVHYWDRKSLIDEAEE-MFFSPLADN--------KQMPKHFGKELKVFYYV 62 (405) Q Consensus 1 ~~~~y~~~~~t---------~~~~v~~qm~t~y~~~k~L~~a~p~-lv~~~fA~~--------~~mPKn~GktIkfrry~ 62 (405) |+- |-+|.+- ....-.++.+. |.+|+....+.. -.+..+... .++-|+.|.+|.|.=-. T Consensus 1 ~~~-~~~~~a~~~~~~~lft~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~ 77 (404) T protein:vir:32 1 MTT-VTSAQANKLYQVALFTAANRNRSMVNI--LTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMH 77 (404) T ss_pred CCC-cCCcchhhhHHHHHHHHHhcCChhHhh--hhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEee Confidence 432 3333221 11122333232 334444443332 233333333 45669999999988555 Q ss_pred cCCCCCCccccCCCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhh Q lcl|NC_020862. 63 PLLDDLNVNDQGLDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFD 142 (405) Q Consensus 63 pl~~~~t~l~eGvtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d 142 (405) +|.-. |..++-+.+| |-..+.+.+-++.|.|-..=+-.... +.+ T Consensus 78 ~L~g~--gv~Gd~~lEG--------------------------------nee~L~~~s~~i~Idq~r~~V~~~g~--msq 121 (404) T protein:vir:32 78 KLSKR--PTMGDERVEG--------------------------------RGEDLSHADFSLKINQGRHLVDAGGR--MSQ 121 (404) T ss_pred ecccC--CcccCceeec--------------------------------cccceeEEeeEEEEeeecccccccCc--hhh Confidence 55332 2222222222 11123333444455554444433331 111 Q ss_pred hc--cchHHHHHHHHHHHHhhHHHHHHHHHHhccCc--------------------------------eEEecCCCccce Q lcl|NC_020862. 143 TD--SDLYGHLSREMLRGANEITEDLLQADILASAD--------------------------------VKVFTGAATSMV 188 (405) Q Consensus 143 ~d--~~l~~~~~~ell~~~~~~ted~l~~~ilag~~--------------------------------~v~yag~ats~~ 188 (405) .- =+|.++ .+..|..-..-..|++.-..|+|+. .+++.|.+|+.. T Consensus 122 QRt~~dlr~~-ar~~L~~w~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~ 200 (404) T protein:vir:32 122 QRTKFNLASS-ARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFE 200 (404) T ss_pred hhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhccccccccccceeeccccccccceeecccCCCCCCcEEeccCccchh Confidence 11 123332 4444444444444555444444433 245556666665 Q ss_pred eeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccceEEEEEcccchHHHHHHhcccCCCccee Q lcl|NC_020862. 189 TMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVP 268 (405) Q Consensus 189 ~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~ 268 (405) +++. .+.+|++.|.++.+.++...-|.+-=.+.|-.+-+. .+-||+||||....|||. +..-+.|.. T Consensus 201 ~l~s-------tD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~---~~~yV~~~~p~q~~~Lr~---dt~~~~w~d 267 (404) T protein:vir:32 201 QIEA-------ADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE---DPYYVLYVTPRQWNDWYT---STSGKDWNQ 267 (404) T ss_pred hhhh-------cccccHHHHHHHHHHHHHhCCCCcceEeccccccCc---cceEEEEechHHHHHHhh---CCCcHHHHH Confidence 5543 378999999999999998776643223333322222 334999999999999985 111134777 Q ss_pred hhhc------CCcccccCcceeEecCCcEEEEeCcch--hhhhcCCCcccCCCcccccccccCCcceeeeEEEEEccccc Q lcl|NC_020862. 269 VEKY------ADAATIMNGEIGAIPGAHLRIVVVPQM--MHYAGAGATATAANRGYQVSDVAGTDKYDIAPLLVVGDQAF 340 (405) Q Consensus 269 v~~Y------a~~~~i~~gEIGsi~g~n~Rfv~~p~~--~~~~~aGa~~~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Af 340 (405) ..++ |..-|||.||.|.+.| +-+++-|.+ ...++....+... .+...+..++....|==-|.+|.+|- T Consensus 268 ~q~~A~a~~rg~~nPlF~G~~gm~ng--vii~~~~~~~Irf~~g~~~~~~~n--~~~a~~~~~aa~~~v~RallLGaQAl 343 (404) T protein:vir:32 268 MMVRAVNRAKGFNHPLFKGECAMWRN--ILVRKYAGMPIRFYQGSKVLVSEN--NLTATTKEVAAATNIDRAMLLGAQAL 343 (404) T ss_pred HHHHHhhccccccCCceecCeeEEcC--EEEEecCCceeeecccceeeecCC--ccccccccccccccchhheeecceeE Confidence 6665 4678999999999954 555554433 2223322211111 11111112222223444588898774 Q ss_pred eeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhcc---------ccceEEEEEecCC Q lcl|NC_020862. 341 ATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLR---------GERIAVAYSVIPE 405 (405) Q Consensus 341 g~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~---------~~~marie~~a~~ 405 (405) +. .+ | +.+...+...-+.- | ||.+=-++.++.+|.+.++ |.=...|-++|+= T Consensus 344 ~~-A~-g--~~~g~~~~w~Ee~~--------D-~g~~~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idta~~~ 404 (404) T protein:vir:32 344 AN-AY-G--QKAGGHFNMVEKKT--------D-MDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) T ss_pred EE-Ee-e--ccCCCCceeEeecc--------c-cCchhhhhhHHHhhhhhccccCCCCceeeEEEEEecccccC Confidence 32 22 1 11223455544432 2 2333337888888888776 3334556677777 No 33 >protein:vir:819 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050552;genbank:gi:9633449;genbank:GeneID:1262254 Probab=98.84 E-value=4.4e-09 Score=66.37 Aligned_cols=335 Identities=15% Similarity=0.160 Sum_probs=166.2 Q ss_pred CCccccCcCCC---------cccccccceeehhhhhHHHHHhhhh-hhhhccccc--------cccCcCCCCEEEEEecc Q lcl|NC_020862. 1 MPHIYNDPAAG---------DASTVGPQFNVHYWDRKSLIDEAEE-MFFSPLADN--------KQMPKHFGKELKVFYYV 62 (405) Q Consensus 1 ~~~~y~~~~~t---------~~~~v~~qm~t~y~~~k~L~~a~p~-lv~~~fA~~--------~~mPKn~GktIkfrry~ 62 (405) |+- |-+|.+- ....-.++.+. |.+|+....+.. -.+..+... .++-|+.|.+|.|.=-. T Consensus 1 ~~~-~~~~~a~~~~~~~lft~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~ 77 (404) T protein:vir:81 1 MTT-VTSAQANKLYQVALFTAANRNRSMVNI--LTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMH 77 (404) T ss_pred CCC-cCCcchhhhHHHHHHHHHhcCChhHhh--hhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEee Confidence 432 3333221 11122333232 334444443332 233333333 45669999999988555 Q ss_pred cCCCCCCccccCCCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhh Q lcl|NC_020862. 63 PLLDDLNVNDQGLDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFD 142 (405) Q Consensus 63 pl~~~~t~l~eGvtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d 142 (405) +|.-. |..++-+.+| |-..+.+.+-++.|.|-..=+-.... +.+ T Consensus 78 ~L~g~--gv~Gd~~lEG--------------------------------nee~L~~~s~~i~Idq~r~~V~~~g~--msq 121 (404) T protein:vir:81 78 KLSKR--PTMGDERVEG--------------------------------RGEDLSHADFSLKINQGRHLVDAGGR--MSQ 121 (404) T ss_pred ecccC--CcccCceeec--------------------------------cccceeEEeeEEEEeeecccccccCc--hhh Confidence 55332 2222222222 11123333444455554444433331 111 Q ss_pred hc--cchHHHHHHHHHHHHhhHHHHHHHHHHhccCc--------------------------------eEEecCCCccce Q lcl|NC_020862. 143 TD--SDLYGHLSREMLRGANEITEDLLQADILASAD--------------------------------VKVFTGAATSMV 188 (405) Q Consensus 143 ~d--~~l~~~~~~ell~~~~~~ted~l~~~ilag~~--------------------------------~v~yag~ats~~ 188 (405) .- =+|.++ .+..|..-..-..|++.-..|+|+. .+++.|.+|+.. T Consensus 122 QRt~~dlr~~-ar~~L~~w~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~ 200 (404) T protein:vir:81 122 QRTKFNLASS-ARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFE 200 (404) T ss_pred hhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhccccccccccceeeccccccccceeecccCCCCCCcEEeccCccchh Confidence 11 123332 4444444444444555444444433 245556666665 Q ss_pred eeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccceEEEEEcccchHHHHHHhcccCCCccee Q lcl|NC_020862. 189 TMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVP 268 (405) Q Consensus 189 ~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~ 268 (405) +++. .+.+|++.|.++.+.++...-|.+-=.+.|-.+-+. .+-||+||||....|||. +..-+.|.. T Consensus 201 ~l~s-------tD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~---~~~yV~~~~p~q~~~Lr~---dt~~~~w~d 267 (404) T protein:vir:81 201 QIEA-------ADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE---DPYYVLYVTPRQWNDWYT---STSGKDWNQ 267 (404) T ss_pred hhhh-------cccccHHHHHHHHHHHHHhCCCCcceEeccccccCc---cceEEEEechHHHHHHhh---CCCcHHHHH Confidence 5543 378999999999999998776643223333322222 334999999999999985 111134777 Q ss_pred hhhc------CCcccccCcceeEecCCcEEEEeCcch--hhhhcCCCcccCCCcccccccccCCcceeeeEEEEEccccc Q lcl|NC_020862. 269 VEKY------ADAATIMNGEIGAIPGAHLRIVVVPQM--MHYAGAGATATAANRGYQVSDVAGTDKYDIAPLLVVGDQAF 340 (405) Q Consensus 269 v~~Y------a~~~~i~~gEIGsi~g~n~Rfv~~p~~--~~~~~aGa~~~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Af 340 (405) ..++ |..-|||.||.|.+.| +-+++-|.+ ...++....+... .+...+..++....|==-|.+|.+|- T Consensus 268 ~q~~A~a~~rg~~nPlF~G~~gm~ng--vii~~~~~~~Irf~~g~~~~~~~n--~~~a~~~~~aa~~~v~RallLGaQAl 343 (404) T protein:vir:81 268 MMVRAVNRAKGFNHPLFKGECAMWRN--ILVRKYAGMPIRFYQGSKVLVSEN--NLTATTKEVAAATNIDRAMLLGAQAL 343 (404) T ss_pred HHHHHhhccccccCCceecCeeEEcC--EEEEecCCceeeecccceeeecCC--ccccccccccccccchhheeecceeE Confidence 6665 4678999999999954 555554433 2223322211111 11111112222223444588898774 Q ss_pred eeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhcc---------ccceEEEEEecCC Q lcl|NC_020862. 341 ATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLR---------GERIAVAYSVIPE 405 (405) Q Consensus 341 g~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~---------~~~marie~~a~~ 405 (405) +. .+ | +.+...+...-+.- | ||.+=-++.++.+|.+.++ |.=...|-++|+= T Consensus 344 ~~-A~-g--~~~g~~~~w~Ee~~--------D-~g~~~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idta~~~ 404 (404) T protein:vir:81 344 AN-AY-G--QKAGGHFNMVEKKT--------D-MDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) T ss_pred EE-Ee-e--ccCCCCceeEeecc--------c-cCchhhhhhHHHhhhhhccccCCCCceeeEEEEEecccccC Confidence 32 22 1 11223455544432 2 2333337888888888776 3334556677777 No 34 >protein:vir:10123 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859253;genbank:gi:32171009;genbank:GeneID:2653345 Probab=98.84 E-value=4.4e-09 Score=66.37 Aligned_cols=335 Identities=15% Similarity=0.160 Sum_probs=166.2 Q ss_pred CCccccCcCCC---------cccccccceeehhhhhHHHHHhhhh-hhhhccccc--------cccCcCCCCEEEEEecc Q lcl|NC_020862. 1 MPHIYNDPAAG---------DASTVGPQFNVHYWDRKSLIDEAEE-MFFSPLADN--------KQMPKHFGKELKVFYYV 62 (405) Q Consensus 1 ~~~~y~~~~~t---------~~~~v~~qm~t~y~~~k~L~~a~p~-lv~~~fA~~--------~~mPKn~GktIkfrry~ 62 (405) |+- |-+|.+- ....-.++.+. |.+|+....+.. -.+..+... .++-|+.|.+|.|.=-. T Consensus 1 ~~~-~~~~~a~~~~~~~lft~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~ 77 (404) T protein:vir:10 1 MTT-VTSAQANKLYQVALFTAANRNRSMVNI--LTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMH 77 (404) T ss_pred CCC-cCCcchhhhHHHHHHHHHhcCChhHhh--hhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEee Confidence 432 3333221 11122333232 334444443332 233333333 45669999999988555 Q ss_pred cCCCCCCccccCCCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhh Q lcl|NC_020862. 63 PLLDDLNVNDQGLDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFD 142 (405) Q Consensus 63 pl~~~~t~l~eGvtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d 142 (405) +|.-. |..++-+.+| |-..+.+.+-++.|.|-..=+-.... +.+ T Consensus 78 ~L~g~--gv~Gd~~lEG--------------------------------nee~L~~~s~~i~Idq~r~~V~~~g~--msq 121 (404) T protein:vir:10 78 KLSKR--PTMGDERVEG--------------------------------RGEDLSHADFSLKINQGRHLVDAGGR--MSQ 121 (404) T ss_pred ecccC--CcccCceeec--------------------------------cccceeEEeeEEEEeeecccccccCc--hhh Confidence 55332 2222222222 11123333444455554444433331 111 Q ss_pred hc--cchHHHHHHHHHHHHhhHHHHHHHHHHhccCc--------------------------------eEEecCCCccce Q lcl|NC_020862. 143 TD--SDLYGHLSREMLRGANEITEDLLQADILASAD--------------------------------VKVFTGAATSMV 188 (405) Q Consensus 143 ~d--~~l~~~~~~ell~~~~~~ted~l~~~ilag~~--------------------------------~v~yag~ats~~ 188 (405) .- =+|.++ .+..|..-..-..|++.-..|+|+. .+++.|.+|+.. T Consensus 122 QRt~~dlr~~-ar~~L~~w~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~ 200 (404) T protein:vir:10 122 QRTKFNLASS-ARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFE 200 (404) T ss_pred hhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhccccccccccceeeccccccccceeecccCCCCCCcEEeccCccchh Confidence 11 123332 4444444444444555444444433 245556666665 Q ss_pred eeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccceEEEEEcccchHHHHHHhcccCCCccee Q lcl|NC_020862. 189 TMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVP 268 (405) Q Consensus 189 ~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~ 268 (405) +++. .+.+|++.|.++.+.++...-|.+-=.+.|-.+-+. .+-||+||||....|||. +..-+.|.. T Consensus 201 ~l~s-------tD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~---~~~yV~~~~p~q~~~Lr~---dt~~~~w~d 267 (404) T protein:vir:10 201 QIEA-------ADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGE---DPYYVLYVTPRQWNDWYT---STSGKDWNQ 267 (404) T ss_pred hhhh-------cccccHHHHHHHHHHHHHhCCCCcceEeccccccCc---cceEEEEechHHHHHHhh---CCCcHHHHH Confidence 5543 378999999999999998776643223333322222 334999999999999985 111134777 Q ss_pred hhhc------CCcccccCcceeEecCCcEEEEeCcch--hhhhcCCCcccCCCcccccccccCCcceeeeEEEEEccccc Q lcl|NC_020862. 269 VEKY------ADAATIMNGEIGAIPGAHLRIVVVPQM--MHYAGAGATATAANRGYQVSDVAGTDKYDIAPLLVVGDQAF 340 (405) Q Consensus 269 v~~Y------a~~~~i~~gEIGsi~g~n~Rfv~~p~~--~~~~~aGa~~~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Af 340 (405) ..++ |..-|||.||.|.+.| +-+++-|.+ ...++....+... .+...+..++....|==-|.+|.+|- T Consensus 268 ~q~~A~a~~rg~~nPlF~G~~gm~ng--vii~~~~~~~Irf~~g~~~~~~~n--~~~a~~~~~aa~~~v~RallLGaQAl 343 (404) T protein:vir:10 268 MMVRAVNRAKGFNHPLFKGECAMWRN--ILVRKYAGMPIRFYQGSKVLVSEN--NLTATTKEVAAATNIDRAMLLGAQAL 343 (404) T ss_pred HHHHHhhccccccCCceecCeeEEcC--EEEEecCCceeeecccceeeecCC--ccccccccccccccchhheeecceeE Confidence 6665 4678999999999954 555554433 2223322211111 11111112222223444588898774 Q ss_pred eeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhcc---------ccceEEEEEecCC Q lcl|NC_020862. 341 ATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLR---------GERIAVAYSVIPE 405 (405) Q Consensus 341 g~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~---------~~~marie~~a~~ 405 (405) +. .+ | +.+...+...-+.- | ||.+=-++.++.+|.+.++ |.=...|-++|+= T Consensus 344 ~~-A~-g--~~~g~~~~w~Ee~~--------D-~g~~~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idta~~~ 404 (404) T protein:vir:10 344 AN-AY-G--QKAGGHFNMVEKKT--------D-MDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) T ss_pred EE-Ee-e--ccCCCCceeEeecc--------c-cCchhhhhhHHHhhhhhccccCCCCceeeEEEEEecccccC Confidence 32 22 1 11223455544432 2 2333337888888888776 3334556677777 No 35 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=98.79 E-value=2.4e-09 Score=67.79 Aligned_cols=324 Identities=16% Similarity=0.117 Sum_probs=172.3 Q ss_pred CCccccCcCCCccccccc---c---eeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccC Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGP---Q---FNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQG 74 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~---q---m~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eG 74 (405) |-++.-.+...+....++ . +...-|....|..-+..-++..+-..+++ ..|++++|-|--.... .-.+.| T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~~~--~~G~sv~i~~ig~~t~--~~~~~g 76 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSI--ASGKSAQFPVIGRTKA--AYLKPG 76 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccccc--cccceeEeeeccceee--eeeccC Confidence 766665554433333321 1 11122334444433343444444444433 3589999987664322 112222 Q ss_pred CCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHH Q lcl|NC_020862. 75 LDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSRE 154 (405) Q Consensus 75 vtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~e 154 (405) -...+ +-.|+ +..+++-+|-|+=-|..+=|+.-..+.+-|+..+++.+ T Consensus 77 ~~l~~----------~~~~~----------------------~~~e~~ltID~~~~~~~~VddlD~~q~~~D~~~~~~~~ 124 (347) T protein:vir:15 77 ENLDD----------KRKDI----------------------KHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQ 124 (347) T ss_pred CCCCC----------CCCCC----------------------ccceEEEEechhhhhhHHhhhHHHHhcCCcchHHHHHH Confidence 22211 00011 12233445555544443335444555566788887777 Q ss_pred HHHHHhhHHHHHHHHHHhccCc-------eEEecCCCccce--eeecccc-cccCCceecHHHHHHHHHHHHhccCcccc Q lcl|NC_020862. 155 MLRGANEITEDLLQADILASAD-------VKVFTGAATSMV--TMTGEAA-DAEDDGLITLKDLKRLSITLTDNYTPKKT 224 (405) Q Consensus 155 ll~~~~~~ted~l~~~ilag~~-------~v~yag~ats~~--~~t~~~~-~~~~n~~it~~~lr~~~~~Lk~nrApk~T 224 (405) ....-+......+.+.+..++. +.-..|..+... +.+++.. .+..+..-=++.|+.+.+.|.++..|. T Consensus 125 ~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~-- 202 (347) T protein:vir:15 125 LGESLAMAADGAVLAELAGLVNLPDASNENIEGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPA-- 202 (347) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccccccccccccCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCc-- Confidence 7765555444444433332211 111112111111 1111110 000000011677888899999999983 Q ss_pred ceeccccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhc Q lcl|NC_020862. 225 TIIKGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAG 304 (405) Q Consensus 225 ~ii~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~ 304 (405) ..|++++.|+.-.+|.. ++.|+.. .|+....+.+|.||+|.| |++++++++-...+ T Consensus 203 ---------------~gR~~vv~P~~y~~LL~------~~~~~~~-d~~~~~~~~~G~Vg~i~G--~~V~~Sn~lp~~~~ 258 (347) T protein:vir:15 203 ---------------ADRTFYTTPDNYSAILA------ALMPNAA-NYQALIDHERGTIRNVMG--FEVVEVPHLTAGGA 258 (347) T ss_pred ---------------cCCEEEeCHHHHHHHhc------ccccccc-cccccccccceEEEEEec--eEEEeccccccccc Confidence 24899999999999953 4678876 688878889999999966 99999998853322 Q ss_pred CCCcccC-CCcccccc---cccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhH Q lcl|NC_020862. 305 AGATATA-ANRGYQVS---DVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFS 380 (405) Q Consensus 305 aGa~~~~-t~~~~~~~---~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~ 380 (405) .+....+ ++..|-.. ..+-...++....|++-++|.|++-++....+ ..-|+--|.-.+ T Consensus 259 t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e-----------------~~~~~~~~~d~i 321 (347) T protein:vir:15 259 GDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALE-----------------RARRANYQADQI 321 (347) T ss_pred ccccccccccccccccccccceeeeccccceeeeeccceeeeeEeeceeee-----------------ecccchhhhhhh Confidence 1110000 11111110 00112234556789999999999888742111 123555666666 Q ss_pred HHHHHHHHhhccccceEEEEEecCC Q lcl|NC_020862. 381 SIKFFYGFIKLRGERIAVAYSVIPE 405 (405) Q Consensus 381 gwK~~~~~~iL~~~~marie~~a~~ 405 (405) --|..|++.+||++..+.| ..|- T Consensus 322 ~~~~~~G~~vlrP~~av~~--~~~~ 344 (347) T protein:vir:15 322 IAKYAMGHGGLRPEAAGAI--VLPK 344 (347) T ss_pred ehhhhcCCceeccccEEEE--ecCC Confidence 6688899999999997666 4554 No 36 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=98.78 E-value=2.1e-09 Score=68.20 Aligned_cols=288 Identities=11% Similarity=0.051 Sum_probs=164.7 Q ss_pred CCccccCc-CCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCccc Q lcl|NC_020862. 1 MPHIYNDP-AAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATG 79 (405) Q Consensus 1 ~~~~y~~~-~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g 79 (405) |+-.=.++ ..+++++-+.-+ +-.+.++.+....+.-.+.+++...+|+.+.+.++....-.+ .-+.-++| T Consensus 1 m~~~~~~~~~~~~t~~~~~lv-P~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~--------~a~~v~Eg 71 (297) T protein:vir:95 1 MTVQTFNPENVLVSQKKDGTL-HKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGI--------SAYWVNET 71 (297) T ss_pred CCccccccccccccCCCccee-chhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCc--------eeEEeecC Confidence 77653344 333333333333 335566777777777889999999998876655543321111 01111222 Q ss_pred ccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHH Q lcl|NC_020862. 80 ASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGA 159 (405) Q Consensus 80 ~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~ 159 (405) +++. . .-.++..++.+.++++.++.+|++++ -|+..++.+.+..++.+.- T Consensus 72 ~~~~---------------~--------------~~~~f~~v~l~~~k~~~~~~is~ell-~ds~~~l~~~i~~~la~ai 121 (297) T protein:vir:95 72 EKIK---------------T--------------DKPEVVPVTLKAHKLGIILVTSREAL-NYTWKKFFEDMKPQIVEAF 121 (297) T ss_pred cccc---------------c--------------cccceeEEEEeeEEEEEeehhhHHHH-hcCHHHHHHHHHHHHHHHH Confidence 2221 1 11256678899999999999999854 4666677777776666544 Q ss_pred hhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccccc Q lcl|NC_020862. 160 NEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTIS 239 (405) Q Consensus 160 ~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~ 239 (405) +.-.+. .+++|.+...-.|..+... ... ....+.+++++|.++...|..+... . T Consensus 122 ~~~~d~----a~l~G~g~~~~~gi~~~~~----~~~-~~~~~~~t~~~i~~~~~~l~~~~~~-----------------~ 175 (297) T protein:vir:95 122 YKKIDE----AGLLGHDTPFANSVAKAAK----DAN-KVIGGPINYDNILKLQDALYDADVE-----------------P 175 (297) T ss_pred HHHHHH----HHhcccCCccccccccccc----ccc-eecccccCHHHHHHHHHHhhhccCC-----------------c Confidence 433333 3345543322222111111 111 1123468999999999999876542 1 Q ss_pred ceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccccc Q lcl|NC_020862. 240 ASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVS 319 (405) Q Consensus 240 ~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~ 319 (405) + +.+|||.....|+.|+|..+.|- +.+..|.+.| +..+.++... . T Consensus 176 ~--~~v~~~~~~~~L~~l~d~~G~~i-------------~~~~~~~l~G--~Pv~~~~~~~--------~---------- 220 (297) T protein:vir:95 176 N--AFVSKIQNRSALREARDGNKVSI-------------YDKAANTIDG--ITTVDLKSAR--------F---------- 220 (297) T ss_pred C--EEEEcHHHHHHHHHhhccCCcee-------------ecCCCCcccc--eeeEeecCCC--------C---------- Confidence 1 35789999999999987555444 3444455544 2333222100 0 Q ss_pred cccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCC----cc--chhhhHHHH--HHHHHhhc Q lcl|NC_020862. 320 DVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRND----PY--GKVGFSSIK--FFYGFIKL 391 (405) Q Consensus 320 ~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~D----Pl--gQrg~~gwK--~~~~~~iL 391 (405) ..+ .+++|+-+...++.. +.+.+-+...+.- ....| ++ -|++.+.+| +++.+.++ T Consensus 221 ---~~~------~~~~gd~s~~~~~~~-------~~~~i~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~ 283 (297) T protein:vir:95 221 ---EKG------DLLAGDFDNLIYGVP-------YNITYKISEEGQI-STITNADGTPINLFEQEMIAIRATMDIAVMIT 283 (297) T ss_pred ---CCc------eEEEEecccEEEEEe-------cCeEEEEeecccc-ccccccCccchhhhhcCcEEEEEEEEeccEee Confidence 001 256787776666664 2244434333211 12222 32 467777777 78899999 Q ss_pred cccceEEEEEecCC Q lcl|NC_020862. 392 RGERIAVAYSVIPE 405 (405) Q Consensus 392 ~~~~marie~~a~~ 405 (405) +++-+++|+.+.|= T Consensus 284 ~~~a~~~l~~at~~ 297 (297) T protein:vir:95 284 KTDAFAKLTPAERV 297 (297) T ss_pred cccceEEEeecCCC Confidence 99999999999999 No 37 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=98.77 E-value=8.7e-09 Score=64.77 Aligned_cols=322 Identities=16% Similarity=0.142 Sum_probs=169.9 Q ss_pred CCccccCcCCCccc--cccc-----ceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDAS--TVGP-----QFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQ 73 (405) Q Consensus 1 ~~~~y~~~~~t~~~--~v~~-----qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~e 73 (405) |.++-.-+.+.+.+ ..+. .+...-|..+.+..-+..-++..+-..|.+= .||+++|-|.-..- .+ T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~--~g~s~~~~~iG~~~------~~ 72 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSIS--SGKSAQFPVLGRTQ------AA 72 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHHHHHHHHHHHHHHHHhhhcccceeeeec--ccceEEEEeeceeE------EE Confidence 65442222222111 1100 1222345666666666666777777777654 59999998653221 23 Q ss_pred CCCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHH Q lcl|NC_020862. 74 GLDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSR 153 (405) Q Consensus 74 Gvtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ 153 (405) +.+| |+.+ .++..|+ ...+++-+|-|.=-|..+=|+.-+.+.+-|+..+++. T Consensus 73 ~~~~-G~~l-----~~t~~~~----------------------~~~e~~l~ID~~~y~~~~VdDiD~~q~~~D~r~~~~~ 124 (344) T protein:vir:10 73 YLAP-GENL-----DDIRKDI----------------------KHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTS 124 (344) T ss_pred eeec-CCCC-----CCCCCCc----------------------ccceEEEEEcchhhhhhhhhhHHHHhcCcchHHHHHH Confidence 3333 3222 1111122 1223334444433333222333455566688888888 Q ss_pred HHHHHHhhHHHHHHHHHHhccCce-----EEecCCCccceeeecccccccCCcee-----cHHHHHHHHHHHHhccCccc Q lcl|NC_020862. 154 EMLRGANEITEDLLQADILASADV-----KVFTGAATSMVTMTGEAADAEDDGLI-----TLKDLKRLSITLTDNYTPKK 223 (405) Q Consensus 154 ell~~~~~~ted~l~~~ilag~~~-----v~yag~ats~~~~t~~~~~~~~n~~i-----t~~~lr~~~~~Lk~nrApk~ 223 (405) ++...-+......+-+.+..++.. ..-+|.-.+ ..+.........++.. -++.|+++...|.++..|. T Consensus 125 ~~G~aLA~~~D~~i~~~la~~a~~~~~~~~~~~g~~~~-~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~- 202 (344) T protein:vir:10 125 QLGESLAMAADGAVLAEIAGLCNVESQYNENITGLGTA-TVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPS- 202 (344) T ss_pred HHHHHHHHHHHHHHHHHHHhhhcccccccccccccccc-ceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCc- Confidence 887655554443333444222211 111111111 1111111000111112 2567899999999999983 Q ss_pred cceeccccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhh Q lcl|NC_020862. 224 TTIIKGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYA 303 (405) Q Consensus 224 T~ii~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~ 303 (405) .-|++++.|+.-..|.+ ++.|... .|+....+-+|.||+|.| |++++++++-.- T Consensus 203 ----------------~gR~~vv~P~~y~~Ll~------~~~~~~~-~~~~~~~~~~G~V~~v~G--~~V~~Sn~lp~~- 256 (344) T protein:vir:10 203 ----------------SDRVFYCDPDSYSAILA------ALMPNAA-NYAALIDPEKGSIRNVMG--FEVVEVPHLTAG- 256 (344) T ss_pred ----------------cCCEEEeChHHHHHHhh------ccccccc-ccccccceeeeEEEEEec--eEEEeccccccc- Confidence 24889999999888854 4667665 588888899999999965 999999987531 Q ss_pred cCCCcccC-CCcccccccccCCcceee----eEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhh Q lcl|NC_020862. 304 GAGATATA-ANRGYQVSDVAGTDKYDI----APLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVG 378 (405) Q Consensus 304 ~aGa~~~~-t~~~~~~~~~~g~~~~DV----Yp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg 378 (405) ..+....+ ++..+-.....+ ..+.+ -.-|||=++|-+++-++... +. .-.|+--|.- T Consensus 257 ~~~~~~~~~tg~~~~~~~~~~-~~~~~~~s~~~~l~~h~~A~~~v~~~~~~----------~e-------~~r~~~~~~d 318 (344) T protein:vir:10 257 GAGTSREGTTGQKHAFPATKS-GNDKVAKDNVIGLFMHRSAVGTVKLRDLA----------LE-------RARRANFQAD 318 (344) T ss_pred cCCcccccccCccccccCCcc-cceeeecceeEEEeechhhhhhhhhccce----------ee-------cccchhHHHH Confidence 11111111 111111111111 11111 12356666666666554211 11 1235556666 Q ss_pred hHHHHHHHHHhhccccceEEEEEecC Q lcl|NC_020862. 379 FSSIKFFYGFIKLRGERIAVAYSVIP 404 (405) Q Consensus 379 ~~gwK~~~~~~iL~~~~marie~~a~ 404 (405) ++==|+.|++.+||++..+.||-..- T Consensus 319 ~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 319 QIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred HHHHHhhcccceecccceEEEEeecC Confidence 66678999999999999999998877 No 38 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=98.76 E-value=1.9e-09 Score=68.34 Aligned_cols=291 Identities=11% Similarity=0.028 Sum_probs=153.7 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) +-..-.......+++-+.-+-+-.|.+..+....+...+.+++.+.+|+-+.|+... .++..-.... ...||-+.. T Consensus 113 ~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~-~~~~~~~~~~-~v~E~~~~~-- 188 (415) T protein:vir:79 113 LETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPV-VRQSEVAALE-KVEELEENP-- 188 (415) T ss_pred HhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEE-EeecCCccce-eeccccccC-- Confidence 000000111111122122233335566655555667888999999999988876333 2333332211 122332111 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) .....++..|+.++++++.++.+|++++ -|+..++.+.+..++.+..+ T Consensus 189 -------------------------------~~~~~~~~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~~~~ 236 (415) T protein:vir:79 189 -------------------------------ELAVKPFFQLAYDINTHRGYFRISREAI-EDAKVNVLQELKLWMARTIA 236 (415) T ss_pred -------------------------------cccccceeeEEeeeeeeEeeehhhHHHH-hhchHHHHHHHHHHHHHHHH Confidence 0001246677899999999999999854 56666777777777665444 Q ss_pred hHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISA 240 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~ 240 (405) .. ++ ..+++|.+.-.-.+...... .........+.+++++|.++...|.....+ .+ T Consensus 237 ~~-~~---~~il~g~g~g~~~~~~~~~~---~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~-----------------~~ 292 (415) T protein:vir:79 237 AT-RN---KAIIDVITKGSTGSTSSGFE---KEGKKLEVKKAKSLDDIKDAINLNVKPNYE-----------------HN 292 (415) T ss_pred HH-HH---HHHhhccccCcccccccccc---ccccccccccccchhHHHHHHHhhhhhccC-----------------CC Confidence 33 22 33444432210000000000 001111234568899999988887654332 11 Q ss_pred eEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccccccc Q lcl|NC_020862. 241 SRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSD 320 (405) Q Consensus 241 syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~ 320 (405) ..+||+.+...|+.|+|-.++|-|.|- +..+-.++|-| +.+++++.+. ..++| T Consensus 293 --~~v~n~~~~~~l~~lkd~~G~~l~~~~--------~~~~~~~~l~G--~pV~~~~~~~-~~~~~-------------- 345 (415) T protein:vir:79 293 --VAIVSQTMFAKLDKMKDKLGNYLIQPD--------VKEKTQQRLLG--AKIEILPDEV-LGQKG-------------- 345 (415) T ss_pred --EEEEcHHHHHHHHHhhccCCceeeccC--------cCCCCCceecc--eeeEEecccc-cCCCC-------------- Confidence 247899999999999998888888763 22444567766 4555555432 11111 Q ss_pred ccCCcceeeeEEEEEc--cccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccceEE Q lcl|NC_020862. 321 VAGTDKYDIAPLLVVG--DQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIAV 398 (405) Q Consensus 321 ~~g~~~~DVYp~lV~G--~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~mar 398 (405) | . .++|| +++|-...-++ +++-+.+ +-..++++.+. +++.+.+++++=+++ T Consensus 346 -------~-~-~~~~Gd~~~~~~~~~~~~--------~~v~~~~---------~~~~~~~~~~~-~r~d~~v~~~~a~~~ 398 (415) T protein:vir:79 346 -------N-N-TLIIGNLKDAIVLFDRSQ--------YQASWTD---------YMHFGECLMIA-VRQDCRILDYKSAIV 398 (415) T ss_pred -------c-c-EEEEEehhccEEEEeecc--------eEEEEec---------cccCceEEEEE-EEeccEEeccccEEE Confidence 1 1 26778 34443222221 2332221 11223333332 477888899999988 Q ss_pred EEEecCC Q lcl|NC_020862. 399 AYSVIPE 405 (405) Q Consensus 399 ie~~a~~ 405 (405) ++..++- T Consensus 399 ~~~~~~~ 405 (415) T protein:vir:79 399 IEYDDSE 405 (415) T ss_pred EEEeccC Confidence 8765554 No 39 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=98.76 E-value=1.9e-09 Score=68.34 Aligned_cols=291 Identities=11% Similarity=0.028 Sum_probs=153.7 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) +-..-.......+++-+.-+-+-.|.+..+....+...+.+++.+.+|+-+.|+... .++..-.... ...||-+.. T Consensus 113 ~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~-~~~~~~~~~~-~v~E~~~~~-- 188 (415) T protein:vir:98 113 LETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPV-VRQSEVAALE-KVEELEENP-- 188 (415) T ss_pred HhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEE-EeecCCccce-eeccccccC-- Confidence 000000111111122122233335566655555667888999999999988876333 2333332211 122332111 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) .....++..|+.++++++.++.+|++++ -|+..++.+.+..++.+..+ T Consensus 189 -------------------------------~~~~~~~~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~~~~ 236 (415) T protein:vir:98 189 -------------------------------ELAVKPFFQLAYDINTHRGYFRISREAI-EDAKVNVLQELKLWMARTIA 236 (415) T ss_pred -------------------------------cccccceeeEEeeeeeeEeeehhhHHHH-hhchHHHHHHHHHHHHHHHH Confidence 0001246677899999999999999854 56666777777777665444 Q ss_pred hHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISA 240 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~ 240 (405) .. ++ ..+++|.+.-.-.+...... .........+.+++++|.++...|.....+ .+ T Consensus 237 ~~-~~---~~il~g~g~g~~~~~~~~~~---~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~-----------------~~ 292 (415) T protein:vir:98 237 AT-RN---KAIIDVITKGSTGSTSSGFE---KEGKKLEVKKAKSLDDIKDAINLNVKPNYE-----------------HN 292 (415) T ss_pred HH-HH---HHHhhccccCcccccccccc---ccccccccccccchhHHHHHHHhhhhhccC-----------------CC Confidence 33 22 33444432210000000000 001111234568899999988887654332 11 Q ss_pred eEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccccccc Q lcl|NC_020862. 241 SRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSD 320 (405) Q Consensus 241 syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~ 320 (405) ..+||+.+...|+.|+|-.++|-|.|- +..+-.++|-| +.+++++.+. ..++| T Consensus 293 --~~v~n~~~~~~l~~lkd~~G~~l~~~~--------~~~~~~~~l~G--~pV~~~~~~~-~~~~~-------------- 345 (415) T protein:vir:98 293 --VAIVSQTMFAKLDKMKDKLGNYLIQPD--------VKEKTQQRLLG--AKIEILPDEV-LGQKG-------------- 345 (415) T ss_pred --EEEEcHHHHHHHHHhhccCCceeeccC--------cCCCCCceecc--eeeEEecccc-cCCCC-------------- Confidence 247899999999999998888888763 22444567766 4555555432 11111 Q ss_pred ccCCcceeeeEEEEEc--cccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccceEE Q lcl|NC_020862. 321 VAGTDKYDIAPLLVVG--DQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIAV 398 (405) Q Consensus 321 ~~g~~~~DVYp~lV~G--~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~mar 398 (405) | . .++|| +++|-...-++ +++-+.+ +-..++++.+. +++.+.+++++=+++ T Consensus 346 -------~-~-~~~~Gd~~~~~~~~~~~~--------~~v~~~~---------~~~~~~~~~~~-~r~d~~v~~~~a~~~ 398 (415) T protein:vir:98 346 -------N-N-TLIIGNLKDAIVLFDRSQ--------YQASWTD---------YMHFGECLMIA-VRQDCRILDYKSAIV 398 (415) T ss_pred -------c-c-EEEEEehhccEEEEeecc--------eEEEEec---------cccCceEEEEE-EEeccEEeccccEEE Confidence 1 1 26778 34443222221 2332221 11223333332 477888899999988 Q ss_pred EEEecCC Q lcl|NC_020862. 399 AYSVIPE 405 (405) Q Consensus 399 ie~~a~~ 405 (405) ++..++- T Consensus 399 ~~~~~~~ 405 (415) T protein:vir:98 399 IEYDDSE 405 (415) T ss_pred EEEeccC Confidence 8765554 No 40 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=98.76 E-value=1.9e-09 Score=68.34 Aligned_cols=291 Identities=11% Similarity=0.028 Sum_probs=153.7 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) +-..-.......+++-+.-+-+-.|.+..+....+...+.+++.+.+|+-+.|+... .++..-.... ...||-+.. T Consensus 113 ~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~-~~~~~~~~~~-~v~E~~~~~-- 188 (415) T protein:vir:81 113 LETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPV-VRQSEVAALE-KVEELEENP-- 188 (415) T ss_pred HhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEE-EeecCCccce-eeccccccC-- Confidence 000000111111122122233335566655555667888999999999988876333 2333332211 122332111 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) .....++..|+.++++++.++.+|++++ -|+..++.+.+..++.+..+ T Consensus 189 -------------------------------~~~~~~~~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~~~~ 236 (415) T protein:vir:81 189 -------------------------------ELAVKPFFQLAYDINTHRGYFRISREAI-EDAKVNVLQELKLWMARTIA 236 (415) T ss_pred -------------------------------cccccceeeEEeeeeeeEeeehhhHHHH-hhchHHHHHHHHHHHHHHHH Confidence 0001246677899999999999999854 56666777777777665444 Q ss_pred hHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISA 240 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~ 240 (405) .. ++ ..+++|.+.-.-.+...... .........+.+++++|.++...|.....+ .+ T Consensus 237 ~~-~~---~~il~g~g~g~~~~~~~~~~---~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~-----------------~~ 292 (415) T protein:vir:81 237 AT-RN---KAIIDVITKGSTGSTSSGFE---KEGKKLEVKKAKSLDDIKDAINLNVKPNYE-----------------HN 292 (415) T ss_pred HH-HH---HHHhhccccCcccccccccc---ccccccccccccchhHHHHHHHhhhhhccC-----------------CC Confidence 33 22 33444432210000000000 001111234568899999988887654332 11 Q ss_pred eEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccccccc Q lcl|NC_020862. 241 SRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSD 320 (405) Q Consensus 241 syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~ 320 (405) ..+||+.+...|+.|+|-.++|-|.|- +..+-.++|-| +.+++++.+. ..++| T Consensus 293 --~~v~n~~~~~~l~~lkd~~G~~l~~~~--------~~~~~~~~l~G--~pV~~~~~~~-~~~~~-------------- 345 (415) T protein:vir:81 293 --VAIVSQTMFAKLDKMKDKLGNYLIQPD--------VKEKTQQRLLG--AKIEILPDEV-LGQKG-------------- 345 (415) T ss_pred --EEEEcHHHHHHHHHhhccCCceeeccC--------cCCCCCceecc--eeeEEecccc-cCCCC-------------- Confidence 247899999999999998888888763 22444567766 4555555432 11111 Q ss_pred ccCCcceeeeEEEEEc--cccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccceEE Q lcl|NC_020862. 321 VAGTDKYDIAPLLVVG--DQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIAV 398 (405) Q Consensus 321 ~~g~~~~DVYp~lV~G--~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~mar 398 (405) | . .++|| +++|-...-++ +++-+.+ +-..++++.+. +++.+.+++++=+++ T Consensus 346 -------~-~-~~~~Gd~~~~~~~~~~~~--------~~v~~~~---------~~~~~~~~~~~-~r~d~~v~~~~a~~~ 398 (415) T protein:vir:81 346 -------N-N-TLIIGNLKDAIVLFDRSQ--------YQASWTD---------YMHFGECLMIA-VRQDCRILDYKSAIV 398 (415) T ss_pred -------c-c-EEEEEehhccEEEEeecc--------eEEEEec---------cccCceEEEEE-EEeccEEeccccEEE Confidence 1 1 26778 34443222221 2332221 11223333332 477888899999988 Q ss_pred EEEecCC Q lcl|NC_020862. 399 AYSVIPE 405 (405) Q Consensus 399 ie~~a~~ 405 (405) ++..++- T Consensus 399 ~~~~~~~ 405 (415) T protein:vir:81 399 IEYDDSE 405 (415) T ss_pred EEEeccC Confidence 8765554 No 41 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=98.73 E-value=7.3e-09 Score=65.21 Aligned_cols=322 Identities=13% Similarity=0.112 Sum_probs=181.6 Q ss_pred CCccccCcCCCcccccc-----c-ceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccC Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVG-----P-QFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQG 74 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~-----~-qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eG 74 (405) |-++---....+....+ + .+...-|..+.+..-+..-+|..+-..+.+ ..||+++|.|.-..-.. -.+.| T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~rti--~~G~sv~~~~iG~~~~~--~~~~G 76 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFLKVFGGEVLTAFTRTSVTMNKHLVRSI--QSGKSAQFPVLGRTKAA--YLQPG 76 (347) T ss_pred CCccccccccccccccCCcccchHHHHHHHHhHHHHHHHHHHHhhhhhhhheec--cccceEEeeeccceeEe--eeecC Confidence 44222111111111111 1 233445677777777777788888888877 35999999976654331 11222 Q ss_pred CCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHH Q lcl|NC_020862. 75 LDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSRE 154 (405) Q Consensus 75 vtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~e 154 (405) =++.+ .+|.+ +..+.+-+|-|+--+..+=|+.-+.+.+-|+..+++.+ T Consensus 77 ~~l~~------------------------------~~~~~--~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~ 124 (347) T protein:vir:94 77 ENLDD------------------------------KRKDM--KHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQ 124 (347) T ss_pred cCCCC------------------------------CcCCc--cccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHH Confidence 22211 01111 12234455566544443333444555666788888888 Q ss_pred HHHHHhhHHHHHHHHHHhccCceE-----EecC-CCccceeeeccc---ccccCCceecHHHHHHHHHHHHhccCccccc Q lcl|NC_020862. 155 MLRGANEITEDLLQADILASADVK-----VFTG-AATSMVTMTGEA---ADAEDDGLITLKDLKRLSITLTDNYTPKKTT 225 (405) Q Consensus 155 ll~~~~~~ted~l~~~ilag~~~v-----~yag-~ats~~~~t~~~---~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ 225 (405) +...-+......+.+.+..++... -..| ...+.+.+.... ..+..+...-++.|+++...|+++..|. T Consensus 125 ~g~ALA~~~D~~i~~~l~~~a~~~~~~~~~~~g~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~--- 201 (347) T protein:vir:94 125 LGESLAMAADGAVLAEMAKLCNLPTANNENIAGLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPS--- 201 (347) T ss_pred HHHHHHHHHHHHHHHHHHHhhccccccccccccCCcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCC--- Confidence 877666665555555554333211 1111 111112221111 1111112233678999999999999983 Q ss_pred eeccccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcC Q lcl|NC_020862. 226 IIKGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGA 305 (405) Q Consensus 226 ii~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~a 305 (405) .-+++++.|..-..|....+ +. ...|..-..+-+|.||++.| |++++++++-.|.+. T Consensus 202 --------------~~R~~vv~P~~y~~LLk~~~------~~-~~~~~~~~~~~~G~V~~v~G--~~V~~Sn~~p~~~~~ 258 (347) T protein:vir:94 202 --------------SDRVFYTTPDNYSAILAALM------PN-AANYQALIDPSTGSIRNVMG--FEVIEVPHLTAGGAG 258 (347) T ss_pred --------------CCCEEEeChHHHHHHHHhhc------cc-ccccccccccccceeEEeec--eEEEEcCccccccCc Confidence 24899999999888864321 33 23666666677899999976 899999999776542 Q ss_pred CCcccCCCcccccccc------cCCcceee----eEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccc Q lcl|NC_020862. 306 GATATAANRGYQVSDV------AGTDKYDI----APLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYG 375 (405) Q Consensus 306 Ga~~~~t~~~~~~~~~------~g~~~~DV----Yp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlg 375 (405) ... .+.++....+ ...++|++ ---||+-.+|-+++-++.+..+ ..-|+-- T Consensus 259 ~~~---~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~~~e-----------------~~~~~~~ 318 (347) T protein:vir:94 259 DNR---AEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALE-----------------RARRANF 318 (347) T ss_pred ccc---cccccccccccccccccccccccccccceEEEEechhhhhhhhhccccee-----------------eeechhh Confidence 111 1111111111 11222221 1257888888888777532211 1257777 Q ss_pred hhhhHHHHHHHHHhhccccceEEEEEecC Q lcl|NC_020862. 376 KVGFSSIKFFYGFIKLRGERIAVAYSVIP 404 (405) Q Consensus 376 Qrg~~gwK~~~~~~iL~~~~marie~~a~ 404 (405) |..+.=-|..|++.+||+|.-+.|+.-+- T Consensus 319 ~~~~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 319 QADQIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred hhhhhhhhhhhcCcccccceeEEEEecCC Confidence 88888889999999999999998887666 No 42 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=98.73 E-value=2e-08 Score=62.78 Aligned_cols=324 Identities=17% Similarity=0.159 Sum_probs=167.5 Q ss_pred CCccccCcCCCcccccc---c----ceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVG---P----QFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQ 73 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~---~----qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~e 73 (405) |..+-.--...+.+..+ + .+...-|..+.+..=+..-++..+=..|.+= .||+++|-|.-..- .+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~r~i~--~gks~~~~~iG~~~------~~ 72 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSIS--SGKSAQFPVLGRTQ------AA 72 (345) T ss_pred CcccccchhcccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhcccceeeecc--ccceEEEeeecceE------EE Confidence 55544321111111111 1 1222234555555545544555555556543 58999998653321 22 Q ss_pred CCCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHH Q lcl|NC_020862. 74 GLDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSR 153 (405) Q Consensus 74 Gvtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ 153 (405) +.+| |+++-. +-.|+ ...+..-+|-|.=-+..+=|+.-+.+.+-|+..+++. T Consensus 73 ~~~~-G~~l~~-----~~~~~----------------------~~~e~~ltID~~~y~~~~VddiD~~q~~~D~r~~~s~ 124 (345) T protein:vir:22 73 YLAP-GENLDD-----KRKDI----------------------KHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTS 124 (345) T ss_pred eeec-CCCCCC-----CCCCc----------------------ccceEEEEecchhhhhhhHhhHHHHhcCchhHHHHHH Confidence 2222 222210 00011 1112223333333333222333455566678888888 Q ss_pred HHHHHHhhHHHHHHHHHHhccCceE----Eec-CCCc-cc--eeeecccccccC-CceecHHHHHHHHHHHHhccCcccc Q lcl|NC_020862. 154 EMLRGANEITEDLLQADILASADVK----VFT-GAAT-SM--VTMTGEAADAED-DGLITLKDLKRLSITLTDNYTPKKT 224 (405) Q Consensus 154 ell~~~~~~ted~l~~~ilag~~~v----~ya-g~at-s~--~~~t~~~~~~~~-n~~it~~~lr~~~~~Lk~nrApk~T 224 (405) ++...-+......+-+.|..++... -+. |..+ .. ++.++...++.+ ...--++.|+.+...|.++..|. T Consensus 125 ~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~~~~~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~-- 202 (345) T protein:vir:22 125 QLGESLAMAADGAVLAEIAGLCNVESKYNENIEGLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPA-- 202 (345) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcccccccccccccccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCc-- Confidence 8876555544444444443222110 011 1001 11 111111111110 01123688899999999999984 Q ss_pred ceeccccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhc Q lcl|NC_020862. 225 TIIKGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAG 304 (405) Q Consensus 225 ~ii~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~ 304 (405) .-|+++|.|+.-..|.+ ++.|... .|+.....-+|.||+|.| |++++++.+-. .. T Consensus 203 ---------------~~R~~vv~P~~y~~Ll~------~~~~~~~-~~~~~~~~~~G~V~~i~G--~~V~~sn~lp~-~~ 257 (345) T protein:vir:22 203 ---------------ADRVFYCDPDSYSAILA------ALMPNAA-NYAALIDPEKGSIRNVMG--FEVVEVPHLTA-GG 257 (345) T ss_pred ---------------cCCEEEeChHHHHHHhc------ccccccc-ccccccccccceEEEEec--eEEEecccccc-cc Confidence 24899999999998864 4667765 588888888999999966 89999987642 21 Q ss_pred CCCcccCCCccccc-ccccCCcce----eeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhh Q lcl|NC_020862. 305 AGATATAANRGYQV-SDVAGTDKY----DIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGF 379 (405) Q Consensus 305 aGa~~~~t~~~~~~-~~~~g~~~~----DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~ 379 (405) .|-...++-..... .+.++...+ +==..|+|-++|-+++.++... +. ...|+--|.-+ T Consensus 258 ~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~----------~e-------~~r~~~~~~d~ 320 (345) T protein:vir:22 258 AGTAREGTTGQKHVFPANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLA----------LE-------RARRANFQADQ 320 (345) T ss_pred cCccccCcccccccccccccceeeeeccCceEEEEEehhheeeeeeecce----------ee-------eeechhHHHHH Confidence 11111111111111 111111100 0013577778887777665311 11 22466677777 Q ss_pred HHHHHHHHHhhccccceEEEEEecC Q lcl|NC_020862. 380 SSIKFFYGFIKLRGERIAVAYSVIP 404 (405) Q Consensus 380 ~gwK~~~~~~iL~~~~marie~~a~ 404 (405) +==|..|++.+||++..+.|+.-.- T Consensus 321 I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 321 IIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred HHHHHhcCCcccccceeEEEEEeeC Confidence 7778999999999999999887776 No 43 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=98.73 E-value=1.1e-09 Score=69.71 Aligned_cols=290 Identities=15% Similarity=0.090 Sum_probs=156.7 Q ss_pred CCccccCc-CCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCccc Q lcl|NC_020862. 1 MPHIYNDP-AAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATG 79 (405) Q Consensus 1 ~~~~y~~~-~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g 79 (405) +...-... ..++.++ +.-.-+.-|+++.+....+..++.+++...++... .+++.+.. .... .+..+.| T Consensus 98 ~~~~e~~a~~~~t~~~-gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~---~~~~~~~~---~~~~---a~~v~E~ 167 (407) T protein:vir:48 98 LRELERKALQVGNDED-GGYAIPEELDRTILTLLKDEVVMRQEATVITLGGS---DYKKLVNL---GGTT---SGWVGET 167 (407) T ss_pred hhHHHHHhhhcccCCC-CcccccHhHHHHHHHHHHhhhhhhhhceeeecCCC---ceEEEEec---CCcc---eeeeccc Confidence 11111110 1111111 11112234566666666667788888887776543 23322111 1111 1112222 Q ss_pred ccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHH Q lcl|NC_020862. 80 ASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGA 159 (405) Q Consensus 80 ~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~ 159 (405) ..... .+. .++..|+-++++++.|+.+|+++ +-|+.-++.+.+..++.+.. T Consensus 168 ~~~~~--------------~~~--------------~~f~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~~i 218 (407) T protein:vir:48 168 DARPE--------------TAT--------------SKLGLIEPFMGEIYGNPQATQKM-LDDAFFNVEDWINSELALEF 218 (407) T ss_pred ccccc--------------ccc--------------ccceeEEeeeeeeEeehhhHHHH-HhcchHHHHHHHHHHHHHHH Confidence 11100 011 24566788999999999999995 44666677777777666544 Q ss_pred hhHHHHHHHHHHhccCceEE------ecCCCccceeee---cccccccCCceecHHHHHHHHHHHHhccCccccceeccc Q lcl|NC_020862. 160 NEITEDLLQADILASADVKV------FTGAATSMVTMT---GEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGS 230 (405) Q Consensus 160 ~~~ted~l~~~ilag~~~v~------yag~ats~~~~t---~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs 230 (405) + ..++. .+++|.+... +........... .........+.+++++|.++...|+..... T Consensus 219 ~-~~~~~---a~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~--------- 285 (407) T protein:vir:48 219 A-EQEEI---AFTSGDGSKKPKGFLAYESTDEDDKTRAFGKLQHIASGAASGVTADAIIKLIYTLRKAHRS--------- 285 (407) T ss_pred H-HHHHh---hhhccCCCCccceeeecccccccccccccccccccccccccccChHHHHHHHHhhchhhhc--------- Confidence 3 33433 3566654421 111110000000 000111224568999999999988765432 Q ss_pred cccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCccc Q lcl|NC_020862. 231 RMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATAT 310 (405) Q Consensus 231 ~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~ 310 (405) .++| ++|+....-|+.|+|.-+.|-|.|--+.|. .++|-| ...+.++.|- -.++| T Consensus 286 --------~a~~--v~n~~~~~~L~~lkD~~Gr~l~~~~~~~g~--------~~~l~G--~PV~~~~~~p-~~~~~---- 340 (407) T protein:vir:48 286 --------GAKF--MMNNSSLFAIRLLKDNDGNYLWRPGIELGQ--------PSSLAG--YGIVENEQMP-DIAAD---- 340 (407) T ss_pred --------CCEE--EEcHHHHHHHHHhhccCCceeeccCcCCCC--------Cceecc--eeeEEecCcC-CccCC---- Confidence 1233 689999999999999989999987534343 345544 4556665542 11111 Q ss_pred CCCcccccccccCCcceeeeEEEEEccc--cceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHH--HH Q lcl|NC_020862. 311 AANRGYQVSDVAGTDKYDIAPLLVVGDQ--AFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKF--FY 386 (405) Q Consensus 311 ~t~~~~~~~~~~g~~~~DVYp~lV~G~~--Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~--~~ 386 (405) -. .++||+= +|-...-.| +++ . .|||.+++.++|++ .+ T Consensus 341 ------------------~~-~i~~Gd~~~~~~i~~~~~--------~~i--~---------~d~~~~~~~~~~~~~~r~ 382 (407) T protein:vir:48 341 ------------------AK-AIAFGNFKRGYTIVDRIG--------TRI--L---------RDPYTNKPFVGFYTTKRT 382 (407) T ss_pred ------------------cc-EEEEEeccccEEEEEeec--------eEE--E---------eeccccCCcEEEEEEEEe Confidence 11 2456753 233222221 232 1 47888889999996 58 Q ss_pred HHhhccccceEEEEEecCC Q lcl|NC_020862. 387 GFIKLRGERIAVAYSVIPE 405 (405) Q Consensus 387 ~~~iL~~~~marie~~a~~ 405 (405) .+.+++++-++.+++++.= T Consensus 383 d~~v~~~~a~~~l~~~aa~ 401 (407) T protein:vir:48 383 GGMLVDSQAIKLMKIGAAT 401 (407) T ss_pred ccEEecccceEEEEeeccC Confidence 9999999999999887665 No 44 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=98.72 E-value=8.4e-09 Score=64.86 Aligned_cols=316 Identities=11% Similarity=0.034 Sum_probs=161.2 Q ss_pred ccccCcCCCccccc--cc-ceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCccc Q lcl|NC_020862. 3 HIYNDPAAGDASTV--GP-QFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATG 79 (405) Q Consensus 3 ~~y~~~~~t~~~~v--~~-qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g 79 (405) |-|.|+...--.+- ++ .+...-|..+.+..-+..-++..+=.++.+ ..||+.+|-|--..-. .-.+.|-.+.| T Consensus 1 ms~~n~~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~rti--~~gkS~q~~~iG~~~~--~~~~~G~~ld~ 76 (364) T protein:vir:10 1 MSNPNVLTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQEV--VGTNSVSNKYIGETEL--QVLSPGKSPDA 76 (364) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeee--cccceEEeeeeeeeEE--eeeccCcccCC Confidence 44445422211111 11 122222344444444444455555556654 3788888876432211 11222222233 Q ss_pred ccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccc-hHHHHHHHHHHH Q lcl|NC_020862. 80 ASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSD-LYGHLSREMLRG 158 (405) Q Consensus 80 ~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~-l~~~~~~ell~~ 158 (405) +.+.+. | ..|+++=..|-.+ +-++..+...|=+ +-.+++.|++.. T Consensus 77 ~~~~~~--------------k------------------~~itID~ll~a~~--~V~diDe~q~~~D~vR~e~s~e~G~A 122 (364) T protein:vir:10 77 SPTEFD--------------K------------------NRLVVDTTVIARN--TVAHFHDVQNDIDGLKSKLSVNQAKK 122 (364) T ss_pred CCcccC--------------c------------------EEEEecceeeech--hhhhHHHHhcCccchhHHHHHHHHHH Confidence 222111 0 1122222222211 1222233444444 456777777765 Q ss_pred HhhHHHHHHHHHHhcc--CceEEec--C---CCccceeeecccccccCCceecH----HHHHHHHHHHHhccCcccccee Q lcl|NC_020862. 159 ANEITEDLLQADILAS--ADVKVFT--G---AATSMVTMTGEAADAEDDGLITL----KDLKRLSITLTDNYTPKKTTII 227 (405) Q Consensus 159 ~~~~ted~l~~~ilag--~~~v~ya--g---~ats~~~~t~~~~~~~~n~~it~----~~lr~~~~~Lk~nrApk~T~ii 227 (405) -+......+.+.++++ ++..-+. + .+....++....+ ...++. +-++.+...|+++..|. T Consensus 123 LA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~~~~g~~i~~~~~a~----~~~~~~~~l~~ai~~a~~~LdEkdVP~----- 193 (364) T protein:vir:10 123 LKKMEDSMVIQQLVLGGISNTEAIRKNPRVAGHGFSIHIVGLAS----SFLTSPQYMMAAIEMAMEQQTEQEVDT----- 193 (364) T ss_pred HHHHHHHHHHHHHHhhhhhcccccccCCcccCCcceeeecccCc----chhhhHHHHHHHHHHHHHHHhhcCCCc----- Confidence 5544433444445443 2222221 0 0011111111111 123333 34456788899999883 Q ss_pred ccccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcC--CcccccCcceeEecCCcEEEEeCcchhhhh-c Q lcl|NC_020862. 228 KGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYA--DAATIMNGEIGAIPGAHLRIVVVPQMMHYA-G 304 (405) Q Consensus 228 ~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya--~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~-~ 304 (405) .-+++++.|..-..|.+ ++.|+++ +|+ ......+|+|+++.| |+++.++++ |+. + T Consensus 194 ------------~~R~~vv~P~~y~~Ll~------~~~lvn~-d~~~~~~~~~~~G~v~~v~G--v~Vv~Sn~l-P~~~~ 251 (364) T protein:vir:10 194 ------------SELCGLMPWTAFNCLRD------ADRIVDK-SYTIAASDNTVDGFVLKSWN--TPIVPSNRF-PKLSD 251 (364) T ss_pred ------------cccEEEeChHHHHHHhc------CCccccc-cccccCCCccccceeEEEec--eEEEecccc-ccccc Confidence 24899999998888865 5678887 554 445578999999965 899999988 553 1 Q ss_pred CCCcccCCCcccccccccCCccee------eeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhh Q lcl|NC_020862. 305 AGATATAANRGYQVSDVAGTDKYD------IAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVG 378 (405) Q Consensus 305 aGa~~~~t~~~~~~~~~~g~~~~D------VYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg 378 (405) -+. +.+......++..-.+++|+ -.-.++|=++|-+++-++....+ ...|+--|.. T Consensus 252 ~~~-~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e-----------------~~~~~~~~~~ 313 (364) T protein:vir:10 252 NTE-GTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGD-----------------IFYEKKEKTW 313 (364) T ss_pred ccc-ccccccccccccccCCcccccccccceeEEEEEecceEEEEEEecceee-----------------eeeccceeee Confidence 110 00000111111111134444 35578888899888887653321 1123333444 Q ss_pred hHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_020862. 379 FSSIKFFYGFIKLRGERIAVAYSVIPE 405 (405) Q Consensus 379 ~~gwK~~~~~~iL~~~~marie~~a~~ 405 (405) +.=-|..|++.+||++.-+.|.++++- T Consensus 314 ~ida~~a~G~g~lRPeaa~~i~~~~~~ 340 (364) T protein:vir:10 314 YIDTFLAEGAIPDRWEAVAVVTAADTA 340 (364) T ss_pred eeeeehcccCcccCccceEEEEecCCC Confidence 444588899999999999999998887 No 45 >protein:vir:2770 Length: 318 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612887;genbank:gi:20065804;genbank:GeneID:935710 Probab=98.69 E-value=5.3e-09 Score=65.95 Aligned_cols=260 Identities=12% Similarity=0.083 Sum_probs=142.0 Q ss_pred CCcc-ccCcCCCc-------ccccccceeehhhhhHHHHHhhhhhhhhccccc---------cccCcCCCCEEEEEeccc Q lcl|NC_020862. 1 MPHI-YNDPAAGD-------ASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADN---------KQMPKHFGKELKVFYYVP 63 (405) Q Consensus 1 ~~~~-y~~~~~t~-------~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~---------~~mPKn~GktIkfrry~p 63 (405) |+-+ |.+|.+.. ...-.|.++ .|.+|+-.++++...+..|-+. .++-|+.|.+|.|.=-.+ T Consensus 1 mt~~~~~~~~~~~~~~~ft~~~~~~~~vk--~ws~~l~~~~~~~~~~~~~~g~~~~~~I~r~~dL~K~~GD~Vtf~L~~~ 78 (318) T protein:vir:27 1 MTTVTSAQANKLFQVALFTAANRNRSMVN--ILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHK 78 (318) T ss_pred CCccCCCChHHHHHHHHHHHHhcCChHHH--HHHHhhhhHHHhhhhhhcccCCCCCceEEEeccCCCCCccEEEEeEeec Confidence 5543 44443221 111123333 6899887777776566565543 357799999999986666 Q ss_pred CCCCCCccccCCCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhh Q lcl|NC_020862. 64 LLDDLNVNDQGLDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDT 143 (405) Q Consensus 64 l~~~~t~l~eGvtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~ 143 (405) |.-... ..+-+.+|. -..+.+.+-++.|.|...=+-... -+.+. T Consensus 79 L~g~gv--~Gd~~lEGn--------------------------------ee~L~~~~d~l~IDq~r~~V~~gg--~msqq 122 (318) T protein:vir:27 79 LSKRPT--MGDERVEGR--------------------------------GEDLSHADFSLKINQGRHLVDAGG--RMSQQ 122 (318) T ss_pred cccCcc--ccCceeecc--------------------------------ccceEEEeeEEEEeeecccccccc--chhhh Confidence 644322 222222221 111233333344444443332222 11111 Q ss_pred --ccchHHHHHHHHHHHHhhHHHHHHHHHHhccCc--------------------------------eEEecCCCcccee Q lcl|NC_020862. 144 --DSDLYGHLSREMLRGANEITEDLLQADILASAD--------------------------------VKVFTGAATSMVT 189 (405) Q Consensus 144 --d~~l~~~~~~ell~~~~~~ted~l~~~ilag~~--------------------------------~v~yag~ats~~~ 189 (405) -=+|-++ .+..|..-..-..|++.-..|+|+. .++++|.+|+... T Consensus 123 Rt~~dlR~~-ar~~L~~w~~~~~Dq~~~v~laGarg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~g~at~~~~ 201 (318) T protein:vir:27 123 RTKFNLASS-ARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQ 201 (318) T ss_pred hhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccceEecccCccchhhhhcccCCCCCCcEEeccCccchhh Confidence 1123332 4444544444555555544454433 3666677777665 Q ss_pred eecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccceEEEEEcccchHHHHHHhcccCCCcceeh Q lcl|NC_020862. 190 MTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPV 269 (405) Q Consensus 190 ~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v 269 (405) ++. .+.++++.|.++...++...-|.+-=.+.|-.+-+ ..+-||+|+||....|||. ....+.|... T Consensus 202 l~s-------tD~~s~~lid~~~~~~~~~a~pi~PV~v~g~~~~~---~~~~yV~~~~p~q~~~Lrt---dt~~~~w~d~ 268 (318) T protein:vir:27 202 IEA-------ADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHG---EDPYYVLYVTPRQWNDWYT---STSGKDWNQM 268 (318) T ss_pred hhh-------cccccHHHHHHHHHHHHHhCCCCcceeeccccccC---CcceEEEEechHHHHHHhh---cCCCHHHHHH Confidence 553 37899999999999998876664323333333222 2345999999999999974 1122347777 Q ss_pred hhc------CCcccccCcceeEecCCcEEEEeCcch-hhhhcCCCcccCCCcccccc Q lcl|NC_020862. 270 EKY------ADAATIMNGEIGAIPGAHLRIVVVPQM-MHYAGAGATATAANRGYQVS 319 (405) Q Consensus 270 ~~Y------a~~~~i~~gEIGsi~g~n~Rfv~~p~~-~~~~~aGa~~~~t~~~~~~~ 319 (405) .++ |+.-|||.||+|.+ .|+-+.+-|.+ -.| -+|..+...+ .+ T Consensus 269 q~~A~~r~~g~knPLF~G~~gm~--ngvil~~~~~vpIrf-~~G~~v~~~~----~~ 318 (318) T protein:vir:27 269 MVRAVNRAKGFNHPLFKGECAMW--RNILVRKYAGMPIRF-YQGQRFWYQR----IT 318 (318) T ss_pred HHHHHhcccccCCCceecceeee--cCEEEeecCCccEEE-cCCCeeeeee----cC Confidence 665 35678999999999 55677777765 222 2554332211 11 No 46 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=98.68 E-value=3.7e-09 Score=66.81 Aligned_cols=291 Identities=11% Similarity=0.041 Sum_probs=153.0 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) +-..-.....+++++-+.-+.+-.+..+.+....+...+.+++...+|+-+.++.. +.++..-+.. ....||-+.+ T Consensus 113 ~~~~~~~~~~~~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~-~~~~~~~~~~-~~v~Eg~~~~-- 188 (415) T protein:vir:94 113 LETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYP-VVRQSEVAAL-EKVEELEENP-- 188 (415) T ss_pred hhhhhhhhhhccccccccccCcHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEE-EEeecCCccc-eecccccccc-- Confidence 00000111111111112222333445555555567789999999999997766532 2233332221 1223332211 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) . . ...++..|+.++++++.++.+|+++ +-|++.++.+.+..++.+..+ T Consensus 189 ---~-------------------~---------~~~~~~~i~~~~~k~~~~~~is~el-l~ds~~~~~~~i~~~l~~~~~ 236 (415) T protein:vir:94 189 ---E-------------------L---------AVKPFFQLAYDINTHRGYFRISREA-IEDAKVNVLQELKLWMARTIA 236 (415) T ss_pred ---c-------------------c---------ccccceeeEeeheeeeeechhhHHH-HhhchHHHHHHHHHHHHHHHH Confidence 0 0 0124667789999999999999984 446666777777777665443 Q ss_pred hHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISA 240 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~ 240 (405) .. ++ ..+++|.+.-.-.+...... .........+..++++|.++...|....... T Consensus 237 ~~-~~---~~il~g~g~g~~~~~~~~~~---~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~------------------ 291 (415) T protein:vir:94 237 AT-RN---KAIIDVITKGSTGSTSSGFE---KEGKKLEVKKAKSLDDIKDAINLNVKPNYEH------------------ 291 (415) T ss_pred HH-HH---HHHhhccccCcccccccccc---ccccccccccccchHHHHHHHHhhhhhccCC------------------ Confidence 32 22 33344322110011000000 0011122235688999999988776544321 Q ss_pred eEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccccccc Q lcl|NC_020862. 241 SRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSD 320 (405) Q Consensus 241 syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~ 320 (405) + ..+|||.+...|+.|+|-.++|-|.|- +.++-.+.|-| +.+++++.|. ..++ T Consensus 292 ~-~~vmn~~~~~~l~~lkd~~G~~l~~~~--------~~~~~~~~l~G--~pV~~~~~~~-~~~~--------------- 344 (415) T protein:vir:94 292 N-VAIVSQTMFAKLDKMKDKLGNYLIQPD--------VKEKTQQRLLG--AKIEILPDEV-LGQK--------------- 344 (415) T ss_pred C-EEEEcHHHHHHHHHhhccCCCeeeccC--------cCCCCCceecc--eeeEEecccc-cCCC--------------- Confidence 2 347899999999999998888888753 23444567766 4556666442 1111 Q ss_pred ccCCcceeeeEEEEEcc--ccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccceEE Q lcl|NC_020862. 321 VAGTDKYDIAPLLVVGD--QAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIAV 398 (405) Q Consensus 321 ~~g~~~~DVYp~lV~G~--~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~mar 398 (405) + | .+ +++|. ++|-...-++ +++-+.+ +-..|+++.++ .++.+.+++++-+++ T Consensus 345 ----~--~-~~-i~~gd~~~~~~~~~~~~--------~~v~~~~---------~~~~~~~~r~~-~r~d~~~~~~~a~~~ 398 (415) T protein:vir:94 345 ----G--N-NT-LIIGNLKDAIVLFDRSQ--------YQASWTD---------YMHFGECLMIA-VRQDCRILDYKSAIV 398 (415) T ss_pred ----C--c-cE-EEEEehhccEEEEeecc--------eEEEEec---------cccCceEEEEE-EEeccEEeccccEEE Confidence 0 1 12 56773 4443222111 2322221 22344444443 467888999999999 Q ss_pred EEEecCC Q lcl|NC_020862. 399 AYSVIPE 405 (405) Q Consensus 399 ie~~a~~ 405 (405) ++...+- T Consensus 399 ~~~~~~~ 405 (415) T protein:vir:94 399 IEYDDSE 405 (415) T ss_pred EEEeccC Confidence 9765544 No 47 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=98.68 E-value=1.1e-09 Score=69.66 Aligned_cols=287 Identities=18% Similarity=0.142 Sum_probs=158.6 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) .-...+.-..++.+-+-|+ -|..+.+....+.-++.+++...+++.+.++-.+. ....+. +....|+ T Consensus 126 ~~~al~~~t~~~gG~lvP~----~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~~~~------~~~~~a---~wv~E~~ 192 (425) T protein:vir:10 126 VQAALNKGEDSEGGYLTPI----EWDRTITNKLVLISPMRQLCRVQPVSKAGFSKLFN------MGGTTS---GWVGEAS 192 (425) T ss_pred hHHHhhcCcCCCCceeccH----hHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEE------cCCcce---eeecccc Confidence 0000111111111113332 34455555566677888999999888665433221 111111 1111221 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) +... .+. .++..|+-+.++++.++.+|++++ -|+..++.+.+..++.+.- T Consensus 193 ~~~~--------------~~~--------------~~f~~v~~~~~k~~~~i~iS~ell-~ds~~~l~~~i~~~la~ai- 242 (425) T protein:vir:10 193 QRPQ--------------TNA--------------ATFQPLSFASGEIYANPAATQQIL-DDAEIDLESWLATEVQTEF- 242 (425) T ss_pred cccc--------------ccc--------------cccceeeeeheeeEeehHhHHHHH-hcchhHHHHHHHHHHHHHH- Confidence 1100 011 245667889999999999999854 4665677787766666433 Q ss_pred hHHHHHHHHHHhccCceEEecC------CCccceee---ecccccccCCceecHHHHHHHHHHHHhccCccccceecccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKVFTG------AATSMVTM---TGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSR 231 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~yag------~ats~~~~---t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~ 231 (405) ...+|. .+++|.++..=.| ..+..... .-........+.+++++|.++...|+...... T Consensus 243 ~~~~d~---~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~~~--------- 310 (425) T protein:vir:10 243 AKQEGK---AFLAGDGTNKPNGLLTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPSAFTGN--------- 310 (425) T ss_pred HHHHHh---hhhcccCCCCcceeeeccccccccccccccccccccccccccccHHHHHHHHhhhhhhhccC--------- Confidence 333433 4667654322111 11100000 00001112235689999999988887554321 Q ss_pred ccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccC Q lcl|NC_020862. 232 MTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATA 311 (405) Q Consensus 232 ~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~ 311 (405) -+-++|+.+..-|+.|+|.-++|-|.|-.+ .|.-++|-| ..+++++.| +..++| T Consensus 311 ----------a~~vmn~~~~~~L~~lkD~~G~~l~~~~~~--------~g~~~~l~G--~PV~~~~~~-p~~~~~----- 364 (425) T protein:vir:10 311 ----------ARFAMNRNTQRQVRKLKDGQGNYLWQPSYV--------AGQPATLAG--YPVTEVPDM-PDVAAN----- 364 (425) T ss_pred ----------CEEEEchHHHHHHHHhhcCCCceeeccCcc--------CCCCceecc--eeeEEecCc-CCccCC----- Confidence 134799999999999999999999986433 334455644 455666544 322211 Q ss_pred CCcccccccccCCcceeeeEEEEEcccc--ceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHH--HHH Q lcl|NC_020862. 312 ANRGYQVSDVAGTDKYDIAPLLVVGDQA--FATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKF--FYG 387 (405) Q Consensus 312 t~~~~~~~~~~g~~~~DVYp~lV~G~~A--fg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~--~~~ 387 (405) ..+ ++||+-+ |-...-+ .+++ ..|||.+.+.++|++ ++. T Consensus 365 -----------------~~~-i~~Gd~~~~~~i~~~~--------~~~v-----------~~d~~~~~~~~~~~~~~r~d 407 (425) T protein:vir:10 365 -----------------STP-ILFGDFQQTYLIIDRI--------GVRV-----------LRDPYTAKPYVLFYTTKRVG 407 (425) T ss_pred -----------------ccE-EEEEehhccEEEEEec--------ceEE-----------EecccccCCcEEEEEEEEec Confidence 122 3557543 3222111 1222 257788888888884 589 Q ss_pred HhhccccceEEEEEecCC Q lcl|NC_020862. 388 FIKLRGERIAVAYSVIPE 405 (405) Q Consensus 388 ~~iL~~~~marie~~a~~ 405 (405) +.+++++-++.+++.|.| T Consensus 408 ~~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 408 GGLLNPEPMRAMKVAASE 425 (425) T ss_pred cEeecccceEEEEeeccC Confidence 999999999999999999 No 48 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=98.66 E-value=6.5e-09 Score=65.45 Aligned_cols=285 Identities=13% Similarity=0.064 Sum_probs=160.6 Q ss_pred CCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccccccCCccc Q lcl|NC_020862. 9 AAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGASYAGGNLY 88 (405) Q Consensus 9 ~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~~~~~gnly 88 (405) =+++.+.+-|+ .+..+.+....+.-++.+++...+|+.+. +++-+...-+.+ +.-+.|+++ T Consensus 1 ma~~gG~lip~----~~~~~ii~~~~~~s~i~~~~~~~~~~~~~---~~~p~~~~~~~a------~~v~Eg~~~------ 61 (298) T protein:vir:94 1 MVLNKGTLFDP----ELVTDLISKVAGKSSIARLSAQKPIPFNG---EKVFTFTMDSEI------DVVAESGKK------ 61 (298) T ss_pred CeeccccccCh----hHHHHHHHHHHhhchhhhhcceeeccCCc---eEEEEEecCcce------EEeeCCccc------ Confidence 13334444443 22455555666677889999998887753 344333221111 112222222 Q ss_pred ccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhc--cchHHHHHHHHHHHHhhHHHHH Q lcl|NC_020862. 89 GGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTD--SDLYGHLSREMLRGANEITEDL 166 (405) Q Consensus 89 ~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d--~~l~~~~~~ell~~~~~~ted~ 166 (405) .....++..++.+.++++.++.+|++++....| .++++.+..++.+..+.-.+. T Consensus 62 -----------------------~~~~~~f~~v~l~~~k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~- 117 (298) T protein:vir:94 62 -----------------------THGGVTLAPQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDL- 117 (298) T ss_pred -----------------------cccccceeEEEEeeeEEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHH- Confidence 112235677889999999999999997654443 346777666666544332222 Q ss_pred HHHHHhccC----c----eEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccc Q lcl|NC_020862. 167 LQADILASA----D----VKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTI 238 (405) Q Consensus 167 l~~~ilag~----~----~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I 238 (405) .+++|. + .+-..+......+... ......-.+++|.++...|..++.+. T Consensus 118 ---~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~i~~~~~~~~~~~~~~---------------- 174 (298) T protein:vir:94 118 ---MAFHGVNPRLGTASAVIGTNHFDSKVTQKVE----APRGIADPNGAIENAVELLTGVDADV---------------- 174 (298) T ss_pred ---HhhcccccCCCcccccccccccccccccccc----cccccccHHHHHHHHHHhhhhcCCCc---------------- Confidence 233331 1 0111110100000000 00112345678999988888766541 Q ss_pred cceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccccc Q lcl|NC_020862. 239 SASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQV 318 (405) Q Consensus 239 ~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~ 318 (405) -..++||.....|+.|+|.-+.|-|.+.- ..+.-|+|-| +.++.++.+. .++ T Consensus 175 ---~~~vmn~~~~~~l~~lkd~~G~~l~~~~~--------~~~~~~tl~G--~PV~~~~~v~----~~~----------- 226 (298) T protein:vir:94 175 ---TGIAINPSFRSALAKQKDLQGNALFPELK--------WGATPDTING--LPVDVNKTVS----DMS----------- 226 (298) T ss_pred ---cEEEEcHHHHHHHHHhhccCCCeeecCcc--------cCCCCceecc--eeeEEecccc----ccc----------- Confidence 25789999999999999988888887642 2444567766 4666665443 111 Q ss_pred ccccCCcceeeeEEEEEccccce-eecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhhccccc Q lcl|NC_020862. 319 SDVAGTDKYDIAPLLVVGDQAFA-TIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIKLRGER 395 (405) Q Consensus 319 ~~~~g~~~~DVYp~lV~G~~Afg-~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL~~~~ 395 (405) +.++ + .+++|+-+-+ .+++. +.+++-+..-+.. .++...|-|++.+.|+ +++++.+++++- T Consensus 227 ----~~~~-~---~~~~Gdfs~~~~~~~~-------~~~~~~~~~~~~~-d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a 290 (298) T protein:vir:94 227 ----LTQR-D---RAIIGDFANGFKWGYA-------KEVPLEVIQYGDP-DNSGLDLKGYNQVYIRAELFLGWGILDATK 290 (298) T ss_pred ----CCCc-c---EEEEeeccceEEEEEe-------cCceEEEeecCCC-cCcchhhhhcCcEEEEEEEEeccEeecccc Confidence 1122 1 3677875543 34443 2245545543210 1122346788888887 588999999999 Q ss_pred eEEEEEec Q lcl|NC_020862. 396 IAVAYSVI 403 (405) Q Consensus 396 marie~~a 403 (405) +++|+-+- T Consensus 291 ~~~l~~~t 298 (298) T protein:vir:94 291 FARVTEAN 298 (298) T ss_pred eEEEEecC Confidence 99998777 No 49 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=98.66 E-value=1.1e-08 Score=64.18 Aligned_cols=324 Identities=15% Similarity=0.126 Sum_probs=169.8 Q ss_pred CCcc-ccCcCCCccccccc-----ceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccC Q lcl|NC_020862. 1 MPHI-YNDPAAGDASTVGP-----QFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQG 74 (405) Q Consensus 1 ~~~~-y~~~~~t~~~~v~~-----qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eG 74 (405) |-++ +.+-..|.+..-+. .+-..-|....|..-+..-++..+-..+.+ ..|++++|-|--...... .+.| T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~ie~~~g~V~~~f~~~s~~~~~v~~r~~--~~G~sv~i~~iG~~t~~~--~~~g 76 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSI--ASGKSAQFPVIGRTKAAY--LKPG 76 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHHHHHHHHHHHHHHHHHhhhhhhccccc--cccceeEeeeccceeeee--ecCC Confidence 5422 22211121111111 122234566666666666677777666655 459999998766543311 1222 Q ss_pred CCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHH Q lcl|NC_020862. 75 LDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSRE 154 (405) Q Consensus 75 vtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~e 154 (405) -...+. -.|+ ...+.+-+|-|+=-|-.+=|+.-..+.+-|+..+++.+ T Consensus 77 ~~l~~~----------~~~~----------------------~~~e~~ltiD~~~y~~~~VddiD~~q~~~D~~~~~~~~ 124 (347) T protein:vir:33 77 ENLDDK----------RKDI----------------------KHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQ 124 (347) T ss_pred CCCCCC----------CCCC----------------------ccceEEEEechhhhhhHHHhhHHHHhcCCchhHHHHHH Confidence 222110 0011 11123334444332221112223344555777777777 Q ss_pred HHHHHhhHHHHHHHHHHhc--c-C----ceEEecCCCccceee---ecccccccCCceecHHHHHHHHHHHHhccCcccc Q lcl|NC_020862. 155 MLRGANEITEDLLQADILA--S-A----DVKVFTGAATSMVTM---TGEAADAEDDGLITLKDLKRLSITLTDNYTPKKT 224 (405) Q Consensus 155 ll~~~~~~ted~l~~~ila--g-~----~~v~yag~ats~~~~---t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T 224 (405) ....-+......+-+.+.. + + ...-..|.++..... ++....+..+..--++.|+.+...|.++..|. T Consensus 125 ~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~-- 202 (347) T protein:vir:33 125 LGESLAMAADGAVLAELAGLVNLPDGSNENIEGLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPA-- 202 (347) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCc-- Confidence 7766655555544433321 1 1 111111222211111 11100000001112678899999999999983 Q ss_pred ceeccccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhc Q lcl|NC_020862. 225 TIIKGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAG 304 (405) Q Consensus 225 ~ii~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~ 304 (405) ..|++++.|+....|.. ++.|+.. .|+..+.+.+|.||++.| |+++++++|-.-.+ T Consensus 203 ---------------~gR~~vv~P~~y~~Ll~------~~~~~~~-d~~~~~~~~~G~V~~i~G--~~V~~Sn~lp~~~~ 258 (347) T protein:vir:33 203 ---------------ADRTFYTTPDNYSAILA------ALMPNAA-NYQALLDPERGTIRNVMG--FEVVEVPHLTAGGA 258 (347) T ss_pred ---------------cCcEEEeCHHHHHHHhc------ccccccc-ccccccccccceeEEEec--eeEEEecccccCcc Confidence 24889999999999864 4667765 688778899999999965 99999998754221 Q ss_pred CCCc---ccCCCcccccccccC-CcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhH Q lcl|NC_020862. 305 AGAT---ATAANRGYQVSDVAG-TDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFS 380 (405) Q Consensus 305 aGa~---~~~t~~~~~~~~~~g-~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~ 380 (405) -+-. .++....+....+.. ....+--.-|++-++|-|++-++... +- ..-|+--|.-.+ T Consensus 259 ~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~----------~e-------~~r~~~~~~d~i 321 (347) T protein:vir:33 259 GDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLA----------LE-------RARRANYQADQI 321 (347) T ss_pred ccccccccccccccccCCcccceeccccceeeeeecchhheeeeeecee----------ee-------eccchhhhhHhh Confidence 1100 000000111111110 11122334578899999888876311 11 224667777778 Q ss_pred HHHHHHHHhhccccceEEEEEecCC Q lcl|NC_020862. 381 SIKFFYGFIKLRGERIAVAYSVIPE 405 (405) Q Consensus 381 gwK~~~~~~iL~~~~marie~~a~~ 405 (405) --|..|++.+||++..+.|| .|- T Consensus 322 ~~~~~~G~~vlrP~~av~i~--~~~ 344 (347) T protein:vir:33 322 IAKYAMGHGGLRPEAAGAIV--LPK 344 (347) T ss_pred hhhhhcCCceecccceEEEe--cCC Confidence 88899999999999977664 444 No 50 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=98.66 E-value=6.3e-09 Score=65.53 Aligned_cols=282 Identities=9% Similarity=0.011 Sum_probs=157.9 Q ss_pred CCccccCc----CCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCC Q lcl|NC_020862. 1 MPHIYNDP----AAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLD 76 (405) Q Consensus 1 ~~~~y~~~----~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvt 76 (405) +..-..+. ..++.+. +.-.-+-.+.++.+....+..++.+++...+|+-+.|+...+++-..-+. ..-..|| T Consensus 98 ~~~~~~~~~~~~~~~t~~~-gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~-a~~v~E~-- 173 (397) T protein:vir:48 98 VRGRYQNLLDSKTDASGSD-AGLTIPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKWADITGL-AKLDDEA-- 173 (397) T ss_pred HhhhhhHHHHHhhccCCcc-ccccccHHHHHHHHHHHHHHHHHHhhhceeeccCCcceEEEEeecCCCcc-eeeeccc-- Confidence 11111000 1111111 11122335555555555567888999999999998887665553221111 1111122 Q ss_pred cccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHH Q lcl|NC_020862. 77 ATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREML 156 (405) Q Consensus 77 p~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell 156 (405) +.+. .-...++..|+.+.++++.++.+|+++ +-|++.++...+..++. T Consensus 174 ---~~~~----------------------------~~~~~~~~~v~~~~~k~~~~~~iS~el-l~ds~~~l~~~v~~~l~ 221 (397) T protein:vir:48 174 ---GSIG----------------------------TNDDPKLYPIRYAIKRYAGISTVTNSL-LADSAENILAWLSGWIA 221 (397) T ss_pred ---cccc----------------------------cccccceeeEEeeheeeeeehhhHHHH-HhhchHHHHHHHHHHHH Confidence 1110 111125667889999999999999984 45667678888777766 Q ss_pred HHHhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_020862. 157 RGANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTK 236 (405) Q Consensus 157 ~~~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~ 236 (405) +..+.. +-..|++|.+... . .-+.+++++|.++...|+....+ T Consensus 222 ~~~~~~----~d~~il~G~g~~~------~------------~~~~~~~d~i~~~~~~l~~~~~~--------------- 264 (397) T protein:vir:48 222 KKVVVT----RNKAILEAIATLP------T------------KPTLTKWDDIIDLQAKVDPAIKQ--------------- 264 (397) T ss_pred HHHHHH----HHHHHhhcccccc------c------------ccccccHHHHHHHHHHhhhhhcC--------------- Confidence 544432 3334566643211 0 12457899999999888765432 Q ss_pred cccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccc Q lcl|NC_020862. 237 TISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGY 316 (405) Q Consensus 237 ~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~ 316 (405) .-+.+|||.+...|+.|+|..+.|-|.|- +..|.-+.|.|-.+.++.+. .++ .++ T Consensus 265 ----~a~~v~n~~~~~~L~~lkd~~G~~i~~~~--------~~~~~~~~l~G~PV~~~~~~-~~~---~~~--------- 319 (397) T protein:vir:48 265 ----TSFFLTNTSGFTALKKVKNAFGDYLMERD--------VKSPTGYSIDGFAVKEVADR-WLA---NAS--------- 319 (397) T ss_pred ----CCEEEECHHHHHHHHHhhcCCCceeeccC--------cCCCCCceeccceeEEeccc-ccC---CcC--------- Confidence 12446899999999999998888888642 23455567766433333321 111 111 Q ss_pred ccccccCCcceeeeEEEEEcccc-ceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhhccc Q lcl|NC_020862. 317 QVSDVAGTDKYDIAPLLVVGDQA-FATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIKLRG 393 (405) Q Consensus 317 ~~~~~~g~~~~DVYp~lV~G~~A-fg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL~~ 393 (405) . +.. .+++|.-+ +-.+..+ +.+.+.+..- .+-+-+.+.+.|+ +++.+.++++ T Consensus 320 -------~---~~~-~~~~gd~~~~~~~~~~-------~~~~i~~~~~-------~~~~~~~~~~~~r~~~r~d~~~~~~ 374 (397) T protein:vir:48 320 -------S---GAM-PLYFGDLKQAVTLFDR-------QQMSLLSTNI-------GGGAFETDTTKIRVIDRFDVVATDT 374 (397) T ss_pred -------C---Cce-EEEEEeccceEEEEee-------cceEEEEecc-------chhhhhcCceeEEEEeeeccEEecc Confidence 0 112 24567533 1222222 1234444321 3345566667776 5678999999 Q ss_pred cceEEEEEecCC Q lcl|NC_020862. 394 ERIAVAYSVIPE 405 (405) Q Consensus 394 ~~marie~~a~~ 405 (405) +-++.++..+.. T Consensus 375 ~a~~~~~~~~~~ 386 (397) T protein:vir:48 375 ESFVPASFKAIA 386 (397) T ss_pred cceEEEEecccc Confidence 999888865554 No 51 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=98.65 E-value=1.3e-08 Score=63.85 Aligned_cols=300 Identities=11% Similarity=0.079 Sum_probs=161.9 Q ss_pred CCcccc-C-----cCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccC Q lcl|NC_020862. 1 MPHIYN-D-----PAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQG 74 (405) Q Consensus 1 ~~~~y~-~-----~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eG 74 (405) |----. | -..+.++.-+.-+-+ .+.+..+....+...+.+++...+|+.+ ++++-++..-+.+ + T Consensus 1 ~~~~~~~~~~~~~~~~t~~~~~~~~ip~-~~~~~ii~~~~~~s~l~~~~~~~~~~~~---~~~~p~~~~~~~a------~ 70 (320) T protein:vir:10 1 MAAGTAFQVDHAQIAQTGDTMFKGYLEP-EQAKDYFAEAEKTSIVQQFAQKVPMGTT---GQKIPHWIGDVSA------Q 70 (320) T ss_pred CCCCccCCHHHHHhhccccccccccccH-HHHHHHHHHHHhccchhhhcceeeccCC---ceEEEEEeCCcce------E Confidence 221111 1 111222222223333 4567777777777888899999988743 3454443321111 1 Q ss_pred CCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHH Q lcl|NC_020862. 75 LDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSRE 154 (405) Q Consensus 75 vtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~e 154 (405) .-++|+++ .-...++.+++.+.+++|.++.+|++++. |+..++.+.+..+ T Consensus 71 ~v~E~~~~-----------------------------~~~~~~f~~v~~~~~k~~~~~~is~ell~-ds~~~l~~~i~~~ 120 (320) T protein:vir:10 71 WIGEGDMK-----------------------------PITKGNMTSQNIAPHKIATIFVASAETVR-ANPANYLGTMRTK 120 (320) T ss_pred EecCCccc-----------------------------cccccceeEEEEeeEEEEEeehhhHHHHh-cChHHHHHHHHHH Confidence 11222222 12223667789999999999999999544 6666788887777 Q ss_pred HHHHHhhHHHHHHHHHHhccCceEEecCCCc--cceeeecccccccCCceecHH-HHHHHHHHHHhccCccccceecccc Q lcl|NC_020862. 155 MLRGANEITEDLLQADILASADVKVFTGAAT--SMVTMTGEAADAEDDGLITLK-DLKRLSITLTDNYTPKKTTIIKGSR 231 (405) Q Consensus 155 ll~~~~~~ted~l~~~ilag~~~v~yag~at--s~~~~t~~~~~~~~n~~it~~-~lr~~~~~Lk~nrApk~T~ii~gs~ 231 (405) +.+..+.- +| ..+++|.+.-.-.+-.. ..++....... ..++....+ ++.++...|+.+... T Consensus 121 l~~a~a~~-~d---~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~---------- 185 (320) T protein:vir:10 121 VATAFAMA-FD---SAALNGTDSPFPTYLAQTTKSVSLADPGGA-TASDLTAYDAVAVNGLSLLVNAKKK---------- 185 (320) T ss_pred HHHHHHHH-HH---HHhhcccCCCCCcccccccccccceecccc-cccccccHHHHHHHHHhhhhcccCC---------- Confidence 76544442 32 23566655322221110 11111111111 112223332 345555555544332 Q ss_pred ccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccC Q lcl|NC_020862. 232 MTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATA 311 (405) Q Consensus 232 ~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~ 311 (405) .-+.+|||.....|+.|+|..+.+-|.+...-+....+..+ .+-| +.++.++.+. +| T Consensus 186 ---------~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~---~i~g--~pv~~~~~~~----~~----- 242 (320) T protein:vir:10 186 ---------WTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDENSPFRAG---RIVS--RPTILSDHVA----DG----- 242 (320) T ss_pred ---------CcEEEEcHHHHHHHHHhhccCCceeeccccccCccccccCc---eeee--eeeEecCCCC----CC----- Confidence 12457899999999999998888888876555555443333 3322 4555555331 11 Q ss_pred CCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCC-------ccchhhhHHHH- Q lcl|NC_020862. 312 ANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRND-------PYGKVGFSSIK- 383 (405) Q Consensus 312 t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~D-------PlgQrg~~gwK- 383 (405) + ..+++|.-+...+++.+ .+.+.+..-. +....+ -+-|++.+.|+ T Consensus 243 --------------~----~~~~~gd~~~~~~~~~~-------~~~i~~~~~~--~~~~~~~~~~~~~~~f~~~~~~~r~ 295 (320) T protein:vir:10 243 --------------T----TVGYMGDFRNVIWGQVG-------GLSFDVTDQA--TLNLGTPTEPNFVSLWQHNLVAVRV 295 (320) T ss_pred --------------c----eEEEEeecceEEEEEec-------CeEEEEeecc--eeeeccccccccchhhhcCcEEEEE Confidence 1 13456765555565542 2333333211 111111 23467777877 Q ss_pred -HHHHHhhccccceEEEE-EecCC Q lcl|NC_020862. 384 -FFYGFIKLRGERIAVAY-SVIPE 405 (405) Q Consensus 384 -~~~~~~iL~~~~marie-~~a~~ 405 (405) +++++.+++++-+++|+ .+||+ T Consensus 296 ~~~~d~~v~~~~a~~~l~~~~ap~ 319 (320) T protein:vir:10 296 EAEYAFHNNDKDAFVKLTNVVTPD 319 (320) T ss_pred EEeeccEEecccceEEEEeccCCC Confidence 78999999999999998 78999 No 52 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=98.61 E-value=4.2e-09 Score=66.51 Aligned_cols=292 Identities=12% Similarity=0.091 Sum_probs=154.5 Q ss_pred CCc----cccC---cCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCcccc Q lcl|NC_020862. 1 MPH----IYND---PAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQ 73 (405) Q Consensus 1 ~~~----~y~~---~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~e 73 (405) ++. .+.. -..++.+.-+.-+-+.+ ....+....+...+.+++.+.++. .+..+.+.+......... T Consensus 103 ~~~~e~~~~~~~~a~~~~~~~~gg~liP~~~-~~~ii~~~~~~~~l~~~~~~~~~~--~~~~~~~~~~~~~~~~~~---- 175 (409) T protein:vir:45 103 LTSEERKALRELRAQGVAQDEKGGYTVPETF-LAKVVEKMKSYGGIASVAQILTTS--DGRTMEWATADGTSEVGV---- 175 (409) T ss_pred ccHHHHHHHHHHhhccCccCcCCceeccHhH-HHHHHHHHHhhhhhhhhceeeecC--CCceEEEEeeccCccccc---- Confidence 000 0011 01111111111122222 333444444556777888887764 344455554443222211 Q ss_pred CCCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeee-eeeEEecchhhhhhhccchHHHHH Q lcl|NC_020862. 74 GLDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEY-GFFMEYTEDSLMFDTDSDLYGHLS 152 (405) Q Consensus 74 Gvtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qy-G~~~e~Td~~~~~d~d~~l~~~~~ 152 (405) ..+.|...... -.++..++-+.+++ +.++.+|+++ +-|++.++.+.|. T Consensus 176 -~v~E~~~~~~~-----------------------------~~~f~~~~l~~~k~~~~~i~is~el-l~ds~~~l~~~i~ 224 (409) T protein:vir:45 176 -LLGENEEAGEE-----------------------------DTDFGMGSLGALKMTSKIIRVSNEL-LQDSAIDMEAYLA 224 (409) T ss_pred -ccccccccccc-----------------------------ccccceeeeeeeeeeeeehhhhHHH-HhccHHHHHHHHH Confidence 22222222111 11333444455564 6899999985 4556667777766 Q ss_pred HHHHHHHhhHHHHHHHHHHhccCceEEe---cCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceecc Q lcl|NC_020862. 153 REMLRGANEITEDLLQADILASADVKVF---TGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKG 229 (405) Q Consensus 153 ~ell~~~~~~ted~l~~~ilag~~~v~y---ag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~g 229 (405) .++.+..+ ..+ -..+++|.++-.- .|--.. .++ .......+.+++++|.++...|+..... T Consensus 225 ~~la~a~~-~~~---~~a~l~G~G~~~~~~p~Gil~~---~~~-~~~~~~~~~~~~d~i~~l~~~l~~~~~~-------- 288 (409) T protein:vir:45 225 RRIAERIG-RGE---ARYLIQGTGAGTPKQPKGLAAS---VTG-TTQTAAANAVKWQEILALKHSIDPAYRR-------- 288 (409) T ss_pred HHHHHHHH-HHH---HHHhhccCCCCCccccceeeec---ccc-ccccccccccchHHHHHHHHhhhhhhcc-------- Confidence 66654333 233 3446766543110 010000 000 0111123568999999999999765442 Q ss_pred ccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcc Q lcl|NC_020862. 230 SRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATA 309 (405) Q Consensus 230 s~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~ 309 (405) .+.++.++|+.....|+.|+|..+.|-|.|- +..|.-++|-| +.++.+..| +..++| T Consensus 289 ---------~a~~~~~~n~~~~~~l~~lkd~~G~~i~~~~--------~~~~~~~~l~G--~PV~~~~~~-p~~~~~--- 345 (409) T protein:vir:45 289 ---------GPKFRLAFNDNTLKLISEMEDGQGRPLWLPD--------IVGVAPASVLN--VPYVIDQEI-DDIGAG--- 345 (409) T ss_pred ---------CCeEEEEECHHHHHHHHHhhcCCCceeeccC--------cCCCCCceecc--eeeEEecCc-CCccCC--- Confidence 2357889999999999999998888888652 12344456755 577777655 322111 Q ss_pred cCCCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHH Q lcl|NC_020862. 310 TAANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYG 387 (405) Q Consensus 310 ~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~ 387 (405) .+ .++||.=+...+..++ .+.++ ...|||.+.+.++++ .++. T Consensus 346 -------------------~~-~i~~Gd~~~~~i~~~~---------~~~~~-------~~~d~~~~~~~~~~~~~~r~d 389 (409) T protein:vir:45 346 -------------------KK-FMFCGDFDRFIIRRVR---------YMILK-------RLVERYAEYDQTGFLAFHRFD 389 (409) T ss_pred -------------------cc-EEEEeehhhhheeecc---------ceEEE-------EeecccccCCcEEEEEEEEec Confidence 12 2556764333333321 11122 125888888888877 4789 Q ss_pred HhhccccceEEEEEecCC Q lcl|NC_020862. 388 FIKLRGERIAVAYSVIPE 405 (405) Q Consensus 388 ~~iL~~~~marie~~a~~ 405 (405) +.+.+++-+++++..+.= T Consensus 390 ~~~~~~~A~~~l~~k~s~ 407 (409) T protein:vir:45 390 CILEDTSAIKALVGKGSV 407 (409) T ss_pred cEeechhheEEEEeccCC Confidence 999999999999985555 No 53 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=98.60 E-value=2e-08 Score=62.74 Aligned_cols=298 Identities=10% Similarity=0.026 Sum_probs=159.8 Q ss_pred ccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccccccC Q lcl|NC_020862. 5 YNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGASYAG 84 (405) Q Consensus 5 y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~~~~~ 84 (405) -+- .++...+-|+ .+.++.+..+.+.-++.+++.+.+|+.+. +++-++..-+.+ .-.. T Consensus 1 mat--~~~gg~lvP~----~~~~~ii~~~~~~s~i~~~~~~i~~~~~~---~~~p~~~~~~~a-~wv~------------ 58 (311) T protein:vir:81 1 MVA--LATGTFQLPK----HLVPGVWQKAQGQSVLARLSMAEPQEFGE---QQYMTLTAPPRG-EVVG------------ 58 (311) T ss_pred Cce--ecCCceEcch----hHHHHHHHHHHhcchhhhhcceeecCCCc---eEEEEEeCCcee-EEee------------ Confidence 011 1111223333 23466666677788899999999887753 444333221111 1111 Q ss_pred CcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhcc--chHHHHHHHHHHHHhhH Q lcl|NC_020862. 85 GNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDS--DLYGHLSREMLRGANEI 162 (405) Q Consensus 85 gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~--~l~~~~~~ell~~~~~~ 162 (405) .|........++.+++-..++++.++.+|++++.+..|+ ++++.+..++.+. ... T Consensus 59 ----------------------Eg~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a-i~~ 115 (311) T protein:vir:81 59 ----------------------EGAQKSESTATFAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVA-LGR 115 (311) T ss_pred ----------------------cCcccccccceeeEEEEeeEEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHH-HHH Confidence 122223334567788899999999999999976544443 3566655555433 333 Q ss_pred HHHHHHHHHhccCc----eE---EecCC--CccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceecccccc Q lcl|NC_020862. 163 TEDLLQADILASAD----VK---VFTGA--ATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMT 233 (405) Q Consensus 163 ted~l~~~ilag~~----~v---~yag~--ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~ 233 (405) .+|. .+++|.+ .. .-++. .++.++.+ +........++.++...+..++.. T Consensus 116 ~~d~---a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~------~~~~~~~~~~i~~~~~~~~~~~~~------------ 174 (311) T protein:vir:81 116 ALDL---IGIHGINPLTGAALSGSPAKILDTTNIVELT------TGTSATPDLAVEAAVGLVLGDNLS------------ 174 (311) T ss_pred HHHH---hhhccccCCCCcccccccccccccceeeeec------ccccchHHHHHHHHHHHhhhcCCC------------ Confidence 3332 2344421 10 00110 11111111 111223345566666666554432 Q ss_pred CcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCC Q lcl|NC_020862. 234 DTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAAN 313 (405) Q Consensus 234 gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~ 313 (405) .-..++||.....|+.|+|-.++|-|.+.- ..+.-|++.| +.++.+..|.-=..++ .+ T Consensus 175 -------~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~--------~~~~~~tl~G--~Pv~~~~~i~~~~~~~-----~~ 232 (311) T protein:vir:81 175 -------PDGVALDNTFSFMLATQRDSQGRKLYPELG--------FGTDVASFAG--LNAAVSDTVRGGPEAV-----TA 232 (311) T ss_pred -------ceEEEEcHHHHHHHHhhhccCCCeeecCcc--------ccCCCceecc--eeEEeccccccccccc-----cc Confidence 123578999999999999999999997642 2445567766 3444444333101110 00 Q ss_pred cccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhhc Q lcl|NC_020862. 314 RGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIKL 391 (405) Q Consensus 314 ~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL 391 (405) .. .....+.++. .+++|+-+.-.+++.+ .+++-+..-+ ..+....|-|++.+.|| +.+++.++ T Consensus 233 -~~-~~~~~~~~~~----~~~~gDfs~~~i~~~~-------~~~~~~~~~~--~~~~~~~~~~~~~v~~r~~~r~d~~v~ 297 (311) T protein:vir:81 233 -ST-GVYRTTNPNV----KAIAGDFSAFRWGVQV-------SIPLELIEFG--DPDGLGDLKRQNQIAIRAEVVYGIGIM 297 (311) T ss_pred -cc-chhcccCCcc----EEEEEecccEEEEEec-------cceEEEeccC--CCCcchhhhhcCcEEEEEEEEeccEee Confidence 00 0011122221 3568887766666652 2444444332 12334467889999998 68899999 Q ss_pred cccceEEEEEecCC Q lcl|NC_020862. 392 RGERIAVAYSVIPE 405 (405) Q Consensus 392 ~~~~marie~~a~~ 405 (405) +++-+++++-+.-- T Consensus 298 ~~~a~~~l~~a~~~ 311 (311) T protein:vir:81 298 STDAFAVVRDADES 311 (311) T ss_pred cccceEEEEeeccC Confidence 99999999766544 No 54 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=98.59 E-value=1.9e-08 Score=62.96 Aligned_cols=280 Identities=10% Similarity=0.029 Sum_probs=154.6 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) +......-..+++++-+.-+-+ -+.+.-+....+...+.+++...+|+.+ ++++-++.--....... +.|+ T Consensus 106 ~~~~~~~~~~~~~~~~g~~vp~-~~~~~ii~~~~~~~~l~~l~~~~~~~~~---~~~~~~~~~~~~~a~~v-----~E~~ 176 (395) T protein:vir:43 106 RVSMPRSAITSIDGSGGALVAP-DRRPGVVAAPQRRLTIRDLVAPGTTESN---SVEYVRETGFVNNAAPV-----SEGT 176 (395) T ss_pred hhhhhhhhhcccCCCCccccch-hhHHHHHHHHHhhhhHHhhccceecCCC---ceEEEEEecCCCceeee-----cCCc Confidence 1111111111112221222223 2345544445566788899999998754 45554432211111111 2222 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) ++ .-...++..|+.++++++.++.+|++++. |+. ++...+..++.+..+ T Consensus 177 ~~-----------------------------~~~~~~~~~i~~~~~k~~~~~~is~ell~-d~~-~l~~~v~~~la~a~~ 225 (395) T protein:vir:43 177 QK-----------------------------PYSDLTFELENAPVRTIAHLFKASRQILD-DAS-ALQSYIDARARYGLM 225 (395) T ss_pred cc-----------------------------cccccceeEEEEeeeeEEEeehhhHHHHH-hHH-HHHHHHHHHHHHHHH Confidence 21 11223677789999999999999999654 554 566665555554333 Q ss_pred hHHHHHHHHHHhccCce-------EEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceecccccc Q lcl|NC_020862. 161 EITEDLLQADILASADV-------KVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMT 233 (405) Q Consensus 161 ~~ted~l~~~ilag~~~-------v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~ 233 (405) . .++ ..+++|.++ .-..+..+.. ..........+++|.++...|+.+..+. T Consensus 226 ~-~~d---~~~l~G~g~~~~~~Gi~~~~~~~~~~-------~~~~~~~~~~~~~i~~~~~~~~~~~~~~----------- 283 (395) T protein:vir:43 226 L-VEE---CQLLYGNGTGANLHGIIPQAQAYAPP-------SGVVVTAEQRIDRIRLAILQAQLAEFPA----------- 283 (395) T ss_pred H-HHH---HHHHhccCCCCccccccccccccccc-------cccccccchhHHHHHHHHHhhccccCCC----------- Confidence 3 333 345565432 1111111111 1111223467889999988887665531 Q ss_pred CcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCC Q lcl|NC_020862. 234 DTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAAN 313 (405) Q Consensus 234 gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~ 313 (405) + +-++||.+...|+.|+|..+.|-|.+. ..+.-+.+-| +++++++.|. +| T Consensus 284 -------~-~~vmn~~~~~~l~~lkd~~G~~i~~~~---------~~~~~~~l~G--~pVv~~~~~~----~~------- 333 (395) T protein:vir:43 284 -------S-GIVLNPIDWALIELNKDAENRYIIGSP---------QNGTTPTLWR--LPVVETQAIT----QD------- 333 (395) T ss_pred -------c-EEEEcHHHHHHHHHhhccCCceecccc---------ccCCCceecc--eeeEEcCCCC----CC------- Confidence 1 357999999999999987777766322 3455567755 6777776542 11 Q ss_pred cccccccccCCcceeeeEEEEEccccc-eeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhh Q lcl|NC_020862. 314 RGYQVSDVAGTDKYDIAPLLVVGDQAF-ATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIK 390 (405) Q Consensus 314 ~~~~~~~~~g~~~~DVYp~lV~G~~Af-g~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~i 390 (405) + +++|.-.. -.+..+ ..+.+-+.. ..+.+-+++...|+ +++.+.+ T Consensus 334 --------------~----~~~gd~~~~~~~~~~-------~~~~i~~~~-------~~~~~f~~~~~~~r~~~r~d~~v 381 (395) T protein:vir:43 334 --------------E----FLTGAFSLGAQIFDR-------MDIEVLVST-------ENDKDFENNMVTIRAEERLAFAV 381 (395) T ss_pred --------------c----EEEEeccceEEEEEe-------cceEEEEec-------cccchhhcCcEEEEEEEeeccEE Confidence 1 34555332 122221 113333332 13445678888888 5889999 Q ss_pred ccccceEEEEEecC Q lcl|NC_020862. 391 LRGERIAVAYSVIP 404 (405) Q Consensus 391 L~~~~marie~~a~ 404 (405) ++++-++++++.+- T Consensus 382 ~~~~a~~~~~~taa 395 (395) T protein:vir:43 382 YRPEAFVTGSLTAS 395 (395) T ss_pred ecccceEEEEeccC Confidence 99999999999888 No 55 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=98.59 E-value=1.2e-08 Score=64.08 Aligned_cols=294 Identities=13% Similarity=0.077 Sum_probs=158.9 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCC--ccccCCCcc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLN--VNDQGLDAT 78 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t--~l~eGvtp~ 78 (405) +-.+-+...+++.+..+..+-+ -+.+..+....+...+.+++++.+|+-+ .....+ .....+ -..||-..+ T Consensus 155 ~~~~~a~~~~~~~~~g~~~ip~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~--~~~~~~----~~~~~~a~~v~e~~~~~ 227 (458) T protein:vir:10 155 QRHLKAVNQSSSVEVSSESYET-IFSQRIIRDLQKELVVGALFEELPMSSK--ILTMLV----EPDAGKATWVAASTYGT 227 (458) T ss_pred hhhhhhhhhcccCccccceehh-hHhHHHHHHHHhhhhHHhhcceeecCCc--ceEEEE----ecCCcceeecccccccc Confidence 1111111122222222222223 3456666666677788899998888643 222222 111111 122222111 Q ss_pred cccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHH Q lcl|NC_020862. 79 GASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRG 158 (405) Q Consensus 79 g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~ 158 (405) . . ....... .++..|+.+.++++.++.+|++ ++-|++.++.+.+..+|.+. T Consensus 228 ~-----~---------~~~~~~~--------------~~~~~i~~~~~k~~~~v~is~e-ll~ds~~~~~~~i~~~l~~~ 278 (458) T protein:vir:10 228 D-----T---------TTGEEVK--------------GALKEIHFSTYKLAAKSFITDE-TEEDAIFSLLPLLRKRLIEA 278 (458) T ss_pred c-----c---------ccccccc--------------ccceeeEeeeeeEEeeehhhHH-HHhcchHHHHHHHHHHHHHH Confidence 1 0 0001111 2456678899999999999998 55666667888877776654 Q ss_pred HhhHHHHHHHHHHhccCceEE------ecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccc Q lcl|NC_020862. 159 ANEITEDLLQADILASADVKV------FTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRM 232 (405) Q Consensus 159 ~~~~ted~l~~~ilag~~~v~------yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~ 232 (405) .+. .+ -..+++|.++-. +++.....+ .. ..++...+.+++++|.++...|+.+... T Consensus 279 i~~-~~---d~~~l~G~G~~~p~Gi~~~~~~~~~~~-~~--~~~~~~~~~~~~~~i~~~~~~l~~~~~~----------- 340 (458) T protein:vir:10 279 HAV-SI---EEAFMTGDGSGKPKGLLTLASEDSAKV-VT--EAKADGSVLVTAKTISKLRRKLGRHGLK----------- 340 (458) T ss_pred HHH-HH---HHHhhcCCCCCccceeeecccccccce-ee--cccccccccccHHHHHHHHHhhhhhhcC----------- Confidence 443 23 335566654422 222111111 11 1122235679999999999888765432 Q ss_pred cCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCC Q lcl|NC_020862. 233 TDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAA 312 (405) Q Consensus 233 ~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t 312 (405) .++ -+||+.....|+.|+|..+.|-|.|- +. .....|..+++-| ++++.+..| | +++ T Consensus 341 ------~~~--~v~~~~~~~~l~~lkd~~G~~i~~~~--~~--~~~~~~~~~~l~G--~pv~~~~~~-p---~~~----- 397 (458) T protein:vir:10 341 ------LSK--LVLIVSMDAYYDLLEDEEWQDVAQVG--ND--SVKLQGQVGRIYG--LPVVVSEYF-P---AKA----- 397 (458) T ss_pred ------CCE--EEEcHHHHHHHHhhcccCCceeeccc--cc--cccccCcCceecc--eeeEEcccc-c---ccc----- Confidence 112 37899999999999987777777643 22 2234666777876 466666544 2 111 Q ss_pred CcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhh Q lcl|NC_020862. 313 NRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIK 390 (405) Q Consensus 313 ~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~i 390 (405) +..+|+ +..|+ +.|-...-. .+++. .|||.+.+++++. ..++..+ T Consensus 398 ------------~~~~~~-~~~f~-~~~~~~~~~--------~~~v~-----------~d~~~~~~~~~~~~~~r~~~~v 444 (458) T protein:vir:10 398 ------------NSAEFA-VIVYK-DNFVMPRQR--------AVTVE-----------RERQAGKQRDAYYVTQRVNLQR 444 (458) T ss_pred ------------CCcceE-EEEec-ccEEEEEee--------ceEEE-----------eecccCCCceEEEEEEEecceE Confidence 111332 33333 233222221 12321 3777777776665 3456788 Q ss_pred ccccceEEEEEecC Q lcl|NC_020862. 391 LRGERIAVAYSVIP 404 (405) Q Consensus 391 L~~~~marie~~a~ 404 (405) +++..++.+..+|- T Consensus 445 ~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 445 YFANGVVSGTYAAS 458 (458) T ss_pred ecccceEEEeeccC Confidence 89999999999888 No 56 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=98.58 E-value=1.6e-09 Score=68.84 Aligned_cols=290 Identities=15% Similarity=0.074 Sum_probs=154.7 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) +..+-...-.+.+.+-+.-.-+--|.++.+....+..++.+++...++..+.. +..+ ...... .+....|. T Consensus 99 ~~~~e~~a~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~---~~~~---~~~~~~---a~wv~E~~ 169 (401) T protein:vir:44 99 LRDLERKALQVGTDEDGGYAVPEELDRSILSLLKDEVVMRQEATVITVGGSDY---KKLV---NLGGTA---SGWVGETD 169 (401) T ss_pred hHHHHHHHhhcCCCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCce---EEEE---ecCCcc---ceeecccc Confidence 11110000011111111111122334444444445667888888887754432 2221 111000 11111111 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) . ++.....++..|+-+.++++.++.+|+++ +-|+..++.+.+..+|.+.-+ T Consensus 170 ~----------------------------~~~~~~~~~~~v~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~la~ai~ 220 (401) T protein:vir:44 170 T----------------------------RSQTATSRLGLIEPFMGEIYGNPQATQKM-LDDAFFNVEAWINSELATEFA 220 (401) T ss_pred c----------------------------cCccccccceeeeeehhheeeehhhhHHH-HhcchHHHHHHHHHHHHHHHH Confidence 1 11111235667889999999999999985 446666788887777665443 Q ss_pred hHHHHHHHHHHhccCceEEecC------CCccceeee---cccccccCCceecHHHHHHHHHHHHhccCccccceecccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKVFTG------AATSMVTMT---GEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSR 231 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~yag------~ats~~~~t---~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~ 231 (405) . . +-..+++|.++-.=.| ..+...... .........+.+++++|.++...|+...... T Consensus 221 ~-~---~~~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~~~~~~l~~~~~~~--------- 287 (401) T protein:vir:44 221 E-Q---EEIAFTTGDGTKKPKGFLAYESTEESDKARAFGKLQHIVSGEATAVTADAIIKLIYTLRKAHRTG--------- 287 (401) T ss_pred H-H---HHhhhhccCCCCccceeeccccccccccccccccccccccccccccCHHHHHHHHHhcchhhhcC--------- Confidence 2 2 3344556544321111 100000000 0000111245689999999999887653321 Q ss_pred ccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccC Q lcl|NC_020862. 232 MTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATA 311 (405) Q Consensus 232 ~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~ 311 (405) + +.++|+.+..-|+.|+|-.+.|-|.|--+.|.+ ++|-| +.++.++.| +..++|+ T Consensus 288 ---------a-~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~g~~--------~~l~G--~PVv~~~~~-p~~~~~~---- 342 (401) T protein:vir:44 288 ---------A-KFMMNNNSLFAIRLLKDTEGNYLWRPGLELGQP--------SSLAG--YGIAENEQM-PDIAADA---- 342 (401) T ss_pred ---------C-EEEEcHHHHHHHHHhhccCCceeecCCcCCCCC--------ceecc--eeeEEecCc-CCccCCc---- Confidence 1 357999999999999998888988775344433 45655 344555433 3222111 Q ss_pred CCcccccccccCCcceeeeEEEEEcc--ccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHH--HHH Q lcl|NC_020862. 312 ANRGYQVSDVAGTDKYDIAPLLVVGD--QAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKF--FYG 387 (405) Q Consensus 312 t~~~~~~~~~~g~~~~DVYp~lV~G~--~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~--~~~ 387 (405) ..++||+ ++|....-.| +++ ..|||.+++.++|++ .+. T Consensus 343 -------------------~~i~~Gd~~~~~~i~~~~~--------~~~-----------~~~~~~~~~~v~~~a~~r~d 384 (401) T protein:vir:44 343 -------------------KAIAFGNFKRGYTIVDRIG--------TRI-----------LRDPYTNKPFVGFYTTKRTG 384 (401) T ss_pred -------------------cEEEEeehhccEEEEEecc--------eEE-----------eeeccccCCcEEEEEEEEec Confidence 1244565 3343222211 222 156788899999996 599 Q ss_pred HhhccccceEEEEEecC Q lcl|NC_020862. 388 FIKLRGERIAVAYSVIP 404 (405) Q Consensus 388 ~~iL~~~~marie~~a~ 404 (405) +.+++++-+++|+.+|- T Consensus 385 ~~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 385 GMLVDSQAIKLLKIAAA 401 (401) T ss_pred cEEecccceEEEEeecC Confidence 99999999999999988 No 57 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=98.57 E-value=2.2e-08 Score=62.60 Aligned_cols=288 Identities=11% Similarity=0.092 Sum_probs=154.4 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) =.+.++.-..+.+++-+.-+-+. +.++.+..+.+..++.+++...+|+. .++++.++...+.+ ... +.|+ T Consensus 20 ~~~~~~a~~~~~~~~~~~lip~~-~~~~ii~~~~~~s~l~~l~~~~~~~~---~~~~~p~~~~~~~a-~~v-----~Eg~ 89 (324) T protein:vir:96 20 KPQVFNPDNVMMHEKKDGTLLND-FTTPILQEVMENSKIMQLGKYEPMEG---TEKKFTFWADKPGA-YWV-----GEGQ 89 (324) T ss_pred hhhhcccccccccCCCcceechh-HHHHHHHHHHhhchhhhhcceeeccC---CceEEEEEecCcce-eee-----cCCc Confidence 11222211111111112122222 34566666667788889999988874 34666554322221 111 2222 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) ++ -....++..++.+.++++.++.+|++++ -|++.++...+..++.+..+ T Consensus 90 ~~-----------------------------~~~~~~f~~v~~~~~k~~~~~~is~ell-~ds~~~l~~~i~~~l~~aia 139 (324) T protein:vir:96 90 KI-----------------------------ETSKATWVNATMRAFKLGVILPVTKEFL-NYTYSQFFEEMKPMIAEAFY 139 (324) T ss_pred cc-----------------------------cccccceeEEEEEeEEEEEeehhhHHHH-hcchHHHHHHHHHHHHHHHH Confidence 21 1122367778899999999999999854 46666788887777775544 Q ss_pred hHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISA 240 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~ 240 (405) .-.+. .+++|.+.-.....-.+. . ........+.+++++|.++...|+.+.... T Consensus 140 ~~~d~----~~l~G~g~~~~~~~~~~~---~-~~~~~~~~~~~~~~~i~~~~~~i~~~~~~~------------------ 193 (324) T protein:vir:96 140 KKFDE----AGILNQGNNPFGKSIAQS---I-KKTNKVIKGDFTQDNIIDLEALLEDDELEA------------------ 193 (324) T ss_pred HHHHH----HhhhcCCCCCcCcccccc---c-cccceecccccchHHHHHHHHhhhhccCCC------------------ Confidence 43333 333443211111100000 0 111122235588999999998887754421 Q ss_pred eEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccccccc Q lcl|NC_020862. 241 SRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSD 320 (405) Q Consensus 241 syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~ 320 (405) + ..+||+.....|+.++|..+.|.|.+ +--+++-| +..+.++... . T Consensus 194 ~-~~i~n~~~~~~L~~lkd~~G~~~~~~------------~~~~~l~G--~PV~~~~~~~----~--------------- 239 (324) T protein:vir:96 194 N-AFISKTQNRSLLRKIVDPETKERIYD------------RNSDSLDG--LPVVNLKSSN----L--------------- 239 (324) T ss_pred C-EEEEcHHHHHHHHHhhCCCCCeeecC------------CCCCcccc--eeeEeecCCC----C--------------- Confidence 1 35799999999999988766666531 22344544 2333332110 0 Q ss_pred ccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEec-CCCCCCCCCCc------cchhhhHHHH--HHHHHhhc Q lcl|NC_020862. 321 VAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKK-PGEATADRNDP------YGKVGFSSIK--FFYGFIKL 391 (405) Q Consensus 321 ~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~-pG~~tad~~DP------lgQrg~~gwK--~~~~~~iL 391 (405) ++ ..+++|+-+.-.+++. +.+++-+.. .+.. ...|+ |-|++.+.|+ +++++.++ T Consensus 240 ----~~----~~~~~gd~s~~~~~~~-------~~~~i~~~~~~~~~--~~~~~~~~~~~~~~~n~v~~r~~~r~d~~v~ 302 (324) T protein:vir:96 240 ----KR----GELITGDFDKLIYGIP-------QLIEYKIDETAQLS--TVKNEDGTPVNLFEQDMVALRATMHVALHIA 302 (324) T ss_pred ----Cc----ceEEEEecceEEEEEe-------cCcEEEEeeccccc--ccccccccchhhhhcCcEEEEEEEEeccEEe Confidence 01 1256776665555553 124433322 1111 11222 2466677777 78899999 Q ss_pred cccceEEEEEecCC Q lcl|NC_020862. 392 RGERIAVAYSVIPE 405 (405) Q Consensus 392 ~~~~marie~~a~~ 405 (405) +++-+++|+.+.+- T Consensus 303 ~~~a~~~l~~a~~~ 316 (324) T protein:vir:96 303 DDKAFAKLVPADKR 316 (324) T ss_pred cccceEEEeccccc Confidence 99999999966554 No 58 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=98.57 E-value=4.2e-08 Score=61.01 Aligned_cols=290 Identities=12% Similarity=0.113 Sum_probs=158.9 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) -.+.++.-..+.+++-+.-+-+ .+..+.+..+.+..++.+++...+|+. .++++.++...+.+ ...+.|+ T Consensus 20 ~~~~~~a~~~~~~~~~~~~iP~-~~~~~ii~~~~~~s~l~~~~~~~~~~~---~~~~ip~~~~~~~a------~~v~Eg~ 89 (324) T protein:vir:97 20 KPQVFNPDNVMMHEKKDGTLMN-EFTTPILQEVMENSKIMQLGKYEPMEG---TEKKFTFWADKPGA------YWVGEGQ 89 (324) T ss_pred hhhhhccccccccCCCcceech-hHHHHHHHHHHhhcchhhhcceeeccC---CceEEEEEecCcce------eEeccCc Confidence 2222322222212222222222 335556666667788889998888873 44666554322221 1112222 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) ++ .-...++..++.+.++++.++.+|++++. |+..++...+..++.+..+ T Consensus 90 ~~-----------------------------~~~~~~f~~v~~~~~k~~~~~~is~ell~-ds~~~l~~~i~~~l~~aia 139 (324) T protein:vir:97 90 KI-----------------------------ETSKATWVNATMRAFKLGVILPVTKEFLN-YTYSQFFEEMKPMIAEAFY 139 (324) T ss_pred cc-----------------------------cccccceeEEEEeeEEEEEeehhhHHHHh-cchHHHHHHHHHHHHHHHH Confidence 22 11123667788999999999999998544 5555777776666665433 Q ss_pred hHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISA 240 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~ 240 (405) . .+ -..+++|.+.--..+.-.+.. ........+.+++++|.++...|+.+.-. ++ T Consensus 140 ~-~~---d~a~l~G~g~~~~~~gi~~~~----~~~~~~~~~~~~~~~i~~~~~~l~~~~~~-----------------~~ 194 (324) T protein:vir:97 140 K-KF---DEAGILNQGNNPFGKSIAQSI----EKTNKVIKGDFTQDNIIDLEALLEDDELE-----------------AN 194 (324) T ss_pred H-HH---HHHhhccCCCCccCccccccc----cccceeccccCCHHHHHHHHHhhhhccCC-----------------CC Confidence 3 22 234555554322221111111 11122234678999999999988876532 12 Q ss_pred eEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccccccc Q lcl|NC_020862. 241 SRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSD 320 (405) Q Consensus 241 syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~ 320 (405) + .+|||.....|+.|+|..+.|.|.+ +--|.+-| +.++.++... . T Consensus 195 ~--~v~n~~~~~~L~~lkd~~g~~~~~~------------~~~~tl~G--~PV~~~~~~~----~--------------- 239 (324) T protein:vir:97 195 A--FISKTQNRSLLRKIVDPETKERIYD------------RNSDTLDG--LPVVNLKSSN----L--------------- 239 (324) T ss_pred E--EEEcHHHHHHHHHhhcCCCceeecC------------CCCccccc--eeeEeecCCC----C--------------- Confidence 2 4789999999999998777776642 22244544 3444443100 0 Q ss_pred ccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCC-CC--CCCCCc--cchhhhHHHH--HHHHHhhccc Q lcl|NC_020862. 321 VAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGE-AT--ADRNDP--YGKVGFSSIK--FFYGFIKLRG 393 (405) Q Consensus 321 ~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~-~t--ad~~DP--lgQrg~~gwK--~~~~~~iL~~ 393 (405) ++ ..+++|.-+...++.++ .+++-+..-.. .+ ....-+ +-|+..+.++ +++.+.++++ T Consensus 240 ----~~----~~~~~gd~~~~~i~~~~-------~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~ 304 (324) T protein:vir:97 240 ----KR----GELITGDFDKLIYGIPQ-------LIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADD 304 (324) T ss_pred ----Cc----ceEEEEecccEEEEEec-------CcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecc Confidence 11 12567776665565541 24443332211 00 011112 2356667766 6789999999 Q ss_pred cceEEEEEecCC Q lcl|NC_020862. 394 ERIAVAYSVIPE 405 (405) Q Consensus 394 ~~marie~~a~~ 405 (405) +-++.|+.+.|- T Consensus 305 ~a~~~l~~~~~~ 316 (324) T protein:vir:97 305 KAFAKLVPADKK 316 (324) T ss_pred cceEEEEeccCC Confidence 999999988884 No 59 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=98.56 E-value=4.8e-08 Score=60.72 Aligned_cols=316 Identities=13% Similarity=0.061 Sum_probs=155.9 Q ss_pred ccccCcCCCc---ccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCccc Q lcl|NC_020862. 3 HIYNDPAAGD---ASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATG 79 (405) Q Consensus 3 ~~y~~~~~t~---~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g 79 (405) |+.|+.=+.- .++|+.....-|-++=.+.-. +.-++|-....++-+.++--+|-.+...... ..|=+..+ T Consensus 1 ~~~~~~~~~~~~Ms~~i~~~fv~qy~~~v~~~~q---q~~s~L~~tV~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~ 73 (322) T protein:vir:10 1 MKLNAIMSMLPLIAGDIDQAFVQTYETTLRILSQ---QKSAKLKQYCQHKNESSESHNWETLASMDPD----AVKRKRSR 73 (322) T ss_pred CcccceeeeeeeeechhhhHHHHHHHHHHHHHHH---Hhhhhhhcccccccccccccceeeccccccc----cccccccc Confidence 5555532220 123433333323222122222 2335555555566666664443333222110 01111111 Q ss_pred ccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHH Q lcl|NC_020862. 80 ASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGA 159 (405) Q Consensus 80 ~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~ 159 (405) ++. +|.. -..| +|-+ .+.++.+.+++|..+..+.|.. .....-|+......+..-.. T Consensus 74 ~~~---------~d~~---~dtp--------~~~~--~~~~r~~~~~d~~~~~~VDd~D-~~k~~~D~~~~~~~~~a~AL 130 (322) T protein:vir:10 74 QQS---------ADGT---YPTP--------VNNK--PFAKRRTNVDTYDTGHVVEQED-ISQMLLDPNSALITSQAYAM 130 (322) T ss_pred ccc---------cCcc---cCCC--------cccc--ccceEEEeecccccceecchHH-HHHhhcCchHHHHHHHHHHh Confidence 111 1110 0000 0111 2445668999998887776642 23333344444333333222 Q ss_pred hhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccccc Q lcl|NC_020862. 160 NEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTIS 239 (405) Q Consensus 160 ~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~ 239 (405) +.-..+.|..-++.++ .+ |..+..+...+....+..++-+|++.|+.|.+.|..+.+|.. T Consensus 131 ~R~~D~~I~~a~~g~a-~~---~~~gt~v~~~ss~~i~~g~~g~t~~kl~~a~~~l~~~dvp~d---------------- 190 (322) T protein:vir:10 131 ARKTDDLIIAGAWKPA-SI---KGTGQPVEFLATQEIGDGTKPISFDYVTEITERFLENEIEPE---------------- 190 (322) T ss_pred hhHHHHHHHhhhhccc-cc---cccccccccCCCcccccCccchhHHHHHHHHHHHHhcCCCCC---------------- Confidence 2222333332232222 21 222222322222233444678999999999999999999831 Q ss_pred ceEEEEEcccchHHHHHHhcccCCCcceehhhcCCccccc-CcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccccc Q lcl|NC_020862. 240 ASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIM-NGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQV 318 (405) Q Consensus 240 ~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~-~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~ 318 (405) .-+++++.|....+|-. ++.|+.+ .|.+.+.+. +|.||.+-| |.++.+-.+-.-...+- ..+ T Consensus 191 ~~R~~vv~p~~~~~LL~------d~~~ts~-D~~~~~~l~~~G~ig~~lG--f~~i~s~~lp~~~~t~~-~~~------- 253 (322) T protein:vir:10 191 VSKVIVIGPTQARKLLQ------ITEATSA-DYTSAMDLQSKGIITNWMG--YTWIVSTRLDKFDPTQW-GMA------- 253 (322) T ss_pred CCeEEEeCHHHHHHHhc------chhhhhh-hcccchhhhhcCeeeeeee--EEEEEeccCCccccccc-ccc------- Confidence 12678899999888843 6889985 787777775 599999977 89888865542111100 000 Q ss_pred ccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccceEE Q lcl|NC_020862. 319 SDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIAV 398 (405) Q Consensus 319 ~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~mar 398 (405) ......-|+.+.++.=+.|.+-.--+ + -..+ |...|++. --..+.-++.|+|.+++|+.++. T Consensus 254 --~~~~~~~~~~~~~a~~k~Av~~a~~~----d--v~~~-i~~~~~~~---------~a~~I~~~~~~Ga~ri~~~gVv~ 315 (322) T protein:vir:10 254 --AEDGPQGDEIWCIAMTDMALGYHSCK----D--IWTK-VAEDPSAS---------FAWRIYSAFTADCVRVEDEHIFK 315 (322) T ss_pred --ccCCCCccceeEEEEecCceeEEEee----e--eeEE-eeccCCcc---------hhhhhhhhhhhCceEeccCcEEE Confidence 00011125677554444444322110 1 0111 13344321 11235556889999999999999 Q ss_pred EEEecCC Q lcl|NC_020862. 399 AYSVIPE 405 (405) Q Consensus 399 ie~~a~~ 405 (405) |+| -| T Consensus 316 i~~--~e 320 (322) T protein:vir:10 316 LRL--KN 320 (322) T ss_pred EEE--ec Confidence 999 67 No 60 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=98.56 E-value=9.9e-09 Score=64.46 Aligned_cols=291 Identities=11% Similarity=0.017 Sum_probs=153.5 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) +-..-.......+++-+.-+.+-.+...-+....+...+.+++...+|+.+.++....+ ..+-+... -..||- T Consensus 113 ~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~-~~~~~~~~-~v~Eg~----- 185 (415) T protein:vir:46 113 LETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVR-QSEVAALE-KVEELE----- 185 (415) T ss_pred HhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEE-ecCCccee-eccccc----- Confidence 00000011111122222222333445555555566788999999999998877643322 22221111 112221 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) .+. .....++..|+-+.++++.++.+|++++ -|++.++..++..++.+..+ T Consensus 186 ~~~----------------------------~~~~~~~~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~~i~ 236 (415) T protein:vir:46 186 ENP----------------------------ELAVKPFFQLAYDINTHRGYFRISREAI-EDAKVNVLQELKLWMARTIA 236 (415) T ss_pred ccc----------------------------cccccceeeEEeeeeeeEeeehhhHHHH-hhchHHHHHHHHHHHHHHHH Confidence 110 0011256678899999999999999854 56666788877777765544 Q ss_pred hHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISA 240 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~ 240 (405) ..-+ ..+++|.++-.-.+....... ........+.+++++|.++...|....... T Consensus 237 ~~~d----~~il~g~g~g~~~~~~~~~~~---~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~------------------ 291 (415) T protein:vir:46 237 ATRN----KAIIDVITKGSTGSTSSGFEK---EGKKLEVKKAKSLDDIKDAINLNVKPNYEH------------------ 291 (415) T ss_pred HHHH----HHHhhccccCCcccccccccc---ccceeccccccchHHHHHHHHhhhhhccCC------------------ Confidence 4322 334444322100111100000 011112345689999999988887654321 Q ss_pred eEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccccccc Q lcl|NC_020862. 241 SRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSD 320 (405) Q Consensus 241 syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~ 320 (405) + ..++|+.....|+.|+|-.++|-|.|- + .++-.++|-| +.+++++.+ +..++| T Consensus 292 ~-~~v~n~~~~~~L~~lkd~~G~~i~~~~--~------~~~~~~~l~G--~pV~~~~~~-~~~~~~-------------- 345 (415) T protein:vir:46 292 N-VAIVSQTMFAKLDKMKDKLGNYLIQPD--V------KEKTQQRLLG--AKIEILPDE-VLGQKG-------------- 345 (415) T ss_pred C-EEEEcHHHHHHHHHhhccCCCeeeccC--c------CCCCCccccc--eeeEEeccc-cccCCC-------------- Confidence 1 346999999999999998888888652 1 2444567766 455555533 211110 Q ss_pred ccCCcceeeeEEEEEccc--cceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccceEE Q lcl|NC_020862. 321 VAGTDKYDIAPLLVVGDQ--AFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIAV 398 (405) Q Consensus 321 ~~g~~~~DVYp~lV~G~~--Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~mar 398 (405) |+ .+++|.= +|-...-++ +.+-+. .+-..|+++.++ +++.+.+++++-+++ T Consensus 346 -------~~--~~~~gd~~~~~~~~~~~~--------~~v~~~---------~~~~~~~~~~~~-~r~d~~v~~~~a~~~ 398 (415) T protein:vir:46 346 -------NN--TLIIGNLKDAIVLFDRSQ--------YQASWT---------DYMHFGECLMIA-VRQDCRILDYKSAIV 398 (415) T ss_pred -------cc--EEEEEehhccEEEEeecc--------eEEEee---------ccccCceEEEEE-EEeccEEeccccEEE Confidence 11 2677743 332221111 222221 122334555443 568899999999999 Q ss_pred EEEecCC Q lcl|NC_020862. 399 AYSVIPE 405 (405) Q Consensus 399 ie~~a~~ 405 (405) ++..++- T Consensus 399 ~~~~~~~ 405 (415) T protein:vir:46 399 IEYDDSE 405 (415) T ss_pred EEeeccC Confidence 9876555 No 61 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=98.56 E-value=9.9e-09 Score=64.46 Aligned_cols=291 Identities=11% Similarity=0.017 Sum_probs=153.5 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) +-..-.......+++-+.-+.+-.+...-+....+...+.+++...+|+.+.++....+ ..+-+... -..||- T Consensus 113 ~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~-~~~~~~~~-~v~Eg~----- 185 (415) T protein:vir:47 113 LETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVR-QSEVAALE-KVEELE----- 185 (415) T ss_pred HhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEE-ecCCccee-eccccc----- Confidence 00000011111122222222333445555555566788999999999998877643322 22221111 112221 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) .+. .....++..|+-+.++++.++.+|++++ -|++.++..++..++.+..+ T Consensus 186 ~~~----------------------------~~~~~~~~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~~i~ 236 (415) T protein:vir:47 186 ENP----------------------------ELAVKPFFQLAYDINTHRGYFRISREAI-EDAKVNVLQELKLWMARTIA 236 (415) T ss_pred ccc----------------------------cccccceeeEEeeeeeeEeeehhhHHHH-hhchHHHHHHHHHHHHHHHH Confidence 110 0011256678899999999999999854 56666788877777765544 Q ss_pred hHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISA 240 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~ 240 (405) ..-+ ..+++|.++-.-.+....... ........+.+++++|.++...|....... T Consensus 237 ~~~d----~~il~g~g~g~~~~~~~~~~~---~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~------------------ 291 (415) T protein:vir:47 237 ATRN----KAIIDVITKGSTGSTSSGFEK---EGKKLEVKKAKSLDDIKDAINLNVKPNYEH------------------ 291 (415) T ss_pred HHHH----HHHhhccccCCcccccccccc---ccceeccccccchHHHHHHHHhhhhhccCC------------------ Confidence 4322 334444322100111100000 011112345689999999988887654321 Q ss_pred eEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccccccc Q lcl|NC_020862. 241 SRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSD 320 (405) Q Consensus 241 syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~ 320 (405) + ..++|+.....|+.|+|-.++|-|.|- + .++-.++|-| +.+++++.+ +..++| T Consensus 292 ~-~~v~n~~~~~~L~~lkd~~G~~i~~~~--~------~~~~~~~l~G--~pV~~~~~~-~~~~~~-------------- 345 (415) T protein:vir:47 292 N-VAIVSQTMFAKLDKMKDKLGNYLIQPD--V------KEKTQQRLLG--AKIEILPDE-VLGQKG-------------- 345 (415) T ss_pred C-EEEEcHHHHHHHHHhhccCCCeeeccC--c------CCCCCccccc--eeeEEeccc-cccCCC-------------- Confidence 1 346999999999999998888888652 1 2444567766 455555533 211110 Q ss_pred ccCCcceeeeEEEEEccc--cceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccceEE Q lcl|NC_020862. 321 VAGTDKYDIAPLLVVGDQ--AFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIAV 398 (405) Q Consensus 321 ~~g~~~~DVYp~lV~G~~--Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~mar 398 (405) |+ .+++|.= +|-...-++ +.+-+. .+-..|+++.++ +++.+.+++++-+++ T Consensus 346 -------~~--~~~~gd~~~~~~~~~~~~--------~~v~~~---------~~~~~~~~~~~~-~r~d~~v~~~~a~~~ 398 (415) T protein:vir:47 346 -------NN--TLIIGNLKDAIVLFDRSQ--------YQASWT---------DYMHFGECLMIA-VRQDCRILDYKSAIV 398 (415) T ss_pred -------cc--EEEEEehhccEEEEeecc--------eEEEee---------ccccCceEEEEE-EEeccEEeccccEEE Confidence 11 2677743 332221111 222221 122334555443 568899999999999 Q ss_pred EEEecCC Q lcl|NC_020862. 399 AYSVIPE 405 (405) Q Consensus 399 ie~~a~~ 405 (405) ++..++- T Consensus 399 ~~~~~~~ 405 (415) T protein:vir:47 399 IEYDDSE 405 (415) T ss_pred EEeeccC Confidence 9876555 No 62 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=98.55 E-value=2.7e-08 Score=62.07 Aligned_cols=308 Identities=14% Similarity=0.065 Sum_probs=163.8 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCC--CCccccCCCcc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDD--LNVNDQGLDAT 78 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~--~t~l~eGvtp~ 78 (405) |.. -.++....+++-+ ..-+-.+..+.+....+.-.+.+++...+|+.+ .+++.++.--+.+ .....-....+ T Consensus 10 ~~~-~~~~~~~~~~~~~-~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~---~~~ip~~~~~~~a~~v~~~~~~~~~E 84 (338) T protein:vir:78 10 NTA-GSNHQGRLAHVPS-DLLPKEIVGPIFDKAQESSLVLRLGENIPISYG---ETIIPTTVKRPEVGQVGVGTSNEQRE 84 (338) T ss_pred hhc-ccccccceecccc-cccchHHHHHHHHHHHhhchhhhhcceeeccCC---ceEEEEEecCccceeecccccccccc Confidence 222 1333332222222 233334566777777778888999999998854 3333332222111 11111111122 Q ss_pred cccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHH Q lcl|NC_020862. 79 GASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRG 158 (405) Q Consensus 79 g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~ 158 (405) |+.......++..|+.+.++++.++.+|+++ +-|+..++.+.|..++.+. T Consensus 85 -----------------------------g~~~~~~~~~f~~v~l~~~k~~~~~~is~el-l~ds~~~~~~~i~~~la~a 134 (338) T protein:vir:78 85 -----------------------------GGTKPLSGTAWDTRSVAPIKLATIVTVSEEF-ARMNPSGLYTKLQADLAYA 134 (338) T ss_pred -----------------------------cccccccccceeEEEEEEEEEEEeehhhHHH-HhcCHHHHHHHHHHHHHHH Confidence 2222333346778889999999999999985 4455567888777666643 Q ss_pred HhhHHHHHHHHHHhccCceEEec---CCCccce--eeecccccccCCceecHHHHHHHHHHHHhccCccccceecccccc Q lcl|NC_020862. 159 ANEITEDLLQADILASADVKVFT---GAATSMV--TMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMT 233 (405) Q Consensus 159 ~~~~ted~l~~~ilag~~~v~ya---g~ats~~--~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~ 233 (405) .+. .+| ..+++|.+...-+ |-.+... ..+... ....+....++++.++...+..|.... T Consensus 135 ~~~-~~d---~~~l~G~g~~~~~~~~gi~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~----------- 198 (338) T protein:vir:78 135 IGR-GID---LAVFHGKSPLTGSALQGIDTNNVIVNTTNVD-YLQTGTTPLLDRFLDGYDLVSANTDVD----------- 198 (338) T ss_pred HHH-HHH---HHhhcccCCCccccccccccccccccccccc-cccccchhhHHHHHHHHHHhhhhcccc----------- Confidence 333 232 2455654422111 1000000 000100 111234566788888887776655431 Q ss_pred CcccccceEEEEEcccchHHHH---HHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCccc Q lcl|NC_020862. 234 DTKTISASRIAYIGSELEIYIT---ELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATAT 310 (405) Q Consensus 234 gT~~I~~syv~~~h~dl~~dir---~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~ 310 (405) .-+.++||.+...|+ .++|-.+.|-|.+. .+.+.-+.|.| +.++.++.|---.++ T Consensus 199 -------~~~~~m~~~~~~~L~~~~~l~d~~g~~l~~~~--------~~~~~~~~l~G--~PV~~~~~ip~~~~~----- 256 (338) T protein:vir:78 199 -------FNGWAADPRYRARLLRSQAYRDANGNVDPTRI--------NLAASAGDLLG--LPVQFGKAVGGDLGA----- 256 (338) T ss_pred -------ceEEEEchHHHHHHHHHhhhccCCCceeeccc--------ccCCCCceeee--eeEEEccccCccccc----- Confidence 124678998877774 45565666666543 34556677766 466666655311111 Q ss_pred CCCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEec-CCCCCCCCCCcc------chhhhHHHH Q lcl|NC_020862. 311 AANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKK-PGEATADRNDPY------GKVGFSSIK 383 (405) Q Consensus 311 ~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~-pG~~tad~~DPl------gQrg~~gwK 383 (405) ....+ ..+++|+-+.-.++..+ .+.+-+.. .+. -+..||- -|+....|| T Consensus 257 -----------~~~~~----~~~~~gdfs~~~~~~~~-------~~~i~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~r 312 (338) T protein:vir:78 257 -----------ATDSK----VRVVGGDFSQLKYGFAD-------EIRVKMSDTATL--TDNTSPTPQTVSMWQTNQIAIL 312 (338) T ss_pred -----------cCCcc----cEEEEEecceEEEEeec-------ccEEEEeecccc--cccccccccchhhhhcCcEEEE Confidence 11122 24557877766666542 24443332 221 1445564 455667776 Q ss_pred --HHHHHhhccccceEEEE-EecCC Q lcl|NC_020862. 384 --FFYGFIKLRGERIAVAY-SVIPE 405 (405) Q Consensus 384 --~~~~~~iL~~~~marie-~~a~~ 405 (405) +++.+.+++++-+++|+ ..+|+ T Consensus 313 ~~~r~d~~v~~~~a~~~l~~~~~~~ 337 (338) T protein:vir:78 313 IEVTFGWLLGDKQAFVKFVDDEDPD 337 (338) T ss_pred EEEEeccEeecccceEEEecccCCC Confidence 68899999999887775 45677 No 63 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=98.54 E-value=1.2e-08 Score=64.06 Aligned_cols=304 Identities=13% Similarity=0.112 Sum_probs=163.8 Q ss_pred ccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccccc Q lcl|NC_020862. 3 HIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGASY 82 (405) Q Consensus 3 ~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~~~ 82 (405) |.-.|+.+.+..- +.+.-|+...|.-..+.|+...++....- .+|.||++.+-.....-.-.....++..- T Consensus 1 ~~~~n~ts~~qaf----i~~EiWsa~il~~l~~~Lv~~~~~~~~d~--g~GDtV~InsIg~~tV~dY~~~~~i~~d~--- 71 (322) T protein:vir:31 1 MSTGNNTSNTQAL----IVSEIWADEIEDILHEKLLDVNIARVVDF--PDGDKLTIPSVGTPVVRSRPEQGDFTFDN--- 71 (322) T ss_pred CCCCCCcccceEE----eehhhhHHHHHHHhhhhhhhhhhhccccc--CCCCeEEeccccccccccccCCCCccccc--- Confidence 3333332222333 33447988888888888888888775443 46999999876555433333333333321 Q ss_pred cCCcccccccccccccccccccccccccccccceeeeeEEEEeee--eeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 83 AGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTE--YGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 83 ~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~q--yG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) .+..+++-.|-| |=.|. ++|+ . -+.--+|......++. .+. T Consensus 72 ---------------------------------ltt~~~~l~IDq~KYfaf~-VdDD-~-~Qa~~dl~~~~~~~aa-~al 114 (322) T protein:vir:31 72 ---------------------------------LDTGEISIILRDEVYAGNA-ISKK-L-RQDSRWISNVGAMLPA-EQA 114 (322) T ss_pred ---------------------------------CCCceEEEEEehhhhhccc-cchh-H-HHhhhhHHHHHHHHHH-HHH Confidence 122234444555 65554 5552 2 2333345554333333 333 Q ss_pred hHHHHHHHHHHhc-cCceEEecCCCc-cceee-ecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccc Q lcl|NC_020862. 161 EITEDLLQADILA-SADVKVFTGAAT-SMVTM-TGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKT 237 (405) Q Consensus 161 ~~ted~l~~~ila-g~~~v~yag~at-s~~~~-t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~ 237 (405) .-..|.-..++|+ |+. .+++..+ +.++- ...--....+....++.|+++...|++.+.|+ T Consensus 115 a~~~D~fva~lL~~gA~--~~~~~~~p~vin~~~~~iv~~gt~~~~ay~~lv~l~~kLdkanVP~--------------- 177 (322) T protein:vir:31 115 RAIMERYQTDLLALGNA--QFAGQNDPNVINGVPHRFVGTGTDQTMDVTDFSRVNYVMTQSKMPM--------------- 177 (322) T ss_pred HHHHHHHHHHHHHHHhh--hhhccCCcceecCCccceeccCCCchhhHHHHHHHHHHhccccCCC--------------- Confidence 4455666666544 221 1111111 00000 00000011134578899999999999999994 Q ss_pred ccceEEEEEcccchHHHHHHhc---ccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhh---hhcCCCcccC Q lcl|NC_020862. 238 ISASRIAYIGSELEIYITELVD---SLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMH---YAGAGATATA 311 (405) Q Consensus 238 I~~syv~~~h~dl~~dir~l~d---~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~---~~~aGa~~~~ 311 (405) ..|+++|.|.....|..+.. ..+|+-|..+.+.|..+.+- =||++.| |++++|-.+-. =.-+|.+.. T Consensus 178 --~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~a~g~~--~Vg~~~G--F~V~~SN~l~~~~~~i~aG~d~~- 250 (322) T protein:vir:31 178 --GGMIGIIDPSVAHHLETITNISNISNNPRWEGIVESGIAPDMQ--FVRSVYG--IDLFVSNLLADANETINAGGDAR- 250 (322) T ss_pred --CCeEEEeCchhhhhhhhhhhhhhhhccccccccccccchhhHH--HHHHHhc--eeeeeeccccccccccccCcccc- Confidence 35999999999887755422 25689999888888754432 3899977 99999987620 001222221 Q ss_pred CCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecC----CCCCCCCCCccchhhhHHHHHHHH Q lcl|NC_020862. 312 ANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKP----GEATADRNDPYGKVGFSSIKFFYG 387 (405) Q Consensus 312 t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~p----G~~tad~~DPlgQrg~~gwK~~~~ 387 (405) .+++++|-.|+.+- + +++.++ -.-+|+. ++ .-+..+-.+-||.+ .|+ T Consensus 251 ---------~t~ag~~n~f~~~~---~-~~~~~~-----------~~~~~~l~~~e~~-r~~~~~~d~~~~~~----~~g 301 (322) T protein:vir:31 251 ---------STTAGKCNMFMNVS---D-MGLLPF-----------VVAWKEMPTTKSF-IDDYNDDLNTATTA----RWG 301 (322) T ss_pred ---------cccceeeccccccc---c-hhhhhh-----------hhHhhhhhhhhcc-cCccccccceeeee----eec Confidence 23345543443211 1 111111 1111111 11 11334445556654 679 Q ss_pred HhhccccceEEEEEecCC Q lcl|NC_020862. 388 FIKLRGERIAVAYSVIPE 405 (405) Q Consensus 388 ~~iL~~~~marie~~a~~ 405 (405) +.++|+|=++.++.-|-- T Consensus 302 ~g~~r~e~l~~~~a~~~~ 319 (322) T protein:vir:31 302 NGLVRDENLVCVLANADK 319 (322) T ss_pred ceeecccceEEEEecccc Confidence 999999999998876655 No 64 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=98.53 E-value=4.2e-08 Score=61.02 Aligned_cols=301 Identities=13% Similarity=0.102 Sum_probs=149.9 Q ss_pred ccccee---ehhhhhHHHHHhhhhhhhhcccccccc----CcCCCCEEEEEecccCCCCCCcccc--CCCcccccccCCc Q lcl|NC_020862. 16 VGPQFN---VHYWDRKSLIDEAEEMFFSPLADNKQM----PKHFGKELKVFYYVPLLDDLNVNDQ--GLDATGASYAGGN 86 (405) Q Consensus 16 v~~qm~---t~y~~~k~L~~a~p~lv~~~fA~~~~m----PKn~GktIkfrry~pl~~~~t~l~e--Gvtp~g~~~~~gn 86 (405) +.+++. +--|.+++|.--++.||+.++....-- -.++|.||++|+..++.....+... |+++. T Consensus 1 MaN~llT~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~~~~~~~~~~~~~~~~~~~-------- 72 (423) T protein:vir:17 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKN-------- 72 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCCcceeecccCcccCCcccC-------- Confidence 333332 224888999888899998886543221 2468999999964444332221111 12221 Q ss_pred ccccccccccccccccccccccccccccceeeeeEEEEeeeee-eeEEecchhhhhhhccchHHHHHHHHHHHHhhHHHH Q lcl|NC_020862. 87 LYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYG-FFMEYTEDSLMFDTDSDLYGHLSREMLRGANEITED 165 (405) Q Consensus 87 ly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG-~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~~~ted 165 (405) .|+| ..++.+|.|.= +-.+++|+...++.+ ++-+ .++.|..---+ T Consensus 73 ----------------~l~e------------~~v~l~id~~k~va~~v~d~E~~~~i~-~~~~-----~l~~A~~aLA~ 118 (423) T protein:vir:17 73 ----------------NLIS------------GKATGRVGNYITVAVEYQQLEEAIKLN-QLEE-----ILAPVRQRIVT 118 (423) T ss_pred ----------------cccc------------ceeEEEeeceeeeeeeecHHHHhcChh-HHHH-----HHHHHHHHHHH Confidence 1222 23445665443 345899976544443 2222 23333221122 Q ss_pred HHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccceEEEE Q lcl|NC_020862. 166 LLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISASRIAY 245 (405) Q Consensus 166 ~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~syv~~ 245 (405) .+-.+|.+-. +..+... .+...++. + .++++.++.+.|+++++|+ .-|+++ T Consensus 119 ~vd~~ia~~~----~~~a~~~----~gt~~t~~--~--a~~~i~~a~~~Ld~~~vP~-----------------~~R~~V 169 (423) T protein:vir:17 119 DLETELAHFM----MNNGALS----LGSPNTPI--T--KWSDVAQTASFLKDLGVNE-----------------GENYAV 169 (423) T ss_pred HHHHHHHHHH----hhccccc----cccCCccc--c--cHHHHHHHHHHHHhccCCc-----------------CCCEEE Confidence 2222332211 0000000 01111111 1 3899999999999999995 138889 Q ss_pred EcccchHHHHHHhcccCCCcceehhhcCCcccccCcce-eEecCCcEEEEeCcchhhhh-cC--------------CCcc Q lcl|NC_020862. 246 IGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEI-GAIPGAHLRIVVVPQMMHYA-GA--------------GATA 309 (405) Q Consensus 246 ~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEI-Gsi~g~n~Rfv~~p~~~~~~-~a--------------Ga~~ 309 (405) +.|+....|.. ++.+.-..+-+..+.+-+++| |++.| |++.++..+-.-. +. ++++ T Consensus 170 v~p~~~a~Ll~------~~~~~~~~~~~~~~alr~g~i~G~i~G--Fdvy~Snnip~~T~gt~~~t~~~~~~~~v~~~a~ 241 (423) T protein:vir:17 170 MDPWSAQRLAD------AQTGLHASDQLVRTAWENAQIPTNFGG--IRALMSNGLASRTQGAFGGTLTVKTQPTVTYNAV 241 (423) T ss_pred eChHHHHHHhc------cccceecccccchHHHhhccceeeecc--eEEEEeCCCccccccceeceeeeccccccccccc Confidence 99999888753 344554556666777888888 99966 8988887665321 11 0000 Q ss_pred cCCC-----------cc----------------------------------cc--cc-----cccCCcceeeeEE----- Q lcl|NC_020862. 310 TAAN-----------RG----------------------------------YQ--VS-----DVAGTDKYDIAPL----- 332 (405) Q Consensus 310 ~~t~-----------~~----------------------------------~~--~~-----~~~g~~~~DVYp~----- 332 (405) .+++ .. .+ +. ...+...+.+||. T Consensus 242 ~~~~~~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v~~~tk~v~~~~~t~~~~~~~v~~~~~~~a~~~~tv~i~p~~i~~~ 321 (423) T protein:vir:17 242 KDSYQFTVTLTGATTSVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSDSSGDVTVTLSGVPIYDT 321 (423) T ss_pred ccccceeeeeeeeeeeccCceeecceEEecceeeecccccccccccccccceEEEEEecccccccCceEEEecCcccccc Confidence 0000 00 00 00 0112223444442 Q ss_pred -------------------------------EEEccccce--eecceeccCC------CCCCceEEEecCCCCCCCCCCc Q lcl|NC_020862. 333 -------------------------------LVVGDQAFA--TIGLQGMSGK------GKSKFRIIVKKPGEATADRNDP 373 (405) Q Consensus 333 -------------------------------lV~G~~Afg--~i~l~g~~~~------g~~~~~~ivk~pG~~tad~~DP 373 (405) |++.++||+ +.+|. +.+. ...++.+.|-.- .|. T Consensus 322 ~~~~~~~v~a~~a~~~~vT~~~~a~~t~~~nl~~~~~a~~l~~~pl~-~~~~~~~~~~~~~g~s~r~~~~-------~d~ 393 (423) T protein:vir:17 322 TNPQYNSVSRQVAAGDAVSVVGTASQTMKPNLFYNKFFCGLGSIPLP-KLHSIDSAVATYEGFSIRVHKY-------ADG 393 (423) T ss_pred CCcccccceecccCCceeeccccccCCeeEEEEecCcceEEEEEccc-CCCccceeecccCCcEEEEEEe-------ccc Confidence 455555554 23332 1111 011222222211 122 Q ss_pred cchhhhHHHHHHHHHhhccccceEEEEEecC Q lcl|NC_020862. 374 YGKVGFSSIKFFYGFIKLRGERIAVAYSVIP 404 (405) Q Consensus 374 lgQrg~~gwK~~~~~~iL~~~~marie~~a~ 404 (405) -...-.+.|=.+|++..|++||..|+ ...| T Consensus 394 ~~~~~~~r~d~l~g~~~~~p~~~~~~-~g~~ 423 (423) T protein:vir:17 394 DANVQKMRFDLLPAYVCFNPHMGGQF-FGNP 423 (423) T ss_pred ccceeEEEEEeecceeeeccceEEEE-EecC Confidence 22222344556799999999998777 5666 No 65 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=98.52 E-value=2.8e-08 Score=62.02 Aligned_cols=281 Identities=20% Similarity=0.181 Sum_probs=147.7 Q ss_pred cccccCcCCCCEEEEEecccCCCCCCccccCCCcccccccCCcccccccccccccccccccccccccccccceeeeeEEE Q lcl|NC_020862. 44 DNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTG 123 (405) Q Consensus 44 ~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~ 123 (405) .+|+| ..||+++|-|=-..- ....+.|-.+.|.. .|+ .-.+..- T Consensus 1 ~vr~i--~~g~s~~~~~iG~~~--~~~~~~G~~l~~~~----------~~~----------------------~~~e~~i 44 (324) T protein:vir:99 1 MTRTI--TSGKSAQFPVMGRTK--ARYLKQGQSLDDGR----------EDI----------------------KHTEKVI 44 (324) T ss_pred Ceeee--ecCceEEEeeeeeeE--eccccCCCCcCCCc----------CCc----------------------CcccEEE Confidence 44544 348888887542211 11122232222100 000 1111122 Q ss_pred EeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHhhHHHHHHHHHHhcc--------CceEEecCCCccceeeecccc Q lcl|NC_020862. 124 TLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGANEITEDLLQADILAS--------ADVKVFTGAATSMVTMTGEAA 195 (405) Q Consensus 124 ~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~~~ted~l~~~ilag--------~~~v~yag~ats~~~~t~~~~ 195 (405) +|-|+=-|-.+=|+.-+.+.+-|+..+++.++...-+......+-+.+..+ ..++.-.|... ....++... T Consensus 45 tID~~l~~~~~VdDiD~~qa~~Dlr~e~s~~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~-~~~~~~~~~ 123 (324) T protein:vir:99 45 TIDGLLTTDVLIYDIEDAMNHYDVRSEYSTQMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAAS-LVKITGKKE 123 (324) T ss_pred EecchhhhhhhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccc-eeccccccc Confidence 333332222222223344555578888777777655554443332222211 11222222211 122222222 Q ss_pred cccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCc Q lcl|NC_020862. 196 DAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADA 275 (405) Q Consensus 196 ~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~ 275 (405) .+..+..--++.|+.+...|+++..|. .-+++++.|+.-..|.+ ..+.-...|+.- T Consensus 124 ~~~~~~~~~~dai~~a~~~Lde~~VP~-----------------~gR~~vv~P~~y~~Ll~-------~~~~~~~~~~~~ 179 (324) T protein:vir:99 124 DPAKYGTQVIQALTYARAAFAKKYIPA-----------------GDRTFYTDPDTYSAILA-------ALMPNAANYAAL 179 (324) T ss_pred ccccCHHHHHHHHHHHHHHHhhcCCCC-----------------CCCEEEeChHHHHHHhh-------cccccccccccc Confidence 111111122678889999999999983 24899999999887753 235556688888 Q ss_pred ccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCC-ccc---ccccccCCcceee----eEEEEEccccceeeccee Q lcl|NC_020862. 276 ATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAAN-RGY---QVSDVAGTDKYDI----APLLVVGDQAFATIGLQG 347 (405) Q Consensus 276 ~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~-~~~---~~~~~~g~~~~DV----Yp~lV~G~~Afg~i~l~g 347 (405) +.+-+|.||+|.| |++++++++-.-.+.... .+.+ .+- .........+|++ =.-|+|=++|-+++-++. T Consensus 180 ~~~~~G~V~~i~G--f~V~~Sn~lp~~~~t~~~-~a~~~~~~~~~~~~~~~~~~ky~~d~~~~~gl~~~~~a~~tv~~~~ 256 (324) T protein:vir:99 180 IDPETGNIRNVMG--FEVVETPHMTAQMVTNPT-DAFDGTGHIFPATGDSTTTGKMTVGADNVVGLFVHRSAVATLKLKD 256 (324) T ss_pred cceecceEEEEec--eEEEecCCcccccccccc-ccccccccccccccccccccccccccCceeEEEEehhheEEEeeec Confidence 8899999999966 999999988743221100 0000 000 0001111122222 123677777777777653 Q ss_pred ccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccceEEEE-------EecCC Q lcl|NC_020862. 348 MSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIAVAY-------SVIPE 405 (405) Q Consensus 348 ~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~marie-------~~a~~ 405 (405) ...+ ..-|+--|.-++==|..|++.+||++..+.|| .+.|| T Consensus 257 ~~~e-----------------~~~~~~~~~d~i~~~~a~G~~~lRPe~a~~v~l~~~~~~~~~~~ 304 (324) T protein:vir:99 257 MALE-----------------RARRPEYQADQIIAKYAMGHGGLRPEAVGAIIFEDGETPAVAPD 304 (324) T ss_pred ceec-----------------ceechhhHHHhhhhhhhhcCcccccceEEEEEEccCccccccch Confidence 2111 22466667777777899999999999999999 45566 No 66 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=98.52 E-value=3.1e-08 Score=61.77 Aligned_cols=295 Identities=11% Similarity=0.017 Sum_probs=151.5 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) .......-..+++++-+.- -+..+....+....+..++.+++...+||...|+....++... +. .....||-.... T Consensus 103 ~~~e~~a~~~~~~~~gg~~-vP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~~~~~~~~-~~-~~~v~e~~~~~~- 178 (404) T protein:vir:10 103 SEKEINAISENIDEDGGYA-VPEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRTYEKRSKQ-KP-MKPLSENQQIPT- 178 (404) T ss_pred hhhHHhhhccccCCCCcee-echhHHHHHHHHHhhhhhHhhhhceeeccCCccceEEEEecCC-cc-eeeccccccccc- Confidence 1111111111112221211 2223344555455567789999999999999987554443221 11 111222211000 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) + ..-.++..|+.+.++++.++.+|+++ +.|+..++...+..++.+..+ T Consensus 179 -----------------~--------------~~~~~f~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~la~~~~ 226 (404) T protein:vir:10 179 -----------------N--------------GDNGKLERFNFKLKDLADFMSIPNDL-LKFADKSLEDWIINWFVDKVR 226 (404) T ss_pred -----------------c--------------ccccceeeeEeeheeeEeeehhhHHH-HhhcHHHHHHHHHHHHHHHHH Confidence 0 00124667789999999999999985 456666788887777766555 Q ss_pred hHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHH-HHHhccCccccceeccccccCccccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSI-TLTDNYTPKKTTIIKGSRMTDTKTIS 239 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~-~Lk~nrApk~T~ii~gs~~~gT~~I~ 239 (405) ...+ ..|++|.+.-.-.++-.... ...+....+..+++++..+.. .|+....+ T Consensus 227 ~~~~----~~il~G~g~~~~~~gi~~~~----~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~------------------ 280 (404) T protein:vir:10 227 ITRN----AEILYGAGGDEHATGIMTAN----KFKKITLPKSPALKDFKKCKNVELLNVFKA------------------ 280 (404) T ss_pred HHHH----HHHhhcCCCCCcccceeecc----ccceeeccccccHHHHHHHHHhhhhccccC------------------ Confidence 4333 34566643211111000000 001111224467788877654 44433322 Q ss_pred ceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccccc Q lcl|NC_020862. 240 ASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVS 319 (405) Q Consensus 240 ~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~ 319 (405) .-+.+|||.....|+.|+|-.++|-|.|-. ..+..+.|-|. .+++++..++=. + T Consensus 281 -~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~--------~~~~~~~l~G~--PV~~~~~~~~~~---~------------ 334 (404) T protein:vir:10 281 -TSSWIVNQDGFNYLDSLEDKTGRPYLQPDP--------KDPTQYRFLGL--PVIELPNDLLLS---T------------ 334 (404) T ss_pred -CCEEEEcHHHHHHHHHhhccCCceeeccCc--------CCCCCccccce--eeEEecccccCC---C------------ Confidence 123589999999999999988888887532 23344566553 333333332100 0 Q ss_pred cccCCcceeeeEEEEEccccc-eeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhhccccce Q lcl|NC_020862. 320 DVAGTDKYDIAPLLVVGDQAF-ATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIKLRGERI 396 (405) Q Consensus 320 ~~~g~~~~DVYp~lV~G~~Af-g~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL~~~~m 396 (405) .+ -. .+++|+-+- -.+..+ ..+++.+..- ....-+++...|+ +++.+.+++++-+ T Consensus 335 ----~~---~~-~~~~gd~s~~~~~~~~-------~~~~i~~~~~-------~~~~~~~~~~~~~~~~r~d~~v~~~~a~ 392 (404) T protein:vir:10 335 ----ES---AI-PVLLGDTKEAYKYVSD-------GAYELATTNI-------GAGAFETNTTKARIIMRIDGNVKDSEAL 392 (404) T ss_pred ----CC---cc-EEEEEeccccEEEEEe-------cceEEEEecc-------ccchhhcCceEEEEEEeeccEEecccce Confidence 01 11 246786542 223322 1244444321 1122345656655 6788899999999 Q ss_pred EEEEEecC--C Q lcl|NC_020862. 397 AVAYSVIP--E 405 (405) Q Consensus 397 arie~~a~--~ 405 (405) +.++..+. + T Consensus 393 ~~~~~~~aa~~ 403 (404) T protein:vir:10 393 LIAEIPVESVQ 403 (404) T ss_pred EEEEeecccCC Confidence 87765443 3 No 67 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=98.51 E-value=3.3e-08 Score=61.59 Aligned_cols=296 Identities=9% Similarity=0.013 Sum_probs=163.6 Q ss_pred ccccCc----CCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcc Q lcl|NC_020862. 3 HIYNDP----AAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDAT 78 (405) Q Consensus 3 ~~y~~~----~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~ 78 (405) |=||-. ..+.++..+.-+-+.+ ..+.+.++.+.-.+.+++...+|+.+ ++++.+...-+.+ ..-.+ T Consensus 1 ~g~~~e~~~~~~~~t~~~~g~l~~~~-~~~ii~~l~~~s~i~~l~~~~~~~~~---~~~ip~~~~~~~a------~wv~E 70 (397) T protein:vir:23 1 MGFSADHSQIAQTKDTMFTGYLDPVQ-AKDYFAEAEKTSIVQRVAQKIPMGAT---GIVIPHWTGDVSA------QWIGE 70 (397) T ss_pred CCcCHHHHHHhhccCCCCccccchhH-HHHHHHHHHhccchhhhcceeeccCC---ceEEEEEcCCcce------EEecC Confidence 334332 1111222222222323 35666777777788889999888743 3454443221111 01112 Q ss_pred cccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHH Q lcl|NC_020862. 79 GASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRG 158 (405) Q Consensus 79 g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~ 158 (405) | ........++..|+.++++++.++.+|++++. |+..++..+|..++.+. T Consensus 71 g-----------------------------~~~~~s~~~f~~v~l~~~k~~~~v~iS~ell~-ds~~~l~~~i~~~l~~a 120 (397) T protein:vir:23 71 G-----------------------------DMKPITKGNMTKRDVHPAKIATIFVASAETVR-ANPANYLGTMRTKVATA 120 (397) T ss_pred C-----------------------------ccccccccceeEEEEeeEEEEEeehhhHHHHh-cchHHHHHHHHHHHHHH Confidence 2 22222234677789999999999999999544 66667888877777754 Q ss_pred HhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccc Q lcl|NC_020862. 159 ANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTI 238 (405) Q Consensus 159 ~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I 238 (405) .+...+. .+++|.+.-.-.+....... .........+++++..+...|..+..+. T Consensus 121 ia~~~d~----a~l~G~gt~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~l~~~~~~~---------------- 175 (397) T protein:vir:23 121 IAMAFDN----AALHGTNAPSAFQGYLDQSN-----KTQSISPNAYQGLGVSGLTKLVTDGKKW---------------- 175 (397) T ss_pred HHHHHHH----HHhhcccCCccccccccccc-----ceeeecccchhHHHHHHHHhhhhcccCC---------------- Confidence 4443333 33454432111111101000 0111123466777777777777665431 Q ss_pred cceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccccc Q lcl|NC_020862. 239 SASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQV 318 (405) Q Consensus 239 ~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~ 318 (405) -..++|+.....|+.++|..+.|-|.|..+-+.... .-.|.+-| +..+.++.|. +|. T Consensus 176 ---a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~---~~~~tl~G--~Pv~~s~~~~----~g~----------- 232 (397) T protein:vir:23 176 ---THTLLDDTVEPVLNGSVDANGRPLFVESTYESLTTP---FREGRILG--RPTILSDHVA----EGD----------- 232 (397) T ss_pred ---CEEEEcHHHHHHHHHhhccCCceeeccccccccccc---ccCceeee--eeEEEeCCCC----CCc----------- Confidence 235899999999999999999999998655444432 23355633 5666666542 111 Q ss_pred ccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEe-cCCCC--CCCCCCcc--chhhhHHHH--HHHHHhhc Q lcl|NC_020862. 319 SDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVK-KPGEA--TADRNDPY--GKVGFSSIK--FFYGFIKL 391 (405) Q Consensus 319 ~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk-~pG~~--tad~~DPl--gQrg~~gwK--~~~~~~iL 391 (405) ..+++|+-+...++..+ .+.+-+. ..... ....++|+ -|++.+.|+ +++.+.++ T Consensus 233 ------------~~~~~gDfs~~~i~~~~-------~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~ 293 (397) T protein:vir:23 233 ------------VVGYAGDFSQIIWGQVG-------GLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLIN 293 (397) T ss_pred ------------eEEEEeecceEEEEEEe-------ceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeecccee Confidence 13456654444444431 1222111 11100 01223444 477778887 68899999 Q ss_pred cccceEEEEEecCC Q lcl|NC_020862. 392 RGERIAVAYSVIPE 405 (405) Q Consensus 392 ~~~~marie~~a~~ 405 (405) +++-+++++...-| T Consensus 294 ~~~a~~~~~~~~~~ 307 (397) T protein:vir:23 294 DVNAFVKLTFDPVL 307 (397) T ss_pred cccceEEEeecccc Confidence 99999999986666 No 68 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=98.50 E-value=5.1e-08 Score=60.56 Aligned_cols=322 Identities=18% Similarity=0.150 Sum_probs=158.9 Q ss_pred CCccccCcCC-Cccccccc-----ceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccC Q lcl|NC_020862. 1 MPHIYNDPAA-GDASTVGP-----QFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQG 74 (405) Q Consensus 1 ~~~~y~~~~~-t~~~~v~~-----qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eG 74 (405) |.+. +-... |.+..-++ .|..--|.-+.+..-+..-++..+-..+++ ..|++++|-|--+.. .++ T Consensus 1 m~~~-~~~~~~t~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~~r~i--~~G~sv~i~~iG~~t------v~~ 71 (347) T protein:vir:94 1 MANV-PGQKIGTDQGKGKSSSDALALFLKVFAGEVLTAFTRRSVTADKHIVRTI--QNGKSAQFPVMGRTS------GVY 71 (347) T ss_pred CCCC-CccccccccccCCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccc--cccceEEEeccccee------eee Confidence 4332 11111 11111000 011111122222211222233334444443 358999997654432 233 Q ss_pred CCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHH Q lcl|NC_020862. 75 LDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSRE 154 (405) Q Consensus 75 vtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~e 154 (405) .+| |+.+ .| |+...+-.+++-+|-|+=-+-.+-|..-+.+.+-|+..+++.+ T Consensus 72 ~t~-G~~l-----~~----------------------~~~~~~~~e~~itID~~~~~~~~VddiD~~q~~~D~~~~~~~~ 123 (347) T protein:vir:94 72 LAP-GERL-----SD----------------------KRKGIKHTEKVITIDGLLTADVMIFDIEDAMNHYDVAGEYSNQ 123 (347) T ss_pred ecC-CCCc-----CC----------------------CCCCCCcceEEEEecchhhhhHHhhhHHHHhcCcchHHHHHHH Confidence 333 2221 00 1111123344455556554444444444455556788887777 Q ss_pred HHHHHhhHHHHHHHH--HHhccC---ceEEecCCCccceeeecccc---cccCCceecHHHHHHHHHHHHhccCccccce Q lcl|NC_020862. 155 MLRGANEITEDLLQA--DILASA---DVKVFTGAATSMVTMTGEAA---DAEDDGLITLKDLKRLSITLTDNYTPKKTTI 226 (405) Q Consensus 155 ll~~~~~~ted~l~~--~ilag~---~~v~yag~ats~~~~t~~~~---~~~~n~~it~~~lr~~~~~Lk~nrApk~T~i 226 (405) +...-+......+-+ ..+++. ....-+|--...+...+..+ ++..+..=-++.|+++.+.|+++..|. T Consensus 124 ~g~aLa~~~D~~i~~~~~~~aa~~~~~~~~~~g~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~---- 199 (347) T protein:vir:94 124 LGEALAIAADGAVLAEMAILCNLPAASNENIAGLGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPA---- 199 (347) T ss_pred HHHHHHHHHHHHHHHHHHHHhccccccccccCCCcccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCC---- Confidence 775555444333322 222221 11111111111110000000 110000111577889999999999984 Q ss_pred eccccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCC Q lcl|NC_020862. 227 IKGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAG 306 (405) Q Consensus 227 i~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aG 306 (405) .-|++++.|+....|.+ ++.|... .|+....+.+|.||++.| |+++++++|-. .+.+ T Consensus 200 -------------~~R~~vv~P~~~~~Ll~------~~~~~~~-~~~~~~~~~~G~Vg~i~G--~~V~~Sn~lp~-~~~t 256 (347) T protein:vir:94 200 -------------GDRYFYTTPDNYSAILA------ALMPNAA-NYAALIDPETGNIRNVMG--FVVVEVPHLVQ-GGAG 256 (347) T ss_pred -------------CCcEEEeCHHHHHHHhc------cchhhhh-hccccccccccceEEEec--eEEEecCcccc-cccc Confidence 24899999999888842 4556654 688888888999999966 99999998752 2222 Q ss_pred CcccCCCcccccccc-------cCCcce----eeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccc Q lcl|NC_020862. 307 ATATAANRGYQVSDV-------AGTDKY----DIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYG 375 (405) Q Consensus 307 a~~~~t~~~~~~~~~-------~g~~~~----DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlg 375 (405) ..+ .+.++.+... ...++| +---.|+|=++|-+++-++....+ .--|+-- T Consensus 257 ~~~--~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e-----------------~~r~~~~ 317 (347) T protein:vir:94 257 ETR--GDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLALE-----------------RDRDVDA 317 (347) T ss_pred ccc--ccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhccccccc-----------------chhchhh Confidence 111 1111111110 001111 122456666677666655421111 2245555 Q ss_pred hhhhHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_020862. 376 KVGFSSIKFFYGFIKLRGERIAVAYSVIPE 405 (405) Q Consensus 376 Qrg~~gwK~~~~~~iL~~~~marie~~a~~ 405 (405) |.-.+==|..|++.+||++..+.||..+-| T Consensus 318 ~~d~i~~~~~~G~~~~rP~~a~~~~~~~A~ 347 (347) T protein:vir:94 318 QGDLIVGKYAMGHGGLRPEAAGALVFSPAE 347 (347) T ss_pred HHHHhhhhhhhcCcccccceeEEEEecCCC Confidence 566666689999999999999999999999 No 69 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=98.50 E-value=4e-08 Score=61.16 Aligned_cols=279 Identities=13% Similarity=0.070 Sum_probs=154.2 Q ss_pred CCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccccccCCccc Q lcl|NC_020862. 9 AAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGASYAGGNLY 88 (405) Q Consensus 9 ~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~~~~~gnly 88 (405) =+++.+.+-|+ .+ ..+.+..+.+...+.+++...+|+.+. +++-+...-+.+ .-.. +|+++. T Consensus 1 ma~~gG~lvp~---~~-~~~ii~~~~~~s~i~~l~~~~~~~~~~---~~ip~~~~~~~a-~~v~-----E~~~~~----- 62 (298) T protein:vir:16 1 MVLNKGTLFDP---TL-VTDLISKVAGKSSIARLSAQKPIPFNG---EKVFTFTMDSEI-DVVA-----ESGKKT----- 62 (298) T ss_pred CcccCcceech---hH-HHHHHHHHHhhhhhhhhcceeeccCCc---eEEEEEecCcce-EEec-----CCcccc----- Confidence 23444555543 22 355566666778889999988887533 333332221111 1122 222221 Q ss_pred ccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhc--cchHHHHHHHHHHHHhhHHHHH Q lcl|NC_020862. 89 GGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTD--SDLYGHLSREMLRGANEITEDL 166 (405) Q Consensus 89 ~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d--~~l~~~~~~ell~~~~~~ted~ 166 (405) ....++..++.+.++++.++.+|++++....| .++.+.+..++.+..+.- T Consensus 63 ------------------------~~~~~f~~v~l~~~k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~---- 114 (298) T protein:vir:16 63 ------------------------HGGVTLAPQTMVPIKVEYGARISDEFMYASDEEKINILQEFNDGFAKKVARG---- 114 (298) T ss_pred ------------------------ccccceeEEEEeeeeEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHH---- Confidence 11135667889999999999999997644333 356676666665443332 Q ss_pred HHHHHhccCc--------eEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccc Q lcl|NC_020862. 167 LQADILASAD--------VKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTI 238 (405) Q Consensus 167 l~~~ilag~~--------~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I 238 (405) +-..+++|.+ .+-.++......+... ....+.-.+.+|.++...|..++.+. T Consensus 115 ~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~i~~~~~~~~~~~~~~---------------- 174 (298) T protein:vir:16 115 IDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVE----APRGIADPNGAIENAVELLTGVDADV---------------- 174 (298) T ss_pred HHHHhhccccCCCCcccccccccccccccccccc----cccccccHHHHHHHHHHHhhhcCCCc---------------- Confidence 2233344411 1111111100000000 00112233678889988888776541 Q ss_pred cceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccccc Q lcl|NC_020862. 239 SASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQV 318 (405) Q Consensus 239 ~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~ 318 (405) -..++||.+...|+.|+|..+.|-|.+.. ..+.-|+|-| +..+.++.+.. +. T Consensus 175 ---~~~vmn~~~~~~l~~lkd~~G~~i~~~~~--------~~~~~~~l~G--~PV~~~~~v~~----~~----------- 226 (298) T protein:vir:16 175 ---TGIAINPSFRSALAKQKDLQDNALFPELK--------WGATPDTING--LPVDVNKTVSD----MS----------- 226 (298) T ss_pred ---cEEEEcHHHHHHHHHhhccCCCeeecCcc--------cCCCCceecc--eeeEEeccccc----cc----------- Confidence 13567999999999999988888887543 3444466755 45666654431 00 Q ss_pred ccccCCcceeeeEEEEEccccce-eecceeccCCCCCCceEEEecCCCCCCCCCCc------cchhhhHHHH--HHHHHh Q lcl|NC_020862. 319 SDVAGTDKYDIAPLLVVGDQAFA-TIGLQGMSGKGKSKFRIIVKKPGEATADRNDP------YGKVGFSSIK--FFYGFI 389 (405) Q Consensus 319 ~~~~g~~~~DVYp~lV~G~~Afg-~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DP------lgQrg~~gwK--~~~~~~ 389 (405) +.++. .+++|+-+-+ .++++. .+++-+..- .|| |-|++.++|+ +++.+. T Consensus 227 ----~~~~~----~~~~GDfs~~~~~~~~~-------~~~~~~~~~-------~~~~~~~~~~f~~~~v~~ra~~r~d~~ 284 (298) T protein:vir:16 227 ----LTQRD----RAIIGDFANGFKWGYAK-------EVPLEVIQY-------GDPDNSGLDLKGYNQVYIRAELFLGWG 284 (298) T ss_pred ----CCCcc----EEEEeeccceEEEEEec-------CceEEEeec-------cCCcCcchhhhhcCcEEEEEEEEEccE Confidence 11221 3677875432 345431 233333332 233 3466777777 478899 Q ss_pred hccccceEEEEEec Q lcl|NC_020862. 390 KLRGERIAVAYSVI 403 (405) Q Consensus 390 iL~~~~marie~~a 403 (405) +++++-+++|+-+- T Consensus 285 v~~~~a~~~l~~at 298 (298) T protein:vir:16 285 ILDATKFARVTEAN 298 (298) T ss_pred eecccceEEEeecC Confidence 99999999998777 No 70 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=98.50 E-value=6.3e-08 Score=60.07 Aligned_cols=323 Identities=16% Similarity=0.129 Sum_probs=171.8 Q ss_pred CCcc-ccCcCCCcccc--c-c-c-ceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccC Q lcl|NC_020862. 1 MPHI-YNDPAAGDAST--V-G-P-QFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQG 74 (405) Q Consensus 1 ~~~~-y~~~~~t~~~~--v-~-~-qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eG 74 (405) |-++ +..--.+.... - + + .+...-|..+.+..-+..-+|..+-..+++ ..||+++|.|.-.....- .+.| T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i--~~G~sv~~~~iG~~~~~~--~~~g 76 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTI--QNGKSASFPVMGRTKGYY--LAPG 76 (347) T ss_pred CCCcccchhhhccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhccccccc--cCcceEEEeeecceeeee--eccc Confidence 4321 11110011111 0 1 1 122233455555555555566666666654 469999999877654421 1122 Q ss_pred CCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHH Q lcl|NC_020862. 75 LDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSRE 154 (405) Q Consensus 75 vtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~e 154 (405) -.+.+ +-.|+ ...+++-+|-|+=-+-.+=|+.-..+.+-|+..+++.+ T Consensus 77 ~~l~~----------~~~~~----------------------~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~ 124 (347) T protein:vir:88 77 ENLDD----------KRKDI----------------------KHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQ 124 (347) T ss_pred cCCCC----------CCCCC----------------------ccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHH Confidence 11111 00111 22344566666544433323333445555788888888 Q ss_pred HHHHHhhHHHHHHHHHHhccCc-----eEEecCCCc-cceeeecc--cccccCCceecHHHHHHHHHHHHhccCccccce Q lcl|NC_020862. 155 MLRGANEITEDLLQADILASAD-----VKVFTGAAT-SMVTMTGE--AADAEDDGLITLKDLKRLSITLTDNYTPKKTTI 226 (405) Q Consensus 155 ll~~~~~~ted~l~~~ilag~~-----~v~yag~at-s~~~~t~~--~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~i 226 (405) +...-+......+.+.+..++. .-..+|..+ ..+++... ...+..+...-++.|+++.+.|++++.|. T Consensus 125 ~g~aLA~~~D~~i~~~l~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~---- 200 (347) T protein:vir:88 125 LGEALAIAADGAVLAEMAKLCNLPAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPA---- 200 (347) T ss_pred HHHHHHHHHHHHHHHHHHHhhccccccccccCCccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCC---- Confidence 8877666655444444433221 111222111 11111111 11122222334788999999999999984 Q ss_pred eccccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCC Q lcl|NC_020862. 227 IKGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAG 306 (405) Q Consensus 227 i~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aG 306 (405) .-+++++.|+.-.+|.+ ++.|.. ..|.+...+-+|.||++.| |+++++|++- ....| T Consensus 201 -------------~gR~~vv~P~~y~~Ll~------~~~~~~-~~~~~~~~~~~G~vg~i~G--~~V~~s~nlp-~~~~~ 257 (347) T protein:vir:88 201 -------------GDRRFYCAPEDYSAILS------ALMPNA-ANYAALIDPETGNIRNVMG--FEVIEVPHLT-VGGAG 257 (347) T ss_pred -------------CCCEEEeCHHHHHHHhc------chhhhh-hhhccccchhcceeeeecc--ceEEEeeccc-ccccc Confidence 24889999998888853 345654 4888777888999999977 8999999884 21111 Q ss_pred CcccCCCccccccc------ccCCcceee----eEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccch Q lcl|NC_020862. 307 ATATAANRGYQVSD------VAGTDKYDI----APLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGK 376 (405) Q Consensus 307 a~~~~t~~~~~~~~------~~g~~~~DV----Yp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQ 376 (405) ....+ .++..+. ..-.++|.. ---||+-..|-|++-++... +. .--||--| T Consensus 258 ~~~~~--~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~----------~e-------~~r~~~~~ 318 (347) T protein:vir:88 258 DNNPA--DGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMA----------LE-------RARRPEFQ 318 (347) T ss_pred ccccc--ccccccccccccccccccccccccCcEEEEEechhhhhheecccce----------ee-------eeechhhH Confidence 11100 0110000 000111111 12366667777776664211 11 12455566 Q ss_pred hhhHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_020862. 377 VGFSSIKFFYGFIKLRGERIAVAYSVIPE 405 (405) Q Consensus 377 rg~~gwK~~~~~~iL~~~~marie~~a~~ 405 (405) .-.+==|..|++.+||++..+.|++-+.- T Consensus 319 ~d~i~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 319 ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred HHHhhhhhhhcCceeccceEEEEEeCCCC Confidence 66677789999999999999999987777 No 71 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=98.49 E-value=3.6e-08 Score=61.37 Aligned_cols=283 Identities=10% Similarity=0.030 Sum_probs=154.1 Q ss_pred CCccccCc---CCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCc Q lcl|NC_020862. 1 MPHIYNDP---AAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDA 77 (405) Q Consensus 1 ~~~~y~~~---~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp 77 (405) +..-..+. -++.+++-+.-.-+..+.++.+....+...+.+++...+|+.+.|+....+. ....... +..+ T Consensus 98 l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~-~~~~~~a-----~~v~ 171 (397) T protein:vir:49 98 VRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTGSRVYEKW-TDITGLA-----NIDD 171 (397) T ss_pred HhcchhHHHHHhhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceeecccCccceEEEee-ccCCcce-----eeec Confidence 11111111 0111111111122224445544444566788899999999999987553332 1111111 1222 Q ss_pred ccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHH Q lcl|NC_020862. 78 TGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLR 157 (405) Q Consensus 78 ~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~ 157 (405) .|+.+.. ....++..|+.++++++.++.+|+++ +-|++.++...+..++.+ T Consensus 172 E~~~~~~----------------------------~~~~~~~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~ 222 (397) T protein:vir:49 172 EAGKIAD----------------------------VDDPKLSLIKYTIKRYAGISTVTNSL-LADSAENILAWLSGWIAK 222 (397) T ss_pred Ccccccc----------------------------ccccceeeEEeeeeeEEeeehhHHHH-HhhhHHHHHHHHHHHHHH Confidence 2222211 11235667789999999999999985 456666777777777665 Q ss_pred HHhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccc Q lcl|NC_020862. 158 GANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKT 237 (405) Q Consensus 158 ~~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~ 237 (405) ..+. .++ ..|++|.+... ..-+..++++|.++...|+.+..+. T Consensus 223 ~~~~-~~d---~ai~~G~g~~~------------------~~~~~~~~d~i~~~~~~l~~~~~~~--------------- 265 (397) T protein:vir:49 223 KVVV-TRN---KAILEAIAALP------------------TKPTLTKWDDIIDLEAKVDPAIKQT--------------- 265 (397) T ss_pred HHHH-HHH---HHHHhhccccc------------------cccccccHHHHHHHHHhhhhhhcCC--------------- Confidence 4433 233 34566543211 1124578999999999998765431 Q ss_pred ccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccc Q lcl|NC_020862. 238 ISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQ 317 (405) Q Consensus 238 I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~ 317 (405) =+.++|+.+...|+.|+|..+.|-|.|- +..+.-+.|.|- .+++++. .|...++. T Consensus 266 ----a~~vmn~~~~~~l~~lkd~~G~~l~~~~--------~~~~~~~~l~G~--PV~~~~~--~~~~~~~~--------- 320 (397) T protein:vir:49 266 ----SFFLTNTSGFTALKKVKNALGDYLMERD--------VKSPTGYSIDGF--AVKEVAD--RWLANGTG--------- 320 (397) T ss_pred ----CEEEEcHHHHHHHHHhhcCCCceeeccC--------cCCCCCceecce--eeEEecc--cccccccC--------- Confidence 1357899999999999998888888652 234555677663 3333221 12212111 Q ss_pred cccccCCcceeeeEEEEEcccc-ceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhhcccc Q lcl|NC_020862. 318 VSDVAGTDKYDIAPLLVVGDQA-FATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIKLRGE 394 (405) Q Consensus 318 ~~~~~g~~~~DVYp~lV~G~~A-fg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL~~~ 394 (405) +-. .+++|.-. |-.+..+ ..+++-+..- .+-+-+.+.+.++ ..+.+.+++++ T Consensus 321 ----------~~~-~i~~gd~~~~~~~~~~-------~~~~i~~~~~-------~~~~~~~~~~~~r~~~r~d~~~~~~~ 375 (397) T protein:vir:49 321 ----------GAM-PLYFGDLKQAVTLFDR-------QHMSLLSTNI-------GGGAFETDTTKVRVIDRFDVVATDTE 375 (397) T ss_pred ----------Cce-eEEEeeccceEEEEee-------cceEEEEecc-------ccchhhcCceeEEEEeeeCcEEeccc Confidence 112 24567433 2222222 1233333321 2233344555554 57789999999 Q ss_pred ceEEEEEecCC Q lcl|NC_020862. 395 RIAVAYSVIPE 405 (405) Q Consensus 395 ~marie~~a~~ 405 (405) -++.++..+.- T Consensus 376 a~~~~~~~~~~ 386 (397) T protein:vir:49 376 AFVPASFKAIA 386 (397) T ss_pred ceEEEEeeccc Confidence 99998865533 No 72 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=98.49 E-value=9.2e-08 Score=59.15 Aligned_cols=302 Identities=10% Similarity=0.011 Sum_probs=153.8 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) |. -+.. .+....|..+ + ..+.+....+..++.+++.+.+|+.+ .+++-+...-+.+ .-..| T Consensus 1 Ma--~~~~-~~gg~~vP~~----~-~~~ii~~l~~~s~i~~l~~~i~~~~~---~~~ip~~~~~~~a-~wv~E------- 61 (315) T protein:vir:80 1 MA--DDFL-SAGKLELPGS----M-IGAVRDRAIDSGVLAKLSPEQPTIFG---PVKGAVFSGVPRA-KIVGE------- 61 (315) T ss_pred CC--CCcC-CcCceEcchH----H-HHHHHHHHHhhchhhhhcceeecCCC---ceEEEEEeCCcce-EEeeC------- Confidence 32 1111 1112222223 2 45556666677889999999988754 2444333221111 11122 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) |+.+.....++.+++.+.++++.++.+|++++ -++..+....|...+.+..+ T Consensus 62 ---------------------------g~~~~~s~~~f~~v~l~~~kl~~~~~iS~ell-~~s~~~~~~~l~~~i~~~la 113 (315) T protein:vir:80 62 ---------------------------GEVKPSASVDVSAFTAQPIKVVTQQRVSDEFM-WADADYRLGVLQDLISPALG 113 (315) T ss_pred ---------------------------CccccccccceeeeEeeeeeEEeeehhhHHHh-hcCchhHHHHHHHHHHHHHH Confidence 22222223467778899999999999999954 34444443333332222211 Q ss_pred hHHHHHHHHHHhccCceEE---ecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKV---FTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKT 237 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~---yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~ 237 (405) +-.-..+-..+++|.+..- -.|..++.. ..+........++.+|.++...|..+.... T Consensus 114 ~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~--------------- 174 (315) T protein:vir:80 114 ASIGRAVDLIAFHGIDPATGKAASAVHTSLN----KTKNIVDATDSATADLVKAVGLIAGAGLQV--------------- 174 (315) T ss_pred HHHHHHHhhheeeccCCCCCccccccccccc----cccceeeccccchHHHHHHHHHHhhccCcc--------------- Confidence 1111122233455532110 001001100 000011112245678888876665443211 Q ss_pred ccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccc Q lcl|NC_020862. 238 ISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQ 317 (405) Q Consensus 238 I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~ 317 (405) .-.-++||.....|+.|++.-+.|.|-.- -| . .+..+.-|+|-| +.++.++.|-.-.+.+ T Consensus 175 ---~~~~imn~~~~~~L~~l~~~~g~~~~g~~-~~--~-~~~~g~~~tl~G--~PV~~~~~~~~~~~~~----------- 234 (315) T protein:vir:80 175 ---PNGVALDPAFSFALSTEVYPKGSPLAGQP-MY--P-AAGFAGLDNWRG--LNVGASSTVSGAPEMS----------- 234 (315) T ss_pred ---ceEEEEcHHHHHHHHHHhhccCCcccccc-cc--c-ccccCCCceecc--eeeEecCcCCcccccc----------- Confidence 12356899999999999876665555421 11 1 123555678866 5667776554211110 Q ss_pred cccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhhccccc Q lcl|NC_020862. 318 VSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIKLRGER 395 (405) Q Consensus 318 ~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL~~~~ 395 (405) ..+++ .+++|+-+...+++.+ .+.+-+..-+. .......|-|++.+.|| +.+++.+.+++- T Consensus 235 -----~~~~~----~~~~GDfs~~~~g~~~-------~~~i~i~~~~~-~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a 297 (315) T protein:vir:80 235 -----PASGV----KAIVGDFSRVHWGFQR-------NFPIELIEYGD-PDQTGRDLKGHNEVMVRAEAVLYVAIESLDS 297 (315) T ss_pred -----ccccc----EEEEeecccEEEEEec-------CeeEEEecccc-ccCcccchhhcCcEEEEEEEEecceeecccc Confidence 01221 3577887766666642 23333333221 11223457788888888 778999999999 Q ss_pred eEEEEEec-CC Q lcl|NC_020862. 396 IAVAYSVI-PE 405 (405) Q Consensus 396 marie~~a-~~ 405 (405) +++++.++ |- T Consensus 298 ~~~l~~~~a~~ 308 (315) T protein:vir:80 298 FAVVKEKAAPK 308 (315) T ss_pred eEEEeeccCCC Confidence 99999766 44 No 73 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=98.48 E-value=5.7e-08 Score=60.32 Aligned_cols=289 Identities=12% Similarity=0.107 Sum_probs=153.8 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) -.+.|+- ..++.++-+.-.-+..+.++.+....+.-++.+++...+|+.+ ++++-+...-+.+ .... .|. T Consensus 20 ~~~~~~a-~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~---~~~ip~~~~~~~a-~~v~-----Eg~ 89 (324) T protein:vir:93 20 KPQVFNP-DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGT---EKKFTFWADKPGA-YWVG-----EGQ 89 (324) T ss_pred hhhhccc-ccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCC---ceEEEEEecCcce-eeec-----CCc Confidence 2344432 2222222122123334566666666677788889988888743 4454443322211 1112 222 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) . ......++..|+.+.++++.++.+|+++ +-|++.++...+..++.+..+ T Consensus 90 ~-----------------------------~~~~~~~f~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~aia 139 (324) T protein:vir:93 90 K-----------------------------IETSKATWVNATMRAFKLGVILPVTKEF-LNYTYSQFFEEMKPMIAEAFY 139 (324) T ss_pred c-----------------------------ccccccceeEEEEEeEEEEEeehhhHHH-HhcchHHHHHHHHHHHHHHHH Confidence 2 1222236677889999999999999984 456666788887777765544 Q ss_pred hHHHHHHHHHHhccCceEEe-cCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKVF-TGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTIS 239 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~y-ag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~ 239 (405) .-.+. .++.|.+.--. .|..... ........+.+++++|.++...|..+.... T Consensus 140 ~~~d~----a~l~G~g~~~~~~~~~~~~-----~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~----------------- 193 (324) T protein:vir:93 140 KKFDE----AGILNQGNNPFGKSIAQSI-----EKTNKVIKGDFTQDNIIDLEALLEDDELEA----------------- 193 (324) T ss_pred HHHHH----HHhcCCCCCCcCccccccc-----cccceeccccccHHHHHHHHHhhhhccCCC----------------- Confidence 43333 23444321110 1111100 011112235689999999999998865431 Q ss_pred ceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccccc Q lcl|NC_020862. 240 ASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVS 319 (405) Q Consensus 240 ~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~ 319 (405) + ..+||+.....|+.|+|..+.|-|.+ +--+++-|- .++.++... T Consensus 194 -~-~~v~n~~~~~~L~~l~d~~G~~~~~~------------~~~~~l~G~--PVv~~~~~~------------------- 238 (324) T protein:vir:93 194 -N-AFISKTQNRSLLRKIVDPETKERIYD------------RNSDSLDGL--PVVNLKSSN------------------- 238 (324) T ss_pred -C-EEEEcHHHHHHHHHhhCCCCCeeecC------------CCCCcccce--eeEeecCCC------------------- Confidence 1 36789999999999998777666532 222444442 223222100 Q ss_pred cccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEe-cCCCCCC--CCCCc--cchhhhHHHH--HHHHHhhcc Q lcl|NC_020862. 320 DVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVK-KPGEATA--DRNDP--YGKVGFSSIK--FFYGFIKLR 392 (405) Q Consensus 320 ~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk-~pG~~ta--d~~DP--lgQrg~~gwK--~~~~~~iL~ 392 (405) .++- .+++|+-+...+++.+ .+++-+. ..+.... ...-+ +-|++.+.++ +++++.+++ T Consensus 239 ----~~~~----~i~~gdfs~~~~~~~~-------~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~v~~ 303 (324) T protein:vir:93 239 ----LKRG----ELITGDFDKLIYGIPQ-------LIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIAD 303 (324) T ss_pred ----CCcc----eEEEEecceEEEEEec-------CcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEec Confidence 0111 2457765555454431 1333221 1111000 11111 2455666666 678999999 Q ss_pred ccceEEEEEecCC Q lcl|NC_020862. 393 GERIAVAYSVIPE 405 (405) Q Consensus 393 ~~~marie~~a~~ 405 (405) ++-+++|..+.+- T Consensus 304 ~~a~~~l~~a~~~ 316 (324) T protein:vir:93 304 DKAFAKLVPADKR 316 (324) T ss_pred ccceEEEeccccc Confidence 9999999855544 No 74 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=98.47 E-value=6.6e-08 Score=59.94 Aligned_cols=310 Identities=13% Similarity=0.061 Sum_probs=161.5 Q ss_pred CCc-cccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCccc Q lcl|NC_020862. 1 MPH-IYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATG 79 (405) Q Consensus 1 ~~~-~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g 79 (405) .++ ...++...-.+.-+.-+-+ .+..+.+....+..++.+++...+|+- .++++.+..--+.+- -..||-.... T Consensus 8 ~~~~~~~~~~g~~~~~~~~liP~-~~~~~ii~~l~~~s~l~~~~~~~~~~~---~~~~~p~~~~~~~a~-~v~eg~~~~~ 82 (333) T protein:vir:78 8 LPNSAGSNHQGRLAHVPSDLLPK-EIVGPIFDKAQESSLVLRMGEQIPISY---GETIIPTTVKRPEVG-QVGVGTSNEQ 82 (333) T ss_pred hhhcccccccCceecCCccccch-hHHHHHHHHHHhhchhhhhcceeeccC---CceEEEEEeCCceeE-eecCcccccc Confidence 111 1223322212211222222 334566666667788899999988873 233333322211110 1122221110 Q ss_pred ccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHH Q lcl|NC_020862. 80 ASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGA 159 (405) Q Consensus 80 ~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~ 159 (405) . + |+.......++..|+.+.++++.+..+|++++. |+..++.+.+..+|.+.. T Consensus 83 ~------------e--------------~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~-~s~~~~~~~i~~~la~ai 135 (333) T protein:vir:78 83 R------------E--------------GGLKPLSGTAWDTRSVSPIKLATIVTVSEEFAR-MNPSGLYTKLQGDLAYAI 135 (333) T ss_pred c------------c--------------cccccccccceeEEEEeeEEEEEeehhhHHHHh-cCHHHHHHHHHHHHHHHH Confidence 0 1 111222334677778999999999999998544 555577777776666433 Q ss_pred hhHHHHHHHHHHhccCceEEec---CCC--ccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccC Q lcl|NC_020862. 160 NEITEDLLQADILASADVKVFT---GAA--TSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTD 234 (405) Q Consensus 160 ~~~ted~l~~~ilag~~~v~ya---g~a--ts~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~g 234 (405) +.- + -..+++|.+...-. |-. +...+.+...+. .....+++++|.++...+..|.... T Consensus 136 ~~~-~---d~~~l~G~g~~~~~~~~g~~~~~~~~~~~~~~~~-~~~~~~~~~~i~~~~~~~~~~~~~~------------ 198 (333) T protein:vir:78 136 GRG-I---DLAVFHGKSPLTGSALQGIDTDNVIANTTNVDYL-QETGDPLLDRLLDGYDLVSANTDVE------------ 198 (333) T ss_pred HHH-H---HHHHhcccCCCCCccccccccccccccccccccc-ccccchhHHHHHHHHHhhccccccC------------ Confidence 332 2 23345554432111 100 011111111111 1234578899999988877665421 Q ss_pred cccccceEEEEEcccchHHHHH---HhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccC Q lcl|NC_020862. 235 TKTISASRIAYIGSELEIYITE---LVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATA 311 (405) Q Consensus 235 T~~I~~syv~~~h~dl~~dir~---l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~ 311 (405) .=+.++||.+...|+. ++|..+.|-|.+. .+.+..|+|.| +.+++++.+..-.+. T Consensus 199 ------~~~~vmn~~~~~~L~~~~~~~d~~G~~i~~~~--------~~~~~~~~l~G--~Pv~~~~~i~~~~~~------ 256 (333) T protein:vir:78 199 ------FNGWAVDPRFRAHLLRAQAYRDANGNVDPSRI--------NLAAQTGDVLG--LPAQFGRAVGGDLGA------ 256 (333) T ss_pred ------ceEEEEcchHHHHHHHHhhhcCCCCceeecCc--------cccCCCceeec--eeeEEccccCCCccc------ Confidence 1145669998877765 4454555555432 35667788866 577777655421111 Q ss_pred CCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCc----cchhhhHHHH--HH Q lcl|NC_020862. 312 ANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDP----YGKVGFSSIK--FF 385 (405) Q Consensus 312 t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DP----lgQrg~~gwK--~~ 385 (405) ...++. .+++|+-..-.+++. +.+++.+..=+ +....|. +-|++.+.++ .+ T Consensus 257 ----------~~~~~~----~~~~gD~~~~~~g~~-------~~~~i~~~~~~--~~~~~~~~~~~~~~~~~v~~r~~~r 313 (333) T protein:vir:78 257 ----------AVDSKT----RIIGGDFSQLKFGFA-------DEIRIKMSDTA--TLTDSGSATVSMWQTNQIAILIEVT 313 (333) T ss_pred ----------cCCCcc----EEEEEecccEEEEEe-------eccEEEEeccc--cccccccceeehhhcCcEEEEEEEE Confidence 011221 355776665556654 23555554322 2223332 3466666666 57 Q ss_pred HHHhhccccceEEEE-EecC Q lcl|NC_020862. 386 YGFIKLRGERIAVAY-SVIP 404 (405) Q Consensus 386 ~~~~iL~~~~marie-~~a~ 404 (405) +.+.+++++-+++|+ ..|| T Consensus 314 ~d~~v~~~~a~~~l~~~~a~ 333 (333) T protein:vir:78 314 FGWLLGDKQAFVKFVDDEQP 333 (333) T ss_pred EccEEecccceEEEeccCCC Confidence 899999999999887 5677 No 75 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=98.46 E-value=3e-08 Score=61.79 Aligned_cols=312 Identities=16% Similarity=0.137 Sum_probs=129.3 Q ss_pred CCccccCcCCCcccccccc-eeehhhhhHHHHHhhhhhhhhcccccc--ccCcC-CCCEEEEEecccCCCCCCccccCCC Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQ-FNVHYWDRKSLIDEAEEMFFSPLADNK--QMPKH-FGKELKVFYYVPLLDDLNVNDQGLD 76 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~q-m~t~y~~~k~L~~a~p~lv~~~fA~~~--~mPKn-~GktIkfrry~pl~~~~t~l~eGvt 76 (405) |. .+ +++.-|.+.+|..-.+.|||.++.+.. .-.++ .|.||++|++.+......- ..+. T Consensus 1 Ma---------------~~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~~~~~~~~-~~~~- 63 (392) T protein:vir:99 1 MA---------------NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRK-LRGA- 63 (392) T ss_pred Cc---------------cccccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeecccccceeee-cccc- Confidence 33 22 334478888998888999998886433 12244 6999999976544321110 0111 Q ss_pred cccccccCCcccccccccccccccccccccccccccccceeeeeEEEEe-eeeeeeEEecchhhhhhhccchHHHHHHHH Q lcl|NC_020862. 77 ATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTL-TEYGFFMEYTEDSLMFDTDSDLYGHLSREM 155 (405) Q Consensus 77 p~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l-~qyG~~~e~Td~~~~~d~d~~l~~~~~~el 155 (405) ..+.. .+-...+...++.+| ++..+=.+++|+....+.. ++..++.++. T Consensus 64 ~~~~~-----------------------------~~~~~~~~~~~~~~id~~k~~~~~i~d~e~~~~~~-~~~~~~~~~a 113 (392) T protein:vir:99 64 GAERN-----------------------------LTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLE-SFATQILPRQ 113 (392) T ss_pred ccCCc-----------------------------ccccccccceEEEEEeeeeecceeechHHHhhhhh-hhHHHHHHHH Confidence 00111 111112334566777 3444445688865444333 3333322222 Q ss_pred HHHHh-hHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccC Q lcl|NC_020862. 156 LRGAN-EITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTD 234 (405) Q Consensus 156 l~~~~-~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~g 234 (405) +..-+ ++..+++ . ++.++ .+..... .+..+..-.++.|..+.+.|+++++|. T Consensus 114 ~~ala~~vd~~i~-~-~~~~a--~~~~~~~-----------~~~~~~~~~~~~i~~a~~~L~~~~vP~------------ 166 (392) T protein:vir:99 114 VRGVADILEEGVR-D-MIVGA--PYEAAGA-----------VHEVAPDEFFKGVNGARRALNELYIPQ------------ 166 (392) T ss_pred HHHHHHHHHHHHH-H-HHhcc--ccccccc-----------ccccChhhhHHHHHHHHHHHhhcCCCC------------ Confidence 22222 2222221 1 22221 1221111 111122356789999999999999983 Q ss_pred cccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCc--ccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCC Q lcl|NC_020862. 235 TKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADA--ATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAA 312 (405) Q Consensus 235 T~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~--~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t 312 (405) .|++++.|+....|.. ++.|+..+.+|+. +.+.+|+||.+.| |.++.++.+..-.+......+. T Consensus 167 ------~R~~vv~p~~~~~l~~------~~~~~~~~~~g~~~~~~l~~G~vg~i~G--~~v~~s~~~~~~t~~a~~~~a~ 232 (392) T protein:vir:99 167 ------GRVLVVGTAVTEQILN------DDRFIKYESQGQSAVSALQEARLGRIYG--YEIVESTLIPHGDAYLYHPTAF 232 (392) T ss_pred ------CCEEEEcHHHHHHHhc------ccceeecccccchhhhhhhcceeeeeee--eEEEeecccccccceeeecccc Confidence 3788899998888863 5889999888875 4577999999966 8888887654321110000000 Q ss_pred CcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEec-CCCCCCCCCCccchhhhHHHHHHHHH--- Q lcl|NC_020862. 313 NRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKK-PGEATADRNDPYGKVGFSSIKFFYGF--- 388 (405) Q Consensus 313 ~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~-pG~~tad~~DPlgQrg~~gwK~~~~~--- 388 (405) ......+.. ..+ ..+...+-|..++..--+......-+. ....++. .|..+....+--+..-.+.++..-.. T Consensus 233 ~~at~a~v~-~~~--~~~~~s~s~~~~v~~~~~~~~~~t~~s-~~~~v~~~~g~~~v~~~~~~~~~~~~~~~~~~~~v~v 308 (392) T protein:vir:99 233 IMATRAPAP-PMG--AVRSTAISGDQRIAMRWLVDYDSTITS-NRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEV 308 (392) T ss_pred ccccccccc-ccc--ccceeEEecccceecceeecccceeec-cccccceeEEEEEEeeccccceeeeeeeeeecceeee Confidence 000000000 000 011111222221110000000000000 0000000 00000000000000000000000000 Q ss_pred -----------hhccccceEEEEEecCC Q lcl|NC_020862. 389 -----------IKLRGERIAVAYSVIPE 405 (405) Q Consensus 389 -----------~iL~~~~marie~~a~~ 405 (405) .-..+.+-..+. +.|+ T Consensus 309 ~~v~~~~~~~~~~~~~~~~~~~t-~~~~ 335 (392) T protein:vir:99 309 APEAGANATITAAAGEDHTVQLK-VTDA 335 (392) T ss_pred eeeecccceeEeeeccceeEEEE-EEec Confidence 000011111111 1111 No 76 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=98.43 E-value=1.1e-07 Score=58.63 Aligned_cols=290 Identities=13% Similarity=0.115 Sum_probs=157.8 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) -.+.++....+.+.+-+.-+ +.-|..+.+....+.-.+.+++...+++ +.++++.++..-+.+ ....| |. T Consensus 20 ~~~~~~a~~~~~~~~~~~li-p~~~~~~ii~~~~~~s~l~~~~~~~~~~---~~~~~~p~~~~~~~a-~~v~E-----g~ 89 (324) T protein:vir:99 20 KPQVFNPDNVMMHEKKDGTL-LNDFTTPILQEVMENSKIMRLGKYEPME---GTEKKFTFWADKPGA-YWVGE-----GQ 89 (324) T ss_pred hhhhccccceeccCCCccee-chhHHHHHHHHHHhhchhhhhcceeecc---CCceEEEEEecCcce-eEecc-----Cc Confidence 22333322222122212223 3344556665666677888888888877 345666554322111 11122 22 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) .+ .-...++..++.+.+++|.++.+|++++. |+..++.+.+..++.+..+ T Consensus 90 ~~-----------------------------~~~~~~~~~v~~~~~k~~~~~~iS~ell~-ds~~~l~~~i~~~l~~ai~ 139 (324) T protein:vir:99 90 KI-----------------------------ETSKATWVNATMRAFKLGVILPVTKEFLN-YTYSQFFEEMKPMIAEAFY 139 (324) T ss_pred cc-----------------------------cccccceeEEEEeeEEEEEeehhhHHHHh-cchHHHHHHHHHHHHHHHH Confidence 21 11223667788999999999999998544 5555788877776665444 Q ss_pred hHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISA 240 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~ 240 (405) . .+| ..+++|.+.--..+.-.+. .........+.+++++|.++...|+.+... ++ T Consensus 140 ~-~~d---~~~l~G~g~~~~~~~~~~~----~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~-----------------~~ 194 (324) T protein:vir:99 140 K-KFD---EAGILNQGNNPFGKSIAQS----IEKTNKVIKGDFTQDNIIDLEALLEDDELE-----------------AN 194 (324) T ss_pred H-HHH---HHhhhcCCCCccCcccccc----ccccceeccccCCHHHHHHHHHhhhhccCC-----------------CC Confidence 3 222 2345554332111111110 111122234678999999999999876542 11 Q ss_pred eEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccccccc Q lcl|NC_020862. 241 SRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSD 320 (405) Q Consensus 241 syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~ 320 (405) +.++||.....|+.|+|..++|-|.+ +.-+++-| +.++.++.+. T Consensus 195 --~~v~n~~~~~~L~~l~d~~g~~~~~~------------~~~~~l~G--~PVv~~~~~~-------------------- 238 (324) T protein:vir:99 195 --AFISKTQNRSLLRKIVDPETKERIYD------------RNSDTLDG--LPVVNLKSSN-------------------- 238 (324) T ss_pred --EEEEcHHHHHHHHHhhcCCCceeecC------------CCCccccc--eeEEeecCCC-------------------- Confidence 24789999999999998777766532 12244544 2333332111 Q ss_pred ccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEec-CCCCC--CCCCCcc--chhhhHHHH--HHHHHhhccc Q lcl|NC_020862. 321 VAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKK-PGEAT--ADRNDPY--GKVGFSSIK--FFYGFIKLRG 393 (405) Q Consensus 321 ~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~-pG~~t--ad~~DPl--gQrg~~gwK--~~~~~~iL~~ 393 (405) .++. .+++|.-+.-.+++.+ .+++-+.. .+-.+ .....++ -|++.+.|+ +++++.++++ T Consensus 239 ---~~~~----~~i~gd~~~~~~~~~~-------~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~ 304 (324) T protein:vir:99 239 ---LKRG----ELITGDFDKLIYGIPQ-------LIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADD 304 (324) T ss_pred ---CCcc----eEEEEecccEEEEEec-------CcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecc Confidence 0111 2567776665555541 23332222 11111 1122232 467778887 6789999999 Q ss_pred cceEEEEEecCC Q lcl|NC_020862. 394 ERIAVAYSVIPE 405 (405) Q Consensus 394 ~~marie~~a~~ 405 (405) +-+++|..+.+- T Consensus 305 ~a~~~lt~a~~~ 316 (324) T protein:vir:99 305 KAFAKLVPADKK 316 (324) T ss_pred cceEEEEeccCC Confidence 999999877665 No 77 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=98.41 E-value=1.4e-07 Score=58.14 Aligned_cols=278 Identities=12% Similarity=0.049 Sum_probs=153.4 Q ss_pred cccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCccccccc Q lcl|NC_020862. 4 IYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGASYA 83 (405) Q Consensus 4 ~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~~~~ 83 (405) +-+.-..++++. +.-+-+-.+..+.+....+...+.+++...+|+.+.|+....++-.--+. -+..++|+++. T Consensus 1 ~l~~~~~~t~~~-gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~------a~~v~Eg~~~~ 73 (293) T protein:vir:48 1 MLDSKTDHSGSD-AGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGL------ANIDDEAGKIA 73 (293) T ss_pred CceeecccccCc-CceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcc------eeeecCCcccc Confidence 222222222222 22222334455555555567889999999999999986554442111111 11222222221 Q ss_pred CCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHhhHH Q lcl|NC_020862. 84 GGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGANEIT 163 (405) Q Consensus 84 ~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~~~t 163 (405) .....++..|+-+.++++.++++|+++ +-|+..++.+.+..++.+..+ .. T Consensus 74 ----------------------------~~~~~~~~~i~l~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~la~~~~-~~ 123 (293) T protein:vir:48 74 ----------------------------DIDDPKLSLIKYTIKRYAGISTVTNSL-LADSAENILAWLSGWIAKKVV-VT 123 (293) T ss_pred ----------------------------cccccceeEEEEeeeEEEEeehhhHHH-HhhhhHHHHHHHHHHHHHHHH-HH Confidence 111235677889999999999999985 456666777877777654433 33 Q ss_pred HHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccceEE Q lcl|NC_020862. 164 EDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISASRI 243 (405) Q Consensus 164 ed~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~syv 243 (405) |+ ..|++|.+. . +...+.+++++|.++...|+....+. + . T Consensus 124 ~~---~~i~~g~~~------~------------~~~~~~~~~d~i~~~~~~l~~~~~~~------------------a-~ 163 (293) T protein:vir:48 124 RN---KAILGVVDK------L------------PTKPTLTKWDDIIDLEAKVDPAIKQT------------------S-F 163 (293) T ss_pred HH---hHHhhcccc------c------------cccccccCHHHHHHHHHhhhhhhcCC------------------C-E Confidence 43 344444221 0 01125689999999998886443220 1 3 Q ss_pred EEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccccccccC Q lcl|NC_020862. 244 AYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSDVAG 323 (405) Q Consensus 244 ~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~~~g 323 (405) -+||+.+...|+.|+|.-+.|-|.|- +.++--++|-|-.++++.+. .++ +. + T Consensus 164 ~vmn~~~~~~L~~lkd~~g~~l~~~~--------~~~~~~~~l~G~Pv~~~~~~-~~~--~~-----------------~ 215 (293) T protein:vir:48 164 FLTNTSGFTALKKVKNALGDYLMERD--------VKSPTGYSIAGFAVKEISDR-WLP--NA-----------------S 215 (293) T ss_pred EEEcHHHHHHHHHhhccCCceEeecC--------cCCCCCceecceeeEEeccc-ccC--Cc-----------------c Confidence 36899999999999998888888652 23344466766444433321 111 00 0 Q ss_pred CcceeeeEEEEEcc--ccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhhccccceEEE Q lcl|NC_020862. 324 TDKYDIAPLLVVGD--QAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIKLRGERIAVA 399 (405) Q Consensus 324 ~~~~DVYp~lV~G~--~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL~~~~mari 399 (405) . +.++ +++|. ++|-...-++ +++-+..- .+-+-+++..+++ +.+.+.+.+++-++++ T Consensus 216 ~---~~~~-~~~gd~~~~~~~~~~~~--------~~i~~~~~-------~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l 276 (293) T protein:vir:48 216 S---GVMP-LYFGDLKQAVTLFDRQQ--------MSLLSTNI-------GGGAFETDTTKVRVIDRFDVVATDTEAFVPA 276 (293) T ss_pred C---CceE-EEEEeccceEEEEEecc--------eEEEEecc-------cchhhhcCeEEEEEEEeeCcEEecccceEEE Confidence 0 1334 34664 3443222221 33333321 1223344444444 5678899999999999 Q ss_pred EEecCC Q lcl|NC_020862. 400 YSVIPE 405 (405) Q Consensus 400 e~~a~~ 405 (405) +..+.. T Consensus 277 ~~~~~~ 282 (293) T protein:vir:48 277 SFKAIA 282 (293) T ss_pred Eeeccc Confidence 866555 No 78 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=98.40 E-value=9.2e-08 Score=59.17 Aligned_cols=304 Identities=16% Similarity=0.124 Sum_probs=151.2 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccC----cCCCCEEEEEecccCCCCCCcc--ccC Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMP----KHFGKELKVFYYVPLLDDLNVN--DQG 74 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mP----Kn~GktIkfrry~pl~~~~t~l--~eG 74 (405) |.. +-+.+.|| -|.+++|.--++.+|+.++.+..--. .++|.||++|+...+....... ..+ T Consensus 1 MAN--------sl~~l~p~----iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~~~~d~~~~~~t~ 68 (423) T protein:vir:10 1 MAN--------NLDANVSQ----IVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQFKSERTMDGDITG 68 (423) T ss_pred Ccc--------ccccccHH----HHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCceeeecccCcccCc Confidence 331 11223344 57889999899999998885543322 3479999998655443322110 111 Q ss_pred CCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeee-eeeEEecchhhhhhhccchHHHHHH Q lcl|NC_020862. 75 LDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEY-GFFMEYTEDSLMFDTDSDLYGHLSR 153 (405) Q Consensus 75 vtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qy-G~~~e~Td~~~~~d~d~~l~~~~~~ 153 (405) .++ +. |+| ..++.+|.|+ .+-++++|+...++.+ ++- T Consensus 69 ~~~--~~----------------------l~e------------~~v~l~id~~k~~a~~v~d~E~~l~i~-~~~----- 106 (423) T protein:vir:10 69 KSK--NS----------------------LIS------------AKATGEVGNYITVAVEYRQIEEALKLN-QLD----- 106 (423) T ss_pred ccc--cc----------------------ccc------------ceEEEEecceeeeeeeeChHHHhcChh-HHH----- Confidence 111 11 111 1244555443 4567899976554443 222 Q ss_pred HHHHHHhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceecccccc Q lcl|NC_020862. 154 EMLRGANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMT 233 (405) Q Consensus 154 ell~~~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~ 233 (405) +.++.|..---+.+-.+|.... ....+. ..+..+++. =.++++.++.+.|+++++|+ T Consensus 107 ~~l~~A~~aLA~~vd~~ia~~~-~~~~~~-------~vgt~~t~~----~a~~~~a~a~~~L~~~~vP~----------- 163 (423) T protein:vir:10 107 QILVPINERMVTDLETELALFM-MKHGAL-------SLGSPNTPI----KKWSDVAQTASFLKDLGINS----------- 163 (423) T ss_pred HHHHHHHHHHHHHHHHHHHHHh-hhcccc-------ccccccccc----ccHHHHHHHHHHHhhccCCc----------- Confidence 2333333322222222332110 000000 001111111 13789999999999999995 Q ss_pred CcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcce-eEecCCcEEEEeCcchhhh-hcC-CC--- Q lcl|NC_020862. 234 DTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEI-GAIPGAHLRIVVVPQMMHY-AGA-GA--- 307 (405) Q Consensus 234 gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEI-Gsi~g~n~Rfv~~p~~~~~-~~a-Ga--- 307 (405) .-|++++.|+....|.. +..+..+.+-+..+.+-+++| |++.| |++..+..+-.- .|. |+ T Consensus 164 ------~~R~~Vv~p~~~a~Ll~------~~~~~~~~~~~~~~alr~~~i~G~~~G--Fdi~~Sn~vp~~T~g~~~ga~~ 229 (423) T protein:vir:10 164 ------GENYAVMDPWAAQRLAD------AQSGLHVSEQLVRTAWENAQISGNFGG--IRALMSNGLASRTQGAFGGKLT 229 (423) T ss_pred ------CCCEEEeCHHHHHHHhh------hhhhhccccccchHHHHhcccceeecc--eEEEEecCCcccccccccceee Confidence 13889999999888742 234555556677777888877 99966 898888766422 111 11 Q ss_pred ----------ccc-----------CCCc------------------------------------ccccc-----cccCCc Q lcl|NC_020862. 308 ----------TAT-----------AANR------------------------------------GYQVS-----DVAGTD 325 (405) Q Consensus 308 ----------~~~-----------~t~~------------------------------------~~~~~-----~~~g~~ 325 (405) .+. ++.. -+.+. ...|.. T Consensus 230 ~~~~~~vt~a~~~~~~~~~~~~~~~T~s~~g~l~~GD~~t~aGv~~v~~~tk~~l~~~~~~~~~~~~V~~~~~~~a~~~~ 309 (423) T protein:vir:10 230 VKGTPEVNYDSVKDSYAFTATLTGATASKKGFLKVGDQLQFDDTHWLNQQSKQTLYNGASALSFTATVMEDANAHSSGDV 309 (423) T ss_pred eeeeeEEEecccccccccccceeeccceeceeEEecceEeecceeeecccccceeecccCCcceEEEEEecccccccCce Confidence 000 0000 00010 012233 Q ss_pred ceeeeEE------------------------------------EEEccccce--eecceeccCC------CCCCceEEEe Q lcl|NC_020862. 326 KYDIAPL------------------------------------LVVGDQAFA--TIGLQGMSGK------GKSKFRIIVK 361 (405) Q Consensus 326 ~~DVYp~------------------------------------lV~G~~Afg--~i~l~g~~~~------g~~~~~~ivk 361 (405) .+.+||. |++.++||+ +.+|. +.+. -..++.+.+- T Consensus 310 tv~i~p~~~~~~~~~~~~~V~a~~a~~~~vT~~~~~~~t~~~nl~~~~~a~~l~~~pl~-~~~~~~~~~~~~~g~s~r~~ 388 (423) T protein:vir:10 310 TVKISGVPIFDAGYPQYNAVDRLLAEGDTVSVIGTSKQAMKPNLFYNKLFCGLGTIPLP-KLHSIDSAVATYEGFSIRVH 388 (423) T ss_pred EEEeccccccccCcccccceeccccCCceeEEeeccCCceeEEEEecCcceEEEEEccc-CCCccceeecccccceEEEE Confidence 3455553 355555554 33332 1111 0112222222 Q ss_pred cCCCCCCCCCCccchhhhHHHHHHHHHhhccccceEEEEEecC Q lcl|NC_020862. 362 KPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIAVAYSVIP 404 (405) Q Consensus 362 ~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~marie~~a~ 404 (405) .- .|.-..--.+.|=.+|++..|++||..|+ ...| T Consensus 389 ~~-------~d~~~~~~~~r~d~l~g~~~~~p~~~~~~-~g~~ 423 (423) T protein:vir:10 389 KY-------ADGDANKQMMRFDLLPAYVCYNPHMGGQF-FGNP 423 (423) T ss_pred Ee-------eeccccceEEEEEeecceeeeccceEEEE-EecC Confidence 11 11111122234556799999999998777 5666 No 79 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=98.39 E-value=2.1e-07 Score=57.21 Aligned_cols=300 Identities=13% Similarity=0.101 Sum_probs=139.9 Q ss_pred cccceee---hhhhhHHHHHhhhhhhhhccccccccC-----cCCCCEEEEEecccCCCCCCccccCCCcccccccCCcc Q lcl|NC_020862. 16 VGPQFNV---HYWDRKSLIDEAEEMFFSPLADNKQMP-----KHFGKELKVFYYVPLLDDLNVNDQGLDATGASYAGGNL 87 (405) Q Consensus 16 v~~qm~t---~y~~~k~L~~a~p~lv~~~fA~~~~mP-----Kn~GktIkfrry~pl~~~~t~l~eGvtp~g~~~~~gnl 87 (405) +.+++.+ --|.+++|.--++.||+.++.. +..+ ++.|.||++|+--+.. +.+ +-+..+ T Consensus 1 MAN~llT~iP~iia~~al~~l~~~lV~~~lV~-r~y~ge~~~a~~GDTV~I~~p~~~~----v~d-~~~~~~-------- 66 (423) T protein:vir:35 1 MANNLESNISQIVLKKFLPGFMSDIVLCKTVD-RQLLSGEINSNTGDSVSFKRPHQFK----SER-TETGDI-------- 66 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhcc-cCCCcccccccCCCEEEEeeCCcce----eec-ccCcCC-------- Confidence 3344323 3588899988999999998753 3332 4679999999654432 211 111111 Q ss_pred cccccccccccccccccccccccccccceeeeeEEEEeeee-eeeEEecchhhhhhhccchHHHHHHHHHHHHhhHHHHH Q lcl|NC_020862. 88 YGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEY-GFFMEYTEDSLMFDTDSDLYGHLSREMLRGANEITEDL 166 (405) Q Consensus 88 y~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qy-G~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~~~ted~ 166 (405) ...+..+++...++.+|-|. .+-.+++|+...++.. ++.+.+...+-.-+.++..+. T Consensus 67 ---------------------~~~~~~~~~e~~v~l~id~~k~~a~~v~d~e~~l~i~-~~~~~l~~a~~ala~~vd~~l 124 (423) T protein:vir:35 67 ---------------------TGKDKNGLFSAKATGKVGKYITVAVEWTQIEEALKLN-QLDQILSPIHERMVTDLETEL 124 (423) T ss_pred ---------------------CCccccccccceeeEEeccceeccceeCHHHHHhhHH-HHHHHHHHHHHHHHHHHHHHH Confidence 11112222333455666543 3456899976555444 444444444443333443333 Q ss_pred HHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccceEEEEE Q lcl|NC_020862. 167 LQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISASRIAYI 246 (405) Q Consensus 167 l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~syv~~~ 246 (405) + ..+..++.+ ..|.. .++ .-.++++..+.+.|+++++|+ .-|++++ T Consensus 125 ~-~~l~~~a~~--~vgt~----------~t~----~~~~~~i~~a~~~Ld~~~vP~-----------------~~R~~Vv 170 (423) T protein:vir:35 125 A-HFMMNNGAL--SLGSP----------NTA----IKKWADVAQTASFIKDIGIKT-----------------GENYAIM 170 (423) T ss_pred H-HHHhhcccc--ccccc----------cCC----cchHHHHHHHHHHHHHhcCCc-----------------CCCEEEe Confidence 3 333332211 11211 111 124799999999999999995 1389999 Q ss_pred cccchHHHHHHhcccCCCcceehhhcCCcccccCcce-eEecCCcEEEEeCcchhhh-hcCCCcccCCCccccccc--cc Q lcl|NC_020862. 247 GSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEI-GAIPGAHLRIVVVPQMMHY-AGAGATATAANRGYQVSD--VA 322 (405) Q Consensus 247 h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEI-Gsi~g~n~Rfv~~p~~~~~-~~aGa~~~~t~~~~~~~~--~~ 322 (405) .|+....|.. ++.+.-..+-+..+.+-+++| |++.| |.+.++..+-.- ++..+.....+.+-.++. .. T Consensus 171 ~p~~~a~Ll~------~~~~~~~~~~~~~~alr~g~i~G~i~G--Fdv~~Snnvp~~T~gt~~~~~~v~~a~~v~~~a~~ 242 (423) T protein:vir:35 171 DPWSAQRLAD------AQSGLHAADQLVRTAWENAQISGNFGG--IRALMSNGLASRKQGDFDGAITVKTAPNVDYLSVK 242 (423) T ss_pred CHHHHHHHhc------cccceeccccchhHHHhhccceeeecc--eEEEEcCCCccccccccccceeecccccccccccc Confidence 9998888742 233333334445566788876 99966 899998776632 222111111111111100 00 Q ss_pred --CC-------cceeeeEEEEEccccceeecceeccC---------CCCCCceEEEecCCCCCCCCCCccchhhhHHHHH Q lcl|NC_020862. 323 --GT-------DKYDIAPLLVVGDQAFATIGLQGMSG---------KGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKF 384 (405) Q Consensus 323 --g~-------~~~DVYp~lV~G~~Afg~i~l~g~~~---------~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~ 384 (405) +. -....|..|..|+ .|.--|++.-.- +-...++..|..--...+..+. ..|. T Consensus 243 ~~~~~~~~~~~~~~~~~g~l~~GD-~~t~aGv~~v~~~t~~~~~~~~t~~~~~~~V~~~~~~~a~g~~--------~v~i 313 (423) T protein:vir:35 243 DSYQFTVALTGATPSKTGFLKAGD-QLKFTSTHWLNQQSKQTLYNGSTAMSFTATVLEETNSTASGDV--------TVKL 313 (423) T ss_pred ccccceeeeeeeeeccCCcEEecc-eEEeeeeeeccccccceeecccCCceeEEEEeccccccccCce--------eEEc Confidence 00 0123467777776 555444432100 0011233333211000000000 0111 Q ss_pred HHHHhhc---cccceEEEEEecCC Q lcl|NC_020862. 385 FYGFIKL---RGERIAVAYSVIPE 405 (405) Q Consensus 385 ~~~~~iL---~~~~marie~~a~~ 405 (405) += +.+. +..|-.+ +++|= T Consensus 314 ~p-~~~~~~~~~~~~~v--~a~~a 334 (423) T protein:vir:35 314 SG-VPIYDEKNSQYNAV--DAKVK 334 (423) T ss_pred cc-cccccCCCcccccc--ccccc Confidence 10 0111 0000000 00111 No 80 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=98.36 E-value=1.1e-07 Score=58.84 Aligned_cols=298 Identities=10% Similarity=0.057 Sum_probs=156.2 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) =|+.-+. ..+++++-+.-+-+ .+..+.+....+.-++.++++..+|+.+ ++++-+...-+ ..+ T Consensus 8 ~~e~~~~-~~~~~~~~~~~ip~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~---~~~ip~~~~~~------------~a~ 70 (318) T protein:vir:24 8 AVDHAQI-AQTGDTMFKGYLEP-EQAKDYFAEAEKTSIVQQFAQKVPMGTT---GQKIPHWVGDV------------SAQ 70 (318) T ss_pred CHHHHHh-hcccCcccceeech-hHHHHHHHHHHhhchhhhhcceeeccCC---ceEEEEEeCCc------------ceE Confidence 1111111 11222222222333 3345566666677788999999988743 34443332111 111 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) -+. +|..+.....++.+++.+.++++.++++|+++ +.|+..++.+.+..++.+..+ T Consensus 71 ~v~-----------------------Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~e~-l~ds~~~~~~~i~~~l~~~~~ 126 (318) T protein:vir:24 71 WIG-----------------------EGDMKPITKGNMTSQTIAPHKIATIFVASAET-VRANPANYLGTMRTKVATAFA 126 (318) T ss_pred Eec-----------------------CCccccccccceeEEEEeeEEEEEeehhhHHH-hhcChHHHHHHHHHHHHHHHH Confidence 111 12222222346677889999999999999985 445666788877666665443 Q ss_pred hHHHHHHHHHHhccCceEEecCCC--ccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKVFTGAA--TSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTI 238 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~yag~a--ts~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I 238 (405) . .+ -..+++|.+.-.-.|.. +..++.+.. ...+....+++..+...+...... T Consensus 127 ~-~~---d~a~l~G~g~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~----------------- 181 (318) T protein:vir:24 127 M-AF---DGAAMHGTDSPFPTYIGQTTKAISIADT----TGATTVYDQVAVNGLSLLVNDGKK----------------- 181 (318) T ss_pred H-HH---HHhhhcccCCCCCccccccccccccccc----ccccchHHHHHHHHHHhhccccCC----------------- Confidence 3 22 23446665432222111 111211111 112233444555555555443332 Q ss_pred cceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccccc Q lcl|NC_020862. 239 SASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQV 318 (405) Q Consensus 239 ~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~ 318 (405) .+ +-+|||.....|+.|+|..+.|-|.|...-+...++ +.+.+.| +.++.++.+ .+| T Consensus 182 -~~-~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~---~~~~i~g--~pv~~~~~~----~~~------------ 238 (318) T protein:vir:24 182 -WT-HTLLDDITEPILNGAKDQNGRPLFIESTYGEAASPF---RSGRIVA--RPTILSDHV----VEG------------ 238 (318) T ss_pred -CC-EEEEcHHHHHHHHHhhccCCceeecCccccCccccc---cCceEEE--EeeEEeCCC----CCC------------ Confidence 11 348999999999999999888888875444444332 2244433 234444322 111 Q ss_pred ccccCCcceeeeEEEEEccccceeecceeccCCCCCCceE-EEecCCCC--CCCCCCcc--chhhhHHHH--HHHHHhhc Q lcl|NC_020862. 319 SDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRI-IVKKPGEA--TADRNDPY--GKVGFSSIK--FFYGFIKL 391 (405) Q Consensus 319 ~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~-ivk~pG~~--tad~~DPl--gQrg~~gwK--~~~~~~iL 391 (405) + .++++|+-+...++..+ .+.+ +....+-. +.+.++|. -|++.+.|| +++.+.++ T Consensus 239 -------~----~~~~~gdfs~~~~~~~~-------~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~ 300 (318) T protein:vir:24 239 -------T----TVGFMGDFSQLIWGQIG-------GLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCN 300 (318) T ss_pred -------c----cEEEEeecceEEEEEec-------CeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEe Confidence 1 13466766655555431 1222 12222111 11233343 466677776 78999999 Q ss_pred cccceEEEEEecCC Q lcl|NC_020862. 392 RGERIAVAYSVIPE 405 (405) Q Consensus 392 ~~~~marie~~a~~ 405 (405) +++-+++|..++.- T Consensus 301 ~~~a~~~i~~~~a~ 314 (318) T protein:vir:24 301 DAEAFVALTNVVSG 314 (318) T ss_pred cccceEEEEeeccC Confidence 99999999987766 No 81 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=98.33 E-value=1.7e-07 Score=57.65 Aligned_cols=299 Identities=11% Similarity=0.047 Sum_probs=153.8 Q ss_pred CC----ccccCc--CCCcccccccceeehhhhhHHHHHhhhh-hhhhccccccccCcCCCCEEEEEecccCCCCCCcccc Q lcl|NC_020862. 1 MP----HIYNDP--AAGDASTVGPQFNVHYWDRKSLIDEAEE-MFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQ 73 (405) Q Consensus 1 ~~----~~y~~~--~~t~~~~v~~qm~t~y~~~k~L~~a~p~-lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~e 73 (405) +. +.+.+- ..+++++-+.-+-+.| ....++++... -.+.+++.+.++ .|+... .+...-+. .....| T Consensus 237 l~~~e~~~~~~~~~~~~t~~~gg~lip~~~-~~~ii~~~~~~~~~l~~~~~~~~~---~g~~~~-~~~~~~~~-a~~v~E 310 (543) T protein:vir:81 237 LTEEEKRAINEVRAMGLTKADGGYLVPFQL-DPTVIITSNGSLNDIRRFARQVVA---TGDVWH-GVSSAAVQ-WSWDAE 310 (543) T ss_pred hhhhhhhhhhhhhhcccccccCcccCchhh-hhHHHHHHHhhhchhhhhcccccC---CcceEE-EEecCCcc-eeeccc Confidence 00 111111 0111111111111112 23344444433 355666654333 354322 22211111 111122 Q ss_pred CCCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHH Q lcl|NC_020862. 74 GLDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSR 153 (405) Q Consensus 74 Gvtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ 153 (405) | +.+ .-...++..|+.+.++++.|+.+|++++. |+ .++...|.. T Consensus 311 g-----~~~-----------------------------~~~~~~~~~i~~~~~k~~~~~~is~ell~-d~-~~~~~~i~~ 354 (543) T protein:vir:81 311 F-----EEV-----------------------------SDDSPEFGQPEIPVKKAQGFVPISIEALQ-DE-ANVTETVAL 354 (543) T ss_pred C-----ccc-----------------------------cccccccceeeeeeeeeEeeehhhHHHHh-cc-HHHHHHHHH Confidence 2 111 11223566778999999999999999764 54 478887777 Q ss_pred HHHHHHhhHHHHHHHHHHhccCceE-EecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccc Q lcl|NC_020862. 154 EMLRGANEITEDLLQADILASADVK-VFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRM 232 (405) Q Consensus 154 ell~~~~~~ted~l~~~ilag~~~v-~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~ 232 (405) .|....+. .++ ..|++|.++- ...|-.+...... ...+....+.++++++.++...|+.+.... T Consensus 355 ~l~~~~~~-~~d---~ail~G~Gt~~~p~Gi~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~---------- 419 (543) T protein:vir:81 355 LFAEGKDE-LEA---VTLTTGTGQGNQPTGIVTALAGTA-AEIAPVTAETFALADVYAVYEQLAARHRRQ---------- 419 (543) T ss_pred HHHHHHHH-HHH---HHHhccCCCCcccccchhhccccc-ccccccccccccHHHHHHHHHhhhccccCC---------- Confidence 76654443 332 2445654321 1111111000000 000111235689999999999997765531 Q ss_pred cCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCC Q lcl|NC_020862. 233 TDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAA 312 (405) Q Consensus 233 ~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t 312 (405) -+.++||.+...|+.|+|..+.|-|-|. ..|.-++|.| +.+++++.|-.-...++ T Consensus 420 ---------~~~v~n~~~~~~l~~lkd~~G~~l~~~~---------~~g~~~~l~G--~pv~~~~~~~~~~~~~~----- 474 (543) T protein:vir:81 420 ---------GAWLANNLIYNKIRQFDTQGGAGLWTTI---------GNGEPSQLLG--RPVGEAEAMDANWNTSA----- 474 (543) T ss_pred ---------cEEEEcHHHHHHHHHhhcCCCceeccCc---------CCCCCccccc--eeeEEeccccccccccc----- Confidence 2457999999999999998888888653 2334456755 56677765542111110 Q ss_pred CcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhcc Q lcl|NC_020862. 313 NRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLR 392 (405) Q Consensus 313 ~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~ 392 (405) +. +.+ .++||.-+.-.++.. ..+++.+.+-+. .+.....|++++..| +++++.+++ T Consensus 475 ----------~~---~~~-~i~~gd~~~~~i~~~-------~~~~i~~~~~~~--~~~~~~~~~~~~~~~-~r~d~~v~~ 530 (543) T protein:vir:81 475 ----------SA---DNF-VLLYGNFQNYVIADR-------IGMTVEFIPHLF--GTNRRPNGSRGWFAY-YRMGADVVN 530 (543) T ss_pred ----------cC---Ccc-eEEEeeccceeEEee-------cccEEEEecccc--ccchhhcCceEEEEE-EeeccEeec Confidence 01 123 356687665556554 235665554332 233345566655543 468889999 Q ss_pred ccceEEEEEecCC Q lcl|NC_020862. 393 GERIAVAYSVIPE 405 (405) Q Consensus 393 ~~~marie~~a~~ 405 (405) +.-++.++..+.= T Consensus 531 ~~A~~~l~~~~~a 543 (543) T protein:vir:81 531 PNAFRLLNVETAS 543 (543) T ss_pred ccceEEEEecccC Confidence 9998888776555 No 82 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=98.32 E-value=3.6e-07 Score=55.89 Aligned_cols=310 Identities=11% Similarity=0.068 Sum_probs=141.6 Q ss_pred CC---ccccCcCCCcccccccc-eeehhhhhHHHHH-hhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCC Q lcl|NC_020862. 1 MP---HIYNDPAAGDASTVGPQ-FNVHYWDRKSLID-EAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGL 75 (405) Q Consensus 1 ~~---~~y~~~~~t~~~~v~~q-m~t~y~~~k~L~~-a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGv 75 (405) .. ..-.....+++++-+.. +-+-+ ..+.+++ ..+..++.+++...+||-..|. +++-+...-+.......||- T Consensus 145 ~~~~~~~~~~~~~~~~~~~gg~lv~~~~-~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~-~~ip~~~~~~~~a~~~~Eg~ 222 (477) T protein:vir:84 145 RKIAKVGEEYRDLDRNGGTGGYAVPPLW-MMNRFIELARAGRTYANLCPTEPLPGGTSS-INIPKILTGTSTAIQAADNA 222 (477) T ss_pred HHHHHhhhhhccccccCCCcceeeccch-hHHHHHHHhhhcchHHHhhceeeecCCcce-eEEEEEecCcceeeeeccCc Confidence 00 00000111111111111 11212 2334444 4456777888888899887765 33322211111111122222 Q ss_pred CcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHH Q lcl|NC_020862. 76 DATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREM 155 (405) Q Consensus 76 tp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~el 155 (405) .... + ...-...++..|+.+.++++.++.+|++ ++-|+..++...|..+| T Consensus 223 ~~~~-----~------------------------~~~~s~~~f~~i~~~~~k~~~~~~iS~e-ll~ds~~~l~~~i~~~l 272 (477) T protein:vir:84 223 ALTA-----P------------------------SAHEVDLTDGFVQANVKTIAGQQGIAIQ-LLDQAAVSVDEFVFRDL 272 (477) T ss_pred cccc-----c------------------------cccccccceeeEEEeeeeEEeeeHHHHH-HHhccchhHHHHHHHHH Confidence 1110 0 0111123677789999999999999998 45555557777777776 Q ss_pred HHHHhhHHHHHHHHHHhccCce-------EEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceec Q lcl|NC_020862. 156 LRGANEITEDLLQADILASADV-------KVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIK 228 (405) Q Consensus 156 l~~~~~~ted~l~~~ilag~~~-------v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~ 228 (405) .+..+ ..+| ..+++|.++ +-+.+. ..++.+... ..-.+....+++|.++...+..+... T Consensus 273 ~~~~~-~~~d---~~~l~G~Gt~~~p~Gi~~~~~~--~~~~~~~~~-~t~~~~~~~~~~i~~~~~~~~~~~~~------- 338 (477) T protein:vir:84 273 AADYA-NKLN---VQVISGTGSNNQVVGVRATAGI--TQVTATSAG-SALEKHQIIYQKIADAIQRVHTSRFL------- 338 (477) T ss_pred HHHHH-HHHH---HHHhccCCCCCccceeeecccc--ccccccccc-cchhhHHHHHHHHHHHHhhccccccC------- Confidence 65433 3333 245666543 111111 011111100 00001112233334433333333221 Q ss_pred cccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCccc-----ccCcceeEecCCcEEEEeCcchhhhh Q lcl|NC_020862. 229 GSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAAT-----IMNGEIGAIPGAHLRIVVVPQMMHYA 303 (405) Q Consensus 229 gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~-----i~~gEIGsi~g~n~Rfv~~p~~~~~~ 303 (405) + .-+.++||.....|+.|+|-.+.|-|.|...-.+... +.++=.|.+-| +++++++.|-. T Consensus 339 ----------~-~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G--~pVv~s~~~p~-- 403 (477) T protein:vir:84 339 ----------E-PEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHG--LPVVTDPTLPT-- 403 (477) T ss_pred ----------C-ccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhcc--cceEecCcccc-- Confidence 1 2355889999999999999999999987644333333 33444567755 57788875531 Q ss_pred cCCCcccCCCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH Q lcl|NC_020862. 304 GAGATATAANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK 383 (405) Q Consensus 304 ~aGa~~~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK 383 (405) +.|+. + | ...++||.-+.-.+.-.| +.+.+. + + ...+ .++..|.-+ T Consensus 404 ~~~~~---------------~---d-~~~i~~gd~~~~~i~~~~--------~~~~~~-~-~---~~~~-~~~~~~~v~- 449 (477) T protein:vir:84 404 TLGTG---------------T---D-QDVIHVLRASDLALFESS--------VRMRAL-Q-E---TRAE-NLSVLLQVY- 449 (477) T ss_pred ccccc---------------C---C-cceEEEEEeceEEEEeec--------eeEEec-c-c---cccc-cceeeeeeh- Confidence 22211 1 1 124566766655553321 222222 1 0 1111 222222111 Q ss_pred HHHHHhhcc-ccceEEEEEe---cCC Q lcl|NC_020862. 384 FFYGFIKLR-GERIAVAYSV---IPE 405 (405) Q Consensus 384 ~~~~~~iL~-~~~marie~~---a~~ 405 (405) .++.+...| ++-.++|=-. ||- T Consensus 450 ~~~~~~~~r~~~afv~~t~~~~~~~~ 475 (477) T protein:vir:84 450 GYLAFTAARFPQSVVEIGGTALTAPT 475 (477) T ss_pred hhhhhhhhccccceEEeecccccccc Confidence 234554444 6666655433 233 No 83 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=98.31 E-value=4.8e-07 Score=55.23 Aligned_cols=290 Identities=11% Similarity=0.102 Sum_probs=155.0 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) -.+.++....+.++.-+.-+-+. +..+.+....+.-.+.+++...+|+. .++++.+...-+.+ +.-+. T Consensus 20 ~~~~~~a~~~~~~~~~~~~iP~~-~~~~ii~~~~~~s~l~~l~~~~~~~~---~~~~~p~~~~~~~a------~~v~E-- 87 (324) T protein:vir:78 20 KPQVFNPDNVMMHEKKDGTLMNE-FTTPILQEVMENSKIMQLGKYEPMEG---TEKKFTFWADKPGA------YWVGE-- 87 (324) T ss_pred hhhhhccccccccCcCccccchh-HHHHHHHHHHhhchhhhhcceeeccC---CceEEEEEecCcce------eEecC-- Confidence 22233332233223323333333 35566666666778888888888773 34554443221111 11122 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) |+.......++..++.+.++++.++.+|++++. |+..++..++..++.+..+ T Consensus 88 ---------------------------g~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~-ds~~~l~~~i~~~la~ai~ 139 (324) T protein:vir:78 88 ---------------------------GQKIETSKATWVNATMRAFKLGVILPVTKEFLN-YTYSQFFEEMKPMIAEAFY 139 (324) T ss_pred ---------------------------CccccccccceeEEEEeeEEEEEeehhhHHHHh-cchHHHHHHHHHHHHHHHH Confidence 222223334677788999999999999998544 5666788887777765544 Q ss_pred hHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISA 240 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~ 240 (405) .-.+. .+++|.+.-...+.- ..... .......+.+++++|.++...|+.+.... + T Consensus 140 ~~~d~----a~l~G~g~~~~~~gi---~~~~~-~~~~~~~~~~t~~~i~~~~~~l~~~~~~~-----------------~ 194 (324) T protein:vir:78 140 KKFDE----AGILNQGNNPFGKSI---AQSIE-KTNKVIKGDFTQDNIIDLEALLEDDELEA-----------------N 194 (324) T ss_pred HHHHH----HHhccCCCCCcCccc---ccccc-ccceeccccccHHHHHHHHHhhhhccCCC-----------------C Confidence 43333 334443322111111 00011 11112235689999999999888765431 1 Q ss_pred eEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccccccc Q lcl|NC_020862. 241 SRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSD 320 (405) Q Consensus 241 syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~ 320 (405) +.++||.....|+.++|..+.|.|. .+.-+++-|. .++.++.+. T Consensus 195 --~~vmn~~~~~~L~~l~d~~G~~~~~------------~~~~~~l~G~--PV~~~~~~~-------------------- 238 (324) T protein:vir:78 195 --AFISKTQNRSLLRKIVDPETKERIY------------DRNSDSLDGL--PVVNLKSSN-------------------- 238 (324) T ss_pred --EEEEcHHHHHHHHHhhccCCCeeec------------CCCCCcccce--eeEeeCCCC-------------------- Confidence 3579999999999998876666553 1223445442 333332110 Q ss_pred ccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEec-CCCC--CCCCCCcc--chhhhHHHH--HHHHHhhccc Q lcl|NC_020862. 321 VAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKK-PGEA--TADRNDPY--GKVGFSSIK--FFYGFIKLRG 393 (405) Q Consensus 321 ~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~-pG~~--tad~~DPl--gQrg~~gwK--~~~~~~iL~~ 393 (405) .++- .+++|.-+...++.. ..+.+-+.. +... ......|+ -|+....|+ +++.+.++++ T Consensus 239 ---~~~~----~~~~gd~~~~~~g~~-------~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~ 304 (324) T protein:vir:78 239 ---LKRG----ELITGDFDKLIYGIP-------QLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADD 304 (324) T ss_pred ---CCcc----eEEEEecceEEEEEe-------cCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecc Confidence 0111 245776555555553 123332322 1110 01112232 455667766 6889999999 Q ss_pred cceEEEEEec--CC Q lcl|NC_020862. 394 ERIAVAYSVI--PE 405 (405) Q Consensus 394 ~~marie~~a--~~ 405 (405) +-+++|.-+- +| T Consensus 305 ~A~~~l~~a~~~~~ 318 (324) T protein:vir:78 305 KAFAKLVPADKRTD 318 (324) T ss_pred cceEEEecccccCC Confidence 9999998543 33 No 84 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=98.31 E-value=4.8e-07 Score=55.23 Aligned_cols=290 Identities=11% Similarity=0.102 Sum_probs=155.0 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) -.+.++....+.++.-+.-+-+. +..+.+....+.-.+.+++...+|+. .++++.+...-+.+ +.-+. T Consensus 20 ~~~~~~a~~~~~~~~~~~~iP~~-~~~~ii~~~~~~s~l~~l~~~~~~~~---~~~~~p~~~~~~~a------~~v~E-- 87 (324) T protein:vir:96 20 KPQVFNPDNVMMHEKKDGTLMNE-FTTPILQEVMENSKIMQLGKYEPMEG---TEKKFTFWADKPGA------YWVGE-- 87 (324) T ss_pred hhhhhccccccccCcCccccchh-HHHHHHHHHHhhchhhhhcceeeccC---CceEEEEEecCcce------eEecC-- Confidence 22233332233223323333333 35566666666778888888888773 34554443221111 11122 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) |+.......++..++.+.++++.++.+|++++. |+..++..++..++.+..+ T Consensus 88 ---------------------------g~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~-ds~~~l~~~i~~~la~ai~ 139 (324) T protein:vir:96 88 ---------------------------GQKIETSKATWVNATMRAFKLGVILPVTKEFLN-YTYSQFFEEMKPMIAEAFY 139 (324) T ss_pred ---------------------------CccccccccceeEEEEeeEEEEEeehhhHHHHh-cchHHHHHHHHHHHHHHHH Confidence 222223334677788999999999999998544 5666788887777765544 Q ss_pred hHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISA 240 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~ 240 (405) .-.+. .+++|.+.-...+.- ..... .......+.+++++|.++...|+.+.... + T Consensus 140 ~~~d~----a~l~G~g~~~~~~gi---~~~~~-~~~~~~~~~~t~~~i~~~~~~l~~~~~~~-----------------~ 194 (324) T protein:vir:96 140 KKFDE----AGILNQGNNPFGKSI---AQSIE-KTNKVIKGDFTQDNIIDLEALLEDDELEA-----------------N 194 (324) T ss_pred HHHHH----HHhccCCCCCcCccc---ccccc-ccceeccccccHHHHHHHHHhhhhccCCC-----------------C Confidence 43333 334443322111111 00011 11112235689999999999888765431 1 Q ss_pred eEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccccccc Q lcl|NC_020862. 241 SRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSD 320 (405) Q Consensus 241 syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~ 320 (405) +.++||.....|+.++|..+.|.|. .+.-+++-|. .++.++.+. T Consensus 195 --~~vmn~~~~~~L~~l~d~~G~~~~~------------~~~~~~l~G~--PV~~~~~~~-------------------- 238 (324) T protein:vir:96 195 --AFISKTQNRSLLRKIVDPETKERIY------------DRNSDSLDGL--PVVNLKSSN-------------------- 238 (324) T ss_pred --EEEEcHHHHHHHHHhhccCCCeeec------------CCCCCcccce--eeEeeCCCC-------------------- Confidence 3579999999999998876666553 1223445442 333332110 Q ss_pred ccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEec-CCCC--CCCCCCcc--chhhhHHHH--HHHHHhhccc Q lcl|NC_020862. 321 VAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKK-PGEA--TADRNDPY--GKVGFSSIK--FFYGFIKLRG 393 (405) Q Consensus 321 ~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~-pG~~--tad~~DPl--gQrg~~gwK--~~~~~~iL~~ 393 (405) .++- .+++|.-+...++.. ..+.+-+.. +... ......|+ -|+....|+ +++.+.++++ T Consensus 239 ---~~~~----~~~~gd~~~~~~g~~-------~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~ 304 (324) T protein:vir:96 239 ---LKRG----ELITGDFDKLIYGIP-------QLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADD 304 (324) T ss_pred ---CCcc----eEEEEecceEEEEEe-------cCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecc Confidence 0111 245776555555553 123332322 1110 01112232 455667766 6889999999 Q ss_pred cceEEEEEec--CC Q lcl|NC_020862. 394 ERIAVAYSVI--PE 405 (405) Q Consensus 394 ~~marie~~a--~~ 405 (405) +-+++|.-+- +| T Consensus 305 ~A~~~l~~a~~~~~ 318 (324) T protein:vir:96 305 KAFAKLVPADKRTD 318 (324) T ss_pred cceEEEecccccCC Confidence 9999998543 33 No 85 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=98.30 E-value=2.1e-07 Score=57.17 Aligned_cols=282 Identities=11% Similarity=0.061 Sum_probs=153.8 Q ss_pred CCc----cccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCC Q lcl|NC_020862. 1 MPH----IYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLD 76 (405) Q Consensus 1 ~~~----~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvt 76 (405) +.. ..+--..++.+.-+.-+ +..+....+....+..++.+++.+.+|+-+.|+....++ .-..... ... T Consensus 98 l~~~~~~~~~~~~~~t~~~gg~~i-P~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~-~~~~~~a-----~~v 170 (397) T protein:vir:49 98 VRGRYQNLLDSKTDGSGSDAGLTI-PQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEKW-ADITGLA-----KLD 170 (397) T ss_pred hhcchhhHHHhhhccCCccCccee-cHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEee-ccCCcce-----eee Confidence 110 00110111111111222 223445555555567788899999999998887544332 1111111 122 Q ss_pred cccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHH Q lcl|NC_020862. 77 ATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREML 156 (405) Q Consensus 77 p~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell 156 (405) +.|.++. .....++..|+-+.++++.++.+|++++ -|+..++...+..++. T Consensus 171 ~E~~~~~----------------------------~~~~~~~~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~ 221 (397) T protein:vir:49 171 DEGGQIG----------------------------QNDDPKLSLIRYAIKRYAGISTVTNSLL-ADSAENILAWLSGWIA 221 (397) T ss_pred ccccccc----------------------------cccccceeeeEeeeeeeEeehhhHHHHH-hhhhHHHHHHHHHHHH Confidence 2222211 0011256677899999999999999854 5666677777777766 Q ss_pred HHHhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_020862. 157 RGANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTK 236 (405) Q Consensus 157 ~~~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~ 236 (405) +..+. .+| ..+++|.++. +. .-+.+++++|.++...|+.+..+. T Consensus 222 ~~~~~-~~d---~ail~G~g~~------~~------------~~~~~~~d~i~~~~~~l~~~~~~~-------------- 265 (397) T protein:vir:49 222 KKVVV-TRN---KAILEAIGTL------PN------------KPTLAKWDDIIDLQAKVDPAIKQT-------------- 265 (397) T ss_pred HHHHH-HHH---HHHHhccccc------cc------------cccccCHHHHHHHHHhhhhhhcCC-------------- Confidence 54443 333 3456664321 10 124688999999998887655431 Q ss_pred cccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccc Q lcl|NC_020862. 237 TISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGY 316 (405) Q Consensus 237 ~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~ 316 (405) -.-++||.....|+.|+|-.+.|-|.|- + ..|--++|-|-.++++.+- .++ .++ T Consensus 266 -----a~~v~n~~~~~~l~~lkd~~g~~l~~~~--~------~~g~~~~l~G~pV~~~~~~-~~~---~~~--------- 319 (397) T protein:vir:49 266 -----SLFLTNTSGFTALKKVKNAMGDYLMERD--V------KSPTGYSIDGFVVKEISDR-FLP---NGT--------- 319 (397) T ss_pred -----CEEEEcHHHHHHHHHhhccCCceeeccc--c------cCCCCceecceeeEEeccc-ccc---ccc--------- Confidence 2457999999999999998888877652 1 2233355655333333321 111 111 Q ss_pred ccccccCCcceeeeEEEEEccccc-eeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhhccc Q lcl|NC_020862. 317 QVSDVAGTDKYDIAPLLVVGDQAF-ATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIKLRG 393 (405) Q Consensus 317 ~~~~~~g~~~~DVYp~lV~G~~Af-g~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL~~ 393 (405) .+ -++ ++||.-.- -.+..+ +.+++.+..- .+-+-+++.+.++ +++.+.++++ T Consensus 320 -------~~---~~~-~~~gd~~~~~~~~~~-------~~~~i~~~~~-------~~~~~~~~~~~~~~~~r~d~~~~~~ 374 (397) T protein:vir:49 320 -------GG---AMP-LYFGDLKQAVTLFDR-------QHLSLLSTNI-------GGGAFETDTTKVRVIDRFDVVSTDT 374 (397) T ss_pred -------CC---cee-EEEeeccceEEEEee-------cccEEEEecc-------ccchhhcCeeeEEEEEeeccEEecc Confidence 11 122 45775331 222222 1234444321 2334456666655 6788899999 Q ss_pred cceEEEEEecCC Q lcl|NC_020862. 394 ERIAVAYSVIPE 405 (405) Q Consensus 394 ~~marie~~a~~ 405 (405) +-++.++..++= T Consensus 375 ~a~~~~~~~~~~ 386 (397) T protein:vir:49 375 EAFVPASFKAIA 386 (397) T ss_pred cceEEEEecccc Confidence 999999865443 No 86 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=98.29 E-value=3.3e-07 Score=56.11 Aligned_cols=287 Identities=13% Similarity=0.134 Sum_probs=155.3 Q ss_pred CCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccccccCCcccc Q lcl|NC_020862. 10 AGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGASYAGGNLYG 89 (405) Q Consensus 10 ~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~~~~~gnly~ 89 (405) +.+.++-+--+-+.+ ..+.+..+.+..++.+++...+|+.+ ++++-++..-+.+ .-..| T Consensus 1 m~t~t~gg~liP~~~-~~~ii~~l~~~s~i~~l~~~~~~~~~---~~~ip~~~~~~~a-~wv~E---------------- 59 (303) T protein:vir:97 1 MGTETSKASLFDKHL-VSDLINKVKGHSSLAKLSSQKPIPFN---GSKEFTFTLDSDI-DVVAE---------------- 59 (303) T ss_pred CcccCCCCeEcchhH-HHHHHHHHHhhchhhhhcceeecCCC---ceEEEEEecCcce-EEeec---------------- Confidence 222222222223333 45566666678899999999998853 3444433221111 11112 Q ss_pred cccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhcc--chHHHHHHHHHHHHhhHHHHHH Q lcl|NC_020862. 90 GSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDS--DLYGHLSREMLRGANEITEDLL 167 (405) Q Consensus 90 ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~--~l~~~~~~ell~~~~~~ted~l 167 (405) |+.+.....++..++-..++.+.++++|++++..+.|+ ++.+.+..++.+..+ .. + T Consensus 60 ------------------~~~~~~s~~~f~~v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~-~~---l 117 (303) T protein:vir:97 60 ------------------NGKKTHGGLSLEPVTIVPIKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLA-RG---I 117 (303) T ss_pred ------------------CccccccccceeeEEeeeEEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHH-HH---H Confidence 22223333466778899999999999999976544443 456666555554332 22 2 Q ss_pred HHHHhccCce-----E------EecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_020862. 168 QADILASADV-----K------VFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTK 236 (405) Q Consensus 168 ~~~ilag~~~-----v------~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~ 236 (405) -..+++|.+. . .+.+..+..+..+ ....++++|.++...|..+.... T Consensus 118 d~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~i~~~~~~~~~~~~~~-------------- 175 (303) T protein:vir:97 118 DLMAMHGINPRTKKASDVIGTNHFDSKVTQVVKFT--------ESEDADANIEAAVNLIQGAEGVV-------------- 175 (303) T ss_pred HhhhhcccccCCccccccccccccccccccccccc--------cccchHHHHHHHHHHHhhcCCCc-------------- Confidence 2344444211 1 1111111111111 23467889999888886644321 Q ss_pred cccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccc Q lcl|NC_020862. 237 TISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGY 316 (405) Q Consensus 237 ~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~ 316 (405) =..++||.....|+.|+|..+.|-|.|-.. ..+..|+|-| ++++.+..|. ..+ . . T Consensus 176 -----~~~vmn~~~~~~L~~lkd~~g~~~~~~~~~-------~~~~~~~l~G--~Pv~~s~~v~-~~~--~-~------- 230 (303) T protein:vir:97 176 -----TGLAMDTEFSTALAKVTNGEMGPKMYPELA-------WGANPDSING--LKSSVNTTVG-AGA--D-E------- 230 (303) T ss_pred -----cEEEEcHHHHHHHHHhhccCCCeEEecCcc-------CCCCCceecc--eeeEEecccC-Ccc--c-c------- Confidence 126779999999999998777777765211 1344567866 6777777554 111 0 0 Q ss_pred ccccccCCcceeeeEEEEEccccc-eeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhhccc Q lcl|NC_020862. 317 QVSDVAGTDKYDIAPLLVVGDQAF-ATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIKLRG 393 (405) Q Consensus 317 ~~~~~~g~~~~DVYp~lV~G~~Af-g~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL~~ 393 (405) +.++ ..+++|+-+. ..++.+ +.+++-+..-+.. -++..-|-|+..+.++ .++++.++++ T Consensus 231 ------~~~~----~~~~~Gdf~~~~~~~~~-------~~~~~~~~~~~~~-d~~~~~~~~~n~~~~r~~~r~~~~v~~p 292 (303) T protein:vir:97 231 ------AESK----DLVIIGDFESMFKWGYA-------KQIPMEIIKYGDP-DNSGKDLKGYNQIYLRAEAYIGWGILDA 292 (303) T ss_pred ------CCCc----cEEEEeeccccEEEEEe-------cCcEEEEeeccCC-CCcchhhhhcCcEEEEEEEEeccEeecc Confidence 0111 2468887533 245554 1233333332210 0111225566666775 5788999999 Q ss_pred cceEEEEEecC Q lcl|NC_020862. 394 ERIAVAYSVIP 404 (405) Q Consensus 394 ~~marie~~a~ 404 (405) +-+++|+=+== T Consensus 293 ~af~~l~~~~~ 303 (303) T protein:vir:97 293 KSFARVTKGEV 303 (303) T ss_pred cceEEeeCCCC Confidence 99888764322 No 87 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=98.27 E-value=3.5e-07 Score=56.01 Aligned_cols=304 Identities=11% Similarity=0.058 Sum_probs=158.1 Q ss_pred CCccccCcCC---CcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCc Q lcl|NC_020862. 1 MPHIYNDPAA---GDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDA 77 (405) Q Consensus 1 ~~~~y~~~~~---t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp 77 (405) +..+..+... +.+++-+.-+.+. +.+..+....+...+.+++...+|+.+ ++++-+...-+.+.- .. T Consensus 9 ~~~~~~~e~~a~~~~~~~~g~~ip~~-~~~~ii~~~~~~s~i~~~~~~~~~~~~---~~~~p~~~~~~~a~~-v~----- 78 (326) T protein:vir:42 9 TPFLGVNDPKVAQTGDSMFEGYLEPE-QAQDYFAEAEKISIVQQFAQKIPMGTT---GQKIPHWTGDVSASW-IG----- 78 (326) T ss_pred hhhcCcchhhheeccccCCcceechh-hHHHHHHHHHhcchhhhhcceeeccCC---ceEEEEEeCCcceEE-ec----- Confidence 3333222211 1122222234443 356666666777788889999998854 344433321111100 11 Q ss_pred ccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHH Q lcl|NC_020862. 78 TGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLR 157 (405) Q Consensus 78 ~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~ 157 (405) . |+.......++..|+.+.+++|.++.+|+++ .-|+..++...+..++.+ T Consensus 79 E-----------------------------g~~~~~~~~~f~~i~~~~~k~~~~v~iS~el-l~~s~~~~~~~i~~~l~~ 128 (326) T protein:vir:42 79 E-----------------------------GDMKPITKGNMTSQTIAPHKIATIFVASAET-VRANPANYLGTMRTKVAT 128 (326) T ss_pred C-----------------------------CccccccccceeEEEEeeEEEEEeehhhHHH-HhcCHHHHHHHHHHHHHH Confidence 1 2222223346777899999999999999985 456666788887777665 Q ss_pred HHhhHHHHHHHHHHhccCceEEecCCCcc--ceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCc Q lcl|NC_020862. 158 GANEITEDLLQADILASADVKVFTGAATS--MVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDT 235 (405) Q Consensus 158 ~~~~~ted~l~~~ilag~~~v~yag~ats--~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT 235 (405) ..+. .+| ..+++|.+.-.-.|-... ..+.....++ ..+..++..++..+..........+ T Consensus 129 a~~~-~~d---~a~l~G~gs~~p~gi~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~------------- 190 (326) T protein:vir:42 129 AFAM-AFD---NAAINGTDSPFPTFLAQTTKEVSLVDPDGT-GSNADLTVYDAVAVNALSLLVNAGK------------- 190 (326) T ss_pred HHHH-HHH---HHhhcccCCCccccccccccccceeecccc-cccccchhHHHHHHHHHhhhhhhcc------------- Confidence 3333 333 344566543222221111 0111111111 1123355555443322222222210 Q ss_pred ccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcc Q lcl|NC_020862. 236 KTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRG 315 (405) Q Consensus 236 ~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~ 315 (405) ..-+-++||.....|+.|+|..++|-|.+..+-+...++ ..|.+-| +..+.++.+. +|. T Consensus 191 ----~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~---~~~~l~G--~pv~~~~~~~----~~~-------- 249 (326) T protein:vir:42 191 ----KWTHTLLDDITEPILNGAKDKSGRPLFIESTYTEENSPF---RLGRIVA--RPTILSDHVA----SGT-------- 249 (326) T ss_pred ----CccEEEEeHHHHHHHHHhhccCCceeeccccccCccccc---cCceeee--eeEEEcCCCC----CCc-------- Confidence 012346899999999999999999999876555554433 3455633 5666665332 211 Q ss_pred cccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEec-CCC--CCCCCCCcc--chhhhHHHH--HHHHH Q lcl|NC_020862. 316 YQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKK-PGE--ATADRNDPY--GKVGFSSIK--FFYGF 388 (405) Q Consensus 316 ~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~-pG~--~tad~~DPl--gQrg~~gwK--~~~~~ 388 (405) .++++|+=+...++..+ .+.+-+.. ... .+....+|+ -|++...|| +++.+ T Consensus 250 ---------------~~~~~Gd~s~~~~~~~~-------~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~ 307 (326) T protein:vir:42 250 ---------------VVGYQGDFRQLVWGQVG-------GLSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAF 307 (326) T ss_pred ---------------eEEEEeecceEEEEEec-------ceEEEEeecceeeecccccccchhhhhcCcEEEEEEEEecc Confidence 13445554333343331 12221111 000 012333443 466778887 78899 Q ss_pred hhccccceEEEEEecCC Q lcl|NC_020862. 389 IKLRGERIAVAYSVIPE 405 (405) Q Consensus 389 ~iL~~~~marie~~a~~ 405 (405) .+.+++-+++|+.++-- T Consensus 308 ~v~~~~a~~~l~~~~~~ 324 (326) T protein:vir:42 308 HCNDKDAFVKLTNVDAT 324 (326) T ss_pred EEecccceEEEeecccc Confidence 99999999999877666 No 88 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=98.27 E-value=1.9e-06 Score=51.98 Aligned_cols=314 Identities=11% Similarity=0.077 Sum_probs=171.7 Q ss_pred CCcccc---CcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCc Q lcl|NC_020862. 1 MPHIYN---DPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDA 77 (405) Q Consensus 1 ~~~~y~---~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp 77 (405) |+-. | .|.-...+.. ..+...-|.-+.|..-+...+|..+-..|.+ ..||++.|-|--..-. .-.+.|-.+ T Consensus 1 ms~~-~~~t~~~~~~s~~d-~al~le~f~geV~~af~~~s~~~~~~~~rti--~~g~s~~~~~iG~~~~--~~~~pG~~l 74 (335) T protein:vir:78 1 MSFL-NDLTRPNYAGKNAD-VDIHLEEHLGIVDKHFAYTSKFAPLMNIRDL--RGSNVVRLDRLGNVEA--KGRRAGEEL 74 (335) T ss_pred CCcc-ccccccccccccch-hhhhhhhhhhHHHHHHHHhhhhccccceeee--ccceeEEEeeeeeeee--cccccCccc Confidence 5433 2 1111111111 1455556788888888888888899999988 5599999885433322 112333333 Q ss_pred ccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHH Q lcl|NC_020862. 78 TGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLR 157 (405) Q Consensus 78 ~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~ 157 (405) .|+...+ .-..|+++=..|-.+. + |..-+...+-|+..+++.|++. T Consensus 75 ~~~~~~~--------------------------------~k~~itID~ll~a~~~-V-ddlDe~~~~yDvR~e~s~~~G~ 120 (335) T protein:vir:78 75 ERSRVVN--------------------------------DKWNLTVDTLLYLRHQ-F-DHQDEWTQSFDMRKEVAELDGQ 120 (335) T ss_pred CCCCccc--------------------------------CCeEEEecceeechhh-H-hhHHHhhcCchhHHHHHHHHHH Confidence 3322211 1111222211222211 2 2223445556788888888876 Q ss_pred HHhhHHHHHHHHHHhccCc---eEEecCC----CccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccc Q lcl|NC_020862. 158 GANEITEDLLQADILASAD---VKVFTGA----ATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGS 230 (405) Q Consensus 158 ~~~~~ted~l~~~ilag~~---~v~yag~----ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs 230 (405) .=+.....-+.+.+..++. .+-+.++ -+....+++. +++.|-..=.+-++.+...|.++.-|. T Consensus 121 aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~G~~~~~~~tg~--~~~~~~~~l~~a~~~a~~~l~ekdvP~-------- 190 (335) T protein:vir:78 121 ELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVLEKLDLTGL--TAKEAAEKIVRMHRRVVETFIERDLGD-------- 190 (335) T ss_pred HHHHHHHHHHHHHHHhhcccccccccCCCcCCCcceeeeeccc--cccccHHHHHHHHHHHHHHHHhccCCC-------- Confidence 5555433333334433332 1222111 1122222222 222222222355666777788777763 Q ss_pred cccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCc---ccccCcceeEecCCcEEEEeCcchhhhhcCCC Q lcl|NC_020862. 231 RMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADA---ATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGA 307 (405) Q Consensus 231 ~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~---~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa 307 (405) ....-+++++-|..-..|.+ ++.|+.. .|++. ...-+|+|+++.| ||++.+++|-.-.+.+- T Consensus 191 ------~~~~~rv~vv~P~~y~~Ll~------~~~l~n~-~~~~s~~~~~~~~g~v~~v~G--v~V~~Sn~lP~~~~t~~ 255 (335) T protein:vir:78 191 ------AVYSEGLTPMSPRVFSLLLE------HDKLMSV-EYQATGATNDYVKSRVAILNG--VKVLETPRFATKAISAH 255 (335) T ss_pred ------CCCCccEEEeChHHHHHHhc------ccccccc-cccccccccccccceeEEeec--eEEEeeccCCCCCCccc Confidence 11123899999999999975 4678887 56532 3357899999965 89999998843211111 Q ss_pred cccCCCcccccccccCCcceee--eEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHH Q lcl|NC_020862. 308 TATAANRGYQVSDVAGTDKYDI--APLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFF 385 (405) Q Consensus 308 ~~~~t~~~~~~~~~~g~~~~DV--Yp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~ 385 (405) . -+..+++ .++|. =..+++=++|-+++-++....+ -..|+--|..++=-|.. T Consensus 256 ~---lg~a~n~------~~~d~~~~~~~~~~~~Al~t~~~~~~~~e-----------------~~~~~~~~~~~i~~~~a 309 (335) T protein:vir:78 256 P---LGRHFNV------SAEEAERQIALFLPSKTLITAQVAPVQAK-----------------LWEDHDQFSWVLDTFQM 309 (335) T ss_pred c---ccccCCc------ccccccceEEEEEecceEEEEEEEecccc-----------------eeeccchhhHhhhHHHH Confidence 0 0111111 12222 2456677777777766532211 12334446677778999 Q ss_pred HHHhhccccceEEEEEecCC Q lcl|NC_020862. 386 YGFIKLRGERIAVAYSVIPE 405 (405) Q Consensus 386 ~~~~iL~~~~marie~~a~~ 405 (405) |++.+||+|.-+.||+---| T Consensus 310 ~G~g~lRPe~a~~i~~tg~~ 329 (335) T protein:vir:78 310 YNIGARRPDTAGAIELKGIE 329 (335) T ss_pred cCCcccCcceEEEEEecCCC Confidence 99999999999999987777 No 89 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=98.26 E-value=4.9e-07 Score=55.19 Aligned_cols=290 Identities=12% Similarity=0.113 Sum_probs=154.1 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) -.+.++....+.+.+-+.-+ +--+..+.+....+.-.+.+++...+|+. .++++-+....+.+ ....|| + T Consensus 20 ~~~~~~a~~~~~~~~~~~li-P~~~~~~ii~~~~~~s~l~~~~~~~~~~~---~~~~~p~~~~~~~a-~~v~Eg-----~ 89 (324) T protein:vir:10 20 KPQVFNPDNVMMHEKKDGTL-LNDFTTPILQEVMENSKIMQLGKYEPMEG---TEKKFTFWADKPGA-YWVGEG-----Q 89 (324) T ss_pred ccceecccceeccCCCccee-chhHHHHHHHHHHhhchhhhhcceeeccC---CceEEEEEeCCcce-eEeccC-----c Confidence 12333222122222222222 22345555555666678888888888773 34555544322111 111222 2 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) .+ .-...++..++.+.++++.++.+|++++. |+..++...+..++.+..+ T Consensus 90 ~~-----------------------------~~~~~~~~~v~~~~~k~~~~~~iS~ell~-ds~~~l~~~i~~~l~~ai~ 139 (324) T protein:vir:10 90 KI-----------------------------ETSKATWVNATMRAFKLGVILPVTKEFLN-YTYSQFFEEMKPMIAEAFY 139 (324) T ss_pred cc-----------------------------cccccceeEEEEeeEEEEEeehhhHHHHh-cchHHHHHHHHHHHHHHHH Confidence 21 11223667788999999999999998554 5555788877766665444 Q ss_pred hHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISA 240 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~ 240 (405) . .+| ..+++|.+.-.....-.+. . ........+.+++++|.++...|+.+... ++ T Consensus 140 ~-~~d---~a~l~G~g~~~~~~~i~~~---~-~~~~~~~~~~~t~~~i~~~~~~l~~~~~~-----------------~~ 194 (324) T protein:vir:10 140 K-KFD---EAGILNQGNNPFGKSIAQS---I-EKTNKVIKGDFTQDNIIDLEALLEDDELE-----------------AN 194 (324) T ss_pred H-HHH---HHhhhcCCCCccCcccccc---c-cccceeccccCCHHHHHHHHHhhhhccCC-----------------CC Confidence 3 222 2344444322111110010 0 11112234578999999999999886542 12 Q ss_pred eEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccccccc Q lcl|NC_020862. 241 SRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSD 320 (405) Q Consensus 241 syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~ 320 (405) + .++||.....|+.++|..+.|-|.+ +.-+++-|. -++.++.+. T Consensus 195 ~--~v~n~~~~~~L~~l~d~~g~~~~~~------------~~~~~l~G~--PV~~~~~~~-------------------- 238 (324) T protein:vir:10 195 A--FISKTQNRSLLRKIVDPETKERIYD------------RNSDTLDGL--PVVNLKSSN-------------------- 238 (324) T ss_pred E--EEEcHHHHHHHHHhhccCCceeecC------------CCCccccce--eEEeecCCC-------------------- Confidence 2 4689999999999998777776642 122445442 233332110 Q ss_pred ccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCC-C--CCCCCCc--cchhhhHHHH--HHHHHhhccc Q lcl|NC_020862. 321 VAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGE-A--TADRNDP--YGKVGFSSIK--FFYGFIKLRG 393 (405) Q Consensus 321 ~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~-~--tad~~DP--lgQrg~~gwK--~~~~~~iL~~ 393 (405) .++. .+++|.-+...+++.+ .+.+-+..-+. . +...+.+ +-|++.+.|+ +++++.++++ T Consensus 239 ---~~~~----~~~~gd~~~~~~~~~~-------~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~ 304 (324) T protein:vir:10 239 ---LKRG----ELITGDFDKLIYGIPQ-------LIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADD 304 (324) T ss_pred ---CCcc----eEEEEecccEEEEEec-------CcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecc Confidence 0111 2556665555555431 13332221110 0 0111222 2467778887 6789999999 Q ss_pred cceEEEEEecCC Q lcl|NC_020862. 394 ERIAVAYSVIPE 405 (405) Q Consensus 394 ~~marie~~a~~ 405 (405) +-+++|..+.+- T Consensus 305 ~A~~~l~~a~~~ 316 (324) T protein:vir:10 305 KAFAKLVPADKK 316 (324) T ss_pred cceEEEEeccCC Confidence 999999877665 No 90 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=98.26 E-value=1.9e-07 Score=57.47 Aligned_cols=297 Identities=13% Similarity=0.072 Sum_probs=153.6 Q ss_pred CCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccccccCCccc Q lcl|NC_020862. 9 AAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGASYAGGNLY 88 (405) Q Consensus 9 ~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~~~~~gnly 88 (405) =+|.+++-+.-+-+.| .++.+..+++..++.+++...+|+.+. +++-+...-+.+ +.-.+ T Consensus 1 Mat~tt~~g~~vP~~~-~~~ii~~~~~~s~l~~~~~~i~~~~~~---~~~p~~~~~~~a------~wv~E---------- 60 (311) T protein:vir:99 1 MATFGTGNLKNLPRNI-ADGMVKDVVQGSTVAVLSARKPQRFGN---EDIITFNGRPKA------EFVGE---------- 60 (311) T ss_pred CceecCCCceeccHHH-HHHHHHHHHhhchhhhhcceeeccCCc---eEEEEEeCCcee------EEeec---------- Confidence 1222233222222233 456666677788999999998888543 233322111100 01111 Q ss_pred ccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhc--cchHHHHHHHHHHHHhhHHHHH Q lcl|NC_020862. 89 GGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTD--SDLYGHLSREMLRGANEITEDL 166 (405) Q Consensus 89 ~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d--~~l~~~~~~ell~~~~~~ted~ 166 (405) |....-...++.+++-+.++++.++.+|++++....| .++.+.+..+|.+.-+.-.+. T Consensus 61 -------------------g~~~~~~~~~f~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~- 120 (311) T protein:vir:99 61 -------------------GQQKSSTTGEFDFVTSTPKKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDL- 120 (311) T ss_pred -------------------CcccccccceeeEEEEeeEEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHH- Confidence 2222223346677889999999999999997654333 457777776666544433332 Q ss_pred HHHHHhccCceEE---------ecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccc Q lcl|NC_020862. 167 LQADILASADVKV---------FTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKT 237 (405) Q Consensus 167 l~~~ilag~~~v~---------yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~ 237 (405) .+++|.+... ..+..+..++.+.. +......++..+...+..+++.. T Consensus 121 ---~~l~G~g~~~g~~~~g~~~~~~~~~~~~~~~~~------~~~~~~~~i~~~~~~~~~~~~~~--------------- 176 (311) T protein:vir:99 121 ---GLYHRINPLTGTVIPGWSNYLGAASKRVELTAD------TIANPDLAIEAAVGLLVANGHPT--------------- 176 (311) T ss_pred ---HhhcccCcccCccccccccccccccceeecccc------ccchhHHHHHHHHHHHhhhccCC--------------- Confidence 2334322111 11222222222211 11223355666666666665531 Q ss_pred ccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccc Q lcl|NC_020862. 238 ISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQ 317 (405) Q Consensus 238 I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~ 317 (405) +.+ .-++||.+...|+.|+|..+.|-|.|. ...++.|++.| ++.+.+..+..-.+...+. T Consensus 177 -~~~-~~vmn~~~~~~L~~lkd~~G~~l~~~~--------~~~~~~~~l~G--~Pv~~s~~i~~~~~~~~~~-------- 236 (311) T protein:vir:99 177 -PVN-GLALHPSIAWGLSTARYTDGRKKFPEL--------GLGIGVSSFEG--IDASVSDTVNGGDEADPDD-------- 236 (311) T ss_pred -Ccc-EEEEcHHHHHHHHhhhccCCCeeecCc--------ccCCCCceecc--eeeEeeccccccccccccc-------- Confidence 011 257899999999999998888888654 13445677765 5666665443111111000 Q ss_pred cccccCCcceeeeEEEEEcccc-ceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhhcccc Q lcl|NC_020862. 318 VSDVAGTDKYDIAPLLVVGDQA-FATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIKLRGE 394 (405) Q Consensus 318 ~~~~~g~~~~DVYp~lV~G~~A-fg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL~~~ 394 (405) .....+++ ..+++|+-+ +-.++++ ..+++-+.+-+ ..+..=.|-|+....+| +++++.+++++ T Consensus 237 -~~~~~~~~----~~~~~Gdf~~~~~~~~~-------~~~~~~~~~~~--~~~~~~~~~~~d~~~~r~~~r~d~~v~~~~ 302 (311) T protein:vir:99 237 -EDLDAARA----VRGIVGDFANGIHWGVQ-------RDIPVELIKYG--DPDGQGDLKRHNQIALRLEIVYGWYVFTDR 302 (311) T ss_pred -chhhccCc----ceEEEeeccccEEEEEe-------cCceEEEeecC--CCCcchhhhhcCcEEEEEEEeecceecChh Confidence 00111122 234556532 1123332 11222222211 11111134667777776 78899999999 Q ss_pred ceEEEEEec Q lcl|NC_020862. 395 RIAVAYSVI 403 (405) Q Consensus 395 ~marie~~a 403 (405) +.+..+.+| T Consensus 303 ~v~~~~~~A 311 (311) T protein:vir:99 303 FVVIENAVA 311 (311) T ss_pred HeeeecccC Confidence 999999999 No 91 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=98.25 E-value=2.3e-07 Score=56.94 Aligned_cols=285 Identities=13% Similarity=0.074 Sum_probs=152.6 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) ....++....++++.-+.-+-+ -+.+..+....+...+.+++...+|..+ .+++.+........... +.|+ T Consensus 97 ~~~~~~~~~~~~~~~~g~~i~~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~---~~~~~~~~~~~~~a~~v-----~E~~ 167 (385) T protein:vir:19 97 GAKTFNKSLGSDADSAGSLIQP-MQIPGIIMPGLRRLTIRDLLAQGRTSSN---ALEYVREEVFTNNADVV-----AEKA 167 (385) T ss_pred hhhHHHhhhccccccCCceecc-hhhhHHHHHhhhccchhhhcceecccCc---ceEEEEEecCCcceeee-----ccCc Confidence 1111111111111111111112 2234555555667788888888887643 45554442211111111 1222 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) . ..-...++..++.++++++.++.+|+++++ |+. ++.+.+..++.+.- T Consensus 168 ~-----------------------------~~~~~~~~~~~~~~~~k~~~~~~is~ell~-d~~-~l~~~i~~~la~a~- 215 (385) T protein:vir:19 168 L-----------------------------KPESDITFSKQTANVKTIAHWVQASRQVMD-DAP-MLQSYINNRLMYGL- 215 (385) T ss_pred c-----------------------------ccccccceeEEEEeeeeEEEeehhhHHHHh-hHH-HHHHHHHHHHHHHH- Confidence 2 122223566789999999999999999655 554 47777666665443 Q ss_pred hHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISA 240 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~ 240 (405) ...++ ..+++|.++-.-.++-.+.+.. ..........+++++|.++...|+.+.... T Consensus 216 ~~~~d---~~~l~G~g~~~~~~Gi~~~~~~--~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~------------------ 272 (385) T protein:vir:19 216 ALKEE---GQLLNGDGTGDNLEGLNKVATA--YDTSLNATGDTRADIIAHAIYQVTESEFSA------------------ 272 (385) T ss_pred HHHHH---HHHHhccCCCCccccccccccc--ccccccccccchHHHHHHHHHhhccccCCC------------------ Confidence 33333 3456654221110000000000 011112234578999999999887765531 Q ss_pred eEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccccccc Q lcl|NC_020862. 241 SRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSD 320 (405) Q Consensus 241 syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~ 320 (405) =+.+|||.+...|+.|+|..+.|-|-+. ..+..+.+-| +++++++.|- +| T Consensus 273 -~~~~~~~~~~~~l~~lkd~~G~~l~~~~---------~~~~~~~l~G--~pV~~~~~~p----~~-------------- 322 (385) T protein:vir:19 273 -SGIVLNPRDWHNIALLKDNEGRYIFGGP---------QAFTSNIMWG--LPVVPTKAQA----AG-------------- 322 (385) T ss_pred -CEEEEcHHHHHHHHHhhcCCCceeccCc---------ccCCCceecc--eeeEEcCcCC----CC-------------- Confidence 2468899999999999988887777432 3455566755 5777777542 11 Q ss_pred ccCCcceeeeEEEEEcc--ccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhhccccce Q lcl|NC_020862. 321 VAGTDKYDIAPLLVVGD--QAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIKLRGERI 396 (405) Q Consensus 321 ~~g~~~~DVYp~lV~G~--~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL~~~~m 396 (405) .+++|. ++|....-+ .+.+-+-. ...| +-+++.+.|+ +++++.++++.-+ T Consensus 323 -----------~~~~gd~~~~~~~~~~~--------~~~v~~~~------~~~~-~~~~~~~~~~~~~r~~~~v~~~~a~ 376 (385) T protein:vir:19 323 -----------TFTVGGFDMASQVWDRM--------DATVEVSR------EDRD-NFVKNMLTILCEERLALAHYRPTAI 376 (385) T ss_pred -----------cEEEeecccEEEEEEec--------ceEEEEec------cccc-hhhcCcEEEEEEEeeccEEecccce Confidence 144554 233322111 12322221 1123 3467778877 5788999999999 Q ss_pred EEEEEecCC Q lcl|NC_020862. 397 AVAYSVIPE 405 (405) Q Consensus 397 arie~~a~~ 405 (405) ++++..+-= T Consensus 377 ~~~~~~aa~ 385 (385) T protein:vir:19 377 IKGTFSSGS 385 (385) T ss_pred EEEEeccCC Confidence 999876555 No 92 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=98.25 E-value=2.3e-07 Score=56.94 Aligned_cols=285 Identities=13% Similarity=0.074 Sum_probs=152.6 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) ....++....++++.-+.-+-+ -+.+..+....+...+.+++...+|..+ .+++.+........... +.|+ T Consensus 97 ~~~~~~~~~~~~~~~~g~~i~~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~---~~~~~~~~~~~~~a~~v-----~E~~ 167 (385) T protein:vir:18 97 GAKTFNKSLGSDADSAGSLIQP-MQIPGIIMPGLRRLTIRDLLAQGRTSSN---ALEYVREEVFTNNADVV-----AEKA 167 (385) T ss_pred hhhHHHhhhccccccCCceecc-hhhhHHHHHhhhccchhhhcceecccCc---ceEEEEEecCCcceeee-----ccCc Confidence 1111111111111111111112 2234555555667788888888887643 45554442211111111 1222 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) . ..-...++..++.++++++.++.+|+++++ |+. ++.+.+..++.+.- T Consensus 168 ~-----------------------------~~~~~~~~~~~~~~~~k~~~~~~is~ell~-d~~-~l~~~i~~~la~a~- 215 (385) T protein:vir:18 168 L-----------------------------KPESDITFSKQTANVKTIAHWVQASRQVMD-DAP-MLQSYINNRLMYGL- 215 (385) T ss_pred c-----------------------------ccccccceeEEEEeeeeEEEeehhhHHHHh-hHH-HHHHHHHHHHHHHH- Confidence 2 122223566789999999999999999655 554 47777666665443 Q ss_pred hHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISA 240 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~ 240 (405) ...++ ..+++|.++-.-.++-.+.+.. ..........+++++|.++...|+.+.... T Consensus 216 ~~~~d---~~~l~G~g~~~~~~Gi~~~~~~--~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~------------------ 272 (385) T protein:vir:18 216 ALKEE---GQLLNGDGTGDNLEGLNKVATA--YDTSLNATGDTRADIIAHAIYQVTESEFSA------------------ 272 (385) T ss_pred HHHHH---HHHHhccCCCCccccccccccc--ccccccccccchHHHHHHHHHhhccccCCC------------------ Confidence 33333 3456654221110000000000 011112234578999999999887765531 Q ss_pred eEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccccccc Q lcl|NC_020862. 241 SRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSD 320 (405) Q Consensus 241 syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~ 320 (405) =+.+|||.+...|+.|+|..+.|-|-+. ..+..+.+-| +++++++.|- +| T Consensus 273 -~~~~~~~~~~~~l~~lkd~~G~~l~~~~---------~~~~~~~l~G--~pV~~~~~~p----~~-------------- 322 (385) T protein:vir:18 273 -SGIVLNPRDWHNIALLKDNEGRYIFGGP---------QAFTSNIMWG--LPVVPTKAQA----AG-------------- 322 (385) T ss_pred -CEEEEcHHHHHHHHHhhcCCCceeccCc---------ccCCCceecc--eeeEEcCcCC----CC-------------- Confidence 2468899999999999988887777432 3455566755 5777777542 11 Q ss_pred ccCCcceeeeEEEEEcc--ccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhhccccce Q lcl|NC_020862. 321 VAGTDKYDIAPLLVVGD--QAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIKLRGERI 396 (405) Q Consensus 321 ~~g~~~~DVYp~lV~G~--~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL~~~~m 396 (405) .+++|. ++|....-+ .+.+-+-. ...| +-+++.+.|+ +++++.++++.-+ T Consensus 323 -----------~~~~gd~~~~~~~~~~~--------~~~v~~~~------~~~~-~~~~~~~~~~~~~r~~~~v~~~~a~ 376 (385) T protein:vir:18 323 -----------TFTVGGFDMASQVWDRM--------DATVEVSR------EDRD-NFVKNMLTILCEERLALAHYRPTAI 376 (385) T ss_pred -----------cEEEeecccEEEEEEec--------ceEEEEec------cccc-hhhcCcEEEEEEEeeccEEecccce Confidence 144554 233322111 12322221 1123 3467778877 5788999999999 Q ss_pred EEEEEecCC Q lcl|NC_020862. 397 AVAYSVIPE 405 (405) Q Consensus 397 arie~~a~~ 405 (405) ++++..+-= T Consensus 377 ~~~~~~aa~ 385 (385) T protein:vir:18 377 IKGTFSSGS 385 (385) T ss_pred EEEEeccCC Confidence 999876555 No 93 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=98.25 E-value=1.9e-07 Score=57.48 Aligned_cols=278 Identities=13% Similarity=0.090 Sum_probs=150.2 Q ss_pred CC-------------ccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCC Q lcl|NC_020862. 1 MP-------------HIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDD 67 (405) Q Consensus 1 ~~-------------~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~ 67 (405) +. ..-.....+.+++-+.-.-+-.+.........+..++.+++...+|+...|+....+ -..-+. T Consensus 102 ~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~-~~~~~~- 179 (397) T protein:vir:12 102 LRGKRLTDEERDLLDSPEFRAMSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVTVEPVTTRSGTRLLEK-NADMVP- 179 (397) T ss_pred HhccCCcHHHHHHHhhhhhhhccccccccCcccCchhHHHHHHHhhhhhhhHHhhcceeeccCCceeEEEEE-ecCCcc- Confidence 00 000000111122212111222334443334445678889999999998887633322 111111 Q ss_pred CCccccCCCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccch Q lcl|NC_020862. 68 LNVNDQGLDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDL 147 (405) Q Consensus 68 ~t~l~eGvtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l 147 (405) .....||-+ +. .....++..|+-+.++++.++.+|++++ -|+..++ T Consensus 180 a~~v~Eg~~-----~~----------------------------~~~~~~~~~v~~~~~k~~~~~~is~e~l-~ds~~~l 225 (397) T protein:vir:12 180 FSPVEELGN-----LP----------------------------EIDQPRFTKVSYSIIDYGGIMTLSNSML-NDSDQAI 225 (397) T ss_pred eeeeccccc-----cc----------------------------ccccccceeEEeeheeeEeeehhhHHHH-hhchHHH Confidence 112223221 10 0011256677899999999999999854 4666677 Q ss_pred HHHHHHHHHHHHhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHH-HHHHhccCccccce Q lcl|NC_020862. 148 YGHLSREMLRGANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLS-ITLTDNYTPKKTTI 226 (405) Q Consensus 148 ~~~~~~ell~~~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~-~~Lk~nrApk~T~i 226 (405) .+.+..++.+.-+ ..+| ..|++|.+.. .+.+.+++++|.++. ..|+....+ T Consensus 226 ~~~i~~~l~~~~~-~~~d---~~il~G~g~~-------------------~~~g~~~~~~i~~~~~~~l~~~~~~----- 277 (397) T protein:vir:12 226 MTYVAKWFAKKSV-VTRN---NLILAAIASL-------------------KKVDIDGLDGIKKALNVTLDPMVAP----- 277 (397) T ss_pred HHHHHHHHHHHHH-HHHH---HHHHhccccc-------------------cccccccHHHHHHHHhhccchhhhC----- Confidence 7776666664433 3333 3466665321 123567888888765 355543332 Q ss_pred eccccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCC Q lcl|NC_020862. 227 IKGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAG 306 (405) Q Consensus 227 i~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aG 306 (405) .-+-+|||.....|+.|+|.-+.|-|.|-- ..|--++|-| +.+++++..++-.++ T Consensus 278 --------------~a~~~~n~~~~~~L~~lkd~~G~~l~~~~~--------~~g~~~~l~G--~pv~~~~~~~~~~~~- 332 (397) T protein:vir:12 278 --------------GSIVLTNQDGYDWLDTLKDGTGRYLLQPDP--------TNPTKKLLDG--RPVVPFTNRVLKTQK- 332 (397) T ss_pred --------------CCEEEEcHHHHHHHHHhhccCCceeecccc--------cCCCCccccc--eeeEEecccccccCC- Confidence 113579999999999999988888776532 2333455545 455555544321111 Q ss_pred CcccCCCcccccccccCCcceeeeEEEEEccc--cceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH- Q lcl|NC_020862. 307 ATATAANRGYQVSDVAGTDKYDIAPLLVVGDQ--AFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK- 383 (405) Q Consensus 307 a~~~~t~~~~~~~~~~g~~~~DVYp~lV~G~~--Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK- 383 (405) + -.+ ++||.- +|- +..+ ..+.+-+. +..+.+-+++...|+ T Consensus 333 ------------------~---~~~-~~~gd~~~~~~-~~~~-------~~~~i~~~-------~~~~~~f~~~~~~~r~ 375 (397) T protein:vir:12 333 ------------------G---KAP-LIIGNLKEAIV-LFDR-------EQQSIAST-------DTGAGAFETNSTKVRG 375 (397) T ss_pred ------------------C---ccE-EEEEehhceEE-EEee-------cceEEEEe-------ccccchhhcCceEEEE Confidence 1 112 457753 232 2221 11222222 223444566777777 Q ss_pred -HHHHHhhccccceEEEEEecC Q lcl|NC_020862. 384 -FFYGFIKLRGERIAVAYSVIP 404 (405) Q Consensus 384 -~~~~~~iL~~~~marie~~a~ 404 (405) +++.+.++++.-++.+...|- T Consensus 376 ~~r~d~~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 376 IEREDVRKWDEDAVVFGQITVE 397 (397) T ss_pred EEeeccEEecccceEEEEEeeC Confidence 458899999999999999998 No 94 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=98.24 E-value=4.2e-07 Score=55.55 Aligned_cols=282 Identities=9% Similarity=0.001 Sum_probs=153.4 Q ss_pred CCccccC----cCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCC Q lcl|NC_020862. 1 MPHIYND----PAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLD 76 (405) Q Consensus 1 ~~~~y~~----~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvt 76 (405) +-.+... -..+++++-+. .-+-.+....+....+...+.+++...+|+...|+...++... -........||- T Consensus 105 ~~~~~~~e~~a~~~~t~~~gg~-~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~-~~~~a~~v~Eg~- 181 (404) T protein:vir:39 105 MAFLNTVSSKTETSGSDSAAGL-TIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTD-VTPLTVMDAEDG- 181 (404) T ss_pred hhhhhhhhhhhhhcccccCCce-eccHHHHHHHHHHHHhhhhHHhhcceeeccCCcceEEEEeecC-CccceeeecCcc- Confidence 1100000 01111121111 1222444444555556778899999999999988876555221 101111122221 Q ss_pred cccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHH Q lcl|NC_020862. 77 ATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREML 156 (405) Q Consensus 77 p~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell 156 (405) ++. .-...++..|+-++++++.++.+|++++ -|+..++...+..++. T Consensus 182 ----~~~----------------------------~~~~~~f~~i~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~ 228 (404) T protein:vir:39 182 ----KIP----------------------------DLDNPRLTIIKYLIKRYAGIITATNTLL-KDTAENILAWLSSWIA 228 (404) T ss_pred ----ccc----------------------------cccccceeeEEeeeeeEEeeehhHHHHH-hhchHHHHHHHHHHHH Confidence 111 0111356778899999999999999854 5677788888877777 Q ss_pred HHHhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHH-HHHhccCccccceeccccccCc Q lcl|NC_020862. 157 RGANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSI-TLTDNYTPKKTTIIKGSRMTDT 235 (405) Q Consensus 157 ~~~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~-~Lk~nrApk~T~ii~gs~~~gT 235 (405) +..+.. ++. .|++|.++.. . .-+.+++++|..+.. .++....+ T Consensus 229 ~~~~~~-~d~---~il~g~g~~~------~------------~~~~~~~~~i~~~~~~~~~~~~~~-------------- 272 (404) T protein:vir:39 229 KKVVVT-RNQ---AIIAAMGTVP------K------------KPTIAKFDDVITMINTSVDPAIIA-------------- 272 (404) T ss_pred HHHHHH-HHH---HHHhcccccc------c------------ccccccHHHHHHHHHHhhhhhhcc-------------- Confidence 655443 332 4566643211 0 113467777777654 34333221 Q ss_pred ccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcc Q lcl|NC_020862. 236 KTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRG 315 (405) Q Consensus 236 ~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~ 315 (405) .=+-+|||.....|+.|+|-.+.|-|.+- +..+-.++|-|-.+.+..+. +...++ T Consensus 273 -----~a~~v~n~~~~~~L~~lkd~~G~~l~~~~--------~~~~~~~~l~G~pV~~~~~~----~~~~~~-------- 327 (404) T protein:vir:39 273 -----TSSLLTNQSGLNKLALVKTAEGKYLLEPD--------PTKPNSYLIKGKKVIVVADR----WLPNSG-------- 327 (404) T ss_pred -----CCEEEEcHHHHHHHHHhhccCCceeeccC--------cCCCCcceecceeEEEeccc----ccCccC-------- Confidence 01468999999999999998888887643 23444467766444433321 111111 Q ss_pred cccccccCCcceeeeEEEEEcccc-ceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhhcc Q lcl|NC_020862. 316 YQVSDVAGTDKYDIAPLLVVGDQA-FATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIKLR 392 (405) Q Consensus 316 ~~~~~~~g~~~~DVYp~lV~G~~A-fg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL~ 392 (405) .+ -+ .+++|+-. +-.+..+ ..+++-+.. -.+.+.+.+...++ +.+++.+++ T Consensus 328 --------~~---~~-~~~~gd~~~~~~~~~~-------~~~~i~~~~-------~~~~~~~~~~~~~r~~~r~d~~~~~ 381 (404) T protein:vir:39 328 --------ST---VY-PLYYGDMSQAITLFDR-------ENMSLLPTN-------IGAGAFETDTTKIRVIDRFDVKTTD 381 (404) T ss_pred --------CC---cc-EEEEEeccccEEEEee-------cceEEEEec-------cchhhhhhceeeEEEEeeeccEEec Confidence 11 12 24567533 2222222 113333332 23345566667766 678999999 Q ss_pred ccceEEEEEecCC Q lcl|NC_020862. 393 GERIAVAYSVIPE 405 (405) Q Consensus 393 ~~~marie~~a~~ 405 (405) ++-++.++..+.. T Consensus 382 ~~a~~~~~~~~~a 394 (404) T protein:vir:39 382 SEALVAGSFTAIA 394 (404) T ss_pred ccceEEEEeeccc Confidence 9999999955544 No 95 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=98.22 E-value=2.2e-07 Score=57.13 Aligned_cols=280 Identities=11% Similarity=0.026 Sum_probs=151.9 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) +-..-. ..+++++-+.-.-+..|.+..+....+..++.+++...+|+.+.|+-..+.+...-+. .+..+.|+ T Consensus 101 ~~~~~~--~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~------a~~v~E~~ 172 (395) T protein:vir:38 101 FKNLVT--SGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEKLADITPL------KDLDDESA 172 (395) T ss_pred HHHHHh--hccCccCCCceecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEeeccCCcc------cccccccc Confidence 000000 0111111122223334555555555567789999999999999997655443322111 11122222 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) .+. .-...++..|+-+.++++.++.+|+++ .-|++.++.+.+..++.+..+ T Consensus 173 ~~~----------------------------~~~~~~f~~v~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~la~~~~ 223 (395) T protein:vir:38 173 LIG----------------------------DNDDPELTVVKYLIHRYAGITTVTNTL-LKDTVDNIIQWLVNWAAKKDV 223 (395) T ss_pred ccc----------------------------cccccceeeEEeeeeeeEeehhhHHHH-HhhhHHHHHHHHHHHHHHHHH Confidence 211 011235667889999999999999984 457777888887777765444 Q ss_pred hHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHH-HHHhccCccccceeccccccCccccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSI-TLTDNYTPKKTTIIKGSRMTDTKTIS 239 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~-~Lk~nrApk~T~ii~gs~~~gT~~I~ 239 (405) .. ++ ..|++|.+. | .. ..+..++++|.++.- .|+..-.+ T Consensus 224 ~~-~~---~~il~g~g~----~--~~------------~~~~~~~~~i~~~~~~~l~~~~~~------------------ 263 (395) T protein:vir:38 224 VT-RN---AKILEVMGK----A--PK------------KPTISQFDNIKDLENNTLDPAIES------------------ 263 (395) T ss_pred HH-HH---HHHhhcccc----c--cc------------ccccccHHHHHHHHHHhhhhhhcC------------------ Confidence 32 32 345555431 0 00 113456777766543 34322221 Q ss_pred ceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccccc Q lcl|NC_020862. 240 ASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVS 319 (405) Q Consensus 240 ~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~ 319 (405) .-+-+|||.....|+.|+|..+.|-|.|- +..+.-+.|-|- .+++++.+. . +.+ T Consensus 264 -~a~~v~n~~~~~~L~~lkd~~G~~l~~~~--------~~~~~~~~l~G~--pV~~~~~~~--~--~~~----------- 317 (395) T protein:vir:38 264 -TSSFITNQSGYNILSKVKDADGRYLMQPD--------VTSPDKYLIDGK--PVIRIADKW--L--PDV----------- 317 (395) T ss_pred -CCEEEEcHHHHHHHHHhhccCCceeeccC--------cCCCCcceeccc--eeEEecccc--c--CcC----------- Confidence 12347999999999999998888888652 223444566553 444443221 1 110 Q ss_pred cccCCcceeeeEEEEEcccc-ceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHH--HHHHhhccccce Q lcl|NC_020862. 320 DVAGTDKYDIAPLLVVGDQA-FATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKF--FYGFIKLRGERI 396 (405) Q Consensus 320 ~~~g~~~~DVYp~lV~G~~A-fg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~--~~~~~iL~~~~m 396 (405) ++. . .++||.-+ +-.+..+ ..+.+-+. +..+.+-+++.++|++ ++.+.+++++-+ T Consensus 318 ----~~~---~-~i~~gd~~~~~~i~~~-------~~~~i~~~-------~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~ 375 (395) T protein:vir:38 318 ----SGS---H-PLYFGDLKQGITLFDR-------QQMQIDTT-------NVGAGSFEHDTTKLRFIDRFDVQLIDDGAF 375 (395) T ss_pred ----CCc---c-eEEEEeccccEEEEEe-------cceEEEEe-------ccccchhhcCceEEEEEEeeccEEecccce Confidence 011 1 24677533 2233322 11232232 1233455677777774 489999999999 Q ss_pred EEEEEe--cCC Q lcl|NC_020862. 397 AVAYSV--IPE 405 (405) Q Consensus 397 arie~~--a~~ 405 (405) +.++.. ++| T Consensus 376 ~~~~~~~~~~~ 386 (395) T protein:vir:38 376 AAASFKTVANQ 386 (395) T ss_pred EEEEeecccCC Confidence 998865 344 No 96 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=98.20 E-value=6.6e-07 Score=54.47 Aligned_cols=296 Identities=16% Similarity=0.044 Sum_probs=153.1 Q ss_pred CCccccCc-CCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCccc Q lcl|NC_020862. 1 MPHIYNDP-AAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATG 79 (405) Q Consensus 1 ~~~~y~~~-~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g 79 (405) +-. ..+. ..+.+++.+...-+-.|.++.+....+...+.+++.+.+|+.+. +++.+..+...... .-+..+.| T Consensus 110 ~~~-~~~~~~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~---~~~~~~~~~~~~~~--~a~~v~Eg 183 (413) T protein:vir:81 110 VKA-ASDPASTATLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNLTMTNTT---IKYLMEKANRVVEG--GFKTVAEG 183 (413) T ss_pred HHh-hhhhhhhcccccccccccchhhHHHHHHHHhhhhhHHhhcceeeccCCc---eeEEEecccccccc--ccceecCc Confidence 111 1111 11112222222223345666677777788888999999887543 44444333222111 00111222 Q ss_pred ccccCCcccccccccccccccccccccccccccccce-eeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHH Q lcl|NC_020862. 80 ASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGY-TRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRG 158 (405) Q Consensus 80 ~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~-t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~ 158 (405) .... .... ++..|+-+.++++.++.+|++++ -|++ .+...+..++.+. T Consensus 184 ~~~~-----------------------------~~~~~~f~~i~~~~~k~~~~~~iS~ell-~ds~-~l~~~i~~~la~~ 232 (413) T protein:vir:81 184 GKKP-----------------------------YMRFADFDIVTESLSKIAGLTKITDEMI-EDYD-FLVSYINARLLEE 232 (413) T ss_pred cccc-----------------------------ccCcccceeeEeeeeeEEEeehhhHHHH-HHHH-HHHHHHHHHHHHH Confidence 2221 1111 45667899999999999999955 4665 4777776666544 Q ss_pred HhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccc Q lcl|NC_020862. 159 ANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTI 238 (405) Q Consensus 159 ~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I 238 (405) .+ ..+|. .+++|.++-.-. ++-.+.++..+.+.....-.++++.++...+..++... T Consensus 233 ~~-~~~d~---~~l~G~G~~~~~---~Gi~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~---------------- 289 (413) T protein:vir:81 233 LA-IEEER---QLLLGDGTGNNL---TGLLKRDGIQTLAVSNKDELADSIYKAMTNISLATPFQ---------------- 289 (413) T ss_pred HH-HHHHH---HHhccCCCCCcc---cccccccccccccccccchhHHHHHHHHHHhhhhccCC---------------- Confidence 33 33333 356664321100 01111111111111112234666777765554443321 Q ss_pred cceEEEEEcccchHHHHHHhcccCCCcceehhh--cCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccc Q lcl|NC_020862. 239 SASRIAYIGSELEIYITELVDSLGNPAFVPVEK--YADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGY 316 (405) Q Consensus 239 ~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~--Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~ 316 (405) ++ ..++|+.....|+.|+|-.+.|-|.+... +++.. ..-.+.+-| +++++++.|. +| T Consensus 290 -~~-~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~---~~~~~~l~G--~pv~~s~~~~----~~---------- 348 (413) T protein:vir:81 290 -AD-ALVINPLDYQELRLAKDANGQYYGGGVFQGQYGSGG---IMLDPAPWG--LRTVQSQVVP----VG---------- 348 (413) T ss_pred -Cc-EEEEcHHHHHHHHHhhccCCceeccccccccccccc---cccCceecc--eeeEEcCCCC----cc---------- Confidence 12 24789999999999999888888876422 22221 222345644 5667666442 11 Q ss_pred ccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhhcccc Q lcl|NC_020862. 317 QVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIKLRGE 394 (405) Q Consensus 317 ~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL~~~ 394 (405) -++||.-+.+-..+.. +.+.+-+. +..+++-+++.++|+ +++.+.+.+++ T Consensus 349 ---------------~~~~gd~~~~~~~~~~------~~~~v~~~-------~~~~~~~~~~~~~~r~~~r~d~~~~~~~ 400 (413) T protein:vir:81 349 ---------------KPVVGAFRSAASVLRK------GGVRIDST-------NTNVDDFENNLITVRAEERVGLMVTFPE 400 (413) T ss_pred ---------------cEEEEecccEEEEEEe------cceEEEEe-------ccccchhhcCcEEEEEEEeeccEEeccc Confidence 1456765433222210 12333332 224466777778887 57899999999 Q ss_pred ceEEEEEecCC Q lcl|NC_020862. 395 RIAVAYSVIPE 405 (405) Q Consensus 395 ~marie~~a~~ 405 (405) -+++++..++= T Consensus 401 a~~~l~~~~~~ 411 (413) T protein:vir:81 401 AIVQLDVAEVV 411 (413) T ss_pred ceEEEEecCCC Confidence 99988865433 No 97 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=98.16 E-value=4.8e-07 Score=55.22 Aligned_cols=279 Identities=15% Similarity=0.054 Sum_probs=154.2 Q ss_pred CCcccc--CcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcc Q lcl|NC_020862. 1 MPHIYN--DPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDAT 78 (405) Q Consensus 1 ~~~~y~--~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~ 78 (405) +....+ ....++++.-+. .-+-.+....+....+...+.+++...+|+.+.. .+-+... ..+.. +.... T Consensus 126 ~~~~~~~~~~~~~~~~~~g~-lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~---~~~~~~~----~~~~a-~~v~E 196 (418) T protein:vir:10 126 RKSIMNVPATVGSGVSGSNS-LVVADRQAGIIAPPQRKMTIRDLLMPGQTSSSSI---EYTVETG----FTNNA-AAVAE 196 (418) T ss_pred HHHHHHhhhhccCCCCCCcc-ccchhHHHHHHHHHhhhhhHHhhcceeeccCCce---eEEEEec----CCCce-eeecc Confidence 111111 111122222121 2222344554555556778888888888875543 2222111 11110 11122 Q ss_pred cccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHH Q lcl|NC_020862. 79 GASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRG 158 (405) Q Consensus 79 g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~ 158 (405) |+.+ .-...++..|+.+.++++.++.+|++++. ++. ++...+..++.+. T Consensus 197 ~~~~-----------------------------~~~~~~f~~v~~~~~k~~~~~~is~ell~-ds~-~l~~~i~~~l~~a 245 (418) T protein:vir:10 197 GAQK-----------------------------PTSDLKFNLKNQPVRTIAHLFKASRQILD-DAP-ALQSYIDGRARYG 245 (418) T ss_pred Cccc-----------------------------cccccceeeEEEeeeeEEEeehhhHHHHH-hHH-HHHHHHHHHHHHH Confidence 2222 11223566789999999999999999664 565 6777766666654 Q ss_pred HhhHHHHHHHHHHhccCce-------EEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceecccc Q lcl|NC_020862. 159 ANEITEDLLQADILASADV-------KVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSR 231 (405) Q Consensus 159 ~~~~ted~l~~~ilag~~~-------v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~ 231 (405) .+. .+| ..+++|.++ ...++..+. .....+..++++|..+...+.....+. T Consensus 246 ~~~-~~d---~a~l~G~g~~~~p~Gi~~~~~~~~~---------~~~~~~~~~~~~i~~~~~~~~~~~~~~--------- 303 (418) T protein:vir:10 246 LQL-TEE---GQILKGDGTGANILGILPQASAFMP---------SITLANATPIDKIRLALLQAVLAEFPA--------- 303 (418) T ss_pred HHH-HHH---HHHhccCCCCccccccccccccccc---------cccccccccHHHHHHHHHhhccccCCC--------- Confidence 433 333 244555432 111111111 111234577889998888776554431 Q ss_pred ccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccC Q lcl|NC_020862. 232 MTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATA 311 (405) Q Consensus 232 ~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~ 311 (405) =..+|||.+...|+.|+|..+.|-|-+. ..+..|.|-| +.+++++.|. +| T Consensus 304 ----------~~~v~n~~~~~~L~~lkd~~G~~i~~~~---------~~~~~~~l~G--~pV~~~~~~p----~~----- 353 (418) T protein:vir:10 304 ----------TGIVLNPIDWASIELTKDSQGRYIVGNP---------VNGTTPRLWN--LPVVETQAMT----AN----- 353 (418) T ss_pred ----------CEEEEcHHHHHHHHHhhcCCCceecccc---------ccCCCceecc--eeeEEcCCCC----CC----- Confidence 1356899999999999988777767322 3555677755 6777776543 11 Q ss_pred CCcccccccccCCcceeeeEEEEEcccccee-ecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHH Q lcl|NC_020862. 312 ANRGYQVSDVAGTDKYDIAPLLVVGDQAFAT-IGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGF 388 (405) Q Consensus 312 t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~-i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~ 388 (405) + +++|.-+... +..+ ..+++.+..- .+.+-+++.+.|+ +++.+ T Consensus 354 ----------------~----~~~gd~s~~~~~~~~-------~~~~i~~~~~-------~~~~f~~~~~~~r~~~~~d~ 399 (418) T protein:vir:10 354 ----------------E----FLVGAFSMAAQIFDR-------MEIEVLLSTE-------NVDDFEKNMVSIRAEERLAL 399 (418) T ss_pred ----------------c----EEEeeccceEEEEEe-------cceEEEEecc-------cchhhhcCceEEEEEEeecc Confidence 1 3556533221 2111 1244433321 3345677778887 46888 Q ss_pred hhccccceEEEEEecCC Q lcl|NC_020862. 389 IKLRGERIAVAYSVIPE 405 (405) Q Consensus 389 ~iL~~~~marie~~a~~ 405 (405) .++++.-+++++..+|= T Consensus 400 ~~~~~~a~~~~~~~~~~ 416 (418) T protein:vir:10 400 AVYRPESFVTGALVEQA 416 (418) T ss_pred EEecccceEEEEeccCC Confidence 99999999999998888 No 98 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=98.14 E-value=4.1e-07 Score=55.61 Aligned_cols=284 Identities=13% Similarity=0.083 Sum_probs=151.7 Q ss_pred CCccc---cCc-CCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCC Q lcl|NC_020862. 1 MPHIY---NDP-AAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLD 76 (405) Q Consensus 1 ~~~~y---~~~-~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvt 76 (405) +.++. .+. ..+++++-+.-+-+ .+..+.+....+..++.+++...+||.+.|+-...+ ...-+. .....|| T Consensus 80 ~~~l~~~~~~a~~~~t~~~gg~~vP~-~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~-~~~~~~-a~~v~Eg-- 154 (371) T protein:vir:81 80 VNHIRTRFRNAMSEGSNQDGGYTVPQ-DIQTRINELRESKDALQNLITVEPVTTLSGSRVFKK-RSQQTG-FVEVAEG-- 154 (371) T ss_pred HHHHHHHHHHhhccCCCccCceeecH-hHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEe-ecCCcc-eeeeccc-- Confidence 11111 011 11111211222222 334555555556788999999999998776533222 111111 1112222 Q ss_pred cccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHH Q lcl|NC_020862. 77 ATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREML 156 (405) Q Consensus 77 p~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell 156 (405) +.. +.....++..|+.+.++++.++.+|+++ +-|++.++...+..++. T Consensus 155 ---~~~----------------------------~~~~~~~f~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~ 202 (371) T protein:vir:81 155 ---AAI----------------------------GEKATPQFTLLQYQVKKYAGFFRVTNEL-LNDSTEAIVNTLVRWIG 202 (371) T ss_pred ---ccc----------------------------ccccccceeeEEeeeeEEEEeehhhHHH-HhhhhHHHHHHHHHHHH Confidence 111 1111236777899999999999999985 44666677777777666 Q ss_pred HHHhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHH-HHHhccCccccceeccccccCc Q lcl|NC_020862. 157 RGANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSI-TLTDNYTPKKTTIIKGSRMTDT 235 (405) Q Consensus 157 ~~~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~-~Lk~nrApk~T~ii~gs~~~gT 235 (405) +..+. .+ -..+++|.+.. .+.+..+++++..+.. .|+...... T Consensus 203 ~a~~~-~~---~~~i~~g~g~~-------------------~~~~~~~~~~i~~~~~~~l~~~~~~~------------- 246 (371) T protein:vir:81 203 DESRV-TR---NGLIINVLNTK-------------------AKTAIADLDGLKQIINVQLDPVFRST------------- 246 (371) T ss_pred HHHHH-HH---HHHHHhhcccc-------------------cccccccHHHHHHHHHhhcchhhhcC------------- Confidence 44332 33 34556654321 0124567778777653 443332210 Q ss_pred ccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcc Q lcl|NC_020862. 236 KTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRG 315 (405) Q Consensus 236 ~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~ 315 (405) + +.++||.....|+.|+|..+.|-|.|- +-.+-.|.|.| +..++++. +++-..+.. T Consensus 247 -----a-~~vmn~~~~~~L~~lkd~~g~~l~~~~--------~~~~~~~~l~G--~pV~~~~~-~~~~~~~~~------- 302 (371) T protein:vir:81 247 -----S-SVIVNQDAFNWLDTLKDQNGQYLLQPS--------ISSPTGRQLLG--LPVVIVSN-KVLANRVDG------- 302 (371) T ss_pred -----C-EEEEcHHHHHHHHHhhccCCCeeeecc--------cCCCCCceecc--eeEEEecc-cccCccccc------- Confidence 1 357899999999999998888888752 12445567766 45555543 332211110 Q ss_pred cccccccCCcceeeeEEEEEcccc-ceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhhcc Q lcl|NC_020862. 316 YQVSDVAGTDKYDIAPLLVVGDQA-FATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIKLR 392 (405) Q Consensus 316 ~~~~~~~g~~~~DVYp~lV~G~~A-fg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL~ 392 (405) +.+. .-..+++|.=. |-.+..+ ..+++-+..- .+.+-+++.+.|+ .++.+.+++ T Consensus 303 -----~~~~----~~~~i~~Gd~~~~~~~~~~-------~~~~i~~~~~-------~~~~f~~~~v~~~~~~r~d~~~~~ 359 (371) T protein:vir:81 303 -----GTGA----QFAPIIVGDLKEAVVMFDR-------QRTEIMSSNV-------AMDAFETDATLWRAIERMDVKMRD 359 (371) T ss_pred -----cccC----CcceEEEEehhceEEEEee-------cceEEEEecc-------ccchhhcCceEEEEEEeeccEEec Confidence 1111 12336777422 1122211 1233333321 2233456667777 456889999 Q ss_pred ccceEEEEEecC Q lcl|NC_020862. 393 GERIAVAYSVIP 404 (405) Q Consensus 393 ~~~marie~~a~ 404 (405) +.-++.++..+- T Consensus 360 ~~a~~~~~~~~A 371 (371) T protein:vir:81 360 DEAFVFGEVQLA 371 (371) T ss_pred ccceEEEEEecC Confidence 999999998777 No 99 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=98.13 E-value=7.3e-07 Score=54.23 Aligned_cols=330 Identities=13% Similarity=0.103 Sum_probs=169.9 Q ss_pred ccccCc------CCCccccccc-----ceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCcc Q lcl|NC_020862. 3 HIYNDP------AAGDASTVGP-----QFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVN 71 (405) Q Consensus 3 ~~y~~~------~~t~~~~v~~-----qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l 71 (405) |-+.++ +..+....+. .+...-|..+.|..=+..-++..+-..+.+= .||+++|-|--..-. .-. T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~~~~rti~--~Gksv~f~~iG~~t~--~~~ 76 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDLVTKRTLK--NGKSLQFIYTGRMTS--SFH 76 (375) T ss_pred CccccccccCccccCCccccccccchHHHHHHHHhHHHHHHHHHHHhhhccccccccc--cCceEEEEeeeeeEE--eee Confidence 222222 1111111110 1222234555555545555555555555432 489999886432211 112 Q ss_pred ccCCCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHH Q lcl|NC_020862. 72 DQGLDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHL 151 (405) Q Consensus 72 ~eGvtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~ 151 (405) +.|-...|.... |+ ...+++-+|-|.=-|..+=|+.-+.+.+-|+..++ T Consensus 77 t~G~~i~~~~~~---------d~----------------------~~te~~l~ID~~~y~~~~VdDiD~aqa~~Dlr~e~ 125 (375) T protein:vir:10 77 TPGTPILGNADK---------AP----------------------PVAEKTIVMDDLLISSAFVYDLDETLAHYELRGEI 125 (375) T ss_pred cCCcCcCCcccc---------CC----------------------CCCceEEEecchhhhhhhHhhHHHHhcCchhHHHH Confidence 223322221110 11 01122233333322222222334455566788888 Q ss_pred HHHHHHHHhhHHHHHHHHHHhccC--------ceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccc Q lcl|NC_020862. 152 SREMLRGANEITEDLLQADILASA--------DVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKK 223 (405) Q Consensus 152 ~~ell~~~~~~ted~l~~~ilag~--------~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~ 223 (405) +.++...-+......+-+.+..++ ....+.|+ +.-...++..+.+..+...-++.|+.+...|.++..|. T Consensus 126 s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~~~~~~~~~Gg-~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~- 203 (375) T protein:vir:10 126 SKKIGYALAEKYDRLIFRSITRGARSASPVSATNFVEPGG-TQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSS- 203 (375) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccCc-ceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCC- Confidence 888776555544443444443222 11222221 12111122221111122244677889999999999983 Q ss_pred cceeccccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhh Q lcl|NC_020862. 224 TTIIKGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYA 303 (405) Q Consensus 224 T~ii~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~ 303 (405) .-+++++.|+.-.-|..-. +.+.|+.. .|+.....-+|.+|+|. .|+++++.++=.-. T Consensus 204 ----------------~~R~~vv~P~~y~~Ll~~~---d~~~~~n~-d~~~~~~~~~g~v~~i~--Gv~V~~Sn~lP~~~ 261 (375) T protein:vir:10 204 ----------------QGRCAVLNPRQYYALIQDI---GSNGLVNR-DVQGSALQSGNGVIEIA--GIHIYKSMNIPFLG 261 (375) T ss_pred ----------------CCCEEEeChHHHHHHHhcC---Cccceeee-cccccceeccceEEEEe--ceEEEEeccccccc Confidence 2388999998877774322 13457776 57777777799999995 48999988766544 Q ss_pred cC----CCcccCCCccc----------ccccccC-CcceeeeE-------EEEEccccceeecceeccCCCCCCceEEEe Q lcl|NC_020862. 304 GA----GATATAANRGY----------QVSDVAG-TDKYDIAP-------LLVVGDQAFATIGLQGMSGKGKSKFRIIVK 361 (405) Q Consensus 304 ~a----Ga~~~~t~~~~----------~~~~~~g-~~~~DVYp-------~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk 361 (405) +- |+....+.+.. ..+.+.| .++|++=. -|+|=++|-|++.+.+...+ + T Consensus 262 ~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~~~~~~A~g~v~~~~~~~~--------~- 332 (375) T protein:vir:10 262 KYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCGLIFQKEAAGVVEAIGPQVQ--------V- 332 (375) T ss_pred cccccccccccccchhhhhccccccCCcceeeccccccccccccccCceEEEEEchhheeeeeeeccccc--------c- Confidence 32 11111111110 1111111 12344222 47778888888877643211 1 Q ss_pred cCCCCCCCCCCccchhhhHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_020862. 362 KPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIAVAYSVIPE 405 (405) Q Consensus 362 ~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~marie~~a~~ 405 (405) - -+.=++.-|..+.=-|+.+|+.+||+|.-+.|.+.+|- T Consensus 333 -~----~~~~~~~~q~~~i~~~~a~G~~~lrp~~av~l~~~~~~ 371 (375) T protein:vir:10 333 -T----NGDVSVIYQGDVILGRMAMGADYLNPAAAVELYIGATA 371 (375) T ss_pred -c----cchhhheeeeeeeeeeeeeccCccCceeEEEEecCcCc Confidence 1 12237777888888899999999999999999998776 No 100 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=98.11 E-value=6.3e-07 Score=54.56 Aligned_cols=283 Identities=13% Similarity=0.089 Sum_probs=148.7 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhh-hhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEE-MFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATG 79 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~-lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g 79 (405) -+..-+..++.....+.++ .| ++.+.+..+. .++.+++.+.++ +.+..+.+.+...-+.+ +....| T Consensus 107 ~~~~~~~t~~~~g~~~~~~----~~-~~~i~~~~~~~~~l~~~~~~~~~--~~~~~~~~~~~~~~~~a------~~v~E~ 173 (392) T protein:vir:13 107 APEKRDGTKAGNPNVLSRT----LY-GQLIAQAVERSAIMRGGASTFTT--SDANPMDFTVITGRATA------GIVGET 173 (392) T ss_pred hhhhhcccccCCCcccccc----ch-HHHHHHHHhhhhhhhhcceeeec--CCCceeEEEEEcCCcce------eeeccc Confidence 0111111111111122222 22 4445555554 456677766543 44556666544322211 111222 Q ss_pred ccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHH Q lcl|NC_020862. 80 ASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGA 159 (405) Q Consensus 80 ~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~ 159 (405) .+ ..-...++..|+-+.++++.++.+|++++ -|++.++...|..++.+.- T Consensus 174 ~~-----------------------------~~~~~~~f~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~~i 223 (392) T protein:vir:13 174 AE-----------------------------IPESYPATTQRSMGGFKYGFASVVSYEFA-TDQVLDLVGFLVSDAGPAI 223 (392) T ss_pred cc-----------------------------ccccccceeeEEeeeeeEEeeehhHHHHH-hcchHHHHHHHHHHHHHHH Confidence 12 11222356667899999999999999854 4666677777777766544 Q ss_pred hhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccccc Q lcl|NC_020862. 160 NEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTIS 239 (405) Q Consensus 160 ~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~ 239 (405) +. .++ ..+++|.++..=.|--++....+..... ...+.+++++|.++...|+..... . T Consensus 224 ~~-~~d---~~~l~G~Gt~~p~Gil~~~~~~~~~~~~-~~~~~~~~d~l~~~~~~l~~~~~~-----------------~ 281 (392) T protein:vir:13 224 GD-AMG---RHFLTGTGTGQPRGILTDATGANAAFGE-ADADSKVSDALIDLFHEVPSAYRK-----------------N 281 (392) T ss_pred HH-HHH---HHHhcccCCccccccccccccccccccc-cccccccHHHHHHHHHhhhhhhhc-----------------C Confidence 33 233 3566665542111111111100000111 123568899999988888654321 1 Q ss_pred ceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccccc Q lcl|NC_020862. 240 ASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVS 319 (405) Q Consensus 240 ~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~ 319 (405) ++| ++|+.....|+.|+|..+.|-|.|--+- |.-+.+-| +.++.++.|- + T Consensus 282 a~~--v~n~~~~~~l~~lkd~~G~~l~~~~~~~--------g~~~~l~G--~Pv~~~~~~~----~-------------- 331 (392) T protein:vir:13 282 AKF--VVNDLRAAQMRKLKDANGQYLWQSALTV--------GAPDTFNG--KVVETDDGMP----A-------------- 331 (392) T ss_pred CEE--EEcHHHHHHHHHhhccCCceeecCCcCC--------CCCceecc--eeeEEcCCCC----C-------------- Confidence 123 6799999999999998888888764333 33345544 4555554321 0 Q ss_pred cccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhhccccceE Q lcl|NC_020862. 320 DVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIKLRGERIA 397 (405) Q Consensus 320 ~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL~~~~ma 397 (405) + .++||+=+.-.++..+ .++ ++. ..|++-+++...++ .++.+.+.+++-++ T Consensus 332 -----~------~i~~Gdf~~~~i~~~~-------~~~--i~~-------~~~~~~~~~~~~~r~~~r~d~~~~~~~A~~ 384 (392) T protein:vir:13 332 -----D------KVLFADLSKYRVRFAG-------SLR--VDR-------SVDAKFSTDQIVYRFLQRADGLLVDARGAK 384 (392) T ss_pred -----C------cEEEeeccceeEEeec-------ceE--EEe-------eccccccCCcEEEEEEEEeccEEecccceE Confidence 0 1456764433444431 122 331 25677666666665 56789999999888 Q ss_pred EEEEecCC Q lcl|NC_020862. 398 VAYSVIPE 405 (405) Q Consensus 398 rie~~a~~ 405 (405) ++++.+-= T Consensus 385 ~~~~~~aa 392 (392) T protein:vir:13 385 VLTVTPAA 392 (392) T ss_pred EEEeeccC Confidence 66543222 No 101 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=98.10 E-value=1.2e-06 Score=53.10 Aligned_cols=283 Identities=10% Similarity=0.024 Sum_probs=147.2 Q ss_pred CCccccCc----C-CCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCC Q lcl|NC_020862. 1 MPHIYNDP----A-AGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGL 75 (405) Q Consensus 1 ~~~~y~~~----~-~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGv 75 (405) ....+-.. . ...++.-+.-.-+..+....+....+...+.+++...+|+.+.|+-...+.- .-.. ...+ T Consensus 103 ~~~~~~~~~~~~a~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-~~~~-----~~~~ 176 (408) T protein:vir:74 103 NPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSSGSRVYEKWT-DVTP-----LKAM 176 (408) T ss_pred cchhhhhhhhhhhhcccccCCCceeechhHhhHHHHHHhhhcchhhhcceeeccCCcceEEEEeec-CCcc-----cccc Confidence 00000000 0 0111111112233355554444455667889999999999887765433321 1111 0112 Q ss_pred CcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHH Q lcl|NC_020862. 76 DATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREM 155 (405) Q Consensus 76 tp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~el 155 (405) .+.|+.+ +.-...++..|+.++++++.++.+|++++ -|+..++..++..++ T Consensus 177 v~E~~~~----------------------------~~~~~~~~~~i~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l 227 (408) T protein:vir:74 177 DEEDGKI----------------------------PDLDNPRLTIIKYLIKRYAGIITATNTLL-KDTAENILAWLSSWI 227 (408) T ss_pred ccccccc----------------------------ccccccceeeEEeeeeeEEeeehhHHHHH-hhchHHHHHHHHHHH Confidence 2222221 11112366778999999999999999954 466667888877777 Q ss_pred HHHHhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHH-HHHHhccCccccceeccccccC Q lcl|NC_020862. 156 LRGANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLS-ITLTDNYTPKKTTIIKGSRMTD 234 (405) Q Consensus 156 l~~~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~-~~Lk~nrApk~T~ii~gs~~~g 234 (405) .+..+ ..+| ..+++|.++..- .-+.+++++|..+. ..|+.+.... T Consensus 228 ~~~~~-~~~d---~~il~G~G~~~~------------------~~~~~~~~~i~~~~~~~l~~~~~~~------------ 273 (408) T protein:vir:74 228 AKKVV-VTRN---QAIIAAMGTVPK------------------KPTIANFDDVITMINTSVDPAIIAT------------ 273 (408) T ss_pred HHHHH-HHHH---HHHhhccccccc------------------ccccccHHHHHHHHHHhhhhhhcCC------------ Confidence 65443 3333 356666543111 11457788887764 4665443220 Q ss_pred cccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCc Q lcl|NC_020862. 235 TKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANR 314 (405) Q Consensus 235 T~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~ 314 (405) + +-+|||.+...|+.|+|..+.|-|.|- +..+--++|-|-.+.+..+ ..++ .. T Consensus 274 ------a-~~v~n~~~~~~l~~lkd~~G~~l~~~~--------~~~~~~~~l~G~pV~~~~~-~~~~---~~-------- 326 (408) T protein:vir:74 274 ------S-SLLTNQSGLNKLALVKTAEGKYLLEPD--------PTKPNSYLIKGKQVIVVAD-RWLP---NS-------- 326 (408) T ss_pred ------C-EEEEcHHHHHHHHHhhcCCCceEeccC--------cCCCCCceecceeeEEecC-cccc---cc-------- Confidence 1 346899999999999987777777542 1233334665633333322 1121 00 Q ss_pred ccccccccCCcceeeeEEEEEccccc-eeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhhc Q lcl|NC_020862. 315 GYQVSDVAGTDKYDIAPLLVVGDQAF-ATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIKL 391 (405) Q Consensus 315 ~~~~~~~~g~~~~DVYp~lV~G~~Af-g~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL 391 (405) +.++ ..+++|.-+- -.+..+ +.+.+-+.. ..+..-+++.+.|+ +.+.+.++ T Consensus 327 --------~~~~----~~i~~gd~~~~~~~~~~-------~~~~i~~~~-------~~~~~f~~~~~~~r~~~r~d~~~~ 380 (408) T protein:vir:74 327 --------GSTV----YPLYYGDMSQAITLFDR-------ENMSLLPTN-------IGAGAFETDTTKIRVIDRFDVKAT 380 (408) T ss_pred --------cCCc----ceEEEEehhccEEEEEe-------cceEEEEec-------cccchhhcceeeEEEEEeeCcEEe Confidence 1111 2256775432 122221 113333321 12223345555555 56788999 Q ss_pred cccceEEEEEe--cCC Q lcl|NC_020862. 392 RGERIAVAYSV--IPE 405 (405) Q Consensus 392 ~~~~marie~~--a~~ 405 (405) +++-++.++.. +++ T Consensus 381 ~~~a~~~~~~~~~~~~ 396 (408) T protein:vir:74 381 DSEALVAGSFTAIADQ 396 (408) T ss_pred cccceEEEEeecccCC Confidence 99999888853 333 No 102 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=98.09 E-value=1.6e-06 Score=52.37 Aligned_cols=282 Identities=10% Similarity=0.044 Sum_probs=145.1 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) ....-.+-..++++.-+.-+-+ -+....+....+..++.+++...+|+-+.|+...++. ..-+.. .-..||- T Consensus 99 ~~~~~~~~~~~t~~~gg~~vP~-~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~-~~~~~a-~~v~E~~----- 170 (392) T protein:vir:10 99 DDLEQRAMSGLTGEDGGLVIPQ-DIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKN-SDMIPF-AEITEMG----- 170 (392) T ss_pred hhhhhhhccccccCCCceecch-hHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEee-cCCccc-eeecccc----- Confidence 0000011111111111111212 2233333334455778889999999988887544431 111110 0122221 Q ss_pred cccCCccccccccccccccccccccccccccccc-ceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHH Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRV-GYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGA 159 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~-~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~ 159 (405) +. ... ..++..|+-+.++++.++.+|+++ +-|++.++...|..++.+. T Consensus 171 ~~-----------------------------~~~~~~~~~~v~l~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~~- 219 (392) T protein:vir:10 171 EI-----------------------------PETDNPKFSNVQYAVKDRAGILPLSRSL-LQDSDQNILKYVTKWLGKK- 219 (392) T ss_pred cc-----------------------------cccccccceeEEeeeeeEEEeehhhHHH-HhhhHHHHHHHHHHHHHHH- Confidence 11 111 125667789999999999999985 4566667888776666543 Q ss_pred hhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHH-HHHHhccCccccceeccccccCcccc Q lcl|NC_020862. 160 NEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLS-ITLTDNYTPKKTTIIKGSRMTDTKTI 238 (405) Q Consensus 160 ~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~-~~Lk~nrApk~T~ii~gs~~~gT~~I 238 (405) ....++. .+++|.+. ..+.+..++++|.++. ..|+....+. T Consensus 220 i~~~~d~---~~~~g~g~-------------------~~~~~~~~~d~i~~~~~~~l~~~~~~~---------------- 261 (392) T protein:vir:10 220 SKVTRNV---LILGVIEK-------------------LTKQAIKSLDDIKDVLNVKLDPAISPN---------------- 261 (392) T ss_pred HHHHHHH---HHhhcccc-------------------ccccCccCHHHHHHHHHHhhhhhhccC---------------- Confidence 3344433 33444321 0112457788887765 4555544321 Q ss_pred cceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccccc Q lcl|NC_020862. 239 SASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQV 318 (405) Q Consensus 239 ~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~ 318 (405) + .-++||.+...|+.|+|-.+.|-|.|= + -.+.-++|-|..+.+++...+....+.+ T Consensus 262 -a--~~vm~~~~~~~L~~lkd~~G~~l~~~~--~------~~~~~~tllG~~~v~~~~~~~~~~~~~~------------ 318 (392) T protein:vir:10 262 -A--ILLTNQDGFNYLDKLKDKDGKYILQSD--P------TQKNKKLFAGTNPVVVVSNRFLKSKGTT------------ 318 (392) T ss_pred -C--EEEEcHHHHHHHHHhhccCCCeEeecC--c------cCCccccccCcccEEEecccccCCCccc------------ Confidence 1 247899999999999998888888652 1 2344455655323322222223221110 Q ss_pred ccccCCcceeeeEEEEEcccc-ceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhhccccc Q lcl|NC_020862. 319 SDVAGTDKYDIAPLLVVGDQA-FATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIKLRGER 395 (405) Q Consensus 319 ~~~~g~~~~DVYp~lV~G~~A-fg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL~~~~ 395 (405) . +.++ +++|.=+ |-.+..+ ..+++-+. +-.+.+-+++.+.++ +++++.+++++- T Consensus 319 -----~---~~~~-~~~gdfs~~~~i~~~-------~~~~~~~~-------~~~~~~f~~~~~~~r~~~r~d~~v~~~~a 375 (392) T protein:vir:10 319 -----A---KKAP-LIIGDLKEAIVLFKR-------EDMELAST-------DVGGKAFTRNTLDLRAIQRDDVQMWDNEA 375 (392) T ss_pred -----C---CceE-EEEEehhceEEEEee-------cceEEEEe-------ccccchhhcCceEEEEEEeeccEEecccc Confidence 1 1222 4566422 2222222 11333332 123445566666666 558889999999 Q ss_pred eEEEEEecCC Q lcl|NC_020862. 396 IAVAYSVIPE 405 (405) Q Consensus 396 marie~~a~~ 405 (405) ++.+...... T Consensus 376 ~~~l~~~~~a 385 (392) T protein:vir:10 376 AVYGEIDLSA 385 (392) T ss_pred eEEEEecccc Confidence 9997664333 No 103 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=98.09 E-value=1.6e-06 Score=52.37 Aligned_cols=282 Identities=10% Similarity=0.044 Sum_probs=145.1 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) ....-.+-..++++.-+.-+-+ -+....+....+..++.+++...+|+-+.|+...++. ..-+.. .-..||- T Consensus 99 ~~~~~~~~~~~t~~~gg~~vP~-~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~-~~~~~a-~~v~E~~----- 170 (392) T protein:vir:10 99 DDLEQRAMSGLTGEDGGLVIPQ-DIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKN-SDMIPF-AEITEMG----- 170 (392) T ss_pred hhhhhhhccccccCCCceecch-hHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEee-cCCccc-eeecccc----- Confidence 0000011111111111111212 2233333334455778889999999988887544431 111110 0122221 Q ss_pred cccCCccccccccccccccccccccccccccccc-ceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHH Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRV-GYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGA 159 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~-~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~ 159 (405) +. ... ..++..|+-+.++++.++.+|+++ +-|++.++...|..++.+. T Consensus 171 ~~-----------------------------~~~~~~~~~~v~l~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~~- 219 (392) T protein:vir:10 171 EI-----------------------------PETDNPKFSNVQYAVKDRAGILPLSRSL-LQDSDQNILKYVTKWLGKK- 219 (392) T ss_pred cc-----------------------------cccccccceeEEeeeeeEEEeehhhHHH-HhhhHHHHHHHHHHHHHHH- Confidence 11 111 125667789999999999999985 4566667888776666543 Q ss_pred hhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHH-HHHHhccCccccceeccccccCcccc Q lcl|NC_020862. 160 NEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLS-ITLTDNYTPKKTTIIKGSRMTDTKTI 238 (405) Q Consensus 160 ~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~-~~Lk~nrApk~T~ii~gs~~~gT~~I 238 (405) ....++. .+++|.+. ..+.+..++++|.++. ..|+....+. T Consensus 220 i~~~~d~---~~~~g~g~-------------------~~~~~~~~~d~i~~~~~~~l~~~~~~~---------------- 261 (392) T protein:vir:10 220 SKVTRNV---LILGVIEK-------------------LTKQAIKSLDDIKDVLNVKLDPAISPN---------------- 261 (392) T ss_pred HHHHHHH---HHhhcccc-------------------ccccCccCHHHHHHHHHHhhhhhhccC---------------- Confidence 3344433 33444321 0112457788887765 4555544321 Q ss_pred cceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccccc Q lcl|NC_020862. 239 SASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQV 318 (405) Q Consensus 239 ~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~ 318 (405) + .-++||.+...|+.|+|-.+.|-|.|= + -.+.-++|-|..+.+++...+....+.+ T Consensus 262 -a--~~vm~~~~~~~L~~lkd~~G~~l~~~~--~------~~~~~~tllG~~~v~~~~~~~~~~~~~~------------ 318 (392) T protein:vir:10 262 -A--ILLTNQDGFNYLDKLKDKDGKYILQSD--P------TQKNKKLFAGTNPVVVVSNRFLKSKGTT------------ 318 (392) T ss_pred -C--EEEEcHHHHHHHHHhhccCCCeEeecC--c------cCCccccccCcccEEEecccccCCCccc------------ Confidence 1 247899999999999998888888652 1 2344455655323322222223221110 Q ss_pred ccccCCcceeeeEEEEEcccc-ceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhhccccc Q lcl|NC_020862. 319 SDVAGTDKYDIAPLLVVGDQA-FATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIKLRGER 395 (405) Q Consensus 319 ~~~~g~~~~DVYp~lV~G~~A-fg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL~~~~ 395 (405) . +.++ +++|.=+ |-.+..+ ..+++-+. +-.+.+-+++.+.++ +++++.+++++- T Consensus 319 -----~---~~~~-~~~gdfs~~~~i~~~-------~~~~~~~~-------~~~~~~f~~~~~~~r~~~r~d~~v~~~~a 375 (392) T protein:vir:10 319 -----A---KKAP-LIIGDLKEAIVLFKR-------EDMELAST-------DVGGKAFTRNTLDLRAIQRDDVQMWDNEA 375 (392) T ss_pred -----C---CceE-EEEEehhceEEEEee-------cceEEEEe-------ccccchhhcCceEEEEEEeeccEEecccc Confidence 1 1222 4566422 2222222 11333332 123445566666666 558889999999 Q ss_pred eEEEEEecCC Q lcl|NC_020862. 396 IAVAYSVIPE 405 (405) Q Consensus 396 marie~~a~~ 405 (405) ++.+...... T Consensus 376 ~~~l~~~~~a 385 (392) T protein:vir:10 376 AVYGEIDLSA 385 (392) T ss_pred eEEEEecccc Confidence 9997664333 No 104 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=98.09 E-value=1.6e-06 Score=52.37 Aligned_cols=282 Identities=10% Similarity=0.044 Sum_probs=145.1 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) ....-.+-..++++.-+.-+-+ -+....+....+..++.+++...+|+-+.|+...++. ..-+.. .-..||- T Consensus 99 ~~~~~~~~~~~t~~~gg~~vP~-~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~-~~~~~a-~~v~E~~----- 170 (392) T protein:vir:10 99 DDLEQRAMSGLTGEDGGLVIPQ-DIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKN-SDMIPF-AEITEMG----- 170 (392) T ss_pred hhhhhhhccccccCCCceecch-hHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEee-cCCccc-eeecccc----- Confidence 0000011111111111111212 2233333334455778889999999988887544431 111110 0122221 Q ss_pred cccCCccccccccccccccccccccccccccccc-ceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHH Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRV-GYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGA 159 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~-~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~ 159 (405) +. ... ..++..|+-+.++++.++.+|+++ +-|++.++...|..++.+. T Consensus 171 ~~-----------------------------~~~~~~~~~~v~l~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~~- 219 (392) T protein:vir:10 171 EI-----------------------------PETDNPKFSNVQYAVKDRAGILPLSRSL-LQDSDQNILKYVTKWLGKK- 219 (392) T ss_pred cc-----------------------------cccccccceeEEeeeeeEEEeehhhHHH-HhhhHHHHHHHHHHHHHHH- Confidence 11 111 125667789999999999999985 4566667888776666543 Q ss_pred hhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHH-HHHHhccCccccceeccccccCcccc Q lcl|NC_020862. 160 NEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLS-ITLTDNYTPKKTTIIKGSRMTDTKTI 238 (405) Q Consensus 160 ~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~-~~Lk~nrApk~T~ii~gs~~~gT~~I 238 (405) ....++. .+++|.+. ..+.+..++++|.++. ..|+....+. T Consensus 220 i~~~~d~---~~~~g~g~-------------------~~~~~~~~~d~i~~~~~~~l~~~~~~~---------------- 261 (392) T protein:vir:10 220 SKVTRNV---LILGVIEK-------------------LTKQAIKSLDDIKDVLNVKLDPAISPN---------------- 261 (392) T ss_pred HHHHHHH---HHhhcccc-------------------ccccCccCHHHHHHHHHHhhhhhhccC---------------- Confidence 3344433 33444321 0112457788887765 4555544321 Q ss_pred cceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccccc Q lcl|NC_020862. 239 SASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQV 318 (405) Q Consensus 239 ~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~ 318 (405) + .-++||.+...|+.|+|-.+.|-|.|= + -.+.-++|-|..+.+++...+....+.+ T Consensus 262 -a--~~vm~~~~~~~L~~lkd~~G~~l~~~~--~------~~~~~~tllG~~~v~~~~~~~~~~~~~~------------ 318 (392) T protein:vir:10 262 -A--ILLTNQDGFNYLDKLKDKDGKYILQSD--P------TQKNKKLFAGTNPVVVVSNRFLKSKGTT------------ 318 (392) T ss_pred -C--EEEEcHHHHHHHHHhhccCCCeEeecC--c------cCCccccccCcccEEEecccccCCCccc------------ Confidence 1 247899999999999998888888652 1 2344455655323322222223221110 Q ss_pred ccccCCcceeeeEEEEEcccc-ceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhhccccc Q lcl|NC_020862. 319 SDVAGTDKYDIAPLLVVGDQA-FATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIKLRGER 395 (405) Q Consensus 319 ~~~~g~~~~DVYp~lV~G~~A-fg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL~~~~ 395 (405) . +.++ +++|.=+ |-.+..+ ..+++-+. +-.+.+-+++.+.++ +++++.+++++- T Consensus 319 -----~---~~~~-~~~gdfs~~~~i~~~-------~~~~~~~~-------~~~~~~f~~~~~~~r~~~r~d~~v~~~~a 375 (392) T protein:vir:10 319 -----A---KKAP-LIIGDLKEAIVLFKR-------EDMELAST-------DVGGKAFTRNTLDLRAIQRDDVQMWDNEA 375 (392) T ss_pred -----C---CceE-EEEEehhceEEEEee-------cceEEEEe-------ccccchhhcCceEEEEEEeeccEEecccc Confidence 1 1222 4566422 2222222 11333332 123445566666666 558889999999 Q ss_pred eEEEEEecCC Q lcl|NC_020862. 396 IAVAYSVIPE 405 (405) Q Consensus 396 marie~~a~~ 405 (405) ++.+...... T Consensus 376 ~~~l~~~~~a 385 (392) T protein:vir:10 376 AVYGEIDLSA 385 (392) T ss_pred eEEEEecccc Confidence 9997664333 No 105 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=98.09 E-value=1.6e-06 Score=52.37 Aligned_cols=282 Identities=10% Similarity=0.044 Sum_probs=145.1 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) ....-.+-..++++.-+.-+-+ -+....+....+..++.+++...+|+-+.|+...++. ..-+.. .-..||- T Consensus 99 ~~~~~~~~~~~t~~~gg~~vP~-~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~-~~~~~a-~~v~E~~----- 170 (392) T protein:vir:10 99 DDLEQRAMSGLTGEDGGLVIPQ-DIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKN-SDMIPF-AEITEMG----- 170 (392) T ss_pred hhhhhhhccccccCCCceecch-hHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEee-cCCccc-eeecccc----- Confidence 0000011111111111111212 2233333334455778889999999988887544431 111110 0122221 Q ss_pred cccCCccccccccccccccccccccccccccccc-ceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHH Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRV-GYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGA 159 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~-~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~ 159 (405) +. ... ..++..|+-+.++++.++.+|+++ +-|++.++...|..++.+. T Consensus 171 ~~-----------------------------~~~~~~~~~~v~l~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~~- 219 (392) T protein:vir:10 171 EI-----------------------------PETDNPKFSNVQYAVKDRAGILPLSRSL-LQDSDQNILKYVTKWLGKK- 219 (392) T ss_pred cc-----------------------------cccccccceeEEeeeeeEEEeehhhHHH-HhhhHHHHHHHHHHHHHHH- Confidence 11 111 125667789999999999999985 4566667888776666543 Q ss_pred hhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHH-HHHHhccCccccceeccccccCcccc Q lcl|NC_020862. 160 NEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLS-ITLTDNYTPKKTTIIKGSRMTDTKTI 238 (405) Q Consensus 160 ~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~-~~Lk~nrApk~T~ii~gs~~~gT~~I 238 (405) ....++. .+++|.+. ..+.+..++++|.++. ..|+....+. T Consensus 220 i~~~~d~---~~~~g~g~-------------------~~~~~~~~~d~i~~~~~~~l~~~~~~~---------------- 261 (392) T protein:vir:10 220 SKVTRNV---LILGVIEK-------------------LTKQAIKSLDDIKDVLNVKLDPAISPN---------------- 261 (392) T ss_pred HHHHHHH---HHhhcccc-------------------ccccCccCHHHHHHHHHHhhhhhhccC---------------- Confidence 3344433 33444321 0112457788887765 4555544321 Q ss_pred cceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccccc Q lcl|NC_020862. 239 SASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQV 318 (405) Q Consensus 239 ~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~ 318 (405) + .-++||.+...|+.|+|-.+.|-|.|= + -.+.-++|-|..+.+++...+....+.+ T Consensus 262 -a--~~vm~~~~~~~L~~lkd~~G~~l~~~~--~------~~~~~~tllG~~~v~~~~~~~~~~~~~~------------ 318 (392) T protein:vir:10 262 -A--ILLTNQDGFNYLDKLKDKDGKYILQSD--P------TQKNKKLFAGTNPVVVVSNRFLKSKGTT------------ 318 (392) T ss_pred -C--EEEEcHHHHHHHHHhhccCCCeEeecC--c------cCCccccccCcccEEEecccccCCCccc------------ Confidence 1 247899999999999998888888652 1 2344455655323322222223221110 Q ss_pred ccccCCcceeeeEEEEEcccc-ceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhhccccc Q lcl|NC_020862. 319 SDVAGTDKYDIAPLLVVGDQA-FATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIKLRGER 395 (405) Q Consensus 319 ~~~~g~~~~DVYp~lV~G~~A-fg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL~~~~ 395 (405) . +.++ +++|.=+ |-.+..+ ..+++-+. +-.+.+-+++.+.++ +++++.+++++- T Consensus 319 -----~---~~~~-~~~gdfs~~~~i~~~-------~~~~~~~~-------~~~~~~f~~~~~~~r~~~r~d~~v~~~~a 375 (392) T protein:vir:10 319 -----A---KKAP-LIIGDLKEAIVLFKR-------EDMELAST-------DVGGKAFTRNTLDLRAIQRDDVQMWDNEA 375 (392) T ss_pred -----C---CceE-EEEEehhceEEEEee-------cceEEEEe-------ccccchhhcCceEEEEEEeeccEEecccc Confidence 1 1222 4566422 2222222 11333332 123445566666666 558889999999 Q ss_pred eEEEEEecCC Q lcl|NC_020862. 396 IAVAYSVIPE 405 (405) Q Consensus 396 marie~~a~~ 405 (405) ++.+...... T Consensus 376 ~~~l~~~~~a 385 (392) T protein:vir:10 376 AVYGEIDLSA 385 (392) T ss_pred eEEEEecccc Confidence 9997664333 No 106 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=98.09 E-value=1.6e-06 Score=52.38 Aligned_cols=289 Identities=10% Similarity=0.061 Sum_probs=154.1 Q ss_pred ccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccccccC Q lcl|NC_020862. 5 YNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGASYAG 84 (405) Q Consensus 5 y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~~~~~ 84 (405) -++...+ .+.+-| +.+ ..+.++...+.-++.+++...+||.+. +++.++..-+. ..-..|| +++ T Consensus 1 ma~~t~~-~G~lip---~~~-~~~ii~~l~~~s~i~~l~~~~~~~~~~---~~~p~~~~~~~-a~wv~Eg-----~~~-- 64 (300) T protein:vir:95 1 MSEAQLS-KGNLFN---PEL-VTKVINKVKGHSSIAKLSPQKPIPFNG---QREFVFDFDSD-IDIVAEN-----GKK-- 64 (300) T ss_pred CcccccC-Ccceec---hhh-HHHHHHHHHhhhhhhhhcceeeccCCc---eEEEEEecCcc-eEEeeCC-----ccc-- Confidence 2221111 222223 233 466666677778888999999998863 23333222111 1112222 221 Q ss_pred CcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhc--cchHHHHHHHHHHHHhhH Q lcl|NC_020862. 85 GNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTD--SDLYGHLSREMLRGANEI 162 (405) Q Consensus 85 gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d--~~l~~~~~~ell~~~~~~ 162 (405) .....++.+++.+.++++.++.+|++++....| .++.+.+..++.+..+. T Consensus 65 ---------------------------~~s~~~f~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~- 116 (300) T protein:vir:95 65 ---------------------------THGGVSLDPVTIVPLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLAR- 116 (300) T ss_pred ---------------------------ccccccceeeEeeeEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHH- Confidence 122246677788999999999999996643322 35677766666543332 Q ss_pred HHHHHHHHHhccCc------eEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_020862. 163 TEDLLQADILASAD------VKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTK 236 (405) Q Consensus 163 ted~l~~~ilag~~------~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~ 236 (405) .+|. .+++|.+ .-........... ..+...+...++++|.++...|...+... T Consensus 117 ~~d~---~~l~G~~~~~g~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~i~~~~~~~~~~~~~~-------------- 175 (300) T protein:vir:95 117 GLDI---MSIHGINPRTKQASTIIGDNCFDKKV----TQTVPFKDTNPDESMEDAVGMIDGSERDI-------------- 175 (300) T ss_pred HHHH---hhhhcccCCCCCCccccccccccccc----ceeecccccchHHHHHHHHHHhhhcCCCc-------------- Confidence 2222 2333311 1000000000000 00111234577899999998888755431 Q ss_pred cccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccc Q lcl|NC_020862. 237 TISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGY 316 (405) Q Consensus 237 ~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~ 316 (405) + +.++||.....|+.|+|..+.|-|.+.. ..+.-+++-| ++++.++.+.. ++ + T Consensus 176 ---~--~~vmn~~~~~~L~~lkd~~G~~i~~~~~--------~~~~~~~l~G--~Pv~~s~~v~~----~~----~---- 228 (300) T protein:vir:95 176 ---T--GAILDPIFTTALSKMKNAEGGKLYPELA--------WGGVPDAING--LAVDKNRTVSY----SQ----T---- 228 (300) T ss_pred ---c--EEEECHHHHHHHHHhhccCCCeeccCcc--------ccCCCceecc--eeeEEecCCCC----CC----C---- Confidence 1 3578999999999999988877775331 2344567766 57777765421 11 0 Q ss_pred ccccccCCcceeeeEEEEEccccce-eecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhhccc Q lcl|NC_020862. 317 QVSDVAGTDKYDIAPLLVVGDQAFA-TIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIKLRG 393 (405) Q Consensus 317 ~~~~~~g~~~~DVYp~lV~G~~Afg-~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL~~ 393 (405) ..+ .++++|+=+-+ .++++ ..+++-+..-+.. -++.--|-|..-++++ +.+++.++++ T Consensus 229 -------~~~----~~~~~GDf~~~~~~~~~-------~~~~~~v~~~~~~-d~~~~~~f~~~~v~~r~~~r~d~~v~~~ 289 (300) T protein:vir:95 229 -------DPK----NTAIVGDFETMFKWGYA-------KEVPMEIIKYGDP-DNSGRDLKGYNQIYIRCEAYIGWGIMDA 289 (300) T ss_pred -------CCc----cEEEEeeccceEEEEEe-------cccEEEEeeccCC-CCcchhhhhcCcEEEEEEEeecceeecc Confidence 011 23556652211 23333 1133334432210 0111124566667777 3678899999 Q ss_pred cceEEEEEecC Q lcl|NC_020862. 394 ERIAVAYSVIP 404 (405) Q Consensus 394 ~~marie~~a~ 404 (405) +.+++|.-++= T Consensus 290 ~a~~~l~~~~g 300 (300) T protein:vir:95 290 ASFARIVKTGG 300 (300) T ss_pred cceEEEecCCC Confidence 99999988888 No 107 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=98.07 E-value=7e-07 Score=54.32 Aligned_cols=281 Identities=16% Similarity=0.095 Sum_probs=148.7 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) +....+.-..++++.-+.-+-+.+ .+..+....+...+.+++...+++.+. +++.+...-.... ...+.|+ T Consensus 106 ~~~~~~~~~~~~~~~~g~~~~~~~-~~~ii~~~~~~~~l~~~~~~~~~~~~~---~~~~~~~~~~~~a-----~~v~Eg~ 176 (390) T protein:vir:81 106 IKAALNTASTDAAGSAGALTTPNR-LPGFITPPDARLTVRDLIGSGRTDSAL---IEYVQETGFVNNA-----AIVAEGA 176 (390) T ss_pred HHHHHHhhccccccCCcceechhh-hHHHHHHHhhhhhhhhhcceeeccCCc---eEEEEEecCCcce-----eeecCCc Confidence 111111111122222222222333 344444455567888889888887543 4444332211111 1112222 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) ++ .....++..++.++++++.++.+|++++. |+. ++...+..++.+..+ T Consensus 177 ~~-----------------------------~~~~~~~~~i~~~~~k~~~~~~is~ell~-d~~-~~~~~i~~~l~~~~~ 225 (390) T protein:vir:81 177 LK-----------------------------PESSLKFAKKTDTTHVIAHTMKATRQILS-DAP-QLASYMNNRLIRGLK 225 (390) T ss_pred cc-----------------------------ccccceeeEEEEeeeEEEEeehhhHHHHH-hHH-HHHHHHHHHHHHHHH Confidence 22 11223667789999999999999999654 554 577776666664443 Q ss_pred hHHHHHHHHHHhccCceEE-ecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKV-FTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTIS 239 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~-yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~ 239 (405) . .++ ..+++|.+.-. ..|--+. +... ..........++++|..+...|.....+. T Consensus 226 ~-~~d---~a~l~G~g~~~~~~Gi~~~-~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------------- 281 (390) T protein:vir:81 226 V-KED---AEILRGTGANDGLLGLIPQ-ATTY--AAPTTIAGATRVDQLRLAMLQASLAEYNP----------------- 281 (390) T ss_pred H-HHH---HHHHhcCCCCCcccceeec-cccc--ccccccccchhHHHHHHHHHhhccccCCC----------------- Confidence 3 333 34566533211 1110000 0000 00111224577888988888887665531 Q ss_pred ceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccccc Q lcl|NC_020862. 240 ASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVS 319 (405) Q Consensus 240 ~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~ 319 (405) + .-++||.....|+.|+|-.+.|-|-+. ..+..+.+-| +.++.++.|- +| T Consensus 282 -~-~~v~~~~~~~~l~~lkd~~G~~l~~~~---------~~~~~~~l~G--~pv~~~~~~p----~~------------- 331 (390) T protein:vir:81 282 -S-GIVINPIDWAAIELAKDANNQYLIGNA---------RGTLTPTLWG--LPVVATQAMA----PG------------- 331 (390) T ss_pred -C-EEEEcHHHHHHHHHhhcCCCceeecCc---------ccccCceecc--eeeEEcCCCC----CC------------- Confidence 1 347899999999999987777777542 2334456655 4666666432 11 Q ss_pred cccCCcceeeeEEEEEccccce-eecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhhccccce Q lcl|NC_020862. 320 DVAGTDKYDIAPLLVVGDQAFA-TIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIKLRGERI 396 (405) Q Consensus 320 ~~~g~~~~DVYp~lV~G~~Afg-~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL~~~~m 396 (405) .+++|.-+.+ .+..+ ..+.+-+. +.+.+-+++.+.|+ .++.+.+++++-+ T Consensus 332 ------------~~~~gd~~~~~~~~~~-------~~~~v~~~--------~~~~~~~~~~v~~r~~~r~d~~v~~~~a~ 384 (390) T protein:vir:81 332 ------------EFLVGAFDLAAQIFDQ-------WDARVEIG--------YVGEDFQRNMITVLAEERLALVVYRPEAL 384 (390) T ss_pred ------------cEEEEehhceEEEEEe-------cceEEEEe--------cccchhhcCcEEEEEEEeeccEEecccce Confidence 1345654322 12111 11233222 12235566667765 6788999999999 Q ss_pred EEEEEe Q lcl|NC_020862. 397 AVAYSV 402 (405) Q Consensus 397 arie~~ 402 (405) +++..+ T Consensus 385 v~~t~a 390 (390) T protein:vir:81 385 ISGSFA 390 (390) T ss_pred EEEEeC Confidence 999999 No 108 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=98.05 E-value=2.6e-06 Score=51.17 Aligned_cols=296 Identities=12% Similarity=0.097 Sum_probs=135.5 Q ss_pred cccceee---hhhhhHHHHHhhhhhhhhcccccc----ccCcCCCCEEEEEecccCCCCCCccc--cCCCcccccccCCc Q lcl|NC_020862. 16 VGPQFNV---HYWDRKSLIDEAEEMFFSPLADNK----QMPKHFGKELKVFYYVPLLDDLNVND--QGLDATGASYAGGN 86 (405) Q Consensus 16 v~~qm~t---~y~~~k~L~~a~p~lv~~~fA~~~----~mPKn~GktIkfrry~pl~~~~t~l~--eGvtp~g~~~~~gn 86 (405) +.+++-+ --|.+.+|.--++.||+.++.... ....++|.||++|+-.+.....-... .++++. T Consensus 1 MaN~llT~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~~~~~~~~~~~-------- 72 (423) T protein:vir:10 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKN-------- 72 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCceeeeccCCccccccccC-------- Confidence 3333322 248889998888999988875432 22346899999986555433222111 112221 Q ss_pred ccccccccccccccccccccccccccccceeeeeEEEEeeee-eeeEEecchhhhhhhccchHHHHHHHHHHHHhhHHHH Q lcl|NC_020862. 87 LYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEY-GFFMEYTEDSLMFDTDSDLYGHLSREMLRGANEITED 165 (405) Q Consensus 87 ly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qy-G~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~~~ted 165 (405) .|+| ..++.+|.|. .+-.+++|+...++.. ++-+ .++.|..---+ T Consensus 73 ----------------dl~e------------~~v~l~id~~k~va~~v~d~E~~~~i~-~~~~-----~l~~A~~aLA~ 118 (423) T protein:vir:10 73 ----------------NLIS------------GKATGRVGNYITVAVEYQQLEEAIKLN-QLEE-----ILAPVRQRIVT 118 (423) T ss_pred ----------------cccc------------ceeEEEeeceeeeeeeechHHHhcChh-hHHH-----HHHHHHHHHHH Confidence 1222 2334555433 3456788866544433 2222 23333322222 Q ss_pred HHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccceEEEE Q lcl|NC_020862. 166 LLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISASRIAY 245 (405) Q Consensus 166 ~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~syv~~ 245 (405) .+-.+|++-. .+.... + .+...++. =.++++.++.+.|+++++|+ .-|.++ T Consensus 119 ~vd~~ia~~~-----~~~~~~-~--~gt~~t~~----~a~~~i~~a~~~Ld~~~vP~-----------------~~R~~V 169 (423) T protein:vir:10 119 DLETELAHFM-----MNNGAL-S--LGSPNTPI----TKWSDVAQTASFLKDLGVNE-----------------GENYAV 169 (423) T ss_pred HHHHHHHHHH-----hhcccc-c--cccCCccc----chHHHHHHHHHHHHhccCCc-----------------CCCEEE Confidence 2222332211 110000 0 01111111 13789999999999999995 138889 Q ss_pred EcccchHHHHHHhcccCCCcceehhhcCCcccccCcce-eEecCCcEEEEeCcchhhhhcCCCccc--CCCccccc---- Q lcl|NC_020862. 246 IGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEI-GAIPGAHLRIVVVPQMMHYAGAGATAT--AANRGYQV---- 318 (405) Q Consensus 246 ~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEI-Gsi~g~n~Rfv~~p~~~~~~~aGa~~~--~t~~~~~~---- 318 (405) +.|+....|.. ++.+.-..+-+..+.+-+++| |++.| |.+.++..+-.-. +|+... ....+..+ T Consensus 170 v~p~~~a~Ll~------~~~~~~~~~~~~~~alr~g~i~G~i~G--Fdv~~Snnip~~T-~gt~~~t~~~~~~~~v~~~a 240 (423) T protein:vir:10 170 MDPWSAQRLAD------AQTGLHASDQLVRTAWENAQIPTNFGG--IRALMSNGLASRT-QGAFGGTLTVKTQPTVTYNA 240 (423) T ss_pred eChHHHHHHhc------cccceecccccchhhhhhccceeeecc--eEEEEeCCCcccc-ccccccceeeeecceecccc Confidence 99999888752 234544545566677888887 99966 8999887666321 121110 00011111 Q ss_pred -------ccccCCcceeeeEEEEEccccceeecceecc---C------CCCCCceEEEecCCCCCCCC-CCccchhhhHH Q lcl|NC_020862. 319 -------SDVAGTDKYDIAPLLVVGDQAFATIGLQGMS---G------KGKSKFRIIVKKPGEATADR-NDPYGKVGFSS 381 (405) Q Consensus 319 -------~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~---~------~g~~~~~~ivk~pG~~tad~-~DPlgQrg~~g 381 (405) ..+....-...|..|..|+ .|.--|+..-. + +-...++..|..... ++. +| .. T Consensus 241 ~~~a~~~~~~~~~~~~~~~~~l~~GD-~~t~aGv~~v~~~tk~~~~~~~t~~~~~~~v~a~~~--~~~~g~-------~t 310 (423) T protein:vir:10 241 VKDSYQFTVTLTGATASVTGFLKAGD-QVKFTNTYWLQQQTKQALYNGATPISFTATVTADAN--SDSGGD-------VT 310 (423) T ss_pred ccccceeeeeeeeccccccCceeecc-eEEecceeeecccccccccccccCcceEEEEEeeee--eccCCc-------ee Confidence 1111111234577777776 55544443210 0 000123333332110 000 00 00 Q ss_pred HHHHHHHhhccc----cceEEEEEecCC Q lcl|NC_020862. 382 IKFFYGFIKLRG----ERIAVAYSVIPE 405 (405) Q Consensus 382 wK~~~~~~iL~~----~~marie~~a~~ 405 (405) .|.+= + ++.. .+=.+ +++|. T Consensus 311 v~i~p-~-~i~~~~~~~~~~v--~a~~a 334 (423) T protein:vir:10 311 VTLSG-V-PIYDTTNPQYNSV--SRQVE 334 (423) T ss_pred eeccC-c-cccccCCcccccc--ccccc Confidence 11110 0 0000 00000 01111 No 109 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=98.03 E-value=1.2e-06 Score=53.08 Aligned_cols=281 Identities=15% Similarity=0.093 Sum_probs=150.9 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) .-...+....++++.-+.-+-+ .+.+..+....+...+.+++...+|+.+ ++++.+........ ...+.|+ T Consensus 106 ~~~~~~~~~~~~~~~~g~lip~-~~~~~ii~~~~~~~~i~~~~~~~~~~~~---~~~~~~~~~~~~~a-----~~v~Eg~ 176 (390) T protein:vir:97 106 IKAALNTASTDAAGSAGALTTP-NRLPGFITPPDARLTVRDLIGSGRTDSA---LIEYVQETGFVNNA-----AIVAEGA 176 (390) T ss_pred HHHHHHhhhcccccccccccch-hhhHHHHHHHhhhhhhHhhcceeeccCC---ceEEEEEecCCcce-----eeecCCc Confidence 1111122122222222222222 3345555555566777788888888743 34444333211111 1112222 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) ++ .-...++..++.++++++.++++|++++. |++ ++...+..++.+..+ T Consensus 177 ~~-----------------------------~~~~~~~~~i~~~~~k~~~~~~is~ell~-ds~-~l~~~i~~~la~a~~ 225 (390) T protein:vir:97 177 LK-----------------------------PESSLKFAKKTDTTHVIAHTMKATRQILS-DAP-QLASYMNNRLIRGLK 225 (390) T ss_pred cc-----------------------------cccccceeEEEEeeeeEEEeehhhHHHHH-hHH-HHHHHHHHHHHHHHH Confidence 22 11123567789999999999999999654 564 577766666654433 Q ss_pred hHHHHHHHHHHhccCceEE-ecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKV-FTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTIS 239 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~-yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~ 239 (405) . .+| ..+++|.+.-. -.|-.+. +..... ........++++|..+...|+.+..+. T Consensus 226 ~-~~d---~a~l~G~g~~~~p~Gi~~~-~~~~~~--~~~~~~~~~~d~~~~~~~~~~~~~~~~----------------- 281 (390) T protein:vir:97 226 V-KED---AEILRGTGANDGLLGLIPQ-ATTYAA--PTTIAGATRVDQLRLAMLQASLAEYPA----------------- 281 (390) T ss_pred H-HHH---HHHhhcCCCCccccceeec-cccccc--cccccccchHHHHHHHHHhhccccCCC----------------- Confidence 3 333 34555533211 0110000 000000 001123567788888888877666541 Q ss_pred ceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccccc Q lcl|NC_020862. 240 ASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVS 319 (405) Q Consensus 240 ~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~ 319 (405) =..++||.....|+.|+|-.+.|-|-+. ..+.-+.+.| +.+++++.|. +| T Consensus 282 --~~~v~n~~~~~~L~~lkd~~G~~l~~~~---------~~~~~~~l~G--~pV~~~~~~~----~~------------- 331 (390) T protein:vir:97 282 --SGIVINPIDWAAIELAKDANNQYLIGNA---------RGTLTPTLWG--LPVVATQAMA----PG------------- 331 (390) T ss_pred --CEEEEcHHHHHHHHHhhcCCCceeecCc---------cCCCCceecc--eeeEEcCCCC----CC------------- Confidence 1357899999999999987777766542 1333456655 4666666432 11 Q ss_pred cccCCcceeeeEEEEEccccce-eecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhhccccce Q lcl|NC_020862. 320 DVAGTDKYDIAPLLVVGDQAFA-TIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIKLRGERI 396 (405) Q Consensus 320 ~~~g~~~~DVYp~lV~G~~Afg-~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL~~~~m 396 (405) .+++|.-+.+ .+..+ ..+.+.+. +.+++-+++.+.|+ +++...+++++-+ T Consensus 332 ------------~~~~gd~~~~~~~~~~-------~~~~i~~~--------~~~~~f~~~~~~~r~~~r~d~~v~~~~a~ 384 (390) T protein:vir:97 332 ------------EFLVGAFDLAAQIFDQ-------WDARVEIG--------YVNDDFQRNMVTVLAEERLALVVYRPEAL 384 (390) T ss_pred ------------cEEEEeccceEEEEEe-------cceEEEEe--------ecccccccCcEEEEEEEeeccEEeccccE Confidence 1355653321 12221 11222222 23356678888888 6899999999999 Q ss_pred EEEEEe Q lcl|NC_020862. 397 AVAYSV 402 (405) Q Consensus 397 arie~~ 402 (405) +.++.+ T Consensus 385 v~~~~a 390 (390) T protein:vir:97 385 ITGSFA 390 (390) T ss_pred EEEEeC Confidence 999999 No 110 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=98.01 E-value=1.2e-06 Score=52.95 Aligned_cols=282 Identities=11% Similarity=0.054 Sum_probs=148.9 Q ss_pred CCccc--cC--c--CCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccC Q lcl|NC_020862. 1 MPHIY--ND--P--AAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQG 74 (405) Q Consensus 1 ~~~~y--~~--~--~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eG 74 (405) ..+.+ .. . ..++.+.-+. .-+.-+....+....+...+.+++...+|+.+.|+....+.- +.++. -. T Consensus 103 ~~~~~~~~~~~~a~~~~t~~~gg~-~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-----~~~~~-a~ 175 (408) T protein:vir:10 103 NPMAFMNTVSSKTETSGSDSAAGL-TIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWT-----DVTPL-TV 175 (408) T ss_pred cchhhhhhhhhhhhhcccccCCce-eccHhHHHHHHHHHHhhchhhhhcceeeccCCcceEEEeecc-----ccccc-ee Confidence 00000 00 0 1111111111 112233444555555667889999999999999875433311 11111 11 Q ss_pred CCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHH Q lcl|NC_020862. 75 LDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSRE 154 (405) Q Consensus 75 vtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~e 154 (405) ..+.|+++. .-...++..|+-+.++++.++.+|+++ +-|+..++...+..+ T Consensus 176 ~v~E~~~~~----------------------------~~~~~~~~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~ 226 (408) T protein:vir:10 176 MDAEDGKIP----------------------------DLDNPQLTIIKYLIKRYAGIITATNTS-LKDTAENILAWLSSW 226 (408) T ss_pred eecCccccc----------------------------cccCcceeeEEeeeeeEEeeehhHHHH-HhhchHHHHHHHHHH Confidence 112222221 111125667889999999999999985 446666777776666 Q ss_pred HHHHHhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHH-HHHHhccCccccceecccccc Q lcl|NC_020862. 155 MLRGANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLS-ITLTDNYTPKKTTIIKGSRMT 233 (405) Q Consensus 155 ll~~~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~-~~Lk~nrApk~T~ii~gs~~~ 233 (405) +.+..+ .+++ ..|++|.+.. +. .-+..++++|..+. ..|+....+ T Consensus 227 l~~~~~-~~~~---~~il~g~g~~------~~------------~~~~~~~~~l~~~~~~~~~~~~~~------------ 272 (408) T protein:vir:10 227 IAKKVV-VTRN---QAIIEVMKAA------PK------------KPTIAKFDDVITMINTAVDPAIIA------------ 272 (408) T ss_pred HHHHHH-HHHH---HHHhhccccc------cc------------ccccccHHHHHHHHHHhhhhhhcc------------ Confidence 654333 3333 3456664321 00 11457788887765 445443322 Q ss_pred CcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCC Q lcl|NC_020862. 234 DTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAAN 313 (405) Q Consensus 234 gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~ 313 (405) .=+-+||+.+...|+.|+|..+.|-|.|- . -.+..++|-|-.+.++.+. +.++.+ T Consensus 273 -------~a~~v~n~~~~~~l~~lkd~~G~~i~~~~--~------~~~~~~~l~G~PV~~~~~~----~~~~~~------ 327 (408) T protein:vir:10 273 -------TSSLLTNQSGLNKLALVKTAEGKYLLEPD--P------TKPNSYLIKGKQVIVVADR----WLPNTG------ 327 (408) T ss_pred -------CCEEEEcHHHHHHHHHhhccCCceEeccC--c------CCCCCceecceeeEEeccc----ccCccC------ Confidence 01357999999999999998888888652 1 2345567766433333221 111111 Q ss_pred cccccccccCCcceeeeEEEEEcccc-ceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhh Q lcl|NC_020862. 314 RGYQVSDVAGTDKYDIAPLLVVGDQA-FATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIK 390 (405) Q Consensus 314 ~~~~~~~~~g~~~~DVYp~lV~G~~A-fg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~i 390 (405) .+ .++ +++|+=+ |-.+..+ ..+++-+.. -....-+++...++ +++.+.+ T Consensus 328 ----------~~---~~~-i~~gd~~~~~~~~~~-------~~~~v~~~~-------~~~~~f~~~~~~~r~~~r~d~~v 379 (408) T protein:vir:10 328 ----------ST---VYP-LYYGDMSQAITLFDR-------ENMSLLPTN-------IGAGAFETDTTKIRVIDRFDVKA 379 (408) T ss_pred ----------CC---ceE-EEEEehhccEEEEEe-------cceEEEEcc-------cccchhhcCceEEEEEEeeccEE Confidence 11 223 5677533 2223222 112322221 01122356677777 4589999 Q ss_pred ccccceEEEEEec--CC Q lcl|NC_020862. 391 LRGERIAVAYSVI--PE 405 (405) Q Consensus 391 L~~~~marie~~a--~~ 405 (405) ++++-++.++..+ |+ T Consensus 380 ~~~~a~~~~~~~~~~~~ 396 (408) T protein:vir:10 380 TDSEALVAGSFSAIADQ 396 (408) T ss_pred eccccEEEEEeeccccC Confidence 9999999998665 45 No 111 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=98.00 E-value=4.5e-06 Score=49.88 Aligned_cols=301 Identities=12% Similarity=0.031 Sum_probs=142.6 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) ...-.+..-.+.++.-+.-+-+-.+.+..+....+..++.++. .+.+|-..|. +++-+...-+.+ ..-+.|. T Consensus 124 ~~~~~~~~~~~~~~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~-~~~v~~~~~~-~~~p~~~~~~~a------~~v~E~~ 195 (435) T protein:vir:80 124 FGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLG-ARTLPLSNGN-ITIPRLKGGAIV------GYIGADT 195 (435) T ss_pred hhhhhhhhhcccCCCCCccccchhHHHHHHHHHhhhchhhhcc-ceeeecCCCc-eEEEEEeCCcce------eeeccCc Confidence 1111111111111111111222234444444445556666762 3345555553 444333322111 1111221 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhh-ccchHHHHHHHHHHHH Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDT-DSDLYGHLSREMLRGA 159 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~-d~~l~~~~~~ell~~~ 159 (405) .+ .....++..|+.+.++++.++.+|++++.... +.++.+.|..++.+.. T Consensus 196 ~~-----------------------------~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~l~~~i~~~l~~a~ 246 (435) T protein:vir:80 196 DI-----------------------------PTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLTAAI 246 (435) T ss_pred cc-----------------------------cccccceeeEEEeeEEEEEeehhhHHHHHhhcccHHHHHHHHHHHHHHH Confidence 11 11123566788999999999999998654322 3356666666665433 Q ss_pred hhHHHHHHHHHHhccCceE-----EecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccC Q lcl|NC_020862. 160 NEITEDLLQADILASADVK-----VFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTD 234 (405) Q Consensus 160 ~~~ted~l~~~ilag~~~v-----~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~g 234 (405) ...++. .+++|.++- ..+.+....+....... .......++.+++..|+.+.... T Consensus 247 -~~~~d~---a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~----~~~~~~~d~~~~~~~~~~~~~~~------------ 306 (435) T protein:vir:80 247 -GAREDK---AFIRDDGTANTPKGLRFWALPGNVITASDGS----TLQKIETDLGKAILALENADANL------------ 306 (435) T ss_pred -HHHHHH---HhhccCCCCCcccceeecccccceeeccccc----chhhHHHHHHHHHHHhhcccccc------------ Confidence 333333 345554321 01111111111111111 01233567888888887765421 Q ss_pred cccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCc Q lcl|NC_020862. 235 TKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANR 314 (405) Q Consensus 235 T~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~ 314 (405) -.++| ++||.....|+.|+|..+.|-|-.. + =|++.| +.+++++.|-.-.+. T Consensus 307 ---~~~~~--vmn~~~~~~L~~lkd~~G~~l~~~~---~---------~~~l~G--~pv~~~~~~p~~~~~--------- 358 (435) T protein:vir:80 307 ---TQPGW--IMAPRTFRFLEGLRDGNGNKVYPEL---A---------NGMLKG--YPVGKTTQVPINLGE--------- 358 (435) T ss_pred ---ccCEE--EEcHHHHHHHHhhhccCCceeccCC---C---------CCeEee--eeeEEeccccccccC--------- Confidence 01233 6899999999999998888877321 1 145655 567777655321111 Q ss_pred ccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCC-CCC-CCCCCccchhhhHHHH--HHHHHhh Q lcl|NC_020862. 315 GYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPG-EAT-ADRNDPYGKVGFSSIK--FFYGFIK 390 (405) Q Consensus 315 ~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG-~~t-ad~~DPlgQrg~~gwK--~~~~~~i 390 (405) ++++ ..+++|+=++..++.+ +.+++-+..-+ ... ...--.+-|++.+.|+ +++.+.+ T Consensus 359 --------~~~~----~~i~~gd~s~~~i~~~-------~~~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~ 419 (435) T protein:vir:80 359 --------AGKE----SEIYFTDFGDVFIGEE-------ETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGP 419 (435) T ss_pred --------CCCc----ceEEEEEcccEEEEee-------cceEEEEeccccccccccchhhhhhcCcceeeeeeeeCcEe Confidence 1111 1356776555555543 23454444322 100 0111245666777777 5567777 Q ss_pred ccccceEEEEEecCC Q lcl|NC_020862. 391 LRGERIAVAYSVIPE 405 (405) Q Consensus 391 L~~~~marie~~a~~ 405 (405) .+++-++.|.-+.== T Consensus 420 ~~~~a~~~l~~~~~~ 434 (435) T protein:vir:80 420 RHVESIAVLSGVAWG 434 (435) T ss_pred ecccceEEEeccCCC Confidence 777766655432111 No 112 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=97.96 E-value=9.1e-06 Score=48.21 Aligned_cols=317 Identities=11% Similarity=0.059 Sum_probs=170.0 Q ss_pred CCcccc--CcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcc Q lcl|NC_020862. 1 MPHIYN--DPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDAT 78 (405) Q Consensus 1 ~~~~y~--~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~ 78 (405) |+-+=| .|.-...+..- .+...-|.-+.+..-+...++..+-.+|.+ ..||++.|-|--..-.. -.+.|-.+. T Consensus 1 ms~~~~~tr~~~~~s~~d~-al~le~f~geV~~af~~~s~~~~~~~~rti--~~g~s~~~~~iG~~~~~--~~~pG~~l~ 75 (335) T protein:vir:63 1 MSFLNDLTRPNYAGKNADV-DIHLEEHLGIVDKHFAYTSKFAPLMNIRDL--RGSNVVRLDRLGNVEAK--GRRAGEELE 75 (335) T ss_pred CCCcccchhhhcccccchh-heehhhhhhhHHHHHHhhhhhccccceeee--ccceeEEEeeeeeeeee--cccCCcCcC Confidence 544311 11111111111 344455677777777778888889999998 45999998865433221 122222222 Q ss_pred cccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHH Q lcl|NC_020862. 79 GASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRG 158 (405) Q Consensus 79 g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~ 158 (405) |+... +.| . .|+++=..|-... + |..-+...+=|+.++++.|++.. T Consensus 76 ~~~~~--------------~~k----------------~--~itVD~ll~a~~~-I-~dlDe~~~~yDvRse~s~e~G~a 121 (335) T protein:vir:63 76 RSRVV--------------NDK----------------W--NLTVDTLLYLRHQ-F-DHQDEWTQSFDMRKEVAELDGQE 121 (335) T ss_pred CCCcc--------------ccc----------------e--EEEecceeechhh-h-hhHHHHhcCchhHHHHHHHHHHH Confidence 22111 001 1 1112211122111 1 22233445557788888887765 Q ss_pred HhhHHHHHHHHHHhccCce---EE-ecC---CCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceecccc Q lcl|NC_020862. 159 ANEITEDLLQADILASADV---KV-FTG---AATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSR 231 (405) Q Consensus 159 ~~~~ted~l~~~ilag~~~---v~-yag---~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~ 231 (405) =+.....-+-+.|..++.. +- .+| +-+..+.+++..+ +.+-.--.+-++.+...|.++..|. T Consensus 122 LA~~~D~~~~~~i~~aa~~~a~~~~~~~~~~G~~~~~~~tg~~~--~~~~~~l~~a~~~a~~~L~e~dVP~--------- 190 (335) T protein:vir:63 122 LARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVLEKLDLTGLTA--KQAADKIVRMHRRVVETFIDRDLGD--------- 190 (335) T ss_pred HHHHHHHHHHHHHHhhccccCccccCCCcCCCcceeeeeccCcc--cccHHHHHHHHHHHHHHHHhccCCC--------- Confidence 5554333333333333321 11 121 2233334444322 2121122245667888888888762 Q ss_pred ccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCc---ccccCcceeEecCCcEEEEeCcchhhhhcCCCc Q lcl|NC_020862. 232 MTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADA---ATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGAT 308 (405) Q Consensus 232 ~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~---~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~ 308 (405) +.-.-+++++.|+.-..|.+ ++.|+.+ .|++. ...-+|+|+++.| |+++.+++|-.-.+.+-. T Consensus 191 -----~~~~dr~~vv~P~~y~~Ll~------~~~l~n~-~~~~s~~~~~~~~g~v~~v~G--v~V~~sn~lP~~~~t~~~ 256 (335) T protein:vir:63 191 -----AVYSEGLTPMSPRVFSLLLE------HDKLMNV-EYQATGATNDYVKSRVAILNG--VKVLETPRFATKAIAAHP 256 (335) T ss_pred -----cccCceEEEeChHHHHHHhc------ccccccc-ccccccccccccCceeEEeec--eEEEeeccCCCCCccccc Confidence 11113899999999999965 4679888 67643 3357899999965 899999988432221111 Q ss_pred ccCCCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHH Q lcl|NC_020862. 309 ATAANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGF 388 (405) Q Consensus 309 ~~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~ 388 (405) . +.++++.... ..=...+++-++|-+++-++....+ --.|+--|..++=-|..|++ T Consensus 257 l---g~a~n~~~~d----~~~~~~~~~~~~Al~t~~~~~vt~e-----------------~~~~~~~~~~~i~~~~a~G~ 312 (335) T protein:vir:63 257 L---GRHFNVSAEE----SERQIALFLPSKTLITAQVAPVQAK-----------------LWEDNEKFSWVLDTFQMYNI 312 (335) T ss_pred c---cccCCccccc----cceeEEEEEecceEEEEEEeecccc-----------------eeeccchhhHHhHHHHHcCC Confidence 0 1112111111 1124688888888888877632211 11333346677778999999 Q ss_pred hhccccceEEEEEecCC Q lcl|NC_020862. 389 IKLRGERIAVAYSVIPE 405 (405) Q Consensus 389 ~iL~~~~marie~~a~~ 405 (405) .+||++.-+.||+---- T Consensus 313 g~lRPe~a~~i~~tg~~ 329 (335) T protein:vir:63 313 GARRPDTAGAIELKGIG 329 (335) T ss_pred cccccceEEEEEEcCCC Confidence 99999999999973221 No 113 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=97.91 E-value=9.9e-07 Score=53.50 Aligned_cols=281 Identities=12% Similarity=0.026 Sum_probs=149.9 Q ss_pred CC--cc----ccCcCCCcccccccceeehhhhhHHHHHhhh-hhhhhccccccccCcCCCCEEEEEecccCCCCCCcccc Q lcl|NC_020862. 1 MP--HI----YNDPAAGDASTVGPQFNVHYWDRKSLIDEAE-EMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQ 73 (405) Q Consensus 1 ~~--~~----y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p-~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~e 73 (405) +. +. -..-..++.+.-+.-+-+-+ ..+.+++..+ ..++.+++.+.++. .+..+++.+...-+.+ T Consensus 97 ~~~~~r~~~~~~~~~~~t~~~~g~~~~~~~-~~~~i~~~~~~~~~l~~~~~~~~~~--~~~~~~~p~~~~~~~a------ 167 (390) T protein:vir:62 97 NLGEARSFEFAPEKRDGTKAGNPNVLSRTL-YGQLIAQAVERSAIMRGGATTFTTS--DANPLDFTVITGRSSA------ 167 (390) T ss_pred hhhhhHHHHhhhhhhcccccCCCccccccc-hHHHHHHHHhhhhhhhhcceeeecC--CCceeEEEEEcCCcce------ Confidence 00 00 00000111111111111112 2556666654 34667788876653 3333444433211111 Q ss_pred CCCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHH Q lcl|NC_020862. 74 GLDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSR 153 (405) Q Consensus 74 Gvtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ 153 (405) +..+.| +...-...++..|+-+.++++.++.+|++++. |+.-++...+.. T Consensus 168 ~wv~E~-----------------------------~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~-ds~~~l~~~i~~ 217 (390) T protein:vir:62 168 SIVGET-----------------------------AEIPESYPATAQRSMGGFKYGFASVVSYEFAT-DQVLDLVGFLVS 217 (390) T ss_pred eeeccc-----------------------------ccccccccceeeeEeeeeeEEeehHHHHHHHh-hhhHHHHHHHHH Confidence 111122 12222233566788999999999999999654 566577777777 Q ss_pred HHHHHHhhHHHHHHHHHHhccCce----EEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceecc Q lcl|NC_020862. 154 EMLRGANEITEDLLQADILASADV----KVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKG 229 (405) Q Consensus 154 ell~~~~~~ted~l~~~ilag~~~----v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~g 229 (405) ++.+.-+ ..+|. .+++|.+. +-.....+.. ++.. ..+.+++++|.++...|+...... T Consensus 218 ~l~~~i~-~~~d~---~~l~G~G~p~Gi~~~~~~~~~~--~~~~-----~~~~~~~~~l~~~~~~l~~~~~~~------- 279 (390) T protein:vir:62 218 DAGPAIG-DAMGR---HFITGTGQPRGILTDASPATAT--FLAT-----DTDSKVSDALIDLFHEVPSAYRAN------- 279 (390) T ss_pred HHHHHHH-HHHHh---hhhccCCccccccccccccccc--eecc-----cccccchHHHHHHHHhhhhhhhcC------- Confidence 6665443 33333 46676542 1111111111 1111 124689999999888886543321 Q ss_pred ccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcc Q lcl|NC_020862. 230 SRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATA 309 (405) Q Consensus 230 s~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~ 309 (405) + +-+||+.+...|+.|+|..++|-|.|--.- |.-+.|-| +.+++++.+- T Consensus 280 ----------a--~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~--------g~~~~l~G--~Pv~~~~~~p--------- 328 (390) T protein:vir:62 280 ----------A--KYVVNDLRAAQMRKLKDANGQYLWQSGLTV--------GAPSLFNG--KVVETDDGMP--------- 328 (390) T ss_pred ----------C--EEEEchHHHHHHHHhhccCCCeeecCCcCC--------Cccceecc--cceEEecCCC--------- Confidence 1 347899999999999988888878664333 33344544 3444443221 Q ss_pred cCCCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHH--HHH Q lcl|NC_020862. 310 TAANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKF--FYG 387 (405) Q Consensus 310 ~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~--~~~ 387 (405) .-+ ++||+=++..++.++ .+. ++. ..|++-+++.+.+++ .+. T Consensus 329 -------------------~~~-i~~gd~s~~~i~~~~-------~~~--v~~-------~~~~~~~~~~~~~~~~~r~d 372 (390) T protein:vir:62 329 -------------------ADK-ILFADLSKYRVRFAG-------SLR--VDR-------SVDAKFSTDQIVYRFLQRAD 372 (390) T ss_pred -------------------Ccc-EEEeeccceeEEeec-------ceE--EEe-------eccccccCCcEEEEEEEEeC Confidence 001 457875555565542 122 332 246776777777664 489 Q ss_pred HhhccccceEEEEEecCC Q lcl|NC_020862. 388 FIKLRGERIAVAYSVIPE 405 (405) Q Consensus 388 ~~iL~~~~marie~~a~~ 405 (405) +.+++++-++.+++.+-= T Consensus 373 ~~~~~~~A~~~l~~~~~a 390 (390) T protein:vir:62 373 GLLVDARGAKVLTVTPGA 390 (390) T ss_pred cEeechhheEEEEeecCC Confidence 999999999888865544 No 114 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=97.89 E-value=4e-06 Score=50.17 Aligned_cols=276 Identities=9% Similarity=-0.027 Sum_probs=149.9 Q ss_pred CCccccC--cCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcc Q lcl|NC_020862. 1 MPHIYND--PAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDAT 78 (405) Q Consensus 1 ~~~~y~~--~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~ 78 (405) +-....+ -..+++++.+.-+-+. |...-+........+.+++.+.++.-+ ++++-+..-... ..-+. T Consensus 98 ~~~~~~~~~~~~~~~~~~~~~ip~~-~~~~ii~~~~~~~~i~~~~~~~~~~~~---~~~~~~~~~~~~----~~~~~--- 166 (379) T protein:vir:10 98 GKSIQVKAVGDMTLPVNLTGAQPKD-YNFDVVLNPSQMLNVSDIVGAVSISGG---TYTFVRENGAGE----GAIGA--- 166 (379) T ss_pred hhhhhhhhhcccccCCCCccccchh-hhhHHHHhHHhhhhHHhhceeeeccCC---ceEEEEeecCCC----ccccc--- Confidence 1111111 1223333333333443 344444444456777888887777533 333332111000 00000 Q ss_pred cccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHH Q lcl|NC_020862. 79 GASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRG 158 (405) Q Consensus 79 g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~ 158 (405) ...|+.......++..|+-++++|+.++.+|++++. |+. .+...+..+|.+. T Consensus 167 --------------------------v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~-D~~-~l~~~i~~~la~~ 218 (379) T protein:vir:10 167 --------------------------QVEGATKGQKDYDISMIDVNTDFIAGFTRYSKKMAN-NLP-FLTSFIPNALRRD 218 (379) T ss_pred --------------------------ccCCccccccccceeeeEeeeeeEEeeehhhHHHHh-hHH-HHHHHHHHHHHHH Confidence 112333334445778889999999999999999654 554 4666666665543 Q ss_pred HhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccc Q lcl|NC_020862. 159 ANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTI 238 (405) Q Consensus 159 ~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I 238 (405) ....++.....++++.++ ... .+ .....++++|.++.-.|..+..+ T Consensus 219 -~~~~~~~~~~~g~~~~~~--------~~~-~~-------~~~~~~~d~i~~~~~~~~~~~~~----------------- 264 (379) T protein:vir:10 219 -YAKAENAAFNAVLAANAT--------AST-EI-------ITNKNKVEMLINEIAKQENLDFP----------------- 264 (379) T ss_pred -HHHHHHHHHhcccccccc--------ccc-cc-------ccCcccHHHHHHHHHhhhhccCC----------------- Confidence 344555543333322110 000 00 11235678888887777655432 Q ss_pred cceEEEEEcccchHHHHHHhcccCCCcceeh--hhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccc Q lcl|NC_020862. 239 SASRIAYIGSELEIYITELVDSLGNPAFVPV--EKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGY 316 (405) Q Consensus 239 ~~syv~~~h~dl~~dir~l~d~~~~p~fi~v--~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~ 316 (405) .-+.++||.+...|+.|+|..+.|-|.|- .+.+.+. ++-| +++++++.|. +| T Consensus 265 --~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~--------~l~G--~pvv~s~~~~----ag---------- 318 (379) T protein:vir:10 265 --VTAIVLRPTDYYDILVTQKSVGAGYGLPGVVTQDNGVL--------RING--IPLFRATWLA----AN---------- 318 (379) T ss_pred --CCEEEEcHHHHHHHHHhhccCCceeccCCccCCCCCcc--------eecc--eeeEecCCCC----CC---------- Confidence 12466899999999999998888877652 1222222 3434 5777776543 11 Q ss_pred ccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhhcccc Q lcl|NC_020862. 317 QVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIKLRGE 394 (405) Q Consensus 317 ~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL~~~ 394 (405) + ++||+-+...+.+. ..+.+-+..- -.| +-+++.+.|+ .-+++.+++++ T Consensus 319 -----------~----~~~gdf~~~~~~~~-------~~~~i~~~~~------~~~-~f~~~~~~~r~~~R~~~~v~~p~ 369 (379) T protein:vir:10 319 -----------K----YYVGDWTRVTKVTT-------EGLSLEFSEV------EGT-NFVKNNITARIEAQVALAVEQPA 369 (379) T ss_pred -----------c----eEEeecccEEEEEE-------eceEEEEeec------ccc-cccCCcEEEEEEEEeccEEecCc Confidence 1 35777666555443 1123333321 122 3457777777 36789999999 Q ss_pred ceEEEEEecC Q lcl|NC_020862. 395 RIAVAYSVIP 404 (405) Q Consensus 395 ~marie~~a~ 404 (405) -++.++..+= T Consensus 370 a~v~~~~~~~ 379 (379) T protein:vir:10 370 ALIFGDFTAV 379 (379) T ss_pred cEEEEEecCC Confidence 9999988877 No 115 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=97.88 E-value=1e-05 Score=47.99 Aligned_cols=283 Identities=11% Similarity=0.015 Sum_probs=145.8 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhh-hhh------hccccccccCcCCCCEEEEEecccCCCCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEE-MFF------SPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQ 73 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~-lv~------~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~e 73 (405) |+ ++.++.-+++..| -.++....++ .-| ...++...+-..-|.+|.+=.|..|.-+....++ T Consensus 1 MA----------~T~lsd~i~PEvf-~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~~~~~~ 69 (351) T protein:vir:15 1 MA----------ETHLSDLIVPEVF-GNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGDPDNWTD 69 (351) T ss_pred CC----------ceeeeeeechhHH-HHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCC Confidence 43 2344444444333 2233322222 223 3344444444467999999999988555555566 Q ss_pred CCCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHH Q lcl|NC_020862. 74 GLDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSR 153 (405) Q Consensus 74 Gvtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ 153 (405) |-+-...++ +...-.+.++++|.=+++||.+...- -.|.++++.. T Consensus 70 ~~~i~~~ki----------------------------------tt~~~~a~i~~~~kg~~~tD~a~~~s-g~dp~~~i~~ 114 (351) T protein:vir:15 70 SDDIDVNNL----------------------------------TSGKQQGIKFYQTKAYGYTDLGTMIS-GAPVQETIGN 114 (351) T ss_pred Ccccchhee----------------------------------cccceeEEEEeeccceehhhhhHhhc-cchHHHHHHH Confidence 543222222 22344688999999999999876654 4477887666 Q ss_pred HHHHHHhhHHHHHHHHHHhccCceEEe----cCCCccceeeecccccccCCceecHHHHHHHHHHHHhc-cCccccceec Q lcl|NC_020862. 154 EMLRGANEITEDLLQADILASADVKVF----TGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDN-YTPKKTTIIK 228 (405) Q Consensus 154 ell~~~~~~ted~l~~~ilag~~~v~y----ag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~n-rApk~T~ii~ 228 (405) .+..--+ +..|.+|++-..-++. +...+.+. +.. +. ....++.+.|-++...|-.. ... T Consensus 115 q~a~~w~----~~~q~~lla~l~gv~~~~~~~~~~~~d~--t~~--~~-~~~~is~~~l~~A~~~~GD~~~~~------- 178 (351) T protein:vir:15 115 RFAAFWQ----RADQKTLLSVLKGVMGVTKIANSKVYDQ--TKV--SP-SEPMFGAKGFTGAIGLMGDLQDTA------- 178 (351) T ss_pred HHHHHHH----HHHHHHHHHHHHHHhhchhhcccceecc--ccc--cc-cccccCHHHHHHHHHHhccccccc------- Confidence 6554333 3444444432111111 11111111 111 11 23469999999999998664 332 Q ss_pred cccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCc Q lcl|NC_020862. 229 GSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGAT 308 (405) Q Consensus 229 gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~ 308 (405) -.+.+|||.....|+++ +++.-.+|.+. +.+||.+-| .|+|.++-| |..+.+ T Consensus 179 ------------~~~ivmhS~v~~~L~~~-------~li~~~~~s~~----~~~i~t~~G--~~VivdD~~-p~~~~~-- 230 (351) T protein:vir:15 179 ------------FGAIAVNSATYSLMKVQ-------GLIETIQPQNG----ATPFEAYNG--LRIVLDDDI-EIDLTD-- 230 (351) T ss_pred ------------eEEEEEChHHHHHHHhh-------hhhhhcccccc----Ccccceecc--eEEEEcCCC-ccccCC-- Confidence 26778899999999965 36666678776 457999966 688887643 222111 Q ss_pred ccCCCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHH Q lcl|NC_020862. 309 ATAANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGF 388 (405) Q Consensus 309 ~~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~ 388 (405) +..++|-..+||+.|++... + .-+.+ +.+.|... +..|=|-+|.. |.. T Consensus 231 ----------------~~~~~ytsyl~~~GAi~~~~--~-----~~~ve-~~rd~~~~--~g~d~l~~r~~------~~~ 278 (351) T protein:vir:15 231 ----------------KTKPVSTSYIFAPGAVRYST--N-----MRSTE-TKYDPLIN--GGQDVIVQKRV------GTI 278 (351) T ss_pred ----------------CCCceeEEEEEecceeeeec--C-----CcCcc-eeecccCC--CCceEEEEeee------eee Confidence 11358999999999988421 1 11122 22322211 11222222221 111 Q ss_pred hhccccceEE---EEEecCC Q lcl|NC_020862. 389 IKLRGERIAV---AYSVIPE 405 (405) Q Consensus 389 ~iL~~~~mar---ie~~a~~ 405 (405) -++=--|-.- -...+|- T Consensus 279 hp~G~s~~~~~~~~~~~sPt 298 (351) T protein:vir:15 279 HVAGTSIKASFSPSKASFPT 298 (351) T ss_pred eeeeeeecccccccCcCCcC Confidence 1111111000 0011233 No 116 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=97.88 E-value=8.3e-06 Score=48.44 Aligned_cols=292 Identities=13% Similarity=0.050 Sum_probs=146.6 Q ss_pred CCcccc-CcCCCc---ccccccceeehhhhhHHHHHhh-hhhhhhccccccccCcCCCCEEEEEecccCC--CCCCcccc Q lcl|NC_020862. 1 MPHIYN-DPAAGD---ASTVGPQFNVHYWDRKSLIDEA-EEMFFSPLADNKQMPKHFGKELKVFYYVPLL--DDLNVNDQ 73 (405) Q Consensus 1 ~~~~y~-~~~~t~---~~~v~~qm~t~y~~~k~L~~a~-p~lv~~~fA~~~~mPKn~GktIkfrry~pl~--~~~t~l~e 73 (405) ...... ....++ ...+.|+ . ..+.++... ...++.+++...++..+ ++++-+..... ....-+.- T Consensus 116 ~~~~~~~~~~~~~~~~~~~~~p~----~-~~~~i~~~~~~~~~i~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~a 187 (419) T protein:vir:94 116 NRLLSRDAPAGTITNPNVPHLPQ----L-VPGIVPTTPDLPLLVADLLDQQNADYN---VLEYIRDTSGTAGAGSTWNKA 187 (419) T ss_pred HHhhccccccccccCCcccccch----h-hhHHHHHHHhhhhhhhhcceeeeccCC---ceeeeeeccccccccccCccc Confidence 000000 111111 1122333 2 133333332 24566777777776543 34443322111 10000111 Q ss_pred CCCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHH Q lcl|NC_020862. 74 GLDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSR 153 (405) Q Consensus 74 Gvtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ 153 (405) +..+. |+..+....++..|+.++++++.++.+|++++. |+. ++...|.. T Consensus 188 ~~v~E-----------------------------g~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~-d~~-~l~~~i~~ 236 (419) T protein:vir:94 188 AVVPE-----------------------------GTAKPQSTLSFDTITTTLKTVAHWLPITRQAAD-DNS-QLMGYIQG 236 (419) T ss_pred ceecC-----------------------------CccccccccceeeEEeeeeeEEEeehhhHHHHH-hHH-HHHHHHHH Confidence 11122 222333345777889999999999999999765 553 45555555 Q ss_pred HHHHHHhhHHHHHHHHHHhccCceEEecCCCccc--eeeecccccccCCceecHHHHHHHHHHHHhccCccccceecccc Q lcl|NC_020862. 154 EMLRGANEITEDLLQADILASADVKVFTGAATSM--VTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSR 231 (405) Q Consensus 154 ell~~~~~~ted~l~~~ilag~~~v~yag~ats~--~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~ 231 (405) ++.+.. ...+ -..+++|.++-.-.|-.+.. .+...............+++|.++...|.....+. T Consensus 237 ~la~a~-~~~~---d~aii~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~t~~~~~~~l~~~~~~~~~~~~~~--------- 303 (419) T protein:vir:94 237 RLTYGL-RFLR---DRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPP--------- 303 (419) T ss_pred HHHHHH-HHHH---HHHHHhccCcccccceecccccccccccccccccccchhHHHHHHHHHhhhhccCCC--------- Confidence 544322 2222 34556665554333322111 01111111222234567899999988887665531 Q ss_pred ccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccC Q lcl|NC_020862. 232 MTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATA 311 (405) Q Consensus 232 ~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~ 311 (405) + +.+|||.....|+.++|.-+.+-|... . .-.+-.+.|-| +.++.++.|. +| T Consensus 304 ---------~-~~v~n~~~~~~l~~~k~~~~~~~~~~~--~-----~~~~~~~~l~G--~pV~~~~~~~----~~----- 355 (419) T protein:vir:94 304 ---------D-GVVVHPQDWESIELDQAPGSGVFRVIA--N-----VQGEATPRIWG--LNVVSTVAIA----QG----- 355 (419) T ss_pred ---------C-EEEEcHHHHHHHHHHhhcCCCceeecC--C-----cccCCCccccc--eeeEEcCCCC----Cc----- Confidence 1 568999999999999875555443321 1 12333456655 5667766442 00 Q ss_pred CCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHh Q lcl|NC_020862. 312 ANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFI 389 (405) Q Consensus 312 t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~ 389 (405) + ++||.-...-..+.. ..+.+-+- +..+.+-+++...|+ +++.+. T Consensus 356 ----------------~----~~~gd~~~~~~~~~~------~~~~v~~~-------~~~~~~~~~~~~~~r~~~r~d~~ 402 (419) T protein:vir:94 356 ----------------T----ALVGGFRQGATLWSR------QGITVLMT-------DSHADFFTANTLVILAEFRANLA 402 (419) T ss_pred ----------------c----EEEeeccceEEEEEe------cceEEEEe-------ccccchhhcCcEEEEEEEeeccE Confidence 1 345654433222211 11333222 112334455666666 678899 Q ss_pred hccccceEEEEEecCC Q lcl|NC_020862. 390 KLRGERIAVAYSVIPE 405 (405) Q Consensus 390 iL~~~~marie~~a~~ 405 (405) +++++-++++++.+.= T Consensus 403 v~~~~a~~~~~~~aa~ 418 (419) T protein:vir:94 403 VYQPKAFVRVTFAAAT 418 (419) T ss_pred EeccccEEEEEeccCC Confidence 9999999999987766 No 117 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=97.86 E-value=1.4e-05 Score=47.17 Aligned_cols=283 Identities=12% Similarity=0.060 Sum_probs=144.4 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhh-hhhhhc------cccccccCc--CCCCEEEEEecccCCCCCCcc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAE-EMFFSP------LADNKQMPK--HFGKELKVFYYVPLLDDLNVN 71 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p-~lv~~~------fA~~~~mPK--n~GktIkfrry~pl~~~~t~l 71 (405) |+ ++.++.-+++..| -.++....+ ..-|-| .++...+-. -.|.+|.+=.|..|.-+.... T Consensus 1 MA----------~T~lsd~i~peVf-~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v 69 (324) T protein:vir:59 1 MA----------YTKISDVIVPELF-NPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVL 69 (324) T ss_pred CC----------ceeeeceechhHH-HHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCccccc Confidence 43 2344444444333 233333333 233433 344444332 248999988888884444444 Q ss_pred ccCCCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHH Q lcl|NC_020862. 72 DQGLDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHL 151 (405) Q Consensus 72 ~eGvtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~ 151 (405) ++|-+-+..++ +.....+.+.+.|.=++.||++... +-.+.++++ T Consensus 70 ~~~~~i~~~~l----------------------------------~t~~~~a~i~~~~k~~~~tD~a~~~-sg~dp~~~i 114 (324) T protein:vir:59 70 NDTDDLVPQKI----------------------------------NAGQDKAVLILRGNAWSSHDLAATL-SGSDPMQAI 114 (324) T ss_pred CCCcccchhhc----------------------------------ccceeeEEEEeecCceeehhhhhhh-ccchHHHHH Confidence 55543222222 2334468888899889999987665 444777776 Q ss_pred HHHHHHHHhhHHHHHHHHHHhccCce-EEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccc Q lcl|NC_020862. 152 SREMLRGANEITEDLLQADILASADV-KVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGS 230 (405) Q Consensus 152 ~~ell~~~~~~ted~l~~~ilag~~~-v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs 230 (405) ...+.. --....|.++++...- +-.+..++....+++. .+..+|.+.|-++...|-++... T Consensus 115 ~~q~a~----~~~~~~~~~lia~l~g~~~~~~~~~~~~dvsa~-----~~~~~s~~~l~~A~~~~GD~~~~--------- 176 (324) T protein:vir:59 115 GSRVAA----YWAREMQKIVFAELAGVFSNDDMKDNKLDISGT-----ADGIYSAETFVDASYKLGDHESL--------- 176 (324) T ss_pred HHHHHH----HHHHHHHHHHHHHHHHhhhccccccceeeeecc-----ccceecHHHHHHHHHHhCCcccC--------- Confidence 555543 3344455555543221 1112223333333332 24679999999999998776543 Q ss_pred cccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCccc Q lcl|NC_020862. 231 RMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATAT 310 (405) Q Consensus 231 ~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~ 310 (405) -.+.++||....+||++. ++.-.+|.+. ..+||.+-| .|+|++.-|- -. T Consensus 177 ----------~~~ivmhS~v~~~L~~~~-------li~~~~~s~~----~~~i~~~~G--~~VivdD~~p-~~------- 225 (324) T protein:vir:59 177 ----------LTAIGMHSATMASAVKQD-------LIEFVKDSQS----GIRFPTYMN--KRVIVDDSMP-VE------- 225 (324) T ss_pred ----------cEEEEEchHHHHHHHHhh-------hhhhcccccc----Cceeeeecc--cEEEEeCCCC-cc------- Confidence 378999999999999752 4544567665 457899866 5777765332 10 Q ss_pred CCCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhh Q lcl|NC_020862. 311 AANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIK 390 (405) Q Consensus 311 ~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~i 390 (405) .. .+..++|-.+++|+.|++...-+ -++.+- -..||.+-.-..--+..|..-+ T Consensus 226 ---------~~--~~~~~~y~s~l~~~GAi~~~~~~-------~~v~vE---------~dRd~~~g~~~l~~r~~~~~~p 278 (324) T protein:vir:59 226 ---------TL--EDGTKVFTSYLFGAGALGYAEGQ-------PEVPTE---------TARNALGSQDILINRKHFVLHP 278 (324) T ss_pred ---------cc--CCCCceEEEEEEecCeEEEeecC-------CCccee---------cccCccccceEEEEeeEEEeEe Confidence 01 12235999999999998775432 111110 1122321100000000000000 Q ss_pred ccccceE-EEEEecCC Q lcl|NC_020862. 391 LRGERIA-VAYSVIPE 405 (405) Q Consensus 391 L~~~~ma-rie~~a~~ 405 (405) +==-|-. -.-..+|- T Consensus 279 ~G~s~~~~~~~~~sPt 294 (324) T protein:vir:59 279 RGVKFTENAMAGTTPT 294 (324) T ss_pred eeEEecccccCCCCCC Confidence 0000000 00011233 No 118 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=97.85 E-value=6.6e-06 Score=48.97 Aligned_cols=297 Identities=12% Similarity=0.063 Sum_probs=137.2 Q ss_pred CCc-cccC--------cCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCcc Q lcl|NC_020862. 1 MPH-IYND--------PAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVN 71 (405) Q Consensus 1 ~~~-~y~~--------~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l 71 (405) |.. .+++ ..+++.+.+-|+ -+..+-+....+..++.+++ .+.+|...|+ +++-|...-+.+ .-. T Consensus 113 ~~~~~~~~~~~~~~~~~~~~~gg~liP~----~~~~~ii~~l~~~~~l~~~~-~~~~~~~~g~-~~~p~~~~~~~a-~~v 185 (428) T protein:vir:10 113 FASDELNDQSVSMAISTAAGSGGVLIPQ----NIHSEVIELLRDRTIVRKLG-ARSIPLPNGN-MSLPRLAGGATA-SYT 185 (428) T ss_pred HhhhhhhhhhHhhhhcccccCCccccch----hHHHHHHHHHhhhchhhhhc-ceeeecCCcc-eEEEEEeCCcce-eee Confidence 110 0111 011111112232 22334444444566777773 2346665665 333333211110 011 Q ss_pred ccCCCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHH Q lcl|NC_020862. 72 DQGLDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHL 151 (405) Q Consensus 72 ~eGvtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~ 151 (405) .| |+.......++..|+-+.++++.++.+|+++ +-|++.++.+.| T Consensus 186 ~E----------------------------------g~~~~~~~~~f~~i~~~~~k~~~~v~is~el-l~ds~~~l~~~i 230 (428) T protein:vir:10 186 GE----------------------------------NQDAKVSEARFDDVKLTAKTMIAMVPISNAL-IGRAGFNVEQLV 230 (428) T ss_pred cc----------------------------------CccccccccceeeEEeeeEEEEEeehhhHHH-HhhhhHHHHHHH Confidence 11 2222222346777889999999999999984 456676788887 Q ss_pred HHHHHHHHhhHHHHHHHHHHhccCceE-EecCC-----CccceeeecccccccCCceecHHHHHHHHHHHHhccCccccc Q lcl|NC_020862. 152 SREMLRGANEITEDLLQADILASADVK-VFTGA-----ATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTT 225 (405) Q Consensus 152 ~~ell~~~~~~ted~l~~~ilag~~~v-~yag~-----ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ 225 (405) ..++.+..+. .++. .+++|.++- .-.|- .+..+..+. .....+++.+......|........ T Consensus 231 ~~~l~~ai~~-~~d~---~~l~G~G~~~~p~Gi~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~-- 298 (428) T protein:vir:10 231 LQDILTAISV-REDK---AFMRDDGTGDTPIGMKARATQWNRLLPWA------ADAAVNLDTIDTYLDSIILMSMDGN-- 298 (428) T ss_pred HHHHHHHHHH-HHHH---HHhccCCCCcccccccccccccccccccc------ccccccHHHHHHHHHHHHHhhhccc-- Confidence 7777754443 3332 445664321 11111 111111111 1123455555444433322111100 Q ss_pred eeccccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcC Q lcl|NC_020862. 226 IIKGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGA 305 (405) Q Consensus 226 ii~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~a 305 (405) .... .-+-++|+.....|+.|+|..+.|-|.+..+ |+|-| +.++.++.|..-.+. T Consensus 299 ----------~~~~-~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~------------g~l~G--~pv~~~~~~p~~~~~ 353 (428) T protein:vir:10 299 ----------SNMI-SSGWGMSNRTYMKLFGLRDGNGNKVYPEMAQ------------GMLKG--YPIQRTSAIPANLGE 353 (428) T ss_pred ----------cccc-cCEEEEcHHHHHHHHHhhccCCceeccCCCC------------Ceeec--eeeEEeccccccccC Confidence 0000 1234679999999999999888888854311 46655 566666644311111 Q ss_pred CCcccCCCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCC--CCCCCCccchhhhHHHH Q lcl|NC_020862. 306 GATATAANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEA--TADRNDPYGKVGFSSIK 383 (405) Q Consensus 306 Ga~~~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~--tad~~DPlgQrg~~gwK 383 (405) +++ . +.++||+=++-.++..+ .+++.+..=+.- ....--.+-|+..+.|+ T Consensus 354 -----------------~~~---~-~~i~~gd~s~~~i~~~~-------~i~i~~~~~~~~~~~~~~~~~~f~~~~~~~R 405 (428) T protein:vir:10 354 -----------------GGK---E-SEIYFADFNDVVIGEDG-------NMKVDFSKEASYIDTDGKLVSAFSRNQSLIR 405 (428) T ss_pred -----------------CCc---c-ceEEEEecceEEEEEec-------ceEEEeecccccccccccccchhhcchhhee Confidence 111 1 23457776666666542 244433321100 00001134456666666 Q ss_pred --HHHHHhhccccceEEEEEecC Q lcl|NC_020862. 384 --FFYGFIKLRGERIAVAYSVIP 404 (405) Q Consensus 384 --~~~~~~iL~~~~marie~~a~ 404 (405) +.+.+.+.+++-.+++--+.= T Consensus 406 ~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 406 VVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred eeeeeCceeeccceEEEEeccCC Confidence 344455565655555443333 No 119 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=97.84 E-value=3.3e-06 Score=50.67 Aligned_cols=294 Identities=14% Similarity=0.110 Sum_probs=146.3 Q ss_pred CC-cccc-Cc-CCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCc Q lcl|NC_020862. 1 MP-HIYN-DP-AAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDA 77 (405) Q Consensus 1 ~~-~~y~-~~-~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp 77 (405) |. +.-. .. +.+.+++-+.-+-+..+..+-+....+..++.+++...++. | .+++-++..-+.+. .+.. T Consensus 131 l~~~~~~~e~~a~~~~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~~~---~-~~~~p~~~~~~~a~-----~~~~ 201 (434) T protein:vir:62 131 IVGNIDEKEARALGLVTGNGSVTIPDFLSKEIITYAQEENFLRRLGTGVKTK---E-NIKYPVLVKKAEAQ-----GHKN 201 (434) T ss_pred hccccchhhhhhhcccccccceecchhhHHHHHHhhhhhhhhhhhcceeccC---C-ceEEEEEecCCccc-----ceec Confidence 10 0000 00 01111111111222234455555555677888898876543 3 24443332111111 1000 Q ss_pred ccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHH Q lcl|NC_020862. 78 TGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLR 157 (405) Q Consensus 78 ~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~ 157 (405) .+.|....-...++..|+.+.++++.++.+|+++ +-|+.-++.+.|..++.+ T Consensus 202 ---------------------------~~e~~~~~~~~~~f~~v~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~la~ 253 (434) T protein:vir:62 202 ---------------------------ERTNNEMPETDIEFDEIELSPTEFDALATVTKKL-LARTGLPIEQIVMDELKK 253 (434) T ss_pred ---------------------------ccccccccccccceeeEEeeheeeEeehhhHHHH-HhcchHHHHHHHHHHHHH Confidence 0111111112236677889999999999999985 446665777777777765 Q ss_pred HHhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccc Q lcl|NC_020862. 158 GANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKT 237 (405) Q Consensus 158 ~~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~ 237 (405) ..+. .++ ..+++|.++-...++......++ ....+.+++++|.++...|+....+ T Consensus 254 ~~~~-~~d---~~~l~G~G~~~~~~g~~~~~~~~-----~~~~~~~~~d~l~~l~~~l~~~~~~---------------- 308 (434) T protein:vir:62 254 AYVR-KET---QYMVNGDEANNINDGALAKKAVE-----FKTDEKNLYDALVKMKNTPVKEVRK---------------- 308 (434) T ss_pred HHHH-HHH---HHHhccCCCCccccceeeccccc-----ccccccchhhHHHHHHhhcchhhhc---------------- Confidence 4433 333 44667766443333322211111 1223568899999988888664332 Q ss_pred ccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccc Q lcl|NC_020862. 238 ISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQ 317 (405) Q Consensus 238 I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~ 317 (405) .+ +-++|+.....|+.|+|-.+.|-|.|... +. .|--..|-| +++++++.|. .|.+ T Consensus 309 --~a-~~v~n~~~~~~L~~lkd~~G~~l~~~~~~-~~-----~g~~~tl~G--~pV~~~~~~~----~~~~--------- 364 (434) T protein:vir:62 309 --KA-RWVLNTAALTKIETMKTDDGFPLLRPFNQ-AE-----GGIGYTLLG--FPVEEEDAID----IPDS--------- 364 (434) T ss_pred --CC-EEEEcHHHHHHHHHhhccCCCEeeccCCC-cc-----CCCCceecc--eeeEEecCcc----CccC--------- Confidence 12 23789999999999999988998887421 11 122234544 5666666442 1110 Q ss_pred cccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHH--HHh-hcccc Q lcl|NC_020862. 318 VSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFY--GFI-KLRGE 394 (405) Q Consensus 318 ~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~--~~~-iL~~~ 394 (405) + +. +.+.||+-+...|.-+. +.+. ++. ..++|-.++.++++++. .++ |++++ T Consensus 365 ------~---~~-~~i~~Gdfs~~~i~~~~------g~~~--i~~-------~~~~~~~~~~v~~~~~~r~Dgk~i~~~~ 419 (434) T protein:vir:62 365 ------P---DT-PVFYFGDFSKFYIQDVI------GSLE--VQK-------LVELFSRTNRVGFRIWNLLDAQLIHSPF 419 (434) T ss_pred ------C---Cc-eEEEEeeccceEEEEee------ceeE--EEe-------ehhhhcccCceEEEEEeeecceeecCcc Confidence 0 12 45667877665554321 1111 221 14555455555544322 334 33355 Q ss_pred ceEEEEEe--cCC Q lcl|NC_020862. 395 RIAVAYSV--IPE 405 (405) Q Consensus 395 ~marie~~--a~~ 405 (405) =.+.+... +|= T Consensus 420 ~~~~~~~~~~~~~ 432 (434) T protein:vir:62 420 EVPVYKYVLKAPT 432 (434) T ss_pred cceEEEEEeccCC Confidence 44444333 122 No 120 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=97.84 E-value=2.6e-06 Score=51.16 Aligned_cols=272 Identities=11% Similarity=0.061 Sum_probs=140.8 Q ss_pred CCccccCcC-CCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCccc Q lcl|NC_020862. 1 MPHIYNDPA-AGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATG 79 (405) Q Consensus 1 ~~~~y~~~~-~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g 79 (405) +.....+-. .++.+.+-| ..+....+....+..++.+++...+|+.+.++-.. +.-.+... .....| | T Consensus 109 ~~~~~ra~~t~~~gg~liP----~~~~~~Ii~~~~~~~~l~~l~~~~~~~~~~~~~~~-~~~~~~~~-~~~~~E-----~ 177 (421) T protein:vir:13 109 LSEEERDIMSSTNNGAVIP----QEFVNEFEKLKEGYPSLKEHCHVIPVNRNAGKMPV-RAGASVDK-LANLAK-----D 177 (421) T ss_pred hhHHHhhccccCCcceecc----hhhHHHHHHHHHhhhhhhhhceeeeccCCceEEEE-eecCCccc-eeeccc-----c Confidence 222222211 111111223 23344444444456788889999888877653222 11111110 011112 1 Q ss_pred ccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHH Q lcl|NC_020862. 80 ASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGA 159 (405) Q Consensus 80 ~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~ 159 (405) .. .+...+++..|+-++++++.++.+|+++ +-|+..++...|..++.+.. T Consensus 178 ~~-----------------------------~~~s~~~f~~i~~~~~k~~~~v~iS~el-l~ds~~~l~~~i~~~la~~~ 227 (421) T protein:vir:13 178 TE-----------------------------LVKAMLKTQPMAYDIDDYGLLAPIDNSL-LEDSEINFLEFVNEEFAEFA 227 (421) T ss_pred cc-----------------------------ccccccceeEEEeeeeeeEeehhhhHHH-HhhhHHHHHHHHHHHHHHHH Confidence 11 1112236677889999999999999985 45666678888777776443 Q ss_pred hhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccccc Q lcl|NC_020862. 160 NEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTIS 239 (405) Q Consensus 160 ~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~ 239 (405) ..+++......+.|.. + ..+..++++|.++...|+....+. T Consensus 228 -~~~~~~~i~~~~~g~~------------------~---~~~~~~~d~i~~~~~~l~~~~~~~----------------- 268 (421) T protein:vir:13 228 -VNTENAEIVKQAKAVL------------------A---EETINDYAGLVKTINSLVPNARKR----------------- 268 (421) T ss_pred -HHHhhhhHhhhhhhcc------------------c---cccccchHHHHHHHHHhhhhhcCC----------------- Confidence 3344332222222110 0 113467899999999887654331 Q ss_pred ceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccccc Q lcl|NC_020862. 240 ASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVS 319 (405) Q Consensus 240 ~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~ 319 (405) =+.++|+.....|+.|+|-.+.|-|.++. .+.-++|-| +..++++.|- .+.+ T Consensus 269 --a~~v~n~~~~~~l~~lkd~~G~~i~~~~~---------~~~~~tl~G--~pV~~~~~~~--~~~~------------- 320 (421) T protein:vir:13 269 --AIIVTNSDGRAYLDGLMDKQGRPLLKELS---------DGGDLVFKG--RPVIELEESI--FDVG------------- 320 (421) T ss_pred --CEEEEcHHHHHHHHHhhcCCCceeecCcC---------CCCCceecc--eeeEEecccc--ccCC------------- Confidence 13478999999999999999999887752 333456655 4566665443 1111 Q ss_pred cccCCcceeeeEEEEEcccc-ceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhhccccce Q lcl|NC_020862. 320 DVAGTDKYDIAPLLVVGDQA-FATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIKLRGERI 396 (405) Q Consensus 320 ~~~g~~~~DVYp~lV~G~~A-fg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL~~~~m 396 (405) +.+ .+++|+-+ +-.+..+ ..+++-+. .+++-+++...++ +.+.+.+.+++.. T Consensus 321 --------~~~-~~~~gd~~~~~~~~~~-------~~~~v~~~---------~~~~f~~~~~~~r~~~r~d~~~~~~~a~ 375 (421) T protein:vir:13 321 --------DET-KFIVSDFKTLIKFMDR-------KQYLIDQS---------KEAGYTKNETIARIIERFDVNSPLDKSS 375 (421) T ss_pred --------Cce-EEEEEeccccEEEEEe-------cceEEEee---------cccccccCeeEEEEEeeecceeecchhh Confidence 122 34566533 2333332 11333332 2334455555554 3344455555544 Q ss_pred EEEEEecCC Q lcl|NC_020862. 397 AVAYSVIPE 405 (405) Q Consensus 397 arie~~a~~ 405 (405) ..+.+..+- T Consensus 376 ~~~~~~~~~ 384 (421) T protein:vir:13 376 DAEKIRKFG 384 (421) T ss_pred heeeecccc Confidence 333333222 No 121 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=97.81 E-value=6.1e-06 Score=49.17 Aligned_cols=280 Identities=13% Similarity=0.048 Sum_probs=142.7 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) +...=+. ..+.+++-+.-.-+-.|....+....+..++.+++...+||.+.++-..... -...... T Consensus 104 ~~~~~~~-~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~---~~~~~~~---------- 169 (394) T protein:vir:10 104 GKVIDNA-AGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKR---ATDRFSS---------- 169 (394) T ss_pred chhhhhh-hcccccccCceeccHHHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEec---CCCcccc---------- Confidence 1111111 1111111111122224455444445566788999999999887654332220 0000001 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) ..|.|-.+.....++..|+-++++++.++.+|+++ +-|++.++...+..+|.+.-+ T Consensus 170 -----------------------~~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~la~~~~ 225 (394) T protein:vir:10 170 -----------------------VAELAENPALAEPEFEQVDWSVSTYRGAIPLSEEA-IADSAVDLTSLVGQSINEKSV 225 (394) T ss_pred -----------------------ccccccccccccccceeEEeeeeeeEeeehhHHHH-HhhhhHHHHHHHHHHHHHHHH Confidence 11222222223346778889999999999999985 445666788877777664433 Q ss_pred hHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHH-HHHhccCccccceeccccccCccccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSI-TLTDNYTPKKTTIIKGSRMTDTKTIS 239 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~-~Lk~nrApk~T~ii~gs~~~gT~~I~ 239 (405) .+++ ..+++|.+. | +... .-+..++++|..+.. .|+.... T Consensus 226 -~~~~---~~il~g~g~----~--~~~~----------~~~~~~~d~l~~~~~~~~~~~~~------------------- 266 (394) T protein:vir:10 226 -NTYN---AMIAPVLQS----F--TAKA----------TTTDTLVDSLKHILNVDLDPAYS------------------- 266 (394) T ss_pred -HHHH---HHHhhcccc----c--cccc----------ccccccHHHHHHHHHhhhhhhcc------------------- Confidence 3333 345555432 1 1100 112467777776543 3332210 Q ss_pred ceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccccc Q lcl|NC_020862. 240 ASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVS 319 (405) Q Consensus 240 ~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~ 319 (405) + +-++|+.+..-|+.|+|-.+.|-|.|.-. +. .-.+.-+++-|. +++.++... . +. T Consensus 267 a--~~vmn~~~~~~l~~lkd~~G~~i~~~~~~--~~--~~~~~~~~L~G~--PV~~~~~~~--~--~~------------ 322 (394) T protein:vir:10 267 R--ALVVTQSLFNTLDTLKDKNGRYLLHDASD--SI--TDGTAKGTVLGV--PVYVVGDAL--L--GS------------ 322 (394) T ss_pred C--EEEecHHHHHHHHHhhccCCCeeeecccc--cc--ccCCcccccccc--eeEEecccc--c--CC------------ Confidence 1 35799999999999999888888876421 11 112334567664 444333221 1 10 Q ss_pred cccCCcceeeeEEEEEcccc-ceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccceEE Q lcl|NC_020862. 320 DVAGTDKYDIAPLLVVGDQA-FATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIAV 398 (405) Q Consensus 320 ~~~g~~~~DVYp~lV~G~~A-fg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~mar 398 (405) +.+. ..++||+-+ |-.+..+ ..+++-+.+ +....+++..+ +.+.+.+.++.=++. T Consensus 323 ---~~~~----~~i~~gd~s~~~~~~~~-------~~~~v~~~~---------~~~~~~~~~~~-~r~d~~~~~~~ai~~ 378 (394) T protein:vir:10 323 ---AAGD----QKAFVGDLKRGVLFADR-------QQVTLAWED---------SKIYGRYLGAA-FRFGVKQADSNAGYF 378 (394) T ss_pred ---CCCc----eEEEEeeccccEEEEee-------cceEEEEec---------ccccceeEEEE-EEeccEEeccccEEE Confidence 0111 245677533 2222221 113333322 11222333333 467888889998888 Q ss_pred EEEecCC Q lcl|NC_020862. 399 AYSVIPE 405 (405) Q Consensus 399 ie~~a~~ 405 (405) |+...+- T Consensus 379 ~~~~~~~ 385 (394) T protein:vir:10 379 VTNTDAA 385 (394) T ss_pred EEeeccc Confidence 8876655 No 122 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=97.81 E-value=5.3e-06 Score=49.52 Aligned_cols=281 Identities=15% Similarity=0.096 Sum_probs=147.9 Q ss_pred CC--ccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcc Q lcl|NC_020862. 1 MP--HIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDAT 78 (405) Q Consensus 1 ~~--~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~ 78 (405) +. ...+....++++.-+.-+-+.+ .+.-+....+...+.+++...+++.+ .+++.+........... +. T Consensus 104 ~~~~~~~~~~~~~~~~~~g~~~~~~~-~~~ii~~~~~~~~l~~~~~~~~~~~~---~~~~~~~~~~~~~a~~v-----~E 174 (390) T protein:vir:10 104 MNIKAALNTASTDAAGSAGALTTPNR-LPGFITQPDARLTVRDLIGSGRTDSA---LIEYVQETGFVNNAAIV-----AE 174 (390) T ss_pred hHHHHHHHhhhcccccccccccchhH-HHHHHHHHHhhchhhhhcceeeccCC---ceEEEEEecCCcceeee-----cC Confidence 11 1111111111111111111112 34445555556677788888888754 34444332221111111 22 Q ss_pred cccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHH Q lcl|NC_020862. 79 GASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRG 158 (405) Q Consensus 79 g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~ 158 (405) |+++ .....++..|+.++++++.++.+|++++. |+. ++...+..++.+. T Consensus 175 g~~~-----------------------------~~~~~~~~~i~~~~~k~~~~~~is~ell~-d~~-~l~~~i~~~l~~~ 223 (390) T protein:vir:10 175 GALK-----------------------------PESSLKFAKKTDTTHVIAHTMKATRQILS-DAP-QLASYMNNRLIRG 223 (390) T ss_pred Cccc-----------------------------cccccceeEEEEeeEEEEEeehhhHHHHH-hHH-HHHHHHHHHHHHH Confidence 2222 22234677789999999999999998654 565 6777777777654 Q ss_pred HhhHHHHHHHHHHhccCceEE-ecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccc Q lcl|NC_020862. 159 ANEITEDLLQADILASADVKV-FTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKT 237 (405) Q Consensus 159 ~~~~ted~l~~~ilag~~~v~-yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~ 237 (405) .+.. ++ ..+++|.+.-. -.|--+. +..... ........+++++..+...|+....+. T Consensus 224 ~~~~-~~---~~il~G~G~~~~p~Gi~~~-~~~~~~--~~~~~~~~~~~~~~~~~~~l~~~~~~~--------------- 281 (390) T protein:vir:10 224 LKVK-ED---AEILRGTGANDGLLGLIPQ-ATTYAA--PTTIAGATRVDQLRLAMLQASLAEYPA--------------- 281 (390) T ss_pred HHHH-HH---HHHhhcCCCCccccccccc-cccccc--cccccccchHHHHHHHHHhhccccCCC--------------- Confidence 4433 22 34556543211 1111100 000000 001123456788888888887766541 Q ss_pred ccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccc Q lcl|NC_020862. 238 ISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQ 317 (405) Q Consensus 238 I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~ 317 (405) -..++||.....|+.|+|..+.|-|.+. ..+.-+.|-| +.++.++.|- +| T Consensus 282 ----~~~v~n~~~~~~L~~lkd~~g~~l~~~~---------~~~~~~~l~G--~pv~~~~~~p----~~----------- 331 (390) T protein:vir:10 282 ----SGIVINPIDWAAIELAKDANNQYLIGNA---------RGTLTPTLWG--LPVVATQAMA----PG----------- 331 (390) T ss_pred ----CEEEEcHHHHHHHHHhhcCCCceeecCC---------cCcCCceecc--eeeEEcCCCC----CC----------- Confidence 1346899999999999988887777543 1233455645 5677776442 11 Q ss_pred cccccCCcceeeeEEEEEccccce-eecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhhcccc Q lcl|NC_020862. 318 VSDVAGTDKYDIAPLLVVGDQAFA-TIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIKLRGE 394 (405) Q Consensus 318 ~~~~~g~~~~DVYp~lV~G~~Afg-~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL~~~ 394 (405) .+++|.-+.+ .+..+ ..+.+-+.. .+.+-+++.+.++ .++.+.+++++ T Consensus 332 --------------~~~~gdf~~~~~~~~~-------~~~~i~~~~--------~~~~~~~~~~~~r~~~r~d~~v~~~~ 382 (390) T protein:vir:10 332 --------------EFLVGAFDLAAQIFDQ-------WDARVEIGY--------VNDDFQRNMVTVLAEERLALVVYRPE 382 (390) T ss_pred --------------cEEEEeccceEEEEEe-------cceEEEEee--------cccccccCcEEEEEEEeeccEEeccc Confidence 1345654322 12221 113332221 1123456667766 68899999999 Q ss_pred ceEEEEEe Q lcl|NC_020862. 395 RIAVAYSV 402 (405) Q Consensus 395 ~marie~~ 402 (405) -++.+..+ T Consensus 383 a~~~~~~a 390 (390) T protein:vir:10 383 ALISGSFA 390 (390) T ss_pred cEEEEEeC Confidence 99999999 No 123 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=97.76 E-value=2.1e-05 Score=46.22 Aligned_cols=310 Identities=10% Similarity=0.015 Sum_probs=157.1 Q ss_pred ccccCcCCCcccccc-c--ceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCccc Q lcl|NC_020862. 3 HIYNDPAAGDASTVG-P--QFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATG 79 (405) Q Consensus 3 ~~y~~~~~t~~~~v~-~--qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g 79 (405) |-|.|+...--.+-+ . .+...-|..+.+..-+..-++..+=.++.+ ..||+.+|.|---.-. .-++.|-.+.| T Consensus 1 Ms~~n~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~vrti--~~GkS~qf~~iG~~~a--~y~~~G~~ldg 76 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTV--TGTNTVSNKYLGETEL--QVLAPGQSPNA 76 (402) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeee--cccceEEEEEEeeeEE--eeeccccccCC Confidence 444454322111111 1 122122344444444444445555555654 3788888876422111 11222222233 Q ss_pred ccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeee-eeEEecchhhhhhhccc-hHHHHHHHHHH Q lcl|NC_020862. 80 ASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYG-FFMEYTEDSLMFDTDSD-LYGHLSREMLR 157 (405) Q Consensus 80 ~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG-~~~e~Td~~~~~d~d~~-l~~~~~~ell~ 157 (405) +.+.+. | . .|+++=.-|- .| -++..+...|=| +-.+++.|++. T Consensus 77 ~~~~~~--------------k----------------~--~ItID~lL~a~~~---V~diDeaq~~yD~vRse~s~e~G~ 121 (402) T protein:vir:97 77 TPTQAD--------------K----------------N--QLVIDTTVIARNT---VAHIHDVQGDIDSLKPKLAMNQAK 121 (402) T ss_pred CCcccc--------------c----------------E--EEEeCceeechhh---hhhHHHHHhcccchhHHHHHHHHH Confidence 222111 1 1 1222211122 22 122223334444 45677777776 Q ss_pred HHhhHHHHHHHHHHhc-cCce---EEecC-CC--ccceeeecccccccCCceecH----HHHHHHHHHHHhccCccccce Q lcl|NC_020862. 158 GANEITEDLLQADILA-SADV---KVFTG-AA--TSMVTMTGEAADAEDDGLITL----KDLKRLSITLTDNYTPKKTTI 226 (405) Q Consensus 158 ~~~~~ted~l~~~ila-g~~~---v~yag-~a--ts~~~~t~~~~~~~~n~~it~----~~lr~~~~~Lk~nrApk~T~i 226 (405) .-+......+.+.+++ +... ....+ +. .+..+++...+ +-..+. +-++.+...|++...|. T Consensus 122 ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~~~~g~s~~~~~t~~----~a~~~~~~l~~ai~~a~~~LdEkdVP~---- 193 (402) T protein:vir:97 122 QLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGFSINVNVTES----EALANPQYVMAAVEYALEQQLEQEVDI---- 193 (402) T ss_pred HHHHHHHHHHHHHHHHhhccccccccccCcccccccccccccccc----hhhcCHHHHHHHHHHHHHHHHhcCCCc---- Confidence 5555443334444433 3211 11111 00 01111111111 112344 44456778888888873 Q ss_pred eccccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcC--CcccccCcceeEecCCcEEEEeCcchhhhhc Q lcl|NC_020862. 227 IKGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYA--DAATIMNGEIGAIPGAHLRIVVVPQMMHYAG 304 (405) Q Consensus 227 i~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya--~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~ 304 (405) .-+++++.|..-.-|.+ ++.|++. +|+ ....+.+|+|++|.| |++++++++---++ T Consensus 194 -------------~dRv~vv~P~~y~~Ll~------~~rl~n~-d~~~~~~g~~~~G~v~~v~G--v~Vv~SnnlP~~a~ 251 (402) T protein:vir:97 194 -------------SDVAIMMPWKFFNALRD------ADRIVDK-TYTISQSGATINGFVLSSYN--CPVIPSNRFPTFAQ 251 (402) T ss_pred -------------cccEEEeChHHHHHHhh------cccccch-hhccccCCccccceeEEEec--eEEEecCccccccc Confidence 23899999999888865 5778888 664 555688999999966 89999998753222 Q ss_pred CCCcccCCCcccccccccCCcceee------eEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhh Q lcl|NC_020862. 305 AGATATAANRGYQVSDVAGTDKYDI------APLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVG 378 (405) Q Consensus 305 aGa~~~~t~~~~~~~~~~g~~~~DV------Yp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg 378 (405) .+.+.. .+..-.+++||| --.++|=++|-+++-+....++ ---|+=-|.. T Consensus 252 ~it~~~-------ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~-----------------~~~d~r~~~~ 307 (402) T protein:vir:97 252 DQAHHL-------LSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGD-----------------IFYEKKEKTY 307 (402) T ss_pred cccccc-------cccCCCCccCCcCcccceeEEEEEecceEEEEEeeccccc-----------------hhhchhHHHH Confidence 221111 111111233331 1345666677777666432211 1246777888 Q ss_pred hHHHHHHHHHhhccccceEEEEEec-------CC Q lcl|NC_020862. 379 FSSIKFFYGFIKLRGERIAVAYSVI-------PE 405 (405) Q Consensus 379 ~~gwK~~~~~~iL~~~~marie~~a-------~~ 405 (405) |+=-|+.|+...+|++.-+++++.= |+ T Consensus 308 ~id~~~a~G~g~~RPeaa~vv~~~~~~t~~~~~~ 341 (402) T protein:vir:97 308 YIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGG 341 (402) T ss_pred HHHHHHHhCCcccCccceEEEEEecccccccCCc Confidence 9999999999999999999998765 44 No 124 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=97.63 E-value=3.5e-05 Score=45.03 Aligned_cols=297 Identities=12% Similarity=0.040 Sum_probs=144.7 Q ss_pred CCccccCcCCCcc----cccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCC Q lcl|NC_020862. 1 MPHIYNDPAAGDA----STVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLD 76 (405) Q Consensus 1 ~~~~y~~~~~t~~----~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvt 76 (405) +...-.+.-.+.+ ..+-|+ -+.++.+....+..++.++. .+.+|-..|+ +++-+...-+.+ +.. T Consensus 124 ~~~~~~~~~~~~t~~~gg~~vP~----~~~~~ii~~l~~~~~i~~~~-~~~~~~~~~~-~~~p~~~~~~~a------~~v 191 (435) T protein:vir:14 124 FGEEVAMSLNTLSPGAGGVLVPE----NLSSEVIELLRPKSVVRKLG-ARTLPLSNGN-ITIPRLKGGAIV------GYI 191 (435) T ss_pred hhhhhhhhcccCCcCCCccccch----hHHHHHHHHHhhhchhhhhc-ceeeecCCCc-eEEEEEeCCcce------eee Confidence 1111111111111 113333 23344444445566666662 3455655553 444333221111 111 Q ss_pred cccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhh-ccchHHHHHHHH Q lcl|NC_020862. 77 ATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDT-DSDLYGHLSREM 155 (405) Q Consensus 77 p~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~-d~~l~~~~~~el 155 (405) +.|..+ .....++..|+.+.++++.++.+|++++.... +.+|.+.|..++ T Consensus 192 ~E~~~~-----------------------------~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~~~l~~~i~~~l 242 (435) T protein:vir:14 192 GADTDI-----------------------------PTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDL 242 (435) T ss_pred ccCccc-----------------------------cccccceeEEEeeeEEEEEeehhhHHHHHhhccCHHHHHHHHHHH Confidence 222221 11123566778999999999999998654432 334666666666 Q ss_pred HHHHhhHHHHHHHHHHhccCceE-EecCCC----ccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccc Q lcl|NC_020862. 156 LRGANEITEDLLQADILASADVK-VFTGAA----TSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGS 230 (405) Q Consensus 156 l~~~~~~ted~l~~~ilag~~~v-~yag~a----ts~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs 230 (405) .+.-+.. + -..+++|.++- .-.|-. ...+....... .......++.+++..|+.+.+-. T Consensus 243 ~~ai~~~-~---d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~----~~~~~~~~~~~l~~~~~~~~~~~-------- 306 (435) T protein:vir:14 243 TAAIGAR-E---DKAFIRDDGTANTPKGLRFWALPSNVITASDAS----TLQKIETDLGKVILALENADANL-------- 306 (435) T ss_pred HHHHHHH-H---HHHhhccCCCCccccceeecccccceecccccc----chhhHHHHHHHHHHHhhhccccc-------- Confidence 5433322 2 23445654321 111110 01111111111 11234567778877777664421 Q ss_pred cccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCccc Q lcl|NC_020862. 231 RMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATAT 310 (405) Q Consensus 231 ~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~ 310 (405) . . -+-++|+.....|+.|+|..+.|-|... . -|+|.| +.+++++.|-.-.+.+ T Consensus 307 ---~----~--~~~v~n~~~~~~L~~lkd~~G~~l~~~~---~---------~g~l~G--~Pv~~~~~~p~~~~~~---- 359 (435) T protein:vir:14 307 ---T----Q--PGWIMAPRTFRFLEGLRDGNGNKVYPEL---A---------NGMLKG--YPVGKTTQVPINLGET---- 359 (435) T ss_pred ---c----C--CEEEEcHHHHHHHHHhhccCCceeccCC---C---------CCeeec--ceeEeeccccccccCC---- Confidence 0 1 2347899999999999998888877322 1 145655 5777776543111110 Q ss_pred CCCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCC--CCCCCCccchhhhHHHH--HHH Q lcl|NC_020862. 311 AANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEA--TADRNDPYGKVGFSSIK--FFY 386 (405) Q Consensus 311 ~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~--tad~~DPlgQrg~~gwK--~~~ 386 (405) +. .. .+++|+=+...++.. ..+++.+..-+.- ....--.|-|++.+.++ +++ T Consensus 360 -------------~~---~~-~i~~gd~s~~~i~~~-------~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~ 415 (435) T protein:vir:14 360 -------------GK---ES-EIYFTDFGDVFIGEE-------ETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKN 415 (435) T ss_pred -------------Cc---cc-eEEEeecccEEEEEe-------cccEEEEeccccccccccchhhhhhcChhheeeeeee Confidence 11 11 255676555555544 2355555443210 00112255677777777 667 Q ss_pred HHhhccccceEEEEEecCC Q lcl|NC_020862. 387 GFIKLRGERIAVAYSVIPE 405 (405) Q Consensus 387 ~~~iL~~~~marie~~a~~ 405 (405) .+.+.+++-++.+.-+.== T Consensus 416 d~~~~~~~a~~~l~~~~~~ 434 (435) T protein:vir:14 416 DFGPRHVESIAVLAGVAWG 434 (435) T ss_pred CceeecccceEEEecCCCC Confidence 7888888877776543322 No 125 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=97.62 E-value=7.9e-06 Score=48.56 Aligned_cols=281 Identities=12% Similarity=0.110 Sum_probs=137.9 Q ss_pred CCccc---cCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCc Q lcl|NC_020862. 1 MPHIY---NDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDA 77 (405) Q Consensus 1 ~~~~y---~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp 77 (405) ||+-. +.-..++.++-+.-+-+-+..++-+....+..++.+++ .+.+|-..|+ +++=+. +.|..+ T Consensus 347 ~~~~~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l~-~~~~~~~~g~-~~ip~~----------~~~~~a 414 (632) T protein:vir:96 347 MPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMG-ARMLPGLVGD-VDIPKK----------TSGANF 414 (632) T ss_pred hhHHHHHHhhhhcccccccccccccccchHHHHHHHhhcchhhhhc-ceEeecCCcc-eEEEEE----------eCCcee Confidence 22211 01111111211111111111222233334566777773 3456666664 333211 111111 Q ss_pred ccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHH Q lcl|NC_020862. 78 TGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLR 157 (405) Q Consensus 78 ~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~ 157 (405) . ....|+.+....+++..++.+.++++.++.+|+++++ |++.++.+.|..+|.. T Consensus 415 --~-----------------------wv~E~~~~~~s~~~f~~i~l~~~k~~~~v~iS~ell~-ds~~~~~~~i~~~l~~ 468 (632) T protein:vir:96 415 --Y-----------------------WIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRK-QSSIHVENLIREDLIE 468 (632) T ss_pred --E-----------------------eecCCccccccccceeeEEeeeeEEEEehhhHHHHHh-ccchHHHHHHHHHHHH Confidence 0 1112233333445777889999999999999998544 4555787877776664 Q ss_pred HHhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccc Q lcl|NC_020862. 158 GANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKT 237 (405) Q Consensus 158 ~~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~ 237 (405) .- ...+|. .+++|.++- +.-.+-.+.++..+.......+++++|.++...|...++.. T Consensus 469 a~-~~~~d~---a~l~G~G~~---~~p~Gi~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~~~~~~--------------- 526 (632) T protein:vir:96 469 GI-GVALDL---AMLTGTGLA---NDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADA--------------- 526 (632) T ss_pred HH-HHHHHH---HhhcccCCC---CccceeeecccccceecccccCCHHHHHHHHHHHhhccccc--------------- Confidence 33 333433 245554310 00001111111111111123478889988888877665421 Q ss_pred ccceEEEEEcccchHHHHH--HhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcc Q lcl|NC_020862. 238 ISASRIAYIGSELEIYITE--LVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRG 315 (405) Q Consensus 238 I~~syv~~~h~dl~~dir~--l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~ 315 (405) -. -+-++|+.....++. +.|..+.|-| .. |.+.| ++++.+..|. + T Consensus 527 ~~--~~~~~~~~~~~~l~~~~l~d~~G~~i~-------------~~--~~l~G--~pv~~s~~ip----~---------- 573 (632) T protein:vir:96 527 GR--LAYLTSVTQRGAAKKAQVFDNTGERIW-------------QN--NEVNG--YRAEASNQIP----A---------- 573 (632) T ss_pred Cc--cEEEEchhHHHHHHHHhccCCCCceee-------------cC--Ceecc--cceEeccccc----c---------- Confidence 01 223578877777765 3344444433 22 45644 4666654432 0 Q ss_pred cccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccc Q lcl|NC_020862. 316 YQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGER 395 (405) Q Consensus 316 ~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~ 395 (405) + .+++|+-+...++..| .+.+.+..- ..---|+..|..| +.+.+.+.+++. T Consensus 574 ---------~------~~~~gd~s~~~i~~~~-------~~~i~~~~~------~~~~~~~v~~~~~-~~~d~~v~~~~a 624 (632) T protein:vir:96 574 ---------D------TWIFGDWSQIVIAMWG-------VLDLKVDPY------TKAASDGLVLRVF-QDVDAGVRRKEA 624 (632) T ss_pred ---------C------cEEEeecceEEEEEec-------ceEEEEccc------cccccCceEEEEE-eecCceeechhh Confidence 0 1567776665555542 355555431 1111244444433 457888999999 Q ss_pred eEEEEEec Q lcl|NC_020862. 396 IAVAYSVI 403 (405) Q Consensus 396 marie~~a 403 (405) ++.++..| T Consensus 625 f~~~k~~A 632 (632) T protein:vir:96 625 FCIAKKGA 632 (632) T ss_pred hhheeecC Confidence 99999999 No 126 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=97.58 E-value=5.3e-06 Score=49.49 Aligned_cols=269 Identities=14% Similarity=0.112 Sum_probs=139.5 Q ss_pred CCccccCc-----CCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCC Q lcl|NC_020862. 1 MPHIYNDP-----AAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGL 75 (405) Q Consensus 1 ~~~~y~~~-----~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGv 75 (405) ..+...+. ...+.+.+-|+ -|....+....+.-.+.+++...+++.+.++-... ....+....+ T Consensus 125 ~~~~~~~~~~~~~~~~~gg~~vP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~ 193 (400) T protein:vir:38 125 VPTDASDAVNAGVKAADAASTIPE----TISNTPQRELQTVVDLKPFTNVFQASTQKGTYPTV-------ANATTKMVTV 193 (400) T ss_pred hhHHHHHHHhhcccccCCcccccH----HHHHHHHHHHHhhhhhhhcceeEeccCcceEEEEE-------ecCCCccccc Confidence 11111111 11111122232 33444333344556788899999998775532221 1111101011 Q ss_pred CcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHH Q lcl|NC_020862. 76 DATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREM 155 (405) Q Consensus 76 tp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~el 155 (405) +.| |-.+.-.-.++..|+-+.++++.++.+|++ ++-|++.++...+..++ T Consensus 194 -~E~----------------------------~~~~~~~~~~f~~i~~~~~k~~~~~~is~e-ll~ds~~~~~~~i~~~l 243 (400) T protein:vir:38 194 -AEL----------------------------EKNPAMAKPEFKPVNWSVETYRQALPVSQE-SIDDSAIDLVGLIAQNG 243 (400) T ss_pred -ccc----------------------------ccccccccccceeeEeehhheeeehhhHHH-HHhhhHHHHHHHHHHHH Confidence 111 111111123566778899999999999998 45566667888877776 Q ss_pred HHHHhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHH-HHhccCccccceeccccccC Q lcl|NC_020862. 156 LRGANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSIT-LTDNYTPKKTTIIKGSRMTD 234 (405) Q Consensus 156 l~~~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~-Lk~nrApk~T~ii~gs~~~g 234 (405) .+... .+++ ..++.|.+. . .+.+..++++|..+... ++... T Consensus 244 ~~~~~-~~~~---~~i~~~~~~------~-------------~~~~~~~~~~~~~~~~~~~~~~~--------------- 285 (400) T protein:vir:38 244 QQIKV-NTTN---GAVATLLKG------F-------------TAKTISSVDDLKHINNVDLDPAY--------------- 285 (400) T ss_pred HHHHH-HHHH---Hhhhhcccc------c-------------cccccccHHHHHHHHHhhhhhhh--------------- Confidence 64333 3333 334444321 0 11245677777766442 22111 Q ss_pred cccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCc Q lcl|NC_020862. 235 TKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANR 314 (405) Q Consensus 235 T~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~ 314 (405) . -+-++||.....|+.|+|-.+.|-|.| .+. .+--+.+-| +.+++++.+ ++.++| T Consensus 286 ----~--a~~v~~~~~~~~l~~lkd~~G~~i~~~--~~~------~~~~~~l~G--~pv~~~~~~-~~~~~g-------- 340 (400) T protein:vir:38 286 ----S--RVIIASQSFYNFLDTVKDGNGRYLLQD--SIL------TPSGKSVLG--MPIAVVSDD-TLGAAG-------- 340 (400) T ss_pred ----C--cEEEEcHHHHHHHHHhhccCCCeeeec--CcC------CCCcccccc--ceeEEeccc-ccCCCC-------- Confidence 1 234789999999999999888888865 222 222356766 455555533 322211 Q ss_pred ccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhcccc Q lcl|NC_020862. 315 GYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGE 394 (405) Q Consensus 315 ~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~ 394 (405) |+ .++||+-+-+.+.+.. ..+.+-+. .+.+.+.++.++ +.+.+.+++++ T Consensus 341 -------------~~--~~~~gd~s~~~~~~~~------~~~~~~~~---------~~~~~~~~~~~~-~r~d~~~~~~~ 389 (400) T protein:vir:38 341 -------------EA--HAFLGDIKRAILFANR------ADFMVRWV---------DDQIYGQFLQAG-MRFGVSVADEK 389 (400) T ss_pred -------------ce--EEEEEeccccEEEEee------cceEEEEe---------cccccceeEEEE-EEeccEEeccc Confidence 12 3567774422222211 11222221 122333454433 78899999999 Q ss_pred ceEEEEEecCC Q lcl|NC_020862. 395 RIAVAYSVIPE 405 (405) Q Consensus 395 ~marie~~a~~ 405 (405) -++.|+. +|+ T Consensus 390 a~~~l~~-~~~ 399 (400) T protein:vir:38 390 AGYFLTY-TPK 399 (400) T ss_pred ceEEEEe-ecC Confidence 9999887 555 No 127 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=97.58 E-value=1.2e-05 Score=47.62 Aligned_cols=280 Identities=13% Similarity=0.028 Sum_probs=141.4 Q ss_pred CCccccCc-CCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCccc Q lcl|NC_020862. 1 MPHIYNDP-AAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATG 79 (405) Q Consensus 1 ~~~~y~~~-~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g 79 (405) ....++.. ..++++.-+. .-+.-+.+..+....+...+.+++.+.+++ |++ ++-+...-+ .-+..+.| T Consensus 130 ~~~~~~~~~~~~~~~~gg~-~vP~~~~~~Ii~~l~~~~~i~~~~~~~~~~---g~~-~ip~~~~~~------~a~~v~E~ 198 (425) T protein:vir:95 130 VVEFYEKFRNLRAVAGGEL-TIPEVVVNRIMDIMGDYTTLYPLVDKIRVK---GTT-RILVDTDTS------PATWIEQS 198 (425) T ss_pred HHHHHHHHHhhcccccCce-eccHHHHHHHHHHHHhhhhHHHhhceeecC---cee-EEEEecCCc------cccccccc Confidence 11111110 0111112111 112222333344444556677777777774 432 322221111 11222222 Q ss_pred ccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHH Q lcl|NC_020862. 80 ASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGA 159 (405) Q Consensus 80 ~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~ 159 (405) .++... .. .++.+|+-+.++++.++.+|++ ++-|+..++...+..++.+.. T Consensus 199 ~~~~~~--------------~~--------------~~f~~i~l~~~k~~~~~~iS~e-ll~ds~~~l~~~i~~~l~~~i 249 (425) T protein:vir:95 199 GALPTG--------------DV--------------GTIASIDFDGFKVGKVTFVDNY-LLQDSIINLDDYVTKKIARAI 249 (425) T ss_pred cccccc--------------cc--------------cccceeeeeheeeeeeehhhHH-HHhccHHHHHHHHHHHHHHHH Confidence 222111 11 2455678899999999999998 555566577777766666433 Q ss_pred hhHHHHHHHHHHhccCceE------EecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceecccccc Q lcl|NC_020862. 160 NEITEDLLQADILASADVK------VFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMT 233 (405) Q Consensus 160 ~~~ted~l~~~ilag~~~v------~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~ 233 (405) + ..+| ..+++|.+.- .+++.. ....++. ..+..++++|.++...+...... T Consensus 250 ~-~~~d---~~il~G~G~~~~~p~Gil~~~~-~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~------------ 306 (425) T protein:vir:95 250 A-KALD---LAIVKGTGAANKQPLGIIPSLP-PENQVTV------EADNNLLKNLVKQIGLIDTGDDS------------ 306 (425) T ss_pred H-HHHH---HHhhccCCCCccccceeecccc-ccccccc------ccccchHHHHHHHHHhhhhhccc------------ Confidence 3 3333 3566765421 122211 1111111 12457889998887766544321 Q ss_pred CcccccceEEEEEcccc-hH---HHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcc Q lcl|NC_020862. 234 DTKTISASRIAYIGSEL-EI---YITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATA 309 (405) Q Consensus 234 gT~~I~~syv~~~h~dl-~~---dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~ 309 (405) ...+ +.++|+.. .. .++-++|..+.+-|.+ -.++.+.+-| .++|.++.|- + T Consensus 307 ----~~~~-~~v~~~~~~~~~l~~l~~~kd~~g~~i~~~----------~~~~~~~l~G--~pvv~~~~~~----~---- 361 (425) T protein:vir:95 307 ----VGEI-VAVMKRSTYYNRLVEFSIQVDSNGNVVGKL----------PNLRTPDLLG--LRVVFNNFLD----D---- 361 (425) T ss_pred ----cCce-EEEEeChHHHHHHHHHHhhcCCCCceeecc----------CCCCCccccc--eeeEEcCcCC----C---- Confidence 1222 34666653 33 3444455444443331 1334455644 4666665442 0 Q ss_pred cCCCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHH Q lcl|NC_020862. 310 TAANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYG 387 (405) Q Consensus 310 ~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~ 387 (405) + .++||.-++-.++.. +.+++-+. .|.+-.++..+++ .++. T Consensus 362 ---------------~------~i~~Gd~~~~~~~~~-------~~~~i~~~---------~~~~f~~~~~~~~~~~r~d 404 (425) T protein:vir:95 362 ---------------D------TVLFGEFEQYTLVER-------ENITIDSS---------THVKFTEDQTAFRGKGRFD 404 (425) T ss_pred ---------------c------cEEEEecccEEEEee-------cceEEEee---------cccccccCceEEEEEEeeC Confidence 0 156777665555543 22333332 3456666777777 4689 Q ss_pred HhhccccceEEEEEecCC Q lcl|NC_020862. 388 FIKLRGERIAVAYSVIPE 405 (405) Q Consensus 388 ~~iL~~~~marie~~a~~ 405 (405) +.+.+++=++++++..|+ T Consensus 405 ~~~~~~~a~~~~~i~~~~ 422 (425) T protein:vir:95 405 GKPVKPEAFVLVTITDPV 422 (425) T ss_pred cEeecccceEEEEecCcC Confidence 999999999999999999 No 128 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=97.49 E-value=2e-05 Score=46.33 Aligned_cols=293 Identities=12% Similarity=0.108 Sum_probs=143.8 Q ss_pred ccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccccccC Q lcl|NC_020862. 5 YNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGASYAG 84 (405) Q Consensus 5 y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~~~~~ 84 (405) -+ .+++++.+.-+-+.+ .++-+..+.+..++.+++...+|+.+ ++++-+...-+. ..-..||-. ..+ T Consensus 1 ma---~~t~~~gg~liP~~~-~~~Ii~~~~~~s~l~~l~~~~~~~~~---~~~~p~~~~~~~-a~wv~E~~~-----~~~ 67 (305) T protein:vir:25 1 MA---DISRAEVASLIQEAY-SDTLLAAAKQGSTVLSAFQNVNMGTK---TTHLPVLATLPE-ADWVGESAT-----DPK 67 (305) T ss_pred CC---CccCCccceecCHHH-HHHHHHHHHhhchhhhhcceeeccCC---cEEEEEEeCCcc-eEEeecccc-----ccc Confidence 22 333333343343334 56666677777888999998888643 344433221111 111233322 111 Q ss_pred CcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHhhHHH Q lcl|NC_020862. 85 GNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGANEITE 164 (405) Q Consensus 85 gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~~~te 164 (405) +. +..+ ..++..|+-+.++++.++.+|++++ -|+..++...+..++.+.-+.-.+ T Consensus 68 ~~----------~~~s--------------~~~f~~i~~~~~k~~~~~~is~ell-~ds~~~~~~~i~~~l~~~~a~~~d 122 (305) T protein:vir:25 68 GV----------KPTS--------------KVTWANRTLVAEEIAVIIPVHENVI-DDATVAVLTEVAELGGQAIGKKLD 122 (305) T ss_pred cc----------cccc--------------ccceeeEEeeeEEEEEeehhhHHHH-hcchHHHHHHHHHHHHHHHHHHHh Confidence 10 1111 1356677899999999999999854 566667777766666654443333 Q ss_pred HHHHHHHhccCceEEec---CCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccce Q lcl|NC_020862. 165 DLLQADILASADVKVFT---GAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISAS 241 (405) Q Consensus 165 d~l~~~ilag~~~v~ya---g~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~s 241 (405) . .+++|.+.-.-. +..+... .............+..++......+....... + . ... T Consensus 123 ~----a~~~G~g~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~------~-~~~- 182 (305) T protein:vir:25 123 Q----AVIFGTDKPASWVSPALIPAAV--TAGQAVEVVGGVANESDIVGATNRAAKAVASA------G------W-APD- 182 (305) T ss_pred h----hheeccCCCCCccccccccccc--cccccccccccchhhhHHHHHHHHHHHhhhhc------c------c-ccc- Confidence 2 334443311000 0000000 00011111223455555555554444333320 0 0 001 Q ss_pred EEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccccccc Q lcl|NC_020862. 242 RIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSDV 321 (405) Q Consensus 242 yv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~~ 321 (405) ..++|+.....|+.|+|--+.|-|.| +.+.| +..+.++. .++. T Consensus 183 -~~v~~~~~~~~l~~lkd~~G~~i~~~---------------~~l~G--~Pv~~~~~-~~~~------------------ 225 (305) T protein:vir:25 183 -TLLSSLALRYEVANIRDANGNPVFRD---------------DSFAG--FRTFFNRN-GAWD------------------ 225 (305) T ss_pred -eeEecHHHHHHHHHhhccCCceeecC---------------Ccccc--cceEEcCc-cCCC------------------ Confidence 25779999999999988666666643 34544 44555443 2210 Q ss_pred cCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEec-CCCCCCCCCCccchhhhHHHH--HHHHHhhccccceEE Q lcl|NC_020862. 322 AGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKK-PGEATADRNDPYGKVGFSSIK--FFYGFIKLRGERIAV 398 (405) Q Consensus 322 ~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~-pG~~tad~~DPlgQrg~~gwK--~~~~~~iL~~~~mar 398 (405) .++. .+++|+=+...++.++ .+++-+.. -.....+..--+-|+....|| ..+++.++|++-++. T Consensus 226 --~~~~----~~~~gd~s~~~i~~~~-------~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~ 292 (305) T protein:vir:25 226 --ADAA----IEVIADSSRVKIGVRQ-------DITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQG 292 (305) T ss_pred --CCcc----EEEEEecceEEEEEec-------CeEEEEeeeeeeecCCceeeeeecCcEEEEEEEeecceeeCcccEEE Confidence 0111 2456765555555541 23332211 111111122224577778887 347888999887777 Q ss_pred EEEe-----cCC Q lcl|NC_020862. 399 AYSV-----IPE 405 (405) Q Consensus 399 ie~~-----a~~ 405 (405) +..+ .|= T Consensus 293 ~~~~~~~~~~pa 304 (305) T protein:vir:25 293 ANKTPVAVVAPA 304 (305) T ss_pred EccccccccCCC Confidence 6542 222 No 129 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=97.46 E-value=4.7e-05 Score=44.31 Aligned_cols=290 Identities=16% Similarity=0.100 Sum_probs=133.8 Q ss_pred CCccccCc------C--CCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccc Q lcl|NC_020862. 1 MPHIYNDP------A--AGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVND 72 (405) Q Consensus 1 ~~~~y~~~------~--~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~ 72 (405) -...|.+. . +.+.+.+-|+ .+ ..+.+....+..++.+++ .+.+|-..|+ +++-+...-+.+ .... T Consensus 53 a~~~~~~~~~~~a~~~~~~~Gg~lvP~---~~-~~~ii~~l~~~s~l~~lg-~~~v~~~~g~-~~~p~~t~~~~a-~wv~ 125 (366) T protein:vir:57 53 AATELGDTGLSMAISTAAGSGGALIPQ---NM-QNEVIELLRDRTVVRILG-ARSIPLPNGN-LSMPRLSGGATA-GYVG 125 (366) T ss_pred HHHhhcchhhhhhccccccCCccccch---hH-HHHHHHHHhhhcchhhhc-eeeeecCCCc-eEEEEEeCCcce-eeec Confidence 00111111 0 1111111232 22 333333444556777773 3345555564 444443321111 1112 Q ss_pred cCCCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHH Q lcl|NC_020862. 73 QGLDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLS 152 (405) Q Consensus 73 eGvtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~ 152 (405) || . .+.....++..|+.+.++++.++.+|++++ -|+..++...+. T Consensus 126 E~-----~-----------------------------~~~~s~~~f~~i~~~~~k~~~~~~iS~ell-~ds~~~~~~~i~ 170 (366) T protein:vir:57 126 EG-----K-----------------------------DVVATGATFDDVKLSAKTMIALVPVSNQLI-GRAGFNVEQLLL 170 (366) T ss_pred cC-----c-----------------------------cccccccceeEEEEeeEEEEEeehhhHHHH-hhhhHHHHHHHH Confidence 22 1 111122356677899999999999999855 456667777777 Q ss_pred HHHHHHHhhHHHHHHHHHHhccCce-------EEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccc Q lcl|NC_020862. 153 REMLRGANEITEDLLQADILASADV-------KVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTT 225 (405) Q Consensus 153 ~ell~~~~~~ted~l~~~ilag~~~-------v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ 225 (405) .++.+..+ ..+| ..++.|.++ ..+++..+..+..++ +..+..++......|....... .. T Consensus 171 ~~l~~a~~-~~~d---~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~--------t~~~~~~~~~~~~~~~~~~~~~-~~ 237 (366) T protein:vir:57 171 GDILSAIA-TRED---KAFLRDDGTGDTPKGMKAVATAANRLVAWTG--------TAINLTTIDEYLDSLILKHMDS-NS 237 (366) T ss_pred HHHHHHHH-HHHH---HHhhccCCCCccccceeeccccccceeeccc--------cccchhhHHHHHHHHHHhhhcc-cc Confidence 76665443 3333 345555432 233333222222111 2234443333332222211110 00 Q ss_pred eeccccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcC Q lcl|NC_020862. 226 IIKGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGA 305 (405) Q Consensus 226 ii~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~a 305 (405) . .. .-.-++|+.....|+.|+|..+.|-|.+.. -|.+.| +.+++++.|---.+. T Consensus 238 ~-----------~~-~a~~vmn~~~~~~L~~lkd~~G~~l~~~~~------------~g~l~G--~Pvv~s~~ip~~~~~ 291 (366) T protein:vir:57 238 N-----------MI-RCGWGLSNRTYMTLFGLRDGNGNKVYPEMS------------QGILKG--YPIQRTSAIPANLGD 291 (366) T ss_pred c-----------cc-cCEEEecHHHHHHHHhhhccCCceeccCCC------------CCeecc--eeeEEcccccccccc Confidence 0 00 112369999999999999988888885321 156644 688888765421111 Q ss_pred CCcccCCCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEE-ecCCCCCCCCCCc------cchhh Q lcl|NC_020862. 306 GATATAANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIV-KKPGEATADRNDP------YGKVG 378 (405) Q Consensus 306 Ga~~~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~iv-k~pG~~tad~~DP------lgQrg 378 (405) +.+ -. .++||+=+.-.++..+ .+++-+ ..+.. .|+ +-|+. T Consensus 292 -----------------~~~---~~-~i~~gdfs~~~i~~~~-------~i~i~~~~ea~~-----~~~~g~~~~~f~~~ 338 (366) T protein:vir:57 292 -----------------DGN---ES-EIYFCDFNDVVIGEDG-------MMKVDFSTEATY-----KDADGQLVSAFARN 338 (366) T ss_pred -----------------CCC---cc-EEEEEecceEEEEEec-------ceEEEEeecccc-----ccccccchhhhhcC Confidence 111 11 2445665554455442 233222 11111 122 22344 Q ss_pred hHHHH--HHHHHhhccccceEEEEEecC Q lcl|NC_020862. 379 FSSIK--FFYGFIKLRGERIAVAYSVIP 404 (405) Q Consensus 379 ~~gwK--~~~~~~iL~~~~marie~~a~ 404 (405) ...+| +.+.+.+.+++-++++.-+-= T Consensus 339 ~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 339 QSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred ceeEEeeeeeCcEeeccccEEEEecccC Confidence 45555 345556666665555543333 No 130 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=97.36 E-value=2.1e-05 Score=46.22 Aligned_cols=269 Identities=12% Similarity=0.024 Sum_probs=136.3 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) +..........+..+ +.-+-+--|.+..+....+...+.+++...+++.+.++-..+..-. .......| |+ T Consensus 121 ~~~~~~~~~~~t~~~-gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~---~~~~~v~E-----~~ 191 (394) T protein:vir:97 121 TTPVEPQKDGIKKEN-AKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRAT---TKMVTVAE-----LE 191 (394) T ss_pred hhhhhhhcccccccc-ccccChHHHHHHHHHHhhhhhhhhhhceeeeccCcceEEEEEecCC---Cccceecc-----cc Confidence 111111111111111 1111222334444444445678889999999988876533322100 00111112 11 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) .. +.....++..|+.+.++++.++.+|+++ +-|+..++...+..++.+..+ T Consensus 192 ~~----------------------------~~~~~~~~~~v~l~~~k~~~~i~is~el-l~ds~~~~~~~i~~~la~~~~ 242 (394) T protein:vir:97 192 KN----------------------------PALAKPDFKDVAWNIDTYRGAIPLSQES-IDDADVDLVGIVSESISQIKV 242 (394) T ss_pred cc----------------------------cccccccceeEEeehhheeeehhhHHHH-HhhhhHHHHHHHHHHHHHHHH Confidence 11 1111135667889999999999999985 445565777777666654333 Q ss_pred hHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISA 240 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~ 240 (405) .+++. .|++|.+. . ...+..++++|..+...+..... .+ T Consensus 243 -~~~~~---~i~~g~~~------~-------------~~~~~~~~~~~~~~~~~~~~~~~------------------~a 281 (394) T protein:vir:97 243 -NTTND---AIAKVLKS------F-------------TTKTVKNLDEIKALLNGGFDPAY------------------NV 281 (394) T ss_pred -HHHHH---HHhhcccc------c-------------cccccccHHHHHHHHHhhhhhhh------------------CC Confidence 34443 23443221 0 01234678888776654332211 12 Q ss_pred eEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccccccc Q lcl|NC_020862. 241 SRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSD 320 (405) Q Consensus 241 syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~ 320 (405) + -++||.+...|+.|+|..+.|-|.|- +-.+--+.|-|- .+++++.. .+|. T Consensus 282 ~--~v~n~~~~~~l~~lkd~~G~~i~~~~--------~~~~~~~~l~G~--pv~~~~~~----~~~~------------- 332 (394) T protein:vir:97 282 S--LIVSQSFYQTLDTLKDGNGRYLLQDD--------ITAVSGKVLLGK--PVFVLSDE----VLGA------------- 332 (394) T ss_pred E--EEEcHHHHHHHHHhhccCCCeeeecC--------cCCCCCceeccc--eeEEeccc----ccCC------------- Confidence 3 35899999999999998888888652 123334567663 33433311 1111 Q ss_pred ccCCcceeeeEEEEEcc--ccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccceEE Q lcl|NC_020862. 321 VAGTDKYDIAPLLVVGD--QAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIAV 398 (405) Q Consensus 321 ~~g~~~~DVYp~lV~G~--~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~mar 398 (405) + .++||+ +.|.....++ +.+-+. .+.+.+.++.++ +.+.+.++++.-++. T Consensus 333 ----~------~~~~gd~~~~~~~~~~~~--------~~~~~~---------~~~~~~~~~~~~-~r~d~~v~~~~a~~~ 384 (394) T protein:vir:97 333 ----N------KAFIGDFKRGVLFADRKD--------LGLRWA---------DNEIYGQYLQAV-LRFGVSKVDDKAGYY 384 (394) T ss_pred ----c------cEEEeeccccEEEEEecc--------eEEEEe---------cccccceeEEEE-EEEccEEecccceEE Confidence 0 135665 2232222221 222111 233334444333 678888999999988 Q ss_pred EEEecCC Q lcl|NC_020862. 399 AYSVIPE 405 (405) Q Consensus 399 ie~~a~~ 405 (405) ++.-..- T Consensus 385 ~~~~~~~ 391 (394) T protein:vir:97 385 VTFTPEP 391 (394) T ss_pred EEecccc Confidence 8863222 No 131 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=97.33 E-value=1.1e-05 Score=47.72 Aligned_cols=269 Identities=12% Similarity=0.067 Sum_probs=136.5 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) .....+ .++..+-+.. -+-.+..+ +++......+.+++...+++.+.++..... ..+.. .+. T Consensus 128 ~~~~~~---~~~~~~~~~~-vp~~~~~~-i~~~~~~~~l~~~~~~~~~~~~~~~~~~~~-------~~~~~-~~~----- 189 (397) T protein:vir:96 128 GAEKRD---GFTSVEGGAL-IPQELLQP-QLEPKDIVDLSKYVRSVPVNSASGKFPVIS-------KSGSK-MAT----- 189 (397) T ss_pred hhhhhh---cccccccccc-hhHHHHHH-HHHhhhhhhHHHhhhhccccccceeEEEEe-------ccCCc-ccc----- Confidence 111111 1111111111 11122222 333344445567777777777665432111 11100 000 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) ..|.|-.+.-...++..|+-++++++.++.+|+++ +-|+..++...+..++.+..+ T Consensus 190 -----------------------~~E~~~~~~~~~~~~~~i~~~~~~~~~~~~~s~el-l~ds~~~l~~~i~~~l~~~~~ 245 (397) T protein:vir:96 190 -----------------------VQQLEKNPQLANPKMVEIDYSVATRRGYIPISQEM-IDDASYDVTGLIADEIQDQSL 245 (397) T ss_pred -----------------------ccccccccccccccccceeecHhHhhcchhhHHHH-HhhhHHHHHHHHHHHHHHHHH Confidence 11112222222235667788999999999999985 445555777777766654333 Q ss_pred hHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISA 240 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~ 240 (405) ..++ ..|++|.+.. .+.+.+++++|.++.-....... .+ T Consensus 246 -~~~~---~~i~~g~g~~-------------------~~~~~~~~d~~~~~~~~~~~~~~------------------~a 284 (397) T protein:vir:96 246 -NTKN---ADIAAVLKTA-------------------TAKSVVGVDGLKDLINKEIKKVY------------------DV 284 (397) T ss_pred -HHHH---HHHhhccccc-------------------ccccccchHHHHHHHHHhhhhhc------------------Cc Confidence 3333 3455554321 12346788888877644222111 11 Q ss_pred eEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccccccc Q lcl|NC_020862. 241 SRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSD 320 (405) Q Consensus 241 syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~ 320 (405) +-++||....-|+.|+|-.+.|-|.|- +-.+..+.+-| ..+++++.+++ |.+ T Consensus 285 --~~v~n~~~~~~l~~lkd~~G~~~~~~~--------~~~~~~~~l~G--~pv~~~~~~~~----~~~------------ 336 (397) T protein:vir:96 285 --KLFISASMYSELDKLKDKNGRYLLQDS--------ITAASGKQLLG--KEVVVLDDDVI----GKS------------ 336 (397) T ss_pred --EEEEcHHHHHHHHHhhccCCCeEeccC--------ccCCCcccccc--cceEEeccccc----CCC------------ Confidence 358999999999999998888888652 22344456755 34555554332 110 Q ss_pred ccCCcceeeeEEEEEcccc-ceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccceEEE Q lcl|NC_020862. 321 VAGTDKYDIAPLLVVGDQA-FATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIAVA 399 (405) Q Consensus 321 ~~g~~~~DVYp~lV~G~~A-fg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~mari 399 (405) .+ -+ .++||+-+ |-.+..+ ..+.+-+. .+.+.++++.++ +.+.+.+++++-++++ T Consensus 337 ---~~---~~-~~~~gd~~~~~~~~~~-------~~~~~~~~---------~~~~~~~~~~~~-~r~d~~~~~~~a~~~~ 392 (397) T protein:vir:96 337 ---VG---NV-VGFIGDAKAFASFFDR-------KQVSVSWV---------DNNIYGQLLAGI-IRYDVKATDKKAGFYV 392 (397) T ss_pred ---CC---ce-EEEEeehhcceEeEee-------cceEEEEe---------cccccceeEEEE-EEEccEEecccceEEE Confidence 01 12 24567544 1122222 11232222 122334444444 5788889999999999 Q ss_pred EEecC Q lcl|NC_020862. 400 YSVIP 404 (405) Q Consensus 400 e~~a~ 404 (405) ++-+- T Consensus 393 ~~~~a 397 (397) T protein:vir:96 393 TFTIG 397 (397) T ss_pred EeecC Confidence 84433 No 132 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=97.26 E-value=5.4e-05 Score=43.97 Aligned_cols=276 Identities=13% Similarity=0.064 Sum_probs=137.8 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) .....+ .++++.-+. .-+..|....+....+...+.+++.+.+|+.+.++-.... ... ++. +. T Consensus 105 ~~~~~~---~~t~~~gg~-~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~-~~~-----~~~--~~----- 167 (389) T protein:vir:10 105 VIDATS---KVTSTEAGV-LIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILK-RAT-----DRF--SS----- 167 (389) T ss_pred hhhhhc---ccccCCcce-eehHHHHHHHHHHHHhhhhHHhhcceeeccCCeeEEEEEe-cCC-----Ccc--cc----- Confidence 010111 111121111 1222445555445556678888999999987765422222 110 000 00 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) ..|.|-++.....++..|+.++++++.++.+|+++ +-|++.++...|..++.+ +. T Consensus 168 -----------------------~~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~la~-~~ 222 (389) T protein:vir:10 168 -----------------------VAELAENPKLAEPEFNKVDWSVATYRGAIPLSEEA-IADSAVDLTALVGQSIKE-KS 222 (389) T ss_pred -----------------------ccccccccccccccceeeeeeheeeEeeehhhHHH-HhhhhHHHHHHHHHHHHH-HH Confidence 11112222222346677889999999999999984 556666788877666554 33 Q ss_pred hHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHH-HHHhccCccccceeccccccCccccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSI-TLTDNYTPKKTTIIKGSRMTDTKTIS 239 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~-~Lk~nrApk~T~ii~gs~~~gT~~I~ 239 (405) ..+++.. |++|.+... . . ..-+..++++|.++.. .|+... . T Consensus 223 ~~~~~~~---i~~g~~~~~----~-~-----------~~~~~~~~d~l~~~~~~~~~~~~-------------------~ 264 (389) T protein:vir:10 223 VNTYNAM---IAPVLQSFT----A-K-----------KTTTDTLVDSLKHILNVDLDPAY-------------------S 264 (389) T ss_pred HHHHHHH---Hhhhhcccc----c-c-----------cccccccHHHHHHHHHhhhhhhh-------------------C Confidence 3444443 343332110 0 0 0113467777766543 332211 1 Q ss_pred ceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccccc Q lcl|NC_020862. 240 ASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVS 319 (405) Q Consensus 240 ~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~ 319 (405) + +.++|+.....|+.|+|..+.|-|.|- +.+.. -.+..++|-|-.++++.. .+.+.. T Consensus 265 a--~~~~n~~~~~~L~~lkd~~G~~i~~~~--~~~~~--~~~~~~~l~G~pV~~~~~-~~~~~~---------------- 321 (389) T protein:vir:10 265 R--ALVVTQSLFNTLDTLKDKNGRYLLHDA--SDSIT--DGTAKGTILGVPVYVVGD-TLLGSL---------------- 321 (389) T ss_pred c--EEEecHHHHHHHHHhhccCCCeeeecC--ccccc--ccccccccccceeEEecc-cccCCC---------------- Confidence 1 357999999999999998889888753 32221 124445677744443332 122100 Q ss_pred cccCCcceeeeEEEEEcc--ccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccceE Q lcl|NC_020862. 320 DVAGTDKYDIAPLLVVGD--QAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIA 397 (405) Q Consensus 320 ~~~g~~~~DVYp~lV~G~--~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~ma 397 (405) ++ |+ .++||+ ++|-...-++ +++-+.. .+.+. .++... +.+.+.+++++=++ T Consensus 322 ----~~--~~--~~~~gd~~~~~~~~~~~~--------~~i~~~~--------~~~~~-~~~~~~-~r~d~~~~~~~a~~ 375 (389) T protein:vir:10 322 ----AG--DQ--KAFVGDLKRGVLFTDRQQ--------VTLAWED--------SKIYG-KYLGAA-FRFGVQKADSKAGY 375 (389) T ss_pred ----CC--ce--EEEEeeccccEEEEeecc--------eEEEeec--------ccccc-ceEEEE-EEeccEEecccceE Confidence 01 11 367775 3333222222 2222221 12222 222222 46677777887777 Q ss_pred EEEEecCC Q lcl|NC_020862. 398 VAYSVIPE 405 (405) Q Consensus 398 rie~~a~~ 405 (405) .++..... T Consensus 376 ~~~~~~~~ 383 (389) T protein:vir:10 376 FVTNTDVP 383 (389) T ss_pred EEEeeccC Confidence 77755444 No 133 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=97.02 E-value=0.00021 Score=40.80 Aligned_cols=282 Identities=11% Similarity=-0.031 Sum_probs=142.9 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhh-hh------hccccccccCcCCCCEEEEEecccCCCCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEM-FF------SPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQ 73 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~l-v~------~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~e 73 (405) |. .+++.++.-+++..| -.++....+++ -| .+.++...+=..-|.+|.+=.|.+|..+...+.+ T Consensus 1 Ma--------~~~T~l~d~i~pevf-~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~G~~~~~~d 71 (330) T protein:vir:10 1 MA--------NELTKILDTITPQQY-NAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLTGDSEVLGN 71 (330) T ss_pred CC--------CCceEeeeeechhHH-HHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCC Confidence 32 234666665555554 34444444432 23 2233333332346999999999999656555655 Q ss_pred CCCc-ccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHH Q lcl|NC_020862. 74 GLDA-TGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLS 152 (405) Q Consensus 74 Gvtp-~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~ 152 (405) |-++ ...++ +...-.+.+.++|.=+++||.+. .-+-.|.++++. T Consensus 72 g~~~i~~~ki----------------------------------~t~~~~a~i~~~~k~~~~tD~a~-~~~g~dp~~~i~ 116 (330) T protein:vir:10 72 GDKALETGKI----------------------------------TAGADIACVLYRGRGWAANELTG-VVAGSDPVRAIL 116 (330) T ss_pred Cccccchhhc----------------------------------ccceeEEEEEeecceeeehhhhh-hhcchhHHHHHH Confidence 5321 11111 22234689999999999999874 446668888877 Q ss_pred HHHHHHHhhHHHHHHHHHHhccCceEE-ecCCCcc---ceeeecccccccCCceecHHHHHHHHHHHHhccCccccceec Q lcl|NC_020862. 153 REMLRGANEITEDLLQADILASADVKV-FTGAATS---MVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIK 228 (405) Q Consensus 153 ~ell~~~~~~ted~l~~~ilag~~~v~-yag~ats---~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~ 228 (405) ..+.+--+. ..|.+|++-..-++ ++-.+.+ ..+.....+. ....++++.|-++...|..+... T Consensus 117 ~q~a~~w~~----~~q~~lla~l~gvf~~~~~~~~~~~~~~~~~~~~~--~~a~~s~~~l~~A~~~~GD~~~~------- 183 (330) T protein:vir:10 117 NRIGAYWLR----EDQKALIATLNGIFATGTAGEKGALEETHVSDQSK--ASTGIDAGMVLDAKQLLGDSADQ------- 183 (330) T ss_pred HHHHHHhhh----hHHHHHHHHHHhhhhhhhcccchhhhhhheecccc--cccccCHHHHHHHHHHhcccccc------- Confidence 666543333 33444433211111 1100101 0000110111 12358999999998888776553 Q ss_pred cccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCc Q lcl|NC_020862. 229 GSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGAT 308 (405) Q Consensus 229 gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~ 308 (405) -.+.++||....+||+. +++.-.+|.+. .+.||.+-| .|+|++.-+- T Consensus 184 ------------~~~ivmhS~v~~~L~~~-------~li~~~~~s~~----~~~i~~~~G--~~VivdD~~p-------- 230 (330) T protein:vir:10 184 ------------VTAIAMHSAVYTKLQKD-------NLIQYIQPTTA----TINIPTYLG--YRVIIDDGIA-------- 230 (330) T ss_pred ------------ceEEEEcHHHHHHHHHh-------hhhhhhccccc----Ccccccccc--eEEEEeCCCC-------- Confidence 27899999999999864 46766788776 467899866 5777775331 Q ss_pred ccCCCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccc------hhh---- Q lcl|NC_020862. 309 ATAANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYG------KVG---- 378 (405) Q Consensus 309 ~~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlg------Qrg---- 378 (405) +. ..+|-.++||..|++...=+ .+.+ +.+ . -..||.+ .|- T Consensus 231 ~~----------------~~~yt~yl~~~GAi~~~~~~-----~~~~--v~~-E------tdRd~~~g~~~l~~r~~~~~ 280 (330) T protein:vir:10 231 PT----------------GDIYTSYLFRTGSIGLNTGN-----PSGL--TTF-E------TSREAAKGNDMIYTRRALVM 280 (330) T ss_pred CC----------------CCceeEEEEecCceeeeccc-----CCcc--ccc-c------ccCCccccceEEEEeeEEEe Confidence 10 13788889999988754210 0000 000 0 0112221 000 Q ss_pred -hHHHHHHHHH---hhccccceEEEEEecCC Q lcl|NC_020862. 379 -FSSIKFFYGF---IKLRGERIAVAYSVIPE 405 (405) Q Consensus 379 -~~gwK~~~~~---~iL~~~~marie~~a~~ 405 (405) --|+||--.+ .-..+-+ .|-+.+- T Consensus 281 hp~G~s~~~~~~~~~~~sPt~---~~L~~~~ 308 (330) T protein:vir:10 281 HPYGVKWTGAEVDAGNITPSN---ADLAKFK 308 (330) T ss_pred eeeeeeecccccccCcCCcCh---HHhcCCc Confidence 0111111000 0000000 0000000 No 134 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=96.67 E-value=0.00019 Score=41.04 Aligned_cols=277 Identities=13% Similarity=0.044 Sum_probs=131.9 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) +..........+.+.-+.-+-..+ ...+++....-.+.+++.+.+++.+.++-.... +.++...-+ . T Consensus 149 ~~~e~~~~~~~~~~~~g~lvp~~~--~~~i~~~~~~~~l~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~-~--- 215 (437) T protein:vir:10 149 KTGEVRDVTGIALKDGKVIIPETI--LTPEKEVHQFPRLGSLVRTESVTTTTGKLPIFN-------NSTDLLTAH-T--- 215 (437) T ss_pred HhhhhhhhhhcccccccccchHHH--HHHHHHhhhhhhhhhcceeEeeccCceeeEEee-------ccccccccc-c--- Confidence 111111111121222111111112 223344333445667788888777665422221 111111111 1 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) |.+-++.-...++..|+-+.++++.++.+|++++ -|+..++...|..++.+.. T Consensus 216 -------------------------e~~~~~e~~~~~~~~v~~~~~k~~~~~~is~ell-~ds~~~~~~~i~~~l~~~~- 268 (437) T protein:vir:10 216 -------------------------EYGQTTKNATPVITPILWDLKTYTGGYVFSQELI-SDSSYDWQAELQSRLIELR- 268 (437) T ss_pred -------------------------ccccccccccccceeeeeehhheeeehhhhHHHH-hhhHHHHHHHHHHHHHHHH- Confidence 1111111122356667889999999999999854 4566577777777665433 Q ss_pred hHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHH-HHHhccCccccceeccccccCccccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSI-TLTDNYTPKKTTIIKGSRMTDTKTIS 239 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~-~Lk~nrApk~T~ii~gs~~~gT~~I~ 239 (405) ..+++ ..|++|.+...-. . ....+.++|.++.. .|+..... T Consensus 269 ~~~~~---~~i~~g~g~~~~~------~-----------~~~~~~~~~~~~~~~~l~~~~~~------------------ 310 (437) T protein:vir:10 269 DNTDD---SLIITALTDGIKK------T-----------TSTYLLGDLKKVLNVTLKPQDSA------------------ 310 (437) T ss_pred HHHHH---HHHhhhhcccccc------c-----------ccccchhhHHHHHHhhhhhhhhc------------------ Confidence 33333 3456665321111 0 01234455555432 44433321 Q ss_pred ceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccccc Q lcl|NC_020862. 240 ASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVS 319 (405) Q Consensus 240 ~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~ 319 (405) .+ +-+||+....-|+.|+|-.+.|-|.|- +. .|.-++|-| +.++.++.|. ...++ T Consensus 311 ~~-~~~~~~~~~~~l~~lkd~~g~~~~~~~--~~------~~~~~~l~G--~pv~~~~~~~--~~~~~------------ 365 (437) T protein:vir:10 311 AA-SIVMSQSAYNLFDMATDAMGRPLLQPN--VT------AATGYTLLG--KTVVIVDDKL--FPSAS------------ 365 (437) T ss_pred CC-EEEEcHHHHHHHHHhhccCCCeeeccC--cc------CCCCccccc--ceeEEecccc--cCCcC------------ Confidence 12 348999999999999998889988763 22 233346655 3344443331 11111 Q ss_pred cccCCcceeeeEEEEEcc--ccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccceE Q lcl|NC_020862. 320 DVAGTDKYDIAPLLVVGD--QAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIA 397 (405) Q Consensus 320 ~~~g~~~~DVYp~lV~G~--~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~ma 397 (405) .+ -+ .++||+ ++|....-++ +.+-+. +..|.+.|...+- +.+.+.++++.-++ T Consensus 366 ----~~---~~-~~~~gd~~~~~~~~~r~~--------~~~~~~-------~~~~~~~~~~~~~--~r~d~~~~~~~a~~ 420 (437) T protein:vir:10 366 ----AG---DV-NIVVAPLKKAVINFKLTE--------ITGQFQ-------DTYDIWYKQLGIF--LRQNVVQASKDLIV 420 (437) T ss_pred ----CC---ce-EEEEeeccccEEEEeeec--------eEEEEe-------cccccccceeeEE--EEEccEEecccceE Confidence 11 12 245775 3343322222 222111 2345555533222 34688888888888 Q ss_pred EEEEecCC Q lcl|NC_020862. 398 VAYSVIPE 405 (405) Q Consensus 398 rie~~a~~ 405 (405) .|..-.|= T Consensus 421 ~l~~~~~~ 428 (437) T protein:vir:10 421 NLTGKLKA 428 (437) T ss_pred EEEeeccc Confidence 77532222 No 135 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=96.54 E-value=0.00041 Score=39.15 Aligned_cols=214 Identities=12% Similarity=0.081 Sum_probs=104.8 Q ss_pred EEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHhhHHHHHHHHHHhccC-ceEEecC-CCccceeeeccccccc Q lcl|NC_020862. 121 RTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGANEITEDLLQADILASA-DVKVFTG-AATSMVTMTGEAADAE 198 (405) Q Consensus 121 i~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~~~ted~l~~~ilag~-~~v~yag-~ats~~~~t~~~~~~~ 198 (405) |..-|- --+ ++-| .-..+.+-++..+.++|++..-+....--+-+.+..++ ...-..+ ...+.+++..+.+ .. T Consensus 1 iD~lL~-a~~--~VdD-iD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g~~~~~~a~~t-~~ 75 (221) T protein:vir:17 1 MDDLLV-ASQ--FVYD-LDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDGGFSVNIGAGNT-NN 75 (221) T ss_pred CCcchh-HHH--HHHh-HHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccccCcceecccccc-CC Confidence 111000 000 0111 12233445666776777765555443333333332222 1111110 0111222222211 11 Q ss_pred CCceecHHHHHHHHHHHHhccCccccceeccccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccc Q lcl|NC_020862. 199 DDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATI 278 (405) Q Consensus 199 ~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i 278 (405) .+.+ ++-|+.+...|++++.|. .-+++++.|+.-+.|-.- .++-+......++.+.+ T Consensus 76 -~~~l-~dai~~a~~~LdekdVP~-----------------~gR~~vv~P~~y~~LL~~----~d~~~~n~d~~~s~g~~ 132 (221) T protein:vir:17 76 -AQAI-VDGFFEAAAVLDERSAPM-----------------DGRVAVLSPRQYYSLISS----VDTNILNREIGNTQGDM 132 (221) T ss_pred -HHHH-HHHHHHHHHHHhhcCCCC-----------------CCCEEEeCcHHHHHHHHh----cCcceeeeecccccccc Confidence 1223 588899999999999993 358999999888877421 25667766555666667 Q ss_pred cCc-ceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccccccccCCcceee----eEEEEEccccceeecceeccCCCC Q lcl|NC_020862. 279 MNG-EIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSDVAGTDKYDI----APLLVVGDQAFATIGLQGMSGKGK 353 (405) Q Consensus 279 ~~g-EIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~~~g~~~~DV----Yp~lV~G~~Afg~i~l~g~~~~g~ 353 (405) -+| |||+|.| |++++++++-. .+|.... .+++..+...+..++|.+ =--||+=.+|-|+|-|=| ++ T Consensus 133 ~~g~~i~~v~G--~~V~~SnnlP~--~~gt~~~-~~ag~~~~~~~~~~~yr~~fs~~~glv~~~~Avgtvkl~~----~~ 203 (221) T protein:vir:17 133 NTGKGLYVNAG--IRIYKSNVLAS--LYGTNLV-TDPGDATTSGENNGSYRPAITDRAGLVFHKEAADTVEVLL----PP 203 (221) T ss_pred cccceeeeecC--cEEEEeccCCc--ccccccc-cCCccccccccccccccccccceEEEEEcchheeeeeeec----CC Confidence 777 8999976 89999997742 2222111 111111111111222221 236889999999999975 23 Q ss_pred CCceEEEecCCCCCCCCC Q lcl|NC_020862. 354 SKFRIIVKKPGEATADRN 371 (405) Q Consensus 354 ~~~~~ivk~pG~~tad~~ 371 (405) +...+++.+.--..+|+- T Consensus 204 ~~~~~~~~~~~~~~~~~~ 221 (221) T protein:vir:17 204 SRPPLVISMFSIRRPDRR 221 (221) T ss_pred CCCceeeeeeeccCCCCC Confidence 322222222111122443 No 136 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=96.16 E-value=0.00079 Score=37.58 Aligned_cols=303 Identities=12% Similarity=0.096 Sum_probs=138.3 Q ss_pred CCcccc--------------CcCCCcccccccc-eeehhhhhHHHHHhhhhhhhhccccccccCcCCC-CEEEEEecccC Q lcl|NC_020862. 1 MPHIYN--------------DPAAGDASTVGPQ-FNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFG-KELKVFYYVPL 64 (405) Q Consensus 1 ~~~~y~--------------~~~~t~~~~v~~q-m~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~G-ktIkfrry~pl 64 (405) .-..|+ ....++++.-+.- .-+.| .+..+....|..++.+++.....+-+.- -.+++ T Consensus 316 a~~~~~~~~~~~~~~~~a~~~~~~~~~~~~Gg~~vp~~~-~~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~i------ 388 (645) T protein:vir:93 316 ARRQYPDDSRLHHVLKSAVGAGTTTDPQWAGSLSEYQEY-AQDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRV------ 388 (645) T ss_pred HHhhcccchhhhhhhhhhhhccccccccccCCccCchhh-HHHHHHhhhhhhhHHhhccccccccccccCceee------ Confidence 000010 0011111111221 22233 3333443446678888886542222210 01111 Q ss_pred CCCCCccccCCCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhc Q lcl|NC_020862. 65 LDDLNVNDQGLDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTD 144 (405) Q Consensus 65 ~~~~t~l~eGvtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d 144 (405) |. .+.|.++. ....|+.......++..|+-+.++++.++.+|++++. |+. T Consensus 389 p~----~t~~~~a~-------------------------wv~Eg~~~~~s~~~f~~v~l~~~kla~~~~iS~ell~-ds~ 438 (645) T protein:vir:93 389 HA----QVSGGAAG-------------------------WVGEGKTKPLTKFDFESITFSHAKVSAIAVLTEELIR-FSS 438 (645) T ss_pred ee----eecCcceE-------------------------EeccCccccccccceeEEEEeeEEEEEeehhHHHHHh-hch Confidence 11 11122110 1122333344445778889999999999999998544 555 Q ss_pred cchHHHHHHHHHHHHhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCcccc Q lcl|NC_020862. 145 SDLYGHLSREMLRGANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKT 224 (405) Q Consensus 145 ~~l~~~~~~ell~~~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T 224 (405) .++.+.+...+.+..+ ..+|. .+++|.+.-.....-.+ ........ .....+..++..+...|..++... T Consensus 439 ~~~~~~i~~~l~~aia-~~~d~---a~l~g~g~~~~~~~p~g-i~~~~~~~---~~~~~~~~d~~~~~~~~~~a~~~~-- 508 (645) T protein:vir:93 439 PAADALVRNALAEAVV-ARLDT---DFVDPKKAAVADVSPAS-ITHDVKGT---ASSGNPDADAEAAFGQFVAANLQP-- 508 (645) T ss_pred HHHHHHHHHHHHHHHH-HHHHH---HhhcCCCcccCCccccc-eecccccc---ccccchHHHHHHHHHHHHhcCCCc-- Confidence 5777777777665444 33442 34444322110000000 10000111 112235567888777776665532 Q ss_pred ceeccccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhc Q lcl|NC_020862. 225 TIIKGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAG 304 (405) Q Consensus 225 ~ii~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~ 304 (405) -. =+-++||.+...|+.|+|..++|-|..+ +. .+ |.+-| ++++++..| + T Consensus 509 -------------~~--a~~vmn~~~~~~L~~lkd~~G~~~~~~~---~~-----~~--~tL~G--~PV~~s~~v-p--- 557 (645) T protein:vir:93 509 -------------TG--AVWLMSSTNALALSMRKNALGQKEYPDM---TL-----LG--GSFQG--LPVIVSQYV-G--- 557 (645) T ss_pred -------------cc--cEEEEcHHHHHHHHhccccCCceeecCC---CC-----CC--ceeec--eeeEEeccC-C--- Confidence 01 1345799999999999988777777432 11 11 45644 566666544 1 Q ss_pred CCCcccCCCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCC---ccchhhhHH Q lcl|NC_020862. 305 AGATATAANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRND---PYGKVGFSS 381 (405) Q Consensus 305 aGa~~~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~D---PlgQrg~~g 381 (405) +.... ++.=++ ++|.+ +.+-+. ++.. ..+++....-|..+.+... -|-|+..++ T Consensus 558 --~~~~~------------gd~s~~----~ig~~--~~v~i~-~s~~--a~~~~~~~~~~~~~~~~~~~~v~lf~~d~va 614 (645) T protein:vir:93 558 --DQLVL------------VNAPDI----YLADD--GGVAVD-MSRE--ASLEMQSEPTGDSTTPSPVELVSMFQTGSVA 614 (645) T ss_pred --cceeE------------eccccE----EEEEe--cceEEE-eecc--eeEEEeecccccccccccccchhHhhcCceE Confidence 00000 000012 23322 222221 1111 1223222211111111111 136778888 Q ss_pred HH--HHHHHhhccccceEEEEEecCC Q lcl|NC_020862. 382 IK--FFYGFIKLRGERIAVAYSVIPE 405 (405) Q Consensus 382 wK--~~~~~~iL~~~~marie~~a~~ 405 (405) +| +.+.+.+.+++-.++|.= +.= T Consensus 615 ira~~r~d~~~~~p~a~~~lt~-~~~ 639 (645) T protein:vir:93 615 IRAERWINWRRRRTAAVAVITG-VNY 639 (645) T ss_pred EEEEEEEcceeeCccceEEEec-ccC Confidence 88 667888888887776652 221 No 137 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=96.11 E-value=0.00099 Score=37.05 Aligned_cols=315 Identities=12% Similarity=0.033 Sum_probs=154.4 Q ss_pred CCccccCcCCCccccccc-ceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGP-QFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATG 79 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~-qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g 79 (405) |+-+=+.......++=.+ .+...-|.-+.+..-+..-++..+=.++. -..||+..|-|---. ...-++.|-.+.| T Consensus 1 Ms~~n~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~vRt--I~~gkS~qf~~lG~s--~a~y~~pG~~ldg 76 (400) T protein:vir:10 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQT--VTGTNTVSNKYLGET--ELQVLAPGQSPAA 76 (400) T ss_pred CCCCccccccccccccchhhhHHhHhcchHHHHHHHHhhhcccceeee--ecccceEEEEEeeee--EEeeecCCCCcCC Confidence 554411111111111011 12222233444444444445555556665 467788888754111 1111333333333 Q ss_pred ccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccc-hHHHHHHHHHHH Q lcl|NC_020862. 80 ASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSD-LYGHLSREMLRG 158 (405) Q Consensus 80 ~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~-l~~~~~~ell~~ 158 (405) +.+.+ .| +.+-|..-|--.=+++.|-|-. .|=| +-.++++|++.. T Consensus 77 ~~~~~--------------dk----------------~~ItIDtLL~a~~~V~dlDd~q----~~yD~vRse~s~e~G~A 122 (400) T protein:vir:10 77 TSTQA--------------DK----------------NQLVIDATVIARNTVAHLHDVQ----GDIDSLKPKLATNQAKQ 122 (400) T ss_pred CCccc--------------Cc----------------EEEEeCceeeecchhhhHHHHh----hccccccHHHHHHHHHH Confidence 33211 11 2233344443333444444432 3323 345666666654 Q ss_pred HhhHH-HHHHHHHHhccCceE-E---ecCCC--ccceeeecccccccCCceecHHH----HHHHHHHHHhccCcccccee Q lcl|NC_020862. 159 ANEIT-EDLLQADILASADVK-V---FTGAA--TSMVTMTGEAADAEDDGLITLKD----LKRLSITLTDNYTPKKTTII 227 (405) Q Consensus 159 ~~~~t-ed~l~~~ilag~~~v-~---yag~a--ts~~~~t~~~~~~~~n~~it~~~----lr~~~~~Lk~nrApk~T~ii 227 (405) -+... +..||-.+++++... . ..|+. ....++++..+ ...++... ++.+...|.++..|. T Consensus 123 LA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g~~~g~s~~v~~~~~----~~~~~~~~l~~A~~~A~~~LdEkdVP~----- 193 (400) T protein:vir:10 123 LKKMEDEMLIQQMLLGGIANTQAKRTNPRVKGHGFSVNVEVNEG----EALVNPQYVMAAVEFALEQQLEQEVDI----- 193 (400) T ss_pred HHHHHHHHHHHHHHHhcccccccccccCCccccccceeeccccc----ccccCHHHHHHHHHHHHHHHHhcCCCc----- Confidence 33332 234454555553221 1 11222 11222332222 22344444 456666677666652 Q ss_pred ccccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcC--CcccccCcceeEecCCcEEEEeCcchhhhhcC Q lcl|NC_020862. 228 KGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYA--DAATIMNGEIGAIPGAHLRIVVVPQMMHYAGA 305 (405) Q Consensus 228 ~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya--~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~a 305 (405) + -++.++.|+--.-|++ .+-+++. .|+ .....-.|+|.+|.| ||+++++++---++. T Consensus 194 -----------~-d~vvl~pp~~Ys~Ll~------~dkLvnr-df~~s~~g~~~~g~v~~v~G--v~Iv~Sn~lP~~a~~ 252 (400) T protein:vir:10 194 -----------S-DVAILMPWRYFNVLRD------ADRIVDK-SYTISQSGATIQGFVLSSYN--CPVIPSNRFPKYSQG 252 (400) T ss_pred -----------c-ceEEEcCHHHHHHHHh------CCcccch-hccccCCCccccceEEEEec--eEEEeeCcCCcccCc Confidence 1 1566666655555554 3456666 465 335567999999965 899999987532222 Q ss_pred CCcc----cCCCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHH Q lcl|NC_020862. 306 GATA----TAANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSS 381 (405) Q Consensus 306 Ga~~----~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~g 381 (405) .... ++++..|.+..-.. ---.|+|=.+|-+++.+....++ ---||=-|..++= T Consensus 253 ~~~~~lS~a~~G~~y~~t~d~s-----~~~av~F~~sAv~tvk~~~lt~~-----------------~~~d~r~~~~~id 310 (400) T protein:vir:10 253 QKHHLLSNEDNGYRYDPIAEMN-----GAIAVLFTADALLVGRSIDVIGD-----------------IFYEKKEKTYYID 310 (400) T ss_pred ccccccccCCCCccCCcccccc-----ceeEEEEehhheEEEEeeccccc-----------------cccchhhHHHHHH Confidence 1111 11222222211111 12456777777777555422211 1257778999999 Q ss_pred HHHHHHHhhccccceEEEEEecCC Q lcl|NC_020862. 382 IKFFYGFIKLRGERIAVAYSVIPE 405 (405) Q Consensus 382 wK~~~~~~iL~~~~marie~~a~~ 405 (405) -|+.|+...+|++.-++++++=.. T Consensus 311 ~~~a~G~g~~RPeaa~vv~~~~~~ 334 (400) T protein:vir:10 311 TFMSEGAIPDRWEAVSVVTTKRQS 334 (400) T ss_pred HHHHhCCcccchhheEEEEecCCc Confidence 999999999999999999998666 No 138 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=96.08 E-value=0.00019 Score=40.96 Aligned_cols=271 Identities=13% Similarity=0.101 Sum_probs=131.6 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) ....-+.-..++.+.-+. .-+.-+..+.+......-.+.+++.+.++.- .++ .+...+..+ .....| T Consensus 111 ~~~~~~a~~~~~~~~gG~-lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~---~~~-p~~~~~~~~-a~~v~E------- 177 (387) T protein:vir:96 111 AQRLLHALPTGNDSGGDK-LLPKTLSKEIVSEPFAKNQLREKARLTNIKG---LEI-PRVSYTLDD-DDFITD------- 177 (387) T ss_pred HHHHHhhhccCCCCCCce-eechhHHHHHHHHHHhhchhhhhceeeecCC---cee-eeeeccCCc-cccccc------- Confidence 000000001111111111 1122233444444444555667777766542 122 111111111 111122 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) |...+..-.++..|+-+.++|+.|+.+|++ ++-|++.++.+.|..++.+..+ T Consensus 178 ---------------------------g~~~~~~~~~f~~v~l~~~k~~~~i~iS~e-ll~ds~~~l~~~i~~~la~~~~ 229 (387) T protein:vir:96 178 ---------------------------VETAKELKAKGDTVKFTTNKFKVFAAISDT-VIHGSDVDLVNWVENALQSGLA 229 (387) T ss_pred ---------------------------cccccccccccceeeechheeeeechhhHH-HHhhhHHHHHHHHHHHHHHHHH Confidence 222222234566788999999999999998 5567777888888877776544 Q ss_pred hHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISA 240 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~ 240 (405) . +|+.. .+..|.++-.-.|..++.. . ..+....++++|.++...|+....+. + T Consensus 230 ~-~e~~~--~~~~g~g~g~~~g~~~~~~----~---~~~~~~~~~d~i~~~~~~l~~~y~~n-----------------a 282 (387) T protein:vir:96 230 A-KERKD--ALAVSPKSGLEHMSFYNGS----V---KEVEGADMYDAIINALADLHEDYRDN-----------------A 282 (387) T ss_pred H-HHHHh--HhhcCCCccccceeeeccc----c---ccccccchHHHHHHHHhccChhhhcC-----------------C Confidence 3 44322 1334433322222111110 0 01112346889999888887654321 1 Q ss_pred eEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccccccc Q lcl|NC_020862. 241 SRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSD 320 (405) Q Consensus 241 syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~ 320 (405) + .++|+.....+..+.+.-+.+-| .+.-+.|-|- ..+.+. T Consensus 283 ~--~imn~~t~~~~~~~~~~~~~~~~-------------~~~~~~llG~--PV~~~~----------------------- 322 (387) T protein:vir:96 283 T--IYMRYADYVKIISVLSNGTTNFF-------------DTPAEKVFGK--PVVFTD----------------------- 322 (387) T ss_pred E--EEEechHHHHHHHHHhcCCCccc-------------ccCCcccccc--ceEEec----------------------- Confidence 2 36787777777667665444433 2222333331 111111 Q ss_pred ccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCc-cchhhhHHHHHHHHHhhccccceEEE Q lcl|NC_020862. 321 VAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDP-YGKVGFSSIKFFYGFIKLRGERIAVA 399 (405) Q Consensus 321 ~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DP-lgQrg~~gwK~~~~~~iL~~~~mari 399 (405) |. +-++||+=++.-+.++++. ++.- .|. -|+++|..+. ++.+.+.+++-++.+ T Consensus 323 -------~~-~~~~~GDf~~~~~~~~~~~----------~~~~-------~~~~~~~~~~~~~~-r~Dg~v~~~~A~~~l 376 (387) T protein:vir:96 323 -------AA-VKPIVGDFNYFGINYDGTT----------YDTD-------KDVKKGEYLFVLTA-WYDQQRTLDSAFRIA 376 (387) T ss_pred -------CC-Cceeeechhhhhhhhhhhh----------heec-------ccccCCceEEEEEE-EeCcEeechhheEEE Confidence 01 1246787665555554322 1111 111 2455555544 789999999999999 Q ss_pred EEecCC Q lcl|NC_020862. 400 YSVIPE 405 (405) Q Consensus 400 e~~a~~ 405 (405) +..+.. T Consensus 377 ~~ka~~ 382 (387) T protein:vir:96 377 KAKENT 382 (387) T ss_pred EeecCC Confidence 997666 No 139 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=96.08 E-value=0.00019 Score=40.96 Aligned_cols=271 Identities=13% Similarity=0.101 Sum_probs=131.6 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) ....-+.-..++.+.-+. .-+.-+..+.+......-.+.+++.+.++.- .++ .+...+..+ .....| T Consensus 111 ~~~~~~a~~~~~~~~gG~-lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~---~~~-p~~~~~~~~-a~~v~E------- 177 (387) T protein:vir:94 111 AQRLLHALPTGNDSGGDK-LLPKTLSKEIVSEPFAKNQLREKARLTNIKG---LEI-PRVSYTLDD-DDFITD------- 177 (387) T ss_pred HHHHHhhhccCCCCCCce-eechhHHHHHHHHHHhhchhhhhceeeecCC---cee-eeeeccCCc-cccccc------- Confidence 000000001111111111 1122233444444444555667777766542 122 111111111 111122 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) |...+..-.++..|+-+.++|+.|+.+|++ ++-|++.++.+.|..++.+..+ T Consensus 178 ---------------------------g~~~~~~~~~f~~v~l~~~k~~~~i~iS~e-ll~ds~~~l~~~i~~~la~~~~ 229 (387) T protein:vir:94 178 ---------------------------VETAKELKAKGDTVKFTTNKFKVFAAISDT-VIHGSDVDLVNWVENALQSGLA 229 (387) T ss_pred ---------------------------cccccccccccceeeechheeeeechhhHH-HHhhhHHHHHHHHHHHHHHHHH Confidence 222222234566788999999999999998 5567777888888877776544 Q ss_pred hHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISA 240 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~ 240 (405) . +|+.. .+..|.++-.-.|..++.. . ..+....++++|.++...|+....+. + T Consensus 230 ~-~e~~~--~~~~g~g~g~~~g~~~~~~----~---~~~~~~~~~d~i~~~~~~l~~~y~~n-----------------a 282 (387) T protein:vir:94 230 A-KERKD--ALAVSPKSGLEHMSFYNGS----V---KEVEGADMYDAIINALADLHEDYRDN-----------------A 282 (387) T ss_pred H-HHHHh--HhhcCCCccccceeeeccc----c---ccccccchHHHHHHHHhccChhhhcC-----------------C Confidence 3 44322 1334433322222111110 0 01112346889999888887654321 1 Q ss_pred eEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccccccc Q lcl|NC_020862. 241 SRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSD 320 (405) Q Consensus 241 syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~ 320 (405) + .++|+.....+..+.+.-+.+-| .+.-+.|-|- ..+.+. T Consensus 283 ~--~imn~~t~~~~~~~~~~~~~~~~-------------~~~~~~llG~--PV~~~~----------------------- 322 (387) T protein:vir:94 283 T--IYMRYADYVKIISVLSNGTTNFF-------------DTPAEKVFGK--PVVFTD----------------------- 322 (387) T ss_pred E--EEEechHHHHHHHHHhcCCCccc-------------ccCCcccccc--ceEEec----------------------- Confidence 2 36787777777667665444433 2222333331 111111 Q ss_pred ccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCc-cchhhhHHHHHHHHHhhccccceEEE Q lcl|NC_020862. 321 VAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDP-YGKVGFSSIKFFYGFIKLRGERIAVA 399 (405) Q Consensus 321 ~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DP-lgQrg~~gwK~~~~~~iL~~~~mari 399 (405) |. +-++||+=++.-+.++++. ++.- .|. -|+++|..+. ++.+.+.+++-++.+ T Consensus 323 -------~~-~~~~~GDf~~~~~~~~~~~----------~~~~-------~~~~~~~~~~~~~~-r~Dg~v~~~~A~~~l 376 (387) T protein:vir:94 323 -------AA-VKPIVGDFNYFGINYDGTT----------YDTD-------KDVKKGEYLFVLTA-WYDQQRTLDSAFRIA 376 (387) T ss_pred -------CC-Cceeeechhhhhhhhhhhh----------heec-------ccccCCceEEEEEE-EeCcEeechhheEEE Confidence 01 1246787665555554322 1111 111 2455555544 789999999999999 Q ss_pred EEecCC Q lcl|NC_020862. 400 YSVIPE 405 (405) Q Consensus 400 e~~a~~ 405 (405) +..+.. T Consensus 377 ~~ka~~ 382 (387) T protein:vir:94 377 KAKENT 382 (387) T ss_pred EeecCC Confidence 997666 No 140 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=96.08 E-value=0.00019 Score=40.96 Aligned_cols=271 Identities=13% Similarity=0.101 Sum_probs=131.6 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) ....-+.-..++.+.-+. .-+.-+..+.+......-.+.+++.+.++.- .++ .+...+..+ .....| T Consensus 111 ~~~~~~a~~~~~~~~gG~-lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~---~~~-p~~~~~~~~-a~~v~E------- 177 (387) T protein:vir:26 111 AQRLLHALPTGNDSGGDK-LLPKTLSKEIVSEPFAKNQLREKARLTNIKG---LEI-PRVSYTLDD-DDFITD------- 177 (387) T ss_pred HHHHHhhhccCCCCCCce-eechhHHHHHHHHHHhhchhhhhceeeecCC---cee-eeeeccCCc-cccccc------- Confidence 000000001111111111 1122233444444444555667777766542 122 111111111 111122 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) |...+..-.++..|+-+.++|+.|+.+|++ ++-|++.++.+.|..++.+..+ T Consensus 178 ---------------------------g~~~~~~~~~f~~v~l~~~k~~~~i~iS~e-ll~ds~~~l~~~i~~~la~~~~ 229 (387) T protein:vir:26 178 ---------------------------VETAKELKAKGDTVKFTTNKFKVFAAISDT-VIHGSDVDLVNWVENALQSGLA 229 (387) T ss_pred ---------------------------cccccccccccceeeechheeeeechhhHH-HHhhhHHHHHHHHHHHHHHHHH Confidence 222222234566788999999999999998 5567777888888877776544 Q ss_pred hHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccc Q lcl|NC_020862. 161 EITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISA 240 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~ 240 (405) . +|+.. .+..|.++-.-.|..++.. . ..+....++++|.++...|+....+. + T Consensus 230 ~-~e~~~--~~~~g~g~g~~~g~~~~~~----~---~~~~~~~~~d~i~~~~~~l~~~y~~n-----------------a 282 (387) T protein:vir:26 230 A-KERKD--ALAVSPKSGLEHMSFYNGS----V---KEVEGADMYDAIINALADLHEDYRDN-----------------A 282 (387) T ss_pred H-HHHHh--HhhcCCCccccceeeeccc----c---ccccccchHHHHHHHHhccChhhhcC-----------------C Confidence 3 44322 1334433322222111110 0 01112346889999888887654321 1 Q ss_pred eEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccccccc Q lcl|NC_020862. 241 SRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSD 320 (405) Q Consensus 241 syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~ 320 (405) + .++|+.....+..+.+.-+.+-| .+.-+.|-|- ..+.+. T Consensus 283 ~--~imn~~t~~~~~~~~~~~~~~~~-------------~~~~~~llG~--PV~~~~----------------------- 322 (387) T protein:vir:26 283 T--IYMRYADYVKIISVLSNGTTNFF-------------DTPAEKVFGK--PVVFTD----------------------- 322 (387) T ss_pred E--EEEechHHHHHHHHHhcCCCccc-------------ccCCcccccc--ceEEec----------------------- Confidence 2 36787777777667665444433 2222333331 111111 Q ss_pred ccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCc-cchhhhHHHHHHHHHhhccccceEEE Q lcl|NC_020862. 321 VAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDP-YGKVGFSSIKFFYGFIKLRGERIAVA 399 (405) Q Consensus 321 ~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DP-lgQrg~~gwK~~~~~~iL~~~~mari 399 (405) |. +-++||+=++.-+.++++. ++.- .|. -|+++|..+. ++.+.+.+++-++.+ T Consensus 323 -------~~-~~~~~GDf~~~~~~~~~~~----------~~~~-------~~~~~~~~~~~~~~-r~Dg~v~~~~A~~~l 376 (387) T protein:vir:26 323 -------AA-VKPIVGDFNYFGINYDGTT----------YDTD-------KDVKKGEYLFVLTA-WYDQQRTLDSAFRIA 376 (387) T ss_pred -------CC-Cceeeechhhhhhhhhhhh----------heec-------ccccCCceEEEEEE-EeCcEeechhheEEE Confidence 01 1246787665555554322 1111 111 2455555544 789999999999999 Q ss_pred EEecCC Q lcl|NC_020862. 400 YSVIPE 405 (405) Q Consensus 400 e~~a~~ 405 (405) +..+.. T Consensus 377 ~~ka~~ 382 (387) T protein:vir:26 377 KAKENT 382 (387) T ss_pred EeecCC Confidence 997666 No 141 >protein:vir:100939 Length: 430 # NCBI annotation: Gp5 # Family: family:all:1412 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006408;genbank:gi:46358700;genbank:GeneID:2777089 Probab=95.90 E-value=0.0011 Score=36.76 Aligned_cols=307 Identities=13% Similarity=0.104 Sum_probs=133.5 Q ss_pred cccccccceeehhhhhHHHHHhhhhhhhhcccccc-----ccCcCCCCEEEEEecccCCCCCCccccCCCcccccccCCc Q lcl|NC_020862. 12 DASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNK-----QMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGASYAGGN 86 (405) Q Consensus 12 ~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~-----~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~~~~~gn 86 (405) =..+++..+.--+ |+.|.--+..+|+.+.+.+. .+ .+.|.||+. |++.+- +...|.+..++. + T Consensus 1 MAn~l~~~~~ii~--~eal~~l~n~~v~a~~~~~~r~~d~~~-~r~Gdti~~----p~~~~~-~~~~G~~~t~~~--~-- 68 (430) T protein:vir:10 1 MALNEGQIVTLAV--DEIIETISAITPMAQKAKKYTPPAASM-QRSSNTIWM----PVEQES-PTQEGWDLTDKA--T-- 68 (430) T ss_pred CccchhhHHHHHH--HHHHHHHhhhhhhhhhhcccCCchhhh-hcccceEEe----cccccc-ccccCcccCCCC--C-- Confidence 0111222111122 55555555567777654322 22 267888743 333322 233466554431 1 Q ss_pred ccccccccccccccccccccccccccccceeeeeEEEEe-eeeeeeEEecchhhhhhhccchHHHHHHHHHHHHhhHHHH Q lcl|NC_020862. 87 LYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTL-TEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGANEITED 165 (405) Q Consensus 87 ly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l-~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~~~ted 165 (405) | +.|. -+.+++ +|.+..++|+++.+. .++.-.+.++..|..=++++..| T Consensus 69 ------~----------i~e~------------~v~~~v~~~k~V~~~~~~kel~--~~~~~~~~i~~Am~~LA~~Vd~d 118 (430) T protein:vir:10 69 ------G----------LLEL------------NVAVNMGEPDNDFFQLRADDLR--DETAYRHRIQSAARKLANNVELK 118 (430) T ss_pred ------c----------cccc------------eEEEEEeeeccceEEechhHhc--ChhHHHHHhHHHHHHHHHHHHHH Confidence 1 1111 123344 467889999987643 22223455555666666667766 Q ss_pred HHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccceEEEE Q lcl|NC_020862. 166 LLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISASRIAY 245 (405) Q Consensus 166 ~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~syv~~ 245 (405) ++.---+.+..++...-+.+. + ..-..+++-.+-+.|..|.+|+- --+-++ T Consensus 119 l~~~~~~~~~~v~~~~~~t~~-----------~--~~~~~~~~A~a~~~L~~~~vP~~----------------~~R~~v 169 (430) T protein:vir:10 119 VANMAAEMGSLVITSPDAIGT-----------N--TADAWNFVADAEELMFSRELNRD----------------MGTSYF 169 (430) T ss_pred HHHHhhhcccccccccccCCC-----------c--CCcchhhHHHHHHHHHHhcCCCC----------------CCcEEE Confidence 664333333333322111100 0 11234777888899999999951 026677 Q ss_pred EcccchHHH-HHHhcccCCCcceehhhcCCcccccCcceeE-ecCCcEE-EEeCcchhhhhcCCCcccCCCcc------- Q lcl|NC_020862. 246 IGSELEIYI-TELVDSLGNPAFVPVEKYADAATIMNGEIGA-IPGAHLR-IVVVPQMMHYAGAGATATAANRG------- 315 (405) Q Consensus 246 ~h~dl~~di-r~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGs-i~g~n~R-fv~~p~~~~~~~aGa~~~~t~~~------- 315 (405) +-|+....+ .++...++.. =...+.+.+||||+ +.| |+ +-.++..-+-.+.-+.....+.+ T Consensus 170 ldp~~~~~l~~~l~~l~~~~-------~~~~~A~r~g~i~~~~~G--fd~~~~~~~~~~~t~g~~t~~tv~gA~~~~~~~ 240 (430) T protein:vir:10 170 FNPQDYKKAGYDLTKRDIFG-------RIPEEAYRDGTIQRQVAG--FDDVLRSPKLPVLTKSTATGITVSGAQSFKPVA 240 (430) T ss_pred eChHHHHHHHhhhccccccc-------cchhHHHhhccccccchh--hhhhhhcCCcccccCccCcCceecccccccccc Confidence 888877777 3343321111 11334567899997 643 44 22333332222211111100000 Q ss_pred ------------------------------------------------------cccccccCCcceeeeEEEE------- Q lcl|NC_020862. 316 ------------------------------------------------------YQVSDVAGTDKYDIAPLLV------- 334 (405) Q Consensus 316 ------------------------------------------------------~~~~~~~g~~~~DVYp~lV------- 334 (405) +.++....++.+.|||-++ T Consensus 241 ~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftiaGV~~v~~~tkq~~~~l~~F~Vt~~~~atsv~I~paii~~~~~~~ 320 (430) T protein:vir:10 241 WQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSL 320 (430) T ss_pred ceecccccccccccccceeeeecccceecccEEEecceeeeccccccccCCccEEEEEEecCCceeEEeccccccccccc Confidence 1111111223345666553 Q ss_pred ----------------------Eccc-----------cce--eeccee-c---------cCCCCC-CceEEEecCCCCCC Q lcl|NC_020862. 335 ----------------------VGDQ-----------AFA--TIGLQG-M---------SGKGKS-KFRIIVKKPGEATA 368 (405) Q Consensus 335 ----------------------~G~~-----------Afg--~i~l~g-~---------~~~g~~-~~~~ivk~pG~~ta 368 (405) +|+. ||+ +.+|.- + .-..+. .+.+++-.- T Consensus 321 ~~~~~~y~nVsaspa~~aavTvv~~a~~~~Nl~fhr~A~aLa~~pL~~~~~~~~~~~~~~~~~~~~Glsirv~~~----- 395 (430) T protein:vir:10 321 SPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQ----- 395 (430) T ss_pred cccccccceeccccccCceeEEeccCCcccceeEcccceEEEEecccCCCCHHHhhhhheeccccceEEEEEEEe----- Confidence 2322 222 111100 0 000000 122222111 Q ss_pred CCCCccchhhhHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_020862. 369 DRNDPYGKVGFSSIKFFYGFIKLRGERIAVAYSVIPE 405 (405) Q Consensus 369 d~~DPlgQrg~~gwK~~~~~~iL~~~~marie~~a~~ 405 (405) .|.-.....+.|=.+||+..|++||..++=....- T Consensus 396 --yd~~~~~~~~r~DvLyG~~~v~Pe~a~v~l~g~~~ 430 (430) T protein:vir:10 396 --GDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred --cccccCceEEEEeeeccceecCcceEEEEcCCCCC Confidence 11112222234456788888888886544322222 No 142 >protein:vir:9265 Length: 430 # NCBI annotation: 5 # Family: family:all:1412 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720329;genbank:gi:24371587;genbank:GeneID:955820 Probab=95.90 E-value=0.0011 Score=36.76 Aligned_cols=307 Identities=13% Similarity=0.104 Sum_probs=133.5 Q ss_pred cccccccceeehhhhhHHHHHhhhhhhhhcccccc-----ccCcCCCCEEEEEecccCCCCCCccccCCCcccccccCCc Q lcl|NC_020862. 12 DASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNK-----QMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGASYAGGN 86 (405) Q Consensus 12 ~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~-----~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~~~~~gn 86 (405) =..+++..+.--+ |+.|.--+..+|+.+.+.+. .+ .+.|.||+. |++.+- +...|.+..++. + T Consensus 1 MAn~l~~~~~ii~--~eal~~l~n~~v~a~~~~~~r~~d~~~-~r~Gdti~~----p~~~~~-~~~~G~~~t~~~--~-- 68 (430) T protein:vir:92 1 MALNEGQIVTLAV--DEIIETISAITPMAQKAKKYTPPAASM-QRSSNTIWM----PVEQES-PTQEGWDLTDKA--T-- 68 (430) T ss_pred CccchhhHHHHHH--HHHHHHHhhhhhhhhhhcccCCchhhh-hcccceEEe----cccccc-ccccCcccCCCC--C-- Confidence 0111222111122 55555555567777654322 22 267888743 333322 233466554431 1 Q ss_pred ccccccccccccccccccccccccccccceeeeeEEEEe-eeeeeeEEecchhhhhhhccchHHHHHHHHHHHHhhHHHH Q lcl|NC_020862. 87 LYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTL-TEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGANEITED 165 (405) Q Consensus 87 ly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l-~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~~~ted 165 (405) | +.|. -+.+++ +|.+..++|+++.+. .++.-.+.++..|..=++++..| T Consensus 69 ------~----------i~e~------------~v~~~v~~~k~V~~~~~~kel~--~~~~~~~~i~~Am~~LA~~Vd~d 118 (430) T protein:vir:92 69 ------G----------LLEL------------NVAVNMGEPDNDFFQLRADDLR--DETAYRHRIQSAARKLANNVELK 118 (430) T ss_pred ------c----------cccc------------eEEEEEeeeccceEEechhHhc--ChhHHHHHhHHHHHHHHHHHHHH Confidence 1 1111 123344 467889999987643 22223455555666666667766 Q ss_pred HHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccceEEEE Q lcl|NC_020862. 166 LLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISASRIAY 245 (405) Q Consensus 166 ~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~syv~~ 245 (405) ++.---+.+..++...-+.+. + ..-..+++-.+-+.|..|.+|+- --+-++ T Consensus 119 l~~~~~~~~~~v~~~~~~t~~-----------~--~~~~~~~~A~a~~~L~~~~vP~~----------------~~R~~v 169 (430) T protein:vir:92 119 VANMAAEMGSLVITSPDAIGT-----------N--TADAWNFVADAEELMFSRELNRD----------------MGTSYF 169 (430) T ss_pred HHHHhhhcccccccccccCCC-----------c--CCcchhhHHHHHHHHHHhcCCCC----------------CCcEEE Confidence 664333333333322111100 0 11234777888899999999951 026677 Q ss_pred EcccchHHH-HHHhcccCCCcceehhhcCCcccccCcceeE-ecCCcEE-EEeCcchhhhhcCCCcccCCCcc------- Q lcl|NC_020862. 246 IGSELEIYI-TELVDSLGNPAFVPVEKYADAATIMNGEIGA-IPGAHLR-IVVVPQMMHYAGAGATATAANRG------- 315 (405) Q Consensus 246 ~h~dl~~di-r~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGs-i~g~n~R-fv~~p~~~~~~~aGa~~~~t~~~------- 315 (405) +-|+....+ .++...++.. =...+.+.+||||+ +.| |+ +-.++..-+-.+.-+.....+.+ T Consensus 170 ldp~~~~~l~~~l~~l~~~~-------~~~~~A~r~g~i~~~~~G--fd~~~~~~~~~~~t~g~~t~~tv~gA~~~~~~~ 240 (430) T protein:vir:92 170 FNPQDYKKAGYDLTKRDIFG-------RIPEEAYRDGTIQRQVAG--FDDVLRSPKLPVLTKSTATGITVSGAQSFKPVA 240 (430) T ss_pred eChHHHHHHHhhhccccccc-------cchhHHHhhccccccchh--hhhhhhcCCcccccCccCcCceecccccccccc Confidence 888877777 3343321111 11334567899997 643 44 22333332222211111100000 Q ss_pred ------------------------------------------------------cccccccCCcceeeeEEEE------- Q lcl|NC_020862. 316 ------------------------------------------------------YQVSDVAGTDKYDIAPLLV------- 334 (405) Q Consensus 316 ------------------------------------------------------~~~~~~~g~~~~DVYp~lV------- 334 (405) +.++....++.+.|||-++ T Consensus 241 ~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftiaGV~~v~~~tkq~~~~l~~F~Vt~~~~atsv~I~paii~~~~~~~ 320 (430) T protein:vir:92 241 WQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSL 320 (430) T ss_pred ceecccccccccccccceeeeecccceecccEEEecceeeeccccccccCCccEEEEEEecCCceeEEeccccccccccc Confidence 1111111223345666553 Q ss_pred ----------------------Eccc-----------cce--eeccee-c---------cCCCCC-CceEEEecCCCCCC Q lcl|NC_020862. 335 ----------------------VGDQ-----------AFA--TIGLQG-M---------SGKGKS-KFRIIVKKPGEATA 368 (405) Q Consensus 335 ----------------------~G~~-----------Afg--~i~l~g-~---------~~~g~~-~~~~ivk~pG~~ta 368 (405) +|+. ||+ +.+|.- + .-..+. .+.+++-.- T Consensus 321 ~~~~~~y~nVsaspa~~aavTvv~~a~~~~Nl~fhr~A~aLa~~pL~~~~~~~~~~~~~~~~~~~~Glsirv~~~----- 395 (430) T protein:vir:92 321 SPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQ----- 395 (430) T ss_pred cccccccceeccccccCceeEEeccCCcccceeEcccceEEEEecccCCCCHHHhhhhheeccccceEEEEEEEe----- Confidence 2322 222 111100 0 000000 122222111 Q ss_pred CCCCccchhhhHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_020862. 369 DRNDPYGKVGFSSIKFFYGFIKLRGERIAVAYSVIPE 405 (405) Q Consensus 369 d~~DPlgQrg~~gwK~~~~~~iL~~~~marie~~a~~ 405 (405) .|.-.....+.|=.+||+..|++||..++=....- T Consensus 396 --yd~~~~~~~~r~DvLyG~~~v~Pe~a~v~l~g~~~ 430 (430) T protein:vir:92 396 --GDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred --cccccCceEEEEeeeccceecCcceEEEEcCCCCC Confidence 11112222234456788888888886544322222 No 143 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=95.49 E-value=0.002 Score=35.42 Aligned_cols=315 Identities=13% Similarity=0.085 Sum_probs=140.9 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) -+.....-..+++++-+.-+-+-+ ..+.+....+...+.+++...+++.+ .+++-+- ...++ .-+..+. T Consensus 144 ~~~~~~~~~~~~~~~gg~~vp~~~-~~~ii~~~~~~~~i~~l~~~~~~~~~---~~~~~~~----~~~~~-~a~wv~E-- 212 (497) T protein:vir:10 144 APAAIGQNPFGSTGTFAPGILPTF-LPGIVEQLFYELSLADLISSRPVTSP---NLSYLTE----SAAHN-NAAAVAE-- 212 (497) T ss_pred hHHHHHhhhcccCcccccccchhh-hHHHHHHHHhhhhHHhhccccccCCC---ceEEEEE----cCCCC-cceeecc-- Confidence 000000111222222222233333 34444445567788899998888654 2443321 10000 0011122 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) |+.......++..|+.+.++++.++.+|++++. |.. .+...+..++.+ +- T Consensus 213 ---------------------------~~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~-d~~-~l~~~i~~~l~~-~i 262 (497) T protein:vir:10 213 ---------------------------AGTYPFSSEEFARVYEQVGKVANALTITDEGLR-DAP-ELFNFVQGRLLE-GI 262 (497) T ss_pred ---------------------------CcccccccccceeeEeeeeeeEeecHhHHHHHH-hHH-HHHHHHHHHHHH-HH Confidence 222223345677889999999999999999765 553 466665555554 23 Q ss_pred hHHHHHHHHHHhccCceEEecCCCccc--eeeeccccccc---------------CC-ceecHHHHHHHHHHHHhccCcc Q lcl|NC_020862. 161 EITEDLLQADILASADVKVFTGAATSM--VTMTGEAADAE---------------DD-GLITLKDLKRLSITLTDNYTPK 222 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~yag~ats~--~~~t~~~~~~~---------------~n-~~it~~~lr~~~~~Lk~nrApk 222 (405) ...+| ..+++|.++-.=.|--++. .+.+....... .+ ..+..+-+..+......+-.+. T Consensus 263 ~~~~d---~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 339 (497) T protein:vir:10 263 QRKEE---VQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAG 339 (497) T ss_pred HHHHH---HHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhh Confidence 33333 4466665432211100000 00000000000 00 0001111111111111000000 Q ss_pred ccceeccccccCc----------ccc------cceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEe Q lcl|NC_020862. 223 KTTIIKGSRMTDT----------KTI------SASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAI 286 (405) Q Consensus 223 ~T~ii~gs~~~gT----------~~I------~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi 286 (405) .-..+.+....+. ..+ +++ ..++||....-|+.|+|-.+.|-|.+.-.-.... ..+.-+++ T Consensus 340 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~--~~~~~~~l 416 (497) T protein:vir:10 340 SGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPN-AVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGN--PVNGGKNI 416 (497) T ss_pred hccchhccccchhhhhhHHHHHHhhhhhhcccCCC-eEEEchHHHHHHHHhhcCCCceeccCcccccccc--cccCCcee Confidence 0000000000000 000 111 3568999999999999988888887642211221 23334466 Q ss_pred cCCcEEEEeCcchhhhhcCCCcccCCCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCC Q lcl|NC_020862. 287 PGAHLRIVVVPQMMHYAGAGATATAANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEA 366 (405) Q Consensus 287 ~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~ 366 (405) -| .++++++.|. +| ++ +||.=..+.+.+ + ....+++.+..- T Consensus 417 ~G--~pV~~t~~~~----~~---------------------~~----~~Gd~~~~~~~i-~----~r~~~~v~~~~~--- 457 (497) T protein:vir:10 417 WG--VPVVTTPLIP----LG---------------------TI----LVGHFAPSVIQT-A----RREGVTMQMTNS--- 457 (497) T ss_pred ec--eeeEecCCCC----CC---------------------ce----EEeecccceEEE-E----EecccEEEeecc--- Confidence 44 5788887553 11 11 334322211111 0 001233333321 Q ss_pred CCCCCCccchhhhHHHHH--HHHHhhccccceEEEEEecCC Q lcl|NC_020862. 367 TADRNDPYGKVGFSSIKF--FYGFIKLRGERIAVAYSVIPE 405 (405) Q Consensus 367 tad~~DPlgQrg~~gwK~--~~~~~iL~~~~marie~~a~~ 405 (405) ..++-+++.+++++ .+.+.+++++-+++++..++= T Consensus 458 ----~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~ 494 (497) T protein:vir:10 458 ----NGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) T ss_pred ----cchhhhcCcEEEEEEEeecceeeccccEEEEEecCCc Confidence 22345667777764 478899999999999988776 No 144 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=95.49 E-value=0.002 Score=35.42 Aligned_cols=315 Identities=13% Similarity=0.085 Sum_probs=140.9 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) -+.....-..+++++-+.-+-+-+ ..+.+....+...+.+++...+++.+ .+++-+- ...++ .-+..+. T Consensus 144 ~~~~~~~~~~~~~~~gg~~vp~~~-~~~ii~~~~~~~~i~~l~~~~~~~~~---~~~~~~~----~~~~~-~a~wv~E-- 212 (497) T protein:vir:78 144 APAAIGQNPFGSTGTFAPGILPTF-LPGIVEQLFYELSLADLISSRPVTSP---NLSYLTE----SAAHN-NAAAVAE-- 212 (497) T ss_pred hHHHHHhhhcccCcccccccchhh-hHHHHHHHHhhhhHHhhccccccCCC---ceEEEEE----cCCCC-cceeecc-- Confidence 000000111222222222233333 34444445567788899998888654 2443321 10000 0011122 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) |+.......++..|+.+.++++.++.+|++++. |.. .+...+..++.+ +- T Consensus 213 ---------------------------~~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~-d~~-~l~~~i~~~l~~-~i 262 (497) T protein:vir:78 213 ---------------------------AGTYPFSSEEFARVYEQVGKVANALTITDEGLR-DAP-ELFNFVQGRLLE-GI 262 (497) T ss_pred ---------------------------CcccccccccceeeEeeeeeeEeecHhHHHHHH-hHH-HHHHHHHHHHHH-HH Confidence 222223345677889999999999999999765 553 466665555554 23 Q ss_pred hHHHHHHHHHHhccCceEEecCCCccc--eeeeccccccc---------------CC-ceecHHHHHHHHHHHHhccCcc Q lcl|NC_020862. 161 EITEDLLQADILASADVKVFTGAATSM--VTMTGEAADAE---------------DD-GLITLKDLKRLSITLTDNYTPK 222 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~yag~ats~--~~~t~~~~~~~---------------~n-~~it~~~lr~~~~~Lk~nrApk 222 (405) ...+| ..+++|.++-.=.|--++. .+.+....... .+ ..+..+-+..+......+-.+. T Consensus 263 ~~~~d---~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 339 (497) T protein:vir:78 263 QRKEE---VQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAG 339 (497) T ss_pred HHHHH---HHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhh Confidence 33333 4466665432211100000 00000000000 00 0001111111111111000000 Q ss_pred ccceeccccccCc----------ccc------cceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEe Q lcl|NC_020862. 223 KTTIIKGSRMTDT----------KTI------SASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAI 286 (405) Q Consensus 223 ~T~ii~gs~~~gT----------~~I------~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi 286 (405) .-..+.+....+. ..+ +++ ..++||....-|+.|+|-.+.|-|.+.-.-.... ..+.-+++ T Consensus 340 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~--~~~~~~~l 416 (497) T protein:vir:78 340 SGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPN-AVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGN--PVNGGKNI 416 (497) T ss_pred hccchhccccchhhhhhHHHHHHhhhhhhcccCCC-eEEEchHHHHHHHHhhcCCCceeccCcccccccc--cccCCcee Confidence 0000000000000 000 111 3568999999999999988888887642211221 23334466 Q ss_pred cCCcEEEEeCcchhhhhcCCCcccCCCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCC Q lcl|NC_020862. 287 PGAHLRIVVVPQMMHYAGAGATATAANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEA 366 (405) Q Consensus 287 ~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~ 366 (405) -| .++++++.|. +| ++ +||.=..+.+.+ + ....+++.+..- T Consensus 417 ~G--~pV~~t~~~~----~~---------------------~~----~~Gd~~~~~~~i-~----~r~~~~v~~~~~--- 457 (497) T protein:vir:78 417 WG--VPVVTTPLIP----LG---------------------TI----LVGHFAPSVIQT-A----RREGVTMQMTNS--- 457 (497) T ss_pred ec--eeeEecCCCC----CC---------------------ce----EEeecccceEEE-E----EecccEEEeecc--- Confidence 44 5788887553 11 11 334322211111 0 001233333321 Q ss_pred CCCCCCccchhhhHHHHH--HHHHhhccccceEEEEEecCC Q lcl|NC_020862. 367 TADRNDPYGKVGFSSIKF--FYGFIKLRGERIAVAYSVIPE 405 (405) Q Consensus 367 tad~~DPlgQrg~~gwK~--~~~~~iL~~~~marie~~a~~ 405 (405) ..++-+++.+++++ .+.+.+++++-+++++..++= T Consensus 458 ----~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~ 494 (497) T protein:vir:78 458 ----NGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) T ss_pred ----cchhhhcCcEEEEEEEeecceeeccccEEEEEecCCc Confidence 22345667777764 478899999999999988776 No 145 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=95.32 E-value=0.0023 Score=35.06 Aligned_cols=287 Identities=11% Similarity=-0.045 Sum_probs=124.5 Q ss_pred CCc----cccCc-CCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCC Q lcl|NC_020862. 1 MPH----IYNDP-AAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGL 75 (405) Q Consensus 1 ~~~----~y~~~-~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGv 75 (405) +.. .++.- ..++++.-+.-+-+ -+.++.+....+.-.+.+++...++..+..+ |..+ ..-+. ..-..|| T Consensus 72 l~~~~r~~~~~~~~~~~~~~gg~lvP~-~~~~~I~~~~~~~s~i~~~~~~~~~~~~~~~-i~~~--~~~~~-a~~~~E~- 145 (390) T protein:vir:40 72 LTSDESKYYNEVIAGNGFAGVTALLPP-TVFERVFEDLTVEHPLLSKINFVNTTATTEW-IISV--GDVAT-AWWGPLC- 145 (390) T ss_pred ccHHHHHHHHHHHhccCcccCcccccH-HHHHHHHHHHHhhhhhhhhceeeecCCceeE-EEEE--cCCcc-eeeeccc- Confidence 110 01110 11112222211111 2233333344445556677888777543322 2111 11111 0011111 Q ss_pred CcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHH Q lcl|NC_020862. 76 DATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREM 155 (405) Q Consensus 76 tp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~el 155 (405) |.++.-...++..|+-++++++.++.+|++++ -|+..++.+.|..++ T Consensus 146 --------------------------------~~~~~~~~~~f~~i~l~~~k~~~~i~iS~ell-~ds~~~l~~~i~~~l 192 (390) T protein:vir:40 146 --------------------------------AEIKEVLDNGFDKIQTGMYKLSAYIPVCNAML-DLGPSWLDQYVRTIL 192 (390) T ss_pred --------------------------------cccCccccccceeeEeeeeeEEEeehhhHHHH-hcchHHHHHHHHHHH Confidence 11111112356667889999999999999854 466667777766666 Q ss_pred HHHHhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCc Q lcl|NC_020862. 156 LRGANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDT 235 (405) Q Consensus 156 l~~~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT 235 (405) .+. ....++ ..+++|.++..=.|--++....+...........++..+.-.+...|+...-.. +.+ T Consensus 193 a~~-i~~~~~---~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~------~~~---- 258 (390) T protein:vir:40 193 GEA-MALGLE---AGIVNGSGKDQPIGMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDN------GKK---- 258 (390) T ss_pred HHH-HHHHHH---hhhhcccCCCccceeeeccccccccccccccccccchhhHHHHHHHHHHHhhcc------hhh---- Confidence 643 333333 356666554221111111111111111111123455555544444443321100 000 Q ss_pred ccccceEEEEEcccchH-HHH---HHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccC Q lcl|NC_020862. 236 KTISASRIAYIGSELEI-YIT---ELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATA 311 (405) Q Consensus 236 ~~I~~syv~~~h~dl~~-dir---~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~ 311 (405) ..+. =+-+||+.... -|+ .+.|..+.+-| ...+ -.+.+|.++.|- +| T Consensus 259 -~~~~-a~~i~n~~t~~~~l~~~~~~~d~~G~~v~--------~~~~----------~g~pvv~~~~~p----~~----- 309 (390) T protein:vir:40 259 -SVSD-AILVINPADYWSKIYAATSYMTPQGVWVT--------GILP----------VPLEIVQSVAVP----VG----- 309 (390) T ss_pred -hhcC-ceEEEcchhHHHHHHHHhhccCCCCcccc--------ccCC----------CceeEEEcCCCC----CC----- Confidence 0111 12468875532 233 34432222211 1111 125677776542 10 Q ss_pred CCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHh Q lcl|NC_020862. 312 ANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFI 389 (405) Q Consensus 312 t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~ 389 (405) + ++||.-+...++.+ +.+++-+.+ +.+-.++..+++ ..+.+. T Consensus 310 ----------------~----i~~Gd~s~~~i~~~-------~~~~v~~~~---------~~~f~~~~~~~r~~~r~dg~ 353 (390) T protein:vir:40 310 ----------------K----AVAGRAKDYFMGIG-------SEQVIRTST---------EYRLLDDETLYYAKQYANGR 353 (390) T ss_pred ----------------c----EEEEeeceEEEEee-------cceEEEecc---------hhhhhcCcEEEEEEEEeCCE Confidence 1 45676555555543 123433321 223333444444 567888 Q ss_pred hccccceEEEEEecCC Q lcl|NC_020862. 390 KLRGERIAVAYSVIPE 405 (405) Q Consensus 390 iL~~~~marie~~a~~ 405 (405) +.+++=++.+++.+.+ T Consensus 354 v~~~~A~~~l~~~~~~ 369 (390) T protein:vir:40 354 PKDNSSFLVFDITGLE 369 (390) T ss_pred EecccceEEEEeeccC Confidence 8888889999887776 No 146 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=95.26 E-value=0.0024 Score=34.93 Aligned_cols=308 Identities=11% Similarity=0.016 Sum_probs=141.0 Q ss_pred CCccccCcCCCcccc-cccceeehhhhhHHHHHhhhhhh----hhccccccccCcCCCCEEEEEecccCCCCCCccccCC Q lcl|NC_020862. 1 MPHIYNDPAAGDAST-VGPQFNVHYWDRKSLIDEAEEMF----FSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGL 75 (405) Q Consensus 1 ~~~~y~~~~~t~~~~-v~~qm~t~y~~~k~L~~a~p~lv----~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGv 75 (405) ||. ||+ .|.-+. +-|+.=.-|=..+..+.. .|+ ...-++...+=...|.+|.+=.|.+|..+.-- T Consensus 1 M~~-~~~--~T~l~Dii~pEvF~~Yv~~~~~e~~--~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~~~n----- 70 (367) T protein:vir:80 1 MPD-FNN--QVRLVDAVIPEVYTSYTAIDRPELT--AFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPN----- 70 (367) T ss_pred Ccc-hhh--hhhhhhccchhhhhHHHhhhhhhhh--hhhhcceeecCHHHHHHhhcCCCEEEeeeeccCCCCccc----- Confidence 887 444 111111 233321212111111000 011 11112222222356888888888888432211 Q ss_pred CcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHH Q lcl|NC_020862. 76 DATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREM 155 (405) Q Consensus 76 tp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~el 155 (405) |+++.|.+.+|.. .++.-...+.+..+|.=...+|-+.+..- +|.++++..++ T Consensus 71 ------------~~~d~~~~~~t~~--------------kittg~~~a~v~~r~kaw~~~Dla~~lsG-~dpm~~Ia~qv 123 (367) T protein:vir:80 71 ------------YGSDNPNVEAPID--------------GLGSGEMKTTKTWLNKAYGAMDLTAELAG-SNPMTRIRNRF 123 (367) T ss_pred ------------cCCCCCccccccc--------------ccccchheeeeehhcccchhhhHHHHhhC-chHHHHHHHHH Confidence 1122222212111 12223345788899999999997766654 58888877776 Q ss_pred HHHHhhHHHHHHHHHHhc-----cCceE----Ee-----cC---CCccceeeecccccccCCceecHHHHHHHHHHHHhc Q lcl|NC_020862. 156 LRGANEITEDLLQADILA-----SADVK----VF-----TG---AATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDN 218 (405) Q Consensus 156 l~~~~~~ted~l~~~ila-----g~~~v----~y-----ag---~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~n 218 (405) ..--....+..|...+.. .++.. .+ +. ...-...++++.. ..+..++.+.+-+|...|-++ T Consensus 124 a~yW~r~~q~~Lla~L~Gvf~~~~a~~~~~~~~~~~~~a~~~~~~~~~~~Dis~~t~--~~~~~~s~~~~~~A~~~lGD~ 201 (367) T protein:vir:80 124 GVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRVPAEVLGTAGDMVIDISGQTN--PADAVFNREAFVDAAFTMGDH 201 (367) T ss_pred HHHhhhhhHHHHHHHHHHhhccccccchhhhhhhhccccccccccCceeeeeeccCC--CccceecHHHHHHHHHHhccc Confidence 644444433333322210 11110 00 00 0011112233222 234689999999998888887 Q ss_pred cCccccceeccccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcc Q lcl|NC_020862. 219 YTPKKTTIIKGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQ 298 (405) Q Consensus 219 rApk~T~ii~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~ 298 (405) ...- =++++||.....|+.+ ..+.-.+|.+. +.+|+...| .|.|+.. T Consensus 202 ~~~l-------------------~~i~mHS~V~~~L~~~-------~li~~i~~sd~----~~~i~ty~G--~~VIvDD- 248 (367) T protein:vir:80 202 VGSI-------------------AAIAVHSMVYKRMTNN-------DEIEFIPDSKG----QLTIPTYMG--KVVIVDD- 248 (367) T ss_pred cccc-------------------cEEEEchHHHHHHHhc-------cccccccCCCC----ccccceecc--eeEEEeC- Confidence 6641 4679999999999875 36666688876 457999866 4666554 Q ss_pred hhhhhcCCCcccCCCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchh- Q lcl|NC_020862. 299 MMHYAGAGATATAANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKV- 377 (405) Q Consensus 299 ~~~~~~aGa~~~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQr- 377 (405) -+|-.+.| + ..+|-..+||..||+--... ++.+.+. .+.|-...-+.-|=|=.| T Consensus 249 ~~Pv~~~~-----------------a--~~~yttYlfg~GAi~~~~~~-----~~~~~E~-~Rd~~~~~~gG~d~L~~Rr 303 (367) T protein:vir:80 249 GMPVFGTG-----------------A--DKTYLSILFGGAAFGYADGA-----PQVPVAV-GRRELRGNGSGLEYILERK 303 (367) T ss_pred CCcccccC-----------------C--CceEEEEEEecceeeecccC-----Cccceec-ccchhhhcCCceEEEEeee Confidence 44433222 1 13899999999998743321 1111111 111100000011111111 Q ss_pred ----hhHHHHHHHHHhhccccce----EEEEEecCC Q lcl|NC_020862. 378 ----GFSSIKFFYGFIKLRGERI----AVAYSVIPE 405 (405) Q Consensus 378 ----g~~gwK~~~~~~iL~~~~m----arie~~a~~ 405 (405) .-.|.||--++.+--.-.. ..-+.-+|. T Consensus 304 ~~~~hP~G~s~~~~~v~~~~~~~~~~~~~~~~~sPt 339 (367) T protein:vir:80 304 EWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAIT 339 (367) T ss_pred eEEeecceeeecccccccccccccccccccccCCCC Confidence 0023333222211000000 000111232 No 147 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=95.13 E-value=0.0018 Score=35.68 Aligned_cols=272 Identities=13% Similarity=0.127 Sum_probs=125.5 Q ss_pred CC--ccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcc Q lcl|NC_020862. 1 MP--HIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDAT 78 (405) Q Consensus 1 ~~--~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~ 78 (405) +. ..-+.-..++.+.-|.-+ +.-+.++-+......-.+.+++.+.++. |.++ .+...+...+ ....|| T Consensus 109 ~~~~~~~~al~~~t~s~gG~~I-P~~~~~~Ii~~~~~~~~l~~~~~v~~~~---~~~~-p~~~~~~~~a-~~v~E~---- 178 (387) T protein:vir:93 109 MEAQRLLHALPTGNDSGGDKLL-PKTLSKEIVSEPFAKNQLREKARLTNIK---GLEI-PRVSYTLDDD-DFITDV---- 178 (387) T ss_pred hhhHHHHHhhccCcCCCCceee-chhHHHHHHHHHHhhchhhhheeeeecC---CceE-EEEeecCCcc-ccccCc---- Confidence 00 000000011111111111 1112233333333334455667766553 2222 1222222211 111222 Q ss_pred cccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHH Q lcl|NC_020862. 79 GASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRG 158 (405) Q Consensus 79 g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~ 158 (405) ...+..-.++..|+-+.++|+.++.+|++ ++-|++.++...|..++.+. T Consensus 179 ------------------------------~~~~~~~~~f~~v~~~~~k~~~~~~iS~e-ll~Ds~~~l~~~i~~~la~~ 227 (387) T protein:vir:93 179 ------------------------------ETAKELKLKGDTVKFTTNKFKVFAAISDT-VIHGSDVDLVNWVENALQSG 227 (387) T ss_pred ------------------------------ccccccccccceeeeeheeeeeechhhHH-HHhhhHHHHHHHHHHHHHHH Confidence 11122223556678899999999999998 55677778888887777654 Q ss_pred HhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccc Q lcl|NC_020862. 159 ANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTI 238 (405) Q Consensus 159 ~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I 238 (405) -+ .+|+.. -+.+|.++-.-.|.-++.. + ..+....++++|.++...|+...... T Consensus 228 ~~-~~e~~~--~~~~g~g~g~p~g~l~~~~-~------~~v~~~~~~d~i~~~~~~l~~~~~~~---------------- 281 (387) T protein:vir:93 228 LA-AKERKD--ALAVSPKSGLDHMSFYNGS-V------KEVEGADMYDAIINALADLHEDYRDN---------------- 281 (387) T ss_pred HH-HHHHHh--HhhcCCCccccceeeeccc-c------ccccccchHHHHHHHHhccChhhhcC---------------- Confidence 33 344322 2344443322222111110 0 01112345788888888887654421 Q ss_pred cceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccccc Q lcl|NC_020862. 239 SASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQV 318 (405) Q Consensus 239 ~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~ 318 (405) + +.++|+.....+..+.+.-+.+-| .|.=.+|-|- ..+.+. T Consensus 282 --a-~~~mn~~t~~~~~~~~~d~~~~~~-------------~~~~~~llG~--PV~~~~--------------------- 322 (387) T protein:vir:93 282 --A-TIYMRYADYVKIISVLSNGTTNFF-------------DTPAEKVFGK--PVVFTD--------------------- 322 (387) T ss_pred --C-EEEEechHHHHHHHHHhcCCCccc-------------ccCCcccccc--ceEEec--------------------- Confidence 1 236777666666656554333333 2222233331 211110 Q ss_pred ccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccceEE Q lcl|NC_020862. 319 SDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIAV 398 (405) Q Consensus 319 ~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~mar 398 (405) |.. -++||+=+++-+.++++. ++.- +.-=-|+++|..+ .++.+.+.+++-+.. T Consensus 323 ---------~~~-~~~~GDf~~~~~~~~~~~----------~~~~------~~~~~~~~~~~~~-~r~d~~v~~~eA~~~ 375 (387) T protein:vir:93 323 ---------AAV-KPIVGDFNYFGINYDGTT----------YDTD------KDVKKGEYLFVLT-AWYDQQRTLDSAFRI 375 (387) T ss_pred ---------CCC-ceeeeehhhhheehhhhe----------eeec------ccccCCceeEEEE-eeeCceeechhheEE Confidence 011 246787666655554322 1111 0000245555544 488999999999988 Q ss_pred EEEecCC Q lcl|NC_020862. 399 AYSVIPE 405 (405) Q Consensus 399 ie~~a~~ 405 (405) ++..++= T Consensus 376 l~~k~~~ 382 (387) T protein:vir:93 376 AKAKENT 382 (387) T ss_pred EEeecCC Confidence 8874444 No 148 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=94.88 E-value=0.0033 Score=34.21 Aligned_cols=287 Identities=10% Similarity=0.081 Sum_probs=132.5 Q ss_pred ccccCcCCCcccccccceeehhhhhHHHHHhhhh--hhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 3 HIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEE--MFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 3 ~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~--lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) |+-+ +..+ ..+.+.+ +|.|.++-.. --+.+||.. .|. -.|.-..-+... T Consensus 1 m~it------~~~l-~~l~~~~--~~~~~~~y~~a~~~~~~~a~~--~~s-df~~~~~~~lg~----------------- 51 (302) T protein:vir:10 1 MLIN------KQSL-NAAFVAI--KTIFNNAFAAAPTTWQKIAME--VPS-NTSSNDYKWLST----------------- 51 (302) T ss_pred Cccc------HHHH-HHHHHHH--HHHHHHHHHhhhhhhhceeee--cCC-CcceeeceecCC----------------- Confidence 2211 1112 2233333 5555544321 245778743 453 344444333333 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) +|.|.|..|...--.+.-...+..+..||.-+.+|.+++.-|.- .++..+..+|++.++ T Consensus 52 --------------------~p~l~e~~Ge~~~~~l~~~~~~i~~~~~g~~v~i~R~~i~nDdl-g~~~~~~~~~G~aaa 110 (302) T protein:vir:10 52 --------------------FPKMRRWIGAKVVKNLKAYKYVVENEDFEATVEVDRNDIEDDQI-GIYSPQAKMAGYSAA 110 (302) T ss_pred --------------------CCCccccccceeeccccccceeEEeecccceecccHHhhccccc-chhHHHHHHHHHHHH Confidence 33444432222223345556678999999999999986544433 668887777776665 Q ss_pred hHHHHHHHHHHhccCceEEecCCC----------ccceeeecccccccCCceecH---HHHHHHHHHHHhccCcccccee Q lcl|NC_020862. 161 EITEDLLQADILASADVKVFTGAA----------TSMVTMTGEAADAEDDGLITL---KDLKRLSITLTDNYTPKKTTII 227 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~yag~a----------ts~~~~t~~~~~~~~n~~it~---~~lr~~~~~Lk~nrApk~T~ii 227 (405) +.-.++++.-|.+|-+.+.|-|.. .+..+...+. ..-....++. ...|.+.+.++...-+ T Consensus 111 ~~~~~lv~~~L~~g~~~~~~DG~~fF~~dH~~g~~~~~N~g~~~-~~~~~~~l~~~~~~aa~~am~~~k~~~G~------ 183 (302) T protein:vir:10 111 QLPDELVYEAVNGAFTKPCFDGQYFIDTDHPVGDASVSNKGTAP-LSNASQAAAKAGYGAARTAMKKFKDEEGR------ 183 (302) T ss_pred hhHHHHHHHHHhccCCCcccCCcceecccccccccccccccchh-hhhcccccchHHHHHHHHHHHHHhhhccc------ Confidence 555555555555544444444322 0001110000 0000123444 4444444454444332 Q ss_pred ccccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCC Q lcl|NC_020862. 228 KGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGA 307 (405) Q Consensus 228 ~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa 307 (405) . --|.|.+ .+|+|+++..-+.+... .++. .++.=-+. +.+..|++|++. +|. T Consensus 184 ----~---L~i~P~~-LiVp~~le~~A~~ll~~---------~~~~------~g~~Np~~-g~~~~vv~p~L~----s~~ 235 (302) T protein:vir:10 184 ----S---LNVSPNV-LLVGPALEDVAKMLLTN---------PKLA------DNTPNPYV-GTAELVVDGRIE----SDT 235 (302) T ss_pred ----c---cccCCCE-EEecchhHHHHHHHhhc---------cccC------CCCcceec-cceEEEEeeccC----CCC Confidence 1 3455666 68999999999886421 1111 12211122 346888888763 221 Q ss_pred cccCCCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHH Q lcl|NC_020862. 308 TATAANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYG 387 (405) Q Consensus 308 ~~~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~ 387 (405) +-| +...+ ..+++. ..=|.+..-..-..+.+.+|- .+++...= | .|-=+-.|+.-|.++|+ T Consensus 236 ------aWy-L~a~~--~~i~~~--~l~g~~~P~~~~~~~~~~dgv-~~k~~~d~-G------vd~R~~~G~~~wq~a~~ 296 (302) T protein:vir:10 236 ------AWF-LLDTT--KPVKPF--IFQPRKQPEFVSQVNLDSDDV-FNLRKLKF-G------AEARAAAGYGFWQLAYG 296 (302) T ss_pred ------ceE-EEecC--CccceE--EEcCccccEEEeccCCCCCce-EEEEEEEE-e------eeeeeecchhhhhhhhc Confidence 111 11111 223332 222444333332322222211 12222221 2 25555667777777777 Q ss_pred HhhccccceEEEEEec Q lcl|NC_020862. 388 FIKLRGERIAVAYSVI 403 (405) Q Consensus 388 ~~iL~~~~marie~~a 403 (405) +.=- +| T Consensus 297 s~g~----------~~ 302 (302) T protein:vir:10 297 STGT----------GA 302 (302) T ss_pred cCcc----------CC Confidence 6531 11 No 149 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=94.32 E-value=0.0017 Score=35.73 Aligned_cols=271 Identities=13% Similarity=0.101 Sum_probs=125.7 Q ss_pred CC-ccccC-cCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcc Q lcl|NC_020862. 1 MP-HIYND-PAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDAT 78 (405) Q Consensus 1 ~~-~~y~~-~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~ 78 (405) +. ....+ -..++.+.-|. .-+.-+.++.+......-.+.+++.+.++. |.++- +...+..+ .....| T Consensus 124 ~~~~~~~~a~~~~t~~~GG~-lIP~~~~~~Ii~~~~~~~~l~~~~~v~~~~---~~~~p-~~~~~~~~-a~~v~E----- 192 (402) T protein:vir:93 124 MEAQRLLHALPTGNDSGGDK-LLPKTLSKEIVSEPFAKNQLREKARLTNIK---GLEIP-RVSYTLDD-DDFITD----- 192 (402) T ss_pred HhHHHHHhhhccCCCcCCcc-ccchhHHHHHHHhHHhhhhhhhhceeeecC---Cceee-eeeccCCc-cccccc----- Confidence 00 00000 01111111111 112222333333333444556677766553 22221 11111111 111111 Q ss_pred cccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHH Q lcl|NC_020862. 79 GASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRG 158 (405) Q Consensus 79 g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~ 158 (405) |...+..-.++..|+-+.++++.|+.+|++ ++-|++-++.+.|..++.+. T Consensus 193 -----------------------------g~~~~~~~~~f~~i~~~~~k~~~~i~iS~e-ll~Ds~~~l~~~i~~~la~~ 242 (402) T protein:vir:93 193 -----------------------------VETAKELKAKGDTVKFTTNKFKVFAAISDT-VIHGSDVDLVNWVENALQSG 242 (402) T ss_pred -----------------------------cccccccccccceeeecceeeeeechhhHH-HHhhhHHHHHHHHHHHHHHH Confidence 222222234566788999999999999998 55577767888877777765 Q ss_pred HhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccc Q lcl|NC_020862. 159 ANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTI 238 (405) Q Consensus 159 ~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I 238 (405) .+ .+|+.. .+..|.++-.-.|..++.. ..+ +....++++|.++...|+...... T Consensus 243 ~~-~~e~~~--~~~~g~g~g~p~g~~~~~~----~~~---~~~~~~~d~l~~~~~~l~~~y~~n---------------- 296 (402) T protein:vir:93 243 LA-AKERKD--ALAVSPKSGLEHMSFYNGS----VKE---VEGADMYDAIINALADLHEDYRDN---------------- 296 (402) T ss_pred HH-HHHHHh--HhhcCCCccccceeeeccc----ccc---ccccchHHHHHHHHhccChhhhcC---------------- Confidence 44 344332 2344443322222111110 000 112245788888888887644321 Q ss_pred cceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCccccc Q lcl|NC_020862. 239 SASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQV 318 (405) Q Consensus 239 ~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~ 318 (405) + +.++|+.....++.+.+.-+.+-| .+.-+.|-|- -.+.+. T Consensus 297 --a-~~imn~~t~~~~~~~~~d~~~~~~-------------~~~~~~llG~--PV~~t~--------------------- 337 (402) T protein:vir:93 297 --A-TIYMRYADYVKIISVLSNGTTNFF-------------DTPAEKVFGK--PVVFTD--------------------- 337 (402) T ss_pred --C-EEEEechHHHHHHHHHhcCCCccc-------------ccCCcccccc--ceEEec--------------------- Confidence 1 236777766677666654333322 2222233221 111110 Q ss_pred ccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCc-cchhhhHHHHHHHHHhhccccceE Q lcl|NC_020862. 319 SDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDP-YGKVGFSSIKFFYGFIKLRGERIA 397 (405) Q Consensus 319 ~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DP-lgQrg~~gwK~~~~~~iL~~~~ma 397 (405) |.. .++||+=++.-+.+.++. ++.- .|+ -|+++|..+. ++.+.+.+++=+. T Consensus 338 ---------~~~-~i~~GDf~~~~~~~~~~~----------~~~~-------~~~~~~~~~~~~~~-r~Dg~v~~~~A~~ 389 (402) T protein:vir:93 338 ---------AAV-KPIVGDFNYFGINYDGTT----------YDTD-------KDVKKGEYLFVLTA-WYDQQRTLDSAFR 389 (402) T ss_pred ---------CCC-ceeeechhhhhhhhhhhh----------hhhh-------hcccCCceEEEEEE-EeCcEEechhheE Confidence 111 246787555444443211 1111 222 2566666554 6788899999888 Q ss_pred EEEEecCC Q lcl|NC_020862. 398 VAYSVIPE 405 (405) Q Consensus 398 rie~~a~~ 405 (405) .++.-+.. T Consensus 390 ~l~ik~~~ 397 (402) T protein:vir:93 390 IAKAKENT 397 (402) T ss_pred EEEeecCC Confidence 88775544 No 150 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=93.86 E-value=0.0051 Score=33.15 Aligned_cols=268 Identities=13% Similarity=0.141 Sum_probs=124.9 Q ss_pred CC----------ccc-cCcCCCcc---cccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCC Q lcl|NC_020862. 1 MP----------HIY-NDPAAGDA---STVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLD 66 (405) Q Consensus 1 ~~----------~~y-~~~~~t~~---~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~ 66 (405) +. +.. +.-..++. .-+-|+ -+.++.+........+.+++.+.++. |.++ .+.....+. T Consensus 65 ~~~~~~~~~~~~~~~~~al~~~~~~~gG~lIP~----~~~~~Ii~~l~~~s~l~~~~~v~~~~---~~~~-p~~~~~~~~ 136 (352) T protein:vir:78 65 LPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPK----TLSKEIVSEPFAKNQLREKARLTNIK---GLEI-PRVSYTLDD 136 (352) T ss_pred hhhHHHHHHhhHHHHHHHhccCCCCCCceeccH----hHHHHHHHHHHhhcchhhheeeEecC---CceE-EEEecCCCc Confidence 00 000 00000111 112221 11222222222233445556554442 2222 111111111 Q ss_pred CCCccccCCCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccc Q lcl|NC_020862. 67 DLNVNDQGLDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSD 146 (405) Q Consensus 67 ~~t~l~eGvtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~ 146 (405) + . ..+.|...+....++..|+-+.++|+.++++|++ ++-|++.+ T Consensus 137 a-~----------------------------------~v~E~~~~~~~~~~f~~v~~~~~k~~~~i~is~e-ll~Ds~~~ 180 (352) T protein:vir:78 137 D-D----------------------------------FITDVETAKELKLKGDTVKFTTNKFKVFAAISDT-VIHGSDVD 180 (352) T ss_pred c-c----------------------------------ccccccccccccccceeeeecceeEEeechhhHH-HHhhhhHH Confidence 1 0 1112333333345677788999999999999998 67777778 Q ss_pred hHHHHHHHHHHHHhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccce Q lcl|NC_020862. 147 LYGHLSREMLRGANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTI 226 (405) Q Consensus 147 l~~~~~~ell~~~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~i 226 (405) +.+.|..++.+..+. .|+.. -+.+|.++-.-.|.-+... ..+ +...-++++|.++...|+..... T Consensus 181 l~~~i~~~la~~~~~-~e~~~--~~~~g~g~~~~~g~l~~~~----~~~---~t~~~~~d~i~~~~~~l~~~~~~----- 245 (352) T protein:vir:78 181 LVNWVENALQSGLAA-KERKD--ALAVSPKSGLEHMSFYNGS----VKE---VEGANMYDAIINALADLHEDYRD----- 245 (352) T ss_pred HHHHHHHHHHHHHHH-HHHHh--hhhcCCCCcccccceeccc----ccc---ccccchHHHHHHHHhccChhhhc----- Confidence 888888877765543 33332 2234433222222111111 000 11123478888888888655432 Q ss_pred eccccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCC Q lcl|NC_020862. 227 IKGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAG 306 (405) Q Consensus 227 i~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aG 306 (405) .+ +.++++.....++.+.|.-+.|-| .+.-..|-| ..++++. T Consensus 246 ------------~a--~~~mn~~t~~~l~~~~~~~~~~~~-------------~~~~~~llG--~PV~~~~--------- 287 (352) T protein:vir:78 246 ------------NA--TIYMRYADYVKIISVLSNGTTNFF-------------DTPAEKVFG--KPVVFTD--------- 287 (352) T ss_pred ------------CC--EEEEehHHHHHHHHHHhccCCccc-------------ccCCccccc--cceEEec--------- Confidence 11 246688777788877775555444 222123322 1222211 Q ss_pred CcccCCCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCcc-chhhhHHHHHH Q lcl|NC_020862. 307 ATATAANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPY-GKVGFSSIKFF 385 (405) Q Consensus 307 a~~~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPl-gQrg~~gwK~~ 385 (405) |. +-++||+=++.-+...++ +++.. .|++ |+.+|.+. .+ T Consensus 288 ---------------------~~-~~~~~Gdf~~~~~~~~~~----------~~~~~-------~~~~~g~~~f~~~-~r 327 (352) T protein:vir:78 288 ---------------------AA-VKPIVGDFNYFGINYDGT----------TYDTD-------KDVKKGEYLFVLT-AW 327 (352) T ss_pred ---------------------CC-CceeEeehhhhhhhhhhh----------eeeee-------ccccCCeeEEEEE-ee Confidence 01 113578766665555421 12211 1221 23343333 47 Q ss_pred HHHhhccccceEEEEEecCC Q lcl|NC_020862. 386 YGFIKLRGERIAVAYSVIPE 405 (405) Q Consensus 386 ~~~~iL~~~~marie~~a~~ 405 (405) +.+.+.+++=++.+++.+.- T Consensus 328 ~Dg~~~~~eA~~~l~~~a~~ 347 (352) T protein:vir:78 328 YDQQRTLDSAFRIAKAKEST 347 (352) T ss_pred eCceeechhheEEEEeeccc Confidence 78888888888888776654 No 151 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=93.49 E-value=0.0074 Score=32.28 Aligned_cols=313 Identities=12% Similarity=0.047 Sum_probs=146.6 Q ss_pred CCccccCcCCCccccccc-ceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGP-QFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATG 79 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~-qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g 79 (405) |+-+=+.......++=.+ .+...-|.-+.+..-+..-++..+=.+|. -..||+..|-|---.- ..-++.|-.+.| T Consensus 1 Ms~~n~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~vRt--i~~gkS~qf~~~G~s~--~~~~~pG~~ld~ 76 (401) T protein:vir:70 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQT--VTGTNTVSNKYLGETE--LQVLAPGQSPAA 76 (401) T ss_pred CCCCccccccccccccchhHhHHhHhcchHHHHHHHHhhhcccceeee--ecccceEEEEEeeeeE--eeeecCCCCcCC Confidence 554422111111111111 12222223444444444444555556665 3567888877542110 011222333333 Q ss_pred ccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccc-hHHHHHHHHHHH Q lcl|NC_020862. 80 ASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSD-LYGHLSREMLRG 158 (405) Q Consensus 80 ~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~-l~~~~~~ell~~ 158 (405) +.+.+. | +.+-|..-|--.=+++.|- +...|=| +-.+++++++.. T Consensus 77 ~~~~~d--------------K----------------~~ItID~lL~a~~~V~dlD----e~q~~yD~vRse~s~e~G~A 122 (401) T protein:vir:70 77 TSTQAD--------------K----------------NQLVIDATVIARNTVAHLH----DVQGDIDSLKPKLATNQAKQ 122 (401) T ss_pred CCcccc--------------c----------------EEEEeCceeehhhhhhhHH----HHHhcccccchHHHHHHHHH Confidence 322111 1 1112222221111122222 2233333 345666776655 Q ss_pred HhhHHHHHHHHHH-hccCceEEec---C---CCccceeeecccccccCCceecHH----HHHHHHHHHHhccCcccccee Q lcl|NC_020862. 159 ANEITEDLLQADI-LASADVKVFT---G---AATSMVTMTGEAADAEDDGLITLK----DLKRLSITLTDNYTPKKTTII 227 (405) Q Consensus 159 ~~~~ted~l~~~i-lag~~~v~ya---g---~ats~~~~t~~~~~~~~n~~it~~----~lr~~~~~Lk~nrApk~T~ii 227 (405) -+......+.+.+ ++|..+.... . .--...++.+. +.+..++.. .++.+...|.++..|. T Consensus 123 LA~~~Dq~iiq~i~~aa~ana~~~~~~p~~~~~G~~i~v~~~----~~~~~~~~~~l~~ai~dA~~~LdEkdVP~----- 193 (401) T protein:vir:70 123 LKRMEDEMLIQQMMLGGIANTQAKRTNPRVKGHGFSINVEVA----EGEALVNPQYVMAAVEFALEQQLEQEVDI----- 193 (401) T ss_pred HHHHHHHHHHHHHHHhccccccccccCCCcCCCceEEecccc----ccccccCHHHHHHHHHHHHHHHHhcCCCc----- Confidence 4443333333344 4543221111 0 00111222222 223345443 4556777777777762 Q ss_pred ccccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcC--CcccccCcceeEecCCcEEEEeCcchhhhhcC Q lcl|NC_020862. 228 KGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYA--DAATIMNGEIGAIPGAHLRIVVVPQMMHYAGA 305 (405) Q Consensus 228 ~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya--~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~a 305 (405) + =|+.++.|.--.-|.+ .|.+++. .|+ .....-+|+|++|.| ||++.++++---.+. T Consensus 194 -----------~-r~vvl~pp~~Ys~Ll~------~d~L~nr-d~~~s~~g~~~~G~v~~vaG--v~Vv~SnnlP~~a~~ 252 (401) T protein:vir:70 194 -----------S-DVAILMPWRYFNVLRD------ADRIVDK-TYTISQSGATIQGFTLSSYN--CPVIPSNRFPKYSQG 252 (401) T ss_pred -----------c-ceEEEcCHHHHHHHHh------cCcccch-hhccccCCccccceEEEEec--eEEEeeccccccccc Confidence 1 1666665555555543 3556665 554 445677999999955 899999987531111 Q ss_pred CCcccCCCcccccccccCCcceee------eEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhh Q lcl|NC_020862. 306 GATATAANRGYQVSDVAGTDKYDI------APLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGF 379 (405) Q Consensus 306 Ga~~~~t~~~~~~~~~~g~~~~DV------Yp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~ 379 (405) -. +-..++...+++||| --.|+|=.+|-+++.+....++ ---|+=-|..+ T Consensus 253 it-------~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~lt~~-----------------~~~d~r~~~~~ 308 (401) T protein:vir:70 253 QT-------HHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDVTGD-----------------IFYEKKEKTYY 308 (401) T ss_pred cc-------cccccccCCCccCCCCccccceeEEEEehhheEEEEeeccccc-----------------hhhhhhhhHHH Confidence 00 011111112333331 2356666676666555321111 11466678888 Q ss_pred HHHHHHHHHhhccccceEEEEEecC----C Q lcl|NC_020862. 380 SSIKFFYGFIKLRGERIAVAYSVIP----E 405 (405) Q Consensus 380 ~gwK~~~~~~iL~~~~marie~~a~----~ 405 (405) +=-|..|+...+|++.-+++++.=. + T Consensus 309 id~~~a~g~g~~RPeaa~vv~~k~~~~~~~ 338 (401) T protein:vir:70 309 IDTFMAEGAIPDRWEAVSVVTTKRNTTTGA 338 (401) T ss_pred HHHHHHhCCcccchhheEEEeecCcccccc Confidence 8899999999999999999977644 2 No 152 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=93.03 E-value=0.0091 Score=31.78 Aligned_cols=273 Identities=15% Similarity=0.084 Sum_probs=123.7 Q ss_pred CCc-------------------------cccCcCCCcccccccceeehhhhhH---HHHHhhhhhhhhc--cccccccCc Q lcl|NC_020862. 1 MPH-------------------------IYNDPAAGDASTVGPQFNVHYWDRK---SLIDEAEEMFFSP--LADNKQMPK 50 (405) Q Consensus 1 ~~~-------------------------~y~~~~~t~~~~v~~qm~t~y~~~k---~L~~a~p~lv~~~--fA~~~~mPK 50 (405) |-- -|++-+ -+-|+.=..+| .|.+......+.. .+. ...=. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N-~~~e~ 71 (329) T protein:vir:10 1 MDGIFITGVKTMNKEIKNATGKLKLNLQHFANKS--------VEPGDTLLKNKHVGILEKVTAANSYSAPAVIS-NDAIF 71 (329) T ss_pred CCceEEechhhhhhhhhcccceeEEehhhhcCCc--------cCCchhHHHHHHHHHHHHHHHhhceeeeeecc-cceee Confidence 222 222211 11122111111 2222211111111 122 22336 Q ss_pred CCCCEEEEEecccC-CCCCCccccCCCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeee Q lcl|NC_020862. 51 HFGKELKVFYYVPL-LDDLNVNDQGLDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYG 129 (405) Q Consensus 51 n~GktIkfrry~pl-~~~~t~l~eGvtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG 129 (405) +.|++|++.+-.-- ..|-+- ..|.++ ++ ++....+-+|.| . T Consensus 72 ~~g~tVkIp~i~~~gl~DY~R-~~g~~~-----------------g~-------------------vt~~~~t~tidq-d 113 (329) T protein:vir:10 72 MQGRSFTVIKGDVTELKDYKR-NATNEF-----------------DH-------------------PQIQETTYFLDQ-E 113 (329) T ss_pred ccCcEEEEeeecccccccccC-CCCccc-----------------cc-------------------cccceeEEEeec-c Confidence 78999999865321 000000 012211 11 122334456666 3 Q ss_pred eeEEecchhhhhhhccchH--HHHHHHHHHHHhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHH Q lcl|NC_020862. 130 FFMEYTEDSLMFDTDSDLY--GHLSREMLRGANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKD 207 (405) Q Consensus 130 ~~~e~Td~~~~~d~d~~l~--~~~~~ell~~~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~ 207 (405) -+..+.=+.+|-++-+..+ ..+.++-...+..-..|...-..+++. +++. +.... ++ .=.++. T Consensus 114 R~~~F~VD~~D~dEtn~~l~a~~i~~~~~~~~v~pEiDay~~skla~~-----a~~~-~~~~~-----t~----~nay~~ 178 (329) T protein:vir:10 114 KYWGRFVDALDRRDTEGNIDINYVVAKQASEVVAPYLDNLRFATLARN-----KAKH-LTVGS-----GA----DAQYDA 178 (329) T ss_pred cceeeecchhhHhhhhhhhhHHHHHHHHHHHHhhhHHHHHHHHHHHhh-----cccc-ccccc-----CH----HHHHHH Confidence 4444443346666654433 233333222222222333333333321 1111 11111 11 124889 Q ss_pred HHHHHHHHHhccCccccceeccccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEec Q lcl|NC_020862. 208 LKRLSITLTDNYTPKKTTIIKGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIP 287 (405) Q Consensus 208 lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~ 287 (405) |+.+...|+++..| ..+++||.|+.-..|+. ++.|+......+. ...+|.||+|. T Consensus 179 i~~a~~~Lde~~vp------------------~~Rvl~VtP~~~~~Lk~------~~~f~~~~~~~~~-~~~~g~Vg~id 233 (329) T protein:vir:10 179 VLDVSVELDEIGAG------------------ASRILFVTPKFYKGIKK------FVIELPQGDNRQQ-VLGKGVQGELD 233 (329) T ss_pred HHHHHHHHHhcCCC------------------CCcEEEeCHHHHHHHHh------hhhhhcccccccc-ceeeeeeeeec Confidence 99999999998775 25899999888888864 5789887666654 56799999996 Q ss_pred CCcEEEEeCcchhhhhcCCCcccCCCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCC Q lcl|NC_020862. 288 GAHLRIVVVPQMMHYAGAGATATAANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEAT 367 (405) Q Consensus 288 g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~t 367 (405) | |.++++|.-. + + + +.+|++-+.|...+ .|= .++++.--.|+. T Consensus 234 G--~~Ii~vps~~-----~-------------------k-~-in~ii~~~~A~~~~-~K~------~~~~~~~p~~~~-- 276 (329) T protein:vir:10 234 G--FTIVKVPSKM-----L-------------------Q-G-VEAMAVIGEVMASP-IQA------NEAKLNSNVPGM-- 276 (329) T ss_pred C--eEEEEecCCc-----c-------------------c-c-eeEEEEcCCceeee-eee------eeeeeeCCCCcc-- Confidence 6 7777776321 0 0 1 23566556555432 110 011111111221 Q ss_pred CCCCCccchhhhHHHHHHHHHhhccccceEE--EEEecCC Q lcl|NC_020862. 368 ADRNDPYGKVGFSSIKFFYGFIKLRGERIAV--AYSVIPE 405 (405) Q Consensus 368 ad~~DPlgQrg~~gwK~~~~~~iL~~~~mar--ie~~a~~ 405 (405) .+| .+--..||++.++++.-..+ ....+++ T Consensus 277 --~a~------~v~gr~yyd~~V~~~k~~~I~~~~~~a~~ 308 (329) T protein:vir:10 277 --FGT------LAEQMLYTGAFVPEHLQKYIFTIGGKEVE 308 (329) T ss_pred --chh------eeeeeeeeeeEEEccccCEEEEecccCcc Confidence 111 01113789999998875443 2233333 No 153 >protein:vir:2106 Length: 430 # NCBI annotation: coat protein # Family: family:all:1412 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059630;genbank:gi:9635538;genbank:GeneID:1262831 Probab=92.22 E-value=0.012 Score=31.05 Aligned_cols=308 Identities=13% Similarity=0.081 Sum_probs=134.6 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccc-cccCc---CCCCEEEEEecccCCCCCCccccCCC Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADN-KQMPK---HFGKELKVFYYVPLLDDLNVNDQGLD 76 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~-~~mPK---n~GktIkfrry~pl~~~~t~l~eGvt 76 (405) |--- ... +.+.= -|+.|..-+-.+|+.+.+++ ++.-. +.|.||..+ ++.+. +...|.+ T Consensus 1 Ma~~-----------~~~-~lti~-~~eal~~~~n~lV~a~~~~~~r~~d~~~~r~Gdti~ip----~p~~~-~~~~G~~ 62 (430) T protein:vir:21 1 MALN-----------EGQ-IVTLA-VDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMP----VEQES-PTQEGWD 62 (430) T ss_pred Cccc-----------cch-hhHHH-HHHHHHHhhhhhhhhhhhhccCCchhhhhcccceEEee----ccccc-ccccccc Confidence 2111 111 11211 16666666667788876443 22222 788897544 33222 2223432 Q ss_pred cccccccCCcccccccccccccccccccccccccccccceeeeeEEEEe-eeeeeeEEecchhhhhhhccchHHHHHHHH Q lcl|NC_020862. 77 ATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTL-TEYGFFMEYTEDSLMFDTDSDLYGHLSREM 155 (405) Q Consensus 77 p~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l-~qyG~~~e~Td~~~~~d~d~~l~~~~~~el 155 (405) -.++ .+-+.| .-+.+++ +|.+..++++++.+....+ ..+-++..| T Consensus 63 ~t~~--------------------~~~~~e------------~~v~~~~~~~~~V~~~~~~kEl~~~~~--~er~l~pAm 108 (430) T protein:vir:21 63 LTDK--------------------ATGLLE------------LNVAVNMGEPDNDFFQLRADDLRDETA--YRRRIQSAA 108 (430) T ss_pred ccCC--------------------Ccccee------------eeEeEEEeeeccceEEeehhHhcChhh--HHHHHHHHH Confidence 2211 111111 1233444 3567788999876432222 233444455 Q ss_pred HHHHhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCc Q lcl|NC_020862. 156 LRGANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDT 235 (405) Q Consensus 156 l~~~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT 235 (405) ..=++++..|++.--.+.+..++..+-+.... ..=..+++-.+-+.|..|.+|+- T Consensus 109 ~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~~-------------~~~~~~~~A~a~~~L~~~~vP~~------------ 163 (430) T protein:vir:21 109 RKLANNVELKVANMAAEMGSLVITSPDAIGTN-------------TADAWNFVADAEEIMFSRELNRD------------ 163 (430) T ss_pred HHHHHHHHHHHHHHhhhhhhccccccCCCCCC-------------CCcchhhHHHHHHHHHHhcCCCC------------ Confidence 55566666666655445544444332111110 01125777777788999999951 Q ss_pred ccccceEEEEEcccchHHH-HHHhcccCCCcceehhhcCCcccccCcceeE-ecCCcEE-EEeCcchhhhhcCCCcccCC Q lcl|NC_020862. 236 KTISASRIAYIGSELEIYI-TELVDSLGNPAFVPVEKYADAATIMNGEIGA-IPGAHLR-IVVVPQMMHYAGAGATATAA 312 (405) Q Consensus 236 ~~I~~syv~~~h~dl~~di-r~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGs-i~g~n~R-fv~~p~~~~~~~aGa~~~~t 312 (405) --+-+++-|+....+ +.+...++. +=...+.+.+|+||+ +.| |+ +..++..-+-...-+..... T Consensus 164 ----~~R~~~~~p~~~~~l~~~l~~~~~~-------~~~~~~A~r~g~i~r~~~G--fd~~~~s~~~~~~t~gt~t~~tv 230 (430) T protein:vir:21 164 ----MGTSYFFNPQDYKKAGYDLTKRDIF-------GRIPEEAYRDGTIQRQVAG--FDDVLRSPKLPVLTKSTATGITV 230 (430) T ss_pred ----CCcEEEeChHHHHHHhhhhcccccc-------ccchhHHHhhcccccccch--hhhhhhcCCcccccCccCcCcee Confidence 126677788777766 334321111 111334567899997 644 55 33443333322211111100 Q ss_pred Ccc-------------------------------------------------------------cccccccCCcceeeeE Q lcl|NC_020862. 313 NRG-------------------------------------------------------------YQVSDVAGTDKYDIAP 331 (405) Q Consensus 313 ~~~-------------------------------------------------------------~~~~~~~g~~~~DVYp 331 (405) +.+ +.|+....+..+.||| T Consensus 231 ~gA~~~~~~~~tv~~~g~~~~~d~~~~~it~s~tg~l~~GD~ftiaGV~~v~~itk~~~~~l~qf~V~a~~~~ttv~I~P 310 (430) T protein:vir:21 231 SGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGMKRGDKISFAGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITP 310 (430) T ss_pred ccccccccccceeccccccccccccceeeeeecccceecccEEEecceeeeccccccccCCcceEEEEEecCCceeEEee Confidence 000 0111111223456777 Q ss_pred EEE-----------------------------Eccc-----------cce--eecce-ecc----------CCCCCCceE Q lcl|NC_020862. 332 LLV-----------------------------VGDQ-----------AFA--TIGLQ-GMS----------GKGKSKFRI 358 (405) Q Consensus 332 ~lV-----------------------------~G~~-----------Afg--~i~l~-g~~----------~~g~~~~~~ 358 (405) -|| +|+. ||+ +.+|. +++ .....++.+ T Consensus 311 ai~~~~~~~~~~~~~~y~nVsaspa~~aavT~v~~a~~~~Nl~fh~~A~~La~~pl~~p~~~~~~~~~~~~~~~~~Glsi 390 (430) T protein:vir:21 311 KPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNG 390 (430) T ss_pred cccccccccccccccccceeccccccCceeEEeccCCcccceeEccceeEEEEecccCCCChhHhhheeeeeccccceEE Confidence 653 2222 222 11110 000 000001222 Q ss_pred EEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_020862. 359 IVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIAVAYSVIPE 405 (405) Q Consensus 359 ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~marie~~a~~ 405 (405) ++-.- .|.-.....+.|=.+||+..|++||..++=....- T Consensus 391 rv~~~-------yd~~~~~~~~r~DilyG~~~l~Pe~a~v~l~g~~~ 430 (430) T protein:vir:21 391 IFATQ-------GDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred EEEEc-------cccccCceEEEEEeecCccccCcceEEEEcCCCCC Confidence 22211 12222223344557888888888886544322222 No 154 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=91.09 E-value=0.018 Score=30.21 Aligned_cols=271 Identities=14% Similarity=0.075 Sum_probs=126.2 Q ss_pred CCccccCcCCCcccccccceeehhh---hhHHHHHhhhhhhhhc--cccccccCcCCCCEEEEEecccC-CCCCCccccC Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYW---DRKSLIDEAEEMFFSP--LADNKQMPKHFGKELKVFYYVPL-LDDLNVNDQG 74 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~---~~k~L~~a~p~lv~~~--fA~~~~mPKn~GktIkfrry~pl-~~~~t~l~eG 74 (405) --|-|++- +++-++... .-..|.+.....++.. ++... .=-+.|++|++.+-.-- ..|-+- ..| T Consensus 15 ~~~~~~~~--------~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~-~e~~gg~tVkIp~i~~~gl~DY~R-~~g 84 (319) T protein:vir:94 15 NLQHFANK--------SVEPGQTLLKNKHVGILERVTAVNAYSTPALISND-AIFMEGRSFTVMKGDTTELKDYKR-NAT 84 (319) T ss_pred ehhhhhcc--------CCCcchHHHHHHHHHHHHHHHHHhhhhhhcccCcc-eEeccCcEEEEeeecccccccccC-CCC Confidence 11223221 111122111 1345555555555543 23322 22357899999865421 001110 012 Q ss_pred CCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchH--HHHH Q lcl|NC_020862. 75 LDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLY--GHLS 152 (405) Q Consensus 75 vtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~--~~~~ 152 (405) .++ ++ ++....+-+|.|- -+..+.=+.+|-++-+..+ ..+. T Consensus 85 ~~~-----------------g~-------------------vt~~~~t~tidqd-R~~~F~VD~~D~~Etn~~l~a~~i~ 127 (319) T protein:vir:94 85 NEF-----------------DH-------------------PKIEETTYFLDQE-KYWGRFVDALDRKDTEGNIDINYVV 127 (319) T ss_pred ccc-----------------CC-------------------cccceeEEEeecc-cccccccchhhHhhhhchhhHHHHH Confidence 211 11 1223334566552 2333332236666654433 2222 Q ss_pred HHHHHHHhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccc Q lcl|NC_020862. 153 REMLRGANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRM 232 (405) Q Consensus 153 ~ell~~~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~ 232 (405) ++-...+..-..|...-..+++. ++.. +.... ++ .=.++.|+.+...|++++.|- T Consensus 128 ~~~~~~~v~PEiDay~~skla~~-----a~~~-~~~~~-----t~----~n~y~~i~~a~~~Lde~~VP~---------- 182 (319) T protein:vir:94 128 ARQGAEVVAPYLDNLRFATLARN-----KAKH-LTVGT-----GS----DAQYDAVLDVSVELDEIKAPE---------- 182 (319) T ss_pred HHHHHHHhhhhhhHHHHHHHHhh-----cccc-ccccc-----CH----HHHHHHHHHHHHHHHhcCCCC---------- Confidence 22222111112233333333321 1111 11111 11 124899999999999998862 Q ss_pred cCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCC Q lcl|NC_020862. 233 TDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAA 312 (405) Q Consensus 233 ~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t 312 (405) .+|+||.|+.-..|+. ++.|+.....++ +.+.+|-||+|.| |.++++|.-. + T Consensus 183 --------~Rvl~Vtp~~~~~L~~------~~~f~~~~~~~~-~~~~~g~Vg~idG--~~Vi~vps~~-----~------ 234 (319) T protein:vir:94 183 --------NRVLFVSPTFYKGIKK------FVIALPQGDTRQ-QVLGKGVQGELDG--FVIVKVPTKL-----L------ 234 (319) T ss_pred --------CcEEEeCHHHHHHHHh------hhhhhccccccc-cceeeeeceeecC--eEEEEecccc-----c------ Confidence 5899999988888864 478988777765 4567999999966 7777776311 0 Q ss_pred CcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhh Q lcl|NC_020862. 313 NRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIK 390 (405) Q Consensus 313 ~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~i 390 (405) + | +.++++-+.|... ..|= ..+++.--.||. +| -.|| .||++.+ T Consensus 235 -------------k-~-in~i~~h~~A~~~-~~k~------~~~~~~~p~~~~-----------~a-~~v~gr~y~d~~V 280 (319) T protein:vir:94 235 -------------Q-G-LQAIAVVGEVLAS-PIQA------DLAKTNSNIPGM-----------FG-TLAEQLLYTGAFV 280 (319) T ss_pred -------------c-c-ceEEEEcCCeeee-eeee------eeeeccCCCccc-----------cc-eeeeeeeeeeeEE Confidence 0 1 2344444444321 1210 011111111221 11 1233 7899999 Q ss_pred ccccceEE--EEEecCC Q lcl|NC_020862. 391 LRGERIAV--AYSVIPE 405 (405) Q Consensus 391 L~~~~mar--ie~~a~~ 405 (405) +++.-.++ +....|+ T Consensus 281 ~~~k~~~Iy~~~~~~~~ 297 (319) T protein:vir:94 281 PEHLQKYIFTIGGTEVA 297 (319) T ss_pred eccccceEEEeecCCcc Confidence 98885444 3344444 No 155 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=91.09 E-value=0.018 Score=30.21 Aligned_cols=271 Identities=14% Similarity=0.075 Sum_probs=126.2 Q ss_pred CCccccCcCCCcccccccceeehhh---hhHHHHHhhhhhhhhc--cccccccCcCCCCEEEEEecccC-CCCCCccccC Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYW---DRKSLIDEAEEMFFSP--LADNKQMPKHFGKELKVFYYVPL-LDDLNVNDQG 74 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~---~~k~L~~a~p~lv~~~--fA~~~~mPKn~GktIkfrry~pl-~~~~t~l~eG 74 (405) --|-|++- +++-++... .-..|.+.....++.. ++... .=-+.|++|++.+-.-- ..|-+- ..| T Consensus 15 ~~~~~~~~--------~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~-~e~~gg~tVkIp~i~~~gl~DY~R-~~g 84 (319) T protein:vir:97 15 NLQHFANK--------SVEPGQTLLKNKHVGILERVTAVNAYSTPALISND-AIFMEGRSFTVMKGDTTELKDYKR-NAT 84 (319) T ss_pred ehhhhhcc--------CCCcchHHHHHHHHHHHHHHHHHhhhhhhcccCcc-eEeccCcEEEEeeecccccccccC-CCC Confidence 11223221 111122111 1345555555555543 23322 22357899999865421 001110 012 Q ss_pred CCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchH--HHHH Q lcl|NC_020862. 75 LDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLY--GHLS 152 (405) Q Consensus 75 vtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~--~~~~ 152 (405) .++ ++ ++....+-+|.|- -+..+.=+.+|-++-+..+ ..+. T Consensus 85 ~~~-----------------g~-------------------vt~~~~t~tidqd-R~~~F~VD~~D~~Etn~~l~a~~i~ 127 (319) T protein:vir:97 85 NEF-----------------DH-------------------PKIEETTYFLDQE-KYWGRFVDALDRKDTEGNIDINYVV 127 (319) T ss_pred ccc-----------------CC-------------------cccceeEEEeecc-cccccccchhhHhhhhchhhHHHHH Confidence 211 11 1223334566552 2333332236666654433 2222 Q ss_pred HHHHHHHhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccc Q lcl|NC_020862. 153 REMLRGANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRM 232 (405) Q Consensus 153 ~ell~~~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~ 232 (405) ++-...+..-..|...-..+++. ++.. +.... ++ .=.++.|+.+...|++++.|- T Consensus 128 ~~~~~~~v~PEiDay~~skla~~-----a~~~-~~~~~-----t~----~n~y~~i~~a~~~Lde~~VP~---------- 182 (319) T protein:vir:97 128 ARQGAEVVAPYLDNLRFATLARN-----KAKH-LTVGT-----GS----DAQYDAVLDVSVELDEIKAPE---------- 182 (319) T ss_pred HHHHHHHhhhhhhHHHHHHHHhh-----cccc-ccccc-----CH----HHHHHHHHHHHHHHHhcCCCC---------- Confidence 22222111112233333333321 1111 11111 11 124899999999999998862 Q ss_pred cCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCC Q lcl|NC_020862. 233 TDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAA 312 (405) Q Consensus 233 ~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t 312 (405) .+|+||.|+.-..|+. ++.|+.....++ +.+.+|-||+|.| |.++++|.-. + T Consensus 183 --------~Rvl~Vtp~~~~~L~~------~~~f~~~~~~~~-~~~~~g~Vg~idG--~~Vi~vps~~-----~------ 234 (319) T protein:vir:97 183 --------NRVLFVSPTFYKGIKK------FVIALPQGDTRQ-QVLGKGVQGELDG--FVIVKVPTKL-----L------ 234 (319) T ss_pred --------CcEEEeCHHHHHHHHh------hhhhhccccccc-cceeeeeceeecC--eEEEEecccc-----c------ Confidence 5899999988888864 478988777765 4567999999966 7777776311 0 Q ss_pred CcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhh Q lcl|NC_020862. 313 NRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIK 390 (405) Q Consensus 313 ~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~i 390 (405) + | +.++++-+.|... ..|= ..+++.--.||. +| -.|| .||++.+ T Consensus 235 -------------k-~-in~i~~h~~A~~~-~~k~------~~~~~~~p~~~~-----------~a-~~v~gr~y~d~~V 280 (319) T protein:vir:97 235 -------------Q-G-LQAIAVVGEVLAS-PIQA------DLAKTNSNIPGM-----------FG-TLAEQLLYTGAFV 280 (319) T ss_pred -------------c-c-ceEEEEcCCeeee-eeee------eeeeccCCCccc-----------cc-eeeeeeeeeeeEE Confidence 0 1 2344444444321 1210 011111111221 11 1233 7899999 Q ss_pred ccccceEE--EEEecCC Q lcl|NC_020862. 391 LRGERIAV--AYSVIPE 405 (405) Q Consensus 391 L~~~~mar--ie~~a~~ 405 (405) +++.-.++ +....|+ T Consensus 281 ~~~k~~~Iy~~~~~~~~ 297 (319) T protein:vir:97 281 PEHLQKYIFTIGGTEVA 297 (319) T ss_pred eccccceEEEeecCCcc Confidence 98885444 3344444 No 156 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=90.55 E-value=0.02 Score=29.87 Aligned_cols=288 Identities=12% Similarity=0.017 Sum_probs=129.6 Q ss_pred CC----ccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCC Q lcl|NC_020862. 1 MP----HIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLD 76 (405) Q Consensus 1 ~~----~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvt 76 (405) ++ ..||.-...+.++-+. +-+-.+.++.+....+.-.+.+++...++. |+ +++-+...-+.+.- + T Consensus 75 l~~ee~~~~~~~~~~t~~~gG~-liP~~~~~~Ii~~l~~~s~i~~~~~v~~~~---~~-~~i~~~~~~~~a~w-----~- 143 (395) T protein:vir:95 75 LTSEERKFFNDINYDVGYTDEK-ILPETVVERVFDDLQKDHPLLSKINFQNAG---IK-TRVIKADPAGQAVW-----G- 143 (395) T ss_pred cchHHHHHHHHHhhccCCCCce-eccHHHHHHHHHHHHhhhhhhhhceeEecC---Cc-eEEEEecCCcceEE-----e- Confidence 11 1122222222222111 111122444455555566777788887773 33 22211111111000 0 Q ss_pred cccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHH Q lcl|NC_020862. 77 ATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREML 156 (405) Q Consensus 77 p~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell 156 (405) .|.|.+.+-...++..|+-+.++++.+..+|+++ +-|+..++...|..++. T Consensus 144 ----------------------------~e~~~~~~~~~~~f~~i~l~~~kl~~~~~iS~el-l~ds~~~ie~~i~~~la 194 (395) T protein:vir:95 144 ----------------------------KVFGEIKGQLDAAFREENFTQYKLTCFVVLPDDL-STFGPAWIERFVRTQIQ 194 (395) T ss_pred ----------------------------ecccccCccccccceeeeeceeeEEEeecccHHH-HhcchhHHHHHHHHHHH Confidence 0111122222346667788999999999999985 55566677777666655 Q ss_pred HHHhhHHHHHHHHHHhccCceE--------EecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceec Q lcl|NC_020862. 157 RGANEITEDLLQADILASADVK--------VFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIK 228 (405) Q Consensus 157 ~~~~~~ted~l~~~ilag~~~v--------~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~ 228 (405) + +....++ ..+++|.++- .+....+..++ .+ .....++.+++..+...|......- ....+ T Consensus 195 ~-~ia~~~~---~a~i~G~G~~~~qP~Gil~~~~~~~~~~~--~~----~~~~~~t~~~~~~~~~~l~~~~~~~-~~~~~ 263 (395) T protein:vir:95 195 E-AISVALE---SAIINGGGAAKTQPVGLMKDVNTNSGAVT--DK----ASSGTLTFADADTTILELNDVLKNL-SVDEK 263 (395) T ss_pred H-HHHHHHh---hheeeccCCCCcCceeeeecccccccccc--cc----cccchhhhhhhHhhHHHHHHHHHhh-ccccc Confidence 3 3333333 4667776542 22222221111 00 1123345555554444443322210 00001 Q ss_pred cccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCc Q lcl|NC_020862. 229 GSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGAT 308 (405) Q Consensus 229 gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~ 308 (405) +... ..+.. -..++|+....|++ +.+-|.|. .|+..++-|-.+.++++..|- + T Consensus 264 ~~~~---~~~~~-~~~~mn~~t~~~~~------g~~~~~~~----------~G~~~~~lg~g~~v~~~~~~p----~--- 316 (395) T protein:vir:95 264 GKEL---KIDGK-VALVVNPRDSWDVQ------ARYTYLTA----------NGGFVTVLPYNVTIITSEFVP----E--- 316 (395) T ss_pred cchh---hhcCc-eEEEEcchhhhhcC------CcceeccC----------CCcceeccCCcceEEEcCCCC----C--- Confidence 1100 00111 12357765544443 34556552 233333322234555554332 0 Q ss_pred ccCCCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCcc---chhhhHHHHHH Q lcl|NC_020862. 309 ATAANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPY---GKVGFSSIKFF 385 (405) Q Consensus 309 ~~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPl---gQrg~~gwK~~ 385 (405) + + ++||+=++-.++.+ +.+++-+. .+.+ +|.+|.++ ++ T Consensus 317 ----------------~--~----i~fgdfs~y~i~~r-------~~~~i~~~---------~~~~~~~d~~~f~~~-~r 357 (395) T protein:vir:95 317 ----------------G--K----LVAFVTDRYNAVRG-------GGLTVKKF---------DQTLALEDAVLFTAK-TF 357 (395) T ss_pred ----------------C--c----EEEEecccEEEEEe-------cceEEEec---------cchhhhCCcEEEEEE-EE Confidence 0 1 46776555455543 12333222 2233 34444443 46 Q ss_pred HHHhhccccceEEEEEecCC Q lcl|NC_020862. 386 YGFIKLRGERIAVAYSVIPE 405 (405) Q Consensus 386 ~~~~iL~~~~marie~~a~~ 405 (405) +.+++.+++=++++++...| T Consensus 358 ~dg~~~~~~A~~~l~i~~~~ 377 (395) T protein:vir:95 358 AYGQPDDNKASAVYDLKVAS 377 (395) T ss_pred ECCEEeccccEEEEEeeccC Confidence 68899999988999888777 No 157 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=88.76 E-value=0.03 Score=28.91 Aligned_cols=292 Identities=11% Similarity=0.017 Sum_probs=126.9 Q ss_pred CCccccCcCCCccccccc-ceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGP-QFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATG 79 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~-qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g 79 (405) |-..++.....+.++.+. -+.+.-+ +|.+...++.-.+-++|.+.+.-++....|-..-.. . ....|.+-.| T Consensus 4 ~~~~~~~~k~it~~d~~gG~L~P~~~-~~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g-----~-~~~~~~~~~~ 76 (314) T protein:vir:41 4 LNKPFQITPKIDVPDLGKGILAVQRF-GEFVREVRENSAIIKDARVLNALKSYEVDISRISLG-----V-ELEPGRNTSG 76 (314) T ss_pred hhhHHHhhcccccccCCCceeChHHH-HHHHHHHHhccchhhheeeecccCccceeecccccC-----c-cccccccccc Confidence 555555554444333322 2333333 355544455667888887654433332333211110 0 0111111100 Q ss_pred ccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhc-cchHHHHHHHHHHH Q lcl|NC_020862. 80 ASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTD-SDLYGHLSREMLRG 158 (405) Q Consensus 80 ~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d-~~l~~~~~~ell~~ 158 (405) +-++++...|+ +..++-..+++..++++|++++..... .++.+.+..++.+ T Consensus 77 -------------~~~~~~~~~~t--------------f~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae- 128 (314) T protein:vir:41 77 -------------TKVAPTADEVT--------------VSTNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLAS- 128 (314) T ss_pred -------------CCccCCccccc--------------ccceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHH- Confidence 11122222233 333456779999999999986554332 3677776666654 Q ss_pred HhhHHHHHHHHHHhccCceE--------EecCC---CccceeeecccccccCCceecHHHHHHHHHHHHhccCcccccee Q lcl|NC_020862. 159 ANEITEDLLQADILASADVK--------VFTGA---ATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTII 227 (405) Q Consensus 159 ~~~~ted~l~~~ilag~~~v--------~yag~---ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii 227 (405) +.+..|.. -+++|.+.. .-.|= +++.++... ......+.+.|.++...|+.-..+ T Consensus 129 ~~g~~~~~---~~~nGdg~~~s~~~~~~~p~G~l~~a~~~~~~~~-----~~~~~~~~~~~~~l~~sl~~~yr~------ 194 (314) T protein:vir:41 129 GVTYDLEC---FFLHADSSLTTGRELYRINDGWMKLAGNQYTDAE-----PEDENWPLNLFDGMMDELDTRYLQ------ 194 (314) T ss_pred HHHHHHHH---HhhccccCCcCcccchhcchhhhhhcccceeecC-----ccccccHHHHHHHHHHhcCchhhc------ Confidence 33333332 334553321 11110 111111111 112346778889999999763221 Q ss_pred ccccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCC Q lcl|NC_020862. 228 KGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGA 307 (405) Q Consensus 228 ~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa 307 (405) +++ -.+-++|++....+|.+.+.-+.+.|-+.-.=|... .+.| +..+.+|.|. =.++++ T Consensus 195 ~~~----------~~~~~m~~~t~~~~r~~l~~~~~~l~~~~~~~~~~~--------~l~G--~PV~~~~~~~-~~~~~~ 253 (314) T protein:vir:41 195 LKP----------RMKFYVSNEIYNGYRKQLLVRETGLGDSALIGATGL--------QYDG--IPIQYVPALD-ALGDDK 253 (314) T ss_pred CCC----------ceEEEecHHHHHHHHHHHhccCCcccchhhhCCCCc--------eecc--eeeEeccccc-ccCCCC Confidence 001 267889999999999998877778777664333333 3433 4666666552 233333 Q ss_pred cccCC-CcccccccccCCcceeeeEEEEEccc-----cceeecceeccCCCCCCceEEEecCCCCCCC Q lcl|NC_020862. 308 TATAA-NRGYQVSDVAGTDKYDIAPLLVVGDQ-----AFATIGLQGMSGKGKSKFRIIVKKPGEATAD 369 (405) Q Consensus 308 ~~~~t-~~~~~~~~~~g~~~~DVYp~lV~G~~-----Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad 369 (405) ..... ....-+....-.-++.++.-.=-++- .+..+.++=.. .-...++++. .++ T Consensus 254 ~~i~fgd~~nlv~~~~~~ir~~~~~~a~~~~~~~~~~~r~d~~~~~~~----aa~~~~~~~~---~~~ 314 (314) T protein:vir:41 254 ARALLTVPTNLVYGFWRNIRIEPKRDAAMRRTEYIASLRADCNYEDEN----AAVAAVIDMS---SGG 314 (314) T ss_pred ceEEEechhheEEEeeceeEEeecccCcCCeEEEEEEEEeceEEEEcC----cEEEEEeecc---CCC Confidence 22221 11110000000001111110000011 11122222111 1233344433 222 No 158 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=88.17 E-value=0.034 Score=28.64 Aligned_cols=298 Identities=15% Similarity=0.095 Sum_probs=135.0 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccc----cccC--cCCCCEEEEEecccCCCCC-Ccccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADN----KQMP--KHFGKELKVFYYVPLLDDL-NVNDQ 73 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~----~~mP--Kn~GktIkfrry~pl~~~~-t~l~e 73 (405) ++|-|..-+.+.- ++ +|.. ++....| ...+...|++. ..|| +-.|....+.|=.-||.+. -.+-+ T Consensus 16 ~~~~~p~l~m~al-TL-aea~--~l~~d~~----~~~VIE~l~~~s~iL~~lpf~~ve~~~~~~~r~~~lp~a~~r~~n~ 87 (330) T protein:vir:94 16 LTHQFPELKMPTV-TL-AESA--KLSQDHL----VSGLIETIVEVNPLYEMMPFTEIEGNALAYNRENVLGDVQFLAVGG 87 (330) T ss_pred hhccccccchhhh-hh-hHHh--hcCchhh----HHHHHHhhhccchHHhhcccccccCCcceeeeeecCCcceeeeccc Confidence 3333333222211 11 1110 0000111 11122222221 1122 1233333333322233211 11233 Q ss_pred CCCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhc-cchHHHHH Q lcl|NC_020862. 74 GLDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTD-SDLYGHLS 152 (405) Q Consensus 74 Gvtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d-~~l~~~~~ 152 (405) |++|+. --|+.+++.++.-++.+.++.....++..+ .+...+.. T Consensus 88 ~~~~~~-----------------------------------~~Tf~q~t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~ 132 (330) T protein:vir:94 88 TITAKN-----------------------------------PATFTKVTSELTTLIGDAEVNGLIQATRSDFMDQTSVQV 132 (330) T ss_pred cccccC-----------------------------------cceeeeeeechhhhhhhHHHHHHHHHhcCCHHHHHHHHH Confidence 333321 125667788899999999999887777664 23444445 Q ss_pred HHHHHHHhhHHHHHHHHHHhccCceEEecCCC---ccceeeecccccccCCceecHHHHHHHHHHHHhcc-Cccccceec Q lcl|NC_020862. 153 REMLRGANEITEDLLQADILASADVKVFTGAA---TSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNY-TPKKTTIIK 228 (405) Q Consensus 153 ~ell~~~~~~ted~l~~~ilag~~~v~yag~a---ts~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nr-Apk~T~ii~ 228 (405) ++..++..+..|+.+..- .+++ ..+.|=. .....+..++.. ..+|+++|..+......-+ .| T Consensus 133 ~~~ieal~~~~e~~linG--Ds~~-~~F~GL~~~~~~~q~i~tg~~g----g~~T~d~LDeLl~~v~~~~g~~------- 198 (330) T protein:vir:94 133 ASKAKSIGRQYQASMITG--DGTG-NSFQGMMGLVAASQTISAGANG----GTLTFELLDQLLDLVKDKDGQV------- 198 (330) T ss_pred HHHHHHHHHHHHHHhhcc--CCCC-ccccchhhcCCcccEEecCCCC----CCCCHHHHHHHHHHhcCCCCCC------- Confidence 666667777666666431 0111 1222211 111122222111 3488999888776653322 22 Q ss_pred cccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEe--cCCcEEEEeCcchhhhhcCC Q lcl|NC_020862. 229 GSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAI--PGAHLRIVVVPQMMHYAGAG 306 (405) Q Consensus 229 gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi--~g~n~Rfv~~p~~~~~~~aG 306 (405) -+-+++.-...-|+++.+-.+ .|+-..+ .....|+- ....+-++.+..... +++ T Consensus 199 -------------~~~l~n~a~~r~I~a~~R~~~--------~~~v~~~-~~~~~G~~v~~~~GvPi~~~d~ip~--~~~ 254 (330) T protein:vir:94 199 -------------DYLMSSFAMRRKYFSLLRALG--------GAAIGEV-MTLPSGRQIPTYRGVPWFVNDFIPS--NMT 254 (330) T ss_pred -------------cEEEechhHHHHHHHHHHhcc--------CCCCCCc-ccccCCCEEeeeCCeEEEecccccC--CCC Confidence 133445555666776654222 2332222 12233431 001123333322221 111 Q ss_pred CcccCCCcccccccccCCcceeeeEEEEEccc--cceeecceeccCCCCCCceEEEecCCCCCCCCCCccc-hhhhHHHH Q lcl|NC_020862. 307 ATATAANRGYQVSDVAGTDKYDIAPLLVVGDQ--AFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYG-KVGFSSIK 383 (405) Q Consensus 307 a~~~~t~~~~~~~~~~g~~~~DVYp~lV~G~~--Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlg-Qrg~~gwK 383 (405) + .+.++.-.||.+- +|.+ --|.+||+.-+ . --+-|.-+|+- |.-+ -|+.+ + T Consensus 255 ~-------------~~~~~ttsIyav~-~G~~~~~qgV~Gl~~~g---~--~glsVr~~G~~-----~~k~v~~~~v--~ 308 (330) T protein:vir:94 255 Q-------------GTATNATAIFAGT-FDDGSNKYGIAGLTARG---S--AGLRVQNVGAK-----ENADETITRV--K 308 (330) T ss_pred c-------------ccCCCceeEEEEe-ecccccccceEeecCCC---C--CcceeeeCCCc-----cccceeeEEE--E Confidence 1 0111122588665 5644 35889997421 1 12556777731 2211 22333 4 Q ss_pred HHHHHhhccccceEEEEEecCC Q lcl|NC_020862. 384 FFYGFIKLRGERIAVAYSVIPE 405 (405) Q Consensus 384 ~~~~~~iL~~~~marie~~a~~ 405 (405) ||+++.+|++.-++++|-+.+= T Consensus 309 ~y~~~av~~~~a~~~L~~V~~g 330 (330) T protein:vir:94 309 MYCGFANFSQLGLAAIKGLIPG 330 (330) T ss_pred EeeeeEEechhheeeeccccCC Confidence 7999999999999999999999 No 159 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=86.89 E-value=0.042 Score=28.11 Aligned_cols=290 Identities=10% Similarity=-0.037 Sum_probs=119.7 Q ss_pred CCccccC-cCCCccccccc-ceeehhhhhHHHHHhhhhhhhhcccccc-ccCcCCCCEEEEEecccCCCCCCccccCCCc Q lcl|NC_020862. 1 MPHIYND-PAAGDASTVGP-QFNVHYWDRKSLIDEAEEMFFSPLADNK-QMPKHFGKELKVFYYVPLLDDLNVNDQGLDA 77 (405) Q Consensus 1 ~~~~y~~-~~~t~~~~v~~-qm~t~y~~~k~L~~a~p~lv~~~fA~~~-~mPKn~GktIkfrry~pl~~~~t~l~eGvtp 77 (405) +-+..++ ..+.+.++.+. -+.+..++ +.+.+..+.-.+.+.|.+. +|=.+ +.++- -+... .....|.+. T Consensus 8 ~~~~~~~~~k~~t~~d~~Gg~l~P~~~~-~~i~~~~e~s~~l~~~~vi~~~~~~---~~~i~---~~g~~-~~~~~g~~~ 79 (315) T protein:vir:41 8 RGGKPFEIVPKIDVPDLGRGVLSVDRFG-EFVKAVRDSAVIIPEARIDNALKSY---EKDIS---RLSLV-LDVGPGRDE 79 (315) T ss_pred hcCChhhhhhhcCCcCCCCceechHHHH-HHHHHHHhhhhhhhhceeeeccccc---ccccc---ccccC-ccccccccc Confidence 1111111 12222222211 23333433 4444444455677777753 33111 11110 01000 011112222 Q ss_pred ccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhh-ccchHHHHHHHHH Q lcl|NC_020862. 78 TGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDT-DSDLYGHLSREML 156 (405) Q Consensus 78 ~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~-d~~l~~~~~~ell 156 (405) .|. +.+++...-++..++-..+++..+.++|+++++... ..++.+.|..++. T Consensus 80 ~~~---------------------------~~~~~~~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~l~~~~a 132 (315) T protein:vir:41 80 TGQ---------------------------KLAPPESTAEVKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQKIVTLLG 132 (315) T ss_pred ccC---------------------------cCCCCCCccccceeeeceeeeeeeccccHHHHHhhhccccHHHHHHHHHH Confidence 211 112222222344456788999999999999766433 2467777776666 Q ss_pred HHHhhHHHHHHHHHHhccCc------------eEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCcccc Q lcl|NC_020862. 157 RGANEITEDLLQADILASAD------------VKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKT 224 (405) Q Consensus 157 ~~~~~~ted~l~~~ilag~~------------~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T 224 (405) +.-+.--+ ..+++|.+ ...+++.. +........ ...++.+.|.+++..|+...-.- T Consensus 133 ~~~a~~~~----~~~~nGdg~s~~p~~~~~~G~l~~a~~~---~~~~~~~~~---a~~~~~d~l~~l~~sl~~~yr~~-- 200 (315) T protein:vir:41 133 EGISYVLE----KYYLHGDTSSSDPLLRMSDGWLKLASEK---LTESDVDPE---AEDWPMNLFDTMIESLPTPYRNN-- 200 (315) T ss_pred HHHHHHHH----HHhhccCCcCcCccccccccceeccccc---ccccccccc---cccccHHHHHHHHHhcChHHhhc-- Confidence 53333222 23456522 22233222 111111111 12466788889998887632110 Q ss_pred ceeccccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhc Q lcl|NC_020862. 225 TIIKGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAG 304 (405) Q Consensus 225 ~ii~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~ 304 (405) + ..-+-+++.+....||.+++..+++.|-|...=|...++ .| +.++.+|.|-. .+ T Consensus 201 ----~----------~~~~~imn~~t~~~~rklk~~~g~~lw~~~~~~g~~~tl--------~G--~PV~~~~~m~~-~~ 255 (315) T protein:vir:41 201 ----L----------PNMKFYVTWDIYRAYRDALKGRETGLGDQALTGANSILY--------DG--RPVQYVPALEA-LN 255 (315) T ss_pred ----C----------CceEEEEcHHHHHHHHHHhccCCCccccchhhcCCCcee--------cc--cceEecccccc-cC Confidence 0 025678999999999999999999999888655555544 23 45566665532 11 Q ss_pred CCCcccCCCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCC-CceEEE-ecCCCCCCCCCCccchhhhHHH Q lcl|NC_020862. 305 AGATATAANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKS-KFRIIV-KKPGEATADRNDPYGKVGFSSI 382 (405) Q Consensus 305 aGa~~~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~-~~~~iv-k~pG~~tad~~DPlgQrg~~gw 382 (405) +++.....+. .+-| ++++.. .+.++= ..+.+. .+.+.. +..|-. ..| T Consensus 256 ~~~~~ilf~d------------~~nl-~~~~~~----~i~i~~-~~~a~~~~~~~~~~~r~d~~-------------~~~ 304 (315) T protein:vir:41 256 DGKSRALFVV------------PTQL-VYGFWR----NIKVVP-DYDAEMRLTKYVASLRTDNH-------------YED 304 (315) T ss_pred CCCccEEEec------------ccce-EEEecc----ccEEEe-eecCCCCceEEEEEEEecee-------------EEe Confidence 1111111000 0000 001110 011100 000011 111111 111100 000 Q ss_pred HHHHHHhhccc Q lcl|NC_020862. 383 KFFYGFIKLRG 393 (405) Q Consensus 383 K~~~~~~iL~~ 393 (405) .=+=+.++++- T Consensus 305 ~~~~a~~~~~v 315 (315) T protein:vir:41 305 EEGAVSATITV 315 (315) T ss_pred ccceeEeeeeC Confidence 00001111111 No 160 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=86.19 E-value=0.047 Score=27.85 Aligned_cols=286 Identities=16% Similarity=0.084 Sum_probs=128.2 Q ss_pred CCcc-ccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccc----cC--cCCCCEEEEEecccCCCCCCc--- Q lcl|NC_020862. 1 MPHI-YNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQ----MP--KHFGKELKVFYYVPLLDDLNV--- 70 (405) Q Consensus 1 ~~~~-y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~----mP--Kn~GktIkfrry~pl~~~~t~--- 70 (405) ||-+ -.+-.--.+ +.....|...|++.-+ || .=.|....+.|=.-+++..-. T Consensus 1 mpaltLaea~k~~~------------------d~l~~~ViE~~~~~s~lL~~LpF~~veg~~~~ynR~~~~~~~~~~~v~ 62 (310) T protein:vir:97 1 MASVTLAESAKLAQ------------------DELVAGVIENIITVNRMFDVLPFDSIEGNSLAYNRENVLGDVIMAGVG 62 (310) T ss_pred CcccchHHHhhcCc------------------chHHHHHHHHHhccchHHHhCCcccccCCcceeeEeeccCCccccccc Confidence 5522 111000000 0111122222222111 11 122445444444333332211 Q ss_pred ---cccCCCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhcc-- Q lcl|NC_020862. 71 ---NDQGLDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDS-- 145 (405) Q Consensus 71 ---l~eGvtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~-- 145 (405) +.+|+.+ ..+|+..++.+|.=.|...++-....+++..+ T Consensus 63 ~~~~~~g~~~------------------------------------~~~t~~~~~~~L~i~~g~~~Vd~~i~dl~~~~~~ 106 (310) T protein:vir:97 63 TTFSGAGAGK------------------------------------AAATFTKVNSNLTTIMGDAEVNGLIQATRSGDGN 106 (310) T ss_pred ccccCCCccc------------------------------------cccccceeeeeeeeeeehhhhhhHHHhhhcCChH Confidence 1122222 22344556666766666666554333433111 Q ss_pred chHHHHHHHHHHHHhhHHHHHHHHHHhccCc-eEEecCCC---ccceeeecccccccCCceecHHHHHHHHHHHH-hccC Q lcl|NC_020862. 146 DLYGHLSREMLRGANEITEDLLQADILASAD-VKVFTGAA---TSMVTMTGEAADAEDDGLITLKDLKRLSITLT-DNYT 220 (405) Q Consensus 146 ~l~~~~~~ell~~~~~~ted~l~~~ilag~~-~v~yag~a---ts~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk-~nrA 220 (405) +...+.-+...+...+..|+++. +|-. +.-+.|=. ++.-.+..... -..+|+++|..+....- ..+. T Consensus 107 dq~~~Ql~~~iea~~~~~e~~lI----NGD~a~n~F~GL~~~~~~~q~i~~~~~----gg~~t~d~LDeLl~~v~~~~g~ 178 (310) T protein:vir:97 107 DQTAVQIASKAKSAGRKYQDQLI----NGNGAGNEFAGLIQLCASGQKATTGAT----GSAISFAILDELMDLVVDKDGQ 178 (310) T ss_pred HHHHHHHHHHHHHHHHHHHHHhh----ccccCCCcccchhhcCCccceeecCCC----CCCCCHHHHHHHHHHHhcCCCC Confidence 22222234444555555555554 3211 11121110 11111211111 12478999988876653 3344 Q ss_pred ccccceeccccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehh--hcCCcccccCcceeEecCCcEEEEeCcc Q lcl|NC_020862. 221 PKKTTIIKGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVE--KYADAATIMNGEIGAIPGAHLRIVVVPQ 298 (405) Q Consensus 221 pk~T~ii~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~--~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~ 298 (405) | -+-++||-+..-|+++.--.+.-+-.|+. .+|.+=. +. ..+-++.+.. T Consensus 179 p--------------------~~~l~~~~~~r~i~A~~R~~~~~g~~~~~~~~~G~~v~-------~~--~GiPi~~~d~ 229 (310) T protein:vir:97 179 V--------------------DYLTMHARTLRSYKALLRALGGASINEVVELPSGAEVP-------AY--SGTPIFRNDY 229 (310) T ss_pred C--------------------CEEEecHHHHHHHHHHHHHhcCCCCCCccccCCCCEEe-------ee--CCeEEEEeCc Confidence 4 24789998888888766533333433321 1222211 11 1134444432 Q ss_pred hhhhhcCCCcccCCCcccccccccCCcceeeeEEEEEcccc--ceeecceeccCCCCCCceEEEecCCCCCCCCCCccch Q lcl|NC_020862. 299 MMHYAGAGATATAANRGYQVSDVAGTDKYDIAPLLVVGDQA--FATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGK 376 (405) Q Consensus 299 ~~~~~~aGa~~~~t~~~~~~~~~~g~~~~DVYp~lV~G~~A--fg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQ 376 (405) +..=...| ++++.-.||. +-+|.++ .|.+||.. .+. --+-|.-+|+- .|.--- T Consensus 230 ip~~~~~~---------------~~~gtTsIya-~r~Ge~~~~~Gv~Gl~~---~~~--~glsVr~~G~~----~~~~v~ 284 (310) T protein:vir:97 230 IPTNQTKG---------------GTTGCTTIFA-GTLDDGSRTHGIAGLTA---TQA--AGIQVVDVGES----EDSDEH 284 (310) T ss_pred cCCCcccc---------------ccCCceeEEE-EeeCccccccceecccc---CCc--cceeEEeCCcc----cCCcce Confidence 22100000 1111224775 5689886 68888863 111 12457777631 344444 Q ss_pred hhhHHHHHHHHHhhccccceEEEEEecC Q lcl|NC_020862. 377 VGFSSIKFFYGFIKLRGERIAVAYSVIP 404 (405) Q Consensus 377 rg~~gwK~~~~~~iL~~~~marie~~a~ 404 (405) +..+ +||+++.+|++.=++++|-+.- T Consensus 285 ~~~V--~~Y~~~av~~~~A~a~L~~V~~ 310 (310) T protein:vir:97 285 IWRV--KWYCGLALFSEKGLACADGITN 310 (310) T ss_pred eEEE--EEeeeEEEecccceeeeccccC Confidence 4555 4699999999999999999998 No 161 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=85.75 E-value=0.051 Score=27.69 Aligned_cols=285 Identities=15% Similarity=0.121 Sum_probs=133.7 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccc---cccCcCCCCEEEEEecccC-CCCCCccccCCC Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADN---KQMPKHFGKELKVFYYVPL-LDDLNVNDQGLD 76 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~---~~mPKn~GktIkfrry~pl-~~~~t~l~eGvt 76 (405) |. ..| .-+ -|+..++....+.++++.+-.. ..+--+.|++||+.+-.-- ..|-+-...|-+ T Consensus 1 MA-~~n-------------~a~-~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~~gl~DY~R~~~g~~ 65 (299) T protein:vir:79 1 MA-ALN-------------YAK-EYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTISTTGRVDSNRDTIAVA 65 (299) T ss_pred Cc-cch-------------hHH-HHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEeccccccccccccCCCccc Confidence 33 111 111 3666777777778888875332 2233366899998854311 011111111111 Q ss_pred cccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchH--HHHHHH Q lcl|NC_020862. 77 ATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLY--GHLSRE 154 (405) Q Consensus 77 p~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~--~~~~~e 154 (405) + +++ +....+-+|.|-=.|...=| .+|-|+-...+ ..+..+ T Consensus 66 ~-----------------g~~-------------------~~~~~t~~ldqdr~~~f~vD-~~Dvdet~~~~~~a~v~~~ 108 (299) T protein:vir:79 66 Q-----------------RNY-------------------DNAWEPKVLTNQRKWSTLVH-PADINQTNYVASIGNITKV 108 (299) T ss_pred c-----------------ccc-------------------CcceeEEEeeccccceeccc-hhhHHHHhhhhHHHHHHHH Confidence 1 011 11122334444322222222 24444432222 222222 Q ss_pred HHHHHhhHHHHHH-HHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceecccccc Q lcl|NC_020862. 155 MLRGANEITEDLL-QADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMT 233 (405) Q Consensus 155 ll~~~~~~ted~l-~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~ 233 (405) -...+..-..|.. ...|-+++..+ |...+..++++. =-++.|+.+...|++++.|. T Consensus 109 ~~~~~v~pEiDay~~skl~~~a~~~---g~~~~~~~~T~~---------n~y~~i~~~~~~lde~~vP~----------- 165 (299) T protein:vir:79 109 YNEEQKFPEMDAYCISKIYADWTAL---GNTADTTVLTTT---------NVLEVFDKLMEKMTEARVPE----------- 165 (299) T ss_pred HHHHHhhhHhhHHHHHHHHHhhhhc---CCcccccccCHH---------HHHHHHHHHHHHHHhcCCCC----------- Confidence 2211111111222 22222333211 211121212211 12688999999999999983 Q ss_pred CcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcc--hhhhh--cCCCcc Q lcl|NC_020862. 234 DTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQ--MMHYA--GAGATA 309 (405) Q Consensus 234 gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~--~~~~~--~aGa~~ 309 (405) ..+|+||.|+.-..|+. .+.|.......+.....++-||+|.| |.++++|. |.-=. -.|... T Consensus 166 ------~~rvl~vtp~~~~~L~~------~~~f~k~~~~~~~~~~~~g~Vg~idG--~~Ii~Vps~r~~t~~~~~~G~~~ 231 (299) T protein:vir:79 166 ------NGRILYVTPVVNTLIKN------AKEIQRTVNIKDAGTSLNRQTTDIDT--VKIIKVPSNLMKTAYDFTTGWKV 231 (299) T ss_pred ------CCeEEEeCHHHHHHHhh------chhhhcccccccccceeeeeeeeecc--eEEEEechhhcCccceeccCccc Confidence 35999999999888864 47898877788777778999999966 77777765 33100 012211 Q ss_pred cCCCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHh Q lcl|NC_020862. 310 TAANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFI 389 (405) Q Consensus 310 ~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~ 389 (405) .++ +| -..++|+ |.-+++.+.... .++ +-.||.. .++|=|-| .+.||-.. T Consensus 232 -----------~~~-ak--~in~ii~--~~~a~~~~~K~~-----~~~--~~~P~~~--~~~~~~~~-----~r~y~d~~ 281 (299) T protein:vir:79 232 -----------GAG-AK--QIFMSLV--HPSAIITPVSYQ-----FSK--LDEPTAV--TEGKYFYF-----EESFEDVF 281 (299) T ss_pred -----------cCc-cc--ccceEEE--cCCeeeeeEeee-----eEE--eecCCCC--Cccceeee-----eeeeeeee Confidence 111 12 2455555 334455554211 222 3468754 44443311 25555555 Q ss_pred hccccceEEEEEecCC Q lcl|NC_020862. 390 KLRGERIAVAYSVIPE 405 (405) Q Consensus 390 iL~~~~marie~~a~~ 405 (405) +|...--. |.+..-. T Consensus 282 v~~nk~~~-i~~~~~~ 296 (299) T protein:vir:79 282 ILNKKADA-IQFVVEG 296 (299) T ss_pred eeccccCe-EEEEeee Confidence 55443322 2222222 No 162 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=84.82 E-value=0.057 Score=27.39 Aligned_cols=290 Identities=13% Similarity=0.139 Sum_probs=151.5 Q ss_pred CCccccCcCCCccccc-------ccc-eeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTV-------GPQ-FNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVND 72 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v-------~~q-m~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~ 72 (405) |- .+.++-. ..+ +++ ..+.--+--+++-|+|....+++-|.-. -|+|+-+.|-++.-+ . T Consensus 59 m~---G~~p~~e-V~~~e~mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~~--L~~Grsm~F~~~g~~-------R 125 (393) T protein:vir:79 59 ME---GETPTNE-VNLREFMATPSAQILIPRVIVGTMREAAEPLYIGTKMLQKIR--LKSGQSMIFPSIGIM-------R 125 (393) T ss_pred hc---CCCchhh-eehhhhhcCCCcceechhhhhhhhhhcccchhHHHHHHHHHh--hhcCcceeccchhee-------e Confidence 21 2221110 111 111 1111222335555666655566655544 377888877765522 1 Q ss_pred cCCCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHH Q lcl|NC_020862. 73 QGLDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLS 152 (405) Q Consensus 73 eGvtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~ 152 (405) +===+.|+++-+-+| |- +|..-|+.+..+||-.++|||+ ++-|+-=|+++.+- T Consensus 126 a~~IgEGgE~~~~sl-----d~---------------------~T~dsv~~~~gK~G~~Ia~SqE-mIsDSg~Dvin~~l 178 (393) T protein:vir:79 126 AYDVAEGQEIPEDSI-----DW---------------------QTHESPEIRVGKSGIRLRFTDE-MISDSQWDLMSMMI 178 (393) T ss_pred eccccccccccccch-----hh---------------------hcCCceeEEechhhhhhhhHHH-HhhcchHHHHHHHH Confidence 111234466644433 11 2444677888999999999998 56666667788878 Q ss_pred HHHHHHHhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccc Q lcl|NC_020862. 153 REMLRGANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRM 232 (405) Q Consensus 153 ~ell~~~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~ 232 (405) +.+.+.++.-.+....+.+.+-+.+++=+=.++-....+|.+-..+.|+.+|++||.+...+...++=- + T Consensus 179 ~aA~RaMaRkKee~a~n~fk~~ghtvfDa~st~t~ahptGr~~~~~qNGTlSleDllDm~~av~~~hyt-------~--- 248 (393) T protein:vir:79 179 KQAGRAMGRHKEQKAYHQFRSHGHTVFDNYSTNKLAHTTGLDKNGVQNDTFSAEDFLDLIIAVMANEYT-------P--- 248 (393) T ss_pred HHHHHHHHhhhHHHHHhhhhcccceeeeccccCccceeecCCccccccccccHHHHHHHHHHHhcccCC-------c--- Confidence 888888888778778888888777777765555666777777777889999999999998887776642 1 Q ss_pred cCcccccceEEEEEcccchHHHH--HHhcccCCCcceehhhcCCcccccCcce--------eEecCCcEEEEeCcchhhh Q lcl|NC_020862. 233 TDTKTISASRIAYIGSELEIYIT--ELVDSLGNPAFVPVEKYADAATIMNGEI--------GAIPGAHLRIVVVPQMMHY 302 (405) Q Consensus 233 ~gT~~I~~syv~~~h~dl~~dir--~l~d~~~~p~fi~v~~Ya~~~~i~~gEI--------Gsi~g~n~Rfv~~p~~~~~ 302 (405) =+-+.||-|-.-+- ++-+..---+|- .|+.+. ++-|- |++++ ||-++.+|..- + T Consensus 249 ---------svi~MHPLAWnv~AKna~me~~~~na~g---N~~~~~--~~ts~algp~~i~~~~~~-nlnv~~sPfvp-~ 312 (393) T protein:vir:79 249 ---------SDLMMHPLAWTVFAKNELMGSLQANPYG---NYPAKG--APSSMALGPDSIQGRLPF-NFNVNLSPFIP-L 312 (393) T ss_pred ---------ceEEEcCchhhhhhhhhhhcceeecccc---ccCccc--cchhhhhchhhhcccccc-ceeEEEecccc-c Confidence 36789998877662 111111111122 333332 23221 34432 78999999543 3 Q ss_pred hcCCCcccCCCcccccccccCCcceeeeEE-------EEEcc--------------------ccceeecceeccCCCCCC Q lcl|NC_020862. 303 AGAGATATAANRGYQVSDVAGTDKYDIAPL-------LVVGD--------------------QAFATIGLQGMSGKGKSK 355 (405) Q Consensus 303 ~~aGa~~~~t~~~~~~~~~~g~~~~DVYp~-------lV~G~--------------------~Afg~i~l~g~~~~g~~~ 355 (405) --. +.++|+|.+ |.|-+ |-||.==| ..||. T Consensus 313 d~k------------------~~rFd~~~Vd~NnvgvlLV~D~i~tdq~ddk~rdiq~iKl~ERYG~gvL----n~gka- 369 (393) T protein:vir:79 313 DKK------------------SRRFDVYAVDRNNVGVLLVRDDLKTDQWDEKARGLQNIKMIERYGIGIL----NEGKA- 369 (393) T ss_pred ccc------------------cceeeEEEeecCCceEEEEecCcceeccccccccceeeeeeeeeceeee----eCCce- Confidence 211 234455543 22211 22222111 11111 Q ss_pred ceEEEecCCCCCCCCCCccchhhhHH Q lcl|NC_020862. 356 FRIIVKKPGEATADRNDPYGKVGFSS 381 (405) Q Consensus 356 ~~~ivk~pG~~tad~~DPlgQrg~~g 381 (405) ....|+--. +-+=.||+--.-.-. T Consensus 370 -iavakNI~~-~k~y~~P~~~~~~~~ 393 (393) T protein:vir:79 370 -IAVAKNISM-DKSYAEPMLIKNVGN 393 (393) T ss_pred -EEEEeccee-ecccccchhhhccCC Confidence 111111000 011223321110000 No 163 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=79.39 E-value=0.096 Score=26.17 Aligned_cols=293 Identities=10% Similarity=-0.061 Sum_probs=118.9 Q ss_pred CC----ccccCc-CCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCC Q lcl|NC_020862. 1 MP----HIYNDP-AAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGL 75 (405) Q Consensus 1 ~~----~~y~~~-~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGv 75 (405) ++ ..|++- ..+.++.-+.-+-+ -+.++.+++....=.+-+++.+.++. |. +++-+ .+ ..+-..- T Consensus 67 lt~ee~~~~~~~~~~~~~~~gg~~vP~-~~~~~I~~~l~~~s~i~~~~~v~~~~---~~-~~~~~----~~-~~~~a~w- 135 (377) T protein:vir:98 67 LTAEEIKFFNDIDKNVGGKDKFKLLPE-ETMVQVFDDLVAEHPLLKVINFKNTS---LR-LKALT----AE-TSGTAVW- 135 (377) T ss_pred cCHHHHHHHHHHHhccCCCCCccccCH-HHHHHHHHHHHHhhhhhhheeeEecC---cc-eEEEE----ec-CCcceeE- Confidence 00 111111 01111111111111 22344444444433444556665553 33 22211 11 1110000 Q ss_pred CcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHH Q lcl|NC_020862. 76 DATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREM 155 (405) Q Consensus 76 tp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~el 155 (405) -.|.+.+.....-++..++-+.++++.+..+|++++ -|+.-++.+.+..++ T Consensus 136 ----------------------------~~e~~~~~~~~~~~f~~i~l~~~kl~a~~~is~elL-~ds~~~ie~~i~~~l 186 (377) T protein:vir:98 136 ----------------------------GDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDAL-KFGPKWIKQFITEQL 186 (377) T ss_pred ----------------------------eecccccCcccCccceeEeecceeEEeeecccHHhh-hccHhHHHHHHHHHH Confidence 011122222233356678899999999999999854 445557777766666 Q ss_pred HHHHhhHHHHHHHHHHhccCceEEecCC------CccceeeecccccccCCcee-cHHHHHHHHHHHHhccCccccceec Q lcl|NC_020862. 156 LRGANEITEDLLQADILASADVKVFTGA------ATSMVTMTGEAADAEDDGLI-TLKDLKRLSITLTDNYTPKKTTIIK 228 (405) Q Consensus 156 l~~~~~~ted~l~~~ilag~~~v~yag~------ats~~~~t~~~~~~~~n~~i-t~~~lr~~~~~Lk~nrApk~T~ii~ 228 (405) .+ +....++ ..+++|.++..=-|= .+.... ++..+ .+.. ..+.+-++.-.|+....+ T Consensus 187 a~-~~a~~~~---~a~i~G~G~~qP~Gil~~~~~~~~~~~-~~~~~----~~~~~~~~~~~~l~~~~~~~~~~------- 250 (377) T protein:vir:98 187 KE-AIAVALE---LAIVKGDGLLQPVGLLKDLSQPTVDQS-TGRDI----TTYKTDKEAIADLSDLTPDNAPK------- 250 (377) T ss_pred HH-HHHHHHh---hceEeccCCCcceeeeecccccccccc-ccccc----ccccchhhhHhhhhhhchhHHHH------- Confidence 53 3333333 456777664321111 111110 00000 1111 112233332222222111 Q ss_pred cccccCcccccceEEEEEcccchHHHHHHhcccCCCcceeh-hhcCCcccc-----cCcceeEecCCcEEEEeCcchhhh Q lcl|NC_020862. 229 GSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPV-EKYADAATI-----MNGEIGAIPGAHLRIVVVPQMMHY 302 (405) Q Consensus 229 gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v-~~Ya~~~~i-----~~gEIGsi~g~n~Rfv~~p~~~~~ 302 (405) ..+-+++......+|.|+|..+++-|+.= ..|-.-.+. -+|....+-|-.+++|+++.|. T Consensus 251 ------------~a~~~m~~~t~~~~~klkd~~G~~i~~~n~~~~~~~~p~~~~~~~~G~~~t~lg~p~~vv~s~~~p-- 316 (377) T protein:vir:98 251 ------------KLVPVMKHLSVNDKKRPLKIAGQVKLILNPEDRWALEAQFTSRNQFGEYVTVLPHGITILESLAVE-- 316 (377) T ss_pred ------------HHHHHHHHHHHHHHhhhhccCCceEEEecccchhhccccccccCCCCccccccCCCceEEecCCCC-- Confidence 01122233333444445554444444310 001000000 1334334434445677665332 Q ss_pred hcCCCcccCCCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHH Q lcl|NC_020862. 303 AGAGATATAANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSI 382 (405) Q Consensus 303 ~~aGa~~~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gw 382 (405) + + + ++||.-++-.+..+ +.+++-+... ..=-.+|++|.++ T Consensus 317 --~-------------------~--~----i~fgdf~~Y~i~~r-------~~~~i~~~~~------~~~~~d~~~f~~~ 356 (377) T protein:vir:98 317 --T-------------------G--K----AIAFVANRYDAFMA-------TASTIEEYDQ------TFAMEDLQLYLTK 356 (377) T ss_pred --c-------------------c--c----EEEEEecceeEEee-------cceEEEeech------hhhhcCceEEEEE Confidence 0 1 1 35666555455543 1233322211 0001244444433 Q ss_pred HHHHHHhhccccceEEEEEecC Q lcl|NC_020862. 383 KFFYGFIKLRGERIAVAYSVIP 404 (405) Q Consensus 383 K~~~~~~iL~~~~marie~~a~ 404 (405) +...+++.+++=++.+....= T Consensus 357 -~r~dg~~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 357 -NYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred -EEEcCEEeccCcEEEEEEecC Confidence 456778888888888888888 No 164 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=78.43 E-value=0.11 Score=25.74 Aligned_cols=288 Identities=10% Similarity=-0.040 Sum_probs=134.6 Q ss_pred CCc----cccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCC Q lcl|NC_020862. 1 MPH----IYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLD 76 (405) Q Consensus 1 ~~~----~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvt 76 (405) ++. .||+-..++.+.-+--+-+ .+.++.+.+....-.+.+++.+.+++ |. .++-+. + ..+...-+ T Consensus 65 lt~~e~~~~~~~~~~~~~~gg~lvP~-~~~~~I~~~l~~~s~i~~~~~v~~~~---~~-~~i~~~----~-~~~~a~w~- 133 (381) T protein:vir:95 65 LSANQRSFFMDINKNVNYKEEKLLPE-ETIDRIFEDLTTNHPLLADLGIKNAG---LR-LKFLKS----E-TSGVAVWG- 133 (381) T ss_pred ccHHHHHHHHHHhcccCCCCceecCH-HHHHHHHHHHHhhccceeheeeEecC---cc-eEEEEe----c-CCcceeee- Confidence 111 2333222222222212222 23344444444444566778877774 32 222211 1 11111111 Q ss_pred cccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHH Q lcl|NC_020862. 77 ATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREML 156 (405) Q Consensus 77 p~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell 156 (405) .|.+.+......++..|+-+.++++.++.+|+++ +-|+.-++...|..++. T Consensus 134 ----------------------------~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~el-L~Ds~~~ie~~i~~~la 184 (381) T protein:vir:95 134 ----------------------------KIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDL-NDFGPAWIERFVRVQIE 184 (381) T ss_pred ----------------------------cccccccccccccceeeeecceeEEeechhhHHH-hhcCHHHHHHHHHHHHH Confidence 0111122222346667789999999999999985 45555577777666665 Q ss_pred HHHhhHHHHHHHHHHhccCceEEecCCCc---cceeeecc--------cccccCCceecHHHHHHHHHHHHhccCccccc Q lcl|NC_020862. 157 RGANEITEDLLQADILASADVKVFTGAAT---SMVTMTGE--------AADAEDDGLITLKDLKRLSITLTDNYTPKKTT 225 (405) Q Consensus 157 ~~~~~~ted~l~~~ilag~~~v~yag~at---s~~~~t~~--------~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ 225 (405) + +....++ ..+++|.++..=-|--+ ..+..+++ ......+....++.|..+.+.|..+...+ T Consensus 185 ~-~~a~~~~---~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~--- 257 (381) T protein:vir:95 185 E-AFAVALE---TAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGK--- 257 (381) T ss_pred H-HHHHHhh---heeEeccCCCCceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccc--- Confidence 4 3333343 34677765422111100 00111110 01111223344566777777776553321 Q ss_pred eeccccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcC Q lcl|NC_020862. 226 IIKGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGA 305 (405) Q Consensus 226 ii~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~a 305 (405) .+......+.+||+....+|+.+.+..+ ++...+ ...+-...+++++.|. + T Consensus 258 ---------~~~~~~~a~~~mn~~t~~~l~~~~~~~~----------~~G~~v------~~l~~g~~vv~s~~~p----~ 308 (381) T protein:vir:95 258 ---------SVAVKGNVTMVVNPSDAFEVQAQYTHLN----------ANGVYV------TALPFNLNVIESTVQE----A 308 (381) T ss_pred ---------cccccCceEEEEccccHHhhccccccCC----------CCCcee------ecCCCCceEEecCCCC----c Confidence 0111224677899999889986653211 111111 0111234556665332 0 Q ss_pred CCcccCCCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH-- Q lcl|NC_020862. 306 GATATAANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK-- 383 (405) Q Consensus 306 Ga~~~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK-- 383 (405) + | ++||.-++-.+..++ .+++-+. ...+-..+-.+++ T Consensus 309 -------------------~--~----iifgDfs~Y~i~~r~-------~~~i~~~---------~~~~~~~d~~~f~a~ 347 (381) T protein:vir:95 309 -------------------G--K----VLTYVKGLYDGYLAG-------GINVQKF---------KETLALDDMDLYTAK 347 (381) T ss_pred -------------------C--c----EEEEecccEEEEEec-------ccEEEee---------chhHhhcCCeEEEEE Confidence 1 1 567776665665541 2332221 1223333333333 Q ss_pred HHHHHhhccccceEEEEEecCC Q lcl|NC_020862. 384 FFYGFIKLRGERIAVAYSVIPE 405 (405) Q Consensus 384 ~~~~~~iL~~~~marie~~a~~ 405 (405) ..+.+++++++=++.++...=| T Consensus 348 ~r~dg~~~~~~A~~v~~l~~~~ 369 (381) T protein:vir:95 348 QFAYGKAKDNKVAAVWKLDLKG 369 (381) T ss_pred EEEcCEEecCceEEEEEEEecC Confidence 5667788888888887776644 No 165 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=78.43 E-value=0.11 Score=25.74 Aligned_cols=288 Identities=10% Similarity=-0.040 Sum_probs=134.6 Q ss_pred CCc----cccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCC Q lcl|NC_020862. 1 MPH----IYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLD 76 (405) Q Consensus 1 ~~~----~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvt 76 (405) ++. .||+-..++.+.-+--+-+ .+.++.+.+....-.+.+++.+.+++ |. .++-+. + ..+...-+ T Consensus 65 lt~~e~~~~~~~~~~~~~~gg~lvP~-~~~~~I~~~l~~~s~i~~~~~v~~~~---~~-~~i~~~----~-~~~~a~w~- 133 (381) T protein:vir:10 65 LSANQRSFFMDINKNVNYKEEKLLPE-ETIDRIFEDLTTNHPLLADLGIKNAG---LR-LKFLKS----E-TSGVAVWG- 133 (381) T ss_pred ccHHHHHHHHHHhcccCCCCceecCH-HHHHHHHHHHHhhccceeheeeEecC---cc-eEEEEe----c-CCcceeee- Confidence 111 2333222222222212222 23344444444444566778877774 32 222211 1 11111111 Q ss_pred cccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHH Q lcl|NC_020862. 77 ATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREML 156 (405) Q Consensus 77 p~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell 156 (405) .|.+.+......++..|+-+.++++.++.+|+++ +-|+.-++...|..++. T Consensus 134 ----------------------------~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~el-L~Ds~~~ie~~i~~~la 184 (381) T protein:vir:10 134 ----------------------------KIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDL-NDFGPAWIERFVRVQIE 184 (381) T ss_pred ----------------------------cccccccccccccceeeeecceeEEeechhhHHH-hhcCHHHHHHHHHHHHH Confidence 0111122222346667789999999999999985 45555577777666665 Q ss_pred HHHhhHHHHHHHHHHhccCceEEecCCCc---cceeeecc--------cccccCCceecHHHHHHHHHHHHhccCccccc Q lcl|NC_020862. 157 RGANEITEDLLQADILASADVKVFTGAAT---SMVTMTGE--------AADAEDDGLITLKDLKRLSITLTDNYTPKKTT 225 (405) Q Consensus 157 ~~~~~~ted~l~~~ilag~~~v~yag~at---s~~~~t~~--------~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ 225 (405) + +....++ ..+++|.++..=-|--+ ..+..+++ ......+....++.|..+.+.|..+...+ T Consensus 185 ~-~~a~~~~---~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~--- 257 (381) T protein:vir:10 185 E-AFAVALE---TAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGK--- 257 (381) T ss_pred H-HHHHHhh---heeEeccCCCCceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccc--- Confidence 4 3333343 34677765422111100 00111110 01111223344566777777776553321 Q ss_pred eeccccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcC Q lcl|NC_020862. 226 IIKGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGA 305 (405) Q Consensus 226 ii~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~a 305 (405) .+......+.+||+....+|+.+.+..+ ++...+ ...+-...+++++.|. + T Consensus 258 ---------~~~~~~~a~~~mn~~t~~~l~~~~~~~~----------~~G~~v------~~l~~g~~vv~s~~~p----~ 308 (381) T protein:vir:10 258 ---------SVAVKGNVTMVVNPSDAFEVQAQYTHLN----------ANGVYV------TALPFNLNVIESTVQE----A 308 (381) T ss_pred ---------cccccCceEEEEccccHHhhccccccCC----------CCCcee------ecCCCCceEEecCCCC----c Confidence 0111224677899999889986653211 111111 0111234556665332 0 Q ss_pred CCcccCCCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH-- Q lcl|NC_020862. 306 GATATAANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK-- 383 (405) Q Consensus 306 Ga~~~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK-- 383 (405) + | ++||.-++-.+..++ .+++-+. ...+-..+-.+++ T Consensus 309 -------------------~--~----iifgDfs~Y~i~~r~-------~~~i~~~---------~~~~~~~d~~~f~a~ 347 (381) T protein:vir:10 309 -------------------G--K----VLTYVKGLYDGYLAG-------GINVQKF---------KETLALDDMDLYTAK 347 (381) T ss_pred -------------------C--c----EEEEecccEEEEEec-------ccEEEee---------chhHhhcCCeEEEEE Confidence 1 1 567776665665541 2332221 1223333333333 Q ss_pred HHHHHhhccccceEEEEEecCC Q lcl|NC_020862. 384 FFYGFIKLRGERIAVAYSVIPE 405 (405) Q Consensus 384 ~~~~~~iL~~~~marie~~a~~ 405 (405) ..+.+++++++=++.++...=| T Consensus 348 ~r~dg~~~~~~A~~v~~l~~~~ 369 (381) T protein:vir:10 348 QFAYGKAKDNKVAAVWKLDLKG 369 (381) T ss_pred EEEcCEEecCceEEEEEEEecC Confidence 5667788888888887776644 No 166 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=78.24 E-value=0.12 Score=25.70 Aligned_cols=291 Identities=17% Similarity=0.216 Sum_probs=131.4 Q ss_pred CC--ccccCcCCCcccccccceeehhhhhHHHHHhhh--hhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCC Q lcl|NC_020862. 1 MP--HIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAE--EMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLD 76 (405) Q Consensus 1 ~~--~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p--~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvt 76 (405) |. ++..---..++++... +-.-- -.|.|+++-. +--|.+|.-...++.=+- .++.+...+++=. .. T Consensus 386 ~~~~~~~~~a~~htTSDFp~-IL~~~-~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~--~~~~~lg~~~~L~-~V----- 455 (693) T protein:vir:95 386 LNAPQMVGLAFTHTSSDFGL-ILLDV-ANKSVLAGWEEAEETFPLWTKSGILTDFKP--ARRVGLGEFSSLR-QV----- 455 (693) T ss_pred CCHHHHHHHHHhcCcchhHH-HHHHH-HHHHHHHHHHhhhhHHHHHhccCCCCcccc--cceeecCCCCChh-hc----- Confidence 11 1111000011222211 11111 1455555332 345666666666665432 2333333333211 22 Q ss_pred cccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHH Q lcl|NC_020862. 77 ATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREML 156 (405) Q Consensus 77 p~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell 156 (405) |+|.+|..|.+ .|- ..+..|..||--+.+|.+.++= -|=+.+..+.+.++ T Consensus 456 ~E~gEyk~~t~-----------------~e~------------~e~~~l~tyG~~~~iTRqaiIN-DDLga~~~ip~~~g 505 (693) T protein:vir:95 456 REGAEYKYVTL-----------------GER------------GEQIILATYGELFSITRQAIIN-DDLQMLSDIPFKLG 505 (693) T ss_pred CCCCceeeeec-----------------CCc------------cceeehhhcCCeeeecHHhhhc-cchHHHHHHHHHHH Confidence 34445543322 221 1246799999999999986443 23345888777777 Q ss_pred HHHhhHHHHHHHHHHhccCceEEecCCC---ccceee-ecccccccCCceecHHHHHHHHHHHHhccCccccceeccccc Q lcl|NC_020862. 157 RGANEITEDLLQADILASADVKVFTGAA---TSMVTM-TGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRM 232 (405) Q Consensus 157 ~~~~~~ted~l~~~ilag~~~v~yag~a---ts~~~~-t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~ 232 (405) +.+..+-.|+.+ .+|++ +-+.+-|.+ ..-.|+ ++. ...+|++.|-.+...+..++.... .-.|. T Consensus 506 ~aA~~~~~~~vy-~~L~~-Np~m~DGk~LFhadH~Nl~tga------~sals~~sl~~a~~am~~qk~~~~--~~~g~-- 573 (693) T protein:vir:95 506 QAAKATIGDLVY-AVLTG-NPAMSDGKTLFHADHSNLLTGA------ASALSIDSLSKAKTQMATQKAQVE--KGKGR-- 573 (693) T ss_pred HHHHHHHHHHHH-HHHhc-CccccCCcceeecccccccccc------ccccChHHHHHHHHHHHHhhcchh--ccCCc-- Confidence 655555555544 66653 333344433 122232 211 135888888888888877776411 00111 Q ss_pred cCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchh-----hhhcCCC Q lcl|NC_020862. 233 TDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMM-----HYAGAGA 307 (405) Q Consensus 233 ~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~-----~~~~aGa 307 (405) .--|.|.|+ +|+|+++-..+.+.. ...+|.. .+-.|.+--+.+ .+-.|++|++. .|.=+ + T Consensus 574 --~L~i~P~~l-lvP~~le~~a~~l~~----s~~~~~a------~~~~~~~NP~~~-~~~vi~~prL~~~s~~~Wyl~-a 638 (693) T protein:vir:95 574 --TLNIRPGFV-LTPVALEDKANQIIN----SESVPGA------DVNSGIVNPIRA-FAQVIGEPRLDDASATAWYMA-A 638 (693) T ss_pred --eeecccceE-EecchHHHHHHHHhc----ccccccc------ccccccccchhc-cccccccceecCCCCCceEEe-c Confidence 234566665 569999999988753 3334321 112233333322 23456677773 34421 1 Q ss_pred cccCCCcccccccccCCcceeeeEEEEEccccceeecceec----cCCCCCCceEEEecCCC Q lcl|NC_020862. 308 TATAANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGM----SGKGKSKFRIIVKKPGE 365 (405) Q Consensus 308 ~~~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~----~~~g~~~~~~ivk~pG~ 365 (405) +... +.-.+.-..|. -=|.| -=++.|.+-+++-. -+-+---++=++|+||- T Consensus 639 ~~~~--dtie~~yL~G~----~~P~i-e~~~gf~~dG~~~kvr~D~G~~~iD~Rg~~kn~GA 693 (693) T protein:vir:95 639 KKGS--DTIEVAYLDGV----DTPYL-EQQEGFTVDGVASKVRIDAGVAPLDFRGLQKSNGA 693 (693) T ss_pred CCCC--CeEEEEEecCC----CCCeE-eecCCCCcceEEEEEEEeccCceeeccccccCCCC Confidence 1110 00000000000 00222 12334555444311 01111234457889983 No 167 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=74.68 E-value=0.16 Score=25.02 Aligned_cols=283 Identities=11% Similarity=-0.015 Sum_probs=130.4 Q ss_pred CCccccCc--------CCCcc------cccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCC Q lcl|NC_020862. 1 MPHIYNDP--------AAGDA------STVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLD 66 (405) Q Consensus 1 ~~~~y~~~--------~~t~~------~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~ 66 (405) |+.++=+- ...+. .+|.|++. .+.+.+..+.-.+.+.+.+.++-...|+..++= +.+-. T Consensus 1 ~~~k~~~~~l~~~~~~~~~~~~~~~~g~~v~~~~~-----~~l~~~i~e~s~~l~~i~v~~v~~~~~~i~~~~-~~~~~- 73 (321) T protein:vir:31 1 MASRTINNDLSRITEKNALTVDDLDAGGTLPDPLW-----DEFWTDMIEETPLLDAIRTETVGAKKTRIPTLN-IGERH- 73 (321) T ss_pred CchHHHHHHHHHHHHhccccccccCCcceeCHHHH-----HHHHHHHHHhhhhhhhceeeeccCcceeeeeec-cCCcc- Confidence 44332111 11111 23444432 333444444446778888888877776544331 11110 Q ss_pred CCCccccCCCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhh-cc Q lcl|NC_020862. 67 DLNVNDQGLDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDT-DS 145 (405) Q Consensus 67 ~~t~l~eGvtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~-d~ 145 (405) .++ .++|.++ -++-..++..++-+++|+..+.++|++.++... .. T Consensus 74 ~~~-~~e~~~~---------------------------------~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d~a~~~ 119 (321) T protein:vir:31 74 RRP-QDEGEWN---------------------------------ENESDVSTGTIDISTEKATVAWDLPREVVQENPEGE 119 (321) T ss_pred ccc-ccccccc---------------------------------cccccceeeeeeeeeEEEEeehhccHHHHHhhhcch Confidence 111 1222211 122223444557789999999999999765432 24 Q ss_pred chHHHHHHHHHHHHhhHHHHHHHHHHhccCceE------EecCC---Cccce-eeecccccccCCceecHHHHHHHHHHH Q lcl|NC_020862. 146 DLYGHLSREMLRGANEITEDLLQADILASADVK------VFTGA---ATSMV-TMTGEAADAEDDGLITLKDLKRLSITL 215 (405) Q Consensus 146 ~l~~~~~~ell~~~~~~ted~l~~~ilag~~~v------~yag~---ats~~-~~t~~~~~~~~n~~it~~~lr~~~~~L 215 (405) ++.+.+...+.+ +....++ ..+++|.++- .-.|- ++..+ ++.. ....++.+.|.++...| T Consensus 120 d~e~~i~~~ia~-~~a~~~~---~~~~nGd~~~~~~~~~~n~G~l~~a~~~~~~~~~------~~~~~~~d~l~~l~~~l 189 (321) T protein:vir:31 120 ALADRILNLMTD-AWSADVE---DLAANGDEDAEDSFENQNDGFITVAEGDVETIDA------ADDILDNDLVIRTIAGL 189 (321) T ss_pred hHHHHHHHHHHH-HHHHHHH---hheeeccccCCCcccccchhhhhhhccccccccc------cccccCHHHHHHHHHhc Confidence 666665554443 3333332 2334553320 00110 00111 1111 12347888898888888 Q ss_pred HhccCccccceeccccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEe Q lcl|NC_020862. 216 TDNYTPKKTTIIKGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVV 295 (405) Q Consensus 216 k~nrApk~T~ii~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~ 295 (405) +..... ....+.+|++++..+++.....-+.|.|.+. +..+.-.++-| +..+. T Consensus 190 ~~~yr~-----------------~~~~v~im~~~~~~~~~~~l~~~~~~~~~~~--------l~~~~~~tl~G--~pvv~ 242 (321) T protein:vir:31 190 DSKYRA-----------------RMNPALIVSEDQLLSYHYTLTDRDTPLGDNV--------IMGEADVNPFS--FPIIG 242 (321) T ss_pred cHhHhc-----------------CCCeEEEechHHHHHHHHHHhcCCCccccch--------hhccccccccc--eeEEE Confidence 654321 0136899999999988775444444555443 22344445533 67788 Q ss_pred CcchhhhhcCCCcccCCCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccc Q lcl|NC_020862. 296 VPQMMHYAGAGATATAANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYG 375 (405) Q Consensus 296 ~p~~~~~~~aGa~~~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlg 375 (405) +|.|- .+. ++++.=.--.+++.. + ..++.... .|+.- T Consensus 243 ~~~mP----~~~-------------------------il~t~~~nl~~~~~~---~--~~~~~~~~---------~~~~~ 279 (321) T protein:vir:31 243 SGLWP----DDK-------------------------AMFTDPQNLIYALYR---D--LEIDVLTE---------SDKVS 279 (321) T ss_pred cCCCC----CCc-------------------------EEEeccccEEEEEee---c--cEEEEeec---------Ccccc Confidence 88653 111 111110000111210 0 01111111 11110 Q ss_pred -h--hhhHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_020862. 376 -K--VGFSSIKFFYGFIKLRGERIAVAYSVIPE 405 (405) Q Consensus 376 -Q--rg~~gwK~~~~~~iL~~~~marie~~a~~ 405 (405) + |-..-...-+.+.|.+.+..+.+| -.|| T Consensus 280 ~~~~~~~~~~~~~~~~~ve~~~a~a~~~-~i~~ 311 (321) T protein:vir:31 280 ERDLHARYFMRGDDDFAIENTEAVVLAE-GLGD 311 (321) T ss_pred ccceeeEeeeeeecceeEeccccEEEEe-cCCc Confidence 0 000011133567788888888888 2233 No 168 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=69.67 E-value=0.22 Score=24.19 Aligned_cols=296 Identities=11% Similarity=0.049 Sum_probs=129.0 Q ss_pred CCccccCcCCCccc------ccccceeehhhhhHHHHHhhh-hhhhhc-cccccccCcCCCCEEEEEecccCCCCCCccc Q lcl|NC_020862. 1 MPHIYNDPAAGDAS------TVGPQFNVHYWDRKSLIDEAE-EMFFSP-LADNKQMPKHFGKELKVFYYVPLLDDLNVND 72 (405) Q Consensus 1 ~~~~y~~~~~t~~~------~v~~qm~t~y~~~k~L~~a~p-~lv~~~-fA~~~~mPKn~GktIkfrry~pl~~~~t~l~ 72 (405) |+. |..-... +|+.=|++-=+-...+++.++ +++-.. |.+.. -+.+..++|++-.|.-.+- -. T Consensus 1 ~~~----~~~i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~~---a~~~~~v~f~~~~p~~~~~--d~ 71 (318) T protein:vir:10 1 MTA----PTGIVSVSDGPAITVRELVGNPLWIPTALKKMMVNQFISESLFRNGG---ANPNGVVAYNEGNPSFLED--DV 71 (318) T ss_pred CCC----CCcceeeecCCceehHHhhCCchhHHHHHHHHHhccchhhhhhhccc---ccccceeEEEecccccccC--cH Confidence 332 2111111 122223311223444555554 444333 44433 4567799999988875433 34 Q ss_pred cCCCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHH Q lcl|NC_020862. 73 QGLDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLS 152 (405) Q Consensus 73 eGvtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~ 152 (405) |.| ++|.++---.. + .| ...-+..++||-=..+|||...-... +.++..- T Consensus 72 e~V-aEggEiP~~~~----------~---------------~G---~~~ia~~~K~G~~~~vS~Em~~~n~~-~~v~r~~ 121 (318) T protein:vir:10 72 ADV-AEFGEIPVSAG----------A---------------RG---LPRTAFAVKKALGVRVSKEMIDENRV-GAVNDQM 121 (318) T ss_pred hhc-cCcccccccCC----------C---------------CC---chhhhhhehhccceeccHHHHhhcCh-hHHHHHH Confidence 666 66666521100 0 01 11234667999999999996544443 4444444 Q ss_pred HHHHHHHhhHHHHHHHHHHhccCce-EEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccc-cceeccc Q lcl|NC_020862. 153 REMLRGANEITEDLLQADILASADV-KVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKK-TTIIKGS 230 (405) Q Consensus 153 ~ell~~~~~~ted~l~~~ilag~~~-v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~-T~ii~gs 230 (405) +.+.+.... ..|...-+.|..+.+ -..+.++... .+ -...++..+....+.-++-.. -.+..-. T Consensus 122 ~~l~Nti~r-~~d~~a~dal~sa~t~~~~~s~~w~~------~~-------~~~~d~~~A~e~v~~a~~~~~~a~~~~~~ 187 (318) T protein:vir:10 122 LQLRNTFIR-ANDRSAKALLQSPIVPTLAVPTAWDN------GG-------KVRTDIAIAIEQISTAAPTAYPAGVGSSD 187 (318) T ss_pred HHHHHHHHH-HHHHHHHHHHhccccccccCCcCCCC------cc-------cccccchhhhhhhhhhhhhhhhhhhhhhh Confidence 444443333 233333344443322 1111111000 00 111244444444433333000 0000000 Q ss_pred cccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcC-Ccccc-----cCcce-eEecCCcEEEEeCcchhhhh Q lcl|NC_020862. 231 RMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYA-DAATI-----MNGEI-GAIPGAHLRIVVVPQMMHYA 303 (405) Q Consensus 231 ~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya-~~~~i-----~~gEI-Gsi~g~n~Rfv~~p~~~~~~ 303 (405) ..+|=. -=..++||.+.+-|++ |+.|.++ |- +..++ +.|.+ |++-| ++.|.+|.+-. T Consensus 188 ~~~GY~----pdtIVlhP~~~~~l~~------n~~~~~~--y~~~a~~~~~~~~~tg~~~g~~lG--l~vi~s~~~p~-- 251 (318) T protein:vir:10 188 EYFGFI----PDTIVMHYALLPILMD------NENFMKV--YERNANYVSTAPDWTGNFPGSVMG--LNVIRSRTFPI-- 251 (318) T ss_pred hccCcc----ceeeEECHHHHHHHhc------chhhhhh--hhccchhhhhcccccccccceeec--eEEeecCccCC-- Confidence 112200 1257899999999964 6777754 31 11212 23444 45534 89999985541 Q ss_pred cCCCcccCCCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH Q lcl|NC_020862. 304 GAGATATAANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK 383 (405) Q Consensus 304 ~aGa~~~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK 383 (405) | + .+||=+...|.+... .+++.-- .+ .+.+||. +--..+|. T Consensus 252 --~-------------------~-----alvlq~g~vG~~~d~-------~pl~~t~---~~--~egg~~~-g~~~~s~~ 292 (318) T protein:vir:10 252 --D-------------------R-----VLIMERGTVGFYSDT-------RPLQFTA---LY--PEGNGPN-GGPTESYR 292 (318) T ss_pred --C-------------------e-----eEEEecCCcceeecc-------ccceeee---cc--cCCCCCC-CCcchhhh Confidence 1 1 356666555544321 2233222 22 2567885 55566665 Q ss_pred H----HHHHhhccccceEEEE-EecC Q lcl|NC_020862. 384 F----FYGFIKLRGERIAVAY-SVIP 404 (405) Q Consensus 384 ~----~~~~~iL~~~~marie-~~a~ 404 (405) . +.+..+-++.=...|+ .+.| T Consensus 293 ~~~~~~~~~~V~~PkA~~~itgi~~~ 318 (318) T protein:vir:10 293 ADASHKRALAVDQPKAALWLTGIVTP 318 (318) T ss_pred eehheeeeeeeeCcceeEEEeeccCC Confidence 1 1122222222111111 2333 No 169 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=64.91 E-value=0.29 Score=23.51 Aligned_cols=285 Identities=13% Similarity=0.072 Sum_probs=113.5 Q ss_pred CCccccCc-CCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCccc Q lcl|NC_020862. 1 MPHIYNDP-AAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATG 79 (405) Q Consensus 1 ~~~~y~~~-~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g 79 (405) ..+.-..- ..+..+.+-|+ .+ ....+........+.+.+.+.++. |. +++......+. + T Consensus 144 ~~~~~~~~~~~~g~~~~vP~---~~-~~~i~~~l~~~~~l~~~~~v~~~~---g~-~~~~~~~~~~~------------a 203 (466) T protein:vir:80 144 VRTLAQQKRAVSGAELTIPD---VM-LELLRDNMHRYSKLISKVRLRPLK---GT-ARQNIAGAIPE------------G 203 (466) T ss_pred HHHHhhhhhhhccccccccH---HH-HHHHHHhhhhhhhhhhheeeeecC---ce-eEeeeecCCcc------------e Confidence 00000000 00001112222 12 112222222223334455444442 21 12211111110 0 Q ss_pred ccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHH Q lcl|NC_020862. 80 ASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGA 159 (405) Q Consensus 80 ~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~ 159 (405) ...+.|+.....-.++..|+-++++|+.|+.+|++ ++-|+..++.+.|...+.+ + T Consensus 204 -----------------------~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~e-ll~ds~~~l~~~i~~~la~-~ 258 (466) T protein:vir:80 204 -----------------------VWTEAVANLNELSLSFSQIEVDGYKVGGFIPIPNS-TLEDSDLNLADEILDAIGQ-A 258 (466) T ss_pred -----------------------eecccccccccccccccceeecceeeeeehhhhHH-HHhcchHHHHHHHHHHHHH-H Confidence 11222333333344666788999999999999998 4567776788877666654 3 Q ss_pred hhHHHHHHHHHHhccCceEE------ecCCCccceeeecccccccCCceecHHHHHHH--------------HHHHHhcc Q lcl|NC_020862. 160 NEITEDLLQADILASADVKV------FTGAATSMVTMTGEAADAEDDGLITLKDLKRL--------------SITLTDNY 219 (405) Q Consensus 160 ~~~ted~l~~~ilag~~~v~------yag~ats~~~~t~~~~~~~~n~~it~~~lr~~--------------~~~Lk~nr 219 (405) ...+++ ..+++|.++-. +.+..+.... + ...+...+.++..++-.+ +..+.... T Consensus 259 ~~~~~~---~ail~G~G~~~P~Gil~~~~~~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 332 (466) T protein:vir:80 259 IGFALD---KAILYGTGTKMPVGIVTRLAQTTQPPN-W--GTKAPAWTNLSTTNLLKIDPTGKSAEEFFSELVLKLSKAR 332 (466) T ss_pred HHHHHh---hheeeccCCCCcceeeecccccccccc-c--ccccccccccchhhhhhhhhhccchhhHHHHHHHHHHhhh Confidence 334444 45667655421 1111111000 0 011111222333322221 11111111 Q ss_pred CccccceeccccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcch Q lcl|NC_020862. 220 TPKKTTIIKGSRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQM 299 (405) Q Consensus 220 Apk~T~ii~gs~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~ 299 (405) ++ ...+.++..+++++.+.++.+.... ++....+..=++.. .|-| ..+|+++.| T Consensus 333 ~~---------------~~~~~~~w~~~~~~~~~l~~~~~~~-~~~g~~~~~~~~~~--------~i~G--~pvv~s~~~ 386 (466) T protein:vir:80 333 AN---------------YSNGMKFWAMSSNTHAVLMSKAITF-NSAGALVASLNNTM--------PIVG--GDIVILDFI 386 (466) T ss_pred cc---------------ccCCceeEEecchhHHHhhcccccc-cCCccccccCCCcc--------cccc--cceeecCcc Confidence 11 1122456678999888887664211 11111111101111 1222 345555533 Q ss_pred hhhhcCCCcccCCCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhh Q lcl|NC_020862. 300 MHYAGAGATATAANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGF 379 (405) Q Consensus 300 ~~~~~aGa~~~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~ 379 (405) - .| + +++|.-..-.+..+ ..+++.+.. +..==.+|.+| T Consensus 387 ~----~~---------------------~----~~~g~~~~y~i~~r-------~~~~i~~~~------~~~f~~d~~~~ 424 (466) T protein:vir:80 387 P----DN---------------------D----IIGGYGSLYLLAER-------ADIKLAQSE------HVRFIEDQTVF 424 (466) T ss_pred C----cc---------------------c----eeeeccccEEEEee-------cceEEEech------hhhhhcCcEEE Confidence 1 00 1 45555444444443 123333321 00101244444 Q ss_pred HHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_020862. 380 SSIKFFYGFIKLRGERIAVAYSVIPE 405 (405) Q Consensus 380 ~gwK~~~~~~iL~~~~marie~~a~~ 405 (405) .+ .+++.+.+.+++=++.++..... T Consensus 425 r~-~~r~dg~~~~~~afv~~~~~~~~ 449 (466) T protein:vir:80 425 KG-TARYDGKPVFGEGFVAVNIANAN 449 (466) T ss_pred EE-EEEEccEEeccCceEEEEecCCC Confidence 44 24456666666666666544333 No 170 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=63.61 E-value=0.31 Score=23.34 Aligned_cols=280 Identities=13% Similarity=0.144 Sum_probs=122.5 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhh--hhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAE--EMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDAT 78 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p--~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~ 78 (405) +...|. -+|++...=+.. - -.|.|+++-. +--|.+|+-..+++.=+- .++.|..-++.=. -| |. T Consensus 357 v~~A~~----hsTsDFp~IL~~-~-~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~--~~~~~lg~~~~L~-----~V-~E 422 (652) T protein:vir:79 357 VGAAFT----HSTSDFGNILLD-V-ANKAILQGWEDAPETYEQWTRKGQLSDFKI--AHRVGMGGFSALR-----QV-RE 422 (652) T ss_pred HHHHhh----cCcchHHHHHHH-H-HHHHHHHHHhhhHHHHHHHhccCCCccccc--cceeecCCCCCcc-----cc-CC Confidence 111111 112222211111 0 1455555544 346778887777765433 2333333322211 12 34 Q ss_pred cccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHH Q lcl|NC_020862. 79 GASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRG 158 (405) Q Consensus 79 g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~ 158 (405) |.+|..|.+ .| ...+..|..||--+.+|.+.++=| |=+.+..+.+.+++. T Consensus 423 ~gEyk~~t~-----------------~e------------~~e~~~l~tyG~~~~iTRqaiIND-DL~a~~~ip~~~g~a 472 (652) T protein:vir:79 423 GAEYKYVTT-----------------GD------------KQATIALATYGELFSITRQAIIND-DLNMLTDVPMKLGRA 472 (652) T ss_pred CCccceeee-----------------cC------------ccceeeeecccCeeeeehheeecc-chhHHHHHHHHHHHH Confidence 555543322 22 234679999999999999864333 334588878777755 Q ss_pred HhhHHHHHHHHHHhccCceEEecCCCc----cceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccC Q lcl|NC_020862. 159 ANEITEDLLQADILASADVKVFTGAAT----SMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTD 234 (405) Q Consensus 159 ~~~~ted~l~~~ilag~~~v~yag~at----s~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~g 234 (405) +..+-.++.+ .+|.+-..+.+-|.+= .-.|+.+ ...+++.-|-.+...+..+. +|.+. T Consensus 473 A~~~~~~~vy-~~l~~Np~~~~DGk~LF~hA~H~Nl~~-------~aa~~~~~l~~ar~aM~~Qk--------~g~~~-- 534 (652) T protein:vir:79 473 AKSTIADLVY-AILTSNPKISTDNVSLFDKAKHANVLE-------SAAMDVASLDKARQLMRVQK--------EGERH-- 534 (652) T ss_pred HHHHHHHHHH-HHHhcCcccccCCceeecccccccccc-------cccCCHHHHHHHHHHHHHhc--------cCCcc-- Confidence 5555444445 5554332222222110 0112111 12367777766666665443 23222 Q ss_pred cccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCc Q lcl|NC_020862. 235 TKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANR 314 (405) Q Consensus 235 T~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~ 314 (405) --|.|.| .+|+|+++-.-+.|.. ....-+ .....|.+--+. +.+..|+.|++.- . +. . T Consensus 535 -l~i~P~~-llvp~~le~~a~~ll~--------s~~v~~--a~~~~~~~Np~~-~~~~~i~eprL~~-----~--s~--~ 592 (652) T protein:vir:79 535 -LNIRPAF-VLVPTAMESVANQVIR--------SSSVKG--ADINAGIINPVK-DFATVIAEPRLDD-----N--SQ--T 592 (652) T ss_pred -ccccccE-EEecchhHHHHHHHhc--------cCCCcc--cccccccccccc-cccccccccccCC-----C--Cc--c Confidence 3466777 6899999999987742 111111 112334444442 2357788888741 0 00 0 Q ss_pred ccccccccCCcceeee----------EEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHH Q lcl|NC_020862. 315 GYQVSDVAGTDKYDIA----------PLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKF 384 (405) Q Consensus 315 ~~~~~~~~g~~~~DVY----------p~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~ 384 (405) .+.+.... . .|.. |.|=- .+.|.+-|++ ||. T Consensus 593 ~wylaa~~--~-~dtiev~yL~G~~~P~ie~-~~gf~~dG~~-----------------------------------~kv 633 (652) T protein:vir:79 593 TFYLAASK--G-SDTIEVAYLNGVDTPYIDQ-MEGFSVDGVT-----------------------------------TKV 633 (652) T ss_pred cEEEecCC--C-CCeEEEEEecCCCCCeeee-cCCCCcceEE-----------------------------------EEE Confidence 00000000 0 0110 11111 1223333332 332 Q ss_pred HHHHhhccccceEEEEEec Q lcl|NC_020862. 385 FYGFIKLRGERIAVAYSVI 403 (405) Q Consensus 385 ~~~~~iL~~~~marie~~a 403 (405) +.=+.+=-=.|--..++-| T Consensus 634 rlD~G~~~iD~RG~~k~t~ 652 (652) T protein:vir:79 634 RIDAGVAPVDHRGLVKCTA 652 (652) T ss_pred EEeccCceeeccceeeecC Confidence 2111111112222333333 No 171 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=57.70 E-value=0.43 Score=22.60 Aligned_cols=293 Identities=11% Similarity=-0.019 Sum_probs=124.0 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) =...|+.-..++++.-+--+ +--+.++.+++..+.-.+-+++.+.++. |. .++-+...-+.+ . T Consensus 76 e~~~~~~~~~~~~~~gg~lv-P~~~~~~I~~~l~~~s~l~~~~~v~~~~---~~-~~i~~~~~~~~a-----~------- 138 (383) T protein:vir:78 76 EIKFFNDINKEVGYKEETLL-PQTVVDEIFEDLTTEHPFLASIGMRTTG---LR-TKFLKSETSGVA-----V------- 138 (383) T ss_pred HHHHHHHHhccCCCCCcccc-CHHHHHHHHHHHHhhccceeeeeeEecC---Cc-eEEEEEcCCcce-----E------- Confidence 01122322222222212111 2233455554454444566677766653 33 232221111110 0 Q ss_pred cccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHh Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGAN 160 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~ 160 (405) - -.|.|.+++....++..++-+.++++.++.+|++++ -|+.-+|...|..++.+ +. T Consensus 139 w----------------------~~e~~~~~~~~~~~f~~i~l~~~kl~~~i~is~ell-~Ds~~~ie~~i~~~l~~-~~ 194 (383) T protein:vir:78 139 W----------------------GKIFGEIKGQLDATFSDEESIQNKLTAFVVVPKDLE-KFGPAWVKRFVVTQIEE-AF 194 (383) T ss_pred E----------------------eecccccccccCcceeeEeecceeeEeeccchHHHh-hccHHHHHHHHHHHHHH-HH Confidence 0 011122223333467777889999999999999854 44555777776666664 33 Q ss_pred hHHHHHHHHHHhccCceEE------ecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccC Q lcl|NC_020862. 161 EITEDLLQADILASADVKV------FTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTD 234 (405) Q Consensus 161 ~~ted~l~~~ilag~~~v~------yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~g 234 (405) ...++ ..+++|.++.. ..+..++.+ .+.......-..++..++..+...|+.-+.- .+-+-++. . T Consensus 195 a~~~~---~a~i~G~G~~qP~Gil~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~-~~~~~~~~---~ 265 (383) T protein:vir:78 195 AVALE---SAYIVGDGNDKPIGLNRKVGKGSTVV--DGVYAEKAATGTLTFANPKTTVNELTDVYKY-HSVKENGH---P 265 (383) T ss_pred HHHHh---hheEeccCCCCceeeeeccCCccccc--ccccccccccchhhhhhhHHHHHHHHHHHhc-cchhcccc---h Confidence 33343 44677766432 111111111 1110111111334555555554444432110 00000000 0 Q ss_pred cccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCc Q lcl|NC_020862. 235 TKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANR 314 (405) Q Consensus 235 T~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~ 314 (405) =+.++ .-+.++++...+++.-. . ...+. +|+.-.+-+-.+++|+++.|. + T Consensus 266 ~~~~~-~~~~~~n~~~~~~~~~~--------~----~~~~~----~G~~~t~l~~~~~iv~s~~~p----~--------- 315 (383) T protein:vir:78 266 LNVAG-KVTLLVNPTDAWDVKKQ--------Y----TSLNA----NGVYVTALPFNLNIIESLFVP----E--------- 315 (383) T ss_pred hhhcC-ceEEEEcCcchhhhccc--------h----hccCC----CCceeeecCCCceEEecCCCC----c--------- Confidence 01111 12356666444444211 1 11111 223223323345566665432 0 Q ss_pred ccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHHHhhcc Q lcl|NC_020862. 315 GYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYGFIKLR 392 (405) Q Consensus 315 ~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL~ 392 (405) + + ++||.-..=.+..+ +.+++-+. .+-+-.++-.+++ +++.+++++ T Consensus 316 ----------~--~----iifgdfs~Y~i~~r-------~~~~i~~~---------~~~~f~~d~~~f~~~~r~dG~~~~ 363 (383) T protein:vir:78 316 ----------K--K----AISYVAERYDALIG-------GPLDIGTY---------DQTLAIEDLNLYAAKQFAYGKAKD 363 (383) T ss_pred ----------c--c----EEEeeccceEEEec-------ccceEEec---------chhhhhcCceEEEEEEEEcCEEec Confidence 1 1 45665444344443 11222111 2222233333333 567889999 Q ss_pred ccceEEEEEecCC Q lcl|NC_020862. 393 GERIAVAYSVIPE 405 (405) Q Consensus 393 ~~~marie~~a~~ 405 (405) ++=+++++...-| T Consensus 364 ~~A~~vl~~~~~~ 376 (383) T protein:vir:78 364 DKAAAVWTLNINP 376 (383) T ss_pred CCeEEEEEEEecC Confidence 9999998876555 No 172 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=48.02 E-value=0.68 Score=21.49 Aligned_cols=289 Identities=12% Similarity=0.143 Sum_probs=139.5 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGA 80 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~ 80 (405) |+ .+...++...+++.+..-| -..+.+...+| +.-|+-.|.+|...|.+||-+++.... -..+-|--++|. T Consensus 1 M~---~e~nl~~~~dL~~a~siDF--~~~f~~~i~~L-~~~LGv~r~~pla~Gt~iktyK~~~~~---y~gda~dVaEGe 71 (303) T protein:vir:10 1 MS---AENNLINVEALGKAKSIDF--ANKLGVGLNKL-FEALAIQNKIPMNVGSALKQYRFKVED---SEKPNGDVAEGD 71 (303) T ss_pred CC---CCcCCcchhhcccceeehh--hhhhhhhHHHH-HHHhhhhccccccCCceeeeeeeecee---eccccccccCCc Confidence 55 4455677788887766666 33333333322 223566788888899999887764331 123333344554 Q ss_pred cccCCcccccccccccccccccccccccccccccceeee---eEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHH Q lcl|NC_020862. 81 SYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRL---ERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLR 157 (405) Q Consensus 81 ~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~---di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~ 157 (405) .| . |+. ++|+ ..+.++.+|+--+ |||++-..-..+.+++.-.+|+. T Consensus 72 ~I----------p----------lsk---------vt~~~~~t~~~~~kK~rK~t--TdEAIqlsGyg~aVgetd~qL~~ 120 (303) T protein:vir:10 72 VI----------P----------LTK---------VTREQVDITELQFAKYRKST--SAEAIQAHGYDLAINQTDNEMIK 120 (303) T ss_pred cc----------c----------hhh---------heeeecceEEEEeecccccc--cHHHHHhhcCCchhHHHHHHHHH Confidence 44 1 222 2332 5788999999855 99987677777777876666665 Q ss_pred HHhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCccc Q lcl|NC_020862. 158 GANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKT 237 (405) Q Consensus 158 ~~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~ 237 (405) .-+.-....+...++.+++. .++ ...+.++.+-|-++......+-.-. +-. T Consensus 121 ~Iq~kIdnd~~~~lktaT~t------~~~-----------t~~t~~s~~glq~Al~~~~~kl~~~-----------~ed- 171 (303) T protein:vir:10 121 YVQKKFRAKFFETLKSAIEN------GKR-----------TNKTKLSAENLQGALSKGRANLSVL-----------LDD- 171 (303) T ss_pred HHHhhhhHHHHHHHhhcccc------ccc-----------ccceeecHHHHHHHHHhhhhhcccc-----------ccc- Confidence 33332222222233332211 101 1235678888888776554443321 111 Q ss_pred ccceEEEEEcccchHHHHHHhcccCCCccee-h-hhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcc Q lcl|NC_020862. 238 ISASRIAYIGSELEIYITELVDSLGNPAFVP-V-EKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRG 315 (405) Q Consensus 238 I~~syv~~~h~dl~~dir~l~d~~~~p~fi~-v-~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~ 315 (405) ...+|.||+|.=..++|.= +.+. . .++|.. ...+=.|. -+|.++.+. +|...... T Consensus 172 -~~~~V~FvNP~Daa~yl~~-------A~i~~~~t~fG~n--~L~nfLG~------~II~S~kv~----~G~~~~T~--- 228 (303) T protein:vir:10 172 -EITPIAFVNPNDTAEYLAN-------GFINSTGAQFGVN--LLTPYVGV------KIVEFADVP----QGEVWMTV--- 228 (303) T ss_pred -cccEEEEEchHHHHHHhhc-------CCcchhhhhhhhh--hhhhhhcc------eEEEeccCC----CceEEEee--- Confidence 1247999999988888742 2222 1 345544 23332232 234444433 23222111 Q ss_pred cccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccc Q lcl|NC_020862. 316 YQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGER 395 (405) Q Consensus 316 ~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~ 395 (405) ..-+-+|.+-+-|+-+ ...+|- .+ ...+.=+...+=. ++=-+-- .++.+..|-+|+ T Consensus 229 --------~~Ni~~ay~~~~g~l~-~~f~~t---~D-~tglIGv~h~~~~----~~~t~eT-------~~~~~~~lfpE~ 284 (303) T protein:vir:10 229 --------AENLNVAYANPRGELS-RAFAFA---TD-ATGFVGVLHDIQP----QRLTSDT-------IYASAISMFPEN 284 (303) T ss_pred --------ccceEEEEecCchhhh-hhhhhc---cc-cccceEEEecccc----ceeeehh-------HhHhHHHhcccc Confidence 1112344555555333 222221 11 2223333333211 1111122 333444444554 Q ss_pred e---EEEEEec---CC Q lcl|NC_020862. 396 I---AVAYSVI---PE 405 (405) Q Consensus 396 m---arie~~a---~~ 405 (405) . +..+.-+ +| T Consensus 285 ~dgiv~~ti~~~e~~~ 300 (303) T protein:vir:10 285 IDAVIKVTIKKDEAGE 300 (303) T ss_pred cceEEEEEEeccccCC Confidence 3 4444423 33 No 173 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=42.96 E-value=0.86 Score=20.93 Aligned_cols=286 Identities=11% Similarity=0.075 Sum_probs=126.7 Q ss_pred ccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccccccC Q lcl|NC_020862. 5 YNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGASYAG 84 (405) Q Consensus 5 y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~~~~~ 84 (405) .+++..|..+.+.+.+..-|-. .+.+-..+ .+.-|+-.+.+|...|.||++++|.=..++.+ . ++|..| T Consensus 1 mAe~nlt~~~dL~~~~sidfv~--~f~~~i~~-L~~~Lgi~r~~p~a~G~tIt~pK~~~tgda~d-V-----aEGe~I-- 69 (295) T protein:vir:99 1 MAEKNLNTMADLGDIKSIDFVN--KFSKNIND-LLKLLGVTRRETLTNDLKIQTYKWEVTLDQTD-P-----GEGETI-- 69 (295) T ss_pred CCCcccccHhhccCceeehhhH--HhhhhHHH-HHHHhccccccccccCCeEEeeeeeeeccccc-c-----cCCccc-- Confidence 5677778788887544444411 11111111 12236677889999999999999887766654 2 333333 Q ss_pred Ccccccccccccccccccccccccccccccceeee---eEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHhh Q lcl|NC_020862. 85 GNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRL---ERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGANE 161 (405) Q Consensus 85 gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~---di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~~ 161 (405) + |++ ++++ ..+.++.+|+-- .|||++-..-..+..++...+|+..-+. T Consensus 70 --------p----------lsk---------vt~~~~~t~t~kikK~rK~--tTdEAIqlsGygdpvgead~qL~~~ia~ 120 (295) T protein:vir:99 70 --------P----------LSK---------VTRTKDKDYTVKWFKKRRA--TTAEAIARHGAARAITEADKRIMRELQN 120 (295) T ss_pred --------c----------hhh---------heeeeeeeeEEEeeeeccc--ccHHHHHhcCCCchhHHHHHHHHHHHHH Confidence 1 111 2333 467889999984 5999875667777788766666654333 Q ss_pred HHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcccccce Q lcl|NC_020862. 162 ITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISAS 241 (405) Q Consensus 162 ~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~s 241 (405) -..+.+...++++++.+ ++. +-...++.+. ..|....-- .| .. T Consensus 121 kId~D~~~~lktat~t~------------tg~------~lq~a~a~~~---~al~~f~Ee-----------~~-----~~ 163 (295) T protein:vir:99 121 GIKDAFFTFLKTKPTKV------------KGV------GLQKALSASW---AKLATFNEF-----------EG-----SP 163 (295) T ss_pred hhhHHHHHHhccCceee------------ehh------hHHHHHHHhh---hhhhhcccc-----------cC-----Cc Confidence 22222222223322111 100 0011222221 122222211 11 24 Q ss_pred EEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcccccccc Q lcl|NC_020862. 242 RIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRGYQVSDV 321 (405) Q Consensus 242 yv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~~~~~~~ 321 (405) +|.|++|.-..++|.-+. -.|-....+|.. +..+=.|- ..=||.=..|+-+.|+-+ . T Consensus 164 ~V~FVnP~D~a~yl~~A~----~~~~~a~~fG~~--~L~nfLG~--q~II~S~kv~~G~~~aT~-------~-------- 220 (295) T protein:vir:99 164 LVSFVSPLDVANYLGDTK----VGADASNVFGMT--LLKNFLGM--QNVIVMPSVPEGKIYSTA-------V-------- 220 (295) T ss_pred eEEEEehHHHHHHHhccc----cccchhhhhhhh--hhhhhhcc--ceEEEcccCCCceEEEee-------c-------- Confidence 899999999999885331 123333335443 22332231 001343344443333321 0 Q ss_pred cCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccceEEEEE Q lcl|NC_020862. 322 AGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGERIAVAYS 401 (405) Q Consensus 322 ~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~marie~ 401 (405) .-+-+|.+-+=|-|-=....+- .....+.=+...+= .++=-+--.-++||++| .-+..-++.... T Consensus 221 ---~Ni~~ay~~~~~g~l~~~f~~~----~D~tglIg~~h~~~----~~~~t~et~~~~~~~lf----pE~~dgiv~~tI 285 (295) T protein:vir:99 221 ---ENLVFASLNVKGGDLGGLFADF----TDETGLIAAARNRQ----LSNLTYESVFFGANVLF----AEIPEGVVEATI 285 (295) T ss_pred ---cceEEEEecCCchhhhhhhhhc----cCcccceEEEeccc----cceeeehhhhHhHHHhc----ccccceEEEEEE Confidence 0011222222211100111110 01112222222221 11111222233444443 122334566666 Q ss_pred ecCC Q lcl|NC_020862. 402 VIPE 405 (405) Q Consensus 402 ~a~~ 405 (405) -+|| T Consensus 286 ~~~~ 289 (295) T protein:vir:99 286 EAAA 289 (295) T ss_pred ecCc Confidence 6677 No 174 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=40.77 E-value=0.96 Score=20.69 Aligned_cols=284 Identities=10% Similarity=0.065 Sum_probs=126.0 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEe-cccCCCCCCccccCCCccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFY-YVPLLDDLNVNDQGLDATG 79 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrr-y~pl~~~~t~l~eGvtp~g 79 (405) -..-|.++..+..+.+++.+..-| -..+.+-..+| +.-|+-.|.+|...|.|||.+. |.-..++.+ =++| T Consensus 3 ~~~~~~e~nlt~~~dl~~~~siDf--~~~f~~~i~~L-~~~LGv~r~~pla~GstIkt~k~~~y~gda~d------VaEG 73 (296) T protein:vir:98 3 TSRTYPEENLIKSTDLKYPITIDV--TNKFQENISKL-LEMLGVTRKISVSEGMTLKTYAGYDVTLAEGN------VPEG 73 (296) T ss_pred CccccCcCCCcchhhhhhhhhhhh--HHHHhhhHHHH-HHHhhhcccccccCCCEEeeccceeeeecccc------ccCC Confidence 356788888888888988766665 22232222222 2336777889999999999874 333333322 2344 Q ss_pred ccccCCcccccccccccccccccccccccccccccceeee---eEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHH Q lcl|NC_020862. 80 ASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRL---ERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREML 156 (405) Q Consensus 80 ~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~---di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell 156 (405) ..| . |++ ++++ ..++++.+|+--+ |||++-..-..+..++...+|+ T Consensus 74 e~I----------p----------lsk---------vt~~~~~t~t~~ikK~rK~t--TdEAIqlsGyg~aVgetd~qL~ 122 (296) T protein:vir:98 74 EVI----------P----------LSK---------VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALV 122 (296) T ss_pred ccc----------c----------hhh---------heeeecceEEEEeecccccc--CHHHHHhhcCCchhHHHHHHHH Confidence 343 1 122 2443 4678999999875 9998766677777887666666 Q ss_pred HHHhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_020862. 157 RGANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTK 236 (405) Q Consensus 157 ~~~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~ 236 (405) ..-+.-....+...++.++..+. .+.+.|-.+....-.+-.-+ +.-. T Consensus 123 ~~iq~kId~d~~t~LktaT~t~~-----------------------~t~~~lQ~Ala~~~~~l~~~----------fede 169 (296) T protein:vir:98 123 RQLQKKIRTDFVTALKTGTGTQD-----------------------ALGAGLQGALASAWGKLQVL----------FEDY 169 (296) T ss_pred HHHHHhhhHHHHHHHhcccceee-----------------------echhhHHHHHHHHhhhhhhh----------cccc Confidence 54333222222223333322111 22333333332111111100 1100 Q ss_pred cccceEEEEEcccchHHHHHHhcccCCCcceehh-hcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccCCCcc Q lcl|NC_020862. 237 TISASRIAYIGSELEIYITELVDSLGNPAFVPVE-KYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATAANRG 315 (405) Q Consensus 237 ~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~-~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~t~~~ 315 (405) ....+|.|++|.=..+.+ ++..+ ..+ .|| .+..++-.|.. =||.=..|+-+.|+-+ . T Consensus 170 -d~~~~V~FVnP~D~a~yl------g~a~i-t~qt~fG--~tyl~nfLG~~---II~S~kV~~G~~~~T~-------~-- 227 (296) T protein:vir:98 170 -GSERAIVFANSLDVAEYI------AKAGI-TTQTAFG--LTYLVDFTGTV---IISTNDVTKGEIWATV-------P-- 227 (296) T ss_pred -CCCceEEEEehHHHHHHh------cCCcc-chhheec--hhhhhhccccE---EEEcCcCCCceEEEee-------e-- Confidence 012499999998777765 34433 332 222 22334455531 1343333333333311 0 Q ss_pred cccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHHHHHHHhhccccc Q lcl|NC_020862. 316 YQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIKFFYGFIKLRGER 395 (405) Q Consensus 316 ~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL~~~~ 395 (405) .-+-+|.+=+=|-|-=....|- .....+.=+...+=. ++=-+-- .++.+..|-+|+ T Consensus 228 ---------~Ni~~ay~~~~~~~l~~~f~~~----~d~tglIGv~h~~~~----~~~t~eT-------~~~~~~~lfpE~ 283 (296) T protein:vir:98 228 ---------ENIIFAYINPNNSELAKEFNLY----GDPTGYIGMNHFQEN----TTLTIQT-------LLVSGMLMYPER 283 (296) T ss_pred ---------cceEEEeecccccchhhhhccc----cccccceEEEecccc----ceeeehh-------HhHhHHHhcccc Confidence 0011222222111100011110 011122222222210 1111112 334444455555 Q ss_pred eEE-----EEEec Q lcl|NC_020862. 396 IAV-----AYSVI 403 (405) Q Consensus 396 mar-----ie~~a 403 (405) ..= |+.+. T Consensus 284 ~dgiv~~tI~~~~ 296 (296) T protein:vir:98 284 IDGIVKVTLTPGV 296 (296) T ss_pred cceEEEEEecCCC Confidence 432 22222 No 175 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=34.15 E-value=1.3 Score=19.94 Aligned_cols=292 Identities=10% Similarity=-0.024 Sum_probs=125.8 Q ss_pred CC----ccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCC Q lcl|NC_020862. 1 MP----HIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLD 76 (405) Q Consensus 1 ~~----~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvt 76 (405) ++ ..|++-..++.+.-+ -.-+--+.++.+.+....=.+-++|.+.++. |.+ ++-+...-+. ..- T Consensus 65 l~~~e~~~~~~~~~~t~~~Gg-~lvP~~~~~~I~~~l~~~spir~~a~v~~~~---~~~-~i~~~~~~~~-----a~W-- 132 (381) T protein:vir:10 65 LSANQRNFFMDINKSVGYKEE-KLLPEETIDRIFEDLTTNHPLLADLGIKNAG---LRL-KFLKSETSGV-----AVW-- 132 (381) T ss_pred cCHHHHHHHHHHhhcCCCCCc-eecCHHHHHHHHHHHHhhcceeeeeeeEecC---cce-EEEeecCCcc-----eEE-- Confidence 11 112222222222111 1122223455554444444566678777763 322 2221111110 000 Q ss_pred cccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHH Q lcl|NC_020862. 77 ATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREML 156 (405) Q Consensus 77 p~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell 156 (405) -.|.|.++.....++..|+-+.++++.+..+|++++ -|+.-+|...|..++. T Consensus 133 ---------------------------~~e~~~~~~~~~~~f~~i~l~~~kl~a~i~is~elL-~Ds~~~le~~i~~~la 184 (381) T protein:vir:10 133 ---------------------------GKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLN-DFGPAWIERFVRVQIE 184 (381) T ss_pred ---------------------------eecccccccccCccceeEeecceeEEeeccccHHHH-hccHHHHHHHHHHHHH Confidence 011122233333466778899999999999999864 4444467777666665 Q ss_pred HHHhhHHHHHHHHHHhccCceEEecCCCc---cceeeeccc-ccccCCceecHHHHHHHHHHHHh---ccCccccceecc Q lcl|NC_020862. 157 RGANEITEDLLQADILASADVKVFTGAAT---SMVTMTGEA-ADAEDDGLITLKDLKRLSITLTD---NYTPKKTTIIKG 229 (405) Q Consensus 157 ~~~~~~ted~l~~~ilag~~~v~yag~at---s~~~~t~~~-~~~~~n~~it~~~lr~~~~~Lk~---nrApk~T~ii~g 229 (405) + +....|+ ..+++|.++..=-|--+ +....+.+. ........++..++......|.. +.+.. .. T Consensus 185 ~-~~a~~~~---~afi~GdG~~qP~Gil~~~~~~~~~~~g~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~-----~~ 255 (381) T protein:vir:10 185 E-AFAVALE---TAFLKGTGKDQPIGLNRQVQKGVSVTDGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTN-----EK 255 (381) T ss_pred H-HHHHHhh---ceeEecccCCCceeeeecCCccccccccccccccccccccccchhhHHHHHHHHHHhhhhh-----hc Confidence 4 3333343 45677766533222100 111111110 00011123444443333332221 11100 00 Q ss_pred ccccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcc Q lcl|NC_020862. 230 SRMTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATA 309 (405) Q Consensus 230 s~~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~ 309 (405) . ..+......+..+++....+|+.+.. +. -++...+ ..+ +-++++++++.|- + T Consensus 256 ~---~~~~~~~~~~~vmn~~t~~~l~~~~~------~~----~~~G~~v-----~~l-p~g~~vv~~~~~p----~---- 308 (381) T protein:vir:10 256 G---KSVAVKGNVTMVVNPSDAFEVQAQYT------HL----NANGVYV-----TAL-PFNLNVIESTVQE----A---- 308 (381) T ss_pred c---ccccccCceEEEEchhhHHhhccccc------cC----CCCCcee-----ecC-CCCceeEEcCCCC----c---- Confidence 0 00111223456789888888875432 11 1111111 011 1124566665432 0 Q ss_pred cCCCcccccccccCCcceeeeEEEEEccccceeecceeccCCCCCCceEEEecCCCCCCCCCCccchhhhHHHH--HHHH Q lcl|NC_020862. 310 TAANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQGMSGKGKSKFRIIVKKPGEATADRNDPYGKVGFSSIK--FFYG 387 (405) Q Consensus 310 ~~t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~g~~~~g~~~~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~ 387 (405) + + ++||.-++-.|..+ ..+++-+. .+-+-..+-.+++ +++. T Consensus 309 ---------------~--~----i~fGDfs~Y~i~~r-------~~~~i~~~---------~~~~~~~d~~~f~a~~r~d 351 (381) T protein:vir:10 309 ---------------G--K----VLTYVKGLYDGYLA-------GGINVQKF---------KETLALDDMDLYTAKQFAY 351 (381) T ss_pred ---------------C--c----EEEEEcccEEEEEe-------cccEEEee---------chhhhhcCceEEEEEEEEc Confidence 1 1 56777666555554 12332221 1223333333333 5667 Q ss_pred HhhccccceEEEEEecCC Q lcl|NC_020862. 388 FIKLRGERIAVAYSVIPE 405 (405) Q Consensus 388 ~~iL~~~~marie~~a~~ 405 (405) +++.+++=++++...+-| T Consensus 352 G~~~~~~A~~v~~l~~~~ 369 (381) T protein:vir:10 352 GKAKDNKVAAVWKLDLKG 369 (381) T ss_pred CEEecCCcEEEEEEeecC Confidence 778888888887665444 No 176 >protein:vir:1991 Length: 305 # NCBI annotation: major head subunit # Family: family:all:776 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050638;genbank:gi:9633525;genbank:GeneID:2636267 Probab=25.81 E-value=2 Score=18.92 Aligned_cols=225 Identities=17% Similarity=0.212 Sum_probs=117.4 Q ss_pred ccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhccccccccCcCCCCEEEEEecccCCCCCCccccCCCcccccc Q lcl|NC_020862. 3 HIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPLADNKQMPKHFGKELKVFYYVPLLDDLNVNDQGLDATGASY 82 (405) Q Consensus 3 ~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~fA~~~~mPKn~GktIkfrry~pl~~~~t~l~eGvtp~g~~~ 82 (405) |+-| +..+ +.|-.+....+..-|+.. |.+. +.|-.+ .+.+..+++-| T Consensus 1 M~i~------~~~l-----------~~l~~~~~~~f~~~~~~a---~~~~-~~iA~~----vpSt~~~~tY~-------- 47 (305) T protein:vir:19 1 MIVT------PASI-----------KALMTSWRKDFQGGLEDA---PSQY-NKIAMV----VNSSTRSNTYG-------- 47 (305) T ss_pred CccC------HHHH-----------HHHHHHHHHHHHHHHhhc---Cccc-ceEEeE----ecCCCCccccc-------- Confidence 2211 1122 222222222222223322 2222 222222 11222222111 Q ss_pred cCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHHHHHHHHHHhhH Q lcl|NC_020862. 83 AGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHLSREMLRGANEI 162 (405) Q Consensus 83 ~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~~~ell~~~~~~ 162 (405) =.+.+|.|+|-=|...-..+.-+.-+..-+.|..=+.+..+.+.-|. -.++..+..+|+..+++. T Consensus 48 --------------wLg~fP~lrewiGer~i~~l~~~~y~i~Nk~fe~tV~V~R~dIeDD~-lG~y~p~~~~~G~~aa~~ 112 (305) T protein:vir:19 48 --------------WLGKFPTLKEWVGKRTIQQMEAHGYSIANKTFEGTVGISRDDFEDDN-LGIYAPIFQEMGRSAAVQ 112 (305) T ss_pred --------------ccccCCccchhhcceeeeeccccceeEeeccccceeccchhhccccc-cCchHHHHHHHHHHHhhc Confidence 23677888884355555556666677778888888888886543333 257888888888888777 Q ss_pred HHHHHHHHHhccCceEEecC------------------CCccceeee---cccccc------------------------ Q lcl|NC_020862. 163 TEDLLQADILASADVKVFTG------------------AATSMVTMT---GEAADA------------------------ 197 (405) Q Consensus 163 ted~l~~~ilag~~~v~yag------------------~ats~~~~t---~~~~~~------------------------ 197 (405) -.+++..-|++|-+..-|-| .+++-+++. +....+ T Consensus 113 pd~lv~~lL~~Gf~~~cyDGq~FFdtDHpv~~~~~~tg~~~~vsn~~~~~~~~g~~w~Lld~~~~ikP~I~Q~Rk~~~~~ 192 (305) T protein:vir:19 113 PDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTGSAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELV 192 (305) T ss_pred hhhHHHHHHHhcCCccCCCCCcccCCCCCcccCCcccccccchhhhhcCCCCCCceeeeeecCCcceeEEEeccccccee Confidence 77777777777766655543 233333321 111000 Q ss_pred -----c------------------------------CCceecHHHHHHHHHHHHhccCccccceeccccccCcccccceE Q lcl|NC_020862. 198 -----E------------------------------DDGLITLKDLKRLSITLTDNYTPKKTTIIKGSRMTDTKTISASR 242 (405) Q Consensus 198 -----~------------------------------~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~~~gT~~I~~sy 242 (405) . ....||.+.|..+...+..++... | +. .-|.|.| T Consensus 193 ~~~~~~d~~vf~~~e~~ygvd~R~n~Gygfwq~a~gS~~~Ls~~nl~aar~aM~~qk~d~------G-~p---L~I~P~~ 262 (305) T protein:vir:19 193 ARTRIDDDHVFMDNEFLFGASTRRAAGYGFWQMAVAVKGDLTLDNLWKGWQLMRSFEGDG------G-KK---LGLKPTH 262 (305) T ss_pred eccCCCchhhhhhceeeeeeeeeeeccccchhheecCCCCCCHHHHHHHHHHHHhhcCCC------C-ce---eeeecCe Confidence 0 024577777777777777666632 2 22 3466777 Q ss_pred EEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcch Q lcl|NC_020862. 243 IAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQM 299 (405) Q Consensus 243 v~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~ 299 (405) .+|+|.++-.-+.|.. -.+++ +..+ +++=-. -+.+-.|++|++ T Consensus 263 -LvVPp~LE~~A~qll~----s~~i~-----~g~~---~~~Np~-~g~~eliV~P~L 305 (305) T protein:vir:19 263 -IVVPVGLEKAAEQLLN----RELFA-----DGNT---TVSNEM-KGKLQLVVADYL 305 (305) T ss_pred -EEeCchhHHHHHHHHh----hcccC-----Cccc---ccccee-cceEEEEecccC Confidence 4899999999988753 12221 1110 111111 134578888888 No 177 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=20.81 E-value=2.7 Score=18.21 Aligned_cols=283 Identities=11% Similarity=-0.006 Sum_probs=109.7 Q ss_pred CCccccCcCCCcccccccceeehhhhhHHHHHhhhhhhhhcc--cc--ccccCcCCCCEEEEEecccCCCC----CCccc Q lcl|NC_020862. 1 MPHIYNDPAAGDASTVGPQFNVHYWDRKSLIDEAEEMFFSPL--AD--NKQMPKHFGKELKVFYYVPLLDD----LNVND 72 (405) Q Consensus 1 ~~~~y~~~~~t~~~~v~~qm~t~y~~~k~L~~a~p~lv~~~f--A~--~~~mPKn~GktIkfrry~pl~~~----~t~l~ 72 (405) |- .||+-.-+....--+|. ..+|... +- ..+-|- .|+.++.=.|.+|..+ .+.++ T Consensus 6 ~~-vfN~~~~~a~~e~~~q~---------------~~~fn~as~gai~l~~~~~-~Gd~~~~pf~~~l~g~~~~~~~~~~ 68 (325) T protein:vir:95 6 LA-VYSEYAYSAFSETLRQQ---------------VDLFNTATGGAIMLQSAAH-QGDFSDVAFFAKVTGGLVRRRNAYG 68 (325) T ss_pred hh-hhhhhhhhhhhhhhhhh---------------HhhhhhcccceeEeccccc-cCceeeccccccccccccccccCCC Confidence 22 24432111111111110 1122111 00 001110 3666666566666432 12222 Q ss_pred cC-CCcccccccCCcccccccccccccccccccccccccccccceeeeeEEEEeeeeeeeEEecchhhhhhhccchHHHH Q lcl|NC_020862. 73 QG-LDATGASYAGGNLYGGSRDIGTVTGKMPTLTETGGRVNRVGYTRLERTGTLTEYGFFMEYTEDSLMFDTDSDLYGHL 151 (405) Q Consensus 73 eG-vtp~g~~~~~gnly~ss~di~~it~k~ptl~e~g~r~~~~~~t~~di~~~l~qyG~~~e~Td~~~~~d~d~~l~~~~ 151 (405) ++ |+|. + || +.+++.+.+..=-.|++.+- ..+-...+.+.++ T Consensus 69 ~~~vt~~--k---------------it------------------t~~~~av~~~r~~g~~~~d~--~~~~~g~~~~~~~ 111 (325) T protein:vir:95 69 SGTVAEK--V---------------LK------------------HLVDTSVKVAAGTPPVRLDP--GQFRWIQQNPEVA 111 (325) T ss_pred Cceeccc--e---------------ec------------------cccceeeEEecccCcccccH--HHHhhcCCCHHHH Confidence 22 2111 1 11 23344444443333444443 3344445666666 Q ss_pred HHHHHHHHhhHHHHHHHHHHhccCceEEecCCCccceeeecccccccCCceecHHHHHHHHHHHHhccCccccceecccc Q lcl|NC_020862. 152 SREMLRGANEITEDLLQADILASADVKVFTGAATSMVTMTGEAADAEDDGLITLKDLKRLSITLTDNYTPKKTTIIKGSR 231 (405) Q Consensus 152 ~~ell~~~~~~ted~l~~~ilag~~~v~yag~ats~~~~t~~~~~~~~n~~it~~~lr~~~~~Lk~nrApk~T~ii~gs~ 231 (405) ..++...-+......++..++++..-. ..+ .+..+ .....++...+..+|...|-++...|-++...- T Consensus 112 ~~~Ig~~~a~~~~~~~l~~~~~~l~~a-~~~-~~~~v-~dis~~~~~~~~~~s~~~l~~A~~klGD~~~~l--------- 179 (325) T protein:vir:95 112 GAAMGQQLAVDTMADMLNVGLGSVYSA-LSQ-VSDVV-YDATANTDAADKLPTWNNLNNGQAKFGDQSSQI--------- 179 (325) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHh-hcc-cccce-eeeecccCcccccccHHHHHHHHHHhcccccce--------- Confidence 666655444443333344444332111 111 11111 111112222345689999999999997766541 Q ss_pred ccCcccccceEEEEEcccchHHHHHHhcccCCCcceehhhcCCcccccCcceeEecCCcEEEEeCcchhhhhcCCCcccC Q lcl|NC_020862. 232 MTDTKTISASRIAYIGSELEIYITELVDSLGNPAFVPVEKYADAATIMNGEIGAIPGAHLRIVVVPQMMHYAGAGATATA 311 (405) Q Consensus 232 ~~gT~~I~~syv~~~h~dl~~dir~l~d~~~~p~fi~v~~Ya~~~~i~~gEIGsi~g~n~Rfv~~p~~~~~~~aGa~~~~ 311 (405) =+.++||....+|++.+ -+.-+++-+.+.+. .|+..-| -|+|+.. -+|-.+.|+ T Consensus 180 ----------~~~~MHS~v~~~L~~~~-------L~~~~~~~~~~g~~--~i~t~~G--~~VIVdD-~~p~~~~g~---- 233 (325) T protein:vir:95 180 ----------AAWIMHSTPMHKLYGSN-------LTNGERLFTYGTVN--VVRDPFG--KLLVMTD-SPNLFAAGT---- 233 (325) T ss_pred ----------eEEEEchHHHHHHHHhh-------ccccccccccCCcc--cccccCC--cEEEEeC-CCCCCCccC---- Confidence 46789999999998753 33223332222221 2444333 3666654 233222211 Q ss_pred CCcccccccccCCcceeeeEEEEEccccceeecce-----eccCCCCCCceEEEecCCCCCCCCC---CccchhhhHHHH Q lcl|NC_020862. 312 ANRGYQVSDVAGTDKYDIAPLLVVGDQAFATIGLQ-----GMSGKGKSKFRIIVKKPGEATADRN---DPYGKVGFSSIK 383 (405) Q Consensus 312 t~~~~~~~~~~g~~~~DVYp~lV~G~~Afg~i~l~-----g~~~~g~~~~~~ivk~pG~~tad~~---DPlgQrg~~gwK 383 (405) + .+|-+++||..|++.-.=+ ....++...+..-.. ++.. .|+ |+| T Consensus 234 -------------~--~~ytty~lg~GAi~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~tf~lhp~------G~s 286 (325) T protein:vir:95 234 -------------P--NVYHILGLVPGGVLIGQNNDFDANEETKNGDENIIRTYQ------AEWSYNIGVK------GFA 286 (325) T ss_pred -------------c--eeEEEEEEecCeEEecCCCCccccccccCcccceeeeee------eeeeEEeecc------eee Confidence 1 3899999999886532211 000111110000000 0000 122 222 Q ss_pred HHHHHhhccccc--------------------eEEEEEe Q lcl|NC_020862. 384 FFYGFIKLRGER--------------------IAVAYSV 402 (405) Q Consensus 384 ~~~~~~iL~~~~--------------------marie~~ 402 (405) |--+..-.++-. .+.+|+- T Consensus 287 w~~s~~g~sPt~aeL~~~~NW~rv~~~~K~tagv~~~~~ 325 (325) T protein:vir:95 287 WDKANGGKSPTDAALFTSTNWDKYATSHKDLAGVVVKTN 325 (325) T ss_pred eecccccCCcChHhhcCCcCcceecCCCccccceeEeeC Confidence 211111111100 1111111 Done!