Query lcl|NC_019402.1_cdsid_YP_006987657.1 [gene=D858_gp107] [protein=major head protein] [protein_id=YP_006987657.1] [location=8547..9503] Match_columns 318 No_of_seqs 79 out of 89 Neff 6.6 Searched_HMMs 1612 Date Thu Nov 7 17:21:23 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_10 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_10_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:8843 Length: 317 # 100.0 6E-115 4E-118 646.8 31.4 305 1-318 1-316 (317) 2 protein:vir:96442 Length: 418 100.0 1E-69 6.4E-73 398.8 21.7 297 1-318 61-407 (418) 3 protein:vir:103370 Length: 418 100.0 1E-63 6.5E-67 365.9 21.3 296 1-318 61-407 (418) 4 protein:vir:97255 Length: 310 99.4 1.9E-14 1.2E-17 95.8 16.5 287 1-317 1-310 (310) 5 protein:vir:94933 Length: 330 99.4 2.1E-14 1.3E-17 95.5 13.9 282 1-318 25-330 (330) 6 protein:vir:80835 Length: 464 99.0 1.1E-11 6.8E-15 80.7 13.4 285 1-318 29-331 (464) 7 protein:vir:191 Length: 385 # 98.9 6.3E-10 3.9E-13 71.0 18.5 275 1-318 105-385 (385) 8 protein:vir:1886 Length: 385 # 98.9 6.3E-10 3.9E-13 71.0 18.5 275 1-318 105-385 (385) 9 protein:vir:99311 Length: 463 98.9 1.6E-10 1E-13 74.2 15.2 284 1-318 26-338 (463) 10 protein:vir:95603 Length: 463 98.9 1.6E-10 1E-13 74.2 15.2 284 1-318 26-338 (463) 11 protein:vir:100851 Length: 514 98.9 3.1E-10 1.9E-13 72.7 16.5 291 1-318 52-370 (514) 12 protein:vir:96666 Length: 462 98.8 6.6E-10 4.1E-13 70.9 16.6 286 1-318 26-338 (462) 13 protein:vir:102823 Length: 470 98.8 4.4E-10 2.7E-13 71.9 15.0 294 1-318 1-338 (470) 14 protein:vir:41 Length: 299 # N 98.8 1.7E-09 1E-12 68.7 17.8 272 1-318 6-299 (299) 15 protein:vir:100135 Length: 418 98.8 1.7E-09 1.1E-12 68.6 17.9 275 1-318 136-416 (418) 16 protein:vir:7771 Length: 330 # 98.7 5.3E-09 3.3E-12 65.9 17.1 288 1-318 1-324 (330) 17 protein:vir:81070 Length: 390 98.6 9.9E-09 6.2E-12 64.5 17.1 272 1-315 113-390 (390) 18 protein:vir:103955 Length: 324 98.6 1.8E-08 1.1E-11 63.1 18.3 270 1-318 21-316 (324) 19 protein:vir:8187 Length: 311 # 98.6 1.4E-08 8.7E-12 63.6 17.6 280 1-318 1-311 (311) 20 protein:vir:97148 Length: 324 98.6 2.4E-08 1.5E-11 62.4 18.4 267 1-318 26-316 (324) 21 protein:vir:4339 Length: 395 # 98.6 1.8E-08 1.1E-11 63.0 17.7 276 1-317 111-395 (395) 22 protein:vir:99749 Length: 324 98.6 3.7E-08 2.3E-11 61.3 18.2 270 1-318 21-316 (324) 23 protein:vir:9820 Length: 272 # 98.5 8.9E-08 5.5E-11 59.2 20.0 260 1-318 1-270 (272) 24 protein:vir:3033 Length: 272 # 98.5 8.9E-08 5.5E-11 59.2 20.0 260 1-318 1-270 (272) 25 protein:vir:9309 Length: 324 # 98.5 5.1E-08 3.2E-11 60.5 18.4 272 1-318 26-316 (324) 26 protein:vir:96223 Length: 324 98.5 4.2E-08 2.6E-11 61.0 17.9 269 1-318 21-316 (324) 27 protein:vir:10364 Length: 390 98.5 3.9E-08 2.4E-11 61.2 17.4 272 1-315 113-390 (390) 28 protein:vir:9574 Length: 300 # 98.5 2.4E-08 1.5E-11 62.4 15.9 279 1-317 1-300 (300) 29 protein:vir:63741 Length: 468 98.5 3.4E-09 2.1E-12 67.0 11.2 291 1-318 33-338 (468) 30 protein:vir:80491 Length: 467 98.5 3.6E-09 2.2E-12 66.9 11.1 291 1-318 32-337 (467) 31 protein:vir:9759 Length: 303 # 98.5 2.4E-08 1.5E-11 62.4 15.6 280 1-317 1-303 (303) 32 protein:vir:96123 Length: 274 98.5 1.7E-07 1.1E-10 57.6 20.1 257 1-318 1-271 (274) 33 protein:vir:96392 Length: 324 98.5 6.9E-08 4.3E-11 59.8 17.8 269 1-318 27-316 (324) 34 protein:vir:78830 Length: 324 98.5 6.9E-08 4.3E-11 59.8 17.8 269 1-318 27-316 (324) 35 protein:vir:95763 Length: 297 98.5 8.9E-08 5.5E-11 59.2 18.3 268 1-318 9-297 (297) 36 protein:vir:93742 Length: 274 98.5 3E-07 1.9E-10 56.4 20.7 258 1-318 1-271 (274) 37 protein:vir:8102 Length: 543 # 98.5 8.4E-08 5.2E-11 59.4 17.6 274 1-318 249-543 (543) 38 protein:vir:94771 Length: 298 98.4 6.9E-08 4.3E-11 59.8 16.7 280 1-316 1-298 (298) 39 protein:vir:94142 Length: 304 98.4 1.5E-07 9.4E-11 58.0 18.1 272 1-316 1-304 (304) 40 protein:vir:105905 Length: 304 98.4 1.5E-07 9.4E-11 58.0 18.1 272 1-316 1-304 (304) 41 protein:vir:348 Length: 321 # 98.4 1.2E-08 7.2E-12 64.1 11.8 269 1-315 1-321 (321) 42 protein:vir:81227 Length: 413 98.4 7.6E-08 4.7E-11 59.6 15.9 285 1-318 118-411 (413) 43 protein:vir:97053 Length: 390 98.4 1.7E-07 1.1E-10 57.7 17.3 272 1-315 113-390 (390) 44 protein:vir:2504 Length: 305 # 98.4 1.3E-07 7.9E-11 58.4 16.4 273 1-318 1-299 (305) 45 protein:vir:104085 Length: 320 98.3 2.6E-07 1.6E-10 56.7 17.4 283 1-318 14-318 (320) 46 protein:vir:97433 Length: 274 98.3 1E-06 6.4E-10 53.4 20.0 258 1-318 1-271 (274) 47 protein:vir:94494 Length: 274 98.3 1E-06 6.4E-10 53.4 20.0 258 1-318 1-271 (274) 48 protein:vir:94673 Length: 419 98.3 5.2E-07 3.3E-10 55.0 18.2 288 1-318 121-418 (419) 49 protein:vir:1638 Length: 298 # 98.3 3.9E-07 2.4E-10 55.7 17.0 277 1-316 1-298 (298) 50 protein:vir:485 Length: 407 # 98.2 1.7E-07 1.1E-10 57.7 14.5 280 1-318 106-401 (407) 51 protein:vir:2430 Length: 318 # 98.2 4.5E-07 2.8E-10 55.4 16.5 278 1-318 14-314 (318) 52 protein:vir:98339 Length: 415 98.2 5.1E-07 3.2E-10 55.1 16.4 276 1-318 120-405 (415) 53 protein:vir:81100 Length: 415 98.2 5.1E-07 3.2E-10 55.1 16.4 276 1-318 120-405 (415) 54 protein:vir:79987 Length: 415 98.2 5.1E-07 3.2E-10 55.1 16.4 276 1-318 120-405 (415) 55 protein:vir:78523 Length: 338 98.2 8.6E-07 5.4E-10 53.8 17.6 291 1-318 1-336 (338) 56 protein:vir:1328 Length: 392 # 98.2 1.2E-06 7.3E-10 53.1 17.7 277 1-318 111-392 (392) 57 protein:vir:105334 Length: 276 98.1 1.6E-06 9.7E-10 52.4 18.0 258 1-318 1-271 (276) 58 protein:vir:9410 Length: 415 # 98.1 5.4E-07 3.4E-10 54.9 15.3 275 1-318 120-405 (415) 59 protein:vir:3158 Length: 321 # 98.1 6E-07 3.7E-10 54.7 15.4 281 1-318 1-312 (321) 60 protein:vir:96833 Length: 275 98.1 1.7E-06 1E-09 52.3 17.4 257 1-318 1-272 (275) 61 protein:vir:4600 Length: 415 # 98.1 1.8E-06 1.1E-09 52.1 17.6 276 1-318 120-405 (415) 62 protein:vir:4700 Length: 415 # 98.1 1.8E-06 1.1E-09 52.1 17.6 276 1-318 120-405 (415) 63 protein:vir:4226 Length: 326 # 98.0 1.7E-06 1.1E-09 52.2 16.5 283 1-318 20-324 (326) 64 protein:vir:6242 Length: 390 # 98.0 3E-06 1.8E-09 50.9 16.9 273 1-318 110-390 (390) 65 protein:vir:2344 Length: 397 # 98.0 1.4E-06 8.4E-10 52.8 14.7 276 1-318 10-307 (397) 66 protein:vir:101607 Length: 379 98.0 3.2E-06 2E-09 50.7 16.5 266 1-317 106-379 (379) 67 protein:vir:99920 Length: 311 97.9 2.8E-06 1.8E-09 51.0 15.4 280 1-317 1-311 (311) 68 protein:vir:3613 Length: 272 # 97.9 1.2E-05 7.6E-09 47.5 19.3 254 1-317 1-272 (272) 69 protein:vir:9704 Length: 394 # 97.9 2.8E-06 1.7E-09 51.0 15.3 262 1-318 128-391 (394) 70 protein:vir:4856 Length: 293 # 97.9 6.4E-06 3.9E-09 49.1 17.1 266 1-318 5-282 (293) 71 protein:vir:4511 Length: 409 # 97.9 2.2E-06 1.3E-09 51.6 14.4 279 1-318 115-407 (409) 72 protein:vir:95898 Length: 274 97.9 1.4E-05 8.6E-09 47.2 19.3 257 1-318 1-271 (274) 73 protein:vir:96262 Length: 274 97.9 1.4E-05 8.6E-09 47.2 19.3 257 1-318 1-271 (274) 74 protein:vir:1239 Length: 274 # 97.8 1.7E-05 1E-08 46.8 19.6 256 1-318 1-271 (274) 75 protein:vir:80684 Length: 315 97.8 4E-06 2.5E-09 50.2 14.6 276 1-318 1-307 (315) 76 protein:vir:100247 Length: 425 97.8 3.9E-06 2.4E-09 50.2 14.4 280 1-318 130-425 (425) 77 protein:vir:78223 Length: 333 97.8 1.6E-05 9.8E-09 46.9 17.3 291 1-318 1-333 (333) 78 protein:vir:4953 Length: 397 # 97.7 2E-05 1.3E-08 46.3 17.2 261 1-318 109-386 (397) 79 protein:vir:4456 Length: 401 # 97.7 7.2E-06 4.5E-09 48.8 14.6 280 1-317 107-401 (401) 80 protein:vir:1433 Length: 435 # 97.7 1.9E-05 1.2E-08 46.4 16.6 273 1-318 130-434 (435) 81 protein:vir:4159 Length: 315 # 97.6 9.6E-06 6E-09 48.1 14.5 281 1-314 16-315 (315) 82 protein:vir:4092 Length: 390 # 97.6 8.6E-06 5.4E-09 48.3 14.0 270 1-318 84-369 (390) 83 protein:vir:95376 Length: 425 97.6 9.3E-06 5.8E-09 48.2 14.2 275 1-318 138-422 (425) 84 protein:vir:4997 Length: 397 # 97.6 1.8E-05 1.1E-08 46.6 15.7 262 1-318 109-386 (397) 85 protein:vir:80930 Length: 278 97.6 2.6E-05 1.6E-08 45.7 16.4 264 1-318 1-278 (278) 86 protein:vir:104256 Length: 458 97.6 1.8E-05 1.1E-08 46.5 15.4 281 1-318 161-458 (458) 87 protein:vir:7855 Length: 497 # 97.6 3E-05 1.9E-08 45.4 16.6 289 1-318 151-494 (497) 88 protein:vir:101650 Length: 497 97.6 3E-05 1.9E-08 45.4 16.6 289 1-318 151-494 (497) 89 protein:vir:4197 Length: 314 # 97.6 2.8E-05 1.7E-08 45.6 16.1 283 1-318 11-314 (314) 90 protein:vir:99424 Length: 360 97.5 2.6E-05 1.6E-08 45.7 15.8 295 1-318 20-358 (360) 91 protein:vir:100172 Length: 394 97.5 2.4E-05 1.5E-08 45.9 15.5 269 1-318 111-385 (394) 92 protein:vir:4830 Length: 397 # 97.5 1.5E-05 9.1E-09 47.1 14.3 263 1-318 109-386 (397) 93 protein:vir:80376 Length: 435 97.5 5.2E-05 3.2E-08 44.1 17.2 276 1-318 130-434 (435) 94 protein:vir:100884 Length: 389 97.4 3.6E-05 2.2E-08 44.9 15.2 269 1-318 109-383 (389) 95 protein:vir:102119 Length: 404 97.4 7.1E-05 4.4E-08 43.3 16.6 273 1-318 110-401 (404) 96 protein:vir:5739 Length: 366 # 97.2 0.00012 7.5E-08 42.1 16.6 271 1-317 64-366 (366) 97 protein:vir:1025 Length: 408 # 97.1 0.00017 1.1E-07 41.2 17.0 263 1-318 116-394 (408) 98 protein:vir:95107 Length: 270 97.1 0.00016 1E-07 41.3 15.6 253 1-318 1-266 (270) 99 protein:vir:3991 Length: 404 # 97.0 0.0002 1.2E-07 40.9 17.8 263 1-318 116-394 (404) 100 protein:vir:105038 Length: 428 96.9 0.00025 1.5E-07 40.4 16.7 270 1-317 125-428 (428) 101 protein:vir:3870 Length: 400 # 96.9 0.00019 1.1E-07 41.0 14.6 259 1-318 133-400 (400) 102 protein:vir:739 Length: 231 # 96.9 0.00029 1.8E-07 40.0 18.6 224 31-317 1-231 (231) 103 protein:vir:7409 Length: 408 # 96.7 0.00041 2.5E-07 39.2 18.7 263 1-318 116-394 (408) 104 protein:vir:8420 Length: 477 # 96.5 0.00041 2.5E-07 39.2 13.9 291 1-318 154-472 (477) 105 protein:vir:102082 Length: 392 96.5 0.00054 3.3E-07 38.5 16.5 260 1-318 106-385 (392) 106 protein:vir:105004 Length: 392 96.5 0.00054 3.3E-07 38.5 16.5 260 1-318 106-385 (392) 107 protein:vir:102873 Length: 392 96.5 0.00054 3.3E-07 38.5 16.5 260 1-318 106-385 (392) 108 protein:vir:107593 Length: 392 96.5 0.00054 3.3E-07 38.5 16.5 260 1-318 106-385 (392) 109 protein:vir:1084 Length: 437 # 96.2 0.00078 4.8E-07 37.6 13.6 258 1-318 156-428 (437) 110 protein:vir:81160 Length: 371 96.1 0.00099 6.1E-07 37.1 16.1 260 1-317 91-371 (371) 111 protein:vir:96762 Length: 632 95.9 0.0013 7.8E-07 36.5 15.5 263 1-316 355-632 (632) 112 protein:vir:1268 Length: 397 # 95.8 0.0014 8.8E-07 36.2 15.4 257 1-317 123-397 (397) 113 protein:vir:9643 Length: 377 # 95.5 0.002 1.3E-06 35.4 14.0 275 1-317 79-377 (377) 114 protein:vir:962 Length: 397 # 95.3 0.0019 1.2E-06 35.5 12.4 258 1-317 132-397 (397) 115 protein:vir:3845 Length: 395 # 95.1 0.0027 1.7E-06 34.7 18.2 262 1-318 105-384 (395) 116 protein:vir:101291 Length: 381 94.8 0.0034 2.1E-06 34.1 13.7 276 1-318 76-369 (381) 117 protein:vir:9509 Length: 381 # 94.8 0.0034 2.1E-06 34.1 13.7 276 1-318 76-369 (381) 118 protein:vir:78350 Length: 383 94.4 0.0033 2E-06 34.2 11.3 272 1-318 83-376 (383) 119 protein:vir:95963 Length: 395 94.2 0.0052 3.2E-06 33.1 13.5 270 1-318 86-377 (395) 120 protein:vir:80068 Length: 301 94.0 0.0029 1.8E-06 34.5 10.3 273 1-315 1-301 (301) 121 protein:vir:100632 Length: 381 93.5 0.0072 4.5E-06 32.3 13.9 273 1-318 74-368 (381) 122 protein:vir:2685 Length: 387 # 93.5 0.0072 4.5E-06 32.3 12.9 262 1-318 116-382 (387) 123 protein:vir:94424 Length: 387 93.5 0.0072 4.5E-06 32.3 12.9 262 1-318 116-382 (387) 124 protein:vir:96978 Length: 387 93.5 0.0072 4.5E-06 32.3 12.9 262 1-318 116-382 (387) 125 protein:vir:78640 Length: 352 93.5 0.0073 4.5E-06 32.3 14.0 262 1-318 83-347 (352) 126 protein:vir:9361 Length: 402 # 93.4 0.0075 4.7E-06 32.2 13.3 262 1-318 131-397 (402) 127 protein:vir:6212 Length: 434 # 93.1 0.0088 5.5E-06 31.8 16.1 272 1-318 141-432 (434) 128 protein:vir:103886 Length: 302 92.9 0.0096 6E-06 31.6 11.6 274 3-318 1-302 (302) 129 protein:vir:79928 Length: 393 92.1 0.011 6.9E-06 31.3 10.6 284 1-318 59-379 (393) 130 protein:vir:80128 Length: 466 88.9 0.03 1.8E-05 29.0 12.2 276 1-318 148-449 (466) 131 protein:vir:93881 Length: 387 88.8 0.03 1.9E-05 28.9 13.9 262 1-318 116-382 (387) 132 protein:vir:93616 Length: 645 88.3 0.033 2.1E-05 28.7 16.8 268 1-318 334-640 (645) 133 protein:vir:98635 Length: 377 87.1 0.041 2.5E-05 28.2 10.6 271 1-317 79-377 (377) 134 protein:vir:9927 Length: 295 # 86.4 0.027 1.7E-05 29.2 8.2 256 1-318 1-287 (295) 135 protein:vir:102944 Length: 330 85.8 0.05 3.1E-05 27.7 14.4 271 1-318 1-307 (330) 136 protein:vir:1383 Length: 421 # 83.2 0.071 4.4E-05 26.9 16.6 260 1-318 114-384 (421) 137 protein:vir:79642 Length: 329 81.3 0.087 5.4E-05 26.4 12.4 274 1-318 26-329 (329) 138 protein:vir:108211 Length: 318 81.3 0.087 5.4E-05 26.4 15.5 284 1-318 1-318 (318) 139 protein:vir:104479 Length: 310 77.6 0.0087 5.4E-06 31.9 2.1 89 1-103 213-310 (310) 140 protein:vir:95318 Length: 328 77.2 0.13 7.9E-05 25.5 9.9 223 1-318 1-242 (328) 141 protein:vir:106647 Length: 303 73.1 0.17 0.00011 24.7 11.2 248 1-318 1-276 (303) 142 protein:vir:5974 Length: 324 # 70.5 0.21 0.00013 24.3 15.2 264 1-318 1-301 (324) 143 protein:vir:103285 Length: 296 69.9 0.22 0.00013 24.2 14.5 266 1-318 1-296 (296) 144 protein:vir:8324 Length: 410 # 60.6 0.14 8.7E-05 25.3 4.8 265 1-318 127-410 (410) 145 protein:vir:107687 Length: 319 54.3 0.51 0.00031 22.2 11.1 269 1-315 19-319 (319) 146 protein:vir:1583 Length: 351 # 49.9 0.63 0.00039 21.7 14.8 260 1-318 1-305 (351) 147 protein:vir:4074 Length: 480 # 48.7 0.66 0.00041 21.6 9.8 255 1-318 198-478 (480) 148 protein:vir:94622 Length: 341 43.0 0.86 0.00054 20.9 13.4 268 1-318 1-340 (341) 149 protein:vir:104342 Length: 314 40.6 0.96 0.0006 20.7 11.1 268 1-318 17-314 (314) 150 protein:vir:7324 Length: 335 # 24.5 2.2 0.0013 18.7 9.0 225 1-318 1-245 (335) 151 protein:vir:9875 Length: 296 # 24.0 2.2 0.0014 18.7 12.6 255 1-318 10-275 (296) 152 protein:vir:96490 Length: 348 20.6 2.7 0.0017 18.2 7.6 290 1-318 1-348 (348) No 1 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=100.00 E-value=6e-115 Score=646.78 Aligned_cols=305 Identities=28% Similarity=0.387 Sum_probs=284.1 Q ss_pred CC----ceeeeeeeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeecccccccc Q lcl|NC_019402. 1 MA----TLVSYDLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGE 76 (318) Q Consensus 1 Ma----~~~t~~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~ 76 (318) || +|+||+++|+||||+|+|++|+|+||||+|+|+++++++++|+|+||+|++++ .|+++||+|+++++ T Consensus 1 ma~~~~~~~t~~~~g~~~dl~~~I~~isp~dTPf~S~i~~~~a~~~~~~W~~d~l~~~~-------~~~~~EG~da~~~~ 73 (317) T protein:vir:88 1 MATPTNAVSTVEINGKREDLIDIIYNIAPYDTPFMSAIGKGVATAITHEWQTDELRQPG-------KNTRVEGEDATIKA 73 (317) T ss_pred CCccccceEeeeeeeeeechhhhheecCCccCcceeeecCceecccEEEEEeeecCCcc-------ccccccCccccccc Confidence 88 49999999999999999999999999999999999999999999999999753 58999999999999 Q ss_pred ccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCcccc-CCCCCccchhhhHHHHH Q lcl|NC_019402. 77 RASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKV-DGSATVARQTAGFSALV 155 (318) Q Consensus 77 ~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~-~gs~t~~r~m~Gi~~~i 155 (318) .++|++++||||||+|+++||||+||++.+|+++|++||++||++|||||||++||+|++.. ++++++||+|+||++|| T Consensus 74 ~~~r~~~~N~tQIf~k~v~VSgTa~av~~~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~~~i 153 (317) T protein:vir:88 74 GSFTTMLNNYCQISDETLQVTGTADRVKKAGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIFAYY 153 (317) T ss_pred ccCCEEeccEEEEEEeEEEEeehhhhhhhcCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHHHHh Confidence 99999999999999999999999999999999999999999999999999999999998654 56678899999999999 Q ss_pred hcCCcccCccc------cceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEec Q lcl|NC_019402. 156 AAKDAADPDTG------AIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFD 229 (318) Q Consensus 156 ~~~~~~~~~~g------~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~ 229 (318) .+++.+.++++ ...++++++++|||++|++++|+||++||+++++||+|.+|++|++|+++.. .+..++ T Consensus 154 ~t~~~~~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~Gg~~~~i~v~a~~k~~i~~~~~~~~-----~~i~~~ 228 (317) T protein:vir:88 154 KTNGSLGANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRNGGQANSIQTSSSIKKAISKNMKGRA-----TEITLD 228 (317) T ss_pred ccCceeccCccccccCCCccccccccccccHHHHHHHHHHHHhcCCCCCEEEeChHHHHHHHHHhcCCc-----eeEEEc Confidence 99876544322 2345678999999999999999999999999999999999999999987532 234567 Q ss_pred CCceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceecCCCccceeeEEEEEEeEEEecccce Q lcl|NC_019402. 230 GQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKLAKDGSYEKWMIEMEVGLRHRNPYAS 309 (318) Q Consensus 230 ~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~laktGd~~k~~i~~E~tLe~~N~~a~ 309 (318) +.++++|..|++|+|||| .|+||||||||++++++|||++|+++|||||+.|+|||+||++|+||++|+||||+||+|| T Consensus 229 ~~~~~~g~~v~~~~tdfG-~v~ii~~r~lp~~~~~~~D~~~~~l~~Lr~~~~e~laKtGd~~k~~i~~E~tLe~~N~~a~ 307 (317) T protein:vir:88 229 ASDNRIAQTVDVYESDFG-KYTIRANRWFHENTLFVFDPKMHSLCYLRPFFQHELAKTGDSEKRQLLVEYTFRVNNEKSG 307 (317) T ss_pred ccCeEEEEEEEEEEeCCe-EEEEEeCCCCCCCeEEEEcccccceeecccceeeccCCCcccceeEEEEEEEEEEcCccce Confidence 789999999999999999 5999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEeccC Q lcl|NC_019402. 310 GILEVKAGA 318 (318) Q Consensus 310 g~i~~lt~a 318 (318) |+|++|+++ T Consensus 308 a~i~~l~~~ 316 (317) T protein:vir:88 308 ALIRDVVAQ 316 (317) T ss_pred eEEEEeccc Confidence 999999999 No 2 >protein:vir:96442 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1616 # MgeName: 119X # Cross-refs: genbank:acc:YP_001218814;genbank:gi:147917331;genbank:GeneID:5142645 Probab=100.00 E-value=1e-69 Score=398.81 Aligned_cols=297 Identities=11% Similarity=0.028 Sum_probs=246.3 Q ss_pred CC--c------eeeeeeeeecccce----eeeEecC-CcccceeeeeccccccceEEEeeeeeccccCCcccccccccee Q lcl|NC_019402. 1 MA--T------LVSYDLNGKKLSFA----NWISNLS-PTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVI 67 (318) Q Consensus 1 Ma--~------~~t~~~~~~~~dl~----d~I~~i~-p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~ 67 (318) |. + .+...+....+|=. +.|+.+. -.+--.++.|.+.+++..+..-.|.+-+.++.......+.+++ T Consensus 61 l~~~~~~~ta~~~a~~T~i~V~~~~~f~~~~l~~~~~~~EvirVtsVng~~lTV~RG~~~t~aa~iaag~~~~~ig~~~e 140 (418) T protein:vir:96 61 MVFASAVVTAEALADATVLTVENSDGLTKGMIFYNEATGENMRLELVNGLNLTVKRQTGRIAAAIIAANTKLIVIGTAFE 140 (418) T ss_pred eeeeeEEEEEEEecCceEEEecCCcccccccEEEEecCCeEEEEEEEeCCEEEEEEccCCeeeeeeecCceEEEeecCcc Confidence 11 1 11111222222211 1222111 2455667788888999999888887777887777777889999 Q ss_pred eccccccccccCcEEecceEEEEeeeeeehhHHHH-hhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCC--- Q lcl|NC_019402. 68 EGSAAVDGERASTTVINNVTQILRKVVKVSDTANV-LANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSAT--- 143 (318) Q Consensus 68 EG~da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a-~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t--- 143 (318) ||+|+++++...+++++||||||++.|+||||||| +.++|+.+++++| +||++++|+++|++++.|++.+++++. T Consensus 141 EGsd~~ta~~~k~~~vsN~tQIf~e~vsVSgTAqA~v~qaGvsn~~~~e-~d~l~~~kv~iE~ali~g~~~~~~~ng~p~ 219 (418) T protein:vir:96 141 EGSQRPTARSIQPVYVPNFTQIFRNAWALTDTARASYAEAGYSNITESR-RDCMDFHATEQETAIFFGQAFMGTYNGQPL 219 (418) T ss_pred cccccCCcceecceeccchhheehhhhhhhhhhhhhhhhcCcchhHHHH-HHHHHHHHHHHHHhhhccccccCCCCCccc Confidence 99999999999999999999999999999999999 5889999988888 799999999999999999999988764 Q ss_pred --ccchhhhHHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCC----CCc----CEEEEcchHHhhhhhh Q lcl|NC_019402. 144 --VARQTAGFSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSG----SEA----NIIMFHPKHAAFFSSL 213 (318) Q Consensus 144 --~~r~m~Gi~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G----~~~----~~l~~~~~~k~~is~~ 213 (318) ++|+|+||++|+.+|. ..+++++++|+++|.++++++|..| ++. -.++|++.+|+.|++| T Consensus 220 ~~t~R~m~gI~~f~~~Nv----------i~ag~~~~~t~d~L~~~~~~a~~~g~n~G~~~~~~~y~~~V~a~~k~~I~k~ 289 (418) T protein:vir:96 220 HTTQGIVDAIRQYAPDNV----------NAMPNPTAVTYDDVVDATIDAFKWSVNVGDNTQRVMFCDTVGMRTMQDIGRF 289 (418) T ss_pred ccccchhHHHHhhccccc----------cccCCCCcCCHHHHHHHHHHHHhhcCCCCCcccceEEEEEeChHHHHHHhhh Confidence 5699999999997762 3356678899999999999999833 322 2378999999999999 Q ss_pred hhhhcccccceEEEecCCceEEEEEEEEEEcCCCcEEEEEecCCCCCce-----EEEEehhhcceeec--CcccceecCC Q lcl|NC_019402. 214 METSGVTNGQRMKMFDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENA-----VYFFTPSDWTQMVL--RAPERTKLAK 286 (318) Q Consensus 214 ~~~~~~~~~~r~~~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~-----~~~~D~~~~~~~~L--r~~~~e~lak 286 (318) .+++ +....++++|..|++|+||||. ++|++|||||+|+ |||||+++|+++|| |++++|.|+| T Consensus 290 ~~~I---------~~~~~en~~G~vv~~~~Td~G~-v~ii~n~~~pad~I~~g~mlVvD~~~vkL~yL~~R~~~~E~l~k 359 (418) T protein:vir:96 290 FGEV---------TVTQRETSYGMVFTEWKFFKGR-LIIKEHPLFSAIGISPGFAVVVDVPAVKLAYMDGRNAKVENYGQ 359 (418) T ss_pred hcee---------EeccccceeceEEEEEEeeccE-EEEEecCCCCccccCcceEEEEecCceEEEEecCCCccchhccc Confidence 6432 3567889999999999999995 9999999998777 69999999999999 9999999999 Q ss_pred Cc----------------cceeeEEEEEEeEEEecccceeEEEEeccC Q lcl|NC_019402. 287 DG----------------SYEKWMIEMEVGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 287 tG----------------d~~k~~i~~E~tLe~~N~~a~g~i~~lt~a 318 (318) +| |++||+|++|+|||++||++||+|++|..| T Consensus 360 ~G~~~~~~~~~~~~~~~~D~~~G~l~~Eltle~~N~~a~a~itgl~~~ 407 (418) T protein:vir:96 360 GGGENKSGATDYSYGHGVDAQGGSLTSEWALELLNPQGCAVITGLQKA 407 (418) T ss_pred CCCcccccccccccccccccccCEEEEEEEEEeecccccEEeeccccc Confidence 99 999999999999999999999999999999 No 3 >protein:vir:103370 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1621 # MgeName: PaP2 # Cross-refs: genbank:acc:YP_024741;genbank:gi:48697083;genbank:GeneID:2846038 Probab=100.00 E-value=1e-63 Score=365.86 Aligned_cols=296 Identities=12% Similarity=0.059 Sum_probs=239.6 Q ss_pred CC--c------eeeeeeeeecccce----eeeEecC-CcccceeeeeccccccceEEEeeeeeccccCCcccccccccee Q lcl|NC_019402. 1 MA--T------LVSYDLNGKKLSFA----NWISNLS-PTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVI 67 (318) Q Consensus 1 Ma--~------~~t~~~~~~~~dl~----d~I~~i~-p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~ 67 (318) |. + .+...+....+|=. +.|+.+. -.|--.++.|.+.+++..+..-.+++-+.+++......+.+++ T Consensus 61 ~~~~~~~~ta~a~a~~T~l~ve~~~~f~~~~l~~~~~~~Evirv~sVng~~lTV~Rg~~~t~aaaia~n~~~~~Ig~~~e 140 (418) T protein:vir:10 61 MVFASAVVTAEAAADATVLTVENSDGLTKGMIFYNEATGENMRLELVNGLNLTVKRQTGRISAAIIAANTKLIVIGTAFE 140 (418) T ss_pred EeeeeEEEEEEEecCceEEEEcCcceeccccEEEEccCCeEEEEEEEeCCEEEEEEecCCeeEEEEecCceEEEeccccc Confidence 11 0 11111222233211 1222111 2455667788888999999999998887777777777889999 Q ss_pred eccccccccccCcEEecceEEEEeeeeeehhHHHHh-hhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCcc- Q lcl|NC_019402. 68 EGSAAVDGERASTTVINNVTQILRKVVKVSDTANVL-ANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVA- 145 (318) Q Consensus 68 EG~da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~-~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~- 145 (318) ||+|+++++...+++++||||||++.|+||||+||+ ..+|.+|++++|..+++++ ++|||+++|+|++++++++++| T Consensus 141 EGsd~~ta~~~k~~~vsNvtQIF~~avsvSgTaqAs~~q~Gvsn~~ese~drk~~~-av~iEkalI~G~~~~~~~~~g~~ 219 (418) T protein:vir:10 141 EGSQRPTARSIQPVYVPNFTQIFRNAWALTDTARASYAEAGYSNITESRRDCMDFH-ATEQETAIFFGQAFMGTYNGQPL 219 (418) T ss_pred cccccCCcceecceeccchhhhhhhhhhhhhhhhhccccccCchHHHHHHHHHHHH-HHHHHHHHhcccccCCCcCCcch Confidence 999999999999999999999999999999999995 7789999999995555555 6799999999999999999886 Q ss_pred chhhhHHHHHhc---CCcccCccccceeeccCccccCHHHHHHHHHHHHhCC----CCc----CEEEEcchHHhhhhhhh Q lcl|NC_019402. 146 RQTAGFSALVAA---KDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSG----SEA----NIIMFHPKHAAFFSSLM 214 (318) Q Consensus 146 r~m~Gi~~~i~~---~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G----~~~----~~l~~~~~~k~~is~~~ 214 (318) |+|+||+.++++ .+. ..++++.++|.++|.++++.+|..| ++- -.++|++.+|+.|++|. T Consensus 220 R~m~GIl~~vr~~~~gnV---------v~a~~~t~~s~d~l~~a~~~af~~g~~~G~~~q~~~f~~~V~~~~k~~I~k~~ 290 (418) T protein:vir:10 220 HTTQGIVDAVRQYAPDNV---------NAMPNPTAVTYDDVVDATIDAFKWSVNVGDNTQRVMFCDTVGMRTMQDIGRFF 290 (418) T ss_pred hhHHHHHHHHhhhcccce---------eccCCCCccCHHHHHHHHHHHhhccCCCcccccceeEEEEeChHHHHHhhhhh Confidence 999999988865 222 3344556899999999999997633 222 24789999999999997 Q ss_pred hhhcccccceEEEecCCceEEEEEEEEEEcCCCcEEEEEecCC-------CCCceEEEEehhhcceeec--CcccceecC Q lcl|NC_019402. 215 ETSGVTNGQRMKMFDGQDTRLNVYVSSIVDPLGCQYKLVPNRW-------MPENAVYFFTPSDWTQMVL--RAPERTKLA 285 (318) Q Consensus 215 ~~~~~~~~~r~~~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~-------m~~~~~~~~D~~~~~~~~L--r~~~~e~la 285 (318) +++ +....++.+|..|+++ +|| ..+|+|||| ||+|+|+++|+++++++|| |.+++|.|+ T Consensus 291 ~~I---------~~~~~e~~~G~vv~~~--~~~-~G~I~L~~~p~~~~~~lp~g~mlVvD~~~vkL~~L~~R~~~~E~l~ 358 (418) T protein:vir:10 291 GEV---------TVTQRETSYGMVFTEW--KFF-KGRLILKEHPLFSAIGISPGFAVVVDVPAVKLAYMDGRNAKVENYG 358 (418) T ss_pred hhe---------eecccceeeeEEEEEE--Ecc-eEEEEeecccccccccCCCceEEEEccccceEEEeccccccchhcc Confidence 542 4456789999999999 666 367888888 9999999999999999999 999999999 Q ss_pred CCc----------------cceeeEEEEEEeEEEecccceeEEEEeccC Q lcl|NC_019402. 286 KDG----------------SYEKWMIEMEVGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 286 ktG----------------d~~k~~i~~E~tLe~~N~~a~g~i~~lt~a 318 (318) |+| |++|++|++|++||++||++||+|++|..+ T Consensus 359 k~G~~~~~~~~~~~~~~~~D~~kG~iv~E~tLe~~N~~a~avitgl~~~ 407 (418) T protein:vir:10 359 QGGGENKSGATDYSYGHGVDAQGGSLTSEWALELLNPQGCAVITGLQKA 407 (418) T ss_pred cCCCcccccccccccccccccccceEEEEeeeeeecccceEEeecccee Confidence 999 999999999999999999999999999888 No 4 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=99.41 E-value=1.9e-14 Score=95.78 Aligned_cols=287 Identities=13% Similarity=0.126 Sum_probs=174.1 Q ss_pred CCceeeee-eeeecccceeeeEecCCcccceeeeeccccc-cceEEEeeeeeccccCCccccccccceeecccccccccc Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAI-NQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERA 78 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~-~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~ 78 (318) ||.+|=-. .++-.-.+...|+..-..+-++|..+-=..+ .+.++-=++.++...+- .+......-+|..-.. . T Consensus 1 mpaltLaea~k~~~d~l~~~ViE~~~~~s~lL~~LpF~~veg~~~~ynR~~~~~~~~~--~~v~~~~~~~g~~~~~---~ 75 (310) T protein:vir:97 1 MASVTLAESAKLAQDELVAGVIENIITVNRMFDVLPFDSIEGNSLAYNRENVLGDVIM--AGVGTTFSGAGAGKAA---A 75 (310) T ss_pred CcccchHHHhhcCcchHHHHHHHHHhccchHHHhCCcccccCCcceeeEeeccCCccc--ccccccccCCCccccc---c Confidence 88654322 1222222222222222333333332221111 12233333333322110 0000111112221111 2 Q ss_pred CcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhcC Q lcl|NC_019402. 79 STTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAAK 158 (318) Q Consensus 79 ~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~~ 158 (318) +.....--+-|+...+.|-.--.....-...++.++|+..+.+.+++..|..||+|... .+...||...++.. T Consensus 76 t~~~~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a-------~n~F~GL~~~~~~~ 148 (310) T protein:vir:97 76 TFTKVNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGA-------GNEFAGLIQLCASG 148 (310) T ss_pred ccceeeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccC-------CCcccchhhcCCcc Confidence 22344455677777777764433332222457999999999999999999999998532 13488999888654 Q ss_pred CcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEEE Q lcl|NC_019402. 159 DAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNVY 238 (318) Q Consensus 159 ~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~~ 238 (318) +..+ +.+++.++|.++|++++-.+|..++.+..|++||..+++|..|..... +..+++.....+|.. T Consensus 149 q~i~--------~~~~gg~~t~d~LDeLl~~v~~~~g~p~~~l~~~~~~r~i~A~~R~~~-----~~g~~~~~~~~~G~~ 215 (310) T protein:vir:97 149 QKAT--------TGATGSAISFAILDELMDLVVDKDGQVDYLTMHARTLRSYKALLRALG-----GASINEVVELPSGAE 215 (310) T ss_pred ceee--------cCCCCCCCCHHHHHHHHHHHhcCCCCCCEEEecHHHHHHHHHHHHHhc-----CCCCCCccccCCCCE Confidence 4433 234567789999999999999999999999999999999999876432 112333334456776 Q ss_pred EEEEEcCCCcEEEEEecCCCCCc----------eEEEEehhh--cceee--cC-----cccceecC--CCccceeeEEEE Q lcl|NC_019402. 239 VSSIVDPLGCQYKLVPNRWMPEN----------AVYFFTPSD--WTQMV--LR-----APERTKLA--KDGSYEKWMIEM 297 (318) Q Consensus 239 v~~~~tdfG~~v~iv~nr~m~~~----------~~~~~D~~~--~~~~~--Lr-----~~~~e~la--ktGd~~k~~i~~ 297 (318) |..| +- |.|+++.++|.+ .++++.+.. +...+ |. ....+.+. ...+..++.+.+ T Consensus 216 v~~~----~G-iPi~~~d~ip~~~~~~~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~~G~~~~~~v~~~~V~~ 290 (310) T protein:vir:97 216 VPAY----SG-TPIFRNDYIPTNQTKGGTTGCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVDVGESEDSDEHIWRVKW 290 (310) T ss_pred Eeee----CC-eEEEEeCccCCCccccccCCceeEEEEeeCccccccceeccccCCccceeEEeCCcccCCcceeEEEEE Confidence 6544 32 899999999853 477777664 33332 21 23456666 366888999999 Q ss_pred EEeEEEecccceeEEEEecc Q lcl|NC_019402. 298 EVGLRHRNPYASGILEVKAG 317 (318) Q Consensus 298 E~tLe~~N~~a~g~i~~lt~ 317 (318) -+++-+.+|+|.|++++.+- T Consensus 291 Y~~~av~~~~A~a~L~~V~~ 310 (310) T protein:vir:97 291 YCGLALFSEKGLACADGITN 310 (310) T ss_pred eeeEEEecccceeeeccccC Confidence 99999999999999999999 No 5 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=99.36 E-value=2.1e-14 Score=95.53 Aligned_cols=282 Identities=13% Similarity=0.080 Sum_probs=173.0 Q ss_pred CCceeeeee-eeecccceeeeEecCCcccceeeeeccccc-cceEEEeeeeeccccCCccccccccceeecccccccccc Q lcl|NC_019402. 1 MATLVSYDL-NGKKLSFANWISNLSPTDTPFVSMTGKEAI-NQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERA 78 (318) Q Consensus 1 Ma~~~t~~~-~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~-~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~ 78 (318) |+++|--++ .+-...+...|+..=.++.|++..+-=..+ .+.++-=++..+..++. ...+...+..... T Consensus 25 m~alTLaea~~l~~d~~~~~VIE~l~~~s~iL~~lpf~~ve~~~~~~~r~~~lp~a~~---------r~~n~~~~~~~~~ 95 (330) T protein:vir:94 25 MPTVTLAESAKLSQDHLVSGLIETIVEVNPLYEMMPFTEIEGNALAYNRENVLGDVQF---------LAVGGTITAKNPA 95 (330) T ss_pred hhhhhhhHHhhcCchhhHHHHHHhhhccchHHhhcccccccCCcceeeeeecCCccee---------eeccccccccCcc Confidence 665544332 111122222233333334555554431112 11222223444432211 1112222221112 Q ss_pred CcEEecceEEEEeeeeeehhHHHHhhhcC-ccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhc Q lcl|NC_019402. 79 STTVINNVTQILRKVVKVSDTANVLANYG-RGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAA 157 (318) Q Consensus 79 ~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G-~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~ 157 (318) +.+.+.--|-++.-.+.|..--. +.+| ..+..++|+..+.+.++..+|..||+|. +.+++..||...+.. T Consensus 96 Tf~q~t~~l~~l~~~~~Vd~~ia--dl~g~~~d~~~~q~~~~ieal~~~~e~~linGD-------s~~~~F~GL~~~~~~ 166 (330) T protein:vir:94 96 TFTKVTSELTTLIGDAEVNGLIQ--ATRSDFMDQTSVQVASKAKSIGRQYQASMITGD-------GTGNSFQGMMGLVAA 166 (330) T ss_pred eeeeeeechhhhhhhHHHHHHHH--HhcCCHHHHHHHHHHHHHHHHHHHHHHHhhccC-------CCCccccchhhcCCc Confidence 22444444666666666663332 2344 2478889999999999999999999983 124678899876654 Q ss_pred CCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEE Q lcl|NC_019402. 158 KDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNV 237 (318) Q Consensus 158 ~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~ 237 (318) .+.. .+.+++.++|.++|++++-.+|..+|.+..|+++....++|..|.... .++.+++.....+|. T Consensus 167 ~q~i--------~tg~~gg~~T~d~LDeLl~~v~~~~g~~~~~l~n~a~~r~I~a~~R~~-----~~~~v~~~~~~~~G~ 233 (330) T protein:vir:94 167 SQTI--------SAGANGGTLTFELLDQLLDLVKDKDGQVDYLMSSFAMRRKYFSLLRAL-----GGAAIGEVMTLPSGR 233 (330) T ss_pred ccEE--------ecCCCCCCCCHHHHHHHHHHhcCCCCCCcEEEechhHHHHHHHHHHhc-----cCCCCCCcccccCCC Confidence 3332 233567889999999999999999999999999999999999996543 222233333344566 Q ss_pred EEEEEEcCCCcEEEEEecCCCCCc----------eEEEEehh--hcceee--cC-----cccceecC--CCccceeeEEE Q lcl|NC_019402. 238 YVSSIVDPLGCQYKLVPNRWMPEN----------AVYFFTPS--DWTQMV--LR-----APERTKLA--KDGSYEKWMIE 296 (318) Q Consensus 238 ~v~~~~tdfG~~v~iv~nr~m~~~----------~~~~~D~~--~~~~~~--Lr-----~~~~e~la--ktGd~~k~~i~ 296 (318) .|..| .| +.|+++.++|.+ .++++.+. .++..+ |. ....+.+. .+.+..++.+. T Consensus 234 ~v~~~---~G--vPi~~~d~ip~~~~~~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~glsVr~~G~~~~k~v~~~~v~ 308 (330) T protein:vir:94 234 QIPTY---RG--VPWFVNDFIPSNMTQGTATNATAIFAGTFDDGSNKYGIAGLTARGSAGLRVQNVGAKENADETITRVK 308 (330) T ss_pred EEeee---CC--eEEEecccccCCCCcccCCCceeEEEEeecccccccceEeecCCCCCcceeeeCCCccccceeeEEEE Confidence 55444 23 789999988863 57777754 222222 22 33456766 56788999999 Q ss_pred EEEeEEEecccceeEEEEeccC Q lcl|NC_019402. 297 MEVGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 297 ~E~tLe~~N~~a~g~i~~lt~a 318 (318) +-+++-+++|+|.|++++..-- T Consensus 309 ~y~~~av~~~~a~~~L~~V~~g 330 (330) T protein:vir:94 309 MYCGFANFSQLGLAAIKGLIPG 330 (330) T ss_pred EeeeeEEechhheeeeccccCC Confidence 9999999999999999988777 No 6 >protein:vir:80835 Length: 464 # NCBI annotation: putative major capsid protein # Family: family:all:2450 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504125;genbank:gi:158079312;genbank:GeneID:5666484 Probab=99.03 E-value=1.1e-11 Score=80.66 Aligned_cols=285 Identities=14% Similarity=0.084 Sum_probs=176.8 Q ss_pred CCceeeeeeeeecccceeeeEecCCccccee--eeeccccccceEEEeeeeeccccCCccccccccc-eeeccccccccc Q lcl|NC_019402. 1 MATLVSYDLNGKKLSFANWISNLSPTDTPFV--SMTGKEAINQTLFQWQTDALAPVADPSDAQKRNA-VIEGSAAVDGER 77 (318) Q Consensus 1 Ma~~~t~~~~~~~~dl~d~I~~i~p~~TP~~--s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na-~~EG~da~~~~~ 77 (318) =+.-.+...-.-||+|.+.|.+++-.+.+|. .-|.+.+++|+.++|-... ++-. ....+ .-||..++.... T Consensus 29 ~p~~q~~~~AlRrEsL~~~i~~Lt~~~~~f~f~~di~k~~a~STV~~y~~~~-~~G~-----~g~~~f~~E~g~~~~~d~ 102 (464) T protein:vir:80 29 TPESQTDAAALRREFLDDQITMLTWADGDLSFYRDITKRPATSTVAKYDVYL-AHGR-----VGHTRFTREIGVAPISDP 102 (464) T ss_pred CcccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhhheee-ccCc-----cccccccccccccccCCC Confidence 1122333466789999999999999888874 5677889999999998865 2211 11111 236666555543 Q ss_pred cCcEEecceEEEEeeeeeehhHHHHh-----hhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCC-CC-ccchhhh Q lcl|NC_019402. 78 ASTTVINNVTQILRKVVKVSDTANVL-----ANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGS-AT-VARQTAG 150 (318) Q Consensus 78 ~~~~~~~N~tQIf~~~~~VS~Ta~a~-----~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs-~t-~~r~m~G 150 (318) .-+.+.-++- -+|+|.+.+ .+.+ .+-++.|...++.-+...+|++++.|-.+..-. .+ .-=+.+| T Consensus 103 ~~~Rr~~~~K-------fl~~~r~vsia~~lvn~~-~d~~~~~~~dai~~va~tiE~a~FyGds~l~~~~~~~~gleFDG 174 (464) T protein:vir:80 103 NLRQKTVNMK-------YVSDTKNMSIATGLVNNI-EDPMRILTDDAISVVAKTIEWASFYGDSDLSENPDAGSGLEFDG 174 (464) T ss_pred ceEEEEEEee-------eeecceeeeeehhhhcch-hhHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCCCccccchhh Confidence 3333333222 233333321 2223 478899999999999999999999986554211 11 1246999 Q ss_pred HHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhh-hhhhhhhcccccceEEEec Q lcl|NC_019402. 151 FSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFF-SSLMETSGVTNGQRMKMFD 229 (318) Q Consensus 151 i~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~i-s~~~~~~~~~~~~r~~~~~ 229 (318) |..+|+.++..|. -+..||++.|+.+-..+=.+=|+++.++++...|-.| +.+... ..+....+ T Consensus 175 l~~lI~~~NViDa----------rG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~v~a~f~n~~l~~-----q~~~~~~n 239 (464) T protein:vir:80 175 LAKLIDKHNVLDA----------KGASLTEALLNQASVLVGKGYGTPTDAYMPIGVQADFVNQQLDR-----QVQVISDN 239 (464) T ss_pred hHhhcCCCceeec----------CCCCcCHHHHhhhhhhhhcccCChhhcccchhHHHHHHhhhcCc-----eeEEEcCC Confidence 9999988876554 4677999999999888855447888899998888665 444321 22333445 Q ss_pred CCceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcc--eeecCccc-ceecCCC--c--cceeeEEEEEEeEE Q lcl|NC_019402. 230 GQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWT--QMVLRAPE-RTKLAKD--G--SYEKWMIEMEVGLR 302 (318) Q Consensus 230 ~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~--~~~Lr~~~-~e~lakt--G--d~~k~~i~~E~tLe 302 (318) +.....|..|..|.|..| .+++.++-+|....+ +|++..+ =++- +|. .-.++.+ | ..+..-...+|-+. T Consensus 240 ~~~~~~G~~v~~f~sa~G-~i~L~~s~~m~~~~~--ld~~~~~~~~apa-apsvt~tv~~~~~g~f~~~~~~~~~~Ykv~ 315 (464) T protein:vir:80 240 GQNATMGFNVKGFNSARG-FIRLHGSTVMELEQI--LDENRMQLPNAPQ-KATVKATLEAGTKGKFRDEDLTIDTEYKVV 315 (464) T ss_pred CCcceeeeeccccccccc-ceeccCccccCcccc--cccccccCCCCcC-CceeEEEecCCcccCCccccccceeEEEEE Confidence 556689999999999999 588888887754444 4444432 2222 222 1111211 1 12222233467888 Q ss_pred EecccceeEEEEeccC Q lcl|NC_019402. 303 HRNPYASGILEVKAGA 318 (318) Q Consensus 303 ~~N~~a~g~i~~lt~a 318 (318) +.|..+-........+ T Consensus 316 ~vn~~GeS~ps~~~~~ 331 (464) T protein:vir:80 316 VVSDDAESAPSDVASV 331 (464) T ss_pred EECCCCccccceeeee Confidence 8887764333221111 No 7 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=98.90 E-value=6.3e-10 Score=71.03 Aligned_cols=275 Identities=12% Similarity=0.036 Sum_probs=158.6 Q ss_pred CCceeeeeeeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccccCc Q lcl|NC_019402. 1 MATLVSYDLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERAST 80 (318) Q Consensus 1 Ma~~~t~~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~~~ 80 (318) |.+-.+.......+.+.+.|...--...|+++++......+..+++.......+. ..-.-||...+....+ T Consensus 105 ~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~-------a~~v~E~~~~~~~~~~-- 175 (385) T protein:vir:19 105 LGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVREEVFTNN-------ADVVAEKALKPESDIT-- 175 (385) T ss_pred hccccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEEEEEecCCcc-------eeeeccCccccccccc-- Confidence 3333333333456677777887778899999988776665555555543321110 1112366655544322 Q ss_pred EEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhcCCc Q lcl|NC_019402. 81 TVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAAKDA 160 (318) Q Consensus 81 ~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~~~~ 160 (318) .......+++-...+.=|.+.+... .+...|=...-...+.+-+|.++|+|. |++. ...||....... T Consensus 176 -~~~~~~~~~k~~~~~~is~ell~d~--~~l~~~i~~~la~a~~~~~d~~~l~G~----g~~~---~~~Gi~~~~~~~-- 243 (385) T protein:vir:19 176 -FSKQTANVKTIAHWVQASRQVMDDA--PMLQSYINNRLMYGLALKEEGQLLNGD----GTGD---NLEGLNKVATAY-- 243 (385) T ss_pred -eeEEEEeeeeEEEeehhhHHHHhhH--HHHHHHHHHHHHHHHHHHHHHHHHhcc----CCCC---cccccccccccc-- Confidence 1222223333233333333444322 233444444556668888999999873 3332 245665422111 Q ss_pred ccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEEEEE Q lcl|NC_019402. 161 ADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNVYVS 240 (318) Q Consensus 161 ~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~~v~ 240 (318) .....+....+.++|.+++.++-.++.....++|+|.....+..+. +.+ .++...+. . ..... T Consensus 244 --------~~~~~~~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lk-d~~----G~~l~~~~--~--~~~~~ 306 (385) T protein:vir:19 244 --------DTSLNATGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIALLK-DNE----GRYIFGGP--Q--AFTSN 306 (385) T ss_pred --------cccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhh-cCC----CceeccCc--c--cCCCc Confidence 1122334556789999999999888888889999999888777654 321 23222111 1 01111 Q ss_pred EEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecC-cccceecCCCc-----cceeeEEEEEEeEEEecccceeEEEE Q lcl|NC_019402. 241 SIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLR-APERTKLAKDG-----SYEKWMIEMEVGLRHRNPYASGILEV 314 (318) Q Consensus 241 ~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr-~~~~e~laktG-----d~~k~~i~~E~tLe~~N~~a~g~i~~ 314 (318) +=+| +.++.+++||++.+++.|++..-..+.| .+..+.....+ +.........++..+++|.|..+++. T Consensus 307 ---~l~G--~pV~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~ 381 (385) T protein:vir:19 307 ---IMWG--LPVVPTKAQAAGTFTVGGFDMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPTAIIKGTF 381 (385) T ss_pred ---eecc--eeeEEcCcCCCCcEEEeecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEe Confidence 1268 6899999999999999999763332322 22222211111 23344555668999999999999999 Q ss_pred eccC Q lcl|NC_019402. 315 KAGA 318 (318) Q Consensus 315 lt~a 318 (318) .++| T Consensus 382 ~aa~ 385 (385) T protein:vir:19 382 SSGS 385 (385) T ss_pred ccCC Confidence 9999 No 8 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=98.90 E-value=6.3e-10 Score=71.03 Aligned_cols=275 Identities=12% Similarity=0.036 Sum_probs=158.6 Q ss_pred CCceeeeeeeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccccCc Q lcl|NC_019402. 1 MATLVSYDLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERAST 80 (318) Q Consensus 1 Ma~~~t~~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~~~ 80 (318) |.+-.+.......+.+.+.|...--...|+++++......+..+++.......+. ..-.-||...+....+ T Consensus 105 ~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~-------a~~v~E~~~~~~~~~~-- 175 (385) T protein:vir:18 105 LGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVREEVFTNN-------ADVVAEKALKPESDIT-- 175 (385) T ss_pred hccccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEEEEEecCCcc-------eeeeccCccccccccc-- Confidence 3333333333456677777887778899999988776665555555543321110 1112366655544322 Q ss_pred EEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhcCCc Q lcl|NC_019402. 81 TVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAAKDA 160 (318) Q Consensus 81 ~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~~~~ 160 (318) .......+++-...+.=|.+.+... .+...|=...-...+.+-+|.++|+|. |++. ...||....... T Consensus 176 -~~~~~~~~~k~~~~~~is~ell~d~--~~l~~~i~~~la~a~~~~~d~~~l~G~----g~~~---~~~Gi~~~~~~~-- 243 (385) T protein:vir:18 176 -FSKQTANVKTIAHWVQASRQVMDDA--PMLQSYINNRLMYGLALKEEGQLLNGD----GTGD---NLEGLNKVATAY-- 243 (385) T ss_pred -eeEEEEeeeeEEEeehhhHHHHhhH--HHHHHHHHHHHHHHHHHHHHHHHHhcc----CCCC---cccccccccccc-- Confidence 1222223333233333333444322 233444444556668888999999873 3332 245665422111 Q ss_pred ccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEEEEE Q lcl|NC_019402. 161 ADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNVYVS 240 (318) Q Consensus 161 ~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~~v~ 240 (318) .....+....+.++|.+++.++-.++.....++|+|.....+..+. +.+ .++...+. . ..... T Consensus 244 --------~~~~~~~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lk-d~~----G~~l~~~~--~--~~~~~ 306 (385) T protein:vir:18 244 --------DTSLNATGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIALLK-DNE----GRYIFGGP--Q--AFTSN 306 (385) T ss_pred --------cccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhh-cCC----CceeccCc--c--cCCCc Confidence 1122334556789999999999888888889999999888777654 321 23222111 1 01111 Q ss_pred EEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecC-cccceecCCCc-----cceeeEEEEEEeEEEecccceeEEEE Q lcl|NC_019402. 241 SIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLR-APERTKLAKDG-----SYEKWMIEMEVGLRHRNPYASGILEV 314 (318) Q Consensus 241 ~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr-~~~~e~laktG-----d~~k~~i~~E~tLe~~N~~a~g~i~~ 314 (318) +=+| +.++.+++||++.+++.|++..-..+.| .+..+.....+ +.........++..+++|.|..+++. T Consensus 307 ---~l~G--~pV~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~ 381 (385) T protein:vir:18 307 ---IMWG--LPVVPTKAQAAGTFTVGGFDMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPTAIIKGTF 381 (385) T ss_pred ---eecc--eeeEEcCcCCCCcEEEeecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEe Confidence 1268 6899999999999999999763332322 22222211111 23344555668999999999999999 Q ss_pred eccC Q lcl|NC_019402. 315 KAGA 318 (318) Q Consensus 315 lt~a 318 (318) .++| T Consensus 382 ~aa~ 385 (385) T protein:vir:18 382 SSGS 385 (385) T ss_pred ccCC Confidence 9999 No 9 >protein:vir:99311 Length: 463 # NCBI annotation: putative capsid protein # Family: family:all:2450 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024474;genbank:gi:48696433;genbank:GeneID:2948039 Probab=98.90 E-value=1.6e-10 Score=74.22 Aligned_cols=284 Identities=15% Similarity=0.063 Sum_probs=179.0 Q ss_pred CCc-------eeeeeeeeecccceeeeEecCCccccee--eeeccccccceEEEeeeeeccccCCccccccccc-eeecc Q lcl|NC_019402. 1 MAT-------LVSYDLNGKKLSFANWISNLSPTDTPFV--SMTGKEAINQTLFQWQTDALAPVADPSDAQKRNA-VIEGS 70 (318) Q Consensus 1 Ma~-------~~t~~~~~~~~dl~d~I~~i~p~~TP~~--s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na-~~EG~ 70 (318) |.+ -.+...-.-+|+|...|.+++-.+.+|. .-|.+.+++|+.|+|-... ++-. ....+ .-||. T Consensus 26 ~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a~STV~~y~~~~-~~G~-----~g~~~f~~E~g 99 (463) T protein:vir:99 26 FQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQYL-RHGN-----VGHSRFVKEIG 99 (463) T ss_pred hhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhheeee-ccCc-----ccccccccccc Confidence 221 1223355679999999999999888874 4677889999999998865 2211 01111 23666 Q ss_pred ccccccccCcEEecceEEEEeeeeeehhHHHHhhhc----CccchHHHHHHHHHHHHHHHHHHHHhcCcccc-CCCCCcc Q lcl|NC_019402. 71 AAVDGERASTTVINNVTQILRKVVKVSDTANVLANY----GRGKELQYQMEKAGKEIKRDLEVALLRNGAKV-DGSATVA 145 (318) Q Consensus 71 da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~----G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~-~gs~t~~ 145 (318) .+.... .||.|+-...==++.|.+.+... ++.+-++.|.+.++.-+...+|++++.|-.+. ++....- T Consensus 100 ~~~~~d-------~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~dai~~ia~tiE~a~FyGds~l~~~~~~~g 172 (463) T protein:vir:99 100 VAPVSD-------PNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEG 172 (463) T ss_pred ccccCC-------CceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCcCccc Confidence 554444 44544433333366666654333 34467888999999999999999999986544 2222233 Q ss_pred chhhhHHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhh-hhhhhhcccccce Q lcl|NC_019402. 146 RQTAGFSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFS-SLMETSGVTNGQR 224 (318) Q Consensus 146 r~m~Gi~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is-~~~~~~~~~~~~r 224 (318) =+.+||...|+.++..|. -+..|+++.|+.+...+=.+=|+++.++++...|-.|. .|+. .+| T Consensus 173 leFDGl~~lId~enviDa----------rG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~~~~l~------~qr 236 (463) T protein:vir:99 173 LEFDGLAKLIDKNNVINA----------KGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILG------RQM 236 (463) T ss_pred cchhhhhhhcCCCCeeec----------CCCcccHHHHhhhhhhhhcccCChhheecchHHHHHHHHHhcC------ceE Confidence 579999999998876554 46789999999998888554478889999999888886 4443 234 Q ss_pred EEEecCC-ceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhc--ceeecCcc---cceecCCCc---cceeeEE Q lcl|NC_019402. 225 MKMFDGQ-DTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDW--TQMVLRAP---ERTKLAKDG---SYEKWMI 295 (318) Q Consensus 225 ~~~~~~~-~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~--~~~~Lr~~---~~e~laktG---d~~k~~i 295 (318) ..+.... ....|..|..|.|..| .+++.++++|.... ++|.+.- -=++-.|- ..+.-.|.| +.+... T Consensus 237 v~~~~N~~~~~~G~~v~~f~s~~G-~I~L~~s~~m~~~~--il~~~~~~~p~ap~~~~~tatv~~~~~~~~~~~~~~a~- 312 (463) T protein:vir:99 237 QLMQDNSGNVNTGYSVNGFYSSRG-FIKLHGSTVMENEL--ILDESLQPLPNAPQPAKVTATVETKQKGAFENEEDRAG- 312 (463) T ss_pred EEEcCCCCceeeeeeccceeeeee-eeeeCCceecCCcc--cccchhhcCCCCccCceeEEEEeeccCCCCCCcccccc- Confidence 3333322 2489999999999999 69999999997555 4444432 11111111 112212222 222222 Q ss_pred EEEEeEEEeccccee----EEEEeccC Q lcl|NC_019402. 296 EMEVGLRHRNPYASG----ILEVKAGA 318 (318) Q Consensus 296 ~~E~tLe~~N~~a~g----~i~~lt~a 318 (318) -.|-+.+.|-.+-. +++...++ T Consensus 313 -~~Y~vv~~s~~geS~pS~ivtaT~a~ 338 (463) T protein:vir:99 313 -LSYKVVVNSDDAQSAPSEEVTATVSN 338 (463) T ss_pred -eEEEEEEECCCCCcccchheeeeeee Confidence 24556666655432 33333222 No 10 >protein:vir:95603 Length: 463 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240903;genbank:gi:66394965;genbank:GeneID:5132544 Probab=98.90 E-value=1.6e-10 Score=74.22 Aligned_cols=284 Identities=15% Similarity=0.063 Sum_probs=179.0 Q ss_pred CCc-------eeeeeeeeecccceeeeEecCCccccee--eeeccccccceEEEeeeeeccccCCccccccccc-eeecc Q lcl|NC_019402. 1 MAT-------LVSYDLNGKKLSFANWISNLSPTDTPFV--SMTGKEAINQTLFQWQTDALAPVADPSDAQKRNA-VIEGS 70 (318) Q Consensus 1 Ma~-------~~t~~~~~~~~dl~d~I~~i~p~~TP~~--s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na-~~EG~ 70 (318) |.+ -.+...-.-+|+|...|.+++-.+.+|. .-|.+.+++|+.|+|-... ++-. ....+ .-||. T Consensus 26 ~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a~STV~~y~~~~-~~G~-----~g~~~f~~E~g 99 (463) T protein:vir:95 26 FQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQYL-RHGN-----VGHSRFVKEIG 99 (463) T ss_pred hhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhheeee-ccCc-----ccccccccccc Confidence 221 1223355679999999999999888874 4677889999999998865 2211 01111 23666 Q ss_pred ccccccccCcEEecceEEEEeeeeeehhHHHHhhhc----CccchHHHHHHHHHHHHHHHHHHHHhcCcccc-CCCCCcc Q lcl|NC_019402. 71 AAVDGERASTTVINNVTQILRKVVKVSDTANVLANY----GRGKELQYQMEKAGKEIKRDLEVALLRNGAKV-DGSATVA 145 (318) Q Consensus 71 da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~----G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~-~gs~t~~ 145 (318) .+.... .||.|+-...==++.|.+.+... ++.+-++.|.+.++.-+...+|++++.|-.+. ++....- T Consensus 100 ~~~~~d-------~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~dai~~ia~tiE~a~FyGds~l~~~~~~~g 172 (463) T protein:vir:95 100 VAPVSD-------PNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEG 172 (463) T ss_pred ccccCC-------CceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCcCccc Confidence 554444 44544433333366666654333 34467888999999999999999999986544 2222233 Q ss_pred chhhhHHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhh-hhhhhhcccccce Q lcl|NC_019402. 146 RQTAGFSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFS-SLMETSGVTNGQR 224 (318) Q Consensus 146 r~m~Gi~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is-~~~~~~~~~~~~r 224 (318) =+.+||...|+.++..|. -+..|+++.|+.+...+=.+=|+++.++++...|-.|. .|+. .+| T Consensus 173 leFDGl~~lId~enviDa----------rG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~~~~l~------~qr 236 (463) T protein:vir:95 173 LEFDGLAKLIDKNNVINA----------KGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILG------RQM 236 (463) T ss_pred cchhhhhhhcCCCCeeec----------CCCcccHHHHhhhhhhhhcccCChhheecchHHHHHHHHHhcC------ceE Confidence 579999999998876554 46789999999998888554478889999999888886 4443 234 Q ss_pred EEEecCC-ceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhc--ceeecCcc---cceecCCCc---cceeeEE Q lcl|NC_019402. 225 MKMFDGQ-DTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDW--TQMVLRAP---ERTKLAKDG---SYEKWMI 295 (318) Q Consensus 225 ~~~~~~~-~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~--~~~~Lr~~---~~e~laktG---d~~k~~i 295 (318) ..+.... ....|..|..|.|..| .+++.++++|.... ++|.+.- -=++-.|- ..+.-.|.| +.+... T Consensus 237 v~~~~N~~~~~~G~~v~~f~s~~G-~I~L~~s~~m~~~~--il~~~~~~~p~ap~~~~~tatv~~~~~~~~~~~~~~a~- 312 (463) T protein:vir:95 237 QLMQDNSGNVNTGYSVNGFYSSRG-FIKLHGSTVMENEL--ILDESLQPLPNAPQPAKVTATVETKQKGAFENEEDRAG- 312 (463) T ss_pred EEEcCCCCceeeeeeccceeeeee-eeeeCCceecCCcc--cccchhhcCCCCccCceeEEEEeeccCCCCCCcccccc- Confidence 3333322 2489999999999999 69999999997555 4444432 11111111 112212222 222222 Q ss_pred EEEEeEEEeccccee----EEEEeccC Q lcl|NC_019402. 296 EMEVGLRHRNPYASG----ILEVKAGA 318 (318) Q Consensus 296 ~~E~tLe~~N~~a~g----~i~~lt~a 318 (318) -.|-+.+.|-.+-. +++...++ T Consensus 313 -~~Y~vv~~s~~geS~pS~ivtaT~a~ 338 (463) T protein:vir:95 313 -LSYKVVVNSDDAQSAPSEEVTATVSN 338 (463) T ss_pred -eEEEEEEECCCCCcccchheeeeeee Confidence 24556666655432 33333222 No 11 >protein:vir:100851 Length: 514 # NCBI annotation: hypothetical protein # Family: family:all:2450 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164744;genbank:gi:56693157;genbank:GeneID:3197484 Probab=98.90 E-value=3.1e-10 Score=72.73 Aligned_cols=291 Identities=13% Similarity=0.047 Sum_probs=175.5 Q ss_pred CCceeeeeeeeecccceeeeEecCCccccee--eeeccccccceEEEeeeeeccccCCccccccccc-eeeccccccccc Q lcl|NC_019402. 1 MATLVSYDLNGKKLSFANWISNLSPTDTPFV--SMTGKEAINQTLFQWQTDALAPVADPSDAQKRNA-VIEGSAAVDGER 77 (318) Q Consensus 1 Ma~~~t~~~~~~~~dl~d~I~~i~p~~TP~~--s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na-~~EG~da~~~~~ 77 (318) =+.-.+...-..+|||.++|.+++-.+.+|+ .-|.+.+++|+.|+|-... ++-. ....+ .-||..+..... T Consensus 52 ~~~~~t~gaAlR~EsLd~~l~~Lt~~~~~ftf~~~i~k~~a~STV~ey~~~~-~~G~-----~G~~~f~~E~gi~~~~d~ 125 (514) T protein:vir:10 52 TPDTQTDGAANRIESLNRDLKVTTWGERDFTLYNDIAKQPVDNTVLKYTQYY-SHGR-----TGHSLFQPEIGIGDVNNP 125 (514) T ss_pred CCccccCccchhhhhhccceeEeeecCcchhhhhhcCCchhhHHHhhhhhhc-ccCc-----ccccccccccccCcCCCc Confidence 1112334456789999999999999999985 5788999999999998865 2211 11111 236665444443 Q ss_pred cCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCcccc-CCCCCccchhhhHHHHHh Q lcl|NC_019402. 78 ASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKV-DGSATVARQTAGFSALVA 156 (318) Q Consensus 78 ~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~-~gs~t~~r~m~Gi~~~i~ 156 (318) .-+.+.-+.-=+. ....||--+.= ..|+-+-++.|.+.++.-+...+|++++.|-... ++....+-+.+||...|+ T Consensus 126 ~~~rk~~~~k~l~-~~~~vS~~~~l--~n~i~d~~~~~~~dai~~ia~tiE~a~FyGDs~L~s~~~~~gleFDGl~~lI~ 202 (514) T protein:vir:10 126 NERQRTINIKYIV-DTHVTSIALQR--ANTIVDSLKVQEYAAISTVIKTDEWAMFYGDADLTSGQKGEGLQFDGLFKLIA 202 (514) T ss_pred ceEEEEEeeeeee-eeeeeeehhhh--ccchhhHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccCcchhhhHHHhhc Confidence 3333333322111 12222222222 2256678889999999999999999999886543 233334578999999998 Q ss_pred cCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhh-hhhhhhhcccccceEEEecC--Cce Q lcl|NC_019402. 157 AKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFF-SSLMETSGVTNGQRMKMFDG--QDT 233 (318) Q Consensus 157 ~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~i-s~~~~~~~~~~~~r~~~~~~--~~~ 233 (318) ..|..| .-+..|+++.|+.+...+=.+=|+++.++++...|-.| ..|+ ..+|. +..+ ... T Consensus 203 ~~NvID----------arG~~Ls~~~ln~aA~~i~~gfGt~TD~ylp~~vka~f~~~~~------~~qRV-~~~~n~~~~ 265 (514) T protein:vir:10 203 PENHID----------LRGGRLSPAALNMAARKIGEGFGTPTDAYMPIGIKADFVNQHL------NGQRV-MLPGQTGGM 265 (514) T ss_pred CCCeEe----------cCCCCccHHHHhhhhhhhhcccCChhheeCchHHHHHHhhccc------CcceE-EeecCccce Confidence 876554 45678999999999977766657888888888776644 3343 23444 3332 344 Q ss_pred EEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcc---cceec-------CCCcccee-eEE------E Q lcl|NC_019402. 234 RLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAP---ERTKL-------AKDGSYEK-WMI------E 296 (318) Q Consensus 234 ~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~---~~e~l-------aktGd~~k-~~i------~ 296 (318) ..|..|..|.|-.| .+++.++-.|.....+=++....-.++--|- ..++. +.++++.. ..+ + T Consensus 266 ~~G~~v~~f~s~~G-~I~L~gs~im~~~n~L~~~~~~~~~Ap~~~~va~svT~~~~g~~~~ad~t~~~g~~~~~~~~g~~ 344 (514) T protein:vir:10 266 TTGLDIDKFLSAHG-SIRIQGSTIMDSDNKLDFDRPVSPTAPTAPQLSATVTPDGGGLWHEADKTDSKGEVILNKEVGVE 344 (514) T ss_pred eeeeeccceeEecc-ceeecCCeeecccccCccCCccCCcCCCCCcceEEEecCcccccCccccccccccccccccccee Confidence 88999999999999 5998888877655544333222222221111 11111 11111111 011 2 Q ss_pred EEEeEEEecccc----eeEEEEeccC Q lcl|NC_019402. 297 MEVGLRHRNPYA----SGILEVKAGA 318 (318) Q Consensus 297 ~E~tLe~~N~~a----~g~i~~lt~a 318 (318) -.|.+...|..+ +.+|+-..++ T Consensus 345 ~sYaVv~~n~~GeS~ps~~vtaT~a~ 370 (514) T protein:vir:10 345 QSYVAVMVSRHGDSRPSLVQTATPTK 370 (514) T ss_pred EEEEEEEECCCCcccccceeeeeeec Confidence 235566666554 2333333333 No 12 >protein:vir:96666 Length: 462 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238545;genbank:gi:66391271;genbank:GeneID:5130448 Probab=98.84 E-value=6.6e-10 Score=70.91 Aligned_cols=286 Identities=15% Similarity=0.066 Sum_probs=175.2 Q ss_pred CCc-------eeeeeeeeecccceeeeEecCCccccee--eeeccccccceEEEeeeeeccccCCccccccccc-eeecc Q lcl|NC_019402. 1 MAT-------LVSYDLNGKKLSFANWISNLSPTDTPFV--SMTGKEAINQTLFQWQTDALAPVADPSDAQKRNA-VIEGS 70 (318) Q Consensus 1 Ma~-------~~t~~~~~~~~dl~d~I~~i~p~~TP~~--s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na-~~EG~ 70 (318) |.+ -.+...-.-||+|.+.|.+++-.+.+|. .-|.+.+++|+.++|-... ++-. ....+ .-||. T Consensus 26 ~~tg~g~~p~~q~~~gAlR~esL~~~i~~Lt~~~~~~~~~~~i~k~~a~sTv~~y~~~~-~~G~-----~g~~~f~~E~g 99 (462) T protein:vir:96 26 YQTGYGITPDTQVDAGALRREILDDQITMLTWTQDDLIFYREISRRPAQSTVQKYDVYL-RHGN-----VGHSRFVREVG 99 (462) T ss_pred HhcCCCcCCccccccchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhheeee-ccCc-----ccccccccccc Confidence 221 1122345678999999999999888874 5677899999999998865 2211 11111 23666 Q ss_pred ccccccccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCcccc-CCCCCccchhh Q lcl|NC_019402. 71 AAVDGERASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKV-DGSATVARQTA 149 (318) Q Consensus 71 da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~-~gs~t~~r~m~ 149 (318) .++.....-+.+.-+.-=+. .+-.||-- +.-..++.+-++.|.+.++.-+...+|++++.|-.+. ++....-=+.+ T Consensus 100 ~~~~~d~~~~R~~~~~k~l~-~t~~vsi~--~tl~n~~~d~~~~~~~dai~~~a~tiE~a~Fygds~l~~~~~~~gleFD 176 (462) T protein:vir:96 100 VAPVSDPNIRQKTVEMKYVS-DTKNLSIA--STLVNNIQDPMQILTEDAIAVVAKTIEWASFYGDASLTADPTGQGLEFD 176 (462) T ss_pred ccccCCCceEEEEEEEEEEe-eeeeechh--hhhccchhhHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCccccccchh Confidence 65555443333333322111 12223211 2223467788999999999999999999999986544 22222236699 Q ss_pred hHHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhh-hhhhhhcccccceEEEe Q lcl|NC_019402. 150 GFSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFS-SLMETSGVTNGQRMKMF 228 (318) Q Consensus 150 Gi~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is-~~~~~~~~~~~~r~~~~ 228 (318) ||..+|+.++..| .-+..||+++|+.....+=.+=|+++.++++...|-.|. .|+. .+|..+. T Consensus 177 Gl~~lI~~~NViD----------arG~~Ls~~~ln~aa~~i~~~fGt~TD~~~p~~v~a~f~~~~l~------~qrv~~~ 240 (462) T protein:vir:96 177 GLAKLIDKDNVID----------AKGESLTETLLNRSAVLIGKSFGTATDAYMPIGVHADFVNSVLG------RQMQLMQ 240 (462) T ss_pred hhhhhcCCCceee----------cCCCCccHHHHhhhhhhcccccCChhheecchHHHHHHHHhhcC------ceEEEEc Confidence 9999998887554 456899999999999888433378888999999888886 4443 2443333 Q ss_pred cCC-ceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccce------ecCCCc--cceeeEEEEE- Q lcl|NC_019402. 229 DGQ-DTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERT------KLAKDG--SYEKWMIEME- 298 (318) Q Consensus 229 ~~~-~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e------~laktG--d~~k~~i~~E- 298 (318) ... ....|..|..|.|..| .+++.++++|....+ +|.+.-.+.- .|..- ..++.| -.+.. .+| T Consensus 241 ~n~g~~~~G~~v~~f~s~~G-~I~L~~s~~m~~~~i--~~~~~~~~p~--ap~~~~vsaTv~t~~~g~f~~~~d--~~~y 313 (462) T protein:vir:96 241 DNSGNVNAGYNVQGFYSSRG-FIKLHGSTVMENELI--LDESLQPLPN--APQPATVKATVETGKKGLFTDEHD--RAEL 313 (462) T ss_pred CCCCceeeeeeccceeeeee-eeeeCCceecCcccc--cccccccCCC--CCCCCceeEEEEeCCCCCCCCccC--ceeE Confidence 332 2489999999999999 699999999975444 5544432221 22111 122222 11111 244 Q ss_pred -EeEEEecccc----eeEEEEeccC Q lcl|NC_019402. 299 -VGLRHRNPYA----SGILEVKAGA 318 (318) Q Consensus 299 -~tLe~~N~~a----~g~i~~lt~a 318 (318) |-+...|..+ +.+++...++ T Consensus 314 ~Y~V~avs~dgeS~PS~~VtaTva~ 338 (462) T protein:vir:96 314 TYKVVVNSDDAQSAPSEAVTATVNN 338 (462) T ss_pred EEEEEEECCCCccccceeeEeeeec Confidence 3445555443 2223333222 No 13 >protein:vir:102823 Length: 470 # NCBI annotation: major structural protein # Family: family:all:2450 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874086;genbank:gi:118197693;genbank:GeneID:4496015 Probab=98.82 E-value=4.4e-10 Score=71.89 Aligned_cols=294 Identities=10% Similarity=0.051 Sum_probs=173.9 Q ss_pred CC----------------ceeeeeeeeecccceeeeEecCCccccee--eeeccccccceEEEeeeeeccccCCcccccc Q lcl|NC_019402. 1 MA----------------TLVSYDLNGKKLSFANWISNLSPTDTPFV--SMTGKEAINQTLFQWQTDALAPVADPSDAQK 62 (318) Q Consensus 1 Ma----------------~~~t~~~~~~~~dl~d~I~~i~p~~TP~~--s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~ 62 (318) |+ .-..+.....+|||.++|.+++-.+.+|+ .-|.+.+++|+.|+|-....++-. .. T Consensus 1 ~~~~~~~~~~~a~~~al~~a~~~g~AlR~EsLd~~l~~lt~~~~~ftf~~~i~k~~a~STV~ey~~~~~rhG~-----~g 75 (470) T protein:vir:10 1 MPYEHLKHLDEATLKALNAAGQVAESLEREDLEPEVTQLNVLDTPLTDLLSKNAVKAKAYEHEYNVVTARHDK-----IG 75 (470) T ss_pred CChhHhhhhhHHHHHHHHHhhhcchhhhhhhhccceeEeeecCccchhhhhcCCchhhhHhhhhhhhcccccc-----cc Confidence 11 01111233689999999999999999985 578889999999999653333211 01 Q ss_pred ccceeeccccccccccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCC Q lcl|NC_019402. 63 RNAVIEGSAAVDGERASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSA 142 (318) Q Consensus 63 ~na~~EG~da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~ 142 (318) ..+.=||..+......-+.+.- ..-.+..+..||.-+-.....|+.+.++.+...++.-+..-+|++++.|-.+.. |+ T Consensus 76 ~s~~~E~~l~~~~d~~~~Rr~v-~~K~l~~~~~VT~~a~~~~~n~v~d~~~~~~~dai~~ia~tiE~a~FyGDs~l~-s~ 153 (470) T protein:vir:10 76 YAAFREGGLPRTVEVNVVRRRI-RPMLVGHRITVTELATRTTQNGVMQIDELVKREKMIAVANEFEYLAFYGDNLLG-DD 153 (470) T ss_pred ceeecccccCccCCCceEEEEE-EEEEEeecchhhhhhhhhhhccccchHHHHHHHHHHHHHHHHHhhhhhhccccc-cc Confidence 1122366665544322222222 222333344444333223445888999999999999999999999999865432 21 Q ss_pred ----CccchhhhHHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHh--CCCCcCEEEEcchHHhhhh-hhhh Q lcl|NC_019402. 143 ----TVARQTAGFSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYL--SGSEANIIMFHPKHAAFFS-SLME 215 (318) Q Consensus 143 ----t~~r~m~Gi~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~--~G~~~~~l~~~~~~k~~is-~~~~ 215 (318) ..+=+.+||...|+.++.. -+.+.-++.|+++.|+.....|-. +=|+++.++++...|-.|. .|+ T Consensus 154 ~~g~~~gleFDGl~~lId~~~~~-------NViDarG~~Ls~~~L~~aa~~I~~~~~fGt~TD~~lp~~vka~f~~~~~- 225 (470) T protein:vir:10 154 VPGSPNNLQQDGIINIIKRGAPQ-------NVLDAGGRPLSIDLLWEAESRVVSTQAFANPTAVFISYVDKLNLQASFY- 225 (470) T ss_pred cCcccCceeccchhhhccCCCCc-------cccccCCCCccHHHHHHHHhhhcccccccChhhhccchhHHHHHHHhhc- Confidence 1334799999999754211 134566788999999999998854 3368888999888877764 343 Q ss_pred hhcccccceEEEecC-CceEEEEEEEEEEcCCCcEEEEEecCCCCCceEE---EEehhhcceeecCcccc--------ee Q lcl|NC_019402. 216 TSGVTNGQRMKMFDG-QDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVY---FFTPSDWTQMVLRAPER--------TK 283 (318) Q Consensus 216 ~~~~~~~~r~~~~~~-~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~---~~D~~~~~~~~Lr~~~~--------e~ 283 (318) ..+|...... .....|..|..|.|-.| .+++.++-+|+....+ .||-+-=+++ .|.. .. T Consensus 226 -----~~qRv~~~~N~~~~~~G~~v~~f~sa~G-~I~L~~s~~m~~~~k~~p~~l~~~v~~~a---AP~~~~tv~~t~~~ 296 (470) T protein:vir:10 226 -----QISRVMTTADRRAGLLGADAQSYIGVRG-EHSLYPSQFLGDFHKFNPARFGAEVGDFA---APSNSWTVSTTDNF 296 (470) T ss_pred -----CceEEEEecCCCceeeeeeccceeeeee-eeeecccccccchhhcCcccCCcccCCcc---cCceeEEeecCCCc Confidence 2345444422 23579999999999999 6999888888753322 2444321111 1110 00 Q ss_pred cCCCccceee----EEEEEEeEEEecccc---eeEEEEeccC Q lcl|NC_019402. 284 LAKDGSYEKW----MIEMEVGLRHRNPYA---SGILEVKAGA 318 (318) Q Consensus 284 laktGd~~k~----~i~~E~tLe~~N~~a---~g~i~~lt~a 318 (318) .+-.-++.++ -=+++|..++.|-.+ +-+|...+++ T Consensus 297 ~a~~~~sk~g~~~~~~v~sy~y~v~~~~gds~s~~v~vt~t~ 338 (470) T protein:vir:10 297 VTLPYNSGLGDPANTTVYSYAFKAANFYGESAAKYIDVYIDS 338 (470) T ss_pred eeecccCCCCcccCcceeEEEEEEEEecCCCCcceEEEEEee Confidence 0000012212 112244444444332 2233222222 No 14 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=98.81 E-value=1.7e-09 Score=68.72 Aligned_cols=272 Identities=11% Similarity=-0.021 Sum_probs=159.3 Q ss_pred CCceeeee-eeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccccC Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERAS 79 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~~ 79 (318) |...++.+ ....-+.+++.|...-....|+.++....+.++....+...+...+ ...-||.+.+...... T Consensus 6 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~a---------~~v~E~~~~~~~~~~f 76 (299) T protein:vir:41 6 DTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMTKPEEEFTFMSGVGA---------FWVDEAERIQTSKPTF 76 (299) T ss_pred CcccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecCCCcEEEEEEcCCce---------eeeecCccccccccce Confidence 55444433 3345566778887777788888877655555554444444332111 1234777665433221 Q ss_pred cEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhcCC Q lcl|NC_019402. 80 TTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAAKD 159 (318) Q Consensus 80 ~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~~~ 159 (318) . .-...+.+-...+.=|.+.+.-.. .+...+=+..-...+.+.+|.++|+|. |++. --||..-.... T Consensus 77 ~---~v~l~~~k~~~~~~is~ell~ds~-~~~~~~i~~~l~~a~~~~~d~a~l~G~----g~~~----~~gil~~~~~~- 143 (299) T protein:vir:41 77 T---KAKMRSKKMGVIIPTTKENLNYSV-TNFFSLMQAEIVEAFYKKFDQAVFTGV----ESPY----NWNILKSATDA- 143 (299) T ss_pred e---EEEEeeEEEEEeehhhHHHHhcCH-HHHHHHHHHHHHHHHHHHHHHHHhhcc----cCcc----ccccccccccc- Confidence 1 112223333333444455554322 233333444555668999999999874 2221 12443321111 Q ss_pred cccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEEEE Q lcl|NC_019402. 160 AADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNVYV 239 (318) Q Consensus 160 ~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~~v 239 (318) .........+.++|.+++.++-.++..+..++|++.....+.++. +.+ .++-..+.. ..+. T Consensus 144 ----------~~~~~~~~~~~~~l~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lk-d~~----G~~l~~~~~--~~~~-- 204 (299) T protein:vir:41 144 ----------SNLVEETANKYDDLNEAIGLIEAEDLEPNGIATIRKQRVKYRSTK-DGN----GMPIFNTAT--SNGV-- 204 (299) T ss_pred ----------ceeeccccccHHHHHHHHHhhhcccCCcCEEEEcHHHHHHHHHhh-ccC----CceeecCCc--CCCC-- Confidence 112233456889999999999988888888999999988888764 221 222222111 1111 Q ss_pred EEEEcCCCcEEEEEecCCCCCce----EEEEehhhcceeecCcccceecC-----------------CCccceeeEEEEE Q lcl|NC_019402. 240 SSIVDPLGCQYKLVPNRWMPENA----VYFFTPSDWTQMVLRAPERTKLA-----------------KDGSYEKWMIEME 298 (318) Q Consensus 240 ~~~~tdfG~~v~iv~nr~m~~~~----~~~~D~~~~~~~~Lr~~~~e~la-----------------ktGd~~k~~i~~E 298 (318) . +=+| +.++.+.+||++. +++.|++++-+..-+++..+.+- ..-+....+.+.. T Consensus 205 ~---~l~G--~PV~~~~~~~~~~~~~~~~~gdfs~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~ 279 (299) T protein:vir:41 205 D---DVLG--LPIAYTPKYTFGDKDISELVGDWNQAYYGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFE 279 (299) T ss_pred c---eecc--eeeEEecccCCCCCceEEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEE Confidence 1 2367 6899999999887 99999998755444454433221 1223445566778 Q ss_pred EeEEEecccceeEEEEeccC Q lcl|NC_019402. 299 VGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 299 ~tLe~~N~~a~g~i~~lt~a 318 (318) +++.+++|+|..+|+..++. T Consensus 280 ~d~~v~~~~A~~~l~~~aa~ 299 (299) T protein:vir:41 280 VGFMVVKDEAFSAVQPKAGN 299 (299) T ss_pred eccEEecccceEEEEeccCC Confidence 99999999999999999888 No 15 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=98.81 E-value=1.7e-09 Score=68.63 Aligned_cols=275 Identities=15% Similarity=0.028 Sum_probs=156.9 Q ss_pred CCceeeeeeeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccccCc Q lcl|NC_019402. 1 MATLVSYDLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERAST 80 (318) Q Consensus 1 Ma~~~t~~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~~~ 80 (318) +.+-++..-....+.+...|+..-....|++.++......+...+|.......+ ...-.-||.+.+....... T Consensus 136 ~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~-------~a~~v~E~~~~~~~~~~f~ 208 (418) T protein:vir:10 136 VGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQTSSSSIEYTVETGFTN-------NAAAVAEGAQKPTSDLKFN 208 (418) T ss_pred ccCCCCCCccccchhHHHHHHHHHhhhhhHHhhcceeeccCCceeEEEEecCCC-------ceeeeccCcccccccccee Confidence 111122223345667777788878888999888877666555555555432111 0112347776554432211 Q ss_pred EEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhcCCc Q lcl|NC_019402. 81 TVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAAKDA 160 (318) Q Consensus 81 ~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~~~~ 160 (318) ...-.... +...+.||. +.+... .+..+|=...-...+.+-+|.+||+|. |++.. ..||+..... T Consensus 209 ~v~~~~~k-~~~~~~is~--ell~ds--~~l~~~i~~~l~~a~~~~~d~a~l~G~----g~~~~---p~Gi~~~~~~--- 273 (418) T protein:vir:10 209 LKNQPVRT-IAHLFKASR--QILDDA--PALQSYIDGRARYGLQLTEEGQILKGD----GTGAN---ILGILPQASA--- 273 (418) T ss_pred eEEEeeee-EEEeehhhH--HHHHhH--HHHHHHHHHHHHHHHHHHHHHHHhccC----CCCcc---cccccccccc--- Confidence 11111111 222233443 333322 244444444556678999999999873 33322 3466643211 Q ss_pred ccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEEEEE Q lcl|NC_019402. 161 ADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNVYVS 240 (318) Q Consensus 161 ~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~~v~ 240 (318) .....++....+.++|.+++.++...+.....++|+|.....+..+. +. ..++.. +.... +. . T Consensus 274 -------~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lk-d~----~G~~i~-~~~~~--~~-~- 336 (418) T protein:vir:10 274 -------FMPSITLANATPIDKIRLALLQAVLAEFPATGIVLNPIDWASIELTK-DS----QGRYIV-GNPVN--GT-T- 336 (418) T ss_pred -------ccccccccccccHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhh-cC----CCceec-ccccc--CC-C- Confidence 11122333445678888888888888888888999999877776654 22 123222 11100 10 0 Q ss_pred EEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecC-cccceecCCC-----ccceeeEEEEEEeEEEecccceeEEEE Q lcl|NC_019402. 241 SIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLR-APERTKLAKD-----GSYEKWMIEMEVGLRHRNPYASGILEV 314 (318) Q Consensus 241 ~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr-~~~~e~lakt-----Gd~~k~~i~~E~tLe~~N~~a~g~i~~ 314 (318) -+=+| +.++.+.+||++.+++.|++..-+.+.| .+..+-..-. -+....+.+..+...+++|+|...++. T Consensus 337 --~~l~G--~pV~~~~~~p~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~~~d~~~~~~~a~~~~~~ 412 (418) T protein:vir:10 337 --PRLWN--LPVVETQAMTANEFLVGAFSMAAQIFDRMEIEVLLSTENVDDFEKNMVSIRAEERLALAVYRPESFVTGAL 412 (418) T ss_pred --ceecc--eeeEEcCCCCCCcEEEeeccceEEEEEecceEEEEecccchhhhcCceEEEEEEeeccEEecccceEEEEe Confidence 12368 6899999999999999999864333322 2222221112 244456667779999999999999999 Q ss_pred eccC Q lcl|NC_019402. 315 KAGA 318 (318) Q Consensus 315 lt~a 318 (318) .+++ T Consensus 413 ~~~~ 416 (418) T protein:vir:10 413 VEQA 416 (418) T ss_pred ccCC Confidence 9999 No 16 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=98.69 E-value=5.3e-09 Score=65.95 Aligned_cols=288 Identities=10% Similarity=-0.094 Sum_probs=158.0 Q ss_pred CCc---------eeeeeeeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccc Q lcl|NC_019402. 1 MAT---------LVSYDLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSA 71 (318) Q Consensus 1 Ma~---------~~t~~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~d 71 (318) |++ .+........+.+.++|...-....|+..+.-.....+....+....-... ..-.-||.. T Consensus 1 m~~~~~~a~~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~--------a~~v~Eg~~ 72 (330) T protein:vir:77 1 MAGSTVPSTQVALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGPTGISIPHWTGAVS--------ASWTGEAER 72 (330) T ss_pred CcccccchhhccccCCCcceechhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcc--------eeEecCCCc Confidence 554 223333445567777777777788888887665555444444443221111 111337777 Q ss_pred cccccccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhH Q lcl|NC_019402. 72 AVDGERASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGF 151 (318) Q Consensus 72 a~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi 151 (318) .+....+..... ..+.+=...|.=|.+.+.... .+-.++=...-.+.+.+.+|+++|+|. |++ ....|| T Consensus 73 ~~~~~~~f~~i~---~~~~k~~~~~~is~ell~ds~-~~~~~~i~~~l~~ai~~~~~~~~l~G~----g~~---~~~~g~ 141 (330) T protein:vir:77 73 KPITKGSFGKQE---LEPVKITTIFAESAEVVRLNP-LNYLNTMRTKIAEAIALKFDAAAIHGI----DKP---SAFKGY 141 (330) T ss_pred cccccceeeEEE---EeEEEEEEeehhhHHHHhcch-HHHHHHHHHHHHHHHHHHHHHHhhccc----CCC---Cccccc Confidence 665543322221 222222222233334433222 233444445556678899999999884 222 224566 Q ss_pred HHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCC Q lcl|NC_019402. 152 SALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQ 231 (318) Q Consensus 152 ~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~ 231 (318) +..+....... +....+..+......++|.+++.+++.++.....++||+.....+..+. +.+ .|+...+.. T Consensus 142 ~~~~~~~~~~~---~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lk-d~~----G~~l~~~~~ 213 (330) T protein:vir:77 142 LAETTKVVSLA---DTNLTTASGPQGNAYLAVNNALSLLVNSGKKWTGTLLDNVTEPILNTAV-DGN----GRPLFVEST 213 (330) T ss_pred cccccccceee---cccccccccccchhHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHHh-ccC----CceeecCcc Confidence 55443221111 1111222333445567888888999998888888999999988887654 321 233222111 Q ss_pred ceEEEEEEEEEEcCCCcEEEEEecCCCCCce------EEEEehhhcceeecCcccceec----CCC-------------- Q lcl|NC_019402. 232 DTRLNVYVSSIVDPLGCQYKLVPNRWMPENA------VYFFTPSDWTQMVLRAPERTKL----AKD-------------- 287 (318) Q Consensus 232 ~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~------~~~~D~~~~~~~~Lr~~~~e~l----akt-------------- 287 (318) .. .+...-.-.+=+| +.++.+.+||++. +++.|++..-+..-.++..+-+ .+. T Consensus 214 ~~-~~~~~~~~~~l~G--~PV~~~~~~p~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~ 290 (330) T protein:vir:77 214 YT-EQVGAIREGRILG--RPTYVADNVVNGTVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLIS 290 (330) T ss_pred cc-ccccccCCceecc--eeeEEeccccCCCCCCccEEEEEecceEEEEEecCcEEEEeecceeeecccccccccccccc Confidence 00 0000000112357 6888999998754 7888988865443333332211 111 Q ss_pred ---ccceeeEEEEEEeEEEecccceeEEEEeccC Q lcl|NC_019402. 288 ---GSYEKWMIEMEVGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 288 ---Gd~~k~~i~~E~tLe~~N~~a~g~i~~lt~a 318 (318) -|....+....+...+++|+|..+|+..++. T Consensus 291 ~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~~~ 324 (330) T protein:vir:77 291 LWQHNMVAVRCEAEFAFMVNDKDAFVKLTDQVAG 324 (330) T ss_pred hhhcCcEEEEEEEEeccEEecccceEEEEeccCC Confidence 1455667777899999999999999998877 No 17 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=98.64 E-value=9.9e-09 Score=64.46 Aligned_cols=272 Identities=11% Similarity=0.028 Sum_probs=154.6 Q ss_pred CCce-eeeeeeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccccC Q lcl|NC_019402. 1 MATL-VSYDLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERAS 79 (318) Q Consensus 1 Ma~~-~t~~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~~ 79 (318) +.+- ++..-....+.+...|...-....|+.+++......+..+.|....-.... ....-||...+...... T Consensus 113 ~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~-------a~~v~Eg~~~~~~~~~~ 185 (390) T protein:vir:81 113 ASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYVQETGFVNN-------AAIVAEGALKPESSLKF 185 (390) T ss_pred hccccccCCcceechhhhHHHHHHHhhhhhhhhhcceeeccCCceEEEEEecCCcc-------eeeecCCccccccccee Confidence 2211 112222344455555655556667888877665555555566554322110 11234777666543322 Q ss_pred cEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhcCC Q lcl|NC_019402. 80 TTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAAKD 159 (318) Q Consensus 80 ~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~~~ 159 (318) ....-+.. -+...+.||. +.+.... +...+-...-...+.+-+|.+||+|. |.+.. ..||....... T Consensus 186 ~~i~~~~~-k~~~~~~is~--ell~d~~--~~~~~i~~~l~~~~~~~~d~a~l~G~----g~~~~---~~Gi~~~~~~~- 252 (390) T protein:vir:81 186 AKKTDTTH-VIAHTMKATR--QILSDAP--QLASYMNNRLIRGLKVKEDAEILRGT----GANDG---LLGLIPQATTY- 252 (390) T ss_pred eEEEEeee-EEEEeehhhH--HHHHhHH--HHHHHHHHHHHHHHHHHHHHHHHhcC----CCCCc---ccceeeccccc- Confidence 22222222 2222344554 3443332 44555555667778899999999873 33322 34665322111 Q ss_pred cccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEEEE Q lcl|NC_019402. 160 AADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNVYV 239 (318) Q Consensus 160 ~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~~v 239 (318) ......+...+.++|.+++-++-.++.....++|||.....|..+. +. ..++...+.... +. T Consensus 253 ---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lk-d~----~G~~l~~~~~~~--~~-- 314 (390) T protein:vir:81 253 ---------AAPTTIAGATRVDQLRLAMLQASLAEYNPSGIVINPIDWAAIELAK-DA----NNQYLIGNARGT--LT-- 314 (390) T ss_pred ---------ccccccccchhHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhh-cC----CCceeecCcccc--cC-- Confidence 1122334455678899999999888888888999999888887664 22 123322221111 11 Q ss_pred EEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecC-ccccee----cCCCccceeeEEEEEEeEEEecccceeEEEE Q lcl|NC_019402. 240 SSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLR-APERTK----LAKDGSYEKWMIEMEVGLRHRNPYASGILEV 314 (318) Q Consensus 240 ~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr-~~~~e~----laktGd~~k~~i~~E~tLe~~N~~a~g~i~~ 314 (318) -+=+| ++++.+.+||++.+++.|++..-..+.| .+..+. .--+-+....+++..+...+++|.|..+++. T Consensus 315 ---~~l~G--~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~v~~t~ 389 (390) T protein:vir:81 315 ---PTLWG--LPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVGEDFQRNMITVLAEERLALVVYRPEALISGSF 389 (390) T ss_pred ---ceecc--eeeEEcCCCCCCcEEEEehhceEEEEEecceEEEEecccchhhcCcEEEEEEEeeccEEecccceEEEEe Confidence 12378 5899999999999999999874333333 222211 1112345667778889999999999999988 Q ss_pred e Q lcl|NC_019402. 315 K 315 (318) Q Consensus 315 l 315 (318) . T Consensus 390 a 390 (390) T protein:vir:81 390 A 390 (390) T ss_pred C Confidence 8 No 18 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=98.63 E-value=1.8e-08 Score=63.07 Aligned_cols=270 Identities=11% Similarity=0.031 Sum_probs=153.3 Q ss_pred CC-----ceeeee--eeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccc Q lcl|NC_019402. 1 MA-----TLVSYD--LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAV 73 (318) Q Consensus 1 Ma-----~~~t~~--~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~ 73 (318) ++ +.++.+ ....-+.+.+.|...-....|++++....+..+..+++..-+.... ..-.-||++.+ T Consensus 21 ~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~--------a~~v~Eg~~~~ 92 (324) T protein:vir:10 21 PQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPG--------AYWVGEGQKIE 92 (324) T ss_pred cceecccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCcc--------eeEeccCcccc Confidence 11 111111 2234455667776666778888887665555544444444221111 11133888777 Q ss_pred cccccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHH Q lcl|NC_019402. 74 DGERASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSA 153 (318) Q Consensus 74 ~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~ 153 (318) ..........-+... +...+.||. +.+.... .+...|=...-.+.+.+-+|.++|+|. |++. ...|+.. T Consensus 93 ~~~~~~~~v~~~~~k-~~~~~~iS~--ell~ds~-~~l~~~i~~~l~~ai~~~~d~a~l~G~----g~~~---~~~~i~~ 161 (324) T protein:vir:10 93 TSKATWVNATMRAFK-LGVILPVTK--EFLNYTY-SQFFEEMKPMIAEAFYKKFDEAGILNQ----GNNP---FGKSIAQ 161 (324) T ss_pred ccccceeEEEEeeEE-EEEeehhhH--HHHhcch-HHHHHHHHHHHHHHHHHHHHHHhhhcC----CCCc---cCccccc Confidence 654332222222222 223344443 3333222 233444444555778899999999874 2222 2234444 Q ss_pred HHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCce Q lcl|NC_019402. 154 LVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDT 233 (318) Q Consensus 154 ~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~ 233 (318) .+... ...+..+++.++|.+++.++-.++..+..++++|.....+..+. +. ..+.....+.. T Consensus 162 ~~~~~------------~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~-d~----~g~~~~~~~~~- 223 (324) T protein:vir:10 162 SIEKT------------NKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIV-DP----ETKERIYDRNS- 223 (324) T ss_pred ccccc------------ceeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhh-cc----CCceeecCCCC- Confidence 33221 12344578999999999999888888888999999888887663 22 12222222211 Q ss_pred EEEEEEEEEEcCCCcEEEEEecCC--CCCceEEEEehhhcceeecCcccceecC-----------------CCccceeeE Q lcl|NC_019402. 234 RLNVYVSSIVDPLGCQYKLVPNRW--MPENAVYFFTPSDWTQMVLRAPERTKLA-----------------KDGSYEKWM 294 (318) Q Consensus 234 ~~g~~v~~~~tdfG~~v~iv~nr~--m~~~~~~~~D~~~~~~~~Lr~~~~e~la-----------------ktGd~~k~~ 294 (318) .+-+|. .++.+.. .+.+.+++.|++.+-+..-+++..+-+- ..-|....+ T Consensus 224 ---------~~l~G~--PV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r 292 (324) T protein:vir:10 224 ---------DTLDGL--PVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALR 292 (324) T ss_pred ---------ccccce--eEEeecCCCCCcceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEE Confidence 123673 4444444 4566789999998776665555443221 112456666 Q ss_pred EEEEEeEEEecccceeEEEEeccC Q lcl|NC_019402. 295 IEMEVGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 295 i~~E~tLe~~N~~a~g~i~~lt~a 318 (318) ....++..+.+|.|..+|++.++. T Consensus 293 ~~~r~d~~v~~~~A~~~l~~a~~~ 316 (324) T protein:vir:10 293 ATMHVALHIADDKAFAKLVPADKK 316 (324) T ss_pred EEEEEccEEecccceEEEEeccCC Confidence 777799999999999999998776 No 19 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=98.62 E-value=1.4e-08 Score=63.63 Aligned_cols=280 Identities=12% Similarity=0.005 Sum_probs=151.0 Q ss_pred CCceeeeeeeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccccCc Q lcl|NC_019402. 1 MATLVSYDLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERAST 80 (318) Q Consensus 1 Ma~~~t~~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~~~ 80 (318) ||+..+.. ...-+.+.+.|...-..+.|+..+.......+-..++...+.... ..-..||.+.+....+.. T Consensus 1 mat~~~gg-~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~~~~p~~~~~~~--------a~wv~Eg~~~~~~~~~f~ 71 (311) T protein:vir:81 1 MVALATGT-FQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPR--------GEVVGEGAQKSESTATFA 71 (311) T ss_pred CceecCCc-eEcchhHHHHHHHHHHhcchhhhhcceeecCCCceEEEEEeCCce--------eEEeecCcccccccceee Confidence 98766643 345566778777777778888776554433333334433222111 111347777665432211 Q ss_pred EEecceEEEEeeeeeehhHHHHhhhcCc--cchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhcC Q lcl|NC_019402. 81 TVINNVTQILRKVVKVSDTANVLANYGR--GKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAAK 158 (318) Q Consensus 81 ~~~~N~tQIf~~~~~VS~Ta~a~~~~G~--~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~~ 158 (318) ...=+. .-+..-+.||. +-...... .+...+=..+....+.+.+|.++++|... +. .-...|+...+... T Consensus 72 ~v~l~~-~kl~~~~~iS~--ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~--~~---~~~~~gi~~~~~~~ 143 (311) T protein:vir:81 72 PVTAIP-RKVQVTQRFSQ--EVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINP--LT---GAALSGSPAKILDT 143 (311) T ss_pred EEEEee-EEEEEeehhhH--HHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccC--CC---Cccccccccccccc Confidence 111111 11122223332 22211110 11223333445566889999999988521 11 12244555443221 Q ss_pred CcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEEE Q lcl|NC_019402. 159 DAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNVY 238 (318) Q Consensus 159 ~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~~ 238 (318) . .......+.......++..++.++...+.+++.+++||.....+.++. +.+ .++...+... +.. T Consensus 144 ~-------~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lk-d~~----G~~l~~~~~~---~~~ 208 (311) T protein:vir:81 144 T-------NIVELTTGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQR-DSQ----GRKLYPELGF---GTD 208 (311) T ss_pred c-------eeeeecccccchHHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHHhhh-ccC----CCeeecCccc---cCC Confidence 1 111222333334456788888888788888888999999988887764 322 2221111100 000 Q ss_pred EEEEEcCCCcEEEEEecCCCC------------------CceEEEEehhhcceeecCcccceecCCCc-----------c Q lcl|NC_019402. 239 VSSIVDPLGCQYKLVPNRWMP------------------ENAVYFFTPSDWTQMVLRAPERTKLAKDG-----------S 289 (318) Q Consensus 239 v~~~~tdfG~~v~iv~nr~m~------------------~~~~~~~D~~~~~~~~Lr~~~~e~laktG-----------d 289 (318) .. +=+| +.++.+..|| ...+++.|++.+-+...+++..+-+ ..+ | T Consensus 209 ~~---tl~G--~Pv~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~~~-~~~~~~~~~~~~~~~ 282 (311) T protein:vir:81 209 VA---SFAG--LNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELI-EFGDPDGLGDLKRQN 282 (311) T ss_pred Cc---eecc--eeEEecccccccccccccccchhcccCCccEEEEEecccEEEEEeccceEEEe-ccCCCCcchhhhhcC Confidence 11 1145 4566666555 3457899999877777666554332 222 2 Q ss_pred ceeeEEEEEEeEEEecccceeEEEEeccC Q lcl|NC_019402. 290 YEKWMIEMEVGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 290 ~~k~~i~~E~tLe~~N~~a~g~i~~lt~a 318 (318) -...+.+..++..+++|+|..+|++.+.| T Consensus 283 ~v~~r~~~r~d~~v~~~~a~~~l~~a~~~ 311 (311) T protein:vir:81 283 QIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) T ss_pred cEEEEEEEEeccEeecccceEEEEeeccC Confidence 24455567799999999999999999999 No 20 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=98.61 E-value=2.4e-08 Score=62.41 Aligned_cols=267 Identities=12% Similarity=0.027 Sum_probs=154.1 Q ss_pred CCce--eeeeeeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeecccccccccc Q lcl|NC_019402. 1 MATL--VSYDLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERA 78 (318) Q Consensus 1 Ma~~--~t~~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~ 78 (318) +.+. ++......-+.+.+.|...=....|+..+.-.....+..+.+...+.... ..-.-||...+..... T Consensus 26 a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~--------a~~v~Eg~~~~~~~~~ 97 (324) T protein:vir:97 26 PDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPG--------AYWVGEGQKIETSKAT 97 (324) T ss_pred cccccccCCCcceechhHHHHHHHHHHhhcchhhhcceeeccCCceEEEEEecCcc--------eeEeccCccccccccc Confidence 2112 22223334566777777777888888887665555444444443221111 1123377776654332 Q ss_pred CcEEecceEEEEe---eeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHH Q lcl|NC_019402. 79 STTVINNVTQILR---KVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALV 155 (318) Q Consensus 79 ~~~~~~N~tQIf~---~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i 155 (318) +.+++--.+ ..+.||. +.+.... -+...+=...-...+.+.+|.++|+|. |++. .-.||...+ T Consensus 98 ----f~~v~~~~~k~~~~~~is~--ell~ds~-~~l~~~i~~~l~~aia~~~d~a~l~G~----g~~~---~~~gi~~~~ 163 (324) T protein:vir:97 98 ----WVNATMRAFKLGVILPVTK--EFLNYTY-SQFFEEMKPMIAEAFYKKFDEAGILNQ----GNNP---FGKSIAQSI 163 (324) T ss_pred ----eeEEEEeeEEEEEeehhhH--HHHhcch-HHHHHHHHHHHHHHHHHHHHHHhhccC----CCCc---cCccccccc Confidence 333332222 3333443 3333222 233444445566778899999999884 2222 234554433 Q ss_pred hcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEE Q lcl|NC_019402. 156 AAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRL 235 (318) Q Consensus 156 ~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~ 235 (318) ... ...+..+++.++|.++..++-.++..+..++|+|.....+..+. +.+ .++....+... T Consensus 164 ~~~------------~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lk-d~~----g~~~~~~~~~~-- 224 (324) T protein:vir:97 164 EKT------------NKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIV-DPE----TKERIYDRNSD-- 224 (324) T ss_pred ccc------------ceeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhh-cCC----CceeecCCCCc-- Confidence 221 22344668999999999999888888888999999887776653 221 23322222211 Q ss_pred EEEEEEEEcCCCcEEEEEecC--CCCCceEEEEehhhcceeecCcccceecC-----------------CCccceeeEEE Q lcl|NC_019402. 236 NVYVSSIVDPLGCQYKLVPNR--WMPENAVYFFTPSDWTQMVLRAPERTKLA-----------------KDGSYEKWMIE 296 (318) Q Consensus 236 g~~v~~~~tdfG~~v~iv~nr--~m~~~~~~~~D~~~~~~~~Lr~~~~e~la-----------------ktGd~~k~~i~ 296 (318) +=+|. .++.+. ..+.+.+++.|++++-+..-.++..+-.- ..-|....++. T Consensus 225 --------tl~G~--PV~~~~~~~~~~~~~~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~ 294 (324) T protein:vir:97 225 --------TLDGL--PVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRAT 294 (324) T ss_pred --------cccce--eeEeecCCCCCcceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEE Confidence 23663 444433 35567799999988766655554432221 11255666677 Q ss_pred EEEeEEEecccceeEEEEeccC Q lcl|NC_019402. 297 MEVGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 297 ~E~tLe~~N~~a~g~i~~lt~a 318 (318) ..+...+.+|+|..+|++.+.. T Consensus 295 ~r~d~~v~~~~a~~~l~~~~~~ 316 (324) T protein:vir:97 295 MHVALHIADDKAFAKLVPADKK 316 (324) T ss_pred EEeccEEecccceEEEEeccCC Confidence 7899999999999999998886 No 21 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=98.61 E-value=1.8e-08 Score=62.98 Aligned_cols=276 Identities=13% Similarity=0.013 Sum_probs=155.9 Q ss_pred CCceeeee---eeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccc Q lcl|NC_019402. 1 MATLVSYD---LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGER 77 (318) Q Consensus 1 Ma~~~t~~---~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~ 77 (318) -...++.. -....++++..|...-....|+.+++......+..++|.......+. ..-.-||...+.... T Consensus 111 ~~~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~-------a~~v~E~~~~~~~~~ 183 (395) T protein:vir:43 111 RSAITSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTTESNSVEYVRETGFVNN-------AAPVSEGTQKPYSDL 183 (395) T ss_pred hhhhcccCCCCccccchhhHHHHHHHHHhhhhHHhhccceecCCCceEEEEEecCCCc-------eeeecCCcccccccc Confidence 01112222 22345667777777777889999998887777766777665432211 111347776665443 Q ss_pred cCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhc Q lcl|NC_019402. 78 ASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAA 157 (318) Q Consensus 78 ~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~ 157 (318) .. ..+.--.+-+...+.||.. .+... .+..+|=...-...+.+-+|.++|+|. |.+.. ..||...... T Consensus 184 ~~-~~i~~~~~k~~~~~~is~e--ll~d~--~~l~~~v~~~la~a~~~~~d~~~l~G~----g~~~~---~~Gi~~~~~~ 251 (395) T protein:vir:43 184 TF-ELENAPVRTIAHLFKASRQ--ILDDA--SALQSYIDARARYGLMLVEECQLLYGN----GTGAN---LHGIIPQAQA 251 (395) T ss_pred ce-eEEEEeeeeEEEeehhhHH--HHHhH--HHHHHHHHHHHHHHHHHHHHHHHHhcc----CCCCc---cccccccccc Confidence 22 2222222233334445543 33322 233344444555677889999999873 33221 4566543211 Q ss_pred CCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEE Q lcl|NC_019402. 158 KDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNV 237 (318) Q Consensus 158 ~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~ 237 (318) .. +....+.......++|.+++.++-.++.....+++||.....+..+. +. +.++...+... + T Consensus 252 ~~--------~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lk-d~----~G~~i~~~~~~---~- 314 (395) T protein:vir:43 252 YA--------PPSGVVVTAEQRIDRIRLAILQAQLAEFPASGIVLNPIDWALIELNK-DA----ENRYIIGSPQN---G- 314 (395) T ss_pred cc--------cccccccccchhHHHHHHHHHhhccccCCCcEEEEcHHHHHHHHHhh-cc----CCceecccccc---C- Confidence 10 11112333445677777777777777777778999999877776654 22 12332211100 1 Q ss_pred EEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecC-cccceecCCCc-----cceeeEEEEEEeEEEecccceeE Q lcl|NC_019402. 238 YVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLR-APERTKLAKDG-----SYEKWMIEMEVGLRHRNPYASGI 311 (318) Q Consensus 238 ~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr-~~~~e~laktG-----d~~k~~i~~E~tLe~~N~~a~g~ 311 (318) ... +=|| ++|+.+.+||++.+++.|++..-+.+.| .+..+-....+ +...+.....+...+++|.|... T Consensus 315 ~~~---~l~G--~pVv~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~ 389 (395) T protein:vir:43 315 TTP---TLWR--LPVVETQAITQDEFLTGAFSLGAQIFDRMDIEVLVSTENDKDFENNMVTIRAEERLAFAVYRPEAFVT 389 (395) T ss_pred CCc---eecc--eeeEEcCCCCCCcEEEEeccceEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEE Confidence 111 2368 6899999999999999999875444443 33333222222 33345556678999999999999 Q ss_pred EEEecc Q lcl|NC_019402. 312 LEVKAG 317 (318) Q Consensus 312 i~~lt~ 317 (318) ++..++ T Consensus 390 ~~~taa 395 (395) T protein:vir:43 390 GSLTAS 395 (395) T ss_pred EEeccC Confidence 976666 No 22 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=98.56 E-value=3.7e-08 Score=61.34 Aligned_cols=270 Identities=11% Similarity=0.025 Sum_probs=154.1 Q ss_pred CCc-----ee--eeeeeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccc Q lcl|NC_019402. 1 MAT-----LV--SYDLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAV 73 (318) Q Consensus 1 Ma~-----~~--t~~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~ 73 (318) +++ .+ +......-+.+.+.|...-....|+.++....+..+..+.+...+-... ..-.-||...+ T Consensus 21 ~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~--------a~~v~Eg~~~~ 92 (324) T protein:vir:99 21 PQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYEPMEGTEKKFTFWADKPG--------AYWVGEGQKIE 92 (324) T ss_pred hhhccccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcc--------eeEeccCcccc Confidence 111 11 1112234556777777777888888887766555444444443321111 11234888777 Q ss_pred cccccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHH Q lcl|NC_019402. 74 DGERASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSA 153 (318) Q Consensus 74 ~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~ 153 (318) ..........-+... +...+.||. +.+.... .+...|=...-.+.+.+-+|.++|+|. |++. ...|+.. T Consensus 93 ~~~~~~~~v~~~~~k-~~~~~~iS~--ell~ds~-~~l~~~i~~~l~~ai~~~~d~~~l~G~----g~~~---~~~~~~~ 161 (324) T protein:vir:99 93 TSKATWVNATMRAFK-LGVILPVTK--EFLNYTY-SQFFEEMKPMIAEAFYKKFDEAGILNQ----GNNP---FGKSIAQ 161 (324) T ss_pred ccccceeEEEEeeEE-EEEeehhhH--HHHhcch-HHHHHHHHHHHHHHHHHHHHHHhhhcC----CCCc---cCccccc Confidence 655433222222222 223444554 3332221 233444445556668899999999874 2221 2234433 Q ss_pred HHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCce Q lcl|NC_019402. 154 LVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDT 233 (318) Q Consensus 154 ~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~ 233 (318) .+.. ....+..+++.++|.+++.++-.++..+..++|+|.....+..+. +.+ .+.....+.. T Consensus 162 ~~~~------------~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~-d~~----g~~~~~~~~~- 223 (324) T protein:vir:99 162 SIEK------------TNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIV-DPE----TKERIYDRNS- 223 (324) T ss_pred cccc------------cceeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhh-cCC----CceeecCCCC- Confidence 2221 122344568999999999999888888888999999888877653 221 2222222211 Q ss_pred EEEEEEEEEEcCCCcEEEEEecCCCC--CceEEEEehhhcceeecCcccceecC-----------------CCccceeeE Q lcl|NC_019402. 234 RLNVYVSSIVDPLGCQYKLVPNRWMP--ENAVYFFTPSDWTQMVLRAPERTKLA-----------------KDGSYEKWM 294 (318) Q Consensus 234 ~~g~~v~~~~tdfG~~v~iv~nr~m~--~~~~~~~D~~~~~~~~Lr~~~~e~la-----------------ktGd~~k~~ 294 (318) .+-+| +.++.+..++ .+.+++.|++.+-+..-.++..+..- -.-|....+ T Consensus 224 ---------~~l~G--~PVv~~~~~~~~~~~~i~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r 292 (324) T protein:vir:99 224 ---------DTLDG--LPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALR 292 (324) T ss_pred ---------ccccc--eeEEeecCCCCCcceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEE Confidence 12367 3555555544 45688889998766655555433221 112455666 Q ss_pred EEEEEeEEEecccceeEEEEeccC Q lcl|NC_019402. 295 IEMEVGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 295 i~~E~tLe~~N~~a~g~i~~lt~a 318 (318) ....++..+.+|.|..+|++.+.. T Consensus 293 ~~~r~d~~v~~~~a~~~lt~a~~~ 316 (324) T protein:vir:99 293 ATMHVALHIADDKAFAKLVPADKK 316 (324) T ss_pred EEEEEccEEecccceEEEEeccCC Confidence 677799999999999999998877 No 23 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=98.55 E-value=8.9e-08 Score=59.23 Aligned_cols=260 Identities=15% Similarity=0.132 Sum_probs=148.1 Q ss_pred CCceeeee-eeeecccceeeeEecCCc-------ccceeeeeccccccceEEEeeeeeccccCCccccccccceeecccc Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPT-------DTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAA 72 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~-------~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da 72 (318) ||..+|.. ....-+-+++.|..--+. -+.-.+.-+..--+-..+.|. .+..+.+ .-||.+. T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~--~~~~a~~---------v~eg~~i 69 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWD--YIGDAED---------VAEGEAI 69 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEec--CCCCccc---------ccCCCcc Confidence 99654433 444444555544211111 111011111100112234563 2222222 2378777 Q ss_pred ccccccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHH Q lcl|NC_019402. 73 VDGERASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFS 152 (318) Q Consensus 73 ~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~ 152 (318) +....+.....-.+.+ +.+.+.||+-..... ..+-+++-..+....+.|.+|..++..-.. . T Consensus 70 ~~~~~~~~~~~~~~~~-~~~~~~itd~~~~~s---~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~---a----------- 131 (272) T protein:vir:98 70 PMTQLGFKKTTMTIKK-AGKGVEITDEAILSG---YGDPVGQAAKQIVEAIDHKVDADVLDALSK---S----------- 131 (272) T ss_pred cccccccceEEEEeee-eeeeeeecHHHHhhc---cccHHHHHHHHHHHHHHHHHHHHHHHHhcc---c----------- Confidence 7666665555555555 356788886654332 245566666677778889999998842110 0 Q ss_pred HHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCc Q lcl|NC_019402. 153 ALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQD 232 (318) Q Consensus 153 ~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~ 232 (318) ........+.+.|.++++++=+++...+.++|||.....+.+..... ...... .+.. T Consensus 132 ------------------~~~~~~~~t~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~----~~~~~~-~~~~ 188 (272) T protein:vir:98 132 ------------------TQTVEATATVDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKE----WLGATE-VGAN 188 (272) T ss_pred ------------------ccccccccCHHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhcccc----cccccc-cccc Confidence 00112335788999999998888888899999999876664321000 000000 0000 Q ss_pred eEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceec--CCCccceeeEEEEEEeEEEeccccee Q lcl|NC_019402. 233 TRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKL--AKDGSYEKWMIEMEVGLRHRNPYASG 310 (318) Q Consensus 233 ~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~l--aktGd~~k~~i~~E~tLe~~N~~a~g 310 (318) -..... +-+=.| ++|+.+.+||.++++++++..+.+..-+++..|.. ++. ..+......-|++.+.||.+.- T Consensus 189 ~~~~g~---ig~i~G--~~Vi~s~~~p~~t~~~~~~~a~~~~~~~~~~ve~~r~~~~-~~~~i~~~~~~~~~v~~~~~vv 262 (272) T protein:vir:98 189 RVVSGV---YGEVLG--VQIVRSRKCPKGTAYMVRKGALRIMLKRNTMVETDRDITK-AINQIVANKHYGVYLYKAEKAV 262 (272) T ss_pred cccccc---chhhcC--eeEEEcCCCCcceEEEEcCCeEEEEecCCceeeecccccc-ceeEEEEEEEEEEEEEcCCceE Confidence 000001 112246 68999999999999999999888876666554322 222 2344444556899999999999 Q ss_pred EEEEeccC Q lcl|NC_019402. 311 ILEVKAGA 318 (318) Q Consensus 311 ~i~~lt~a 318 (318) .++.-+++ T Consensus 263 ~~t~~~a~ 270 (272) T protein:vir:98 263 KITLKDAA 270 (272) T ss_pred EEEecccc Confidence 99998888 No 24 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=98.55 E-value=8.9e-08 Score=59.23 Aligned_cols=260 Identities=15% Similarity=0.132 Sum_probs=148.1 Q ss_pred CCceeeee-eeeecccceeeeEecCCc-------ccceeeeeccccccceEEEeeeeeccccCCccccccccceeecccc Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPT-------DTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAA 72 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~-------~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da 72 (318) ||..+|.. ....-+-+++.|..--+. -+.-.+.-+..--+-..+.|. .+..+.+ .-||.+. T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~--~~~~a~~---------v~eg~~i 69 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWD--YIGDAED---------VAEGEAI 69 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEec--CCCCccc---------ccCCCcc Confidence 99654433 444444555544211111 111011111100112234563 2222222 2378777 Q ss_pred ccccccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHH Q lcl|NC_019402. 73 VDGERASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFS 152 (318) Q Consensus 73 ~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~ 152 (318) +....+.....-.+.+ +.+.+.||+-..... ..+-+++-..+....+.|.+|..++..-.. . T Consensus 70 ~~~~~~~~~~~~~~~~-~~~~~~itd~~~~~s---~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~---a----------- 131 (272) T protein:vir:30 70 PMTQLGFKKTTMTIKK-AGKGVEITDEAILSG---YGDPVGQAAKQIVEAIDHKVDADVLDALSK---S----------- 131 (272) T ss_pred cccccccceEEEEeee-eeeeeeecHHHHhhc---cccHHHHHHHHHHHHHHHHHHHHHHHHhcc---c----------- Confidence 7666665555555555 356788886654332 245566666677778889999998842110 0 Q ss_pred HHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCc Q lcl|NC_019402. 153 ALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQD 232 (318) Q Consensus 153 ~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~ 232 (318) ........+.+.|.++++++=+++...+.++|||.....+.+..... ...... .+.. T Consensus 132 ------------------~~~~~~~~t~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~----~~~~~~-~~~~ 188 (272) T protein:vir:30 132 ------------------TQTVEATATVDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKE----WLGATE-VGAN 188 (272) T ss_pred ------------------ccccccccCHHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhcccc----cccccc-cccc Confidence 00112335788999999998888888899999999876664321000 000000 0000 Q ss_pred eEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceec--CCCccceeeEEEEEEeEEEeccccee Q lcl|NC_019402. 233 TRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKL--AKDGSYEKWMIEMEVGLRHRNPYASG 310 (318) Q Consensus 233 ~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~l--aktGd~~k~~i~~E~tLe~~N~~a~g 310 (318) -..... +-+=.| ++|+.+.+||.++++++++..+.+..-+++..|.. ++. ..+......-|++.+.||.+.- T Consensus 189 ~~~~g~---ig~i~G--~~Vi~s~~~p~~t~~~~~~~a~~~~~~~~~~ve~~r~~~~-~~~~i~~~~~~~~~v~~~~~vv 262 (272) T protein:vir:30 189 RVVSGV---YGEVLG--VQIVRSRKCPKGTAYMVRKGALRIMLKRNTMVETDRDITK-AINQIVANKHYGVYLYKAEKAV 262 (272) T ss_pred cccccc---chhhcC--eeEEEcCCCCcceEEEEcCCeEEEEecCCceeeecccccc-ceeEEEEEEEEEEEEEcCCceE Confidence 000001 112246 68999999999999999999888876666554322 222 2344444556899999999999 Q ss_pred EEEEeccC Q lcl|NC_019402. 311 ILEVKAGA 318 (318) Q Consensus 311 ~i~~lt~a 318 (318) .++.-+++ T Consensus 263 ~~t~~~a~ 270 (272) T protein:vir:30 263 KITLKDAA 270 (272) T ss_pred EEEecccc Confidence 99998888 No 25 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=98.54 E-value=5.1e-08 Score=60.54 Aligned_cols=272 Identities=11% Similarity=0.019 Sum_probs=151.0 Q ss_pred CCceeeee--eeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeecccccccccc Q lcl|NC_019402. 1 MATLVSYD--LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERA 78 (318) Q Consensus 1 Ma~~~t~~--~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~ 78 (318) +-+.++.+ ....-+.+.+.|...-....|+.++.......+..+.+...+-... ..-.-||.+.+..... T Consensus 26 a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~--------a~~v~Eg~~~~~~~~~ 97 (324) T protein:vir:93 26 PDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPG--------AYWVGEGQKIETSKAT 97 (324) T ss_pred cccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcc--------eeeecCCccccccccc Confidence 11122222 2234566778777777888898888766555444444333221111 1123488777654432 Q ss_pred CcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhcC Q lcl|NC_019402. 79 STTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAAK 158 (318) Q Consensus 79 ~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~~ 158 (318) .....-+. .-+...+.||.- .+.... .+...+=...-.+.+.+.+|.++|.|. |++. ...|+...+... T Consensus 98 f~~i~~~~-~k~~~~~~iS~e--ll~ds~-~~l~~~i~~~l~~aia~~~d~a~l~G~----g~~~---~~~~~~~~~~~~ 166 (324) T protein:vir:93 98 WVNATMRA-FKLGVILPVTKE--FLNYTY-SQFFEEMKPMIAEAFYKKFDEAGILNQ----GNNP---FGKSIAQSIEKT 166 (324) T ss_pred eeEEEEEe-EEEEEeehhhHH--HHhcch-HHHHHHHHHHHHHHHHHHHHHHHhcCC----CCCC---cCcccccccccc Confidence 21111111 122233444432 222111 122233333444678999999999884 2221 123444332221 Q ss_pred CcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEEE Q lcl|NC_019402. 159 DAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNVY 238 (318) Q Consensus 159 ~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~~ 238 (318) ...+...++.++|.+++.++-.++..+..++|++.....+..+. +.+ .++....+... T Consensus 167 ------------~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~-d~~----G~~~~~~~~~~----- 224 (324) T protein:vir:93 167 ------------NKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIV-DPE----TKERIYDRNSD----- 224 (324) T ss_pred ------------ceeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhh-CCC----CCeeecCCCCC----- Confidence 12234567899999999999888888889999999988887653 322 23322222211 Q ss_pred EEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceecC-----------------CCccceeeEEEEEEeE Q lcl|NC_019402. 239 VSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKLA-----------------KDGSYEKWMIEMEVGL 301 (318) Q Consensus 239 v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~la-----------------ktGd~~k~~i~~E~tL 301 (318) +=+|..|.+.++...+.+.+++.|++++-+..-.++..+..- ..-|....+....+++ T Consensus 225 -----~l~G~PVv~~~~~~~~~~~i~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~ 299 (324) T protein:vir:93 225 -----SLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVAL 299 (324) T ss_pred -----cccceeeEeecCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEecc Confidence 125633222234456677899999998766554454332211 1225567777888999 Q ss_pred EEecccceeEEEEeccC Q lcl|NC_019402. 302 RHRNPYASGILEVKAGA 318 (318) Q Consensus 302 e~~N~~a~g~i~~lt~a 318 (318) .+.+|.|..+|++..+- T Consensus 300 ~v~~~~a~~~l~~a~~~ 316 (324) T protein:vir:93 300 HIADDKAFAKLVPADKR 316 (324) T ss_pred EEecccceEEEeccccc Confidence 99999999999877665 No 26 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=98.53 E-value=4.2e-08 Score=61.03 Aligned_cols=269 Identities=12% Similarity=0.031 Sum_probs=152.6 Q ss_pred CCc-----ee--eeeeeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccc Q lcl|NC_019402. 1 MAT-----LV--SYDLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAV 73 (318) Q Consensus 1 Ma~-----~~--t~~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~ 73 (318) ++. .+ +......-+.+.++|...=....|++++.......+..+.|...+.... ..-.-||...+ T Consensus 21 ~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~--------a~~v~Eg~~~~ 92 (324) T protein:vir:96 21 PQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPG--------AYWVGEGQKIE 92 (324) T ss_pred hhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcc--------eeeecCCcccc Confidence 111 11 1112233466777777777888888888776666555556655332211 11234787766 Q ss_pred cccccCcEEecceE---EEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhh Q lcl|NC_019402. 74 DGERASTTVINNVT---QILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAG 150 (318) Q Consensus 74 ~~~~~~~~~~~N~t---QIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~G 150 (318) ..... +.+++ .-+...+.||.-.. .. ...+...|=...-.+.+.+.+|.++|.|. |++.. -.| T Consensus 93 ~~~~~----f~~v~~~~~k~~~~~~is~ell--~d-s~~~l~~~i~~~l~~aia~~~d~~~l~G~----g~~~~---~~~ 158 (324) T protein:vir:96 93 TSKAT----WVNATMRAFKLGVILPVTKEFL--NY-TYSQFFEEMKPMIAEAFYKKFDEAGILNQ----GNNPF---GKS 158 (324) T ss_pred ccccc----eeEEEEEeEEEEEeehhhHHHH--hc-chHHHHHHHHHHHHHHHHHHHHHHhhhcC----CCCCc---Ccc Confidence 54332 33333 22334444554322 21 11233334444555668999999999874 22222 223 Q ss_pred HHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecC Q lcl|NC_019402. 151 FSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDG 230 (318) Q Consensus 151 i~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~ 230 (318) +...+.. ....+...++.++|.+++.++-.++..+..++|++.....+..+. +.+ .++....+ T Consensus 159 ~~~~~~~------------~~~~~~~~~~~~~i~~~~~~i~~~~~~~~~~i~n~~~~~~L~~lk-d~~----G~~~~~~~ 221 (324) T protein:vir:96 159 IAQSIKK------------TNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIV-DPE----TKERIYDR 221 (324) T ss_pred ccccccc------------cceecccccchHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhh-CCC----CCeeecCC Confidence 3322211 112334567899999999999888888889999999888877653 322 23222222 Q ss_pred CceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceec--CC---------------Cccceee Q lcl|NC_019402. 231 QDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKL--AK---------------DGSYEKW 293 (318) Q Consensus 231 ~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~l--ak---------------tGd~~k~ 293 (318) ... +=+|..|.+.++-.++.+.+++.|++.+-+..-.++..+-. +. .-|.... T Consensus 222 ~~~----------~l~G~PV~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~ 291 (324) T protein:vir:96 222 NSD----------SLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVAL 291 (324) T ss_pred CCC----------cccceeeEeecCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEE Confidence 211 22563332333444566779999999876665555443222 11 1234556 Q ss_pred EEEEEEeEEEecccceeEEEEeccC Q lcl|NC_019402. 294 MIEMEVGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 294 ~i~~E~tLe~~N~~a~g~i~~lt~a 318 (318) ++...+.+.+++|.|..+|++.... T Consensus 292 r~~~r~d~~v~~~~a~~~l~~a~~~ 316 (324) T protein:vir:96 292 RATMHVALHIADDKAFAKLVPADKR 316 (324) T ss_pred EEEEEeccEEecccceEEEeccccc Confidence 6677799999999999999877666 No 27 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=98.52 E-value=3.9e-08 Score=61.21 Aligned_cols=272 Identities=11% Similarity=0.029 Sum_probs=147.9 Q ss_pred CCce-eeeeeeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccccC Q lcl|NC_019402. 1 MATL-VSYDLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERAS 79 (318) Q Consensus 1 Ma~~-~t~~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~~ 79 (318) +.+. ++..-....+.+.+.|...--...|+..++...+..+....|......... ..-.-||...+...... T Consensus 113 ~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~-------a~~v~Eg~~~~~~~~~~ 185 (390) T protein:vir:10 113 ASTDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGRTDSALIEYVQETGFVNN-------AAIVAEGALKPESSLKF 185 (390) T ss_pred hhcccccccccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCCcc-------eeeecCCccccccccce Confidence 1111 111111233344444444444566888776665555544555543322110 01123777655543221 Q ss_pred cEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhcCC Q lcl|NC_019402. 80 TTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAAKD 159 (318) Q Consensus 80 ~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~~~ 159 (318) ....-+. .-+...+.||... +... .+..+|-..+-...+.+-+|.++|+|. |++.. ..||+...... T Consensus 186 ~~i~~~~-~k~~~~~~is~el--l~d~--~~l~~~i~~~l~~~~~~~~~~~il~G~----G~~~~---p~Gi~~~~~~~- 252 (390) T protein:vir:10 186 AKKTDTT-HVIAHTMKATRQI--LSDA--PQLASYMNNRLIRGLKVKEDAEILRGT----GANDG---LLGLIPQATTY- 252 (390) T ss_pred eEEEEee-EEEEEeehhhHHH--HHhH--HHHHHHHHHHHHHHHHHHHHHHHhhcC----CCCcc---ccccccccccc- Confidence 1111111 1222334455433 3322 243444444555678889999999873 33322 34665432111 Q ss_pred cccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEEEE Q lcl|NC_019402. 160 AADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNVYV 239 (318) Q Consensus 160 ~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~~v 239 (318) ....+.....+.+++.+++.++-.++.....+++||.....+..+. +. ..++...+.... +. T Consensus 253 ---------~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lk-d~----~g~~l~~~~~~~--~~-- 314 (390) T protein:vir:10 253 ---------AAPTTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIELAK-DA----NNQYLIGNARGT--LT-- 314 (390) T ss_pred ---------cccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhh-cC----CCceeecCCcCc--CC-- Confidence 1112233345678888998888888888888999999887777664 22 123322221111 10 Q ss_pred EEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecC-cccce----ecCCCccceeeEEEEEEeEEEecccceeEEEE Q lcl|NC_019402. 240 SSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLR-APERT----KLAKDGSYEKWMIEMEVGLRHRNPYASGILEV 314 (318) Q Consensus 240 ~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr-~~~~e----~laktGd~~k~~i~~E~tLe~~N~~a~g~i~~ 314 (318) -+=+| +.++.+++||++++++.|++..-..+.| .+..+ .-.-+.+....+....+...+++|.|..+++. T Consensus 315 ---~~l~G--~pv~~~~~~p~~~~~~gdf~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~ 389 (390) T protein:vir:10 315 ---PTLWG--LPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEERLALVVYRPEALISGSF 389 (390) T ss_pred ---ceecc--eeeEEcCCCCCCcEEEEeccceEEEEEecceEEEEeecccccccCcEEEEEEEeeccEEeccccEEEEEe Confidence 12278 5899999999999999999863222222 22221 11123466677777889999999999998888 Q ss_pred e Q lcl|NC_019402. 315 K 315 (318) Q Consensus 315 l 315 (318) . T Consensus 390 a 390 (390) T protein:vir:10 390 A 390 (390) T ss_pred C Confidence 7 No 28 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=98.51 E-value=2.4e-08 Score=62.40 Aligned_cols=279 Identities=10% Similarity=-0.015 Sum_probs=151.7 Q ss_pred CCceeeeeeeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccccCc Q lcl|NC_019402. 1 MATLVSYDLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERAST 80 (318) Q Consensus 1 Ma~~~t~~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~~~ 80 (318) ||.-++..-....+.+.++|...-....|+..+.......+....+...+..+.+ .-.-||.+.+..... T Consensus 1 ma~~t~~~G~lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~~p~~~~~~~a--------~wv~Eg~~~~~s~~~-- 70 (300) T protein:vir:95 1 MSEAQLSKGNLFNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQREFVFDFDSDI--------DIVAENGKKTHGGVS-- 70 (300) T ss_pred CcccccCCcceechhhHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecCcce--------EEeeCCccccccccc-- Confidence 9987777666778888888887777788887654443333222333332211111 123377766543322 Q ss_pred EEecceE---EEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhc Q lcl|NC_019402. 81 TVINNVT---QILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAA 157 (318) Q Consensus 81 ~~~~N~t---QIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~ 157 (318) +.+++ .-+..-+.||.-...-......+...+=..+-.+.+.+-+|.++|.|.....|.+..+. |+..+-.. T Consensus 71 --f~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~---~~~~~~~~ 145 (300) T protein:vir:95 71 --LDPVTIVPLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTII---GDNCFDKK 145 (300) T ss_pred --ceeeEeeeEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccc---cccccccc Confidence 22222 12333444554332100111112233333445677899999999988644334332221 11111000 Q ss_pred CCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEE Q lcl|NC_019402. 158 KDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNV 237 (318) Q Consensus 158 ~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~ 237 (318) ...........+.++|.+++.++-..+.++..+++||.....+..+.... .++.. . .... +. T Consensus 146 ----------~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~-----G~~i~-~-~~~~-~~ 207 (300) T protein:vir:95 146 ----------VTQTVPFKDTNPDESMEDAVGMIDGSERDITGAILDPIFTTALSKMKNAE-----GGKLY-P-ELAW-GG 207 (300) T ss_pred ----------cceeecccccchHHHHHHHHHHhhhcCCCccEEEECHHHHHHHHHhhccC-----CCeec-c-Cccc-cC Confidence 00111223345678899999988888888888999999988887775221 22211 1 1111 11 Q ss_pred EEEEEEcCCCcEEEEEecCCCCCce------EEEEehhh-cceeecCcccceec--C-CC--------ccceeeEEEEEE Q lcl|NC_019402. 238 YVSSIVDPLGCQYKLVPNRWMPENA------VYFFTPSD-WTQMVLRAPERTKL--A-KD--------GSYEKWMIEMEV 299 (318) Q Consensus 238 ~v~~~~tdfG~~v~iv~nr~m~~~~------~~~~D~~~-~~~~~Lr~~~~e~l--a-kt--------Gd~~k~~i~~E~ 299 (318) .. -+=+| +.++.+..+|.+. +|+.|++. +.+.+-+.+..+-. + .+ -|....+.+..+ T Consensus 208 ~~---~~l~G--~Pv~~s~~v~~~~~~~~~~~~~GDf~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~ 282 (300) T protein:vir:95 208 VP---DAING--LAVDKNRTVSYSQTDPKNTAIVGDFETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYI 282 (300) T ss_pred CC---ceecc--eeeEEecCCCCCCCCCccEEEEeeccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEee Confidence 11 12367 5788888887543 67789874 33333334332221 1 11 122344556678 Q ss_pred eEEEecccceeEEEEecc Q lcl|NC_019402. 300 GLRHRNPYASGILEVKAG 317 (318) Q Consensus 300 tLe~~N~~a~g~i~~lt~ 317 (318) ++.+++|.|..+|++.++ T Consensus 283 d~~v~~~~a~~~l~~~~g 300 (300) T protein:vir:95 283 GWGIMDAASFARIVKTGG 300 (300) T ss_pred cceeecccceEEEecCCC Confidence 999999999999999999 No 29 >protein:vir:63741 Length: 468 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547622;genbank:GeneID:3783474 Probab=98.50 E-value=3.4e-09 Score=67.01 Aligned_cols=291 Identities=13% Similarity=0.075 Sum_probs=170.7 Q ss_pred CCceeeeeeeeecccceeeeEecCCccccee--eeeccccccceEEEeeeeeccccCCccccccccc-eeeccccccccc Q lcl|NC_019402. 1 MATLVSYDLNGKKLSFANWISNLSPTDTPFV--SMTGKEAINQTLFQWQTDALAPVADPSDAQKRNA-VIEGSAAVDGER 77 (318) Q Consensus 1 Ma~~~t~~~~~~~~dl~d~I~~i~p~~TP~~--s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na-~~EG~da~~~~~ 77 (318) =+.-.+...-.-+|+|...|.+++-.+.+|. .-|.+.+++|+.|+|-... ++-. ....+ .-||..+..... T Consensus 33 ~p~~q~~~~AlR~EsL~~~i~~L~~~~~~f~~~~di~k~~a~stv~~y~~~~-~~G~-----~g~~~f~~E~g~~~~~~~ 106 (468) T protein:vir:63 33 TPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYDVYM-QHGK-----VGHTRFTREIGVAPVSDP 106 (468) T ss_pred CCccccCcchhhhhhhhhhhheeeecccchhhhhhcccchhhhhhhhheeee-ccCc-----cccccccccccccccCCC Confidence 1111223456789999999999999888874 4567889999999998865 2211 11111 236666555544 Q ss_pred cCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCC-CC-ccchhhhHHHHH Q lcl|NC_019402. 78 ASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGS-AT-VARQTAGFSALV 155 (318) Q Consensus 78 ~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs-~t-~~r~m~Gi~~~i 155 (318) .-+.+.-++- -+..+-.||.-+.- ..++.+-++.|.+.++.-+...+|++++.|-....-+ .+ ..=+-+||...| T Consensus 107 ~~~r~~~~~k-~l~~~~~vs~~~~l--~n~i~d~~~~~~~~ai~~~a~tiE~a~FyGds~l~~s~~~~~glqfDGi~~li 183 (468) T protein:vir:63 107 NIRQKTVNMK-FASDTKNISIAAGL--VNNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLI 183 (468) T ss_pred ceEEEEEEee-eeeeeeeehhhhhh--hcchhhHHHHHHHHHHHHHHHHHHHHhhhcccccccCCCccccccccceeEEe Confidence 3333333222 12223334433333 3456688899999999999999999999987655211 11 124689999999 Q ss_pred hcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhh-hhhhhhhcccccceEEEecCCceE Q lcl|NC_019402. 156 AAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFF-SSLMETSGVTNGQRMKMFDGQDTR 234 (318) Q Consensus 156 ~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~i-s~~~~~~~~~~~~r~~~~~~~~~~ 234 (318) +.++..| .-+..||+++|+.+...+=..=|.+..++++...|-.| +.+..- ..+...-++.... T Consensus 184 ~~enviD----------a~G~~ls~~~lneaa~~i~~gfG~~td~~~~~~v~a~~~~~~L~~-----q~~v~~~n~~~~~ 248 (468) T protein:vir:63 184 NQDNVHD----------ARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSK-----QTQLVRDNGNNVS 248 (468) T ss_pred cCCceec----------cCCCccCHHHHHHHhhhccccccChhhhhcchhHHhhhhhhhcCc-----eEEEEcCCCCcee Confidence 8876554 45677999999999877765437778889988888665 443211 1222223345567 Q ss_pred EEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeec-Cccc---ceecCCCcc-ceeeEEEEEEeEEEecccce Q lcl|NC_019402. 235 LNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVL-RAPE---RTKLAKDGS-YEKWMIEMEVGLRHRNPYAS 309 (318) Q Consensus 235 ~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~L-r~~~---~e~laktGd-~~k~~i~~E~tLe~~N~~a~ 309 (318) .|..|..|.|.-| .+++..+-+|....++ |++......= -|+. .-...++|- +...--.-+|.+.+.|..+- T Consensus 249 ~G~~v~g~~sa~G-~I~l~gs~il~~~~~l--~~~~~~~~~Apsp~~vsaT~~~~~~g~~~~~~~a~y~Y~v~~vs~~GE 325 (468) T protein:vir:63 249 VGFNIQGFHSARG-FIKLHGSTVMENEQIL--DERILALPTAPQPAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDAE 325 (468) T ss_pred eeecccceeccee-eeeecCceeeccccCC--CcccccccccccCCccceeeecccCCcccCCCcceEEEEEEEECCCCc Confidence 8999999999999 6998877666444443 3333222210 0111 011111111 11111113455556555432 Q ss_pred eE----EEEeccC Q lcl|NC_019402. 310 GI----LEVKAGA 318 (318) Q Consensus 310 g~----i~~lt~a 318 (318) .. ++..-+| T Consensus 326 S~pS~~vtvTVaa 338 (468) T protein:vir:63 326 SIASEVATATVTA 338 (468) T ss_pred cccccceEEEecC Confidence 22 2222222 No 30 >protein:vir:80491 Length: 467 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468466;genbank:gi:157325041;genbank:GeneID:5601449 Probab=98.50 E-value=3.6e-09 Score=66.89 Aligned_cols=291 Identities=13% Similarity=0.075 Sum_probs=170.9 Q ss_pred CCceeeeeeeeecccceeeeEecCCccccee--eeeccccccceEEEeeeeeccccCCccccccccc-eeeccccccccc Q lcl|NC_019402. 1 MATLVSYDLNGKKLSFANWISNLSPTDTPFV--SMTGKEAINQTLFQWQTDALAPVADPSDAQKRNA-VIEGSAAVDGER 77 (318) Q Consensus 1 Ma~~~t~~~~~~~~dl~d~I~~i~p~~TP~~--s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na-~~EG~da~~~~~ 77 (318) =+.-.+...-.-+|+|...|.+++-.+.+|. .-|.+.+++|+.|+|-... ++-. ....+ .-||..+..... T Consensus 32 ~p~tq~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~di~k~~a~stv~~y~~~~-~~G~-----~g~~~f~~E~g~~~~~~~ 105 (467) T protein:vir:80 32 TPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYDVYM-QHGK-----VGHTRFTREIGVAPVSDP 105 (467) T ss_pred CCccccCcchhhhhhhhhhhheeeccccchhhhhhcccchhhhhhhhheeee-ccCc-----cccccccccccccccCCC Confidence 1111233456789999999999999888874 4667889999999998865 2211 11111 236666555544 Q ss_pred cCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCC-CC-ccchhhhHHHHH Q lcl|NC_019402. 78 ASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGS-AT-VARQTAGFSALV 155 (318) Q Consensus 78 ~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs-~t-~~r~m~Gi~~~i 155 (318) .-+.+.-++- -+..+-.||.-+.- ..++.+-++.|.+.++.-+...+|++++.|-....-+ .+ ..=+-+||...| T Consensus 106 ~~~r~~~~~k-~l~~~~~vs~~~~l--~n~i~d~~~~~~~~ai~~~a~tiE~a~FyGds~l~~s~~~~~glqfDGi~~li 182 (467) T protein:vir:80 106 NIRQKTVNMK-FASDTKNISIAAGL--VNNIQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLI 182 (467) T ss_pred ceEEEEEEee-eeeeeeeehhhhhh--hcchhhHHHHHHHHHHHHHHHHHHHHhhhcccccccCCCccccccccceeEEe Confidence 3333333222 12223334433333 3456688899999999999999999999987655211 11 124689999999 Q ss_pred hcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhh-hhhhhhhcccccceEEEecCCceE Q lcl|NC_019402. 156 AAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFF-SSLMETSGVTNGQRMKMFDGQDTR 234 (318) Q Consensus 156 ~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~i-s~~~~~~~~~~~~r~~~~~~~~~~ 234 (318) +.++..| .-+..||+++|+.+...+=..=|.+..++++...|-.| +.+..- ..+...-++.... T Consensus 183 ~~enviD----------a~G~~ls~~~lneaa~~i~~gfG~~td~~~p~~v~a~~~~~~L~~-----q~~v~~~n~~~~~ 247 (467) T protein:vir:80 183 NQDNVHD----------ARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSK-----QTQLVRDNGNNVS 247 (467) T ss_pred cCCceec----------cCCCccCHHHHHHHhhhccccccChhhhhcchhHHhhhhhhhcCc-----eEEEEcCCCCcee Confidence 8876554 45677999999999877765437778889988888665 443211 1222223345567 Q ss_pred EEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeec-Cccc---ceecCCCcc-ceeeEEEEEEeEEEecccce Q lcl|NC_019402. 235 LNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVL-RAPE---RTKLAKDGS-YEKWMIEMEVGLRHRNPYAS 309 (318) Q Consensus 235 ~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~L-r~~~---~e~laktGd-~~k~~i~~E~tLe~~N~~a~ 309 (318) .|..|..|.|.-| .+++..+-+|....++ |++......= -|+. .-...++|- +...--.-+|.+.+.|..+- T Consensus 248 ~G~~v~g~~sa~G-~I~l~gs~il~~~~~l--~~~~~~~~~Apsp~~vsaT~~~~~~g~~~~~~~a~y~Y~v~~vs~~GE 324 (467) T protein:vir:80 248 VGFNIQGFHSARG-FIKLHGSTVMENEQIL--DERILALPTAPQPAKVTATQEAGKKGQFRAEDLAAHEYKVVVSSDDAE 324 (467) T ss_pred eeecccceeccee-eeeecCceeeccccCC--CcccccccccccCCccceeeecccCCcccCCCcceEEEEEEEECCCCc Confidence 8999999999999 6998877666444443 3333222210 0111 011111111 11111113455556555432 Q ss_pred eE----EEEeccC Q lcl|NC_019402. 310 GI----LEVKAGA 318 (318) Q Consensus 310 g~----i~~lt~a 318 (318) .. ++..-+| T Consensus 325 S~pS~~vtvTVaa 337 (467) T protein:vir:80 325 SIASEVATATVTA 337 (467) T ss_pred cccccceEEEecC Confidence 22 2222222 No 31 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=98.49 E-value=2.4e-08 Score=62.36 Aligned_cols=280 Identities=11% Similarity=-0.010 Sum_probs=148.2 Q ss_pred CCceeeeeeeeecccceeeeEecCCcccceeeeeccccccce--EEEeeeeeccccCCccccccccceeecccccccccc Q lcl|NC_019402. 1 MATLVSYDLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQT--LFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERA 78 (318) Q Consensus 1 Ma~~~t~~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~--~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~ 78 (318) |++.++.. ...-+.+++.|...-..+.|+..+......++- .+-+.+.... + --..||...+..... T Consensus 1 m~t~t~gg-~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~-a---------~wv~E~~~~~~s~~~ 69 (303) T protein:vir:97 1 MGTETSKA-SLFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEFTFTLDSD-I---------DVVAENGKKTHGGLS 69 (303) T ss_pred CcccCCCC-eEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEecCcc-e---------EEeecCccccccccc Confidence 99766544 456677878887777778888877654443322 2223222111 1 123477665544322 Q ss_pred CcEEecceEEEEeeeeeehhHHH-HhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhc Q lcl|NC_019402. 79 STTVINNVTQILRKVVKVSDTAN-VLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAA 157 (318) Q Consensus 79 ~~~~~~N~tQIf~~~~~VS~Ta~-a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~ 157 (318) .....-+ ..-+..-+.||.-.. +.... ..+..++=..+-.+.+.+-+|.++|+|.....+....+.-..+... T Consensus 70 f~~v~l~-~~kl~~~~~iS~ell~~~~d~-~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~---- 143 (303) T protein:vir:97 70 LEPVTIV-PIKVEYGARLSDEFLYATEEE-KIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDS---- 143 (303) T ss_pred eeeEEee-eEEEEEeehhhHHHhhcCccc-hHHHHHHHHHHHHHHHHHHHHhhhhcccccCCcccccccccccccc---- Confidence 1111111 111222233333211 10100 1122333334455667889999999885332222211111111100 Q ss_pred CCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEE Q lcl|NC_019402. 158 KDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNV 237 (318) Q Consensus 158 ~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~ 237 (318) ..+ .....+....+.++|.+++.++..++..+..+++||.....+..+. +.+ .++..++.- ..+. T Consensus 144 ------~~~--~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lk-d~~----g~~~~~~~~--~~~~ 208 (303) T protein:vir:97 144 ------KVT--QVVKFTESEDADANIEAAVNLIQGAEGVVTGLAMDTEFSTALAKVT-NGE----MGPKMYPEL--AWGA 208 (303) T ss_pred ------ccc--cccccccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhh-ccC----CCeEEecCc--cCCC Confidence 000 0112223345678999999999888888888999999888876653 332 233332211 1111 Q ss_pred EEEEEEcCCCcEEEEEecCCCCC--------ceEEEEehhh-cceeecCcccceecC---CCc--------cceeeEEEE Q lcl|NC_019402. 238 YVSSIVDPLGCQYKLVPNRWMPE--------NAVYFFTPSD-WTQMVLRAPERTKLA---KDG--------SYEKWMIEM 297 (318) Q Consensus 238 ~v~~~~tdfG~~v~iv~nr~m~~--------~~~~~~D~~~-~~~~~Lr~~~~e~la---ktG--------d~~k~~i~~ 297 (318) ... +=+| ++++.+++||. ..+++.|++. +.+...+.+..+-.. .+| |-...+.+. T Consensus 209 ~~~---~l~G--~Pv~~s~~v~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~ 283 (303) T protein:vir:97 209 NPD---SING--LKSSVNTTVGAGADEAESKDLVIIGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEA 283 (303) T ss_pred CCc---eecc--eeeEEecccCCccccCCCccEEEEeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEE Confidence 122 2357 68888889875 3378889754 445555555443322 111 222344566 Q ss_pred EEeEEEecccceeEEEEecc Q lcl|NC_019402. 298 EVGLRHRNPYASGILEVKAG 317 (318) Q Consensus 298 E~tLe~~N~~a~g~i~~lt~ 317 (318) .+...+++|+|..+|++..- T Consensus 284 r~~~~v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 284 YIGWGILDAKSFARVTKGEV 303 (303) T ss_pred EeccEeecccceEEeeCCCC Confidence 78999999999999999888 No 32 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=98.48 E-value=1.7e-07 Score=57.63 Aligned_cols=257 Identities=12% Similarity=0.057 Sum_probs=149.1 Q ss_pred CCceeeeeeeeecccc-ee----------eeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeec Q lcl|NC_019402. 1 MATLVSYDLNGKKLSF-AN----------WISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEG 69 (318) Q Consensus 1 Ma~~~t~~~~~~~~dl-~d----------~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG 69 (318) ||.-+|....-..|.+ ++ .+..+.+.++.+-. -++.+ .+.+.|. .+..+.+ --|| T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g-~~G~t--v~ip~~~--~~g~~~~---------~~~g 66 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVG-QPGDT--LTFPAFT--YSGDAQV---------IAEG 66 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhcccccccccccC-CCCCE--EEEEeec--cCCCccc---------cCCC Confidence 9976665433333322 21 22333334433221 11222 3345674 2222221 2267 Q ss_pred cccccccccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhh Q lcl|NC_019402. 70 SAAVDGERASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTA 149 (318) Q Consensus 70 ~da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~ 149 (318) .+.+....+.....--+.|+ .+.|++++-+.+.. .++.+..-..+....+.+.++..++.--. +.. T Consensus 67 ~~i~~~~it~~~~~~~i~~~-~~~~~i~D~~~~~~---~~d~~~~~~~~~~~~~a~~~d~~i~~~l~---~a~------- 132 (274) T protein:vir:96 67 EKIPVDQIGTSKREAKVRKI-GKGTELTDEAVLSG---FGDPQGEAVRQHGLAIANKVDNDVLEALK---GAT------- 132 (274) T ss_pred CcCchhhcccceeEEEEEee-eceeeecHHHHHhh---cchHHHHHHHHHHHHHHHHHHHHHHHHHh---cCC------- Confidence 77666666655555556664 57789988765543 34667776677777888999988873210 100 Q ss_pred hHHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEec Q lcl|NC_019402. 150 GFSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFD 229 (318) Q Consensus 150 Gi~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~ 229 (318) ......+++.+.|.++++++=+++..++.++|||.+...+.+..... ..+... T Consensus 133 ---------------------~~~~~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~------f~~~~~ 185 (274) T protein:vir:96 133 ---------------------LTVEADITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDN------FTRPTQ 185 (274) T ss_pred ---------------------CCcCcccccHHHHHHHHHHhcccCCCceEEEeCHHHHHHHHhccccc------cccccc Confidence 01122457889999999999888889999999999877664431100 000000 Q ss_pred -CCceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCccccee--cCCCccceeeEEEEEEeEEEecc Q lcl|NC_019402. 230 -GQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTK--LAKDGSYEKWMIEMEVGLRHRNP 306 (318) Q Consensus 230 -~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~--laktGd~~k~~i~~E~tLe~~N~ 306 (318) ++.-.....+-.| .| ++|+.++.+|.++.|++.+..+.+..-+++..|. .++.+ ++......=|+..+.|| T Consensus 186 ~g~~~~~~g~ig~~---~G--~~Vi~s~~~p~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~-~d~i~~~~~yg~~~~~~ 259 (274) T protein:vir:96 186 LGDNIIVKGAFGEA---LG--AVIVRSNKLNKGEALLAKKGAVKLITKRDFFLEKDRDASRK-STALYSDKHYVAYLYDE 259 (274) T ss_pred ccccceeeccccee---cC--eeEEEcCCCCcceEEEEeCcceeeeecCCcccccccchhhc-ccEEEEeeEEEEEEEcC Confidence 1100001112222 46 6899999999999999999999987666654432 22222 22233333489999999 Q ss_pred cceeEEEEeccC Q lcl|NC_019402. 307 YASGILEVKAGA 318 (318) Q Consensus 307 ~a~g~i~~lt~a 318 (318) .+..+|+-.++= T Consensus 260 ~~vv~~t~~~~~ 271 (274) T protein:vir:96 260 SKVVKITKGAGD 271 (274) T ss_pred ccEEEEEcCccc Confidence 999888877666 No 33 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=98.48 E-value=6.9e-08 Score=59.84 Aligned_cols=269 Identities=12% Similarity=0.034 Sum_probs=149.6 Q ss_pred CCceee-eeeeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccccC Q lcl|NC_019402. 1 MATLVS-YDLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERAS 79 (318) Q Consensus 1 Ma~~~t-~~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~~ 79 (318) +....+ ......-+.+.+.|...-....|+++++-..+..+..+.+..-+.... ..-.-||...+..... T Consensus 27 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~--------a~~v~Eg~~~~~~~~~- 97 (324) T protein:vir:96 27 DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPG--------AYWVGEGQKIETSKAT- 97 (324) T ss_pred ccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcc--------eeEecCCccccccccc- Confidence 221111 222233455667676666788888887766555544344333221111 1123477776654332 Q ss_pred cEEecceEEE---EeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHh Q lcl|NC_019402. 80 TTVINNVTQI---LRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVA 156 (318) Q Consensus 80 ~~~~~N~tQI---f~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~ 156 (318) +.+++-- +...+.||. +.+.... .+..+|=...-.+.+.+.+|.++|+|. |++.. -.||...+. T Consensus 98 ---~~~v~~~~~k~~~~~~is~--ell~ds~-~~l~~~i~~~la~ai~~~~d~a~l~G~----g~~~~---~~gi~~~~~ 164 (324) T protein:vir:96 98 ---WVNATMRAFKLGVILPVTK--EFLNYTY-SQFFEEMKPMIAEAFYKKFDEAGILNQ----GNNPF---GKSIAQSIE 164 (324) T ss_pred ---eeEEEEeeEEEEEeehhhH--HHHhcch-HHHHHHHHHHHHHHHHHHHHHHHhccC----CCCCc---Ccccccccc Confidence 2222211 223333443 3333221 233333334555778899999999874 22222 234443222 Q ss_pred cCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEE Q lcl|NC_019402. 157 AKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLN 236 (318) Q Consensus 157 ~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g 236 (318) . ....+..+++.++|.++..++-.++.++..++|++.....+..+. +.+ .++....+... T Consensus 165 ~------------~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~-d~~----G~~~~~~~~~~--- 224 (324) T protein:vir:96 165 K------------TNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIV-DPE----TKERIYDRNSD--- 224 (324) T ss_pred c------------cceeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhh-ccC----CCeeecCCCCC--- Confidence 1 122344567999999999999888888889999999888877654 221 23322222211 Q ss_pred EEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceecC-----------------CCccceeeEEEEEE Q lcl|NC_019402. 237 VYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKLA-----------------KDGSYEKWMIEMEV 299 (318) Q Consensus 237 ~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~la-----------------ktGd~~k~~i~~E~ 299 (318) +=+|..|-+.+.-.++.+.+++.|++++-+..-.++..+-.- -.-|....+....+ T Consensus 225 -------~l~G~PV~~~~~~~~~~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~ 297 (324) T protein:vir:96 225 -------SLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHV 297 (324) T ss_pred -------cccceeeEeeCCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEE Confidence 235633222233445667799999988766554554432221 11245566667779 Q ss_pred eEEEecccceeEEEEeccC Q lcl|NC_019402. 300 GLRHRNPYASGILEVKAGA 318 (318) Q Consensus 300 tLe~~N~~a~g~i~~lt~a 318 (318) ++.+++|+|..+|++.... T Consensus 298 d~~v~~~~A~~~l~~a~~~ 316 (324) T protein:vir:96 298 ALHIADDKAFAKLVPADKR 316 (324) T ss_pred ccEEecccceEEEeccccc Confidence 9999999999999886555 No 34 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=98.48 E-value=6.9e-08 Score=59.84 Aligned_cols=269 Identities=12% Similarity=0.034 Sum_probs=149.6 Q ss_pred CCceee-eeeeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccccC Q lcl|NC_019402. 1 MATLVS-YDLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERAS 79 (318) Q Consensus 1 Ma~~~t-~~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~~ 79 (318) +....+ ......-+.+.+.|...-....|+++++-..+..+..+.+..-+.... ..-.-||...+..... T Consensus 27 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~--------a~~v~Eg~~~~~~~~~- 97 (324) T protein:vir:78 27 DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPG--------AYWVGEGQKIETSKAT- 97 (324) T ss_pred ccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcc--------eeEecCCccccccccc- Confidence 221111 222233455667676666788888887766555544344333221111 1123477776654332 Q ss_pred cEEecceEEE---EeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHh Q lcl|NC_019402. 80 TTVINNVTQI---LRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVA 156 (318) Q Consensus 80 ~~~~~N~tQI---f~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~ 156 (318) +.+++-- +...+.||. +.+.... .+..+|=...-.+.+.+.+|.++|+|. |++.. -.||...+. T Consensus 98 ---~~~v~~~~~k~~~~~~is~--ell~ds~-~~l~~~i~~~la~ai~~~~d~a~l~G~----g~~~~---~~gi~~~~~ 164 (324) T protein:vir:78 98 ---WVNATMRAFKLGVILPVTK--EFLNYTY-SQFFEEMKPMIAEAFYKKFDEAGILNQ----GNNPF---GKSIAQSIE 164 (324) T ss_pred ---eeEEEEeeEEEEEeehhhH--HHHhcch-HHHHHHHHHHHHHHHHHHHHHHHhccC----CCCCc---Ccccccccc Confidence 2222211 223333443 3333221 233333334555778899999999874 22222 234443222 Q ss_pred cCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEE Q lcl|NC_019402. 157 AKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLN 236 (318) Q Consensus 157 ~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g 236 (318) . ....+..+++.++|.++..++-.++.++..++|++.....+..+. +.+ .++....+... T Consensus 165 ~------------~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~-d~~----G~~~~~~~~~~--- 224 (324) T protein:vir:78 165 K------------TNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIV-DPE----TKERIYDRNSD--- 224 (324) T ss_pred c------------cceeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhh-ccC----CCeeecCCCCC--- Confidence 1 122344567999999999999888888889999999888877654 221 23322222211 Q ss_pred EEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceecC-----------------CCccceeeEEEEEE Q lcl|NC_019402. 237 VYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKLA-----------------KDGSYEKWMIEMEV 299 (318) Q Consensus 237 ~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~la-----------------ktGd~~k~~i~~E~ 299 (318) +=+|..|-+.+.-.++.+.+++.|++++-+..-.++..+-.- -.-|....+....+ T Consensus 225 -------~l~G~PV~~~~~~~~~~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~ 297 (324) T protein:vir:78 225 -------SLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHV 297 (324) T ss_pred -------cccceeeEeeCCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEE Confidence 235633222233445667799999988766554554432221 11245566667779 Q ss_pred eEEEecccceeEEEEeccC Q lcl|NC_019402. 300 GLRHRNPYASGILEVKAGA 318 (318) Q Consensus 300 tLe~~N~~a~g~i~~lt~a 318 (318) ++.+++|+|..+|++.... T Consensus 298 d~~v~~~~A~~~l~~a~~~ 316 (324) T protein:vir:78 298 ALHIADDKAFAKLVPADKR 316 (324) T ss_pred ccEEecccceEEEeccccc Confidence 9999999999999886555 No 35 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=98.48 E-value=8.9e-08 Score=59.23 Aligned_cols=268 Identities=10% Similarity=0.012 Sum_probs=152.2 Q ss_pred CCceee-eeeeeecccceeeeEecCCcccceeeeeccccccce---EEEeeeeeccccCCccccccccceeecccccccc Q lcl|NC_019402. 1 MATLVS-YDLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQT---LFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGE 76 (318) Q Consensus 1 Ma~~~t-~~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~---~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~ 76 (318) |...++ ......-+.+.+.|...-..+.|++++.......+. .+.+.+.... ..-.-||++.+... T Consensus 9 ~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~----------a~~v~Eg~~~~~~~ 78 (297) T protein:vir:95 9 ENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGIS----------AYWVNETEKIKTDK 78 (297) T ss_pred ccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCce----------eEEeecCccccccc Confidence 332222 223345677778888888888898888665444322 2323332211 11234787766544 Q ss_pred ccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHh Q lcl|NC_019402. 77 RASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVA 156 (318) Q Consensus 77 ~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~ 156 (318) ...... -..+.+=...+.=|.+.+.... .+...|=...-...+.+.+|.++|+|. |++ .| .||..-+. T Consensus 79 ~~f~~v---~l~~~k~~~~~~is~ell~ds~-~~l~~~i~~~la~ai~~~~d~a~l~G~----g~~-~~---~gi~~~~~ 146 (297) T protein:vir:95 79 PEVVPV---TLKAHKLGIILVTSREALNYTW-KKFFEDMKPQIVEAFYKKIDEAGLLGH----DTP-FA---NSVAKAAK 146 (297) T ss_pred cceeEE---EEeeEEEEEeehhhHHHHhcCH-HHHHHHHHHHHHHHHHHHHHHHHhccc----CCc-cc---cccccccc Confidence 222111 1222233333333344443222 232333334445559999999999874 221 11 23332211 Q ss_pred cCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEE Q lcl|NC_019402. 157 AKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLN 236 (318) Q Consensus 157 ~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g 236 (318) . .....+..++.++|.+++.++..++.....++|++.....+.++.. . ..++. +.+.. T Consensus 147 ~------------~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d-~----~G~~i-~~~~~---- 204 (297) T protein:vir:95 147 D------------ANKVIGGPINYDNILKLQDALYDADVEPNAFVSKIQNRSALREARD-G----NKVSI-YDKAA---- 204 (297) T ss_pred c------------cceecccccCHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhc-c----CCcee-ecCCC---- Confidence 1 1223445688999999999999998888889999998888876642 2 12222 22221 Q ss_pred EEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceecC-----------------CCccceeeEEEEEE Q lcl|NC_019402. 237 VYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKLA-----------------KDGSYEKWMIEMEV 299 (318) Q Consensus 237 ~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~la-----------------ktGd~~k~~i~~E~ 299 (318) .+-+|..+.+.++-.++.+.+++.|++.+-+..-.++..+-+- -+-|....+++..+ T Consensus 205 ------~~l~G~Pv~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~ 278 (297) T protein:vir:95 205 ------NTIDGITTVDLKSARFEKGDLLAGDFDNLIYGVPYNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDI 278 (297) T ss_pred ------CcccceeeEeecCCCCCCceEEEEecccEEEEEecCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEe Confidence 1245643333345567889999999998765544444332211 12245566777789 Q ss_pred eEEEecccceeEEEEeccC Q lcl|NC_019402. 300 GLRHRNPYASGILEVKAGA 318 (318) Q Consensus 300 tLe~~N~~a~g~i~~lt~a 318 (318) ...+.||+|..+|+..+.- T Consensus 279 d~~v~~~~a~~~l~~at~~ 297 (297) T protein:vir:95 279 AVMITKTDAFAKLTPAERV 297 (297) T ss_pred ccEeecccceEEEeecCCC Confidence 9999999999998877666 No 36 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=98.46 E-value=3e-07 Score=56.35 Aligned_cols=258 Identities=12% Similarity=0.064 Sum_probs=151.7 Q ss_pred CCceeeeeeeeeccc-ceeee----------EecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeec Q lcl|NC_019402. 1 MATLVSYDLNGKKLS-FANWI----------SNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEG 69 (318) Q Consensus 1 Ma~~~t~~~~~~~~d-l~d~I----------~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG 69 (318) ||.-+|....-..|. +++.+ ..+.+.++. +-|+.--+.+.+.|. .+..+.+ --|| T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~---l~g~~G~tv~ip~~~--~~g~~~~---------~~eg 66 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDST---LQGQPGDTLTFPAFV--YSGDAQV---------VAEG 66 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhccccccccc---ccCCCCCEEEEEeec--cCCCccc---------ccCC Confidence 997555444433332 22221 222222222 222111123345674 2222222 2267 Q ss_pred cccccccccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhh Q lcl|NC_019402. 70 SAAVDGERASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTA 149 (318) Q Consensus 70 ~da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~ 149 (318) .+.+....+.....--+.| ..+.|.|++-+.+.. .++.++....+....+.+.+++.++.--. +.. T Consensus 67 ~~i~~~~it~~~~~~~i~~-~~~~~~i~D~~~~~~---~~d~~~~~~~~~~~~~a~~~d~~~~~~~~---~a~------- 132 (274) T protein:vir:93 67 EKIPTDILETKKREAKIRK-IAKGTSITDEALLSG---YGDPQGEQVRQHGLAHANKVDNDVLEALM---GAK------- 132 (274) T ss_pred CcccccccccceeEEEeee-ecccccccHHHHHhh---ccchHHHHHHHHHHHHHHHHHHHHHHHHh---ccc------- Confidence 7766666555555555666 356788888765543 24677777778888999999988873210 000 Q ss_pred hHHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEec Q lcl|NC_019402. 150 GFSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFD 229 (318) Q Consensus 150 Gi~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~ 229 (318) ......+++.+.|.++++++=+++...+.++|||.+...+-+ +.... ..+. ... T Consensus 133 ---------------------~~~~~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k---~~~~~-f~~~-s~~ 186 (274) T protein:vir:93 133 ---------------------LTVNADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRG---DASTN-FTRA-TEL 186 (274) T ss_pred ---------------------ccccccccCHHHHHHHHHHhhhccCCccEEEeCHHHHHHHHh---hhhhc-cccc-ccc Confidence 011224578899999999999988899999999997765532 21000 0000 000 Q ss_pred CCceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCccccee--cCCCccceeeEEEEEEeEEEeccc Q lcl|NC_019402. 230 GQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTK--LAKDGSYEKWMIEMEVGLRHRNPY 307 (318) Q Consensus 230 ~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~--laktGd~~k~~i~~E~tLe~~N~~ 307 (318) ++.-.....+-.| .| ++|+.++.+|.++.+++.+..+.+..-+++..|. .++. .++......-|+..+.||. T Consensus 187 g~~~~~~G~ig~~---~G--~~Vi~s~~~p~~t~~l~~~gai~~~~~~~~~vE~~Rd~~~-~~d~i~~~~~y~~~~~~~~ 260 (274) T protein:vir:93 187 GDDIIVKGAFGEA---LG--AIIVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDAST-KTTALYSDKHYVAYLYDES 260 (274) T ss_pred cccceeeccccee---cC--eeEEEcCCCCcceEEEEeCCeEEEEecCCcccccccchhh-cccEEEEEEEEEEEEEcCC Confidence 1110001112222 46 6899999999999999999999987666654432 2232 2344445555999999999 Q ss_pred ceeEEEEeccC Q lcl|NC_019402. 308 ASGILEVKAGA 318 (318) Q Consensus 308 a~g~i~~lt~a 318 (318) +..+++-.++| T Consensus 261 ~~v~~t~~~~s 271 (274) T protein:vir:93 261 KAVKITKGSGS 271 (274) T ss_pred ceEEEeeCccc Confidence 99999999999 No 37 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=98.45 E-value=8.4e-08 Score=59.36 Aligned_cols=274 Identities=12% Similarity=0.017 Sum_probs=143.5 Q ss_pred CCceeeeeeee--ecccceeeeE-ecCCcccceeeeeccccccceE-EEeeeeeccccCCccccccccceeecccccccc Q lcl|NC_019402. 1 MATLVSYDLNG--KKLSFANWIS-NLSPTDTPFVSMTGKEAINQTL-FQWQTDALAPVADPSDAQKRNAVIEGSAAVDGE 76 (318) Q Consensus 1 Ma~~~t~~~~~--~~~dl~d~I~-~i~p~~TP~~s~i~~~~~~~~~-~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~ 76 (318) ++.-++....| .-.++.+.|+ ..-....|+..+.-....++.. +-+. .....+ .-.-||...+... T Consensus 249 ~~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~~~~g~~~~~~~-~~~~~a---------~~v~Eg~~~~~~~ 318 (543) T protein:vir:81 249 RAMGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQVVATGDVWHGVS-SAAVQW---------SWDAEFEEVSDDS 318 (543) T ss_pred hhcccccccCcccCchhhhhHHHHHHHhhhchhhhhcccccCCcceEEEEe-cCCcce---------eecccCccccccc Confidence 22222222222 2345555544 3333445555544332222222 2222 221111 1234777766543 Q ss_pred ccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHh Q lcl|NC_019402. 77 RASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVA 156 (318) Q Consensus 77 ~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~ 156 (318) .+.....-+.. -+...+.||. +.+... . +..+|=...-...+.+-++.+||+|. |++ .+.-||..... T Consensus 319 ~~~~~i~~~~~-k~~~~~~is~--ell~d~-~-~~~~~i~~~l~~~~~~~~d~ail~G~----Gt~---~~p~Gi~~~~~ 386 (543) T protein:vir:81 319 PEFGQPEIPVK-KAQGFVPISI--EALQDE-A-NVTETVALLFAEGKDELEAVTLTTGT----GQG---NQPTGIVTALA 386 (543) T ss_pred cccceeeeeee-eeEeeehhhH--HHHhcc-H-HHHHHHHHHHHHHHHHHHHHHHhccC----CCC---cccccchhhcc Confidence 32211111111 1222334443 444332 2 55555555566778889999999883 322 34556664432 Q ss_pred cCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEE Q lcl|NC_019402. 157 AKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLN 236 (318) Q Consensus 157 ~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g 236 (318) ... ...+.++...++.+++.+++..+=.+......++|||.....+..+. +.+ .++...+... | T Consensus 387 ~~~--------~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~l~~lk-d~~----G~~l~~~~~~---g 450 (543) T protein:vir:81 387 GTA--------AEIAPVTAETFALADVYAVYEQLAARHRRQGAWLANNLIYNKIRQFD-TQG----GAGLWTTIGN---G 450 (543) T ss_pred ccc--------ccccccccccccHHHHHHHHHhhhccccCCcEEEEcHHHHHHHHHhh-cCC----CceeccCcCC---C Confidence 211 12334556678999999988887544444446889999888887764 221 2322211111 1 Q ss_pred EEEEEEEcCCCcEEEEEecCCCCCce----------EEEEehhhcceeecCcccceecCCC-cc------ceeeEEEEEE Q lcl|NC_019402. 237 VYVSSIVDPLGCQYKLVPNRWMPENA----------VYFFTPSDWTQMVLRAPERTKLAKD-GS------YEKWMIEMEV 299 (318) Q Consensus 237 ~~v~~~~tdfG~~v~iv~nr~m~~~~----------~~~~D~~~~~~~~Lr~~~~e~lakt-Gd------~~k~~i~~E~ 299 (318) . . -+-+| +.|+.+.+||.+. +++.|++.+-+..-..+..+-..-. ++ ...+..+.-+ T Consensus 451 ~-~---~~l~G--~pv~~~~~~~~~~~~~~~~~~~~i~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~r~ 524 (543) T protein:vir:81 451 E-P---SQLLG--RPVGEAEAMDANWNTSASADNFVLLYGNFQNYVIADRIGMTVEFIPHLFGTNRRPNGSRGWFAYYRM 524 (543) T ss_pred C-C---ccccc--eeeEEeccccccccccccCCcceEEEeeccceeEEeecccEEEEeccccccchhhcCceEEEEEEee Confidence 0 0 12378 5888888888643 7788998766654444443322211 11 2344555668 Q ss_pred eEEEecccceeEEEEeccC Q lcl|NC_019402. 300 GLRHRNPYASGILEVKAGA 318 (318) Q Consensus 300 tLe~~N~~a~g~i~~lt~a 318 (318) ++.+.||.|..++...++| T Consensus 525 d~~v~~~~A~~~l~~~~~a 543 (543) T protein:vir:81 525 GADVVNPNAFRLLNVETAS 543 (543) T ss_pred ccEeecccceEEEEecccC Confidence 9999999999999999999 No 38 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=98.44 E-value=6.9e-08 Score=59.83 Aligned_cols=280 Identities=9% Similarity=-0.049 Sum_probs=146.8 Q ss_pred CCceeeeeeeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccccCc Q lcl|NC_019402. 1 MATLVSYDLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERAST 80 (318) Q Consensus 1 Ma~~~t~~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~~~ 80 (318) |++-.++ ..-+.+.++|...-..+.|+..+....+.++-...+...+-.+. ..-.-||.+.+.....-. T Consensus 1 ma~~gG~---lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~~--------a~~v~Eg~~~~~~~~~f~ 69 (298) T protein:vir:94 1 MVLNKGT---LFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSE--------IDVVAESGKKTHGGVTLA 69 (298) T ss_pred Ceecccc---ccChhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcc--------eEEeeCCcccccccccee Confidence 7753322 34456677777777777888777654444333233322221111 112347877665433222 Q ss_pred EEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhcCCc Q lcl|NC_019402. 81 TVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAAKDA 160 (318) Q Consensus 81 ~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~~~~ 160 (318) ...-+.. -+...+.||.-...-..--..+..++-..+-...+.+.+|.++++|...-.|.... -.|+..+.... T Consensus 70 ~v~l~~~-k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~---~~~~~~~~~~~-- 143 (298) T protein:vir:94 70 PQTMVPI-KVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASA---VIGTNHFDSKV-- 143 (298) T ss_pred EEEEeee-EEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccc---ccccccccccc-- Confidence 2111222 22334555544321111111223344445666778999999999885332222211 11111111110 Q ss_pred ccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEEEEE Q lcl|NC_019402. 161 ADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNVYVS 240 (318) Q Consensus 161 ~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~~v~ 240 (318) .+ ....++......++|.+++.++..++.....++|||....++.++. +.+ .++-..+.. ..+ ... T Consensus 144 -----~~-~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lk-d~~----G~~l~~~~~--~~~-~~~ 209 (298) T protein:vir:94 144 -----TQ-KVEAPRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQK-DLQ----GNALFPELK--WGA-TPD 209 (298) T ss_pred -----cc-ccccccccccHHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHhh-ccC----CCeeecCcc--cCC-CCc Confidence 00 0112233344567899999999988888888999999988887764 221 222211111 101 111 Q ss_pred EEEcCCCcEEEEEecCCCCC------ceEEEEehhhc-ceeecCcccceecC-----------CCccceeeEEEEEEeEE Q lcl|NC_019402. 241 SIVDPLGCQYKLVPNRWMPE------NAVYFFTPSDW-TQMVLRAPERTKLA-----------KDGSYEKWMIEMEVGLR 302 (318) Q Consensus 241 ~~~tdfG~~v~iv~nr~m~~------~~~~~~D~~~~-~~~~Lr~~~~e~la-----------ktGd~~k~~i~~E~tLe 302 (318) +=+| +.++.+..+|. +.+|+.|++.+ .+..-.....+-.. ..-|....+....+++. T Consensus 210 ---tl~G--~PV~~~~~v~~~~~~~~~~~~~Gdfs~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~~ 284 (298) T protein:vir:94 210 ---TING--LPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWG 284 (298) T ss_pred ---eecc--eeeEEecccccccCCCccEEEEeeccceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEeccE Confidence 2256 57788888874 46888899864 33333333332211 01223334556678899 Q ss_pred EecccceeEEEEec Q lcl|NC_019402. 303 HRNPYASGILEVKA 316 (318) Q Consensus 303 ~~N~~a~g~i~~lt 316 (318) +++|+|..+|++.+ T Consensus 285 ~~~~~a~~~l~~~t 298 (298) T protein:vir:94 285 ILDATKFARVTEAN 298 (298) T ss_pred eecccceEEEEecC Confidence 99999999999999 No 39 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=98.41 E-value=1.5e-07 Score=57.96 Aligned_cols=272 Identities=8% Similarity=-0.036 Sum_probs=145.5 Q ss_pred CCc-------eeeee--eeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccc Q lcl|NC_019402. 1 MAT-------LVSYD--LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSA 71 (318) Q Consensus 1 Ma~-------~~t~~--~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~d 71 (318) ||. ++++. ....-+.+.+.|...-....|++++.-....++..+.+....-... ..-.-||.. T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~--------a~~v~E~~~ 72 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVG--------AYWVSETER 72 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcc--------eEEeecCcc Confidence 663 22222 2234556666666666677888887655444443333222111110 011236666 Q ss_pred cccccccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhH Q lcl|NC_019402. 72 AVDGERASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGF 151 (318) Q Consensus 72 a~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi 151 (318) .++.........-+... +..-+.||. +.+.... -+...|=...-...+.+.+|.++|+|.......+ ....|+ T Consensus 73 ~~~~~~~~~~i~~~~~k-~~~~~~iS~--ell~ds~-~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~---~~~~~~ 145 (304) T protein:vir:94 73 IQTSKPEYAQAEMEAKK-IGVIIPLSK--EFLKWTA-KDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTS---TSGKPL 145 (304) T ss_pred cccccceeeEEEEEEEE-EEEeehhhH--HHHhcch-HHHHHHHHHHHHHHHHHHHHhhheeccCCCcccc---cccccc Confidence 55443222221112222 222334443 3333221 2223333334456688999999998843211110 111111 Q ss_pred HHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCC Q lcl|NC_019402. 152 SALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQ 231 (318) Q Consensus 152 ~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~ 231 (318) ..-. .......+....+-++|.+++.++-.++.....++|++.....+..+. +. ..|+.... . T Consensus 146 ~~~~-----------~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lk-d~----~G~~l~~~-~ 208 (304) T protein:vir:94 146 VEGA-----------EEKGNVVTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNAL-DA----NDRPLFDA-N 208 (304) T ss_pred cccc-----------cccccccccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhh-cc----CCcEeecC-C Confidence 1100 012233445567888999999999888888888999999888887653 32 12332221 1 Q ss_pred ceEEEEEEEEEEcCCCcEEEEEecCCCCCc----eEEEEehhhcceeecCcccceecC-------------------CCc Q lcl|NC_019402. 232 DTRLNVYVSSIVDPLGCQYKLVPNRWMPEN----AVYFFTPSDWTQMVLRAPERTKLA-------------------KDG 288 (318) Q Consensus 232 ~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~----~~~~~D~~~~~~~~Lr~~~~e~la-------------------ktG 288 (318) . -+=+| +.++.+.+||.+ .+++.|++++-+..-.++..+.+- -.- T Consensus 209 ~----------~~l~G--~PV~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~ 276 (304) T protein:vir:94 209 G----------NEIMG--LPLSYTGADVYDKKKSLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFER 276 (304) T ss_pred C----------ccccc--eeeEEecccccCCCCcEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhc Confidence 1 12367 578888889854 488889988655443343322111 011 Q ss_pred cceeeEEEEEEeEEEecccceeEEEEec Q lcl|NC_019402. 289 SYEKWMIEMEVGLRHRNPYASGILEVKA 316 (318) Q Consensus 289 d~~k~~i~~E~tLe~~N~~a~g~i~~lt 316 (318) |-...+++..+++.+++|+|..+|+..- T Consensus 277 ~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 277 DMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred CcEEEEEEEEeccEeecccceEEEEecC Confidence 3355567778999999999999988877 No 40 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=98.41 E-value=1.5e-07 Score=57.96 Aligned_cols=272 Identities=8% Similarity=-0.036 Sum_probs=145.5 Q ss_pred CCc-------eeeee--eeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccc Q lcl|NC_019402. 1 MAT-------LVSYD--LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSA 71 (318) Q Consensus 1 Ma~-------~~t~~--~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~d 71 (318) ||. ++++. ....-+.+.+.|...-....|++++.-....++..+.+....-... ..-.-||.. T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~--------a~~v~E~~~ 72 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVG--------AYWVSETER 72 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcc--------eEEeecCcc Confidence 663 22222 2234556666666666677888887655444443333222111110 011236666 Q ss_pred cccccccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhH Q lcl|NC_019402. 72 AVDGERASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGF 151 (318) Q Consensus 72 a~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi 151 (318) .++.........-+... +..-+.||. +.+.... -+...|=...-...+.+.+|.++|+|.......+ ....|+ T Consensus 73 ~~~~~~~~~~i~~~~~k-~~~~~~iS~--ell~ds~-~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~---~~~~~~ 145 (304) T protein:vir:10 73 IQTSKPEYAQAEMEAKK-IGVIIPLSK--EFLKWTA-KDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTS---TSGKPL 145 (304) T ss_pred cccccceeeEEEEEEEE-EEEeehhhH--HHHhcch-HHHHHHHHHHHHHHHHHHHHhhheeccCCCcccc---cccccc Confidence 55443222221112222 222334443 3333221 2223333334456688999999998843211110 111111 Q ss_pred HHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCC Q lcl|NC_019402. 152 SALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQ 231 (318) Q Consensus 152 ~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~ 231 (318) ..-. .......+....+-++|.+++.++-.++.....++|++.....+..+. +. ..|+.... . T Consensus 146 ~~~~-----------~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lk-d~----~G~~l~~~-~ 208 (304) T protein:vir:10 146 VEGA-----------EEKGNVVTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNAL-DA----NDRPLFDA-N 208 (304) T ss_pred cccc-----------cccccccccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhh-cc----CCcEeecC-C Confidence 1100 012233445567888999999999888888888999999888887653 32 12332221 1 Q ss_pred ceEEEEEEEEEEcCCCcEEEEEecCCCCCc----eEEEEehhhcceeecCcccceecC-------------------CCc Q lcl|NC_019402. 232 DTRLNVYVSSIVDPLGCQYKLVPNRWMPEN----AVYFFTPSDWTQMVLRAPERTKLA-------------------KDG 288 (318) Q Consensus 232 ~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~----~~~~~D~~~~~~~~Lr~~~~e~la-------------------ktG 288 (318) . -+=+| +.++.+.+||.+ .+++.|++++-+..-.++..+.+- -.- T Consensus 209 ~----------~~l~G--~PV~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~ 276 (304) T protein:vir:10 209 G----------NEIMG--LPLSYTGADVYDKKKSLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFER 276 (304) T ss_pred C----------ccccc--eeeEEecccccCCCCcEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhc Confidence 1 12367 578888889854 488889988655443343322111 011 Q ss_pred cceeeEEEEEEeEEEecccceeEEEEec Q lcl|NC_019402. 289 SYEKWMIEMEVGLRHRNPYASGILEVKA 316 (318) Q Consensus 289 d~~k~~i~~E~tLe~~N~~a~g~i~~lt 316 (318) |-...+++..+++.+++|+|..+|+..- T Consensus 277 ~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 277 DMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred CcEEEEEEEEeccEeecccceEEEEecC Confidence 3355567778999999999999988877 No 41 >protein:vir:348 Length: 321 # NCBI annotation: major virion structural protein # Family: family:all:3198 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203462;genbank:gi:15320618;genbank:GeneID:921734 Probab=98.41 E-value=1.2e-08 Score=64.10 Aligned_cols=269 Identities=18% Similarity=0.171 Sum_probs=154.8 Q ss_pred CCceeeeeeeee---cccceeeeEecCCcccceeeeeccc-----------------cccceEEEeee--eeccccCCcc Q lcl|NC_019402. 1 MATLVSYDLNGK---KLSFANWISNLSPTDTPFVSMTGKE-----------------AINQTLFQWQT--DALAPVADPS 58 (318) Q Consensus 1 Ma~~~t~~~~~~---~~dl~d~I~~i~p~~TP~~s~i~~~-----------------~~~~~~~~W~t--d~l~~~~~~~ 58 (318) ||- .+.+... ....+.++-.+=-.++|||-.++.. ......+.|-+ |.|.. T Consensus 1 mp~--~~lsel~t~tl~~rs~~~~D~v~~~n~LL~~L~~kG~~~~~~gg~~I~~~l~y~~~s~~~wy~Gyd~l~~----- 73 (321) T protein:vir:34 1 MPF--PNISDIITTTIESRSGVIADNVTKNNAILARLAKRGKPRLVSGGYTILEELSFSGNSNGGWYSGYDVLPT----- 73 (321) T ss_pred CCC--chHHHHHHHHHHhhcchhhhhhhcccHHHHHHHhcCcccccCCCeeEEEEEeeccCcceeEEEeeeeecc----- Confidence 773 2211111 2222334444445677887665541 12344466765 44532 Q ss_pred ccccccceeeccccccccccCcEEecceEEEEeeeeeehhHHHHhhhcCc---cchHHHHHHHHHHHHHHHHHHHHhcCc Q lcl|NC_019402. 59 DAQKRNAVIEGSAAVDGERASTTVINNVTQILRKVVKVSDTANVLANYGR---GKELQYQMEKAGKEIKRDLEVALLRNG 135 (318) Q Consensus 59 ~~~~~na~~EG~da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~---~~e~a~q~~k~~~eikrd~E~a~i~g~ 135 (318) .....++.+.++ .+....| +.|||+-+= .+.|. -+.+..+++-+-+.++-+++..|-+ T Consensus 74 ---~p~d~~~~Aef~-----wk~aa~~--------~~isg~e~l-~n~g~~~~idll~~~~~~ae~t~~n~l~~~l~s-- 134 (321) T protein:vir:34 74 ---APQDVISSAEYA-----LKQYAVP--------VVISGLEML-QNSGKEAQLDLLEARMNVAEATMANDISAALYG-- 134 (321) T ss_pred ---chhhhccccccc-----hhheeEe--------eEEehhHHh-hccchHHHHHHHHHHHHHHHHHHHhhhhHhhhc-- Confidence 122234444443 3334444 345666442 22232 1334444444444444444444443 Q ss_pred cccCCCCCccchhhhHHHHHhcCCcccCccccce-------------eeccCccccCHHHHHHHHHHHHh---CCC-CcC Q lcl|NC_019402. 136 AKVDGSATVARQTAGFSALVAAKDAADPDTGAIV-------------HFETAAAALTEAEIFKVTYNLYL---SGS-EAN 198 (318) Q Consensus 136 ~~~~gs~t~~r~m~Gi~~~i~~~~~~~~~~g~~~-------------~~~~t~~~lTe~~l~~~~~~i~~---~G~-~~~ 198 (318) +|++-..|++.||...|... +.+|.+. .++.+ .+.|-.-+...|.++|. +|+ .|+ T Consensus 135 ---dGTa~g~~~i~GL~~lv~~~----p~tGtvGGIdra~~~~WRn~~~d~~-~~~t~~tl~~~m~~~w~~~~Rg~~~PD 206 (321) T protein:vir:34 135 ---DGTAFGGRAINGLDGAVPVD----PTVGTYGGINRALWPFWRSQVEDMA-AVATINTIQPAMTKLWSRCVRGADMPD 206 (321) T ss_pred ---cccccccchhhhhhhhcccC----CCCceeccccccchhhhhhhhhhhh-hcccHHHHHHHHHHHHHhhccCCCCcc Confidence 34444569999999998643 1111111 22333 34789999999999996 344 888 Q ss_pred EEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEEEEEEEEcCCCcEEEEEecC----CCCCceEEEEehhhccee Q lcl|NC_019402. 199 IIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNVYVSSIVDPLGCQYKLVPNR----WMPENAVYFFTPSDWTQM 274 (318) Q Consensus 199 ~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr----~m~~~~~~~~D~~~~~~~ 274 (318) -|+|+...-.+-+.-. ..-+|+ ...+.-..|+.=-.|.+ +.||.+. +||++.+|++|.++++++ T Consensus 207 lii~~~~~y~~y~~s~-----q~~qR~--~~~~~a~~Gf~~Lky~~-----~div~D~~~g~~~pan~~yfiNT~yl~~r 274 (321) T protein:vir:34 207 LIMSGNDAWTTYSNSL-----QVLQRF--TSAEEANLGFRSLKFLS-----TDVVLDGGIGGFAGANTMYFLNTKYLHFR 274 (321) T ss_pred EEEechHHHHHHHHhh-----heeeee--cccccccccceeeeeee-----EEEEEeCCCCCCccccceeeeecceEEEE Confidence 8888876544433311 112333 34455667775555544 6888887 799999999999999998 Q ss_pred ecCcccceecCCC------ccceeeEEEEEEeEEEecccceeEEEEe Q lcl|NC_019402. 275 VLRAPERTKLAKD------GSYEKWMIEMEVGLRHRNPYASGILEVK 315 (318) Q Consensus 275 ~Lr~~~~e~lakt------Gd~~k~~i~~E~tLe~~N~~a~g~i~~l 315 (318) +-..-..-++.+. =|+.-..|..-.-|-|.|+.+|+++.+- T Consensus 275 ~h~~~~~~pi~p~r~~~~NqdA~~q~I~~~GnL~~sn~~~~~vL~~~ 321 (321) T protein:vir:34 275 PHKDRNMVPLSPSRRAAFNQDAEAQILAWAGNLTCSGAQFQGRLIAE 321 (321) T ss_pred EcCCCceeecCcccccccchhHHhhhhhhhheeeeecccceeEEeeC Confidence 7643333344433 3666678888889999999999999988 No 42 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=98.39 E-value=7.6e-08 Score=59.62 Aligned_cols=285 Identities=13% Similarity=0.089 Sum_probs=147.0 Q ss_pred CCceeeee-eeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccccC Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERAS 79 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~~ 79 (318) +++-++.+ ....-+.+++.|+..-....|+.+++.....++....|.........+... .-.-||...+...... T Consensus 118 ~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~a----~~v~Eg~~~~~~~~~~ 193 (413) T protein:vir:81 118 STATLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNLTMTNTTIKYLMEKANRVVEGGF----KTVAEGGKKPYMRFAD 193 (413) T ss_pred hhcccccccccccchhhHHHHHHHHhhhhhHHhhcceeeccCCceeEEEecccccccccc----ceecCcccccccCccc Confidence 22111111 222345566777777777788888877766666655665443322111100 1123666544332111 Q ss_pred cEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhcCC Q lcl|NC_019402. 80 TTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAAKD 159 (318) Q Consensus 80 ~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~~~ 159 (318) -..+.=-..-+...+.||...- .... ...+|=..+-...+.+-+|++||+|. |.+. ...||... .+ T Consensus 194 f~~i~~~~~k~~~~~~iS~ell--~ds~--~l~~~i~~~la~~~~~~~d~~~l~G~----G~~~---~~~Gi~~~---~~ 259 (413) T protein:vir:81 194 FDIVTESLSKIAGLTKITDEMI--EDYD--FLVSYINARLLEELAIEEERQLLLGD----GTGN---NLTGLLKR---DG 259 (413) T ss_pred ceeeEeeeeeEEEeehhhHHHH--HHHH--HHHHHHHHHHHHHHHHHHHHHHhccC----CCCC---cccccccc---cc Confidence 1111111112223355665432 2221 22233333445678889999999873 3221 14466431 11 Q ss_pred cccCccccceeeccCccccCHHHHHHHHHHHHhC-CCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEE- Q lcl|NC_019402. 160 AADPDTGAIVHFETAAAALTEAEIFKVTYNLYLS-GSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNV- 237 (318) Q Consensus 160 ~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~-G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~- 237 (318) .. ... .....-..+++.+++-.+-.+ +..++.++||+.....|..+. +.+ .|+...+......+. T Consensus 260 ~~-------~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lk-d~~----G~~l~~~~~~~~~~~~ 326 (413) T protein:vir:81 260 IQ-------TLA-VSNKDELADSIYKAMTNISLATPFQADALVINPLDYQELRLAK-DAN----GQYYGGGVFQGQYGSG 326 (413) T ss_pred cc-------ccc-ccccchhHHHHHHHHHHhhhhccCCCcEEEEcHHHHHHHHHhh-ccC----Cceecccccccccccc Confidence 11 011 111222345555565555544 445667888998877777664 221 222211111111110 Q ss_pred EEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecC-cccceecCCC-----ccceeeEEEEEEeEEEecccceeE Q lcl|NC_019402. 238 YVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLR-APERTKLAKD-----GSYEKWMIEMEVGLRHRNPYASGI 311 (318) Q Consensus 238 ~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr-~~~~e~lakt-----Gd~~k~~i~~E~tLe~~N~~a~g~ 311 (318) ....-.+-|| +.++.+.+||++.+++.|++..-+.+.| ++..+-..-. -+...+..+..+.+.+.+|.|..+ T Consensus 327 ~~~~~~~l~G--~pv~~s~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~ 404 (413) T protein:vir:81 327 GIMLDPAPWG--LRTVQSQVVPVGKPVVGAFRSAASVLRKGGVRIDSTNTNVDDFENNLITVRAEERVGLMVTFPEAIVQ 404 (413) T ss_pred ccccCceecc--eeeEEcCCCCcccEEEEecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEE Confidence 0111124568 5899999999999999999874444444 3333322222 244566666779999999999999 Q ss_pred EEEeccC Q lcl|NC_019402. 312 LEVKAGA 318 (318) Q Consensus 312 i~~lt~a 318 (318) ++..+++ T Consensus 405 l~~~~~~ 411 (413) T protein:vir:81 405 LDVAEVV 411 (413) T ss_pred EEecCCC Confidence 9987777 No 43 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=98.36 E-value=1.7e-07 Score=57.65 Aligned_cols=272 Identities=11% Similarity=0.017 Sum_probs=149.9 Q ss_pred CCceeeee-eeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccccC Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERAS 79 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~~ 79 (318) +.+-++.. -...-+.+.+.|...--...|+..++......+..+.|....-.... ..-.-||.+.+...... T Consensus 113 ~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~-------a~~v~Eg~~~~~~~~~~ 185 (390) T protein:vir:97 113 ASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYVQETGFVNN-------AAIVAEGALKPESSLKF 185 (390) T ss_pred hhcccccccccccchhhhHHHHHHHhhhhhhHhhcceeeccCCceEEEEEecCCcc-------eeeecCCccccccccce Confidence 22222222 22234455555555555667787776666555555555554321110 11234777766554332 Q ss_pred cEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhcCC Q lcl|NC_019402. 80 TTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAAKD 159 (318) Q Consensus 80 ~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~~~ 159 (318) ....-+... +...+.||. +.++.. .+..++=...-...+.+-+|.+||+| +|++.. ..||....... T Consensus 186 ~~i~~~~~k-~~~~~~is~--ell~ds--~~l~~~i~~~la~a~~~~~d~a~l~G----~g~~~~---p~Gi~~~~~~~- 252 (390) T protein:vir:97 186 AKKTDTTHV-IAHTMKATR--QILSDA--PQLASYMNNRLIRGLKVKEDAEILRG----TGANDG---LLGLIPQATTY- 252 (390) T ss_pred eEEEEeeee-EEEeehhhH--HHHHhH--HHHHHHHHHHHHHHHHHHHHHHHhhc----CCCCcc---ccceeeccccc- Confidence 222222222 222344554 334322 24444444456777888999999987 233332 34665432111 Q ss_pred cccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEEEE Q lcl|NC_019402. 160 AADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNVYV 239 (318) Q Consensus 160 ~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~~v 239 (318) ..........+.++|.+++.++-..+.....++|||.....+..+. +.+ .++...+.. ..+. T Consensus 253 ---------~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lk-d~~----G~~l~~~~~--~~~~-- 314 (390) T protein:vir:97 253 ---------AAPTTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIELAK-DAN----NQYLIGNAR--GTLT-- 314 (390) T ss_pred ---------cccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhh-cCC----CceeecCcc--CCCC-- Confidence 1112233455677888888888777777888999999888887764 322 232222211 1111 Q ss_pred EEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeec-Ccccceec----CCCccceeeEEEEEEeEEEecccceeEEEE Q lcl|NC_019402. 240 SSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVL-RAPERTKL----AKDGSYEKWMIEMEVGLRHRNPYASGILEV 314 (318) Q Consensus 240 ~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~L-r~~~~e~l----aktGd~~k~~i~~E~tLe~~N~~a~g~i~~ 314 (318) . +=+| ++++.+.+||++.+++.|++..-+.+. ..+..+.. --+-+...+++..-+.+.+++|.|..+++. T Consensus 315 ~---~l~G--~pV~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~v~~~~ 389 (390) T protein:vir:97 315 P---TLWG--LPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEERLALVVYRPEALITGSF 389 (390) T ss_pred c---eecc--eeeEEcCCCCCCcEEEEeccceEEEEEecceEEEEeecccccccCcEEEEEEEeeccEEeccccEEEEEe Confidence 1 1257 688999999999999999986322222 22222111 112355566777789999999999999988 Q ss_pred e Q lcl|NC_019402. 315 K 315 (318) Q Consensus 315 l 315 (318) . T Consensus 390 a 390 (390) T protein:vir:97 390 A 390 (390) T ss_pred C Confidence 7 No 44 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=98.35 E-value=1.3e-07 Score=58.39 Aligned_cols=273 Identities=13% Similarity=0.057 Sum_probs=144.4 Q ss_pred CCceeeee-eeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccccC Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERAS 79 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~~ 79 (318) ||..++.+ -...-+++.+.|...--..+|+.++.......+....|...+....+ .-.-||+..+...... T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a--------~wv~E~~~~~~~~~~~ 72 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEA--------DWVGESATDPKGVKPT 72 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCcEEEEEEeCCcce--------EEeecccccccccccc Confidence 99888876 44567778888888778889999887666665555555544432211 1123665544432111 Q ss_pred cEEecceEEE----EeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHH Q lcl|NC_019402. 80 TTVINNVTQI----LRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALV 155 (318) Q Consensus 80 ~~~~~N~tQI----f~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i 155 (318) ....+.+| ++=...+.=|.+.+.-.. .+..++=...-.+.+.+.+|.++|+|.- +... ...-++.... T Consensus 73 --s~~~f~~i~~~~~k~~~~~~is~ell~ds~-~~~~~~i~~~l~~~~a~~~d~a~~~G~g----~~~~-~~~~~~~~~~ 144 (305) T protein:vir:25 73 --SKVTWANRTLVAEEIAVIIPVHENVIDDAT-VAVLTEVAELGGQAIGKKLDQAVIFGTD----KPAS-WVSPALIPAA 144 (305) T ss_pred --cccceeeEEeeeEEEEEeehhhHHHHhcch-HHHHHHHHHHHHHHHHHHHhhhheeccC----CCCC-cccccccccc Confidence 01122222 221222223333333222 2334444455568889999999998742 2111 1111111111 Q ss_pred hcCCcccCccccceeeccCccccCHHHHHHHHHH----HHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCC Q lcl|NC_019402. 156 AAKDAADPDTGAIVHFETAAAALTEAEIFKVTYN----LYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQ 231 (318) Q Consensus 156 ~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~----i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~ 231 (318) ... +. ....+....+.+++.+.+.+ +-.++...+.+++++.....+.++. +.+ .++...+ T Consensus 145 ~~~-------~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lk-d~~----G~~i~~~-- 208 (305) T protein:vir:25 145 VTA-------GQ--AVEVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIR-DAN----GNPVFRD-- 208 (305) T ss_pred ccc-------cc--cccccccchhhhHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHHhh-ccC----CceeecC-- Confidence 000 00 11122223334444444443 3344555667889998888877763 322 2332211 Q ss_pred ceEEEEEEEEEEcCCCcEEEEEecCCCCCc----eEEEEehhhcceeecCcccceec--C--CCc---------cceeeE Q lcl|NC_019402. 232 DTRLNVYVSSIVDPLGCQYKLVPNRWMPEN----AVYFFTPSDWTQMVLRAPERTKL--A--KDG---------SYEKWM 294 (318) Q Consensus 232 ~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~----~~~~~D~~~~~~~~Lr~~~~e~l--a--ktG---------d~~k~~ 294 (318) + +=+| +.++.+.++|.+ .+++.|++.+.+..-.++..+.. + +++ |....+ T Consensus 209 ~-----------~l~G--~Pv~~~~~~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R 275 (305) T protein:vir:25 209 D-----------SFAG--FRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALR 275 (305) T ss_pred C-----------cccc--cceEEcCccCCCCCccEEEEEecceEEEEEecCeEEEEeeeeeeecCCceeeeeecCcEEEE Confidence 1 2356 466666777643 58888988765554444432211 1 111 334445 Q ss_pred EEEEEeEEEecccceeEEEEeccC Q lcl|NC_019402. 295 IEMEVGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 295 i~~E~tLe~~N~~a~g~i~~lt~a 318 (318) .+..+++.+.||+|..+++.+..+ T Consensus 276 ~~~r~~~~v~~p~a~v~~~~~~~~ 299 (305) T protein:vir:25 276 LKARFAYVLGVSATAQGANKTPVA 299 (305) T ss_pred EEEeecceeeCcccEEEEcccccc Confidence 566799999999999999998776 No 45 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=98.32 E-value=2.6e-07 Score=56.66 Aligned_cols=283 Identities=10% Similarity=-0.083 Sum_probs=149.3 Q ss_pred CCceeeee-eeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccccC Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERAS 79 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~~ 79 (318) |+..++.. .-..-+.+.+.|...-....|+++++......+....|...+-... ..-..||.+.+...... T Consensus 14 ~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~--------a~~v~E~~~~~~~~~~f 85 (320) T protein:vir:10 14 IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWIGDVS--------AQWIGEGDMKPITKGNM 85 (320) T ss_pred hhccccccccccccHHHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcc--------eEEecCCccccccccce Confidence 44333222 2234566677777776778888887766555444444443322111 11234787776544322 Q ss_pred cEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhcCC Q lcl|NC_019402. 80 TTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAAKD 159 (318) Q Consensus 80 ~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~~~ 159 (318) .. .-..+++=...+.=|.+.+.... .+...+=..+-.+.+.+.+|.++|+|... + .+....|+.... . T Consensus 86 ~~---v~~~~~k~~~~~~is~ell~ds~-~~l~~~i~~~l~~a~a~~~d~a~l~G~g~----~-~~~~~~~~~~~~---~ 153 (320) T protein:vir:10 86 TS---QNIAPHKIATIFVASAETVRANP-ANYLGTMRTKVATAFAMAFDSAALNGTDS----P-FPTYLAQTTKSV---S 153 (320) T ss_pred eE---EEEeeEEEEEeehhhHHHHhcCh-HHHHHHHHHHHHHHHHHHHHHHhhcccCC----C-CCcccccccccc---c Confidence 11 12223333344444455544322 23344444566688899999999988532 1 121122221111 1 Q ss_pred cccCccccceeeccCcccc--CHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEE Q lcl|NC_019402. 160 AADPDTGAIVHFETAAAAL--TEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNV 237 (318) Q Consensus 160 ~~~~~~g~~~~~~~t~~~l--Te~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~ 237 (318) .. .....+...+ .++.+.+++..+...+..+..++|||.....+..+. +.+ .++...+.... ... T Consensus 154 ~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lk-d~~----G~~l~~~~~~~-~~~ 220 (320) T protein:vir:10 154 LA-------DPGGATASDLTAYDAVAVNGLSLLVNAKKKWTHTLLDDIVEPILNGAK-DKN----GRPLFIESTYT-DEN 220 (320) T ss_pred ce-------ecccccccccccHHHHHHHHHhhhhcccCCCcEEEEcHHHHHHHHHhh-ccC----Cceeecccccc-Ccc Confidence 00 0111122222 234567777777777777888999999988887653 221 12222111100 000 Q ss_pred EEEEEEcCCCcEEEEEecCCCCCce--EEEEehhhcceeecCcccce----ecCCCc-------------cceeeEEEEE Q lcl|NC_019402. 238 YVSSIVDPLGCQYKLVPNRWMPENA--VYFFTPSDWTQMVLRAPERT----KLAKDG-------------SYEKWMIEME 298 (318) Q Consensus 238 ~v~~~~tdfG~~v~iv~nr~m~~~~--~~~~D~~~~~~~~Lr~~~~e----~laktG-------------d~~k~~i~~E 298 (318) ..-.-.+-+| +.++++.+||+++ +++.|++.+-+..-.++..+ ...+.| +....+.+.. T Consensus 221 ~~~~~~~i~g--~pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~ 298 (320) T protein:vir:10 221 SPFRAGRIVS--RPTILSDHVADGTTVGYMGDFRNVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAE 298 (320) T ss_pred ccccCceeee--eeeEecCCCCCCceEEEEeecceEEEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEe Confidence 0000012356 6889999999987 45678877655444444322 111222 3345566677 Q ss_pred EeEEEecccceeEEEEeccC Q lcl|NC_019402. 299 VGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 299 ~tLe~~N~~a~g~i~~lt~a 318 (318) +.+.+.+|+|..+|++.++= T Consensus 299 ~d~~v~~~~a~~~l~~~~ap 318 (320) T protein:vir:10 299 YAFHNNDKDAFVKLTNVVTP 318 (320) T ss_pred eccEEecccceEEEEeccCC Confidence 99999999999999977644 No 46 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=98.29 E-value=1e-06 Score=53.42 Aligned_cols=258 Identities=11% Similarity=0.045 Sum_probs=149.5 Q ss_pred CCceeeeeee-eecccceeeeE----------ecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeec Q lcl|NC_019402. 1 MATLVSYDLN-GKKLSFANWIS----------NLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEG 69 (318) Q Consensus 1 Ma~~~t~~~~-~~~~dl~d~I~----------~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG 69 (318) ||.-+|.... -+-|=+++.+. .+-..++.+ +--++.+ .+.+.|. .+..+.+ --|| T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l-~g~~G~t--v~iP~~~--~~g~a~~---------~~~g 66 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTL-QGQPGDT--LTFPAFV--YSGDAQV---------VAEG 66 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccc-cCCCCCE--EEEeeec--CCCcccc---------ccCC Confidence 8874443333 23222332221 222222221 1111222 2345674 2222222 2367 Q ss_pred cccccccccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhh Q lcl|NC_019402. 70 SAAVDGERASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTA 149 (318) Q Consensus 70 ~da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~ 149 (318) .+.+....+.....--+.|+ .+.|+|++-+.+.. .++-+.........-+.+.++..++.--. +.. T Consensus 67 ~~i~~~~lt~~~~~~~i~~~-~~~~~i~D~~~~~~---~~dp~~~~~~~~a~a~a~~vd~~~~~~l~---~a~------- 132 (274) T protein:vir:97 67 EKIPTDILETKKREAKIRKI-AKGTSITDEALLSG---YGDPQGEQVRQHGLAHANKVDNDVLEALM---GAK------- 132 (274) T ss_pred CcccccccccceeEEEeeee-cceecccHHHHHhc---cchHHHHHHHHHHHHHHHHHHHHHHHHHh---ccC------- Confidence 77666666666665566664 46899998776543 24677777777778899999988873210 100 Q ss_pred hHHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEec Q lcl|NC_019402. 150 GFSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFD 229 (318) Q Consensus 150 Gi~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~ 229 (318) ....+.+++.+.|.++++++=+++..++.++|||.+...+-+ +....+ .+. ... T Consensus 133 ---------------------~~~~~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k---~~~~~f-~~~-s~~ 186 (274) T protein:vir:97 133 ---------------------LTVNADITKLNGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRG---DASTNF-TRA-TEL 186 (274) T ss_pred ---------------------ccccccccCHHHHHHHHHHhhccCCCceEEEeCHHHHHHHHh---hhhhhc-ccc-Ccc Confidence 001224578999999999998888899999999997766532 210000 000 000 Q ss_pred CCceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCccccee--cCCCccceeeEEEEEEeEEEeccc Q lcl|NC_019402. 230 GQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTK--LAKDGSYEKWMIEMEVGLRHRNPY 307 (318) Q Consensus 230 ~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~--laktGd~~k~~i~~E~tLe~~N~~ 307 (318) ++.-.....+-.| .| ++|+.++.+|.++.+++.+..+.+..-+++..|. -++.+ ++......-|+..+.||. T Consensus 187 g~~~~~~G~ig~~---~G--~~Vi~s~~~p~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~-~d~i~~~~~y~~~~~~~~ 260 (274) T protein:vir:97 187 GDDIIVKGAFGEA---LG--AIIVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDASTK-TTALYSDKHYVAYLYDES 260 (274) T ss_pred cccceecccccee---cC--eeEEEcCCCCcceEEEEeCcceEeeecCCceeccccchhhc-ccEEEEEEEEEEEEEcCC Confidence 1110001112222 35 6899999999999999999999986656654322 12222 233333344899999999 Q ss_pred ceeEEEEeccC Q lcl|NC_019402. 308 ASGILEVKAGA 318 (318) Q Consensus 308 a~g~i~~lt~a 318 (318) +..+++-..+| T Consensus 261 ~vv~~t~~~~~ 271 (274) T protein:vir:97 261 KAVKITKGSGS 271 (274) T ss_pred ceEEEecCccc Confidence 99999998888 No 47 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=98.29 E-value=1e-06 Score=53.42 Aligned_cols=258 Identities=11% Similarity=0.045 Sum_probs=149.5 Q ss_pred CCceeeeeee-eecccceeeeE----------ecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeec Q lcl|NC_019402. 1 MATLVSYDLN-GKKLSFANWIS----------NLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEG 69 (318) Q Consensus 1 Ma~~~t~~~~-~~~~dl~d~I~----------~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG 69 (318) ||.-+|.... -+-|=+++.+. .+-..++.+ +--++.+ .+.+.|. .+..+.+ --|| T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l-~g~~G~t--v~iP~~~--~~g~a~~---------~~~g 66 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTL-QGQPGDT--LTFPAFV--YSGDAQV---------VAEG 66 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccc-cCCCCCE--EEEeeec--CCCcccc---------ccCC Confidence 8874443333 23222332221 222222221 1111222 2345674 2222222 2367 Q ss_pred cccccccccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhh Q lcl|NC_019402. 70 SAAVDGERASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTA 149 (318) Q Consensus 70 ~da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~ 149 (318) .+.+....+.....--+.|+ .+.|+|++-+.+.. .++-+.........-+.+.++..++.--. +.. T Consensus 67 ~~i~~~~lt~~~~~~~i~~~-~~~~~i~D~~~~~~---~~dp~~~~~~~~a~a~a~~vd~~~~~~l~---~a~------- 132 (274) T protein:vir:94 67 EKIPTDILETKKREAKIRKI-AKGTSITDEALLSG---YGDPQGEQVRQHGLAHANKVDNDVLEALM---GAK------- 132 (274) T ss_pred CcccccccccceeEEEeeee-cceecccHHHHHhc---cchHHHHHHHHHHHHHHHHHHHHHHHHHh---ccC------- Confidence 77666666666665566664 46899998776543 24677777777778899999988873210 100 Q ss_pred hHHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEec Q lcl|NC_019402. 150 GFSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFD 229 (318) Q Consensus 150 Gi~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~ 229 (318) ....+.+++.+.|.++++++=+++..++.++|||.+...+-+ +....+ .+. ... T Consensus 133 ---------------------~~~~~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k---~~~~~f-~~~-s~~ 186 (274) T protein:vir:94 133 ---------------------LTVNADITKLNGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRG---DASTNF-TRA-TEL 186 (274) T ss_pred ---------------------ccccccccCHHHHHHHHHHhhccCCCceEEEeCHHHHHHHHh---hhhhhc-ccc-Ccc Confidence 001224578999999999998888899999999997766532 210000 000 000 Q ss_pred CCceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCccccee--cCCCccceeeEEEEEEeEEEeccc Q lcl|NC_019402. 230 GQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTK--LAKDGSYEKWMIEMEVGLRHRNPY 307 (318) Q Consensus 230 ~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~--laktGd~~k~~i~~E~tLe~~N~~ 307 (318) ++.-.....+-.| .| ++|+.++.+|.++.+++.+..+.+..-+++..|. -++.+ ++......-|+..+.||. T Consensus 187 g~~~~~~G~ig~~---~G--~~Vi~s~~~p~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~-~d~i~~~~~y~~~~~~~~ 260 (274) T protein:vir:94 187 GDDIIVKGAFGEA---LG--AIIVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDASTK-TTALYSDKHYVAYLYDES 260 (274) T ss_pred cccceecccccee---cC--eeEEEcCCCCcceEEEEeCcceEeeecCCceeccccchhhc-ccEEEEEEEEEEEEEcCC Confidence 1110001112222 35 6899999999999999999999986656654322 12222 233333344899999999 Q ss_pred ceeEEEEeccC Q lcl|NC_019402. 308 ASGILEVKAGA 318 (318) Q Consensus 308 a~g~i~~lt~a 318 (318) +..+++-..+| T Consensus 261 ~vv~~t~~~~~ 271 (274) T protein:vir:94 261 KAVKITKGSGS 271 (274) T ss_pred ceEEEecCccc Confidence 99999998888 No 48 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=98.28 E-value=5.2e-07 Score=55.01 Aligned_cols=288 Identities=9% Similarity=0.045 Sum_probs=141.4 Q ss_pred CCceeeee----eeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeecccccccc Q lcl|NC_019402. 1 MATLVSYD----LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGE 76 (318) Q Consensus 1 Ma~~~t~~----~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~ 76 (318) +....... .....+.+.+.|...-.....+..++......+..+.|..............+...-.-||+..+... T Consensus 121 ~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~ 200 (419) T protein:vir:94 121 RDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQST 200 (419) T ss_pred cccccccccCCcccccchhhhHHHHHHHhhhhhhhhcceeeeccCCceeeeeeccccccccccCcccceecCCccccccc Confidence 22111111 11122233333332222223333344433333333444333221110000001111233777766543 Q ss_pred ccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHh Q lcl|NC_019402. 77 RASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVA 156 (318) Q Consensus 77 ~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~ 156 (318) .+.....-+. .-+...+.||.- .+... .+..+|=...-...+.+-+|.++|+|. |++ +.-||+..-. T Consensus 201 ~~~~~i~~~~-~k~~~~~~is~e--ll~d~--~~l~~~i~~~la~a~~~~~d~aii~G~----G~~----~p~Gi~~~~~ 267 (419) T protein:vir:94 201 LSFDTITTTL-KTVAHWLPITRQ--AADDN--SQLMGYIQGRLTYGLRFLRDRQLLNGN----GST----EMQGILTTPG 267 (419) T ss_pred cceeeEEeee-eeEEEeehhhHH--HHHhH--HHHHHHHHHHHHHHHHHHHHHHHHhcc----Ccc----cccceecccc Confidence 3222211111 223334455543 33322 232333333457788899999999873 222 2335543111 Q ss_pred cCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEE Q lcl|NC_019402. 157 AKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLN 236 (318) Q Consensus 157 ~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g 236 (318) -.... ..............++|.+++.++-.++..+..++|+|.....+..+.... +.++..+... . + T Consensus 268 ~~~~~-----~~~~~~~~t~~~~~~~l~~~~~~~~~~~~~~~~~v~n~~~~~~l~~~k~~~----~~~~~~~~~~-~--~ 335 (419) T protein:vir:94 268 IGTYQ-----QPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPG----SGVFRVIANV-Q--G 335 (419) T ss_pred ccccc-----ccccccccccchhHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHHhhcC----CCceeecCCc-c--c Confidence 00000 001112233444567777777777777777778999999888887664322 1222111110 0 1 Q ss_pred EEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecC-cccceecCC-----CccceeeEEEEEEeEEEeccccee Q lcl|NC_019402. 237 VYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLR-APERTKLAK-----DGSYEKWMIEMEVGLRHRNPYASG 310 (318) Q Consensus 237 ~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr-~~~~e~lak-----tGd~~k~~i~~E~tLe~~N~~a~g 310 (318) .... +=+| +.++.+.+||++.+++.|+++.-..+.| .+..+...- +-+...+.++.-+.+.+.+|.|.. T Consensus 336 ~~~~---~l~G--~pV~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~ 410 (419) T protein:vir:94 336 EATP---RIWG--LNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFV 410 (419) T ss_pred CCCc---cccc--eeeEEcCCCCCccEEEeeccceEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEeccccEE Confidence 0011 2368 5899999999999999999976444443 333322222 234556677777999999999999 Q ss_pred EEEEeccC Q lcl|NC_019402. 311 ILEVKAGA 318 (318) Q Consensus 311 ~i~~lt~a 318 (318) +++..++= T Consensus 411 ~~~~~aa~ 418 (419) T protein:vir:94 411 RVTFAAAT 418 (419) T ss_pred EEEeccCC Confidence 98876655 No 49 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=98.25 E-value=3.9e-07 Score=55.70 Aligned_cols=277 Identities=9% Similarity=-0.034 Sum_probs=142.1 Q ss_pred CCceeeeeeeeecccceeeeEecCCcccceeeeeccccccce--EEEeeeeeccccCCccccccccceeecccccccccc Q lcl|NC_019402. 1 MATLVSYDLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQT--LFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERA 78 (318) Q Consensus 1 Ma~~~t~~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~--~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~ 78 (318) ||+-.++ ..-+++..+|+..-....++.++......++. .+-+++.... + .-.-||.+.+..... T Consensus 1 ma~~gG~---lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~-a---------~~v~E~~~~~~~~~~ 67 (298) T protein:vir:16 1 MVLNKGT---LFDPTLVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSE-I---------DVVAESGKKTHGGVT 67 (298) T ss_pred CcccCcc---eechhHHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecCcc-e---------EEecCCccccccccc Confidence 8854443 23345667777766777888887665444322 2333332211 1 123377766655432 Q ss_pred CcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhcC Q lcl|NC_019402. 79 STTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAAK 158 (318) Q Consensus 79 ~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~~ 158 (318) -...--+... +..-+.||........--..+..++=..+-.+.+.+-+|.++++|...-.|.. ...-|+....... T Consensus 68 f~~v~l~~~k-~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~---~~~~~~~~~~~~~ 143 (298) T protein:vir:16 68 LAPQTMVPIK-VEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTA---SAVIGTNHFDSKV 143 (298) T ss_pred eeEEEEeeee-EEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcc---ccccccccccccc Confidence 2211112222 22334455443211111111222333345556678899999998843222221 1112211111000 Q ss_pred CcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEEE Q lcl|NC_019402. 159 DAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNVY 238 (318) Q Consensus 159 ~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~~ 238 (318) .. .....+...-..++|.+++.++..++.....+++||.....+..+. +.+ .|+...+... .+ . T Consensus 144 -------~~-~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lk-d~~----G~~i~~~~~~--~~-~ 207 (298) T protein:vir:16 144 -------TQ-KVEAPRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQK-DLQ----DNALFPELKW--GA-T 207 (298) T ss_pred -------cc-ccccccccccHHHHHHHHHHHhhhcCCCccEEEEcHHHHHHHHHhh-ccC----CCeeecCccc--CC-C Confidence 00 0111111222356888999999888888888999999988887763 332 2332222110 01 0 Q ss_pred EEEEEcCCCcEEEEEecCCCCC------ceEEEEehhhc-ceeecCcccceecCCCc------------cceeeEEEEEE Q lcl|NC_019402. 239 VSSIVDPLGCQYKLVPNRWMPE------NAVYFFTPSDW-TQMVLRAPERTKLAKDG------------SYEKWMIEMEV 299 (318) Q Consensus 239 v~~~~tdfG~~v~iv~nr~m~~------~~~~~~D~~~~-~~~~Lr~~~~e~laktG------------d~~k~~i~~E~ 299 (318) ... =+| +.++.+..+|. +.+|+.|++.. .+..-..+..+-. ..+ |-...+....+ T Consensus 208 ~~~---l~G--~PV~~~~~v~~~~~~~~~~~~~GDfs~~~~~~~~~~~~~~~~-~~~~~~~~~~~~f~~~~v~~ra~~r~ 281 (298) T protein:vir:16 208 PDT---ING--LPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVI-QYGDPDNSGLDLKGYNQVYIRAELFL 281 (298) T ss_pred Cce---ecc--eeeEEecccccccCCCccEEEEeeccceEEEEEecCceEEEe-eccCCcCcchhhhhcCcEEEEEEEEE Confidence 111 257 57888888875 45888898753 3333333332211 111 22344556668 Q ss_pred eEEEecccceeEEEEec Q lcl|NC_019402. 300 GLRHRNPYASGILEVKA 316 (318) Q Consensus 300 tLe~~N~~a~g~i~~lt 316 (318) ...+++|+|...|++.+ T Consensus 282 d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 282 GWGILDATKFARVTEAN 298 (298) T ss_pred ccEeecccceEEEeecC Confidence 89999999999999999 No 50 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=98.23 E-value=1.7e-07 Score=57.68 Aligned_cols=280 Identities=15% Similarity=0.079 Sum_probs=145.9 Q ss_pred CCceeeee-eeeecccceeeeEecCCcccceeeeeccccccceE-EEeeeeeccccCCccccccccceeecccccccccc Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTL-FQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERA 78 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~-~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~ 78 (318) |.+-++.+ -...-+++..+|...-....|+.++...-+..+.. .-|.......+ .-.-||...+.... T Consensus 106 ~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a---------~~v~E~~~~~~~~~- 175 (407) T protein:vir:48 106 LQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQEATVITLGGSDYKKLVNLGGTTS---------GWVGETDARPETAT- 175 (407) T ss_pred hhcccCCCCcccccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEecCCcce---------eeeccccccccccc- Confidence 33222211 11234566777777777778887766554443332 22332222111 12336665443221 Q ss_pred CcEEecceEEEEeeeee----ehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHH Q lcl|NC_019402. 79 STTVINNVTQILRKVVK----VSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSAL 154 (318) Q Consensus 79 ~~~~~~N~tQIf~~~~~----VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~ 154 (318) ..+.||--.... |.=|.+.+.... .+..+|=..+-...+.+-+|.+||+|. |. .+.-||+.. T Consensus 176 -----~~f~~i~~~~~k~~~~~~iS~ell~ds~-~~l~~~i~~~l~~~i~~~~~~a~l~G~----G~----~~p~Gil~~ 241 (407) T protein:vir:48 176 -----SKLGLIEPFMGEIYGNPQATQKMLDDAF-FNVEDWINSELALEFAEQEEIAFTSGD----GS----KKPKGFLAY 241 (407) T ss_pred -----ccceeEEeeeeeeEeehhhHHHHHhcch-HHHHHHHHHHHHHHHHHHHHhhhhccC----CC----Cccceeeec Confidence 223333222222 222333333222 122333334444557788999999873 32 234477654 Q ss_pred HhcCC--cccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCc Q lcl|NC_019402. 155 VAAKD--AADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQD 232 (318) Q Consensus 155 i~~~~--~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~ 232 (318) ..... .....+.....+.++...++.++|.+++.++..+......++||+.....+..+. +.+ .|+...+.-. T Consensus 242 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~~a~~v~n~~~~~~L~~lk-D~~----Gr~l~~~~~~ 316 (407) T protein:vir:48 242 ESTDEDDKTRAFGKLQHIASGAASGVTADAIIKLIYTLRKAHRSGAKFMMNNSSLFAIRLLK-DND----GNYLWRPGIE 316 (407) T ss_pred ccccccccccccccccccccccccccChHHHHHHHHhhchhhhcCCEEEEcHHHHHHHHHhh-ccC----CceeeccCcC Confidence 32211 1111111122344566778999999998888765544456789998877776653 322 2332222111 Q ss_pred eEEEEEEEEEEcCCCcEEEEEecCCCCC-----ceEEEEehhh-cceeecCcc--cceecCCCccceeeEEEEEEeEEEe Q lcl|NC_019402. 233 TRLNVYVSSIVDPLGCQYKLVPNRWMPE-----NAVYFFTPSD-WTQMVLRAP--ERTKLAKDGSYEKWMIEMEVGLRHR 304 (318) Q Consensus 233 ~~~g~~v~~~~tdfG~~v~iv~nr~m~~-----~~~~~~D~~~-~~~~~Lr~~--~~e~laktGd~~k~~i~~E~tLe~~ 304 (318) -|. -.+=+| +.++.+.+||. ..+++.|++. +.+.---.+ ..++.+. -+...+..+..+...+. T Consensus 317 --~g~----~~~l~G--~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~i~~d~~~~-~~~~~~~~~~r~d~~v~ 387 (407) T protein:vir:48 317 --LGQ----PSSLAG--YGIVENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRDPYTN-KPFVGFYTTKRTGGMLV 387 (407) T ss_pred --CCC----Cceecc--eeeEEecCcCCccCCccEEEEEeccccEEEEEeeceEEEeecccc-CCcEEEEEEEEeccEEe Confidence 111 112367 57888899985 3367788875 222111111 1233432 35556666677999999 Q ss_pred cccceeEEEEeccC Q lcl|NC_019402. 305 NPYASGILEVKAGA 318 (318) Q Consensus 305 N~~a~g~i~~lt~a 318 (318) +|+|..++...+++ T Consensus 388 ~~~a~~~l~~~aa~ 401 (407) T protein:vir:48 388 DSQAIKLMKIGAAT 401 (407) T ss_pred cccceEEEEeeccC Confidence 99999999999888 No 51 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=98.21 E-value=4.5e-07 Score=55.40 Aligned_cols=278 Identities=10% Similarity=-0.038 Sum_probs=142.3 Q ss_pred CCceeeeeeee-ecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccccC Q lcl|NC_019402. 1 MATLVSYDLNG-KKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERAS 79 (318) Q Consensus 1 Ma~~~t~~~~~-~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~~ 79 (318) |+..++.+..+ .-+.+.+.|...-....|+..+.......+....+....-... . .-.-||.+.+..... T Consensus 14 ~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~----a----~~v~Eg~~~~~~~~~- 84 (318) T protein:vir:24 14 IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWVGDVS----A----QWIGEGDMKPITKGN- 84 (318) T ss_pred hhcccCcccceeechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCcc----e----EEecCCccccccccc- Confidence 55444444333 3445556665555666788777655554444333322221110 0 112377765543322 Q ss_pred cEEecceEEEEeee-eeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhcC Q lcl|NC_019402. 80 TTVINNVTQILRKV-VKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAAK 158 (318) Q Consensus 80 ~~~~~N~tQIf~~~-~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~~ 158 (318) +..++--.+|. ..+.=|.+.+.... .+..++=.....+.+.+.+|.++|+|.- ++. -.|+..-... T Consensus 85 ---f~~i~~~~~k~~~~~~iS~e~l~ds~-~~~~~~i~~~l~~~~~~~~d~a~l~G~g----~~~----~~~~~~~~~~- 151 (318) T protein:vir:24 85 ---MTSQTIAPHKIATIFVASAETVRANP-ANYLGTMRTKVATAFAMAFDGAAMHGTD----SPF----PTYIGQTTKA- 151 (318) T ss_pred ---eeEEEEeeEEEEEeehhhHHHhhcCh-HHHHHHHHHHHHHHHHHHHHHhhhcccC----CCC----Cccccccccc- Confidence 22222111221 11222233333222 2333444455566689999999998852 211 1122211110 Q ss_pred CcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEEE Q lcl|NC_019402. 159 DAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNVY 238 (318) Q Consensus 159 ~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~~ 238 (318) .......+......+.+.+++-.+-..+.....++|||.....+..+. +.+ .++...+.. . +.. T Consensus 152 --------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lk-d~~----G~~l~~~~~--~-~~~ 215 (318) T protein:vir:24 152 --------ISIADTTGATTVYDQVAVNGLSLLVNDGKKWTHTLLDDITEPILNGAK-DQN----GRPLFIEST--Y-GEA 215 (318) T ss_pred --------ccccccccccchHHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhh-ccC----CceeecCcc--c-cCc Confidence 001111223334445566666666666667778899999988887654 322 222211111 0 111 Q ss_pred EEEE--EcCCCcEEEEEecCCCCCce--EEEEehhhcceeecCcccceec--------C---------CCccceeeEEEE Q lcl|NC_019402. 239 VSSI--VDPLGCQYKLVPNRWMPENA--VYFFTPSDWTQMVLRAPERTKL--------A---------KDGSYEKWMIEM 297 (318) Q Consensus 239 v~~~--~tdfG~~v~iv~nr~m~~~~--~~~~D~~~~~~~~Lr~~~~e~l--------a---------ktGd~~k~~i~~ 297 (318) ...+ ..-+| +.++.++++|.++ +++.|++.+-+..-.++..+.. . .+-|....+... T Consensus 216 ~~~~~~~~i~g--~pv~~~~~~~~~~~~~~~gdfs~~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~ 293 (318) T protein:vir:24 216 ASPFRSGRIVA--RPTILSDHVVEGTTVGFMGDFSQLIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEA 293 (318) T ss_pred cccccCceEEE--EeeEEeCCCCCCccEEEEeecceEEEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEE Confidence 1111 12345 5778888998776 4667888765554343322211 1 111445566778 Q ss_pred EEeEEEecccceeEEEEeccC Q lcl|NC_019402. 298 EVGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 298 E~tLe~~N~~a~g~i~~lt~a 318 (318) .+...+++|+|..+|+..+++ T Consensus 294 r~d~~v~~~~a~~~i~~~~a~ 314 (318) T protein:vir:24 294 EYAFHCNDAEAFVALTNVVSG 314 (318) T ss_pred EEccEEecccceEEEEeeccC Confidence 899999999999999999988 No 52 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=98.19 E-value=5.1e-07 Score=55.06 Aligned_cols=276 Identities=10% Similarity=-0.079 Sum_probs=141.3 Q ss_pred CCc-eeeeee-eeecccceeeeEecCCcccceeeeeccccccceEEE-eeeeeccccCCccccccccceeeccccccccc Q lcl|NC_019402. 1 MAT-LVSYDL-NGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQ-WQTDALAPVADPSDAQKRNAVIEGSAAVDGER 77 (318) Q Consensus 1 Ma~-~~t~~~-~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~-W~td~l~~~~~~~~~~~~na~~EG~da~~~~~ 77 (318) ++. .++.+. ...-.++.+.|...-...+|+..++..-..++..+. |....-..+ .....-||++.+.... T Consensus 120 ~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~v~E~~~~~~~~~ 192 (415) T protein:vir:98 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVA-------ALEKVEELEENPELAV 192 (415) T ss_pred hhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCc-------cceeeccccccCcccc Confidence 222 222222 123346777777777788898888776555433322 222111110 0112347776664321 Q ss_pred -cCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHh Q lcl|NC_019402. 78 -ASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVA 156 (318) Q Consensus 78 -~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~ 156 (318) .-....-+... +...+.||.-. +.... .+..+|=..+-...+.+-+|.+++.|.- .++.+. ++..... T Consensus 193 ~~~~~v~~~~~k-~~~~~~iS~el--l~ds~-~~l~~~i~~~l~~~~~~~~~~~il~g~g----~g~~~~---~~~~~~~ 261 (415) T protein:vir:98 193 KPFFQLAYDINT-HRGYFRISREA--IEDAK-VNVLQELKLWMARTIAATRNKAIIDVIT----KGSTGS---TSSGFEK 261 (415) T ss_pred cceeeEEeeeee-eEeeehhhHHH--Hhhch-HHHHHHHHHHHHHHHHHHHHHHHhhccc----cCcccc---ccccccc Confidence 11111111111 22234444333 22222 2333444445556678889999997742 221111 1111110 Q ss_pred cCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEE Q lcl|NC_019402. 157 AKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLN 236 (318) Q Consensus 157 ~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g 236 (318) . ..........+-++|.+++.++-..+.....++||+.....+..+. +.+ .++...+.... | T Consensus 262 ~-----------~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lk-d~~----G~~l~~~~~~~--~ 323 (415) T protein:vir:98 262 E-----------GKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMK-DKL----GNYLIQPDVKE--K 323 (415) T ss_pred c-----------ccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhh-ccC----CceeeccCcCC--C Confidence 0 1122334557888899999998887777778999999888887653 332 22222211100 0 Q ss_pred EEEEEEEcCCCcEEEEEecCCCCCce-----EEEEehhhcceeecCc-ccceecCCCccceeeEEEEEEeEEEeccccee Q lcl|NC_019402. 237 VYVSSIVDPLGCQYKLVPNRWMPENA-----VYFFTPSDWTQMVLRA-PERTKLAKDGSYEKWMIEMEVGLRHRNPYASG 310 (318) Q Consensus 237 ~~v~~~~tdfG~~v~iv~nr~m~~~~-----~~~~D~~~~~~~~Lr~-~~~e~laktGd~~k~~i~~E~tLe~~N~~a~g 310 (318) .. -+=+| ++|+....||... +++.|++..-+.+.|. +..+...-..+......+..+...+.+|.|.. T Consensus 324 -~~---~~l~G--~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~ 397 (415) T protein:vir:98 324 -TQ---QRLLG--AKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAI 397 (415) T ss_pred -CC---ceecc--eeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEeccccCceEEEEEEEeccEEeccccEE Confidence 00 12256 3566656666432 7888887532223222 22222222233445566777899999999999 Q ss_pred EEEEeccC Q lcl|NC_019402. 311 ILEVKAGA 318 (318) Q Consensus 311 ~i~~lt~a 318 (318) +++..+++ T Consensus 398 ~~~~~~~~ 405 (415) T protein:vir:98 398 VIEYDDSE 405 (415) T ss_pred EEEEeccC Confidence 99999988 No 53 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=98.19 E-value=5.1e-07 Score=55.06 Aligned_cols=276 Identities=10% Similarity=-0.079 Sum_probs=141.3 Q ss_pred CCc-eeeeee-eeecccceeeeEecCCcccceeeeeccccccceEEE-eeeeeccccCCccccccccceeeccccccccc Q lcl|NC_019402. 1 MAT-LVSYDL-NGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQ-WQTDALAPVADPSDAQKRNAVIEGSAAVDGER 77 (318) Q Consensus 1 Ma~-~~t~~~-~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~-W~td~l~~~~~~~~~~~~na~~EG~da~~~~~ 77 (318) ++. .++.+. ...-.++.+.|...-...+|+..++..-..++..+. |....-..+ .....-||++.+.... T Consensus 120 ~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~v~E~~~~~~~~~ 192 (415) T protein:vir:81 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVA-------ALEKVEELEENPELAV 192 (415) T ss_pred hhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCc-------cceeeccccccCcccc Confidence 222 222222 123346777777777788898888776555433322 222111110 0112347776664321 Q ss_pred -cCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHh Q lcl|NC_019402. 78 -ASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVA 156 (318) Q Consensus 78 -~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~ 156 (318) .-....-+... +...+.||.-. +.... .+..+|=..+-...+.+-+|.+++.|.- .++.+. ++..... T Consensus 193 ~~~~~v~~~~~k-~~~~~~iS~el--l~ds~-~~l~~~i~~~l~~~~~~~~~~~il~g~g----~g~~~~---~~~~~~~ 261 (415) T protein:vir:81 193 KPFFQLAYDINT-HRGYFRISREA--IEDAK-VNVLQELKLWMARTIAATRNKAIIDVIT----KGSTGS---TSSGFEK 261 (415) T ss_pred cceeeEEeeeee-eEeeehhhHHH--Hhhch-HHHHHHHHHHHHHHHHHHHHHHHhhccc----cCcccc---ccccccc Confidence 11111111111 22234444333 22222 2333444445556678889999997742 221111 1111110 Q ss_pred cCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEE Q lcl|NC_019402. 157 AKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLN 236 (318) Q Consensus 157 ~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g 236 (318) . ..........+-++|.+++.++-..+.....++||+.....+..+. +.+ .++...+.... | T Consensus 262 ~-----------~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lk-d~~----G~~l~~~~~~~--~ 323 (415) T protein:vir:81 262 E-----------GKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMK-DKL----GNYLIQPDVKE--K 323 (415) T ss_pred c-----------ccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhh-ccC----CceeeccCcCC--C Confidence 0 1122334557888899999998887777778999999888887653 332 22222211100 0 Q ss_pred EEEEEEEcCCCcEEEEEecCCCCCce-----EEEEehhhcceeecCc-ccceecCCCccceeeEEEEEEeEEEeccccee Q lcl|NC_019402. 237 VYVSSIVDPLGCQYKLVPNRWMPENA-----VYFFTPSDWTQMVLRA-PERTKLAKDGSYEKWMIEMEVGLRHRNPYASG 310 (318) Q Consensus 237 ~~v~~~~tdfG~~v~iv~nr~m~~~~-----~~~~D~~~~~~~~Lr~-~~~e~laktGd~~k~~i~~E~tLe~~N~~a~g 310 (318) .. -+=+| ++|+....||... +++.|++..-+.+.|. +..+...-..+......+..+...+.+|.|.. T Consensus 324 -~~---~~l~G--~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~ 397 (415) T protein:vir:81 324 -TQ---QRLLG--AKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAI 397 (415) T ss_pred -CC---ceecc--eeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEeccccCceEEEEEEEeccEEeccccEE Confidence 00 12256 3566656666432 7888887532223222 22222222233445566777899999999999 Q ss_pred EEEEeccC Q lcl|NC_019402. 311 ILEVKAGA 318 (318) Q Consensus 311 ~i~~lt~a 318 (318) +++..+++ T Consensus 398 ~~~~~~~~ 405 (415) T protein:vir:81 398 VIEYDDSE 405 (415) T ss_pred EEEEeccC Confidence 99999988 No 54 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=98.19 E-value=5.1e-07 Score=55.06 Aligned_cols=276 Identities=10% Similarity=-0.079 Sum_probs=141.3 Q ss_pred CCc-eeeeee-eeecccceeeeEecCCcccceeeeeccccccceEEE-eeeeeccccCCccccccccceeeccccccccc Q lcl|NC_019402. 1 MAT-LVSYDL-NGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQ-WQTDALAPVADPSDAQKRNAVIEGSAAVDGER 77 (318) Q Consensus 1 Ma~-~~t~~~-~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~-W~td~l~~~~~~~~~~~~na~~EG~da~~~~~ 77 (318) ++. .++.+. ...-.++.+.|...-...+|+..++..-..++..+. |....-..+ .....-||++.+.... T Consensus 120 ~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~v~E~~~~~~~~~ 192 (415) T protein:vir:79 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVA-------ALEKVEELEENPELAV 192 (415) T ss_pred hhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCc-------cceeeccccccCcccc Confidence 222 222222 123346777777777788898888776555433322 222111110 0112347776664321 Q ss_pred -cCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHh Q lcl|NC_019402. 78 -ASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVA 156 (318) Q Consensus 78 -~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~ 156 (318) .-....-+... +...+.||.-. +.... .+..+|=..+-...+.+-+|.+++.|.- .++.+. ++..... T Consensus 193 ~~~~~v~~~~~k-~~~~~~iS~el--l~ds~-~~l~~~i~~~l~~~~~~~~~~~il~g~g----~g~~~~---~~~~~~~ 261 (415) T protein:vir:79 193 KPFFQLAYDINT-HRGYFRISREA--IEDAK-VNVLQELKLWMARTIAATRNKAIIDVIT----KGSTGS---TSSGFEK 261 (415) T ss_pred cceeeEEeeeee-eEeeehhhHHH--Hhhch-HHHHHHHHHHHHHHHHHHHHHHHhhccc----cCcccc---ccccccc Confidence 11111111111 22234444333 22222 2333444445556678889999997742 221111 1111110 Q ss_pred cCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEE Q lcl|NC_019402. 157 AKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLN 236 (318) Q Consensus 157 ~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g 236 (318) . ..........+-++|.+++.++-..+.....++||+.....+..+. +.+ .++...+.... | T Consensus 262 ~-----------~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lk-d~~----G~~l~~~~~~~--~ 323 (415) T protein:vir:79 262 E-----------GKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMK-DKL----GNYLIQPDVKE--K 323 (415) T ss_pred c-----------ccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhh-ccC----CceeeccCcCC--C Confidence 0 1122334557888899999998887777778999999888887653 332 22222211100 0 Q ss_pred EEEEEEEcCCCcEEEEEecCCCCCce-----EEEEehhhcceeecCc-ccceecCCCccceeeEEEEEEeEEEeccccee Q lcl|NC_019402. 237 VYVSSIVDPLGCQYKLVPNRWMPENA-----VYFFTPSDWTQMVLRA-PERTKLAKDGSYEKWMIEMEVGLRHRNPYASG 310 (318) Q Consensus 237 ~~v~~~~tdfG~~v~iv~nr~m~~~~-----~~~~D~~~~~~~~Lr~-~~~e~laktGd~~k~~i~~E~tLe~~N~~a~g 310 (318) .. -+=+| ++|+....||... +++.|++..-+.+.|. +..+...-..+......+..+...+.+|.|.. T Consensus 324 -~~---~~l~G--~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~ 397 (415) T protein:vir:79 324 -TQ---QRLLG--AKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAI 397 (415) T ss_pred -CC---ceecc--eeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEeccccCceEEEEEEEeccEEeccccEE Confidence 00 12256 3566656666432 7888887532223222 22222222233445566777899999999999 Q ss_pred EEEEeccC Q lcl|NC_019402. 311 ILEVKAGA 318 (318) Q Consensus 311 ~i~~lt~a 318 (318) +++..+++ T Consensus 398 ~~~~~~~~ 405 (415) T protein:vir:79 398 VIEYDDSE 405 (415) T ss_pred EEEEeccC Confidence 99999988 No 55 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=98.19 E-value=8.6e-07 Score=53.83 Aligned_cols=291 Identities=12% Similarity=-0.016 Sum_probs=146.8 Q ss_pred CCc----------------eeeeeeeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCcccccccc Q lcl|NC_019402. 1 MAT----------------LVSYDLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRN 64 (318) Q Consensus 1 Ma~----------------~~t~~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~n 64 (318) ||+ .++......-..+.++|...-....|+..+.......+..+.....+..+.+..-...... T Consensus 1 ~~~~~e~~~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~~~~~~ 80 (338) T protein:vir:78 1 MATLNELAPNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPISYGETIIPTTVKRPEVGQVGVGTSN 80 (338) T ss_pred CcchHHhhhhhcccccccceecccccccchHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCccceeecccccc Confidence 332 1222222355556677777778888888877665555544443333322211000000111 Q ss_pred ceeeccccccccccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCc Q lcl|NC_019402. 65 AVIEGSAAVDGERASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATV 144 (318) Q Consensus 65 a~~EG~da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~ 144 (318) ..-||.+.+..........-+... +..-+.||. +.+.... .+...+=..+-.+.+.+.+|.++|+|.-.. + T Consensus 81 ~~~Eg~~~~~~~~~f~~v~l~~~k-~~~~~~is~--ell~ds~-~~~~~~i~~~la~a~~~~~d~~~l~G~g~~-----~ 151 (338) T protein:vir:78 81 EQREGGTKPLSGTAWDTRSVAPIK-LATIVTVSE--EFARMNP-SGLYTKLQADLAYAIGRGIDLAVFHGKSPL-----T 151 (338) T ss_pred cccccccccccccceeEEEEEEEE-EEEeehhhH--HHHhcCH-HHHHHHHHHHHHHHHHHHHHHHhhcccCCC-----c Confidence 233666655443221111111111 222233333 3322211 233344445556678999999999885321 1 Q ss_pred cchhhhHHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCC-CCcCEEEEcchHHhhhhhhhh--hhcccc Q lcl|NC_019402. 145 ARQTAGFSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSG-SEANIIMFHPKHAAFFSSLME--TSGVTN 221 (318) Q Consensus 145 ~r~m~Gi~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G-~~~~~l~~~~~~k~~is~~~~--~~~~~~ 221 (318) +-...||.......+. .......+......+.|.+++.++-.+. .....++|+|.....+..+.. +.+ T Consensus 152 ~~~~~gi~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d~~--- 222 (338) T protein:vir:78 152 GSALQGIDTNNVIVNT------TNVDYLQTGTTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAYRDAN--- 222 (338) T ss_pred cccccccccccccccc------cccccccccchhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhhccCC--- Confidence 2223454432222111 0111123334456677777777765433 345568899987776655432 221 Q ss_pred cceEEEecCCceEEEEEEEEEEcCCCcEEEEEecCCCCC---------ceEEEEehhhcceeecCcccceec--C----- Q lcl|NC_019402. 222 GQRMKMFDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPE---------NAVYFFTPSDWTQMVLRAPERTKL--A----- 285 (318) Q Consensus 222 ~~r~~~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~---------~~~~~~D~~~~~~~~Lr~~~~e~l--a----- 285 (318) .++.. . .....+. . .+=+| +.++.+.+||. ..+++.|+++..+..-.++..+-. + T Consensus 223 -g~~l~-~-~~~~~~~-~---~~l~G--~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~~i~~~~~~~~~~~ 293 (338) T protein:vir:78 223 -GNVDP-T-RINLAAS-A---GDLLG--LPVQFGKAVGGDLGAATDSKVRVVGGDFSQLKYGFADEIRVKMSDTATLTDN 293 (338) T ss_pred -Cceee-c-ccccCCC-C---ceeee--eeEEEccccCccccccCCcccEEEEEecceEEEEeecccEEEEeeccccccc Confidence 12211 1 1111111 1 12257 58888888875 337888999876665544433211 1 Q ss_pred ----------CCccceeeEEEEEEeEEEecccceeEEEEeccC Q lcl|NC_019402. 286 ----------KDGSYEKWMIEMEVGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 286 ----------ktGd~~k~~i~~E~tLe~~N~~a~g~i~~lt~a 318 (318) ..-|-...+.+..+...+.+|.|..+|...+++ T Consensus 294 ~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~ 336 (338) T protein:vir:78 294 TSPTPQTVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDEDP 336 (338) T ss_pred ccccccchhhhhcCcEEEEEEEEeccEeecccceEEEecccCC Confidence 111334556677789999999999999998888 No 56 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=98.16 E-value=1.2e-06 Score=53.10 Aligned_cols=277 Identities=9% Similarity=0.000 Sum_probs=137.2 Q ss_pred CCceeeeeeeeecccceeeeEecCCcccceeeeecc-ccc-cceEEEeeeeeccccCCccccccccceeecccccccccc Q lcl|NC_019402. 1 MATLVSYDLNGKKLSFANWISNLSPTDTPFVSMTGK-EAI-NQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERA 78 (318) Q Consensus 1 Ma~~~t~~~~~~~~dl~d~I~~i~p~~TP~~s~i~~-~~~-~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~ 78 (318) .....+-.-....+.+.+.|+.--..+.+++..+.. ... ....+.|....-.+.+ .-.-||...+..... T Consensus 111 ~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a--------~~v~E~~~~~~~~~~ 182 (392) T protein:vir:13 111 RDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGASTFTTSDANPMDFTVITGRATA--------GIVGETAEIPESYPA 182 (392) T ss_pred hcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcce--------eeecccccccccccc Confidence 111111111112222333322222333444433332 122 2234566554432211 113377665553322 Q ss_pred CcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhcC Q lcl|NC_019402. 79 STTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAAK 158 (318) Q Consensus 79 ~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~~ 158 (318) ..... .-.++=...|-=|.+.+..... +..++=...-...+.+-+|.+||+|. |++ . --||+...... T Consensus 183 f~~v~---~~~~k~~~~~~iS~ell~ds~~-~l~~~i~~~l~~~i~~~~d~~~l~G~----Gt~-~---p~Gil~~~~~~ 250 (392) T protein:vir:13 183 TTQRS---MGGFKYGFASVVSYEFATDQVL-DLVGFLVSDAGPAIGDAMGRHFLTGT----GTG-Q---PRGILTDATGA 250 (392) T ss_pred eeeEE---eeeeeEEeeehhHHHHHhcchH-HHHHHHHHHHHHHHHHHHHHHHhccc----CCc-c---ccccccccccc Confidence 11111 1122222222223334332221 22233333444666778999999873 322 2 23666433211 Q ss_pred CcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEEE Q lcl|NC_019402. 159 DAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNVY 238 (318) Q Consensus 159 ~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~~ 238 (318) .......+...++.++|.+++..+-..-.....++||+.....+..+. +.+ .++-..+. ...|.. T Consensus 251 --------~~~~~~~~~~~~~~d~l~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lk-d~~----G~~l~~~~--~~~g~~ 315 (392) T protein:vir:13 251 --------NAAFGEADADSKVSDALIDLFHEVPSAYRKNAKFVVNDLRAAQMRKLK-DAN----GQYLWQSA--LTVGAP 315 (392) T ss_pred --------cccccccccccccHHHHHHHHHhhhhhhhcCCEEEEcHHHHHHHHHhh-ccC----CceeecCC--cCCCCC Confidence 112233455668888888887766433223345788998888777653 321 22222111 111111 Q ss_pred EEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceec---CCCccceeeEEEEEEeEEEecccceeEEEEe Q lcl|NC_019402. 239 VSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKL---AKDGSYEKWMIEMEVGLRHRNPYASGILEVK 315 (318) Q Consensus 239 v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~l---aktGd~~k~~i~~E~tLe~~N~~a~g~i~~l 315 (318) . +=+| +.++.+.+||++.+++.|++.+.+..-..+..+.. .-.-|......+.-+..++.+|.|..++... T Consensus 316 -~---~l~G--~Pv~~~~~~~~~~i~~Gdf~~~~i~~~~~~~i~~~~~~~~~~~~~~~r~~~r~d~~~~~~~A~~~~~~~ 389 (392) T protein:vir:13 316 -D---TFNG--KVVETDDGMPADKVLFADLSKYRVRFAGSLRVDRSVDAKFSTDQIVYRFLQRADGLLVDARGAKVLTVT 389 (392) T ss_pred -c---eecc--eeeEEcCCCCCCcEEEeeccceeEEeecceEEEeeccccccCCcEEEEEEEEeccEEecccceEEEEee Confidence 1 1257 68899999999999999998765554444433221 1122444556666688999999999999999 Q ss_pred ccC Q lcl|NC_019402. 316 AGA 318 (318) Q Consensus 316 t~a 318 (318) ++| T Consensus 390 ~aa 392 (392) T protein:vir:13 390 PAA 392 (392) T ss_pred ccC Confidence 888 No 57 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=98.14 E-value=1.6e-06 Score=52.41 Aligned_cols=258 Identities=14% Similarity=0.068 Sum_probs=148.5 Q ss_pred CCceeeeeeeeeccc-cee----------eeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeec Q lcl|NC_019402. 1 MATLVSYDLNGKKLS-FAN----------WISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEG 69 (318) Q Consensus 1 Ma~~~t~~~~~~~~d-l~d----------~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG 69 (318) ||..+|....-..|. +++ .+..+...++-+- --++.++ +.+.| ..+..+.+ ..|| T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~-g~~G~ti--~iP~~--~~igda~~---------~~eg 66 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLV-GQPGDTL--TFPAF--VYSGDATV---------VPEG 66 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceeccccc-CCCCCEE--Eeeee--cCCCcccc---------ccCC Confidence 996555443333332 222 2223333333221 1122333 34556 23333222 2377 Q ss_pred cccccccccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhh Q lcl|NC_019402. 70 SAAVDGERASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTA 149 (318) Q Consensus 70 ~da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~ 149 (318) .+.+....+.....--+.| ..+.|++++-+.... .+|-+..-......-+.|.+...++.-- + +. T Consensus 67 ~~i~~~~lt~~~~~a~i~~-~~k~~~~tD~a~~~~---~~dp~~~~~~~~~~~~a~~~d~~~~~~l-~--~~-------- 131 (276) T protein:vir:10 67 QKIPVDKIETNRREAKIHK-IGKGTDITDEALLSG---YGDPQGEAVRQHGLAIANKVDNDVLEAL-R--GT-------- 131 (276) T ss_pred CccCccccccceeeEEeeh-ccccccccHHHHHhh---ccchHHHHHHHHHHHHHHHHHHHHHHHH-h--cc-------- Confidence 7777666665555544545 357788887766543 2466666666667777888887776311 0 00 Q ss_pred hHHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEec Q lcl|NC_019402. 150 GFSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFD 229 (318) Q Consensus 150 Gi~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~ 229 (318) .. ..+..++|.+.|.++++++=+++...+.++|||.+.-.+-+..... ..+. ... T Consensus 132 --------~~------------~~~~~~~t~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~----f~~~-s~~ 186 (276) T protein:vir:10 132 --------KL------------TVSADIGTLAGLEAAIDTFDDEDLEPMVLFINPKDAGKLRSSASDN----FTRA-TEL 186 (276) T ss_pred --------cc------------cccccccCHHHHHHHHHHhccccCcccEEEEcHHHHHHHHHhcccc----cccc-ccc Confidence 00 0123457899999999999888888899999999866654321100 0000 000 Q ss_pred CCceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCccccee--cCCCccceeeEEEEEEeEEEeccc Q lcl|NC_019402. 230 GQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTK--LAKDGSYEKWMIEMEVGLRHRNPY 307 (318) Q Consensus 230 ~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~--laktGd~~k~~i~~E~tLe~~N~~ 307 (318) +..-.... .|-+..| ++|+.++.+|.++.+++.+..+.+..-+++..|. ..+.+ ++......=|+..+.||. T Consensus 187 g~~~~~~G---~ig~~~G--~~Vi~s~~~p~~t~~l~~~gAi~~~~~~~~~vE~dRd~~~~-~d~i~~~~~y~~~~~~~~ 260 (276) T protein:vir:10 187 GDNIIVKG---AFGEALG--AVIVRSKKLDEGEAILAKRGAVKLITKRDFFLETDRDPSTK-TTALYSDKHYVAYLYDES 260 (276) T ss_pred cccceecc---ccceecc--eeEEEcCCCCcceEEEEeccceeeeecCCceeecccchhhc-ccEEEEeeEEEEEEEcCc Confidence 11100011 1333467 6899999999999999999999987777765432 22222 222333333899999999 Q ss_pred ceeEEEEeccC Q lcl|NC_019402. 308 ASGILEVKAGA 318 (318) Q Consensus 308 a~g~i~~lt~a 318 (318) +..+++-.+.| T Consensus 261 ~vv~~t~~~~~ 271 (276) T protein:vir:10 261 KAVKVTKGAGT 271 (276) T ss_pred ceEEEecCCcC Confidence 99999988888 No 58 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=98.13 E-value=5.4e-07 Score=54.94 Aligned_cols=275 Identities=9% Similarity=-0.061 Sum_probs=140.5 Q ss_pred CCceeee--eeeeecccceeeeEecCCcccceeeeeccccccceEEEee-ee-eccccCCccccccccceeecccccccc Q lcl|NC_019402. 1 MATLVSY--DLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQ-TD-ALAPVADPSDAQKRNAVIEGSAAVDGE 76 (318) Q Consensus 1 Ma~~~t~--~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~-td-~l~~~~~~~~~~~~na~~EG~da~~~~ 76 (318) .+..++. .....-+++.+.|...-...+|+..++.....++..+.+. .. .-... ....-||++.+... T Consensus 120 ~~~~~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~v~Eg~~~~~~~ 191 (415) T protein:vir:94 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAA--------LEKVEELEENPELA 191 (415) T ss_pred hhhccccccccccCcHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEeecCCcc--------ceeccccccccccc Confidence 2222221 1222334677777777778888888877655544433322 11 11111 11234777655322 Q ss_pred c-cCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHH Q lcl|NC_019402. 77 R-ASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALV 155 (318) Q Consensus 77 ~-~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i 155 (318) . ......-+.-. +...+.||. +.+.... .+...|=...-...+.+-+|.++|+|.- .++.+. ++..+. T Consensus 192 ~~~~~~i~~~~~k-~~~~~~is~--ell~ds~-~~~~~~i~~~l~~~~~~~~~~~il~g~g----~g~~~~---~~~~~~ 260 (415) T protein:vir:94 192 VKPFFQLAYDINT-HRGYFRISR--EAIEDAK-VNVLQELKLWMARTIAATRNKAIIDVIT----KGSTGS---TSSGFE 260 (415) T ss_pred cccceeeEeehee-eeeechhhH--HHHhhch-HHHHHHHHHHHHHHHHHHHHHHHhhccc----cCcccc---cccccc Confidence 1 11111111111 122233443 3333222 2333344445556778889999998732 221111 111111 Q ss_pred hcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEE Q lcl|NC_019402. 156 AAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRL 235 (318) Q Consensus 156 ~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~ 235 (318) .. ..........+-++|.+++.++-..+.....++|||.....+..+. +.+ .++...+.-.. T Consensus 261 ~~-----------~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lk-d~~----G~~l~~~~~~~-- 322 (415) T protein:vir:94 261 KE-----------GKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMK-DKL----GNYLIQPDVKE-- 322 (415) T ss_pred cc-----------ccccccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhh-ccC----CCeeeccCcCC-- Confidence 11 0112233456788899999988887777788999999888887653 322 12211111000 Q ss_pred EEEEEEEEcCCCcEEEEEecCCCCCce-----EEEEehhhcceeecCc-ccceecCCCccceeeEEEEEEeEEEecccce Q lcl|NC_019402. 236 NVYVSSIVDPLGCQYKLVPNRWMPENA-----VYFFTPSDWTQMVLRA-PERTKLAKDGSYEKWMIEMEVGLRHRNPYAS 309 (318) Q Consensus 236 g~~v~~~~tdfG~~v~iv~nr~m~~~~-----~~~~D~~~~~~~~Lr~-~~~e~laktGd~~k~~i~~E~tLe~~N~~a~ 309 (318) + .. -+=+| +.|+....||.+. +++.|++..-+.+.|. +..+..--..+....+.+..+.+.+.+|.|. T Consensus 323 ~-~~---~~l~G--~pV~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~r~~~r~d~~~~~~~a~ 396 (415) T protein:vir:94 323 K-TQ---QRLLG--AKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSA 396 (415) T ss_pred C-CC---ceecc--eeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEeccccCceEEEEEEEeccEEeccccE Confidence 0 00 01246 4566666666443 7788887533233332 2222222233445567777899999999999 Q ss_pred eEEEEeccC Q lcl|NC_019402. 310 GILEVKAGA 318 (318) Q Consensus 310 g~i~~lt~a 318 (318) .+++..+++ T Consensus 397 ~~~~~~~~~ 405 (415) T protein:vir:94 397 IVIEYDDSE 405 (415) T ss_pred EEEEEeccC Confidence 999998888 No 59 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=98.12 E-value=6e-07 Score=54.71 Aligned_cols=281 Identities=13% Similarity=0.127 Sum_probs=141.0 Q ss_pred CC---------------ceeeeee---eeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCcccccc Q lcl|NC_019402. 1 MA---------------TLVSYDL---NGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQK 62 (318) Q Consensus 1 Ma---------------~~~t~~~---~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~ 62 (318) || ++++.+. -...+++.+.|..---+.+||++++.-..+++...+-..-...... T Consensus 1 ~~~k~~~~~l~~~~~~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~~~~i~~~~~~~~~------- 73 (321) T protein:vir:31 1 MASRTINNDLSRITEKNALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAKKTRIPTLNIGERH------- 73 (321) T ss_pred CchHHHHHHHHHHHHhccccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCcceeeeeeccCCcc------- Confidence 21 1221111 1234566666666666778999887765554443321111100000 Q ss_pred ccceeeccccccccccCcEEecceE---EEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccC Q lcl|NC_019402. 63 RNAVIEGSAAVDGERASTTVINNVT---QILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVD 139 (318) Q Consensus 63 ~na~~EG~da~~~~~~~~~~~~N~t---QIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~ 139 (318) ..-..||.... ....+ .++.+. .=..--+.||.-.-.-...+ .+--++=..+-.+.+.+|++.++++|.. T Consensus 74 ~~~~~e~~~~~-~~~~~--~~~~~~~~~~k~~~~~~it~e~L~d~a~~-~d~e~~i~~~ia~~~a~~~~~~~~nGd~--- 146 (321) T protein:vir:31 74 RRPQDEGEWNE-NESDV--STGTIDISTEKATVAWDLPREVVQENPEG-EALADRILNLMTDAWSADVEDLAANGDE--- 146 (321) T ss_pred ccccccccccc-ccccc--eeeeeeeeeEEEEeehhccHHHHHhhhcc-hhHHHHHHHHHHHHHHHHHHhheeeccc--- Confidence 00011222111 11111 111111 11111233443322111112 2333333445566799999999999852 Q ss_pred CCCCcc--chhhhHHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCC-CCcC-EEEEcchHHhhhhhhhh Q lcl|NC_019402. 140 GSATVA--RQTAGFSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSG-SEAN-IIMFHPKHAAFFSSLME 215 (318) Q Consensus 140 gs~t~~--r~m~Gi~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G-~~~~-~l~~~~~~k~~is~~~~ 215 (318) .+..| ....|++..+..+. ...+.++..++.+.|.+++..|=..= ..++ ..+|++......-.... T Consensus 147 -~~~~~~~~~n~G~l~~a~~~~---------~~~~~~~~~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~ 216 (321) T protein:vir:31 147 -DAEDSFENQNDGFITVAEGDV---------ETIDAADDILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYTLT 216 (321) T ss_pred -cCCCcccccchhhhhhhcccc---------ccccccccccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHHHh Confidence 22222 33457665443321 12234456688888888888763210 1123 45678776554433322 Q ss_pred hhcccccceEEEecCCceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceecCCCc------c Q lcl|NC_019402. 216 TSGVTNGQRMKMFDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKLAKDG------S 289 (318) Q Consensus 216 ~~~~~~~~r~~~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~laktG------d 289 (318) +.+.- . ....-.+. .-.+-+| +.++..++||.+.+++.|++.+.+.+-+....+.....- + T Consensus 217 ~~~~~-------~-~~~~l~~~---~~~tl~G--~pvv~~~~mP~~~il~t~~~nl~~~~~~~~~~~~~~~~~~~~~~~~ 283 (321) T protein:vir:31 217 DRDTP-------L-GDNVIMGE---ADVNPFS--FPIIGSGLWPDDKAMFTDPQNLIYALYRDLEIDVLTESDKVSERDL 283 (321) T ss_pred cCCCc-------c-ccchhhcc---ccccccc--eeEEEcCCCCCCcEEEeccccEEEEEeeccEEEEeecCccccccce Confidence 22110 0 00000111 1223356 789999999999999999999988777766554443321 2 Q ss_pred ceeeEEEEEEeEEEecccceeEEEEeccC Q lcl|NC_019402. 290 YEKWMIEMEVGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 290 ~~k~~i~~E~tLe~~N~~a~g~i~~lt~a 318 (318) .....+.....-.+.++.|+++++++.-. T Consensus 284 ~~~~~~~~~~~~~ve~~~a~a~~~~i~~~ 312 (321) T protein:vir:31 284 HARYFMRGDDDFAIENTEAVVLAEGLGDP 312 (321) T ss_pred eeEeeeeeecceeEeccccEEEEecCCcc Confidence 22333455577788999999999998876 No 60 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=98.09 E-value=1.7e-06 Score=52.26 Aligned_cols=257 Identities=12% Similarity=0.043 Sum_probs=141.7 Q ss_pred CCc--eeeeeeeeecccceeeeE----------ecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceee Q lcl|NC_019402. 1 MAT--LVSYDLNGKKLSFANWIS----------NLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIE 68 (318) Q Consensus 1 Ma~--~~t~~~~~~~~dl~d~I~----------~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~E 68 (318) ||| .|.....-+-|=+++.+. .+...++- +-|..--+-+.+.|. .+..+.+ -.| T Consensus 1 ~~~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~---l~g~~G~tv~iP~~~--~ig~a~~---------~~~ 66 (275) T protein:vir:96 1 MALENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNT---LVGQPGNTITFPAFV--YSGDAKV---------VPE 66 (275) T ss_pred CCCcccchhhhhhchHHHHHHHHHHHHHhhhhcccceeccc---ccCCCCCEEEeeeec--cCCcccc---------ccC Confidence 655 433333333333333322 22222221 112111122345663 2333222 236 Q ss_pred ccccccccccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchh Q lcl|NC_019402. 69 GSAAVDGERASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQT 148 (318) Q Consensus 69 G~da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m 148 (318) |.+.+....+.....--+.|. .+.|++++-+.... .+|.+..-......-+.+.++..++.--. +.. T Consensus 67 g~~i~~~~lt~~~~~~~i~~~-~~~~~i~D~~~~~~---~~d~~~~~~~~~a~~~a~~~d~~ll~~l~---~a~------ 133 (275) T protein:vir:96 67 GEEIPIDLIETKKRQATIRKI-GKGTVLTDEALLSG---YGDPKGEAVRQHGLAIANKVDNDVLEALQ---GAT------ 133 (275) T ss_pred CCCcchhhcccceeeEEeehh-cccccccHHHHHhh---ccchHHHHHHHHHHHHHHHHHHHHHHHHh---ccc------ Confidence 777666665555554444453 66678877654433 24666666666777788888888772210 000 Q ss_pred hhHHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEe Q lcl|NC_019402. 149 AGFSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMF 228 (318) Q Consensus 149 ~Gi~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~ 228 (318) . .....+++.+.|.++++++-+++..++.++|||.+...+-+..... ..+.. T Consensus 134 ---------~-------------~~~~~~~~~d~i~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~------f~~~~ 185 (275) T protein:vir:96 134 ---------L-------------KVEADITKLAGLQTAIDKFNDEDLEPMVLFVNPLDAGKLRASATDN------FTRAT 185 (275) T ss_pred ---------c-------------cccccccCHHHHHHHHHHhccccCCccEEEeCHHHHHHHHhccccc------ccccc Confidence 0 0122457899999999999888888999999999876653321100 00000 Q ss_pred cCCceEE-EEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccce--ecCCCccceeeEEEEEEeEEEec Q lcl|NC_019402. 229 DGQDTRL-NVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERT--KLAKDGSYEKWMIEMEVGLRHRN 305 (318) Q Consensus 229 ~~~~~~~-g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e--~laktGd~~k~~i~~E~tLe~~N 305 (318) ...++.+ ... |-+-.| ++|+.++.+|.++.+++.+..+.+..-+++..| ..++.+ ++......=|++.+.| T Consensus 186 ~~g~~~~~~G~---ig~~~G--~~Vi~s~~~p~~t~~i~~~gA~~~~~~~~~~vE~~Rd~~~~-~d~i~~~~~y~~~~~~ 259 (275) T protein:vir:96 186 LLGDNVIVKGA---FGEALG--AIIVRSNKIKEGEAILAKRGAVKLITKRDFFLETERHASHK-STALFSDKHYVAYLYD 259 (275) T ss_pred cccccceeccc---cceecC--eeEEEeCCCCcceEEEEeccceeeeecCCcccccccchhhc-CcEEEEeEEEEEEEEc Confidence 0011110 011 222256 689999999999999999999988766655433 222222 2223333337999999 Q ss_pred ccceeEEEEeccC Q lcl|NC_019402. 306 PYASGILEVKAGA 318 (318) Q Consensus 306 ~~a~g~i~~lt~a 318 (318) |.+..+++-.++- T Consensus 260 ~~~vv~~t~~~~~ 272 (275) T protein:vir:96 260 ESKVVKITKSASG 272 (275) T ss_pred CccEEEEEecccc Confidence 9999998764444 No 61 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=98.09 E-value=1.8e-06 Score=52.08 Aligned_cols=276 Identities=10% Similarity=-0.066 Sum_probs=145.1 Q ss_pred CCc-eeeee-eeeecccceeeeEecCCcccceeeeeccccccceEEEee--eeeccccCCccccccccceeecccccccc Q lcl|NC_019402. 1 MAT-LVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQ--TDALAPVADPSDAQKRNAVIEGSAAVDGE 76 (318) Q Consensus 1 Ma~-~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~--td~l~~~~~~~~~~~~na~~EG~da~~~~ 76 (318) ++. .++.. ....-+++.+.|...-...+|++.++.....++..+.+. ...-... ..-.-||++.+... T Consensus 120 ~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~v~Eg~~~~~~~ 191 (415) T protein:vir:46 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAA--------LEKVEELEENPELA 191 (415) T ss_pred hhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcc--------eeeccccccccccc Confidence 222 22222 223445777888887788899988877655544433322 2111110 01133676655322 Q ss_pred ccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHh Q lcl|NC_019402. 77 RASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVA 156 (318) Q Consensus 77 ~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~ 156 (318) ...-..+.=-.+-+...+.||. +.+.... .+...|=..+-...+.+-+|.++|+|. |++..+. ++..+.. T Consensus 192 ~~~~~~v~~~~~k~~~~~~iS~--ell~ds~-~~l~~~i~~~l~~~i~~~~d~~il~g~----g~g~~~~---~~~~~~~ 261 (415) T protein:vir:46 192 VKPFFQLAYDINTHRGYFRISR--EAIEDAK-VNVLQELKLWMARTIAATRNKAIIDVI----TKGSTGS---TSSGFEK 261 (415) T ss_pred ccceeeEEeeeeeeEeeehhhH--HHHhhch-HHHHHHHHHHHHHHHHHHHHHHHhhcc----ccCCccc---ccccccc Confidence 1111111111111222334443 3333222 233445555666677889999999873 2221111 1111111 Q ss_pred cCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEE Q lcl|NC_019402. 157 AKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLN 236 (318) Q Consensus 157 ~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g 236 (318) . ..........+.++|.+++.++-..+.....+++|+.....+..+. +.+ .++...+.-.. + T Consensus 262 ~-----------~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lk-d~~----G~~i~~~~~~~--~ 323 (415) T protein:vir:46 262 E-----------GKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMK-DKL----GNYLIQPDVKE--K 323 (415) T ss_pred c-----------cceeccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhh-ccC----CCeeeccCcCC--C Confidence 1 1122344557888999999998888777788999999888887664 322 22222211100 1 Q ss_pred EEEEEEEcCCCcEEEEEecCCCCCc-----eEEEEehhhcceeecC-cccceecCCCccceeeEEEEEEeEEEeccccee Q lcl|NC_019402. 237 VYVSSIVDPLGCQYKLVPNRWMPEN-----AVYFFTPSDWTQMVLR-APERTKLAKDGSYEKWMIEMEVGLRHRNPYASG 310 (318) Q Consensus 237 ~~v~~~~tdfG~~v~iv~nr~m~~~-----~~~~~D~~~~~~~~Lr-~~~~e~laktGd~~k~~i~~E~tLe~~N~~a~g 310 (318) . . -+=+| ++++....||.. .+++.|++..-..+.| .+..+...-..+....+.+..+...+.+|.|.. T Consensus 324 ~-~---~~l~G--~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~ 397 (415) T protein:vir:46 324 T-Q---QRLLG--AKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAI 397 (415) T ss_pred C-C---ccccc--eeeEEeccccccCCCccEEEEEehhccEEEEeecceEEEeeccccCceEEEEEEEeccEEeccccEE Confidence 0 0 12257 466666677643 3788888853222322 222222222334455566777899999999999 Q ss_pred EEEEeccC Q lcl|NC_019402. 311 ILEVKAGA 318 (318) Q Consensus 311 ~i~~lt~a 318 (318) +++..+++ T Consensus 398 ~~~~~~~~ 405 (415) T protein:vir:46 398 VIEYDDSE 405 (415) T ss_pred EEEeeccC Confidence 99998888 No 62 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=98.09 E-value=1.8e-06 Score=52.08 Aligned_cols=276 Identities=10% Similarity=-0.066 Sum_probs=145.1 Q ss_pred CCc-eeeee-eeeecccceeeeEecCCcccceeeeeccccccceEEEee--eeeccccCCccccccccceeecccccccc Q lcl|NC_019402. 1 MAT-LVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQ--TDALAPVADPSDAQKRNAVIEGSAAVDGE 76 (318) Q Consensus 1 Ma~-~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~--td~l~~~~~~~~~~~~na~~EG~da~~~~ 76 (318) ++. .++.. ....-+++.+.|...-...+|++.++.....++..+.+. ...-... ..-.-||++.+... T Consensus 120 ~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~v~Eg~~~~~~~ 191 (415) T protein:vir:47 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAA--------LEKVEELEENPELA 191 (415) T ss_pred hhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcc--------eeeccccccccccc Confidence 222 22222 223445777888887788899988877655544433322 2111110 01133676655322 Q ss_pred ccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHh Q lcl|NC_019402. 77 RASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVA 156 (318) Q Consensus 77 ~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~ 156 (318) ...-..+.=-.+-+...+.||. +.+.... .+...|=..+-...+.+-+|.++|+|. |++..+. ++..+.. T Consensus 192 ~~~~~~v~~~~~k~~~~~~iS~--ell~ds~-~~l~~~i~~~l~~~i~~~~d~~il~g~----g~g~~~~---~~~~~~~ 261 (415) T protein:vir:47 192 VKPFFQLAYDINTHRGYFRISR--EAIEDAK-VNVLQELKLWMARTIAATRNKAIIDVI----TKGSTGS---TSSGFEK 261 (415) T ss_pred ccceeeEEeeeeeeEeeehhhH--HHHhhch-HHHHHHHHHHHHHHHHHHHHHHHhhcc----ccCCccc---ccccccc Confidence 1111111111111222334443 3333222 233445555666677889999999873 2221111 1111111 Q ss_pred cCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEE Q lcl|NC_019402. 157 AKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLN 236 (318) Q Consensus 157 ~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g 236 (318) . ..........+.++|.+++.++-..+.....+++|+.....+..+. +.+ .++...+.-.. + T Consensus 262 ~-----------~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lk-d~~----G~~i~~~~~~~--~ 323 (415) T protein:vir:47 262 E-----------GKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMK-DKL----GNYLIQPDVKE--K 323 (415) T ss_pred c-----------cceeccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhh-ccC----CCeeeccCcCC--C Confidence 1 1122344557888999999998888777788999999888887664 322 22222211100 1 Q ss_pred EEEEEEEcCCCcEEEEEecCCCCCc-----eEEEEehhhcceeecC-cccceecCCCccceeeEEEEEEeEEEeccccee Q lcl|NC_019402. 237 VYVSSIVDPLGCQYKLVPNRWMPEN-----AVYFFTPSDWTQMVLR-APERTKLAKDGSYEKWMIEMEVGLRHRNPYASG 310 (318) Q Consensus 237 ~~v~~~~tdfG~~v~iv~nr~m~~~-----~~~~~D~~~~~~~~Lr-~~~~e~laktGd~~k~~i~~E~tLe~~N~~a~g 310 (318) . . -+=+| ++++....||.. .+++.|++..-..+.| .+..+...-..+....+.+..+...+.+|.|.. T Consensus 324 ~-~---~~l~G--~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~ 397 (415) T protein:vir:47 324 T-Q---QRLLG--AKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAI 397 (415) T ss_pred C-C---ccccc--eeeEEeccccccCCCccEEEEEehhccEEEEeecceEEEeeccccCceEEEEEEEeccEEeccccEE Confidence 0 0 12257 466666677643 3788888853222322 222222222334455566777899999999999 Q ss_pred EEEEeccC Q lcl|NC_019402. 311 ILEVKAGA 318 (318) Q Consensus 311 ~i~~lt~a 318 (318) +++..+++ T Consensus 398 ~~~~~~~~ 405 (415) T protein:vir:47 398 VIEYDDSE 405 (415) T ss_pred EEEeeccC Confidence 99998888 No 63 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=98.04 E-value=1.7e-06 Score=52.20 Aligned_cols=283 Identities=9% Similarity=-0.085 Sum_probs=142.2 Q ss_pred CCceeeeeeeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccccCc Q lcl|NC_019402. 1 MATLVSYDLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERAST 80 (318) Q Consensus 1 Ma~~~t~~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~~~ 80 (318) |.+-++.......+.+.+.|+..-..+.|++.+.......+....+....-... . .-.-||.+.+....... T Consensus 20 ~~~~~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~~----a----~~v~Eg~~~~~~~~~f~ 91 (326) T protein:vir:42 20 AQTGDSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIPMGTTGQKIPHWTGDVS----A----SWIGEGDMKPITKGNMT 91 (326) T ss_pred eeccccCCcceechhhHHHHHHHHHhcchhhhhcceeeccCCceEEEEEeCCcc----e----EEecCCcccccccccee Confidence 322222223346677777777777778888877655554444444433322111 1 11238887776543322 Q ss_pred EEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhcCCc Q lcl|NC_019402. 81 TVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAAKDA 160 (318) Q Consensus 81 ~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~~~~ 160 (318) . +.=-..-+...+.||. +.+.... .+..+|=..+-.+.+.+.+|.++|+|. |++ .| .||..-...... T Consensus 92 ~-i~~~~~k~~~~v~iS~--ell~~s~-~~~~~~i~~~l~~a~~~~~d~a~l~G~----gs~-~p---~gi~~~~~~~~~ 159 (326) T protein:vir:42 92 S-QTIAPHKIATIFVASA--ETVRANP-ANYLGTMRTKVATAFAMAFDNAAINGT----DSP-FP---TFLAQTTKEVSL 159 (326) T ss_pred E-EEEeeEEEEEeehhhH--HHHhcCH-HHHHHHHHHHHHHHHHHHHHHHhhccc----CCC-cc---ccccccccccce Confidence 2 2211222333444443 4433222 233444444555678999999999884 322 22 223211110000 Q ss_pred ccCccccceeeccCccccCHHHH--HHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEE- Q lcl|NC_019402. 161 ADPDTGAIVHFETAAAALTEAEI--FKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNV- 237 (318) Q Consensus 161 ~~~~~g~~~~~~~t~~~lTe~~l--~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~- 237 (318) . .....+....++.+++ .+.+..+-..+.....+++++.....+.++. +.+ .++...+. ...+. T Consensus 160 ~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lk-d~~----G~~l~~~~--~~~~~~ 226 (326) T protein:vir:42 160 V------DPDGTGSNADLTVYDAVAVNALSLLVNAGKKWTHTLLDDITEPILNGAK-DKS----GRPLFIES--TYTEEN 226 (326) T ss_pred e------ecccccccccchhHHHHHHHHHhhhhhhccCccEEEEeHHHHHHHHHhh-ccC----Cceeeccc--cccCcc Confidence 0 0011122223333332 2333333333445567889999888887764 321 12221111 11010 Q ss_pred EEEEEEcCCCcEEEEEecCCCCCceEEE--EehhhcceeecCcccce----ecCCCc-------------cceeeEEEEE Q lcl|NC_019402. 238 YVSSIVDPLGCQYKLVPNRWMPENAVYF--FTPSDWTQMVLRAPERT----KLAKDG-------------SYEKWMIEME 298 (318) Q Consensus 238 ~v~~~~tdfG~~v~iv~nr~m~~~~~~~--~D~~~~~~~~Lr~~~~e----~laktG-------------d~~k~~i~~E 298 (318) ..-..-+-+| +.++.+.++|+++.++ .|++.+-+..-.++..+ ..-++| |....+.... T Consensus 227 ~~~~~~~l~G--~pv~~~~~~~~~~~~~~~Gd~s~~~~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~ 304 (326) T protein:vir:42 227 SPFRLGRIVA--RPTILSDHVASGTVVGYQGDFRQLVWGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAE 304 (326) T ss_pred ccccCceeee--eeEEEcCCCCCCceEEEEeecceEEEEEecceEEEEeecceeeecccccccchhhhhcCcEEEEEEEE Confidence 0001223466 6888999999987644 47766544322233221 111222 3455566778 Q ss_pred EeEEEecccceeEEEEeccC Q lcl|NC_019402. 299 VGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 299 ~tLe~~N~~a~g~i~~lt~a 318 (318) +..++.+|+|...|++.+++ T Consensus 305 ~d~~v~~~~a~~~l~~~~~~ 324 (326) T protein:vir:42 305 YAFHCNDKDAFVKLTNVDAT 324 (326) T ss_pred eccEEecccceEEEeecccc Confidence 99999999999999999999 No 64 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=97.99 E-value=3e-06 Score=50.90 Aligned_cols=273 Identities=9% Similarity=0.012 Sum_probs=133.6 Q ss_pred CCceeeee--eeeecccceeeeEecCCcccceeeeeccc-cc-cceEEEeeeeeccccCCccccccccceeecccccccc Q lcl|NC_019402. 1 MATLVSYD--LNGKKLSFANWISNLSPTDTPFVSMTGKE-AI-NQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGE 76 (318) Q Consensus 1 Ma~~~t~~--~~~~~~dl~d~I~~i~p~~TP~~s~i~~~-~~-~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~ 76 (318) ....++.. .....+...+.|..+-.. .+++..+... .. ......|...+-...+ .-.-||...+... T Consensus 110 ~~~~t~~~~g~~~~~~~~~~~i~~~~~~-~~~l~~~~~~~~~~~~~~~~~p~~~~~~~a--------~wv~E~~~~~~~~ 180 (390) T protein:vir:62 110 KRDGTKAGNPNVLSRTLYGQLIAQAVER-SAIMRGGATTFTTSDANPLDFTVITGRSSA--------SIVGETAEIPESY 180 (390) T ss_pred hhcccccCCCccccccchHHHHHHHHhh-hhhhhhcceeeecCCCceeEEEEEcCCcce--------eeecccccccccc Confidence 11111111 111222223333333333 3333322221 11 1122345443322111 1133777665533 Q ss_pred ccCcEEecceEEEEeee-eeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHH Q lcl|NC_019402. 77 RASTTVINNVTQILRKV-VKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALV 155 (318) Q Consensus 77 ~~~~~~~~N~tQIf~~~-~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i 155 (318) .. +.+++=-.++- ..+.=|.+.+..... +..++=...-...+.+-+|.+||+|. | +| -||+... T Consensus 181 ~~----f~~i~~~~~k~~~~~~iS~ell~ds~~-~l~~~i~~~l~~~i~~~~d~~~l~G~----G---~p---~Gi~~~~ 245 (390) T protein:vir:62 181 PA----TAQRSMGGFKYGFASVVSYEFATDQVL-DLVGFLVSDAGPAIGDAMGRHFITGT----G---QP---RGILTDA 245 (390) T ss_pred cc----eeeeEeeeeeEEeehHHHHHHHhhhhH-HHHHHHHHHHHHHHHHHHHhhhhccC----C---cc---ccccccc Confidence 22 23332222221 122223444433221 22222223334556777999999773 2 23 3565432 Q ss_pred hcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEE Q lcl|NC_019402. 156 AAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRL 235 (318) Q Consensus 156 ~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~ 235 (318) ... ......++...++.++|.++..++..+-.....++||+.....+..+. +.+ .++-..++- .. T Consensus 246 ~~~--------~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lk-d~~----g~~l~~~~~--~~ 310 (390) T protein:vir:62 246 SPA--------TATFLATDTDSKVSDALIDLFHEVPSAYRANAKYVVNDLRAAQMRKLK-DAN----GQYLWQSGL--TV 310 (390) T ss_pred ccc--------ccceecccccccchHHHHHHHHhhhhhhhcCCEEEEchHHHHHHHHhh-ccC----CCeeecCCc--CC Confidence 111 111233444568888888888777543322335788999888887664 322 233222211 11 Q ss_pred EEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceec---CCCccceeeEEEEEEeEEEecccceeEE Q lcl|NC_019402. 236 NVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKL---AKDGSYEKWMIEMEVGLRHRNPYASGIL 312 (318) Q Consensus 236 g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~l---aktGd~~k~~i~~E~tLe~~N~~a~g~i 312 (318) |. --+=+| +.|+.+.+||++.+++.|++++-+..-.++..+.. .-+-|........-+...+.+|+|..+| T Consensus 311 g~----~~~l~G--~Pv~~~~~~p~~~i~~gd~s~~~i~~~~~~~v~~~~~~~~~~~~~~~~~~~r~d~~~~~~~A~~~l 384 (390) T protein:vir:62 311 GA----PSLFNG--KVVETDDGMPADKILFADLSKYRVRFAGSLRVDRSVDAKFSTDQIVYRFLQRADGLLVDARGAKVL 384 (390) T ss_pred Cc----cceecc--cceEEecCCCCccEEEeeccceeEEeecceEEEeeccccccCCcEEEEEEEEeCcEeechhheEEE Confidence 11 112357 57888999999999999998765544444433221 1122334444555588999999999999 Q ss_pred EEeccC Q lcl|NC_019402. 313 EVKAGA 318 (318) Q Consensus 313 ~~lt~a 318 (318) +..++| T Consensus 385 ~~~~~a 390 (390) T protein:vir:62 385 TVTPGA 390 (390) T ss_pred EeecCC Confidence 999999 No 65 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=97.97 E-value=1.4e-06 Score=52.76 Aligned_cols=276 Identities=11% Similarity=-0.023 Sum_probs=145.2 Q ss_pred CCceeeee-eeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccccC Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERAS 79 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~~ 79 (318) |+.-++.. .-...+.+.+.|+..-..+.|++++.......+.........-... ..-.-||...+..... T Consensus 10 ~~~~~t~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~--------a~wv~Eg~~~~~s~~~- 80 (397) T protein:vir:23 10 IAQTKDTMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATGIVIPHWTGDVS--------AQWIGEGDMKPITKGN- 80 (397) T ss_pred HhhccCCCCccccchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcc--------eEEecCCccccccccc- Confidence 44333322 2234555666665555677888887665554443333322221111 1112377666554322 Q ss_pred cEEecceE-EEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhcC Q lcl|NC_019402. 80 TTVINNVT-QILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAAK 158 (318) Q Consensus 80 ~~~~~N~t-QIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~~ 158 (318) +.+++ .+.+-...|.=|.+.+... ..+...+=...-.+.+.+.+|.++|+|. |+ +....|+. ... T Consensus 81 ---f~~v~l~~~k~~~~v~iS~ell~ds-~~~l~~~i~~~l~~aia~~~d~a~l~G~----gt---~~~~~~~~---~~~ 146 (397) T protein:vir:23 81 ---MTKRDVHPAKIATIFVASAETVRAN-PANYLGTMRTKVATAIAMAFDNAALHGT----NA---PSAFQGYL---DQS 146 (397) T ss_pred ---eeEEEEeeEEEEEeehhhHHHHhcc-hHHHHHHHHHHHHHHHHHHHHHHHhhcc----cC---Cccccccc---ccc Confidence 22222 2222222233333333322 1233444445666678899999999874 22 12222322 111 Q ss_pred CcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEEE Q lcl|NC_019402. 159 DAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNVY 238 (318) Q Consensus 159 ~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~~ 238 (318) + .........+.+++.+++.++..++.....+++++.....+.++.. . +.|+...+. ...+.. T Consensus 147 ~----------~~~~~~~~~~~~~~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd-~----~G~~i~~~~--~~~~~~ 209 (397) T protein:vir:23 147 N----------KTQSISPNAYQGLGVSGLTKLVTDGKKWTHTLLDDTVEPVLNGSVD-A----NGRPLFVES--TYESLT 209 (397) T ss_pred c----------ceeeecccchhHHHHHHHHhhhhcccCCCEEEEcHHHHHHHHHhhc-c----CCceeeccc--cccccc Confidence 1 1112234456777888888999988888889999998888877642 2 123322221 111111 Q ss_pred EEE-EEcCCCcEEEEEecCCCCCceE--EEEehhhcceeecCcccceec----CCCc-------------cceeeEEEEE Q lcl|NC_019402. 239 VSS-IVDPLGCQYKLVPNRWMPENAV--YFFTPSDWTQMVLRAPERTKL----AKDG-------------SYEKWMIEME 298 (318) Q Consensus 239 v~~-~~tdfG~~v~iv~nr~m~~~~~--~~~D~~~~~~~~Lr~~~~e~l----aktG-------------d~~k~~i~~E 298 (318) ... ..+-+| +.++.+.+||+++. ++-|++.+-+..-+++..+-. -+.| |-...+.+.. T Consensus 210 ~~~~~~tl~G--~Pv~~s~~~~~g~~~~~~gDfs~~~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r 287 (397) T protein:vir:23 210 TPFREGRILG--RPTILSDHVAEGDVVGYAGDFSQIIWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAE 287 (397) T ss_pred ccccCceeee--eeEEEeCCCCCCceEEEEeecceEEEEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEee Confidence 100 012256 58889999998875 445777654443333322211 1111 3345566677 Q ss_pred EeEEEecccceeEEEEeccC Q lcl|NC_019402. 299 VGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 299 ~tLe~~N~~a~g~i~~lt~a 318 (318) +...+++|.|...+...+.+ T Consensus 288 ~d~~v~~~~a~~~~~~~~~~ 307 (397) T protein:vir:23 288 YGLLINDVNAFVKLTFDPVL 307 (397) T ss_pred eccceecccceEEEeecccc Confidence 89999999999999987776 No 66 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=97.95 E-value=3.2e-06 Score=50.69 Aligned_cols=266 Identities=13% Similarity=0.025 Sum_probs=138.5 Q ss_pred CCc--eeeeeeeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeecccccccccc Q lcl|NC_019402. 1 MAT--LVSYDLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERA 78 (318) Q Consensus 1 Ma~--~~t~~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~ 78 (318) .+. ..+......-+++...|+..-....|+.+++...+.++....+........+. ..-..||.+.+..... T Consensus 106 ~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~v~Eg~~~~~~~~~ 179 (379) T protein:vir:10 106 VGDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSISGGTYTFVRENGAGEGA------IGAQVEGATKGQKDYD 179 (379) T ss_pred hcccccCCCCccccchhhhhHHHHhHHhhhhHHhhceeeeccCCceEEEEeecCCCcc------cccccCCccccccccc Confidence 222 22222445667777777777778888887776655555555554433211100 0113477766654322 Q ss_pred CcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHH-HHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhc Q lcl|NC_019402. 79 STTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEK-AGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAA 157 (318) Q Consensus 79 ~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k-~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~ 157 (318) . ..+.--..-+...+.||... +.... + +...+.. -...+.+-++.+|++|.. .+.+ .+ T Consensus 180 f-~~i~~~~~k~~~~~~iS~el--l~D~~--~-l~~~i~~~la~~~~~~~~~~~~~g~~----~~~~----~~------- 238 (379) T protein:vir:10 180 I-SMIDVNTDFIAGFTRYSKKM--ANNLP--F-LTSFIPNALRRDYAKAENAAFNAVLA----ANAT----AS------- 238 (379) T ss_pred e-eeeEeeeeeEEeeehhhHHH--HhhHH--H-HHHHHHHHHHHHHHHHHHHHHhcccc----cccc----cc------- Confidence 1 11111112222234455433 22221 1 2222222 223345556777765421 1100 00 Q ss_pred CCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEE Q lcl|NC_019402. 158 KDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNV 237 (318) Q Consensus 158 ~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~ 237 (318) ..+.....+.++|.+++.++-.++...+.++|||..-..+..+. +.+ .++.........-|. T Consensus 239 -------------~~~~~~~~~~d~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lk-d~~----G~~l~~~~~~~~~~~ 300 (379) T protein:vir:10 239 -------------TEIITNKNKVEMLINEIAKQENLDFPVTAIVLRPTDYYDILVTQ-KSV----GAGYGLPGVVTQDNG 300 (379) T ss_pred -------------cccccCcccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhh-ccC----CceeccCCccCCCCC Confidence 00111223456777777777777878888999998777777664 222 233222111110011 Q ss_pred EEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceecC-----CCccceeeEEEEEEeEEEecccceeEE Q lcl|NC_019402. 238 YVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKLA-----KDGSYEKWMIEMEVGLRHRNPYASGIL 312 (318) Q Consensus 238 ~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~la-----ktGd~~k~~i~~E~tLe~~N~~a~g~i 312 (318) ..+=+| +.++.+.+||++++++.|++.+.+.+-+....+--. -.-+......+..+++.+++|.|...+ T Consensus 301 ----~~~l~G--~pvv~s~~~~ag~~~~gdf~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~R~~~~v~~p~a~v~~ 374 (379) T protein:vir:10 301 ----VLRING--IPLFRATWLAANKYYVGDWTRVTKVTTEGLSLEFSEVEGTNFVKNNITARIEAQVALAVEQPAALIFG 374 (379) T ss_pred ----cceecc--eeeEecCCCCCCceEEeecccEEEEEEeceEEEEeecccccccCCcEEEEEEEEeccEEecCccEEEE Confidence 013357 688999999999999999999766543333221111 122344555556799999999998887 Q ss_pred EEecc Q lcl|NC_019402. 313 EVKAG 317 (318) Q Consensus 313 ~~lt~ 317 (318) +..+- T Consensus 375 ~~~~~ 379 (379) T protein:vir:10 375 DFTAV 379 (379) T ss_pred EecCC Confidence 76666 No 67 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=97.90 E-value=2.8e-06 Score=51.00 Aligned_cols=280 Identities=14% Similarity=0.040 Sum_probs=139.9 Q ss_pred CCceeeeeeeeecccceeeeEecCCcccceeeeeccccccc--eEEEeeeeeccccCCccccccccceeecccccccccc Q lcl|NC_019402. 1 MATLVSYDLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQ--TLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERA 78 (318) Q Consensus 1 Ma~~~t~~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~--~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~ 78 (318) ||+.++..-...-+.+.++|...-....|+..+.......+ ..+-+.+.... + .-.-||.+.+..... T Consensus 1 Mat~tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~~~~p~~~~~~~-a---------~wv~Eg~~~~~~~~~ 70 (311) T protein:vir:99 1 MATFGTGNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRFGNEDIITFNGRPK-A---------EFVGEGQQKSSTTGE 70 (311) T ss_pred CceecCCCceeccHHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCce-e---------EEeecCcccccccce Confidence 99888776666677888888888888888877665543332 23333332211 1 112377776654332 Q ss_pred CcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhcC Q lcl|NC_019402. 79 STTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAAK 158 (318) Q Consensus 79 ~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~~ 158 (318) .....-+... +.--+.||.-..........+...+-..+-.+.+.+.+|.++|+|.-. +.++.+. |+..++... T Consensus 71 f~~v~l~~~k-~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~--~~g~~~~---g~~~~~~~~ 144 (311) T protein:vir:99 71 FDFVTSTPKK-AQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINP--LTGTVIP---GWSNYLGAA 144 (311) T ss_pred eeEEEEeeEE-EEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCc--ccCcccc---ccccccccc Confidence 2222222222 222344443322111111123344555566677899999999988431 1122222 232222111 Q ss_pred CcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCc--CEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEE Q lcl|NC_019402. 159 DAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEA--NIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLN 236 (318) Q Consensus 159 ~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~--~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g 236 (318) ...+............++.+++..+-.++... +-+++|+.....+..+. +.+ .|+...+.. . + T Consensus 145 -------~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lk-d~~----G~~l~~~~~-~--~ 209 (311) T protein:vir:99 145 -------SKRVELTADTIANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTAR-YTD----GRKKFPELG-L--G 209 (311) T ss_pred -------cceeeccccccchhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhh-ccC----CCeeecCcc-c--C Confidence 00111122233344567788887777665443 44889999988887764 322 232221110 0 0 Q ss_pred EEEEEEEcCCCcEEEEEecCCCC----------------CceEEEEehhh-cceeecCcccceecCC-----Cc-----c Q lcl|NC_019402. 237 VYVSSIVDPLGCQYKLVPNRWMP----------------ENAVYFFTPSD-WTQMVLRAPERTKLAK-----DG-----S 289 (318) Q Consensus 237 ~~v~~~~tdfG~~v~iv~nr~m~----------------~~~~~~~D~~~-~~~~~Lr~~~~e~lak-----tG-----d 289 (318) ... -+=+| +.++.+.++| .+.+++.|.+. +.+...+....+-+.. ++ | T Consensus 210 ~~~---~~l~G--~Pv~~s~~i~~~~~~~~~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 284 (311) T protein:vir:99 210 IGV---SSFEG--IDASVSDTVNGGDEADPDDEDLDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGDPDGQGDLKRHN 284 (311) T ss_pred CCC---ceecc--eeeEeecccccccccccccchhhccCcceEEEeeccccEEEEEecCceEEEeecCCCCcchhhhhcC Confidence 000 01134 2344433332 23466677764 4454444443322211 11 2 Q ss_pred ceeeEEEEEEeEEEecccceeEEEEecc Q lcl|NC_019402. 290 YEKWMIEMEVGLRHRNPYASGILEVKAG 317 (318) Q Consensus 290 ~~k~~i~~E~tLe~~N~~a~g~i~~lt~ 317 (318) -...+....++..+++| ++.++...++ T Consensus 285 ~~~~r~~~r~d~~v~~~-~~v~~~~~~A 311 (311) T protein:vir:99 285 QIALRLEIVYGWYVFTD-RFVVIENAVA 311 (311) T ss_pred cEEEEEEEeecceecCh-hHeeeecccC Confidence 23455567789999997 5667777777 No 68 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=97.89 E-value=1.2e-05 Score=47.52 Aligned_cols=254 Identities=13% Similarity=0.099 Sum_probs=138.0 Q ss_pred CCceeeeeeeeeccc-ceeee----------EecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeec Q lcl|NC_019402. 1 MATLVSYDLNGKKLS-FANWI----------SNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEG 69 (318) Q Consensus 1 Ma~~~t~~~~~~~~d-l~d~I----------~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG 69 (318) ||.-.|.......|. +++.+ ..+.+.+..+ +.-.+.++ ..+.|. .+..+.+ ..|| T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l-~g~~G~ti--~iP~~~--~~gda~~---------~~eg 66 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTL-QGQPGNTL--KFPAFT--YIGDAAD---------VAEG 66 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhcccccccccc-ccCCCCEE--EEeeec--cCccccc---------cCCC Confidence 997555544443333 22222 3333444432 21123333 345573 3332221 2367 Q ss_pred cccccccccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhh Q lcl|NC_019402. 70 SAAVDGERASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTA 149 (318) Q Consensus 70 ~da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~ 149 (318) .+.+....+.....--+.|. .+.|+|++-+.... .++.+.+-..+....+.|.+++.++..-. |. T Consensus 67 ~~i~~~~lt~~~~~~~i~~~-~k~~~vtD~~~~~~---~~d~~~~~~~~~a~~~a~~~d~~i~~~l~---~~-------- 131 (272) T protein:vir:36 67 GEISLDKIGTTTKSVTIKKA-AKGTEITDEAALSG---YGDPIGESNKQLGLSLANKVDDDLLSAAK---TT-------- 131 (272) T ss_pred CccChhhcCCcceeEeeehh-hccccccHHHHhhc---cchHHHHHHHHHHHHHHHHHHHHHHHHhc---cc-------- Confidence 76666655555554455564 67888888666543 24555555556666678888887763210 10 Q ss_pred hHHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEec Q lcl|NC_019402. 150 GFSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFD 229 (318) Q Consensus 150 Gi~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~ 229 (318) .......++.+.|.++++++=+++...+.++|||.+...+.+..... ...... T Consensus 132 ---------------------~~~~~~~~~~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~------~~~~~~ 184 (272) T protein:vir:36 132 ---------------------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAK------NIGSEV 184 (272) T ss_pred ---------------------cccccccccHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHhcccccc------cccccc Confidence 01122357889999999999999988999999999776664432110 000111 Q ss_pred CCceEEEEEEEEEEcCCCcEEEEEecCCCCCceEE----EEehhhcceeecCcccce--ecCCC-ccceeeEEEEEEeEE Q lcl|NC_019402. 230 GQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVY----FFTPSDWTQMVLRAPERT--KLAKD-GSYEKWMIEMEVGLR 302 (318) Q Consensus 230 ~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~----~~D~~~~~~~~Lr~~~~e--~lakt-Gd~~k~~i~~E~tLe 302 (318) +..-.....+-.| .| ++|+.++.||.++.+ ++=+..+.+...+++..| ..++. .|.-... -=|++. T Consensus 185 ~~~~~~~G~ig~~---~G--~~Vv~s~~~p~~~~~~~~~~~~~gA~~~~~~~~~~vE~~R~~~~~~d~i~~~--~~y~~~ 257 (272) T protein:vir:36 185 GANALINGTYADV---LG--AQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITAD--EHYAAY 257 (272) T ss_pred cccceeeecccee---cC--eeEEEeCCCCCCceeEEEEEecccceeeeecCCcccccccchhhcCcEEEEE--EEEEEE Confidence 1111111112222 46 689999999998864 333555554434554433 22222 2332222 238999 Q ss_pred EecccceeEEEEecc Q lcl|NC_019402. 303 HRNPYASGILEVKAG 317 (318) Q Consensus 303 ~~N~~a~g~i~~lt~ 317 (318) +.||.+..+++..-- T Consensus 258 v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 258 LYDLTKVVNITFTGV 272 (272) T ss_pred EEcCccEEEEeecCC Confidence 999999888765555 No 69 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=97.89 E-value=2.8e-06 Score=51.03 Aligned_cols=262 Identities=8% Similarity=-0.082 Sum_probs=132.4 Q ss_pred CCceeeee-eeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccccC Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERAS 79 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~~ 79 (318) ...+++.+ -...-+++.+.|...-...+|+..++.....++..+.|..-..... ...-.-||...+....-. T Consensus 128 ~~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~v~E~~~~~~~~~~~ 200 (394) T protein:vir:97 128 KDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATT-------KMVTVAELEKNPALAKPD 200 (394) T ss_pred ccccccccccccChHHHHHHHHHHhhhhhhhhhhceeeeccCcceEEEEEecCCC-------ccceeccccccccccccc Confidence 22233222 1123355666676666677888877666555554555554322111 011234776655321111 Q ss_pred cEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhcCC Q lcl|NC_019402. 80 TTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAAKD 159 (318) Q Consensus 80 ~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~~~ 159 (318) -..+.=-..=+..-+.||.-.-.-......+.+..++ .+.+.+-++.++|+|.. +. T Consensus 201 ~~~v~l~~~k~~~~i~is~ell~ds~~~~~~~i~~~l---a~~~~~~~~~~i~~g~~----~~----------------- 256 (394) T protein:vir:97 201 FKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESI---SQIKVNTTNDAIAKVLK----SF----------------- 256 (394) T ss_pred ceeEEeehhheeeehhhHHHHHhhhhHHHHHHHHHHH---HHHHHHHHHHHHhhccc----cc----------------- Confidence 1111111112223345554322211111222333333 34455667788886521 10 Q ss_pred cccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEEEE Q lcl|NC_019402. 160 AADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNVYV 239 (318) Q Consensus 160 ~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~~v 239 (318) .+....+.++|.+++........+ ..++|||.....|..+. +.+ .++.....-.. |. - T Consensus 257 -------------~~~~~~~~~~~~~~~~~~~~~~~~-a~~v~n~~~~~~l~~lk-d~~----G~~i~~~~~~~--~~-~ 314 (394) T protein:vir:97 257 -------------TTKTVKNLDEIKALLNGGFDPAYN-VSLIVSQSFYQTLDTLK-DGN----GRYLLQDDITA--VS-G 314 (394) T ss_pred -------------cccccccHHHHHHHHHhhhhhhhC-CEEEEcHHHHHHHHHhh-ccC----CCeeeecCcCC--CC-C Confidence 011224556777777766544333 35788998887777654 322 23322111100 00 0 Q ss_pred EEEEcCCCcEEEEEecCCCCCceEEEEehhh-cceeecCcccceecCCCccceeeEEEEEEeEEEecccceeEEEEeccC Q lcl|NC_019402. 240 SSIVDPLGCQYKLVPNRWMPENAVYFFTPSD-WTQMVLRAPERTKLAKDGSYEKWMIEMEVGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 240 ~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~-~~~~~Lr~~~~e~laktGd~~k~~i~~E~tLe~~N~~a~g~i~~lt~a 318 (318) -+=+|..|.++.+..++++.+++.|++. +.+........+-.--..+......+..+...+.+|.|...|+..+++ T Consensus 315 ---~~l~G~pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~ 391 (394) T protein:vir:97 315 ---KVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEIYGQYLQAVLRFGVSKVDDKAGYYVTFTPEP 391 (394) T ss_pred ---ceeccceeEEecccccCCccEEEeeccccEEEEEecceEEEEecccccceeEEEEEEEccEEecccceEEEEecccc Confidence 0226755555668888999999999875 333323333332222233344567778899999999999999998888 No 70 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=97.89 E-value=6.4e-06 Score=49.08 Aligned_cols=266 Identities=11% Similarity=0.003 Sum_probs=139.8 Q ss_pred CCceeeee-eeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccccC Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERAS 79 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~~ 79 (318) |++.++.+ -...-+++.+.|...=....|+.++......+.....|..-...... ....-.-||.+.++...-. T Consensus 5 ~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~-----~~a~~v~Eg~~~~~~~~~~ 79 (293) T protein:vir:48 5 KTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDIT-----GLANIDDEAGKIADIDDPK 79 (293) T ss_pred ecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCC-----cceeeecCCcccccccccc Confidence 77766655 23456777788877777888887775543333222222211110000 0001133676655422111 Q ss_pred cEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhcCC Q lcl|NC_019402. 80 TTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAAKD 159 (318) Q Consensus 80 ~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~~~ 159 (318) -..+.=-..=+...+.||.-...-......+.+..++. +.+.+-+++++++|... .+ T Consensus 80 ~~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la---~~~~~~~~~~i~~g~~~----~~---------------- 136 (293) T protein:vir:48 80 LSLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIA---KKVVVTRNKAILGVVDK----LP---------------- 136 (293) T ss_pred eeEEEEeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHH---HHHHHHHHhHHhhcccc----cc---------------- Confidence 12222222222333445543322222222334444443 44556677777765211 10 Q ss_pred cccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEEEE Q lcl|NC_019402. 160 AADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNVYV 239 (318) Q Consensus 160 ~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~~v 239 (318) .....++-++|.+++.++-.+......++||+.....+..+. +. ..|+...+.-.. +. T Consensus 137 -------------~~~~~~~~d~i~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lk-d~----~g~~l~~~~~~~--~~-- 194 (293) T protein:vir:48 137 -------------TKPTLTKWDDIIDLEAKVDPAIKQTSFFLTNTSGFTALKKVK-NA----LGDYLMERDVKS--PT-- 194 (293) T ss_pred -------------ccccccCHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhh-cc----CCceEeecCcCC--CC-- Confidence 112346788899888888666555567889999887777654 32 223322221111 00 Q ss_pred EEEEcCCCcEEEEEecCCCCCc-----eEEEEehhh-cceeecCcccceecCC-----CccceeeEEEEEEeEEEecccc Q lcl|NC_019402. 240 SSIVDPLGCQYKLVPNRWMPEN-----AVYFFTPSD-WTQMVLRAPERTKLAK-----DGSYEKWMIEMEVGLRHRNPYA 308 (318) Q Consensus 240 ~~~~tdfG~~v~iv~nr~m~~~-----~~~~~D~~~-~~~~~Lr~~~~e~lak-----tGd~~k~~i~~E~tLe~~N~~a 308 (318) --+=+|..|.++.+..+|.+ .+++.|++. +.+....++..+...- .-+.....++..+...+++|+| T Consensus 195 --~~~l~G~Pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a 272 (293) T protein:vir:48 195 --GYSIAGFAVKEISDRWLPNASSGVMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEA 272 (293) T ss_pred --CceecceeeEEecccccCCccCCceEEEEEeccceEEEEEecceEEEEecccchhhhcCeEEEEEEEeeCcEEecccc Confidence 11236755555667777753 378888774 4444444444332221 2345666677889999999999 Q ss_pred eeEEEEeccC Q lcl|NC_019402. 309 SGILEVKAGA 318 (318) Q Consensus 309 ~g~i~~lt~a 318 (318) ...++..+++ T Consensus 273 ~~~l~~~~~~ 282 (293) T protein:vir:48 273 FVPASFKAIA 282 (293) T ss_pred eEEEEeeccc Confidence 9999977766 No 71 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=97.88 E-value=2.2e-06 Score=51.63 Aligned_cols=279 Identities=13% Similarity=0.075 Sum_probs=140.5 Q ss_pred CCceeeeeee---eecccceeeeEecCCcccceeeeeccccccc-eEEEeeeeeccccCCccccccccceeecccccccc Q lcl|NC_019402. 1 MATLVSYDLN---GKKLSFANWISNLSPTDTPFVSMTGKEAINQ-TLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGE 76 (318) Q Consensus 1 Ma~~~t~~~~---~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~-~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~ 76 (318) -++-++.+.. ..-+.+.+.|...=....|+.+++...+.++ ....|....-.... .--.-||...+... T Consensus 115 ~a~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~v~E~~~~~~~~ 187 (409) T protein:vir:45 115 RAQGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQILTTSDGRTMEWATADGTSEV-------GVLLGENEEAGEED 187 (409) T ss_pred hhccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEeeccCccc-------cccccccccccccc Confidence 1222222221 2345566666666667778876655444433 34677765432211 11234666655433 Q ss_pred ccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHh Q lcl|NC_019402. 77 RASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVA 156 (318) Q Consensus 77 ~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~ 156 (318) .......-.--.+....+.|| .+.+.-.. -+...|=...-...+.+-+|.+||+|.-. ++ +.+.-||+.... T Consensus 188 ~~f~~~~l~~~k~~~~~i~is--~ell~ds~-~~l~~~i~~~la~a~~~~~~~a~l~G~G~--~~---~~~p~Gil~~~~ 259 (409) T protein:vir:45 188 TDFGMGSLGALKMTSKIIRVS--NELLQDSA-IDMEAYLARRIAERIGRGEARYLIQGTGA--GT---PKQPKGLAASVT 259 (409) T ss_pred cccceeeeeeeeeeeeehhhh--HHHHhccH-HHHHHHHHHHHHHHHHHHHHHHhhccCCC--CC---ccccceeeeccc Confidence 221111101111111223344 44433221 13333333344566778899999987421 11 123346653221 Q ss_pred cCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCC-CcC-EEEEcchHHhhhhhhhhhhcccccceEEEecCCceE Q lcl|NC_019402. 157 AKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGS-EAN-IIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTR 234 (318) Q Consensus 157 ~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~-~~~-~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~ 234 (318) . .....+..+++.++|.+++..+-.+.. ++. .++||+.....+..+. +. ..++...... . T Consensus 260 ~-----------~~~~~~~~~~~~d~i~~l~~~l~~~~~~~a~~~~~~n~~~~~~l~~lk-d~----~G~~i~~~~~--~ 321 (409) T protein:vir:45 260 G-----------TTQTAAANAVKWQEILALKHSIDPAYRRGPKFRLAFNDNTLKLISEME-DG----QGRPLWLPDI--V 321 (409) T ss_pred c-----------ccccccccccchHHHHHHHHhhhhhhccCCeEEEEECHHHHHHHHHhh-cC----CCceeeccCc--C Confidence 1 012234456888899998887755422 223 3457888777776664 32 2233222111 1 Q ss_pred EEEEEEEEEcCCCcEEEEEecCCCCC-----ceEEEEehhhcceeecCcccceecC---CCccceeeEEEEEEeEEEecc Q lcl|NC_019402. 235 LNVYVSSIVDPLGCQYKLVPNRWMPE-----NAVYFFTPSDWTQMVLRAPERTKLA---KDGSYEKWMIEMEVGLRHRNP 306 (318) Q Consensus 235 ~g~~v~~~~tdfG~~v~iv~nr~m~~-----~~~~~~D~~~~~~~~Lr~~~~e~la---ktGd~~k~~i~~E~tLe~~N~ 306 (318) -|. -.+=+| +.|+.+.+||. ..+++.|++.+-+..-.+...+.+. -.-+......+.-+...+.+| T Consensus 322 ~~~----~~~l~G--~PV~~~~~~p~~~~~~~~i~~Gd~~~~~i~~~~~~~~~~~~d~~~~~~~~~~~~~~r~d~~~~~~ 395 (409) T protein:vir:45 322 GVA----PASVLN--VPYVIDQEIDDIGAGKKFMFCGDFDRFIIRRVRYMILKRLVERYAEYDQTGFLAFHRFDCILEDT 395 (409) T ss_pred CCC----Cceecc--eeeEEecCcCCccCCccEEEEeehhhhheeeccceEEEEeecccccCCcEEEEEEEEeccEeech Confidence 111 113368 58899999984 3466679987544332232222111 111333445555688999999 Q ss_pred cceeEEEEeccC Q lcl|NC_019402. 307 YASGILEVKAGA 318 (318) Q Consensus 307 ~a~g~i~~lt~a 318 (318) .|..+++..+++ T Consensus 396 ~A~~~l~~k~s~ 407 (409) T protein:vir:45 396 SAIKALVGKGSV 407 (409) T ss_pred hheEEEEeccCC Confidence 999999998887 No 72 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=97.87 E-value=1.4e-05 Score=47.22 Aligned_cols=257 Identities=13% Similarity=0.055 Sum_probs=147.3 Q ss_pred CCceeeeeeeeeccc-ceeeeE----------ecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeec Q lcl|NC_019402. 1 MATLVSYDLNGKKLS-FANWIS----------NLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEG 69 (318) Q Consensus 1 Ma~~~t~~~~~~~~d-l~d~I~----------~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG 69 (318) ||.-+|.-..-..|. +++.+. .+...+.- +-|+.--+.+.+.|. .+..+.+ --|| T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~---l~g~~G~tv~iP~~~--~ig~a~~---------~~~g 66 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNT---LVGQPGDTLTFPAFI--YSGDAKV---------VAEG 66 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceeccc---ccCCCCCEEEeeeec--CCCcccc---------ccCC Confidence 998666554444443 222221 22112221 112111122345664 2332222 2367 Q ss_pred cccccccccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhh Q lcl|NC_019402. 70 SAAVDGERASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTA 149 (318) Q Consensus 70 ~da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~ 149 (318) .+.+....+.....-.+.|. .+.|++++-+.... .++-++.-......-+.+.++..++.--.. + T Consensus 67 ~~i~~~~lt~~~~~~~i~~~-~~a~~i~D~~~~~~---~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~----a------- 131 (274) T protein:vir:95 67 EKIPTDILETKKREAKIRKI-AKGTSISDEALLSG---YGDPQGEQVRQHGLAHANKVDDDVLEALKS----A------- 131 (274) T ss_pred CccchhhcccceeEEEeeee-ecceeehHHHHhhc---cchHHHHHHHHHHHHHHHHHHHHHHHHHhc----c------- Confidence 77777777777777777775 67899998665543 245566666666677778888877632110 0 Q ss_pred hHHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEec Q lcl|NC_019402. 150 GFSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFD 229 (318) Q Consensus 150 Gi~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~ 229 (318) + .. ....+++.+.|.++++++=+++..++.++|||.+...+-+. ... +..+... T Consensus 132 -------~-~~------------~~~~~~~~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~---~~~---~f~~~s~ 185 (274) T protein:vir:95 132 -------K-LT------------VEADITKLTGLQTAIDKFNDEDLEPMVLFISPLDAGKLRGD---ATT---NFTRATE 185 (274) T ss_pred -------c-cc------------ccccccCHHHHHHHHHHhccccccccEEEeCHHHHHHHHhh---ccc---ccccccc Confidence 0 00 01235789999999999988888889999999987665432 100 0000000 Q ss_pred CCceEEEEEEEE-EEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceecCCCccceeeEEE--EEEeEEEecc Q lcl|NC_019402. 230 GQDTRLNVYVSS-IVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKLAKDGSYEKWMIE--MEVGLRHRNP 306 (318) Q Consensus 230 ~~~~~~g~~v~~-~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~laktGd~~k~~i~--~E~tLe~~N~ 306 (318) .... ....- |-+=.| +.|+.+..+|.++.+++-...+.+..-+++..|.. ..-+...-.|+ -=|+..+.|| T Consensus 186 ~g~~---~~~~G~ig~~~G--~~Vi~s~~~~~~t~~l~~~gA~~~~~~~~~~vE~~-Rd~~~~~d~i~~~~~y~~~~~~~ 259 (274) T protein:vir:95 186 LGDD---VIVKGAFGEALG--AVIVRSNKLEAGTAILAKKGAVKLITKRDFFLETD-RDPSTKTTALYSDKHYVAYLYDE 259 (274) T ss_pred cccc---ceeccccceecC--eEEEEeCCCCCceEEEEeccceeeeecCCcccccc-cccccccCEEEEeEEEEEEEEcC Confidence 0001 11111 111235 68999999999999999999998866566554322 12222222222 2389999999 Q ss_pred cceeEEEEeccC Q lcl|NC_019402. 307 YASGILEVKAGA 318 (318) Q Consensus 307 ~a~g~i~~lt~a 318 (318) .+..+++--+.| T Consensus 260 ~~~v~~tk~~~~ 271 (274) T protein:vir:95 260 SKAVKITKGSGS 271 (274) T ss_pred CcEEEEEcCCcc Confidence 999999988888 No 73 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=97.87 E-value=1.4e-05 Score=47.22 Aligned_cols=257 Identities=13% Similarity=0.055 Sum_probs=147.3 Q ss_pred CCceeeeeeeeeccc-ceeeeE----------ecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeec Q lcl|NC_019402. 1 MATLVSYDLNGKKLS-FANWIS----------NLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEG 69 (318) Q Consensus 1 Ma~~~t~~~~~~~~d-l~d~I~----------~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG 69 (318) ||.-+|.-..-..|. +++.+. .+...+.- +-|+.--+.+.+.|. .+..+.+ --|| T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~---l~g~~G~tv~iP~~~--~ig~a~~---------~~~g 66 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNT---LVGQPGDTLTFPAFI--YSGDAKV---------VAEG 66 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceeccc---ccCCCCCEEEeeeec--CCCcccc---------ccCC Confidence 998666554444443 222221 22112221 112111122345664 2332222 2367 Q ss_pred cccccccccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhh Q lcl|NC_019402. 70 SAAVDGERASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTA 149 (318) Q Consensus 70 ~da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~ 149 (318) .+.+....+.....-.+.|. .+.|++++-+.... .++-++.-......-+.+.++..++.--.. + T Consensus 67 ~~i~~~~lt~~~~~~~i~~~-~~a~~i~D~~~~~~---~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~----a------- 131 (274) T protein:vir:96 67 EKIPTDILETKKREAKIRKI-AKGTSISDEALLSG---YGDPQGEQVRQHGLAHANKVDDDVLEALKS----A------- 131 (274) T ss_pred CccchhhcccceeEEEeeee-ecceeehHHHHhhc---cchHHHHHHHHHHHHHHHHHHHHHHHHHhc----c------- Confidence 77777777777777777775 67899998665543 245566666666677778888877632110 0 Q ss_pred hHHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEec Q lcl|NC_019402. 150 GFSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFD 229 (318) Q Consensus 150 Gi~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~ 229 (318) + .. ....+++.+.|.++++++=+++..++.++|||.+...+-+. ... +..+... T Consensus 132 -------~-~~------------~~~~~~~~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~---~~~---~f~~~s~ 185 (274) T protein:vir:96 132 -------K-LT------------VEADITKLTGLQTAIDKFNDEDLEPMVLFISPLDAGKLRGD---ATT---NFTRATE 185 (274) T ss_pred -------c-cc------------ccccccCHHHHHHHHHHhccccccccEEEeCHHHHHHHHhh---ccc---ccccccc Confidence 0 00 01235789999999999988888889999999987665432 100 0000000 Q ss_pred CCceEEEEEEEE-EEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceecCCCccceeeEEE--EEEeEEEecc Q lcl|NC_019402. 230 GQDTRLNVYVSS-IVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKLAKDGSYEKWMIE--MEVGLRHRNP 306 (318) Q Consensus 230 ~~~~~~g~~v~~-~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~laktGd~~k~~i~--~E~tLe~~N~ 306 (318) .... ....- |-+=.| +.|+.+..+|.++.+++-...+.+..-+++..|.. ..-+...-.|+ -=|+..+.|| T Consensus 186 ~g~~---~~~~G~ig~~~G--~~Vi~s~~~~~~t~~l~~~gA~~~~~~~~~~vE~~-Rd~~~~~d~i~~~~~y~~~~~~~ 259 (274) T protein:vir:96 186 LGDD---VIVKGAFGEALG--AVIVRSNKLEAGTAILAKKGAVKLITKRDFFLETD-RDPSTKTTALYSDKHYVAYLYDE 259 (274) T ss_pred cccc---ceeccccceecC--eEEEEeCCCCCceEEEEeccceeeeecCCcccccc-cccccccCEEEEeEEEEEEEEcC Confidence 0001 11111 111235 68999999999999999999998866566554322 12222222222 2389999999 Q ss_pred cceeEEEEeccC Q lcl|NC_019402. 307 YASGILEVKAGA 318 (318) Q Consensus 307 ~a~g~i~~lt~a 318 (318) .+..+++--+.| T Consensus 260 ~~~v~~tk~~~~ 271 (274) T protein:vir:96 260 SKAVKITKGSGS 271 (274) T ss_pred CcEEEEEcCCcc Confidence 999999988888 No 74 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=97.82 E-value=1.7e-05 Score=46.80 Aligned_cols=256 Identities=12% Similarity=0.067 Sum_probs=146.7 Q ss_pred CCceeeeeeeeeccc-ceee----------eEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeec Q lcl|NC_019402. 1 MATLVSYDLNGKKLS-FANW----------ISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEG 69 (318) Q Consensus 1 Ma~~~t~~~~~~~~d-l~d~----------I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG 69 (318) ||.-+|....-..|. +++. ++.+-..++-+- --++.++ +.+.|. .+..+.+ --|| T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~-g~~G~tv--~iP~~~--~ig~a~~---------~~~g 66 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQ-GQPGDTL--TFPAFV--YSGDAQV---------VAEG 66 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceeccccc-CCCCCEE--EEeeec--CCCcccc---------ccCC Confidence 887555444333332 2111 122333333221 1112233 344564 2222222 2266 Q ss_pred cccccccccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhh Q lcl|NC_019402. 70 SAAVDGERASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTA 149 (318) Q Consensus 70 ~da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~ 149 (318) .+.+....+.....--+.|. .+.|+|++-+.... .+|-+.........-+.+.++..++.--. ++. T Consensus 67 ~~i~~~~lt~~~~~~~i~~~-~~~~~i~D~~~~~~---~~d~~~~~~~q~~~~~a~~vd~~~l~~~~---~a~------- 132 (274) T protein:vir:12 67 EKIPTDILETKKREAKIRKI-AKGTSITDEALLSG---YGDPQGEQVRQHGLAHANKVDNDVLEALM---GAK------- 132 (274) T ss_pred CccchhhcccceeeEEeeee-cceeeecHHHHHhc---ccchHHHHHHHHHHHHHHHHHHHHHHHHh---ccc------- Confidence 66666666666655556674 67899998765543 24666666666667788888887773211 000 Q ss_pred hHHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEec Q lcl|NC_019402. 150 GFSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFD 229 (318) Q Consensus 150 Gi~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~ 229 (318) ......+++.+.|.++++++=+++...+.++|||.+...+-+.... . ..+... T Consensus 133 ---------------------~~~~~~a~~~d~i~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~---~---fv~~s~ 185 (274) T protein:vir:12 133 ---------------------LTVNADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDAST---N---FTRATE 185 (274) T ss_pred ---------------------ccccccccCHHHHHHHHHHhccccccccEEEeCHHHHHHHHhhhhh---h---cccccc Confidence 0012245889999999999988888889999999987655332100 0 000000 Q ss_pred CCceEEEEEEE-EEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceec--CCCc-cceeeEEEEEEeEEEec Q lcl|NC_019402. 230 GQDTRLNVYVS-SIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKL--AKDG-SYEKWMIEMEVGLRHRN 305 (318) Q Consensus 230 ~~~~~~g~~v~-~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~l--aktG-d~~k~~i~~E~tLe~~N 305 (318) ... +.... .|-+-.| +.|+.++.||.++.+++-...+.+..-+++..|.. ++.+ |.-. ...=|+..+.| T Consensus 186 ~g~---~~~~~G~ig~~~G--~~Vi~s~~~p~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~--~~~~y~~~~~~ 258 (274) T protein:vir:12 186 LGD---DIIVKGAFGEALG--AIIVRSNKLEAGTAILAKKGAVKLILKRDFFLEVARDASTKTTALY--SDKHYVAYLYD 258 (274) T ss_pred ccc---cceecccceeecC--eeEEEeCCCCcceEEEEeccceeeeecCCceeccccchhhcccEEE--eeeEEEEEEEc Confidence 000 11111 1222246 68999999999999999999988866566554322 2222 2222 22238999999 Q ss_pred ccceeEEEEeccC Q lcl|NC_019402. 306 PYASGILEVKAGA 318 (318) Q Consensus 306 ~~a~g~i~~lt~a 318 (318) |.+..+|+--++| T Consensus 259 ~~~vv~~t~~~~~ 271 (274) T protein:vir:12 259 ESKAVKITKGSGS 271 (274) T ss_pred CCceEEEEcCCcc Confidence 9999999999888 No 75 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=97.79 E-value=4e-06 Score=50.20 Aligned_cols=276 Identities=12% Similarity=0.015 Sum_probs=140.3 Q ss_pred CCceee-eeeeeecccceeeeEecCCcccceeeeecccccc--ceEEEeeeeeccccCCccccccccceeeccccccccc Q lcl|NC_019402. 1 MATLVS-YDLNGKKLSFANWISNLSPTDTPFVSMTGKEAIN--QTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGER 77 (318) Q Consensus 1 Ma~~~t-~~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~--~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~ 77 (318) ||.-++ ......-+.+.++|...-....|+..+.-.-... ...+-.++.... + .-.-||...+.... T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~~~~ip~~~~~~~-a---------~wv~Eg~~~~~s~~ 70 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPR-A---------KIVGEGEVKPSASV 70 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcc-e---------EEeeCCcccccccc Confidence 996444 3345566677787777777777776643222221 112222221111 1 11337766555432 Q ss_pred cCcEEecceE---EEEeeeeeehhHHHHhhhcC--ccchHHHHH-HHHHHHHHHHHHHHHhcCccccCCCCCccchhhhH Q lcl|NC_019402. 78 ASTTVINNVT---QILRKVVKVSDTANVLANYG--RGKELQYQM-EKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGF 151 (318) Q Consensus 78 ~~~~~~~N~t---QIf~~~~~VS~Ta~a~~~~G--~~~e~a~q~-~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi 151 (318) . ++.++ .-+.--+.|| .+.+..-. .-+++...+ +.-...+.+-+|.++++|.-.. . .....|+ T Consensus 71 ~----f~~v~l~~~kl~~~~~iS--~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~--~---~~~~~~~ 139 (315) T protein:vir:80 71 D----VSAFTAQPIKVVTQQRVS--DEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPA--T---GKAASAV 139 (315) T ss_pred c----eeeeEeeeeeEEeeehhh--HHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCC--C---Ccccccc Confidence 2 22111 1122223333 22221111 112233333 3445567888999999884211 1 1224444 Q ss_pred HHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCC-CcCEEEEcchHHhhhhhhhhhhcc-cccceEEEec Q lcl|NC_019402. 152 SALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGS-EANIIMFHPKHAAFFSSLMETSGV-TNGQRMKMFD 229 (318) Q Consensus 152 ~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~-~~~~l~~~~~~k~~is~~~~~~~~-~~~~r~~~~~ 229 (318) ...+.... ........+.++|.+++.++=.++. ..+-+++||.....+.++.-.... ..++. .++ T Consensus 140 ~~~~~~~~-----------~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~--~~~ 206 (315) T protein:vir:80 140 HTSLNKTK-----------NIVDATDSATADLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQP--MYP 206 (315) T ss_pred cccccccc-----------ceeeccccchHHHHHHHHHHhhccCccceEEEEcHHHHHHHHHHhhccCCcccccc--ccc Confidence 43332210 0111122345677777766644433 334578899998888776421110 01111 111 Q ss_pred CCceEEEEEEEEEEcCCCcEEEEEecCCCCCc---------eEEEEehhhcceeecCcccceecCC---C--------cc Q lcl|NC_019402. 230 GQDTRLNVYVSSIVDPLGCQYKLVPNRWMPEN---------AVYFFTPSDWTQMVLRAPERTKLAK---D--------GS 289 (318) Q Consensus 230 ~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~---------~~~~~D~~~~~~~~Lr~~~~e~lak---t--------Gd 289 (318) . -..|. .-+=+| +.++.+.+||++ .+|+.|++++.+.+.+.+..+-+.. + -| T Consensus 207 ~--~~~g~----~~tl~G--~PV~~~~~~~~~~~~~~~~~~~~~~GDfs~~~~g~~~~~~i~i~~~~~~~~~~~~~~~~~ 278 (315) T protein:vir:80 207 A--AGFAG----LDNWRG--LNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHN 278 (315) T ss_pred c--cccCC----Cceecc--eeeEecCcCCcccccccccccEEEEeecccEEEEEecCeeEEEeccccccCcccchhhcC Confidence 0 00111 112357 588888999864 3778899998887777665543311 1 12 Q ss_pred ceeeEEEEEEeEEEecccceeEEEEeccC Q lcl|NC_019402. 290 YEKWMIEMEVGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 290 ~~k~~i~~E~tLe~~N~~a~g~i~~lt~a 318 (318) ....+....++..+++|.|..+|++.++. T Consensus 279 ~v~~r~~~r~~~~v~~~~a~~~l~~~~a~ 307 (315) T protein:vir:80 279 EVMVRAEAVLYVAIESLDSFAVVKEKAAP 307 (315) T ss_pred cEEEEEEEEecceeecccceEEEeeccCC Confidence 34445566799999999999999999888 No 76 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=97.78 E-value=3.9e-06 Score=50.22 Aligned_cols=280 Identities=15% Similarity=0.101 Sum_probs=142.9 Q ss_pred CCceeeee-eeeecccceeeeEecCCcccceeeeeccccccceEEE-eeeeeccccCCccccccccceeecccccccccc Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQ-WQTDALAPVADPSDAQKRNAVIEGSAAVDGERA 78 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~-W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~ 78 (318) |..-++.+ -...-+++.+.|...-...+|+.++....+..+.... |.......+ .-.-||...+..... T Consensus 130 l~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~~~~~~~~~a---------~wv~E~~~~~~~~~~ 200 (425) T protein:vir:10 130 LNKGEDSEGGYLTPIEWDRTITNKLVLISPMRQLCRVQPVSKAGFSKLFNMGGTTS---------GWVGEASQRPQTNAA 200 (425) T ss_pred hhcCcCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEcCCcce---------eeecccccccccccc Confidence 33222211 1133466666676666677888887655444333222 222211111 112366654432211 Q ss_pred CcEEecceEEEEee----eeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHH Q lcl|NC_019402. 79 STTVINNVTQILRK----VVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSAL 154 (318) Q Consensus 79 ~~~~~~N~tQIf~~----~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~ 154 (318) .+.||--+ ..-|.=|.+.+.... -+...+=...-...+.+-+|.+||+|. |++ +-.||+.. T Consensus 201 ------~f~~v~~~~~k~~~~i~iS~ell~ds~-~~l~~~i~~~la~ai~~~~d~~~l~G~----G~~----~p~Gil~~ 265 (425) T protein:vir:10 201 ------TFQPLSFASGEIYANPAATQQILDDAE-IDLESWLATEVQTEFAKQEGKAFLAGD----GTN----KPNGLLTY 265 (425) T ss_pred ------ccceeeeeheeeEeehHhHHHHHhcch-hHHHHHHHHHHHHHHHHHHHhhhhccc----CCC----Ccceeeec Confidence 12222222 222333444444322 133344444555667788999999884 322 34467655 Q ss_pred HhcCCc-ccCcccc-ceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCc Q lcl|NC_019402. 155 VAAKDA-ADPDTGA-IVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQD 232 (318) Q Consensus 155 i~~~~~-~~~~~g~-~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~ 232 (318) +..... .....+. ......+...++.++|.++..++-........+++|+.....+..+. +.+ .++...+. T Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~~~a~~vmn~~~~~~L~~lk-D~~----G~~l~~~~-- 338 (425) T protein:vir:10 266 IAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPSAFTGNARFAMNRNTQRQVRKLK-DGQ----GNYLWQPS-- 338 (425) T ss_pred cccccccccccccccccccccccccccHHHHHHHHhhhhhhhccCCEEEEchHHHHHHHHhh-cCC----CceeeccC-- Confidence 432211 1111111 12334566778889999988877655444456889998888877664 322 23322221 Q ss_pred eEEEEEEEEEEcCCCcEEEEEecCCCCC-----ceEEEEehhhcceeecC-ccc--ceecCCCccceeeEEEEEEeEEEe Q lcl|NC_019402. 233 TRLNVYVSSIVDPLGCQYKLVPNRWMPE-----NAVYFFTPSDWTQMVLR-APE--RTKLAKDGSYEKWMIEMEVGLRHR 304 (318) Q Consensus 233 ~~~g~~v~~~~tdfG~~v~iv~nr~m~~-----~~~~~~D~~~~~~~~Lr-~~~--~e~laktGd~~k~~i~~E~tLe~~ 304 (318) -.-|. --+=|| ++|+.+.+||. ..+++.|++..-+.+-| .+. .++... -+...+..+.-+...+. T Consensus 339 ~~~g~----~~~l~G--~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~v~~d~~~~-~~~~~~~~~~r~d~~v~ 411 (425) T protein:vir:10 339 YVAGQ----PATLAG--YPVTEVPDMPDVAANSTPILFGDFQQTYLIIDRIGVRVLRDPYTA-KPYVLFYTTKRVGGGLL 411 (425) T ss_pred ccCCC----Cceecc--eeeEEecCcCCccCCccEEEEEehhccEEEEEecceEEEeccccc-CCcEEEEEEEEeccEee Confidence 11111 112367 57888899984 44788898753111112 111 233332 34455566666999999 Q ss_pred cccceeEEEEeccC Q lcl|NC_019402. 305 NPYASGILEVKAGA 318 (318) Q Consensus 305 N~~a~g~i~~lt~a 318 (318) +|+|..++...+.= T Consensus 412 ~~~A~~~l~~~as~ 425 (425) T protein:vir:10 412 NPEPMRAMKVAASE 425 (425) T ss_pred cccceEEEEeeccC Confidence 99999776654433 No 77 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=97.75 E-value=1.6e-05 Score=46.91 Aligned_cols=291 Identities=12% Similarity=-0.029 Sum_probs=139.1 Q ss_pred CCceeee-------eee---------eecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccC-Cccccccc Q lcl|NC_019402. 1 MATLVSY-------DLN---------GKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVA-DPSDAQKR 63 (318) Q Consensus 1 Ma~~~t~-------~~~---------~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~-~~~~~~~~ 63 (318) ||++... ... ..-+.+.+.|...-..+.|+.++.-....++-...|....-...+ .... -.. T Consensus 1 ~a~l~el~~~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e-g~~ 79 (333) T protein:vir:78 1 MATLNELLPNSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISYGETIIPTTVKRPEVGQVGV-GTS 79 (333) T ss_pred CchhHHhhhhcccccccCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCceeEeecC-ccc Confidence 4432211 111 122334455555555666777766555444443444443322111 1000 000 Q ss_pred cceeeccccccccccCcEEecceEEEEee-eeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCC Q lcl|NC_019402. 64 NAVIEGSAAVDGERASTTVINNVTQILRK-VVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSA 142 (318) Q Consensus 64 na~~EG~da~~~~~~~~~~~~N~tQIf~~-~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~ 142 (318) ....||...+.. ...+.+++=-.+| ...+.=|.+.+.... .+..+|=...-.+.+.+-+|.++|+|.-.. T Consensus 80 ~~~~e~~~~~~~----~~~f~~i~l~~~kl~~~~~is~ell~~s~-~~~~~~i~~~la~ai~~~~d~~~l~G~g~~---- 150 (333) T protein:vir:78 80 NEQREGGLKPLS----GTAWDTRSVSPIKLATIVTVSEEFARMNP-SGLYTKLQGDLAYAIGRGIDLAVFHGKSPL---- 150 (333) T ss_pred cccccccccccc----ccceeEEEEeeEEEEEeehhhHHHHhcCH-HHHHHHHHHHHHHHHHHHHHHHHhcccCCC---- Confidence 011122222211 1122222221222 122222333333221 233444445556778899999999875322 Q ss_pred CccchhhhHHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCC-CCcCEEEEcchHHhhhhhhhhhhcccc Q lcl|NC_019402. 143 TVARQTAGFSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSG-SEANIIMFHPKHAAFFSSLMETSGVTN 221 (318) Q Consensus 143 t~~r~m~Gi~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G-~~~~~l~~~~~~k~~is~~~~~~~~~~ 221 (318) ++-...|+.... ... ........++...++.++|.+++..+-.++ .+++.++++|.....+.++..-.+ . T Consensus 151 -~~~~~~g~~~~~---~~~---~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d--~ 221 (333) T protein:vir:78 151 -TGSALQGIDTDN---VIA---NTTNVDYLQETGDPLLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLRAQAYRD--A 221 (333) T ss_pred -CCcccccccccc---ccc---ccccccccccccchhHHHHHHHHHhhccccccCceEEEEcchHHHHHHHHhhhcC--C Confidence 122234444211 000 001112334555677888888887775543 355578889987666654422110 0 Q ss_pred cceEEEecCCceEEEEEEEEEEcCCCcEEEEEecCCCCCc---------eEEEEehhhcceeecCcccceec--CC---- Q lcl|NC_019402. 222 GQRMKMFDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPEN---------AVYFFTPSDWTQMVLRAPERTKL--AK---- 286 (318) Q Consensus 222 ~~r~~~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~---------~~~~~D~~~~~~~~Lr~~~~e~l--ak---- 286 (318) +.++. +.. ....+. . -+-+| +.++.+.+||.+ .+++.|++.+-+..-.++..+-. +. T Consensus 222 ~G~~i-~~~-~~~~~~-~---~~l~G--~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~~~~g~~~~~~i~~~~~~~~~~~ 293 (333) T protein:vir:78 222 NGNVD-PSR-INLAAQ-T---GDVLG--LPAQFGRAVGGDLGAAVDSKTRIIGGDFSQLKFGFADEIRIKMSDTATLTDS 293 (333) T ss_pred CCcee-ecC-ccccCC-C---ceeec--eeeEEccccCCCccccCCCccEEEEEecccEEEEEeeccEEEEecccccccc Confidence 11211 111 010011 1 23367 588999999865 48899999876654444432211 11 Q ss_pred --------CccceeeEEEEEEeEEEecccceeEEEEeccC Q lcl|NC_019402. 287 --------DGSYEKWMIEMEVGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 287 --------tGd~~k~~i~~E~tLe~~N~~a~g~i~~lt~a 318 (318) .-|....+.+..+.+.+++|+|..+|+..++= T Consensus 294 ~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~a~ 333 (333) T protein:vir:78 294 GSATVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDEQP 333 (333) T ss_pred ccceeehhhcCcEEEEEEEEEccEEecccceEEEeccCCC Confidence 11233455667789999999999999877777 No 78 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=97.70 E-value=2e-05 Score=46.33 Aligned_cols=261 Identities=12% Similarity=0.042 Sum_probs=137.8 Q ss_pred CCceeeee-eeeecccceeeeEecCCcccceeeeeccccccceE-----EEeeeeeccccCCccccccccceeecccccc Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTL-----FQWQTDALAPVADPSDAQKRNAVIEGSAAVD 74 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~-----~~W~td~l~~~~~~~~~~~~na~~EG~da~~ 74 (318) |++.++.+ -...-+.+.+.|...-....|+.+++.....+... ..|.+... . ..-.-||.+.+. T Consensus 109 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~-~---------a~~v~E~~~~~~ 178 (397) T protein:vir:49 109 KTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTGSRVYEKWTDITG-L---------ANIDDEAGKIAD 178 (397) T ss_pred hhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceeecccCccceEEEeeccCCc-c---------eeeecCcccccc Confidence 55444333 22234567788888888889998887665443322 22222111 0 111336666553 Q ss_pred ccccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHH Q lcl|NC_019402. 75 GERASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSAL 154 (318) Q Consensus 75 ~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~ 154 (318) ...-.-..+.=-.+-+...+.||... +.... -+...|=...-...+.+-+|.++|+|. |.++. T Consensus 179 ~~~~~~~~i~~~~~k~~~~~~iS~el--l~ds~-~~l~~~i~~~l~~~~~~~~d~ai~~G~----g~~~~---------- 241 (397) T protein:vir:49 179 VDDPKLSLIKYTIKRYAGISTVTNSL--LADSA-ENILAWLSGWIAKKVVVTRNKAILEAI----AALPT---------- 241 (397) T ss_pred ccccceeeEEeeeeeEEeeehhHHHH--HhhhH-HHHHHHHHHHHHHHHHHHHHHHHHhhc----ccccc---------- Confidence 21111111111122223344444433 32221 133334444455666788999999873 21111 Q ss_pred HhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceE Q lcl|NC_019402. 155 VAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTR 234 (318) Q Consensus 155 i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~ 234 (318) .+...+.+.|.+++.++-.+......+++|+.....+..+. +.+ .++.....-.. T Consensus 242 -------------------~~~~~~~d~i~~~~~~l~~~~~~~a~~vmn~~~~~~l~~lk-d~~----G~~l~~~~~~~- 296 (397) T protein:vir:49 242 -------------------KPTLTKWDDIIDLEAKVDPAIKQTSFFLTNTSGFTALKKVK-NAL----GDYLMERDVKS- 296 (397) T ss_pred -------------------ccccccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhh-cCC----CceeeccCcCC- Confidence 01124667788888888776666678899999988887764 322 23322211000 Q ss_pred EEEEEEEEEcCCCcEEEEEecCCCCCc-----eEEEEehhh-cceeecCcccceecCCCc-----cceeeEEEEEEeEEE Q lcl|NC_019402. 235 LNVYVSSIVDPLGCQYKLVPNRWMPEN-----AVYFFTPSD-WTQMVLRAPERTKLAKDG-----SYEKWMIEMEVGLRH 303 (318) Q Consensus 235 ~g~~v~~~~tdfG~~v~iv~nr~m~~~-----~~~~~D~~~-~~~~~Lr~~~~e~laktG-----d~~k~~i~~E~tLe~ 303 (318) +. --+=+|..|.++.+.++|.+ .+++.|++. +.+..-.+...+--.-++ +......+..+...+ T Consensus 297 -~~----~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~ 371 (397) T protein:vir:49 297 -PT----GYSIDGFAVKEVADRWLANGTGGAMPLYFGDLKQAVTLFDRQHMSLLSTNIGGGAFETDTTKVRVIDRFDVVA 371 (397) T ss_pred -CC----CceecceeeEEecccccccccCCceeEEEeeccceEEEEeecceEEEEeccccchhhcCceeEEEEeeeCcEE Confidence 10 01225744444557777753 377788885 333333333333222222 334455666789999 Q ss_pred ecccceeEEEEeccC Q lcl|NC_019402. 304 RNPYASGILEVKAGA 318 (318) Q Consensus 304 ~N~~a~g~i~~lt~a 318 (318) .+|.|..+++..+++ T Consensus 372 ~~~~a~~~~~~~~~~ 386 (397) T protein:vir:49 372 TDTEAFVPASFKAIA 386 (397) T ss_pred ecccceEEEEeeccc Confidence 999999999988877 No 79 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=97.69 E-value=7.2e-06 Score=48.77 Aligned_cols=280 Identities=14% Similarity=0.097 Sum_probs=147.8 Q ss_pred CCceeeee-eeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccccC Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERAS 79 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~~ 79 (318) |.+-+..+ -...-+.+.++|...-....|+..+......++..+.|....-... ..-.-||...+... T Consensus 107 ~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~--------a~wv~E~~~~~~~~--- 175 (401) T protein:vir:44 107 LQVGTDEDGGYAVPEELDRSILSLLKDEVVMRQEATVITVGGSDYKKLVNLGGTA--------SGWVGETDTRSQTA--- 175 (401) T ss_pred hhcCCCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEecCCcc--------ceeeccccccCccc--- Confidence 43322211 2233457777777777788888887666666565555655432211 11133665443221 Q ss_pred cEEecceEEEEeeeeee----hhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHH Q lcl|NC_019402. 80 TTVINNVTQILRKVVKV----SDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALV 155 (318) Q Consensus 80 ~~~~~N~tQIf~~~~~V----S~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i 155 (318) ...+.||.-..-.+ .=|.+.+.... -+...|=..+-...+.+-+|.+||+|. |+ .+..||+... T Consensus 176 ---~~~~~~v~~~~~k~~~~~~iS~ell~ds~-~~l~~~i~~~la~ai~~~~~~~~l~G~----G~----~~p~Gil~~~ 243 (401) T protein:vir:44 176 ---TSRLGLIEPFMGEIYGNPQATQKMLDDAF-FNVEAWINSELATEFAEQEEIAFTTGD----GT----KKPKGFLAYE 243 (401) T ss_pred ---cccceeeeeehhheeeehhhhHHHHhcch-HHHHHHHHHHHHHHHHHHHHhhhhccC----CC----Cccceeeccc Confidence 12344443322222 22333333222 122233333334557788999999873 22 2345776543 Q ss_pred hcCC--cccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCce Q lcl|NC_019402. 156 AAKD--AADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDT 233 (318) Q Consensus 156 ~~~~--~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~ 233 (318) .... ...+.........++...++.++|.+++..+-.+......++|++.....+..+. +.+ .|+-..+.- T Consensus 244 ~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lk-d~~----G~~l~~~~~-- 316 (401) T protein:vir:44 244 STEESDKARAFGKLQHIVSGEATAVTADAIIKLIYTLRKAHRTGAKFMMNNNSLFAIRLLK-DTE----GNYLWRPGL-- 316 (401) T ss_pred cccccccccccccccccccccccccCHHHHHHHHHhcchhhhcCCEEEEcHHHHHHHHHhh-ccC----CceeecCCc-- Confidence 3211 1111111122334566778899998888776544333446789998877777653 322 233222211 Q ss_pred EEEEEEEEEEcCCCcEEEEEecCCCCC-----ceEEEEehhh-cceeecCcc--cceecCCCccceeeEEEEEEeEEEec Q lcl|NC_019402. 234 RLNVYVSSIVDPLGCQYKLVPNRWMPE-----NAVYFFTPSD-WTQMVLRAP--ERTKLAKDGSYEKWMIEMEVGLRHRN 305 (318) Q Consensus 234 ~~g~~v~~~~tdfG~~v~iv~nr~m~~-----~~~~~~D~~~-~~~~~Lr~~--~~e~laktGd~~k~~i~~E~tLe~~N 305 (318) .-|. -.+=+| ++|+.+++||. ..+++.|++. +.+.--..+ ..+++.. -+...++.+..++..+.+ T Consensus 317 ~~g~----~~~l~G--~PVv~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~~~~~~~~~-~~~v~~~a~~r~d~~~~~ 389 (401) T protein:vir:44 317 ELGQ----PSSLAG--YGIAENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRDPYTN-KPFVGFYTTKRTGGMLVD 389 (401) T ss_pred CCCC----Cceecc--eeeEEecCcCCccCCccEEEEeehhccEEEEEecceEEeeecccc-CCcEEEEEEEEeccEEec Confidence 1111 112367 57888899884 2267789865 222111111 2234433 355566666779999999 Q ss_pred ccceeEEEEecc Q lcl|NC_019402. 306 PYASGILEVKAG 317 (318) Q Consensus 306 ~~a~g~i~~lt~ 317 (318) +.|..+|...++ T Consensus 390 ~~a~~~l~~~aa 401 (401) T protein:vir:44 390 SQAIKLLKIAAA 401 (401) T ss_pred ccceEEEEeecC Confidence 999999998888 No 80 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=97.67 E-value=1.9e-05 Score=46.42 Aligned_cols=273 Identities=11% Similarity=0.071 Sum_probs=133.6 Q ss_pred CCceeeeeee---eecccceeeeEecCCcccceeeeecc-ccccceEEEeeeeeccccCCccccccccceeecccccccc Q lcl|NC_019402. 1 MATLVSYDLN---GKKLSFANWISNLSPTDTPFVSMTGK-EAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGE 76 (318) Q Consensus 1 Ma~~~t~~~~---~~~~dl~d~I~~i~p~~TP~~s~i~~-~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~ 76 (318) +++-+++... ..-+++.+.|...-...+|+..+... -+..+-...|...+-... ..-.-||...+... T Consensus 130 ~~~~~~t~~~gg~~vP~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~p~~~~~~~--------a~~v~E~~~~~~~~ 201 (435) T protein:vir:14 130 MSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNGNITIPRLKGGAI--------VGYIGADTDIPTTQ 201 (435) T ss_pred hhcccCCcCCCccccchhHHHHHHHHHhhhchhhhhcceeeecCCCceEEEEEeCCcc--------eeeeccCccccccc Confidence 2222222221 12344445454444455666554211 122222233333221110 01133666655432 Q ss_pred ccCcEEecceE---EEEeeeeeehhHHHHhhhcCccchHHHHHH-HHHHHHHHHHHHHHhcCccccCCCCCccchhhhHH Q lcl|NC_019402. 77 RASTTVINNVT---QILRKVVKVSDTANVLANYGRGKELQYQME-KAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFS 152 (318) Q Consensus 77 ~~~~~~~~N~t---QIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~-k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~ 152 (318) .. +.+++ .-+...+.|| .+.+.-.+..-.+...+. .-...+.+-+|.+|++|. |.+. ...||. T Consensus 202 ~~----f~~i~~~~~k~~~~~~iS--~ell~ds~~~~~l~~~i~~~l~~ai~~~~d~a~l~G~----G~~~---~p~Gi~ 268 (435) T protein:vir:14 202 QQ----FDDLKLTAKKMAALVPIA--NDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDD----GTAN---TPKGLR 268 (435) T ss_pred cc----eeEEEeeeEEEEEeehhh--HHHHHhhccCHHHHHHHHHHHHHHHHHHHHHHhhccC----CCCc---ccccee Confidence 11 12111 1122233344 333443332212333333 445568888999999873 3332 345666 Q ss_pred HHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCC--cCEEEEcchHHhhhhhhhhhhcccccceEEEecC Q lcl|NC_019402. 153 ALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSE--ANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDG 230 (318) Q Consensus 153 ~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~--~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~ 230 (318) .+....... ...+++.......++.+++..+..+... ...+++|+.....+..+. +.+ .++..... T Consensus 269 ~~~~~~~~~-------~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~L~~lk-d~~----G~~l~~~~ 336 (435) T protein:vir:14 269 FWALPSNVI-------TASDASTLQKIETDLGKVILALENADANLTQPGWIMAPRTFRFLEGLR-DGN----GNKVYPEL 336 (435) T ss_pred eccccccee-------ccccccchhhHHHHHHHHHHHhhhccccccCCEEEEcHHHHHHHHHhh-ccC----CceeccCC Confidence 443222111 1122333444556788888887765443 346788998877775553 322 22222111 Q ss_pred CceEEEEEEEEEEcCCCcEEEEEecCCCCCc--------eEEEEehhhcceeecCcccceec--C--C----------Cc Q lcl|NC_019402. 231 QDTRLNVYVSSIVDPLGCQYKLVPNRWMPEN--------AVYFFTPSDWTQMVLRAPERTKL--A--K----------DG 288 (318) Q Consensus 231 ~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~--------~~~~~D~~~~~~~~Lr~~~~e~l--a--k----------tG 288 (318) .. | +-+| +.++.+.+||.+ .+++.|++.+-+..-.++..+-. + + .- T Consensus 337 ~~---g-------~l~G--~Pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~f~~ 404 (435) T protein:vir:14 337 AN---G-------MLKG--YPVGKTTQVPINLGETGKESEIYFTDFGDVFIGEEETLEIDYSKEATYKDADGHMVSAFQR 404 (435) T ss_pred CC---C-------eeec--ceeEeeccccccccCCCccceEEEeecccEEEEEecccEEEEeccccccccccchhhhhhc Confidence 11 1 2357 578888888763 58889988754322122222111 1 1 11 Q ss_pred cceeeEEEEEEeEEEecccceeEEEEeccC Q lcl|NC_019402. 289 SYEKWMIEMEVGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 289 d~~k~~i~~E~tLe~~N~~a~g~i~~lt~a 318 (318) |....+.+.-+.+.+.+|+|..++++++.- T Consensus 405 ~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~ 434 (435) T protein:vir:14 405 DQTLIRVIAKNDFGPRHVESIAVLAGVAWG 434 (435) T ss_pred ChhheeeeeeeCceeecccceEEEecCCCC Confidence 445666777789999999999999998876 No 81 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=97.63 E-value=9.6e-06 Score=48.09 Aligned_cols=281 Identities=11% Similarity=-0.013 Sum_probs=129.4 Q ss_pred CCceeeee--eeeecccceeeeEecCCcccceeeeecccc-ccceEEEeeeeeccccCCccccccccceeecccccccc- Q lcl|NC_019402. 1 MATLVSYD--LNGKKLSFANWISNLSPTDTPFVSMTGKEA-INQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGE- 76 (318) Q Consensus 1 Ma~~~t~~--~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~-~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~- 76 (318) -.++++.+ --.-.++..+.+...--+.+||++.+---+ .++..++.. .+..... ........|...+..+ T Consensus 16 ~k~~t~~d~~Gg~l~P~~~~~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~--~~g~~~~----~~~g~~~~~~~~~~~~~ 89 (315) T protein:vir:41 16 VPKIDVPDLGRGVLSVDRFGEFVKAVRDSAVIIPEARIDNALKSYEKDIS--RLSLVLD----VGPGRDETGQKLAPPES 89 (315) T ss_pred hhhcCCcCCCCceechHHHHHHHHHHHhhhhhhhhceeeecccccccccc--ccccCcc----cccccccccCcCCCCCC Confidence 11111111 112234444555555556778887754311 112112211 1100000 0000111121111111 Q ss_pred -ccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHH Q lcl|NC_019402. 77 -RASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALV 155 (318) Q Consensus 77 -~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i 155 (318) .+.....-+. .-+.--+.||.-.--=+..|. +.-++=..+-.+++.+++|.+|++|.-.. ..+...+..|++... T Consensus 90 ~~~f~~~~l~~-~~l~~~~~it~elL~D~~~~~-~~e~~l~~~~a~~~a~~~~~~~~nGdg~s--~~p~~~~~~G~l~~a 165 (315) T protein:vir:41 90 TAEVKTNTLYM-REMVTKVVIHEDAIEDNIEGK-AFEQKIVTLLGEGISYVLEKYYLHGDTSS--SDPLLRMSDGWLKLA 165 (315) T ss_pred ccccceeeece-eeeeeeccccHHHHHhhhccc-cHHHHHHHHHHHHHHHHHHHHhhccCCcC--cCccccccccceecc Confidence 1111111111 122222455544332222232 33344444667789999999999983210 111224556776544 Q ss_pred hcCCcccCccccceeeccCccccCHHHHHHHHHHH---HhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCc Q lcl|NC_019402. 156 AAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNL---YLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQD 232 (318) Q Consensus 156 ~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i---~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~ 232 (318) ...- . ....+++...++.+.|.+++..+ |-+.+..-.++++......+-++..... .+- ... T Consensus 166 ~~~~-~------~~~~~~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rklk~~~g-----~~l---w~~ 230 (315) T protein:vir:41 166 SEKL-T------ESDVDPEAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDALKGRE-----TGL---GDQ 230 (315) T ss_pred cccc-c------ccccccccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHHhccCC-----Ccc---ccc Confidence 3211 0 01223455667888888888876 5444433356778877777777653321 110 000 Q ss_pred eEEEEEEEEEEcCCCcEEEEEecCCC-----CCceEEEEehhhcceeecCcccce--ecCCCccceeeEEEE----EEeE Q lcl|NC_019402. 233 TRLNVYVSSIVDPLGCQYKLVPNRWM-----PENAVYFFTPSDWTQMVLRAPERT--KLAKDGSYEKWMIEM----EVGL 301 (318) Q Consensus 233 ~~~g~~v~~~~tdfG~~v~iv~nr~m-----~~~~~~~~D~~~~~~~~Lr~~~~e--~laktGd~~k~~i~~----E~tL 301 (318) -..+.... +=+| +.|+....| |+..+++.|++.+-++.-|.+..+ ..++. .+.+++. -+.+ T Consensus 231 ~~~~g~~~---tl~G--~PV~~~~~m~~~~~~~~~ilf~d~~nl~~~~~~~i~i~~~~~a~~---~~~~~~~~~r~d~~~ 302 (315) T protein:vir:41 231 ALTGANSI---LYDG--RPVQYVPALEALNDGKSRALFVVPTQLVYGFWRNIKVVPDYDAEM---RLTKYVASLRTDNHY 302 (315) T ss_pred hhhcCCCc---eecc--cceEecccccccCCCCccEEEecccceEEEeccccEEEeeecCCC---CceEEEEEEEeceeE Confidence 00111111 2235 344444444 678899999998776666666543 33332 2233333 3455 Q ss_pred EEecccceeEEEE Q lcl|NC_019402. 302 RHRNPYASGILEV 314 (318) Q Consensus 302 e~~N~~a~g~i~~ 314 (318) .+.|-.+.+++.. T Consensus 303 ~~~~~~a~~~~~v 315 (315) T protein:vir:41 303 EDEEGAVSATITV 315 (315) T ss_pred EeccceeEeeeeC Confidence 6778777888888 No 82 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=97.61 E-value=8.6e-06 Score=48.35 Aligned_cols=270 Identities=11% Similarity=-0.006 Sum_probs=131.4 Q ss_pred CCceeeee-eeeecccceeeeEecCCcccceeeeeccccccceE--EEeeeeeccccCCccccccccceeeccccccccc Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTL--FQWQTDALAPVADPSDAQKRNAVIEGSAAVDGER 77 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~--~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~ 77 (318) +++.++.+ -...-+++.+.|...-....|+++++...+.++.. +-+.+..-. ..-.-||+..+.... T Consensus 84 ~~~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~~~~i~~~~~~~~----------a~~~~E~~~~~~~~~ 153 (390) T protein:vir:40 84 IAGNGFAGVTALLPPTVFERVFEDLTVEHPLLSKINFVNTTATTEWIISVGDVAT----------AWWGPLCAEIKEVLD 153 (390) T ss_pred HhccCcccCcccccHHHHHHHHHHHHhhhhhhhhceeeecCCceeEEEEEcCCcc----------eeeeccccccCcccc Confidence 33222222 22334666777777767777888876655443322 222221111 111225544332211 Q ss_pred cCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhc Q lcl|NC_019402. 78 ASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAA 157 (318) Q Consensus 78 ~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~ 157 (318) ..-..+.=-..-+.--+.||.-...-... +..+|=...-...+.+-+|.+||+|. |++ .| -||+.-+.. T Consensus 154 ~~f~~i~l~~~k~~~~i~iS~ell~ds~~---~l~~~i~~~la~~i~~~~~~a~l~G~----G~~-~P---~Gil~~~~~ 222 (390) T protein:vir:40 154 NGFDKIQTGMYKLSAYIPVCNAMLDLGPS---WLDQYVRTILGEAMALGLEAGIVNGS----GKD-QP---IGMMRDLNN 222 (390) T ss_pred ccceeeEeeeeeEEEeehhhHHHHhcchH---HHHHHHHHHHHHHHHHHHHhhhhccc----CCC-cc---ceeeecccc Confidence 01011110111122223444333322222 34455555666778899999999874 322 22 255532211 Q ss_pred CCcccCccccceeeccCccccCHHHHHHHHHHH---HhCCC----CcCEEEEcchHH-hhhhhh--hhhhcccccceEEE Q lcl|NC_019402. 158 KDAADPDTGAIVHFETAAAALTEAEIFKVTYNL---YLSGS----EANIIMFHPKHA-AFFSSL--METSGVTNGQRMKM 227 (318) Q Consensus 158 ~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i---~~~G~----~~~~l~~~~~~k-~~is~~--~~~~~~~~~~r~~~ 227 (318) . . . ......+...++..+..+++..+ |...+ ....++||+... ..+-.+ ..+.+ .++ T Consensus 223 ~--~-~----~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~~~~~d~~----G~~-- 289 (390) T protein:vir:40 223 V--T-A----GEHPVKTATPLTDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAATSYMTPQ----GVW-- 289 (390) T ss_pred c--c-c----cccccccccccchhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHHhhccCCC----Ccc-- Confidence 1 0 0 01112233445655554444333 32222 123467887542 111111 11111 111 Q ss_pred ecCCceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceecCC---CccceeeEEEEEEeEEEe Q lcl|NC_019402. 228 FDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKLAK---DGSYEKWMIEMEVGLRHR 304 (318) Q Consensus 228 ~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~lak---tGd~~k~~i~~E~tLe~~ 304 (318) +. =...|| +.++.+.+||++++++.|++.+-+.--..+..+...- .-|....+.+.-+...++ T Consensus 290 -----------v~-~~~~~g--~pvv~~~~~p~~~i~~Gd~s~~~i~~~~~~~v~~~~~~~f~~~~~~~r~~~r~dg~v~ 355 (390) T protein:vir:40 290 -----------VT-GILPVP--LEIVQSVAVPVGKAVAGRAKDYFMGIGSEQVIRTSTEYRLLDDETLYYAKQYANGRPK 355 (390) T ss_pred -----------cc-ccCCCc--eeEEEcCCCCCCcEEEEeeceEEEEeecceEEEecchhhhhcCcEEEEEEEEeCCEEe Confidence 10 113478 6899999999999999999986543223443333221 225556667777899999 Q ss_pred cccceeEEEEeccC Q lcl|NC_019402. 305 NPYASGILEVKAGA 318 (318) Q Consensus 305 N~~a~g~i~~lt~a 318 (318) ++.|..++...+.+ T Consensus 356 ~~~A~~~l~~~~~~ 369 (390) T protein:vir:40 356 DNSSFLVFDITGLE 369 (390) T ss_pred cccceEEEEeeccC Confidence 99999988877775 No 83 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=97.61 E-value=9.3e-06 Score=48.16 Aligned_cols=275 Identities=12% Similarity=0.064 Sum_probs=134.7 Q ss_pred CCceeeee-eeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccccC Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERAS 79 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~~ 79 (318) ++.-++.. -...-+.+.+.|+..=....|++.++-....++ .+.|....-.. ...-..||.+.+...... T Consensus 138 ~~~~~~~~gg~~vP~~~~~~Ii~~l~~~~~i~~~~~~~~~~g-~~~ip~~~~~~--------~a~~v~E~~~~~~~~~~~ 208 (425) T protein:vir:95 138 RNLRAVAGGELTIPEVVVNRIMDIMGDYTTLYPLVDKIRVKG-TTRILVDTDTS--------PATWIEQSGALPTGDVGT 208 (425) T ss_pred HhhcccccCceeccHHHHHHHHHHHHhhhhHHHhhceeecCc-eeEEEEecCCc--------cccccccccccccccccc Confidence 22211111 112234567777666566778887765444332 23343322111 112234666544332111 Q ss_pred cEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHH-HHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhcC Q lcl|NC_019402. 80 TTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKA-GKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAAK 158 (318) Q Consensus 80 ~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~-~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~~ 158 (318) -..+.=-..-+...+.||.- .+..... + +...+... ...+.+-+|.++|+|. |+++ .+--||+.-+... T Consensus 209 f~~i~l~~~k~~~~~~iS~e--ll~ds~~-~-l~~~i~~~l~~~i~~~~d~~il~G~----G~~~--~~p~Gil~~~~~~ 278 (425) T protein:vir:95 209 IASIDFDGFKVGKVTFVDNY--LLQDSII-N-LDDYVTKKIARAIAKALDLAIVKGT----GAAN--KQPLGIIPSLPPE 278 (425) T ss_pred cceeeeeheeeeeeehhhHH--HHhccHH-H-HHHHHHHHHHHHHHHHHHHHhhccC----CCCc--cccceeecccccc Confidence 00110001113334444443 3332221 2 33334433 4448899999999884 3221 1223665433322 Q ss_pred CcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCc-C-EEEEcc-hHHhhhhhh--hhhhcccccceEEEecCCce Q lcl|NC_019402. 159 DAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEA-N-IIMFHP-KHAAFFSSL--METSGVTNGQRMKMFDGQDT 233 (318) Q Consensus 159 ~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~-~-~l~~~~-~~k~~is~~--~~~~~~~~~~r~~~~~~~~~ 233 (318) .. ....+.+.+.+++.++...+-.+.... + .+++++ .....+..+ .++.+ .++........ T Consensus 279 ~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~l~~~kd~~----g~~i~~~~~~~ 344 (425) T protein:vir:95 279 NQ----------VTVEADNNLLKNLVKQIGLIDTGDDSVGEIVAVMKRSTYYNRLVEFSIQVDSN----GNVVGKLPNLR 344 (425) T ss_pred cc----------cccccccchHHHHHHHHHhhhhhccccCceEEEEeChHHHHHHHHHHhhcCCC----CceeeccCCCC Confidence 11 122344567777888776655432221 2 234454 333333332 22222 23322111110 Q ss_pred EEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceec---CCCccceeeEEEEEEeEEEeccccee Q lcl|NC_019402. 234 RLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKL---AKDGSYEKWMIEMEVGLRHRNPYASG 310 (318) Q Consensus 234 ~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~l---aktGd~~k~~i~~E~tLe~~N~~a~g 310 (318) --+=|| +.++.+.+||++.+++.|++++-+..-.++..+.. .-+-|....+.+..+...+++|.|.. T Consensus 345 --------~~~l~G--~pvv~~~~~~~~~i~~Gd~~~~~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~d~~~~~~~a~~ 414 (425) T protein:vir:95 345 --------TPDLLG--LRVVFNNFLDDDTVLFGEFEQYTLVERENITIDSSTHVKFTEDQTAFRGKGRFDGKPVKPEAFV 414 (425) T ss_pred --------Cccccc--eeeEEcCcCCCccEEEEecccEEEEeecceEEEeecccccccCceEEEEEEeeCcEeecccceE Confidence 112368 58999999999999999999855444333332221 12235666677777899999999999 Q ss_pred EEEEeccC Q lcl|NC_019402. 311 ILEVKAGA 318 (318) Q Consensus 311 ~i~~lt~a 318 (318) +++..+-. T Consensus 415 ~~~i~~~~ 422 (425) T protein:vir:95 415 LVTITDPV 422 (425) T ss_pred EEEecCcC Confidence 99877733 No 84 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=97.60 E-value=1.8e-05 Score=46.56 Aligned_cols=262 Identities=11% Similarity=0.006 Sum_probs=140.1 Q ss_pred CCceeeee-eeeecccceeeeEecCCcccceeeeeccccccce--EEEeeee-e-ccccCCccccccccceeeccccccc Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQT--LFQWQTD-A-LAPVADPSDAQKRNAVIEGSAAVDG 75 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~--~~~W~td-~-l~~~~~~~~~~~~na~~EG~da~~~ 75 (318) |...++.+ -...-+.+.+.|...-....|++.+......+.. ...|... . ...+ .-.-||...++. T Consensus 109 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a---------~~v~E~~~~~~~ 179 (397) T protein:vir:49 109 KTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEKWADITGLA---------KLDDEGGQIGQN 179 (397) T ss_pred hhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEeeccCCcce---------eeeccccccccc Confidence 44333332 2224456677777777888888776654333221 2223221 1 1111 112367665543 Q ss_pred cccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHH Q lcl|NC_019402. 76 ERASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALV 155 (318) Q Consensus 76 ~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i 155 (318) ....-..+.=-..-+...+.||.-...-... +...|=...-...+.+-+|.++|+|. |+++ + T Consensus 180 ~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~---~l~~~i~~~l~~~~~~~~d~ail~G~----g~~~-~---------- 241 (397) T protein:vir:49 180 DDPKLSLIRYAIKRYAGISTVTNSLLADSAE---NILAWLSGWIAKKVVVTRNKAILEAI----GTLP-N---------- 241 (397) T ss_pred cccceeeeEeeeeeeEeehhhHHHHHhhhhH---HHHHHHHHHHHHHHHHHHHHHHHhcc----cccc-c---------- Confidence 3211122222222233445555443321211 33334444555667788999999773 2211 0 Q ss_pred hcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEE Q lcl|NC_019402. 156 AAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRL 235 (318) Q Consensus 156 ~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~ 235 (318) .+..++.+.|.+++-++-.++.....++|||.....+..+. +.+ .++...+.-.. T Consensus 242 ------------------~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lk-d~~----g~~l~~~~~~~-- 296 (397) T protein:vir:49 242 ------------------KPTLAKWDDIIDLQAKVDPAIKQTSLFLTNTSGFTALKKVK-NAM----GDYLMERDVKS-- 296 (397) T ss_pred ------------------cccccCHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhh-ccC----CceeecccccC-- Confidence 11235677888888888777777778999999988888774 322 23222111000 Q ss_pred EEEEEEEEcCCCcEEEEEecCCCCC-----ceEEEEehhh-cceeecCcccceecCCCc-----cceeeEEEEEEeEEEe Q lcl|NC_019402. 236 NVYVSSIVDPLGCQYKLVPNRWMPE-----NAVYFFTPSD-WTQMVLRAPERTKLAKDG-----SYEKWMIEMEVGLRHR 304 (318) Q Consensus 236 g~~v~~~~tdfG~~v~iv~nr~m~~-----~~~~~~D~~~-~~~~~Lr~~~~e~laktG-----d~~k~~i~~E~tLe~~ 304 (318) |. -.+=+|..|.++.+..||. ..+++.|++. +.+....++..+...-.+ +.....++.-+...++ T Consensus 297 g~----~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~~~ 372 (397) T protein:vir:49 297 PT----GYSIDGFVVKEISDRFLPNGTGGAMPLYFGDLKQAVTLFDRQHLSLLSTNIGGGAFETDTTKVRVIDRFDVVST 372 (397) T ss_pred CC----CceecceeeEEecccccccccCCceeEEEeeccceEEEEeecccEEEEeccccchhhcCeeeEEEEEeeccEEe Confidence 10 0123674444455666664 3478889874 555444555444333222 2334455666889999 Q ss_pred cccceeEEEEeccC Q lcl|NC_019402. 305 NPYASGILEVKAGA 318 (318) Q Consensus 305 N~~a~g~i~~lt~a 318 (318) +|.|..+++..+++ T Consensus 373 ~~~a~~~~~~~~~~ 386 (397) T protein:vir:49 373 DTEAFVPASFKAIA 386 (397) T ss_pred cccceEEEEecccc Confidence 99999999988877 No 85 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=97.59 E-value=2.6e-05 Score=45.72 Aligned_cols=264 Identities=14% Similarity=0.124 Sum_probs=135.5 Q ss_pred CCc-eeeeeeeeecccceeeeE----------ecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeec Q lcl|NC_019402. 1 MAT-LVSYDLNGKKLSFANWIS----------NLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEG 69 (318) Q Consensus 1 Ma~-~~t~~~~~~~~dl~d~I~----------~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG 69 (318) ||. .|.-....+-|-+++.+. .+...+..+-. -++.++ +.+.|. .+..+.+ -.|| T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g-~~G~tv--~ip~~~--~~g~a~~---------~~~g 66 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEG-QPGSEI--TVPKYK--YIGDAQD---------VAEG 66 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhcccceecccccC-CCCCEE--EEeeec--cCCccee---------ecCC Confidence 995 344434444444444331 22222222211 112233 344563 2322221 2256 Q ss_pred cccccccccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhh Q lcl|NC_019402. 70 SAAVDGERASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTA 149 (318) Q Consensus 70 ~da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~ 149 (318) .+.+....+.....--+.|. .+.|+|++-..... .++.+++-......-+.|.+++.++..-...... T Consensus 67 ~~i~~~~lt~~~~~~~i~~~-~~a~~v~D~~~~~~---~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~-------- 134 (278) T protein:vir:80 67 AAIDYSALETESVKHGIKKA-GKGVKLTDESVLSG---YGDPVEEAQKQIRMAIASKVDNDILEEALTTTLE-------- 134 (278) T ss_pred CcCcccccccceeeEeeehh-hccccccHHHHhhc---cccHHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------- Confidence 55555444444444444553 45788887654433 2466777777778888999999888432111000 Q ss_pred hHHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCC-CcCEEEEcchHHhhhhhhhhhhcccccceEEEe Q lcl|NC_019402. 150 GFSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGS-EANIIMFHPKHAAFFSSLMETSGVTNGQRMKMF 228 (318) Q Consensus 150 Gi~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~-~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~ 228 (318) .... .......-..+.|.++..++=.++. ....++|+|.+...+.+...... .+. .. T Consensus 135 -------~~~~----------~t~~~~~~~~~~~~da~~~l~~~~~~~~~~ivv~p~~~~~L~k~~~~~~----~~~-~~ 192 (278) T protein:vir:80 135 -------VKGA----------INIGLIDKIENTFTDAPDAIEDESITTTGVLFLNYKDTAKLREEAAGSW----TKA-SQ 192 (278) T ss_pred -------cccc----------cccchhhhHHHHHHHHHHhhcccCCCcccEEEECHHHHHHHHhhhhhhc----ccc-cc Confidence 0000 0001111234567777776654443 34468899988766544321100 000 00 Q ss_pred cCCceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceecCCCccceeeEE--EEEEeEEEecc Q lcl|NC_019402. 229 DGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKLAKDGSYEKWMI--EMEVGLRHRNP 306 (318) Q Consensus 229 ~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~laktGd~~k~~i--~~E~tLe~~N~ 306 (318) .++.-.....+-.| .| +.|+.+.+||.++.+++.+..+.+..-+++..|.. ...++..-.| ..=|++.+.|| T Consensus 193 ~g~~~~~~G~ig~~---~G--~~Vi~s~~~p~~t~~l~~~gAi~~~~~~~~~vE~~-Rd~~~~~d~i~~~~~yg~~v~~~ 266 (278) T protein:vir:80 193 LGDDLLVKGAFGEL---LG--WEIVRTKKLADGNALAVKAGALKTFLKRNLLAESG-RDMDHKLTKFNADQHYAVALVDE 266 (278) T ss_pred ccccceeeccceee---cc--eeEEEcCCCCcceEEEEeccceeeeecCCcccccc-cchhhccceeeeeeEEEEEEEcC Confidence 01110001111122 46 68999999999999999999877665566554322 2222222222 33389999999 Q ss_pred cceeEEEEeccC Q lcl|NC_019402. 307 YASGILEVKAGA 318 (318) Q Consensus 307 ~a~g~i~~lt~a 318 (318) .+..+|+--++- T Consensus 267 ~~~v~it~~a~~ 278 (278) T protein:vir:80 267 TKAVKVVPVAGN 278 (278) T ss_pred cceEEEeeccCC Confidence 999998877777 No 86 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=97.58 E-value=1.8e-05 Score=46.55 Aligned_cols=281 Identities=11% Similarity=0.067 Sum_probs=130.7 Q ss_pred CCceee--eeeeeecccceeeeEecCCcccceeeeeccccccceEEE-eeeeeccccCCccccccccceeeccccccccc Q lcl|NC_019402. 1 MATLVS--YDLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQ-WQTDALAPVADPSDAQKRNAVIEGSAAVDGER 77 (318) Q Consensus 1 Ma~~~t--~~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~-W~td~l~~~~~~~~~~~~na~~EG~da~~~~~ 77 (318) +.+-++ ......-..+.+.|...--..+|+..+.......+..+. |++.....+.. .-||...+.... T Consensus 161 ~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~---------v~e~~~~~~~~~ 231 (458) T protein:vir:10 161 VNQSSSVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSKILTMLVEPDAGKATW---------VAASTYGTDTTT 231 (458) T ss_pred hhhcccCccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCcceEEEEecCCcceee---------cccccccccccc Confidence 111111 112223345556565555567777766555444444333 33322211111 124433332211 Q ss_pred c--CcEEecceE-EEEe--eeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHH Q lcl|NC_019402. 78 A--STTVINNVT-QILR--KVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFS 152 (318) Q Consensus 78 ~--~~~~~~N~t-QIf~--~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~ 152 (318) . ....+..++ -.++ ..+.||.- .+.-.. .+...|=...-...+.+-+|.+||+|. |++ +.-||+ T Consensus 232 ~~~~~~~~~~i~~~~~k~~~~v~is~e--ll~ds~-~~~~~~i~~~l~~~i~~~~d~~~l~G~----G~~----~p~Gi~ 300 (458) T protein:vir:10 232 GEEVKGALKEIHFSTYKLAAKSFITDE--TEEDAI-FSLLPLLRKRLIEAHAVSIEEAFMTGD----GSG----KPKGLL 300 (458) T ss_pred cccccccceeeEeeeeeEEeeehhhHH--HHhcch-HHHHHHHHHHHHHHHHHHHHHHhhcCC----CCC----ccceee Confidence 0 001111111 1111 22344433 332221 233344444556667788999999873 332 345666 Q ss_pred HHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEec--C Q lcl|NC_019402. 153 ALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFD--G 230 (318) Q Consensus 153 ~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~--~ 230 (318) ......... ............++.++|.+++.++-.++.....++||+.....|..+. +.+ .++.... . T Consensus 301 ~~~~~~~~~----~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~lk-d~~----G~~i~~~~~~ 371 (458) T protein:vir:10 301 TLASEDSAK----VVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLE-DEE----WQDVAQVGND 371 (458) T ss_pred ecccccccc----eeecccccccccccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHhhc-ccC----Cceeeccccc Confidence 543222110 0112234455678889999998888777666677899998777776553 221 1111110 0 Q ss_pred CceEEEEEEEEEEcCCCcEEEEEecCCCCCc----eEEEEehh-hcceeecCccc--ceecCCCccceeeEEEEEEeEEE Q lcl|NC_019402. 231 QDTRLNVYVSSIVDPLGCQYKLVPNRWMPEN----AVYFFTPS-DWTQMVLRAPE--RTKLAKDGSYEKWMIEMEVGLRH 303 (318) Q Consensus 231 ~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~----~~~~~D~~-~~~~~~Lr~~~--~e~laktGd~~k~~i~~E~tLe~ 303 (318) .....|. -.+=+| +.|+.+.+||++ .+++.|+. ++.+..-..+. .++.+.++ ........-+++-+ T Consensus 372 ~~~~~~~----~~~l~G--~pv~~~~~~p~~~~~~~~~~~~f~~~~~~~~~~~~~v~~d~~~~~~-~~~~~~~~r~~~~v 444 (458) T protein:vir:10 372 SVKLQGQ----VGRIYG--LPVVVSEYFPAKANSAEFAVIVYKDNFVMPRQRAVTVERERQAGKQ-RDAYYVTQRVNLQR 444 (458) T ss_pred cccccCc----Cceecc--eeeEEccccccccCCcceEEEEecccEEEEEeeceEEEeecccCCC-ceEEEEEEEecceE Confidence 0010111 113367 688999999874 36677774 23222111222 23333322 23333333366888 Q ss_pred ecccceeEEEEeccC Q lcl|NC_019402. 304 RNPYASGILEVKAGA 318 (318) Q Consensus 304 ~N~~a~g~i~~lt~a 318 (318) ..|.|+..++ .++| T Consensus 445 ~~~~a~v~~~-~aa~ 458 (458) T protein:vir:10 445 YFANGVVSGT-YAAS 458 (458) T ss_pred ecccceEEEe-eccC Confidence 9997775533 3334 No 87 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=97.58 E-value=3e-05 Score=45.37 Aligned_cols=289 Identities=15% Similarity=0.098 Sum_probs=144.5 Q ss_pred CCc-eeeeeeeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccccC Q lcl|NC_019402. 1 MAT-LVSYDLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERAS 79 (318) Q Consensus 1 Ma~-~~t~~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~~ 79 (318) |.+ -++......-+++...|+..-....|+..++.....++....|.......+. ..-.-||...+..... T Consensus 151 ~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~-------a~wv~E~~~~~~s~~~- 222 (497) T protein:vir:78 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNN-------AAAVAEAGTYPFSSEE- 222 (497) T ss_pred hhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCc-------ceeeccCccccccccc- Confidence 332 2222344566777777777667778888887766665555666544322110 0123377766553321 Q ss_pred cEEecceEEEEeeeeee-hhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhcC Q lcl|NC_019402. 80 TTVINNVTQILRKVVKV-SDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAAK 158 (318) Q Consensus 80 ~~~~~N~tQIf~~~~~V-S~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~~ 158 (318) +..++-=.++-..+ .=|.+.+.-. .+.-+|=..+-...+.+-+|.+||+|.- ++ +-.||+...... T Consensus 223 ---f~~i~~~~~k~a~~~~iS~ell~d~--~~l~~~i~~~l~~~i~~~~d~~~l~G~G----~~----~p~Gil~~~~~~ 289 (497) T protein:vir:78 223 ---FARVYEQVGKVANALTITDEGLRDA--PELFNFVQGRLLEGIQRKEEVQLLAGGG----YP----GVNGLLQRSTGF 289 (497) T ss_pred ---ceeeEeeeeeeEeecHhHHHHHHhH--HHHHHHHHHHHHHHHHHHHHHHhhcCCC----cc----cccccccccccc Confidence 22222222222111 1222222221 1223333345556788889999998741 11 111222110000 Q ss_pred -------------------Cc-ccCc-------------------------cccceeeccCccccCHHHHHHHHHHHHhC Q lcl|NC_019402. 159 -------------------DA-ADPD-------------------------TGAIVHFETAAAALTEAEIFKVTYNLYLS 193 (318) Q Consensus 159 -------------------~~-~~~~-------------------------~g~~~~~~~t~~~lTe~~l~~~~~~i~~~ 193 (318) .. ++.. .+.. .............+...+..++.. T Consensus 290 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 368 (497) T protein:vir:78 290 TASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGV-AGSYPTAAEIAENVFDAFVDIQLT 368 (497) T ss_pred cccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccch-hccccchhhhhhHHHHHHhhhhhh Confidence 00 0000 0000 000011122344455556666655 Q ss_pred CC-CcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcc Q lcl|NC_019402. 194 GS-EANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWT 272 (318) Q Consensus 194 G~-~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~ 272 (318) +. .++.+++||..-..+..+. +.+ .++-..+......+..+..-.+=+| +.++.+..||++++++.|++... T Consensus 369 ~~~~~~~~vmn~~~~~~l~~lk-d~~----G~~i~~~~~~~~~~~~~~~~~~l~G--~pV~~t~~~~~~~~~~Gd~~~~~ 441 (497) T protein:vir:78 369 LFQTPNAVVMNPRDWELLRLTK-DAN----GQYMGGNFFGNAYGNPVNGGKNIWG--VPVVTTPLIPLGTILVGHFAPSV 441 (497) T ss_pred cccCCCeEEEchHHHHHHHHhh-cCC----CceeccCcccccccccccCCceeec--eeeEecCCCCCCceEEeecccce Confidence 44 5567889998877776664 322 2222211111112222222224567 68899999999999999988654 Q ss_pred eeec-C-cccceecCCCc-----cceeeEEEEEEeEEEecccceeEEEEeccC Q lcl|NC_019402. 273 QMVL-R-APERTKLAKDG-----SYEKWMIEMEVGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 273 ~~~L-r-~~~~e~laktG-----d~~k~~i~~E~tLe~~N~~a~g~i~~lt~a 318 (318) +... | .+..+-....+ +....+.+.++++.|++|.|+.+++..+++ T Consensus 442 ~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~ 494 (497) T protein:vir:78 442 IQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) T ss_pred EEEEEecccEEEeecccchhhhcCcEEEEEEEeecceeeccccEEEEEecCCc Confidence 4433 2 12112111222 344555567799999999999999998888 No 88 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=97.58 E-value=3e-05 Score=45.37 Aligned_cols=289 Identities=15% Similarity=0.098 Sum_probs=144.5 Q ss_pred CCc-eeeeeeeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccccC Q lcl|NC_019402. 1 MAT-LVSYDLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERAS 79 (318) Q Consensus 1 Ma~-~~t~~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~~ 79 (318) |.+ -++......-+++...|+..-....|+..++.....++....|.......+. ..-.-||...+..... T Consensus 151 ~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~-------a~wv~E~~~~~~s~~~- 222 (497) T protein:vir:10 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNN-------AAAVAEAGTYPFSSEE- 222 (497) T ss_pred hhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCc-------ceeeccCccccccccc- Confidence 332 2222344566777777777667778888887766665555666544322110 0123377766553321 Q ss_pred cEEecceEEEEeeeeee-hhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhcC Q lcl|NC_019402. 80 TTVINNVTQILRKVVKV-SDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAAK 158 (318) Q Consensus 80 ~~~~~N~tQIf~~~~~V-S~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~~ 158 (318) +..++-=.++-..+ .=|.+.+.-. .+.-+|=..+-...+.+-+|.+||+|.- ++ +-.||+...... T Consensus 223 ---f~~i~~~~~k~a~~~~iS~ell~d~--~~l~~~i~~~l~~~i~~~~d~~~l~G~G----~~----~p~Gil~~~~~~ 289 (497) T protein:vir:10 223 ---FARVYEQVGKVANALTITDEGLRDA--PELFNFVQGRLLEGIQRKEEVQLLAGGG----YP----GVNGLLQRSTGF 289 (497) T ss_pred ---ceeeEeeeeeeEeecHhHHHHHHhH--HHHHHHHHHHHHHHHHHHHHHHhhcCCC----cc----cccccccccccc Confidence 22222222222111 1222222221 1223333345556788889999998741 11 111222110000 Q ss_pred -------------------Cc-ccCc-------------------------cccceeeccCccccCHHHHHHHHHHHHhC Q lcl|NC_019402. 159 -------------------DA-ADPD-------------------------TGAIVHFETAAAALTEAEIFKVTYNLYLS 193 (318) Q Consensus 159 -------------------~~-~~~~-------------------------~g~~~~~~~t~~~lTe~~l~~~~~~i~~~ 193 (318) .. ++.. .+.. .............+...+..++.. T Consensus 290 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 368 (497) T protein:vir:10 290 TASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGV-AGSYPTAAEIAENVFDAFVDIQLT 368 (497) T ss_pred cccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccch-hccccchhhhhhHHHHHHhhhhhh Confidence 00 0000 0000 000011122344455556666655 Q ss_pred CC-CcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcc Q lcl|NC_019402. 194 GS-EANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWT 272 (318) Q Consensus 194 G~-~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~ 272 (318) +. .++.+++||..-..+..+. +.+ .++-..+......+..+..-.+=+| +.++.+..||++++++.|++... T Consensus 369 ~~~~~~~~vmn~~~~~~l~~lk-d~~----G~~i~~~~~~~~~~~~~~~~~~l~G--~pV~~t~~~~~~~~~~Gd~~~~~ 441 (497) T protein:vir:10 369 LFQTPNAVVMNPRDWELLRLTK-DAN----GQYMGGNFFGNAYGNPVNGGKNIWG--VPVVTTPLIPLGTILVGHFAPSV 441 (497) T ss_pred cccCCCeEEEchHHHHHHHHhh-cCC----CceeccCcccccccccccCCceeec--eeeEecCCCCCCceEEeecccce Confidence 44 5567889998877776664 322 2222211111112222222224567 68899999999999999988654 Q ss_pred eeec-C-cccceecCCCc-----cceeeEEEEEEeEEEecccceeEEEEeccC Q lcl|NC_019402. 273 QMVL-R-APERTKLAKDG-----SYEKWMIEMEVGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 273 ~~~L-r-~~~~e~laktG-----d~~k~~i~~E~tLe~~N~~a~g~i~~lt~a 318 (318) +... | .+..+-....+ +....+.+.++++.|++|.|+.+++..+++ T Consensus 442 ~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~ 494 (497) T protein:vir:10 442 IQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) T ss_pred EEEEEecccEEEeecccchhhhcCcEEEEEEEeecceeeccccEEEEEecCCc Confidence 4433 2 12112111222 344555567799999999999999998888 No 89 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=97.55 E-value=2.8e-05 Score=45.57 Aligned_cols=283 Identities=8% Similarity=-0.026 Sum_probs=128.5 Q ss_pred CCceeeee--eeeecccceeeeEecCCcccceeeeeccc-cccc-eEEEeeeeeccccCCccccccccceeeccc---cc Q lcl|NC_019402. 1 MATLVSYD--LNGKKLSFANWISNLSPTDTPFVSMTGKE-AINQ-TLFQWQTDALAPVADPSDAQKRNAVIEGSA---AV 73 (318) Q Consensus 1 Ma~~~t~~--~~~~~~dl~d~I~~i~p~~TP~~s~i~~~-~~~~-~~~~W~td~l~~~~~~~~~~~~na~~EG~d---a~ 73 (318) -..++..+ --.-.++-.+++...=-+.+||++.+--- +..+ ...-|....-... .+...|+.+ .+ T Consensus 11 ~k~it~~d~~gG~L~P~~~~~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~~~--------~~~~~~~~~~~~~~ 82 (314) T protein:vir:41 11 TPKIDVPDLGKGILAVQRFGEFVREVRENSAIIKDARVLNALKSYEVDISRISLGVEL--------EPGRNTSGTKVAPT 82 (314) T ss_pred hcccccccCCCceeChHHHHHHHHHHHhccchhhheeeecccCccceeecccccCccc--------ccccccccCCccCC Confidence 11122222 22233444455555555778888766532 1122 1222222211000 011112111 12 Q ss_pred cccccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHH Q lcl|NC_019402. 74 DGERASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSA 153 (318) Q Consensus 74 ~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~ 153 (318) ....+.....=+.-+.-. -+.||.-.-.=+..| .+.-.|-+.+-.+.+.+|+|.++++|......++....+..|++. T Consensus 83 ~~~~tf~~~~l~~~kl~~-~v~is~e~L~D~a~~-~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~~~p~G~l~ 160 (314) T protein:vir:41 83 ADEVTVSTNTLEMKELVT-KVVLEDEALEDNIEQ-SAFEQTITSLLASGVTYDLECFFLHADSSLTTGRELYRINDGWMK 160 (314) T ss_pred cccccccceeeeeEEEEE-eecccHHHHHhhhch-hhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccchhcchhhhh Confidence 222222223333333333 466776554423333 243444445667789999999999985322222222346778765 Q ss_pred HHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHh---CCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecC Q lcl|NC_019402. 154 LVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYL---SGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDG 230 (318) Q Consensus 154 ~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~---~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~ 230 (318) .... + +...+++..+++.+.|.+++..+-. +.+..-.++|++....++-++..+....-.+.. T Consensus 161 ~a~~-~--------~~~~~~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~~~~~l~~~~----- 226 (314) T protein:vir:41 161 LAGN-Q--------YTDAEPEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLLVRETGLGDSA----- 226 (314) T ss_pred hccc-c--------eeecCccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHhccCCcccchh----- Confidence 4311 1 1122344556888999999888843 322222456687777777766543321111111 Q ss_pred CceEEEEEEEEEEcCCCcEEEEEecCC-----CCCceEEEEehhhcceeecCcccc--eecCCCccceeeEEEEE----E Q lcl|NC_019402. 231 QDTRLNVYVSSIVDPLGCQYKLVPNRW-----MPENAVYFFTPSDWTQMVLRAPER--TKLAKDGSYEKWMIEME----V 299 (318) Q Consensus 231 ~~~~~g~~v~~~~tdfG~~v~iv~nr~-----m~~~~~~~~D~~~~~~~~Lr~~~~--e~laktGd~~k~~i~~E----~ 299 (318) ..+.... +-+| +.|+.... +|+..+++.|++.+-++.-+.... +..++ ..+..++.. + T Consensus 227 ---~~~~~~~---~l~G--~PV~~~~~~~~~~~~~~~i~fgd~~nlv~~~~~~ir~~~~~~a~---~~~~~~~~~~r~d~ 295 (314) T protein:vir:41 227 ---LIGATGL---QYDG--IPIQYVPALDALGDDKARALLTVPTNLVYGFWRNIRIEPKRDAA---MRRTEYIASLRADC 295 (314) T ss_pred ---hhCCCCc---eecc--eeeEecccccccCCCCceEEEechhheEEEeeceeEEeecccCc---CCeEEEEEEEEece Confidence 1111111 1235 35554444 467899999999986655555443 33332 112222222 2 Q ss_pred eEEEecccceeEEEEeccC Q lcl|NC_019402. 300 GLRHRNPYASGILEVKAGA 318 (318) Q Consensus 300 tLe~~N~~a~g~i~~lt~a 318 (318) .++.-+..+.++|.-.++- T Consensus 296 ~~~~~~aa~~~~~~~~~~~ 314 (314) T protein:vir:41 296 NYEDENAAVAAVIDMSSGG 314 (314) T ss_pred EEEEcCcEEEEEeeccCCC Confidence 3333222222333332222 No 90 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=97.54 E-value=2.6e-05 Score=45.72 Aligned_cols=295 Identities=13% Similarity=0.088 Sum_probs=152.8 Q ss_pred CCceee--eeeeeecccceeeeEecCCcccceeeeeccccccceEEE-eeeeeccccCCccccccccceee-cccccccc Q lcl|NC_019402. 1 MATLVS--YDLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQ-WQTDALAPVADPSDAQKRNAVIE-GSAAVDGE 76 (318) Q Consensus 1 Ma~~~t--~~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~-W~td~l~~~~~~~~~~~~na~~E-G~da~~~~ 76 (318) =++++. .+-..-+++..+.+.+.--+.+|||+.+...+..+...+ |.- .-.- -...++-| |.+...+. T Consensus 20 k~~it~~~l~~g~L~p~~a~~Fl~~v~~~t~iL~~~r~~~~~s~~~ei~ki---g~G~-----r~~r~~~e~~~~~~~~~ 91 (360) T protein:vir:99 20 QKDIGLAELDGFQLPVDVTEEFLERMQKGVQILGMADTMTLARLEMEVPQF---GVPR-----LSGHTRDEEGSRTENSE 91 (360) T ss_pred hhhccccccCceeecHHHHHHHHHHHhhccchhhhcceeeccccccccccc---ccce-----eeccccccCCCCCcCCc Confidence 111221 123334566667777777789999999987766665443 322 1100 00111112 22211122 Q ss_pred ccCcEEecceEEEEeeeeeehhHHHHh--hhcCccchHHHHH-HHHHHHHHHHHHHHHhcCcccc-----CCC-CCccch Q lcl|NC_019402. 77 RASTTVINNVTQILRKVVKVSDTANVL--ANYGRGKELQYQM-EKAGKEIKRDLEVALLRNGAKV-----DGS-ATVARQ 147 (318) Q Consensus 77 ~~~~~~~~N~tQIf~~~~~VS~Ta~a~--~~~G~~~e~a~q~-~k~~~eikrd~E~a~i~g~~~~-----~gs-~t~~r~ 147 (318) ...+...-|.+.=++-.+.+. .+.+ +..-.+.++..-+ ..-.+.+.+|||...++|.... +|+ ++.-+. T Consensus 92 ~~~~~v~~~~~~~~~~~~~i~--~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~~~g~~ds~d~~~~~~~d~fl~~ 169 (360) T protein:vir:99 92 AESGSVKFNATDKSYYILVEP--KRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMGIRAGASSGNLQSIGGAAELDNT 169 (360) T ss_pred CccccCccccccceeeEeech--HHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHHhhccchhcccccCcccchhhhh Confidence 222222222222222223332 2222 1121122222222 3344678999999999887543 222 234478 Q ss_pred hhhHHHHHhcCC--cccCc--------------cc-cceee----ccCccccCHHHHHHHHHHHHhC---CCCcC-EEEE Q lcl|NC_019402. 148 TAGFSALVAAKD--AADPD--------------TG-AIVHF----ETAAAALTEAEIFKVTYNLYLS---GSEAN-IIMF 202 (318) Q Consensus 148 m~Gi~~~i~~~~--~~~~~--------------~g-~~~~~----~~t~~~lTe~~l~~~~~~i~~~---G~~~~-~l~~ 202 (318) +.|++..+...- ..+++ +. .+... .+...++.+++|.++++.+=.. +-..+ .+++ T Consensus 170 ~dGwlKka~~~~~~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~lf~~~~~~Lp~kyr~~~~~~~~~~~ 249 (360) T protein:vir:99 170 FKGWIARAEGDAQSVDDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTSLFNETIQTLDSRYRESDAYSPVLMT 249 (360) T ss_pred hHHHHHHhhcccchhhccccccccccccccccccccchhhhccccccccccchHHHHHHHHHhcchhhhcCcccceEEEc Confidence 999888774331 11111 00 11111 1233668899999999988543 11111 3455 Q ss_pred cchHHhhhhhhhhhhcccccceEEEecCCceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccce Q lcl|NC_019402. 203 HPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERT 282 (318) Q Consensus 203 ~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e 282 (318) ++.....-.....+ |... -++.--+| +...+.+| |.|+.-++||.+.+++-+|..+.+..-|..+.+ T Consensus 250 s~~~~~~yr~~L~~-------R~t~-LGd~~l~g---~~~~~~~G--ipi~~v~~~pd~~~mlT~p~NLi~g~~~~iri~ 316 (360) T protein:vir:99 250 SPNQVQSYTMSLTE-------REDP-LGSAVIFG---DSDITPFS--YDLVGVNGFPDEYMMFTDPNNLAFGLYEEMELD 316 (360) T ss_pred cCchHHHHHHHHhc-------cCcc-cchhheec---ccccccce--eeeEEcCCCCCCceEEeccCceeEEeeeeeEEe Confidence 65544443333221 1100 01111111 12345788 788888999999999999999988887888765 Q ss_pred ecCCC------ccceeeEEEEEEeEEEecccceeEEEEeccC Q lcl|NC_019402. 283 KLAKD------GSYEKWMIEMEVGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 283 ~lakt------Gd~~k~~i~~E~tLe~~N~~a~g~i~~lt~a 318 (318) ..... --+-+..+.+-+-..+.++.|+++++++... T Consensus 317 ~~~e~~~~~~~~~~~~~~~~~~~D~~iee~~Av~~vt~~~~~ 358 (360) T protein:vir:99 317 QSTDTDKVHEQRLHSRNWLEGQFDFQIKEQQAGVLVTDLETP 358 (360) T ss_pred ecccchhhhhhceeeeEEEEEEeeEEEEecccEEEEecCCCC Confidence 33221 1123333445566777899999999999988 No 91 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=97.53 E-value=2.4e-05 Score=45.88 Aligned_cols=269 Identities=8% Similarity=-0.045 Sum_probs=127.5 Q ss_pred CCceeeee-eeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccccC Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERAS 79 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~~ 79 (318) +.+.++.+ -...-.++...|...-...+|+.+++.....++....|..-...... ..-..||.+.+....-. T Consensus 111 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~E~~~~~~~~~~~ 183 (394) T protein:vir:10 111 AGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKRATDR-------FSSVAELAENPALAEPE 183 (394) T ss_pred hcccccccCceeccHHHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEecCCCc-------ccccccccccccccccc Confidence 33333322 12234466677777777788888887766665554454433221110 11233665554322111 Q ss_pred cEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhcCC Q lcl|NC_019402. 80 TTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAAKD 159 (318) Q Consensus 80 ~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~~~ 159 (318) -..+.---.-+..-+.||.. .+.... .+...|=...-.+.+.+-+|.++++|.. .+ ++ T Consensus 184 ~~~v~l~~~k~~~~~~iS~e--ll~ds~-~~l~~~i~~~la~~~~~~~~~~il~g~g----~~-~~-------------- 241 (394) T protein:vir:10 184 FEQVDWSVSTYRGAIPLSEE--AIADSA-VDLTSLVGQSINEKSVNTYNAMIAPVLQ----SF-TA-------------- 241 (394) T ss_pred ceeEEeeeeeeEeeehhHHH--HHhhhh-HHHHHHHHHHHHHHHHHHHHHHHhhccc----cc-cc-------------- Confidence 11111111122233444443 322211 1222222223334566678888886642 11 00 Q ss_pred cccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEEEE Q lcl|NC_019402. 160 AADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNVYV 239 (318) Q Consensus 160 ~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~~v 239 (318) .+.....+.+.|.+++...-....+ ..+++|+.....+..+. +.+ .++......... ... T Consensus 242 ------------~~~~~~~~~d~l~~~~~~~~~~~~~-a~~vmn~~~~~~l~~lk-d~~----G~~i~~~~~~~~--~~~ 301 (394) T protein:vir:10 242 ------------KATTTDTLVDSLKHILNVDLDPAYS-RALVVTQSLFNTLDTLK-DKN----GRYLLHDASDSI--TDG 301 (394) T ss_pred ------------ccccccccHHHHHHHHHhhhhhhcc-CEEEecHHHHHHHHHhh-ccC----CCeeeecccccc--ccC Confidence 0111224556677766543333322 46788998877777764 221 233222211110 000 Q ss_pred EEEEcCCCcEEEEEecCCCCCc----eEEEEehhhcceeecC-cccceecCCCccceeeEEEEEEeEEEecccceeEEEE Q lcl|NC_019402. 240 SSIVDPLGCQYKLVPNRWMPEN----AVYFFTPSDWTQMVLR-APERTKLAKDGSYEKWMIEMEVGLRHRNPYASGILEV 314 (318) Q Consensus 240 ~~~~tdfG~~v~iv~nr~m~~~----~~~~~D~~~~~~~~Lr-~~~~e~laktGd~~k~~i~~E~tLe~~N~~a~g~i~~ 314 (318) ..-.+=+|..|.++.+..++.+ .+++.|++..-+.+.| .+..+..--....+....+..+...+.+|+|...|+. T Consensus 302 ~~~~~L~G~PV~~~~~~~~~~~~~~~~i~~gd~s~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~~~~~~ai~~~~~ 381 (394) T protein:vir:10 302 TAKGTVLGVPVYVVGDALLGSAAGDQKAFVGDLKRGVLFADRQQVTLAWEDSKIYGRYLGAAFRFGVKQADSNAGYFVTN 381 (394) T ss_pred CcccccccceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEEecccccceeEEEEEEeccEEeccccEEEEEe Confidence 1112336754444445566642 2677788753223332 2221111111223345667778999999999999998 Q ss_pred eccC Q lcl|NC_019402. 315 KAGA 318 (318) Q Consensus 315 lt~a 318 (318) .+++ T Consensus 382 ~~~~ 385 (394) T protein:vir:10 382 TDAA 385 (394) T ss_pred eccc Confidence 8888 No 92 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=97.53 E-value=1.5e-05 Score=47.09 Aligned_cols=263 Identities=10% Similarity=-0.017 Sum_probs=138.0 Q ss_pred CCceeeee-eeeecccceeeeEecCCcccceeeeeccccccce---EEEeeeeeccccCCccccccccceeecccccccc Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQT---LFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGE 76 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~---~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~ 76 (318) |++.++.+ -...-+++.+.|...-....|+++++.....++. .+-|...+-... ..-.-||...++.. T Consensus 109 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~--------a~~v~E~~~~~~~~ 180 (397) T protein:vir:48 109 KTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKWADITGL--------AKLDDEAGSIGTND 180 (397) T ss_pred hhccCCccccccccHHHHHHHHHHHHHHHHHHhhhceeeccCCcceEEEEeecCCCcc--------eeeecccccccccc Confidence 55444433 2234568888888888888898887665433322 222222211110 11123565544322 Q ss_pred ccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHh Q lcl|NC_019402. 77 RASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVA 156 (318) Q Consensus 77 ~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~ 156 (318) .-.-..+.=-.+-+...+.||.....-... +...|=...-...+.+-+|.++++|. |.+.. T Consensus 181 ~~~~~~v~~~~~k~~~~~~iS~ell~ds~~---~l~~~v~~~l~~~~~~~~d~~il~G~----g~~~~------------ 241 (397) T protein:vir:48 181 DPKLYPIRYAIKRYAGISTVTNSLLADSAE---NILAWLSGWIAKKVVVTRNKAILEAI----ATLPT------------ 241 (397) T ss_pred ccceeeEEeeheeeeeehhhHHHHHhhchH---HHHHHHHHHHHHHHHHHHHHHHhhcc----ccccc------------ Confidence 111111111112223334555443322222 33334444455566778999999773 11100 Q ss_pred cCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEE Q lcl|NC_019402. 157 AKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLN 236 (318) Q Consensus 157 ~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g 236 (318) .+...+.+.|.+++.++-.+......++|||.....+..+. +.+ .++.....-.. | T Consensus 242 -----------------~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lk-d~~----G~~i~~~~~~~--~ 297 (397) T protein:vir:48 242 -----------------KPTLTKWDDIIDLQAKVDPAIKQTSFFLTNTSGFTALKKVK-NAF----GDYLMERDVKS--P 297 (397) T ss_pred -----------------ccccccHHHHHHHHHHhhhhhcCCCEEEECHHHHHHHHHhh-cCC----CceeeccCcCC--C Confidence 11234667788888777766666678899999888887764 222 22222111000 0 Q ss_pred EEEEEEEcCCCcEEEEEecCCCCC-----ceEEEEehhh-cceeecCcccceecC-----CCccceeeEEEEEEeEEEec Q lcl|NC_019402. 237 VYVSSIVDPLGCQYKLVPNRWMPE-----NAVYFFTPSD-WTQMVLRAPERTKLA-----KDGSYEKWMIEMEVGLRHRN 305 (318) Q Consensus 237 ~~v~~~~tdfG~~v~iv~nr~m~~-----~~~~~~D~~~-~~~~~Lr~~~~e~la-----ktGd~~k~~i~~E~tLe~~N 305 (318) . --+=+|..|.++.+.++|. ..+++.|++. +.+..-.....+... -.-+......+..+...+++ T Consensus 298 ~----~~~l~G~PV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~ 373 (397) T protein:vir:48 298 T----GYSIDGFAVKEVADRWLANASSGAMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKIRVIDRFDVVATD 373 (397) T ss_pred C----CceeccceeEEecccccCCcCCCceEEEEEeccceEEEEeecceEEEEeccchhhhhcCceeEEEEeeeccEEec Confidence 0 0122675555556667763 4478889874 333333333322211 12244556667778999999 Q ss_pred ccceeEEEEeccC Q lcl|NC_019402. 306 PYASGILEVKAGA 318 (318) Q Consensus 306 ~~a~g~i~~lt~a 318 (318) |.+..+++..+++ T Consensus 374 ~~a~~~~~~~~~~ 386 (397) T protein:vir:48 374 TESFVPASFKAIA 386 (397) T ss_pred ccceEEEEecccc Confidence 9999999988887 No 93 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=97.52 E-value=5.2e-05 Score=44.07 Aligned_cols=276 Identities=11% Similarity=0.056 Sum_probs=133.1 Q ss_pred CCceeeeeeee---ecccceeeeEecCCcccceeeeec-cccccceEEEeeeeeccccCCccccccccceeecccccccc Q lcl|NC_019402. 1 MATLVSYDLNG---KKLSFANWISNLSPTDTPFVSMTG-KEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGE 76 (318) Q Consensus 1 Ma~~~t~~~~~---~~~dl~d~I~~i~p~~TP~~s~i~-~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~ 76 (318) ++.-++....| .-+.+.+.|...-...+|+..+-. .-+..+..+.|..-+-.+. ..-.-||...+... T Consensus 130 ~~~~~~~~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~~~v~~~~~~~~~p~~~~~~~--------a~~v~E~~~~~~~~ 201 (435) T protein:vir:80 130 MSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNGNITIPRLKGGAI--------VGYIGADTDIPTTQ 201 (435) T ss_pred hhhcccCCCCCccccchhHHHHHHHHHhhhchhhhccceeeecCCCceEEEEEeCCcc--------eeeeccCccccccc Confidence 22222222221 223444444443345566655311 1112222234432221111 01123666655432 Q ss_pred ccCcEEecceEEEEeeeeeehhHHHHhhhcCcc-chHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHH Q lcl|NC_019402. 77 RASTTVINNVTQILRKVVKVSDTANVLANYGRG-KELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALV 155 (318) Q Consensus 77 ~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~-~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i 155 (318) ... ..+.=-..-+...+.|| .+.+...+.. +..++=...-...+.+.+|.+||+|. |++.. ..||..+. T Consensus 202 ~~f-~~i~~~~~k~~~~~~is--~ell~ds~~~~~l~~~i~~~l~~a~~~~~d~a~l~G~----G~~~~---p~Gi~~~~ 271 (435) T protein:vir:80 202 QQF-DDLKLTAKKMAALVPIA--NDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDD----GTANT---PKGLRFWA 271 (435) T ss_pred cce-eeEEEeeEEEEEeehhh--HHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHhhccC----CCCCc---ccceeecc Confidence 111 11111112223334444 3333333322 22333334445568899999999873 43322 34666443 Q ss_pred hcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCC--CcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCce Q lcl|NC_019402. 156 AAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGS--EANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDT 233 (318) Q Consensus 156 ~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~--~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~ 233 (318) ...+.. ...+++....+..++.+++-.+..+.. ....+++|+.....+..+. +.+ .++........ T Consensus 272 ~~~~~~-------~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lk-d~~----G~~l~~~~~~~ 339 (435) T protein:vir:80 272 LPGNVI-------TASDGSTLQKIETDLGKAILALENADANLTQPGWIMAPRTFRFLEGLR-DGN----GNKVYPELANG 339 (435) T ss_pred ccccee-------ecccccchhhHHHHHHHHHHHhhccccccccCEEEEcHHHHHHHHhhh-ccC----CceeccCCCCC Confidence 222211 112233333445667777777765533 3346788999888877664 322 22222111111 Q ss_pred EEEEEEEEEEcCCCcEEEEEecCCCCCc--------eEEEEehhhcceeecCcccceecC----CCc----------cce Q lcl|NC_019402. 234 RLNVYVSSIVDPLGCQYKLVPNRWMPEN--------AVYFFTPSDWTQMVLRAPERTKLA----KDG----------SYE 291 (318) Q Consensus 234 ~~g~~v~~~~tdfG~~v~iv~nr~m~~~--------~~~~~D~~~~~~~~Lr~~~~e~la----ktG----------d~~ 291 (318) +-+| +.++.+.+||.+ .+++.|++++-+.--.++..+... +++ |.. T Consensus 340 ----------~l~G--~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~f~~n~~ 407 (435) T protein:vir:80 340 ----------MLKG--YPVGKTTQVPINLGEAGKESEIYFTDFGDVFIGEEETLEIDYSKEATYKDADGHMVSAFQRDQT 407 (435) T ss_pred ----------eEee--eeeEEeccccccccCCCCcceEEEEEcccEEEEeecceEEEEeccccccccccchhhhhhcCcc Confidence 2356 577888888763 588888887544322333222221 111 334 Q ss_pred eeEEEEEEeEEEecccceeEEEEeccC Q lcl|NC_019402. 292 KWMIEMEVGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 292 k~~i~~E~tLe~~N~~a~g~i~~lt~a 318 (318) ..+++..+.+.+.+|+|..+|+++.-. T Consensus 408 ~~r~~~r~d~~~~~~~a~~~l~~~~~~ 434 (435) T protein:vir:80 408 LIRVIAKNDFGPRHVESIAVLSGVAWG 434 (435) T ss_pred eeeeeeeeCcEeecccceEEEeccCCC Confidence 556777788999999999999999877 No 94 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=97.42 E-value=3.6e-05 Score=44.94 Aligned_cols=269 Identities=7% Similarity=-0.067 Sum_probs=126.6 Q ss_pred CCceeeeee-eeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccccC Q lcl|NC_019402. 1 MATLVSYDL-NGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERAS 79 (318) Q Consensus 1 Ma~~~t~~~-~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~~ 79 (318) |+..++.+. ...-..+...|...-....|+..++....+++...+|..-.-.... .....||...+...... T Consensus 109 ~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~E~~~~~~~~~~~ 181 (389) T protein:vir:10 109 TSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKRATDR-------FSSVAELAENPKLAEPE 181 (389) T ss_pred hcccccCCcceeehHHHHHHHHHHHHhhhhHHhhcceeeccCCeeEEEEEecCCCc-------ccccccccccccccccc Confidence 554444332 2234466677777777888888887666655554444432211110 01123665544322111 Q ss_pred cEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhcCC Q lcl|NC_019402. 80 TTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAAKD 159 (318) Q Consensus 80 ~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~~~ 159 (318) -..+.--..-+..-+.||.. .+..... +...|=...-...+.+-++.++++|... +. T Consensus 182 ~~~i~~~~~k~~~~~~iS~e--ll~ds~~-~l~~~i~~~la~~~~~~~~~~i~~g~~~----~~---------------- 238 (389) T protein:vir:10 182 FNKVDWSVATYRGAIPLSEE--AIADSAV-DLTALVGQSIKEKSVNTYNAMIAPVLQS----FT---------------- 238 (389) T ss_pred ceeeeeeheeeEeeehhhHH--HHhhhhH-HHHHHHHHHHHHHHHHHHHHHHhhhhcc----cc---------------- Confidence 12222122223344444443 3322211 2222222233344556677777755311 00 Q ss_pred cccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEEEE Q lcl|NC_019402. 160 AADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNVYV 239 (318) Q Consensus 160 ~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~~v 239 (318) ..+.....+.++|.+++...-+.+. ...++||+.....+..+. +.+ .++...++-... ... T Consensus 239 -----------~~~~~~~~~~d~l~~~~~~~~~~~~-~a~~~~n~~~~~~L~~lk-d~~----G~~i~~~~~~~~--~~~ 299 (389) T protein:vir:10 239 -----------AKKTTTDTLVDSLKHILNVDLDPAY-SRALVVTQSLFNTLDTLK-DKN----GRYLLHDASDSI--TDG 299 (389) T ss_pred -----------cccccccccHHHHHHHHHhhhhhhh-CcEEEecHHHHHHHHHhh-ccC----CCeeeecCcccc--ccc Confidence 0112234567778877765444333 246889999888887764 221 233222221110 000 Q ss_pred EEEEcCCCcEEEEEecCCCCCc----eEEEEehhh-cceeecCcccceecCCCccceeeEEEEEEeEEEecccceeEEEE Q lcl|NC_019402. 240 SSIVDPLGCQYKLVPNRWMPEN----AVYFFTPSD-WTQMVLRAPERTKLAKDGSYEKWMIEMEVGLRHRNPYASGILEV 314 (318) Q Consensus 240 ~~~~tdfG~~v~iv~nr~m~~~----~~~~~D~~~-~~~~~Lr~~~~e~laktGd~~k~~i~~E~tLe~~N~~a~g~i~~ 314 (318) ..-.+=+|..|.++.+..++.+ .+++.|++. +.+.--..+..+-.--....+..+++..+.+.+.+|.|+..++. T Consensus 300 ~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~ 379 (389) T protein:vir:10 300 TAKGTILGVPVYVVGDTLLGSLAGDQKAFVGDLKRGVLFTDRQQVTLAWEDSKIYGKYLGAAFRFGVQKADSKAGYFVTN 379 (389) T ss_pred ccccccccceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEeeccccccceEEEEEEeccEEecccceEEEEe Confidence 1111236743433444444432 267778775 23221112222111112223345666679999999999877776 Q ss_pred eccC Q lcl|NC_019402. 315 KAGA 318 (318) Q Consensus 315 lt~a 318 (318) .+++ T Consensus 380 ~~~~ 383 (389) T protein:vir:10 380 TDVP 383 (389) T ss_pred eccC Confidence 5555 No 95 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=97.41 E-value=7.1e-05 Score=43.34 Aligned_cols=273 Identities=11% Similarity=0.012 Sum_probs=136.2 Q ss_pred CCceeee-eeeeecccceeeeEecCCcccceeeeeccccccceEE--E-eeeeeccccCCccccccccceeecccccccc Q lcl|NC_019402. 1 MATLVSY-DLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLF--Q-WQTDALAPVADPSDAQKRNAVIEGSAAVDGE 76 (318) Q Consensus 1 Ma~~~t~-~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~--~-W~td~l~~~~~~~~~~~~na~~EG~da~~~~ 76 (318) |.+-++. .-...-+++...|...--..+|++.++....+..... . |.......+. ...||...+... T Consensus 110 ~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~~~~~~~~~~~~---------~v~e~~~~~~~~ 180 (404) T protein:vir:10 110 ISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRTYEKRSKQKPMK---------PLSENQQIPTNG 180 (404) T ss_pred hccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceeeccCCccceEEEEecCCccee---------eccccccccccc Confidence 3322221 1222346777778777778899999888766543322 2 2222221111 133665544433 Q ss_pred ccCcEEecceEE---EEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHH Q lcl|NC_019402. 77 RASTTVINNVTQ---ILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSA 153 (318) Q Consensus 77 ~~~~~~~~N~tQ---If~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~ 153 (318) .++. +.+++- -+...+.||. +.+.... .+...|=...-...+.+-+|.++|.|. |.+.. .+||.. T Consensus 181 ~~~~--f~~i~~~~~k~~~~~~iS~--ell~ds~-~~l~~~i~~~la~~~~~~~~~~il~G~----g~~~~---~~gi~~ 248 (404) T protein:vir:10 181 DNGK--LERFNFKLKDLADFMSIPN--DLLKFAD-KSLEDWIINWFVDKVRITRNAEILYGA----GGDEH---ATGIMT 248 (404) T ss_pred cccc--eeeeEeeheeeEeeehhhH--HHHhhcH-HHHHHHHHHHHHHHHHHHHHHHHhhcC----CCCCc---ccceee Confidence 2221 122211 1122334443 3333222 233444444556667789999999873 32222 335442 Q ss_pred HHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcC-EEEEcchHHhhhhhhhhhhcccccceEEEecCCc Q lcl|NC_019402. 154 LVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEAN-IIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQD 232 (318) Q Consensus 154 ~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~-~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~ 232 (318) -... .+.......+.+++.+++...-..+...+ .++|||.....+..+. +.+ .++...+.-. T Consensus 249 ~~~~------------~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lk-d~~----G~~l~~~~~~ 311 (404) T protein:vir:10 249 ANKF------------KKITLPKSPALKDFKKCKNVELLNVFKATSSWIVNQDGFNYLDSLE-DKT----GRPYLQPDPK 311 (404) T ss_pred cccc------------ceeeccccccHHHHHHHHHhhhhccccCCCEEEEcHHHHHHHHHhh-ccC----CceeeccCcC Confidence 1100 11122334577888887775555554443 5788998877777653 221 2332221100 Q ss_pred eEEEEEEEEEEcCCCcEEEEEecCCCCC-----ceEEEEehhhc-ceeecCcccceecCC-C----ccceeeEEEEEEeE Q lcl|NC_019402. 233 TRLNVYVSSIVDPLGCQYKLVPNRWMPE-----NAVYFFTPSDW-TQMVLRAPERTKLAK-D----GSYEKWMIEMEVGL 301 (318) Q Consensus 233 ~~~g~~v~~~~tdfG~~v~iv~nr~m~~-----~~~~~~D~~~~-~~~~Lr~~~~e~lak-t----Gd~~k~~i~~E~tL 301 (318) . .... +=+|..|.++++ .|+. ..+++.|++.. .+..-.....+.... . -+....+.+..+.. T Consensus 312 ~---~~~~---~l~G~PV~~~~~-~~~~~~~~~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~ 384 (404) T protein:vir:10 312 D---PTQY---RFLGLPVIELPN-DLLLSTESAIPVLLGDTKEAYKYVSDGAYELATTNIGAGAFETNTTKARIIMRIDG 384 (404) T ss_pred C---CCCc---cccceeeEEecc-cccCCCCCccEEEEEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEeecc Confidence 0 0001 124633322333 3443 23778888753 332222333222211 1 23344666677999 Q ss_pred EEecccceeEEEEeccC Q lcl|NC_019402. 302 RHRNPYASGILEVKAGA 318 (318) Q Consensus 302 e~~N~~a~g~i~~lt~a 318 (318) .+++|.|+.+++..++| T Consensus 385 ~v~~~~a~~~~~~~~aa 401 (404) T protein:vir:10 385 NVKDSEALLIAEIPVES 401 (404) T ss_pred EEecccceEEEEeeccc Confidence 99999999999999999 No 96 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=97.24 E-value=0.00012 Score=42.06 Aligned_cols=271 Identities=14% Similarity=0.039 Sum_probs=129.0 Q ss_pred CCceeeeeeee--ecccceeeeEecCCcccceeee-eccccccceEEEeeeeeccccCCccccccccceeeccccccccc Q lcl|NC_019402. 1 MATLVSYDLNG--KKLSFANWISNLSPTDTPFVSM-TGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGER 77 (318) Q Consensus 1 Ma~~~t~~~~~--~~~dl~d~I~~i~p~~TP~~s~-i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~ 77 (318) |+.-++.+.-| .-+.+.++|...-...+|+..+ .-.-+..+-...|...+-... ..-.-||.+.+.... T Consensus 64 ~a~~~~~~~Gg~lvP~~~~~~ii~~l~~~s~l~~lg~~~v~~~~g~~~~p~~t~~~~--------a~wv~E~~~~~~s~~ 135 (366) T protein:vir:57 64 MAISTAAGSGGALIPQNMQNEVIELLRDRTVVRILGARSIPLPNGNLSMPRLSGGAT--------AGYVGEGKDVVATGA 135 (366) T ss_pred hhccccccCCccccchhHHHHHHHHHhhhcchhhhceeeeecCCCceEEEEEeCCcc--------eeeeccCcccccccc Confidence 44333333222 3456677776666667777544 111112222233333221110 011237766654332 Q ss_pred cCcEEecceEE-EEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHh Q lcl|NC_019402. 78 ASTTVINNVTQ-ILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVA 156 (318) Q Consensus 78 ~~~~~~~N~tQ-If~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~ 156 (318) + +++++- .++=...+.=|.+.+..... +..++=..+-.+.+.+-+|.+||+|. |.+..| .||..... T Consensus 136 ~----f~~i~~~~~k~~~~~~iS~ell~ds~~-~~~~~i~~~l~~a~~~~~d~a~l~G~----G~~~~p---~Gi~~~~~ 203 (366) T protein:vir:57 136 T----FDDVKLSAKTMIALVPVSNQLIGRAGF-NVEQLLLGDILSAIATREDKAFLRDD----GTGDTP---KGMKAVAT 203 (366) T ss_pred c----eeEEEEeeEEEEEeehhhHHHHhhhhH-HHHHHHHHHHHHHHHHHHHHHhhccC----CCCccc---cceeeccc Confidence 2 122211 11111122223333332221 22233334555567888999999883 333233 35654332 Q ss_pred cCCcccCccccceeeccCccccCHHHHH---HHHHHHHh-C--CCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecC Q lcl|NC_019402. 157 AKDAADPDTGAIVHFETAAAALTEAEIF---KVTYNLYL-S--GSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDG 230 (318) Q Consensus 157 ~~~~~~~~~g~~~~~~~t~~~lTe~~l~---~~~~~i~~-~--G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~ 230 (318) ..+. ....++...+.+.++ +.+...+. . .......++++.....+..+. +.+ .++...+. T Consensus 204 ~~~~---------~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~~lk-d~~----G~~l~~~~ 269 (366) T protein:vir:57 204 AANR---------LVAWTGTAINLTTIDEYLDSLILKHMDSNSNMIRCGWGLSNRTYMTLFGLR-DGN----GNKVYPEM 269 (366) T ss_pred cccc---------eeeccccccchhhHHHHHHHHHHhhhccccccccCEEEecHHHHHHHHhhh-ccC----CceeccCC Confidence 2111 111222233444343 33333332 1 123345678998877777664 221 22222111 Q ss_pred CceEEEEEEEEEEcCCCcEEEEEecCCCCCc--------eEEEEehhhcceeecCcccceec--C------------CCc Q lcl|NC_019402. 231 QDTRLNVYVSSIVDPLGCQYKLVPNRWMPEN--------AVYFFTPSDWTQMVLRAPERTKL--A------------KDG 288 (318) Q Consensus 231 ~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~--------~~~~~D~~~~~~~~Lr~~~~e~l--a------------ktG 288 (318) .. | +=+| +.++.+..||++ .+++.|++.+-+..-.++..+.. + -.- T Consensus 270 ~~---g-------~l~G--~Pvv~s~~ip~~~~~~~~~~~i~~gdfs~~~i~~~~~i~i~~~~ea~~~~~~g~~~~~f~~ 337 (366) T protein:vir:57 270 SQ---G-------ILKG--YPIQRTSAIPANLGDDGNESEIYFCDFNDVVIGEDGMMKVDFSTEATYKDADGQLVSAFAR 337 (366) T ss_pred CC---C-------eecc--eeeEEccccccccccCCCccEEEEEecceEEEEEecceEEEEeeccccccccccchhhhhc Confidence 11 1 1257 688889999863 37888988765443233222211 1 112 Q ss_pred cceeeEEEEEEeEEEecccceeEEEEecc Q lcl|NC_019402. 289 SYEKWMIEMEVGLRHRNPYASGILEVKAG 317 (318) Q Consensus 289 d~~k~~i~~E~tLe~~N~~a~g~i~~lt~ 317 (318) |....+.+..+.+.+++|+|..++++..= T Consensus 338 ~~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 338 NQSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred CceeEEeeeeeCcEeeccccEEEEecccC Confidence 44566777789999999999999988888 No 97 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=97.09 E-value=0.00017 Score=41.20 Aligned_cols=263 Identities=10% Similarity=0.013 Sum_probs=131.1 Q ss_pred CCceeeee-eeeecccceeeeEecCCcccceeeeeccccccceEE---EeeeeeccccCCccccccccceeecccccccc Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLF---QWQTDALAPVADPSDAQKRNAVIEGSAAVDGE 76 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~---~W~td~l~~~~~~~~~~~~na~~EG~da~~~~ 76 (318) |..-++.+ -...-+.+.+.|...-....|+++++.....++... -+...+-.. ...-.-||.+.+... T Consensus 116 ~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~--------~a~~v~E~~~~~~~~ 187 (408) T protein:vir:10 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTP--------LTVMDAEDGKIPDLD 187 (408) T ss_pred hhcccccCCceeccHhHHHHHHHHHHhhchhhhhcceeeccCCcceEEEeecccccc--------ceeeecCcccccccc Confidence 22212111 122456778888888888899988877655443221 122111100 011123666554322 Q ss_pred ccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHh Q lcl|NC_019402. 77 RASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVA 156 (318) Q Consensus 77 ~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~ 156 (318) ...-..+.--..-+..-+.||...-.-... +...|=...-...+.+-+|.+|++|.. +++ + T Consensus 188 ~~~~~~i~~~~~k~~~~~~iS~ell~ds~~---~l~~~i~~~l~~~~~~~~~~~il~g~g----~~~-~----------- 248 (408) T protein:vir:10 188 NPQLTIIKYLIKRYAGIITATNTSLKDTAE---NILAWLSSWIAKKVVVTRNQAIIEVMK----AAP-K----------- 248 (408) T ss_pred CcceeeEEeeeeeEEeeehhHHHHHhhchH---HHHHHHHHHHHHHHHHHHHHHHhhccc----ccc-c----------- Confidence 111122222222233444555443221112 223333333445566778888887632 110 0 Q ss_pred cCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCC-CcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEE Q lcl|NC_019402. 157 AKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGS-EANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRL 235 (318) Q Consensus 157 ~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~-~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~ 235 (318) .....+.++|.+++...-..+. ....++||+.....+..+. +.+ .++...+.-.. T Consensus 249 -----------------~~~~~~~~~l~~~~~~~~~~~~~~~a~~v~n~~~~~~l~~lk-d~~----G~~i~~~~~~~-- 304 (408) T protein:vir:10 249 -----------------KPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVK-TAE----GKYLLEPDPTK-- 304 (408) T ss_pred -----------------ccccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhh-ccC----CceEeccCcCC-- Confidence 0112355667666643333332 2336789999888877764 322 23322221111 Q ss_pred EEEEEEEEcCCCcEEEEEecCCCCCce-----EEEEehhh-cceeecCcccceecCCCc-----cceeeEEEEEEeEEEe Q lcl|NC_019402. 236 NVYVSSIVDPLGCQYKLVPNRWMPENA-----VYFFTPSD-WTQMVLRAPERTKLAKDG-----SYEKWMIEMEVGLRHR 304 (318) Q Consensus 236 g~~v~~~~tdfG~~v~iv~nr~m~~~~-----~~~~D~~~-~~~~~Lr~~~~e~laktG-----d~~k~~i~~E~tLe~~ 304 (318) +. -.+=+|..|.++.+..||+.. +++.|++. +.+..-..+..+-..-.+ +......+..+.+.+. T Consensus 305 ~~----~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~~~f~~~~~~~r~~~r~d~~v~ 380 (408) T protein:vir:10 305 PN----SYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKAT 380 (408) T ss_pred CC----CceecceeeEEecccccCccCCCceEEEEEehhccEEEEEecceEEEEcccccchhhcCceEEEEEEeeccEEe Confidence 10 012268544444577788633 78889885 444433444433332222 3345556666999999 Q ss_pred cccceeEEEEeccC Q lcl|NC_019402. 305 NPYASGILEVKAGA 318 (318) Q Consensus 305 N~~a~g~i~~lt~a 318 (318) +|.|...++..+.+ T Consensus 381 ~~~a~~~~~~~~~~ 394 (408) T protein:vir:10 381 DSEALVAGSFSAIA 394 (408) T ss_pred ccccEEEEEeeccc Confidence 99999999988877 No 98 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=97.09 E-value=0.00016 Score=41.33 Aligned_cols=253 Identities=15% Similarity=0.134 Sum_probs=136.1 Q ss_pred CCceeeeeeeeeccccee----------eeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeecc Q lcl|NC_019402. 1 MATLVSYDLNGKKLSFAN----------WISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGS 70 (318) Q Consensus 1 Ma~~~t~~~~~~~~dl~d----------~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~ 70 (318) ||- |.....-+-|=+.+ .++.+.+.++- |+--++.++ +.+.|. .+..+.+ -.||. T Consensus 1 Ma~-T~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~-L~g~~G~ti--~~P~~~--~igdae~---------~~eg~ 65 (270) T protein:vir:95 1 MTQ-TKKANLINPEVLANVVSAQMQNAIRFTPYAVTDDT-LVGQPGDTI--TRPKYA--YIGAAED---------LQEGV 65 (270) T ss_pred CCc-eehhhhcchHHHHHHHHHHHHhHHhhccccccccc-cCCCCCCEE--Eeeeec--CCCcccc---------ccCCC Confidence 662 22211111111111 12233333331 121223344 345673 2332222 33677 Q ss_pred ccccccccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhh Q lcl|NC_019402. 71 AAVDGERASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAG 150 (318) Q Consensus 71 da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~G 150 (318) +.+....+.....-=+.|. .+.|++++-+..+. + +|-+..-..+...-+.|.+++.+|.--..... T Consensus 66 ~i~~~~lt~~~~~a~i~~~-gk~~~itD~a~~~~--~-~dp~~~~~~q~a~~~a~~~d~~li~~l~~a~~---------- 131 (270) T protein:vir:95 66 AMDTTQMSMTTTKVTVKET-GKAVEVTQTAIITN--V-NGTLQEASRQLAMSLADKVEIDYIAELNKSKQ---------- 131 (270) T ss_pred ccchhhcccchheeeeehh-hCcceecHHHHhhh--c-cchHHHHHHHHHHHHHHHHHHHHHHHhccccc---------- Confidence 7665555444444334443 57788888766544 2 35455555556666778888877732111000 Q ss_pred HHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecC Q lcl|NC_019402. 151 FSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDG 230 (318) Q Consensus 151 i~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~ 230 (318) .....+|.+.|.+.++++=+++.....++|||.+...+.+ +.... ... .+ T Consensus 132 ----------------------~~~~~~t~~~~~dA~~~lgd~~~~~~~i~vhs~~~~~Lrk---~~~~~-~~~----~~ 181 (270) T protein:vir:95 132 ----------------------TATVSADATGILDAIEVFNSENDEDYVLYVNPKDYNKLVK---SLFKV-GGN----VQ 181 (270) T ss_pred ----------------------ccccccCHHHHHHHHHHhccccCCCcEEEEcHHHHHHHHh---hhccc-ccc----cc Confidence 0113478899999999998888888999999987655432 22110 000 00 Q ss_pred CceEEEEEEEEEEcCCCcEEE-EEecCCCCCceEEEEehhhcceeecCcccceecCCCccceeeEEEE--EEeEEEeccc Q lcl|NC_019402. 231 QDTRLNVYVSSIVDPLGCQYK-LVPNRWMPENAVYFFTPSDWTQMVLRAPERTKLAKDGSYEKWMIEM--EVGLRHRNPY 307 (318) Q Consensus 231 ~~~~~g~~v~~~~tdfG~~v~-iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~laktGd~~k~~i~~--E~tLe~~N~~ 307 (318) ..-.... .|-+-.| ++ ||-++..+..+.+++.+..+.+...+++..|. ...-+...-.|++ -|++.+.||- T Consensus 182 ~~~~~~G---~ig~~~G--~~Viv~s~~~~~~~~~l~~~gAi~~~~~~~~~vEt-dRd~~~~~d~i~~~~~y~v~~~~~s 255 (270) T protein:vir:95 182 DRAISKG---DLVEIVG--VSDIVKSKRVSENTAFLQRYGAMEIVNKKKPEAYT-DFDILKRTHLLSTNYHYSVNLKDET 255 (270) T ss_pred cchhccc---ccceecc--eeEEEeCCCCCceeEEEEeccceeeeecCCceeee-ccchhhcccEEEeeeEEEEEEEccc Confidence 1000001 1222246 45 46688888999999999999988888766432 2222222223333 2899999999 Q ss_pred ceeEEEEeccC Q lcl|NC_019402. 308 ASGILEVKAGA 318 (318) Q Consensus 308 a~g~i~~lt~a 318 (318) ...+++--.+. T Consensus 256 kvv~~t~~~a~ 266 (270) T protein:vir:95 256 GVVKVTFKPSG 266 (270) T ss_pred eEEEEEecCCC Confidence 99888765555 No 99 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=97.04 E-value=0.0002 Score=40.91 Aligned_cols=263 Identities=10% Similarity=0.030 Sum_probs=131.9 Q ss_pred CCceeeee-eeeecccceeeeEecCCcccceeeeeccccccceE---EEeeeeeccccCCccccccccceeecccccccc Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTL---FQWQTDALAPVADPSDAQKRNAVIEGSAAVDGE 76 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~---~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~ 76 (318) |...++.+ -...-+.+.+.|...-...+|+++++.....++.. .-|...+-...+ .-.-||...++.. T Consensus 116 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a--------~~v~Eg~~~~~~~ 187 (404) T protein:vir:39 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLT--------VMDAEDGKIPDLD 187 (404) T ss_pred hhcccccCCceeccHHHHHHHHHHHHhhhhHHhhcceeeccCCcceEEEEeecCCccce--------eeecCcccccccc Confidence 32222211 22356678888888888899999988765544332 333332221110 1123665544321 Q ss_pred ccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHh Q lcl|NC_019402. 77 RASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVA 156 (318) Q Consensus 77 ~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~ 156 (318) ...-..+.=-..-+...+.||..... -.. .+...|=...-...+.+-+|.++|+|. |+++. T Consensus 188 ~~~f~~i~~~~~k~~~~~~iS~ell~--ds~-~~l~~~i~~~l~~~~~~~~d~~il~g~----g~~~~------------ 248 (404) T protein:vir:39 188 NPRLTIIKYLIKRYAGIITATNTLLK--DTA-ENILAWLSSWIAKKVVVTRNQAIIAAM----GTVPK------------ 248 (404) T ss_pred ccceeeEEeeeeeEEeeehhHHHHHh--hch-HHHHHHHHHHHHHHHHHHHHHHHHhcc----ccccc------------ Confidence 11111111111222333445544332 111 233444444555666778899999773 22100 Q ss_pred cCCcccCccccceeeccCccccCHHHHHHHHHHHHhC-CCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEE Q lcl|NC_019402. 157 AKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLS-GSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRL 235 (318) Q Consensus 157 ~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~-G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~ 235 (318) .+...+.+++.+++...-.. ......++|||.....+..+. +.+ .++...+.... T Consensus 249 -----------------~~~~~~~~~i~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lk-d~~----G~~l~~~~~~~-- 304 (404) T protein:vir:39 249 -----------------KPTIAKFDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVK-TAE----GKYLLEPDPTK-- 304 (404) T ss_pred -----------------ccccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhh-ccC----CceeeccCcCC-- Confidence 01124556666666533322 223346889999887777653 221 23222111000 Q ss_pred EEEEEEEEcCCCcEEEEEecCCCCCc-----eEEEEehhh-cceeecCcccceecCCCc-----cceeeEEEEEEeEEEe Q lcl|NC_019402. 236 NVYVSSIVDPLGCQYKLVPNRWMPEN-----AVYFFTPSD-WTQMVLRAPERTKLAKDG-----SYEKWMIEMEVGLRHR 304 (318) Q Consensus 236 g~~v~~~~tdfG~~v~iv~nr~m~~~-----~~~~~D~~~-~~~~~Lr~~~~e~laktG-----d~~k~~i~~E~tLe~~ 304 (318) + .. .+=+|..|.+..+..+|.. .+++.|++. +.+..-.+...+-....+ +.....++..+...+. T Consensus 305 ~-~~---~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~ 380 (404) T protein:vir:39 305 P-NS---YLIKGKKVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKTT 380 (404) T ss_pred C-Cc---ceecceeEEEecccccCccCCCccEEEEEeccccEEEEeecceEEEEeccchhhhhhceeeEEEEeeeccEEe Confidence 0 00 1226744444446667653 388889885 333322333333222222 3344556677899999 Q ss_pred cccceeEEEEeccC Q lcl|NC_019402. 305 NPYASGILEVKAGA 318 (318) Q Consensus 305 N~~a~g~i~~lt~a 318 (318) +|.|..+++..+.+ T Consensus 381 ~~~a~~~~~~~~~a 394 (404) T protein:vir:39 381 DSEALVAGSFTAIA 394 (404) T ss_pred cccceEEEEeeccc Confidence 99999999988877 No 100 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=96.94 E-value=0.00025 Score=40.36 Aligned_cols=270 Identities=13% Similarity=0.024 Sum_probs=133.5 Q ss_pred CCceeeeee--eeecccceeeeEecCCcccceeeeecc-ccccceEEEeeeeeccccCCccccccccceeeccccccccc Q lcl|NC_019402. 1 MATLVSYDL--NGKKLSFANWISNLSPTDTPFVSMTGK-EAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGER 77 (318) Q Consensus 1 Ma~~~t~~~--~~~~~dl~d~I~~i~p~~TP~~s~i~~-~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~ 77 (318) ++.-++... ...-.++.+.|...=...+|+..+.-. .+..+-.+.|...+-.+. ..-.-||...+.... T Consensus 125 ~~~~~~~~~gg~liP~~~~~~ii~~l~~~~~l~~~~~~~~~~~~g~~~~p~~~~~~~--------a~~v~Eg~~~~~~~~ 196 (428) T protein:vir:10 125 MAISTAAGSGGVLIPQNIHSEVIELLRDRTIVRKLGARSIPLPNGNMSLPRLAGGAT--------ASYTGENQDAKVSEA 196 (428) T ss_pred hhhcccccCCccccchhHHHHHHHHHhhhchhhhhcceeeecCCcceEEEEEeCCcc--------eeeeccCcccccccc Confidence 222222222 223456666666666677887666211 122222344544321111 111337776665332 Q ss_pred cCcEEecceE---EEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHH Q lcl|NC_019402. 78 ASTTVINNVT---QILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSAL 154 (318) Q Consensus 78 ~~~~~~~N~t---QIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~ 154 (318) . +.+++ .-+...+.||.- .+.... .+...|=...-...+.+-+|.++|+|. |++..| -||+.- T Consensus 197 ~----f~~i~~~~~k~~~~v~is~e--ll~ds~-~~l~~~i~~~l~~ai~~~~d~~~l~G~----G~~~~p---~Gi~~~ 262 (428) T protein:vir:10 197 R----FDDVKLTAKTMIAMVPISNA--LIGRAG-FNVEQLVLQDILTAISVREDKAFMRDD----GTGDTP---IGMKAR 262 (428) T ss_pred c----eeeEEeeeEEEEEeehhhHH--HHhhhh-HHHHHHHHHHHHHHHHHHHHHHHhccC----CCCccc---cccccc Confidence 2 22222 222233444433 332221 233344445566668899999999873 333333 366532 Q ss_pred HhcCCcccCccccceeeccCccccCHHHHHHHHHHHH---hCCC---CcCEEEEcchHHhhhhhhhhhhcccccceEEEe Q lcl|NC_019402. 155 VAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLY---LSGS---EANIIMFHPKHAAFFSSLMETSGVTNGQRMKMF 228 (318) Q Consensus 155 i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~---~~G~---~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~ 228 (318) ... ++......+....+.+.++.++..+. ..+. .....++++.....+..+. +.+ .++... T Consensus 263 ~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lk-d~~----G~~i~~ 329 (428) T protein:vir:10 263 ATQ--------WNRLLPWAADAAVNLDTIDTYLDSIILMSMDGNSNMISSGWGMSNRTYMKLFGLR-DGN----GNKVYP 329 (428) T ss_pred ccc--------ccccccccccccccHHHHHHHHHHHHHhhhccccccccCEEEEcHHHHHHHHHhh-ccC----Cceecc Confidence 211 11112222334445555555544433 2221 2235678988877776664 322 222221 Q ss_pred cCCceEEEEEEEEEEcCCCcEEEEEecCCCCCc--------eEEEEehhhcceeecCcccceecC--------------C Q lcl|NC_019402. 229 DGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPEN--------AVYFFTPSDWTQMVLRAPERTKLA--------------K 286 (318) Q Consensus 229 ~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~--------~~~~~D~~~~~~~~Lr~~~~e~la--------------k 286 (318) .... | +=+| +.++.+.+||++ .+++.|++.+-+..-.++..+... - T Consensus 330 ~~~~---g-------~l~G--~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~i~i~~~~~~~~~~~~~~~~~~f 397 (428) T protein:vir:10 330 EMAQ---G-------MLKG--YPIQRTSAIPANLGEGGKESEIYFADFNDVVIGEDGNMKVDFSKEASYIDTDGKLVSAF 397 (428) T ss_pred CCCC---C-------eeec--eeeEEeccccccccCCCccceEEEEecceEEEEEecceEEEeecccccccccccccchh Confidence 1111 1 1246 578888888764 378889887655433222221111 1 Q ss_pred CccceeeEEEEEEeEEEecccceeEEEEecc Q lcl|NC_019402. 287 DGSYEKWMIEMEVGLRHRNPYASGILEVKAG 317 (318) Q Consensus 287 tGd~~k~~i~~E~tLe~~N~~a~g~i~~lt~ 317 (318) .-|-...+.+..+.+.+.+|.|..++++..= T Consensus 398 ~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 398 SRNQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred hcchhheeeeeeeCceeeccceEEEEeccCC Confidence 1133455677778899999999999999888 No 101 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=96.92 E-value=0.00019 Score=41.05 Aligned_cols=259 Identities=12% Similarity=-0.052 Sum_probs=124.9 Q ss_pred CCc-eeeee-eeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccc- Q lcl|NC_019402. 1 MAT-LVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGER- 77 (318) Q Consensus 1 Ma~-~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~- 77 (318) |.. .++.+ -...-+++.+.|...-....++..++...+.++....|..-...... ..-..||...+.... T Consensus 133 ~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~E~~~~~~~~~~ 205 (400) T protein:vir:38 133 VNAGVKAADAASTIPETISNTPQRELQTVVDLKPFTNVFQASTQKGTYPTVANATTK-------MVTVAELEKNPAMAKP 205 (400) T ss_pred HhhcccccCCcccccHHHHHHHHHHHHhhhhhhhcceeEeccCcceEEEEEecCCCc-------cccccccccccccccc Confidence 222 22222 12233456777777777788888887766655544444432211110 112336665543211 Q ss_pred cCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhc Q lcl|NC_019402. 78 ASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAA 157 (318) Q Consensus 78 ~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~ 157 (318) ......-+. .-+...+.||.....-......+.+..++. ..+.+-++.+++.|... . T Consensus 206 ~f~~i~~~~-~k~~~~~~is~ell~ds~~~~~~~i~~~l~---~~~~~~~~~~i~~~~~~----~--------------- 262 (400) T protein:vir:38 206 EFKPVNWSV-ETYRQALPVSQESIDDSAIDLVGLIAQNGQ---QIKVNTTNGAVATLLKG----F--------------- 262 (400) T ss_pred cceeeEeeh-hheeeehhhHHHHHhhhHHHHHHHHHHHHH---HHHHHHHHHhhhhcccc----c--------------- Confidence 111111111 122234444443222111222223333333 33445566666655311 0 Q ss_pred CCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEE Q lcl|NC_019402. 158 KDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNV 237 (318) Q Consensus 158 ~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~ 237 (318) ......+.++|.+++...-+...+ ..+++||.....|..+. +.+ .++...+.-.. +. T Consensus 263 ---------------~~~~~~~~~~~~~~~~~~~~~~~~-a~~v~~~~~~~~l~~lk-d~~----G~~i~~~~~~~--~~ 319 (400) T protein:vir:38 263 ---------------TAKTISSVDDLKHINNVDLDPAYS-RVIIASQSFYNFLDTVK-DGN----GRYLLQDSILT--PS 319 (400) T ss_pred ---------------cccccccHHHHHHHHHhhhhhhhC-cEEEEcHHHHHHHHHhh-ccC----CCeeeecCcCC--CC Confidence 011224566677776655444333 35678998877777653 322 23322221100 00 Q ss_pred EEEEEEcCCCcEEEEEecCCCCCc-----eEEEEehhhcceeecC-cccceecCCCccceeeEEEEEEeEEEecccceeE Q lcl|NC_019402. 238 YVSSIVDPLGCQYKLVPNRWMPEN-----AVYFFTPSDWTQMVLR-APERTKLAKDGSYEKWMIEMEVGLRHRNPYASGI 311 (318) Q Consensus 238 ~v~~~~tdfG~~v~iv~nr~m~~~-----~~~~~D~~~~~~~~Lr-~~~~e~laktGd~~k~~i~~E~tLe~~N~~a~g~ 311 (318) - -+=+| ++++.+..||.. .+++.|++..-+.+-| .+..+-.--.........+..++..+.+|.|+.. T Consensus 320 -~---~~l~G--~pv~~~~~~~~~~~g~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~ 393 (400) T protein:vir:38 320 -G---KSVLG--MPIAVVSDDTLGAAGEAHAFLGDIKRAILFANRADFMVRWVDDQIYGQFLQAGMRFGVSVADEKAGYF 393 (400) T ss_pred -c---ccccc--ceeEEecccccCCCCceEEEEEeccccEEEEeecceEEEEecccccceeEEEEEEeccEEecccceEE Confidence 0 12257 466666666642 2677788753222222 2222111122334567778889999999999999 Q ss_pred EEEeccC Q lcl|NC_019402. 312 LEVKAGA 318 (318) Q Consensus 312 i~~lt~a 318 (318) |+..++| T Consensus 394 l~~~~~a 400 (400) T protein:vir:38 394 LTYTPKA 400 (400) T ss_pred EEeecCC Confidence 9999999 No 102 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=96.87 E-value=0.00029 Score=40.01 Aligned_cols=224 Identities=14% Similarity=0.147 Sum_probs=129.1 Q ss_pred eeeec-cccccceEEEeeeeeccccCCccccccccceeeccccccccccCcEEecceEEEEeeeeeehhHHHHhhhcCcc Q lcl|NC_019402. 31 VSMTG-KEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERASTTVINNVTQILRKVVKVSDTANVLANYGRG 109 (318) Q Consensus 31 ~s~i~-~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~ 109 (318) -..+. +.+++-+ .| +.++.+ ..||...+....+.....-=+.|+ .|.|+|++.++.+. + + T Consensus 1 ~~~~~~Gdtit~P--~~----iGda~~---------v~eG~~i~~~~l~~t~~~atIk~~-gk~~~itD~a~l~~-~--g 61 (231) T protein:vir:73 1 ENGINLANLCEYP--ND----IGDAAD---------VAEGGEISLDKIGTTTKSVTIKKA-AKGTEITDEAALSG-Y--G 61 (231) T ss_pred CccccCCceEEec--cc----ccchhh---------hcCCCcCChhhccccceeeeEeee-ccceeeeHHHHhhc-c--C Confidence 00111 1233333 56 233222 347888777766666666666776 88999999988763 3 3 Q ss_pred chHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHH Q lcl|NC_019402. 110 KELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYN 189 (318) Q Consensus 110 ~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~ 189 (318) |-+.+-..+-...|++-+.+-++.- .+ ++. -....++|.+.|.+.+++ T Consensus 62 Dp~~ea~~Q~~~~iA~kvD~di~~~-~~--~a~-----------------------------l~~~~~~t~d~i~~A~~~ 109 (231) T protein:vir:73 62 DPIGESNKQLGLSLANKVDDDLLKA-AK--TTS-----------------------------QTVSTKANVDGVQAALDI 109 (231) T ss_pred chHHHHHHHHHHHHHHhhhHHHHHh-hc--ccc-----------------------------ccccccccHHHHHHHHHH Confidence 5444444444445555555544421 00 000 001235899999999999 Q ss_pred HHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEE---- Q lcl|NC_019402. 190 LYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYF---- 265 (318) Q Consensus 190 i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~---- 265 (318) +-+.+..+..++|||.....+-+|..... ....-+..-.....+-+| .| ++|+.++.+|.+..+. T Consensus 110 fgde~~~~~vivv~p~~~~~Lrk~~~~~~------~~~~~g~~i~~~G~iG~i---~G--~~Vi~S~~~~~~~~~~~~~i 178 (231) T protein:vir:73 110 FNDEDAQAYVLIVNPKDAAKIRKDANAKN------IGSEVGANALINGTYADV---LG--AQIVRSKKLAEGSALMFKIV 178 (231) T ss_pred hccccccceEEEEcchHHHhhhhccchhh------hhhhhccceeeecccceE---cc--eEEEEcCCCCCCceeeeeEE Confidence 98888888899999986555444321110 000001111111112222 46 7999999999988754 Q ss_pred EehhhcceeecCcccceecCCCccceeeEEEEE--EeEEEecccceeEEEEecc Q lcl|NC_019402. 266 FTPSDWTQMVLRAPERTKLAKDGSYEKWMIEME--VGLRHRNPYASGILEVKAG 317 (318) Q Consensus 266 ~D~~~~~~~~Lr~~~~e~laktGd~~k~~i~~E--~tLe~~N~~a~g~i~~lt~ 317 (318) .-+..+.+..-|.+..|. ....+.+.-.|++- |+..+-||....+|+..-- T Consensus 179 ~~~gAl~~~~k~~~~vEt-dRd~~~k~~~i~~~~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 179 SNSPALKLVLKRGVQVET-DRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred eeccceeeeecccceeec-cccccccccEEEEeEEEEEEEEcCccEEEEEeecC Confidence 336777777778776552 23444444444443 8899999999887765444 No 103 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=96.69 E-value=0.00041 Score=39.18 Aligned_cols=263 Identities=10% Similarity=0.016 Sum_probs=130.2 Q ss_pred CCceeee-eeeeecccceeeeEecCCcccceeeeeccccccce--EEEeeeeeccccCCccccccccceeecccccccc- Q lcl|NC_019402. 1 MATLVSY-DLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQT--LFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGE- 76 (318) Q Consensus 1 Ma~~~t~-~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~--~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~- 76 (318) |..-++. .-...-+++...|...-...+|+..++.....++. .+.|....-..+ ...-..||++.++.. T Consensus 116 ~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~v~E~~~~~~~~~ 188 (408) T protein:vir:74 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSSGSRVYEKWTDVTP-------LKAMDEEDGKIPDLDN 188 (408) T ss_pred hcccccCCCceeechhHhhHHHHHHhhhcchhhhcceeeccCCcceEEEEeecCCcc-------cccccccccccccccc Confidence 2211111 12224567778888888888999888776554332 233333221110 011123666554322 Q ss_pred ccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHh Q lcl|NC_019402. 77 RASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVA 156 (318) Q Consensus 77 ~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~ 156 (318) .......-+ ..-+..-+.||.. .+.... .+...|=...-...+.+-+|.++|+|. |++.. T Consensus 189 ~~~~~i~~~-~~k~~~~~~iS~e--ll~ds~-~~l~~~i~~~l~~~~~~~~d~~il~G~----G~~~~------------ 248 (408) T protein:vir:74 189 PRLTIIKYL-IKRYAGIITATNT--LLKDTA-ENILAWLSSWIAKKVVVTRNQAIIAAM----GTVPK------------ 248 (408) T ss_pred cceeeEEee-eeeEEeeehhHHH--HHhhch-HHHHHHHHHHHHHHHHHHHHHHHhhcc----ccccc------------ Confidence 111111111 1111222334433 332211 233334444445566777889999773 22110 Q ss_pred cCCcccCccccceeeccCccccCHHHHHHHHH-HHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEE Q lcl|NC_019402. 157 AKDAADPDTGAIVHFETAAAALTEAEIFKVTY-NLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRL 235 (318) Q Consensus 157 ~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~-~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~ 235 (318) .+..++.++|.+++. .+-.+......++|||.....+..+. +.+ .++...+.- .- T Consensus 249 -----------------~~~~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lk-d~~----G~~l~~~~~--~~ 304 (408) T protein:vir:74 249 -----------------KPTIANFDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVK-TAE----GKYLLEPDP--TK 304 (408) T ss_pred -----------------ccccccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhh-cCC----CceEeccCc--CC Confidence 011245666766664 44333333446888999988887764 221 233222211 00 Q ss_pred EEEEEEEEcCCCcEEEEEecCCCCCc-----eEEEEehhh-cceeecCcccceecCCCc-----cceeeEEEEEEeEEEe Q lcl|NC_019402. 236 NVYVSSIVDPLGCQYKLVPNRWMPEN-----AVYFFTPSD-WTQMVLRAPERTKLAKDG-----SYEKWMIEMEVGLRHR 304 (318) Q Consensus 236 g~~v~~~~tdfG~~v~iv~nr~m~~~-----~~~~~D~~~-~~~~~Lr~~~~e~laktG-----d~~k~~i~~E~tLe~~ 304 (318) |. -.. =+|..|.+..|.+||.. .+++.|++. +.+..-.....+-....+ +.....++.-+...+. T Consensus 305 ~~-~~~---l~G~pV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~ 380 (408) T protein:vir:74 305 PN-SYL---IKGKQVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKAT 380 (408) T ss_pred CC-Cce---ecceeeEEecCcccccccCCcceEEEEehhccEEEEEecceEEEEeccccchhhcceeeEEEEEeeCcEEe Confidence 11 011 26755555567788853 277889885 333333344433333332 3333445566889999 Q ss_pred cccceeEEEEeccC Q lcl|NC_019402. 305 NPYASGILEVKAGA 318 (318) Q Consensus 305 N~~a~g~i~~lt~a 318 (318) +|.|..+++..+.+ T Consensus 381 ~~~a~~~~~~~~~~ 394 (408) T protein:vir:74 381 DSEALVAGSFTAIA 394 (408) T ss_pred cccceEEEEeeccc Confidence 99999999887766 No 104 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=96.54 E-value=0.00041 Score=39.18 Aligned_cols=291 Identities=11% Similarity=0.078 Sum_probs=126.9 Q ss_pred CCceeeeeee---eeccc-ceeeeEecCCcccceeeeeccccccce--EEEeeeeeccccCCccccccccceeecccccc Q lcl|NC_019402. 1 MATLVSYDLN---GKKLS-FANWISNLSPTDTPFVSMTGKEAINQT--LFQWQTDALAPVADPSDAQKRNAVIEGSAAVD 74 (318) Q Consensus 1 Ma~~~t~~~~---~~~~d-l~d~I~~i~p~~TP~~s~i~~~~~~~~--~~~W~td~l~~~~~~~~~~~~na~~EG~da~~ 74 (318) -..+++.... ..-++ +.++|...--..+|+.++++....+.. .+.|....-.... .--..||+..+. T Consensus 154 ~~~~~~~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~-------a~~~~Eg~~~~~ 226 (477) T protein:vir:84 154 YRDLDRNGGTGGYAVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTST-------AIQAADNAALTA 226 (477) T ss_pred hccccccCCCcceeeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcce-------eeeeccCccccc Confidence 0012222222 22233 345555544556777776665443322 3455543221110 001234543322 Q ss_pred cccc-CcEEecceEEEEee-eeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHH Q lcl|NC_019402. 75 GERA-STTVINNVTQILRK-VVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFS 152 (318) Q Consensus 75 ~~~~-~~~~~~N~tQIf~~-~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~ 152 (318) .... ....+..++--.++ ..-|.=|.+.+..... +..+|=..+-...+.+-+|.+||+|. |.+.. ..||+ T Consensus 227 ~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~~-~l~~~i~~~l~~~~~~~~d~~~l~G~----Gt~~~---p~Gi~ 298 (477) T protein:vir:84 227 PSAHEVDLTDGFVQANVKTIAGQQGIAIQLLDQAAV-SVDEFVFRDLAADYANKLNVQVISGT----GSNNQ---VVGVR 298 (477) T ss_pred ccccccccceeeEEEeeeeEEeeeHHHHHHHhccch-hHHHHHHHHHHHHHHHHHHHHHhccC----CCCCc---cceee Confidence 1110 01112222111111 2222234444443332 33444444556678888999999873 33323 35776 Q ss_pred HHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCC-CcCEEEEcchHHhhhhhhhhhhcccccceEEEecCC Q lcl|NC_019402. 153 ALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGS-EANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQ 231 (318) Q Consensus 153 ~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~-~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~ 231 (318) .+-..+...... .....+......+.|.+++..+..+.. .+..++++|.....+..+. +. +.++...+.. T Consensus 299 ~~~~~~~~~~~~----~~~t~~~~~~~~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lk-d~----~G~~l~~~~~ 369 (477) T protein:vir:84 299 ATAGITQVTATS----AGSALEKHQIIYQKIADAIQRVHTSRFLEPEVIVMHPRRWASFHAIF-AG----DDRPLIVPSG 369 (477) T ss_pred eccccccccccc----cccchhhHHHHHHHHHHHHhhccccccCCccEEEEcHHHHHHHHHhh-cc----CCCeeeecCc Confidence 432221110000 000001111223334444444444322 3445778888777776664 22 1233222110 Q ss_pred --ceEEEEEEEEEE-----cCCCcEEEEEecCCCCCc--------eEEEEehhhcceeecCcccceecCCCc---cceee Q lcl|NC_019402. 232 --DTRLNVYVSSIV-----DPLGCQYKLVPNRWMPEN--------AVYFFTPSDWTQMVLRAPERTKLAKDG---SYEKW 293 (318) Q Consensus 232 --~~~~g~~v~~~~-----tdfG~~v~iv~nr~m~~~--------~~~~~D~~~~~~~~Lr~~~~e~laktG---d~~k~ 293 (318) ....+.....+. +=+| +.++.+..||++ .+++.|++.+-+.. +....+-+..+. ..... T Consensus 370 ~~~~~~~~~~~~~~~~~~~~l~G--~pVv~s~~~p~~~~~~~d~~~i~~gd~~~~~i~~-~~~~~~~~~~~~~~~~~~~~ 446 (477) T protein:vir:84 370 PGFNNLGVLTEVASQRVVGQMHG--LPVVTDPTLPTTLGTGTDQDVIHVLRASDLALFE-SSVRMRALQETRAENLSVLL 446 (477) T ss_pred ccccccccccccccccccchhcc--cceEecCcccccccccCCcceEEEEEeceEEEEe-eceeEEeccccccccceeee Confidence 011111111110 2246 588889999964 57888887753322 333333332221 22223 Q ss_pred EEEEEEe-EEEecccceeEEEEeccC Q lcl|NC_019402. 294 MIEMEVG-LRHRNPYASGILEVKAGA 318 (318) Q Consensus 294 ~i~~E~t-Le~~N~~a~g~i~~lt~a 318 (318) ++++.+. +-+|.|+|..+|++.+.. T Consensus 447 ~v~~~~~~~~~r~~~afv~~t~~~~~ 472 (477) T protein:vir:84 447 QVYGYLAFTAARFPQSVVEIGGTALT 472 (477) T ss_pred eehhhhhhhhhccccceEEeeccccc Confidence 3444433 455789999999998877 No 105 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=96.53 E-value=0.00054 Score=38.50 Aligned_cols=260 Identities=12% Similarity=0.013 Sum_probs=124.5 Q ss_pred CCceeeee-eeeecccceeeeEecCCcccceeeeeccccccceEEE---eeeeeccccCCccccccccceeecccccccc Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQ---WQTDALAPVADPSDAQKRNAVIEGSAAVDGE 76 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~---W~td~l~~~~~~~~~~~~na~~EG~da~~~~ 76 (318) |...++.+ -...-..+.+.|...-....|+.+++....+++.... +.......+ .-.-||+..+... T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a---------~~v~E~~~~~~~~ 176 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF---------AEITEMGEIPETD 176 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccc---------eeecccccccccc Confidence 44444332 2224456667777766778888887766555433222 222221111 1234776655332 Q ss_pred ccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHh Q lcl|NC_019402. 77 RASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVA 156 (318) Q Consensus 77 ~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~ 156 (318) ...-..+.=-..-+...+.||.....-......+.+.. .-...+.+-++.++++|.. ..+ T Consensus 177 ~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~---~l~~~i~~~~d~~~~~g~g----~~~------------- 236 (392) T protein:vir:10 177 NPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTK---WLGKKSKVTRNVLILGVIE----KLT------------- 236 (392) T ss_pred cccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHH---HHHHHHHHHHHHHHhhccc----ccc------------- Confidence 11112222222334445556654332112222223333 3344566677888886531 110 Q ss_pred cCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCC-CcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEE Q lcl|NC_019402. 157 AKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGS-EANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRL 235 (318) Q Consensus 157 ~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~-~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~ 235 (318) .....+.++|.+++...-..+. ....++|||.....+..+. +.+ .++...+.-.. T Consensus 237 -----------------~~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lk-d~~----G~~l~~~~~~~-- 292 (392) T protein:vir:10 237 -----------------KQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLK-DKD----GKYILQSDPTQ-- 292 (392) T ss_pred -----------------ccCccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhh-ccC----CCeEeecCccC-- Confidence 0112456677776643333332 3346889999888887763 322 23322211100 Q ss_pred EEEEEEEEcCCCcEEEEEe--cCCCC-------CceEEEEehhh-cceeecCcccceecCCCc-----cceeeEEEEEEe Q lcl|NC_019402. 236 NVYVSSIVDPLGCQYKLVP--NRWMP-------ENAVYFFTPSD-WTQMVLRAPERTKLAKDG-----SYEKWMIEMEVG 300 (318) Q Consensus 236 g~~v~~~~tdfG~~v~iv~--nr~m~-------~~~~~~~D~~~-~~~~~Lr~~~~e~laktG-----d~~k~~i~~E~t 300 (318) +. --+=+|..+ ++. +..++ ...+++.|++. +.+..-.++..+...-++ +......+..++ T Consensus 293 ~~----~~tllG~~~-v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d 367 (392) T protein:vir:10 293 KN----KKLFAGTNP-VVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDD 367 (392) T ss_pred Cc----cccccCccc-EEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeec Confidence 00 011145311 221 22221 12266678775 333333333333322222 223355556688 Q ss_pred EEEecccceeEEEEeccC Q lcl|NC_019402. 301 LRHRNPYASGILEVKAGA 318 (318) Q Consensus 301 Le~~N~~a~g~i~~lt~a 318 (318) ..+++|.|..+++..+++ T Consensus 368 ~~v~~~~a~~~l~~~~~a 385 (392) T protein:vir:10 368 VQMWDNEAAVYGEIDLSA 385 (392) T ss_pred cEEecccceEEEEecccc Confidence 999999999999998887 No 106 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=96.53 E-value=0.00054 Score=38.50 Aligned_cols=260 Identities=12% Similarity=0.013 Sum_probs=124.5 Q ss_pred CCceeeee-eeeecccceeeeEecCCcccceeeeeccccccceEEE---eeeeeccccCCccccccccceeecccccccc Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQ---WQTDALAPVADPSDAQKRNAVIEGSAAVDGE 76 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~---W~td~l~~~~~~~~~~~~na~~EG~da~~~~ 76 (318) |...++.+ -...-..+.+.|...-....|+.+++....+++.... +.......+ .-.-||+..+... T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a---------~~v~E~~~~~~~~ 176 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF---------AEITEMGEIPETD 176 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccc---------eeecccccccccc Confidence 44444332 2224456667777766778888887766555433222 222221111 1234776655332 Q ss_pred ccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHh Q lcl|NC_019402. 77 RASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVA 156 (318) Q Consensus 77 ~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~ 156 (318) ...-..+.=-..-+...+.||.....-......+.+.. .-...+.+-++.++++|.. ..+ T Consensus 177 ~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~---~l~~~i~~~~d~~~~~g~g----~~~------------- 236 (392) T protein:vir:10 177 NPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTK---WLGKKSKVTRNVLILGVIE----KLT------------- 236 (392) T ss_pred cccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHH---HHHHHHHHHHHHHHhhccc----ccc------------- Confidence 11112222222334445556654332112222223333 3344566677888886531 110 Q ss_pred cCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCC-CcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEE Q lcl|NC_019402. 157 AKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGS-EANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRL 235 (318) Q Consensus 157 ~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~-~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~ 235 (318) .....+.++|.+++...-..+. ....++|||.....+..+. +.+ .++...+.-.. T Consensus 237 -----------------~~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lk-d~~----G~~l~~~~~~~-- 292 (392) T protein:vir:10 237 -----------------KQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLK-DKD----GKYILQSDPTQ-- 292 (392) T ss_pred -----------------ccCccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhh-ccC----CCeEeecCccC-- Confidence 0112456677776643333332 3346889999888887763 322 23322211100 Q ss_pred EEEEEEEEcCCCcEEEEEe--cCCCC-------CceEEEEehhh-cceeecCcccceecCCCc-----cceeeEEEEEEe Q lcl|NC_019402. 236 NVYVSSIVDPLGCQYKLVP--NRWMP-------ENAVYFFTPSD-WTQMVLRAPERTKLAKDG-----SYEKWMIEMEVG 300 (318) Q Consensus 236 g~~v~~~~tdfG~~v~iv~--nr~m~-------~~~~~~~D~~~-~~~~~Lr~~~~e~laktG-----d~~k~~i~~E~t 300 (318) +. --+=+|..+ ++. +..++ ...+++.|++. +.+..-.++..+...-++ +......+..++ T Consensus 293 ~~----~~tllG~~~-v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d 367 (392) T protein:vir:10 293 KN----KKLFAGTNP-VVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDD 367 (392) T ss_pred Cc----cccccCccc-EEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeec Confidence 00 011145311 221 22221 12266678775 333333333333322222 223355556688 Q ss_pred EEEecccceeEEEEeccC Q lcl|NC_019402. 301 LRHRNPYASGILEVKAGA 318 (318) Q Consensus 301 Le~~N~~a~g~i~~lt~a 318 (318) ..+++|.|..+++..+++ T Consensus 368 ~~v~~~~a~~~l~~~~~a 385 (392) T protein:vir:10 368 VQMWDNEAAVYGEIDLSA 385 (392) T ss_pred cEEecccceEEEEecccc Confidence 999999999999998887 No 107 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=96.53 E-value=0.00054 Score=38.50 Aligned_cols=260 Identities=12% Similarity=0.013 Sum_probs=124.5 Q ss_pred CCceeeee-eeeecccceeeeEecCCcccceeeeeccccccceEEE---eeeeeccccCCccccccccceeecccccccc Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQ---WQTDALAPVADPSDAQKRNAVIEGSAAVDGE 76 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~---W~td~l~~~~~~~~~~~~na~~EG~da~~~~ 76 (318) |...++.+ -...-..+.+.|...-....|+.+++....+++.... +.......+ .-.-||+..+... T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a---------~~v~E~~~~~~~~ 176 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF---------AEITEMGEIPETD 176 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccc---------eeecccccccccc Confidence 44444332 2224456667777766778888887766555433222 222221111 1234776655332 Q ss_pred ccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHh Q lcl|NC_019402. 77 RASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVA 156 (318) Q Consensus 77 ~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~ 156 (318) ...-..+.=-..-+...+.||.....-......+.+.. .-...+.+-++.++++|.. ..+ T Consensus 177 ~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~---~l~~~i~~~~d~~~~~g~g----~~~------------- 236 (392) T protein:vir:10 177 NPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTK---WLGKKSKVTRNVLILGVIE----KLT------------- 236 (392) T ss_pred cccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHH---HHHHHHHHHHHHHHhhccc----ccc------------- Confidence 11112222222334445556654332112222223333 3344566677888886531 110 Q ss_pred cCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCC-CcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEE Q lcl|NC_019402. 157 AKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGS-EANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRL 235 (318) Q Consensus 157 ~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~-~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~ 235 (318) .....+.++|.+++...-..+. ....++|||.....+..+. +.+ .++...+.-.. T Consensus 237 -----------------~~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lk-d~~----G~~l~~~~~~~-- 292 (392) T protein:vir:10 237 -----------------KQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLK-DKD----GKYILQSDPTQ-- 292 (392) T ss_pred -----------------ccCccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhh-ccC----CCeEeecCccC-- Confidence 0112456677776643333332 3346889999888887763 322 23322211100 Q ss_pred EEEEEEEEcCCCcEEEEEe--cCCCC-------CceEEEEehhh-cceeecCcccceecCCCc-----cceeeEEEEEEe Q lcl|NC_019402. 236 NVYVSSIVDPLGCQYKLVP--NRWMP-------ENAVYFFTPSD-WTQMVLRAPERTKLAKDG-----SYEKWMIEMEVG 300 (318) Q Consensus 236 g~~v~~~~tdfG~~v~iv~--nr~m~-------~~~~~~~D~~~-~~~~~Lr~~~~e~laktG-----d~~k~~i~~E~t 300 (318) +. --+=+|..+ ++. +..++ ...+++.|++. +.+..-.++..+...-++ +......+..++ T Consensus 293 ~~----~~tllG~~~-v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d 367 (392) T protein:vir:10 293 KN----KKLFAGTNP-VVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDD 367 (392) T ss_pred Cc----cccccCccc-EEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeec Confidence 00 011145311 221 22221 12266678775 333333333333322222 223355556688 Q ss_pred EEEecccceeEEEEeccC Q lcl|NC_019402. 301 LRHRNPYASGILEVKAGA 318 (318) Q Consensus 301 Le~~N~~a~g~i~~lt~a 318 (318) ..+++|.|..+++..+++ T Consensus 368 ~~v~~~~a~~~l~~~~~a 385 (392) T protein:vir:10 368 VQMWDNEAAVYGEIDLSA 385 (392) T ss_pred cEEecccceEEEEecccc Confidence 999999999999998887 No 108 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=96.53 E-value=0.00054 Score=38.50 Aligned_cols=260 Identities=12% Similarity=0.013 Sum_probs=124.5 Q ss_pred CCceeeee-eeeecccceeeeEecCCcccceeeeeccccccceEEE---eeeeeccccCCccccccccceeecccccccc Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQ---WQTDALAPVADPSDAQKRNAVIEGSAAVDGE 76 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~---W~td~l~~~~~~~~~~~~na~~EG~da~~~~ 76 (318) |...++.+ -...-..+.+.|...-....|+.+++....+++.... +.......+ .-.-||+..+... T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a---------~~v~E~~~~~~~~ 176 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF---------AEITEMGEIPETD 176 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccc---------eeecccccccccc Confidence 44444332 2224456667777766778888887766555433222 222221111 1234776655332 Q ss_pred ccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHh Q lcl|NC_019402. 77 RASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVA 156 (318) Q Consensus 77 ~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~ 156 (318) ...-..+.=-..-+...+.||.....-......+.+.. .-...+.+-++.++++|.. ..+ T Consensus 177 ~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~---~l~~~i~~~~d~~~~~g~g----~~~------------- 236 (392) T protein:vir:10 177 NPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTK---WLGKKSKVTRNVLILGVIE----KLT------------- 236 (392) T ss_pred cccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHH---HHHHHHHHHHHHHHhhccc----ccc------------- Confidence 11112222222334445556654332112222223333 3344566677888886531 110 Q ss_pred cCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCC-CcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEE Q lcl|NC_019402. 157 AKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGS-EANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRL 235 (318) Q Consensus 157 ~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~-~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~ 235 (318) .....+.++|.+++...-..+. ....++|||.....+..+. +.+ .++...+.-.. T Consensus 237 -----------------~~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lk-d~~----G~~l~~~~~~~-- 292 (392) T protein:vir:10 237 -----------------KQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLK-DKD----GKYILQSDPTQ-- 292 (392) T ss_pred -----------------ccCccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhh-ccC----CCeEeecCccC-- Confidence 0112456677776643333332 3346889999888887763 322 23322211100 Q ss_pred EEEEEEEEcCCCcEEEEEe--cCCCC-------CceEEEEehhh-cceeecCcccceecCCCc-----cceeeEEEEEEe Q lcl|NC_019402. 236 NVYVSSIVDPLGCQYKLVP--NRWMP-------ENAVYFFTPSD-WTQMVLRAPERTKLAKDG-----SYEKWMIEMEVG 300 (318) Q Consensus 236 g~~v~~~~tdfG~~v~iv~--nr~m~-------~~~~~~~D~~~-~~~~~Lr~~~~e~laktG-----d~~k~~i~~E~t 300 (318) +. --+=+|..+ ++. +..++ ...+++.|++. +.+..-.++..+...-++ +......+..++ T Consensus 293 ~~----~~tllG~~~-v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d 367 (392) T protein:vir:10 293 KN----KKLFAGTNP-VVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDD 367 (392) T ss_pred Cc----cccccCccc-EEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeec Confidence 00 011145311 221 22221 12266678775 333333333333322222 223355556688 Q ss_pred EEEecccceeEEEEeccC Q lcl|NC_019402. 301 LRHRNPYASGILEVKAGA 318 (318) Q Consensus 301 Le~~N~~a~g~i~~lt~a 318 (318) ..+++|.|..+++..+++ T Consensus 368 ~~v~~~~a~~~l~~~~~a 385 (392) T protein:vir:10 368 VQMWDNEAAVYGEIDLSA 385 (392) T ss_pred cEEecccceEEEEecccc Confidence 999999999999998887 No 109 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=96.19 E-value=0.00078 Score=37.63 Aligned_cols=258 Identities=9% Similarity=-0.058 Sum_probs=111.5 Q ss_pred CCceeeeeee-eecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccccC Q lcl|NC_019402. 1 MATLVSYDLN-GKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERAS 79 (318) Q Consensus 1 Ma~~~t~~~~-~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~~ 79 (318) +...++.+.. ..-..+.+.|... ...+++..++.....++....|....-..+ ...-..||...+.... + T Consensus 156 ~~~~~~~~~g~lvp~~~~~~i~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~e~~~~~e~~~-~ 226 (437) T protein:vir:10 156 VTGIALKDGKVIIPETILTPEKEV-HQFPRLGSLVRTESVTTTTGKLPIFNNSTD-------LLTAHTEYGQTTKNAT-P 226 (437) T ss_pred hhhcccccccccchHHHHHHHHHh-hhhhhhhhcceeEeeccCceeeEEeecccc-------cccccccccccccccc-c Confidence 2222222211 1223344444333 345555555544444443333333211110 0111234444332111 1 Q ss_pred cEEecceEEE---EeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHh Q lcl|NC_019402. 80 TTVINNVTQI---LRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVA 156 (318) Q Consensus 80 ~~~~~N~tQI---f~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~ 156 (318) .++.++-- +..-+.||.- .+.-... +..+|=...-...+.+-+|.+||+|. |++. + T Consensus 227 --~~~~v~~~~~k~~~~~~is~e--ll~ds~~-~~~~~i~~~l~~~~~~~~~~~i~~g~----g~~~-~----------- 285 (437) T protein:vir:10 227 --VITPILWDLKTYTGGYVFSQE--LISDSSY-DWQAELQSRLIELRDNTDDSLIITAL----TDGI-K----------- 285 (437) T ss_pred --cceeeeeehhheeeehhhhHH--HHhhhHH-HHHHHHHHHHHHHHHHHHHHHHhhhh----cccc-c----------- Confidence 11222211 1222333433 2222111 22222222333455667788888763 2110 0 Q ss_pred cCCcccCccccceeeccCccccCHHHHHHHHH----HHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCc Q lcl|NC_019402. 157 AKDAADPDTGAIVHFETAAAALTEAEIFKVTY----NLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQD 232 (318) Q Consensus 157 ~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~----~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~ 232 (318) ......+.++|.+++. ..|.++ ..++||+.....|..+. +. +.++...+.-. T Consensus 286 ----------------~~~~~~~~~~~~~~~~~~l~~~~~~~---~~~~~~~~~~~~l~~lk-d~----~g~~~~~~~~~ 341 (437) T protein:vir:10 286 ----------------KTTSTYLLGDLKKVLNVTLKPQDSAA---ASIVMSQSAYNLFDMAT-DA----MGRPLLQPNVT 341 (437) T ss_pred ----------------ccccccchhhHHHHHHhhhhhhhhcC---CEEEEcHHHHHHHHHhh-cc----CCCeeeccCcc Confidence 0011123344555443 233322 25788998888887764 22 22332222111 Q ss_pred eEEEEEEEEEEcCCCcEEEEEecCCCCCce-----EEEEehhhcceeecCc-ccceec-CCCccceeeEEEEEEeEEEec Q lcl|NC_019402. 233 TRLNVYVSSIVDPLGCQYKLVPNRWMPENA-----VYFFTPSDWTQMVLRA-PERTKL-AKDGSYEKWMIEMEVGLRHRN 305 (318) Q Consensus 233 ~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~-----~~~~D~~~~~~~~Lr~-~~~e~l-aktGd~~k~~i~~E~tLe~~N 305 (318) . |. --+=+|..|.++.+..+|... +++.|++..-..+.|. +..+.. .-.-+.+..+++..+...+.+ T Consensus 342 ~--~~----~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~r~d~~~~~ 415 (437) T protein:vir:10 342 A--AT----GYTLLGKTVVIVDDKLFPSASAGDVNIVVAPLKKAVINFKLTEITGQFQDTYDIWYKQLGIFLRQNVVQAS 415 (437) T ss_pred C--CC----CcccccceeEEecccccCCcCCCceEEEEeeccccEEEEeeeceEEEEecccccccceeeEEEEEccEEec Confidence 1 11 112367444444455566432 7888987532233332 222111 111234555667778999999 Q ss_pred ccceeEEEEeccC Q lcl|NC_019402. 306 PYASGILEVKAGA 318 (318) Q Consensus 306 ~~a~g~i~~lt~a 318 (318) |.|+.+|++-..+ T Consensus 416 ~~a~~~l~~~~~~ 428 (437) T protein:vir:10 416 KDLIVNLTGKLKA 428 (437) T ss_pred ccceEEEEeeccc Confidence 9999999987666 No 110 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=96.11 E-value=0.00099 Score=37.05 Aligned_cols=260 Identities=10% Similarity=0.008 Sum_probs=123.5 Q ss_pred CCceeeee-eeeecccceeeeEecCCcccceeeeeccccccceEEE-eeeeeccccCCccccccccceeecccccccccc Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQ-WQTDALAPVADPSDAQKRNAVIEGSAAVDGERA 78 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~-W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~ 78 (318) |.+-++.+ -...-+.+...|...-....|++.++.....++.... |....-..+ ...-..||.+.++...- T Consensus 91 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~~~~~~-------~a~~v~Eg~~~~~~~~~ 163 (371) T protein:vir:81 91 MSEGSNQDGGYTVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKKRSQQT-------GFVEVAEGAAIGEKATP 163 (371) T ss_pred hccCCCccCceeecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCc-------ceeeecccccccccccc Confidence 44333222 2224446677777777788888888776555433222 222211110 01123467665532211 Q ss_pred CcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhcC Q lcl|NC_019402. 79 STTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAAK 158 (318) Q Consensus 79 ~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~~ 158 (318) .-..+.=-..-+...+.||.- .+.... .+..+|=...-...+.+-+|.++++|.. ++. | T Consensus 164 ~f~~i~~~~~k~~~~~~iS~e--ll~ds~-~~l~~~i~~~l~~a~~~~~~~~i~~g~g----~~~-~------------- 222 (371) T protein:vir:81 164 QFTLLQYQVKKYAGFFRVTNE--LLNDST-EAIVNTLVRWIGDESRVTRNGLIINVLN----TKA-K------------- 222 (371) T ss_pred ceeeEEeeeeEEEEeehhhHH--HHhhhh-HHHHHHHHHHHHHHHHHHHHHHHHhhcc----ccc-c------------- Confidence 111111111112222334432 222211 1223333334445567788999998742 110 0 Q ss_pred CcccCccccceeeccCccccCHHHHHHHHHH-HHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEE Q lcl|NC_019402. 159 DAADPDTGAIVHFETAAAALTEAEIFKVTYN-LYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNV 237 (318) Q Consensus 159 ~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~-i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~ 237 (318) ....+.+.+.+++.. +-..-.....+++||.....+..+. +.+ .++...+.-.. + T Consensus 223 ----------------~~~~~~~~i~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lk-d~~----g~~l~~~~~~~--~- 278 (371) T protein:vir:81 223 ----------------TAIADLDGLKQIINVQLDPVFRSTSSVIVNQDAFNWLDTLK-DQN----GQYLLQPSISS--P- 278 (371) T ss_pred ----------------cccccHHHHHHHHHhhcchhhhcCCEEEEcHHHHHHHHHhh-ccC----CCeeeecccCC--C- Confidence 011344455554432 2122223346788998888777664 221 22222211000 0 Q ss_pred EEEEEEcCCCcEEEEEecCCCC------------CceEEEEehhh-cceeecCcccceecCCCc-----cceeeEEEEEE Q lcl|NC_019402. 238 YVSSIVDPLGCQYKLVPNRWMP------------ENAVYFFTPSD-WTQMVLRAPERTKLAKDG-----SYEKWMIEMEV 299 (318) Q Consensus 238 ~v~~~~tdfG~~v~iv~nr~m~------------~~~~~~~D~~~-~~~~~Lr~~~~e~laktG-----d~~k~~i~~E~ 299 (318) . .-+=+| +.++.+.+|| ...+++.|+.. +.+..-.++..+-..-.+ +......+..+ T Consensus 279 ~---~~~l~G--~pV~~~~~~~~~~~~~~~~~~~~~~i~~Gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~v~~~~~~r~ 353 (371) T protein:vir:81 279 T---GRQLLG--LPVVIVSNKVLANRVDGGTGAQFAPIIVGDLKEAVVMFDRQRTEIMSSNVAMDAFETDATLWRAIERM 353 (371) T ss_pred C---Cceecc--eeEEEecccccCccccccccCCcceEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEee Confidence 0 111246 4666666665 34578888875 444333444333332222 34455666668 Q ss_pred eEEEecccceeEEEEecc Q lcl|NC_019402. 300 GLRHRNPYASGILEVKAG 317 (318) Q Consensus 300 tLe~~N~~a~g~i~~lt~ 317 (318) .+.+++|.|..+++..++ T Consensus 354 d~~~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 354 DVKMRDDEAFVFGEVQLA 371 (371) T ss_pred ccEEecccceEEEEEecC Confidence 999999999999996666 No 111 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=95.92 E-value=0.0013 Score=36.49 Aligned_cols=263 Identities=11% Similarity=0.010 Sum_probs=125.3 Q ss_pred CCceeeeeeee---eccc-ceeeeEecCCcccceeeeeccc--cccceEEEeeeeeccccCCccccccccceeecccccc Q lcl|NC_019402. 1 MATLVSYDLNG---KKLS-FANWISNLSPTDTPFVSMTGKE--AINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVD 74 (318) Q Consensus 1 Ma~~~t~~~~~---~~~d-l~d~I~~i~p~~TP~~s~i~~~--~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~ 74 (318) .+..+++...| .-.+ +.+.|...=-..+++..+ +-. +..+-.+.|...+-.+. ..-.-||...+. T Consensus 355 ra~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l-~~~~~~~~~g~~~ip~~~~~~~--------a~wv~E~~~~~~ 425 (632) T protein:vir:96 355 RQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQM-GARMLPGLVGDVDIPKKTSGAN--------FYWIGEDEDVQD 425 (632) T ss_pred hhhhcccccccccccccccchHHHHHHHhhcchhhhh-cceEeecCCcceEEEEEeCCce--------eEeecCCccccc Confidence 22222222211 1112 223322221113333222 211 11122234443321110 011226655544 Q ss_pred ccccCcEEecceE---EEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhH Q lcl|NC_019402. 75 GERASTTVINNVT---QILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGF 151 (318) Q Consensus 75 ~~~~~~~~~~N~t---QIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi 151 (318) .... ++.++ .=+.--+.|| .+.+.... .+..++=...-...+.+-++.++|+|. |.+.. .-|| T Consensus 426 s~~~----f~~i~l~~~k~~~~v~iS--~ell~ds~-~~~~~~i~~~l~~a~~~~~d~a~l~G~----G~~~~---p~Gi 491 (632) T protein:vir:96 426 SDFD----FTTLSFSPKTIAGAVPVT--RKLRKQSS-IHVENLIREDLIEGIGVALDLAMLTGT----GLAND---PVGL 491 (632) T ss_pred cccc----eeeEEeeeeEEEEehhhH--HHHHhccc-hHHHHHHHHHHHHHHHHHHHHHhhccc----CCCCc---ccee Confidence 3211 11111 1122222333 33333222 122233334566778889999999874 22222 3466 Q ss_pred HHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcC--EEEEcchHHhhhhh-hhhhhcccccceEEEe Q lcl|NC_019402. 152 SALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEAN--IIMFHPKHAAFFSS-LMETSGVTNGQRMKMF 228 (318) Q Consensus 152 ~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~--~l~~~~~~k~~is~-~~~~~~~~~~~r~~~~ 228 (318) +.....+ ....+...++.+.|.++..++-..+...+ .++|+|..+..+.. ...+. ..++... T Consensus 492 ~~~~~~~-----------~~~~~~~~~~~~~i~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~~~l~d~----~G~~i~~ 556 (632) T protein:vir:96 492 LNMTGVP-----------ALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDN----TGERIWQ 556 (632) T ss_pred eeccccc-----------ceecccccCCHHHHHHHHHHHhhcccccCccEEEEchhHHHHHHHHhccCC----CCceeec Confidence 5332111 11233445788899999888876554332 46788876654432 11121 1222211 Q ss_pred cCCceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceecCCC---ccceeeEEEEEEeEEEec Q lcl|NC_019402. 229 DGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKLAKD---GSYEKWMIEMEVGLRHRN 305 (318) Q Consensus 229 ~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~lakt---Gd~~k~~i~~E~tLe~~N 305 (318) + + +-+| +.++.+..||++.+++.|++.+-+...-+...+-...+ -+.........+.+.++. T Consensus 557 ~---~----------~l~G--~pv~~s~~ip~~~~~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~v~~~~~~~~d~~v~~ 621 (632) T protein:vir:96 557 N---N----------EVNG--YRAEASNQIPADTWIFGDWSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRR 621 (632) T ss_pred C---C----------eecc--cceEeccccccCcEEEeecceEEEEEecceEEEEccccccccCceEEEEEeecCceeec Confidence 1 1 2257 68888999999999999998865554333332222111 234455566778999999 Q ss_pred ccceeEEEEec Q lcl|NC_019402. 306 PYASGILEVKA 316 (318) Q Consensus 306 ~~a~g~i~~lt 316 (318) |++..++.-.+ T Consensus 622 ~~af~~~k~~A 632 (632) T protein:vir:96 622 KEAFCIAKKGA 632 (632) T ss_pred hhhhhheeecC Confidence 99999888777 No 112 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=95.81 E-value=0.0014 Score=36.20 Aligned_cols=257 Identities=11% Similarity=-0.009 Sum_probs=123.7 Q ss_pred CCceeeee-eeeecccceeeeEecCCcccceeeeeccccccceEE---EeeeeeccccCCccccccccceeecccccccc Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLF---QWQTDALAPVADPSDAQKRNAVIEGSAAVDGE 76 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~---~W~td~l~~~~~~~~~~~~na~~EG~da~~~~ 76 (318) |.+.++.+ -...-+.+.+.|...-....|++.++.....++... -++...-.. ..-.-||.+.+... T Consensus 123 ~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~---------a~~v~Eg~~~~~~~ 193 (397) T protein:vir:12 123 MSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVTVEPVTTRSGTRLLEKNADMVP---------FSPVEELGNLPEID 193 (397) T ss_pred ccccccccCcccCchhHHHHHHHhhhhhhhHHhhcceeeccCCceeEEEEEecCCcc---------eeeecccccccccc Confidence 44333332 222346666777777778888888876654433221 122111111 12234676655322 Q ss_pred -ccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHH Q lcl|NC_019402. 77 -RASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALV 155 (318) Q Consensus 77 -~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i 155 (318) .......-+.. -+...+.||. +.+.-.+. +...|=...-...+.+-+|.++++|. |++ .| T Consensus 194 ~~~~~~v~~~~~-k~~~~~~is~--e~l~ds~~-~l~~~i~~~l~~~~~~~~d~~il~G~----g~~-~~---------- 254 (397) T protein:vir:12 194 QPRFTKVSYSII-DYGGIMTLSN--SMLNDSDQ-AIMTYVAKWFAKKSVVTRNNLILAAI----ASL-KK---------- 254 (397) T ss_pred cccceeEEeehe-eeEeeehhhH--HHHhhchH-HHHHHHHHHHHHHHHHHHHHHHHhcc----ccc-cc---------- Confidence 11111111111 1112233443 33332221 22333333445566778899999774 211 11 Q ss_pred hcCCcccCccccceeeccCccccCHHHHHHHHH-HHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceE Q lcl|NC_019402. 156 AAKDAADPDTGAIVHFETAAAALTEAEIFKVTY-NLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTR 234 (318) Q Consensus 156 ~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~-~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~ 234 (318) ...++.++|.+++. .+-..-.....++|||.....+..+. +.+ .++...+.... T Consensus 255 -------------------~g~~~~~~i~~~~~~~l~~~~~~~a~~~~n~~~~~~L~~lk-d~~----G~~l~~~~~~~- 309 (397) T protein:vir:12 255 -------------------VDIDGLDGIKKALNVTLDPMVAPGSIVLTNQDGYDWLDTLK-DGT----GRYLLQPDPTN- 309 (397) T ss_pred -------------------cccccHHHHHHHHhhccchhhhCCCEEEEcHHHHHHHHHhh-ccC----CceeecccccC- Confidence 01234556666554 33222223346889998887777663 221 23322211100 Q ss_pred EEEEEEEEEcCCCcEEEEEe-cCCCCC-----ceEEEEehhh-cceeecCcccceecC-----CCccceeeEEEEEEeEE Q lcl|NC_019402. 235 LNVYVSSIVDPLGCQYKLVP-NRWMPE-----NAVYFFTPSD-WTQMVLRAPERTKLA-----KDGSYEKWMIEMEVGLR 302 (318) Q Consensus 235 ~g~~v~~~~tdfG~~v~iv~-nr~m~~-----~~~~~~D~~~-~~~~~Lr~~~~e~la-----ktGd~~k~~i~~E~tLe 302 (318) |.. -+=+|. .+++ +..||. ..+++.|++. +.+..-.+...+-.. ..-+......+..+... T Consensus 310 -g~~----~~l~G~--pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~ 382 (397) T protein:vir:12 310 -PTK----KLLDGR--PVVPFTNRVLKTQKGKAPLIIGNLKEAIVLFDREQQSIASTDTGAGAFETNSTKVRGIEREDVR 382 (397) T ss_pred -CCC----ccccce--eeEEecccccccCCCccEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccE Confidence 100 112673 4443 334442 2378889875 444433333322111 11244556677779999 Q ss_pred EecccceeEEEEecc Q lcl|NC_019402. 303 HRNPYASGILEVKAG 317 (318) Q Consensus 303 ~~N~~a~g~i~~lt~ 317 (318) +.+|.|..+++.... T Consensus 383 ~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 383 KWDEDAVVFGQITVE 397 (397) T ss_pred EecccceEEEEEeeC Confidence 999999999988888 No 113 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=95.46 E-value=0.002 Score=35.36 Aligned_cols=275 Identities=9% Similarity=0.080 Sum_probs=126.4 Q ss_pred CCceeeee-eeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeec-cccCCccccccccceeecccccccccc Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDAL-APVADPSDAQKRNAVIEGSAAVDGERA 78 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l-~~~~~~~~~~~~na~~EG~da~~~~~~ 78 (318) +..-...+ -...-+++.+.|+.-=...-|+++++.-.+..+. ..|....- ..+.+ .-|++..+. ... T Consensus 79 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~-~~i~~~~~~~~a~w---------v~e~~~~~~-~~~ 147 (377) T protein:vir:96 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSLR-LKALTAETSGTAVW---------GDIFGEIKG-QLK 147 (377) T ss_pred HhcCCCCCCceecCHHHHHHHHHHHHhhhhhhhhceeEecCCc-eEEEEecCCcceeE---------eeccccccc-ccC Confidence 11100100 1123344555555444556677766544333332 23333221 11111 123333221 111 Q ss_pred Cc--EEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHh Q lcl|NC_019402. 79 ST--TVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVA 156 (318) Q Consensus 79 ~~--~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~ 156 (318) +. ...=+...+ ..-+.||.- -+. -..-+.-.|=..+-...+.+-+|.+||+|. |++ +--||+..+. T Consensus 148 ~~f~~i~l~~~kl-~~~~~is~~--ll~-ds~~~le~~i~~~l~~~~~~~~~~a~i~G~----G~~----~P~Gil~~~~ 215 (377) T protein:vir:96 148 QAFKEQDFSQFKL-TAFVVIPKD--ALK-FGPKWLKQFITEQLKEAIAVALELAIVKGN----GLL----QPVGLLKDLS 215 (377) T ss_pred ccceeEeeeeeeE-EeechhhHH--Hhh-cchhhHHHHHHHHHHHHHHHHHhhceEecc----CCC----cceeeeeccc Confidence 11 111111111 111233322 222 222344555556666778899999999874 322 3347776553 Q ss_pred cCCcccC-ccccce-----eeccCccccCHHHHHHHHHHH---HhCCCC-------cC-EEEEcchHHhhhhhhhhhhcc Q lcl|NC_019402. 157 AKDAADP-DTGAIV-----HFETAAAALTEAEIFKVTYNL---YLSGSE-------AN-IIMFHPKHAAFFSSLMETSGV 219 (318) Q Consensus 157 ~~~~~~~-~~g~~~-----~~~~t~~~lTe~~l~~~~~~i---~~~G~~-------~~-~l~~~~~~k~~is~~~~~~~~ 219 (318) ....... +.+... ...+....++.+.+.+++..+ |...+. .+ .++||+.....+-. T Consensus 216 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~~~------- 288 (377) T protein:vir:96 216 QPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEA------- 288 (377) T ss_pred cccccccccccccceeeccccccccccCChhHHHHHHHHHHHhhccccccccccccCceEEEEchhhHHhccc------- Confidence 3211100 000000 111222345555565555444 432221 12 35678754332211 Q ss_pred cccceEEEecCCceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceecC---CCccceeeEEE Q lcl|NC_019402. 220 TNGQRMKMFDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKLA---KDGSYEKWMIE 296 (318) Q Consensus 220 ~~~~r~~~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~la---ktGd~~k~~i~ 296 (318) .+...++. |..+.. -+|| +.++.+.+||++++++.|+++..+.-=.++..+.+- -.-|..-+... T Consensus 289 ----~~~~~~~~----G~~~~~--l~~p--~~v~~s~~~p~~~i~fgdf~~Y~i~~r~~~~i~~~~~~~~~~d~~~f~~~ 356 (377) T protein:vir:96 289 ----KFTSRNQF----GEYVTV--LPHG--ITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTK 356 (377) T ss_pred ----cccccCCC----CCceec--cCCC--ceEEecCCCCcccEEEEEcCcEEEEEecccEEEeehhhhhhcCCeEEEEE Confidence 11111211 222211 1455 678889999999999999998544332233333322 22366667777 Q ss_pred EEEeEEEecccceeEEEEecc Q lcl|NC_019402. 297 MEVGLRHRNPYASGILEVKAG 317 (318) Q Consensus 297 ~E~tLe~~N~~a~g~i~~lt~ 317 (318) .-+.-++.++.|..++...=+ T Consensus 357 ~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 357 NYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred EEEcCEEecCCcEEEEEEecC Confidence 778899999999888777666 No 114 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=95.29 E-value=0.0019 Score=35.47 Aligned_cols=258 Identities=7% Similarity=-0.043 Sum_probs=112.3 Q ss_pred CCceeeee-eeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccccC Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERAS 79 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~~ 79 (318) ++..++.+ ....-.++...|.... ...++..++.....++....+..-...... .....||...+...... T Consensus 132 ~~~~~~~~~~~~vp~~~~~~i~~~~-~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~E~~~~~~~~~~~ 203 (397) T protein:vir:96 132 RDGFTSVEGGALIPQELLQPQLEPK-DIVDLSKYVRSVPVNSASGKFPVISKSGSK-------MATVQQLEKNPQLANPK 203 (397) T ss_pred hhcccccccccchhHHHHHHHHHhh-hhhhHHHhhhhccccccceeEEEEeccCCc-------ccccccccccccccccc Confidence 33333322 2222245555555432 344555555554454444444432221111 01133555443321111 Q ss_pred cEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhcCC Q lcl|NC_019402. 80 TTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAAKD 159 (318) Q Consensus 80 ~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~~~ 159 (318) -..+.--..-+...+.||. +.+..... +..+|=..+-...+.+-++..+++|.- .+ T Consensus 204 ~~~i~~~~~~~~~~~~~s~--ell~ds~~-~l~~~i~~~l~~~~~~~~~~~i~~g~g----~~----------------- 259 (397) T protein:vir:96 204 MVEIDYSVATRRGYIPISQ--EMIDDASY-DVTGLIADEIQDQSLNTKNADIAAVLK----TA----------------- 259 (397) T ss_pred ccceeecHhHhhcchhhHH--HHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHhhccc----cc----------------- Confidence 1111111111112223332 22222211 222222223334455566777775521 11 Q ss_pred cccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEEEE Q lcl|NC_019402. 160 AADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNVYV 239 (318) Q Consensus 160 ~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~~v 239 (318) ......+.++|.+++...-....+ ..+++||.....+..+. +.+ .++...+.-.. + T Consensus 260 -------------~~~~~~~~d~~~~~~~~~~~~~~~-a~~v~n~~~~~~l~~lk-d~~----G~~~~~~~~~~--~--- 315 (397) T protein:vir:96 260 -------------TAKSVVGVDGLKDLINKEIKKVYD-VKLFISASMYSELDKLK-DKN----GRYLLQDSITA--A--- 315 (397) T ss_pred -------------ccccccchHHHHHHHHHhhhhhcC-cEEEEcHHHHHHHHHhh-ccC----CCeEeccCccC--C--- Confidence 011235667777777666554433 35788998888887764 221 23322211100 0 Q ss_pred EEEEcCCCcEEEEEe-cCCCCC-----ceEEEEehhhcceeecC-cccceecCCCccceeeEEEEEEeEEEecccceeEE Q lcl|NC_019402. 240 SSIVDPLGCQYKLVP-NRWMPE-----NAVYFFTPSDWTQMVLR-APERTKLAKDGSYEKWMIEMEVGLRHRNPYASGIL 312 (318) Q Consensus 240 ~~~~tdfG~~v~iv~-nr~m~~-----~~~~~~D~~~~~~~~Lr-~~~~e~laktGd~~k~~i~~E~tLe~~N~~a~g~i 312 (318) .--+=+|. .|+. +..++. ..+++.|++..-..+.| .+.....--..+......+..+...+++|.|+.++ T Consensus 316 -~~~~l~G~--pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~~ 392 (397) T protein:vir:96 316 -SGKQLLGK--EVVVLDDDVIGKSVGNVVGFIGDAKAFASFFDRKQVSVSWVDNNIYGQLLAGIIRYDVKATDKKAGFYV 392 (397) T ss_pred -Cccccccc--ceEEecccccCCCCCceEEEEeehhcceEeEeecceEEEEecccccceeEEEEEEEccEEecccceEEE Confidence 00123463 3332 223332 23777788853222222 22222221122233345566788999999999999 Q ss_pred EEecc Q lcl|NC_019402. 313 EVKAG 317 (318) Q Consensus 313 ~~lt~ 317 (318) +..++ T Consensus 393 ~~~~a 397 (397) T protein:vir:96 393 TFTIG 397 (397) T ss_pred EeecC Confidence 98888 No 115 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=95.14 E-value=0.0027 Score=34.68 Aligned_cols=262 Identities=13% Similarity=0.044 Sum_probs=125.2 Q ss_pred CCceeeee---eeeecccceeeeEecCCcccceeeeecccccc---ceEEEeeeeeccccCCccccccccceeecccccc Q lcl|NC_019402. 1 MATLVSYD---LNGKKLSFANWISNLSPTDTPFVSMTGKEAIN---QTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVD 74 (318) Q Consensus 1 Ma~~~t~~---~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~---~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~ 74 (318) |+.-++.. ....-.++...|...-....|+..+......+ ....-|...+....+ .-..||...+. T Consensus 105 ~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a--------~~v~E~~~~~~ 176 (395) T protein:vir:38 105 VTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEKLADITPLK--------DLDDESALIGD 176 (395) T ss_pred HhhccCccCCCceecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEeeccCCccc--------ccccccccccc Confidence 33222222 22234577788888888889988886654433 233445554432211 11235554432 Q ss_pred ccccCcEEecce-EEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHH Q lcl|NC_019402. 75 GERASTTVINNV-TQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSA 153 (318) Q Consensus 75 ~~~~~~~~~~N~-tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~ 153 (318) ... ..+.++ ...++-..-+-=|.+.+...+ .+...+=...-.+.+.+-+|.++++|. |+++. +. T Consensus 177 ~~~---~~f~~v~~~~~k~~~~~~iS~ell~ds~-~~l~~~i~~~la~~~~~~~~~~il~g~----g~~~~---~~---- 241 (395) T protein:vir:38 177 NDD---PELTVVKYLIHRYAGITTVTNTLLKDTV-DNIIQWLVNWAAKKDVVTRNAKILEVM----GKAPK---KP---- 241 (395) T ss_pred ccc---cceeeEEeeeeeeEeehhhHHHHHhhhH-HHHHHHHHHHHHHHHHHHHHHHHhhcc----ccccc---cc---- Confidence 211 111121 112222222222333333222 122333333445566778889999763 22110 00 Q ss_pred HHhcCCcccCccccceeeccCccccCHHHHHHHHHH-HHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCc Q lcl|NC_019402. 154 LVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYN-LYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQD 232 (318) Q Consensus 154 ~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~-i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~ 232 (318) ...+.+.|.+++.. +-........++|||.....+..+. +. +.++...+.-. T Consensus 242 ----------------------~~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lk-d~----~G~~l~~~~~~ 294 (395) T protein:vir:38 242 ----------------------TISQFDNIKDLENNTLDPAIESTSSFITNQSGYNILSKVK-DA----DGRYLMQPDVT 294 (395) T ss_pred ----------------------ccccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhh-cc----CCceeeccCcC Confidence 11234455555542 2222223346789998877776653 22 12222211100 Q ss_pred eEEEEEEEEEEcCCCcEEEEEecCCCCC----ceEEEEehhh-cceeecCcccceecCCC-----ccceeeEEEEEEeEE Q lcl|NC_019402. 233 TRLNVYVSSIVDPLGCQYKLVPNRWMPE----NAVYFFTPSD-WTQMVLRAPERTKLAKD-----GSYEKWMIEMEVGLR 302 (318) Q Consensus 233 ~~~g~~v~~~~tdfG~~v~iv~nr~m~~----~~~~~~D~~~-~~~~~Lr~~~~e~lakt-----Gd~~k~~i~~E~tLe 302 (318) . +. .. +=+|..|.+..+..+|. ..+++.|++. +.+....+...+-.... -+......+..+.+. T Consensus 295 ~--~~-~~---~l~G~pV~~~~~~~~~~~~~~~~i~~gd~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~ 368 (395) T protein:vir:38 295 S--PD-KY---LIDGKPVIRIADKWLPDVSGSHPLYFGDLKQGITLFDRQQMQIDTTNVGAGSFEHDTTKLRFIDRFDVQ 368 (395) T ss_pred C--CC-cc---eeccceeEEecccccCcCCCcceEEEEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEeeccE Confidence 0 00 00 11564333333333442 3378889885 44443344333222222 234556677779999 Q ss_pred EecccceeEEEEeccC Q lcl|NC_019402. 303 HRNPYASGILEVKAGA 318 (318) Q Consensus 303 ~~N~~a~g~i~~lt~a 318 (318) +.+|.|..+++..+++ T Consensus 369 ~~~~~a~~~~~~~~~~ 384 (395) T protein:vir:38 369 LIDDGAFAAASFKTVA 384 (395) T ss_pred EecccceEEEEeeccc Confidence 9999999999998887 No 116 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=94.84 E-value=0.0034 Score=34.15 Aligned_cols=276 Identities=10% Similarity=0.068 Sum_probs=124.5 Q ss_pred CCceee-eeeeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeecc-ccCCccccccccceeecccccccccc Q lcl|NC_019402. 1 MATLVS-YDLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALA-PVADPSDAQKRNAVIEGSAAVDGERA 78 (318) Q Consensus 1 Ma~~~t-~~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~-~~~~~~~~~~~na~~EG~da~~~~~~ 78 (318) |.+-++ ..-...-+.+.+.|+..=....|+.+++.-....+. ..+...+-. .+.+ .-|++..+. ... T Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~-~~i~~~~~~~~a~w---------~~e~~~~~~-~~~ 144 (381) T protein:vir:10 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR-LKFLKSETSGVAVW---------GKIYGEIKG-QLD 144 (381) T ss_pred HhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecCcc-eEEEEecCCcceee---------ecccccccc-ccc Confidence 221111 112224456667776666667777776543333222 222222111 1111 112222111 111 Q ss_pred Cc--EEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHh Q lcl|NC_019402. 79 ST--TVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVA 156 (318) Q Consensus 79 ~~--~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~ 156 (318) +. ...=+...+ ..-+.||.....- ..-+.-+|=..+-...+.+-+|.+||+|. |.+ .| -||+..+. T Consensus 145 ~~f~~i~l~~~kl-~~~~~is~elL~D---s~~~ie~~i~~~la~~~a~~~~~a~i~G~----G~~-qP---~Gil~~~~ 212 (381) T protein:vir:10 145 AAFSEETAIQNKL-TAFVVLPKDLNDF---GPAWIERFVRVQIEEAFAVALETAFLKGT----GKD-QP---IGLNRQVQ 212 (381) T ss_pred ccceeeeecceeE-EeechhhHHHhhc---CHHHHHHHHHHHHHHHHHHHhhheeEecc----CCC-Cc---eeeeeccC Confidence 11 111111111 2233444333221 11233444455556678888999999884 322 33 36665443 Q ss_pred cCCcccCccc----cceeeccCccccCHHHHHHHHHHHHhCCC------CcC-EEEEcchHHhhhhhhhhhhcccccceE Q lcl|NC_019402. 157 AKDAADPDTG----AIVHFETAAAALTEAEIFKVTYNLYLSGS------EAN-IIMFHPKHAAFFSSLMETSGVTNGQRM 225 (318) Q Consensus 157 ~~~~~~~~~g----~~~~~~~t~~~lTe~~l~~~~~~i~~~G~------~~~-~l~~~~~~k~~is~~~~~~~~~~~~r~ 225 (318) ....+..+.. ..............+.|.++++++-..+. ..+ .++||+...-.+-.+.. T Consensus 213 ~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~---------- 282 (381) T protein:vir:10 213 KGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT---------- 282 (381) T ss_pred cccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccc---------- Confidence 2211111100 00000111112223455666555533221 112 34678764333322211 Q ss_pred EEecCCceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceecC---CCccceeeEEEEEEeEE Q lcl|NC_019402. 226 KMFDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKLA---KDGSYEKWMIEMEVGLR 302 (318) Q Consensus 226 ~~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~la---ktGd~~k~~i~~E~tLe 302 (318) ..++. |..+. .-+|| +.|+.+.+||++++++.|+++..+.-=.++..+.+- -.-|...+....-+.-+ T Consensus 283 -~~~~~----G~~v~--~l~~g--~~vv~s~~~p~~~iifgDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dg~ 353 (381) T protein:vir:10 283 -HLNAN----GVYVT--ALPFN--LNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGK 353 (381) T ss_pred -cCCCC----Cceee--cCCCC--ceEEecCCCCcCcEEEEecccEEEEEecccEEEeechhHhhcCCeEEEEEEEEcCE Confidence 11111 22221 24677 578899999999999999998544322233333322 22355667777778889 Q ss_pred EecccceeEEEEeccC Q lcl|NC_019402. 303 HRNPYASGILEVKAGA 318 (318) Q Consensus 303 ~~N~~a~g~i~~lt~a 318 (318) +.+++|..+++..-.. T Consensus 354 ~~~~~A~~v~~l~~~~ 369 (381) T protein:vir:10 354 AKDNKVAAVWKLDLKG 369 (381) T ss_pred EecCceEEEEEEEecC Confidence 9999998886633222 No 117 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=94.84 E-value=0.0034 Score=34.15 Aligned_cols=276 Identities=10% Similarity=0.068 Sum_probs=124.5 Q ss_pred CCceee-eeeeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeecc-ccCCccccccccceeecccccccccc Q lcl|NC_019402. 1 MATLVS-YDLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALA-PVADPSDAQKRNAVIEGSAAVDGERA 78 (318) Q Consensus 1 Ma~~~t-~~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~-~~~~~~~~~~~na~~EG~da~~~~~~ 78 (318) |.+-++ ..-...-+.+.+.|+..=....|+.+++.-....+. ..+...+-. .+.+ .-|++..+. ... T Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~-~~i~~~~~~~~a~w---------~~e~~~~~~-~~~ 144 (381) T protein:vir:95 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLR-LKFLKSETSGVAVW---------GKIYGEIKG-QLD 144 (381) T ss_pred HhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecCcc-eEEEEecCCcceee---------ecccccccc-ccc Confidence 221111 112224456667776666667777776543333222 222222111 1111 112222111 111 Q ss_pred Cc--EEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHh Q lcl|NC_019402. 79 ST--TVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVA 156 (318) Q Consensus 79 ~~--~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~ 156 (318) +. ...=+...+ ..-+.||.....- ..-+.-+|=..+-...+.+-+|.+||+|. |.+ .| -||+..+. T Consensus 145 ~~f~~i~l~~~kl-~~~~~is~elL~D---s~~~ie~~i~~~la~~~a~~~~~a~i~G~----G~~-qP---~Gil~~~~ 212 (381) T protein:vir:95 145 AAFSEETAIQNKL-TAFVVLPKDLNDF---GPAWIERFVRVQIEEAFAVALETAFLKGT----GKD-QP---IGLNRQVQ 212 (381) T ss_pred ccceeeeecceeE-EeechhhHHHhhc---CHHHHHHHHHHHHHHHHHHHhhheeEecc----CCC-Cc---eeeeeccC Confidence 11 111111111 2233444333221 11233444455556678888999999884 322 33 36665443 Q ss_pred cCCcccCccc----cceeeccCccccCHHHHHHHHHHHHhCCC------CcC-EEEEcchHHhhhhhhhhhhcccccceE Q lcl|NC_019402. 157 AKDAADPDTG----AIVHFETAAAALTEAEIFKVTYNLYLSGS------EAN-IIMFHPKHAAFFSSLMETSGVTNGQRM 225 (318) Q Consensus 157 ~~~~~~~~~g----~~~~~~~t~~~lTe~~l~~~~~~i~~~G~------~~~-~l~~~~~~k~~is~~~~~~~~~~~~r~ 225 (318) ....+..+.. ..............+.|.++++++-..+. ..+ .++||+...-.+-.+.. T Consensus 213 ~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~---------- 282 (381) T protein:vir:95 213 KGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT---------- 282 (381) T ss_pred cccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccc---------- Confidence 2211111100 00000111112223455666555533221 112 34678764333322211 Q ss_pred EEecCCceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceecC---CCccceeeEEEEEEeEE Q lcl|NC_019402. 226 KMFDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKLA---KDGSYEKWMIEMEVGLR 302 (318) Q Consensus 226 ~~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~la---ktGd~~k~~i~~E~tLe 302 (318) ..++. |..+. .-+|| +.|+.+.+||++++++.|+++..+.-=.++..+.+- -.-|...+....-+.-+ T Consensus 283 -~~~~~----G~~v~--~l~~g--~~vv~s~~~p~~~iifgDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dg~ 353 (381) T protein:vir:95 283 -HLNAN----GVYVT--ALPFN--LNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGK 353 (381) T ss_pred -cCCCC----Cceee--cCCCC--ceEEecCCCCcCcEEEEecccEEEEEecccEEEeechhHhhcCCeEEEEEEEEcCE Confidence 11111 22221 24677 578899999999999999998544322233333322 22355667777778889 Q ss_pred EecccceeEEEEeccC Q lcl|NC_019402. 303 HRNPYASGILEVKAGA 318 (318) Q Consensus 303 ~~N~~a~g~i~~lt~a 318 (318) +.+++|..+++..-.. T Consensus 354 ~~~~~A~~v~~l~~~~ 369 (381) T protein:vir:95 354 AKDNKVAAVWKLDLKG 369 (381) T ss_pred EecCceEEEEEEEecC Confidence 9999998886633222 No 118 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=94.38 E-value=0.0033 Score=34.19 Aligned_cols=272 Identities=12% Similarity=0.112 Sum_probs=119.2 Q ss_pred CCceeeee-eeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccc-cCCccccccccceeecccccccccc Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAP-VADPSDAQKRNAVIEGSAAVDGERA 78 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~-~~~~~~~~~~na~~EG~da~~~~~~ 78 (318) |..-++.+ -...-+.+.+.|+..=....|+++++--.+..+. +.|....-.. +.+ .-|++..+ .... T Consensus 83 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~~~~-~~i~~~~~~~~a~w---------~~e~~~~~-~~~~ 151 (383) T protein:vir:78 83 INKEVGYKEETLLPQTVVDEIFEDLTTEHPFLASIGMRTTGLR-TKFLKSETSGVAVW---------GKIFGEIK-GQLD 151 (383) T ss_pred HhccCCCCCccccCHHHHHHHHHHHHhhccceeeeeeEecCCc-eEEEEEcCCcceEE---------eecccccc-cccC Confidence 22211111 1223345666666555566788777654333332 3343322111 111 11222211 1111 Q ss_pred Cc--EEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHh Q lcl|NC_019402. 79 ST--TVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVA 156 (318) Q Consensus 79 ~~--~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~ 156 (318) +. ..-=+...+ ..-+.|| .+-+.-.. -+.-+|=..+-...+.+-+|.+||+|. |.+ +--||+..+. T Consensus 152 ~~f~~i~l~~~kl-~~~i~is--~ell~Ds~-~~ie~~i~~~l~~~~a~~~~~a~i~G~----G~~----qP~Gil~~~~ 219 (383) T protein:vir:78 152 ATFSDEESIQNKL-TAFVVVP--KDLEKFGP-AWVKRFVVTQIEEAFAVALESAYIVGD----GND----KPIGLNRKVG 219 (383) T ss_pred cceeeEeecceee-Eeeccch--HHHhhccH-HHHHHHHHHHHHHHHHHHHhhheEecc----CCC----CceeeeeccC Confidence 11 111111111 1223333 22222222 233444445556678888999999884 321 2346665443 Q ss_pred cCCcccCccccceeeccCccccCHHHHHHHH---HHHHhCCC----------CcC-EEEEcchHHhh-hhhhhhhhcccc Q lcl|NC_019402. 157 AKDAADPDTGAIVHFETAAAALTEAEIFKVT---YNLYLSGS----------EAN-IIMFHPKHAAF-FSSLMETSGVTN 221 (318) Q Consensus 157 ~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~---~~i~~~G~----------~~~-~l~~~~~~k~~-is~~~~~~~~~~ 221 (318) ..+.+..+ +.....+...++.+++..+. ..++.+.. ..+ ..++++..... +..+ T Consensus 220 ~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~-------- 288 (383) T protein:vir:78 220 KGSTVVDG---VYAEKAATGTLTFANPKTTVNELTDVYKYHSVKENGHPLNVAGKVTLLVNPTDAWDVKKQY-------- 288 (383) T ss_pred Cccccccc---ccccccccchhhhhhhHHHHHHHHHHHhccchhcccchhhhcCceEEEEcCcchhhhccch-------- Confidence 33222111 11111223334444444433 33332221 011 12334421100 0000 Q ss_pred cceEEEecCCceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceec---CCCccceeeEEEEE Q lcl|NC_019402. 222 GQRMKMFDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKL---AKDGSYEKWMIEME 298 (318) Q Consensus 222 ~~r~~~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~l---aktGd~~k~~i~~E 298 (318) ...++ -|..+. .-+|| +.|+.+.+||++.+++.|+++.-+.-=.++..+.+ .-.-|..-+....- T Consensus 289 ----~~~~~----~G~~~t--~l~~~--~~iv~s~~~p~~~iifgdfs~Y~i~~r~~~~i~~~~~~~f~~d~~~f~~~~r 356 (383) T protein:vir:78 289 ----TSLNA----NGVYVT--ALPFN--LNIIESLFVPEKKAISYVAERYDALIGGPLDIGTYDQTLAIEDLNLYAAKQF 356 (383) T ss_pred ----hccCC----CCceee--ecCCC--ceEEecCCCCcccEEEeeccceEEEecccceEEecchhhhhcCceEEEEEEE Confidence 00011 122222 22566 57888999999999999999854432223333222 12235556666677 Q ss_pred EeEEEecccceeEEEEeccC Q lcl|NC_019402. 299 VGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 299 ~tLe~~N~~a~g~i~~lt~a 318 (318) +.-++.|++|..++...-+. T Consensus 357 ~dG~~~~~~A~~vl~~~~~~ 376 (383) T protein:vir:78 357 AYGKAKDDKAAAVWTLNINP 376 (383) T ss_pred EcCEEecCCeEEEEEEEecC Confidence 88899999999887754333 No 119 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=94.16 E-value=0.0052 Score=33.11 Aligned_cols=270 Identities=15% Similarity=0.116 Sum_probs=113.3 Q ss_pred CCceee-eeeeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCcccccccccee-ecccccccccc Q lcl|NC_019402. 1 MATLVS-YDLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVI-EGSAAVDGERA 78 (318) Q Consensus 1 Ma~~~t-~~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~-EG~da~~~~~~ 78 (318) |..-+. ..-...-+++.+.|...-....|+.+++--..+.+. ..|....-...+ ... |+...+ .... T Consensus 86 ~~~~t~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~~~~-~~i~~~~~~~~a---------~w~~e~~~~~-~~~~ 154 (395) T protein:vir:95 86 INYDVGYTDEKILPETVVERVFDDLQKDHPLLSKINFQNAGIK-TRVIKADPAGQA---------VWGKVFGEIK-GQLD 154 (395) T ss_pred HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCc-eEEEEecCCcce---------EEeecccccC-cccc Confidence 111011 011123344555555444455566665543333332 222222211110 011 111111 1111 Q ss_pred Cc--EEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCC-CccchhhhHHHHH Q lcl|NC_019402. 79 ST--TVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSA-TVARQTAGFSALV 155 (318) Q Consensus 79 ~~--~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~-t~~r~m~Gi~~~i 155 (318) +. ..--+...+ ..-+.||.... .-.. -+..+|=..+-...+.+-+|.+||+|. |.+ ..|. ||+..+ T Consensus 155 ~~f~~i~l~~~kl-~~~~~iS~ell--~ds~-~~ie~~i~~~la~~ia~~~~~a~i~G~----G~~~~qP~---Gil~~~ 223 (395) T protein:vir:95 155 AAFREENFTQYKL-TCFVVLPDDLS--TFGP-AWIERFVRTQIQEAISVALESAIINGG----GAAKTQPV---GLMKDV 223 (395) T ss_pred ccceeeeeceeeE-EEeecccHHHH--hcch-hHHHHHHHHHHHHHHHHHHhhheeecc----CCCCcCce---eeeecc Confidence 11 111111111 23334443332 2222 133444445556677888899999874 222 2233 776544 Q ss_pred hcCCcccCccccceeeccCccccCHHHHH-------HHHHHH--HhCCC----CcC-EEEEcchHHhhhhhhhhhhcccc Q lcl|NC_019402. 156 AAKDAADPDTGAIVHFETAAAALTEAEIF-------KVTYNL--YLSGS----EAN-IIMFHPKHAAFFSSLMETSGVTN 221 (318) Q Consensus 156 ~~~~~~~~~~g~~~~~~~t~~~lTe~~l~-------~~~~~i--~~~G~----~~~-~l~~~~~~k~~is~~~~~~~~~~ 221 (318) ...... .........++.+++. ++.+.+ |.++. ..+ .+++|+.... +.. T Consensus 224 ~~~~~~-------~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~---~~~------- 286 (395) T protein:vir:95 224 NTNSGA-------VTDKASSGTLTFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSW---DVQ------- 286 (395) T ss_pred cccccc-------cccccccchhhhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhh---hcC------- Confidence 322111 0011112223333333 222221 10110 111 3455654221 111 Q ss_pred cceEEEecCCceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceecC---CCccceeeEEEEE Q lcl|NC_019402. 222 GQRMKMFDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKLA---KDGSYEKWMIEME 298 (318) Q Consensus 222 ~~r~~~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~la---ktGd~~k~~i~~E 298 (318) .++...+.. |..+.. -+|| +.++.+.+||++++++.|+++..+.-=..+..+.+. -+-|...+..+.- T Consensus 287 -g~~~~~~~~----G~~~~~--lg~g--~~v~~~~~~p~~~i~fgdfs~y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r 357 (395) T protein:vir:95 287 -ARYTYLTAN----GGFVTV--LPYN--VTIITSEFVPEGKLVAFVTDRYNAVRGGGLTVKKFDQTLALEDAVLFTAKTF 357 (395) T ss_pred -CcceeccCC----Ccceec--cCCc--ceEEEcCCCCCCcEEEEecccEEEEEecceEEEeccchhhhCCcEEEEEEEE Confidence 111122211 221111 1356 578899999999999999997544321223222221 1225556666777 Q ss_pred EeEEEecccceeEEEEeccC Q lcl|NC_019402. 299 VGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 299 ~tLe~~N~~a~g~i~~lt~a 318 (318) +.-++.++.|.-+++..-.. T Consensus 358 ~dg~~~~~~A~~~l~i~~~~ 377 (395) T protein:vir:95 358 AYGQPDDNKASAVYDLKVAS 377 (395) T ss_pred ECCEEeccccEEEEEeeccC Confidence 88999999999766554333 No 120 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=94.03 E-value=0.0029 Score=34.46 Aligned_cols=273 Identities=11% Similarity=-0.008 Sum_probs=120.3 Q ss_pred CCc-----eeeeeeeeecccceeeeEecCCcccceeeeec---cccccceEEEeeeeeccccCCcccccccccee--e-c Q lcl|NC_019402. 1 MAT-----LVSYDLNGKKLSFANWISNLSPTDTPFVSMTG---KEAINQTLFQWQTDALAPVADPSDAQKRNAVI--E-G 69 (318) Q Consensus 1 Ma~-----~~t~~~~~~~~dl~d~I~~i~p~~TP~~s~i~---~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~--E-G 69 (318) |.+ |...+..... ..|+.+--.++=.-.+++ ........+.+..-+.. +-+.. . + T Consensus 1 ~~~~~~g~f~~~~l~~id----~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~----------G~~~~~~~~~ 66 (301) T protein:vir:80 1 MQGKITATIEARDLQAID----NVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRS----------GAAKIIANGA 66 (301) T ss_pred CCccccchhhHHHHHHHH----HHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccc----------eeEEEecCcc Confidence 222 2221111111 111111111111111111 11111112222221111 00111 1 1 Q ss_pred cccccccccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhh Q lcl|NC_019402. 70 SAAVDGERASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTA 149 (318) Q Consensus 70 ~da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~ 149 (318) .|.|.... ...+..-..-.|...+.++---.+.+....-+.-......+...+.+.+...++.|... ...- T Consensus 67 ~dip~~~~-~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~--------~g~~ 137 (301) T protein:vir:80 67 DDLPLVDV-DMVRKSVPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKK--------YAIK 137 (301) T ss_pred cccccccc-cceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeeccc--------ccce Confidence 11221111 11122223334445566664444444434345556666677788888888888877432 2234 Q ss_pred hHHHHHhcCCc---ccCccccceeeccCccccC--HHHHHHHHHHHHhC-CC--CcCEEEEcchHHhhhhhhhhhhcccc Q lcl|NC_019402. 150 GFSALVAAKDA---ADPDTGAIVHFETAAAALT--EAEIFKVTYNLYLS-GS--EANIIMFHPKHAAFFSSLMETSGVTN 221 (318) Q Consensus 150 Gi~~~i~~~~~---~~~~~g~~~~~~~t~~~lT--e~~l~~~~~~i~~~-G~--~~~~l~~~~~~k~~is~~~~~~~~~~ 221 (318) ||+. ..+. ..+.++.....+-+..+.. .++|..++.++|.. |+ .+.+|+++|..-..++.-. T Consensus 138 GLlN---~p~~~~~~~~~~~~~~~~~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~------- 207 (301) T protein:vir:80 138 GAFE---ATGIQIDVSPTTGVGNVSKWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINKKR------- 207 (301) T ss_pred eeec---CCCcccccccCcccccccccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhhcc------- Confidence 4442 2221 1111111111111122222 37788889999974 33 5578999998766654310 Q ss_pred cceEEEecCCceEEEEEEEEEEcCCCcEEEEEecCCCCC------ceEEEE--ehhhcceeecCcccceecCCCccceee Q lcl|NC_019402. 222 GQRMKMFDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPE------NAVYFF--TPSDWTQMVLRAPERTKLAKDGSYEKW 293 (318) Q Consensus 222 ~~r~~~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~------~~~~~~--D~~~~~~~~Lr~~~~e~laktGd~~k~ 293 (318) ..++.+ ....+.+...|.. .+|+.-+.+.. +.++++ +++.+++++=.|+...++-+.+-..+- T Consensus 208 -----~~~~~~---~tvl~~l~~~~~~-~~I~~~p~L~~~g~~g~~~~v~~~~~~d~~~~~v~~~~~~~~~e~~~~~~~~ 278 (301) T protein:vir:80 208 -----YSNEDS---RSVLKVLQDNAWF-SAIVRVPDLAGMGTAGSDSFAVIHDSNETAELIIPMDITRHPEEYSFPRTKV 278 (301) T ss_pred -----ccCCCC---eeHHHHHHHHcCc-ceEEEcceeccCCCCcccEEEEEecCCcEEEEEecCceeeecceecCceeEe Confidence 000000 0111222222331 45555555432 335555 466777766556666566666644444 Q ss_pred EEEEE-EeEEEecccceeEEEEe Q lcl|NC_019402. 294 MIEME-VGLRHRNPYASGILEVK 315 (318) Q Consensus 294 ~i~~E-~tLe~~N~~a~g~i~~l 315 (318) -.+.. .+++++-|.|...+.|+ T Consensus 279 ~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 279 PFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred eeeeeeEEEEEEccceEEEEecC Confidence 34444 47999999999999999 No 121 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=93.55 E-value=0.0072 Score=32.34 Aligned_cols=273 Identities=11% Similarity=0.055 Sum_probs=124.3 Q ss_pred CCceeeee---eeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccc Q lcl|NC_019402. 1 MATLVSYD---LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGER 77 (318) Q Consensus 1 Ma~~~t~~---~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~ 77 (318) .+...++. -...-+.+.+.|+..=....|+.+++--.+..+ ...|...+-...+. =..|+++.+ .+. T Consensus 74 ~~~~~~t~~~Gg~lvP~~~~~~I~~~l~~~spir~~a~v~~~~~-~~~i~~~~~~~~a~--------W~~e~~~~~-~~~ 143 (381) T protein:vir:10 74 MDINKSVGYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVAV--------WGKIYGEIK-GQL 143 (381) T ss_pred HHHhhcCCCCCceecCHHHHHHHHHHHHhhcceeeeeeeEecCc-ceEEEeecCCcceE--------Eeecccccc-ccc Confidence 11112222 223445677777766667778888765444333 23444332211110 011222211 111 Q ss_pred cCc--EEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHH Q lcl|NC_019402. 78 AST--TVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALV 155 (318) Q Consensus 78 ~~~--~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i 155 (318) .+. ...=+...+. .-+.||... +.-.. -+.-+|=..+-...+.+-+|.+||+|. |++ .| -||+..+ T Consensus 144 ~~~f~~i~l~~~kl~-a~i~is~el--L~Ds~-~~le~~i~~~la~~~a~~~~~afi~Gd----G~~-qP---~Gil~~~ 211 (381) T protein:vir:10 144 DAAFSEETAIQNKLT-AFVVLPKDL--NDFGP-AWIERFVRVQIEEAFAVALETAFLKGT----GKD-QP---IGLNRQV 211 (381) T ss_pred CccceeEeecceeEE-eeccccHHH--HhccH-HHHHHHHHHHHHHHHHHHhhceeEecc----cCC-Cc---eeeeecC Confidence 111 1111222221 233444222 22111 233444455556677888999999884 322 34 3776544 Q ss_pred hcCCcccCccccceeeccCccccCHH-------HHHHHHHHHHhCCC------CcC-EEEEcchHHhhhhhhhhhhcccc Q lcl|NC_019402. 156 AAKDAADPDTGAIVHFETAAAALTEA-------EIFKVTYNLYLSGS------EAN-IIMFHPKHAAFFSSLMETSGVTN 221 (318) Q Consensus 156 ~~~~~~~~~~g~~~~~~~t~~~lTe~-------~l~~~~~~i~~~G~------~~~-~l~~~~~~k~~is~~~~~~~~~~ 221 (318) .....+..+. .. ...+...+|.. .+.++++.+-..+. ..+ .+++|+...-.+-... T Consensus 212 ~~~~~~~~g~--~~-~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~------- 281 (381) T protein:vir:10 212 QKGVSVTDGA--YP-EKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY------- 281 (381) T ss_pred Cccccccccc--cc-cccccccccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhcccc------- Confidence 3321111110 00 00111122322 33344433322211 111 3556775433332211 Q ss_pred cceEEEecCCceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceecC---CCccceeeEEEEE Q lcl|NC_019402. 222 GQRMKMFDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKLA---KDGSYEKWMIEME 298 (318) Q Consensus 222 ~~r~~~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~la---ktGd~~k~~i~~E 298 (318) ...++. |..+. .-+|| +.|+.+.+||++++++.|+++.-+.-=.++..+.+. -.-|...+....- T Consensus 282 ----~~~~~~----G~~v~--~lp~g--~~vv~~~~~p~~~i~fGDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r 349 (381) T protein:vir:10 282 ----THLNAN----GVYVT--ALPFN--LNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQF 349 (381) T ss_pred ----ccCCCC----Cceee--cCCCC--ceeEEcCCCCcCcEEEEEcccEEEEEecccEEEeechhhhhcCceEEEEEEE Confidence 011111 22221 23678 478889999999999999998544321223332222 2236666777777 Q ss_pred EeEEEecccceeEEEEeccC Q lcl|NC_019402. 299 VGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 299 ~tLe~~N~~a~g~i~~lt~a 318 (318) +.-++.+++|..+++. +.+ T Consensus 350 ~dG~~~~~~A~~v~~l-~~~ 368 (381) T protein:vir:10 350 AYGKAKDNKVAAVWKL-DLK 368 (381) T ss_pred EcCEEecCCcEEEEEE-eec Confidence 8889999999888665 333 No 122 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=93.53 E-value=0.0072 Score=32.32 Aligned_cols=262 Identities=10% Similarity=-0.047 Sum_probs=110.9 Q ss_pred CCceeeeee---eeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccc Q lcl|NC_019402. 1 MATLVSYDL---NGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGER 77 (318) Q Consensus 1 Ma~~~t~~~---~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~ 77 (318) -+.-.+..+ ...-+++.+.|...-....|+..++.-.++++...-+.......+.. .-||...+.... T Consensus 116 ~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~p~~~~~~~~a~~---------v~Eg~~~~~~~~ 186 (387) T protein:vir:26 116 HALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGLEIPRVSYTLDDDDF---------ITDVETAKELKA 186 (387) T ss_pred hhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCCceeeeeeccCCcccc---------cccccccccccc Confidence 111112222 22345666777766666678877665545444433333333222221 236666554332 Q ss_pred cCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhc Q lcl|NC_019402. 78 ASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAA 157 (318) Q Consensus 78 ~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~ 157 (318) ......-+...+ .--+.||.....-......+.+..++..++.. .+.+.+|..|. |++ .| .|++ .. T Consensus 187 ~f~~v~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~--~e~~~~~~~g~----g~g-~~---~g~~---~~ 252 (387) T protein:vir:26 187 KGDTVKFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAA--KERKDALAVSP----KSG-LE---HMSF---YN 252 (387) T ss_pred ccceeeechhee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHH--HHHHhHhhcCC----Ccc-cc---ceee---ec Confidence 221111111111 11234443322211111223333333333221 12233444332 111 11 1111 00 Q ss_pred CCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEE Q lcl|NC_019402. 158 KDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNV 237 (318) Q Consensus 158 ~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~ 237 (318) .+ .+ ....+.+.+.|.+++-.+...-.....++|++.....+-++..+. ++. .+.+.. T Consensus 253 ~~----------~~-~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~-----~~~-~~~~~~----- 310 (387) T protein:vir:26 253 GS----------VK-EVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNG-----TTN-FFDTPA----- 310 (387) T ss_pred cc----------cc-cccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcC-----CCc-ccccCC----- Confidence 00 00 011122456666666555432112224566654332233333332 121 112221 Q ss_pred EEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceec--CCCccceeeEEEEEEeEEEecccceeEEEEe Q lcl|NC_019402. 238 YVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKL--AKDGSYEKWMIEMEVGLRHRNPYASGILEVK 315 (318) Q Consensus 238 ~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~l--aktGd~~k~~i~~E~tLe~~N~~a~g~i~~l 315 (318) .+=||. .|+....++ .+++.|+++.-..+ +.+..... +++ +-...++..-+..++.+|.|..++... T Consensus 311 -----~~llG~--PV~~~~~~~--~~~~GDf~~~~~~~-~~~~~~~~~~~~~-~~~~~~~~~r~Dg~v~~~~A~~~l~~k 379 (387) T protein:vir:26 311 -----EKVFGK--PVVFTDAAV--KPIVGDFNYFGINY-DGTTYDTDKDVKK-GEYLFVLTAWYDQQRTLDSAFRIAKAK 379 (387) T ss_pred -----cccccc--ceEEecCCC--ceeeechhhhhhhh-hhhhheecccccC-CceEEEEEEEeCcEeechhheEEEEee Confidence 133673 555544543 47888987642222 22222111 222 345566666789999999999998887 Q ss_pred ccC Q lcl|NC_019402. 316 AGA 318 (318) Q Consensus 316 t~a 318 (318) +++ T Consensus 380 a~~ 382 (387) T protein:vir:26 380 ENT 382 (387) T ss_pred cCC Confidence 777 No 123 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=93.53 E-value=0.0072 Score=32.32 Aligned_cols=262 Identities=10% Similarity=-0.047 Sum_probs=110.9 Q ss_pred CCceeeeee---eeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccc Q lcl|NC_019402. 1 MATLVSYDL---NGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGER 77 (318) Q Consensus 1 Ma~~~t~~~---~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~ 77 (318) -+.-.+..+ ...-+++.+.|...-....|+..++.-.++++...-+.......+.. .-||...+.... T Consensus 116 ~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~p~~~~~~~~a~~---------v~Eg~~~~~~~~ 186 (387) T protein:vir:94 116 HALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGLEIPRVSYTLDDDDF---------ITDVETAKELKA 186 (387) T ss_pred hhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCCceeeeeeccCCcccc---------cccccccccccc Confidence 111112222 22345666777766666678877665545444433333333222221 236666554332 Q ss_pred cCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhc Q lcl|NC_019402. 78 ASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAA 157 (318) Q Consensus 78 ~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~ 157 (318) ......-+...+ .--+.||.....-......+.+..++..++.. .+.+.+|..|. |++ .| .|++ .. T Consensus 187 ~f~~v~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~--~e~~~~~~~g~----g~g-~~---~g~~---~~ 252 (387) T protein:vir:94 187 KGDTVKFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAA--KERKDALAVSP----KSG-LE---HMSF---YN 252 (387) T ss_pred ccceeeechhee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHH--HHHHhHhhcCC----Ccc-cc---ceee---ec Confidence 221111111111 11234443322211111223333333333221 12233444332 111 11 1111 00 Q ss_pred CCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEE Q lcl|NC_019402. 158 KDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNV 237 (318) Q Consensus 158 ~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~ 237 (318) .+ .+ ....+.+.+.|.+++-.+...-.....++|++.....+-++..+. ++. .+.+.. T Consensus 253 ~~----------~~-~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~-----~~~-~~~~~~----- 310 (387) T protein:vir:94 253 GS----------VK-EVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNG-----TTN-FFDTPA----- 310 (387) T ss_pred cc----------cc-cccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcC-----CCc-ccccCC----- Confidence 00 00 011122456666666555432112224566654332233333332 121 112221 Q ss_pred EEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceec--CCCccceeeEEEEEEeEEEecccceeEEEEe Q lcl|NC_019402. 238 YVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKL--AKDGSYEKWMIEMEVGLRHRNPYASGILEVK 315 (318) Q Consensus 238 ~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~l--aktGd~~k~~i~~E~tLe~~N~~a~g~i~~l 315 (318) .+=||. .|+....++ .+++.|+++.-..+ +.+..... +++ +-...++..-+..++.+|.|..++... T Consensus 311 -----~~llG~--PV~~~~~~~--~~~~GDf~~~~~~~-~~~~~~~~~~~~~-~~~~~~~~~r~Dg~v~~~~A~~~l~~k 379 (387) T protein:vir:94 311 -----EKVFGK--PVVFTDAAV--KPIVGDFNYFGINY-DGTTYDTDKDVKK-GEYLFVLTAWYDQQRTLDSAFRIAKAK 379 (387) T ss_pred -----cccccc--ceEEecCCC--ceeeechhhhhhhh-hhhhheecccccC-CceEEEEEEEeCcEeechhheEEEEee Confidence 133673 555544543 47888987642222 22222111 222 345566666789999999999998887 Q ss_pred ccC Q lcl|NC_019402. 316 AGA 318 (318) Q Consensus 316 t~a 318 (318) +++ T Consensus 380 a~~ 382 (387) T protein:vir:94 380 ENT 382 (387) T ss_pred cCC Confidence 777 No 124 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=93.53 E-value=0.0072 Score=32.32 Aligned_cols=262 Identities=10% Similarity=-0.047 Sum_probs=110.9 Q ss_pred CCceeeeee---eeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccc Q lcl|NC_019402. 1 MATLVSYDL---NGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGER 77 (318) Q Consensus 1 Ma~~~t~~~---~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~ 77 (318) -+.-.+..+ ...-+++.+.|...-....|+..++.-.++++...-+.......+.. .-||...+.... T Consensus 116 ~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~p~~~~~~~~a~~---------v~Eg~~~~~~~~ 186 (387) T protein:vir:96 116 HALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGLEIPRVSYTLDDDDF---------ITDVETAKELKA 186 (387) T ss_pred hhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCCceeeeeeccCCcccc---------cccccccccccc Confidence 111112222 22345666777766666678877665545444433333333222221 236666554332 Q ss_pred cCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhc Q lcl|NC_019402. 78 ASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAA 157 (318) Q Consensus 78 ~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~ 157 (318) ......-+...+ .--+.||.....-......+.+..++..++.. .+.+.+|..|. |++ .| .|++ .. T Consensus 187 ~f~~v~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~--~e~~~~~~~g~----g~g-~~---~g~~---~~ 252 (387) T protein:vir:96 187 KGDTVKFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAA--KERKDALAVSP----KSG-LE---HMSF---YN 252 (387) T ss_pred ccceeeechhee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHH--HHHHhHhhcCC----Ccc-cc---ceee---ec Confidence 221111111111 11234443322211111223333333333221 12233444332 111 11 1111 00 Q ss_pred CCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEE Q lcl|NC_019402. 158 KDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNV 237 (318) Q Consensus 158 ~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~ 237 (318) .+ .+ ....+.+.+.|.+++-.+...-.....++|++.....+-++..+. ++. .+.+.. T Consensus 253 ~~----------~~-~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~-----~~~-~~~~~~----- 310 (387) T protein:vir:96 253 GS----------VK-EVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNG-----TTN-FFDTPA----- 310 (387) T ss_pred cc----------cc-cccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcC-----CCc-ccccCC----- Confidence 00 00 011122456666666555432112224566654332233333332 121 112221 Q ss_pred EEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceec--CCCccceeeEEEEEEeEEEecccceeEEEEe Q lcl|NC_019402. 238 YVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKL--AKDGSYEKWMIEMEVGLRHRNPYASGILEVK 315 (318) Q Consensus 238 ~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~l--aktGd~~k~~i~~E~tLe~~N~~a~g~i~~l 315 (318) .+=||. .|+....++ .+++.|+++.-..+ +.+..... +++ +-...++..-+..++.+|.|..++... T Consensus 311 -----~~llG~--PV~~~~~~~--~~~~GDf~~~~~~~-~~~~~~~~~~~~~-~~~~~~~~~r~Dg~v~~~~A~~~l~~k 379 (387) T protein:vir:96 311 -----EKVFGK--PVVFTDAAV--KPIVGDFNYFGINY-DGTTYDTDKDVKK-GEYLFVLTAWYDQQRTLDSAFRIAKAK 379 (387) T ss_pred -----cccccc--ceEEecCCC--ceeeechhhhhhhh-hhhhheecccccC-CceEEEEEEEeCcEeechhheEEEEee Confidence 133673 555544543 47888987642222 22222111 222 345566666789999999999998887 Q ss_pred ccC Q lcl|NC_019402. 316 AGA 318 (318) Q Consensus 316 t~a 318 (318) +++ T Consensus 380 a~~ 382 (387) T protein:vir:96 380 ENT 382 (387) T ss_pred cCC Confidence 777 No 125 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=93.52 E-value=0.0073 Score=32.30 Aligned_cols=262 Identities=10% Similarity=-0.044 Sum_probs=114.5 Q ss_pred CCceeeee-eeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccccC Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERAS 79 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~~ 79 (318) |..-++.+ -...-+++..+|...-....|+..+....++.+....+.......+.. .-||.+.+...... T Consensus 83 l~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~~~~~~~p~~~~~~~~a~~---------v~E~~~~~~~~~~f 153 (352) T protein:vir:78 83 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGLEIPRVSYTLDDDDF---------ITDVETAKELKLKG 153 (352) T ss_pred hccCCCCCCceeccHhHHHHHHHHHHhhcchhhheeeEecCCceEEEEecCCCcccc---------cccccccccccccc Confidence 22111111 112334666666666666677766655444444444444433322221 23666655543222 Q ss_pred cEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhcCC Q lcl|NC_019402. 80 TTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAAKD 159 (318) Q Consensus 80 ~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~~~ 159 (318) ...--+...+ .--+.||...-.-......+.+..++.+++. +.+.+.+|..|. |.+ .| . |++. +. T Consensus 154 ~~v~~~~~k~-~~~i~is~ell~Ds~~~l~~~i~~~la~~~~--~~e~~~~~~~g~----g~~-~~--~-g~l~----~~ 218 (352) T protein:vir:78 154 DTVKFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLA--AKERKDALAVSP----KSG-LE--H-MSFY----NG 218 (352) T ss_pred eeeeecceeE-EeechhhHHHHhhhhHHHHHHHHHHHHHHHH--HHHHHhhhhcCC----CCc-cc--c-ccee----cc Confidence 2111111111 1123344332221111223344445555443 123444454442 111 11 1 2210 10 Q ss_pred cccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEEEE Q lcl|NC_019402. 160 AADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNVYV 239 (318) Q Consensus 160 ~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~~v 239 (318) . ....++. -+.++|.+++..+-..-.+...+++++.....+-++..+. ++. .+.+... T Consensus 219 ----~-----~~~~t~~-~~~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~l~~~~~~~-----~~~-~~~~~~~------ 276 (352) T protein:vir:78 219 ----S-----VKEVEGA-NMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNG-----TTN-FFDTPAE------ 276 (352) T ss_pred ----c-----ccccccc-chHHHHHHHHhccChhhhcCCEEEEehHHHHHHHHHHhcc-----CCc-ccccCCc------ Confidence 0 0011111 1356666666655433223345677776544444544432 222 1222211 Q ss_pred EEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceec--CCCccceeeEEEEEEeEEEecccceeEEEEecc Q lcl|NC_019402. 240 SSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKL--AKDGSYEKWMIEMEVGLRHRNPYASGILEVKAG 317 (318) Q Consensus 240 ~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~l--aktGd~~k~~i~~E~tLe~~N~~a~g~i~~lt~ 317 (318) +=|| +.|+....++ .+++.|+++.-..+ ++...+.+ +++ +-..+....-+...+.+|.|..++...++ T Consensus 277 ----~llG--~PV~~~~~~~--~~~~Gdf~~~~~~~-~~~~~~~~~~~~~-g~~~f~~~~r~Dg~~~~~eA~~~l~~~a~ 346 (352) T protein:vir:78 277 ----KVFG--KPVVFTDAAV--KPIVGDFNYFGINY-DGTTYDTDKDVKK-GEYLFVLTAWYDQQRTLDSAFRIAKAKES 346 (352) T ss_pred ----cccc--cceEEecCCC--ceeEeehhhhhhhh-hhheeeeeccccC-CeeEEEEEeeeCceeechhheEEEEeecc Confidence 2356 3555544443 47788988653221 22222222 222 23444445568899999999999988888 Q ss_pred C Q lcl|NC_019402. 318 A 318 (318) Q Consensus 318 a 318 (318) | T Consensus 347 ~ 347 (352) T protein:vir:78 347 T 347 (352) T ss_pred c Confidence 8 No 126 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=93.45 E-value=0.0075 Score=32.22 Aligned_cols=262 Identities=11% Similarity=-0.042 Sum_probs=109.7 Q ss_pred CCceeeeee---eeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccc Q lcl|NC_019402. 1 MATLVSYDL---NGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGER 77 (318) Q Consensus 1 Ma~~~t~~~---~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~ 77 (318) .++-+++.+ ...-+++...|...-....|+..++.-.++++...-+.......+.. ..||...+.... T Consensus 131 ~a~~~~t~~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~~~~p~~~~~~~~a~~---------v~Eg~~~~~~~~ 201 (402) T protein:vir:93 131 HALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGLEIPRVSYTLDDDDF---------ITDVETAKELKA 201 (402) T ss_pred hhhccCCCcCCccccchhHHHHHHHhHHhhhhhhhhceeeecCCceeeeeeccCCcccc---------cccccccccccc Confidence 222222222 22455666767766666677776665544444333333322222211 236666555432 Q ss_pred cCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhc Q lcl|NC_019402. 78 ASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAA 157 (318) Q Consensus 78 ~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~ 157 (318) .....--+...+ .-.+.||.....-........+..++++++.. ...+.+|+.|. |.+ .| .|++. . T Consensus 202 ~f~~i~~~~~k~-~~~i~iS~ell~Ds~~~l~~~i~~~la~~~~~--~e~~~~~~~g~----g~g-~p---~g~~~---~ 267 (402) T protein:vir:93 202 KGDTVKFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAA--KERKDALAVSP----KSG-LE---HMSFY---N 267 (402) T ss_pred ccceeeecceee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHH--HHHHhHhhcCC----Ccc-cc---ceeee---c Confidence 222222222222 12234443322211111223344444443321 12233444332 111 11 12210 1 Q ss_pred CCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEE Q lcl|NC_019402. 158 KDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNV 237 (318) Q Consensus 158 ~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~ 237 (318) .+ .+..+ .+-+.+.|.+++-++...-.....++|++.....+-++..+.+ +. ...+... T Consensus 268 ~~----------~~~~~-~~~~~d~l~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~d~~-----~~-~~~~~~~---- 326 (402) T protein:vir:93 268 GS----------VKEVE-GADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGT-----TN-FFDTPAE---- 326 (402) T ss_pred cc----------ccccc-ccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCC-----Cc-ccccCCc---- Confidence 00 00111 1223456666655554321122245666544333333333321 11 1122211 Q ss_pred EEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCccccee--cCCCccceeeEEEEEEeEEEecccceeEEEEe Q lcl|NC_019402. 238 YVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTK--LAKDGSYEKWMIEMEVGLRHRNPYASGILEVK 315 (318) Q Consensus 238 ~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~--laktGd~~k~~i~~E~tLe~~N~~a~g~i~~l 315 (318) +=||. .|+....++ .+++.|+++.-..+ ++..... -+++ +-...+...-+..++.||.|..++... T Consensus 327 ------~llG~--PV~~t~~~~--~i~~GDf~~~~~~~-~~~~~~~~~~~~~-~~~~~~~~~r~Dg~v~~~~A~~~l~ik 394 (402) T protein:vir:93 327 ------KVFGK--PVVFTDAAV--KPIVGDFNYFGINY-DGTTYDTDKDVKK-GEYLFVLTAWYDQQRTLDSAFRIAKAK 394 (402) T ss_pred ------ccccc--ceEEecCCC--ceeeechhhhhhhh-hhhhhhhhhcccC-CceEEEEEEEeCcEEechhheEEEEee Confidence 23663 455444443 47888887632211 2221111 1233 345556666688999999999888887 Q ss_pred ccC Q lcl|NC_019402. 316 AGA 318 (318) Q Consensus 316 t~a 318 (318) +++ T Consensus 395 ~~~ 397 (402) T protein:vir:93 395 ENT 397 (402) T ss_pred cCC Confidence 666 No 127 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=93.09 E-value=0.0088 Score=31.84 Aligned_cols=272 Identities=10% Similarity=-0.006 Sum_probs=124.2 Q ss_pred CCceeeeee--eeecccceeeeEecCCcccceeeeeccccccc-eEEEeeeeeccccCCccccccccceeeccccccccc Q lcl|NC_019402. 1 MATLVSYDL--NGKKLSFANWISNLSPTDTPFVSMTGKEAINQ-TLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGER 77 (318) Q Consensus 1 Ma~~~t~~~--~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~-~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~ 77 (318) ++.-+++.. ...-+++.+.|...-....|+..+.-.....+ ..+-+.+....+ .. .-...||.+.+.... T Consensus 141 ~a~~~~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~~~~~~~~p~~~~~~~a-~~------~~~~~e~~~~~~~~~ 213 (434) T protein:vir:62 141 RALGLVTGNGSVTIPDFLSKEIITYAQEENFLRRLGTGVKTKENIKYPVLVKKAEA-QG------HKNERTNNEMPETDI 213 (434) T ss_pred hhhcccccccceecchhhHHHHHHhhhhhhhhhhhcceeccCCceEEEEEecCCcc-cc------eeccccccccccccc Confidence 222122221 22345666667766667777755433322222 222222222111 00 000123444443221 Q ss_pred cCcEEecceEEEEeeee----eehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHH Q lcl|NC_019402. 78 ASTTVINNVTQILRKVV----KVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSA 153 (318) Q Consensus 78 ~~~~~~~N~tQIf~~~~----~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~ 153 (318) .+.||--... -+.=|.+.+..... +...|=...-...+.+-+|.+||+|. |.+.. .+|+.. T Consensus 214 -------~f~~v~~~~~k~~~~~~iS~ell~ds~~-~l~~~i~~~la~~~~~~~d~~~l~G~----G~~~~---~~g~~~ 278 (434) T protein:vir:62 214 -------EFDEIELSPTEFDALATVTKKLLARTGL-PIEQIVMDELKKAYVRKETQYMVNGD----EANNI---NDGALA 278 (434) T ss_pred -------ceeeEEeeheeeEeehhhHHHHHhcchH-HHHHHHHHHHHHHHHHHHHHHHhccC----CCCcc---ccceee Confidence 2333322222 22233344333321 33344444556667788999999874 22211 334431 Q ss_pred HHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCce Q lcl|NC_019402. 154 LVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDT 233 (318) Q Consensus 154 ~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~ 233 (318) .. .....+..+.+.++|.++..++..+......++||+.....+..+. +. +.++...+...- T Consensus 279 ---~~----------~~~~~~~~~~~~d~l~~l~~~l~~~~~~~a~~v~n~~~~~~L~~lk-d~----~G~~l~~~~~~~ 340 (434) T protein:vir:62 279 ---KK----------AVEFKTDEKNLYDALVKMKNTPVKEVRKKARWVLNTAALTKIETMK-TD----DGFPLLRPFNQA 340 (434) T ss_pred ---cc----------cccccccccchhhHHHHHHhhcchhhhcCCEEEEcHHHHHHHHHhh-cc----CCCEeeccCCCc Confidence 11 1122334456788888888888766544446788998777776653 33 234433221111 Q ss_pred EEEEEEEEEEcCCCcEEEEEecCCCCCce------EEEEehhhcceeecC-cccceecCC---CccceeeEEEEEEeEE- Q lcl|NC_019402. 234 RLNVYVSSIVDPLGCQYKLVPNRWMPENA------VYFFTPSDWTQMVLR-APERTKLAK---DGSYEKWMIEMEVGLR- 302 (318) Q Consensus 234 ~~g~~v~~~~tdfG~~v~iv~nr~m~~~~------~~~~D~~~~~~~~Lr-~~~~e~lak---tGd~~k~~i~~E~tLe- 302 (318) ..|.. -+=+| +.|+.+.+||... +++.|++..-+..-. +...+.+-. +-+...+....-+..+ T Consensus 341 ~~g~~----~tl~G--~pV~~~~~~~~~~~~~~~~i~~Gdfs~~~i~~~~g~~~i~~~~~~~~~~~~v~~~~~~r~Dgk~ 414 (434) T protein:vir:62 341 EGGIG----YTLLG--FPVEEEDAIDIPDSPDTPVFYFGDFSKFYIQDVIGSLEVQKLVELFSRTNRVGFRIWNLLDAQL 414 (434) T ss_pred cCCCC----ceecc--eeeEEecCccCccCCCceEEEEeeccceEEEEeeceeEEEeehhhhcccCceEEEEEeeeccee Confidence 11110 11257 5777778887433 666799876443322 222222211 1122223333334333 Q ss_pred EecccceeEEEEe--ccC Q lcl|NC_019402. 303 HRNPYASGILEVK--AGA 318 (318) Q Consensus 303 ~~N~~a~g~i~~l--t~a 318 (318) ++.|.+..++..- +++ T Consensus 415 i~~~~~~~~~~~~~~~~~ 432 (434) T protein:vir:62 415 IHSPFEVPVYKYVLKAPT 432 (434) T ss_pred ecCcccceEEEEEeccCC Confidence 4558888777433 222 No 128 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=92.88 E-value=0.0096 Score=31.64 Aligned_cols=274 Identities=14% Similarity=0.135 Sum_probs=137.1 Q ss_pred ceeeee-eeee----cccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeecccccccc- Q lcl|NC_019402. 3 TLVSYD-LNGK----KLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGE- 76 (318) Q Consensus 3 ~~~t~~-~~~~----~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~- 76 (318) |++|.. .... +-.|.+......|+=..|. +....+.....+-|.-+--.-.. -..++.... T Consensus 1 m~it~~~l~~l~~~~~~~~~~~y~~a~~~~~~~a-~~~~sdf~~~~~~~lg~~p~l~e------------~~Ge~~~~~l 67 (302) T protein:vir:10 1 MLINKQSLNAAFVAIKTIFNNAFAAAPTTWQKIA-MEVPSNTSSNDYKWLSTFPKMRR------------WIGAKVVKNL 67 (302) T ss_pred CcccHHHHHHHHHHHHHHHHHHHHhhhhhhhcee-eecCCCcceeeceecCCCCCccc------------cccceeeccc Confidence 333321 1110 1111111111111111111 11112233344555543211000 012222111 Q ss_pred ccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHh Q lcl|NC_019402. 77 RASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVA 156 (318) Q Consensus 77 ~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~ 156 (318) ......+.|-+ |.+.++||..+--=...|.-..+..++..+..++--++=+.+|.+.....-.+ |-. |-+ T Consensus 68 ~~~~~~i~~~~--~g~~v~i~R~~i~nDdlg~~~~~~~~~G~aaa~~~~~lv~~~L~~g~~~~~~D-------G~~-fF~ 137 (302) T protein:vir:10 68 KAYKYVVENED--FEATVEVDRNDIEDDQIGIYSPQAKMAGYSAAQLPDELVYEAVNGAFTKPCFD-------GQY-FID 137 (302) T ss_pred cccceeEEeec--ccceecccHHhhcccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCCcccC-------Ccc-eec Confidence 11112233333 77888888776666778888888899999999988888888886532111111 111 223 Q ss_pred cCCccc-Cccccc--eeeccCccccCHHHHHH---HHHHHHhCCC-----CcCEEEEcchHHhhhhhhhhhhcccccceE Q lcl|NC_019402. 157 AKDAAD-PDTGAI--VHFETAAAALTEAEIFK---VTYNLYLSGS-----EANIIMFHPKHAAFFSSLMETSGVTNGQRM 225 (318) Q Consensus 157 ~~~~~~-~~~g~~--~~~~~t~~~lTe~~l~~---~~~~i~~~G~-----~~~~l~~~~~~k~~is~~~~~~~~~~~~r~ 225 (318) +.+.+- ...++. .....+..+++++.|.. +|++.-+.+| .|+.|+|+|.+......+.... T Consensus 138 ~dH~~g~~~~~N~g~~~~~~~~~~l~~~~~~aa~~am~~~k~~~G~~L~i~P~~LiVp~~le~~A~~ll~~~-------- 209 (302) T protein:vir:10 138 TDHPVGDASVSNKGTAPLSNASQAAAKAGYGAARTAMKKFKDEEGRSLNVSPNVLLVGPALEDVAKMLLTNP-------- 209 (302) T ss_pred ccccccccccccccchhhhhcccccchHHHHHHHHHHHHHhhhcccccccCCCEEEecchhHHHHHHHhhcc-------- Confidence 322211 111111 11123345666666654 4556666666 5578999999888877765432 Q ss_pred EEecCCceEEEEEEEEEEcCCCcEEEEEecCCCCCce--EEEEehhhcceeecCc---ccce-ecCCCccceeeEEEEEE Q lcl|NC_019402. 226 KMFDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENA--VYFFTPSDWTQMVLRA---PERT-KLAKDGSYEKWMIEMEV 299 (318) Q Consensus 226 ~~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~--~~~~D~~~~~~~~Lr~---~~~e-~laktGd~~k~~i~~E~ 299 (318) ...++..|. -+| ++++|.++++.+.+ .++-|+..++..||.+ |..+ .-.-+-|..++.++-.+ T Consensus 210 ~~~~g~~Np----------~~g-~~~~vv~p~L~s~~aWyL~a~~~~i~~~~l~g~~~P~~~~~~~~~~dgv~~k~~~d~ 278 (302) T protein:vir:10 210 KLADNTPNP----------YVG-TAELVVDGRIESDTAWFLLDTTKPVKPFIFQPRKQPEFVSQVNLDSDDVFNLRKLKF 278 (302) T ss_pred ccCCCCcce----------ecc-ceEEEEeeccCCCCceEEEecCCccceEEEcCccccEEEeccCCCCCceEEEEEEEE Confidence 111222222 237 48999999996544 3444889999999943 3322 22345588888889999 Q ss_pred eEEEecccceeEE-----EEeccC Q lcl|NC_019402. 300 GLRHRNPYASGIL-----EVKAGA 318 (318) Q Consensus 300 tLe~~N~~a~g~i-----~~lt~a 318 (318) +..+|-..+.|-= ..-++| T Consensus 279 Gvd~R~~~G~~~wq~a~~s~g~~~ 302 (302) T protein:vir:10 279 GAEARAAAGYGFWQLAYGSTGTGA 302 (302) T ss_pred eeeeeeecchhhhhhhhccCccCC Confidence 9988876655321 112222 No 129 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=92.14 E-value=0.011 Score=31.30 Aligned_cols=284 Identities=12% Similarity=0.122 Sum_probs=145.3 Q ss_pred CCceeeeeeeeeccc-------------ceeeeEecCCcccceeeeeccccc---cceEEEeeeeeccccCCcccccccc Q lcl|NC_019402. 1 MATLVSYDLNGKKLS-------------FANWISNLSPTDTPFVSMTGKEAI---NQTLFQWQTDALAPVADPSDAQKRN 64 (318) Q Consensus 1 Ma~~~t~~~~~~~~d-------------l~d~I~~i~p~~TP~~s~i~~~~~---~~~~~~W~td~l~~~~~~~~~~~~n 64 (318) |+.-+-..-+..+|- ++..|-.....-|--..|+-+-.. .+..|.-. -.+|+.. T Consensus 59 m~G~~p~~eV~~~e~mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L~~Grsm~F~~~-g~~Ra~~--------- 128 (393) T protein:vir:79 59 MEGETPTNEVNLREFMATPSAQILIPRVIVGTMREAAEPLYIGTKMLQKIRLKSGQSMIFPSI-GIMRAYD--------- 128 (393) T ss_pred hcCCCchhheehhhhhcCCCcceechhhhhhhhhhcccchhHHHHHHHHHhhhcCcceeccch-heeeecc--------- Confidence 443222222222221 112221111111100112222111 11111000 0233221 Q ss_pred ceeecccccccccc------CcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCcccc Q lcl|NC_019402. 65 AVIEGSAAVDGERA------STTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKV 138 (318) Q Consensus 65 a~~EG~da~~~~~~------~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~ 138 (318) .-||.++++...+ +.+++.=+ .+.|-.|++++..-|. +.+.+-+..+.--|+|-.|-..+++-++. T Consensus 129 -IgEGgE~~~~sld~~T~dsv~~~~gK~------G~~Ia~SqEmIsDSg~-Dvin~~l~aA~RaMaRkKee~a~n~fk~~ 200 (393) T protein:vir:79 129 -VAEGQEIPEDSIDWQTHESPEIRVGKS------GIRLRFTDEMISDSQW-DLMSMMIKQAGRAMGRHKEQKAYHQFRSH 200 (393) T ss_pred -ccccccccccchhhhcCCceeEEechh------hhhhhhHHHHhhcchH-HHHHHHHHHHHHHHHhhhHHHHHhhhhcc Confidence 1255555544433 33333333 3677788999888775 78999999999999999999999775443 Q ss_pred CCCCCccchhhhHHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhh--hhhh Q lcl|NC_019402. 139 DGSATVARQTAGFSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSS--LMET 216 (318) Q Consensus 139 ~gs~t~~r~m~Gi~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~--~~~~ 216 (318) .. ++ ..| +.++-.. -++|. .-.+--+.+|+.++|.|++-++-.++-++++||++|---..|.+ .++. T Consensus 201 gh---tv--fDa----~st~t~a-hptGr-~~~~~qNGTlSleDllDm~~av~~~hyt~svi~MHPLAWnv~AKna~me~ 269 (393) T protein:vir:79 201 GH---TV--FDN----YSTNKLA-HTTGL-DKNGVQNDTFSAEDFLDLIIAVMANEYTPSDLMMHPLAWTVFAKNELMGS 269 (393) T ss_pred cc---ee--eec----cccCccc-eeecC-CccccccccccHHHHHHHHHHHhcccCCcceEEEcCchhhhhhhhhhhcc Confidence 21 10 122 1111000 00000 00113456799999999999999999999999999975444433 2211 Q ss_pred hcccccceEEEecC----CceEEEEEEEEEEcCCCcEEEEEecCCCCCce------EEEEehhhcceeecC-cccceec- Q lcl|NC_019402. 217 SGVTNGQRMKMFDG----QDTRLNVYVSSIVDPLGCQYKLVPNRWMPENA------VYFFTPSDWTQMVLR-APERTKL- 284 (318) Q Consensus 217 ~~~~~~~r~~~~~~----~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~------~~~~D~~~~~~~~Lr-~~~~e~l- 284 (318) .+.+-+.+++. ..+-.|- +.|...+-..++|++.++.|=++ .+.+|-+.+.+...| .+..+.. T Consensus 270 ---~~~na~gN~~~~~~~ts~algp--~~i~~~~~~nlnv~~sPfvp~d~k~~rFd~~~Vd~NnvgvlLV~D~i~tdq~d 344 (393) T protein:vir:79 270 ---LQANPYGNYPAKGAPSSMALGP--DSIQGRLPFNFNVNLSPFIPLDKKSRRFDVYAVDRNNVGVLLVRDDLKTDQWD 344 (393) T ss_pred ---eeeccccccCccccchhhhhch--hhhccccccceeEEEecccccccccceeeEEEeecCCceEEEEecCcceeccc Confidence 01111111111 1111221 11222111136777777776433 778888877776654 3333332 Q ss_pred CCCccceeeEEEEEEeEEEeccc-ceeEEEEeccC Q lcl|NC_019402. 285 AKDGSYEKWMIEMEVGLRHRNPY-ASGILEVKAGA 318 (318) Q Consensus 285 aktGd~~k~~i~~E~tLe~~N~~-a~g~i~~lt~a 318 (318) -|+-|-++.-+.--||+-++|+- |.++...++-+ T Consensus 345 dk~rdiq~iKl~ERYG~gvLn~gkaiavakNI~~~ 379 (393) T protein:vir:79 345 EKARGLQNIKMIERYGIGILNEGKAIAVAKNISMD 379 (393) T ss_pred cccccceeeeeeeeeceeeeeCCceEEEEecceee Confidence 36778888889999999888864 66666666665 No 130 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=88.86 E-value=0.03 Score=28.95 Aligned_cols=276 Identities=8% Similarity=-0.021 Sum_probs=117.2 Q ss_pred CCceeeee-e-eeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeecccccccccc Q lcl|NC_019402. 1 MATLVSYD-L-NGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERA 78 (318) Q Consensus 1 Ma~~~t~~-~-~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~ 78 (318) +..-.+.. . .-.-+.+.+.|...-....|+++++--..++ ..+.|..+.-... ..-.-||.+.+.. . T Consensus 148 ~~~~~~~~g~~~~vP~~~~~~i~~~l~~~~~l~~~~~v~~~~-g~~~~~~~~~~~~--------a~wv~E~~~~~~~--~ 216 (466) T protein:vir:80 148 AQQKRAVSGAELTIPDVMLELLRDNMHRYSKLISKVRLRPLK-GTARQNIAGAIPE--------GVWTEAVANLNEL--S 216 (466) T ss_pred hhhhhhhccccccccHHHHHHHHHhhhhhhhhhhheeeeecC-ceeEeeeecCCcc--------eeecccccccccc--c Confidence 11111111 0 1122334444544444556776655433332 2345655431111 0112366665532 2 Q ss_pred CcEEecceEEEE---eeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHH Q lcl|NC_019402. 79 STTVINNVTQIL---RKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALV 155 (318) Q Consensus 79 ~~~~~~N~tQIf---~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i 155 (318) +. +++++==. ..-+.||.-. +.-.+ .+..+|=...-...+.+-+|.+||+|. |.+ .| + ||+..+ T Consensus 217 ~~--f~~i~~~~~k~~~~~~iS~el--l~ds~-~~l~~~i~~~la~~~~~~~~~ail~G~----G~~-~P--~-Gil~~~ 283 (466) T protein:vir:80 217 LS--FSQIEVDGYKVGGFIPIPNST--LEDSD-LNLADEILDAIGQAIGFALDKAILYGT----GTK-MP--V-GIVTRL 283 (466) T ss_pred cc--ccceeecceeeeeehhhhHHH--Hhcch-HHHHHHHHHHHHHHHHHHHhhheeecc----CCC-Cc--c-eeeecc Confidence 11 22222111 1223333322 22222 234444455556667888999999874 222 23 2 666443 Q ss_pred hcCCcccCccccceeeccCccccCHHH--------------HHHHHHHH---HhCCCCcCEEE-EcchHHhhhhhhhhhh Q lcl|NC_019402. 156 AAKDAADPDTGAIVHFETAAAALTEAE--------------IFKVTYNL---YLSGSEANIIM-FHPKHAAFFSSLMETS 217 (318) Q Consensus 156 ~~~~~~~~~~g~~~~~~~t~~~lTe~~--------------l~~~~~~i---~~~G~~~~~l~-~~~~~k~~is~~~~~~ 217 (318) ...... . .....+.+...++... +.+++..+ -..-.++..+| +++.....+-++.-.. T Consensus 284 ~~~~~~--~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~~ 359 (466) T protein:vir:80 284 AQTTQP--P--NWGTKAPAWTNLSTTNLLKIDPTGKSAEEFFSELVLKLSKARANYSNGMKFWAMSSNTHAVLMSKAITF 359 (466) T ss_pred cccccc--c--ccccccccccccchhhhhhhhhhccchhhHHHHHHHHHHhhhccccCCceeEEecchhHHHhhcccccc Confidence 211000 0 0000111111122111 22222222 12223444443 4544333332221000 Q ss_pred cccccceEEEecCCceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceecCCC---ccceeeE Q lcl|NC_019402. 218 GVTNGQRMKMFDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKLAKD---GSYEKWM 294 (318) Q Consensus 218 ~~~~~~r~~~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~lakt---Gd~~k~~ 294 (318) + ....+.... .+. ..=|| +.|+.+.+||.+.+++-|.+...+..=..+..+...-. -|..... T Consensus 360 ~--~~g~~~~~~--~~~--------~~i~G--~pvv~s~~~~~~~~~~g~~~~y~i~~r~~~~i~~~~~~~f~~d~~~~r 425 (466) T protein:vir:80 360 N--SAGALVASL--NNT--------MPIVG--GDIVILDFIPDNDIIGGYGSLYLLAERADIKLAQSEHVRFIEDQTVFK 425 (466) T ss_pred c--CCccccccC--CCc--------ccccc--cceeecCccCccceeeeccccEEEEeecceEEEechhhhhhcCcEEEE Confidence 0 000010000 010 01257 58899999999999999988865432223222222111 2444455 Q ss_pred EEEEEeEEEecccceeEEEEeccC Q lcl|NC_019402. 295 IEMEVGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 295 i~~E~tLe~~N~~a~g~i~~lt~a 318 (318) .+.-+..+++++.|.-+++....+ T Consensus 426 ~~~r~dg~~~~~~afv~~~~~~~~ 449 (466) T protein:vir:80 426 GTARYDGKPVFGEGFVAVNIANAN 449 (466) T ss_pred EEEEEccEEeccCceEEEEecCCC Confidence 555578999999999888766666 No 131 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=88.79 E-value=0.03 Score=28.92 Aligned_cols=262 Identities=11% Similarity=-0.033 Sum_probs=104.8 Q ss_pred CCceeeeee---eeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccc Q lcl|NC_019402. 1 MATLVSYDL---NGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGER 77 (318) Q Consensus 1 Ma~~~t~~~---~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~ 77 (318) .++-+++.. ...-.++.+.|...-....|+..++.-.+..+....+.......+. -.-||.+.+.... T Consensus 116 ~al~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~~~~p~~~~~~~~a~---------~v~E~~~~~~~~~ 186 (387) T protein:vir:93 116 HALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGLEIPRVSYTLDDDD---------FITDVETAKELKL 186 (387) T ss_pred HhhccCcCCCCceeechhHHHHHHHHHHhhchhhhheeeeecCCceEEEEeecCCccc---------cccCccccccccc Confidence 222222222 2234566676766666667776655544444433333332222211 1236665554432 Q ss_pred cCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhc Q lcl|NC_019402. 78 ASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAA 157 (318) Q Consensus 78 ~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~ 157 (318) ......-+...+ .-.+.||.....-........+..++.+++.. ...+.+|..|. |.+ .| .|++. . T Consensus 187 ~f~~v~~~~~k~-~~~~~iS~ell~Ds~~~l~~~i~~~la~~~~~--~e~~~~~~~g~----g~g-~p---~g~l~---~ 252 (387) T protein:vir:93 187 KGDTVKFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAA--KERKDALAVSP----KSG-LD---HMSFY---N 252 (387) T ss_pred ccceeeeeheee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHH--HHHHhHhhcCC----Ccc-cc---ceeee---c Confidence 222211122222 22244443322211111222333333333221 12233444332 111 11 12210 1 Q ss_pred CCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEE Q lcl|NC_019402. 158 KDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNV 237 (318) Q Consensus 158 ~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~ 237 (318) .. .+..+ .+.+.+.|.+++-++=..-.....++|++.....+-++..+.+ +. .+.+... T Consensus 253 ~~----------~~~v~-~~~~~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~~~~~~~d~~-----~~-~~~~~~~---- 311 (387) T protein:vir:93 253 GS----------VKEVE-GADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGT-----TN-FFDTPAE---- 311 (387) T ss_pred cc----------ccccc-ccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCC-----Cc-ccccCCc---- Confidence 00 00111 1223455555544432211112245677654333333333321 11 1122221 Q ss_pred EEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceec--CCCccceeeEEEEEEeEEEecccceeEEEEe Q lcl|NC_019402. 238 YVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKL--AKDGSYEKWMIEMEVGLRHRNPYASGILEVK 315 (318) Q Consensus 238 ~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~l--aktGd~~k~~i~~E~tLe~~N~~a~g~i~~l 315 (318) +=|| +.|+....++ .+++.|+++.-..+ +.+....+ +++| ....+...-+..++.+|.|.-++... T Consensus 312 ------~llG--~PV~~~~~~~--~~~~GDf~~~~~~~-~~~~~~~~~~~~~~-~~~~~~~~r~d~~v~~~eA~~~l~~k 379 (387) T protein:vir:93 312 ------KVFG--KPVVFTDAAV--KPIVGDFNYFGINY-DGTTYDTDKDVKKG-EYLFVLTAWYDQQRTLDSAFRIAKAK 379 (387) T ss_pred ------cccc--cceEEecCCC--ceeeeehhhhheeh-hhheeeecccccCC-ceeEEEEeeeCceeechhheEEEEee Confidence 2356 3455444443 47788887753322 22222221 2222 22333444588999999999888777 Q ss_pred ccC Q lcl|NC_019402. 316 AGA 318 (318) Q Consensus 316 t~a 318 (318) +++ T Consensus 380 ~~~ 382 (387) T protein:vir:93 380 ENT 382 (387) T ss_pred cCC Confidence 666 No 132 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=88.31 E-value=0.033 Score=28.70 Aligned_cols=268 Identities=10% Similarity=0.022 Sum_probs=122.4 Q ss_pred CCceeeee--eee---ecccceeeeEecCCcccceeeeec----cc-cc-cceEEEeeeeeccccCCccccccccceeec Q lcl|NC_019402. 1 MATLVSYD--LNG---KKLSFANWISNLSPTDTPFVSMTG----KE-AI-NQTLFQWQTDALAPVADPSDAQKRNAVIEG 69 (318) Q Consensus 1 Ma~~~t~~--~~~---~~~dl~d~I~~i~p~~TP~~s~i~----~~-~~-~~~~~~W~td~l~~~~~~~~~~~~na~~EG 69 (318) ++.-++++ ..| .-..+.+.|...-..++++..+-. +. .. .....-|++.... +. -.-|| T Consensus 334 ~~~~~~~~~~~~Gg~~vp~~~~~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip~~t~~~~-a~---------wv~Eg 403 (645) T protein:vir:93 334 VGAGTTTDPQWAGSLSEYQEYAQDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVHAQVSGGA-AG---------WVGEG 403 (645) T ss_pred hhccccccccccCCccCchhhHHHHHHhhhhhhhHHhhccccccccccccCceeeeeeecCcc-eE---------EeccC Confidence 11111111 111 223334444433333444332211 00 00 1122333332211 11 12367 Q ss_pred cccccccccCcEEecceEEEEee-eeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchh Q lcl|NC_019402. 70 SAAVDGERASTTVINNVTQILRK-VVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQT 148 (318) Q Consensus 70 ~da~~~~~~~~~~~~N~tQIf~~-~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m 148 (318) .+.+..... ++.++-=-+| ..-|.=|.+-+..... +.-+|=..+-...+.+-++.+||+|...- +.+..| T Consensus 404 ~~~~~s~~~----f~~v~l~~~kla~~~~iS~ell~ds~~-~~~~~i~~~l~~aia~~~d~a~l~g~g~~-~~~~~p--- 474 (645) T protein:vir:93 404 KTKPLTKFD----FESITFSHAKVSAIAVLTEELIRFSSP-AADALVRNALAEAVVARLDTDFVDPKKAA-VADVSP--- 474 (645) T ss_pred ccccccccc----eeEEEEeeEEEEEeehhHHHHHhhchH-HHHHHHHHHHHHHHHHHHHHHhhcCCCcc-cCCccc--- Confidence 665543322 2222211122 2222233333333321 22233334566677788899999774321 111111 Q ss_pred hhHHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcC--EEEEcchHHhhhhhhhhhhcccccceEE Q lcl|NC_019402. 149 AGFSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEAN--IIMFHPKHAAFFSSLMETSGVTNGQRMK 226 (318) Q Consensus 149 ~Gi~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~--~l~~~~~~k~~is~~~~~~~~~~~~r~~ 226 (318) .||.. +. ...+....+.+++..++.++..++..+. ..+++|..+..+..+. +.+ + ++ T Consensus 475 ~gi~~-----~~----------~~~~~~~~~~~d~~~~~~~~~~a~~~~~~a~~vmn~~~~~~L~~lk-d~~---G-~~- 533 (645) T protein:vir:93 475 ASITH-----DV----------KGTASSGNPDADAEAAFGQFVAANLQPTGAVWLMSSTNALALSMRK-NAL---G-QK- 533 (645) T ss_pred cceec-----cc----------cccccccchHHHHHHHHHHHHhcCCCccccEEEEcHHHHHHHHhcc-ccC---C-ce- Confidence 23321 10 0111122355678888888887776654 4678999888887764 221 1 11 Q ss_pred EecCCceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecC--------cccceecC-CCc--------- Q lcl|NC_019402. 227 MFDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLR--------APERTKLA-KDG--------- 288 (318) Q Consensus 227 ~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr--------~~~~e~la-ktG--------- 288 (318) .++......+ +=+| +.++.+.+||++ +++.|++.+-+...- ....+... .+| T Consensus 534 ~~~~~~~~~~-------tL~G--~PV~~s~~vp~~-~~~gd~s~~~ig~~~~v~i~~s~~a~~~~~~~~~~~~~~~~~~~ 603 (645) T protein:vir:93 534 EYPDMTLLGG-------SFQG--LPVIVSQYVGDQ-LVLVNAPDIYLADDGGVAVDMSREASLEMQSEPTGDSTTPSPVE 603 (645) T ss_pred eecCCCCCCc-------eeec--eeeEEeccCCcc-eeEeccccEEEEEecceEEEeecceeEEEeeccccccccccccc Confidence 1221111111 1256 688888999865 556677765544322 11111111 111 Q ss_pred -------cceeeEEEEEEeEEEecccceeEEEEeccC Q lcl|NC_019402. 289 -------SYEKWMIEMEVGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 289 -------d~~k~~i~~E~tLe~~N~~a~g~i~~lt~a 318 (318) |..-...+..+.+.++.|.|.++|++..=- T Consensus 604 ~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt~~~~g 640 (645) T protein:vir:93 604 LVSMFQTGSVAIRAERWINWRRRRTAAVAVITGVNYG 640 (645) T ss_pred chhHhhcCceEEEEEEEEcceeeCccceEEEecccCC Confidence 334456777789999999999999987522 No 133 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=87.14 E-value=0.041 Score=28.20 Aligned_cols=271 Identities=10% Similarity=0.050 Sum_probs=115.3 Q ss_pred CCceeeee-eeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccccC Q lcl|NC_019402. 1 MATLVSYD-LNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERAS 79 (318) Q Consensus 1 Ma~~~t~~-~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~~ 79 (318) +.+-...+ -...-+++.+.|+..=.+.-|+++++.-.+..+. ..|..++-...+ .=.-|++..+. +..+ T Consensus 79 ~~~~~~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~-~~~~~~~~~~~a--------~w~~e~~~~~~-~~~~ 148 (377) T protein:vir:98 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSLR-LKALTAETSGTA--------VWGDIFGEIKG-QLKQ 148 (377) T ss_pred HhccCCCCCccccCHHHHHHHHHHHHHhhhhhhheeeEecCcc-eEEEEecCCcce--------eEeecccccCc-ccCc Confidence 22111111 1112244555555444455566665543333222 233332211110 00113322221 1111 Q ss_pred --cEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhc Q lcl|NC_019402. 80 --TTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAA 157 (318) Q Consensus 80 --~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~ 157 (318) ....=+...+.. -+.||. +-+.-.. -+..+|=..+-...+.+-+|.+||+|. |+ .+--||+..+.. T Consensus 149 ~f~~i~l~~~kl~a-~~~is~--elL~ds~-~~ie~~i~~~la~~~a~~~~~a~i~G~----G~----~qP~Gil~~~~~ 216 (377) T protein:vir:98 149 AFKEQDFSQFKLTA-FVVIPK--DALKFGP-KWIKQFITEQLKEAIAVALELAIVKGD----GL----LQPVGLLKDLSQ 216 (377) T ss_pred cceeEeecceeEEe-eecccH--HhhhccH-hHHHHHHHHHHHHHHHHHHhhceEecc----CC----Ccceeeeecccc Confidence 111111122211 133432 2222222 244555555666778888999999874 32 234566654422 Q ss_pred CCcccCccccceeeccCccccCHHHHHHH---HHHHHhC------------------CCCcCEEE-EcchHHhhhhhhhh Q lcl|NC_019402. 158 KDAADPDTGAIVHFETAAAALTEAEIFKV---TYNLYLS------------------GSEANIIM-FHPKHAAFFSSLME 215 (318) Q Consensus 158 ~~~~~~~~g~~~~~~~t~~~lTe~~l~~~---~~~i~~~------------------G~~~~~l~-~~~~~k~~is~~~~ 215 (318) ... ...++ ...++.....+.+.++ +..-|-. .++.+.+| |+|..... +.. T Consensus 217 ~~~-~~~~~----~~~~~~~~~~~~~~~l~~~~~~~~~~~a~~~m~~~t~~~~~klkd~~G~~i~~~n~~~~~~---~~p 288 (377) T protein:vir:98 217 PTV-DQSTG----RDITTYKTDKEAIADLSDLTPDNAPKKLVPVMKHLSVNDKKRPLKIAGQVKLILNPEDRWA---LEA 288 (377) T ss_pred ccc-ccccc----cccccccchhhhHhhhhhhchhHHHHHHHHHHHHHHHHHHhhhhccCCceEEEecccchhh---ccc Confidence 111 00000 0000000011111111 1111111 12233322 34432110 000 Q ss_pred hhcccccceEEEecCCceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceecC---CCcccee Q lcl|NC_019402. 216 TSGVTNGQRMKMFDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKLA---KDGSYEK 292 (318) Q Consensus 216 ~~~~~~~~r~~~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~la---ktGd~~k 292 (318) . +.. .+..|. +.+-+|..+.++-+.+||++++++.|+++.-+.-=..+..+.+. -.-|..- T Consensus 289 ~--------~~~----~~~~G~----~~t~lg~p~~vv~s~~~p~~~i~fgdf~~Y~i~~r~~~~i~~~~~~~~~~d~~~ 352 (377) T protein:vir:98 289 Q--------FTS----RNQFGE----YVTVLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQL 352 (377) T ss_pred c--------ccc----cCCCCc----cccccCCCceEEecCCCCcccEEEEEecceeEEeecceEEEeechhhhhcCceE Confidence 0 000 001122 33445544678889999999999999998554332343333322 1235566 Q ss_pred eEEEEEEeEEEecccceeEEEEecc Q lcl|NC_019402. 293 WMIEMEVGLRHRNPYASGILEVKAG 317 (318) Q Consensus 293 ~~i~~E~tLe~~N~~a~g~i~~lt~ 317 (318) +....-+.=++.++.|..++....+ T Consensus 353 f~~~~r~dg~~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 353 YLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred EEEEEEEcCEEeccCcEEEEEEecC Confidence 6677778889999999999988888 No 134 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=86.35 E-value=0.027 Score=29.18 Aligned_cols=256 Identities=15% Similarity=0.133 Sum_probs=115.4 Q ss_pred CCc---eeeeeee-eecccceeeeEecCCcccceeeee---------ccccccceEEEeeeeeccccCCcccccccccee Q lcl|NC_019402. 1 MAT---LVSYDLN-GKKLSFANWISNLSPTDTPFVSMT---------GKEAINQTLFQWQTDALAPVADPSDAQKRNAVI 67 (318) Q Consensus 1 Ma~---~~t~~~~-~~~~dl~d~I~~i~p~~TP~~s~i---------~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~ 67 (318) ||- .++.+.+ -+.+|+...+. --=.-|+.++ -+.+.+-+.++..-|+- ... T Consensus 1 mAe~nlt~~~dL~~~~sidfv~~f~---~~i~~L~~~Lgi~r~~p~a~G~tIt~pK~~~tgda~-------------dVa 64 (295) T protein:vir:99 1 MAEKNLNTMADLGDIKSIDFVNKFS---KNINDLLKLLGVTRRETLTNDLKIQTYKWEVTLDQT-------------DPG 64 (295) T ss_pred CCCcccccHhhccCceeehhhHHhh---hhHHHHHHHhccccccccccCCeEEeeeeeeecccc-------------ccc Confidence 884 2223332 44444433321 1001111112 23444444433222221 245 Q ss_pred eccccccccccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccch Q lcl|NC_019402. 68 EGSAAVDGERASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQ 147 (318) Q Consensus 68 EG~da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~ 147 (318) ||...|.........-.=-.+|. |.-... |.+|++-.|-++-.-.--.+-+.+|.+++-+-|+.-- + + ++..-. T Consensus 65 EGe~Iplskvt~~~~~t~t~kik-K~rK~t-TdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~l-k--t-at~t~t 138 (295) T protein:vir:99 65 EGETIPLSKVTRTKDKDYTVKWF-KKRRAT-TAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFL-K--T-KPTKVK 138 (295) T ss_pred CCcccchhhheeeeeeeeEEEee-eecccc-cHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHh-c--c-Cceeee Confidence 89887765543322111122333 333333 8899877787764433333444555555555554211 0 0 000000 Q ss_pred hhhHHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEE Q lcl|NC_019402. 148 TAGFSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKM 227 (318) Q Consensus 148 m~Gi~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~ 227 (318) -.|+ ...-+.+.+.+...|+..+.+.++|++|...- ++-++..... T Consensus 139 g~~l-------------------------q~a~a~~~~al~~f~Ee~~~~~V~FVnP~D~a---~yl~~A~~~~------ 184 (295) T protein:vir:99 139 GVGL-------------------------QKALSASWAKLATFNEFEGSPLVSFVSPLDVA---NYLGDTKVGA------ 184 (295) T ss_pred hhhH-------------------------HHHHHHhhhhhhhcccccCCceEEEEehHHHH---HHHhcccccc------ Confidence 0011 12445556667777776667778999997533 3333221100 Q ss_pred ecCCceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceecCCC--------c------cceee Q lcl|NC_019402. 228 FDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKLAKD--------G------SYEKW 293 (318) Q Consensus 228 ~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~lakt--------G------d~~k~ 293 (318) ...+.||. ..++.=+|.+ .||..+.+|.++++.-=.+.+.++|+.+- .-+|++. | +.+-. T Consensus 185 --~~a~~fG~--~~L~nfLG~q-~II~S~kv~~G~~~aT~~~Ni~~ay~~~~-~g~l~~~f~~~~D~tglIg~~h~~~~~ 258 (295) T protein:vir:99 185 --DASNVFGM--TLLKNFLGMQ-NVIVMPSVPEGKIYSTAVENLVFASLNVK-GGDLGGLFADFTDETGLIAAARNRQLS 258 (295) T ss_pred --chhhhhhh--hhhhhhhccc-eEEEcccCCCceEEEeeccceEEEEecCC-chhhhhhhhhccCcccceEEEeccccc Confidence 01122343 2333445731 49999999999999999999999999663 2223321 1 11111 Q ss_pred EEEEE----EeEEEecccceeEEEEeccC Q lcl|NC_019402. 294 MIEME----VGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 294 ~i~~E----~tLe~~N~~a~g~i~~lt~a 318 (318) .+..| .++.+-.|.--|+|...=.+ T Consensus 259 ~~t~et~~~~~~~lfpE~~dgiv~~tI~~ 287 (295) T protein:vir:99 259 NLTYESVFFGANVLFAEIPEGVVEATIEA 287 (295) T ss_pred eeeehhhhHhHHHhcccccceEEEEEEec Confidence 11111 11222223322332221111 No 135 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=85.77 E-value=0.05 Score=27.70 Aligned_cols=271 Identities=13% Similarity=0.146 Sum_probs=124.2 Q ss_pred CCc-eeeeeeeeecccceeeeEecCCcccceeee--------ec----cccccceEEEeeeeecc-ccCCccccccccce Q lcl|NC_019402. 1 MAT-LVSYDLNGKKLSFANWISNLSPTDTPFVSM--------TG----KEAINQTLFQWQTDALA-PVADPSDAQKRNAV 66 (318) Q Consensus 1 Ma~-~~t~~~~~~~~dl~d~I~~i~p~~TP~~s~--------i~----~~~~~~~~~~W~td~l~-~~~~~~~~~~~na~ 66 (318) ||. .|...-.-+.|-|.+.+....|.-..|+.. |. +.--+.+.+.|. .|. ++.+ . T Consensus 1 Ma~~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~--~l~G~~~~---------~ 69 (330) T protein:vir:10 1 MANELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWN--DLTGDSEV---------L 69 (330) T ss_pred CCCCceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccc--cCCCcccc---------c Confidence 996 455556667777778777777766555210 11 111223344564 221 1111 1 Q ss_pred eecc-ccccccccCcEEecceEEEEeeeeeehhHHHHhhhcCccc---hHHHHHHHHHHHHHHHHHHHHhcCccccCCCC Q lcl|NC_019402. 67 IEGS-AAVDGERASTTVINNVTQILRKVVKVSDTANVLANYGRGK---ELQYQMEKAGKEIKRDLEVALLRNGAKVDGSA 142 (318) Q Consensus 67 ~EG~-da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~---e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~ 142 (318) .||+ +.+..........--+.+ .-|.|++++-+..+. | +| +.+.|+. .-..|..++.+|.--....++. T Consensus 70 ~dg~~~i~~~ki~t~~~~a~i~~-~~k~~~~tD~a~~~~--g-~dp~~~i~~q~a---~~w~~~~q~~lla~l~gvf~~~ 142 (330) T protein:vir:10 70 GNGDKALETGKITAGADIACVLY-RGRGWAANELTGVVA--G-SDPVRAILNRIG---AYWLREDQKALIATLNGIFATG 142 (330) T ss_pred CCCccccchhhcccceeEEEEEe-ecceeeehhhhhhhc--c-hhHHHHHHHHHH---HHhhhhHHHHHHHHHHhhhhhh Confidence 2454 333333232222222222 245688877775442 2 23 3444443 3455666666653211111111 Q ss_pred CccchhhhHHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhccccc Q lcl|NC_019402. 143 TVARQTAGFSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNG 222 (318) Q Consensus 143 t~~r~m~Gi~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~ 222 (318) ... +.+ .+..+... ...++...|+.+.|.+.++++=|+......+++||.+...+-+. ....+ T Consensus 143 ~~~--~~~---~~~~~~~~--------~~~~~~a~~s~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~---~li~~- 205 (330) T protein:vir:10 143 TAG--EKG---ALEETHVS--------DQSKASTGIDAGMVLDAKQLLGDSADQVTAIAMHSAVYTKLQKD---NLIQY- 205 (330) T ss_pred hcc--cch---hhhhhhee--------cccccccccCHHHHHHHHHHhccccccceEEEEcHHHHHHHHHh---hhhhh- Confidence 000 011 11111111 11234556999999999999988888888999999876655431 11111 Q ss_pred ceEEEecCCceEEEEEEEEEEcCCCcEEEEEecCCCCCce----EEEEehhhcceeecCcccceec-----CCCcc---- Q lcl|NC_019402. 223 QRMKMFDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENA----VYFFTPSDWTQMVLRAPERTKL-----AKDGS---- 289 (318) Q Consensus 223 ~r~~~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~----~~~~D~~~~~~~~Lr~~~~e~l-----aktGd---- 289 (318) .+.. +. +. .|-+-.| +.||.+..||... .|+|-+..+.+..-.|+...++ .+.|. T Consensus 206 --~~~s--~~---~~---~i~~~~G--~~VivdD~~p~~~~~yt~yl~~~GAi~~~~~~~~~~v~~EtdRd~~~g~~~l~ 273 (330) T protein:vir:10 206 --IQPT--TA---TI---NIPTYLG--YRVIIDDGIAPTGDIYTSYLFRTGSIGLNTGNPSGLTTFETSREAAKGNDMIY 273 (330) T ss_pred --hccc--cc---Cc---ccccccc--eEEEEeCCCCCCCCceeEEEEecCceeeecccCCccccccccCCccccceEEE Confidence 1111 11 11 1333357 5789989998433 6777777776653222221111 22221 Q ss_pred ceeeEEEEEEeEEEeccc-----ceeEEEEeccC Q lcl|NC_019402. 290 YEKWMIEMEVGLRHRNPY-----ASGILEVKAGA 318 (318) Q Consensus 290 ~~k~~i~~E~tLe~~N~~-----a~g~i~~lt~a 318 (318) +.+..++.=+|.....+. .+--.+.|..+ T Consensus 274 ~r~~~~~hp~G~s~~~~~~~~~~~sPt~~~L~~~ 307 (330) T protein:vir:10 274 TRRALVMHPYGVKWTGAEVDAGNITPSNADLAKF 307 (330) T ss_pred EeeEEEeeeeeeeecccccccCcCCcChHHhcCC Confidence 122222222333332221 01111122222 No 136 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=83.18 E-value=0.071 Score=26.89 Aligned_cols=260 Identities=10% Similarity=-0.008 Sum_probs=116.1 Q ss_pred CCceeeeee-eeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccccccC Q lcl|NC_019402. 1 MATLVSYDL-NGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDGERAS 79 (318) Q Consensus 1 Ma~~~t~~~-~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~~~~~ 79 (318) .+++++.+. ...-.++...|...-....|+..++.....++....+..-.....+.. -..-||.+.+...... T Consensus 114 ra~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~------~~~~E~~~~~~s~~~f 187 (421) T protein:vir:13 114 RDIMSSTNNGAVIPQEFVNEFEKLKEGYPSLKEHCHVIPVNRNAGKMPVRAGASVDKL------ANLAKDTELVKAMLKT 187 (421) T ss_pred hhccccCCcceecchhhHHHHHHHHHhhhhhhhhceeeeccCCceEEEEeecCCccce------eeccccccccccccce Confidence 344444332 223455556666555667788777766555554444443222211100 0122555544332211 Q ss_pred cEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHHHhcCC Q lcl|NC_019402. 80 TTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSALVAAKD 159 (318) Q Consensus 80 ~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~i~~~~ 159 (318) . .+.=-..-+..-+.||.-...-......+.+..++.++ +.+-++..+++ ...|+. T Consensus 188 ~-~i~~~~~k~~~~v~iS~ell~ds~~~l~~~i~~~la~~---~~~~~~~~i~~-------------~~~g~~------- 243 (421) T protein:vir:13 188 Q-PMAYDIDDYGLLAPIDNSLLEDSEINFLEFVNEEFAEF---AVNTENAEIVK-------------QAKAVL------- 243 (421) T ss_pred e-EEEeeeeeeEeehhhhHHHHhhhHHHHHHHHHHHHHHH---HHHHhhhhHhh-------------hhhhcc------- Confidence 1 11111112222344443322211111122222233222 22222333321 111221 Q ss_pred cccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEecCCceEEEEEE Q lcl|NC_019402. 160 AADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFDGQDTRLNVYV 239 (318) Q Consensus 160 ~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~~~~~~~g~~v 239 (318) ......+.++|.+++.++-.++.....+++|+.....+..+. +.+ .++...+.... . T Consensus 244 -------------~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lk-d~~----G~~i~~~~~~~---~-- 300 (421) T protein:vir:13 244 -------------AEETINDYAGLVKTINSLVPNARKRAIIVTNSDGRAYLDGLM-DKQ----GRPLLKELSDG---G-- 300 (421) T ss_pred -------------ccccccchHHHHHHHHHhhhhhcCCCEEEEcHHHHHHHHHhh-cCC----CceeecCcCCC---C-- Confidence 011123567888888888777777778899998888877653 332 23322221111 0 Q ss_pred EEEEcCCCcEEEEEecCCCCCc-----eEEEEehhh-cceeecCcccceecCCCcccee----eEEEEEEeEEEecccce Q lcl|NC_019402. 240 SSIVDPLGCQYKLVPNRWMPEN-----AVYFFTPSD-WTQMVLRAPERTKLAKDGSYEK----WMIEMEVGLRHRNPYAS 309 (318) Q Consensus 240 ~~~~tdfG~~v~iv~nr~m~~~-----~~~~~D~~~-~~~~~Lr~~~~e~laktGd~~k----~~i~~E~tLe~~N~~a~ 309 (318) -.+=+| +.++...+||.. .+++.|++. +.+..-.++..+- ....++++ .+.+..+...+.+++|. T Consensus 301 --~~tl~G--~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~-~~~~~f~~~~~~~r~~~r~d~~~~~~~a~ 375 (421) T protein:vir:13 301 --DLVFKG--RPVIELEESIFDVGDETKFIVSDFKTLIKFMDRKQYLIDQ-SKEAGYTKNETIARIIERFDVNSPLDKSS 375 (421) T ss_pred --Cceecc--eeeEEeccccccCCCceEEEEEeccccEEEEEecceEEEe-ecccccccCeeEEEEEeeecceeecchhh Confidence 012357 466666667643 378888876 3333323333222 12223333 33445566677777775 Q ss_pred eEEEEeccC Q lcl|NC_019402. 310 GILEVKAGA 318 (318) Q Consensus 310 g~i~~lt~a 318 (318) ..+...+.+ T Consensus 376 ~~~~~~~~~ 384 (421) T protein:vir:13 376 DAEKIRKFG 384 (421) T ss_pred heeeecccc Confidence 444333322 No 137 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=81.35 E-value=0.087 Score=26.41 Aligned_cols=274 Identities=9% Similarity=0.030 Sum_probs=115.7 Q ss_pred CCceeeeeeeeecccceeeeEecCC--cccceeeeecc--------ccccceEEEeeeeeccccCCcccccccccee--- Q lcl|NC_019402. 1 MATLVSYDLNGKKLSFANWISNLSP--TDTPFVSMTGK--------EAINQTLFQWQTDALAPVADPSDAQKRNAVI--- 67 (318) Q Consensus 1 Ma~~~t~~~~~~~~dl~d~I~~i~p--~~TP~~s~i~~--------~~~~~~~~~W~td~l~~~~~~~~~~~~na~~--- 67 (318) |+..+. +..+.-.=+..++..|+| .++|.-.+.+. .......+.+..-+..- .+.. T Consensus 26 ~~~~~~-~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~~~~~~~t~~~~~~~G----------~a~~~~d 94 (329) T protein:vir:79 26 LRGAKN-DASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSELSDTDKTFEYQTFDKVG----------HAKIIAD 94 (329) T ss_pred ccccee-ccchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccccCCCCceeEEEeeeeecce----------eeeeecC Confidence 221111 111111111122233333 23333222222 11222223333322110 0111 Q ss_pred eccccccccccCcEEecceEEEEeeeeeehhHHHHh---hhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCc Q lcl|NC_019402. 68 EGSAAVDGERASTTVINNVTQILRKVVKVSDTANVL---ANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATV 144 (318) Q Consensus 68 EG~da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~---~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~ 144 (318) .+.|.|.......... ..|++-...++.+-+-+ ...|. +.-+.....+...+.+.+...++.|... T Consensus 95 ~~~dip~vd~~~~~~~---~~i~~~~~~~~~~~~El~~a~~~g~-~l~~~k~~aA~~~~~~~~n~i~f~G~~~------- 163 (329) T protein:vir:79 95 YTDDLSTVDALMTSEF---GKVFRLGNAFLISIDEIKAGQRTGK-SLSTRKANAAQNAHDQLVNHLVFKGSKP------- 163 (329) T ss_pred cccccceeecccceeE---EEEEEEEEEEEecHHHHHHHHHhCC-ChHHHHHHHHHHHHHHhhccEEEeeccc------- Confidence 1112222111111111 23333333333333322 33443 4444455556667777777777777321 Q ss_pred cchhhhHHHHHhcCCcccCccccceeeccCcccc--CHHHHHHHHHHHHhC-CC--CcCEEEEcchHHhhhhhhhhhhcc Q lcl|NC_019402. 145 ARQTAGFSALVAAKDAADPDTGAIVHFETAAAAL--TEAEIFKVTYNLYLS-GS--EANIIMFHPKHAAFFSSLMETSGV 219 (318) Q Consensus 145 ~r~m~Gi~~~i~~~~~~~~~~g~~~~~~~t~~~l--Te~~l~~~~~~i~~~-G~--~~~~l~~~~~~k~~is~~~~~~~~ 219 (318) ...-||+ ...+.....++.....+-+..+. -.++|.+++.++|.. |+ .+.+|+++|..-..++.-.-+.+. T Consensus 164 -~g~~GLl---N~p~v~~~~~~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~~p~~L~Lpp~~~~~L~~~~~~~~~ 239 (329) T protein:vir:79 164 -HKIISVF---EHPNLTTINSAGWNNAAGTGKKPETAQDELEQAIEKIETLTNGQHRANMILIPPSMRKVLMVRMPETTM 239 (329) T ss_pred -ccceeee---cCCCccccccCCCCCccccccCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHHhhcccCCCCc Confidence 2234444 22222111111000011111222 236888999999975 33 467899999876665432111111 Q ss_pred cccceEEEecCCceEEEEEEEEEEcCCCcEEEEEecCCCC------CceEEE--EehhhcceeecCcccceecCCCccce Q lcl|NC_019402. 220 TNGQRMKMFDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMP------ENAVYF--FTPSDWTQMVLRAPERTKLAKDGSYE 291 (318) Q Consensus 220 ~~~~r~~~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~------~~~~~~--~D~~~~~~~~Lr~~~~e~laktGd~~ 291 (318) ..- +.+.-.|- .++|+.-+++- .+.+++ -|++.+++.+=.|+...++-+.+-+. T Consensus 240 tvl-----------------~~lk~~~~-~l~I~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l~~q~~~~~~ 301 (329) T protein:vir:79 240 SYL-----------------DYFKQQNG-GITIESISELEDIDGAGTKAALVYEKDPMNMSIEIPEAFNMLTAQPKDLHF 301 (329) T ss_pred cHH-----------------HHHHHhCC-CcEEEEcccccccCCCCceEEEEEecCCceEEEecCcceeeeeceecCceE Confidence 111 11111122 23455544442 133444 46666777654666666665555444 Q ss_pred eeEEEEE-EeEEEecccceeEEEEeccC Q lcl|NC_019402. 292 KWMIEME-VGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 292 k~~i~~E-~tLe~~N~~a~g~i~~lt~a 318 (318) +.-.+.- ++++++-|.+...+.|+--- T Consensus 302 ~v~~~~r~~Gv~i~~P~ai~~~dGI~~~ 329 (329) T protein:vir:79 302 KVPCTSKCTGLTIYRPLTLVLIKGLVVG 329 (329) T ss_pred EEceeeeEEEEEEECcceeeeeeeeeeC Confidence 3333333 56999999998888877766 No 138 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=81.31 E-value=0.087 Score=26.40 Aligned_cols=284 Identities=14% Similarity=0.026 Sum_probs=126.5 Q ss_pred CCc---eeeeeeeeecccceeeeEecCCcccceee----------eeccccc-cceEEEeeeeeccc-cCCccccccccc Q lcl|NC_019402. 1 MAT---LVSYDLNGKKLSFANWISNLSPTDTPFVS----------MTGKEAI-NQTLFQWQTDALAP-VADPSDAQKRNA 65 (318) Q Consensus 1 Ma~---~~t~~~~~~~~dl~d~I~~i~p~~TP~~s----------~i~~~~~-~~~~~~W~td~l~~-~~~~~~~~~~na 65 (318) |-. +++... +..+.+++.|-.=+-..+-+.- ++.+..+ .+-.+...+++-.- +.+ ... T Consensus 1 ~~~~~~i~s~~~-~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~~a~~~~~v~f~~~~p~~~~~d------~e~ 73 (318) T protein:vir:10 1 MTAPTGIVSVSD-GPAITVRELVGNPLWIPTALKKMMVNQFISESLFRNGGANPNGVVAYNEGNPSFLEDD------VAD 73 (318) T ss_pred CCCCCcceeeec-CCceehHHhhCCchhHHHHHHHHHhccchhhhhhhcccccccceeEEEecccccccCc------Hhh Confidence 432 333332 3466666664321111111111 1112112 22234444433111 111 223 Q ss_pred eeeccccccccccC-cEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcC-----ccccC Q lcl|NC_019402. 66 VIEGSAAVDGERAS-TTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRN-----GAKVD 139 (318) Q Consensus 66 ~~EG~da~~~~~~~-~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g-----~~~~~ 139 (318) ..||.++|...... ..++-... =.-..+.||+=++. .+..+.+..+..|...-+.|..+...+.- ....+ T Consensus 74 VaEggEiP~~~~~~G~~~ia~~~-K~G~~~~vS~Em~~---~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~~ 149 (318) T protein:vir:10 74 VAEFGEIPVSAGARGLPRTAFAV-KKALGVRVSKEMID---ENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTLA 149 (318) T ss_pred ccCcccccccCCCCCchhhhhhe-hhccceeccHHHHh---hcChhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Confidence 56888877544333 22322221 22334555544443 23346788888888888888888776531 11111 Q ss_pred CCCCcc---chhhhHHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhh Q lcl|NC_019402. 140 GSATVA---RQTAGFSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMET 216 (318) Q Consensus 140 gs~t~~---r~m~Gi~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~ 216 (318) .+.+-- -...|+...++.- .++...+..+.. ..+.. .-|-.+++|++||.....|.+-.. T Consensus 150 ~s~~w~~~~~~~~d~~~A~e~v-------------~~a~~~~~~a~~--~~~~~-~~GY~pdtIVlhP~~~~~l~~n~~- 212 (318) T protein:vir:10 150 VPTAWDNGGKVRTDIAIAIEQI-------------STAAPTAYPAGV--GSSDE-YFGFIPDTIVMHYALLPILMDNEN- 212 (318) T ss_pred CCcCCCCcccccccchhhhhhh-------------hhhhhhhhhhhh--hhhhh-ccCccceeeEECHHHHHHHhcchh- Confidence 111100 0000222111110 000111111000 01111 337788999999998777633111 Q ss_pred hccccc-ceEEEecCCceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeec-CcccceecCC------Cc Q lcl|NC_019402. 217 SGVTNG-QRMKMFDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVL-RAPERTKLAK------DG 288 (318) Q Consensus 217 ~~~~~~-~r~~~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~L-r~~~~e~lak------tG 288 (318) ....|. +-...+.+.+-+..+ .-..|| ++++.+|++|.+++|++|-..+-.--- +|+..+.+-. .| T Consensus 213 ~~~~y~~~a~~~~~~~~~tg~~----~g~~lG--l~vi~s~~~p~~~alvlq~g~vG~~~d~~pl~~t~~~~egg~~~g~ 286 (318) T protein:vir:10 213 FMKVYERNANYVSTAPDWTGNF----PGSVMG--LNVIRSRTFPIDRVLIMERGTVGFYSDTRPLQFTALYPEGNGPNGG 286 (318) T ss_pred hhhhhhccchhhhhcccccccc----cceeec--eEEeecCccCCCeeEEEecCCcceeeccccceeeecccCCCCCCCC Confidence 000000 000000000001111 113478 699999999999999999766542211 4555444432 22 Q ss_pred cceeeEE--EEEEeEEEecccceeEEEEeccC Q lcl|NC_019402. 289 SYEKWMI--EMEVGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 289 d~~k~~i--~~E~tLe~~N~~a~g~i~~lt~a 318 (318) ..+.|.+ ..=-+.=|..|||...|+++=.- T Consensus 287 ~~~s~~~~~~~~~~~~V~~PkA~~~itgi~~~ 318 (318) T protein:vir:10 287 PTESYRADASHKRALAVDQPKAALWLTGIVTP 318 (318) T ss_pred cchhhheehheeeeeeeeCcceeEEEeeccCC Confidence 2233332 11234678999999999999777 No 139 >protein:vir:104479 Length: 310 # NCBI annotation: gp15 # Family: family:all:1105 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214651;genbank:gi:61806292;genbank:GeneID:3294534 Probab=77.60 E-value=0.0087 Score=31.88 Aligned_cols=89 Identities=10% Similarity=-0.030 Sum_probs=45.4 Q ss_pred CCc-e----eeeeeeeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccc---ccccee-eccc Q lcl|NC_019402. 1 MAT-L----VSYDLNGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQ---KRNAVI-EGSA 71 (318) Q Consensus 1 Ma~-~----~t~~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~---~~na~~-EG~d 71 (318) .++ + .+....+..+.+.+|+. +++.|.+.+.|..+.+--|....+......-. ..++.+ ||+| T Consensus 213 ~~ts~~V~d~s~~~~~~~i~Id~E~i--------~i~~isgn~LTV~RG~~~T~aa~H~~g~~V~~in~~d~~lle~gdd 284 (310) T protein:vir:10 213 TATGFAVANASGINQYDNIYIGAELM--------RVTNKVGNNLSVIRGYEKSTPTVHSVGSNVFIVNAADNALLESDDD 284 (310) T ss_pred cceeeeecccccccccceEEECcEEE--------EEEeeccceEEEEecccCCchhhhhcCCcEEEEccCCCccCCcccc Confidence 111 0 00111222223333333 34555555666555554444333333222111 123334 4666 Q ss_pred cccccccCcEEecceEEEEeeeeeehhHHHHh Q lcl|NC_019402. 72 AVDGERASTTVINNVTQILRKVVKVSDTANVL 103 (318) Q Consensus 72 a~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~ 103 (318) ......++||+||.... .+||+++|+ T Consensus 285 -----fg~~e~~s~~~d~~~~~-~~~~~~~a~ 310 (310) T protein:vir:10 285 -----FGFGEIYSEYTDMKKYN-PVSGQDEAI 310 (310) T ss_pred -----ccccccccccccceeec-cccceeecC Confidence 45667899999977666 999999999 No 140 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=77.22 E-value=0.13 Score=25.50 Aligned_cols=223 Identities=15% Similarity=0.099 Sum_probs=111.2 Q ss_pred CCceeeee------e-----eeecccceeeeEecCC--cccceeeeeccccccceEEEeeeeeccccCCcccccccccee Q lcl|NC_019402. 1 MATLVSYD------L-----NGKKLSFANWISNLSP--TDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVI 67 (318) Q Consensus 1 Ma~~~t~~------~-----~~~~~dl~d~I~~i~p--~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~ 67 (318) ||++-... + .+...-+.+.+..-+| .+=||.-...+. . +..=+..+|..+.. +- T Consensus 1 m~~~~~~~~TL~e~Akr~~~d~~~~~VIE~l~~~n~IL~~lpf~e~n~gt---~-~~~~v~~~LP~~~f---------R~ 67 (328) T protein:vir:95 1 MAVKGLTALTLADWGKRVDPNGKVDKIIELLGQTNPILQDMPFVEGNLPT---G-HRTTIRSGLPSATW---------RL 67 (328) T ss_pred CCccccccccHHHHHhhhCcchhHHHHHHHHhccchhHhhcceeecccCC---c-ceeeEeeccCCcee---------ee Confidence 77653322 1 1111123333333333 333443322110 0 11112233322111 00 Q ss_pred eccccccccccCcEEecceEEEEeeeeeehhHHHHhhhcCccc-hHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccc Q lcl|NC_019402. 68 EGSAAVDGERASTTVINNVTQILRKVVKVSDTANVLANYGRGK-ELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVAR 146 (318) Q Consensus 68 EG~da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~-e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r 146 (318) =+.. -......+.++.-.|=||.-.++|..-.... .|..+ ..++|+..+++-+.++++.+||+|. ++..|. T Consensus 68 lN~g-~~~s~~tt~q~t~~l~ilgg~~eVDr~la~~--~Gn~~~~ra~q~~~~~ka~~~~~~~~~iyGd-----sa~~p~ 139 (328) T protein:vir:95 68 LNYG-VQPSKSTTVQVTDSVGMLETYAEVDKSLADL--NGNTAEFRLSEDRAFIEAMNQQMAQTLFYGD-----SSVNPQ 139 (328) T ss_pred cCCc-cCcccceeEEEEEEEEEEecceeechHHHhh--cCCHHHHHHHHHHHHHHHHHHHHHHHHhcCC-----ccCChh Confidence 1122 2344467889999999999999999855543 45433 4588999999999999999999984 233577 Q ss_pred hhhhHHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcC---EEEEcchHHhhhhhhhhhhcccccc Q lcl|NC_019402. 147 QTAGFSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEAN---IIMFHPKHAAFFSSLMETSGVTNGQ 223 (318) Q Consensus 147 ~m~Gi~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~---~l~~~~~~k~~is~~~~~~~~~~~~ 223 (318) ...||...+..-.. +. -+++.++||... -|++..+..+....+ T Consensus 140 ~F~GL~~R~~~~s~----------------~~--------a~qiidaGgtg~~~TSi~~v~~g~~~~~gi---------- 185 (328) T protein:vir:95 140 QFMGLSSRYSSLSA----------------GN--------AQNIIDAGGTGTDNTSIWLVVWGENTVHGI---------- 185 (328) T ss_pred hhcchhhhcCcccc----------------cc--------ccceeecccCCCCceEEEEEEEcCCeEEEe---------- Confidence 89999866533210 00 123566665443 233222211111111 Q ss_pred eEEEecCCceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceecCCCcc--ceeeEEEEEEeE Q lcl|NC_019402. 224 RMKMFDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKLAKDGS--YEKWMIEMEVGL 301 (318) Q Consensus 224 r~~~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~laktGd--~~k~~i~~E~tL 301 (318) + +++.+.|.+ ..|.|. ..+ +|.+ .|- .-.-++.+.+|| T Consensus 186 ----y-PkG~~~Gl~----~~d~g~-~~~-------------~~~~-----------------g~~y~~y~~~~~w~~Gl 225 (328) T protein:vir:95 186 ----F-PKGKKAGIQ----MEDKGQ-VTL-------------EDAN-----------------GGKYEGYRTHYKWDNGL 225 (328) T ss_pred ----c-ccccccCce----eeecCc-eee-------------ecCC-----------------CCeeeEEEEEEEeeeee Confidence 1 122222221 123331 111 1111 111 123356677889 Q ss_pred EEecccceeEEEEeccC Q lcl|NC_019402. 302 RHRNPYASGILEVKAGA 318 (318) Q Consensus 302 e~~N~~a~g~i~~lt~a 318 (318) -++++.+...|..+..| T Consensus 226 ~i~d~r~vvrI~NId~~ 242 (328) T protein:vir:95 226 ALRDWRYVVRIANIDVS 242 (328) T ss_pred EEcCcccEEEEecCccc Confidence 99999998888887555 No 141 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=73.07 E-value=0.17 Score=24.74 Aligned_cols=248 Identities=13% Similarity=0.145 Sum_probs=116.7 Q ss_pred CCc----eeeeee-eeecccceeee-------------EecCCcccceeeeeccccccceEEEee--eeeccccCCcccc Q lcl|NC_019402. 1 MAT----LVSYDL-NGKKLSFANWI-------------SNLSPTDTPFVSMTGKEAINQTLFQWQ--TDALAPVADPSDA 60 (318) Q Consensus 1 Ma~----~~t~~~-~~~~~dl~d~I-------------~~i~p~~TP~~s~i~~~~~~~~~~~W~--td~l~~~~~~~~~ 60 (318) |+. .++.+. .-..+|+.+-+ .+..|.. -+.+.+ .+.|. +..+++ . T Consensus 1 M~~e~nl~~~~dL~~a~siDF~~~f~~~i~~L~~~LGv~r~~pla-------~Gt~ik--tyK~~~~~y~gda-~----- 65 (303) T protein:vir:10 1 MSAENNLINVEALGKAKSIDFANKLGVGLNKLFEALAIQNKIPMN-------VGSALK--QYRFKVEDSEKPN-G----- 65 (303) T ss_pred CCCCcCCcchhhcccceeehhhhhhhhhHHHHHHHhhhhcccccc-------CCceee--eeeeeceeecccc-c----- Confidence 663 333333 45555555421 1222221 122222 34554 233322 1 Q ss_pred ccccceeeccccccccccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhc----Ccc Q lcl|NC_019402. 61 QKRNAVIEGSAAVDGERASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLR----NGA 136 (318) Q Consensus 61 ~~~na~~EG~da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~----g~~ 136 (318) ...||...|-... .|...+=++==|.|.-... |++|+...|.++-.-.--.+-+.+|..++-+.|+. ++- T Consensus 66 ----dVaEGe~Iplskv-t~~~~~t~~~~~kK~rK~t-TdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lktaT~ 139 (303) T protein:vir:10 66 ----DVAEGDVIPLTKV-TREQVDITELQFAKYRKST-SAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSAIE 139 (303) T ss_pred ----cccCCcccchhhh-eeeecceEEEEeecccccc-cHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhccc Confidence 2348888774432 2222222222245555555 99999888877644444444455555555555542 111 Q ss_pred ccCCCCCccchhhhHHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhh Q lcl|NC_019402. 137 KVDGSATVARQTAGFSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMET 216 (318) Q Consensus 137 ~~~gs~t~~r~m~Gi~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~ 216 (318) ...+..++.-...||...++.. ..-++..|+.... -++||||...-. +-++ T Consensus 140 t~~~t~~t~~s~~glq~Al~~~-------------------------~~kl~~~~ed~~~-~V~FvNP~Daa~---yl~~ 190 (303) T protein:vir:10 140 NGKRTNKTKLSAENLQGALSKG-------------------------RANLSVLLDDEIT-PIAFVNPNDTAE---YLAN 190 (303) T ss_pred ccccccceeecHHHHHHHHHhh-------------------------hhhcccccccccc-EEEEEchHHHHH---Hhhc Confidence 1111122222233444333221 1112233443333 378999964332 2222 Q ss_pred hcccccceEEEecCCceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceecCC----Ccccee Q lcl|NC_019402. 217 SGVTNGQRMKMFDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKLAK----DGSYEK 292 (318) Q Consensus 217 ~~~~~~~r~~~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~lak----tGd~~k 292 (318) .... .....||. ..++.=+| +.||..+.+|.++++.-=.+.+.++|..+.. +|++ +.| +- T Consensus 191 A~i~---------~~~t~fG~--n~L~nfLG--~~II~S~kv~~G~~~~T~~~Ni~~ay~~~~g--~l~~~f~~t~D-~t 254 (303) T protein:vir:10 191 GFIN---------STGAQFGV--NLLTPYVG--VKIVEFADVPQGEVWMTVAENLNVAYANPRG--ELSRAFAFATD-AT 254 (303) T ss_pred CCcc---------hhhhhhhh--hhhhhhhc--ceEEEeccCCCceEEEeeccceEEEEecCch--hhhhhhhhccc-cc Confidence 2110 01122332 33444458 4689999999999999999999999997753 4442 233 22 Q ss_pred eEEEEEEeEEEecccceeEEEEeccC Q lcl|NC_019402. 293 WMIEMEVGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 293 ~~i~~E~tLe~~N~~a~g~i~~lt~a 318 (318) +. .|+-|.-..-|+.++-+..+ T Consensus 255 gl----IGv~h~~~~~~~t~eT~~~~ 276 (303) T protein:vir:10 255 GF----VGVLHDIQPQRLTSDTIYAS 276 (303) T ss_pred cc----eEEEeccccceeeehhHhHh Confidence 22 23333333334444444444 No 142 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=70.50 E-value=0.21 Score=24.32 Aligned_cols=264 Identities=14% Similarity=0.097 Sum_probs=117.6 Q ss_pred CCceeeeeeeeecccceeeeEecCCccccee------------ee----eccccccceEEEeeeeecc-ccCCccccccc Q lcl|NC_019402. 1 MATLVSYDLNGKKLSFANWISNLSPTDTPFV------------SM----TGKEAINQTLFQWQTDALA-PVADPSDAQKR 63 (318) Q Consensus 1 Ma~~~t~~~~~~~~dl~d~I~~i~p~~TP~~------------s~----i~~~~~~~~~~~W~td~l~-~~~~~~~~~~~ 63 (318) ||+ |...-.-+-|=|.+.+..-.|.-..|+ .+ -++..+ +.+-|.- |. ++.+ T Consensus 1 MA~-T~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i--~~P~~~~--l~Gd~~~------- 68 (324) T protein:vir:59 1 MAY-TKISDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTL--NMPYWND--LDGDSQV------- 68 (324) T ss_pred CCc-eeeeceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEE--Eeccccc--CCCcccc------- Confidence 993 222333333444444433333332221 11 122233 3445532 21 1111 Q ss_pred cceeeccccccccccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCC Q lcl|NC_019402. 64 NAVIEGSAAVDGERASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSAT 143 (318) Q Consensus 64 na~~EG~da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t 143 (318) .-||.+.+...........=+.+ --|.|++++-+..+. | ++-+..=-.+-..-+.|++++.+|.-- T Consensus 69 --v~~~~~i~~~~l~t~~~~a~i~~-~~k~~~~tD~a~~~s--g-~dp~~~i~~q~a~~~~~~~~~~lia~l-------- 134 (324) T protein:vir:59 69 --LNDTDDLVPQKINAGQDKAVLIL-RGNAWSSHDLAATLS--G-SDPMQAIGSRVAAYWAREMQKIVFAEL-------- 134 (324) T ss_pred --cCCCcccchhhcccceeeEEEEe-ecCceeehhhhhhhc--c-chHHHHHHHHHHHHHHHHHHHHHHHHH-------- Confidence 11444433222222222222221 234567766665542 2 243333222334456677777776321 Q ss_pred ccchhhhHHHHHhc-CCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhhhhhhhccccc Q lcl|NC_019402. 144 VARQTAGFSALVAA-KDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNG 222 (318) Q Consensus 144 ~~r~m~Gi~~~i~~-~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~ 222 (318) .|++.-... ++..+ ....+...|+.+.|.+.++++=|+......++|||.+...+-+. ....+ T Consensus 135 -----~g~~~~~~~~~~~~d-------vsa~~~~~~s~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~---~li~~- 198 (324) T protein:vir:59 135 -----AGVFSNDDMKDNKLD-------ISGTADGIYSAETFVDASYKLGDHESLLTAIGMHSATMASAVKQ---DLIEF- 198 (324) T ss_pred -----HHhhhccccccceee-------eeccccceecHHHHHHHHHHhCCcccCcEEEEEchHHHHHHHHh---hhhhh- Confidence 111110000 01111 11233456999999999999988888888999999876665432 11111 Q ss_pred ceEEEecCCceEEEEEEEEEEcCCCcEEEEEecCCCCCc---------eEEEEehhhcceeecCcccceec---CCCc-- Q lcl|NC_019402. 223 QRMKMFDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPEN---------AVYFFTPSDWTQMVLRAPERTKL---AKDG-- 288 (318) Q Consensus 223 ~r~~~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~---------~~~~~D~~~~~~~~Lr~~~~e~l---aktG-- 288 (318) .+..++ +. .|-+-.| +.||.+..||.+ ..|+|=+..+.+...+++-.-+. +..| T Consensus 199 --~~~s~~-----~~---~i~~~~G--~~VivdD~~p~~~~~~~~~~y~s~l~~~GAi~~~~~~~~v~vE~dRd~~~g~~ 266 (324) T protein:vir:59 199 --VKDSQS-----GI---RFPTYMN--KRVIVDDSMPVETLEDGTKVFTSYLFGAGALGYAEGQPEVPTETARNALGSQD 266 (324) T ss_pred --cccccc-----Cc---eeeeecc--cEEEEeCCCCccccCCCCceEEEEEEecCeEEEeecCCCcceecccCccccce Confidence 111111 11 2444577 488999988853 37888888877776654322111 1112 Q ss_pred --cceeeEEEEEEeEEEecccc---eeEEEEeccC Q lcl|NC_019402. 289 --SYEKWMIEMEVGLRHRNPYA---SGILEVKAGA 318 (318) Q Consensus 289 --d~~k~~i~~E~tLe~~N~~a---~g~i~~lt~a 318 (318) -.++..+++=+|.......- +---+.|..+ T Consensus 267 ~l~~r~~~~~~p~G~s~~~~~~~~~sPt~~~L~~~ 301 (324) T protein:vir:59 267 ILINRKHFVLHPRGVKFTENAMAGTTPTDEELANG 301 (324) T ss_pred EEEEeeEEEeEeeeEEecccccCCCCCChhhhcCC Confidence 11333333334444432211 1111222222 No 143 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=69.87 E-value=0.22 Score=24.22 Aligned_cols=266 Identities=9% Similarity=-0.020 Sum_probs=122.4 Q ss_pred CCceeee----eeeeecccceeeeEecCCcccceeeeeccc--------cccceEEEeeeeeccccCCcccccccccee- Q lcl|NC_019402. 1 MATLVSY----DLNGKKLSFANWISNLSPTDTPFVSMTGKE--------AINQTLFQWQTDALAPVADPSDAQKRNAVI- 67 (318) Q Consensus 1 Ma~~~t~----~~~~~~~dl~d~I~~i~p~~TP~~s~i~~~--------~~~~~~~~W~td~l~~~~~~~~~~~~na~~- 67 (318) |-+.-.. .+..+-+-+...|+ +++.=.+.+.. ......+.+..-+.. +.++. T Consensus 1 ~~~~~a~~~~~f~~~ql~~id~~v~-----e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~----------G~a~~~ 65 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLTASLNKAY-----ETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGV----------GIAQIV 65 (296) T ss_pred CcccchhhhHHHHHHHHHHHHHHHH-----hhhhcccccceecccccCCCCceeEEEeeeeecc----------CceeEe Confidence 3322111 11112222222222 23222222221 111112222221111 11111 Q ss_pred --eccccccccccCcEEecceEEEEeeeeeehhHHHHh---hhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCC Q lcl|NC_019402. 68 --EGSAAVDGERASTTVINNVTQILRKVVKVSDTANVL---ANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSA 142 (318) Q Consensus 68 --EG~da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~---~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~ 142 (318) ++.|.|..... ..+. ..+|++-...++.+-+=+ ...|. +.-+.....+...+.+.+...++.|.+. T Consensus 66 ~~~~~dip~v~~~-~~~~--~~~i~~~~~~~~~~~~El~~a~~~g~-~l~~~ka~aA~~~~~~~~n~~~f~G~~~----- 136 (296) T protein:vir:10 66 ADYTDDLPLVDAL-ATER--QGKVFRFGNAFLISIDEIKVGQATGQ-SLSTRKQSLAFEAHDKLLDKLVWSGSTA----- 136 (296) T ss_pred CCCccccceeecc-ceeE--EEEEEEEEeeeeecHHHHHHHHHhCC-ChHHHHHHHHHHHHHHhhceEEEeeccc----- Confidence 11222221111 1111 224455444444444433 23333 3333344455577777777777777322 Q ss_pred CccchhhhHHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhC-CC--CcCEEEEcchHHhhhhhhhhhhcc Q lcl|NC_019402. 143 TVARQTAGFSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLS-GS--EANIIMFHPKHAAFFSSLMETSGV 219 (318) Q Consensus 143 t~~r~m~Gi~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~-G~--~~~~l~~~~~~k~~is~~~~~~~~ 219 (318) ...-||+. .-+.. ...+..+-...+==.++|..++.++|.. ++ .+.+|+++|.+...++.-..+.+. T Consensus 137 ---~g~~GLlN---~p~v~----~~~~~~~W~~~t~i~~Di~~~~~~l~~~s~g~~~p~~l~L~p~~~~~L~~~~~~~~~ 206 (296) T protein:vir:10 137 ---HGIPSVFD---YPNIN----NVVSGGSWSQPTTAVSDITSLLDIIETSTNGQHRATHLLLPTTARRIMQNLVPGTSV 206 (296) T ss_pred ---ccceeEee---cCCCc----cccccCCccCHHHHHHHHHHHHHHHHHhhCceecceeEEeCHHHHHHHhhccCCCCc Confidence 12334432 11110 0001111011111267777888888964 33 566888899877766543222211 Q ss_pred cccceEEEecCCceEEEEEEEEEEcCCCcEEEEEecCCCCC------ceEEEEe--hhhcceeecCcccceecCCCccce Q lcl|NC_019402. 220 TNGQRMKMFDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPE------NAVYFFT--PSDWTQMVLRAPERTKLAKDGSYE 291 (318) Q Consensus 220 ~~~~r~~~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~------~~~~~~D--~~~~~~~~Lr~~~~e~laktGd~~ 291 (318) .. .+.+...|. .++|+.-+.+.. +.+++++ ++++.+.+=.++...++-+.+-+- T Consensus 207 t~-----------------l~~ik~~~~-~l~i~~~~~l~~a~~~g~~~~v~~~~~~~~~~~~v~~~~~~~~~e~~~l~~ 268 (296) T protein:vir:10 207 SY-----------------GEFFRQNNS-GVTVEFVQYLNDYNGTGTSAAIAYEKDPNNMAIEIPEATNALPAQPKDLHF 268 (296) T ss_pred cH-----------------HHHHHHhcC-CceEEEeeeeccCCCCcceEEEEEEcCCceEEEEcCcceeeecccccCceE Confidence 11 112222233 245555555532 4456655 778888765666666666666666 Q ss_pred eeEEEEEE-eEEEecccceeEEEEeccC Q lcl|NC_019402. 292 KWMIEMEV-GLRHRNPYASGILEVKAGA 318 (318) Q Consensus 292 k~~i~~E~-tLe~~N~~a~g~i~~lt~a 318 (318) +..++.-. +++++-|.|...+.|+|=| T Consensus 269 ~~~~~~~~~Gv~i~~P~ai~~~dGI~~~ 296 (296) T protein:vir:10 269 KIPVTSKATGLIVYRPLTMAVMKGITFA 296 (296) T ss_pred EEeeEeeEEEEEEECCceeEEEeeeecC Confidence 66556655 5999999999999999999 No 144 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=60.59 E-value=0.14 Score=25.26 Aligned_cols=265 Identities=15% Similarity=0.132 Sum_probs=117.0 Q ss_pred CC----ceeeeeeeeec-ccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCccccccccceeeccccccc Q lcl|NC_019402. 1 MA----TLVSYDLNGKK-LSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGSAAVDG 75 (318) Q Consensus 1 Ma----~~~t~~~~~~~-~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~da~~~ 75 (318) |+ ...|-++.+.- .+...-...|=-..-|+.+++....++.-.++.+..+-+.+-...- -..-+.=||...+.. T Consensus 127 ~r~a~~~~~Tgd~~~~i~~~~v~d~i~li~q~r~i~slf~tLP~~g~T~eY~v~t~~~tV~~q~-~~~kqa~EGd~L~~g 205 (410) T protein:vir:83 127 YARAADHQKTGDLQGVIPDPIVGPVIDFIDSARPLVSTLGTLPLNNATFYRPIVSQRPAVGLQG-VAGGASDEKTELDSQ 205 (410) T ss_pred HHHhhccCcccccccccchhHhhhHHHHHhhccchhhhhhhCCCCCCeeEEeeecccccccccc-ccccccccccccccc Confidence 22 23333333321 1122111223335667777777777777777775544332110000 000112255544432 Q ss_pred cccCcEEecceEEEEeeeeeehhHHH-HhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhhHHHH Q lcl|NC_019402. 76 ERASTTVINNVTQILRKVVKVSDTAN-VLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAGFSAL 154 (318) Q Consensus 76 ~~~~~~~~~N~tQIf~~~~~VS~Ta~-a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~Gi~~~ 154 (318) . ++ + +|+- ++++||.+-.|.+|.-.+.--=..|.+--.+...+...+. .+..++ T Consensus 206 K---------l~--------~-~t~tA~ikTyGGyt~LSRQ~IERs~v~~L~~~lraL~~AYA~ate-------a~vra~ 260 (410) T protein:vir:83 206 K---------MV--------I-DRLTVNAKTLGGYVNVSRQAIDFSSPSALDLVVNGLGQQYAIETE-------ALVGAA 260 (410) T ss_pred c---------ee--------e-eeccceeehhcCcccccceeeecCChhhHHHHHHHHHHHHHHHHH-------HHHHHH Confidence 2 11 1 1222 2466666666666655444433344444444332221111 111122 Q ss_pred HhcCCcccCccccceeeccCccccCHH----HHHHHHHHHHhC--CCCcCEEEEcchHHhhhhhhhhhhcccccceEEEe Q lcl|NC_019402. 155 VAAKDAADPDTGAIVHFETAAAALTEA----EIFKVTYNLYLS--GSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMF 228 (318) Q Consensus 155 i~~~~~~~~~~g~~~~~~~t~~~lTe~----~l~~~~~~i~~~--G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~ 228 (318) +... .+ +.+ ....+|.+ .+.|+...++++ |..-..|.++|.+-..+.+ +.... T Consensus 261 L~~t--~t---~~~-----a~~~~Tad~~~~~i~da~~~v~da~~~~~~~~i~vS~DVl~~~~~-----------~f~~~ 319 (410) T protein:vir:83 261 LAST--ST---GAV-----GYGNATADNVASAIWQAAGAVYTAVKGMGRLVIAIAPDVLGDFGP-----------LFAPV 319 (410) T ss_pred HHHh--hh---hhh-----hhhhccHHHHHHHHHHHHHHHhhhhccceeeeEEechhhhhhccc-----------eeecc Confidence 2110 00 000 11122333 445666777776 4444556777776322222 22333 Q ss_pred cCC-ceEEEEEEEE----EEcCCCcEEEEEecCCCCCceEEEEehhhcceeecC--cccceecCCCccceeeEEEEEEeE Q lcl|NC_019402. 229 DGQ-DTRLNVYVSS----IVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLR--APERTKLAKDGSYEKWMIEMEVGL 301 (318) Q Consensus 229 ~~~-~~~~g~~v~~----~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr--~~~~e~laktGd~~k~~i~~E~tL 301 (318) ++. ..+.|+.++. +..-|= .+.|++.+..+++.+.++|+..+++---+ |++-.+=--++-.+.+. +-+.. T Consensus 320 ~~~~~dt~Gfg~~~lg~gi~G~~~-~ipVvm~~~a~AgTA~f~~~~Ai~~~eS~~gp~qL~d~~i~nLt~~yS--gY~a~ 396 (410) T protein:vir:83 320 NPTNAHSTGFEAGRFGQGVMGSIS-GIPVVMSAALGSGDAYLFSTAAIECFEQRVGTLQVVEPSVFGLQVAYA--GYFST 396 (410) T ss_pred CCCCcccccccccccccchhhhhc-ccceEEecCCCcCeeeEeccceeeeeecCCceeEeeCCchhhhhhhhe--eeeee Confidence 332 2344555533 333332 36688888999999999999998765444 24433333333333333 33444 Q ss_pred EEecccceeEEEEeccC Q lcl|NC_019402. 302 RHRNPYASGILEVKAGA 318 (318) Q Consensus 302 e~~N~~a~g~i~~lt~a 318 (318) -.-+|++ +|- +.+| T Consensus 397 a~~~~~g--liP-v~g~ 410 (410) T protein:vir:83 397 LVVNEDA--IVP-LVGS 410 (410) T ss_pred ccccccc--eee-eccC Confidence 4455543 332 2333 No 145 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=54.28 E-value=0.51 Score=22.20 Aligned_cols=269 Identities=9% Similarity=0.013 Sum_probs=113.5 Q ss_pred CCceeeeeeee-ecccceeeeEecCC--cccceeeeeccc--------cccceEEEeeeeeccccCCcccccccccee-- Q lcl|NC_019402. 1 MATLVSYDLNG-KKLSFANWISNLSP--TDTPFVSMTGKE--------AINQTLFQWQTDALAPVADPSDAQKRNAVI-- 67 (318) Q Consensus 1 Ma~~~t~~~~~-~~~dl~d~I~~i~p--~~TP~~s~i~~~--------~~~~~~~~W~td~l~~~~~~~~~~~~na~~-- 67 (318) +++-.-.++.. --.=+..++..|+| .+++.-.+.+.. ......+.+..-+.. +.++. T Consensus 19 ~~~~~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~----------G~a~~~~ 88 (319) T protein:vir:10 19 IQAGVKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELSPTDKTFEYMTFDKV----------GTAQIIA 88 (319) T ss_pred hhccchhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCCCCceEEEEeeeeccc----------cceeeec Confidence 11111111100 00000112223333 233333333321 111112222221110 11111 Q ss_pred -eccccccccccCcEEecceEEEEeeeeeehhHHHHhhhc--CccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCc Q lcl|NC_019402. 68 -EGSAAVDGERASTTVINNVTQILRKVVKVSDTANVLANY--GRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATV 144 (318) Q Consensus 68 -EG~da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~--G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~ 144 (318) +..|.|....... +. ..+|++-...++.+-+-+..+ ..-+.-......+...+.+.+...++.|-+. T Consensus 89 d~~~dip~v~~~~~-~~--~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~------- 158 (319) T protein:vir:10 89 DYTDDLPLVDALGT-SE--FGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNRLVFKGSAP------- 158 (319) T ss_pred Cccccccceeccce-ee--EEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeeccc------- Confidence 1111221111111 11 124555444444444433222 2234444455566677777777777777322 Q ss_pred cchhhhHHHHHhcCCcccCccccceeec--cCccc--cCHHHHHHHHHHHHhC-CC--CcCEEEEcchHHhhhhhhhhhh Q lcl|NC_019402. 145 ARQTAGFSALVAAKDAADPDTGAIVHFE--TAAAA--LTEAEIFKVTYNLYLS-GS--EANIIMFHPKHAAFFSSLMETS 217 (318) Q Consensus 145 ~r~m~Gi~~~i~~~~~~~~~~g~~~~~~--~t~~~--lTe~~l~~~~~~i~~~-G~--~~~~l~~~~~~k~~is~~~~~~ 217 (318) ...-||+. ..+... ..+... .+..+ --.++|..++.++|.. ++ .+.+|+++|..-..++.-..+. T Consensus 159 -~g~~GLlN---~p~~~~----~~~~~~~~~~t~t~~~i~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~ 230 (319) T protein:vir:10 159 -HKIVSVFN---HPNITK----ITSGKWIDVSTMKPETAEAELTQAIETIETITRGQHRATNILIPPSMRKVLAIRMPET 230 (319) T ss_pred -ccceeEEe---CCCcee----eecCCCCCccccCHHHHHHHHHHHHHHHHHhcCceeeceEEEecHHHHHhhhcccCCC Confidence 22344432 111100 000110 11111 1236778888889853 33 5678899998766664332211 Q ss_pred cccccceEEEecCCceEEEEEEEEEEcCCCcEEEEEecCCCCC------ceEEEE--ehhhcceeecCcccceecCCCcc Q lcl|NC_019402. 218 GVTNGQRMKMFDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPE------NAVYFF--TPSDWTQMVLRAPERTKLAKDGS 289 (318) Q Consensus 218 ~~~~~~r~~~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~------~~~~~~--D~~~~~~~~Lr~~~~e~laktGd 289 (318) +. ...+.+...|. .++|+.-+.+.. +.++++ |++++++.+=.++...++-+.+- T Consensus 231 ~~-----------------t~l~~lk~~~~-~l~I~~~pel~~ag~~g~~~~v~y~~~~~~~~~~v~~~~~~~~~e~~~l 292 (319) T protein:vir:10 231 TM-----------------SYLDYFKSQNS-GIEIDSIAELEDIDGAGTKGVLVYEKNPMNMSIEIPEAFNMLPAQPKDL 292 (319) T ss_pred Ce-----------------eHHHHHHHhcC-CceEEEeeeecccCCCcceEEEEEecCCceEEEecCcceeeeeeeecCc Confidence 11 11112222232 245555555532 334444 57778877655666555544443 Q ss_pred ceeeEEEEE-EeEEEecccceeEEEEe Q lcl|NC_019402. 290 YEKWMIEME-VGLRHRNPYASGILEVK 315 (318) Q Consensus 290 ~~k~~i~~E-~tLe~~N~~a~g~i~~l 315 (318) +.+.-.+.- .+++++-|.+...+.|+ T Consensus 293 ~~~~~~~~r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 293 HFKVPCTSKCTGLTIYRPMTIVLITGV 319 (319) T ss_pred eEEEeeeeeeEEEEEEccceeEeeecC Confidence 333323332 56999999999999999 No 146 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=49.88 E-value=0.63 Score=21.70 Aligned_cols=260 Identities=12% Similarity=0.071 Sum_probs=116.9 Q ss_pred CCceeeeeeeeecccceeeeEecCCcccceee------------eeccccccceEEEeeeeecc-ccCCcccccccccee Q lcl|NC_019402. 1 MATLVSYDLNGKKLSFANWISNLSPTDTPFVS------------MTGKEAINQTLFQWQTDALA-PVADPSDAQKRNAVI 67 (318) Q Consensus 1 Ma~~~t~~~~~~~~dl~d~I~~i~p~~TP~~s------------~i~~~~~~~~~~~W~td~l~-~~~~~~~~~~~na~~ 67 (318) ||+ |...-.-+-|=|.+.+..-.|.-..|+. ++.+.--+.+.+.|. .|. ++.+. - T Consensus 1 MA~-T~lsd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~--~l~Gd~~~~---------~ 68 (351) T protein:vir:15 1 MAE-THLSDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLN--DLTGDPDNW---------T 68 (351) T ss_pred CCc-eeeeeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccc--cCCCccccc---------C Confidence 993 3333444444555555554444443311 111111233345564 221 11111 1 Q ss_pred eccccccccccCcEEecceEEEE-----eeeeeehhHHHHhhhcCccc---hHHHHHHHHHHHHHHHHHHHHhcCccccC Q lcl|NC_019402. 68 EGSAAVDGERASTTVINNVTQIL-----RKVVKVSDTANVLANYGRGK---ELQYQMEKAGKEIKRDLEVALLRNGAKVD 139 (318) Q Consensus 68 EG~da~~~~~~~~~~~~N~tQIf-----~~~~~VS~Ta~a~~~~G~~~---e~a~q~~k~~~eikrd~E~a~i~g~~~~~ 139 (318) ||.+.+ ...++-.+|+. -|.|++++-+..+. | ++ +.+.|+. .-..|++++.+|.- T Consensus 69 ~~~~i~------~~kitt~~~~a~i~~~~kg~~~tD~a~~~s--g-~dp~~~i~~q~a---~~w~~~~q~~lla~----- 131 (351) T protein:vir:15 69 DSDDID------VNNLTSGKQQGIKFYQTKAYGYTDLGTMIS--G-APVQETIGNRFA---AFWQRADQKTLLSV----- 131 (351) T ss_pred CCcccc------hheecccceeEEEEeeccceehhhhhHhhc--c-chHHHHHHHHHH---HHHHHHHHHHHHHH----- Confidence 333322 33344444433 34566766665543 3 23 3444444 45667888887742 Q ss_pred CCCCccchhhhHHHHH--hcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCC-CcCEEEEcchHHhhhhhhhhh Q lcl|NC_019402. 140 GSATVARQTAGFSALV--AAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGS-EANIIMFHPKHAAFFSSLMET 216 (318) Q Consensus 140 gs~t~~r~m~Gi~~~i--~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~-~~~~l~~~~~~k~~is~~~~~ 216 (318) +.|++.-. ..++..+. ....++...|+.+.|.+.++++-+... ....++||+.+...+-+. T Consensus 132 --------l~gv~~~~~~~~~~~~d~-----t~~~~~~~~is~~~l~~A~~~~GD~~~~~~~~ivmhS~v~~~L~~~--- 195 (351) T protein:vir:15 132 --------LKGVMGVTKIANSKVYDQ-----TKVSPSEPMFGAKGFTGAIGLMGDLQDTAFGAIAVNSATYSLMKVQ--- 195 (351) T ss_pred --------HHHHhhchhhcccceecc-----ccccccccccCHHHHHHHHHHhccccccceEEEEEChHHHHHHHhh--- Confidence 11111100 00111111 112345667999999999999988643 467788999876554432 Q ss_pred hcccccceEEEecCCceEEEEEEEEEEcCCCcEEEEEecCCCCCc---------eEEEEehhhcceeecC---cccceec Q lcl|NC_019402. 217 SGVTNGQRMKMFDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPEN---------AVYFFTPSDWTQMVLR---APERTKL 284 (318) Q Consensus 217 ~~~~~~~r~~~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~---------~~~~~D~~~~~~~~Lr---~~~~e~l 284 (318) ....+ .+..++ +.. |-+=.| +.||.+..||.+ ..|+|=+..+.+.--. +..++++ T Consensus 196 ~li~~---~~~s~~-----~~~---i~t~~G--~~VivdD~~p~~~~~~~~~~ytsyl~~~GAi~~~~~~~~ve~~rd~~ 262 (351) T protein:vir:15 196 GLIET---IQPQNG-----ATP---FEAYNG--LRIVLDDDIEIDLTDKTKPVSTSYIFAPGAVRYSTNMRSTETKYDPL 262 (351) T ss_pred hhhhh---cccccc-----Ccc---cceecc--eEEEEcCCCccccCCCCCceeEEEEEecceeeeecCCcCcceeeccc Confidence 11111 011111 111 222245 588888888742 3666666655432212 2233444 Q ss_pred CCCcc----ceeeEEEEEEeEEEeccc-ce----eEEEEeccC Q lcl|NC_019402. 285 AKDGS----YEKWMIEMEVGLRHRNPY-AS----GILEVKAGA 318 (318) Q Consensus 285 aktGd----~~k~~i~~E~tLe~~N~~-a~----g~i~~lt~a 318 (318) +..|- ..+..+++=+|....++. .. --.+.|..+ T Consensus 263 ~~~g~d~l~~r~~~~~hp~G~s~~~~~~~~~~~sPt~~~L~~~ 305 (351) T protein:vir:15 263 INGGQDVIVQKRVGTIHVAGTSIKASFSPSKASFPTIDELAKS 305 (351) T ss_pred CCCCceEEEEeeeeeeeeeeeeecccccccCcCCcChHHhcCC Confidence 43331 234444444555554432 11 122233333 No 147 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=48.66 E-value=0.66 Score=21.56 Aligned_cols=255 Identities=13% Similarity=0.067 Sum_probs=89.4 Q ss_pred CC--cee----------eeee-eeecccceeeeEecCCcccceeeeeccccccceEEEeeeeeccccCCcccccccccee Q lcl|NC_019402. 1 MA--TLV----------SYDL-NGKKLSFANWISNLSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVI 67 (318) Q Consensus 1 Ma--~~~----------t~~~-~~~~~dl~d~I~~i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~ 67 (318) |. .+. .... ......+...+........|..+..--.......--|.... .. T Consensus 198 ~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~e~---------------~~ 262 (480) T protein:vir:40 198 MPEQGFLREFANGADLNVVNSLGSITSKYARKSGIYDGAMKARFQGLTLAEDGVDDTFISGTF---------------KA 262 (480) T ss_pred chhhhhhhhhhhhccccccccccccccchhhheeechhhhhhhhhcceeeeccccceeeeeee---------------ec Confidence 11 000 0000 00000111111111111111111000000000000111100 00 Q ss_pred eccccccccccCcEEecceEEEEeeeeeehh-HHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccc Q lcl|NC_019402. 68 EGSAAVDGERASTTVINNVTQILRKVVKVSD-TANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVAR 146 (318) Q Consensus 68 EG~da~~~~~~~~~~~~N~tQIf~~~~~VS~-Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r 146 (318) ++.+-. ..... ...-.-++.++-+.++. |++-+.-++ +.-+|=..+-...+++-.|.+||+|. |++ .. T Consensus 263 ~~~~~~--~~~~~-~~~~~~~~v~~l~~~~k~t~~lLDDa~--~l~~~i~~~l~~~~~~~ee~a~l~G~----g~g--~~ 331 (480) T protein:vir:40 263 GTDKNK--SQTAT-KRSLRPQMAEAYLQMDKATVRGVNDSG--ALSEYVMSEMVNRVIQKVEYNMILGS----VDG--SN 331 (480) T ss_pred cccccc--ccccc-cchhhHHHHHHHHHhHHHHHHHhhhhH--HHHHHHHHHHHHHHHHHHHHHhhccC----CCC--cc Confidence 111100 00000 00000011122222222 222222221 23444445556678888999999883 222 34 Q ss_pred hhhhHHHHHhcCCcccCccccceeeccCccccCHHHHHHHHH---HHHhCCCCcCEEEEcchHHhhhhhhhhhhcccccc Q lcl|NC_019402. 147 QTAGFSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTY---NLYLSGSEANIIMFHPKHAAFFSSLMETSGVTNGQ 223 (318) Q Consensus 147 ~m~Gi~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~---~i~~~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~ 223 (318) ..+|+...- + ..+......+.|+++.. +.|.++ ...+++|+..-.+|-+++ |. +. T Consensus 332 ~~~g~~~~~--~-------------~~~~~~~~~d~id~L~~al~~~y~~~--a~~~vmn~~t~~~I~klK-D~----~G 389 (480) T protein:vir:40 332 GFYGLKTAT--D-------------GWTKQIEYTDLFEGITDAVAECSISD--AITIVMSPQTFAELRKAK-GT----DG 389 (480) T ss_pred ccccceeec--c-------------cccccchhHHHHHHHHHhhhHHhhCC--CCEEEECHHHHHHHHHhh-cC----CC Confidence 455553211 0 11112233445544444 444443 335788998777887775 33 23 Q ss_pred eEEEecCCceEEEEEEEEEEcCCCcEEEEEecCCCCCce---------EEEEehhhcceeecCcccceecCCCccceeeE Q lcl|NC_019402. 224 RMKMFDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENA---------VYFFTPSDWTQMVLRAPERTKLAKDGSYEKWM 294 (318) Q Consensus 224 r~~~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~---------~~~~D~~~~~~~~Lr~~~~e~laktGd~~k~~ 294 (318) +|...+. ...-...+-||..+ ++-+.+||.|. +++.|.+ ...++.|....+ .+++. T Consensus 390 ~Yi~q~~------~~~~~~~~llG~pv-v~~~~~~~~~~~~~~~~~~~~~~~d~~---~~~~~~~~~~~~-----~~~~~ 454 (480) T protein:vir:40 390 HSRFNEL------ATKEQIAQSFGAVN-LETRVWMPKDEVAVYNHDEYVLIGDLN---VENYNDFDLRYN-----VEQWL 454 (480) T ss_pred CeeccCc------ccccCcceecccce-eeeeccccCCcceeeeCCccEEEEecc---cceecccccccc-----hhhhh Confidence 4433221 11112333455321 33344454443 4555543 223333332222 23444 Q ss_pred EEEEEeEEEecccceeEEEEeccC Q lcl|NC_019402. 295 IEMEVGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 295 i~~E~tLe~~N~~a~g~i~~lt~a 318 (318) ...-++..+++|-++..+..-..= T Consensus 455 ~e~~v~g~~~~~~~~~~~~~~~~~ 478 (480) T protein:vir:40 455 SETLVGGSIRGKNRSAYLKKKGSL 478 (480) T ss_pred hhhhhceeeEccccEEEEEeccCc Confidence 444567777777555444322111 No 148 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=42.95 E-value=0.86 Score=20.93 Aligned_cols=268 Identities=11% Similarity=0.087 Sum_probs=115.0 Q ss_pred CCc---eeeee------eeeecccceeeeEecCCcccceeee-------ec-cccccceEEEeeeeeccccCCccccccc Q lcl|NC_019402. 1 MAT---LVSYD------LNGKKLSFANWISNLSPTDTPFVSM-------TG-KEAINQTLFQWQTDALAPVADPSDAQKR 63 (318) Q Consensus 1 Ma~---~~t~~------~~~~~~dl~d~I~~i~p~~TP~~s~-------i~-~~~~~~~~~~W~td~l~~~~~~~~~~~~ 63 (318) |++ +|.-. ...+-+=++++|..-=....=|.++ +. +.+++-+ .+..-... T Consensus 1 ~~~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~~~~d~~~~~~~Gdtv~ip--~~g~~~~~----------- 67 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTSVVKTWGAQVKKGDTFHVP--RISELGVE----------- 67 (341) T ss_pred CcchhhhccccccchhHHHHHHHHHHHHHHHHHHhhcchhhccccccccccCCceEEEe--ccCcceee----------- Confidence 653 33300 1111111111111000000001111 11 1122211 11110000 Q ss_pred cceeeccccccccccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCC Q lcl|NC_019402. 64 NAVIEGSAAVDGERASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSAT 143 (318) Q Consensus 64 na~~EG~da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t 143 (318) ..-+|...+..........=...|.-...+.|++-.+.... .+..+.-.+.....|++.++..++.-.+...+.. T Consensus 68 -d~~~~~~i~~~~~~~~~~~itiD~~~~~~~~i~d~d~~~~~---~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~- 142 (341) T protein:vir:94 68 -DKATDVPVGVQPVNDTDFVITVDTDRTTAVALDDLLEIQAS---YDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTA- 142 (341) T ss_pred -eecCCCccccccccCceEEEEEeeeeecceeechHHHHhhc---cchHHHHHHHHHHHHHHHHHHHHHHHhhhccccc- Confidence 01123222222222222222234545566778876655443 3566677777888888999888764322211110 Q ss_pred ccchhhhHHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCC--CcCEEEEcchHHhhhhh---hhhhhc Q lcl|NC_019402. 144 VARQTAGFSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGS--EANIIMFHPKHAAFFSS---LMETSG 218 (318) Q Consensus 144 ~~r~m~Gi~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~--~~~~l~~~~~~k~~is~---~~~~~~ 218 (318) .+. ..........+.+..++.+.|.++.+.+=+++. +.+.++|+|.+...+-. |.... T Consensus 143 ~~~----------------~~~~~~~~~t~~~~~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~~~- 205 (341) T protein:vir:94 143 SQN----------------VFSSSNGAITGNGQAFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIPQFISKD- 205 (341) T ss_pred cCc----------------cccCccccccCchhhhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhchhhhhhh- Confidence 000 000111222344566888999999898877765 56789999988766632 21110 Q ss_pred ccccceEEEecCCc-eEEEEEEEEEEcCCCcEEEEEecCCCCCceE---------------------------------- Q lcl|NC_019402. 219 VTNGQRMKMFDGQD-TRLNVYVSSIVDPLGCQYKLVPNRWMPENAV---------------------------------- 263 (318) Q Consensus 219 ~~~~~r~~~~~~~~-~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~---------------------------------- 263 (318) +.++. -+-|. |-.+ +| ++|+...++|.+.. T Consensus 206 ---------~~g~~~l~~G~-ig~i---~G--~~V~~Sn~lp~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 270 (341) T protein:vir:94 206 ---------FINNAPIAQGQ-IGSL---MG--VRVIRTSLIGNNSATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSL 270 (341) T ss_pred ---------ccccchhheee-eeeE---ec--eEEEEeccccccccccccccccceeccccccccccccccccccccccc Confidence 00110 11011 1111 35 56666666664321 Q ss_pred ---EEEehh------hcceeecCcccceecCCCccc----eeeEEEEE--EeEEEecccceeEEEEeccC Q lcl|NC_019402. 264 ---YFFTPS------DWTQMVLRAPERTKLAKDGSY----EKWMIEME--VGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 264 ---~~~D~~------~~~~~~Lr~~~~e~laktGd~----~k~~i~~E--~tLe~~N~~a~g~i~~lt~a 318 (318) |++-.+ .+.+..++....+++...|++ +..+|++= +|.++++|.+..-|.-.... T Consensus 271 ~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~G~~~lrp~~~v~~~~~~~~ 340 (341) T protein:vir:94 271 PATFTGNSRPVHTAVMCHMDWAAAVVSKAPRVTQSFENREQVWLMVGRQAYGARLYRPLHAVNIHTTGDT 340 (341) T ss_pred EEEEEEecccccceeeecchhhhccccccccccccchhhhhhhhhhhhhhhcccccCcceeEEEecCcCC Confidence 111111 122223333344555555533 34555555 67888999886444333333 No 149 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=40.58 E-value=0.96 Score=20.67 Aligned_cols=268 Identities=8% Similarity=-0.037 Sum_probs=110.7 Q ss_pred CCce----ee--eeeeeecccceeeeEe------cCCcccceeeeeccccccceEEEeeeeeccccCCcccccccccee- Q lcl|NC_019402. 1 MATL----VS--YDLNGKKLSFANWISN------LSPTDTPFVSMTGKEAINQTLFQWQTDALAPVADPSDAQKRNAVI- 67 (318) Q Consensus 1 Ma~~----~t--~~~~~~~~dl~d~I~~------i~p~~TP~~s~i~~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~- 67 (318) |.+- -+ -.+..+-+-+...|+. .-+..=|+.+-++-. + ..+.+..-+.. +.++. T Consensus 17 ~~~~~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~~~~~-~--et~~~~~~e~~----------G~a~~~ 83 (314) T protein:vir:10 17 EQMGVEKADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTNEIPGH-A--KYFEYPEFDGV----------GIAQII 83 (314) T ss_pred HhhcccchhhhHHHHHHHHHHHHHHHhhhhccccccceeeccccCCCCc-e--eEEEeeeeccc----------cceeee Confidence 1111 00 1122222222222221 111111222211111 1 11222221111 11111 Q ss_pred --eccccccccccCcEEecceEEEEeeeeeehhHHHHh---hhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCC Q lcl|NC_019402. 68 --EGSAAVDGERASTTVINNVTQILRKVVKVSDTANVL---ANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSA 142 (318) Q Consensus 68 --EG~da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~---~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~ 142 (318) .+.|.|.....-... ..+|++-...+..+-+=+ ...|. +.-+.....+...+.+.+.+.++.|.+. T Consensus 84 ~d~~~dip~vd~~~~~~---~~~i~~~~~~~~~~~~El~~a~~~g~-~l~~~k~~aA~~~~~~~~n~i~f~G~~~----- 154 (314) T protein:vir:10 84 ADYSDDLPLVDAFMTEK---QGKVFRFGNAFLISTDEIKAGAATGQ-SLSARKQALAFEAHDNLLDKLVWSGSAP----- 154 (314) T ss_pred CCcccccceeeccccee---EEEEEEEEeeEEecHHHHHHHHHhCC-ChHHHHHHHHHHHHHHhhceEEEeeccc----- Confidence 111222111111111 123344334444433322 33343 4344444556666666666666766322 Q ss_pred CccchhhhHHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhC--C-CCcCEEEEcchHHhhhhhhhhhhcc Q lcl|NC_019402. 143 TVARQTAGFSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLS--G-SEANIIMFHPKHAAFFSSLMETSGV 219 (318) Q Consensus 143 t~~r~m~Gi~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~--G-~~~~~l~~~~~~k~~is~~~~~~~~ 219 (318) ...-||+ ...+ +.+.... ..-.++..+ .++|..++.++|.. | ..|++|+++|.+-..++.-..+.+. T Consensus 155 ---~g~~GLl---N~p~-v~~~~~~--~~WaT~~ei-~~Di~~~~~~l~~~s~g~~~p~~l~Lpp~~~~~L~~~~~~~~~ 224 (314) T protein:vir:10 155 ---HGIVSVF---DQPN-INNVVAT--PNWSVPQNA-IDDVTAMIDAVESSTQGLHHVTDILLPASARRVMQGLVPQTNL 224 (314) T ss_pred ---ccceeEe---ecCC-CccccCC--CCcccHHHH-HHHHHHHHHHHHHhcCccccceeEEecHHHHHhhcccccCCCc Confidence 2234443 2111 1111000 011122222 67888899999974 3 2567899998866655432111110 Q ss_pred cccceEEEecCCceEEEEEEEEEEcCCCcEEEEEecCCCCC------ceEE--EEehhhcceeecCcccceecCCCccce Q lcl|NC_019402. 220 TNGQRMKMFDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPE------NAVY--FFTPSDWTQMVLRAPERTKLAKDGSYE 291 (318) Q Consensus 220 ~~~~r~~~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~------~~~~--~~D~~~~~~~~Lr~~~~e~laktGd~~ 291 (318) .. .+.+.-+|= .++|+.-+++.. +.++ .=|++.+++.+=.|+..-++-+.+-+. T Consensus 225 tv-----------------l~~l~~n~~-~l~I~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l~~e~~~~~~ 286 (314) T protein:vir:10 225 SY-----------------GELFTRNNP-GLTIRFLQFLDNYDGAGGKAALAFEKSPLNMSIEIPEVTNVLPAQPKDLHF 286 (314) T ss_pred cH-----------------HHHHHHhCC-CcEEEEcccccccCCCcceEEEEEecCCcEEEEecCccceeecceecCceE Confidence 00 000111111 234444444431 2233 345666777654565555554444444 Q ss_pred eeEEEEE-EeEEEecccceeEEEEeccC Q lcl|NC_019402. 292 KWMIEME-VGLRHRNPYASGILEVKAGA 318 (318) Q Consensus 292 k~~i~~E-~tLe~~N~~a~g~i~~lt~a 318 (318) +.-.+.. .+++++-|.+...+.|+|=| T Consensus 287 ~~~~~~r~~Gv~i~~P~ai~~~dGI~~~ 314 (314) T protein:vir:10 287 RYPVTSKATGLIVYRPLTMAVIKGITFA 314 (314) T ss_pred EEcceeeeEEEEEECcceeEeeeeeecC Confidence 4433444 57999999999999999999 No 150 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=24.46 E-value=2.2 Score=18.73 Aligned_cols=225 Identities=10% Similarity=0.046 Sum_probs=109.1 Q ss_pred CCceeeeeee----eeccc-------ceeeeEecCC--cccceeeeeccccccceEEEe-eeeeccccCCccccccccce Q lcl|NC_019402. 1 MATLVSYDLN----GKKLS-------FANWISNLSP--TDTPFVSMTGKEAINQTLFQW-QTDALAPVADPSDAQKRNAV 66 (318) Q Consensus 1 Ma~~~t~~~~----~~~~d-------l~d~I~~i~p--~~TP~~s~i~~~~~~~~~~~W-~td~l~~~~~~~~~~~~na~ 66 (318) ||++-..... -++.+ +.+.+..-+| .+-||.-.- . .+.|.- ...+|-.+.. ++. T Consensus 1 m~~~~~~a~TL~E~Akr~~~d~~~~~IIE~l~~tneIL~~lpf~e~N---~--~tg~~~~vrt~LP~~~f-------R~l 68 (335) T protein:vir:73 1 MALIGQTLPSLLDIYNRTDKNGRIARIVEQLAKTNDILTDAIYVPCN---D--GSKHKTTIRAGIPEPVW-------RRY 68 (335) T ss_pred CCcCCCCchhHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhccc---C--CcccceeEEEecCCchh-------hhc Confidence 7765433211 11221 2233333333 344554211 1 111111 1122211100 000 Q ss_pred eeccccccccccCcEEecceEEEEeeeeeehhHHHHhhhcCc-cchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCcc Q lcl|NC_019402. 67 IEGSAAVDGERASTTVINNVTQILRKVVKVSDTANVLANYGR-GKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVA 145 (318) Q Consensus 67 ~EG~da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~-~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~ 145 (318) =+| -.+....+.++.=-|=||.-.+.|=.-.. ...|. ..-.++|+..+++-+...++..||+|- ++..| T Consensus 69 N~g---~~~s~~tt~qvt~~l~ilgg~~eVDr~La--~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGD-----sa~~p 138 (335) T protein:vir:73 69 NQG---VQPTKTQTVPVTDTTGMLYDLGFVDKALA--DRSNNAAAFRVSENMGKLQGFNNKVARYSIYGN-----TDAEP 138 (335) T ss_pred CCc---cccccceEEEEEEEEEEecchhhhhHHHH--hhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCC-----cCCCh Confidence 011 23344677778888889999888885322 22232 224788999999999999999999984 23457 Q ss_pred chhhhHHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcC---EEEEcchHHhhhhhhhhhhccccc Q lcl|NC_019402. 146 RQTAGFSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEAN---IIMFHPKHAAFFSSLMETSGVTNG 222 (318) Q Consensus 146 r~m~Gi~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~---~l~~~~~~k~~is~~~~~~~~~~~ 222 (318) ..+.||...+....-.++. -.+++-++||... -|++..+..+....+ T Consensus 139 ~~FdGL~kR~~~~st~~a~---------------------~a~~iIdaGGtG~~~TSi~~v~wg~~~~~gi--------- 188 (335) T protein:vir:73 139 EAFMGLAPRFNTLSTSKAA---------------------SAENVFSAGGSGSTNTSIWFMSWGENTAHMI--------- 188 (335) T ss_pred hhccchhhhhcCccccccC---------------------cccceeeccccccCceEEEEEEEcCCeeEEE--------- Confidence 8899998877432111111 1123344455332 233333222222111 Q ss_pred ceEEEecCCceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceecCCCcc--ceeeEEEEEEe Q lcl|NC_019402. 223 QRMKMFDGQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKLAKDGS--YEKWMIEMEVG 300 (318) Q Consensus 223 ~r~~~~~~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~laktGd--~~k~~i~~E~t 300 (318) + +++.+.|..+ .|.|. .. ++|.+ .|- .-.-++-+.+| T Consensus 189 -----y-PkG~kaGl~~----~d~g~-~~-------------~~d~~-----------------G~~y~~~~~~~~w~~G 227 (335) T protein:vir:73 189 -----Y-PEGMVAGFQH----EDLGD-DL-------------VSDGN-----------------GGQFRAYRDEFKWDIG 227 (335) T ss_pred -----c-ccCcccccee----eeccc-ee-------------eecCC-----------------CCEEeEEEeeeeeeee Confidence 1 2333334322 24442 11 11111 111 12335667788 Q ss_pred EEEecccceeEEEEeccC Q lcl|NC_019402. 301 LRHRNPYASGILEVKAGA 318 (318) Q Consensus 301 Le~~N~~a~g~i~~lt~a 318 (318) |-++++.+..+|..+.-| T Consensus 228 l~i~d~r~vvRI~NIdvs 245 (335) T protein:vir:73 228 LSVRDWRSISRICNIDVT 245 (335) T ss_pred eEEeCcccEEEEeecccc Confidence 899999988888888755 No 151 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=24.00 E-value=2.2 Score=18.67 Aligned_cols=255 Identities=15% Similarity=0.205 Sum_probs=110.4 Q ss_pred CCceeeeee-eeecccceeeeEecCCcccceeeeec---------cccccceEEEeeeeeccccCCccccccccceeecc Q lcl|NC_019402. 1 MATLVSYDL-NGKKLSFANWISNLSPTDTPFVSMTG---------KEAINQTLFQWQTDALAPVADPSDAQKRNAVIEGS 70 (318) Q Consensus 1 Ma~~~t~~~-~~~~~dl~d~I~~i~p~~TP~~s~i~---------~~~~~~~~~~W~td~l~~~~~~~~~~~~na~~EG~ 70 (318) =-..++.+. .-+.+|+.+-+.. -=.-++.++| +.+.+ ++..|..- ++ +. ...||. T Consensus 10 ~nlt~~~dl~~~~siDf~~~f~~---~i~~L~~~LGv~r~~pla~GstIk-t~k~~~y~-gd-a~---------dVaEGe 74 (296) T protein:vir:98 10 ENLIKSTDLKYPITIDVTNKFQE---NISKLLEMLGVTRKISVSEGMTLK-TYAGYDVT-LA-EG---------NVPEGE 74 (296) T ss_pred CCCcchhhhhhhhhhhhHHHHhh---hHHHHHHHhhhcccccccCCCEEe-eccceeee-ec-cc---------cccCCc Confidence 111112221 2333333322210 0000111111 22221 12235432 11 11 134888 Q ss_pred ccccccccCcEEecceEEEEeeeeeehhHHHHhhhcCccchHHHHHHHHHHHHHHHHHHHHhcCccccCCCCCccchhhh Q lcl|NC_019402. 71 AAVDGERASTTVINNVTQILRKVVKVSDTANVLANYGRGKELQYQMEKAGKEIKRDLEVALLRNGAKVDGSATVARQTAG 150 (318) Q Consensus 71 da~~~~~~~~~~~~N~tQIf~~~~~VS~Ta~a~~~~G~~~e~a~q~~k~~~eikrd~E~a~i~g~~~~~gs~t~~r~m~G 150 (318) ..|.........-.=-.+| .|.-... |.+|++-.|.++-.-.--.+-+.+|.+++-+-|+.-- +. +..++.-.-.| T Consensus 75 ~Iplskvt~~~~~t~t~~i-kK~rK~t-TdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~L-kt-aT~t~~~t~~~ 150 (296) T protein:vir:98 75 VIPLSKVERKIHSEKKIEL-KKYRKAT-TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTAL-KT-GTGTQDALGAG 150 (296) T ss_pred ccchhhheeeecceEEEEe-ecccccc-CHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHH-hc-ccceeeechhh Confidence 8776554332211112234 3333333 8999987887764444444445555555555554211 00 00110011123 Q ss_pred HHHHHhcCCcccCccccceeeccCccccCHHHHHHHHHHHHh-CCCCcCEEEEcchHHhhhhhhhhhhcccccceEEEec Q lcl|NC_019402. 151 FSALVAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYL-SGSEANIIMFHPKHAAFFSSLMETSGVTNGQRMKMFD 229 (318) Q Consensus 151 i~~~i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~-~G~~~~~l~~~~~~k~~is~~~~~~~~~~~~r~~~~~ 229 (318) |...+.. ...=+...|+ .+....++|+||... .++-++.+. T Consensus 151 lQ~Ala~-------------------------~~~~l~~~feded~~~~V~FVnP~D~---a~ylg~a~i---------- 192 (296) T protein:vir:98 151 LQGALAS-------------------------AWGKLQVLFEDYGSERAIVFANSLDV---AEYIAKAGI---------- 192 (296) T ss_pred HHHHHHH-------------------------HhhhhhhhccccCCCceEEEEehHHH---HHHhcCCcc---------- Confidence 3322211 1111233453 355667889999753 334333321 Q ss_pred CCceEEEEEEEEEEcCCCcEEEEEecCCCCCceEEEEehhhcceeecCcccceecCCCccceeeEEEEEEeEEEecccce Q lcl|NC_019402. 230 GQDTRLNVYVSSIVDPLGCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAPERTKLAKDGSYEKWMIEMEVGLRHRNPYAS 309 (318) Q Consensus 230 ~~~~~~g~~v~~~~tdfG~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~~~e~laktGd~~k~~i~~E~tLe~~N~~a~ 309 (318) ...+.+|. ..++.=+| ..||..+.+|.++++..=.+.+.++|..+- .-+|++...-.- -=.|=.++-|.-...| T Consensus 193 t~qt~fG~--tyl~nfLG--~~II~S~kV~~G~~~~T~~~Ni~~ay~~~~-~~~l~~~f~~~~-d~tglIGv~h~~~~~~ 266 (296) T protein:vir:98 193 TTQTAFGL--TYLVDFTG--TVIISTNDVTKGEIWATVPENIIFAYINPN-NSELAKEFNLYG-DPTGYIGMNHFQENTT 266 (296) T ss_pred chhheech--hhhhhccc--cEEEEcCcCCCceEEEeeecceEEEeeccc-ccchhhhhcccc-ccccceEEEeccccce Confidence 12333443 22333457 479999999999999999999999999763 124443321000 0001122223222334 Q ss_pred eEEEEeccC Q lcl|NC_019402. 310 GILEVKAGA 318 (318) Q Consensus 310 g~i~~lt~a 318 (318) +.++-+..+ T Consensus 267 ~t~eT~~~~ 275 (296) T protein:vir:98 267 LTIQTLLVS 275 (296) T ss_pred eeehhHhHh Confidence 444444333 No 152 >protein:vir:96490 Length: 348 # NCBI annotation: head protein # Family: family:all:1083 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238492;genbank:gi:66391768;genbank:GeneID:5176912 Probab=20.59 E-value=2.7 Score=18.18 Aligned_cols=290 Identities=11% Similarity=0.080 Sum_probs=120.1 Q ss_pred CCceeeeeeeeecccceeeeEecCCccccee--eeeccccccceEEEeeeee--ccccCCccccccccceeecccccccc Q lcl|NC_019402. 1 MATLVSYDLNGKKLSFANWISNLSPTDTPFV--SMTGKEAINQTLFQWQTDA--LAPVADPSDAQKRNAVIEGSAAVDGE 76 (318) Q Consensus 1 Ma~~~t~~~~~~~~dl~d~I~~i~p~~TP~~--s~i~~~~~~~~~~~W~td~--l~~~~~~~~~~~~na~~EG~da~~~~ 76 (318) ||.+..+.+.. .|...|-.+.+.-.||+ .++.......+.++|..-. +.-++.. .-.++..+... T Consensus 1 M~~i~d~f~~~---~l~~~i~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~~a~~--------v~~~~~~~~~~ 69 (348) T protein:vir:96 1 MGLIYDKVTAS---NIAGYFNTLQENVDSTLGESIFPARKQLGTKLSYIKGASGQSVALKA--------AAFDTNVTIRD 69 (348) T ss_pred CcchhhccCHH---HHHHHHHhcccchhhhhhhhcCCCccccceeEEEEeecCCceeEeee--------ecCCCCcceec Confidence 99876554432 34444444444444554 3444455666667665522 1111110 00111111110 Q ss_pred ccCcEEecceEEE--EeeeeeehhHH-HHh---hhcCc-------cchHHHHHHHHHHHHHHHHHHHHh----cCccccC Q lcl|NC_019402. 77 RASTTVINNVTQI--LRKVVKVSDTA-NVL---ANYGR-------GKELQYQMEKAGKEIKRDLEVALL----RNGAKVD 139 (318) Q Consensus 77 ~~~~~~~~N~tQI--f~~~~~VS~Ta-~a~---~~~G~-------~~e~a~q~~k~~~eikrd~E~a~i----~g~~~~~ 139 (318) . +..-....++ |.....|+-.. +.. ...+. .+.++..+......+.+.+|+... .|+-... T Consensus 70 r--~~~~~~~~~~p~i~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~ 147 (348) T protein:vir:96 70 R--VSAEIHDEQMPFFKEALLVKENDRQQLNLVKDTGNEALINTIVAGIFNDDVTLINGARARLEAMRMQVLATGKIAFT 147 (348) T ss_pred c--cceeeeeeecCccccccccCHHHHHHHHhhhccCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEee Confidence 0 0000000111 11222333221 111 11111 012223333444567777886653 3443333 Q ss_pred CCCCccchhhhHHHH-HhcCCcccCccccceeeccCccccCHHHHHHHHHHHHhCCCCcCEEEEcchHHhhhhh---hhh Q lcl|NC_019402. 140 GSATVARQTAGFSAL-VAAKDAADPDTGAIVHFETAAAALTEAEIFKVTYNLYLSGSEANIIMFHPKHAAFFSS---LME 215 (318) Q Consensus 140 gs~t~~r~m~Gi~~~-i~~~~~~~~~~g~~~~~~~t~~~lTe~~l~~~~~~i~~~G~~~~~l~~~~~~k~~is~---~~~ 215 (318) +++-. .. + .| ....+.++.. ..-.+.+..++ .+|.++.+.+=++|..++++++++....+|-+ +.+ T Consensus 148 ~~~~~-~~---v-dfg~~~~~~~t~~---~~W~~~~adp~--~di~~~~~~~~~~G~~~~~~i~~~~~~~~l~~~~~v~~ 217 (348) T protein:vir:96 148 SDGVN-KD---I-DYGVKADHKKQVS---KSWAEPGATPL--ADLEDAIETARELGLNPERAIMNAKTFGLIRKAASTVK 217 (348) T ss_pred cCCee-EE---E-eccCCcccceeec---cccCCCCCCHH--HHHHHHHHHHHhcCCcccEEEeCHHHHHHHhcCHHHHH Confidence 32210 00 0 01 1111111111 01111222232 67777877766678899999999887776643 222 Q ss_pred hhcccccceEEEecCC-----ceEEEEEEEEEEcCC----CcEEEEEecCCCCCceEEEEehhhcceeecCcc-cc---- Q lcl|NC_019402. 216 TSGVTNGQRMKMFDGQ-----DTRLNVYVSSIVDPL----GCQYKLVPNRWMPENAVYFFTPSDWTQMVLRAP-ER---- 281 (318) Q Consensus 216 ~~~~~~~~r~~~~~~~-----~~~~g~~v~~~~tdf----G~~v~iv~nr~m~~~~~~~~D~~~~~~~~Lr~~-~~---- 281 (318) .......+........ ....|..+-.|..-| |+. .+++|++.++++-...+-..+.=++ +. T Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~y~~~y~d~~G~~-----~~~~p~~~v~l~~~~~~G~~~yg~~~e~~~~~ 292 (348) T protein:vir:96 218 AIKPLAGDGSSVTKAELQNYVADNYGVEIVLENGTYRNEKGEV-----SKFFPDGHLTLIPNGPLGNTVFGTTPEESDLF 292 (348) T ss_pred HHhccCCccccccHHHHHHHHhhhcCceEEEEccEEEecCCcE-----eccccCCeEEEEcCCCceeEEeccChhhhhhh Confidence 1110000000000000 012244444443322 532 2578888888876654322221111 00 Q ss_pred ---------ee-----cC---CCccceeeEEEEEEe--EEEecccceeEEEEeccC Q lcl|NC_019402. 282 ---------TK-----LA---KDGSYEKWMIEMEVG--LRHRNPYASGILEVKAGA 318 (318) Q Consensus 282 ---------e~-----la---ktGd~~k~~i~~E~t--Le~~N~~a~g~i~~lt~a 318 (318) +. +. ++.|-....+.+|.. --+.++.+..+++.|++- T Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~dP~~~~~~~~s~plPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:96 293 ADNTVNADVEIVDSGIAVTTTKTTDPVNVQTKVSMVALPSFERLGDVYMLTVIPGV 348 (348) T ss_pred hcccccccceecCCeeEEEeeecCCCceEEEEEeeeeeccccCCCcEEEEEEecCC Confidence 00 01 234544566666644 455667788999999888 Done!