Query lcl|NC_013692.1_cdsid_YP_003358474.1 [gene=PP-LIT1_gp77] [protein=N4 gp56-like protein] [protein_id=YP_003358474.1] [location=complement(60626..61825)] Match_columns 399 No_of_seqs 29 out of 32 Neff 4.0 Searched_HMMs 1612 Date Thu Nov 7 13:57:30 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_77 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_77_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:95875 Length: 401 100.0 4E-185 3E-188 1031.5 29.3 390 10-399 1-401 (401) 2 protein:vir:105334 Length: 276 99.7 4.6E-19 2.9E-22 121.1 16.6 269 18-399 1-271 (276) 3 protein:vir:3613 Length: 272 # 99.7 5E-18 3.1E-21 115.5 16.3 270 18-398 1-272 (272) 4 protein:vir:97433 Length: 274 99.6 1.5E-17 9.2E-21 112.8 17.9 268 5-399 1-272 (274) 5 protein:vir:94494 Length: 274 99.6 1.5E-17 9.2E-21 112.8 17.9 268 5-399 1-272 (274) 6 protein:vir:96262 Length: 274 99.6 1.1E-17 6.9E-21 113.5 17.1 270 5-399 1-271 (274) 7 protein:vir:95898 Length: 274 99.6 1.1E-17 6.9E-21 113.5 17.1 270 5-399 1-271 (274) 8 protein:vir:1239 Length: 274 # 99.6 1.8E-17 1.1E-20 112.3 17.1 270 5-399 1-271 (274) 9 protein:vir:93696 Length: 364 99.6 1.2E-16 7.7E-20 107.8 20.1 314 11-399 1-362 (364) 10 protein:vir:96833 Length: 275 99.6 6.7E-17 4.2E-20 109.3 17.0 271 1-399 1-273 (275) 11 protein:vir:93742 Length: 274 99.6 1E-16 6.3E-20 108.3 17.7 268 5-399 1-272 (274) 12 protein:vir:96123 Length: 274 99.6 1.4E-16 8.8E-20 107.5 17.5 267 5-399 1-271 (274) 13 protein:vir:80930 Length: 278 99.6 1.5E-16 9.2E-20 107.4 16.5 276 18-399 1-278 (278) 14 protein:vir:94622 Length: 341 99.5 8.5E-15 5.3E-18 97.7 20.8 325 1-399 1-340 (341) 15 protein:vir:3033 Length: 272 # 99.5 2.3E-15 1.5E-18 100.8 16.8 268 18-399 1-270 (272) 16 protein:vir:9820 Length: 272 # 99.5 2.3E-15 1.5E-18 100.8 16.8 268 18-399 1-270 (272) 17 protein:vir:739 Length: 231 # 99.4 6.2E-15 3.8E-18 98.5 13.3 231 55-398 1-231 (231) 18 protein:vir:95107 Length: 270 99.4 2.4E-14 1.5E-17 95.2 15.1 263 18-399 1-265 (270) 19 protein:vir:80180 Length: 381 99.3 1.7E-12 1.1E-15 85.1 20.1 316 5-398 1-381 (381) 20 protein:vir:10123 Length: 404 99.3 9.7E-12 6E-15 81.0 22.7 337 1-399 3-404 (404) 21 protein:vir:104439 Length: 404 99.3 9.7E-12 6E-15 81.0 22.7 337 1-399 3-404 (404) 22 protein:vir:3298 Length: 404 # 99.3 9.7E-12 6E-15 81.0 22.7 337 1-399 3-404 (404) 23 protein:vir:819 Length: 404 # 99.3 9.7E-12 6E-15 81.0 22.7 337 1-399 3-404 (404) 24 protein:vir:78739 Length: 332 99.2 2.5E-11 1.6E-14 78.7 22.9 322 1-396 1-332 (332) 25 protein:vir:7990 Length: 273 # 99.2 5.3E-12 3.3E-15 82.4 18.9 264 5-359 1-273 (273) 26 protein:vir:105822 Length: 273 99.2 6.6E-12 4.1E-15 81.9 19.1 262 18-359 1-273 (273) 27 protein:vir:102605 Length: 273 99.2 6.6E-12 4.1E-15 81.9 19.1 262 18-359 1-273 (273) 28 protein:vir:105610 Length: 430 99.1 9.8E-11 6.1E-14 75.4 22.7 328 15-399 1-425 (430) 29 protein:vir:2770 Length: 318 # 99.1 8.7E-11 5.4E-14 75.7 19.2 258 5-314 1-318 (318) 30 protein:vir:95763 Length: 297 99.0 2.9E-11 1.8E-14 78.3 15.7 289 5-399 1-297 (297) 31 protein:vir:10450 Length: 344 99.0 4.9E-10 3E-13 71.6 22.4 320 1-398 1-344 (344) 32 protein:vir:3136 Length: 322 # 99.0 6.4E-11 4E-14 76.5 17.1 307 11-399 1-320 (322) 33 protein:vir:108303 Length: 418 99.0 9.5E-11 5.9E-14 75.5 17.8 303 5-399 1-418 (418) 34 protein:vir:80213 Length: 334 99.0 1.3E-09 8.1E-13 69.3 23.6 309 5-399 1-333 (334) 35 protein:vir:41 Length: 299 # N 99.0 1.3E-10 7.9E-14 74.8 16.4 288 11-399 1-299 (299) 36 protein:vir:94576 Length: 347 98.9 1.1E-09 6.5E-13 69.8 21.1 318 1-398 1-347 (347) 37 protein:vir:1541 Length: 347 # 98.9 8.6E-10 5.3E-13 70.3 20.3 317 1-399 1-344 (347) 38 protein:vir:94711 Length: 347 98.9 1.2E-09 7.3E-13 69.5 20.8 317 5-399 1-347 (347) 39 protein:vir:99075 Length: 392 98.9 2.6E-10 1.6E-13 73.2 16.8 313 5-399 1-326 (392) 40 protein:vir:3364 Length: 347 # 98.9 1.9E-09 1.2E-12 68.5 21.0 319 1-399 1-345 (347) 41 protein:vir:2201 Length: 345 # 98.9 8.4E-09 5.2E-12 64.9 23.3 319 1-398 1-345 (345) 42 protein:vir:105905 Length: 304 98.9 6.1E-10 3.8E-13 71.1 16.7 287 1-397 1-304 (304) 43 protein:vir:94142 Length: 304 98.9 6.1E-10 3.8E-13 71.1 16.7 287 1-397 1-304 (304) 44 protein:vir:8885 Length: 347 # 98.8 1.7E-09 1.1E-12 68.6 19.1 317 1-399 1-347 (347) 45 protein:vir:174 Length: 423 # 98.8 7.6E-10 4.7E-13 70.6 16.9 297 18-398 1-423 (423) 46 protein:vir:7771 Length: 330 # 98.8 3.3E-09 2E-12 67.1 17.9 303 1-399 1-324 (330) 47 protein:vir:102655 Length: 322 98.7 1.9E-08 1.2E-11 62.9 21.0 313 11-399 1-322 (322) 48 protein:vir:98339 Length: 415 98.7 1.3E-09 7.9E-13 69.4 14.2 294 1-399 97-405 (415) 49 protein:vir:79987 Length: 415 98.7 1.3E-09 7.9E-13 69.4 14.2 294 1-399 97-405 (415) 50 protein:vir:81100 Length: 415 98.7 1.3E-09 7.9E-13 69.4 14.2 294 1-399 97-405 (415) 51 protein:vir:485 Length: 407 # 98.7 2.4E-09 1.5E-12 67.9 15.5 288 1-399 91-401 (407) 52 protein:vir:3525 Length: 423 # 98.7 4.3E-09 2.7E-12 66.4 16.6 296 24-398 1-423 (423) 53 protein:vir:102119 Length: 404 98.7 4.7E-09 2.9E-12 66.3 16.5 295 1-399 88-401 (404) 54 protein:vir:96223 Length: 324 98.7 2.3E-09 1.4E-12 68.0 14.1 292 1-399 14-316 (324) 55 protein:vir:9410 Length: 415 # 98.7 3.1E-09 2E-12 67.2 14.8 294 1-399 97-405 (415) 56 protein:vir:8187 Length: 311 # 98.6 1.7E-08 1.1E-11 63.2 18.5 294 18-399 1-311 (311) 57 protein:vir:78523 Length: 338 98.6 1.3E-08 8E-12 63.8 17.8 306 5-399 1-336 (338) 58 protein:vir:6324 Length: 335 # 98.6 1.6E-07 9.6E-11 57.9 23.4 313 1-399 1-328 (335) 59 protein:vir:9759 Length: 303 # 98.6 1.5E-08 9.6E-12 63.4 17.8 286 18-399 1-303 (303) 60 protein:vir:99675 Length: 324 98.6 1.4E-08 8.8E-12 63.6 17.5 272 52-399 1-297 (324) 61 protein:vir:9309 Length: 324 # 98.6 5.2E-09 3.2E-12 66.0 14.7 291 1-399 14-316 (324) 62 protein:vir:103323 Length: 364 98.6 2.4E-08 1.5E-11 62.3 18.2 313 1-399 1-340 (364) 63 protein:vir:4339 Length: 395 # 98.6 1.8E-08 1.1E-11 63.1 17.0 284 1-398 102-395 (395) 64 protein:vir:4511 Length: 409 # 98.6 8E-09 5E-12 65.0 15.1 293 1-399 99-407 (409) 65 protein:vir:4700 Length: 415 # 98.6 1.2E-08 7.3E-12 64.0 15.9 296 1-399 109-405 (415) 66 protein:vir:4600 Length: 415 # 98.6 1.2E-08 7.3E-12 64.0 15.9 296 1-399 109-405 (415) 67 protein:vir:2344 Length: 397 # 98.6 2.1E-08 1.3E-11 62.7 16.9 300 1-399 1-307 (397) 68 protein:vir:78935 Length: 335 98.6 2.4E-07 1.5E-10 56.8 22.7 312 1-399 1-329 (335) 69 protein:vir:105522 Length: 423 98.5 2E-08 1.2E-11 62.8 16.3 294 1-398 1-423 (423) 70 protein:vir:78223 Length: 333 98.5 2.4E-08 1.5E-11 62.4 16.6 305 5-398 1-333 (333) 71 protein:vir:100057 Length: 375 98.5 7.3E-08 4.5E-11 59.7 19.1 325 5-399 1-371 (375) 72 protein:vir:4456 Length: 401 # 98.5 6.3E-09 3.9E-12 65.6 12.8 286 1-398 92-401 (401) 73 protein:vir:104256 Length: 458 98.5 3.2E-08 2E-11 61.7 16.1 296 1-398 151-458 (458) 74 protein:vir:1583 Length: 351 # 98.5 1.3E-07 8.3E-11 58.3 19.3 285 18-399 1-308 (351) 75 protein:vir:78830 Length: 324 98.5 1.7E-08 1.1E-11 63.2 14.4 295 1-399 14-316 (324) 76 protein:vir:96392 Length: 324 98.5 1.7E-08 1.1E-11 63.2 14.4 295 1-399 14-316 (324) 77 protein:vir:94771 Length: 298 98.5 6.3E-08 3.9E-11 60.1 17.1 285 18-397 1-298 (298) 78 protein:vir:191 Length: 385 # 98.5 2.8E-08 1.7E-11 62.0 15.1 283 1-399 89-385 (385) 79 protein:vir:1886 Length: 385 # 98.5 2.8E-08 1.7E-11 62.0 15.1 283 1-399 89-385 (385) 80 protein:vir:5974 Length: 324 # 98.4 3.6E-07 2.3E-10 55.9 21.0 278 18-399 1-304 (324) 81 protein:vir:80684 Length: 315 98.4 8.6E-08 5.3E-11 59.3 17.5 295 11-399 1-308 (315) 82 protein:vir:99749 Length: 324 98.4 3.1E-08 1.9E-11 61.8 14.9 297 1-399 1-316 (324) 83 protein:vir:97148 Length: 324 98.4 1.1E-07 7.1E-11 58.6 18.0 295 1-399 1-316 (324) 84 protein:vir:105374 Length: 423 98.4 1.5E-07 9.5E-11 57.9 18.7 297 18-399 1-339 (423) 85 protein:vir:4830 Length: 397 # 98.4 7.5E-08 4.7E-11 59.6 16.8 287 1-399 94-386 (397) 86 protein:vir:9574 Length: 300 # 98.4 1.4E-07 8.6E-11 58.2 18.0 283 1-398 1-300 (300) 87 protein:vir:104085 Length: 320 98.4 6E-08 3.7E-11 60.2 15.9 302 1-399 2-319 (320) 88 protein:vir:99920 Length: 311 98.4 9.2E-08 5.7E-11 59.2 16.8 300 1-397 1-311 (311) 89 protein:vir:100247 Length: 425 98.4 2.8E-08 1.7E-11 62.0 13.7 287 1-399 117-425 (425) 90 protein:vir:1638 Length: 298 # 98.4 1E-07 6.4E-11 58.9 16.6 280 18-397 1-298 (298) 91 protein:vir:4226 Length: 326 # 98.4 9.8E-08 6.1E-11 59.0 16.0 306 1-399 1-324 (326) 92 protein:vir:103955 Length: 324 98.3 8.5E-08 5.3E-11 59.4 15.0 297 1-399 1-316 (324) 93 protein:vir:97053 Length: 390 98.3 7.5E-08 4.6E-11 59.7 14.6 285 1-396 97-390 (390) 94 protein:vir:8102 Length: 543 # 98.3 2.3E-07 1.4E-10 57.0 17.0 298 1-399 233-543 (543) 95 protein:vir:81070 Length: 390 98.3 2E-07 1.2E-10 57.3 16.0 285 1-396 97-390 (390) 96 protein:vir:4997 Length: 397 # 98.3 4.6E-07 2.8E-10 55.3 17.9 284 1-399 98-386 (397) 97 protein:vir:4953 Length: 397 # 98.3 6.5E-07 4E-10 54.5 18.4 285 1-399 98-386 (397) 98 protein:vir:100135 Length: 418 98.3 2.8E-07 1.7E-10 56.5 16.4 285 1-399 117-416 (418) 99 protein:vir:1268 Length: 397 # 98.2 2E-07 1.3E-10 57.3 14.4 282 1-398 103-397 (397) 100 protein:vir:4856 Length: 293 # 98.2 1.3E-06 8E-10 52.9 18.5 277 12-399 1-282 (293) 101 protein:vir:2430 Length: 318 # 98.2 2.9E-07 1.8E-10 56.4 14.7 302 1-399 2-314 (318) 102 protein:vir:3845 Length: 395 # 98.2 5.1E-07 3.2E-10 55.1 15.7 282 1-399 98-384 (395) 103 protein:vir:105038 Length: 428 98.1 2.9E-06 1.8E-09 50.9 18.6 304 1-398 113-428 (428) 104 protein:vir:6242 Length: 390 # 98.1 4.6E-07 2.9E-10 55.3 14.1 286 1-399 97-390 (390) 105 protein:vir:80376 Length: 435 98.1 1.7E-06 1.1E-09 52.1 16.9 301 1-399 105-434 (435) 106 protein:vir:1433 Length: 435 # 98.0 4.8E-06 2.9E-09 49.8 19.0 302 1-399 118-434 (435) 107 protein:vir:10364 Length: 390 98.0 1.3E-06 7.9E-10 52.9 15.8 282 1-396 96-390 (390) 108 protein:vir:81160 Length: 371 98.0 1.2E-06 7.2E-10 53.1 15.0 289 1-398 67-371 (371) 109 protein:vir:81227 Length: 413 98.0 2.6E-06 1.6E-09 51.2 16.6 297 1-399 101-411 (413) 110 protein:vir:7409 Length: 408 # 98.0 5.7E-06 3.5E-09 49.4 18.0 287 1-399 104-394 (408) 111 protein:vir:8420 Length: 477 # 98.0 1.3E-06 8.2E-10 52.8 14.4 314 1-399 141-472 (477) 112 protein:vir:1328 Length: 392 # 97.9 3.1E-06 1.9E-09 50.8 15.9 285 1-399 97-392 (392) 113 protein:vir:1025 Length: 408 # 97.9 5.7E-06 3.5E-09 49.3 17.1 286 1-399 101-394 (408) 114 protein:vir:3991 Length: 404 # 97.9 1.4E-05 8.5E-09 47.2 18.6 287 1-399 101-394 (404) 115 protein:vir:100172 Length: 394 97.8 9E-06 5.6E-09 48.3 17.4 284 1-399 100-385 (394) 116 protein:vir:101607 Length: 379 97.8 3.7E-06 2.3E-09 50.4 15.1 284 1-398 88-379 (379) 117 protein:vir:94673 Length: 419 97.8 5.1E-06 3.2E-09 49.6 15.7 295 1-399 106-418 (419) 118 protein:vir:6212 Length: 434 # 97.8 6.4E-06 3.9E-09 49.1 16.1 295 1-399 131-430 (434) 119 protein:vir:102944 Length: 330 97.8 1.7E-05 1.1E-08 46.7 19.8 280 1-399 1-310 (330) 120 protein:vir:105004 Length: 392 97.7 1.1E-05 6.6E-09 47.9 16.0 287 1-399 84-387 (392) 121 protein:vir:107593 Length: 392 97.7 1.1E-05 6.6E-09 47.9 16.0 287 1-399 84-387 (392) 122 protein:vir:102873 Length: 392 97.7 1.1E-05 6.6E-09 47.9 16.0 287 1-399 84-387 (392) 123 protein:vir:102082 Length: 392 97.7 1.1E-05 6.6E-09 47.9 16.0 287 1-399 84-387 (392) 124 protein:vir:2504 Length: 305 # 97.7 8.5E-06 5.3E-09 48.4 15.0 293 5-399 1-301 (305) 125 protein:vir:3870 Length: 400 # 97.7 3.9E-06 2.4E-09 50.2 12.9 276 1-399 123-400 (400) 126 protein:vir:5739 Length: 366 # 97.7 2.3E-05 1.5E-08 46.0 17.0 299 1-398 43-366 (366) 127 protein:vir:1383 Length: 421 # 97.5 3.4E-05 2.1E-08 45.1 16.4 277 1-399 105-387 (421) 128 protein:vir:93616 Length: 645 97.5 2.4E-05 1.5E-08 45.9 15.4 311 1-399 321-639 (645) 129 protein:vir:79008 Length: 299 97.5 5.2E-05 3.2E-08 44.1 16.9 283 1-398 1-299 (299) 130 protein:vir:9704 Length: 394 # 97.4 1.6E-05 1E-08 46.8 13.4 273 1-399 116-394 (394) 131 protein:vir:962 Length: 397 # 97.4 1.4E-05 8.4E-09 47.3 12.2 276 1-397 120-397 (397) 132 protein:vir:97031 Length: 402 97.4 8.7E-05 5.4E-08 42.9 23.7 307 1-399 1-334 (402) 133 protein:vir:97331 Length: 319 97.3 8.3E-05 5.2E-08 42.9 16.5 275 1-399 1-297 (319) 134 protein:vir:94800 Length: 319 97.3 8.3E-05 5.2E-08 42.9 16.5 275 1-399 1-297 (319) 135 protein:vir:100884 Length: 389 97.2 9.9E-05 6.2E-08 42.5 15.7 282 1-399 99-383 (389) 136 protein:vir:1781 Length: 221 # 97.2 0.00013 8.3E-08 41.8 16.4 205 129-365 1-221 (221) 137 protein:vir:96762 Length: 632 97.1 4.9E-05 3E-08 44.2 13.1 279 1-397 345-632 (632) 138 protein:vir:95376 Length: 425 96.8 0.0003 1.9E-07 39.9 15.5 284 1-399 120-422 (425) 139 protein:vir:7855 Length: 497 # 96.8 7.3E-05 4.5E-08 43.3 11.5 316 1-399 114-494 (497) 140 protein:vir:101650 Length: 497 96.8 7.3E-05 4.5E-08 43.3 11.5 316 1-399 114-494 (497) 141 protein:vir:105645 Length: 400 96.4 0.00068 4.2E-07 38.0 21.8 309 1-399 1-334 (400) 142 protein:vir:107120 Length: 329 96.3 0.00075 4.7E-07 37.7 17.5 279 1-399 1-309 (329) 143 protein:vir:2106 Length: 430 # 96.2 0.00083 5.1E-07 37.5 15.5 298 18-399 1-430 (430) 144 protein:vir:1084 Length: 437 # 95.8 0.0015 9.2E-07 36.1 13.9 282 1-399 144-430 (437) 145 protein:vir:96978 Length: 387 95.6 0.00051 3.2E-07 38.6 10.0 270 1-399 99-382 (387) 146 protein:vir:94424 Length: 387 95.6 0.00051 3.2E-07 38.6 10.0 270 1-399 99-382 (387) 147 protein:vir:2685 Length: 387 # 95.6 0.00051 3.2E-07 38.6 10.0 270 1-399 99-382 (387) 148 protein:vir:100939 Length: 430 95.3 0.0023 1.4E-06 35.0 14.8 297 18-399 1-430 (430) 149 protein:vir:9265 Length: 430 # 95.3 0.0023 1.4E-06 35.0 14.8 297 18-399 1-430 (430) 150 protein:vir:97255 Length: 310 95.0 0.003 1.8E-06 34.5 16.5 283 25-398 1-310 (310) 151 protein:vir:7019 Length: 401 # 94.6 0.0039 2.4E-06 33.8 21.1 309 1-399 1-334 (401) 152 protein:vir:93881 Length: 387 94.3 0.0041 2.5E-06 33.7 11.7 271 1-399 100-382 (387) 153 protein:vir:9361 Length: 402 # 94.0 0.0031 1.9E-06 34.3 10.4 271 1-399 114-397 (402) 154 protein:vir:80446 Length: 367 93.9 0.0059 3.6E-06 32.8 17.9 301 1-399 1-349 (367) 155 protein:vir:78640 Length: 352 93.3 0.008 5E-06 32.1 12.3 268 1-399 64-347 (352) 156 protein:vir:4092 Length: 390 # 93.2 0.0086 5.3E-06 31.9 13.1 284 1-399 68-369 (390) 157 protein:vir:78920 Length: 290 91.1 0.017 1.1E-05 30.2 12.5 280 24-399 1-290 (290) 158 protein:vir:3158 Length: 321 # 91.1 0.018 1.1E-05 30.2 15.1 282 1-358 1-321 (321) 159 protein:vir:103886 Length: 302 90.6 0.02 1.2E-05 29.9 15.9 280 18-397 1-302 (302) 160 protein:vir:4159 Length: 315 # 89.5 0.026 1.6E-05 29.3 17.9 289 1-381 8-315 (315) 161 protein:vir:4197 Length: 314 # 89.2 0.027 1.7E-05 29.1 17.3 274 5-363 1-314 (314) 162 protein:vir:78387 Length: 349 84.8 0.058 3.6E-05 27.4 19.4 301 18-399 1-329 (349) 163 protein:vir:94933 Length: 330 84.7 0.058 3.6E-05 27.4 15.7 297 1-399 9-330 (330) 164 protein:vir:94989 Length: 349 79.9 0.1 6.2E-05 26.1 19.6 299 18-399 1-329 (349) 165 protein:vir:108211 Length: 318 78.1 0.12 7.3E-05 25.7 15.2 296 1-399 1-310 (318) 166 protein:vir:105464 Length: 346 77.7 0.12 7.6E-05 25.6 14.9 293 28-394 1-346 (346) 167 protein:vir:79928 Length: 393 70.1 0.21 0.00013 24.3 13.3 280 1-375 59-393 (393) 168 protein:vir:95131 Length: 325 67.2 0.26 0.00016 23.8 10.9 288 5-399 1-307 (325) 169 protein:vir:101291 Length: 381 59.6 0.39 0.00024 22.8 9.6 286 1-399 60-373 (381) 170 protein:vir:9509 Length: 381 # 59.6 0.39 0.00024 22.8 9.6 286 1-399 60-373 (381) 171 protein:vir:95963 Length: 395 43.9 0.83 0.00051 21.0 11.6 292 1-399 71-380 (395) 172 protein:vir:80128 Length: 466 42.2 0.89 0.00055 20.9 9.4 283 1-364 109-466 (466) 173 protein:vir:95512 Length: 693 31.4 1.5 0.00093 19.6 14.3 291 1-359 383-693 (693) 174 protein:vir:1991 Length: 305 # 31.1 1.5 0.00094 19.6 9.7 214 11-298 1-305 (305) No 1 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=100.00 E-value=4.1e-185 Score=1031.49 Aligned_cols=390 Identities=56% Similarity=0.962 Sum_probs=377.5 Q ss_pred ccccCCCCCCccccc----ccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCccccCCCCc Q lcl|NC_013692. 10 PMKYNDPANGVESSI----GPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVNDQGIDA 85 (399) Q Consensus 10 ~~~~n~~~~~~~~~i----~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lteGV~p 85 (399) -++||.|++++.+++ ||||+||||.+|+|+||+|++||++|||++|||||+|||||||||.||++++|||+||||| T Consensus 1 ~~~~~a~~~~~~~s~~g~~~~~~~t~y~~~k~L~~Aa~~lv~~~fA~~~piPkn~GkTIk~r~y~pl~~~~~pl~eGv~a 80 (401) T protein:vir:95 1 MLNYNAPTDGQKSSIDGANSDQMQTFFWLKKAIITARKEQYFMPLASVTNMPKHYGKTIKVYEYVPLLDDRNINDQGIDA 80 (401) T ss_pred CCccCCCcccccccccccccceeeehhhHHHHHhhhhhhhhhhhcccccccccccCCeEEEEecccccccccchhcCCCc Confidence 678999977655554 9999999999999999999999999999999999999999999999999999999999999 Q ss_pred chhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHHHHHHH Q lcl|NC_013692. 86 SGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVTTEMVK 165 (399) Q Consensus 86 ~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~~el~~ 165 (399) +|+.++|||+||||||||+||+|+|+|||.|||||||||+|+|++++|||||+|+||||+++|||+||+|++|+++||++ T Consensus 81 ~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~D~~l~~h~s~ell~ 160 (401) T protein:vir:95 81 SGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDSDDGLMEHLSRELMN 160 (401) T ss_pred ccccccCccccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhcchHHHHHHHHHHhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhcCceeeecccccccc----ccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccC Q lcl|NC_013692. 166 GANEITEDLLQIDLLNSAGTVRYPGAATSDA----EVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGN 241 (399) Q Consensus 166 ~~~~~t~d~l~~~~l~agt~V~YAg~aTsra----~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~ 241 (399) +|+++++|+|++++|++|++|+|||.+++.+ +....++||+++||+++++|+.|||||||+||+||+|+|||+|++ T Consensus 161 g~~~~t~d~i~~dll~ag~~viyAg~ats~At~~~~~~~~t~vt~~~l~rl~~~L~~nRapk~t~~i~~s~~~dTk~i~~ 240 (401) T protein:vir:95 161 GATQITEAVLQKDLLAAAGTVLYAGAATSDATITGEGSTPSVVSYKNLMRLDQILTENRTPTQTTIITGSRMIDTKVIGA 240 (401) T ss_pred hhhhhHHHHHHHHHHhhcCeeecCCccceeeeccccccccceechhHHHHHHHHHHhcccccchhhhhhhhccCcccccc Confidence 9999999999999999999999999988877 555559999999999999999999999999999999999999999 Q ss_pred eeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCC---cccccccc Q lcl|NC_013692. 242 ARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDP---NDQVPMHE 318 (399) Q Consensus 242 ~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~---~~~~~~~~ 318 (399) |||+||||||+|||++|+|+|+||+||||||||++++||+||||+++|||||++|+|+||+|+|+.++. +..+++++ T Consensus 241 s~va~~h~~L~~di~a~~D~~~~~~fi~v~kYa~~~~i~~gEiG~i~~vR~i~~p~~~~w~~ag~~a~~~~~~y~~~~~~ 320 (401) T protein:vir:95 241 TRVMYVGSELVPELKAMKDLFGNKAFIETQHYADAGTIMNGEVGSIDKFRIIQVPEMLHWAGAGAQATGANPGYRTSMVS 320 (401) T ss_pred ceEEEEecCchhHHHHHHHhcCCCCceehhhcCCccccccccccccCceeEEecccceeecCCccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999999999999999999986544 45578999 Q ss_pred cCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEecC Q lcl|NC_013692. 319 SGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPEWIALLKTVAR 398 (399) Q Consensus 319 ~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~iet~A~ 398 (399) +++++||||+||||+|||++|+|+++|+++||+||||+||+++||++||||||||+||||||++++||++||+|||+||| T Consensus 321 ~gg~~dVyp~lV~G~dAf~~~~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQ~g~vgwK~~~a~~vL~~e~m~~ies~a~ 400 (401) T protein:vir:95 321 GQEHYDVYPMLVVGDDSFTSIGFQTDGKSLKFTVMTKMPGKETADRNDPYGETGFSSIKWYYGILVKRPERLALIKTVAP 400 (401) T ss_pred CCCcceeeeeeEEccccceecccccCCccccceeEeecCCcCCCCCCCcccceehhhhhhhhhhheeccceeEEEEeecC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred C Q lcl|NC_013692. 399 L 399 (399) Q Consensus 399 ~ 399 (399) | T Consensus 401 ~ 401 (401) T protein:vir:95 401 L 401 (401) T ss_pred C Confidence 9 No 2 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.71 E-value=4.6e-19 Score=121.09 Aligned_cols=269 Identities=13% Similarity=0.093 Sum_probs=187.9 Q ss_pred CC-cccccccceehhhhhHHHHHHhhhHHhhhhccccc-ccCcCCCcEEEEEEccCCcCCCccccCCCCcchhhhhhhhh Q lcl|NC_013692. 18 NG-VESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTF-SMPKHYGKEIVRLHYIPLLDDRNVNDQGIDASGATIANGNL 95 (399) Q Consensus 18 ~~-~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~-~mPKn~GktIkfrry~pl~~~~t~lteGV~p~g~~~~ngn~ 95 (399) |+ +++.++.-+.+.+|..-.++...+.+++.+++... .+.-..|++|++.+|..+.++. .+.||-+..-.+ T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~-~~~eg~~i~~~~------ 73 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDAT-VVPEGQKIPVDK------ 73 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccc-cccCCCccCccc------ Confidence 33 25677777788899988888888899999999864 4666779999999998885544 466765443322 Q ss_pred ccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 96 YGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVTTEMVKGANEITEDLL 175 (399) Q Consensus 96 ~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~~el~~~~~~~t~d~l 175 (399) ++.+..+++++|||...++||+.......+- ++++.+++...-+. ..|.- T Consensus 74 ----------------------------lt~~~~~a~i~~~~k~~~~tD~a~~~~~~dp-~~~~~~~~~~~~a~-~~d~~ 123 (276) T protein:vir:10 74 ----------------------------IETNRREAKIHKIGKGTDITDEALLSGYGDP-QGEAVRQHGLAIAN-KVDND 123 (276) T ss_pred ----------------------------cccceeeEEeehccccccccHHHHHhhccch-HHHHHHHHHHHHHH-HHHHH Confidence 2456789999999999999999998875433 34454444443222 23332 Q ss_pred HHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCeeEEEechhhhHHH Q lcl|NC_013692. 176 QIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNARALYVGSDLVPTI 255 (399) Q Consensus 176 ~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di 255 (399) ....|++++. +++ ...+|++.+..|...|..+... -++++|||+....| T Consensus 124 ~~~~l~~~~~-----------~~~-~~~~t~d~i~~A~~~lgd~~~~-------------------~~~ivv~p~~~~~L 172 (276) T protein:vir:10 124 VLEALRGTKL-----------TVS-ADIGTLAGLEAAIDTFDDEDLE-------------------PMVLFINPKDAGKL 172 (276) T ss_pred HHHHHhcccc-----------ccc-ccccCHHHHHHHHHHhccccCc-------------------ccEEEEcHHHHHHH Confidence 3344443322 111 2347899999999888765432 27899999999999 Q ss_pred HHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccccCccceeEEEEEEcccc Q lcl|NC_013692. 256 EAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESGGKYSVFPMLCVASEA 335 (399) Q Consensus 256 ~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~~~~DVYp~lV~G~~A 335 (399) |.. .++.|+..+++|+. .+.+|+||++-|+|+|+++.+ +.|-.+++|+.| T Consensus 173 ~k~----~~~~f~~~s~~g~~-~~~~G~ig~~~G~~Vi~s~~~-------------------------p~~t~~l~~~gA 222 (276) T protein:vir:10 173 RSS----ASDNFTRATELGDN-IIVKGAFGEALGAVIVRSKKL-------------------------DEGEAILAKRGA 222 (276) T ss_pred HHh----cccccccccccccc-ceeccccceecceeEEEcCCC-------------------------CcceEEEEeccc Confidence 865 35899999999976 679999999999999999853 134456999988 Q ss_pred ceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 336 FTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 336 fg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~iet~A~~ 399 (399) ++... + +.+.. - .|| |+.-++-.+.-...|++.+.+++..+.+.-++.- T Consensus 223 i~~~~-~---~~~~v--E--------~dR-d~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~ 271 (276) T protein:vir:10 223 VKLIT-K---RDFFL--E--------TDR-DPSTKTTALYSDKHYVAYLYDESKAVKVTKGAGT 271 (276) T ss_pred eeeee-c---CCcee--e--------ccc-chhhcccEEEEeeEEEEEEEcCcceEEEecCCcC Confidence 88532 2 11111 1 222 4454444444556789999999999999877755 No 3 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.65 E-value=5e-18 Score=115.45 Aligned_cols=270 Identities=14% Similarity=0.099 Sum_probs=182.2 Q ss_pred CC-cccccccceehhhhhHHHHHHhhhHHhhhhccccc-ccCcCCCcEEEEEEccCCcCCCccccCCCCcchhhhhhhhh Q lcl|NC_013692. 18 NG-VESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTF-SMPKHYGKEIVRLHYIPLLDDRNVNDQGIDASGATIANGNL 95 (399) Q Consensus 18 ~~-~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~-~mPKn~GktIkfrry~pl~~~~t~lteGV~p~g~~~~ngn~ 95 (399) |+ +.+.++.-+.+..|..-.++...+.+++.+++... .+..+.|+||++.+|..+.++ ..+.||-+-.-++ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda-~~~~eg~~i~~~~------ 73 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDA-ADVAEGGEISLDK------ 73 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccc-cccCCCCccChhh------ Confidence 22 24566666666678877777787889999999764 466677999999999877444 4577775443323 Q ss_pred ccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 96 YGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVTTEMVKGANEITEDLL 175 (399) Q Consensus 96 ~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~~el~~~~~~~t~d~l 175 (399) ++.+..+++++|+|...++||+........ +++++.+++ +.-..+.+ T Consensus 74 ----------------------------lt~~~~~~~i~~~~k~~~vtD~~~~~~~~d-~~~~~~~~~----a~~~a~~~ 120 (272) T protein:vir:36 74 ----------------------------IGTTTKSVTIKKAAKGTEITDEAALSGYGD-PIGESNKQL----GLSLANKV 120 (272) T ss_pred ----------------------------cCCcceeEeeehhhccccccHHHHhhccch-HHHHHHHHH----HHHHHHHH Confidence 244578999999999999999887765432 344444333 33334455 Q ss_pred HHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCeeEEEechhhhHHH Q lcl|NC_013692. 176 QIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNARALYVGSDLVPTI 255 (399) Q Consensus 176 ~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di 255 (399) ..+++++-.. ...+++ ...+++.|..|...|.....+ .++.+|||.....| T Consensus 121 d~~i~~~l~~--------~~~~~~--~~~~~d~i~~A~~~lgd~~~~-------------------~~~ivv~p~~~~~L 171 (272) T protein:vir:36 121 DDDLLSAAKT--------TSQTVS--TKANVDGVQAALDIFNDEDAQ-------------------AYVLIVNPKDAAKI 171 (272) T ss_pred HHHHHHHhcc--------cccccc--ccccHHHHHHHHHHhhhcCCC-------------------ceEEEEcHHHHHHH Confidence 5566633211 112222 457899999998888765543 27899999999999 Q ss_pred HHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccccCccceeEEEEEEcccc Q lcl|NC_013692. 256 EAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESGGKYSVFPMLCVASEA 335 (399) Q Consensus 256 ~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~~~~DVYp~lV~G~~A 335 (399) +. ++.|..+..++..+.+.+|+||++-|+|+|++..+- . +. .+|..+++|+.| T Consensus 172 ~k------~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~p----~----~~-------------~~~~~~~~~~gA 224 (272) T protein:vir:36 172 RK------DANAKNIGSEVGANALINGTYADVLGAQIVRSKKLA----E----GS-------------ALMFKIVSNSPA 224 (272) T ss_pred hc------ccccccccccccccceeeeccceecCeeEEEeCCCC----C----Cc-------------eeEEEEEecccc Confidence 84 678999988888889999999999999999999642 1 10 168888999999 Q ss_pred ceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEecC Q lcl|NC_013692. 336 FTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPEWIALLKTVAR 398 (399) Q Consensus 336 fg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~iet~A~ 398 (399) ++...- +.+..+ ++| |+.-+.=..--...|++.+++++-.+.+...-- T Consensus 225 ~~~~~~----~~~~vE----------~~R-~~~~~~d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 225 LKLVLK----RGVQVE----------TDR-DIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred eeeeec----CCcccc----------ccc-chhhcCcEEEEEEEEEEEEEcCccEEEEeecCC Confidence 986422 222221 111 232222222224668999999998888765544 No 4 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=99.64 E-value=1.5e-17 Score=112.84 Aligned_cols=268 Identities=12% Similarity=0.073 Sum_probs=178.6 Q ss_pred cccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhccccc-ccCcCCCcEEEEEEccCCcCCCccccCC- Q lcl|NC_013692. 5 VDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTF-SMPKHYGKEIVRLHYIPLLDDRNVNDQG- 82 (399) Q Consensus 5 ~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~-~mPKn~GktIkfrry~pl~~~~t~lteG- 82 (399) |.| ..+.++..+.+..|..-+++...+.+++++++... .++-..|+||++.+|..+.++.. ++|| T Consensus 1 ma~------------~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~-~~~g~ 67 (274) T protein:vir:97 1 MPQ------------GLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQV-VAEGE 67 (274) T ss_pred CCc------------cceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCcccc-ccCCC Confidence 222 24677777788899999999988999999999774 57777799999999987655443 3433 Q ss_pred -CCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHHH Q lcl|NC_013692. 83 -IDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVTT 161 (399) Q Consensus 83 -V~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~~ 161 (399) |+|. + ++.+..+++++|+|.-.+++|+........ +++++.+ T Consensus 68 ~i~~~--~----------------------------------lt~~~~~~~i~~~~~~~~i~D~~~~~~~~d-p~~~~~~ 110 (274) T protein:vir:97 68 KIPTD--I----------------------------------LETKKREAKIRKIAKGTSITDEALLSGYGD-PQGEQVR 110 (274) T ss_pred ccccc--c----------------------------------cccceeEEEeeeecceecccHHHHHhccch-HHHHHHH Confidence 3222 1 244678999999999999999988776442 3454444 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccC Q lcl|NC_013692. 162 EMVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGN 241 (399) Q Consensus 162 el~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~ 241 (399) ++...-+. ..|.-..+.++.+++ .++ +..++++.+..|...|..+... T Consensus 111 ~~a~a~a~-~vd~~~~~~l~~a~~-----------~~~-~~~~~~d~i~dA~~~l~d~~~~------------------- 158 (274) T protein:vir:97 111 QHGLAHAN-KVDNDVLEALMGAKL-----------TVN-ADITKLNGLQSAIDKFNDEDLE------------------- 158 (274) T ss_pred HHHHHHHH-HHHHHHHHHHhccCc-----------ccc-ccccCHHHHHHHHHHhhccCCC------------------- Confidence 33332222 222222333333222 121 2457899999999888765432 Q ss_pred eeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccccCc Q lcl|NC_013692. 242 ARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESGG 321 (399) Q Consensus 242 ~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~~ 321 (399) -++.+|||.....|+.- ....|++.+++|+ ..+.+|.||++-|||+|+++.+- T Consensus 159 ~~~ivv~p~~~~~L~k~----~~~~f~~~s~~g~-~~~~~G~ig~~~G~~Vi~s~~~p---------------------- 211 (274) T protein:vir:97 159 PMVLFVNPLDAGKLRGD----ASTNFTRATELGD-DIIVKGAFGEALGAIIVRTNKLE---------------------- 211 (274) T ss_pred ceEEEeCHHHHHHHHhh----hhhhccccCcccc-cceeccccceecCeeEEEcCCCC---------------------- Confidence 37899999999999852 1137999999998 46899999999999999998531 Q ss_pred cceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEe-cCC Q lcl|NC_013692. 322 KYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPEWIALLKTV-ARL 399 (399) Q Consensus 322 ~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~iet~-A~~ 399 (399) +|-.+++|+.|++... + ..+.. + .+| ||.-+.-...-...|++.++|++-.+.+.-+ |-+ T Consensus 212 ---~~t~~l~~~gA~~~~~-~---~~~~v--------E--~~R-d~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~ 272 (274) T protein:vir:97 212 ---AGTAILAKKGAVKLIL-K---RDFFL--------E--VAR-DASTKTTALYSDKHYVAYLYDESKAVKITKGSGSL 272 (274) T ss_pred ---cceEEEEeCcceEeee-c---CCcee--------c--ccc-chhhcccEEEEEEEEEEEEEcCCceEEEecCcccc Confidence 3556799999998532 2 11111 1 222 4444333333445789999999888886543 344 No 5 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=99.64 E-value=1.5e-17 Score=112.84 Aligned_cols=268 Identities=12% Similarity=0.073 Sum_probs=178.6 Q ss_pred cccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhccccc-ccCcCCCcEEEEEEccCCcCCCccccCC- Q lcl|NC_013692. 5 VDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTF-SMPKHYGKEIVRLHYIPLLDDRNVNDQG- 82 (399) Q Consensus 5 ~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~-~mPKn~GktIkfrry~pl~~~~t~lteG- 82 (399) |.| ..+.++..+.+..|..-+++...+.+++++++... .++-..|+||++.+|..+.++.. ++|| T Consensus 1 ma~------------~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~-~~~g~ 67 (274) T protein:vir:94 1 MPQ------------GLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQV-VAEGE 67 (274) T ss_pred CCc------------cceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCcccc-ccCCC Confidence 222 24677777788899999999988999999999774 57777799999999987655443 3433 Q ss_pred -CCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHHH Q lcl|NC_013692. 83 -IDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVTT 161 (399) Q Consensus 83 -V~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~~ 161 (399) |+|. + ++.+..+++++|+|.-.+++|+........ +++++.+ T Consensus 68 ~i~~~--~----------------------------------lt~~~~~~~i~~~~~~~~i~D~~~~~~~~d-p~~~~~~ 110 (274) T protein:vir:94 68 KIPTD--I----------------------------------LETKKREAKIRKIAKGTSITDEALLSGYGD-PQGEQVR 110 (274) T ss_pred ccccc--c----------------------------------cccceeEEEeeeecceecccHHHHHhccch-HHHHHHH Confidence 3222 1 244678999999999999999988776442 3454444 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccC Q lcl|NC_013692. 162 EMVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGN 241 (399) Q Consensus 162 el~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~ 241 (399) ++...-+. ..|.-..+.++.+++ .++ +..++++.+..|...|..+... T Consensus 111 ~~a~a~a~-~vd~~~~~~l~~a~~-----------~~~-~~~~~~d~i~dA~~~l~d~~~~------------------- 158 (274) T protein:vir:94 111 QHGLAHAN-KVDNDVLEALMGAKL-----------TVN-ADITKLNGLQSAIDKFNDEDLE------------------- 158 (274) T ss_pred HHHHHHHH-HHHHHHHHHHhccCc-----------ccc-ccccCHHHHHHHHHHhhccCCC------------------- Confidence 33332222 222222333333222 121 2457899999999888765432 Q ss_pred eeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccccCc Q lcl|NC_013692. 242 ARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESGG 321 (399) Q Consensus 242 ~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~~ 321 (399) -++.+|||.....|+.- ....|++.+++|+ ..+.+|.||++-|||+|+++.+- T Consensus 159 ~~~ivv~p~~~~~L~k~----~~~~f~~~s~~g~-~~~~~G~ig~~~G~~Vi~s~~~p---------------------- 211 (274) T protein:vir:94 159 PMVLFVNPLDAGKLRGD----ASTNFTRATELGD-DIIVKGAFGEALGAIIVRTNKLE---------------------- 211 (274) T ss_pred ceEEEeCHHHHHHHHhh----hhhhccccCcccc-cceeccccceecCeeEEEcCCCC---------------------- Confidence 37899999999999852 1137999999998 46899999999999999998531 Q ss_pred cceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEe-cCC Q lcl|NC_013692. 322 KYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPEWIALLKTV-ARL 399 (399) Q Consensus 322 ~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~iet~-A~~ 399 (399) +|-.+++|+.|++... + ..+.. + .+| ||.-+.-...-...|++.++|++-.+.+.-+ |-+ T Consensus 212 ---~~t~~l~~~gA~~~~~-~---~~~~v--------E--~~R-d~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~ 272 (274) T protein:vir:94 212 ---AGTAILAKKGAVKLIL-K---RDFFL--------E--VAR-DASTKTTALYSDKHYVAYLYDESKAVKITKGSGSL 272 (274) T ss_pred ---cceEEEEeCcceEeee-c---CCcee--------c--ccc-chhhcccEEEEEEEEEEEEEcCCceEEEecCcccc Confidence 3556799999998532 2 11111 1 222 4444333333445789999999888886543 344 No 6 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=99.64 E-value=1.1e-17 Score=113.53 Aligned_cols=270 Identities=11% Similarity=0.095 Sum_probs=181.6 Q ss_pred cccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhccccc-ccCcCCCcEEEEEEccCCcCCCccccCCC Q lcl|NC_013692. 5 VDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTF-SMPKHYGKEIVRLHYIPLLDDRNVNDQGI 83 (399) Q Consensus 5 ~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~-~mPKn~GktIkfrry~pl~~~~t~lteGV 83 (399) |.| ..+.++..+.+..|..-+++...+.+++.+++..- .+.-..|+||++.+|..+.++. -+.+|- T Consensus 1 m~~------------~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~-~~~~g~ 67 (274) T protein:vir:96 1 MAQ------------GMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAK-VVAEGE 67 (274) T ss_pred CCc------------ceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccc-cccCCC Confidence 111 14567777777788888888888899999998543 4665669999999998776554 345543 Q ss_pred CcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHHHHH Q lcl|NC_013692. 84 DASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVTTEM 163 (399) Q Consensus 84 ~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~~el 163 (399) +-.-++ ++.+..+++|+|+|.-.+++|+......+. +++++.+++ T Consensus 68 ~i~~~~----------------------------------lt~~~~~~~i~~~~~a~~i~D~~~~~~~~d-~~~~~~~~~ 112 (274) T protein:vir:96 68 KIPTDI----------------------------------LETKKREAKIRKIAKGTSISDEALLSGYGD-PQGEQVRQH 112 (274) T ss_pred ccchhh----------------------------------cccceeEEEeeeeecceeehHHHHhhccch-HHHHHHHHH Confidence 322222 244677999999999999999998887653 334444444 Q ss_pred HHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCee Q lcl|NC_013692. 164 VKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNAR 243 (399) Q Consensus 164 ~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~y 243 (399) ... ..+.+..++++.... ++. .++ ...++++.+..|...|..+... -+ T Consensus 113 ~~~----~a~~vd~~i~~~l~~------a~~--~~~-~~~~~~d~i~~A~~~lgd~~~~-------------------~~ 160 (274) T protein:vir:96 113 GLA----HANKVDDDVLEALKS------AKL--TVE-ADITKLTGLQTAIDKFNDEDLE-------------------PM 160 (274) T ss_pred HHH----HHHHHHHHHHHHHhc------ccc--ccc-ccccCHHHHHHHHHHhcccccc-------------------cc Confidence 333 333444444433221 111 111 2346899999999888755432 37 Q ss_pred EEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccccCccc Q lcl|NC_013692. 244 ALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESGGKY 323 (399) Q Consensus 244 v~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~~~~ 323 (399) +++|||+....|+... .-.|+..+++|+ ..+.+|+||++-|||+|++.. + T Consensus 161 ~ivv~p~~~~~L~k~~----~~~f~~~s~~g~-~~~~~G~ig~~~G~~Vi~s~~-------------------------~ 210 (274) T protein:vir:96 161 VLFISPLDAGKLRGDA----TTNFTRATELGD-DVIVKGAFGEALGAVIVRSNK-------------------------L 210 (274) T ss_pred EEEeCHHHHHHHHhhc----cccccccccccc-cceeccccceecCeEEEEeCC-------------------------C Confidence 8999999999998621 127999999997 678999999999999999873 2 Q ss_pred eeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 324 SVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 324 DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~iet~A~~ 399 (399) ++|-.+++|+.|++... + ..+.. | .+| ||.-+.=.+--...|++.++|++..+.+..++=- T Consensus 211 ~~~t~~l~~~gA~~~~~-~---~~~~v--------E--~~R-d~~~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~~ 271 (274) T protein:vir:96 211 EAGTAILAKKGAVKLIT-K---RDFFL--------E--TDR-DPSTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) T ss_pred CCceEEEEeccceeeee-c---CCccc--------c--ccc-ccccccCEEEEeEEEEEEEEcCCcEEEEEcCCcc Confidence 25566899999998532 1 22221 1 222 4444333334447799999999999999887754 No 7 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=99.64 E-value=1.1e-17 Score=113.53 Aligned_cols=270 Identities=11% Similarity=0.095 Sum_probs=181.6 Q ss_pred cccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhccccc-ccCcCCCcEEEEEEccCCcCCCccccCCC Q lcl|NC_013692. 5 VDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTF-SMPKHYGKEIVRLHYIPLLDDRNVNDQGI 83 (399) Q Consensus 5 ~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~-~mPKn~GktIkfrry~pl~~~~t~lteGV 83 (399) |.| ..+.++..+.+..|..-+++...+.+++.+++..- .+.-..|+||++.+|..+.++. -+.+|- T Consensus 1 m~~------------~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~-~~~~g~ 67 (274) T protein:vir:95 1 MAQ------------GMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAK-VVAEGE 67 (274) T ss_pred CCc------------ceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccc-cccCCC Confidence 111 14567777777788888888888899999998543 4665669999999998776554 345543 Q ss_pred CcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHHHHH Q lcl|NC_013692. 84 DASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVTTEM 163 (399) Q Consensus 84 ~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~~el 163 (399) +-.-++ ++.+..+++|+|+|.-.+++|+......+. +++++.+++ T Consensus 68 ~i~~~~----------------------------------lt~~~~~~~i~~~~~a~~i~D~~~~~~~~d-~~~~~~~~~ 112 (274) T protein:vir:95 68 KIPTDI----------------------------------LETKKREAKIRKIAKGTSISDEALLSGYGD-PQGEQVRQH 112 (274) T ss_pred ccchhh----------------------------------cccceeEEEeeeeecceeehHHHHhhccch-HHHHHHHHH Confidence 322222 244677999999999999999998887653 334444444 Q ss_pred HHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCee Q lcl|NC_013692. 164 VKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNAR 243 (399) Q Consensus 164 ~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~y 243 (399) ... ..+.+..++++.... ++. .++ ...++++.+..|...|..+... -+ T Consensus 113 ~~~----~a~~vd~~i~~~l~~------a~~--~~~-~~~~~~d~i~~A~~~lgd~~~~-------------------~~ 160 (274) T protein:vir:95 113 GLA----HANKVDDDVLEALKS------AKL--TVE-ADITKLTGLQTAIDKFNDEDLE-------------------PM 160 (274) T ss_pred HHH----HHHHHHHHHHHHHhc------ccc--ccc-ccccCHHHHHHHHHHhcccccc-------------------cc Confidence 333 333444444433221 111 111 2346899999999888755432 37 Q ss_pred EEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccccCccc Q lcl|NC_013692. 244 ALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESGGKY 323 (399) Q Consensus 244 v~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~~~~ 323 (399) +++|||+....|+... .-.|+..+++|+ ..+.+|+||++-|||+|++.. + T Consensus 161 ~ivv~p~~~~~L~k~~----~~~f~~~s~~g~-~~~~~G~ig~~~G~~Vi~s~~-------------------------~ 210 (274) T protein:vir:95 161 VLFISPLDAGKLRGDA----TTNFTRATELGD-DVIVKGAFGEALGAVIVRSNK-------------------------L 210 (274) T ss_pred EEEeCHHHHHHHHhhc----cccccccccccc-cceeccccceecCeEEEEeCC-------------------------C Confidence 8999999999998621 127999999997 678999999999999999873 2 Q ss_pred eeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 324 SVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 324 DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~iet~A~~ 399 (399) ++|-.+++|+.|++... + ..+.. | .+| ||.-+.=.+--...|++.++|++..+.+..++=- T Consensus 211 ~~~t~~l~~~gA~~~~~-~---~~~~v--------E--~~R-d~~~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~~ 271 (274) T protein:vir:95 211 EAGTAILAKKGAVKLIT-K---RDFFL--------E--TDR-DPSTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) T ss_pred CCceEEEEeccceeeee-c---CCccc--------c--ccc-ccccccCEEEEeEEEEEEEEcCCcEEEEEcCCcc Confidence 25566899999998532 1 22221 1 222 4444333334447799999999999999887754 No 8 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=99.63 E-value=1.8e-17 Score=112.34 Aligned_cols=270 Identities=10% Similarity=0.057 Sum_probs=179.1 Q ss_pred cccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccc-cccCcCCCcEEEEEEccCCcCCCccccCCC Q lcl|NC_013692. 5 VDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADT-FSMPKHYGKEIVRLHYIPLLDDRNVNDQGI 83 (399) Q Consensus 5 ~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~-~~mPKn~GktIkfrry~pl~~~~t~lteGV 83 (399) |.| ..+.++..+.+..|..-.++...+.+++.+++.. ..+.-+.|+||++.+|..+.++. -+++|- T Consensus 1 ma~------------~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~-~~~~g~ 67 (274) T protein:vir:12 1 MAQ------------GLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQ-VVAEGE 67 (274) T ss_pred CCc------------ceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCccc-cccCCC Confidence 111 1467777788888898888888888999999987 45666779999999998776544 345543 Q ss_pred CcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHHHHH Q lcl|NC_013692. 84 DASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVTTEM 163 (399) Q Consensus 84 ~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~~el 163 (399) +-.-++ ++.+..+++|+|+|.-.+++|+........ +++++.+++ T Consensus 68 ~i~~~~----------------------------------lt~~~~~~~i~~~~~~~~i~D~~~~~~~~d-~~~~~~~q~ 112 (274) T protein:vir:12 68 KIPTDI----------------------------------LETKKREAKIRKIAKGTSITDEALLSGYGD-PQGEQVRQH 112 (274) T ss_pred ccchhh----------------------------------cccceeeEEeeeecceeeecHHHHHhcccc-hHHHHHHHH Confidence 322222 244567899999999999999887776432 335554444 Q ss_pred HHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCee Q lcl|NC_013692. 164 VKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNAR 243 (399) Q Consensus 164 ~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~y 243 (399) ...-+. ..|.-....+++++. +++ ...++++.+..|...|..+... -+ T Consensus 113 ~~~~a~-~vd~~~l~~~~~a~~-----------~~~-~~a~~~d~i~dA~~~lgd~~~~-------------------~~ 160 (274) T protein:vir:12 113 GLAHAN-KVDNDVLEALMGAKL-----------TVN-ADITKLNGLQSAIDKFNDEDLE-------------------PM 160 (274) T ss_pred HHHHHH-HHHHHHHHHHhcccc-----------ccc-ccccCHHHHHHHHHHhcccccc-------------------cc Confidence 433222 222222233333222 111 2457899999999888665431 37 Q ss_pred EEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccccCccc Q lcl|NC_013692. 244 ALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESGGKY 323 (399) Q Consensus 244 v~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~~~~ 323 (399) +++|||.....|+... ...|++.+++|+ ..+.+|+||++-|+|+|++..+ T Consensus 161 ~ivv~p~~~~~L~k~~----~~~fv~~s~~g~-~~~~~G~ig~~~G~~Vi~s~~~------------------------- 210 (274) T protein:vir:12 161 VLFINPLDAGKLRGDA----STNFTRATELGD-DIIVKGAFGEALGAIIVRSNKL------------------------- 210 (274) T ss_pred EEEeCHHHHHHHHhhh----hhhccccccccc-cceecccceeecCeeEEEeCCC------------------------- Confidence 8999999999998631 137999999997 5679999999999999999743 Q ss_pred eeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 324 SVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 324 DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~iet~A~~ 399 (399) .+|-..++|+.|++... + ..+.. - .+| ||.-+.=..--...|++.++|++..+.+..++-= T Consensus 211 p~~t~~l~~~gA~~~~~-~---~~~~v--E--------~~R-d~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~ 271 (274) T protein:vir:12 211 EAGTAILAKKGAVKLIL-K---RDFFL--E--------VAR-DASTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) T ss_pred CcceEEEEeccceeeee-c---CCcee--c--------ccc-chhhcccEEEeeeEEEEEEEcCCceEEEEcCCcc Confidence 13456799999998532 1 22221 1 121 3333322333346689999999999988766543 No 9 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=99.62 E-value=1.2e-16 Score=107.78 Aligned_cols=314 Identities=13% Similarity=0.124 Sum_probs=192.5 Q ss_pred cccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhh-ccc---------ccccCcCCCcEEEEEEccCCcCCCcccc Q lcl|NC_013692. 11 MKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQ-LAD---------TFSMPKHYGKEIVRLHYIPLLDDRNVND 80 (399) Q Consensus 11 ~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~-~a~---------~~~mPKn~GktIkfrry~pl~~~~t~lt 80 (399) |.+ |....-+|+.. ..|.+++...+...-.|.. |-. ..++-|+.|.+|.|.=-.+|.- .|.. T Consensus 1 Ma~-----T~~~~~~p~a~-~~ws~~l~~~~~~~s~f~~~l~G~~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~g--~gv~ 72 (364) T protein:vir:93 1 MSQ-----TVIPFGDPKAV-KRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDRITFDLSVHLRG--KPTY 72 (364) T ss_pred Cce-----eccCcCCHHHH-HHHHHHHHHHHHhhCccccccccCCCCCcEEEeeecCCCCCceEEeeeeeeccc--CCcc Confidence 332 22233367745 5788888888877665554 432 2367899999999876666642 2222 Q ss_pred CCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHH Q lcl|NC_013692. 81 QGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVT 160 (399) Q Consensus 81 eGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~ 160 (399) ++-+-+| |...+++-+=++.+-|-.+=+.........-+=-.|.+ .. T Consensus 73 Gd~~leG--------------------------------nee~L~~~~~~i~idq~r~~V~~~g~ms~qRt~~dlr~-~a 119 (364) T protein:vir:93 73 GDARVEG--------------------------------KEESLRFYQDEVRIDQVRHSVSAGGRMSRKRTVHNIRR-IA 119 (364) T ss_pred cCceeec--------------------------------cccceeEEeeEEEEeeccccccccCchhhhhhHHHHHH-HH Confidence 3222222 22334444445555555554443322221111112332 34 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcC--------------------------ceeeeccccccccccCCcceecHHHHHHHHH Q lcl|NC_013692. 161 TEMVKGANEITEDLLQIDLLNSA--------------------------GTVRYPGAATSDAEVDATTEVTYDSLMRLRL 214 (399) Q Consensus 161 ~el~~~~~~~t~d~l~~~~l~ag--------------------------t~V~YAg~aTsra~v~~~~~vt~~~lr~a~~ 214 (399) ++.+..-.....|++..-.|.++ ..++|++.+|++++++..+++|+++|+++.. T Consensus 120 r~~L~~w~~~~~d~~~f~~laGarg~~~~~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~l~stD~~sl~~id~a~~ 199 (364) T protein:vir:93 120 RDRLGDYFYKFTDELLFIYLSGARGINLDFIETPDFTGYAGNPLDAPDVDHLLYGGVATSKASLAATDIMAPLVIEKAVE 199 (364) T ss_pred HHHHHHHHHHHHHHHHHHHhhcccccccccccccCcccccccccCCCCCCcEEeccccCchhhccccccccHHHHHHHHH Confidence 44444444555666555444332 3678888899999999999999999999999 Q ss_pred HHHhccCc-----cccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCc-----cccccccc Q lcl|NC_013692. 215 DLDNARAP-----TKIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAG-----GATMHGEV 284 (399) Q Consensus 215 ~Lk~nrAp-----k~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~-----~~i~~gEI 284 (399) .++..+++ ++..+--++. .+||+|+||....|||.- -+|+|...+|++.+ -|||.||+ T Consensus 200 ~a~~~~~~~~~~~~~~Pv~~~g~--------~~yV~~l~p~q~~~Lr~~----t~~~w~d~qk~A~~~~g~~nPlF~G~~ 267 (364) T protein:vir:93 200 KAAMMQAENPDVANMVPVSIDGD--------DHYVCVMSEYQATDMRTA----AGGTWIDFQKAAAAAEGRNNPIFKGGL 267 (364) T ss_pred HHHHhCCCCCCCcccceeEecCc--------ceeEEEEcchhhhhhhhc----CCHHHHHHHHHhhhcccccCCceecCe Confidence 99887643 2223222222 489999999999999842 26889999998743 45999999 Q ss_pred eeEcCeEEEecCcccccccCCcccCCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCC Q lcl|NC_013692. 285 GQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADR 364 (399) Q Consensus 285 G~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~ 364 (399) |.++||-+++.+++-.+-+.|+ ++++.|-=-|.+|..|.+..=-+.+| ..|..+.+.-.. + T Consensus 268 gm~ngvii~~~~~vi~~~~~~~-------------~~~v~~~ralllGaQA~~~a~g~~~g--~~~~w~Ee~~D~---g- 328 (364) T protein:vir:93 268 GMINNVVLHKHRNVIRFNDYGA-------------GANVEAARALFMGRQAGVIAYGTANG--LRFDWEETVKDY---G- 328 (364) T ss_pred eeEcCeEEeccCCccccccccc-------------CccccchhhheecceeeEEEeecCCC--CCceeeecccCC---C- Confidence 9999999999998766644333 23344555689999996633222222 356556555543 2 Q ss_pred CCc-cchhhHHHHH-HHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 365 SDP-YGEMGFMSIK-WYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 365 ~DP-lgQrg~~gwK-~~~~~~iLn~~~m~~iet~A~~ 399 (399) +.+ .+....+||| .=|-. .|-=.+.|-++|++ T Consensus 329 n~~~i~~~~i~G~kK~rF~~---~DfGvi~idtaa~~ 362 (364) T protein:vir:93 329 NEPAIAAGFIAGMKKARFNN---KDFGVISIDTAAKK 362 (364) T ss_pred CchhhhhhhHhhhhhcccCC---ccceEEEecccccc Confidence 222 5566666666 22221 24445678999999 No 10 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=99.60 E-value=6.7e-17 Score=109.25 Aligned_cols=271 Identities=12% Similarity=0.081 Sum_probs=179.5 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhccccc-ccCcCCCcEEEEEEccCCcCCCccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTF-SMPKHYGKEIVRLHYIPLLDDRNVN 79 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~-~mPKn~GktIkfrry~pl~~~~t~l 79 (399) || +++.+.++.-+.+..|..-.++...+.++|.+++... .+.-..|+||++.+|..+.++. -+ T Consensus 1 ~~---------------~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~-~~ 64 (275) T protein:vir:96 1 MA---------------LENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAK-VV 64 (275) T ss_pred CC---------------CcccchhhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCccc-cc Confidence 22 2344666666667788888888888899999999754 3555669999999998775544 45 Q ss_pred cCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHH Q lcl|NC_013692. 80 DQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHV 159 (399) Q Consensus 80 teGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i 159 (399) .||-+-.-.+ ++.+..+++++|+|.-.+++|+......+. +++++ T Consensus 65 ~~g~~i~~~~----------------------------------lt~~~~~~~i~~~~~~~~i~D~~~~~~~~d-~~~~~ 109 (275) T protein:vir:96 65 PEGEEIPIDL----------------------------------IETKKRQATIRKIGKGTVLTDEALLSGYGD-PKGEA 109 (275) T ss_pred cCCCCcchhh----------------------------------cccceeeEEeehhcccccccHHHHHhhccc-hHHHH Confidence 5554332222 245677899999999999999998877543 23555 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccc Q lcl|NC_013692. 160 TTEMVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTV 239 (399) Q Consensus 160 ~~el~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I 239 (399) .+++...-+. ..|.-..+.|+.+++. ++ ...++++.+..|...|..+.. T Consensus 110 ~~~~a~~~a~-~~d~~ll~~l~~a~~~-----------~~-~~~~~~d~i~dA~~~lgd~~~------------------ 158 (275) T protein:vir:96 110 VRQHGLAIAN-KVDNDVLEALQGATLK-----------VE-ADITKLAGLQTAIDKFNDEDL------------------ 158 (275) T ss_pred HHHHHHHHHH-HHHHHHHHHHhccccc-----------cc-ccccCHHHHHHHHHHhccccC------------------ Confidence 4444433222 3333333444444331 11 234689999999988864432 Q ss_pred cCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCccccccccc Q lcl|NC_013692. 240 GNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHES 319 (399) Q Consensus 240 ~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~ 319 (399) .-++.+|||+....||.+. ...|++..++|+. .+.+|+||++-|+|+|++..+. T Consensus 159 -~~~~ivv~p~~~~~L~k~~----~~~f~~~~~~g~~-~~~~G~ig~~~G~~Vi~s~~~p-------------------- 212 (275) T protein:vir:96 159 -EPMVLFVNPLDAGKLRASA----TDNFTRATLLGDN-VIVKGAFGEALGAIIVRSNKIK-------------------- 212 (275) T ss_pred -CccEEEeCHHHHHHHHhcc----ccccccccccccc-ceeccccceecCeeEEEeCCCC-------------------- Confidence 1378999999999998752 3579999999975 6789999999999999998531 Q ss_pred CccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccccceEEEEE-ecC Q lcl|NC_013692. 320 GGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPEWIALLKT-VAR 398 (399) Q Consensus 320 ~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~iet-~A~ 398 (399) +|-.+++|+.|++... + ..++. - .+| ||.-+.-.+--...|++.+++++-.+.++. .|- T Consensus 213 -----~~t~~i~~~gA~~~~~-~---~~~~v--E--------~~R-d~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~ 272 (275) T protein:vir:96 213 -----EGEAILAKRGAVKLIT-K---RDFFL--E--------TER-HASHKSTALFSDKHYVAYLYDESKVVKITKSASG 272 (275) T ss_pred -----cceEEEEeccceeeee-c---CCccc--c--------ccc-chhhcCcEEEEeEEEEEEEEcCccEEEEEecccc Confidence 3445788999888532 2 12221 1 222 444443334444678999999988887643 344 Q ss_pred C Q lcl|NC_013692. 399 L 399 (399) Q Consensus 399 ~ 399 (399) | T Consensus 273 ~ 273 (275) T protein:vir:96 273 L 273 (275) T ss_pred c Confidence 4 No 11 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=99.59 E-value=1e-16 Score=108.26 Aligned_cols=268 Identities=10% Similarity=0.062 Sum_probs=177.0 Q ss_pred cccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccc-cccCcCCCcEEEEEEccCCcCCCccccC-- Q lcl|NC_013692. 5 VDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADT-FSMPKHYGKEIVRLHYIPLLDDRNVNDQ-- 81 (399) Q Consensus 5 ~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~-~~mPKn~GktIkfrry~pl~~~~t~lte-- 81 (399) |.| +++.++..+.+..|..-+++...+.+++.+++.. ..++-..|+||++.+|..+.++... .| T Consensus 1 ma~------------~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~-~eg~ 67 (274) T protein:vir:93 1 MPQ------------GITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVV-AEGE 67 (274) T ss_pred CCc------------cceehhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccc-cCCC Confidence 222 2456777777788898899888899999999977 4677778999999999877655433 33 Q ss_pred CCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHHH Q lcl|NC_013692. 82 GIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVTT 161 (399) Q Consensus 82 GV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~~ 161 (399) .|+|.. ++.+..+++++|+|.-.+++|+........ +++++.+ T Consensus 68 ~i~~~~------------------------------------it~~~~~~~i~~~~~~~~i~D~~~~~~~~d-~~~~~~~ 110 (274) T protein:vir:93 68 KIPTDI------------------------------------LETKKREAKIRKIAKGTSITDEALLSGYGD-PQGEQVR 110 (274) T ss_pred cccccc------------------------------------cccceeEEEeeeecccccccHHHHHhhccc-hHHHHHH Confidence 333321 345678999999999999999988876443 4454544 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccC Q lcl|NC_013692. 162 EMVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGN 241 (399) Q Consensus 162 el~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~ 241 (399) ++... ..+.+..++++.-.. ++ .+++ ...++++.+..|...|..+... T Consensus 111 ~~~~~----~a~~~d~~~~~~~~~------a~--~~~~-~~~~~~d~i~dA~~~l~d~~~~------------------- 158 (274) T protein:vir:93 111 QHGLA----HANKVDNDVLEALMG------AK--LTVN-ADITKLNGLQSAIDKFNDEDLE------------------- 158 (274) T ss_pred HHHHH----HHHHHHHHHHHHHhc------cc--cccc-ccccCHHHHHHHHHHhhhccCC------------------- Confidence 44332 333334444422111 11 1122 2457899999998888664321 Q ss_pred eeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccccCc Q lcl|NC_013692. 242 ARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESGG 321 (399) Q Consensus 242 ~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~~ 321 (399) -++.+|||+....|+.- ....|++.+++|+. .+.+|.||++-|||+|+++.+- T Consensus 159 ~~~ivv~p~~~~~L~k~----~~~~f~~~s~~g~~-~~~~G~ig~~~G~~Vi~s~~~p---------------------- 211 (274) T protein:vir:93 159 PMVLFINPLDAGKLRGD----ASTNFTRATELGDD-IIVKGAFGEALGAIIVRTNKLE---------------------- 211 (274) T ss_pred ccEEEeCHHHHHHHHhh----hhhccccccccccc-ceeecccceecCeeEEEcCCCC---------------------- Confidence 36899999999999841 11379999999984 6799999999999999998631 Q ss_pred cceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEec-CC Q lcl|NC_013692. 322 KYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPEWIALLKTVA-RL 399 (399) Q Consensus 322 ~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~iet~A-~~ 399 (399) +|-.+++|+.|++... + ..+.. -. + -||.-+.=..--..+|++.+++++-.+.+.-++ -+ T Consensus 212 ---~~t~~l~~~gai~~~~-~---~~~~v--E~--------~-Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~t~~~~s~ 272 (274) T protein:vir:93 212 ---AGTAILAKKGAVKLIL-K---RDFFL--EV--------A-RDASTKTTALYSDKHYVAYLYDESKAVKITKGSGSL 272 (274) T ss_pred ---cceEEEEeCCeEEEEe-c---CCccc--cc--------c-cchhhcccEEEEEEEEEEEEEcCCceEEEeeCcccc Confidence 2445799999998542 1 12221 11 1 133322222222366899999998888876544 33 No 12 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=99.58 E-value=1.4e-16 Score=107.46 Aligned_cols=267 Identities=13% Similarity=0.085 Sum_probs=176.1 Q ss_pred cccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhccccc-ccCcCCCcEEEEEEccCCcCCCccccCCC Q lcl|NC_013692. 5 VDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTF-SMPKHYGKEIVRLHYIPLLDDRNVNDQGI 83 (399) Q Consensus 5 ~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~-~mPKn~GktIkfrry~pl~~~~t~lteGV 83 (399) |.| .++.++.-+.+..|..-+++...+.+++.+++... .++-..|++|++.+|....++. -..+|- T Consensus 1 ma~------------~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~-~~~~g~ 67 (274) T protein:vir:96 1 MAQ------------GTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQ-VIAEGE 67 (274) T ss_pred CCc------------cccchhhhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcc-ccCCCC Confidence 211 13455555666788888888888899999998764 4676679999999997555444 334432 Q ss_pred CcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHHHHH Q lcl|NC_013692. 84 DASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVTTEM 163 (399) Q Consensus 84 ~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~~el 163 (399) +-.-.+ ++....+++|+|+|...+++|+........ +++++.+++ T Consensus 68 ~i~~~~----------------------------------it~~~~~~~i~~~~~~~~i~D~~~~~~~~d-~~~~~~~~~ 112 (274) T protein:vir:96 68 KIPVDQ----------------------------------IGTSKREAKVRKIGKGTELTDEAVLSGFGD-PQGEAVRQH 112 (274) T ss_pred cCchhh----------------------------------cccceeEEEEEeeeceeeecHHHHHhhcch-HHHHHHHHH Confidence 211111 244567899999999999999987765443 445454444 Q ss_pred HHHHHHHHHHHHHHHHH---hcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCccccc Q lcl|NC_013692. 164 VKGANEITEDLLQIDLL---NSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVG 240 (399) Q Consensus 164 ~~~~~~~t~d~l~~~~l---~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~ 240 (399) .. -..+.+..+++ +.++. .++ ...++++.|-.|...|..+.-. T Consensus 113 ~~----~~a~~~d~~i~~~l~~a~~-----------~~~-~~~~~~d~i~dA~~~l~d~~~~------------------ 158 (274) T protein:vir:96 113 GL----AIANKVDNDVLEALKGATL-----------TVE-ADITKLDGLQTAIDKFNDEDLE------------------ 158 (274) T ss_pred HH----HHHHHHHHHHHHHHhcCCC-----------CcC-cccccHHHHHHHHHHhcccCCC------------------ Confidence 33 33334444444 33221 121 2457899999998888765432 Q ss_pred CeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccccC Q lcl|NC_013692. 241 NARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESG 320 (399) Q Consensus 241 ~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~ 320 (399) -++.+|||+....|+.+ +...|++..++|+ ..+.+|.||++-|||+|+++.+- T Consensus 159 -~~~ivv~p~~~~~L~k~----~~~~f~~~~~~g~-~~~~~g~ig~~~G~~Vi~s~~~p--------------------- 211 (274) T protein:vir:96 159 -PMVLFVNPLDAGGLRTS----ASDNFTRPTQLGD-NIIVKGAFGEALGAVIVRSNKLN--------------------- 211 (274) T ss_pred -ceEEEeCHHHHHHHHhc----ccccccccccccc-cceeecccceecCeeEEEcCCCC--------------------- Confidence 37899999999999874 2357999999987 47789999999999999998641 Q ss_pred ccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 321 GKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 321 ~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~iet~A~~ 399 (399) +|-.+++|+.|++... + +.+..+ .+ -||.-+.=..--...|++.++|++..+.+..++-= T Consensus 212 ----~~t~~l~~~gA~~~~~-~---~~~~vE----------~~-Rd~~~~~d~i~~~~~yg~~~~~~~~vv~~t~~~~~ 271 (274) T protein:vir:96 212 ----KGEALLAKKGAVKLIT-K---RDFFLE----------KD-RDASRKSTALYSDKHYVAYLYDESKVVKITKGAGD 271 (274) T ss_pred ----cceEEEEeCcceeeee-c---CCcccc----------cc-cchhhcccEEEEeeEEEEEEEcCccEEEEEcCccc Confidence 2235689999998532 2 222221 11 13332222222236699999999999999887754 No 13 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=99.57 E-value=1.5e-16 Score=107.37 Aligned_cols=276 Identities=12% Similarity=0.090 Sum_probs=173.3 Q ss_pred CCc-ccccccceehhhhhHHHHHHhhhHHhhhhcccc-cccCcCCCcEEEEEEccCCcCCCccccCCCCcchhhhhhhhh Q lcl|NC_013692. 18 NGV-ESSIGPQIHTRYWYKRALIDAAKEAYFGQLADT-FSMPKHYGKEIVRLHYIPLLDDRNVNDQGIDASGATIANGNL 95 (399) Q Consensus 18 ~~~-~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~-~~mPKn~GktIkfrry~pl~~~~t~lteGV~p~g~~~~ngn~ 95 (399) |+. ++.++..+.+-.|..-+++...+.+++.+++.. ..++-..|++|++.+|..+.++. -+++|-+..-++ T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~-~~~~g~~i~~~~------ 73 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQ-DVAEGAAIDYSA------ 73 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcce-eecCCCcCcccc------ Confidence 442 466777778888999999999999999999854 45666679999999998775443 344443221111 Q ss_pred ccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 96 YGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVTTEMVKGANEITEDLL 175 (399) Q Consensus 96 ~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~~el~~~~~~~t~d~l 175 (399) ++.+..+++|+|+|...+++|+......++ +++++..++...-+. ..|.. T Consensus 74 ----------------------------lt~~~~~~~i~~~~~a~~v~D~~~~~~~~d-~~~~~~~~~a~~~a~-~~d~~ 123 (278) T protein:vir:80 74 ----------------------------LETESVKHGIKKAGKGVKLTDESVLSGYGD-PVEEAQKQIRMAIAS-KVDND 123 (278) T ss_pred ----------------------------cccceeeEeeehhhccccccHHHHhhcccc-HHHHHHHHHHHHHHH-HHHHH Confidence 245678899999999999999988877554 335444443333222 22233 Q ss_pred HHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCeeEEEechhhhHHH Q lcl|NC_013692. 176 QIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNARALYVGSDLVPTI 255 (399) Q Consensus 176 ~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di 255 (399) ....++++++. ..+. .+.+.. .-.++.+-.+...|... .++..++++|||.....| T Consensus 124 l~~~l~~a~~~-~~~~----~t~~~~-~~~~~~~~da~~~l~~~------------------~~~~~~~ivv~p~~~~~L 179 (278) T protein:vir:80 124 ILEEALTTTLE-VKGA----INIGLI-DKIENTFTDAPDAIEDE------------------SITTTGVLFLNYKDTAKL 179 (278) T ss_pred HHHHHhccccc-cccc----cccchh-hhHHHHHHHHHHhhccc------------------CCCcccEEEECHHHHHHH Confidence 33444443321 1111 111100 01133333333333333 344456799999999999 Q ss_pred HHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccccCccceeEEEEEEcccc Q lcl|NC_013692. 256 EAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESGGKYSVFPMLCVASEA 335 (399) Q Consensus 256 ~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~~~~DVYp~lV~G~~A 335 (399) +.. ....|++..++|+. .+.+|+||++-|||+|+++.+- .+-..++++.| T Consensus 180 ~k~----~~~~~~~~~~~g~~-~~~~G~ig~~~G~~Vi~s~~~p-------------------------~~t~~l~~~gA 229 (278) T protein:vir:80 180 REE----AAGSWTKASQLGDD-LLVKGAFGELLGWEIVRTKKLA-------------------------DGNALAVKAGA 229 (278) T ss_pred Hhh----hhhhcccccccccc-ceeeccceeecceeEEEcCCCC-------------------------cceEEEEeccc Confidence 853 23579999999986 6789999999999999999752 12346889998 Q ss_pred ceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 336 FTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 336 fg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~iet~A~~ 399 (399) ++...- +.++.+ .+ -||.-+.-.+--...|++.++|++..+.|-.+|-= T Consensus 230 i~~~~~----~~~~vE----------~~-Rd~~~~~d~i~~~~~yg~~v~~~~~~v~it~~a~~ 278 (278) T protein:vir:80 230 LKTFLK----RNLLAE----------SG-RDMDHKLTKFNADQHYAVALVDETKAVKVVPVAGN 278 (278) T ss_pred eeeeec----CCcccc----------cc-cchhhccceeeeeeEEEEEEEcCcceEEEeeccCC Confidence 885322 222221 12 23333222333346689999999999988777766 No 14 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=99.50 E-value=8.5e-15 Score=97.73 Aligned_cols=325 Identities=12% Similarity=0.044 Sum_probs=170.2 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVND 80 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lt 80 (399) |+ --|++-++.| +++.+ .+..+--|.+.+|+.-++.+++..+...++.--.+|+||+|.+.-. .+..+ T Consensus 1 ~~-~~~~~~~~~~------~t~~v-~~fipei~s~~i~~~l~~~~v~~~~~~d~~~~~~~Gdtv~ip~~g~----~~~~d 68 (341) T protein:vir:94 1 MA-LGNTITGPSI------NTQRG-QQFIPEQWLSEVQMFRKAKMLDTSVVKTWGAQVKKGDTFHVPRISE----LGVED 68 (341) T ss_pred Cc-chhhhccccc------cchhH-HHHHHHHHHHHHHHHHHhhcchhhccccccccccCCceEEEeccCc----ceeee Confidence 22 1133333334 12222 2334557899999988899999998766655445699999977422 11111 Q ss_pred CCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeee-cceehhhhhhhhhhhchhHHHHH Q lcl|NC_013692. 81 QGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKY-GFFREYTQEQLDFDSDPAMEGHV 159 (399) Q Consensus 81 eGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~Qy-G~~~e~Td~~~~t~~D~~L~~~i 159 (399) .++.+ ..+...++-++++.+|.|+ ..=..++|+-.....- .+++.+ T Consensus 69 --~~~~~------------------------------~i~~~~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~-d~~~~~ 115 (341) T protein:vir:94 69 --KATDV------------------------------PVGVQPVNDTDFVITVDTDRTTAVALDDLLEIQASY-DLRAPY 115 (341) T ss_pred --ecCCC------------------------------ccccccccCceEEEEEeeeeecceeechHHHHhhcc-chHHHH Confidence 11100 0011123445678888554 4445566644332222 233444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeeecccc-c-cccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_013692. 160 TTEMVKGANEITEDLLQIDLLNSAGTVRYPGAA-T-SDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTR 237 (399) Q Consensus 160 ~~el~~~~~~~t~d~l~~~~l~agt~V~YAg~a-T-sra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~ 237 (399) ..+....-++.. |.....++++++...-.+.. + .....+.+..++++.|..+.+.|+++..|. T Consensus 116 ~~~~~~aLA~~~-D~~i~~~~a~~~~~~~~~~~~~~~~~~t~~~~~~~~~~i~~a~~~Lde~~VP~-------------- 180 (341) T protein:vir:94 116 LEAMGYALAKDM-TGSILGLRAAVQNTASQNVFSSSNGAITGNGQAFSFAVFLAARRLLLEADVPE-------------- 180 (341) T ss_pred HHHHHHHHHHHH-HHHHHHHhhhccccccCccccCccccccCchhhhhHHHHHHHHHHHhhcCCCc-------------- Confidence 444444333322 22233444444321111101 1 111122234578899999999999999873 Q ss_pred cccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccC-------- Q lcl|NC_013692. 238 TVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVD-------- 309 (399) Q Consensus 238 ~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~-------- 309 (399) ..++++|+|+...+|+. ++.|......|+ ..+.+|+||++.+|.+++++++-.-.+.+...+ T Consensus 181 ---~gR~lvv~P~~~~~Ll~------~~~~~~~~~~g~-~~l~~G~ig~i~G~~V~~Sn~lp~~~~~~~~~~~~~~~~~~ 250 (341) T protein:vir:94 181 ---EKIVLLISPGQESALFT------IPQFISKDFINN-APIAQGQIGSLMGVRVIRTSLIGNNSATGWRNGAPTIAPAE 250 (341) T ss_pred ---cCCEEEeCHHHHHHHhh------chhhhhhhcccc-chhheeeeeeEeceEEEEeccccccccccccccccceeccc Confidence 34889999999999974 688999865554 568999999999999999998643222111000 Q ss_pred CcccccccccCccc----eeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhc Q lcl|NC_013692. 310 PNDQVPMHESGGKY----SVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVF 385 (399) Q Consensus 310 ~~~~~~~~~~~~~~----DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL 385 (399) .....+-.....++ |..--|++=++|=+.+-+.-.. -+...+.++-. +-..=++--|.-++=-|..||+.+| T Consensus 251 ~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~---~~~~~~~~~~~-~~~~~~~~~~~~~i~~~~~~G~~~l 326 (341) T protein:vir:94 251 ATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMD---WAAAVVSKAPR-VTQSFENREQVWLMVGRQAYGARLY 326 (341) T ss_pred ccccccccccccccccccccEEEEEEecccccceeeecch---hhhcccccccc-ccccchhhhhhhhhhhhhhhccccc Confidence 00011111111222 2222222222332222211000 01111222211 0111222233333335889999999 Q ss_pred cccceEEEEEecCC Q lcl|NC_013692. 386 RPEWIALLKTVARL 399 (399) Q Consensus 386 n~~~m~~iet~A~~ 399 (399) |++..+.|++.+.- T Consensus 327 rp~~~v~~~~~~~~ 340 (341) T protein:vir:94 327 RPLHAVNIHTTGDT 340 (341) T ss_pred CcceeEEEecCcCC Confidence 99999999998776 No 15 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=99.49 E-value=2.3e-15 Score=100.80 Aligned_cols=268 Identities=13% Similarity=0.115 Sum_probs=171.1 Q ss_pred CC-cccccccceehhhhhHHHHHHhhhHHhhhhccccc-ccCcCCCcEEEEEEccCCcCCCccccCCCCcchhhhhhhhh Q lcl|NC_013692. 18 NG-VESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTF-SMPKHYGKEIVRLHYIPLLDDRNVNDQGIDASGATIANGNL 95 (399) Q Consensus 18 ~~-~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~-~mPKn~GktIkfrry~pl~~~~t~lteGV~p~g~~~~ngn~ 95 (399) |+ ++++.+.-+.+..|...+++.....+++.+++... .++...|++|++.+|....++. -..||-+..- T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~-~v~eg~~i~~-------- 71 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAE-DVAEGEAIPM-------- 71 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcc-cccCCCcccc-------- Confidence 22 23455555666677877777777788999988764 4566679999999986554433 2334422211 Q ss_pred ccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 96 YGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVTTEMVKGANEITEDLL 175 (399) Q Consensus 96 ~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~~el~~~~~~~t~d~l 175 (399) + .++...+++++++++.+.++||+..... .+.+++++...+...-+ +.+ T Consensus 72 -----------------~---------~~~~~~~~~~~~~~~~~~~itd~~~~~s-~~d~~~~~~~~~~~~~a----~~~ 120 (272) T protein:vir:30 72 -----------------T---------QLGFKKTTMTIKKAGKGVEITDEAILSG-YGDPVGQAAKQIVEAID----HKV 120 (272) T ss_pred -----------------c---------ccccceEEEEeeeeeeeeeecHHHHhhc-cccHHHHHHHHHHHHHH----HHH Confidence 1 1356788999999999999999987553 34465655554444433 333 Q ss_pred HHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCeeEEEechhhhHHH Q lcl|NC_013692. 176 QIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNARALYVGSDLVPTI 255 (399) Q Consensus 176 ~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di 255 (399) ...++++... ++ ..++ ...+++.|-.|...|..+..+ -.+.+|||+....| T Consensus 121 d~~i~~~~~~------a~--~~~~--~~~t~d~i~da~~~l~~~~~~-------------------~~~~vv~p~~~~~L 171 (272) T protein:vir:30 121 DADVLDALSK------ST--QTVE--ATATVDGVSKALDIFNDEDDA-------------------ETVIVMNPADASTL 171 (272) T ss_pred HHHHHHHhcc------cc--cccc--cccCHHHHHHHHHHHhccCCC-------------------ccEEEEcHHHHHHH Confidence 4444432111 11 1122 335788888888878765442 25799999999999 Q ss_pred HHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccccCccceeEEEEEEcccc Q lcl|NC_013692. 256 EAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESGGKYSVFPMLCVASEA 335 (399) Q Consensus 256 ~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~~~~DVYp~lV~G~~A 335 (399) +.. +.+.|....+++.. .+.+|.||++.|+|+|+++.+. .+-++++++.+ T Consensus 172 ~k~----~~~~~~~~~~~~~~-~~~~g~ig~i~G~~Vi~s~~~p-------------------------~~t~~~~~~~a 221 (272) T protein:vir:30 172 RLD----AAKEWLGATEVGAN-RVVSGVYGEVLGVQIVRSRKCP-------------------------KGTAYMVRKGA 221 (272) T ss_pred HHh----cccccccccccccc-ccccccchhhcCeeEEEcCCCC-------------------------cceEEEEcCCe Confidence 864 24678888898875 5789999999999999999752 12356789988 Q ss_pred ceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 336 FTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 336 fg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~iet~A~~ 399 (399) ++...- +.+.++ . + -|+..++-..--..+|++.+++++.++.+..++-= T Consensus 222 ~~~~~~----~~~~ve--~--------~-r~~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~ 270 (272) T protein:vir:30 222 LRIMLK----RNTMVE--T--------D-RDITKAINQIVANKHYGVYLYKAEKAVKITLKDAA 270 (272) T ss_pred EEEEec----CCceee--e--------c-cccccceeEEEEEEEEEEEEEcCCceEEEEecccc Confidence 886431 122221 1 1 12322222222235688889999988888766544 No 16 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=99.49 E-value=2.3e-15 Score=100.80 Aligned_cols=268 Identities=13% Similarity=0.115 Sum_probs=171.1 Q ss_pred CC-cccccccceehhhhhHHHHHHhhhHHhhhhccccc-ccCcCCCcEEEEEEccCCcCCCccccCCCCcchhhhhhhhh Q lcl|NC_013692. 18 NG-VESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTF-SMPKHYGKEIVRLHYIPLLDDRNVNDQGIDASGATIANGNL 95 (399) Q Consensus 18 ~~-~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~-~mPKn~GktIkfrry~pl~~~~t~lteGV~p~g~~~~ngn~ 95 (399) |+ ++++.+.-+.+..|...+++.....+++.+++... .++...|++|++.+|....++. -..||-+..- T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~-~v~eg~~i~~-------- 71 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAE-DVAEGEAIPM-------- 71 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcc-cccCCCcccc-------- Confidence 22 23455555666677877777777788999988764 4566679999999986554433 2334422211 Q ss_pred ccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 96 YGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVTTEMVKGANEITEDLL 175 (399) Q Consensus 96 ~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~~el~~~~~~~t~d~l 175 (399) + .++...+++++++++.+.++||+..... .+.+++++...+...-+ +.+ T Consensus 72 -----------------~---------~~~~~~~~~~~~~~~~~~~itd~~~~~s-~~d~~~~~~~~~~~~~a----~~~ 120 (272) T protein:vir:98 72 -----------------T---------QLGFKKTTMTIKKAGKGVEITDEAILSG-YGDPVGQAAKQIVEAID----HKV 120 (272) T ss_pred -----------------c---------ccccceEEEEeeeeeeeeeecHHHHhhc-cccHHHHHHHHHHHHHH----HHH Confidence 1 1356788999999999999999987553 34465655554444433 333 Q ss_pred HHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCeeEEEechhhhHHH Q lcl|NC_013692. 176 QIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNARALYVGSDLVPTI 255 (399) Q Consensus 176 ~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di 255 (399) ...++++... ++ ..++ ...+++.|-.|...|..+..+ -.+.+|||+....| T Consensus 121 d~~i~~~~~~------a~--~~~~--~~~t~d~i~da~~~l~~~~~~-------------------~~~~vv~p~~~~~L 171 (272) T protein:vir:98 121 DADVLDALSK------ST--QTVE--ATATVDGVSKALDIFNDEDDA-------------------ETVIVMNPADASTL 171 (272) T ss_pred HHHHHHHhcc------cc--cccc--cccCHHHHHHHHHHHhccCCC-------------------ccEEEEcHHHHHHH Confidence 4444432111 11 1122 335788888888878765442 25799999999999 Q ss_pred HHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccccCccceeEEEEEEcccc Q lcl|NC_013692. 256 EAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESGGKYSVFPMLCVASEA 335 (399) Q Consensus 256 ~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~~~~DVYp~lV~G~~A 335 (399) +.. +.+.|....+++.. .+.+|.||++.|+|+|+++.+. .+-++++++.+ T Consensus 172 ~k~----~~~~~~~~~~~~~~-~~~~g~ig~i~G~~Vi~s~~~p-------------------------~~t~~~~~~~a 221 (272) T protein:vir:98 172 RLD----AAKEWLGATEVGAN-RVVSGVYGEVLGVQIVRSRKCP-------------------------KGTAYMVRKGA 221 (272) T ss_pred HHh----cccccccccccccc-ccccccchhhcCeeEEEcCCCC-------------------------cceEEEEcCCe Confidence 864 24678888898875 5789999999999999999752 12356789988 Q ss_pred ceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 336 FTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 336 fg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~iet~A~~ 399 (399) ++...- +.+.++ . + -|+..++-..--..+|++.+++++.++.+..++-= T Consensus 222 ~~~~~~----~~~~ve--~--------~-r~~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~ 270 (272) T protein:vir:98 222 LRIMLK----RNTMVE--T--------D-RDITKAINQIVANKHYGVYLYKAEKAVKITLKDAA 270 (272) T ss_pred EEEEec----CCceee--e--------c-cccccceeEEEEEEEEEEEEEcCCceEEEEecccc Confidence 886431 122221 1 1 12322222222235688889999988888766544 No 17 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.40 E-value=6.2e-15 Score=98.49 Aligned_cols=231 Identities=15% Similarity=0.086 Sum_probs=158.0 Q ss_pred ccCcCCCcEEEEEEccCCcCCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEee Q lcl|NC_013692. 55 SMPKHYGKEIVRLHYIPLLDDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLE 134 (399) Q Consensus 55 ~mPKn~GktIkfrry~pl~~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~ 134 (399) +=--|.|+||.|..| .-+...+.||..-.-.+ ++.++-+++|+ T Consensus 1 ~~~~~~Gdtit~P~~---iGda~~v~eG~~i~~~~----------------------------------l~~t~~~atIk 43 (231) T protein:vir:73 1 ENGINLANLCEYPND---IGDAADVAEGGEISLDK----------------------------------IGTTTKSVTIK 43 (231) T ss_pred CccccCCceEEeccc---ccchhhhcCCCcCChhh----------------------------------ccccceeeeEe Confidence 344578999999888 34557788887554433 35567899999 Q ss_pred eecceehhhhhhhhhhhchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHH Q lcl|NC_013692. 135 KYGFFREYTQEQLDFDSDPAMEGHVTTEMVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRL 214 (399) Q Consensus 135 QyG~~~e~Td~~~~t~~D~~L~~~i~~el~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~ 214 (399) |+|.-+++||+..+.....-+ + |..++.+....+.+..|++++... +.++....+|++.|.+|.. T Consensus 44 ~~gk~~~itD~a~l~~~gDp~-~----ea~~Q~~~~iA~kvD~di~~~~~~----------a~l~~~~~~t~d~i~~A~~ 108 (231) T protein:vir:73 44 KAAKGTEITDEAALSGYGDPI-G----ESNKQLGLSLANKVDDDLLKAAKT----------TSQTVSTKANVDGVQAALD 108 (231) T ss_pred eeccceeeeHHHHhhccCchH-H----HHHHHHHHHHHHhhhHHHHHhhcc----------ccccccccccHHHHHHHHH Confidence 999999999999988654222 3 455555555556667777865443 2223346689999999998 Q ss_pred HHHhccCccccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEe Q lcl|NC_013692. 215 DLDNARAPTKIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIV 294 (399) Q Consensus 215 ~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~ 294 (399) .|..+... -++++|||.....||. ++.|.....++...-+++|+||++-++|+|. T Consensus 109 ~fgde~~~-------------------~~vivv~p~~~~~Lrk------~~~~~~~~~~~g~~i~~~G~iG~i~G~~Vi~ 163 (231) T protein:vir:73 109 IFNDEDAQ-------------------AYVLIVNPKDAAKIRK------DANAKNIGSEVGANALINGTYADVLGAQIVR 163 (231) T ss_pred Hhcccccc-------------------ceEEEEcchHHHhhhh------ccchhhhhhhhccceeeecccceEcceEEEE Confidence 88665432 3899999998888885 5667777666666788999999999999999 Q ss_pred cCcccccccCCcccCCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHH Q lcl|NC_013692. 295 NPQMMHWAGVGKAVDPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFM 374 (399) Q Consensus 295 ~~~~~~~~~aGa~~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~ 374 (399) +..+. .|. + .+..+++.+.|.+...= +.+ ..-.|| ||.-+.=.+ T Consensus 164 S~~~~----~~~----~-------------~~~~~i~~~gAl~~~~k----~~~----------~vEtdR-d~~~k~~~i 207 (231) T protein:vir:73 164 SKKLA----EGS----A-------------LMFKIVSNSPALKLVLK----RGV----------QVETDR-DIVTKTTVI 207 (231) T ss_pred cCCCC----CCc----e-------------eeeeEEeeccceeeeec----ccc----------eeeccc-cccccccEE Confidence 98642 111 1 23334555655553221 111 111343 666666666 Q ss_pred HHHHHHHHhhccccceEEEEEecC Q lcl|NC_013692. 375 SIKWYYGFMVFRPEWIALLKTVAR 398 (399) Q Consensus 375 gwK~~~~~~iLn~~~m~~iet~A~ 398 (399) --..+|++.+.|+.-.+.|....- T Consensus 208 ~~~~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 208 TADEHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred EEeEEEEEEEEcCccEEEEEeecC Confidence 666789999999998888766555 No 18 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.38 E-value=2.4e-14 Score=95.23 Aligned_cols=263 Identities=13% Similarity=0.095 Sum_probs=169.6 Q ss_pred CCcccccccceehhhhhHHHHHHhhhHHhhhhcccc-cccCcCCCcEEEEEEccCCcCCCccccCCCCcchhhhhhhhhc Q lcl|NC_013692. 18 NGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADT-FSMPKHYGKEIVRLHYIPLLDDRNVNDQGIDASGATIANGNLY 96 (399) Q Consensus 18 ~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~-~~mPKn~GktIkfrry~pl~~~~t~lteGV~p~g~~~~ngn~~ 96 (399) |+ .+.++.-+.+-.|..-..+...+.++|.++|.. ..|+-..|+||.|-.|. +..+...+.||-+-.-.+ T Consensus 1 Ma-~T~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~-~igdae~~~eg~~i~~~~------- 71 (270) T protein:vir:95 1 MT-QTKKANLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYA-YIGAAEDLQEGVAMDTTQ------- 71 (270) T ss_pred CC-ceehhhhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeec-CCCccccccCCCccchhh------- Confidence 54 344444444446666677777788999999987 45666679999999997 444555667665433222 Q ss_pred cccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhh-hchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 97 GSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFD-SDPAMEGHVTTEMVKGANEITEDLL 175 (399) Q Consensus 97 ~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~-~D~~L~~~i~~el~~~~~~~t~d~l 175 (399) ++.+.-+++|+|+|.-.++||+..... .|| + +++. .+.+.-..+.+ T Consensus 72 ---------------------------lt~~~~~a~i~~~gk~~~itD~a~~~~~~dp-~-~~~~----~q~a~~~a~~~ 118 (270) T protein:vir:95 72 ---------------------------MSMTTTKVTVKETGKAVEVTQTAIITNVNGT-L-QEAS----RQLAMSLADKV 118 (270) T ss_pred ---------------------------cccchheeeeehhhCcceecHHHHhhhccch-H-HHHH----HHHHHHHHHHH Confidence 245667999999999999999999877 466 3 3343 33333334455 Q ss_pred HHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCeeEEEechhhhHHH Q lcl|NC_013692. 176 QIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNARALYVGSDLVPTI 255 (399) Q Consensus 176 ~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di 255 (399) +.++|++... ++ .+.....|++.+-.|...|.... ..-++.+|||.+...| T Consensus 119 d~~li~~l~~------a~----~~~~~~~t~~~~~dA~~~lgd~~-------------------~~~~~i~vhs~~~~~L 169 (270) T protein:vir:95 119 EIDYIAELNK------SK----QTATVSADATGILDAIEVFNSEN-------------------DEDYVLYVNPKDYNKL 169 (270) T ss_pred HHHHHHHhcc------cc----cccccccCHHHHHHHHHHhcccc-------------------CCCcEEEEcHHHHHHH Confidence 5566644332 11 11123467888888876664321 1236899999999999 Q ss_pred HHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccccCccceeEEEEEEcccc Q lcl|NC_013692. 256 EAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESGGKYSVFPMLCVASEA 335 (399) Q Consensus 256 ~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~~~~DVYp~lV~G~~A 335 (399) |.. .|+.-.+|++. .+.+|+||.+-|+|+|+.+.. ...|-..++++.| T Consensus 170 rk~-------~~~~~~~~~~~-~~~~G~ig~~~G~~Viv~s~~------------------------~~~~~~~l~~~gA 217 (270) T protein:vir:95 170 VKS-------LFKVGGNVQDR-AISKGDLVEIVGVSDIVKSKR------------------------VSENTAFLQRYGA 217 (270) T ss_pred Hhh-------hcccccccccc-hhcccccceecceeEEEeCCC------------------------CCceeEEEEeccc Confidence 852 47777888875 478999999999998876531 2234567889888 Q ss_pred ceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 336 FTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 336 fg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~iet~A~~ 399 (399) .+...- +.+.. | .|| ||.-+.=..--...|++.+.++.-.+.|. .+|- T Consensus 218 i~~~~~----~~~~v--------E--tdR-d~~~~~d~i~~~~~y~v~~~~~skvv~~t-~~~a 265 (270) T protein:vir:95 218 MEIVNK----KKPEA--------Y--TDF-DILKRTHLLSTNYHYSVNLKDETGVVKVT-FKPS 265 (270) T ss_pred eeeeec----CCcee--------e--ecc-chhhcccEEEeeeEEEEEEEccceEEEEE-ecCC Confidence 874322 12111 1 222 44444444444566899999988877764 3444 No 19 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=99.28 E-value=1.7e-12 Score=85.07 Aligned_cols=316 Identities=15% Similarity=0.099 Sum_probs=150.4 Q ss_pred cccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCccccCCCC Q lcl|NC_013692. 5 VDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVNDQGID 84 (399) Q Consensus 5 ~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lteGV~ 84 (399) |.|+-+.-=+.+..-.++++ ..+.+-.|.+.+++.-++.+++.++...+......|+||+|.+.-.. . ..-.++|-+ T Consensus 1 ~~~~~~~~~~~~~~~~~t~~-~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g~~-~-a~d~~~g~~ 77 (381) T protein:vir:80 1 MATIQGTGGYKGSAVDLSNV-QVFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNISRA-A-VYDKQPQTP 77 (381) T ss_pred CceecccccccCcccchhhH-HhhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccCcc-e-eeeecCCCc Confidence 66655432222221112222 33334578888888888999999998887887778999999774322 1 111222221 Q ss_pred cchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecce-ehhhhhhhhhhhchhHHHHHHHHH Q lcl|NC_013692. 85 ASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFF-REYTQEQLDFDSDPAMEGHVTTEM 163 (399) Q Consensus 85 p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~-~e~Td~~~~t~~D~~L~~~i~~el 163 (399) ...+ .++-.+++++|.|+=.| ..++|+-....+- .+.+.+..++ T Consensus 78 i~~~----------------------------------~~~~~~~~itID~~~~~~~~Idd~D~~~~~~-D~~~~~~~~~ 122 (381) T protein:vir:80 78 VNLQ----------------------------------ARTDSEFTFTVTKYKESSFMIEDIVNTQASY-TLRQYYTKEA 122 (381) T ss_pred cccc----------------------------------ccCCceEEEEEeeeeecceeechHHHHhhcc-ChHHHHHHHH Confidence 1111 12335677888665433 4444432221111 2223334444 Q ss_pred HHHHHHHHHHHHHHHHHhcC----ceeeeccc-----cccccccC-CcceecHHHHHHHHHHHHhccCccccceeccccc Q lcl|NC_013692. 164 VKGANEITEDLLQIDLLNSA----GTVRYPGA-----ATSDAEVD-ATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRM 233 (399) Q Consensus 164 ~~~~~~~t~d~l~~~~l~ag----t~V~YAg~-----aTsra~v~-~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~ 233 (399) ...-+... |......+.++ +.+.+.+. ++.....+ ....++++.|..|.+.|+++..|. T Consensus 123 ~~aLA~~~-D~~i~~~~~~~~~~~~~~~~t~~~~i~~~~~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP~---------- 191 (381) T protein:vir:80 123 GYALARDM-DNFALAHRAVINAFPSQRIYSYDTTLGDGTVNAHLTGTPAPLTYAALLLAKQKLDEADVPQ---------- 191 (381) T ss_pred HHHHHHHH-HHHHHHHHhhcccccccccccccccccccccccccccchhhHHHHHHHHHHHHHhhcCCCc---------- Confidence 43333322 22233333222 22222211 11111111 123468899999999999999973 Q ss_pred cCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCccccccc------CCcc Q lcl|NC_013692. 234 IDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAG------VGKA 307 (399) Q Consensus 234 ~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~------aGa~ 307 (399) ..++++|+|+...+|+. ++.|.... |++.+.+.+|+||++.+|++++++++-.-.. +|+. T Consensus 192 -------egR~lvv~P~~~~~Ll~------~~~~~~ad-~~~~~~l~~G~Ig~i~G~~Vv~Sn~lp~~~~t~~~~~agap 257 (381) T protein:vir:80 192 -------EGRIVMVSPAQYIDLLS------INQFISVD-FSQVKPVTSGVVGTILGMEVIVTTQIGINSLTGYVNGQGAP 257 (381) T ss_pred -------CCcEEEeCHHHHHHHhh------chhhhhhh-hccchhhhceeeeEEcceEEEeecccccccccceeeecccc Confidence 24889999999999985 67888865 8888889999999999999999988643111 1111 Q ss_pred cCCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCC--CCCCccch-------hhHHHHHH Q lcl|NC_013692. 308 VDPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATA--DRSDPYGE-------MGFMSIKW 378 (399) Q Consensus 308 ~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~ta--d~~DPlgQ-------rg~~gwK~ 378 (399) ....+.. .+.. .-|+.+|....+.+ .+-.-+++..-..+--+- -.+++=.. ++++-|+ T Consensus 258 ~~~~~~~----~~~~-------~~g~~s~~a~av~~-~k~yd~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~- 324 (381) T protein:vir:80 258 TQPTPGV----LGSP-------YLPDQAGTANVVNT-GSASDLAVSLSYFGLPVFSGAGATAADGGQTLGSFGGANRWA- 324 (381) T ss_pred ccccccc----cccc-------cccccccceeeeee-eeeeceeeeeeeccceeeecceeeecCCCceeeeehhhhhhh- Confidence 1111100 0111 11222221111110 000000000000000000 00111111 1222232 Q ss_pred HHHHhhccccceEEE---------------------------------------EEecC Q lcl|NC_013692. 379 YYGFMVFRPEWIALL---------------------------------------KTVAR 398 (399) Q Consensus 379 ~~~~~iLn~~~m~~i---------------------------------------et~A~ 398 (399) ++.+-...|-+.- -+..- T Consensus 325 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 381 (381) T protein:vir:80 325 --TAVVCHPDWLAVGVQQNVKSESSRETMYLADAFVTSCVYGAKVFRPDHCVLLHTSGI 381 (381) T ss_pred --hhcccccccccccceeEeecccchhheeehhhhhhhhhhccccccchhhhhhhhcCC Confidence 4444444443322 11111 No 20 >protein:vir:10123 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859253;genbank:gi:32171009;genbank:GeneID:2653345 Probab=99.25 E-value=9.7e-12 Score=80.96 Aligned_cols=337 Identities=12% Similarity=0.097 Sum_probs=182.2 Q ss_pred CCCc---cccccccccCCCCCCcccccccceehhhhhHHHHHHhhh-HHhhhhccc--------ccccCcCCCcEEEEEE Q lcl|NC_013692. 1 MAGP---VDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAK-EAYFGQLAD--------TFSMPKHYGKEIVRLH 68 (399) Q Consensus 1 ~~~~---~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p-~lv~~~~a~--------~~~mPKn~GktIkfrr 68 (399) -.|+ |--...-|| +....-+|+++ .+|. ++....+. .-.+..++. ..++-|+.|.+|.|.= T Consensus 3 ~~~~~~a~~~~~~~lf-----t~~~~~~~~~~-~~~~-~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L 75 (404) T protein:vir:10 3 TVTSAQANKLYQVALF-----TAANRNRSMVN-ILTE-QQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSI 75 (404) T ss_pred CcCCcchhhhHHHHHH-----HHHhcCChhHh-hhhh-hhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeE Confidence 1111 111222234 22233355544 3444 44333332 223333333 3456699999999966 Q ss_pred ccCCcCCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhh Q lcl|NC_013692. 69 YIPLLDDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLD 148 (399) Q Consensus 69 y~pl~~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~ 148 (399) -.+|.- .|..++-+-+| |...+++-+=++.+.|-..-+-....... T Consensus 76 ~~~L~g--~gv~Gd~~lEG--------------------------------nee~L~~~s~~i~Idq~r~~V~~~g~msq 121 (404) T protein:vir:10 76 MHKLSK--RPTMGDERVEG--------------------------------RGEDLSHADFSLKINQGRHLVDAGGRMSQ 121 (404) T ss_pred eeeccc--CCcccCceeec--------------------------------cccceeEEeeEEEEeeecccccccCchhh Confidence 666632 22223222222 22334444445555555554433333222 Q ss_pred hhhchhHHHHHHHHHHHHHHHHHHHHHHHHH-----------------------------HhcC--ceeeeccccccccc Q lcl|NC_013692. 149 FDSDPAMEGHVTTEMVKGANEITEDLLQIDL-----------------------------LNSA--GTVRYPGAATSDAE 197 (399) Q Consensus 149 t~~D~~L~~~i~~el~~~~~~~t~d~l~~~~-----------------------------l~ag--t~V~YAg~aTsra~ 197 (399) .-+=-.|.+.-...|..--+.+....+...+ ++|- -..+|+|.+|+.+. T Consensus 122 QRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~ 201 (404) T protein:vir:10 122 QRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQ 201 (404) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeeccccccccceeecccCCCCCCcEEeccCccchhh Confidence 1111123332222222222222222222111 2232 23677888999999 Q ss_pred cCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCC---ceehhhc- Q lcl|NC_013692. 198 VDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPA---FIPIEKY- 273 (399) Q Consensus 198 v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~---fi~v~kY- 273 (399) |+..+++|+++|+++.+.++...-|. ..+--+++..+. ..+-||+||||....||+. |++ |....++ T Consensus 202 l~stD~~s~~~Id~~~~~~~~~~~pi-~Pv~~~g~~~~~--~~~~yV~~~~p~q~~~Lr~------dt~~~~w~d~q~~A 272 (404) T protein:vir:10 202 IEAADIFSIGLVDNLSLFIDEMAHPL-QPVRLSGDELHG--EDPYYVLYVTPRQWNDWYT------STSGKDWNQMMVRA 272 (404) T ss_pred hhhcccccHHHHHHHHHHHHHhCCCC-cceEeccccccC--ccceEEEEechHHHHHHhh------CCCcHHHHHHHHHH Confidence 99999999999999999998877664 444444444332 2445999999999999995 654 7788775 Q ss_pred -----CCccccccccceeEcCeEEEecCccc--ccccCCcccCCcccccccc-cCccceeEEEEEEccccceeccc-ccC Q lcl|NC_013692. 274 -----AAGGATMHGEVGQLGRFRVIVNPQMM--HWAGVGKAVDPNDQVPMHE-SGGKYSVFPMLCVASEAFTTVGF-ATD 344 (399) Q Consensus 274 -----g~~~~i~~gEIG~i~~~RfV~~~~~~--~~~~aGa~~~~~~~~~~~~-~~~~~DVYp~lV~G~~Afg~v~l-~~~ 344 (399) |..-|||.||.|.++|+=+.+.+.+. ..++.....+.+.....++ ...+..|==-|.+|..|-+. ++ +.+ T Consensus 273 ~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~RallLGaQAl~~-A~g~~~ 351 (404) T protein:vir:10 273 VNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLGAQALAN-AYGQKA 351 (404) T ss_pred hhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccccccccccccccchhheeecceeEEE-EeeccC Confidence 36689999999999999888877542 2222222222211100000 01122333347888866331 11 222 Q ss_pred CCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhcc---------ccceEEEEEecCC Q lcl|NC_013692. 345 GKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFR---------PEWIALLKTVARL 399 (399) Q Consensus 345 g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn---------~~~m~~iet~A~~ 399 (399) +..|....+.- | ||.+=-++.++.+|..-.+ |-=.+.|-++||+ T Consensus 352 --g~~~~w~Ee~~-----D----~g~~~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idta~~~ 404 (404) T protein:vir:10 352 --GGHFNMVEKKT-----D----MDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) T ss_pred --CCCceeEeecc-----c----cCchhhhhhHHHhhhhhccccCCCCceeeEEEEEecccccC Confidence 23455444443 2 6777778888888888776 4446779999999 No 21 >protein:vir:104439 Length: 404 # NCBI annotation: putative virion structural protein # Family: family:all:974 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794063;genbank:gi:116222008;genbank:GeneID:4397504 Probab=99.25 E-value=9.7e-12 Score=80.96 Aligned_cols=337 Identities=12% Similarity=0.097 Sum_probs=182.2 Q ss_pred CCCc---cccccccccCCCCCCcccccccceehhhhhHHHHHHhhh-HHhhhhccc--------ccccCcCCCcEEEEEE Q lcl|NC_013692. 1 MAGP---VDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAK-EAYFGQLAD--------TFSMPKHYGKEIVRLH 68 (399) Q Consensus 1 ~~~~---~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p-~lv~~~~a~--------~~~mPKn~GktIkfrr 68 (399) -.|+ |--...-|| +....-+|+++ .+|. ++....+. .-.+..++. ..++-|+.|.+|.|.= T Consensus 3 ~~~~~~a~~~~~~~lf-----t~~~~~~~~~~-~~~~-~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L 75 (404) T protein:vir:10 3 TVTSAQANKLYQVALF-----TAANRNRSMVN-ILTE-QQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSI 75 (404) T ss_pred CcCCcchhhhHHHHHH-----HHHhcCChhHh-hhhh-hhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeE Confidence 1111 111222234 22233355544 3444 44333332 223333333 3456699999999966 Q ss_pred ccCCcCCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhh Q lcl|NC_013692. 69 YIPLLDDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLD 148 (399) Q Consensus 69 y~pl~~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~ 148 (399) -.+|.- .|..++-+-+| |...+++-+=++.+.|-..-+-....... T Consensus 76 ~~~L~g--~gv~Gd~~lEG--------------------------------nee~L~~~s~~i~Idq~r~~V~~~g~msq 121 (404) T protein:vir:10 76 MHKLSK--RPTMGDERVEG--------------------------------RGEDLSHADFSLKINQGRHLVDAGGRMSQ 121 (404) T ss_pred eeeccc--CCcccCceeec--------------------------------cccceeEEeeEEEEeeecccccccCchhh Confidence 666632 22223222222 22334444445555555554433333222 Q ss_pred hhhchhHHHHHHHHHHHHHHHHHHHHHHHHH-----------------------------HhcC--ceeeeccccccccc Q lcl|NC_013692. 149 FDSDPAMEGHVTTEMVKGANEITEDLLQIDL-----------------------------LNSA--GTVRYPGAATSDAE 197 (399) Q Consensus 149 t~~D~~L~~~i~~el~~~~~~~t~d~l~~~~-----------------------------l~ag--t~V~YAg~aTsra~ 197 (399) .-+=-.|.+.-...|..--+.+....+...+ ++|- -..+|+|.+|+.+. T Consensus 122 QRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~ 201 (404) T protein:vir:10 122 QRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQ 201 (404) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeeccccccccceeecccCCCCCCcEEeccCccchhh Confidence 1111123332222222222222222222111 2232 23677888999999 Q ss_pred cCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCC---ceehhhc- Q lcl|NC_013692. 198 VDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPA---FIPIEKY- 273 (399) Q Consensus 198 v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~---fi~v~kY- 273 (399) |+..+++|+++|+++.+.++...-|. ..+--+++..+. ..+-||+||||....||+. |++ |....++ T Consensus 202 l~stD~~s~~~Id~~~~~~~~~~~pi-~Pv~~~g~~~~~--~~~~yV~~~~p~q~~~Lr~------dt~~~~w~d~q~~A 272 (404) T protein:vir:10 202 IEAADIFSIGLVDNLSLFIDEMAHPL-QPVRLSGDELHG--EDPYYVLYVTPRQWNDWYT------STSGKDWNQMMVRA 272 (404) T ss_pred hhhcccccHHHHHHHHHHHHHhCCCC-cceEeccccccC--ccceEEEEechHHHHHHhh------CCCcHHHHHHHHHH Confidence 99999999999999999998877664 444444444332 2445999999999999995 654 7788775 Q ss_pred -----CCccccccccceeEcCeEEEecCccc--ccccCCcccCCcccccccc-cCccceeEEEEEEccccceeccc-ccC Q lcl|NC_013692. 274 -----AAGGATMHGEVGQLGRFRVIVNPQMM--HWAGVGKAVDPNDQVPMHE-SGGKYSVFPMLCVASEAFTTVGF-ATD 344 (399) Q Consensus 274 -----g~~~~i~~gEIG~i~~~RfV~~~~~~--~~~~aGa~~~~~~~~~~~~-~~~~~DVYp~lV~G~~Afg~v~l-~~~ 344 (399) |..-|||.||.|.++|+=+.+.+.+. ..++.....+.+.....++ ...+..|==-|.+|..|-+. ++ +.+ T Consensus 273 ~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~RallLGaQAl~~-A~g~~~ 351 (404) T protein:vir:10 273 VNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLGAQALAN-AYGQKA 351 (404) T ss_pred hhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccccccccccccccchhheeecceeEEE-EeeccC Confidence 36689999999999999888877542 2222222222211100000 01122333347888866331 11 222 Q ss_pred CCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhcc---------ccceEEEEEecCC Q lcl|NC_013692. 345 GKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFR---------PEWIALLKTVARL 399 (399) Q Consensus 345 g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn---------~~~m~~iet~A~~ 399 (399) +..|....+.- | ||.+=-++.++.+|..-.+ |-=.+.|-++||+ T Consensus 352 --g~~~~w~Ee~~-----D----~g~~~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idta~~~ 404 (404) T protein:vir:10 352 --GGHFNMVEKKT-----D----MDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) T ss_pred --CCCceeEeecc-----c----cCchhhhhhHHHhhhhhccccCCCCceeeEEEEEecccccC Confidence 23455444443 2 6777778888888888776 4446779999999 No 22 >protein:vir:3298 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049514;genbank:gi:9632520;genbank:GeneID:1262006 Probab=99.25 E-value=9.7e-12 Score=80.96 Aligned_cols=337 Identities=12% Similarity=0.097 Sum_probs=182.2 Q ss_pred CCCc---cccccccccCCCCCCcccccccceehhhhhHHHHHHhhh-HHhhhhccc--------ccccCcCCCcEEEEEE Q lcl|NC_013692. 1 MAGP---VDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAK-EAYFGQLAD--------TFSMPKHYGKEIVRLH 68 (399) Q Consensus 1 ~~~~---~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p-~lv~~~~a~--------~~~mPKn~GktIkfrr 68 (399) -.|+ |--...-|| +....-+|+++ .+|. ++....+. .-.+..++. ..++-|+.|.+|.|.= T Consensus 3 ~~~~~~a~~~~~~~lf-----t~~~~~~~~~~-~~~~-~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L 75 (404) T protein:vir:32 3 TVTSAQANKLYQVALF-----TAANRNRSMVN-ILTE-QQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSI 75 (404) T ss_pred CcCCcchhhhHHHHHH-----HHHhcCChhHh-hhhh-hhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeE Confidence 1111 111222234 22233355544 3444 44333332 223333333 3456699999999966 Q ss_pred ccCCcCCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhh Q lcl|NC_013692. 69 YIPLLDDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLD 148 (399) Q Consensus 69 y~pl~~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~ 148 (399) -.+|.- .|..++-+-+| |...+++-+=++.+.|-..-+-....... T Consensus 76 ~~~L~g--~gv~Gd~~lEG--------------------------------nee~L~~~s~~i~Idq~r~~V~~~g~msq 121 (404) T protein:vir:32 76 MHKLSK--RPTMGDERVEG--------------------------------RGEDLSHADFSLKINQGRHLVDAGGRMSQ 121 (404) T ss_pred eeeccc--CCcccCceeec--------------------------------cccceeEEeeEEEEeeecccccccCchhh Confidence 666632 22223222222 22334444445555555554433333222 Q ss_pred hhhchhHHHHHHHHHHHHHHHHHHHHHHHHH-----------------------------HhcC--ceeeeccccccccc Q lcl|NC_013692. 149 FDSDPAMEGHVTTEMVKGANEITEDLLQIDL-----------------------------LNSA--GTVRYPGAATSDAE 197 (399) Q Consensus 149 t~~D~~L~~~i~~el~~~~~~~t~d~l~~~~-----------------------------l~ag--t~V~YAg~aTsra~ 197 (399) .-+=-.|.+.-...|..--+.+....+...+ ++|- -..+|+|.+|+.+. T Consensus 122 QRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~ 201 (404) T protein:vir:32 122 QRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQ 201 (404) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeeccccccccceeecccCCCCCCcEEeccCccchhh Confidence 1111123332222222222222222222111 2232 23677888999999 Q ss_pred cCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCC---ceehhhc- Q lcl|NC_013692. 198 VDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPA---FIPIEKY- 273 (399) Q Consensus 198 v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~---fi~v~kY- 273 (399) |+..+++|+++|+++.+.++...-|. ..+--+++..+. ..+-||+||||....||+. |++ |....++ T Consensus 202 l~stD~~s~~~Id~~~~~~~~~~~pi-~Pv~~~g~~~~~--~~~~yV~~~~p~q~~~Lr~------dt~~~~w~d~q~~A 272 (404) T protein:vir:32 202 IEAADIFSIGLVDNLSLFIDEMAHPL-QPVRLSGDELHG--EDPYYVLYVTPRQWNDWYT------STSGKDWNQMMVRA 272 (404) T ss_pred hhhcccccHHHHHHHHHHHHHhCCCC-cceEeccccccC--ccceEEEEechHHHHHHhh------CCCcHHHHHHHHHH Confidence 99999999999999999998877664 444444444332 2445999999999999995 654 7788775 Q ss_pred -----CCccccccccceeEcCeEEEecCccc--ccccCCcccCCcccccccc-cCccceeEEEEEEccccceeccc-ccC Q lcl|NC_013692. 274 -----AAGGATMHGEVGQLGRFRVIVNPQMM--HWAGVGKAVDPNDQVPMHE-SGGKYSVFPMLCVASEAFTTVGF-ATD 344 (399) Q Consensus 274 -----g~~~~i~~gEIG~i~~~RfV~~~~~~--~~~~aGa~~~~~~~~~~~~-~~~~~DVYp~lV~G~~Afg~v~l-~~~ 344 (399) |..-|||.||.|.++|+=+.+.+.+. ..++.....+.+.....++ ...+..|==-|.+|..|-+. ++ +.+ T Consensus 273 ~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~RallLGaQAl~~-A~g~~~ 351 (404) T protein:vir:32 273 VNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLGAQALAN-AYGQKA 351 (404) T ss_pred hhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccccccccccccccchhheeecceeEEE-EeeccC Confidence 36689999999999999888877542 2222222222211100000 01122333347888866331 11 222 Q ss_pred CCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhcc---------ccceEEEEEecCC Q lcl|NC_013692. 345 GKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFR---------PEWIALLKTVARL 399 (399) Q Consensus 345 g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn---------~~~m~~iet~A~~ 399 (399) +..|....+.- | ||.+=-++.++.+|..-.+ |-=.+.|-++||+ T Consensus 352 --g~~~~w~Ee~~-----D----~g~~~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idta~~~ 404 (404) T protein:vir:32 352 --GGHFNMVEKKT-----D----MDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) T ss_pred --CCCceeEeecc-----c----cCchhhhhhHHHhhhhhccccCCCCceeeEEEEEecccccC Confidence 23455444443 2 6777778888888888776 4446779999999 No 23 >protein:vir:819 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050552;genbank:gi:9633449;genbank:GeneID:1262254 Probab=99.25 E-value=9.7e-12 Score=80.96 Aligned_cols=337 Identities=12% Similarity=0.097 Sum_probs=182.2 Q ss_pred CCCc---cccccccccCCCCCCcccccccceehhhhhHHHHHHhhh-HHhhhhccc--------ccccCcCCCcEEEEEE Q lcl|NC_013692. 1 MAGP---VDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAK-EAYFGQLAD--------TFSMPKHYGKEIVRLH 68 (399) Q Consensus 1 ~~~~---~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p-~lv~~~~a~--------~~~mPKn~GktIkfrr 68 (399) -.|+ |--...-|| +....-+|+++ .+|. ++....+. .-.+..++. ..++-|+.|.+|.|.= T Consensus 3 ~~~~~~a~~~~~~~lf-----t~~~~~~~~~~-~~~~-~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L 75 (404) T protein:vir:81 3 TVTSAQANKLYQVALF-----TAANRNRSMVN-ILTE-QQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSI 75 (404) T ss_pred CcCCcchhhhHHHHHH-----HHHhcCChhHh-hhhh-hhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeE Confidence 1111 111222234 22233355544 3444 44333332 223333333 3456699999999966 Q ss_pred ccCCcCCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhh Q lcl|NC_013692. 69 YIPLLDDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLD 148 (399) Q Consensus 69 y~pl~~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~ 148 (399) -.+|.- .|..++-+-+| |...+++-+=++.+.|-..-+-....... T Consensus 76 ~~~L~g--~gv~Gd~~lEG--------------------------------nee~L~~~s~~i~Idq~r~~V~~~g~msq 121 (404) T protein:vir:81 76 MHKLSK--RPTMGDERVEG--------------------------------RGEDLSHADFSLKINQGRHLVDAGGRMSQ 121 (404) T ss_pred eeeccc--CCcccCceeec--------------------------------cccceeEEeeEEEEeeecccccccCchhh Confidence 666632 22223222222 22334444445555555554433333222 Q ss_pred hhhchhHHHHHHHHHHHHHHHHHHHHHHHHH-----------------------------HhcC--ceeeeccccccccc Q lcl|NC_013692. 149 FDSDPAMEGHVTTEMVKGANEITEDLLQIDL-----------------------------LNSA--GTVRYPGAATSDAE 197 (399) Q Consensus 149 t~~D~~L~~~i~~el~~~~~~~t~d~l~~~~-----------------------------l~ag--t~V~YAg~aTsra~ 197 (399) .-+=-.|.+.-...|..--+.+....+...+ ++|- -..+|+|.+|+.+. T Consensus 122 QRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~ 201 (404) T protein:vir:81 122 QRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQ 201 (404) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeeccccccccceeecccCCCCCCcEEeccCccchhh Confidence 1111123332222222222222222222111 2232 23677888999999 Q ss_pred cCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCC---ceehhhc- Q lcl|NC_013692. 198 VDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPA---FIPIEKY- 273 (399) Q Consensus 198 v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~---fi~v~kY- 273 (399) |+..+++|+++|+++.+.++...-|. ..+--+++..+. ..+-||+||||....||+. |++ |....++ T Consensus 202 l~stD~~s~~~Id~~~~~~~~~~~pi-~Pv~~~g~~~~~--~~~~yV~~~~p~q~~~Lr~------dt~~~~w~d~q~~A 272 (404) T protein:vir:81 202 IEAADIFSIGLVDNLSLFIDEMAHPL-QPVRLSGDELHG--EDPYYVLYVTPRQWNDWYT------STSGKDWNQMMVRA 272 (404) T ss_pred hhhcccccHHHHHHHHHHHHHhCCCC-cceEeccccccC--ccceEEEEechHHHHHHhh------CCCcHHHHHHHHHH Confidence 99999999999999999998877664 444444444332 2445999999999999995 654 7788775 Q ss_pred -----CCccccccccceeEcCeEEEecCccc--ccccCCcccCCcccccccc-cCccceeEEEEEEccccceeccc-ccC Q lcl|NC_013692. 274 -----AAGGATMHGEVGQLGRFRVIVNPQMM--HWAGVGKAVDPNDQVPMHE-SGGKYSVFPMLCVASEAFTTVGF-ATD 344 (399) Q Consensus 274 -----g~~~~i~~gEIG~i~~~RfV~~~~~~--~~~~aGa~~~~~~~~~~~~-~~~~~DVYp~lV~G~~Afg~v~l-~~~ 344 (399) |..-|||.||.|.++|+=+.+.+.+. ..++.....+.+.....++ ...+..|==-|.+|..|-+. ++ +.+ T Consensus 273 ~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~RallLGaQAl~~-A~g~~~ 351 (404) T protein:vir:81 273 VNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLGAQALAN-AYGQKA 351 (404) T ss_pred hhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccccccccccccccchhheeecceeEEE-EeeccC Confidence 36689999999999999888877542 2222222222211100000 01122333347888866331 11 222 Q ss_pred CCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhcc---------ccceEEEEEecCC Q lcl|NC_013692. 345 GKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFR---------PEWIALLKTVARL 399 (399) Q Consensus 345 g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn---------~~~m~~iet~A~~ 399 (399) +..|....+.- | ||.+=-++.++.+|..-.+ |-=.+.|-++||+ T Consensus 352 --g~~~~w~Ee~~-----D----~g~~~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idta~~~ 404 (404) T protein:vir:81 352 --GGHFNMVEKKT-----D----MDNRTEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) T ss_pred --CCCceeEeecc-----c----cCchhhhhhHHHhhhhhccccCCCCceeeEEEEEecccccC Confidence 23455444443 2 6777778888888888776 4446779999999 No 24 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=99.21 E-value=2.5e-11 Score=78.70 Aligned_cols=322 Identities=12% Similarity=0.075 Sum_probs=178.2 Q ss_pred CCCcccccccccc-CCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcC-CCcc Q lcl|NC_013692. 1 MAGPVDNIKPMKY-NDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLD-DRNV 78 (399) Q Consensus 1 ~~~~~~~~~~~~~-n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~-~~t~ 78 (399) |.- .+|.--..+ +.+--+..+++.-++..--|.-++|+.=+..-++..+-..+++ ..|++++|.|--.... +.++ T Consensus 1 ~~~-~~~~~~~~~~~~~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~r~i--~~G~tv~i~~ig~~~~~~~~~ 77 (332) T protein:vir:78 1 MTT-LSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDL--RGGKSKQFMFTGKLSAGYHTP 77 (332) T ss_pred Ccc-cccccCCccccCCccccccccchhhhhhhhhhhHHHHHHHHhhhhhccccccc--cccceEEEEeccceeEeeecC Confidence 321 222222222 2222222222221344446677777776666677677666766 3699999988754421 1222 Q ss_pred ccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHH Q lcl|NC_013692. 79 NDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGH 158 (399) Q Consensus 79 lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~ 158 (399) -+ ++++. . +++-++++.+|-|.=-|..+=|.+++.-.+-.|+.+ T Consensus 78 g~-~l~~~--~---------------------------------~~~~~~~~l~ID~~ky~~~~VddiD~~q~~~dl~~~ 121 (332) T protein:vir:78 78 GT-PIVGD--A---------------------------------GIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAE 121 (332) T ss_pred CC-CCCCC--C---------------------------------CCCCceEEEEEehhhhhHHHHHhHHHHhcCcchHHH Confidence 11 11110 0 123345677787766666666777666555456666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH-hcCceeeecc--ccccccccCCcceec----HHHHHHHHHHHHhccCccccceeccc Q lcl|NC_013692. 159 VTTEMVKGANEITEDLLQIDLL-NSAGTVRYPG--AATSDAEVDATTEVT----YDSLMRLRLDLDNARAPTKIKMITGT 231 (399) Q Consensus 159 i~~el~~~~~~~t~d~l~~~~l-~agt~V~YAg--~aTsra~v~~~~~vt----~~~lr~a~~~Lk~nrApk~T~ii~~s 231 (399) ++.+....=++.. |.....+| +|+....=++ .+.+.-.++.+...+ ++.|+.+...|.+++.|. T Consensus 122 ~~~~~g~aLA~~~-D~~i~~~l~~aa~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~-------- 192 (332) T protein:vir:78 122 VSKQIGEALATHY-DERIARVLAKASAEASPVTGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQ-------- 192 (332) T ss_pred HHHHHHHHHHHHH-HHHHHHHHHhhhcccCcccccccccccccCCccccCHHHHHHHHHHHHHHHhhcCCCc-------- Confidence 6666655555533 33333344 3332211000 011112233333333 466788888999998873 Q ss_pred cccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCcccccccc-ceeEcCeEEEecCcccccccCCcccCC Q lcl|NC_013692. 232 RMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGE-VGQLGRFRVIVNPQMMHWAGVGKAVDP 310 (399) Q Consensus 232 ~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gE-IG~i~~~RfV~~~~~~~~~~aGa~~~~ 310 (399) ..+++++.|+.-..|..- .||.|......+..+.+.+|. ||++.+|++++++++-.=.+.....++ T Consensus 193 ---------~gR~~vv~P~~y~~Ll~~----~d~~~~n~~~~~~~~~~~~g~~i~~i~G~~V~~Sn~lp~~~g~~~~~~~ 259 (332) T protein:vir:78 193 ---------EGRVAVLSPRQYYSLISS----VDTNILNREIGNSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAA 259 (332) T ss_pred ---------cCCEEEeCHHHHHHHHhh----cCceeeeeeccccccceecceeeeEEeeeEEEecCccccCccccccccc Confidence 348999999999999753 367888887778888889985 999999999999997321111111000 Q ss_pred cccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccccce Q lcl|NC_013692. 311 NDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPEWI 390 (399) Q Consensus 311 ~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m 390 (399) . ..........++-.--|+|.++|-|++-++.- ++-+ +-+--|+--|.-.+=-+..|++.+||++.. T Consensus 260 ~-~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~------~~~~------t~~~~~~~~~~d~i~~~~~~G~~v~rPe~~ 326 (332) T protein:vir:78 260 V-TGENNDYQVDASALAGLIFHREAAGCIQSVAP------TIQT------TSGDFNVQYQGDLIVGKLAMGCGSLRTSVA 326 (332) T ss_pred c-cccccccccccccceEEeecccceeeeeeecc------chhh------hhcccchhhhHhhhhhhhhhcCceecccce Confidence 0 00001111222334568889998887654311 0000 001122333444555567899999999999 Q ss_pred EEEEEe Q lcl|NC_013692. 391 ALLKTV 396 (399) Q Consensus 391 ~~iet~ 396 (399) +.|+++ T Consensus 327 v~l~~a 332 (332) T protein:vir:78 327 GSFQAA 332 (332) T ss_pred EEEeeC Confidence 999999 No 25 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=99.21 E-value=5.3e-12 Score=82.41 Aligned_cols=264 Identities=13% Similarity=0.121 Sum_probs=151.2 Q ss_pred cccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhccccc-ccCcCCCcEEEEEEccCCcCCCccccCCC Q lcl|NC_013692. 5 VDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTF-SMPKHYGKEIVRLHYIPLLDDRNVNDQGI 83 (399) Q Consensus 5 ~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~-~mPKn~GktIkfrry~pl~~~~t~lteGV 83 (399) |.| . .. .+-.|...+++...+.+++.+++... +.=.+.|+||.|++.-.... ..-..+|- T Consensus 1 MA~-------------~-~~----~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~~-~d~~~~~~ 61 (273) T protein:vir:79 1 MAF-------------N-NF----IPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTV-KDYKAAGR 61 (273) T ss_pred Ccc-------------h-hh----hHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCcccc-cccccCCC Confidence 221 1 11 22367888888888999999987443 11235699999988553321 11111221 Q ss_pred CcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeee-cceehhhhhhhhhhhchhHHHHHHHH Q lcl|NC_013692. 84 DASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKY-GFFREYTQEQLDFDSDPAMEGHVTTE 162 (399) Q Consensus 84 ~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~Qy-G~~~e~Td~~~~t~~D~~L~~~i~~e 162 (399) . ++...++-++++.+|.|+ +.=..++|+-......+ |.+ +.++ T Consensus 62 ~----------------------------------~~~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~-~~~-~~~~ 105 (273) T protein:vir:79 62 Q----------------------------------TSADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LEA-YTRA 105 (273) T ss_pred c----------------------------------cCccccccceEEEEEeeecccceeeccHHHHhhccc-HHH-HHHH Confidence 1 111223456889999774 55567777555555553 433 4444 Q ss_pred HHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCe Q lcl|NC_013692. 163 MVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNA 242 (399) Q Consensus 163 l~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~ 242 (399) +...-+. ..|.-...++.++.+ .++ ..+.++. .-.++.|..+.+.|++++.|. .. T Consensus 106 ~~~ala~-~vD~~i~~~~~~a~~-~~~----~~~~~~~--~~~~~~i~~a~~~ld~~~vP~-----------------~~ 160 (273) T protein:vir:79 106 GATALAT-DTDKFIADMLVDNGT-ALT----GSAPSDA--DDAFDLIASALKELTKANVPN-----------------VG 160 (273) T ss_pred HHHHHHH-HHHHHHHHHHhhccc-ccc----cccccch--hhHHHHHHHHHHHhhhccCCc-----------------cC Confidence 4333222 223333344433322 111 1112221 124677999999999999974 24 Q ss_pred eEEEechhhhHHHHHHhhhcCCCC-ceehhhcCCccccccccceeEcCeEEEecCcccccccCCc------ccCCccccc Q lcl|NC_013692. 243 RALYVGSDLVPTIEAMKDNHGNPA-FIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGK------AVDPNDQVP 315 (399) Q Consensus 243 yv~~~h~dl~~di~d~~~~~~~p~-fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa------~~~~~~~~~ 315 (399) ++++|+|+...+|+. ++. |......++...+.+|+||++.+|.++++.++-...+... +.+...... T Consensus 161 R~lvv~p~~~~~Ll~------~~~~~~~~~~~~~~~~l~~G~ig~~~G~~i~~s~~lp~~~~~~~~a~~~~A~~~a~~~~ 234 (273) T protein:vir:79 161 RVVVVNAEMAFWLRS------SGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQID 234 (273) T ss_pred cEEEECHHHHHHHhh------chhhhhhhhhcccccceeeeEeeEEeceEEEecccccccCceEEEEEeccceeeeeehh Confidence 889999999999975 344 6667777777788899999999999999987654432111 111111111 Q ss_pred ccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCC Q lcl|NC_013692. 316 MHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGE 359 (399) Q Consensus 316 ~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~ 359 (399) ..+..-..+=|..+|-|.+.||..-++-.+ -++.+..|. T Consensus 235 ~~e~~r~~~~~~~~v~~~~~yg~~v~~p~~-----vv~~~~~g~ 273 (273) T protein:vir:79 235 TVEALRDQDSFSDRIRALHVYGGKVVRPTG-----VVVFNKTGS 273 (273) T ss_pred hhhcccCcccceeeeeeeeeeeeEEecCce-----EEEEeccCC Confidence 111122234477789999999988887444 233444542 No 26 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=99.20 E-value=6.6e-12 Score=81.87 Aligned_cols=262 Identities=13% Similarity=0.131 Sum_probs=150.5 Q ss_pred CCcccccccceehhhhhHHHHHHhhhHHhhhhccccc-c-cCcCCCcEEEEEEccCCcCC-CccccCCCCcchhhhhhhh Q lcl|NC_013692. 18 NGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTF-S-MPKHYGKEIVRLHYIPLLDD-RNVNDQGIDASGATIANGN 94 (399) Q Consensus 18 ~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~-~-mPKn~GktIkfrry~pl~~~-~t~lteGV~p~g~~~~ngn 94 (399) |... .. .+-.|...++..-.+.+++.++.... + -.+ .|+||+|++.-..... -++....+++ T Consensus 1 MA~~-~~----~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~-~Gdtv~ip~~~~~~~~d~~~~~~~~~~--------- 65 (273) T protein:vir:10 1 MAFN-NF----IPELWSDMLLEEWTAQTVFANLVNREYEGTAS-KGNVVHIAGVVAPTVKDYKAAGRQTSA--------- 65 (273) T ss_pred Ccch-hh----hHHHHHHHHHHHHHhhhccchhhccccccccc-cCceEEEeecccccccccccCCCccCc--------- Confidence 2111 01 12356777777778888999887542 1 234 5999999886554322 1222222222 Q ss_pred hccccccccccccccccccccccceeeccceeEEEEEEeeee-cceehhhhhhhhhhhchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 95 LYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKY-GFFREYTQEQLDFDSDPAMEGHVTTEMVKGANEITED 173 (399) Q Consensus 95 ~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~Qy-G~~~e~Td~~~~t~~D~~L~~~i~~el~~~~~~~t~d 173 (399) -.++-++++.+|.|+ +.=..++|+-.....++ +.+ +.+++...-+. ..| T Consensus 66 ---------------------------~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~-~~~-~~~~~~~alA~-~vD 115 (273) T protein:vir:10 66 ---------------------------DAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LEA-YTRAGATALAT-DTD 115 (273) T ss_pred ---------------------------cccccceEEEEEeeeeecceEeecHHHhhhhcc-HHH-HHHHHHHHHHH-HHH Confidence 123446788888664 66667888666666664 433 45554443332 233 Q ss_pred HHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCeeEEEechhhhH Q lcl|NC_013692. 174 LLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNARALYVGSDLVP 253 (399) Q Consensus 174 ~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~ 253 (399) .-..+++.++.+ .+. ..++++. .-.++.|..+.+.|++++.|. ..++++|+|+... T Consensus 116 ~~i~~~~~~a~~-~~~----~~~~~~~--~~~~~~i~~a~~~ld~~~vP~-----------------~~R~lvv~p~~~~ 171 (273) T protein:vir:10 116 KFIADMLVDNGT-ALT----GSAPTDA--DDAFDLIAKALKELTKANVPN-----------------VGRVVVVNAEMAF 171 (273) T ss_pred HHHHHHHhcccc-ccc----cccccch--hHHHHHHHHHHHHhhhcCCCc-----------------CCCEEEECHHHHH Confidence 333344443332 111 1122222 124678999999999999983 2488899999999 Q ss_pred HHHHHhhhcCCCC-ceehhhcCCccccccccceeEcCeEEEecCcccccccCC------cccCCcccccccccCccceeE Q lcl|NC_013692. 254 TIEAMKDNHGNPA-FIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVG------KAVDPNDQVPMHESGGKYSVF 326 (399) Q Consensus 254 di~d~~~~~~~p~-fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aG------a~~~~~~~~~~~~~~~~~DVY 326 (399) +|+. ++. |.....+++...+-+|+||++.+|.++++.++-...+.. .+.+...+..--+.....+=| T Consensus 172 ~L~~------~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~~~A~~~a~q~~~~e~~r~~~~~ 245 (273) T protein:vir:10 172 WLRS------SGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQIDTVEALRDQDSF 245 (273) T ss_pred HHhc------chhhhhhhhccccccceeeeeeeEEeceEEEEecccccCCccEEEEEeccceeeeeeeehhhcccCCCcc Confidence 9986 355 445677778778889999999999999997764332210 011111110011111223345 Q ss_pred EEEEEccccceecccccCCCCCcceEEEecCCC Q lcl|NC_013692. 327 PMLCVASEAFTTVGFATDGKNVKFKIITKRPGE 359 (399) Q Consensus 327 p~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~ 359 (399) .-.|-|...||.--++.++ -+..+..|. T Consensus 246 ~~~v~~~~~yg~~v~~~~~-----~~~l~~~g~ 273 (273) T protein:vir:10 246 SDRIRALHVYGGKVVRPTG-----VVVFNKTGS 273 (273) T ss_pred eeeeeeeeeeeeeEeccce-----EEEEeccCC Confidence 6678888889977776444 223444442 No 27 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=99.20 E-value=6.6e-12 Score=81.87 Aligned_cols=262 Identities=13% Similarity=0.131 Sum_probs=150.5 Q ss_pred CCcccccccceehhhhhHHHHHHhhhHHhhhhccccc-c-cCcCCCcEEEEEEccCCcCC-CccccCCCCcchhhhhhhh Q lcl|NC_013692. 18 NGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTF-S-MPKHYGKEIVRLHYIPLLDD-RNVNDQGIDASGATIANGN 94 (399) Q Consensus 18 ~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~-~-mPKn~GktIkfrry~pl~~~-~t~lteGV~p~g~~~~ngn 94 (399) |... .. .+-.|...++..-.+.+++.++.... + -.+ .|+||+|++.-..... -++....+++ T Consensus 1 MA~~-~~----~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~-~Gdtv~ip~~~~~~~~d~~~~~~~~~~--------- 65 (273) T protein:vir:10 1 MAFN-NF----IPELWSDMLLEEWTAQTVFANLVNREYEGTAS-KGNVVHIAGVVAPTVKDYKAAGRQTSA--------- 65 (273) T ss_pred Ccch-hh----hHHHHHHHHHHHHHhhhccchhhccccccccc-cCceEEEeecccccccccccCCCccCc--------- Confidence 2111 01 12356777777778888999887542 1 234 5999999886554322 1222222222 Q ss_pred hccccccccccccccccccccccceeeccceeEEEEEEeeee-cceehhhhhhhhhhhchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 95 LYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKY-GFFREYTQEQLDFDSDPAMEGHVTTEMVKGANEITED 173 (399) Q Consensus 95 ~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~Qy-G~~~e~Td~~~~t~~D~~L~~~i~~el~~~~~~~t~d 173 (399) -.++-++++.+|.|+ +.=..++|+-.....++ +.+ +.+++...-+. ..| T Consensus 66 ---------------------------~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~-~~~-~~~~~~~alA~-~vD 115 (273) T protein:vir:10 66 ---------------------------DAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LEA-YTRAGATALAT-DTD 115 (273) T ss_pred ---------------------------cccccceEEEEEeeeeecceEeecHHHhhhhcc-HHH-HHHHHHHHHHH-HHH Confidence 123446788888664 66667888666666664 433 45554443332 233 Q ss_pred HHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCeeEEEechhhhH Q lcl|NC_013692. 174 LLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNARALYVGSDLVP 253 (399) Q Consensus 174 ~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~ 253 (399) .-..+++.++.+ .+. ..++++. .-.++.|..+.+.|++++.|. ..++++|+|+... T Consensus 116 ~~i~~~~~~a~~-~~~----~~~~~~~--~~~~~~i~~a~~~ld~~~vP~-----------------~~R~lvv~p~~~~ 171 (273) T protein:vir:10 116 KFIADMLVDNGT-ALT----GSAPTDA--DDAFDLIAKALKELTKANVPN-----------------VGRVVVVNAEMAF 171 (273) T ss_pred HHHHHHHhcccc-ccc----cccccch--hHHHHHHHHHHHHhhhcCCCc-----------------CCCEEEECHHHHH Confidence 333344443332 111 1122222 124678999999999999983 2488899999999 Q ss_pred HHHHHhhhcCCCC-ceehhhcCCccccccccceeEcCeEEEecCcccccccCC------cccCCcccccccccCccceeE Q lcl|NC_013692. 254 TIEAMKDNHGNPA-FIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVG------KAVDPNDQVPMHESGGKYSVF 326 (399) Q Consensus 254 di~d~~~~~~~p~-fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aG------a~~~~~~~~~~~~~~~~~DVY 326 (399) +|+. ++. |.....+++...+-+|+||++.+|.++++.++-...+.. .+.+...+..--+.....+=| T Consensus 172 ~L~~------~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~~~A~~~a~q~~~~e~~r~~~~~ 245 (273) T protein:vir:10 172 WLRS------SGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQIDTVEALRDQDSF 245 (273) T ss_pred HHhc------chhhhhhhhccccccceeeeeeeEEeceEEEEecccccCCccEEEEEeccceeeeeeeehhhcccCCCcc Confidence 9986 355 445677778778889999999999999997764332210 011111110011111223345 Q ss_pred EEEEEccccceecccccCCCCCcceEEEecCCC Q lcl|NC_013692. 327 PMLCVASEAFTTVGFATDGKNVKFKIITKRPGE 359 (399) Q Consensus 327 p~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~ 359 (399) .-.|-|...||.--++.++ -+..+..|. T Consensus 246 ~~~v~~~~~yg~~v~~~~~-----~~~l~~~g~ 273 (273) T protein:vir:10 246 SDRIRALHVYGGKVVRPTG-----VVVFNKTGS 273 (273) T ss_pred eeeeeeeeeeeeeEeccce-----EEEEeccCC Confidence 6678888889977776444 223444442 No 28 >protein:vir:105610 Length: 430 # NCBI annotation: virion structural protein # Family: family:all:974 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164307;genbank:gi:56692923;genbank:GeneID:3197221 Probab=99.13 E-value=9.8e-11 Score=75.45 Aligned_cols=328 Identities=12% Similarity=0.088 Sum_probs=177.1 Q ss_pred CCCCCccccc---ccceehhhhhHHHHHHhhhHHhh-hhccc-------------------------ccccCcCCCcEEE Q lcl|NC_013692. 15 DPANGVESSI---GPQIHTRYWYKRALIDAAKEAYF-GQLAD-------------------------TFSMPKHYGKEIV 65 (399) Q Consensus 15 ~~~~~~~~~i---~p~~~t~y~~~k~L~~A~p~lv~-~~~a~-------------------------~~~mPKn~GktIk 65 (399) -+ +..+.+ +|+ .-..|.+.+...+.+.-+| .+|.- ..+|-|+.|.+|. T Consensus 1 ~~--~a~T~~~~~~p~-a~~~ws~~l~~~~~k~~~~~~kl~G~~~~~~~~~~~~~~~~ts~~~pI~r~~dL~K~~GD~Vt 77 (430) T protein:vir:10 1 MT--ASKTTMRYGDPN-AMIQQAAGLFALCQGRNSTLNRLTGKMPSGTSDAEKKTKGQSSLELPIVQAQDLGRNKGDEVR 77 (430) T ss_pred Cc--ceeeecccCChh-HHHHHHHHHHHHHhhhhhhHHHhhccccccccchhhhccCCCCCCccEEEeccCCCCCccEEE Confidence 11 122333 555 3457777776666553222 33322 3457899999999 Q ss_pred EEEccCCcCCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhh Q lcl|NC_013692. 66 RLHYIPLLDDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQE 145 (399) Q Consensus 66 frry~pl~~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~ 145 (399) |-=-.+|.-. |...+-+.+| |-..+++-+=+++|.|...=+-.... T Consensus 78 f~L~~~L~g~--gv~Gd~~lEG--------------------------------nee~L~~~~d~l~IDq~R~~V~~gg~ 123 (430) T protein:vir:10 78 FHFVQPANAF--PIMGSEYAEG--------------------------------KGTGLKIGSDQLRVNQARFPVDLGDV 123 (430) T ss_pred EeEeeccccC--ceecCceeec--------------------------------cccceEEEeeEEEEeeeccccccCCc Confidence 9555555322 2222222222 11223444444455554443322221 Q ss_pred hhhhhhchhHHHHHHHHHHHHHHHHHHHHHH------------------------------HHHHhcCc-e-eeeccccc Q lcl|NC_013692. 146 QLDFDSDPAMEGHVTTEMVKGANEITEDLLQ------------------------------IDLLNSAG-T-VRYPGAAT 193 (399) Q Consensus 146 ~~~t~~D~~L~~~i~~el~~~~~~~t~d~l~------------------------------~~~l~agt-~-V~YAg~aT 193 (399) ....-+=-.|.+ ..++.+..=.....|++. .+-++|-+ | .+|+++.+ T Consensus 124 msqQRt~~dlR~-~ar~~L~~w~~~~~Dq~~~v~laGarg~~~~~~~~~~~~~~~~~~~~~~N~v~aPt~nrh~~~~G~a 202 (430) T protein:vir:10 124 MSQIRNPYDLRR-LGRPKAKWFMDAYLDQSMLVHLAGARGNHYNKEWCLPLETHPKLADMLVNRVKAPTKNRHFVASADA 202 (430) T ss_pred hhhhhhhhHHHH-HHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccccCCcchhhhhccccCCCCCceeEeecccc Confidence 111100001221 111222221111222211 12223323 2 67764443 Q ss_pred cc--------cccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCC Q lcl|NC_013692. 194 SD--------AEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNP 265 (399) Q Consensus 194 sr--------a~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p 265 (399) .+ ..++..+++|+++|+++...++..+-|.+-=.|.|..+-|.++ .||+||||....|||. |+ T Consensus 203 t~~~~~~~~~~sl~stD~~s~~~id~a~~~a~~~~~~i~Pv~v~gd~~~g~~~---~yV~~~~p~q~~~Lr~------dt 273 (430) T protein:vir:10 203 ITGVAPNAGEYNITTADVLDVDVVDSIATYMDQIELPPPPVKFEGDEAAEDSP---IRVLLCSPAQYNSFAK------QE 273 (430) T ss_pred cccccccccccchhhhcccCHHHHHHHHHHHHhhCCCCcceEeecccccCCcc---EEEEEechHHHHHHhh------Cc Confidence 32 2356678999999999999999998777666678877777554 4999999999999995 89 Q ss_pred Cceeh-------hhcCCccccccccceeEcCeEEEecCccccccc-----CCcc---cCCcccccccccCccceeEEEEE Q lcl|NC_013692. 266 AFIPI-------EKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAG-----VGKA---VDPNDQVPMHESGGKYSVFPMLC 330 (399) Q Consensus 266 ~fi~v-------~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~-----aGa~---~~~~~~~~~~~~~~~~DVYp~lV 330 (399) .|... ...|+.-|||.||+|.++|+=+.+.+..-.|-. -|++ ...+........++.+.|==-|. T Consensus 274 ~~~~wq~~~~a~a~~g~~nPlF~G~~gm~ngvii~~~~~virf~~g~~~~~~a~~~~~~~~~~~~~a~~~~~~~v~Rall 353 (430) T protein:vir:10 274 KFRSWQAAALARASNAKQHPIFRVDAGLWSNTLIIKMPKPIRFYAGDTIKYCAAYNSEAESSAVVSDSFGNQYAVDRALL 353 (430) T ss_pred chHHHHHHHHHhhcccccCCceecceeeecCeEEecCCceeeecCCCccccccCCcccccccccccccccccccchhhhh Confidence 98753 234567899999999999999998875544431 1111 11111111222245566666788 Q ss_pred Eccccceecccc-cCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccc------------cceEEEEEec Q lcl|NC_013692. 331 VASEAFTTVGFA-TDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRP------------EWIALLKTVA 397 (399) Q Consensus 331 ~G~~Afg~v~l~-~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~------------~~m~~iet~A 397 (399) +|..|-. +.+- ..+....|...-+. -| ||.+=-++.++.+|..-.+- -=.+.|-++| T Consensus 354 lGaQA~~-~A~g~~~~~g~~f~w~Ee~-----~D----~g~~~~i~~~~i~G~kK~rF~~~~~~~~~~~DfGvi~idtaa 423 (430) T protein:vir:10 354 LGGQALA-QAWAASEHSGMPFFWSEKD-----MD----HGDKLELLIGAILGCSKIRFAVEATNGLEYTDHGVMAIDTAV 423 (430) T ss_pred ccchhhe-eeeeccCCCCcceeeeeec-----cc----cCchhhhhhhHHhccceeeecCCCCCCceeeeeEEEEhhhhh Confidence 8888642 2221 11122234322222 13 66666677777777665433 3346788999 Q ss_pred CC Q lcl|NC_013692. 398 RL 399 (399) Q Consensus 398 ~~ 399 (399) ++ T Consensus 424 ~~ 425 (430) T protein:vir:10 424 KI 425 (430) T ss_pred hh Confidence 98 No 29 >protein:vir:2770 Length: 318 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612887;genbank:gi:20065804;genbank:GeneID:935710 Probab=99.05 E-value=8.7e-11 Score=75.73 Aligned_cols=258 Identities=11% Similarity=0.046 Sum_probs=147.0 Q ss_pred cccccc---------cccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccc---------cccCcCCCcEEEE Q lcl|NC_013692. 5 VDNIKP---------MKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADT---------FSMPKHYGKEIVR 66 (399) Q Consensus 5 ~~~~~~---------~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~---------~~mPKn~GktIkf 66 (399) |-|+.. -+| +...--.|.++ .|.+|+-..+.+...+..|.+. .++-|+.|.+|.| T Consensus 1 mt~~~~~~~~~~~~~~~f-----t~~~~~~~~vk--~ws~~l~~~~~~~~~~~~~~g~~~~~~I~r~~dL~K~~GD~Vtf 73 (318) T protein:vir:27 1 MTTVTSAQANKLFQVALF-----TAANRNRSMVN--ILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTF 73 (318) T ss_pred CCccCCCChHHHHHHHHH-----HHHhcCChHHH--HHHHhhhhHHHhhhhhhcccCCCCCceEEEeccCCCCCccEEEE Confidence 333211 122 11122244444 5888887777765555555442 3578999999999 Q ss_pred EEccCCcCCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhh Q lcl|NC_013692. 67 LHYIPLLDDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQ 146 (399) Q Consensus 67 rry~pl~~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~ 146 (399) .=-.+|.-... ..+-+.+| |...+++-+=++.|.|...-+-..... T Consensus 74 ~L~~~L~g~gv--~Gd~~lEG--------------------------------nee~L~~~~d~l~IDq~r~~V~~gg~m 119 (318) T protein:vir:27 74 SIMHKLSKRPT--MGDERVEG--------------------------------RGEDLSHADFSLKINQGRHLVDAGGRM 119 (318) T ss_pred eEeeccccCcc--ccCceeec--------------------------------cccceEEEeeEEEEeeeccccccccch Confidence 76666633222 22222222 112233344444455544443222222 Q ss_pred hhhhhchhHHHHHHHHHHHHHHHHHHHHHHHHH------------------------------Hhc--Cceeeecccccc Q lcl|NC_013692. 147 LDFDSDPAMEGHVTTEMVKGANEITEDLLQIDL------------------------------LNS--AGTVRYPGAATS 194 (399) Q Consensus 147 ~~t~~D~~L~~~i~~el~~~~~~~t~d~l~~~~------------------------------l~a--gt~V~YAg~aTs 194 (399) ...-+=-.|.+ ..++.+..-.....|++..-- ++| ...++|+|.+|+ T Consensus 120 sqqRt~~dlR~-~ar~~L~~w~~~~~Dq~~~v~laGarg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~g~at~ 198 (318) T protein:vir:27 120 SQQRTKFNLAS-SARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATS 198 (318) T ss_pred hhhhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhcccccccccceEecccCccchhhhhcccCCCCCCcEEeccCccc Confidence 11111112222 122222222222233322111 233 235889999999 Q ss_pred ccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCC---Cceehh Q lcl|NC_013692. 195 DAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNP---AFIPIE 271 (399) Q Consensus 195 ra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p---~fi~v~ 271 (399) ...|+..+++|+++|+++...++...-|.+-=.+.| ..+++ -.+.||+||||....|||. |. .|.... T Consensus 199 ~~~l~stD~~s~~lid~~~~~~~~~a~pi~PV~v~g-~~~~~--~~~~yV~~~~p~q~~~Lrt------dt~~~~w~d~q 269 (318) T protein:vir:27 199 FEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSG-DELHG--EDPYYVLYVTPRQWNDWYT------STSGKDWNQMM 269 (318) T ss_pred hhhhhhcccccHHHHHHHHHHHHHhCCCCcceeecc-ccccC--CcceEEEEechHHHHHHhh------cCCCHHHHHHH Confidence 999999999999999999998988766643223333 33332 2445999999999999984 44 588887 Q ss_pred hc------CCccccccccceeEcCeEEEecCcc-cccccCCcccCCcccc Q lcl|NC_013692. 272 KY------AAGGATMHGEVGQLGRFRVIVNPQM-MHWAGVGKAVDPNDQV 314 (399) Q Consensus 272 kY------g~~~~i~~gEIG~i~~~RfV~~~~~-~~~~~aGa~~~~~~~~ 314 (399) ++ |+.-|||.||+|.++|+=+.+.+.+ -.| -+|..+..+-.. T Consensus 270 ~~A~~r~~g~knPLF~G~~gm~ngvil~~~~~vpIrf-~~G~~v~~~~~~ 318 (318) T protein:vir:27 270 VRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRF-YQGQRFWYQRIT 318 (318) T ss_pred HHHHhcccccCCCceecceeeecCEEEeecCCccEEE-cCCCeeeeeecC Confidence 75 3466799999999999999999975 344 366655433221 No 30 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=99.03 E-value=2.9e-11 Score=78.33 Aligned_cols=289 Identities=14% Similarity=0.107 Sum_probs=166.3 Q ss_pred cccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCccccCCCC Q lcl|NC_013692. 5 VDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVNDQGID 84 (399) Q Consensus 5 ~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lteGV~ 84 (399) |+-.-....|...+ ++-+.-+-+ .+.++.++.+++.-.+.+++...+|+.+.+.++...-- . T Consensus 1 m~~~~~~~~~~~~t---~~~~~lvP~-~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~--------------~ 62 (297) T protein:vir:95 1 MTVQTFNPENVLVS---QKKDGTLHK-EFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTD--------------G 62 (297) T ss_pred CCcccccccccccc---CCCcceech-hHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcC--------------C Confidence 33222222233222 222222222 34577788888888999999999988766555432111 1 Q ss_pred cchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHHHHHH Q lcl|NC_013692. 85 ASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVTTEMV 164 (399) Q Consensus 85 p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~~el~ 164 (399) +..++...| +.+.--..++..++.+.++++.+..+|+++.+ +++..|.+.|..+|. T Consensus 63 ~~a~~v~Eg-----------------------~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~-ds~~~l~~~i~~~la 118 (297) T protein:vir:95 63 ISAYWVNET-----------------------EKIKTDKPEVVPVTLKAHKLGIILVTSREALN-YTWKKFFEDMKPQIV 118 (297) T ss_pred ceeEEeecC-----------------------ccccccccceeEEEEeeEEEEEeehhhHHHHh-cCHHHHHHHHHHHHH Confidence 111222211 11111124678899999999999999998877 445567777777777 Q ss_pred HHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCeeE Q lcl|NC_013692. 165 KGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNARA 244 (399) Q Consensus 165 ~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv 244 (399) +.-+.-.++.+..+.=+....-+... .+ .++......+++++|.++...|..+... ++ + T Consensus 119 ~ai~~~~d~a~l~G~g~~~~~gi~~~-~~-~~~~~~~~~~t~~~i~~~~~~l~~~~~~-----------------~~--~ 177 (297) T protein:vir:95 119 EAFYKKIDEAGLLGHDTPFANSVAKA-AK-DANKVIGGPINYDNILKLQDALYDADVE-----------------PN--A 177 (297) T ss_pred HHHHHHHHHHHhcccCCccccccccc-cc-ccceecccccCHHHHHHHHHHhhhccCC-----------------cC--E Confidence 66666455444332211112212221 11 1222223568999999999888765432 12 3 Q ss_pred EEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccccCccce Q lcl|NC_013692. 245 LYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESGGKYS 324 (399) Q Consensus 245 ~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~~~~D 324 (399) .+|||.....|+.++|--+ ..++.+..|.+.++..+.++... . .+. T Consensus 178 ~v~~~~~~~~L~~l~d~~G-------------~~i~~~~~~~l~G~Pv~~~~~~~--------~------------~~~- 223 (297) T protein:vir:95 178 FVSKIQNRSALREARDGNK-------------VSIYDKAANTIDGITTVDLKSAR--------F------------EKG- 223 (297) T ss_pred EEEcHHHHHHHHHhhccCC-------------ceeecCCCCcccceeeEeecCCC--------C------------CCc- Confidence 5789999999999876333 33555566777777776554200 0 111 Q ss_pred eEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCC----cc--chhhHHHHH--HHHHHhhccccceEEEEEe Q lcl|NC_013692. 325 VFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSD----PY--GEMGFMSIK--WYYGFMVFRPEWIALLKTV 396 (399) Q Consensus 325 VYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~D----Pl--gQrg~~gwK--~~~~~~iLn~~~m~~iet~ 396 (399) .+++|.-+...++.. +++.+++. ..+.. ....| ++ -|++.+.+| +++++.+++++-.++|+.+ T Consensus 224 ---~~~~gd~s~~~~~~~---~~~~i~~~--~~~~~-~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~a 294 (297) T protein:vir:95 224 ---DLLAGDFDNLIYGVP---YNITYKIS--EEGQI-STITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKLTPA 294 (297) T ss_pred ---eEEEEecccEEEEEe---cCeEEEEe--ecccc-ccccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEeec Confidence 145777666655554 22233222 22210 11122 22 466777777 7889999999999999999 Q ss_pred cCC Q lcl|NC_013692. 397 ARL 399 (399) Q Consensus 397 A~~ 399 (399) .|| T Consensus 295 t~~ 297 (297) T protein:vir:95 295 ERV 297 (297) T ss_pred CCC Confidence 999 No 31 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=99.03 E-value=4.9e-10 Score=71.62 Aligned_cols=320 Identities=14% Similarity=0.083 Sum_probs=172.0 Q ss_pred CCCccccccccccCCCCCCccccccc--ceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGP--QIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNV 78 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p--~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~ 78 (399) ||- ....+..|..+......-+. .+..--|..++++.=+..-++..+-.+|++= .||+++|-|.-..-. .- T Consensus 1 ma~---~~~~~~~n~~~~~~~~~~~~~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~--~g~s~~~~~iG~~~~--~~ 73 (344) T protein:vir:10 1 MAN---MTGGQQLGTNQGKDVMAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSIS--SGKSAQFPVLGRTQA--AY 73 (344) T ss_pred Ccc---ccccccCCcccCCccCCccchhHHHHHHHHHHHHHHHHHHhhhcccceeeeec--ccceEEEEeeceeEE--Ee Confidence 331 12333334333222111111 1122345677777766667777888888664 599999976633321 01 Q ss_pred ccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHH Q lcl|NC_013692. 79 NDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGH 158 (399) Q Consensus 79 lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~ 158 (399) .+.|-++.+ +.. .++-++++.+|-|.=-|..+=|.+++.-.+-.++.. T Consensus 74 ~~~G~~l~~----------t~~----------------------~~~~~e~~l~ID~~~y~~~~VdDiD~~q~~~D~r~~ 121 (344) T protein:vir:10 74 LAPGENLDD----------IRK----------------------DIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSE 121 (344) T ss_pred eecCCCCCC----------CCC----------------------CcccceEEEEEcchhhhhhhhhhHHHHhcCcchHHH Confidence 111221111 111 123355666776655555444555554444345565 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCce-----eeecc--cc-----cccc-ccCCcc---eecHHHHHHHHHHHHhccCc Q lcl|NC_013692. 159 VTTEMVKGANEITEDLLQIDLLNSAGT-----VRYPG--AA-----TSDA-EVDATT---EVTYDSLMRLRLDLDNARAP 222 (399) Q Consensus 159 i~~el~~~~~~~t~d~l~~~~l~agt~-----V~YAg--~a-----Tsra-~v~~~~---~vt~~~lr~a~~~Lk~nrAp 222 (399) ++.+++..=+......+.+.+.+++.. ..-+| .+ +..+ +.+.+. ..=++.|+.|...|+++..| T Consensus 122 ~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP 201 (344) T protein:vir:10 122 YTSQLGESLAMAADGAVLAEIAGLCNVESQYNENITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVP 201 (344) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCC Confidence 666666555554444444444433221 11111 00 0001 111110 01156688899999999987 Q ss_pred cccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccc Q lcl|NC_013692. 223 TKIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWA 302 (399) Q Consensus 223 k~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~ 302 (399) . ..++++|.|+.-..|.+ ++.|... .|+....+-+|.||++.+|++++++++-.-. T Consensus 202 ~-----------------~gR~~vv~P~~y~~Ll~------~~~~~~~-~~~~~~~~~~G~V~~v~G~~V~~Sn~lp~~~ 257 (344) T protein:vir:10 202 S-----------------SDRVFYCDPDSYSAILA------ALMPNAA-NYAALIDPEKGSIRNVMGFEVVEVPHLTAGG 257 (344) T ss_pred c-----------------cCCEEEeChHHHHHHhh------ccccccc-ccccccceeeeEEEEEeceEEEecccccccc Confidence 3 24899999999998875 5667655 4898889999999999999999999974321 Q ss_pred cCCcccCCcccc--cccccCcccee----EEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHH Q lcl|NC_013692. 303 GVGKAVDPNDQV--PMHESGGKYSV----FPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSI 376 (399) Q Consensus 303 ~aGa~~~~~~~~--~~~~~~~~~DV----Yp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gw 376 (399) ..+...+.++.. .....+.++.+ -.-|+|=++|-+++-++ .++.+ ...|+--|.-++== T Consensus 258 ~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~----~~~~e-----------~~r~~~~~~d~i~g 322 (344) T protein:vir:10 258 AGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLR----DLALE-----------RARRANFQADQIIA 322 (344) T ss_pred CCcccccccCccccccCCcccceeeecceeEEEeechhhhhhhhhc----cceee-----------cccchhHHHHHHHH Confidence 111111111100 00111111111 12345555665555442 11111 11245555566666 Q ss_pred HHHHHHhhccccceEEEEEecC Q lcl|NC_013692. 377 KWYYGFMVFRPEWIALLKTVAR 398 (399) Q Consensus 377 K~~~~~~iLn~~~m~~iet~A~ 398 (399) |+.|++.+||++..+.||-..+ T Consensus 323 ~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 323 KYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred HhhcccceecccceEEEEeecC Confidence 8999999999999999999999 No 32 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=99.02 E-value=6.4e-11 Score=76.46 Aligned_cols=307 Identities=11% Similarity=0.071 Sum_probs=168.5 Q ss_pred cccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCccccCCCCcchhhh Q lcl|NC_013692. 11 MKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVNDQGIDASGATI 90 (399) Q Consensus 11 ~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lteGV~p~g~~~ 90 (399) |.-.+++|.+..-|-|| .|+...|.--.+.|+...++....- .+|+||++.+-.....-.-.....|+.. T Consensus 1 ~~~~n~ts~~qafi~~E----iWsa~il~~l~~~Lv~~~~~~~~d~--g~GDtV~InsIg~~tV~dY~~~~~i~~d---- 70 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSE----IWADEIEDILHEKLLDVNIARVVDF--PDGDKLTIPSVGTPVVRSRPEQGDFTFD---- 70 (322) T ss_pred CCCCCCcccceEEeehh----hhHHHHHHHhhhhhhhhhhhccccc--CCCCeEEeccccccccccccCCCCcccc---- Confidence 33333555444445454 6888888888888888888765443 4699999977665543333323333322 Q ss_pred hhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHHHHHHHHHHHH Q lcl|NC_013692. 91 ANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVTTEMVKGANEI 170 (399) Q Consensus 91 ~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~~el~~~~~~~ 170 (399) +|+. +.+++..+=++|=.|. ++|+...-..| | ....++....+... T Consensus 71 --------------------~ltt----------~~~~l~IDq~KYfaf~-VdDD~~Qa~~d--l-~~~~~~~aa~ala~ 116 (322) T protein:vir:31 71 --------------------NLDT----------GEISIILRDEVYAGNA-ISKKLRQDSRW--I-SNVGAMLPAEQARA 116 (322) T ss_pred --------------------cCCC----------ceEEEEEehhhhhccc-cchhHHHhhhh--H-HHHHHHHHHHHHHH Confidence 1222 2234444444465554 56533222222 2 22233333333344 Q ss_pred HHHHHHHHHHhcCc---------eeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccC Q lcl|NC_013692. 171 TEDLLQIDLLNSAG---------TVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGN 241 (399) Q Consensus 171 t~d~l~~~~l~agt---------~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~ 241 (399) ..|.-..++|+.|- +|++.- -.+-.....+-...++.|+++...|++.+.|+ . T Consensus 117 ~~D~fva~lL~~gA~~~~~~~~p~vin~~-~~~iv~~gt~~~~ay~~lv~l~~kLdkanVP~-----------------~ 178 (322) T protein:vir:31 117 IMERYQTDLLALGNAQFAGQNDPNVINGV-PHRFVGTGTDQTMDVTDFSRVNYVMTQSKMPM-----------------G 178 (322) T ss_pred HHHHHHHHHHHHHhhhhhccCCcceecCC-ccceeccCCCchhhHHHHHHHHHHhccccCCC-----------------C Confidence 56666777775432 222221 00001111123467889999999999999984 3 Q ss_pred eeEEEechhhhHHHHHHhhh---cCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccc Q lcl|NC_013692. 242 ARALYVGSDLVPTIEAMKDN---HGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHE 318 (399) Q Consensus 242 ~yv~~~h~dl~~di~d~~~~---~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~ 318 (399) .++++|.|.....|..+..| ..|+.|..+.+.|..+.+- =||++.+|++++|-++- .+.....++.....+ T Consensus 179 gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~a~g~~--~Vg~~~GF~V~~SN~l~----~~~~~i~aG~d~~~t 252 (322) T protein:vir:31 179 GMIGIIDPSVAHHLETITNISNISNNPRWEGIVESGIAPDMQ--FVRSVYGIDLFVSNLLA----DANETINAGGDARST 252 (322) T ss_pred CeEEEeCchhhhhhhhhhhhhhhhccccccccccccchhhHH--HHHHHhceeeeeecccc----ccccccccCcccccc Confidence 59999999998887664333 6789999999998765433 39999999999999862 111111111111233 Q ss_pred cCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEe-c Q lcl|NC_013692. 319 SGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPEWIALLKTV-A 397 (399) Q Consensus 319 ~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~iet~-A 397 (399) +++++-.|+.+- + +++.++-+..+-+ -|.-++- -+..+-.+-||. +.||+.++|+|=++.+++- + T Consensus 253 ~ag~~n~f~~~~---~-~~~~~~~~~~~~l-----~~~e~~r-~~~~~~d~~~~~----~~~g~g~~r~e~l~~~~a~~~ 318 (322) T protein:vir:31 253 TAGKCNMFMNVS---D-MGLLPFVVAWKEM-----PTTKSFI-DDYNDDLNTATT----ARWGNGLVRDENLVCVLANAD 318 (322) T ss_pred cceeeccccccc---c-hhhhhhhhHhhhh-----hhhhccc-Cccccccceeee----eeecceeecccceEEEEeccc Confidence 445554444321 1 1222222111111 0111111 112333455555 4689999999999988864 4 Q ss_pred CC Q lcl|NC_013692. 398 RL 399 (399) Q Consensus 398 ~~ 399 (399) |+ T Consensus 319 ~~ 320 (322) T protein:vir:31 319 KV 320 (322) T ss_pred cc Confidence 55 No 33 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=99.01 E-value=9.5e-11 Score=75.53 Aligned_cols=303 Identities=14% Similarity=0.068 Sum_probs=157.6 Q ss_pred cccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccc--cCcCCCcEEEEEEccCCcCCCccccCC Q lcl|NC_013692. 5 VDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFS--MPKHYGKEIVRLHYIPLLDDRNVNDQG 82 (399) Q Consensus 5 ~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~--mPKn~GktIkfrry~pl~~~~t~lteG 82 (399) |..+.-+. +.| --|.+++|..-++.+|+.++....- -..++|+||++++-.. .+..+ | T Consensus 1 m~~~~N~~-----------ltp----~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~~----~~v~d-g 60 (418) T protein:vir:10 1 MAVQDNNL-----------LTD----DVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPYR----VKSAS-G 60 (418) T ss_pred CCcccccc-----------ccH----HHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCCc----eeecc-c Confidence 11111111 123 3788999999999999888765522 2357899999976332 22221 1 Q ss_pred CCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEe-eeecceehhhhhhhhhhhchhHHHHHHH Q lcl|NC_013692. 83 IDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKL-EKYGFFREYTQEQLDFDSDPAMEGHVTT 161 (399) Q Consensus 83 V~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l-~QyG~~~e~Td~~~~t~~D~~L~~~i~~ 161 (399) .. ++ ..+++-+.++.+| ++-..-.+++|+-.....+ ++.. T Consensus 61 -----~~---------------~~--------------~~~~te~~v~l~id~~k~~~~~itD~e~a~~~~-----d~~~ 101 (418) T protein:vir:10 61 -----RT---------------LV--------------KQPMVDQTIPFKIAYQEHVGLEYTVKDKTLDIM-----QFSE 101 (418) T ss_pred -----CC---------------cc--------------ccccccceEEEEEecccccceeechHHHhhhhh-----HHHH Confidence 11 11 1123345677888 5556667788776554444 2344 Q ss_pred HHHHHHHHHHHHHHHHHHH---hcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCccc Q lcl|NC_013692. 162 EMVKGANEITEDLLQIDLL---NSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRT 238 (399) Q Consensus 162 el~~~~~~~t~d~l~~~~l---~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~ 238 (399) +.++.++.-..+.+-.+++ ++..+ + .++. ....-.++++..+.+.|+++++|+ T Consensus 102 ~~l~~A~~aLA~~vD~~ia~l~~~a~~---~-~gt~-----gt~~~~~~~i~~a~~~Ld~~~VP~--------------- 157 (418) T protein:vir:10 102 RYLKSGMVQIANQIDRSLALTLKKAFH---S-SGTP-----GVRPGAFIDFANAGAKQTTYAVPQ--------------- 157 (418) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccc---c-cccC-----CcCcchHHHHHHHHHHHHhcCCCC--------------- Confidence 5555555545445455544 33222 1 1111 112235888999999999999985 Q ss_pred ccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCc----------cc Q lcl|NC_013692. 239 VGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGK----------AV 308 (399) Q Consensus 239 I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa----------~~ 308 (399) ..-|++++.|+....|.+ ++.|. ..+++..+.+-+|+||++.+|.++++.++-.-. +|. .. T Consensus 158 -~G~R~lVv~P~~~~~L~~------~~~~~-~~~~~~~~~lr~G~IG~i~GF~V~~S~nip~~t-ag~~~~t~~v~ga~~ 228 (418) T protein:vir:10 158 -DGMRHAVLDPFTCASLSD------EVTKL-FKESMVEQAYKMGYRGNVAAYEVYESQNLPKHT-VGDHGGTPLVNGTVV 228 (418) T ss_pred -CCceEEEeCHHHHHHHhh------hcccc-ccccccchhhheeeeeeeeceEEEEecCCCccc-ccccccceeeecccc Confidence 112888899999888864 45554 367788888999999999999999998854321 121 11 Q ss_pred CCcccc----cccc-c-Ccccee-----------------------------------------EEEEEEccc------- Q lcl|NC_013692. 309 DPNDQV----PMHE-S-GGKYSV-----------------------------------------FPMLCVASE------- 334 (399) Q Consensus 309 ~~~~~~----~~~~-~-~~~~DV-----------------------------------------Yp~lV~G~~------- 334 (399) .+.... ..+. + -.+.|+ ||-|+-+.. T Consensus 229 ~~~~~~~~~~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~~~~~~~~~tv~i~p~~~~~~~~~~~~~~ 308 (418) T protein:vir:10 229 NGDTVGFDGGTASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQEFVVLEDVDTDAGGAGSIKISPSLNDGTATINNENG 308 (418) T ss_pred cceeEEEeecceeeccceeeccEEEECceeecccccccccccceEEEEEeeccccccCcceeEecccccccccccccccc Confidence 111010 0010 1 112333 222210000 Q ss_pred ------cceecccc-cCC--------------CCC-----cceEEEec----CCCcCCCC-CCccc-------------- Q lcl|NC_013692. 335 ------AFTTVGFA-TDG--------------KNV-----KFKIITKR----PGEATADR-SDPYG-------------- 369 (399) Q Consensus 335 ------Afg~v~l~-~~g--------------~~~-----k~~~ivk~----pG~~tad~-~DPlg-------------- 369 (399) +|..|.=. ..+ .++ .|.++..+ .|...... ++|+. T Consensus 309 ~~~~~~~~~~v~a~~a~~~~it~~~~a~~~~~~nl~f~~~a~~l~~~~l~~p~g~~~~~~~~~~~~G~s~r~~~~~d~~~ 388 (418) T protein:vir:10 309 DPVSLTAYQNVTALPADNAPITVLGAANTTYEQNYLFHRDAIALAMIDLELPQSAVIKSRAADPETGLSLTLTGAYDINE 388 (418) T ss_pred ccccccCCCcccccccCcceeeeecccccceeeeeeeecceEEEEEeeccCCCCCCcceEEEeccCCeEEEEEEcccccc Confidence 00000000 000 000 12222222 23221111 13332 Q ss_pred hhhHHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 370 EMGFMSIKWYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 370 Qrg~~gwK~~~~~~iLn~~~m~~iet~A~~ 399 (399) +--.+.|=.+||+..|++||.+||==.|-- T Consensus 389 ~~~~~r~d~l~g~~~~~p~~~~~~~g~~~~ 418 (418) T protein:vir:10 389 QSEIHRIDAVWGADMIYGELALRLWGAASS 418 (418) T ss_pred cceEEEEEeecCceeecccceEEEEeecCC Confidence 222334445999999999998887544333 No 34 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=99.00 E-value=1.3e-09 Score=69.29 Aligned_cols=309 Identities=9% Similarity=0.043 Sum_probs=176.2 Q ss_pred cccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCccccCCCC Q lcl|NC_013692. 5 VDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVNDQGID 84 (399) Q Consensus 5 ~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lteGV~ 84 (399) |+|......-.|.-+...+. -.+..--|.-+.+..-+...+|..+-.+|++ ..|+++.|-|--..... -.+.|-. T Consensus 1 m~~~~~~~~t~~~~~~~~~~-~~l~le~~~geV~~af~~~s~~~~~~~~r~i--~~G~s~~~~~iG~~~~~--~~~~g~~ 75 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSD-VSLHIEEHLGLVDASFMYSSKFASWMNVRSL--RGTNQLRVDRVGASTIA--GRKAGEE 75 (334) T ss_pred CCCCcCCCccccccccccch-heehhhhhhhHHHHHHHHhhhhhccceeeec--cccceEEEeeecceeee--eecCCCC Confidence 77765554444433222222 1223335678888887777888899999988 55999999765443221 1111222 Q ss_pred cchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHHHHHH Q lcl|NC_013692. 85 ASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVTTEMV 164 (399) Q Consensus 85 p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~~el~ 164 (399) +.++. ++-.+.+.+|-|.=-+-.+=|.+++.-..-.+...++.|++ T Consensus 76 l~~~~----------------------------------~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G 121 (334) T protein:vir:80 76 LVVQK----------------------------------NVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDG 121 (334) T ss_pred CCCCC----------------------------------cccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHH Confidence 22211 12234445554432222222333333222225555666666 Q ss_pred HHHHHHHHHHHHHHHHhcCceee-------eccccccccccCCc---ceecHHH----HHHHHHHHHhccCccccceecc Q lcl|NC_013692. 165 KGANEITEDLLQIDLLNSAGTVR-------YPGAATSDAEVDAT---TEVTYDS----LMRLRLDLDNARAPTKIKMITG 230 (399) Q Consensus 165 ~~~~~~t~d~l~~~~l~agt~V~-------YAg~aTsra~v~~~---~~vt~~~----lr~a~~~Lk~nrApk~T~ii~~ 230 (399) ..=+......+.+-+++|+.+.. +..+...-..+++. ...+.+. ++.|.+.|.++.-|. T Consensus 122 ~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~------- 194 (334) T protein:vir:80 122 IALARQYDQACIIQLQKCGDFLAPAHLKPAFHDGILLPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGD------- 194 (334) T ss_pred HHHHHHHHHHHHHHHHHhhhhcccccccccccCCcceeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCC------- Confidence 55555444455666666654421 10111222222222 1222333 445666777777762 Q ss_pred ccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhc---CCccccccccceeEcCeEEEecCcccccccCCcc Q lcl|NC_013692. 231 TRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKY---AAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKA 307 (399) Q Consensus 231 s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kY---g~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~ 307 (399) ....-++++|.|..-..|.. ++.|+.+ .| ++...+-.|+|+++.+||+++++++-.-...+ T Consensus 195 -------~~~~~R~~vv~P~~y~~Ll~------~~r~~n~-d~~~s~~~~~~~~g~i~~v~G~~V~~Sn~~P~~~~t~-- 258 (334) T protein:vir:80 195 -------QLMSEGVTLLDPVIFSFLLE------HDRLMNV-EFGAKEGGNSFVGGRIAMLNGVRVVETPRFPQSAITA-- 258 (334) T ss_pred -------CcCCceEEEeChHHHHHHhc------ccccccc-eeccccccccccceeEEEEeceEEEeecCCCCccccc-- Confidence 11135999999999999986 6889888 44 34456889999999999999999863222111 Q ss_pred cCCcccccccccCccceeE-------EEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHH Q lcl|NC_013692. 308 VDPNDQVPMHESGGKYSVF-------PMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYY 380 (399) Q Consensus 308 ~~~~~~~~~~~~~~~~DVY-------p~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~ 380 (399) +..++.+.+| ..+++.+.|-+++-+. .+.. + ---|+--|..++=-|..| T Consensus 259 ---------~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~----~~~~--------e---~~~~~~~~~d~i~~~~a~ 314 (334) T protein:vir:80 259 ---------NALGADFNVTDAEVRRKMITFIPSMALISAQVH----PVSA--------Q---FWEEKKDFGHYLDTFQSY 314 (334) T ss_pred ---------cccccccccccccccceEEEEEeCceEEEEEEe----ecce--------e---eeechhhHHHHHHHHHHc Confidence 1123344444 5677888888776653 1111 1 124555688888889999 Q ss_pred HHhhccccceEEEEEecCC Q lcl|NC_013692. 381 GFMVFRPEWIALLKTVARL 399 (399) Q Consensus 381 ~~~iLn~~~m~~iet~A~~ 399 (399) ++.+||+|..+.+|.--.= T Consensus 315 G~g~lRPeaa~vv~~~~~~ 333 (334) T protein:vir:80 315 NIGQRRPDAVAVHDITVTN 333 (334) T ss_pred CCceeccceEEEEEEeeec Confidence 9999999999888854322 No 35 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=98.95 E-value=1.3e-10 Score=74.83 Aligned_cols=288 Identities=17% Similarity=0.168 Sum_probs=171.6 Q ss_pred cccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCccccCCCCcchhhh Q lcl|NC_013692. 11 MKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVNDQGIDASGATI 90 (399) Q Consensus 11 ~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lteGV~p~g~~~ 90 (399) +=||.-..+++++-+.-+-+.+ .++.++...+..++.+++.+.+|+.+ +.++.+... |.+.+. T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~~-~~~ii~~~~~~s~l~~~~~~~~~~~~---~~~~~~~~~-------------~~a~~v 63 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPINI-SEQIITGVKNGSAAMKLAKAVPMTKP---EEEFTFMSG-------------VGAFWV 63 (299) T ss_pred CCcCCCcccccCCCceecchhH-HHHHHHHHHhcchhhhhceeeecCCC---cEEEEEEcC-------------Cceeee Confidence 5666555444444333333333 56777778888899999988888643 333322111 111121 Q ss_pred hhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHHHHHHHHHHHH Q lcl|NC_013692. 91 ANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVTTEMVKGANEI 170 (399) Q Consensus 91 ~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~~el~~~~~~~ 170 (399) . +|+....-..++..|+.+.++++.+.++|+++.+ ++++.|.+.|..+|.+.-+.- T Consensus 64 ~-----------------------E~~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~-ds~~~~~~~i~~~l~~a~~~~ 119 (299) T protein:vir:41 64 D-----------------------EAERIQTSKPTFTKAKMRSKKMGVIIPTTKENLN-YSVTNFFSLMQAEIVEAFYKK 119 (299) T ss_pred e-----------------------cCccccccccceeEEEEeeEEEEEeehhhHHHHh-cCHHHHHHHHHHHHHHHHHHH Confidence 1 1222222335778999999999999999999887 555667776666666555543 Q ss_pred HHHHHHHHHHhcCce----eeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCeeEEE Q lcl|NC_013692. 171 TEDLLQIDLLNSAGT----VRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNARALY 246 (399) Q Consensus 171 t~d~l~~~~l~agt~----V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv~~ 246 (399) .++. +|++-++ =+.. ..+...+.......++++|.++...|..+..+. -..+ T Consensus 120 ~d~a----~l~G~g~~~~~gil~-~~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~~-------------------~~~v 175 (299) T protein:vir:41 120 FDQA----VFTGVESPYNWNILK-SATDASNLVEETANKYDDLNEAIGLIEAEDLEP-------------------NGIA 175 (299) T ss_pred HHHH----HhhcccCcccccccc-cccccceeeccccccHHHHHHHHHhhhcccCCc-------------------CEEE Confidence 3333 3322211 0111 111112222345678999999988777655431 2368 Q ss_pred echhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccccCccceeE Q lcl|NC_013692. 247 VGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESGGKYSVF 326 (399) Q Consensus 247 ~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~~~~DVY 326 (399) |||.....|+.++|--+.|=|.+.- .++.+.+-++.++.++.|. +|+ ++ T Consensus 176 ~n~~~~~~L~~lkd~~G~~l~~~~~---------~~~~~~l~G~PV~~~~~~~----~~~--------------~~---- 224 (299) T protein:vir:41 176 TIRKQRVKYRSTKDGNGMPIFNTAT---------SNGVDDVLGLPIAYTPKYT----FGD--------------KD---- 224 (299) T ss_pred EcHHHHHHHHHhhccCCceeecCCc---------CCCCceecceeeEEecccC----CCC--------------Cc---- Confidence 9999999999998766666665432 3445678889999998753 111 11 Q ss_pred EEEEEccccceecccccCCCCCcceEEEe---cCCCcCCCCCCc--cchhhHHHHH--HHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 327 PMLCVASEAFTTVGFATDGKNVKFKIITK---RPGEATADRSDP--YGEMGFMSIK--WYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 327 p~lV~G~~Afg~v~l~~~g~~~k~~~ivk---~pG~~tad~~DP--lgQrg~~gwK--~~~~~~iLn~~~m~~iet~A~~ 399 (399) +.+++|.-++..+++. +++.+++.-- .-+. -..+.| +-|++.+.+| +++++.+++++-+++|+..|-= T Consensus 225 ~~~~~gdfs~~~i~~~---~~~~i~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~aa~ 299 (299) T protein:vir:41 225 ISELVGDWNQAYYGIL---RGVEYEILTEATLTTVA--DETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQPKAGN 299 (299) T ss_pred eEEEEEecccEEEEEe---cCcEEEEeecccccccc--cccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCC Confidence 2466777776666665 2233433221 1111 011222 3477778877 5789999999999999876666 No 36 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=98.94 E-value=1.1e-09 Score=69.80 Aligned_cols=318 Identities=13% Similarity=0.047 Sum_probs=179.2 Q ss_pred CCCccccccccccCCCC--CCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcC-CCc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPA--NGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLD-DRN 77 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~--~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~-~~t 77 (399) || |..-++.+|--+ .+..+++ -.+..--|..+.++.-+...+|..+-.++.+ ..||+++|.|.-.... ..+ T Consensus 1 ma---~~~~~~~~~t~~g~~~~~~d~-~al~ie~~~geV~~~f~~~s~~~~~~~~rti--~~G~sv~~~~iG~~~~~~~~ 74 (347) T protein:vir:94 1 MA---NMNGGQQMGKDQGKGMSAGDK-LALFLKVFGGEVLTAFTRTSVTMNKHLVRSI--QSGKSAQFPVLGRTKAAYLQ 74 (347) T ss_pred CC---ccccccccccccccCCcccch-HHHHHHHHhHHHHHHHHHHHhhhhhhhheec--cccceEEeeeccceeEeeee Confidence 33 122344443111 1111221 1233345677888777777888888888877 3599999987666532 122 Q ss_pred cccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHH Q lcl|NC_013692. 78 VNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEG 157 (399) Q Consensus 78 ~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~ 157 (399) |-++-.++ +| +++..+++.+|-|+--+..+=|.+++.-.+-.+.. T Consensus 75 ~G~~l~~~---------------------------------~~--~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs 119 (347) T protein:vir:94 75 PGENLDDK---------------------------------RK--DMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRS 119 (347) T ss_pred cCcCCCCC---------------------------------cC--CccccceEEEEcchhhhhhhhhhHHHHhcCcchHH Confidence 21111110 01 12334566777776555544444444433322444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCce-----eeecccc-cc------ccccCCcceec----HHHHHHHHHHHHhccC Q lcl|NC_013692. 158 HVTTEMVKGANEITEDLLQIDLLNSAGT-----VRYPGAA-TS------DAEVDATTEVT----YDSLMRLRLDLDNARA 221 (399) Q Consensus 158 ~i~~el~~~~~~~t~d~l~~~~l~agt~-----V~YAg~a-Ts------ra~v~~~~~vt----~~~lr~a~~~Lk~nrA 221 (399) +++.++...=+..+...+.+.+.+++-. .-.+|.. .+ .+++....... ++.|+.|...|+++.. T Consensus 120 ~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~~~~~g~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dV 199 (347) T protein:vir:94 120 EYTAQLGESLAMAADGAVLAEMAKLCNLPTANNENIAGLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYV 199 (347) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCC Confidence 4555555444444444455555443221 1112110 01 11222121112 6679999999999998 Q ss_pred ccccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCccccc Q lcl|NC_013692. 222 PTKIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHW 301 (399) Q Consensus 222 pk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~ 301 (399) |. ..+++++.|..-.+|.... +.....|..-..+-+|.||++.+|++++++++-.| T Consensus 200 P~-----------------~~R~~vv~P~~y~~LLk~~-------~~~~~~~~~~~~~~~G~V~~v~G~~V~~Sn~~p~~ 255 (347) T protein:vir:94 200 PS-----------------SDRVFYTTPDNYSAILAAL-------MPNAANYQALIDPSTGSIRNVMGFEVIEVPHLTAG 255 (347) T ss_pred CC-----------------CCCEEEeChHHHHHHHHhh-------cccccccccccccccceeEEeeceEEEEcCccccc Confidence 73 2599999999998887532 22234666666678899999999999999998777 Q ss_pred ccCCcccCCccc------ccccccCcccee----EEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchh Q lcl|NC_013692. 302 AGVGKAVDPNDQ------VPMHESGGKYSV----FPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEM 371 (399) Q Consensus 302 ~~aGa~~~~~~~------~~~~~~~~~~DV----Yp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQr 371 (399) .......+++.. ...+..+++|++ ---||+-.+|-+++-+. .+..++ .-|+--|. T Consensus 256 ~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~----~~~~e~-----------~~~~~~~~ 320 (347) T protein:vir:94 256 GAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLK----DMALER-----------ARRANFQA 320 (347) T ss_pred cCcccccccccccccccccccccccccccccccceEEEEechhhhhhhhhc----ccceee-----------eechhhhh Confidence 643322222111 011222334331 12477777777766653 222222 14667788 Q ss_pred hHHHHHHHHHHhhccccceEEEEEecC Q lcl|NC_013692. 372 GFMSIKWYYGFMVFRPEWIALLKTVAR 398 (399) Q Consensus 372 g~~gwK~~~~~~iLn~~~m~~iet~A~ 398 (399) -+.=-|..|++.+||+|.-+.|+.-+- T Consensus 321 ~~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 321 DQIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred hhhhhhhhhcCcccccceeEEEEecCC Confidence 888889999999999999988877655 No 37 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=98.93 E-value=8.6e-10 Score=70.29 Aligned_cols=317 Identities=14% Similarity=0.052 Sum_probs=167.5 Q ss_pred CCCccccccccccCCCCCCcccccccc--eehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQ--IHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNV 78 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~--~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~ 78 (399) ||--+ -.+.+| +..+.-...+.. +.---|..+.+..=+..-++..+-..+++ ..|++++|.|--.... .- T Consensus 1 ma~~~---~~~~~~-t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~~~--~~G~sv~i~~ig~~t~--~~ 72 (347) T protein:vir:15 1 MANIQ---GGQQIG-TNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSI--ASGKSAQFPVIGRTKA--AY 72 (347) T ss_pred CCccc---cCCccc-cccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccccc--cccceeEeeeccceee--ee Confidence 43221 233332 111111111111 11123445555544444455555555543 3599999987765422 11 Q ss_pred ccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHH Q lcl|NC_013692. 79 NDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGH 158 (399) Q Consensus 79 lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~ 158 (399) .+.|-...+ ++.+.+.++++.+|-|+=-|..+=|.++..-..-.++.. T Consensus 73 ~~~g~~l~~--------------------------------~~~~~~~~e~~ltID~~~~~~~~VddlD~~q~~~D~~~~ 120 (347) T protein:vir:15 73 LKPGENLDD--------------------------------KRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAE 120 (347) T ss_pred eccCCCCCC--------------------------------CCCCCccceEEEEechhhhhhHHhhhHHHHhcCCcchHH Confidence 122222111 111123356667777765555555555555444335554 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCce-----eee---ccccc--cccccCCcceec--------HHHHHHHHHHHHhcc Q lcl|NC_013692. 159 VTTEMVKGANEITEDLLQIDLLNSAGT-----VRY---PGAAT--SDAEVDATTEVT--------YDSLMRLRLDLDNAR 220 (399) Q Consensus 159 i~~el~~~~~~~t~d~l~~~~l~agt~-----V~Y---Ag~aT--sra~v~~~~~vt--------~~~lr~a~~~Lk~nr 220 (399) ++.+....=+....+.| ..+|..+.+ ... .|..+ .-...+.+..-+ ++.|+.|.+.|.++. T Consensus 121 ~~~~~g~aLA~~~D~~i-~~~l~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~ 199 (347) T protein:vir:15 121 YTAQLGESLAMAADGAV-LAELAGLVNLPDASNENIEGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNY 199 (347) T ss_pred HHHHHHHHHHHHHHHHH-HHHHHHHhhccccccccccccCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcC Confidence 55554444444333333 333322211 000 01001 011111111111 667888889999999 Q ss_pred CccccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccc Q lcl|NC_013692. 221 APTKIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMH 300 (399) Q Consensus 221 Apk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~ 300 (399) .|. ..+++++.|+.-.+|.. ++.|+... |+..+.+.+|.||++.+|++++++++-. T Consensus 200 VP~-----------------~gR~~vv~P~~y~~LL~------~~~~~~~d-~~~~~~~~~G~Vg~i~G~~V~~Sn~lp~ 255 (347) T protein:vir:15 200 VPA-----------------ADRTFYTTPDNYSAILA------ALMPNAAN-YQALIDHERGTIRNVMGFEVVEVPHLTA 255 (347) T ss_pred CCc-----------------cCCEEEeCHHHHHHHhc------cccccccc-ccccccccceEEEEEeceEEEecccccc Confidence 973 24899999999999974 67787654 7777889999999999999999999743 Q ss_pred cccCCcc----cCCccccc--c-cccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhH Q lcl|NC_013692. 301 WAGVGKA----VDPNDQVP--M-HESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGF 373 (399) Q Consensus 301 ~~~aGa~----~~~~~~~~--~-~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~ 373 (399) ....+.. .+...... . .+....++....|++-++|.|++-++. ++.+.. -|+--|.-. T Consensus 256 ~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~----~~~e~~-----------~~~~~~~d~ 320 (347) T protein:vir:15 256 GGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKD----LALERA-----------RRANYQADQ 320 (347) T ss_pred cccccccccccccccccccccccceeeeccccceeeeeccceeeeeEeec----eeeeec-----------ccchhhhhh Confidence 3221111 11000000 0 001223455678999999999887642 222111 345556666 Q ss_pred HHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 374 MSIKWYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 374 ~gwK~~~~~~iLn~~~m~~iet~A~~ 399 (399) +--|..|++.+||++..+.|+ -|= T Consensus 321 i~~~~~~G~~vlrP~~av~~~--~~~ 344 (347) T protein:vir:15 321 IIAKYAMGHGGLRPEAAGAIV--LPK 344 (347) T ss_pred hehhhhcCCceeccccEEEEe--cCC Confidence 666788899999999977773 222 No 38 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=98.93 E-value=1.2e-09 Score=69.53 Aligned_cols=317 Identities=15% Similarity=0.102 Sum_probs=161.8 Q ss_pred cccccccccCCCCCCcccccccceehh--hhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCccccCC Q lcl|NC_013692. 5 VDNIKPMKYNDPANGVESSIGPQIHTR--YWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVNDQG 82 (399) Q Consensus 5 ~~~~~~~~~n~~~~~~~~~i~p~~~t~--y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lteG 82 (399) |.|.+++..+--+ ++-..-+.....| -|.-+.+..=+..-++..+-..+++ ..|++++|-|.-+... ++ T Consensus 1 m~~~~~~~~~t~~-g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~~r~i--~~G~sv~i~~iG~~tv------~~ 71 (347) T protein:vir:94 1 MANVPGQKIGTDQ-GKGKSSSDALALFLKVFAGEVLTAFTRRSVTADKHIVRTI--QNGKSAQFPVMGRTSG------VY 71 (347) T ss_pred CCCCCcccccccc-ccCCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccc--cccceEEEecccceee------ee Confidence 8888877775322 1111111111111 1222222221122234444455544 3689999966654422 11 Q ss_pred CCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHHHH Q lcl|NC_013692. 83 IDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVTTE 162 (399) Q Consensus 83 V~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~~e 162 (399) .+| |..+ .+ ++..++-++++.+|-|+=-+..+-|.+++.-..-.++.+++.+ T Consensus 72 ~t~-G~~l---------------~~------------~~~~~~~~e~~itID~~~~~~~~VddiD~~q~~~D~~~~~~~~ 123 (347) T protein:vir:94 72 LAP-GERL---------------SD------------KRKGIKHTEKVITIDGLLTADVMIFDIEDAMNHYDVAGEYSNQ 123 (347) T ss_pred ecC-CCCc---------------CC------------CCCCCCcceEEEEecchhhhhHHhhhHHHHhcCcchHHHHHHH Confidence 111 1000 00 1223344566777777765555555555443333355555555 Q ss_pred HHHHHHHHHHHHHHH--HHHhcCce---eeeccccccccccC---Ccce----e----cHHHHHHHHHHHHhccCccccc Q lcl|NC_013692. 163 MVKGANEITEDLLQI--DLLNSAGT---VRYPGAATSDAEVD---ATTE----V----TYDSLMRLRLDLDNARAPTKIK 226 (399) Q Consensus 163 l~~~~~~~t~d~l~~--~~l~agt~---V~YAg~aTsra~v~---~~~~----v----t~~~lr~a~~~Lk~nrApk~T~ 226 (399) +...=+......+.+ ..+.+.+. ..-+|..+ .+.+. .+.. . -++.|+.|.+.|+++..|. T Consensus 124 ~g~aLa~~~D~~i~~~~~~~aa~~~~~~~~~~g~~~-~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~--- 199 (347) T protein:vir:94 124 LGEALAIAADGAVLAEMAILCNLPAASNENIAGLGT-ASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPA--- 199 (347) T ss_pred HHHHHHHHHHHHHHHHHHHHhccccccccccCCCcc-cceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCC--- Confidence 554444433333322 12221111 11111000 00000 0000 1 1466888899999999984 Q ss_pred eeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCc Q lcl|NC_013692. 227 MITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGK 306 (399) Q Consensus 227 ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa 306 (399) ..+++++.|+....|.+ ++.|... .|+.-..+.+|.||++.+|++++++++- ..+.+. T Consensus 200 --------------~~R~~vv~P~~~~~Ll~------~~~~~~~-~~~~~~~~~~G~Vg~i~G~~V~~Sn~lp-~~~~t~ 257 (347) T protein:vir:94 200 --------------GDRYFYTTPDNYSAILA------ALMPNAA-NYAALIDPETGNIRNVMGFVVVEVPHLV-QGGAGE 257 (347) T ss_pred --------------CCcEEEeCHHHHHHHhc------cchhhhh-hccccccccccceEEEeceEEEecCccc-cccccc Confidence 24899999999988864 5666664 5888888899999999999999999864 322222 Q ss_pred ccCCccccc--------ccccCccc----eeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHH Q lcl|NC_013692. 307 AVDPNDQVP--------MHESGGKY----SVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFM 374 (399) Q Consensus 307 ~~~~~~~~~--------~~~~~~~~----DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~ 374 (399) .....+... ..+..+++ +---.|+|=++|-+++-++ .++.+ .--|+--|.-.+ T Consensus 258 ~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~----~~~~e-----------~~r~~~~~~d~i 322 (347) T protein:vir:94 258 TRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLR----DLALE-----------RDRDVDAQGDLI 322 (347) T ss_pred ccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhcc----ccccc-----------chhchhhHHHHh Confidence 111110000 00111222 2224566666666655542 11111 113444455566 Q ss_pred HHHHHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 375 SIKWYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 375 gwK~~~~~~iLn~~~m~~iet~A~~ 399 (399) ==|..|++.+||++..+.||..+-= T Consensus 323 ~~~~~~G~~~~rP~~a~~~~~~~A~ 347 (347) T protein:vir:94 323 VGKYAMGHGGLRPEAAGALVFSPAE 347 (347) T ss_pred hhhhhhcCcccccceeEEEEecCCC Confidence 6679999999999999999876444 No 39 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=98.92 E-value=2.6e-10 Score=73.16 Aligned_cols=313 Identities=14% Similarity=0.090 Sum_probs=141.3 Q ss_pred cccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhccccc--ccCcC-CCcEEEEEEccCCcCCCccccC Q lcl|NC_013692. 5 VDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTF--SMPKH-YGKEIVRLHYIPLLDDRNVNDQ 81 (399) Q Consensus 5 ~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~--~mPKn-~GktIkfrry~pl~~~~t~lte 81 (399) |.| .-+.|| -|.+.+|..-++.++|.++.+.. .-.++ .|+||++|++.++.... T Consensus 1 Ma~--------------~~~~p~----~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~~~~~~----- 57 (392) T protein:vir:99 1 MAN--------------AFSKPT----AVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHT----- 57 (392) T ss_pred Ccc--------------ccccHH----HHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeeccccccee----- Confidence 221 113344 68899999999999998886433 12244 69999998765543211 Q ss_pred CCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEe-eeecceehhhhhhhhhhhchhHHHHHH Q lcl|NC_013692. 82 GIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKL-EKYGFFREYTQEQLDFDSDPAMEGHVT 160 (399) Q Consensus 82 GV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l-~QyG~~~e~Td~~~~t~~D~~L~~~i~ 160 (399) .++.+ ...++.++...++-+.++.+| ++..+=.+++|+-.....+ ++. T Consensus 58 -~~~~~-------------------------~~~~~~~~~~~~~~~~~~~~id~~k~~~~~i~d~e~~~~~~-----~~~ 106 (392) T protein:vir:99 58 -RKLRG-------------------------AGAERNLTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLE-----SFA 106 (392) T ss_pred -eeccc-------------------------cccCCcccccccccceEEEEEeeeeecceeechHHHhhhhh-----hhH Confidence 11110 000111222334446777888 4555666788776555444 234 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCccccc Q lcl|NC_013692. 161 TEMVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVG 240 (399) Q Consensus 161 ~el~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~ 240 (399) .+.+..++.-..+.+-.++++.-....|. ..+...+++. .-.++.+-.+.+.|+++++|. T Consensus 107 ~~~~~~a~~ala~~vd~~i~~~~~~a~~~-~~~~~~~~~~--~~~~~~i~~a~~~L~~~~vP~----------------- 166 (392) T protein:vir:99 107 TQILPRQVRGVADILEEGVRDMIVGAPYE-AAGAVHEVAP--DEFFKGVNGARRALNELYIPQ----------------- 166 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccccc-ccccccccCh--hhhHHHHHHHHHHHhhcCCCC----------------- Confidence 45555555544444455555322222222 1222233332 235778899999999999973 Q ss_pred CeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCc--cccccccceeEcCeEEEecCcccccccCCcccCCcccccccc Q lcl|NC_013692. 241 NARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAG--GATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHE 318 (399) Q Consensus 241 ~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~--~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~ 318 (399) .|++++.|+....|+. ++.|+..+.+|+. +.+-+|+||++.+|.++++.....-.+.....++........ T Consensus 167 -~R~~vv~p~~~~~l~~------~~~~~~~~~~g~~~~~~l~~G~vg~i~G~~v~~s~~~~~~t~~a~~~~a~~~at~a~ 239 (392) T protein:vir:99 167 -GRVLVVGTAVTEQILN------DDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTAFIMATRAP 239 (392) T ss_pred -CCEEEEcHHHHHHHhc------ccceeecccccchhhhhhhcceeeeeeeeEEEeecccccccceeeeccccccccccc Confidence 4788899999998874 7999999999876 557899999999999999987643221110000000000000 Q ss_pred cCccceeEEEEEEccccceecccc-cCCCCCcceEEEec-CCCcCCCCCCccchhhHHHHHHHHHHhhc-----cccceE Q lcl|NC_013692. 319 SGGKYSVFPMLCVASEAFTTVGFA-TDGKNVKFKIITKR-PGEATADRSDPYGEMGFMSIKWYYGFMVF-----RPEWIA 391 (399) Q Consensus 319 ~~~~~DVYp~lV~G~~Afg~v~l~-~~g~~~k~~~ivk~-pG~~tad~~DPlgQrg~~gwK~~~~~~iL-----n~~~m~ 391 (399) .......+...+-|..++..--+. .++........++. -|..+....+--+..-.+..+..-...-+ .+.- . T Consensus 240 v~~~~~~~~~s~s~~~~v~~~~~~~~~~t~~s~~~~v~~~~g~~~v~~~~~~~~~~~~~~~~~~~~v~v~~v~~~~~~-~ 318 (392) T protein:vir:99 240 APPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGANAT-I 318 (392) T ss_pred cccccccceeEEecccceecceeecccceeeccccccceeEEEEEEeeccccceeeeeeeeeecceeeeeeeecccce-e Confidence 000001111122222221100000 00000000000000 00000000000000000000000000000 0000 0 Q ss_pred EEEEecCC Q lcl|NC_013692. 392 LLKTVARL 399 (399) Q Consensus 392 ~iet~A~~ 399 (399) .++..... T Consensus 319 ~~~~~~~~ 326 (392) T protein:vir:99 319 TAAAGEDH 326 (392) T ss_pred Eeeeccce Confidence 00000000 No 40 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=98.90 E-value=1.9e-09 Score=68.45 Aligned_cols=319 Identities=14% Similarity=0.058 Sum_probs=166.5 Q ss_pred CCCccccccccccC-CCCCC-cccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcc Q lcl|NC_013692. 1 MAGPVDNIKPMKYN-DPANG-VESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNV 78 (399) Q Consensus 1 ~~~~~~~~~~~~~n-~~~~~-~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~ 78 (399) ||- ..-++.-| .|-.+ ..++.- .+----|..++|.+=+...++..+-..+++ ..|++++|.|--...... T Consensus 1 ~~~---~~~~~~~~t~~g~~~~~~~~~-al~ie~~~g~V~~~f~~~s~~~~~v~~r~~--~~G~sv~i~~iG~~t~~~-- 72 (347) T protein:vir:33 1 MAN---IQGGQQIGTNQGKGQSAADKL-ALFLKVFGGEVLTAFARTSVTMPRHMLRSI--ASGKSAQFPVIGRTKAAY-- 72 (347) T ss_pred CCC---CccCcccccccccCCcccchH-HHHHHHHHHHHHHHHHHHHhhhhhhccccc--cccceeEeeeccceeeee-- Confidence 331 11112111 11111 111111 122234677777766666677777777755 459999998776653211 Q ss_pred ccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHH Q lcl|NC_013692. 79 NDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGH 158 (399) Q Consensus 79 lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~ 158 (399) .+.|-+..+ ++.+.+-.+.+.+|-|+=-|-.+=|.+++.-..-.++.. T Consensus 73 ~~~g~~l~~--------------------------------~~~~~~~~e~~ltiD~~~y~~~~VddiD~~q~~~D~~~~ 120 (347) T protein:vir:33 73 LKPGENLDD--------------------------------KRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAE 120 (347) T ss_pred ecCCCCCCC--------------------------------CCCCCccceEEEEechhhhhhHHHhhHHHHhcCCchhHH Confidence 111211111 011123345555555554333222333332222224444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCc---eeeec----ccc--ccccccCCcceec--------HHHHHHHHHHHHhccC Q lcl|NC_013692. 159 VTTEMVKGANEITEDLLQIDLLNSAG---TVRYP----GAA--TSDAEVDATTEVT--------YDSLMRLRLDLDNARA 221 (399) Q Consensus 159 i~~el~~~~~~~t~d~l~~~~l~agt---~V~YA----g~a--Tsra~v~~~~~vt--------~~~lr~a~~~Lk~nrA 221 (399) ++.+....-+....+.|.+.+.+++. ...+. +++ +.....+++...+ ++.|+.+...|.++.. T Consensus 121 ~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~V 200 (347) T protein:vir:33 121 YTAQLGESLAMAADGAVLAELAGLVNLPDGSNENIEGLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYV 200 (347) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCC Confidence 55555555444444444433322111 00000 111 1111111122221 6778889999999999 Q ss_pred ccccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCccccc Q lcl|NC_013692. 222 PTKIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHW 301 (399) Q Consensus 222 pk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~ 301 (399) |. ..+++++.|+...+|.. ++.|+.. .|+..+.+.+|.||++.+|++++++++-.. T Consensus 201 P~-----------------~gR~~vv~P~~y~~Ll~------~~~~~~~-d~~~~~~~~~G~V~~i~G~~V~~Sn~lp~~ 256 (347) T protein:vir:33 201 PA-----------------ADRTFYTTPDNYSAILA------ALMPNAA-NYQALLDPERGTIRNVMGFEVVEVPHLTAG 256 (347) T ss_pred Cc-----------------cCcEEEeCHHHHHHHhc------ccccccc-ccccccccccceeEEEeceeEEEecccccC Confidence 83 24899999999999974 6777754 578778899999999999999999996443 Q ss_pred ccCCcccCC-cccccc----cccCcc--ceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHH Q lcl|NC_013692. 302 AGVGKAVDP-NDQVPM----HESGGK--YSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFM 374 (399) Q Consensus 302 ~~aGa~~~~-~~~~~~----~~~~~~--~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~ 374 (399) ...+-..++ ++.... ++.... .+--.-|++-++|-|++-++ .++.+. .-|+--|.-++ T Consensus 257 ~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~----~~~~e~-----------~r~~~~~~d~i 321 (347) T protein:vir:33 257 GAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLK----DLALER-----------ARRANYQADQI 321 (347) T ss_pred ccccccccccccccccccCCcccceeccccceeeeeecchhheeeeee----ceeeee-----------ccchhhhhHhh Confidence 221110000 000000 011111 22233578888888877763 222221 13566677777 Q ss_pred HHHHHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 375 SIKWYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 375 gwK~~~~~~iLn~~~m~~iet~A~~ 399 (399) --|..|++.+||++..+.||- -++ T Consensus 322 ~~~~~~G~~vlrP~~av~i~~-~~~ 345 (347) T protein:vir:33 322 IAKYAMGHGGLRPEAAGAIVL-PKV 345 (347) T ss_pred hhhhhcCCceecccceEEEec-CCC Confidence 778999999999999877742 112 No 41 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=98.86 E-value=8.4e-09 Score=64.85 Aligned_cols=319 Identities=15% Similarity=0.084 Sum_probs=169.6 Q ss_pred CCCccccccccccCCCCCCccccccc-ceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGP-QIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVN 79 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p-~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~l 79 (399) |+--|.--++.+=+.|..+ +.-++ .+..--|..++++.=+..-++..+=.+|++= .||+++|.|.-.... .-. T Consensus 1 ~~~~~~~~~~~~~~~~~~~--~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~r~i~--~gks~~~~~iG~~~~--~~~ 74 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVV--AAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSIS--SGKSAQFPVLGRTQA--AYL 74 (345) T ss_pred Ccccccchhcccccccccc--cCCchhHHHHHHHhHHHHHHHHHHhhhcccceeeecc--ccceEEEeeecceEE--Eee Confidence 4433332222222222111 11111 2222235666666655555666666666554 589999976544321 111 Q ss_pred cCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHH Q lcl|NC_013692. 80 DQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHV 159 (399) Q Consensus 80 teGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i 159 (399) +.|-.+.+ ++.+.+..+...+|-|.=-+..+=|.+++.-.+--++.++ T Consensus 75 ~~G~~l~~--------------------------------~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~r~~~ 122 (345) T protein:vir:22 75 APGENLDD--------------------------------KRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEY 122 (345) T ss_pred ecCCCCCC--------------------------------CCCCcccceEEEEecchhhhhhhHhhHHHHhcCchhHHHH Confidence 11211111 0111223444555555555554445555544443455656 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCcee----eeccc---cc-----c-ccccCCcc---eecHHHHHHHHHHHHhccCcc Q lcl|NC_013692. 160 TTEMVKGANEITEDLLQIDLLNSAGTV----RYPGA---AT-----S-DAEVDATT---EVTYDSLMRLRLDLDNARAPT 223 (399) Q Consensus 160 ~~el~~~~~~~t~d~l~~~~l~agt~V----~YAg~---aT-----s-ra~v~~~~---~vt~~~lr~a~~~Lk~nrApk 223 (399) +.|++..=+......+.+.+.+++... .|.++ +. . .+..+..- .--++.|+.|...|+++..|. T Consensus 123 s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~~~~~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~ 202 (345) T protein:vir:22 123 TSQLGESLAMAADGAVLAEIAGLCNVESKYNENIEGLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPA 202 (345) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCc Confidence 666666555555555555555433321 11111 00 0 11111000 112677888889999999984 Q ss_pred ccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCccccccc Q lcl|NC_013692. 224 KIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAG 303 (399) Q Consensus 224 ~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~ 303 (399) ..++++|.|+.-..|.+ ++.|.... |+.....-+|.||++.+||+++++++-. .. T Consensus 203 -----------------~~R~~vv~P~~y~~Ll~------~~~~~~~~-~~~~~~~~~G~V~~i~G~~V~~sn~lp~-~~ 257 (345) T protein:vir:22 203 -----------------ADRVFYCDPDSYSAILA------ALMPNAAN-YAALIDPEKGSIRNVMGFEVVEVPHLTA-GG 257 (345) T ss_pred -----------------cCCEEEeChHHHHHHhc------cccccccc-cccccccccceEEEEeceEEEecccccc-cc Confidence 34899999999999975 57776655 8888888899999999999999997542 22 Q ss_pred CCcccCCc-cccc--ccccCccceeE------EEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHH Q lcl|NC_013692. 304 VGKAVDPN-DQVP--MHESGGKYSVF------PMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFM 374 (399) Q Consensus 304 aGa~~~~~-~~~~--~~~~~~~~DVY------p~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~ 374 (399) .+....++ +... .+..++. .++ ..|+|-++|-+++-++ .++.+ ...|+--|.-++ T Consensus 258 ~~~~~~~~~~~~~~~~~~~g~~-~~~~~~~~~~~l~~h~~A~~~v~~~----~~~~e-----------~~r~~~~~~d~I 321 (345) T protein:vir:22 258 AGTAREGTTGQKHVFPANKGEG-NVKVAKDNVIGLFMHRSAVGTVKLR----DLALE-----------RARRANFQADQI 321 (345) T ss_pred cCccccCcccccccccccccce-eeeeccCceEEEEEehhheeeeeee----cceee-----------eeechhHHHHHH Confidence 11111110 0000 1111111 111 3577777777766553 11111 113556677777 Q ss_pred HHHHHHHHhhccccceEEEEEecC Q lcl|NC_013692. 375 SIKWYYGFMVFRPEWIALLKTVAR 398 (399) Q Consensus 375 gwK~~~~~~iLn~~~m~~iet~A~ 398 (399) ==|..|++.+||++..+.|+.=-+ T Consensus 322 ~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 322 IAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred HHHHhcCCcccccceeEEEEEeeC Confidence 778999999999999888765444 No 42 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=98.85 E-value=6.1e-10 Score=71.11 Aligned_cols=287 Identities=13% Similarity=0.094 Sum_probs=164.6 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVND 80 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lt 80 (399) ||+.-. +...-.. ++.-+.-+-+. +.++.++.+.+..++.+++.+.+|+.+ .+++-++..- T Consensus 1 ma~~~~--~~~~~~~-----t~~gg~lip~~-~~~~ii~~~~~~~~l~~~~~~~~~~~~---~~~ip~~~~~-------- 61 (304) T protein:vir:10 1 MATPTY--TPGNVIL-----SDFKNGVIPAE-QGTLIMKDIMANSAIMKLAKNEPMTAQ---KKKFTYLAKG-------- 61 (304) T ss_pred Cccccc--ccccccc-----cCCCceecchh-HHHHHHHHHHhccchhhhcceeeccCC---ceEEEEEeCC-------- Confidence 887643 2222222 22112222222 357777788888889999988887642 3444333211 Q ss_pred CCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHH Q lcl|NC_013692. 81 QGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVT 160 (399) Q Consensus 81 eGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~ 160 (399) +...+... |+...-...++..|+.++++++.++.+|+++.. +++..|.+.|. T Consensus 62 ----~~a~~v~E-----------------------~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~-ds~~~l~~~i~ 113 (304) T protein:vir:10 62 ----VGAYWVSE-----------------------TERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLK-WTAKDFFNEVK 113 (304) T ss_pred ----cceEEeec-----------------------CcccccccceeeEEEEEEEEEEEeehhhHHHHh-cchHHHHHHHH Confidence 11122211 122223345778899999999999999998876 34455767666 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCce---------eeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccc Q lcl|NC_013692. 161 TEMVKGANEITEDLLQIDLLNSAGT---------VRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGT 231 (399) Q Consensus 161 ~el~~~~~~~t~d~l~~~~l~agt~---------V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s 231 (399) .+|.+.-+.- +-..+|++-+. -.+.+..+. ..-......++++|.++...|+.+... T Consensus 114 ~~l~~~ia~~----~d~~~l~G~g~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~i~~~~~~l~~~~~~--------- 179 (304) T protein:vir:10 114 PLIAEAFYKA----FDQAVIFGTKSPYNTSTSGKPLVEGAEEK-GNVVTDTNNLYVDLSALMATIEDEELD--------- 179 (304) T ss_pred HHHHHHHHHH----HHhhheeccCCCccccccccccccccccc-ccccccccchHHHHHHHHHHhhhccCC--------- Confidence 6665554443 33344543221 122211111 222223567899999998877765432 Q ss_pred cccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCc Q lcl|NC_013692. 232 RMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPN 311 (399) Q Consensus 232 ~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~ 311 (399) ++ ..+|||.+...|+.++|-.+ .+++....|++-++.++.++.|.... T Consensus 180 --------~~--~~v~~~~~~~~L~~lkd~~G-------------~~l~~~~~~~l~G~PV~~~~~~~~~~--------- 227 (304) T protein:vir:10 180 --------PN--GVLTTRSFRSKMRNALDAND-------------RPLFDANGNEIMGLPLSYTGADVYDK--------- 227 (304) T ss_pred --------cC--EEEEcHHHHHHHHHhhccCC-------------cEeecCCCccccceeeEEecccccCC--------- Confidence 12 35789999999998876443 34555556888899998887642100 Q ss_pred ccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCcc------chhhHHHHH--HHHHHh Q lcl|NC_013692. 312 DQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPY------GEMGFMSIK--WYYGFM 383 (399) Q Consensus 312 ~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPl------gQrg~~gwK--~~~~~~ 383 (399) ++. .+++|.-+.-.+++.+ .+.+++.. .++-......|+. -|++.+.|+ +++++. T Consensus 228 ---------~~~----~~~~gd~~~~~~~~~~---~~~i~~~~-e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~ 290 (304) T protein:vir:10 228 ---------KKS----LALMGDWDYARYGILQ---GIEYAISE-DATLTTLQASDASGQPVSLFERDMFALRATMHIAYM 290 (304) T ss_pred ---------CCc----EEEEEehhhEEEEEec---ceEEEEee-cceeeeecccccCccchhhhhcCcEEEEEEEEeccE Confidence 111 2556765555555542 22222211 1111111223443 467777887 589999 Q ss_pred hccccceEEEEEec Q lcl|NC_013692. 384 VFRPEWIALLKTVA 397 (399) Q Consensus 384 iLn~~~m~~iet~A 397 (399) +++++-+++|+.+- T Consensus 291 v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 291 NVKPEAFATLKPTE 304 (304) T ss_pred eecccceEEEEecC Confidence 99999999999998 No 43 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=98.85 E-value=6.1e-10 Score=71.11 Aligned_cols=287 Identities=13% Similarity=0.094 Sum_probs=164.6 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVND 80 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lt 80 (399) ||+.-. +...-.. ++.-+.-+-+. +.++.++.+.+..++.+++.+.+|+.+ .+++-++..- T Consensus 1 ma~~~~--~~~~~~~-----t~~gg~lip~~-~~~~ii~~~~~~~~l~~~~~~~~~~~~---~~~ip~~~~~-------- 61 (304) T protein:vir:94 1 MATPTY--TPGNVIL-----SDFKNGVIPAE-QGTLIMKDIMANSAIMKLAKNEPMTAQ---KKKFTYLAKG-------- 61 (304) T ss_pred Cccccc--ccccccc-----cCCCceecchh-HHHHHHHHHHhccchhhhcceeeccCC---ceEEEEEeCC-------- Confidence 887643 2222222 22112222222 357777788888889999988887642 3444333211 Q ss_pred CCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHH Q lcl|NC_013692. 81 QGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVT 160 (399) Q Consensus 81 eGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~ 160 (399) +...+... |+...-...++..|+.++++++.++.+|+++.. +++..|.+.|. T Consensus 62 ----~~a~~v~E-----------------------~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~-ds~~~l~~~i~ 113 (304) T protein:vir:94 62 ----VGAYWVSE-----------------------TERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLK-WTAKDFFNEVK 113 (304) T ss_pred ----cceEEeec-----------------------CcccccccceeeEEEEEEEEEEEeehhhHHHHh-cchHHHHHHHH Confidence 11122211 122223345778899999999999999998876 34455767666 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCce---------eeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccc Q lcl|NC_013692. 161 TEMVKGANEITEDLLQIDLLNSAGT---------VRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGT 231 (399) Q Consensus 161 ~el~~~~~~~t~d~l~~~~l~agt~---------V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s 231 (399) .+|.+.-+.- +-..+|++-+. -.+.+..+. ..-......++++|.++...|+.+... T Consensus 114 ~~l~~~ia~~----~d~~~l~G~g~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~i~~~~~~l~~~~~~--------- 179 (304) T protein:vir:94 114 PLIAEAFYKA----FDQAVIFGTKSPYNTSTSGKPLVEGAEEK-GNVVTDTNNLYVDLSALMATIEDEELD--------- 179 (304) T ss_pred HHHHHHHHHH----HHhhheeccCCCccccccccccccccccc-ccccccccchHHHHHHHHHHhhhccCC--------- Confidence 6665554443 33344543221 122211111 222223567899999998877765432 Q ss_pred cccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCc Q lcl|NC_013692. 232 RMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPN 311 (399) Q Consensus 232 ~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~ 311 (399) ++ ..+|||.+...|+.++|-.+ .+++....|++-++.++.++.|.... T Consensus 180 --------~~--~~v~~~~~~~~L~~lkd~~G-------------~~l~~~~~~~l~G~PV~~~~~~~~~~--------- 227 (304) T protein:vir:94 180 --------PN--GVLTTRSFRSKMRNALDAND-------------RPLFDANGNEIMGLPLSYTGADVYDK--------- 227 (304) T ss_pred --------cC--EEEEcHHHHHHHHHhhccCC-------------cEeecCCCccccceeeEEecccccCC--------- Confidence 12 35789999999998876443 34555556888899998887642100 Q ss_pred ccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCcc------chhhHHHHH--HHHHHh Q lcl|NC_013692. 312 DQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPY------GEMGFMSIK--WYYGFM 383 (399) Q Consensus 312 ~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPl------gQrg~~gwK--~~~~~~ 383 (399) ++. .+++|.-+.-.+++.+ .+.+++.. .++-......|+. -|++.+.|+ +++++. T Consensus 228 ---------~~~----~~~~gd~~~~~~~~~~---~~~i~~~~-e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~ 290 (304) T protein:vir:94 228 ---------KKS----LALMGDWDYARYGILQ---GIEYAISE-DATLTTLQASDASGQPVSLFERDMFALRATMHIAYM 290 (304) T ss_pred ---------CCc----EEEEEehhhEEEEEec---ceEEEEee-cceeeeecccccCccchhhhhcCcEEEEEEEEeccE Confidence 111 2556765555555542 22222211 1111111223443 467777887 589999 Q ss_pred hccccceEEEEEec Q lcl|NC_013692. 384 VFRPEWIALLKTVA 397 (399) Q Consensus 384 iLn~~~m~~iet~A 397 (399) +++++-+++|+.+- T Consensus 291 v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 291 NVKPEAFATLKPTE 304 (304) T ss_pred eecccceEEEEecC Confidence 99999999999998 No 44 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=98.85 E-value=1.7e-09 Score=68.63 Aligned_cols=317 Identities=13% Similarity=0.079 Sum_probs=168.3 Q ss_pred CCCccccccc-ccc-CCC-CCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCC-C Q lcl|NC_013692. 1 MAGPVDNIKP-MKY-NDP-ANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDD-R 76 (399) Q Consensus 1 ~~~~~~~~~~-~~~-n~~-~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~-~ 76 (399) || |.-. +.- ..+ ..+..++ --.+.---|..+.+..=+..-+|..+-.++++ ..||+++|.|.-..... . T Consensus 1 ~a----~~~~~~~~~~~~g~~~~~~d-~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i--~~G~sv~~~~iG~~~~~~~ 73 (347) T protein:vir:88 1 MA----NATGGQQIGANQGKGQSAAD-KLALFLKVFGGEVLTAFVRRSVTMDKHMVRTI--QNGKSASFPVMGRTKGYYL 73 (347) T ss_pred CC----CcccchhhhccCCCCccccc-hHHHHHHHHHHHHHHHHHHHhhhhhccccccc--cCcceEEEeeecceeeeee Confidence 44 3211 110 111 1111122 00222234566666655555566666666654 46999999887765432 1 Q ss_pred ccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHH Q lcl|NC_013692. 77 NVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAME 156 (399) Q Consensus 77 t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~ 156 (399) ++-++ +.+ +- .+++-++++.+|-|+=-+..+=|.+++.-..-.++ T Consensus 74 ~~g~~---l~~----------~~----------------------~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r 118 (347) T protein:vir:88 74 APGEN---LDD----------KR----------------------KDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVR 118 (347) T ss_pred ccccC---CCC----------CC----------------------CCCccceEEEEEechhhhhhhhhhHHHHhhcCCch Confidence 22111 110 00 01233567777777654444334444332222255 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCce-----eeecccccc-------ccccCCc---ceecHHHHHHHHHHHHhccC Q lcl|NC_013692. 157 GHVTTEMVKGANEITEDLLQIDLLNSAGT-----VRYPGAATS-------DAEVDAT---TEVTYDSLMRLRLDLDNARA 221 (399) Q Consensus 157 ~~i~~el~~~~~~~t~d~l~~~~l~agt~-----V~YAg~aTs-------ra~v~~~---~~vt~~~lr~a~~~Lk~nrA 221 (399) ..++.++...=+....+.+.+.+.+++.. -..+|..+. .+++... ...-++.|+.|.+.|++++. T Consensus 119 ~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~V 198 (347) T protein:vir:88 119 AEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYV 198 (347) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCCccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCC Confidence 55555555555554444444444333221 011221111 0111100 11126789999999999999 Q ss_pred ccccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCccccc Q lcl|NC_013692. 222 PTKIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHW 301 (399) Q Consensus 222 pk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~ 301 (399) |. ..+++++.|+.-.+|.+ ++.|.. ..|.+...+-+|.||++.+|++++++++- . T Consensus 199 P~-----------------~gR~~vv~P~~y~~Ll~------~~~~~~-~~~~~~~~~~~G~vg~i~G~~V~~s~nlp-~ 253 (347) T protein:vir:88 199 PA-----------------GDRRFYCAPEDYSAILS------ALMPNA-ANYAALIDPETGNIRNVMGFEVIEVPHLT-V 253 (347) T ss_pred CC-----------------CCCEEEeCHHHHHHHhc------chhhhh-hhhccccchhcceeeeeccceEEEeeccc-c Confidence 84 24899999999888875 455654 57787778889999999999999999963 2 Q ss_pred ccCCcccCCcc-------cccccccCcccee----EEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccch Q lcl|NC_013692. 302 AGVGKAVDPND-------QVPMHESGGKYSV----FPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGE 370 (399) Q Consensus 302 ~~aGa~~~~~~-------~~~~~~~~~~~DV----Yp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQ 370 (399) ...+...-+++ ....+...+++.. =--|++-..|-|++-++ .++.+. --||--| T Consensus 254 ~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~----d~~~e~-----------~r~~~~~ 318 (347) T protein:vir:88 254 GGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLK----DMALER-----------ARRPEFQ 318 (347) T ss_pred cccccccccccccccccccccccccccccccccCcEEEEEechhhhhheecc----cceeee-----------eechhhH Confidence 22211110000 0000111222221 12356666666666552 112211 1345556 Q ss_pred hhHHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 371 MGFMSIKWYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 371 rg~~gwK~~~~~~iLn~~~m~~iet~A~~ 399 (399) .-.+=-|..|++.+||++..+.|++-+.- T Consensus 319 ~d~i~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 319 ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred HHHhhhhhhhcCceeccceEEEEEeCCCC Confidence 66677789999999999999999987777 No 45 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=98.84 E-value=7.6e-10 Score=70.58 Aligned_cols=297 Identities=10% Similarity=0.069 Sum_probs=156.5 Q ss_pred CCcc-cccccceehhhhhHHHHHHhhhHHhhhhccccccc----CcCCCcEEEEEEccCCcCCCccccC--CCCcchhhh Q lcl|NC_013692. 18 NGVE-SSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSM----PKHYGKEIVRLHYIPLLDDRNVNDQ--GIDASGATI 90 (399) Q Consensus 18 ~~~~-~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~m----PKn~GktIkfrry~pl~~~~t~lte--GV~p~g~~~ 90 (399) |..+ .+.-|| -|.+++|..-++.+|+.++....-- -.++|.||++|+--++.....+... |+++ T Consensus 1 MaN~llT~ip~----iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~~~~~~~~~~~~~~~~~~----- 71 (423) T protein:vir:17 1 MPNNLDSNVSQ----IVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNK----- 71 (423) T ss_pred CccchhhhhHH----HHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCCcceeecccCcccCCccc----- Confidence 2211 111234 4789999999999999887543221 2468999999875554432222111 1222 Q ss_pred hhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeec-ceehhhhhhhhhhhchhHHHHHHHHHHHHHHH Q lcl|NC_013692. 91 ANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYG-FFREYTQEQLDFDSDPAMEGHVTTEMVKGANE 169 (399) Q Consensus 91 ~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG-~~~e~Td~~~~t~~D~~L~~~i~~el~~~~~~ 169 (399) ..|+| ..++.+|-|.= +-.+++|+=...+++. +. +.+..|+. T Consensus 72 -------------------~~l~e------------~~v~l~id~~k~va~~v~d~E~~~~i~~-----~~-~~l~~A~~ 114 (423) T protein:vir:17 72 -------------------NNLIS------------GKATGRVGNYITVAVEYQQLEEAIKLNQ-----LE-EILAPVRQ 114 (423) T ss_pred -------------------Ccccc------------ceeEEEeeceeeeeeeecHHHHhcChhH-----HH-HHHHHHHH Confidence 22333 34555665443 3447887765444442 22 34555555 Q ss_pred HHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCeeEEEech Q lcl|NC_013692. 170 ITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNARALYVGS 249 (399) Q Consensus 170 ~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv~~~h~ 249 (399) -..+.+-.++++-.....+...++..... =.++++..+.+.|+++++|+ ..|++++.| T Consensus 115 aLA~~vd~~ia~~~~~~a~~~~gt~~t~~-----~a~~~i~~a~~~Ld~~~vP~-----------------~~R~~Vv~p 172 (423) T protein:vir:17 115 RIVTDLETELAHFMMNNGALSLGSPNTPI-----TKWSDVAQTASFLKDLGVNE-----------------GENYAVMDP 172 (423) T ss_pred HHHHHHHHHHHHHHhhccccccccCCccc-----ccHHHHHHHHHHHHhccCCc-----------------CCCEEEeCh Confidence 55556666666443222222122221111 14788999999999999985 248899999 Q ss_pred hhhHHHHHHhhhcCCCCceehhhcCCccccccccc-eeEcCeEEEecCcccccc-cC--------------CcccCCccc Q lcl|NC_013692. 250 DLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEV-GQLGRFRVIVNPQMMHWA-GV--------------GKAVDPNDQ 313 (399) Q Consensus 250 dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEI-G~i~~~RfV~~~~~~~~~-~a--------------Ga~~~~~~~ 313 (399) +....|.. ++.+....+-+..+.+-+|+| |++.+|++.++.++-.-. ++ ++++..+.. T Consensus 173 ~~~a~Ll~------~~~~~~~~~~~~~~alr~g~i~G~i~GFdvy~Snnip~~T~gt~~~t~~~~~~~~v~~~a~~~~~~ 246 (423) T protein:vir:17 173 WSAQRLAD------AQTGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTVTYNAVKDSYQ 246 (423) T ss_pred HHHHHHhc------cccceecccccchHHHhhccceeeecceEEEEeCCCccccccceeceeeecccccccccccccccc Confidence 99888763 455555556667778888888 999999999998766321 11 110000000 Q ss_pred cc------------------------------------------------c-------cccCccceeEEE---------- Q lcl|NC_013692. 314 VP------------------------------------------------M-------HESGGKYSVFPM---------- 328 (399) Q Consensus 314 ~~------------------------------------------------~-------~~~~~~~DVYp~---------- 328 (399) .. + ..++..+-+||. T Consensus 247 ~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v~~~tk~v~~~~~t~~~~~~~v~~~~~~~a~~~~tv~i~p~~i~~~~~~~~ 326 (423) T protein:vir:17 247 FTVTLTGATTSVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSDSSGDVTVTLSGVPIYDTTNPQY 326 (423) T ss_pred eeeeeeeeeeeccCceeecceEEecceeeecccccccccccccccceEEEEEecccccccCceEEEecCccccccCCccc Confidence 00 0 001111223322 Q ss_pred --------------------------EEEccccce--ecccccCCC---------CCcceEEEecCCCcCCCCCCccchh Q lcl|NC_013692. 329 --------------------------LCVASEAFT--TVGFATDGK---------NVKFKIITKRPGEATADRSDPYGEM 371 (399) Q Consensus 329 --------------------------lV~G~~Afg--~v~l~~~g~---------~~k~~~ivk~pG~~tad~~DPlgQr 371 (399) |++.++||+ +.+|.-.+. +++++++ .= .|.-..- T Consensus 327 ~~v~a~~a~~~~vT~~~~a~~t~~~nl~~~~~a~~l~~~pl~~~~~~~~~~~~~~g~s~r~~--~~-------~d~~~~~ 397 (423) T protein:vir:17 327 NSVSRQVAAGDAVSVVGTASQTMKPNLFYNKFFCGLGSIPLPKLHSIDSAVATYEGFSIRVH--KY-------ADGDANV 397 (423) T ss_pred ccceecccCCceeeccccccCCeeEEEEecCcceEEEEEcccCCCccceeecccCCcEEEEE--Ee-------cccccce Confidence 233444443 222221110 0011111 00 0111111 Q ss_pred hHHHHHHHHHHhhccccceEEEEEecC Q lcl|NC_013692. 372 GFMSIKWYYGFMVFRPEWIALLKTVAR 398 (399) Q Consensus 372 g~~gwK~~~~~~iLn~~~m~~iet~A~ 398 (399) -.+.|=.+||+..|++||..|+ .+.| T Consensus 398 ~~~r~d~l~g~~~~~p~~~~~~-~g~~ 423 (423) T protein:vir:17 398 QKMRFDLLPAYVCFNPHMGGQF-FGNP 423 (423) T ss_pred eEEEEEeecceeeeccceEEEE-EecC Confidence 2234446799999999998776 4555 No 46 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=98.76 E-value=3.3e-09 Score=67.10 Aligned_cols=303 Identities=12% Similarity=0.046 Sum_probs=171.1 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVND 80 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lt 80 (399) |+|...+-.- ....+..+.+-| +.+ ..+.++..++..++.+++...+|+.+. +++-+.. T Consensus 1 m~~~~~~a~~----~~~t~~~g~~i~---~~~-~~~ii~~~~~~s~l~~~~~~~~~~~~~---~~~p~~~---------- 59 (330) T protein:vir:77 1 MAGSTVPSTQ----VALTGDFSAFLT---PEQ-SQDYFAEIEKTSIVQRIARKVPMGPTG---ISIPHWT---------- 59 (330) T ss_pred Ccccccchhh----ccccCCCcceec---hhH-HHHHHHHHHhccchhhhcceeeccCCc---eEEEEEc---------- Confidence 8887633211 111111222222 234 356788888888999999998887533 3443221 Q ss_pred CCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHH Q lcl|NC_013692. 81 QGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVT 160 (399) Q Consensus 81 eGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~ 160 (399) .+ +..+++..| +...--..++..|+.+.++++.++++|+++.+ +++..+.+.|. T Consensus 60 ~~--~~a~~v~Eg-----------------------~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~-ds~~~~~~~i~ 113 (330) T protein:vir:77 60 GA--VSASWTGEA-----------------------ERKPITKGSFGKQELEPVKITTIFAESAEVVR-LNPLNYLNTMR 113 (330) T ss_pred CC--cceeEecCC-----------------------CccccccceeeEEEEeEEEEEEeehhhHHHHh-cchHHHHHHHH Confidence 11 122333222 11112234678899999999999999998765 44555766666 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCce-----ee--eccc-----cccccccCCcceecHHHHHHHHHHHHhccCcccccee Q lcl|NC_013692. 161 TEMVKGANEITEDLLQIDLLNSAGT-----VR--YPGA-----ATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMI 228 (399) Q Consensus 161 ~el~~~~~~~t~d~l~~~~l~agt~-----V~--YAg~-----aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii 228 (399) .+|.+.-+. .+-..+|++-++ -+ .+.. .+...+.+......+++|.++...|..+.... T Consensus 114 ~~l~~ai~~----~~~~~~l~G~g~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~----- 184 (330) T protein:vir:77 114 TKIAEAIAL----KFDAAAIHGIDKPSAFKGYLAETTKVVSLADTNLTTASGPQGNAYLAVNNALSLLVNSGKKW----- 184 (330) T ss_pred HHHHHHHHH----HHHHHhhcccCCCCccccccccccccceeecccccccccccchhHHHHHHHHHhhhhcCCCc----- Confidence 666555544 333344532211 00 0000 01111122224455777888777776665531 Q ss_pred ccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCccc Q lcl|NC_013692. 229 TGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAV 308 (399) Q Consensus 229 ~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~ 308 (399) -..+||+.....|+.|+|--+.|-|.+..+-++.. ..+-+.+-++.++.++.|.. +. T Consensus 185 --------------~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~---~~~~~~l~G~PV~~~~~~p~----~~-- 241 (330) T protein:vir:77 185 --------------TGTLLDNVTEPILNTAVDGNGRPLFVESTYTEQVG---AIREGRILGRPTYVADNVVN----GT-- 241 (330) T ss_pred --------------cEEEEcHHHHHHHHHHhccCCceeecCcccccccc---ccCCceecceeeEEeccccC----CC-- Confidence 24579999999999999877778887755544442 33557788899999887531 11 Q ss_pred CCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEe------cCCCc-CCCCCCccchhhHHHHH--HH Q lcl|NC_013692. 309 DPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITK------RPGEA-TADRSDPYGEMGFMSIK--WY 379 (399) Q Consensus 309 ~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk------~pG~~-tad~~DPlgQrg~~gwK--~~ 379 (399) .+++ +.+++|.-+...++... .+.+++... .+... .....--+-|++...|| ++ T Consensus 242 ----------~~~~----~~~~~gd~s~~~i~~~~---~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r 304 (330) T protein:vir:77 242 ----------VGNR----VVGVMGDFSQVIWGQIG---GLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEAE 304 (330) T ss_pred ----------CCCc----cEEEEEecceEEEEEec---CcEEEEeecceeeecccccccccccccchhhcCcEEEEEEEE Confidence 1122 23567765555565542 222222110 11000 00011123577788888 58 Q ss_pred HHHhhccccceEEEEEecCC Q lcl|NC_013692. 380 YGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 380 ~~~~iLn~~~m~~iet~A~~ 399 (399) +.+.+.+++-+++|+.+++- T Consensus 305 ~d~~v~~~~a~~~i~~~~~~ 324 (330) T protein:vir:77 305 FAFMVNDKDAFVKLTDQVAG 324 (330) T ss_pred eccEEecccceEEEEeccCC Confidence 89999999999999988877 No 47 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=98.72 E-value=1.9e-08 Score=62.86 Aligned_cols=313 Identities=12% Similarity=0.087 Sum_probs=157.6 Q ss_pred cccCCCCCCcccccccceehhhhhHHHHHHhh--hHHhhhhcccccccCcCCCcEEEEEEccCCcCCCccccCCCCcchh Q lcl|NC_013692. 11 MKYNDPANGVESSIGPQIHTRYWYKRALIDAA--KEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVNDQGIDASGA 88 (399) Q Consensus 11 ~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~--p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lteGV~p~g~ 88 (399) |+.|+.-++. -.++-|+...| .++-..+.+ =.+.-++|-....++-+.++--+|-.+...... ..|=+..++ T Consensus 1 ~~~~~~~~~~-~~Ms~~i~~~f-v~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~ 74 (322) T protein:vir:10 1 MKLNAIMSML-PLIAGDIDQAF-VQTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETLASMDPD----AVKRKRSRQ 74 (322) T ss_pred Ccccceeeee-eeeechhhhHH-HHHHHHHHHHHHHHhhhhhhcccccccccccccceeeccccccc----ccccccccc Confidence 4444432220 11122222222 222222211 122335555555566666664444333322100 001111111 Q ss_pred hhhhhhhccccccccccc-cccccccccccceeeccceeEEEEEEeeeecceehhhhhhh-hhhhchhHHHHHHHHHHHH Q lcl|NC_013692. 89 TIANGNLYGSSRDVGNIT-AKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQL-DFDSDPAMEGHVTTEMVKG 166 (399) Q Consensus 89 ~~~ngn~~~ss~d~g~it-~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~-~t~~D~~L~~~i~~el~~~ 166 (399) +..++ + ..++.++ +..++.+.+++|..+..+.|.-. ..+.||.- .-++...- T Consensus 75 ~~~d~------------~~dtp~~~~-----------~~~~r~~~~~d~~~~~~VDd~D~~k~~~D~~~---~~~~~~a~ 128 (322) T protein:vir:10 75 QSADG------------TYPTPVNNK-----------PFAKRRTNVDTYDTGHVVEQEDISQMLLDPNS---ALITSQAY 128 (322) T ss_pred cccCc------------ccCCCcccc-----------ccceEEEeecccccceecchHHHHHhhcCchH---HHHHHHHH Confidence 11110 1 0111122 34667899999987654443321 23444422 12222222 Q ss_pred HHHHHHHHHHHHHHhcCceeeeccc----cccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCe Q lcl|NC_013692. 167 ANEITEDLLQIDLLNSAGTVRYPGA----ATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNA 242 (399) Q Consensus 167 ~~~~t~d~l~~~~l~agt~V~YAg~----aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~ 242 (399) +-.+..|.+....+.++-++-=.|. -.+..-..++.-++++.|+.|.+.|+++.+|. ... T Consensus 129 AL~R~~D~~I~~a~~g~a~~~~~gt~v~~~ss~~i~~g~~g~t~~kl~~a~~~l~~~dvp~----------------d~~ 192 (322) T protein:vir:10 129 AMARKTDDLIIAGAWKPASIKGTGQPVEFLATQEIGDGTKPISFDYVTEITERFLENEIEP----------------EVS 192 (322) T ss_pred HhhhHHHHHHHhhhhccccccccccccccCCCcccccCccchhHHHHHHHHHHHHhcCCCC----------------CCC Confidence 2223344333332212111100000 01222233345699999999999999999973 123 Q ss_pred eEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccc-cccceeEcCeEEEecCcccccccCCcccCCcccccccccCc Q lcl|NC_013692. 243 RALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATM-HGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESGG 321 (399) Q Consensus 243 yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~-~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~~ 321 (399) +++++.|....+|. .++.|+. ..|.+.+.+. +|.||.+-+|+|+.+.++-.-...+-.-+ ..... T Consensus 193 R~~vv~p~~~~~LL------~d~~~ts-~D~~~~~~l~~~G~ig~~lGf~~i~s~~lp~~~~t~~~~~-------~~~~~ 258 (322) T protein:vir:10 193 KVIVIGPTQARKLL------QITEATS-ADYTSAMDLQSKGIITNWMGYTWIVSTRLDKFDPTQWGMA-------AEDGP 258 (322) T ss_pred eEEEeCHHHHHHHh------cchhhhh-hhcccchhhhhcCeeeeeeeEEEEEeccCCcccccccccc-------ccCCC Confidence 67899999988886 4789987 5666666775 69999999999999987543221111000 01122 Q ss_pred cceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 322 KYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 322 ~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~iet~A~~ 399 (399) ..++.+.++.=+.|.+ | ..++.++.++ ...|+. ..-..+.-++.|||.+++|+..+.|+|---| T Consensus 259 ~~~~~~~~a~~k~Av~---~-a~~~dv~~~i-~~~~~~---------~~a~~I~~~~~~Ga~ri~~~gVv~i~~~e~~ 322 (322) T protein:vir:10 259 QGDEIWCIAMTDMALG---Y-HSCKDIWTKV-AEDPSA---------SFAWRIYSAFTADCVRVEDEHIFKLRLKNSL 322 (322) T ss_pred CccceeEEEEecCcee---E-EEeeeeeEEe-eccCCc---------chhhhhhhhhhhCceEeccCcEEEEEEeccC Confidence 3456665544444444 2 1223333333 445542 1223456678899999999999999997777 No 48 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=98.71 E-value=1.3e-09 Score=69.36 Aligned_cols=294 Identities=10% Similarity=0.008 Sum_probs=158.1 Q ss_pred CCCcccccccccc------------CCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEE Q lcl|NC_013692. 1 MAGPVDNIKPMKY------------NDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLH 68 (399) Q Consensus 1 ~~~~~~~~~~~~~------------n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrr 68 (399) +.....+-....| +..+.+.-+.+-|+ .|....+....+...+.+++.+.+|+-+.++... .+ T Consensus 97 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~-~~ 171 (415) T protein:vir:98 97 QNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPE----EIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPV-VR 171 (415) T ss_pred hhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccch----HHHHHHHHHHHhhhhhhhheeeeeccCCceeEEE-Ee Confidence 0000000000000 00011111223443 4566677777788889999999999988876333 23 Q ss_pred ccCCcCCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhh Q lcl|NC_013692. 69 YIPLLDDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLD 148 (399) Q Consensus 69 y~pl~~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~ 148 (399) +..-.... ...||-+. |..+ ..++..|+.+++++|.++.+|+++.+ T Consensus 172 ~~~~~~~~-~v~E~~~~------------------------~~~~---------~~~~~~v~~~~~k~~~~~~iS~ell~ 217 (415) T protein:vir:98 172 QSEVAALE-KVEELEEN------------------------PELA---------VKPFFQLAYDINTHRGYFRISREAIE 217 (415) T ss_pred ecCCccce-eecccccc------------------------Cccc---------ccceeeEEeeeeeeEeeehhhHHHHh Confidence 33332211 11222110 1000 12567899999999999999999876 Q ss_pred hhhchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee-eccccccccccCCcceecHHHHHHHHHHHHhccCccccce Q lcl|NC_013692. 149 FDSDPAMEGHVTTEMVKGANEITEDLLQIDLLNSAGTVR-YPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKM 227 (399) Q Consensus 149 t~~D~~L~~~i~~el~~~~~~~t~d~l~~~~l~agt~V~-YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~i 227 (399) +++..|.+.|..+|...-+...+..+..+.- +|.... -.+........+.....++++|..+.-.|...... T Consensus 218 -ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g-~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~----- 290 (415) T protein:vir:98 218 -DAKVNVLQELKLWMARTIAATRNKAIIDVIT-KGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYE----- 290 (415) T ss_pred -hchHHHHHHHHHHHHHHHHHHHHHHHhhccc-cCccccccccccccccccccccccchhHHHHHHHhhhhhccC----- Confidence 4555676767666666655533333322221 221110 00112222334444668899998888766543321 Q ss_pred eccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcc Q lcl|NC_013692. 228 ITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKA 307 (399) Q Consensus 228 i~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~ 307 (399) ++ ..+||+.....|+.++|-.+.|-|.|- +..|-.+++-++.++.++.+ +..++|. T Consensus 291 ------------~~--~~v~n~~~~~~l~~lkd~~G~~l~~~~--------~~~~~~~~l~G~pV~~~~~~-~~~~~~~- 346 (415) T protein:vir:98 291 ------------HN--VAIVSQTMFAKLDKMKDKLGNYLIQPD--------VKEKTQQRLLGAKIEILPDE-VLGQKGN- 346 (415) T ss_pred ------------CC--EEEEcHHHHHHHHHhhccCCceeeccC--------cCCCCCceecceeeEEeccc-ccCCCCc- Confidence 11 347899999999999887777767552 23455578899999888864 2111111 Q ss_pred cCCcccccccccCccceeEEEEEEc--cccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhc Q lcl|NC_013692. 308 VDPNDQVPMHESGGKYSVFPMLCVA--SEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVF 385 (399) Q Consensus 308 ~~~~~~~~~~~~~~~~DVYp~lV~G--~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL 385 (399) . .++|| +++|-. .-+ ..+.++. .+ +. ..++++.+. +++.+.++ T Consensus 347 ------------------~-~~~~Gd~~~~~~~-~~~---~~~~v~~--~~------~~---~~~~~~~~~-~r~d~~v~ 391 (415) T protein:vir:98 347 ------------------N-TLIIGNLKDAIVL-FDR---SQYQASW--TD------YM---HFGECLMIA-VRQDCRIL 391 (415) T ss_pred ------------------c-EEEEEehhccEEE-Eee---cceEEEE--ec------cc---cCceEEEEE-EEeccEEe Confidence 1 25777 344432 111 1222221 11 11 223333322 47788888 Q ss_pred cccceEEEEEecCC Q lcl|NC_013692. 386 RPEWIALLKTVARL 399 (399) Q Consensus 386 n~~~m~~iet~A~~ 399 (399) +++-++.++..+.. T Consensus 392 ~~~a~~~~~~~~~~ 405 (415) T protein:vir:98 392 DYKSAIVIEYDDSE 405 (415) T ss_pred ccccEEEEEEeccC Confidence 99999999888887 No 49 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=98.71 E-value=1.3e-09 Score=69.36 Aligned_cols=294 Identities=10% Similarity=0.008 Sum_probs=158.1 Q ss_pred CCCcccccccccc------------CCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEE Q lcl|NC_013692. 1 MAGPVDNIKPMKY------------NDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLH 68 (399) Q Consensus 1 ~~~~~~~~~~~~~------------n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrr 68 (399) +.....+-....| +..+.+.-+.+-|+ .|....+....+...+.+++.+.+|+-+.++... .+ T Consensus 97 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~-~~ 171 (415) T protein:vir:79 97 QNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPE----EIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPV-VR 171 (415) T ss_pred hhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccch----HHHHHHHHHHHhhhhhhhheeeeeccCCceeEEE-Ee Confidence 0000000000000 00011111223443 4566677777788889999999999988876333 23 Q ss_pred ccCCcCCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhh Q lcl|NC_013692. 69 YIPLLDDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLD 148 (399) Q Consensus 69 y~pl~~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~ 148 (399) +..-.... ...||-+. |..+ ..++..|+.+++++|.++.+|+++.+ T Consensus 172 ~~~~~~~~-~v~E~~~~------------------------~~~~---------~~~~~~v~~~~~k~~~~~~iS~ell~ 217 (415) T protein:vir:79 172 QSEVAALE-KVEELEEN------------------------PELA---------VKPFFQLAYDINTHRGYFRISREAIE 217 (415) T ss_pred ecCCccce-eecccccc------------------------Cccc---------ccceeeEEeeeeeeEeeehhhHHHHh Confidence 33332211 11222110 1000 12567899999999999999999876 Q ss_pred hhhchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee-eccccccccccCCcceecHHHHHHHHHHHHhccCccccce Q lcl|NC_013692. 149 FDSDPAMEGHVTTEMVKGANEITEDLLQIDLLNSAGTVR-YPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKM 227 (399) Q Consensus 149 t~~D~~L~~~i~~el~~~~~~~t~d~l~~~~l~agt~V~-YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~i 227 (399) +++..|.+.|..+|...-+...+..+..+.- +|.... -.+........+.....++++|..+.-.|...... T Consensus 218 -ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g-~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~----- 290 (415) T protein:vir:79 218 -DAKVNVLQELKLWMARTIAATRNKAIIDVIT-KGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYE----- 290 (415) T ss_pred -hchHHHHHHHHHHHHHHHHHHHHHHHhhccc-cCccccccccccccccccccccccchhHHHHHHHhhhhhccC----- Confidence 4555676767666666655533333322221 221110 00112222334444668899998888766543321 Q ss_pred eccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcc Q lcl|NC_013692. 228 ITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKA 307 (399) Q Consensus 228 i~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~ 307 (399) ++ ..+||+.....|+.++|-.+.|-|.|- +..|-.+++-++.++.++.+ +..++|. T Consensus 291 ------------~~--~~v~n~~~~~~l~~lkd~~G~~l~~~~--------~~~~~~~~l~G~pV~~~~~~-~~~~~~~- 346 (415) T protein:vir:79 291 ------------HN--VAIVSQTMFAKLDKMKDKLGNYLIQPD--------VKEKTQQRLLGAKIEILPDE-VLGQKGN- 346 (415) T ss_pred ------------CC--EEEEcHHHHHHHHHhhccCCceeeccC--------cCCCCCceecceeeEEeccc-ccCCCCc- Confidence 11 347899999999999887777767552 23455578899999888864 2111111 Q ss_pred cCCcccccccccCccceeEEEEEEc--cccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhc Q lcl|NC_013692. 308 VDPNDQVPMHESGGKYSVFPMLCVA--SEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVF 385 (399) Q Consensus 308 ~~~~~~~~~~~~~~~~DVYp~lV~G--~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL 385 (399) . .++|| +++|-. .-+ ..+.++. .+ +. ..++++.+. +++.+.++ T Consensus 347 ------------------~-~~~~Gd~~~~~~~-~~~---~~~~v~~--~~------~~---~~~~~~~~~-~r~d~~v~ 391 (415) T protein:vir:79 347 ------------------N-TLIIGNLKDAIVL-FDR---SQYQASW--TD------YM---HFGECLMIA-VRQDCRIL 391 (415) T ss_pred ------------------c-EEEEEehhccEEE-Eee---cceEEEE--ec------cc---cCceEEEEE-EEeccEEe Confidence 1 25777 344432 111 1222221 11 11 223333322 47788888 Q ss_pred cccceEEEEEecCC Q lcl|NC_013692. 386 RPEWIALLKTVARL 399 (399) Q Consensus 386 n~~~m~~iet~A~~ 399 (399) +++-++.++..+.. T Consensus 392 ~~~a~~~~~~~~~~ 405 (415) T protein:vir:79 392 DYKSAIVIEYDDSE 405 (415) T ss_pred ccccEEEEEEeccC Confidence 99999999888887 No 50 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=98.71 E-value=1.3e-09 Score=69.36 Aligned_cols=294 Identities=10% Similarity=0.008 Sum_probs=158.1 Q ss_pred CCCcccccccccc------------CCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEE Q lcl|NC_013692. 1 MAGPVDNIKPMKY------------NDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLH 68 (399) Q Consensus 1 ~~~~~~~~~~~~~------------n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrr 68 (399) +.....+-....| +..+.+.-+.+-|+ .|....+....+...+.+++.+.+|+-+.++... .+ T Consensus 97 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~-~~ 171 (415) T protein:vir:81 97 QNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPE----EIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPV-VR 171 (415) T ss_pred hhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccch----HHHHHHHHHHHhhhhhhhheeeeeccCCceeEEE-Ee Confidence 0000000000000 00011111223443 4566677777788889999999999988876333 23 Q ss_pred ccCCcCCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhh Q lcl|NC_013692. 69 YIPLLDDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLD 148 (399) Q Consensus 69 y~pl~~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~ 148 (399) +..-.... ...||-+. |..+ ..++..|+.+++++|.++.+|+++.+ T Consensus 172 ~~~~~~~~-~v~E~~~~------------------------~~~~---------~~~~~~v~~~~~k~~~~~~iS~ell~ 217 (415) T protein:vir:81 172 QSEVAALE-KVEELEEN------------------------PELA---------VKPFFQLAYDINTHRGYFRISREAIE 217 (415) T ss_pred ecCCccce-eecccccc------------------------Cccc---------ccceeeEEeeeeeeEeeehhhHHHHh Confidence 33332211 11222110 1000 12567899999999999999999876 Q ss_pred hhhchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee-eccccccccccCCcceecHHHHHHHHHHHHhccCccccce Q lcl|NC_013692. 149 FDSDPAMEGHVTTEMVKGANEITEDLLQIDLLNSAGTVR-YPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKM 227 (399) Q Consensus 149 t~~D~~L~~~i~~el~~~~~~~t~d~l~~~~l~agt~V~-YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~i 227 (399) +++..|.+.|..+|...-+...+..+..+.- +|.... -.+........+.....++++|..+.-.|...... T Consensus 218 -ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g-~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~----- 290 (415) T protein:vir:81 218 -DAKVNVLQELKLWMARTIAATRNKAIIDVIT-KGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYE----- 290 (415) T ss_pred -hchHHHHHHHHHHHHHHHHHHHHHHHhhccc-cCccccccccccccccccccccccchhHHHHHHHhhhhhccC----- Confidence 4555676767666666655533333322221 221110 00112222334444668899998888766543321 Q ss_pred eccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcc Q lcl|NC_013692. 228 ITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKA 307 (399) Q Consensus 228 i~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~ 307 (399) ++ ..+||+.....|+.++|-.+.|-|.|- +..|-.+++-++.++.++.+ +..++|. T Consensus 291 ------------~~--~~v~n~~~~~~l~~lkd~~G~~l~~~~--------~~~~~~~~l~G~pV~~~~~~-~~~~~~~- 346 (415) T protein:vir:81 291 ------------HN--VAIVSQTMFAKLDKMKDKLGNYLIQPD--------VKEKTQQRLLGAKIEILPDE-VLGQKGN- 346 (415) T ss_pred ------------CC--EEEEcHHHHHHHHHhhccCCceeeccC--------cCCCCCceecceeeEEeccc-ccCCCCc- Confidence 11 347899999999999887777767552 23455578899999888864 2111111 Q ss_pred cCCcccccccccCccceeEEEEEEc--cccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhc Q lcl|NC_013692. 308 VDPNDQVPMHESGGKYSVFPMLCVA--SEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVF 385 (399) Q Consensus 308 ~~~~~~~~~~~~~~~~DVYp~lV~G--~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL 385 (399) . .++|| +++|-. .-+ ..+.++. .+ +. ..++++.+. +++.+.++ T Consensus 347 ------------------~-~~~~Gd~~~~~~~-~~~---~~~~v~~--~~------~~---~~~~~~~~~-~r~d~~v~ 391 (415) T protein:vir:81 347 ------------------N-TLIIGNLKDAIVL-FDR---SQYQASW--TD------YM---HFGECLMIA-VRQDCRIL 391 (415) T ss_pred ------------------c-EEEEEehhccEEE-Eee---cceEEEE--ec------cc---cCceEEEEE-EEeccEEe Confidence 1 25777 344432 111 1222221 11 11 223333322 47788888 Q ss_pred cccceEEEEEecCC Q lcl|NC_013692. 386 RPEWIALLKTVARL 399 (399) Q Consensus 386 n~~~m~~iet~A~~ 399 (399) +++-++.++..+.. T Consensus 392 ~~~a~~~~~~~~~~ 405 (415) T protein:vir:81 392 DYKSAIVIEYDDSE 405 (415) T ss_pred ccccEEEEEEeccC Confidence 99999999888887 No 51 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=98.70 E-value=2.4e-09 Score=67.86 Aligned_cols=288 Identities=20% Similarity=0.202 Sum_probs=160.0 Q ss_pred CCCccccc---cccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCc Q lcl|NC_013692. 1 MAGPVDNI---KPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRN 77 (399) Q Consensus 1 ~~~~~~~~---~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t 77 (399) ..|.-+.. ..-..+..+.+.-+-+-|+ -|..+.+...+...++.+++.+.++... .+++.+. T Consensus 91 ~~g~~~~~~~~e~~a~~~~t~~~gG~~iP~----~~~~~I~~~~~~~~~l~~~~~~~~~~~~---~~~~~~~-------- 155 (407) T protein:vir:48 91 RKGREDGLRELERKALQVGNDEDGGYAIPE----ELDRTILTLLKDEVVMRQEATVITLGGS---DYKKLVN-------- 155 (407) T ss_pred hccchhhhhHHHHHhhhcccCCCCcccccH----hHHHHHHHHHHhhhhhhhhceeeecCCC---ceEEEEe-------- Confidence 11111111 1111222222111223454 3467777777788888889888776543 2333111 Q ss_pred cccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHH Q lcl|NC_013692. 78 VNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEG 157 (399) Q Consensus 78 ~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~ 157 (399) +.|..+ .+...|.. .|... ..++..|+.++++++.|+.+|+++.+ +++..|.+ T Consensus 156 --~~~~~a--~~v~E~~~-------------~~~~~---------~~~f~~i~~~~~k~~~~~~iS~ell~-ds~~~l~~ 208 (407) T protein:vir:48 156 --LGGTTS--GWVGETDA-------------RPETA---------TSKLGLIEPFMGEIYGNPQATQKMLD-DAFFNVED 208 (407) T ss_pred --cCCcce--eeeccccc-------------ccccc---------cccceeEEeeeeeeEeehhhHHHHHh-cchHHHHH Confidence 112211 22222210 01000 02567789999999999999999887 45555767 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCce------eeecccc----------ccccccCCcceecHHHHHHHHHHHHhccC Q lcl|NC_013692. 158 HVTTEMVKGANEITEDLLQIDLLNSAGT------VRYPGAA----------TSDAEVDATTEVTYDSLMRLRLDLDNARA 221 (399) Q Consensus 158 ~i~~el~~~~~~~t~d~l~~~~l~agt~------V~YAg~a----------Tsra~v~~~~~vt~~~lr~a~~~Lk~nrA 221 (399) .|..+|.+.-+...+.. +|++-++ +-++... ...........+++++|.++...|+..-. T Consensus 209 ~i~~~l~~~i~~~~~~a----~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~ 284 (407) T protein:vir:48 209 WINSELALEFAEQEEIA----FTSGDGSKKPKGFLAYESTDEDDKTRAFGKLQHIASGAASGVTADAIIKLIYTLRKAHR 284 (407) T ss_pred HHHHHHHHHHHHHHHhh----hhccCCCCccceeeecccccccccccccccccccccccccccChHHHHHHHHhhchhhh Confidence 67666666655433332 3432111 0011100 00011122356889999998887765432 Q ss_pred ccccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCccccc Q lcl|NC_013692. 222 PTKIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHW 301 (399) Q Consensus 222 pk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~ 301 (399) . .++| +||+....-|+.|+|--+.|-|.|-.. .|..+++-+..++.++.|- T Consensus 285 ~-----------------~a~~--v~n~~~~~~L~~lkD~~Gr~l~~~~~~--------~g~~~~l~G~PV~~~~~~p-- 335 (407) T protein:vir:48 285 S-----------------GAKF--MMNNSSLFAIRLLKDNDGNYLWRPGIE--------LGQPSSLAGYGIVENEQMP-- 335 (407) T ss_pred c-----------------CCEE--EEcHHHHHHHHHhhccCCceeeccCcC--------CCCCceecceeeEEecCcC-- Confidence 1 1233 689999999999998777777776433 3445667788888888652 Q ss_pred ccCCcccCCcccccccccCccceeEEEEEEccc--cceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHH- Q lcl|NC_013692. 302 AGVGKAVDPNDQVPMHESGGKYSVFPMLCVASE--AFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKW- 378 (399) Q Consensus 302 ~~aGa~~~~~~~~~~~~~~~~~DVYp~lV~G~~--Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~- 378 (399) +.++ + . ..++||.= +|-...- .+ +++ ..|||.+++.++|++ T Consensus 336 -~~~~----~---------~-----~~i~~Gd~~~~~~i~~~--~~----~~i-----------~~d~~~~~~~~~~~~~ 379 (407) T protein:vir:48 336 -DIAA----D---------A-----KAIAFGNFKRGYTIVDR--IG----TRI-----------LRDPYTNKPFVGFYTT 379 (407) T ss_pred -CccC----C---------c-----cEEEEEeccccEEEEEe--ec----eEE-----------EeeccccCCcEEEEEE Confidence 1111 0 1 12456752 2321111 11 222 146788889999996 Q ss_pred -HHHHhhccccceEEEEEecCC Q lcl|NC_013692. 379 -YYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 379 -~~~~~iLn~~~m~~iet~A~~ 399 (399) .+.+.+++++-++.++++|.- T Consensus 380 ~r~d~~v~~~~a~~~l~~~aa~ 401 (407) T protein:vir:48 380 KRTGGMLVDSQAIKLMKIGAAT 401 (407) T ss_pred EEeccEEecccceEEEEeeccC Confidence 589999999999999998888 No 52 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=98.69 E-value=4.3e-09 Score=66.45 Aligned_cols=296 Identities=11% Similarity=0.112 Sum_probs=149.9 Q ss_pred cccceeh---hhhhHHHHHHhhhHHhhhhcccccccC-----cCCCcEEEEEEccCCcCCCc--cccCCCCcchhhhhhh Q lcl|NC_013692. 24 IGPQIHT---RYWYKRALIDAAKEAYFGQLADTFSMP-----KHYGKEIVRLHYIPLLDDRN--VNDQGIDASGATIANG 93 (399) Q Consensus 24 i~p~~~t---~y~~~k~L~~A~p~lv~~~~a~~~~mP-----Kn~GktIkfrry~pl~~~~t--~lteGV~p~g~~~~ng 93 (399) +..++-+ --|.+++|+.-++.+|+.++.. +..+ ++.|.||++|+--++-.... ....+++|. T Consensus 1 MAN~llT~iP~iia~~al~~l~~~lV~~~lV~-r~y~ge~~~a~~GDTV~I~~p~~~~v~d~~~~~~~~~~~~------- 72 (423) T protein:vir:35 1 MANNLESNISQIVLKKFLPGFMSDIVLCKTVD-RQLLSGEINSNTGDSVSFKRPHQFKSERTETGDITGKDKN------- 72 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhcc-cCCCcccccccCCCEEEEeeCCcceeecccCcCCCCcccc------- Confidence 2222211 3578999999999999999854 3332 36799999987765532221 112223332 Q ss_pred hhccccccccccccccccccccccceeeccceeEEEEEEeeee-cceehhhhhhhhhhhchhHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 94 NLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKY-GFFREYTQEQLDFDSDPAMEGHVTTEMVKGANEITE 172 (399) Q Consensus 94 n~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~Qy-G~~~e~Td~~~~t~~D~~L~~~i~~el~~~~~~~t~ 172 (399) +++...++.+|-|. .+-.+++|+-...+.+. |.+.+...+..-+.++.. T Consensus 73 -----------------------------~~~e~~v~l~id~~k~~a~~v~d~e~~l~i~~-~~~~l~~a~~ala~~vd~ 122 (423) T protein:vir:35 73 -----------------------------GLFSAKATGKVGKYITVAVEWTQIEEALKLNQ-LDQILSPIHERMVTDLET 122 (423) T ss_pred -----------------------------ccccceeeEEeccceeccceeCHHHHHhhHHH-HHHHHHHHHHHHHHHHHH Confidence 22333566666543 34567787765554442 333333333333333222 Q ss_pred HHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCeeEEEechhhh Q lcl|NC_013692. 173 DLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNARALYVGSDLV 252 (399) Q Consensus 173 d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~ 252 (399) .|...+..++.++ . ++.... .-.++.+..+.+.|+++++|+ ..|++++.|+.. T Consensus 123 -~l~~~l~~~a~~~--v--gt~~t~-----~~~~~~i~~a~~~Ld~~~vP~-----------------~~R~~Vv~p~~~ 175 (423) T protein:vir:35 123 -ELAHFMMNNGALS--L--GSPNTA-----IKKWADVAQTASFIKDIGIKT-----------------GENYAIMDPWSA 175 (423) T ss_pred -HHHHHHhhccccc--c--ccccCC-----cchHHHHHHHHHHHHHhcCCc-----------------CCCEEEeCHHHH Confidence 2222222222221 1 121111 124788999999999999985 238999999998 Q ss_pred HHHHHHhhhcCCCCceehhhcCCccccccccc-eeEcCeEEEecCccccc-ccC--Cccc-CCccccc------------ Q lcl|NC_013692. 253 PTIEAMKDNHGNPAFIPIEKYAAGGATMHGEV-GQLGRFRVIVNPQMMHW-AGV--GKAV-DPNDQVP------------ 315 (399) Q Consensus 253 ~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEI-G~i~~~RfV~~~~~~~~-~~a--Ga~~-~~~~~~~------------ 315 (399) ..|.+ ++.+....+-+..+.+-+|+| |++.+|++.++.++-.- .+. +... +.+..++ T Consensus 176 a~Ll~------~~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnvp~~T~gt~~~~~~v~~a~~v~~~a~~~~~~~~~ 249 (423) T protein:vir:35 176 QRLAD------AQSGLHAADQLVRTAWENAQISGNFGGIRALMSNGLASRKQGDFDGAITVKTAPNVDYLSVKDSYQFTV 249 (423) T ss_pred HHHhc------cccceeccccchhHHHhhccceeeecceEEEEcCCCccccccccccceeecccccccccccccccccee Confidence 88763 233333334445567888876 99999999998876643 111 1000 0000000 Q ss_pred ------------------------------------------------------ccccCccceeEEE------------- Q lcl|NC_013692. 316 ------------------------------------------------------MHESGGKYSVFPM------------- 328 (399) Q Consensus 316 ------------------------------------------------------~~~~~~~~DVYp~------------- 328 (399) ...+...+.+||. T Consensus 250 ~~~~~~~~~~g~l~~GD~~t~aGv~~v~~~t~~~~~~~~t~~~~~~~V~~~~~~~a~g~~~v~i~p~~~~~~~~~~~~~v 329 (423) T protein:vir:35 250 ALTGATPSKTGFLKAGDQLKFTSTHWLNQQSKQTLYNGSTAMSFTATVLEETNSTASGDVTVKLSGVPIYDEKNSQYNAV 329 (423) T ss_pred eeeeeeeccCCcEEecceEEeeeeeeccccccceeecccCCceeEEEEeccccccccCceeEEccccccccCCCcccccc Confidence 0011222333432 Q ss_pred -----------------------EEEccccce--ecccccCCCC-------CcceEEEecCCCcCCCCCCccchhhHHHH Q lcl|NC_013692. 329 -----------------------LCVASEAFT--TVGFATDGKN-------VKFKIITKRPGEATADRSDPYGEMGFMSI 376 (399) Q Consensus 329 -----------------------lV~G~~Afg--~v~l~~~g~~-------~k~~~ivk~pG~~tad~~DPlgQrg~~gw 376 (399) |++.++||+ +.+|.-.+.. -.+.+.+..-+.. =..--.+.| T Consensus 330 ~a~~a~~~~vt~~~~a~~~~~~nl~~~~~a~~l~~~~l~~~~~~~~~~~~~~g~s~r~~~~~d~-------~~~~~~~r~ 402 (423) T protein:vir:35 330 DAKVKAGDAVSIIGTAKQQMKPNLFYNKFFCGLGTIPLPKLHSLDSAVATYEGFSIRVHKYADG-------DANKQMMRF 402 (423) T ss_pred cccccCCceeeeeecCCCceeEEEeecCceeEEEEEccccCCccceeeccccCceEEEEEeecc-------ccCceEEEE Confidence 244444443 2233211100 0011111111100 000111334 Q ss_pred HHHHHHhhccccceEEEEEecC Q lcl|NC_013692. 377 KWYYGFMVFRPEWIALLKTVAR 398 (399) Q Consensus 377 K~~~~~~iLn~~~m~~iet~A~ 398 (399) =.+||+..|++||..|+ .+.| T Consensus 403 d~l~g~~~~~p~~~~~~-~g~~ 423 (423) T protein:vir:35 403 DLLPAYVCFNPHMGGQF-FGNP 423 (423) T ss_pred EeecceeeecccceEEE-EecC Confidence 46799999999998766 4556 No 53 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=98.68 E-value=4.7e-09 Score=66.26 Aligned_cols=295 Identities=11% Similarity=0.026 Sum_probs=158.8 Q ss_pred CCCcccc----------ccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEcc Q lcl|NC_013692. 1 MAGPVDN----------IKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYI 70 (399) Q Consensus 1 ~~~~~~~----------~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~ 70 (399) +...+.. ......+..+.+..+-+-|+ .+..+.+..+.+...+.+++...+||...|+....++.. T Consensus 88 ~~~~~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~vP~----~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~~~~~~~ 163 (404) T protein:vir:10 88 ADNLLKQKNQRGLNLSEKEINAISENIDEDGGYAVPE----DIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRTYEKRSK 163 (404) T ss_pred HHHHHHHHHhhhhcchhhHHhhhccccCCCCceeech----hHHHHHHHHHhhhhhHhhhhceeeccCCccceEEEEecC Confidence 0000000 00111111111111222333 335666666778889999999999999988755444322 Q ss_pred CCcCCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhh Q lcl|NC_013692. 71 PLLDDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFD 150 (399) Q Consensus 71 pl~~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~ 150 (399) . + ......||= .. .. .....++..|+.+.++++.++.+|+++.+ + T Consensus 164 ~-~-~~~~v~e~~-----~~---------------~~------------~~~~~~f~~i~~~~~k~~~~~~iS~ell~-d 208 (404) T protein:vir:10 164 Q-K-PMKPLSENQ-----QI---------------PT------------NGDNGKLERFNFKLKDLADFMSIPNDLLK-F 208 (404) T ss_pred C-c-ceeeccccc-----cc---------------cc------------cccccceeeeEeeheeeEeeehhhHHHHh-h Confidence 1 1 111111210 00 00 00113567899999999999999998875 4 Q ss_pred hchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcCce-----eeeccccccccccCCcceecHHHHHHHHH-HHHhccCccc Q lcl|NC_013692. 151 SDPAMEGHVTTEMVKGANEITEDLLQIDLLNSAGT-----VRYPGAATSDAEVDATTEVTYDSLMRLRL-DLDNARAPTK 224 (399) Q Consensus 151 ~D~~L~~~i~~el~~~~~~~t~d~l~~~~l~agt~-----V~YAg~aTsra~v~~~~~vt~~~lr~a~~-~Lk~nrApk~ 224 (399) ++..|.+.|..+|...-+...+.. +|++-+. .+.. ...-.+++.....++++|..+.. .|+....+ T Consensus 209 s~~~l~~~i~~~la~~~~~~~~~~----il~G~g~~~~~~gi~~--~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~-- 280 (404) T protein:vir:10 209 ADKSLEDWIINWFVDKVRITRNAE----ILYGAGGDEHATGIMT--ANKFKKITLPKSPALKDFKKCKNVELLNVFKA-- 280 (404) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHH----HhhcCCCCCcccceee--ccccceeeccccccHHHHHHHHHhhhhccccC-- Confidence 556677777777777766644444 4432221 1111 11112333345567888887764 33332221 Q ss_pred cceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccC Q lcl|NC_013692. 225 IKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGV 304 (399) Q Consensus 225 T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~a 304 (399) .-+.+|||....-|+.|+|--+.|-|.|-. -.+..+++-|..+++.+..++=.+ T Consensus 281 -----------------~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~--------~~~~~~~l~G~PV~~~~~~~~~~~- 334 (404) T protein:vir:10 281 -----------------TSSWIVNQDGFNYLDSLEDKTGRPYLQPDP--------KDPTQYRFLGLPVIELPNDLLLST- 334 (404) T ss_pred -----------------CCEEEEcHHHHHHHHHhhccCCceeeccCc--------CCCCCccccceeeEEecccccCCC- Confidence 123589999999999999877887786532 234445677788776554221100 Q ss_pred CcccCCcccccccccCccceeEEEEEEccccce-ecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH--HHHH Q lcl|NC_013692. 305 GKAVDPNDQVPMHESGGKYSVFPMLCVASEAFT-TVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK--WYYG 381 (399) Q Consensus 305 Ga~~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg-~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~ 381 (399) .+. . .+++|.-+-+ .+... ++ +++.+..- ....-+++...|+ +++. T Consensus 335 ---------------~~~---~-~~~~gd~s~~~~~~~~-~~----~~i~~~~~-------~~~~~~~~~~~~~~~~r~d 383 (404) T protein:vir:10 335 ---------------ESA---I-PVLLGDTKEAYKYVSD-GA----YELATTNI-------GAGAFETNTTKARIIMRID 383 (404) T ss_pred ---------------CCc---c-EEEEEeccccEEEEEe-cc----eEEEEecc-------ccchhhcCceEEEEEEeec Confidence 011 1 2467854422 23222 22 22222111 1112345666655 6788 Q ss_pred HhhccccceEEEEEecCC Q lcl|NC_013692. 382 FMVFRPEWIALLKTVARL 399 (399) Q Consensus 382 ~~iLn~~~m~~iet~A~~ 399 (399) +.+++++-++.++..+.. T Consensus 384 ~~v~~~~a~~~~~~~~aa 401 (404) T protein:vir:10 384 GNVKDSEALLIAEIPVES 401 (404) T ss_pred cEEecccceEEEEeeccc Confidence 999999999998887777 No 54 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=98.65 E-value=2.3e-09 Score=67.99 Aligned_cols=292 Identities=11% Similarity=0.065 Sum_probs=161.1 Q ss_pred CCCccccccccccCCCCCC---cccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANG---VESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRN 77 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~---~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t 77 (399) .+.+ .++.+.++..... +.+.+-|+ .+ ..+.++.+.+.-++.+++...+|+. .++++-++... T Consensus 14 f~~~--~~~~~~~~a~~~~~~~~~~~lip~---~~-~~~ii~~~~~~s~l~~l~~~~~~~~---~~~~~p~~~~~----- 79 (324) T protein:vir:96 14 FASN--NVKPQVFNPDNVMMHEKKDGTLLN---DF-TTPILQEVMENSKIMQLGKYEPMEG---TEKKFTFWADK----- 79 (324) T ss_pred HHHh--hhhhhhcccccccccCCCcceech---hH-HHHHHHHHHhhchhhhhcceeeccC---CceEEEEEecC----- Confidence 1111 1233334332211 11222232 33 5777777888888999999988874 34565444321 Q ss_pred cccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHH Q lcl|NC_013692. 78 VNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEG 157 (399) Q Consensus 78 ~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~ 157 (399) |...+++.| +..-.-..++..++.+.++++.++.+|+++.+ ++++.|.. T Consensus 80 -------~~a~~v~Eg-----------------------~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~-ds~~~l~~ 128 (324) T protein:vir:96 80 -------PGAYWVGEG-----------------------QKIETSKATWVNATMRAFKLGVILPVTKEFLN-YTYSQFFE 128 (324) T ss_pred -------cceeeecCC-----------------------ccccccccceeEEEEEeEEEEEeehhhHHHHh-cchHHHHH Confidence 122232222 11122234778899999999999999998877 45566777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_013692. 158 HVTTEMVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTR 237 (399) Q Consensus 158 ~i~~el~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~ 237 (399) .|..+|.+.-+.-.+..+..+--++.....=.+ .+...+......+++++|.++...|+.+... T Consensus 129 ~i~~~l~~aia~~~d~~~l~G~g~~~~~~~~~~-~~~~~~~~~~~~~~~~~i~~~~~~i~~~~~~--------------- 192 (324) T protein:vir:96 129 EMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQ-SIKKTNKVIKGDFTQDNIIDLEALLEDDELE--------------- 192 (324) T ss_pred HHHHHHHHHHHHHHHHHhhhcCCCCCcCccccc-cccccceecccccchHHHHHHHHhhhhccCC--------------- Confidence 777777766666444443332111111000000 1111112223557899999998877665432 Q ss_pred cccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCccccccc Q lcl|NC_013692. 238 TVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMH 317 (399) Q Consensus 238 ~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~ 317 (399) + + ..+|||.....|+.++|--+.|-|. .+--+++-++.++.++.. + T Consensus 193 --~-~-~~i~n~~~~~~L~~lkd~~G~~~~~------------~~~~~~l~G~PV~~~~~~----~-------------- 238 (324) T protein:vir:96 193 --A-N-AFISKTQNRSLLRKIVDPETKERIY------------DRNSDSLDGLPVVNLKSS----N-------------- 238 (324) T ss_pred --C-C-EEEEcHHHHHHHHHhhCCCCCeeec------------CCCCCcccceeeEeecCC----C-------------- Confidence 1 1 3579999999999987644433331 233456777877765410 0 Q ss_pred ccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCc------cchhhHHHHH--HHHHHhhccccc Q lcl|NC_013692. 318 ESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDP------YGEMGFMSIK--WYYGFMVFRPEW 389 (399) Q Consensus 318 ~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DP------lgQrg~~gwK--~~~~~~iLn~~~ 389 (399) .++. .+++|.-+.-.+++.. .+.+++..-.. . ....|+ |-|++.+.|+ +++++.+++++- T Consensus 239 --~~~~----~~~~gd~s~~~~~~~~---~~~i~~~~~~~-~--~~~~~~~~~~~~~~~~n~v~~r~~~r~d~~v~~~~a 306 (324) T protein:vir:96 239 --LKRG----ELITGDFDKLIYGIPQ---LIEYKIDETAQ-L--STVKNEDGTPVNLFEQDMVALRATMHVALHIADDKA 306 (324) T ss_pred --CCcc----eEEEEecceEEEEEec---CcEEEEeeccc-c--cccccccccchhhhhcCcEEEEEEEEeccEEecccc Confidence 0111 2567765555455432 22232221111 0 111222 3466677777 788999999999 Q ss_pred eEEEEEecCC Q lcl|NC_013692. 390 IALLKTVARL 399 (399) Q Consensus 390 m~~iet~A~~ 399 (399) +++|+.+.+. T Consensus 307 ~~~l~~a~~~ 316 (324) T protein:vir:96 307 FAKLVPADKR 316 (324) T ss_pred eEEEeccccc Confidence 9999988877 No 55 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=98.65 E-value=3.1e-09 Score=67.20 Aligned_cols=294 Identities=9% Similarity=0.005 Sum_probs=157.2 Q ss_pred CCCcccccccccc--------C----CCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEE Q lcl|NC_013692. 1 MAGPVDNIKPMKY--------N----DPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLH 68 (399) Q Consensus 1 ~~~~~~~~~~~~~--------n----~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrr 68 (399) ............| + ..+.+.-+.+-|+ .+..+.+..+.+...+.+++.+.+|+-+.++. .+.+ T Consensus 97 ~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~g~~~iP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~-~~~~ 171 (415) T protein:vir:94 97 QNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPE----EIVTDILKLKEVEFNLDKYVTVKRVTNGSGKY-PVVR 171 (415) T ss_pred hhhhhhHHHHHHHHHHhhhhhhhhhhccccccccccCcH----HHHHHHHHHHHhhhhhhhhcceeeccCCceeE-EEEe Confidence 0000000000000 0 0111111222332 34566777778889999999999999776653 2223 Q ss_pred ccCCcCCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhh Q lcl|NC_013692. 69 YIPLLDDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLD 148 (399) Q Consensus 69 y~pl~~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~ 148 (399) +..-+.+ ....||-+ .|..+. .+++.|+.++++++.++.+|+++.+ T Consensus 172 ~~~~~~~-~~v~Eg~~------------------------~~~~~~---------~~~~~i~~~~~k~~~~~~is~ell~ 217 (415) T protein:vir:94 172 QSEVAAL-EKVEELEE------------------------NPELAV---------KPFFQLAYDINTHRGYFRISREAIE 217 (415) T ss_pred ecCCccc-eecccccc------------------------cccccc---------ccceeeEeeheeeeeechhhHHHHh Confidence 3332111 11122211 111111 2567899999999999999999766 Q ss_pred hhhchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee-eeccccccccccCCcceecHHHHHHHHHHHHhccCccccce Q lcl|NC_013692. 149 FDSDPAMEGHVTTEMVKGANEITEDLLQIDLLNSAGTV-RYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKM 227 (399) Q Consensus 149 t~~D~~L~~~i~~el~~~~~~~t~d~l~~~~l~agt~V-~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~i 227 (399) +++..|.+.|..+|...-+...++.+..+.- .|... .-.+..............++++|..+...|...... T Consensus 218 -ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g-~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~----- 290 (415) T protein:vir:94 218 -DAKVNVLQELKLWMARTIAATRNKAIIDVIT-KGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYE----- 290 (415) T ss_pred -hchHHHHHHHHHHHHHHHHHHHHHHHhhccc-cCccccccccccccccccccccccchHHHHHHHHhhhhhccC----- Confidence 4555676666666666655533333332222 12111 101111222233334568899998888766543321 Q ss_pred eccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcc Q lcl|NC_013692. 228 ITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKA 307 (399) Q Consensus 228 i~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~ 307 (399) .+ ..+|||.....|+.++|-.+.|=|.|- +..|-.+.+-++.++.++.+. ..++| T Consensus 291 -------------~~-~~vmn~~~~~~l~~lkd~~G~~l~~~~--------~~~~~~~~l~G~pV~~~~~~~-~~~~~-- 345 (415) T protein:vir:94 291 -------------HN-VAIVSQTMFAKLDKMKDKLGNYLIQPD--------VKEKTQQRLLGAKIEILPDEV-LGQKG-- 345 (415) T ss_pred -------------CC-EEEEcHHHHHHHHHhhccCCCeeeccC--------cCCCCCceecceeeEEecccc-cCCCC-- Confidence 12 357899999999999887777777542 234556788999998888642 11111 Q ss_pred cCCcccccccccCccceeEEEEEEcc--ccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhc Q lcl|NC_013692. 308 VDPNDQVPMHESGGKYSVFPMLCVAS--EAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVF 385 (399) Q Consensus 308 ~~~~~~~~~~~~~~~~DVYp~lV~G~--~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL 385 (399) . . .+++|. ++|-. ..+ .+ +.++ +. .+-..++++.++ .++.+.++ T Consensus 346 --------------~---~-~i~~gd~~~~~~~-~~~-~~--~~v~--~~---------~~~~~~~~~r~~-~r~d~~~~ 391 (415) T protein:vir:94 346 --------------N---N-TLIIGNLKDAIVL-FDR-SQ--YQAS--WT---------DYMHFGECLMIA-VRQDCRIL 391 (415) T ss_pred --------------c---c-EEEEEehhccEEE-Eee-cc--eEEE--Ee---------ccccCceEEEEE-EEeccEEe Confidence 1 1 256773 43332 111 11 2221 11 112334444333 46788889 Q ss_pred cccceEEEEEecCC Q lcl|NC_013692. 386 RPEWIALLKTVARL 399 (399) Q Consensus 386 n~~~m~~iet~A~~ 399 (399) +++-++.++..+.. T Consensus 392 ~~~a~~~~~~~~~~ 405 (415) T protein:vir:94 392 DYKSAIVIEYDDSE 405 (415) T ss_pred ccccEEEEEEeccC Confidence 99999999888777 No 56 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=98.64 E-value=1.7e-08 Score=63.16 Aligned_cols=294 Identities=9% Similarity=0.047 Sum_probs=168.0 Q ss_pred CCcccc---cccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCccccCCCCcchhhhhhhh Q lcl|NC_013692. 18 NGVESS---IGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVNDQGIDASGATIANGN 94 (399) Q Consensus 18 ~~~~~~---i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lteGV~p~g~~~~ngn 94 (399) |.+.++ +-|+ .| ..+.++.+++.-++.+++.+.+|+.+. +++-++..- |...++. T Consensus 1 mat~~~gg~lvP~---~~-~~~ii~~~~~~s~i~~~~~~i~~~~~~---~~~p~~~~~------------~~a~wv~--- 58 (311) T protein:vir:81 1 MVALATGTFQLPK---HL-VPGVWQKAQGQSVLARLSMAEPQEFGE---QQYMTLTAP------------PRGEVVG--- 58 (311) T ss_pred CceecCCceEcch---hH-HHHHHHHHHhcchhhhhcceeecCCCc---eEEEEEeCC------------ceeEEee--- Confidence 333322 3444 23 577888888988999999999887753 444333221 1112222 Q ss_pred hccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhch--hHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 95 LYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDP--AMEGHVTTEMVKGANEITE 172 (399) Q Consensus 95 ~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~--~L~~~i~~el~~~~~~~t~ 172 (399) +|+.+..-..++..++...++++.+..+|+++.....|. .|.+.|..+|.+.-+. T Consensus 59 --------------------Eg~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~--- 115 (311) T protein:vir:81 59 --------------------EGAQKSESTATFAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGR--- 115 (311) T ss_pred --------------------cCcccccccceeeEEEEeeEEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHH--- Confidence 233333344578889999999999999999988655443 3555454444444333 Q ss_pred HHHHHHHHhc---Cceeeecc------ccccccccCCcceec-HHHHHHHHHHHHhccCccccceeccccccCcccccCe Q lcl|NC_013692. 173 DLLQIDLLNS---AGTVRYPG------AATSDAEVDATTEVT-YDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNA 242 (399) Q Consensus 173 d~l~~~~l~a---gt~V~YAg------~aTsra~v~~~~~vt-~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~ 242 (399) .+-..+|++ ++-.-..| .++.-.+.+...... ..++..+...+..++.. + T Consensus 116 -~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~-----------------~-- 175 (311) T protein:vir:81 116 -ALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDLAVEAAVGLVLGDNLS-----------------P-- 175 (311) T ss_pred -HHHHhhhccccCCCCcccccccccccccceeeeecccccchHHHHHHHHHHHhhhcCCC-----------------c-- Confidence 333334433 11111110 111111222222223 34566666555444432 1 Q ss_pred eEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccccCcc Q lcl|NC_013692. 243 RALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESGGK 322 (399) Q Consensus 243 yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~~~ 322 (399) -..++||....-|+.|+|--+.|-|.+.. ..+..|++-++.++.+..+.-=...+..... .......+ T Consensus 176 ~~~vmn~~~~~~l~~lkd~~G~~l~~~~~--------~~~~~~tl~G~Pv~~~~~i~~~~~~~~~~~~----~~~~~~~~ 243 (311) T protein:vir:81 176 DGVALDNTFSFMLATQRDSQGRKLYPELG--------FGTDVASFAGLNAAVSDTVRGGPEAVTASTG----VYRTTNPN 243 (311) T ss_pred eEEEEcHHHHHHHHhhhccCCCeeecCcc--------ccCCCceecceeEEecccccccccccccccc----hhcccCCc Confidence 23688999999999999888888886542 3456678889888877654322221111111 11222333 Q ss_pred ceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH--HHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 323 YSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK--WYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 323 ~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iLn~~~m~~iet~A~~ 399 (399) .+ +++|.-+.-.++... .+.+++ ..-+ ..|....|-|++.+.|+ +++++.+++++-+++|+-+..- T Consensus 244 ~~----~~~gDfs~~~i~~~~---~~~~~~--~~~~--~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a~~~ 311 (311) T protein:vir:81 244 VK----AIAGDFSAFRWGVQV---SIPLEL--IEFG--DPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) T ss_pred cE----EEEEecccEEEEEec---cceEEE--eccC--CCCcchhhhhcCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 33 567877666666642 223332 2222 13444467889999998 6889999999999999877666 No 57 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=98.64 E-value=1.3e-08 Score=63.84 Aligned_cols=306 Identities=11% Similarity=0.071 Sum_probs=165.3 Q ss_pred cccccccccCCCCC----C---cccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCc Q lcl|NC_013692. 5 VDNIKPMKYNDPAN----G---VESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRN 77 (399) Q Consensus 5 ~~~~~~~~~n~~~~----~---~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t 77 (399) |.++|-..-+.... + +.+.+-|+ .+..+.++..++.-.+.+++.+.+|+.+..+ |-...-.|-..+.. T Consensus 1 ~~~~~e~~~~~~~~~~~~~~~~~~~~liP~----~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~-ip~~~~~~~a~~v~ 75 (338) T protein:vir:78 1 MATLNELAPNTAGSNHQGRLAHVPSDLLPK----EIVGPIFDKAQESSLVLRLGENIPISYGETI-IPTTVKRPEVGQVG 75 (338) T ss_pred CcchHHhhhhhcccccccceecccccccch----HHHHHHHHHHHhhchhhhhcceeeccCCceE-EEEEecCccceeec Confidence 55555544433221 1 11223333 3467788888888899999999998854322 22222222222211 Q ss_pred cccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHH Q lcl|NC_013692. 78 VNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEG 157 (399) Q Consensus 78 ~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~ 157 (399) ..+-... .+|+.+..-..++..|+.+.++++.+..+|+++... +...+.+ T Consensus 76 ~~~~~~~-----------------------------~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~d-s~~~~~~ 125 (338) T protein:vir:78 76 VGTSNEQ-----------------------------REGGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARM-NPSGLYT 125 (338) T ss_pred ccccccc-----------------------------cccccccccccceeEEEEEEEEEEEeehhhHHHHhc-CHHHHHH Confidence 1111111 123333334457888999999999999999998764 3344666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCce------------eeeccccccccccCCcceecHHHHHHHHHHHHhccCcccc Q lcl|NC_013692. 158 HVTTEMVKGANEITEDLLQIDLLNSAGT------------VRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKI 225 (399) Q Consensus 158 ~i~~el~~~~~~~t~d~l~~~~l~agt~------------V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T 225 (399) .|..+|.+.-+. .+...+|++-+. ...+ ..+............+++|.++...+..|.... T Consensus 126 ~i~~~la~a~~~----~~d~~~l~G~g~~~~~~~~gi~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 198 (338) T protein:vir:78 126 KLQADLAYAIGR----GIDLAVFHGKSPLTGSALQGIDTNNVIV-NTTNVDYLQTGTTPLLDRFLDGYDLVSANTDVD-- 198 (338) T ss_pred HHHHHHHHHHHH----HHHHHhhcccCCCccccccccccccccc-cccccccccccchhhHHHHHHHHHHhhhhcccc-- Confidence 665555554444 333345532211 0111 112222222233456777877776665544321 Q ss_pred ceeccccccCcccccCeeEEEechhhhHHHHHH---hhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccc Q lcl|NC_013692. 226 KMITGTRMIDTRTVGNARALYVGSDLVPTIEAM---KDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWA 302 (399) Q Consensus 226 ~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~---~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~ 302 (399) .=+.+|||.+...|+.+ +|-.+.|-|. ...+.+.-+.+-++.++.++.|---. T Consensus 199 ----------------~~~~~m~~~~~~~L~~~~~l~d~~g~~l~~--------~~~~~~~~~~l~G~PV~~~~~ip~~~ 254 (338) T protein:vir:78 199 ----------------FNGWAADPRYRARLLRSQAYRDANGNVDPT--------RINLAASAGDLLGLPVQFGKAVGGDL 254 (338) T ss_pred ----------------ceEEEEchHHHHHHHHHhhhccCCCceeec--------ccccCCCCceeeeeeEEEccccCccc Confidence 12577899988777554 3333334443 23456677889999999988653222 Q ss_pred cCCcccCCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCcc------chhhHHHH Q lcl|NC_013692. 303 GVGKAVDPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPY------GEMGFMSI 376 (399) Q Consensus 303 ~aGa~~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPl------gQrg~~gw 376 (399) +++ .+++. .+++|.-+.-.++... .+.+++. ...+ -.+..||- -|+...+| T Consensus 255 ~~~-------------~~~~~----~~~~gdfs~~~~~~~~---~~~i~~~-~~~~--~~~~~~~~~~~~~~~~~~~~~~ 311 (338) T protein:vir:78 255 GAA-------------TDSKV----RVVGGDFSQLKYGFAD---EIRVKMS-DTAT--LTDNTSPTPQTVSMWQTNQIAI 311 (338) T ss_pred ccc-------------CCccc----EEEEEecceEEEEeec---ccEEEEe-eccc--ccccccccccchhhhhcCcEEE Confidence 211 12233 3457766655555442 2222211 1111 12334553 45566777 Q ss_pred H--HHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 377 K--WYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 377 K--~~~~~~iLn~~~m~~iet~A~~ 399 (399) | +++++.+++++-+++|+-++.= T Consensus 312 r~~~r~d~~v~~~~a~~~l~~~~~~ 336 (338) T protein:vir:78 312 LIEVTFGWLLGDKQAFVKFVDDEDP 336 (338) T ss_pred EEEEEeccEeecccceEEEecccCC Confidence 6 6889999999999888776555 No 58 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=98.63 E-value=1.6e-07 Score=57.91 Aligned_cols=313 Identities=12% Similarity=0.081 Sum_probs=176.1 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVND 80 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lt 80 (399) |.-| ||.-.+-| .+..+++ .+..--|.-+.+..-+...++..+-.+|.+ ..||++.|-|--..... -++ T Consensus 1 ms~~-~~~tr~~~----~~s~~d~--al~le~f~geV~~af~~~s~~~~~~~~rti--~~g~s~~~~~iG~~~~~--~~~ 69 (335) T protein:vir:63 1 MSFL-NDLTRPNY----AGKNADV--DIHLEEHLGIVDKHFAYTSKFAPLMNIRDL--RGSNVVRLDRLGNVEAK--GRR 69 (335) T ss_pred CCCc-ccchhhhc----ccccchh--heehhhhhhhHHHHHHhhhhhccccceeee--ccceeEEEeeeeeeeee--ccc Confidence 5544 33323333 2222333 344456678888887778888899999998 55999999765444221 111 Q ss_pred CCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHH Q lcl|NC_013692. 81 QGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVT 160 (399) Q Consensus 81 eGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~ 160 (399) .|-.+.|+... + .+...+|-+.=-.-.+=|.+++.-.+--+.++++ T Consensus 70 pG~~l~~~~~~---------------------~-------------~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s 115 (335) T protein:vir:63 70 AGEELERSRVV---------------------N-------------DKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVA 115 (335) T ss_pred CCcCcCCCCcc---------------------c-------------cceEEEecceeechhhhhhHHHHhcCchhHHHHH Confidence 12222221110 0 1112222221111112223333323323556677 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCcee---eec----cccccccccCCcceec-HHHH----HHHHHHHHhccCcccccee Q lcl|NC_013692. 161 TEMVKGANEITEDLLQIDLLNSAGTV---RYP----GAATSDAEVDATTEVT-YDSL----MRLRLDLDNARAPTKIKMI 228 (399) Q Consensus 161 ~el~~~~~~~t~d~l~~~~l~agt~V---~YA----g~aTsra~v~~~~~vt-~~~l----r~a~~~Lk~nrApk~T~ii 228 (399) .|++..=+......+.+-+++|+... .+. .+.+....+++.+..+ .+.| +.|...|.++.-|. T Consensus 116 ~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~~~G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~----- 190 (335) T protein:vir:63 116 ELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVLEKLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGD----- 190 (335) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccccCccccCCCcCCCcceeeeeccCcccccHHHHHHHHHHHHHHHHhccCCC----- Confidence 78777777766666667777666542 111 1233333343332222 3344 45556677666652 Q ss_pred ccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcC---CccccccccceeEcCeEEEecCcccccccCC Q lcl|NC_013692. 229 TGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYA---AGGATMHGEVGQLGRFRVIVNPQMMHWAGVG 305 (399) Q Consensus 229 ~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg---~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aG 305 (399) ..-.-++++|.|..-..|.+ ++.|+.. .|+ .....-+|+|+++.+||+++++++---...+ T Consensus 191 ---------~~~~dr~~vv~P~~y~~Ll~------~~~l~n~-~~~~s~~~~~~~~g~v~~v~Gv~V~~sn~lP~~~~t~ 254 (335) T protein:vir:63 191 ---------AVYSEGLTPMSPRVFSLLLE------HDKLMNV-EYQATGATNDYVKSRVAILNGVKVLETPRFATKAIAA 254 (335) T ss_pred ---------cccCceEEEeChHHHHHHhc------ccccccc-ccccccccccccCceeEEeeceEEEeeccCCCCCccc Confidence 11123999999999999986 5788887 455 3345688999999999999999874333332 Q ss_pred cccCCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhc Q lcl|NC_013692. 306 KAVDPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVF 385 (399) Q Consensus 306 a~~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iL 385 (399) ...+..+ +...+-..-...+++-++|-+++-++. +..+ --.|+--|..++=-|..|++.+| T Consensus 255 ~~lg~a~----n~~~~d~~~~~~~~~~~~Al~t~~~~~----vt~e-----------~~~~~~~~~~~i~~~~a~G~g~l 315 (335) T protein:vir:63 255 HPLGRHF----NVSAEESERQIALFLPSKTLITAQVAP----VQAK-----------LWEDNEKFSWVLDTFQMYNIGAR 315 (335) T ss_pred ccccccC----CccccccceeEEEEEecceEEEEEEee----cccc-----------eeeccchhhHHhHHHHHcCCccc Confidence 2222111 111222223567888888888776641 1111 11233346677788899999999 Q ss_pred cccceEEEEEecCC Q lcl|NC_013692. 386 RPEWIALLKTVARL 399 (399) Q Consensus 386 n~~~m~~iet~A~~ 399 (399) |++.-+.||+ .-+ T Consensus 316 RPe~a~~i~~-tg~ 328 (335) T protein:vir:63 316 RPDTAGAIEL-KGI 328 (335) T ss_pred ccceEEEEEE-cCC Confidence 9999999997 223 No 59 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=98.62 E-value=1.5e-08 Score=63.40 Aligned_cols=286 Identities=11% Similarity=0.141 Sum_probs=162.1 Q ss_pred CCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCccccCCCCcchhhhhhhhhcc Q lcl|NC_013692. 18 NGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVNDQGIDASGATIANGNLYG 97 (399) Q Consensus 18 ~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lteGV~p~g~~~~ngn~~~ 97 (399) |++.++=+--+-+.+ ..+.++.+++.-.+.+++.+.+|+.+ ++++-++.. | |...+... T Consensus 1 m~t~t~gg~liP~~~-~~~ii~~l~~~s~i~~l~~~~~~~~~---~~~ip~~~~----------~--~~a~wv~E----- 59 (303) T protein:vir:97 1 MGTETSKASLFDKHL-VSDLINKVKGHSSLAKLSSQKPIPFN---GSKEFTFTL----------D--SDIDVVAE----- 59 (303) T ss_pred CcccCCCCeEcchhH-HHHHHHHHHhhchhhhhcceeecCCC---ceEEEEEec----------C--cceEEeec----- Confidence 444333332222334 57777778889999999999998853 344433321 1 12233322 Q ss_pred ccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhc--hhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 98 SSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSD--PAMEGHVTTEMVKGANEITEDLL 175 (399) Q Consensus 98 ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D--~~L~~~i~~el~~~~~~~t~d~l 175 (399) |+.+-.-.++++.++...++.+.+..+|+++....+| ..|.+.|..+|.+.-+. .| T Consensus 60 ------------------~~~~~~s~~~f~~v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~----~l 117 (303) T protein:vir:97 60 ------------------NGKKTHGGLSLEPVTIVPIKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLAR----GI 117 (303) T ss_pred ------------------CccccccccceeeEEeeeEEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHH----HH Confidence 2222233457788999999999999999998864444 34555555555444443 33 Q ss_pred HHHHHhc-----Ccee------eeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCeeE Q lcl|NC_013692. 176 QIDLLNS-----AGTV------RYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNARA 244 (399) Q Consensus 176 ~~~~l~a-----gt~V------~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv 244 (399) ...+|++ |+.. .+.+..+...+.+ ....++++|.++...|..+... +. . T Consensus 118 d~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~i~~~~~~~~~~~~~-----------------~~--~ 177 (303) T protein:vir:97 118 DLMAMHGINPRTKKASDVIGTNHFDSKVTQVVKFT-ESEDADANIEAAVNLIQGAEGV-----------------VT--G 177 (303) T ss_pred HhhhhcccccCCccccccccccccccccccccccc-cccchHHHHHHHHHHHhhcCCC-----------------cc--E Confidence 3445543 2221 1222222222222 2345688888888777654332 11 3 Q ss_pred EEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccccCccce Q lcl|NC_013692. 245 LYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESGGKYS 324 (399) Q Consensus 245 ~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~~~~D 324 (399) .++||.....|+.|+|--+.+-|.|-.. ..+..|++-+++++.+..+.-..+.+. . T Consensus 178 ~vmn~~~~~~L~~lkd~~g~~~~~~~~~-------~~~~~~~l~G~Pv~~s~~v~~~~~~~~---------------~-- 233 (303) T protein:vir:97 178 LAMDTEFSTALAKVTNGEMGPKMYPELA-------WGANPDSINGLKSSVNTTVGAGADEAE---------------S-- 233 (303) T ss_pred EEEcHHHHHHHHHhhccCCCeEEecCcc-------CCCCCceecceeeEEecccCCccccCC---------------C-- Confidence 6789999999999887555555543211 233457899999999987532111111 0 Q ss_pred eEEEEEEccccce-ecccccCCCCCcceEEEecCCCcCCC-CCCccchhhHHHHH--HHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 325 VFPMLCVASEAFT-TVGFATDGKNVKFKIITKRPGEATAD-RSDPYGEMGFMSIK--WYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 325 VYp~lV~G~~Afg-~v~l~~~g~~~k~~~ivk~pG~~tad-~~DPlgQrg~~gwK--~~~~~~iLn~~~m~~iet~A~~ 399 (399) -..+++|.-+.+ .++.+ +.+.++++ .-+. +| +..-|-|+..+.++ .++++.+++++-+++|+= |++ T Consensus 234 -~~~~~~Gdf~~~~~~~~~---~~~~~~~~--~~~~--~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~-~~~ 303 (303) T protein:vir:97 234 -KDLVIIGDFESMFKWGYA---KQIPMEII--KYGD--PDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTK-GEV 303 (303) T ss_pred -ccEEEEeeccccEEEEEe---cCcEEEEe--eccC--CCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeC-CCC Confidence 123678875432 45554 22233333 2221 11 11225666667775 578899999999998864 344 No 60 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=98.62 E-value=1.4e-08 Score=63.60 Aligned_cols=272 Identities=16% Similarity=0.118 Sum_probs=143.9 Q ss_pred cccccCcCCCcEEEEEEccCCcCCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEE Q lcl|NC_013692. 52 DTFSMPKHYGKEIVRLHYIPLLDDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKG 131 (399) Q Consensus 52 ~~~~mPKn~GktIkfrry~pl~~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~ 131 (399) .+|+| ..||+++|-|--... ..-.+.|-.+.| ++.+++-.+... T Consensus 1 ~vr~i--~~g~s~~~~~iG~~~--~~~~~~G~~l~~--------------------------------~~~~~~~~e~~i 44 (324) T protein:vir:99 1 MTRTI--TSGKSAQFPVMGRTK--ARYLKQGQSLDD--------------------------------GREDIKHTEKVI 44 (324) T ss_pred Ceeee--ecCceEEEeeeeeeE--eccccCCCCcCC--------------------------------CcCCcCcccEEE Confidence 55554 348888886542221 111222222221 011122334445 Q ss_pred EeeeecceehhhhhhhhhhhchhHHHHHHHHHHHHHHHHHHHHHHHHHHhc---Cc-----eeeeccccccccccCCc-- Q lcl|NC_013692. 132 KLEKYGFFREYTQEQLDFDSDPAMEGHVTTEMVKGANEITEDLLQIDLLNS---AG-----TVRYPGAATSDAEVDAT-- 201 (399) Q Consensus 132 ~l~QyG~~~e~Td~~~~t~~D~~L~~~i~~el~~~~~~~t~d~l~~~~l~a---gt-----~V~YAg~aTsra~v~~~-- 201 (399) +|-|+=-|..+=|.+++.-..-.|+.+++.|++..=+......+.+.+.+. .+ ++.=.|++.+ ..+.+. T Consensus 45 tID~~l~~~~~VdDiD~~qa~~Dlr~e~s~~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~-~~~~~~~~ 123 (324) T protein:vir:99 45 TIDGLLTTDVLIYDIEDAMNHYDVRSEYSTQMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASL-VKITGKKE 123 (324) T ss_pred EecchhhhhhhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccce-eccccccc Confidence 555544444333344443333225555555555444443333332322211 11 1111111111 001111 Q ss_pred -ceec----HHHHHHHHHHHHhccCccccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCc Q lcl|NC_013692. 202 -TEVT----YDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAG 276 (399) Q Consensus 202 -~~vt----~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~ 276 (399) ...+ ++.|+.|...|+++..|. ..+++++.|+.-..|++ ..+.-...|+.. T Consensus 124 ~~~~~~~~~~dai~~a~~~Lde~~VP~-----------------~gR~~vv~P~~y~~Ll~-------~~~~~~~~~~~~ 179 (324) T protein:vir:99 124 DPAKYGTQVIQALTYARAAFAKKYIPA-----------------GDRTFYTDPDTYSAILA-------ALMPNAANYAAL 179 (324) T ss_pred ccccCHHHHHHHHHHHHHHHhhcCCCC-----------------CCCEEEeChHHHHHHhh-------cccccccccccc Confidence 1111 567888899999999983 24899999999888864 456667788888 Q ss_pred cccccccceeEcCeEEEecCcccccccCCcccCCc---ccccc---cccCcccee----EEEEEEccccceecccccCCC Q lcl|NC_013692. 277 GATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPN---DQVPM---HESGGKYSV----FPMLCVASEAFTTVGFATDGK 346 (399) Q Consensus 277 ~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~---~~~~~---~~~~~~~DV----Yp~lV~G~~Afg~v~l~~~g~ 346 (399) +.+-+|.||++.+|++++++++-.-.+.......+ ...+. +....+|++ =.-|+|=+++-+++-++ T Consensus 180 ~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky~~d~~~~~gl~~~~~a~~tv~~~---- 255 (324) T protein:vir:99 180 IDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKMTVGADNVVGLFVHRSAVATLKLK---- 255 (324) T ss_pred cceecceEEEEeceEEEecCCccccccccccccccccccccccccccccccccccccCceeEEEEehhheEEEeee---- Confidence 89999999999999999999976533321111110 00000 111122222 12367777777766553 Q ss_pred CCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 347 NVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 347 ~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~iet~A~~ 399 (399) .++.+ ..-|+--|..++==|..|++.+||++..+.+|.-+-. T Consensus 256 ~~~~e-----------~~~~~~~~~d~i~~~~a~G~~~lRPe~a~~v~l~~~~ 297 (324) T protein:vir:99 256 DMALE-----------RARRPEYQADQIIAKYAMGHGGLRPEAVGAIIFEDGE 297 (324) T ss_pred cceec-----------ceechhhHHHhhhhhhhhcCcccccceEEEEEEccCc Confidence 11111 1235566777777789999999999999888854433 No 61 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=98.60 E-value=5.2e-09 Score=66.01 Aligned_cols=291 Identities=11% Similarity=0.082 Sum_probs=162.6 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVND 80 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lt 80 (399) ++. ++++.+.|+....+++++-+.-+- -.+.++.++.+.+.-++.+++.+.+|+.+ ++++-+...- T Consensus 14 f~~--~~~~~~~~~a~~~~~~~~~~~liP-~~~~~~ii~~~~~~s~l~~l~~~~~~~~~---~~~ip~~~~~-------- 79 (324) T protein:vir:93 14 FAS--NNVKPQVFNPDNVMMHEKKDGTLL-NDFTTPILQEVMENSKIMQLGKYEPMEGT---EKKFTFWADK-------- 79 (324) T ss_pred HHH--hhhhhhhcccccccccCCCcceec-hhHHHHHHHHHHhhchhhhhcceeeccCC---ceEEEEEecC-------- Confidence 232 345556664332221222121222 23467777778888889999998888743 3454333221 Q ss_pred CCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHH Q lcl|NC_013692. 81 QGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVT 160 (399) Q Consensus 81 eGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~ 160 (399) |..++++.| +..-.-..++..|+.+.++++.+..+|+++.+ ++++.|...|. T Consensus 80 ----~~a~~v~Eg-----------------------~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~-ds~~~l~~~i~ 131 (324) T protein:vir:93 80 ----PGAYWVGEG-----------------------QKIETSKATWVNATMRAFKLGVILPVTKEFLN-YTYSQFFEEMK 131 (324) T ss_pred ----cceeeecCC-----------------------ccccccccceeEEEEEeEEEEEeehhhHHHHh-cchHHHHHHHH Confidence 111222221 22222234678899999999999999998877 45566777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCceee----eccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCc Q lcl|NC_013692. 161 TEMVKGANEITEDLLQIDLLNSAGTVR----YPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDT 236 (399) Q Consensus 161 ~el~~~~~~~t~d~l~~~~l~agt~V~----YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT 236 (399) .+|.+.-+.-.++.+ |++.+.-. -......... .....+++++|.++...|+.+.... T Consensus 132 ~~l~~aia~~~d~a~----l~G~g~~~~~~~~~~~~~~~~~-~~~~~~~~~~i~~~~~~l~~~~~~~------------- 193 (324) T protein:vir:93 132 PMIAEAFYKKFDEAG----ILNQGNNPFGKSIAQSIEKTNK-VIKGDFTQDNIIDLEALLEDDELEA------------- 193 (324) T ss_pred HHHHHHHHHHHHHHH----hcCCCCCCcCccccccccccce-eccccccHHHHHHHHHhhhhccCCC------------- Confidence 777666555444443 43222110 1111111111 1235678999999998887765421 Q ss_pred ccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccc Q lcl|NC_013692. 237 RTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPM 316 (399) Q Consensus 237 ~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~ 316 (399) + ..+|||....-|+.++|--+.|-|. .+--+++-++.++.++.. . T Consensus 194 -----~-~~v~n~~~~~~L~~l~d~~G~~~~~------------~~~~~~l~G~PVv~~~~~--------~--------- 238 (324) T protein:vir:93 194 -----N-AFISKTQNRSLLRKIVDPETKERIY------------DRNSDSLDGLPVVNLKSS--------N--------- 238 (324) T ss_pred -----C-EEEEcHHHHHHHHHhhCCCCCeeec------------CCCCCcccceeeEeecCC--------C--------- Confidence 1 3679999999999988655444432 223456667776655420 0 Q ss_pred cccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCc------cchhhHHHHH--HHHHHhhcccc Q lcl|NC_013692. 317 HESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDP------YGEMGFMSIK--WYYGFMVFRPE 388 (399) Q Consensus 317 ~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DP------lgQrg~~gwK--~~~~~~iLn~~ 388 (399) .++. .+++|.-+...+++. +.+.+++.--... ....|| +-|++.+.++ +++++.+++++ T Consensus 239 ---~~~~----~i~~gdfs~~~~~~~---~~~~i~~~~~~~~---~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~ 305 (324) T protein:vir:93 239 ---LKRG----ELITGDFDKLIYGIP---QLIEYKIDETAQL---STVKNEDGTPVNLFEQDMVALRATMHVALHIADDK 305 (324) T ss_pred ---CCcc----eEEEEecceEEEEEe---cCcEEEEeecccc---cccccccccchhhhhcCcEEEEEEEEeccEEeccc Confidence 0111 245776555445444 2223333221110 011111 3455666666 67899999999 Q ss_pred ceEEEEEecCC Q lcl|NC_013692. 389 WIALLKTVARL 399 (399) Q Consensus 389 ~m~~iet~A~~ 399 (399) -+++|..+.+. T Consensus 306 a~~~l~~a~~~ 316 (324) T protein:vir:93 306 AFAKLVPADKR 316 (324) T ss_pred ceEEEeccccc Confidence 99999877777 No 62 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=98.60 E-value=2.4e-08 Score=62.34 Aligned_cols=313 Identities=9% Similarity=0.029 Sum_probs=158.4 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCc-CCCccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLL-DDRNVN 79 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~-~~~t~l 79 (399) |.-| |......++ ..++. -.+..--|.-+.++.=+..-++..+=.++.+ ..||+.+|.|--... ...+|- T Consensus 1 ms~~-n~~t~~~~~-----~~~~~-~al~le~f~geV~taf~~~s~~~~~~~~rti--~~gkS~q~~~iG~~~~~~~~~G 71 (364) T protein:vir:10 1 MSNP-NVLTQPAVS-----ASGEV-DSLLIEKFNNRVHEQYLKGENLLQWFDVQEV--VGTNSVSNKYIGETELQVLSPG 71 (364) T ss_pred CCCc-ccccccccc-----cccch-hhhhhhhhhhhHHHHHHHHHhhcCcceeeee--cccceEEeeeeeeeEEeeeccC Confidence 3222 222122221 01111 1122223456666665555556666666765 478898886653322 122221 Q ss_pred cCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchh-HHHH Q lcl|NC_013692. 80 DQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPA-MEGH 158 (399) Q Consensus 80 teGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~-L~~~ 158 (399) + .+.|+.+.+ .+.+-+|-+.=-+-.+-+.+++.-.|-. +.++ T Consensus 72 ~---~ld~~~~~~----------------------------------~k~~itID~ll~a~~~V~diDe~q~~~D~vR~e 114 (364) T protein:vir:10 72 K---SPDASPTEF----------------------------------DKNRLVVDTTVIARNTVAHFHDVQNDIDGLKSK 114 (364) T ss_pred c---ccCCCCccc----------------------------------CcEEEEecceeeechhhhhHHHHhcCccchhHH Confidence 1 122222211 1223333332122222333333222222 3344 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcC-ceeeec-------ccccccccc--CCc-ceecHHHH----HHHHHHHHhccCcc Q lcl|NC_013692. 159 VTTEMVKGANEITEDLLQIDLLNSA-GTVRYP-------GAATSDAEV--DAT-TEVTYDSL----MRLRLDLDNARAPT 223 (399) Q Consensus 159 i~~el~~~~~~~t~d~l~~~~l~ag-t~V~YA-------g~aTsra~v--~~~-~~vt~~~l----r~a~~~Lk~nrApk 223 (399) ++.|++..=+......+.+-++.++ +++.=+ +++.+ .++ +.. ...+...| ..+...|+++..|. T Consensus 115 ~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~~~~g~~-i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~ 193 (364) T protein:vir:10 115 LSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPRVAGHGFS-IHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDT 193 (364) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCcccCCcce-eeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCc Confidence 5555555555544444444443333 221100 11111 011 111 12222333 35666788888863 Q ss_pred ccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehh-hcCCccccccccceeEcCeEEEecCcccccc Q lcl|NC_013692. 224 KIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIE-KYAAGGATMHGEVGQLGRFRVIVNPQMMHWA 302 (399) Q Consensus 224 ~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~-kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~ 302 (399) ..+++++.|..-..|.+ ++.|++.. .+.......+|+|+++.+||+++++++ |+. T Consensus 194 -----------------~~R~~vv~P~~y~~Ll~------~~~lvn~d~~~~~~~~~~~G~v~~v~Gv~Vv~Sn~l-P~~ 249 (364) T protein:vir:10 194 -----------------SELCGLMPWTAFNCLRD------ADRIVDKSYTIAASDNTVDGFVLKSWNTPIVPSNRF-PKL 249 (364) T ss_pred -----------------cccEEEeChHHHHHHhc------CCccccccccccCCCccccceeEEEeceEEEecccc-ccc Confidence 34999999999988875 67788775 444556678999999999999999997 653 Q ss_pred -cCCcccCCccccccc--ccCccce------eEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhH Q lcl|NC_013692. 303 -GVGKAVDPNDQVPMH--ESGGKYS------VFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGF 373 (399) Q Consensus 303 -~aGa~~~~~~~~~~~--~~~~~~D------VYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~ 373 (399) +.+...+....-+.| ..++.|+ -.-.++|=++|-+++-++ .+..++- .|+--|..+ T Consensus 250 ~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~----~~t~e~~-----------~~~~~~~~~ 314 (364) T protein:vir:10 250 SDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTI----SITGDIF-----------YEKKEKTWY 314 (364) T ss_pred cccccccccccccccccccCCcccccccccceeEEEEEecceEEEEEEe----cceeeee-----------eccceeeee Confidence 221111111111111 1234444 455788888888877663 1111111 233334455 Q ss_pred HHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 374 MSIKWYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 374 ~gwK~~~~~~iLn~~~m~~iet~A~~ 399 (399) .=-|..|++.+||++.-+.|.+++.= T Consensus 315 ida~~a~G~g~lRPeaa~~i~~~~~~ 340 (364) T protein:vir:10 315 IDTFLAEGAIPDRWEAVAVVTAADTA 340 (364) T ss_pred eeeehcccCcccCccceEEEEecCCC Confidence 55588999999999999999998877 No 63 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=98.58 E-value=1.8e-08 Score=63.08 Aligned_cols=284 Identities=13% Similarity=0.099 Sum_probs=157.9 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVND 80 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lt 80 (399) ..+.+....- .....++++.+.+-|. . +..+.+....+...+.+++...+++.+ ++++-++.-- T Consensus 102 ~~~~~~~~~~-~~~~~~~~~~g~~vp~---~-~~~~ii~~~~~~~~l~~l~~~~~~~~~---~~~~~~~~~~-------- 165 (395) T protein:vir:43 102 RGSHRVSMPR-SAITSIDGSGGALVAP---D-RRPGVVAAPQRRLTIRDLVAPGTTESN---SVEYVRETGF-------- 165 (395) T ss_pred hhhhhhhhhh-hhhcccCCCCccccch---h-hHHHHHHHHHhhhhHHhhccceecCCC---ceEEEEEecC-------- Confidence 1222211111 1111111111222222 2 346666667788889999999998754 4555443211 Q ss_pred CCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHH Q lcl|NC_013692. 81 QGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVT 160 (399) Q Consensus 81 eGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~ 160 (399) ++.+.+... |+......++++.|+.++++++.++.+|+++.+ +.+ .|...|. T Consensus 166 ---~~~a~~v~E-----------------------~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~-d~~-~l~~~v~ 217 (395) T protein:vir:43 166 ---VNNAAPVSE-----------------------GTQKPYSDLTFELENAPVRTIAHLFKASRQILD-DAS-ALQSYID 217 (395) T ss_pred ---CCceeeecC-----------------------CccccccccceeEEEEeeeeEEEeehhhHHHHH-hHH-HHHHHHH Confidence 111112111 111122334778899999999999999999876 454 4766666 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcC-ce------eeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccc Q lcl|NC_013692. 161 TEMVKGANEITEDLLQIDLLNSA-GT------VRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRM 233 (399) Q Consensus 161 ~el~~~~~~~t~d~l~~~~l~ag-t~------V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~ 233 (399) .+|.+.-+...+. .+|++. ++ ...++..+............+++|..+...|+.+..+. T Consensus 218 ~~la~a~~~~~d~----~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~---------- 283 (395) T protein:vir:43 218 ARARYGLMLVEEC----QLLYGNGTGANLHGIIPQAQAYAPPSGVVVTAEQRIDRIRLAILQAQLAEFPA---------- 283 (395) T ss_pred HHHHHHHHHHHHH----HHHhccCCCCccccccccccccccccccccccchhHHHHHHHHHhhccccCCC---------- Confidence 6666555553333 344321 11 11122122222222234467888888887776554431 Q ss_pred cCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCccc Q lcl|NC_013692. 234 IDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQ 313 (399) Q Consensus 234 ~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~ 313 (399) + +.+|||.+...|+.|+|--+.|=|. ....+.-+.+-|+++|+++.|. ++ T Consensus 284 --------~-~~vmn~~~~~~l~~lkd~~G~~i~~---------~~~~~~~~~l~G~pVv~~~~~~----~~-------- 333 (395) T protein:vir:43 284 --------S-GIVLNPIDWALIELNKDAENRYIIG---------SPQNGTTPTLWRLPVVETQAIT----QD-------- 333 (395) T ss_pred --------c-EEEEcHHHHHHHHHhhccCCceecc---------ccccCCCceecceeeEEcCCCC----CC-------- Confidence 2 4579999999999988655544442 1345566788899999988541 11 Q ss_pred ccccccCccceeEEEEEEccccce-ecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH--HHHHHhhccccce Q lcl|NC_013692. 314 VPMHESGGKYSVFPMLCVASEAFT-TVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK--WYYGFMVFRPEWI 390 (399) Q Consensus 314 ~~~~~~~~~~DVYp~lV~G~~Afg-~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iLn~~~m 390 (399) -+++|.-+.. .+... .+ +.+-+-. ..+.+-+++...|+ +++.+.+++++-+ T Consensus 334 --------------~~~~gd~~~~~~~~~~-~~----~~i~~~~-------~~~~~f~~~~~~~r~~~r~d~~v~~~~a~ 387 (395) T protein:vir:43 334 --------------EFLTGAFSLGAQIFDR-MD----IEVLVST-------ENDKDFENNMVTIRAEERLAFAVYRPEAF 387 (395) T ss_pred --------------cEEEEeccceEEEEEe-cc----eEEEEec-------cccchhhcCcEEEEEEEeeccEEecccce Confidence 0345553321 22221 12 1121111 12345678888888 5889999999999 Q ss_pred EEEEEecC Q lcl|NC_013692. 391 ALLKTVAR 398 (399) Q Consensus 391 ~~iet~A~ 398 (399) +++++.+- T Consensus 388 ~~~~~taa 395 (395) T protein:vir:43 388 VTGSLTAS 395 (395) T ss_pred EEEEeccC Confidence 99998888 No 64 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=98.58 E-value=8e-09 Score=64.98 Aligned_cols=293 Identities=11% Similarity=0.086 Sum_probs=160.6 Q ss_pred CCCccccccccccCC---CCCCcccc---cccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcC Q lcl|NC_013692. 1 MAGPVDNIKPMKYND---PANGVESS---IGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLD 74 (399) Q Consensus 1 ~~~~~~~~~~~~~n~---~~~~~~~~---i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~ 74 (399) +...+.......+.. ...++.++ +-|+ .+ ..+.+....+...+.+++.+.++. .+..+.+.+...... T Consensus 99 ~~~~~~~~e~~~~~~~~a~~~~~~~~gg~liP~---~~-~~~ii~~~~~~~~l~~~~~~~~~~--~~~~~~~~~~~~~~~ 172 (409) T protein:vir:45 99 GASELTSEERKALRELRAQGVAQDEKGGYTVPE---TF-LAKVVEKMKSYGGIASVAQILTTS--DGRTMEWATADGTSE 172 (409) T ss_pred hhhhccHHHHHHHHHHhhccCccCcCCceeccH---hH-HHHHHHHHHhhhhhhhhceeeecC--CCceEEEEeeccCcc Confidence 222221111111111 11111111 2233 23 455666666777788888887764 344455544432211 Q ss_pred CCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeee-cceehhhhhhhhhhhch Q lcl|NC_013692. 75 DRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKY-GFFREYTQEQLDFDSDP 153 (399) Q Consensus 75 ~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~Qy-G~~~e~Td~~~~t~~D~ 153 (399) .+.+.. +|+...--.+++..++.+.+++ +.++.+|+++.+- +++ T Consensus 173 -----------~~~~v~-----------------------E~~~~~~~~~~f~~~~l~~~k~~~~~i~is~ell~d-s~~ 217 (409) T protein:vir:45 173 -----------VGVLLG-----------------------ENEEAGEEDTDFGMGSLGALKMTSKIIRVSNELLQD-SAI 217 (409) T ss_pred -----------cccccc-----------------------ccccccccccccceeeeeeeeeeeeehhhhHHHHhc-cHH Confidence 111111 1111122223556677777665 7899999998864 455 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee-------eeccccccccccCCcceecHHHHHHHHHHHHhccCccccc Q lcl|NC_013692. 154 AMEGHVTTEMVKGANEITEDLLQIDLLNSAGTV-------RYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIK 226 (399) Q Consensus 154 ~L~~~i~~el~~~~~~~t~d~l~~~~l~agt~V-------~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ 226 (399) .|.+.|..+|...-+. .+...+|++.++- +.+ ..+.-.+......+++++|..+...|+..... T Consensus 218 ~l~~~i~~~la~a~~~----~~~~a~l~G~G~~~~~~p~Gil~-~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~---- 288 (409) T protein:vir:45 218 DMEAYLARRIAERIGR----GEARYLIQGTGAGTPKQPKGLAA-SVTGTTQTAAANAVKWQEILALKHSIDPAYRR---- 288 (409) T ss_pred HHHHHHHHHHHHHHHH----HHHHHhhccCCCCCccccceeee-ccccccccccccccchHHHHHHHHhhhhhhcc---- Confidence 6766665555444443 3344456433220 111 11111222233568899999988888765432 Q ss_pred eeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCc Q lcl|NC_013692. 227 MITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGK 306 (399) Q Consensus 227 ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa 306 (399) .+.++.+||+.....|+.|+|--+.|=|.|- +..|.-+++-|.+++.+..|. ..++| T Consensus 289 -------------~a~~~~~~n~~~~~~l~~lkd~~G~~i~~~~--------~~~~~~~~l~G~PV~~~~~~p-~~~~~- 345 (409) T protein:vir:45 289 -------------GPKFRLAFNDNTLKLISEMEDGQGRPLWLPD--------IVGVAPASVLNVPYVIDQEID-DIGAG- 345 (409) T ss_pred -------------CCeEEEEECHHHHHHHHHhhcCCCceeeccC--------cCCCCCceecceeeEEecCcC-CccCC- Confidence 2357889999999999999876666655431 233455678899999887752 21111 Q ss_pred ccCCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHH--HHHHhh Q lcl|NC_013692. 307 AVDPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKW--YYGFMV 384 (399) Q Consensus 307 ~~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~--~~~~~i 384 (399) .+ .++||.=+...+..++.. .+ .-..|||.+.+.+++++ ++.+.+ T Consensus 346 ------------------~~-~i~~Gd~~~~~i~~~~~~---~~-----------~~~~d~~~~~~~~~~~~~~r~d~~~ 392 (409) T protein:vir:45 346 ------------------KK-FMFCGDFDRFIIRRVRYM---IL-----------KRLVERYAEYDQTGFLAFHRFDCIL 392 (409) T ss_pred ------------------cc-EEEEeehhhhheeeccce---EE-----------EEeecccccCCcEEEEEEEEeccEe Confidence 11 255676333233332111 11 11257888888887774 789999 Q ss_pred ccccceEEEEEecCC Q lcl|NC_013692. 385 FRPEWIALLKTVARL 399 (399) Q Consensus 385 Ln~~~m~~iet~A~~ 399 (399) .+++-++.++..+.. T Consensus 393 ~~~~A~~~l~~k~s~ 407 (409) T protein:vir:45 393 EDTSAIKALVGKGSV 407 (409) T ss_pred echhheEEEEeccCC Confidence 999999999998888 No 65 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=98.57 E-value=1.2e-08 Score=64.04 Aligned_cols=296 Identities=9% Similarity=-0.002 Sum_probs=157.8 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVND 80 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lt 80 (399) +.-.+...+...-+..++..-+.+-|+ .+....+..+.+...+.+++.+.+|+.+.++.... +..+-..+.. .. T Consensus 109 ~~~~~~~~~~~~~~~~~t~~g~~~iP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~-v~ 182 (415) T protein:vir:47 109 FTEYLETRNDIQGGSLKTDSGFVVIPE----EIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVV-RQSEVAALEK-VE 182 (415) T ss_pred HHHHHhhhhhhhhccccccCCcccccH----HHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEE-EecCCcceee-cc Confidence 000000000000001111111223343 33566666777889999999999999887753322 2222211111 11 Q ss_pred CCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHH Q lcl|NC_013692. 81 QGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVT 160 (399) Q Consensus 81 eGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~ 160 (399) ||- . .|..+ ..++..|+.+.++++.++.+|+++.+ +++..|...|. T Consensus 183 Eg~-----~-------------------~~~~~---------~~~~~~v~~~~~k~~~~~~iS~ell~-ds~~~l~~~i~ 228 (415) T protein:vir:47 183 ELE-----E-------------------NPELA---------VKPFFQLAYDINTHRGYFRISREAIE-DAKVNVLQELK 228 (415) T ss_pred ccc-----c-------------------ccccc---------ccceeeEEeeeeeeEeeehhhHHHHh-hchHHHHHHHH Confidence 111 1 11111 13667899999999999999998886 45556767676 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCcee-eeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccc Q lcl|NC_013692. 161 TEMVKGANEITEDLLQIDLLNSAGTV-RYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTV 239 (399) Q Consensus 161 ~el~~~~~~~t~d~l~~~~l~agt~V-~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I 239 (399) .+|...-+...++.+..+.-+ |.+. .-...............+++++|..+...+...... T Consensus 229 ~~l~~~i~~~~d~~il~g~g~-g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~----------------- 290 (415) T protein:vir:47 229 LWMARTIAATRNKAIIDVITK-GSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYE----------------- 290 (415) T ss_pred HHHHHHHHHHHHHHHhhcccc-CCccccccccccccceeccccccchHHHHHHHHhhhhhccC----------------- Confidence 666666555433333322211 1110 000111122233344668899999888777654331 Q ss_pred cCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCccccccccc Q lcl|NC_013692. 240 GNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHES 319 (399) Q Consensus 240 ~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~ 319 (399) + + ..+|||.....|+.++|--+.|=|.|- +.++-.+++-++.+++++.+ +..++|. T Consensus 291 ~-~-~~v~n~~~~~~L~~lkd~~G~~i~~~~--------~~~~~~~~l~G~pV~~~~~~-~~~~~~~------------- 346 (415) T protein:vir:47 291 H-N-VAIVSQTMFAKLDKMKDKLGNYLIQPD--------VKEKTQQRLLGAKIEILPDE-VLGQKGN------------- 346 (415) T ss_pred C-C-EEEEcHHHHHHHHHhhccCCCeeeccC--------cCCCCCccccceeeEEeccc-cccCCCc------------- Confidence 1 1 357999999999999886666666542 23445578889998888753 2111111 Q ss_pred CccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 320 GGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 320 ~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~iet~A~~ 399 (399) . .+++|.=+-+.+.+...+ +.++.. .+-..++++.++ +++.+.+++++-++.++..++. T Consensus 347 ------~-~~~~gd~~~~~~~~~~~~--~~v~~~-----------~~~~~~~~~~~~-~r~d~~v~~~~a~~~~~~~~~~ 405 (415) T protein:vir:47 347 ------N-TLIIGNLKDAIVLFDRSQ--YQASWT-----------DYMHFGECLMIA-VRQDCRILDYKSAIVIEYDDSE 405 (415) T ss_pred ------c-EEEEEehhccEEEEeecc--eEEEee-----------ccccCceEEEEE-EEeccEEeccccEEEEEeeccC Confidence 1 267774332111121112 122111 112234444433 5688999999999999988888 No 66 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=98.57 E-value=1.2e-08 Score=64.04 Aligned_cols=296 Identities=9% Similarity=-0.002 Sum_probs=157.8 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVND 80 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lt 80 (399) +.-.+...+...-+..++..-+.+-|+ .+....+..+.+...+.+++.+.+|+.+.++.... +..+-..+.. .. T Consensus 109 ~~~~~~~~~~~~~~~~~t~~g~~~iP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~-v~ 182 (415) T protein:vir:46 109 FTEYLETRNDIQGGSLKTDSGFVVIPE----EIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVV-RQSEVAALEK-VE 182 (415) T ss_pred HHHHHhhhhhhhhccccccCCcccccH----HHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEE-EecCCcceee-cc Confidence 000000000000001111111223343 33566666777889999999999999887753322 2222211111 11 Q ss_pred CCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHH Q lcl|NC_013692. 81 QGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVT 160 (399) Q Consensus 81 eGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~ 160 (399) ||- . .|..+ ..++..|+.+.++++.++.+|+++.+ +++..|...|. T Consensus 183 Eg~-----~-------------------~~~~~---------~~~~~~v~~~~~k~~~~~~iS~ell~-ds~~~l~~~i~ 228 (415) T protein:vir:46 183 ELE-----E-------------------NPELA---------VKPFFQLAYDINTHRGYFRISREAIE-DAKVNVLQELK 228 (415) T ss_pred ccc-----c-------------------ccccc---------ccceeeEEeeeeeeEeeehhhHHHHh-hchHHHHHHHH Confidence 111 1 11111 13667899999999999999998886 45556767676 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCcee-eeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccc Q lcl|NC_013692. 161 TEMVKGANEITEDLLQIDLLNSAGTV-RYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTV 239 (399) Q Consensus 161 ~el~~~~~~~t~d~l~~~~l~agt~V-~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I 239 (399) .+|...-+...++.+..+.-+ |.+. .-...............+++++|..+...+...... T Consensus 229 ~~l~~~i~~~~d~~il~g~g~-g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~----------------- 290 (415) T protein:vir:46 229 LWMARTIAATRNKAIIDVITK-GSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYE----------------- 290 (415) T ss_pred HHHHHHHHHHHHHHHhhcccc-CCccccccccccccceeccccccchHHHHHHHHhhhhhccC----------------- Confidence 666666555433333322211 1110 000111122233344668899999888777654331 Q ss_pred cCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCccccccccc Q lcl|NC_013692. 240 GNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHES 319 (399) Q Consensus 240 ~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~ 319 (399) + + ..+|||.....|+.++|--+.|=|.|- +.++-.+++-++.+++++.+ +..++|. T Consensus 291 ~-~-~~v~n~~~~~~L~~lkd~~G~~i~~~~--------~~~~~~~~l~G~pV~~~~~~-~~~~~~~------------- 346 (415) T protein:vir:46 291 H-N-VAIVSQTMFAKLDKMKDKLGNYLIQPD--------VKEKTQQRLLGAKIEILPDE-VLGQKGN------------- 346 (415) T ss_pred C-C-EEEEcHHHHHHHHHhhccCCCeeeccC--------cCCCCCccccceeeEEeccc-cccCCCc------------- Confidence 1 1 357999999999999886666666542 23445578889998888753 2111111 Q ss_pred CccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 320 GGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 320 ~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~iet~A~~ 399 (399) . .+++|.=+-+.+.+...+ +.++.. .+-..++++.++ +++.+.+++++-++.++..++. T Consensus 347 ------~-~~~~gd~~~~~~~~~~~~--~~v~~~-----------~~~~~~~~~~~~-~r~d~~v~~~~a~~~~~~~~~~ 405 (415) T protein:vir:46 347 ------N-TLIIGNLKDAIVLFDRSQ--YQASWT-----------DYMHFGECLMIA-VRQDCRILDYKSAIVIEYDDSE 405 (415) T ss_pred ------c-EEEEEehhccEEEEeecc--eEEEee-----------ccccCceEEEEE-EEeccEEeccccEEEEEeeccC Confidence 1 267774332111121112 122111 112234444433 5688999999999999988888 No 67 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=98.56 E-value=2.1e-08 Score=62.71 Aligned_cols=300 Identities=12% Similarity=0.082 Sum_probs=167.0 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVND 80 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lt 80 (399) |.--.+|.....= .+.+..+-|-|+ + ..+.++.+++.-.+.+++...+|+.+ ++++.+... T Consensus 1 ~g~~~e~~~~~~~--~t~~~~g~l~~~----~-~~~ii~~l~~~s~i~~l~~~~~~~~~---~~~ip~~~~--------- 61 (397) T protein:vir:23 1 MGFSADHSQIAQT--KDTMFTGYLDPV----Q-AKDYFAEAEKTSIVQRVAQKIPMGAT---GIVIPHWTG--------- 61 (397) T ss_pred CCcCHHHHHHhhc--cCCCCccccchh----H-HHHHHHHHHhccchhhhcceeeccCC---ceEEEEEcC--------- Confidence 4433333333211 111112234444 2 25677788888888899999888743 345444321 Q ss_pred CCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHH Q lcl|NC_013692. 81 QGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVT 160 (399) Q Consensus 81 eGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~ 160 (399) .|...++..| +....-..++..|+.++++++.++.+|+++.+ ++++.|...|. T Consensus 62 ---~~~a~wv~Eg-----------------------~~~~~s~~~f~~v~l~~~k~~~~v~iS~ell~-ds~~~l~~~i~ 114 (397) T protein:vir:23 62 ---DVSAQWIGEG-----------------------DMKPITKGNMTKRDVHPAKIATIFVASAETVR-ANPANYLGTMR 114 (397) T ss_pred ---CcceEEecCC-----------------------ccccccccceeEEEEeeEEEEEeehhhHHHHh-cchHHHHHHHH Confidence 1122232221 22222235778899999999999999999877 34556777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCccccc Q lcl|NC_013692. 161 TEMVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVG 240 (399) Q Consensus 161 ~el~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~ 240 (399) .+|.+.-+...++.+..+-=++-....+..... .........+++++..+...|..+..+. T Consensus 115 ~~l~~aia~~~d~a~l~G~gt~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~l~~~~~~~----------------- 175 (397) T protein:vir:23 115 TKVATAIAMAFDNAALHGTNAPSAFQGYLDQSN--KTQSISPNAYQGLGVSGLTKLVTDGKKW----------------- 175 (397) T ss_pred HHHHHHHHHHHHHHHhhcccCCccccccccccc--ceeeecccchhHHHHHHHHhhhhcccCC----------------- Confidence 777666665444443322111111111221111 1122234456666666666666554321 Q ss_pred CeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccccC Q lcl|NC_013692. 241 NARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESG 320 (399) Q Consensus 241 ~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~ 320 (399) -..+||+.....|+.++|--+.|-|.|...-+.... .-.|.+-++.++.++.|. +|. T Consensus 176 --a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~---~~~~tl~G~Pv~~s~~~~----~g~-------------- 232 (397) T protein:vir:23 176 --THTLLDDTVEPVLNGSVDANGRPLFVESTYESLTTP---FREGRILGRPTILSDHVA----EGD-------------- 232 (397) T ss_pred --CEEEEcHHHHHHHHHhhccCCceeeccccccccccc---ccCceeeeeeEEEeCCCC----CCc-------------- Confidence 235899999999999999888888888665554433 334778899999887642 111 Q ss_pred ccceeEEEEEEccccceecccccCCCCCcceE---EEecCCCcCCCCCCcc--chhhHHHHH--HHHHHhhccccceEEE Q lcl|NC_013692. 321 GKYSVFPMLCVASEAFTTVGFATDGKNVKFKI---ITKRPGEATADRSDPY--GEMGFMSIK--WYYGFMVFRPEWIALL 393 (399) Q Consensus 321 ~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~---ivk~pG~~tad~~DPl--gQrg~~gwK--~~~~~~iLn~~~m~~i 393 (399) ..+++|.-+...++... .+.+++ ....-| .....+|+ -|++.+.|+ +++.+.+++++-++++ T Consensus 233 ------~~~~~gDfs~~~i~~~~---~i~i~~~~e~~~~~~--~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~ 301 (397) T protein:vir:23 233 ------VVGYAGDFSQIIWGQVG---GLSFDVTDQATLNLG--SQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKL 301 (397) T ss_pred ------eEEEEeecceEEEEEEe---ceEEEEeeeeeeeec--cccccceeeeeeccceeEEEEeeeccceecccceEEE Confidence 13455654443344331 112221 111111 11223343 577778887 6889999999999988 Q ss_pred EEecCC Q lcl|NC_013692. 394 KTVARL 399 (399) Q Consensus 394 et~A~~ 399 (399) +..+-- T Consensus 302 ~~~~~~ 307 (397) T protein:vir:23 302 TFDPVL 307 (397) T ss_pred eecccc Confidence 864332 No 68 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=98.56 E-value=2.4e-07 Score=56.84 Aligned_cols=312 Identities=12% Similarity=0.068 Sum_probs=172.1 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVND 80 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lt 80 (399) |.-| ||.-.+-| .+..++. .+..--|.-+.|+.-+...++..+-.+|.+ ..||++.|-|--.... .-++ T Consensus 1 ms~~-~~~t~~~~----~~s~~d~--al~le~f~geV~~af~~~s~~~~~~~~rti--~~g~s~~~~~iG~~~~--~~~~ 69 (335) T protein:vir:78 1 MSFL-NDLTRPNY----AGKNADV--DIHLEEHLGIVDKHFAYTSKFAPLMNIRDL--RGSNVVRLDRLGNVEA--KGRR 69 (335) T ss_pred CCcc-cccccccc----ccccchh--hhhhhhhhhHHHHHHHHhhhhccccceeee--ccceeEEEeeeeeeee--cccc Confidence 5544 33334444 2222333 345556788888888888899999999998 5599999965443321 1112 Q ss_pred CCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHH Q lcl|NC_013692. 81 QGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVT 160 (399) Q Consensus 81 eGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~ 160 (399) .|-.+.|+... -.+...+|-+.=-.-.+=|.+++.-.+=.+.++++ T Consensus 70 pG~~l~~~~~~----------------------------------~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s 115 (335) T protein:vir:78 70 AGEELERSRVV----------------------------------NDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVA 115 (335) T ss_pred cCcccCCCCcc----------------------------------cCCeEEEecceeechhhHhhHHHhhcCchhHHHHH Confidence 22222222111 11222222221112222233333322222556677 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCce---eeec----cccccccccCCcce-ecHHH----HHHHHHHHHhccCcccccee Q lcl|NC_013692. 161 TEMVKGANEITEDLLQIDLLNSAGT---VRYP----GAATSDAEVDATTE-VTYDS----LMRLRLDLDNARAPTKIKMI 228 (399) Q Consensus 161 ~el~~~~~~~t~d~l~~~~l~agt~---V~YA----g~aTsra~v~~~~~-vt~~~----lr~a~~~Lk~nrApk~T~ii 228 (399) .|++..=+......+.+-+++|+-. +.+. .+.+....+++.+. -..+. ++.+...|.+..-|. T Consensus 116 ~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~----- 190 (335) T protein:vir:78 116 ELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVLEKLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGD----- 190 (335) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcccccccccCCCcCCCcceeeeeccccccccHHHHHHHHHHHHHHHHhccCCC----- Confidence 7777777776655566777766532 2222 11222232332211 12333 445555577666653 Q ss_pred ccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcC---CccccccccceeEcCeEEEecCcccccccCC Q lcl|NC_013692. 229 TGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYA---AGGATMHGEVGQLGRFRVIVNPQMMHWAGVG 305 (399) Q Consensus 229 ~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg---~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aG 305 (399) .....++++|.|..-..|.+ ++.|+.. .|+ .....-+|+|+++.+||+++++++-.-...+ T Consensus 191 ---------~~~~~rv~vv~P~~y~~Ll~------~~~l~n~-~~~~s~~~~~~~~g~v~~v~Gv~V~~Sn~lP~~~~t~ 254 (335) T protein:vir:78 191 ---------AVYSEGLTPMSPRVFSLLLE------HDKLMSV-EYQATGATNDYVKSRVAILNGVKVLETPRFATKAISA 254 (335) T ss_pred ---------CCCCccEEEeChHHHHHHhc------ccccccc-cccccccccccccceeEEeeceEEEeeccCCCCCCcc Confidence 11224999999999999986 5788887 444 3345688999999999999999874322222 Q ss_pred cccCCcccccccccCcccee--EEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHh Q lcl|NC_013692. 306 KAVDPNDQVPMHESGGKYSV--FPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFM 383 (399) Q Consensus 306 a~~~~~~~~~~~~~~~~~DV--Yp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~ 383 (399) ..-+..+ ...++|. =..+++=++|-+++-++. +..+ -..|+--|..++=-|..|++. T Consensus 255 ~~lg~a~------n~~~~d~~~~~~~~~~~~Al~t~~~~~----~~~e-----------~~~~~~~~~~~i~~~~a~G~g 313 (335) T protein:vir:78 255 HPLGRHF------NVSAEEAERQIALFLPSKTLITAQVAP----VQAK-----------LWEDHDQFSWVLDTFQMYNIG 313 (335) T ss_pred ccccccC------CcccccccceEEEEEecceEEEEEEEe----cccc-----------eeeccchhhHhhhHHHHcCCc Confidence 2111111 1122222 245666667766665531 1111 112334466777788999999 Q ss_pred hccccceEEEEEecCC Q lcl|NC_013692. 384 VFRPEWIALLKTVARL 399 (399) Q Consensus 384 iLn~~~m~~iet~A~~ 399 (399) +||+|.-+.||+---= T Consensus 314 ~lRPe~a~~i~~tg~~ 329 (335) T protein:vir:78 314 ARRPDTAGAIELKGIE 329 (335) T ss_pred ccCcceEEEEEecCCC Confidence 9999999999864322 No 69 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=98.54 E-value=2e-08 Score=62.80 Aligned_cols=294 Identities=12% Similarity=0.126 Sum_probs=157.1 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccC----cCCCcEEEEEEccCCcCCC Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMP----KHYGKEIVRLHYIPLLDDR 76 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mP----Kn~GktIkfrry~pl~~~~ 76 (399) || | +=+.+.|| -|.+++|..-++.+|+.++....--. ..+|.||+.|+.-++.... T Consensus 1 MA----N------------sl~~l~p~----iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~~~~d 60 (423) T protein:vir:10 1 MA----N------------NLDANVSQ----IVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQFKSER 60 (423) T ss_pred Cc----c------------ccccccHH----HHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCceeeec Confidence 22 1 01223455 67899999999999999886553322 3479999997766554322 Q ss_pred ccc--cCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeee-cceehhhhhhhhhhhch Q lcl|NC_013692. 77 NVN--DQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKY-GFFREYTQEQLDFDSDP 153 (399) Q Consensus 77 t~l--teGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~Qy-G~~~e~Td~~~~t~~D~ 153 (399) .+. ..+.++ ..|+| ..++++|-|+ .+-++++|+=...+++ T Consensus 61 ~~~~~~t~~~~------------------------~~l~e------------~~v~l~id~~k~~a~~v~d~E~~l~i~- 103 (423) T protein:vir:10 61 TMDGDITGKSK------------------------NSLIS------------AKATGEVGNYITVAVEYRQIEEALKLN- 103 (423) T ss_pred ccCcccCcccc------------------------ccccc------------ceEEEEecceeeeeeeeChHHHhcChh- Confidence 211 111111 12333 2456666543 4566788776554444 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHh----cCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceec Q lcl|NC_013692. 154 AMEGHVTTEMVKGANEITEDLLQIDLLN----SAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMIT 229 (399) Q Consensus 154 ~L~~~i~~el~~~~~~~t~d~l~~~~l~----agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~ 229 (399) ++. +.+..|..-..+.+..++.+ ++.++ .++..... =.++++..+.+.|+++++|+ T Consensus 104 ----~~~-~~l~~A~~aLA~~vd~~ia~~~~~~~~~~----vgt~~t~~-----~a~~~~a~a~~~L~~~~vP~------ 163 (423) T protein:vir:10 104 ----QLD-QILVPINERMVTDLETELALFMMKHGALS----LGSPNTPI-----KKWSDVAQTASFLKDLGINS------ 163 (423) T ss_pred ----HHH-HHHHHHHHHHHHHHHHHHHHHhhhccccc----cccccccc-----ccHHHHHHHHHHHhhccCCc------ Confidence 232 34555555555566666642 11111 11111111 13789999999999999985 Q ss_pred cccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccc-eeEcCeEEEecCccccc-ccC-C- Q lcl|NC_013692. 230 GTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEV-GQLGRFRVIVNPQMMHW-AGV-G- 305 (399) Q Consensus 230 ~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEI-G~i~~~RfV~~~~~~~~-~~a-G- 305 (399) ..|++++.|+....|.. +..+-.+.+-+..+.+-+|+| |++.+|++.++...-.- .+. | T Consensus 164 -----------~~R~~Vv~p~~~a~Ll~------~~~~~~~~~~~~~~alr~~~i~G~~~GFdi~~Sn~vp~~T~g~~~g 226 (423) T protein:vir:10 164 -----------GENYAVMDPWAAQRLAD------AQSGLHVSEQLVRTAWENAQISGNFGGIRALMSNGLASRTQGAFGG 226 (423) T ss_pred -----------CCCEEEeCHHHHHHHhh------hhhhhccccccchHHHHhcccceeecceEEEEecCCcccccccccc Confidence 23889999999988853 344555556677778888877 99999999998876532 111 1 Q ss_pred ------------cccC---------------Cc------------c-----------------------ccc-----ccc Q lcl|NC_013692. 306 ------------KAVD---------------PN------------D-----------------------QVP-----MHE 318 (399) Q Consensus 306 ------------a~~~---------------~~------------~-----------------------~~~-----~~~ 318 (399) +++. .+ + .+. ... T Consensus 227 a~~~~~~~~vt~a~~~~~~~~~~~~~~~T~s~~g~l~~GD~~t~aGv~~v~~~tk~~l~~~~~~~~~~~~V~~~~~~~a~ 306 (423) T protein:vir:10 227 KLTVKGTPEVNYDSVKDSYAFTATLTGATASKKGFLKVGDQLQFDDTHWLNQQSKQTLYNGASALSFTATVMEDANAHSS 306 (423) T ss_pred eeeeeeeeEEEecccccccccccceeeccceeceeEEecceEeecceeeecccccceeecccCCcceEEEEEeccccccc Confidence 1100 00 0 000 011 Q ss_pred cCccceeEEEE------------------------------------EEccccce--ecccccCCC---------CCcce Q lcl|NC_013692. 319 SGGKYSVFPML------------------------------------CVASEAFT--TVGFATDGK---------NVKFK 351 (399) Q Consensus 319 ~~~~~DVYp~l------------------------------------V~G~~Afg--~v~l~~~g~---------~~k~~ 351 (399) ++..+-+||.+ ++.++||+ +.+|.-.+. ++++. T Consensus 307 ~~~tv~i~p~~~~~~~~~~~~~V~a~~a~~~~vT~~~~~~~t~~~nl~~~~~a~~l~~~pl~~~~~~~~~~~~~~g~s~r 386 (423) T protein:vir:10 307 GDVTVKISGVPIFDAGYPQYNAVDRLLAEGDTVSVIGTSKQAMKPNLFYNKLFCGLGTIPLPKLHSIDSAVATYEGFSIR 386 (423) T ss_pred CceEEEeccccccccCcccccceeccccCCceeEEeeccCCceeEEEEecCcceEEEEEcccCCCccceeecccccceEE Confidence 22234444422 44444444 233321111 00111 Q ss_pred EEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEecC Q lcl|NC_013692. 352 IITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPEWIALLKTVAR 398 (399) Q Consensus 352 ~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~iet~A~ 398 (399) ++ .=+ |.-..--.+.|=.+||+..|++||..|+ .+.| T Consensus 387 ~~--~~~-------d~~~~~~~~r~d~l~g~~~~~p~~~~~~-~g~~ 423 (423) T protein:vir:10 387 VH--KYA-------DGDANKQMMRFDLLPAYVCYNPHMGGQF-FGNP 423 (423) T ss_pred EE--Eee-------eccccceEEEEEeecceeeeccceEEEE-EecC Confidence 11 110 0001111233446799999999998776 4556 No 70 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=98.53 E-value=2.4e-08 Score=62.37 Aligned_cols=305 Identities=10% Similarity=0.057 Sum_probs=162.7 Q ss_pred cccccccccCCCCCCcccc-------cccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCc Q lcl|NC_013692. 5 VDNIKPMKYNDPANGVESS-------IGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRN 77 (399) Q Consensus 5 ~~~~~~~~~n~~~~~~~~~-------i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t 77 (399) |+..|-...+.......+. +-|+ .+..+.++...+...+.+++.+.+|+-+. .+|-.....|-. T Consensus 1 ~a~l~el~~~~~~~~~~g~~~~~~~~liP~----~~~~~ii~~l~~~s~l~~~~~~~~~~~~~-~~~p~~~~~~~a---- 71 (333) T protein:vir:78 1 MATLNELLPNSAGSNHQGRLAHVPSDLLPK----EIVGPIFDKAQESSLVLRMGEQIPISYGE-TIIPTTVKRPEV---- 71 (333) T ss_pred CchhHHhhhhcccccccCceecCCccccch----hHHHHHHHHHHhhchhhhhcceeeccCCc-eEEEEEeCCcee---- Confidence 5555555544432221111 2232 33577788888888999999998887321 233332222221 Q ss_pred cccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHH Q lcl|NC_013692. 78 VNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEG 157 (399) Q Consensus 78 ~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~ 157 (399) .+...|..+- ..+++....-+.++..|+.+.++.+.+..+|+++.+. +++.+.+ T Consensus 72 ----------~~v~eg~~~~---------------~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~-s~~~~~~ 125 (333) T protein:vir:78 72 ----------GQVGVGTSNE---------------QREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARM-NPSGLYT 125 (333) T ss_pred ----------EeecCccccc---------------ccccccccccccceeEEEEeeEEEEEeehhhHHHHhc-CHHHHHH Confidence 1211111100 0112222233457888999999999999999998763 3445666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCcee---ee--------ccccccccccCCcceecHHHHHHHHHHHHhccCccccc Q lcl|NC_013692. 158 HVTTEMVKGANEITEDLLQIDLLNSAGTV---RY--------PGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIK 226 (399) Q Consensus 158 ~i~~el~~~~~~~t~d~l~~~~l~agt~V---~Y--------Ag~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ 226 (399) .|..+|.+.-+.-. -..+|++-+.. .. ....+...........++++|.++...+..|... T Consensus 126 ~i~~~la~ai~~~~----d~~~l~G~g~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~---- 197 (333) T protein:vir:78 126 KLQGDLAYAIGRGI----DLAVFHGKSPLTGSALQGIDTDNVIANTTNVDYLQETGDPLLDRLLDGYDLVSANTDV---- 197 (333) T ss_pred HHHHHHHHHHHHHH----HHHHhcccCCCCCcccccccccccccccccccccccccchhHHHHHHHHHhhcccccc---- Confidence 66666555544433 33344322210 00 1112222222333556788888887766555432 Q ss_pred eeccccccCcccccCeeEEEechhhhHHHHHH---hhhcCCCCceehhhcCCccccccccceeEcCeEEEecCccccccc Q lcl|NC_013692. 227 MITGTRMIDTRTVGNARALYVGSDLVPTIEAM---KDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAG 303 (399) Q Consensus 227 ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~---~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~ 303 (399) .++ +.++||.....|+.+ +|--+.+-|. .....+..|++-++.+++++.+..-.. T Consensus 198 -------------~~~-~~vmn~~~~~~L~~~~~~~d~~G~~i~~--------~~~~~~~~~~l~G~Pv~~~~~i~~~~~ 255 (333) T protein:vir:78 198 -------------EFN-GWAVDPRFRAHLLRAQAYRDANGNVDPS--------RINLAAQTGDVLGLPAQFGRAVGGDLG 255 (333) T ss_pred -------------Cce-EEEEcchHHHHHHHHhhhcCCCCceeec--------CccccCCCceeeceeeEEccccCCCcc Confidence 111 466799988777654 3323333342 234567779999999999986532111 Q ss_pred CCcccCCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCc----cchhhHHHHH-- Q lcl|NC_013692. 304 VGKAVDPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDP----YGEMGFMSIK-- 377 (399) Q Consensus 304 aGa~~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DP----lgQrg~~gwK-- 377 (399) . ...++.+ +++|.-+.-.+++.+ . +++.+..=+ +....|. +-|++.+.++ T Consensus 256 ~-------------~~~~~~~----~~~gD~~~~~~g~~~---~--~~i~~~~~~--~~~~~~~~~~~~~~~~~v~~r~~ 311 (333) T protein:vir:78 256 A-------------AVDSKTR----IIGGDFSQLKFGFAD---E--IRIKMSDTA--TLTDSGSATVSMWQTNQIAILIE 311 (333) T ss_pred c-------------cCCCccE----EEEEecccEEEEEee---c--cEEEEeccc--cccccccceeehhhcCcEEEEEE Confidence 1 1122333 446655544455541 2 222222211 2222222 3456666666 Q ss_pred HHHHHhhccccceEEEE-EecC Q lcl|NC_013692. 378 WYYGFMVFRPEWIALLK-TVAR 398 (399) Q Consensus 378 ~~~~~~iLn~~~m~~ie-t~A~ 398 (399) .++++.+++++-+++|+ ..|| T Consensus 312 ~r~d~~v~~~~a~~~l~~~~a~ 333 (333) T protein:vir:78 312 VTFGWLLGDKQAFVKFVDDEQP 333 (333) T ss_pred EEEccEEecccceEEEeccCCC Confidence 57889999999998887 5677 No 71 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=98.53 E-value=7.3e-08 Score=59.72 Aligned_cols=325 Identities=9% Similarity=0.058 Sum_probs=170.2 Q ss_pred cccccccccCCCCCCccccc----cc-ceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCccc Q lcl|NC_013692. 5 VDNIKPMKYNDPANGVESSI----GP-QIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVN 79 (399) Q Consensus 5 ~~~~~~~~~n~~~~~~~~~i----~p-~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~l 79 (399) |.+.|..-.-.+..+|.... ++ .+..--|..+++..=+..-++..+-.++++= .||+++|-|--.... .-. T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~~~~rti~--~Gksv~f~~iG~~t~--~~~ 76 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDLVTKRTLK--NGKSLQFIYTGRMTS--SFH 76 (375) T ss_pred CccccccccCccccCCccccccccchHHHHHHHHhHHHHHHHHHHHhhhccccccccc--cCceEEEEeeeeeEE--eee Confidence 44444333222211111111 11 2222234566665555555555555555442 489999976533311 112 Q ss_pred cCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHH Q lcl|NC_013692. 80 DQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHV 159 (399) Q Consensus 80 teGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i 159 (399) +.|....|.. +.+.+-++.+.+|-|.=-|..+=|.+++.-..-.|+.++ T Consensus 77 t~G~~i~~~~-------------------------------~~d~~~te~~l~ID~~~y~~~~VdDiD~aqa~~Dlr~e~ 125 (375) T protein:vir:10 77 TPGTPILGNA-------------------------------DKAPPVAEKTIVMDDLLISSAFVYDLDETLAHYELRGEI 125 (375) T ss_pred cCCcCcCCcc-------------------------------ccCCCCCceEEEecchhhhhhhHhhHHHHhcCchhHHHH Confidence 2222222111 111122334455555544444444455444443466666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCc--------eeeecc-------cccc-ccccCCcceecHHHHHHHHHHHHhccCcc Q lcl|NC_013692. 160 TTEMVKGANEITEDLLQIDLLNSAG--------TVRYPG-------AATS-DAEVDATTEVTYDSLMRLRLDLDNARAPT 223 (399) Q Consensus 160 ~~el~~~~~~~t~d~l~~~~l~agt--------~V~YAg-------~aTs-ra~v~~~~~vt~~~lr~a~~~Lk~nrApk 223 (399) +.|+...=+......+.+-+.+|+- ...+.| ++++ ..+++ -..-++.|+.+...|.++..|. T Consensus 126 s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~~~~~~~~~Gg~~i~~~sg~~~~~~~t--a~~~~~ai~~a~~~Lde~~VP~ 203 (375) T protein:vir:10 126 SKKIGYALAEKYDRLIFRSITRGARSASPVSATNFVEPGGTQIRVGSGTNESDAFT--ASALVNAFYDAAAAMDEKGVSS 203 (375) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccCcceeeeccccccccccC--HHHHHHHHHHHHHHHhhcCCCC Confidence 6666555555444444444444321 122221 1111 11122 1133566888888999998873 Q ss_pred ccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCccccccc Q lcl|NC_013692. 224 KIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAG 303 (399) Q Consensus 224 ~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~ 303 (399) ..+++++.|+.-.-|..-+ ..+.|+.. .|+..+..-+|.+|++.+|+++++.++-...+ T Consensus 204 -----------------~~R~~vv~P~~y~~Ll~~~---d~~~~~n~-d~~~~~~~~~g~v~~i~Gv~V~~Sn~lP~~~~ 262 (375) T protein:vir:10 204 -----------------QGRCAVLNPRQYYALIQDI---GSNGLVNR-DVQGSALQSGNGVIEIAGIHIYKSMNIPFLGK 262 (375) T ss_pred -----------------CCCEEEeChHHHHHHHhcC---Cccceeee-cccccceeccceEEEEeceEEEEecccccccc Confidence 2488999999988886421 12456665 47777777899999999999999998655443 Q ss_pred CCc------ccCCcc-----------ccccccc-CccceeEE-------EEEEccccceecccccCCCCCcceEEEecCC Q lcl|NC_013692. 304 VGK------AVDPND-----------QVPMHES-GGKYSVFP-------MLCVASEAFTTVGFATDGKNVKFKIITKRPG 358 (399) Q Consensus 304 aGa------~~~~~~-----------~~~~~~~-~~~~DVYp-------~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG 358 (399) .+- .+.+.. ..+...+ +++|++=. -|+|=++|-|++-+.+ +.. T Consensus 263 ~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~~~~~~A~g~v~~~~----~~~-------- 330 (375) T protein:vir:10 263 YGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCGLIFQKEAAGVVEAIG----PQV-------- 330 (375) T ss_pred ccccccccccccchhhhhccccccCCcceeeccccccccccccccCceEEEEEchhheeeeeeec----ccc-------- Confidence 221 110000 0001111 12333222 3667777777775521 111 Q ss_pred CcCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 359 EATADRSDPYGEMGFMSIKWYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 359 ~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~iet~A~~ 399 (399) +-+-+.=++.-|..+.=-|+.+|+.+||+|.-+.|.+.++- T Consensus 331 ~~~~~~~~~~~q~~~i~~~~a~G~~~lrp~~av~l~~~~~~ 371 (375) T protein:vir:10 331 QVTNGDVSVIYQGDVILGRMAMGADYLNPAAAVELYIGATA 371 (375) T ss_pred ccccchhhheeeeeeeeeeeeeccCccCceeEEEEecCcCc Confidence 10111237777888888899999999999999999998765 No 72 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=98.51 E-value=6.3e-09 Score=65.56 Aligned_cols=286 Identities=20% Similarity=0.181 Sum_probs=156.0 Q ss_pred CCCccccc---cccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCc Q lcl|NC_013692. 1 MAGPVDNI---KPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRN 77 (399) Q Consensus 1 ~~~~~~~~---~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t 77 (399) -.|..... .....+..+.+.-+-+-|+ -+..+.++...+..++.+++...++..+.. +..+. T Consensus 92 r~~~~~~~~~~e~~a~~~~~~~~GG~~iP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~---~~~~~-------- 156 (401) T protein:vir:44 92 RKGREDGLRDLERKALQVGTDEDGGYAVPE----ELDRSILSLLKDEVVMRQEATVITVGGSDY---KKLVN-------- 156 (401) T ss_pred hhhhhhhhHHHHHHHhhcCCCCCCceeccH----hHHHHHHHHHHhhhhhhhhceeeecCCCce---EEEEe-------- Confidence 01110100 0000111111101112343 234666666667778889998888754432 22111 Q ss_pred cccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHH Q lcl|NC_013692. 78 VNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEG 157 (399) Q Consensus 78 ~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~ 157 (399) +.|..+ .+...|.- +...-..++..|+.+.++++.++.+|+++.+ +++..|.+ T Consensus 157 --~~~~~a--~wv~E~~~----------------------~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~-ds~~~l~~ 209 (401) T protein:vir:44 157 --LGGTAS--GWVGETDT----------------------RSQTATSRLGLIEPFMGEIYGNPQATQKMLD-DAFFNVEA 209 (401) T ss_pred --cCCccc--eeeccccc----------------------cCccccccceeeeeehhheeeehhhhHHHHh-cchHHHHH Confidence 111111 12211100 0000012567899999999999999999877 55656777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCce----------------eeecccccccccc-CCcceecHHHHHHHHHHHHhcc Q lcl|NC_013692. 158 HVTTEMVKGANEITEDLLQIDLLNSAGT----------------VRYPGAATSDAEV-DATTEVTYDSLMRLRLDLDNAR 220 (399) Q Consensus 158 ~i~~el~~~~~~~t~d~l~~~~l~agt~----------------V~YAg~aTsra~v-~~~~~vt~~~lr~a~~~Lk~nr 220 (399) .|..+|.+.-+...+ ..+|++-++ ..++. ++....+ .....+++++|..+.-.|+... T Consensus 210 ~i~~~la~ai~~~~~----~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~-~~~~~~~t~~~~~~~~d~i~~~~~~l~~~~ 284 (401) T protein:vir:44 210 WINSELATEFAEQEE----IAFTTGDGTKKPKGFLAYESTEESDKARAF-GKLQHIVSGEATAVTADAIIKLIYTLRKAH 284 (401) T ss_pred HHHHHHHHHHHHHHH----hhhhccCCCCccceeecccccccccccccc-ccccccccccccccCHHHHHHHHHhcchhh Confidence 776666665554333 233432111 00110 0001111 1235688999999888776543 Q ss_pred CccccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccc Q lcl|NC_013692. 221 APTKIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMH 300 (399) Q Consensus 221 Apk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~ 300 (399) .. .+ +.+||+....-|+.|+|--+.|=|.|-.+. |.-+++-+..++.++.|. T Consensus 285 ~~------------------~a-~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~--------g~~~~l~G~PVv~~~~~p- 336 (401) T protein:vir:44 285 RT------------------GA-KFMMNNNSLFAIRLLKDTEGNYLWRPGLEL--------GQPSSLAGYGIAENEQMP- 336 (401) T ss_pred hc------------------CC-EEEEcHHHHHHHHHhhccCCceeecCCcCC--------CCCceecceeeEEecCcC- Confidence 21 11 357999999999999887777777654333 344678888988887642 Q ss_pred cccCCcccCCcccccccccCccceeEEEEEEcc--ccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHH Q lcl|NC_013692. 301 WAGVGKAVDPNDQVPMHESGGKYSVFPMLCVAS--EAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKW 378 (399) Q Consensus 301 ~~~aGa~~~~~~~~~~~~~~~~~DVYp~lV~G~--~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~ 378 (399) ..++++ ..++||. ++|....- .+ +++ ..|||.+++.++|++ T Consensus 337 ~~~~~~--------------------~~i~~Gd~~~~~~i~~~--~~----~~~-----------~~~~~~~~~~v~~~a 379 (401) T protein:vir:44 337 DIAADA--------------------KAIAFGNFKRGYTIVDR--IG----TRI-----------LRDPYTNKPFVGFYT 379 (401) T ss_pred CccCCc--------------------cEEEEeehhccEEEEEe--cc----eEE-----------eeeccccCCcEEEEE Confidence 222111 1244564 23332111 11 111 156788899999996 Q ss_pred --HHHHhhccccceEEEEEecC Q lcl|NC_013692. 379 --YYGFMVFRPEWIALLKTVAR 398 (399) Q Consensus 379 --~~~~~iLn~~~m~~iet~A~ 398 (399) ++++.+++++-++.|+.+|- T Consensus 380 ~~r~d~~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 380 TKRTGGMLVDSQAIKLLKIAAA 401 (401) T ss_pred EEEeccEEecccceEEEEeecC Confidence 59999999999999999988 No 73 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=98.49 E-value=3.2e-08 Score=61.71 Aligned_cols=296 Identities=11% Similarity=0.078 Sum_probs=156.0 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVND 80 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lt 80 (399) ..--+..++... +.++.+..+.-+-+ -+.+..+..+.+...+.+++++.+|+-+. .+..+.... ..-.-.. T Consensus 151 ~~~~~~~~~a~~----~~~~~~~g~~~ip~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~~--~~~~~~~~~--~~a~~v~ 221 (458) T protein:vir:10 151 TEHGQRHLKAVN----QSSSVEVSSESYET-IFSQRIIRDLQKELVVGALFEELPMSSKI--LTMLVEPDA--GKATWVA 221 (458) T ss_pred hhhhhhhhhhhh----hcccCccccceehh-hHhHHHHHHHHhhhhHHhhcceeecCCcc--eEEEEecCC--cceeecc Confidence 000011111111 11111111112222 33677777788888899999998886432 222221110 0001111 Q ss_pred CCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHH Q lcl|NC_013692. 81 QGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVT 160 (399) Q Consensus 81 eGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~ 160 (399) ||-..... ..+. -...++..|+.+.++++.++.+|+++.+ ++++.|.+.|. T Consensus 222 e~~~~~~~-------------------~~~~---------~~~~~~~~i~~~~~k~~~~v~is~ell~-ds~~~~~~~i~ 272 (458) T protein:vir:10 222 ASTYGTDT-------------------TTGE---------EVKGALKEIHFSTYKLAAKSFITDETEE-DAIFSLLPLLR 272 (458) T ss_pred cccccccc-------------------cccc---------cccccceeeEeeeeeEEeeehhhHHHHh-cchHHHHHHHH Confidence 22111000 0000 0113567789999999999999999764 45566777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCce------eeeccc----cccccccCCcceecHHHHHHHHHHHHhccCccccceecc Q lcl|NC_013692. 161 TEMVKGANEITEDLLQIDLLNSAGT------VRYPGA----ATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITG 230 (399) Q Consensus 161 ~el~~~~~~~t~d~l~~~~l~agt~------V~YAg~----aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~ 230 (399) .+|...-+... -..+|++-++ +.+++. .+...+....+.+++++|.++...|+.+... T Consensus 273 ~~l~~~i~~~~----d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~-------- 340 (458) T protein:vir:10 273 KRLIEAHAVSI----EEAFMTGDGSGKPKGLLTLASEDSAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLK-------- 340 (458) T ss_pred HHHHHHHHHHH----HHHhhcCCCCCccceeeecccccccceeecccccccccccHHHHHHHHHhhhhhhcC-------- Confidence 77666655533 3345543222 112211 1111222233668999999888777654321 Q ss_pred ccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCC Q lcl|NC_013692. 231 TRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDP 310 (399) Q Consensus 231 s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~ 310 (399) +++ -+|||.....|+.|+|--+.|-|.+ ........|..+++-+++++++..|- ++ T Consensus 341 ---------~~~--~v~~~~~~~~l~~lkd~~G~~i~~~----~~~~~~~~~~~~~l~G~pv~~~~~~p----~~----- 396 (458) T protein:vir:10 341 ---------LSK--LVLIVSMDAYYDLLEDEEWQDVAQV----GNDSVKLQGQVGRIYGLPVVVSEYFP----AK----- 396 (458) T ss_pred ---------CCE--EEEcHHHHHHHHhhcccCCceeecc----ccccccccCcCceecceeeEEccccc----cc----- Confidence 122 3789999999998876544444433 33334566777889999999987641 11 Q ss_pred cccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH--HHHHHhhcccc Q lcl|NC_013692. 311 NDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK--WYYGFMVFRPE 388 (399) Q Consensus 311 ~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iLn~~ 388 (399) ++..+++ +..|+ +.|- +.-. .+ +++. .|||.+.+++++. ..++..++++. T Consensus 397 ---------~~~~~~~-~~~f~-~~~~-~~~~-~~----~~v~-----------~d~~~~~~~~~~~~~~r~~~~v~~~~ 448 (458) T protein:vir:10 397 ---------ANSAEFA-VIVYK-DNFV-MPRQ-RA----VTVE-----------RERQAGKQRDAYYVTQRVNLQRYFAN 448 (458) T ss_pred ---------cCCcceE-EEEec-ccEE-EEEe-ec----eEEE-----------eecccCCCceEEEEEEEecceEeccc Confidence 1122332 23333 2232 1111 11 2221 3778777776665 34567888999 Q ss_pred ceEEEEEecC Q lcl|NC_013692. 389 WIALLKTVAR 398 (399) Q Consensus 389 ~m~~iet~A~ 398 (399) -++.+..+|- T Consensus 449 a~v~~~~aa~ 458 (458) T protein:vir:10 449 GVVSGTYAAS 458 (458) T ss_pred ceEEEeeccC Confidence 9999999888 No 74 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=98.48 E-value=1.3e-07 Score=58.28 Aligned_cols=285 Identities=11% Similarity=0.014 Sum_probs=143.6 Q ss_pred CCcccccccceehhhhhHHHHHHhhh-HHhh------hhcccccccCcCCCcEEEEEEccCCcCCCccccCCCCcchhhh Q lcl|NC_013692. 18 NGVESSIGPQIHTRYWYKRALIDAAK-EAYF------GQLADTFSMPKHYGKEIVRLHYIPLLDDRNVNDQGIDASGATI 90 (399) Q Consensus 18 ~~~~~~i~p~~~t~y~~~k~L~~A~p-~lv~------~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lteGV~p~g~~~ 90 (399) |++ +.++.-+++..| -.++....+ ..-| ...++...+-..-|.+|.+=.|.+|.-+...+++|-+=...+| T Consensus 1 MA~-T~lsd~i~PEvf-~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~~~~~~~~~i~~~ki 78 (351) T protein:vir:15 1 MAE-THLSDLIVPEVF-GNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGDPDNWTDSDDIDVNNL 78 (351) T ss_pred CCc-eeeeeeechhHH-HHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCcccchhee Confidence 432 333332222232 223322222 2233 2333344444456899999888888555566666644332232 Q ss_pred hhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHHHHHHHHHHHH Q lcl|NC_013692. 91 ANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVTTEMVKGANEI 170 (399) Q Consensus 91 ~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~~el~~~~~~~ 170 (399) +...-.+.+.++|.=++.+|.+.+..-.+- ++++...+...-+.. T Consensus 79 ----------------------------------tt~~~~a~i~~~~kg~~~tD~a~~~sg~dp-~~~i~~q~a~~w~~~ 123 (351) T protein:vir:15 79 ----------------------------------TSGKQQGIKFYQTKAYGYTDLGTMISGAPV-QETIGNRFAAFWQRA 123 (351) T ss_pred ----------------------------------cccceeEEEEeeccceehhhhhHhhccchH-HHHHHHHHHHHHHHH Confidence 233457888999999999998766544322 345655555433332 Q ss_pred HHHHHHHHHHhc-------CceeeeccccccccccCCcceecHHHHHHHHHHHHhc-cCccccceeccccccCcccccCe Q lcl|NC_013692. 171 TEDLLQIDLLNS-------AGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNA-RAPTKIKMITGTRMIDTRTVGNA 242 (399) Q Consensus 171 t~d~l~~~~l~a-------gt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~n-rApk~T~ii~~s~~~gT~~I~~~ 242 (399) .+..|. .+|++ +.+..|.. +.. ......+|.+.|-+|...|-.. ... - T Consensus 124 ~q~~ll-a~l~gv~~~~~~~~~~~~d~--t~~--~~~~~~is~~~l~~A~~~~GD~~~~~-------------------~ 179 (351) T protein:vir:15 124 DQKTLL-SVLKGVMGVTKIANSKVYDQ--TKV--SPSEPMFGAKGFTGAIGLMGDLQDTA-------------------F 179 (351) T ss_pred HHHHHH-HHHHHHhhchhhcccceecc--ccc--cccccccCHHHHHHHHHHhccccccc-------------------e Confidence 222221 12221 11222221 110 1123568999999999877553 222 2 Q ss_pred eEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccccCcc Q lcl|NC_013692. 243 RALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESGGK 322 (399) Q Consensus 243 yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~~~ 322 (399) .+.+|||.....|+++ ++++-.+|.+. +.+||.+.+.|+|+++-+ | +..+++. T Consensus 180 ~~ivmhS~v~~~L~~~-------~li~~~~~s~~----~~~i~t~~G~~VivdD~~-p---------------~~~~~~~ 232 (351) T protein:vir:15 180 GAIAVNSATYSLMKVQ-------GLIETIQPQNG----ATPFEAYNGLRIVLDDDI-E---------------IDLTDKT 232 (351) T ss_pred EEEEEChHHHHHHHhh-------hhhhhcccccc----CcccceecceEEEEcCCC-c---------------cccCCCC Confidence 7888999999999974 46666777765 357999999999999843 1 2223345 Q ss_pred ceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhH-----HHHHHHH---HHhhccccceEEEE Q lcl|NC_013692. 323 YSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGF-----MSIKWYY---GFMVFRPEWIALLK 394 (399) Q Consensus 323 ~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~-----~gwK~~~---~~~iLn~~~m~~ie 394 (399) ..+|-..+||+.|++-. .+ +.+.++ ...|.. -+..|=|-+|-. .|+||-- ....-.+. .+-|+ T Consensus 233 ~~~ytsyl~~~GAi~~~----~~-~~~ve~-~rd~~~--~~g~d~l~~r~~~~~hp~G~s~~~~~~~~~~~sPt-~~~L~ 303 (351) T protein:vir:15 233 KPVSTSYIFAPGAVRYS----TN-MRSTET-KYDPLI--NGGQDVIVQKRVGTIHVAGTSIKASFSPSKASFPT-IDELA 303 (351) T ss_pred CceeEEEEEecceeeee----cC-CcCcce-eecccC--CCCceEEEEeeeeeeeeeeeeecccccccCcCCcC-hHHhc Confidence 57999999999998831 11 222221 222211 011122222211 1111110 00000000 00011 Q ss_pred EecCC Q lcl|NC_013692. 395 TVARL 399 (399) Q Consensus 395 t~A~~ 399 (399) .++.- T Consensus 304 ~~~NW 308 (351) T protein:vir:15 304 KSSTW 308 (351) T ss_pred CCccc Confidence 11111 No 75 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=98.48 E-value=1.7e-08 Score=63.16 Aligned_cols=295 Identities=12% Similarity=0.098 Sum_probs=160.1 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVND 80 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lt 80 (399) .++. +.....+|.....++++-+.-+-+.+ ..+.++.+.+.-.+.+++...+|+. .++++.+...- T Consensus 14 ~~~~--~~~~~~~~a~~~~~~~~~~~~iP~~~-~~~ii~~~~~~s~l~~l~~~~~~~~---~~~~~p~~~~~-------- 79 (324) T protein:vir:78 14 FASN--NVKPQVFNPDNVMMHEKKDGTLMNEF-TTPILQEVMENSKIMQLGKYEPMEG---TEKKFTFWADK-------- 79 (324) T ss_pred HHHH--hhhhhhhccccccccCcCccccchhH-HHHHHHHHHhhchhhhhcceeeccC---CceEEEEEecC-------- Confidence 2221 12233344332222222222222233 5778888888888899988888773 34554433211 Q ss_pred CCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHH Q lcl|NC_013692. 81 QGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVT 160 (399) Q Consensus 81 eGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~ 160 (399) |...+++ +|+....-.+++..++.+.++++.+..+|+++.+ ++++.|.+.|. T Consensus 80 ----~~a~~v~-----------------------Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~-ds~~~l~~~i~ 131 (324) T protein:vir:78 80 ----PGAYWVG-----------------------EGQKIETSKATWVNATMRAFKLGVILPVTKEFLN-YTYSQFFEEMK 131 (324) T ss_pred ----cceeEec-----------------------CCccccccccceeEEEEeeEEEEEeehhhHHHHh-cchHHHHHHHH Confidence 1222222 1222233345788899999999999999998777 34456777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeeeccc---cccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_013692. 161 TEMVKGANEITEDLLQIDLLNSAGTVRYPGA---ATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTR 237 (399) Q Consensus 161 ~el~~~~~~~t~d~l~~~~l~agt~V~YAg~---aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~ 237 (399) .+|.+.-+...+..+ |++.+.-...+. .+...+.......++++|.++.-.|+.+.... T Consensus 132 ~~la~ai~~~~d~a~----l~G~g~~~~~~gi~~~~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~-------------- 193 (324) T protein:vir:78 132 PMIAEAFYKKFDEAG----ILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEA-------------- 193 (324) T ss_pred HHHHHHHHHHHHHHH----hccCCCCCcCccccccccccceeccccccHHHHHHHHHhhhhccCCC-------------- Confidence 777666665444443 332221111100 01111111235578999999998887655421 Q ss_pred cccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCccccccc Q lcl|NC_013692. 238 TVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMH 317 (399) Q Consensus 238 ~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~ 317 (399) + +.+|||.....|+.++|--+.|-|. .+.-+++-++.++.++.+. T Consensus 194 ---~--~~vmn~~~~~~L~~l~d~~G~~~~~------------~~~~~~l~G~PV~~~~~~~------------------ 238 (324) T protein:vir:78 194 ---N--AFISKTQNRSLLRKIVDPETKERIY------------DRNSDSLDGLPVVNLKSSN------------------ 238 (324) T ss_pred ---C--EEEEcHHHHHHHHHhhccCCCeeec------------CCCCCcccceeeEeeCCCC------------------ Confidence 1 3579999999999987644444331 2344566777776654100 Q ss_pred ccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCc-CCCCCCcc--chhhHHHHH--HHHHHhhccccceEE Q lcl|NC_013692. 318 ESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEA-TADRSDPY--GEMGFMSIK--WYYGFMVFRPEWIAL 392 (399) Q Consensus 318 ~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~-tad~~DPl--gQrg~~gwK--~~~~~~iLn~~~m~~ 392 (399) .++. .+++|.-+...++.. +.+.+++.--..... ......|+ -|+....|+ +++.+.+++++-+++ T Consensus 239 --~~~~----~~~~gd~~~~~~g~~---~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~ 309 (324) T protein:vir:78 239 --LKRG----ELITGDFDKLIYGIP---QLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAK 309 (324) T ss_pred --CCcc----eEEEEecceEEEEEe---cCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEE Confidence 0111 245666555445443 222333322111100 00112222 455666666 688999999999999 Q ss_pred EEEecCC Q lcl|NC_013692. 393 LKTVARL 399 (399) Q Consensus 393 iet~A~~ 399 (399) |..+-+. T Consensus 310 l~~a~~~ 316 (324) T protein:vir:78 310 LVPADKR 316 (324) T ss_pred Eeccccc Confidence 9876666 No 76 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=98.48 E-value=1.7e-08 Score=63.16 Aligned_cols=295 Identities=12% Similarity=0.098 Sum_probs=160.1 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVND 80 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lt 80 (399) .++. +.....+|.....++++-+.-+-+.+ ..+.++.+.+.-.+.+++...+|+. .++++.+...- T Consensus 14 ~~~~--~~~~~~~~a~~~~~~~~~~~~iP~~~-~~~ii~~~~~~s~l~~l~~~~~~~~---~~~~~p~~~~~-------- 79 (324) T protein:vir:96 14 FASN--NVKPQVFNPDNVMMHEKKDGTLMNEF-TTPILQEVMENSKIMQLGKYEPMEG---TEKKFTFWADK-------- 79 (324) T ss_pred HHHH--hhhhhhhccccccccCcCccccchhH-HHHHHHHHHhhchhhhhcceeeccC---CceEEEEEecC-------- Confidence 2221 12233344332222222222222233 5778888888888899988888773 34554433211 Q ss_pred CCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHH Q lcl|NC_013692. 81 QGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVT 160 (399) Q Consensus 81 eGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~ 160 (399) |...+++ +|+....-.+++..++.+.++++.+..+|+++.+ ++++.|.+.|. T Consensus 80 ----~~a~~v~-----------------------Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~-ds~~~l~~~i~ 131 (324) T protein:vir:96 80 ----PGAYWVG-----------------------EGQKIETSKATWVNATMRAFKLGVILPVTKEFLN-YTYSQFFEEMK 131 (324) T ss_pred ----cceeEec-----------------------CCccccccccceeEEEEeeEEEEEeehhhHHHHh-cchHHHHHHHH Confidence 1222222 1222233345788899999999999999998777 34456777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeeeccc---cccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_013692. 161 TEMVKGANEITEDLLQIDLLNSAGTVRYPGA---ATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTR 237 (399) Q Consensus 161 ~el~~~~~~~t~d~l~~~~l~agt~V~YAg~---aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~ 237 (399) .+|.+.-+...+..+ |++.+.-...+. .+...+.......++++|.++.-.|+.+.... T Consensus 132 ~~la~ai~~~~d~a~----l~G~g~~~~~~gi~~~~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~-------------- 193 (324) T protein:vir:96 132 PMIAEAFYKKFDEAG----ILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEA-------------- 193 (324) T ss_pred HHHHHHHHHHHHHHH----hccCCCCCcCccccccccccceeccccccHHHHHHHHHhhhhccCCC-------------- Confidence 777666665444443 332221111100 01111111235578999999998887655421 Q ss_pred cccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCccccccc Q lcl|NC_013692. 238 TVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMH 317 (399) Q Consensus 238 ~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~ 317 (399) + +.+|||.....|+.++|--+.|-|. .+.-+++-++.++.++.+. T Consensus 194 ---~--~~vmn~~~~~~L~~l~d~~G~~~~~------------~~~~~~l~G~PV~~~~~~~------------------ 238 (324) T protein:vir:96 194 ---N--AFISKTQNRSLLRKIVDPETKERIY------------DRNSDSLDGLPVVNLKSSN------------------ 238 (324) T ss_pred ---C--EEEEcHHHHHHHHHhhccCCCeeec------------CCCCCcccceeeEeeCCCC------------------ Confidence 1 3579999999999987644444331 2344566777776654100 Q ss_pred ccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCc-CCCCCCcc--chhhHHHHH--HHHHHhhccccceEE Q lcl|NC_013692. 318 ESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEA-TADRSDPY--GEMGFMSIK--WYYGFMVFRPEWIAL 392 (399) Q Consensus 318 ~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~-tad~~DPl--gQrg~~gwK--~~~~~~iLn~~~m~~ 392 (399) .++. .+++|.-+...++.. +.+.+++.--..... ......|+ -|+....|+ +++.+.+++++-+++ T Consensus 239 --~~~~----~~~~gd~~~~~~g~~---~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~ 309 (324) T protein:vir:96 239 --LKRG----ELITGDFDKLIYGIP---QLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAK 309 (324) T ss_pred --CCcc----eEEEEecceEEEEEe---cCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEE Confidence 0111 245666555445443 222333322111100 00112222 455666666 688999999999999 Q ss_pred EEEecCC Q lcl|NC_013692. 393 LKTVARL 399 (399) Q Consensus 393 iet~A~~ 399 (399) |..+-+. T Consensus 310 l~~a~~~ 316 (324) T protein:vir:96 310 LVPADKR 316 (324) T ss_pred Eeccccc Confidence 9876666 No 77 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=98.46 E-value=6.3e-08 Score=60.07 Aligned_cols=285 Identities=11% Similarity=0.069 Sum_probs=163.6 Q ss_pred CCccccc-ccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCccccCCCCcchhhhhhhhhc Q lcl|NC_013692. 18 NGVESSI-GPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVNDQGIDASGATIANGNLY 96 (399) Q Consensus 18 ~~~~~~i-~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lteGV~p~g~~~~ngn~~ 96 (399) |++.++. -|+ .| ..+.++..++.-++.+++...+|+.+. +++-|...- |...+...| T Consensus 1 ma~~gG~lip~---~~-~~~ii~~~~~~s~i~~~~~~~~~~~~~---~~~p~~~~~------------~~a~~v~Eg--- 58 (298) T protein:vir:94 1 MVLNKGTLFDP---EL-VTDLISKVAGKSSIARLSAQKPIPFNG---EKVFTFTMD------------SEIDVVAES--- 58 (298) T ss_pred CeeccccccCh---hH-HHHHHHHHHhhchhhhhcceeeccCCc---eEEEEEecC------------cceEEeeCC--- Confidence 4444443 444 23 466777788888999999999888753 344333211 111222221 Q ss_pred cccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhc--hhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 97 GSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSD--PAMEGHVTTEMVKGANEITEDL 174 (399) Q Consensus 97 ~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D--~~L~~~i~~el~~~~~~~t~d~ 174 (399) +....-..++..++.+.++++.+..+|+++.....| ..|.+.|..+|.+.-+.-.+.. T Consensus 59 --------------------~~~~~~~~~f~~v~l~~~k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~ 118 (298) T protein:vir:94 59 --------------------GKKTHGGVTLAPQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLM 118 (298) T ss_pred --------------------ccccccccceeEEEEeeeEEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHH Confidence 112222347788999999999999999998865444 3455555555555444433333 Q ss_pred HHHHHHhcCc-e---e---eeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCeeEEEe Q lcl|NC_013692. 175 LQIDLLNSAG-T---V---RYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNARALYV 247 (399) Q Consensus 175 l~~~~l~agt-~---V---~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv~~~ 247 (399) +..+.-.+.+ . . ..++..+..........-.+++|.++...|..+..+. -+.+| T Consensus 119 ~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~-------------------~~~vm 179 (298) T protein:vir:94 119 AFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGAIENAVELLTGVDADV-------------------TGIAI 179 (298) T ss_pred hhcccccCCCcccccccccccccccccccccccccccHHHHHHHHHHhhhhcCCCc-------------------cEEEE Confidence 2222110011 1 0 0111111111122223344678888887777655431 25889 Q ss_pred chhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccccCccceeEE Q lcl|NC_013692. 248 GSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESGGKYSVFP 327 (399) Q Consensus 248 h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~~~~DVYp 327 (399) ||.....|+.++|--+.|-|.+.. ..+.-|++-++.++.++.+. .+. ++++. T Consensus 180 n~~~~~~l~~lkd~~G~~l~~~~~--------~~~~~~tl~G~PV~~~~~v~----~~~------------~~~~~---- 231 (298) T protein:vir:94 180 NPSFRSALAKQKDLQGNALFPELK--------WGATPDTINGLPVDVNKTVS----DMS------------LTQRD---- 231 (298) T ss_pred cHHHHHHHHHhhccCCCeeecCcc--------cCCCCceecceeeEEecccc----ccc------------CCCcc---- Confidence 999999999998877777775432 34555788899999887532 110 01111 Q ss_pred EEEEccccce-ecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH--HHHHHhhccccceEEEEEec Q lcl|NC_013692. 328 MLCVASEAFT-TVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK--WYYGFMVFRPEWIALLKTVA 397 (399) Q Consensus 328 ~lV~G~~Afg-~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iLn~~~m~~iet~A 397 (399) .+++|.-+-+ .++.. +. +++-+.+-+.. .++...|-|++.++|+ +++++.+++++-+++|+-+= T Consensus 232 ~~~~Gdfs~~~~~~~~---~~--~~~~~~~~~~~-d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 232 RAIIGDFANGFKWGYA---KE--VPLEVIQYGDP-DNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred EEEEeeccceEEEEEe---cC--ceEEEeecCCC-cCcchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 3677765543 34443 22 23333333211 1222346788888887 57899999999999998777 No 78 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=98.46 E-value=2.8e-08 Score=62.03 Aligned_cols=283 Identities=12% Similarity=0.099 Sum_probs=158.8 Q ss_pred CCCccccccccccCCCCCC---cccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANG---VESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRN 77 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~---~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t 77 (399) +.+-..+.....+++...+ ..+.+-|. . +....+....+...+.+++...++..+ .+++.+.... T Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~---~-~~~~ii~~~~~~~~l~~~~~~~~~~~~---~~~~~~~~~~----- 156 (385) T protein:vir:19 89 WDGKQGTFGAKTFNKSLGSDADSAGSLIQP---M-QIPGIIMPGLRRLTIRDLLAQGRTSSN---ALEYVREEVF----- 156 (385) T ss_pred HHHhhccchhhHHHhhhccccccCCceecc---h-hhhHHHHHhhhccchhhhcceecccCc---ceEEEEEecC----- Confidence 1111111111122222111 11222232 1 246667777788888899988888643 4555444221 Q ss_pred cccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHH Q lcl|NC_013692. 78 VNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEG 157 (399) Q Consensus 78 ~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~ 157 (399) ++.+.+.. +|+.+.-..+++..++.+++++|.++.+|+++.+ +.+ .|.+ T Consensus 157 ------~~~a~~v~-----------------------E~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~-d~~-~l~~ 205 (385) T protein:vir:19 157 ------TNNADVVA-----------------------EKALKPESDITFSKQTANVKTIAHWVQASRQVMD-DAP-MLQS 205 (385) T ss_pred ------Ccceeeec-----------------------cCccccccccceeEEEEeeeeEEEeehhhHHHHh-hHH-HHHH Confidence 11111211 1222233345778899999999999999999877 454 5767 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCce-------eeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceecc Q lcl|NC_013692. 158 HVTTEMVKGANEITEDLLQIDLLNSAGT-------VRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITG 230 (399) Q Consensus 158 ~i~~el~~~~~~~t~d~l~~~~l~agt~-------V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~ 230 (399) .|..+|.+.-+.-.+ ..+|++.++ ..+++ ............++++|..+...|+.+.... T Consensus 206 ~i~~~la~a~~~~~d----~~~l~G~g~~~~~~Gi~~~~~--~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~------- 272 (385) T protein:vir:19 206 YINNRLMYGLALKEE----GQLLNGDGTGDNLEGLNKVAT--AYDTSLNATGDTRADIIAHAIYQVTESEFSA------- 272 (385) T ss_pred HHHHHHHHHHHHHHH----HHHHhccCCCCcccccccccc--cccccccccccchHHHHHHHHHhhccccCCC------- Confidence 666666665444333 345543211 11221 1112223335578899999988886655431 Q ss_pred ccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCC Q lcl|NC_013692. 231 TRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDP 310 (399) Q Consensus 231 s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~ 310 (399) =+.+|||....-|+.++|-.+.|=|-+ ...+..+.+-+.++++++.+. ++ T Consensus 273 ------------~~~~~~~~~~~~l~~lkd~~G~~l~~~---------~~~~~~~~l~G~pV~~~~~~p----~~----- 322 (385) T protein:vir:19 273 ------------SGIVLNPRDWHNIALLKDNEGRYIFGG---------PQAFTSNIMWGLPVVPTKAQA----AG----- 322 (385) T ss_pred ------------CEEEEcHHHHHHHHHhhcCCCceeccC---------cccCCCceecceeeEEcCcCC----CC----- Confidence 256889999999999987666555532 235566788899999988641 11 Q ss_pred cccccccccCccceeEEEEEEccc--cceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH--HHHHHhhcc Q lcl|NC_013692. 311 NDQVPMHESGGKYSVFPMLCVASE--AFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK--WYYGFMVFR 386 (399) Q Consensus 311 ~~~~~~~~~~~~~DVYp~lV~G~~--Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iLn 386 (399) .+++|.- +|. +..+ +.+.+ ..+. ...| +-+++.+.|+ +++++.+++ T Consensus 323 -----------------~~~~gd~~~~~~-~~~~---~~~~v-----~~~~---~~~~-~~~~~~~~~~~~~r~~~~v~~ 372 (385) T protein:vir:19 323 -----------------TFTVGGFDMASQ-VWDR---MDATV-----EVSR---EDRD-NFVKNMLTILCEERLALAHYR 372 (385) T ss_pred -----------------cEEEeecccEEE-EEEe---cceEE-----EEec---cccc-hhhcCcEEEEEEEeeccEEec Confidence 1445542 232 2211 11111 1111 1123 3467888877 478899999 Q ss_pred ccceEEEEEecCC Q lcl|NC_013692. 387 PEWIALLKTVARL 399 (399) Q Consensus 387 ~~~m~~iet~A~~ 399 (399) +.-+++++..+-= T Consensus 373 ~~a~~~~~~~aa~ 385 (385) T protein:vir:19 373 PTAIIKGTFSSGS 385 (385) T ss_pred ccceEEEEeccCC Confidence 9999999876655 No 79 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=98.46 E-value=2.8e-08 Score=62.03 Aligned_cols=283 Identities=12% Similarity=0.099 Sum_probs=158.8 Q ss_pred CCCccccccccccCCCCCC---cccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANG---VESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRN 77 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~---~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t 77 (399) +.+-..+.....+++...+ ..+.+-|. . +....+....+...+.+++...++..+ .+++.+.... T Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~---~-~~~~ii~~~~~~~~l~~~~~~~~~~~~---~~~~~~~~~~----- 156 (385) T protein:vir:18 89 WDGKQGTFGAKTFNKSLGSDADSAGSLIQP---M-QIPGIIMPGLRRLTIRDLLAQGRTSSN---ALEYVREEVF----- 156 (385) T ss_pred HHHhhccchhhHHHhhhccccccCCceecc---h-hhhHHHHHhhhccchhhhcceecccCc---ceEEEEEecC----- Confidence 1111111111122222111 11222232 1 246667777788888899988888643 4555444221 Q ss_pred cccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHH Q lcl|NC_013692. 78 VNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEG 157 (399) Q Consensus 78 ~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~ 157 (399) ++.+.+.. +|+.+.-..+++..++.+++++|.++.+|+++.+ +.+ .|.+ T Consensus 157 ------~~~a~~v~-----------------------E~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~-d~~-~l~~ 205 (385) T protein:vir:18 157 ------TNNADVVA-----------------------EKALKPESDITFSKQTANVKTIAHWVQASRQVMD-DAP-MLQS 205 (385) T ss_pred ------Ccceeeec-----------------------cCccccccccceeEEEEeeeeEEEeehhhHHHHh-hHH-HHHH Confidence 11111211 1222233345778899999999999999999877 454 5767 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCce-------eeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceecc Q lcl|NC_013692. 158 HVTTEMVKGANEITEDLLQIDLLNSAGT-------VRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITG 230 (399) Q Consensus 158 ~i~~el~~~~~~~t~d~l~~~~l~agt~-------V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~ 230 (399) .|..+|.+.-+.-.+ ..+|++.++ ..+++ ............++++|..+...|+.+.... T Consensus 206 ~i~~~la~a~~~~~d----~~~l~G~g~~~~~~Gi~~~~~--~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~------- 272 (385) T protein:vir:18 206 YINNRLMYGLALKEE----GQLLNGDGTGDNLEGLNKVAT--AYDTSLNATGDTRADIIAHAIYQVTESEFSA------- 272 (385) T ss_pred HHHHHHHHHHHHHHH----HHHHhccCCCCcccccccccc--cccccccccccchHHHHHHHHHhhccccCCC------- Confidence 666666665444333 345543211 11221 1112223335578899999988886655431 Q ss_pred ccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCC Q lcl|NC_013692. 231 TRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDP 310 (399) Q Consensus 231 s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~ 310 (399) =+.+|||....-|+.++|-.+.|=|-+ ...+..+.+-+.++++++.+. ++ T Consensus 273 ------------~~~~~~~~~~~~l~~lkd~~G~~l~~~---------~~~~~~~~l~G~pV~~~~~~p----~~----- 322 (385) T protein:vir:18 273 ------------SGIVLNPRDWHNIALLKDNEGRYIFGG---------PQAFTSNIMWGLPVVPTKAQA----AG----- 322 (385) T ss_pred ------------CEEEEcHHHHHHHHHhhcCCCceeccC---------cccCCCceecceeeEEcCcCC----CC----- Confidence 256889999999999987666555532 235566788899999988641 11 Q ss_pred cccccccccCccceeEEEEEEccc--cceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH--HHHHHhhcc Q lcl|NC_013692. 311 NDQVPMHESGGKYSVFPMLCVASE--AFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK--WYYGFMVFR 386 (399) Q Consensus 311 ~~~~~~~~~~~~~DVYp~lV~G~~--Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iLn 386 (399) .+++|.- +|. +..+ +.+.+ ..+. ...| +-+++.+.|+ +++++.+++ T Consensus 323 -----------------~~~~gd~~~~~~-~~~~---~~~~v-----~~~~---~~~~-~~~~~~~~~~~~~r~~~~v~~ 372 (385) T protein:vir:18 323 -----------------TFTVGGFDMASQ-VWDR---MDATV-----EVSR---EDRD-NFVKNMLTILCEERLALAHYR 372 (385) T ss_pred -----------------cEEEeecccEEE-EEEe---cceEE-----EEec---cccc-hhhcCcEEEEEEEeeccEEec Confidence 1445542 232 2211 11111 1111 1123 3467888877 478899999 Q ss_pred ccceEEEEEecCC Q lcl|NC_013692. 387 PEWIALLKTVARL 399 (399) Q Consensus 387 ~~~m~~iet~A~~ 399 (399) +.-+++++..+-= T Consensus 373 ~~a~~~~~~~aa~ 385 (385) T protein:vir:18 373 PTAIIKGTFSSGS 385 (385) T ss_pred ccceEEEEeccCC Confidence 9999999876655 No 80 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=98.45 E-value=3.6e-07 Score=55.89 Aligned_cols=278 Identities=14% Similarity=0.070 Sum_probs=138.9 Q ss_pred CCcccccccceehhhhhHHHHHHhhh-HHhhhhc------ccccccCc--CCCcEEEEEEccCCcCCCccccCCCCcchh Q lcl|NC_013692. 18 NGVESSIGPQIHTRYWYKRALIDAAK-EAYFGQL------ADTFSMPK--HYGKEIVRLHYIPLLDDRNVNDQGIDASGA 88 (399) Q Consensus 18 ~~~~~~i~p~~~t~y~~~k~L~~A~p-~lv~~~~------a~~~~mPK--n~GktIkfrry~pl~~~~t~lteGV~p~g~ 88 (399) |+ ++.++.-+++-.|. .++....+ ..-|-|- ++...+-. ..|.+|.+=.|..|..+....++|-+-+.. T Consensus 1 MA-~T~lsd~i~peVf~-~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~~i~~~ 78 (324) T protein:vir:59 1 MA-YTKISDVIVPELFN-PYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDTDDLVPQ 78 (324) T ss_pred CC-ceeeeceechhHHH-HHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccccCCCcccchh Confidence 43 23333333322332 23333222 3344333 33333322 248999988888885444455555433322 Q ss_pred hhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhh-hhchhHHHHHHHHHHHHH Q lcl|NC_013692. 89 TIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDF-DSDPAMEGHVTTEMVKGA 167 (399) Q Consensus 89 ~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t-~~D~~L~~~i~~el~~~~ 167 (399) +| +...-.+.+.++|.=.+.+|.+.+. .+|| ++++...+...- T Consensus 79 ~l----------------------------------~t~~~~a~i~~~~k~~~~tD~a~~~sg~dp--~~~i~~q~a~~~ 122 (324) T protein:vir:59 79 KI----------------------------------NAGQDKAVLILRGNAWSSHDLAATLSGSDP--MQAIGSRVAAYW 122 (324) T ss_pred hc----------------------------------ccceeeEEEEeecCceeehhhhhhhccchH--HHHHHHHHHHHH Confidence 22 3345577788888888999976554 4454 345555444433 Q ss_pred HHHHHHHHHHHHHhcCceeeecc--cccccccc--CCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCee Q lcl|NC_013692. 168 NEITEDLLQIDLLNSAGTVRYPG--AATSDAEV--DATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNAR 243 (399) Q Consensus 168 ~~~t~d~l~~~~l~agt~V~YAg--~aTsra~v--~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~y 243 (399) .. ..+.++|+.-.- .|+. .++....+ +.+..+|.+.|-+|...|-.+... -. T Consensus 123 ~~----~~~~~lia~l~g-~~~~~~~~~~~~dvsa~~~~~~s~~~l~~A~~~~GD~~~~-------------------~~ 178 (324) T protein:vir:59 123 AR----EMQKIVFAELAG-VFSNDDMKDNKLDISGTADGIYSAETFVDASYKLGDHESL-------------------LT 178 (324) T ss_pred HH----HHHHHHHHHHHH-hhhccccccceeeeeccccceecHHHHHHHHHHhCCcccC-------------------cE Confidence 33 334444321110 0110 11111122 223568999999999877665432 37 Q ss_pred EEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccccCccc Q lcl|NC_013692. 244 ALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESGGKY 323 (399) Q Consensus 244 v~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~~~~ 323 (399) +.+|||....+|+++ +++.-.+|.+. ..+||.+-+.|+|++..|- ...+.+.. T Consensus 179 ~ivmhS~v~~~L~~~-------~li~~~~~s~~----~~~i~~~~G~~VivdD~~p----------------~~~~~~~~ 231 (324) T protein:vir:59 179 AIGMHSATMASAVKQ-------DLIEFVKDSQS----GIRFPTYMNKRVIVDDSMP----------------VETLEDGT 231 (324) T ss_pred EEEEchHHHHHHHHh-------hhhhhcccccc----CceeeeecccEEEEeCCCC----------------ccccCCCC Confidence 999999999999975 34555566665 3589999999999987542 11223455 Q ss_pred eeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccc------hhhH-----HHHHHHHHHh-hccccceE Q lcl|NC_013692. 324 SVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYG------EMGF-----MSIKWYYGFM-VFRPEWIA 391 (399) Q Consensus 324 DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlg------Qrg~-----~gwK~~~~~~-iLn~~~m~ 391 (399) .+|-.+++|+.|++...-+ ..++.+ -..||.+ .|.+ -|+||--.+. -.++-. + T Consensus 232 ~~y~s~l~~~GAi~~~~~~---~~v~vE-----------~dRd~~~g~~~l~~r~~~~~~p~G~s~~~~~~~~~sPt~-~ 296 (324) T protein:vir:59 232 KVFTSYLFGAGALGYAEGQ---PEVPTE-----------TARNALGSQDILINRKHFVLHPRGVKFTENAMAGTTPTD-E 296 (324) T ss_pred ceEEEEEEecCeEEEeecC---CCccee-----------cccCccccceEEEEeeEEEeEeeeEEecccccCCCCCCh-h Confidence 6999999999998864422 111111 1122321 1100 0111100000 001100 1 Q ss_pred EEEEecCC Q lcl|NC_013692. 392 LLKTVARL 399 (399) Q Consensus 392 ~iet~A~~ 399 (399) -|++++-- T Consensus 297 ~L~~~~NW 304 (324) T protein:vir:59 297 ELANGANW 304 (324) T ss_pred hhcCCccc Confidence 11111110 No 81 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=98.45 E-value=8.6e-08 Score=59.32 Aligned_cols=295 Identities=10% Similarity=0.006 Sum_probs=159.2 Q ss_pred cccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCccccCCCCcchhhh Q lcl|NC_013692. 11 MKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVNDQGIDASGATI 90 (399) Q Consensus 11 ~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lteGV~p~g~~~ 90 (399) |.-+.++. .+-+-|+ .+ ..+.++.+++..++.+++.+.+|+.+ .+++-+... .|...++ T Consensus 1 Ma~~~~~~--gg~~vP~---~~-~~~ii~~l~~~s~i~~l~~~i~~~~~---~~~ip~~~~------------~~~a~wv 59 (315) T protein:vir:80 1 MADDFLSA--GKLELPG---SM-IGAVRDRAIDSGVLAKLSPEQPTIFG---PVKGAVFSG------------VPRAKIV 59 (315) T ss_pred CCCCcCCc--CceEcch---HH-HHHHHHHHHhhchhhhhcceeecCCC---ceEEEEEeC------------CcceEEe Confidence 33222222 1222232 23 46677778888899999999988754 244433221 1122333 Q ss_pred hhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhh-hc--hhHHHHHHHHHHHHH Q lcl|NC_013692. 91 ANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFD-SD--PAMEGHVTTEMVKGA 167 (399) Q Consensus 91 ~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~-~D--~~L~~~i~~el~~~~ 167 (399) .. |+.+..-..++..++.+.++++.+..+|+++.... .| ..|.+.|..+|...- T Consensus 60 ~E-----------------------g~~~~~s~~~f~~v~l~~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai 116 (315) T protein:vir:80 60 GE-----------------------GEVKPSASVDVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASI 116 (315) T ss_pred eC-----------------------CccccccccceeeeEeeeeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHH Confidence 22 22222333578889999999999999999987432 22 335444444444333 Q ss_pred HHHHHHHHHHHHHhc---Cceeeecccccc----ccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCccccc Q lcl|NC_013692. 168 NEITEDLLQIDLLNS---AGTVRYPGAATS----DAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVG 240 (399) Q Consensus 168 ~~~t~d~l~~~~l~a---gt~V~YAg~aTs----ra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~ 240 (399) +. .+-..+|++ ++.-.-.+-.+. ...+. ....++++|.++...|..+.... . T Consensus 117 ~~----~~d~a~~~G~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~d~~~~~~~~~~~~~~~----------------~ 175 (315) T protein:vir:80 117 GR----AVDLIAFHGIDPATGKAASAVHTSLNKTKNIVD-ATDSATADLVKAVGLIAGAGLQV----------------P 175 (315) T ss_pred HH----HHhhheeeccCCCCCccccccccccccccceee-ccccchHHHHHHHHHHhhccCcc----------------c Confidence 33 223344433 111011110111 11111 12234677777775554432211 1 Q ss_pred CeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccccC Q lcl|NC_013692. 241 NARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESG 320 (399) Q Consensus 241 ~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~ 320 (399) -.-+|||.....|+.|++.-+.|.|-.-. . ..+..|.-|++-+..++.++.|..-.+.+. . T Consensus 176 --~~~imn~~~~~~L~~l~~~~g~~~~g~~~---~-~~~~~g~~~tl~G~PV~~~~~~~~~~~~~~-------------~ 236 (315) T protein:vir:80 176 --NGVALDPAFSFALSTEVYPKGSPLAGQPM---Y-PAAGFAGLDNWRGLNVGASSTVSGAPEMSP-------------A 236 (315) T ss_pred --eEEEEcHHHHHHHHHHhhccCCccccccc---c-cccccCCCceecceeeEecCcCCccccccc-------------c Confidence 23568999999999987644444442211 1 123455668899999998887632222111 1 Q ss_pred ccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH--HHHHHhhccccceEEEEEec- Q lcl|NC_013692. 321 GKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK--WYYGFMVFRPEWIALLKTVA- 397 (399) Q Consensus 321 ~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iLn~~~m~~iet~A- 397 (399) ++. .+++|.-+...+++.. ...+++. .-+. .......|-|++...|| +.+++.+.+++-+++|+.++ T Consensus 237 ~~~----~~~~GDfs~~~~g~~~---~~~i~i~--~~~~-~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~a 306 (315) T protein:vir:80 237 SGV----KAIVGDFSRVHWGFQR---NFPIELI--EYGD-PDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) T ss_pred ccc----EEEEeecccEEEEEec---CeeEEEe--cccc-ccCcccchhhcCcEEEEEEEEecceeecccceEEEeeccC Confidence 222 2467877766666652 2223322 2221 11223347788888888 67899999999999999655 Q ss_pred CC Q lcl|NC_013692. 398 RL 399 (399) Q Consensus 398 ~~ 399 (399) |- T Consensus 307 ~~ 308 (315) T protein:vir:80 307 PK 308 (315) T ss_pred CC Confidence 55 No 82 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=98.44 E-value=3.1e-08 Score=61.78 Aligned_cols=297 Identities=11% Similarity=0.097 Sum_probs=163.4 Q ss_pred CCCc-----------cccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEc Q lcl|NC_013692. 1 MAGP-----------VDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHY 69 (399) Q Consensus 1 ~~~~-----------~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry 69 (399) |--+ -++++++.++......+.+-+.-+-+ .+..+.+..+.+.-.+.+++.+.+++. .++++.++ T Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~lip~-~~~~~ii~~~~~~s~l~~~~~~~~~~~---~~~~~p~~ 76 (324) T protein:vir:99 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLN-DFTTPILQEVMENSKIMRLGKYEPMEG---TEKKFTFW 76 (324) T ss_pred CCCchHhhHHHHHHHHHhhhhhhccccceeccCCCcceech-hHHHHHHHHHHhhchhhhhcceeeccC---CceEEEEE Confidence 1111 01234444443222112221222222 345777777778888899998888773 35666554 Q ss_pred cCCcCCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhh Q lcl|NC_013692. 70 IPLLDDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDF 149 (399) Q Consensus 70 ~pl~~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t 149 (399) ..- +..++++.| +..-.-..++..++.+.+++|.+..+|+++.+. T Consensus 77 ~~~------------~~a~~v~Eg-----------------------~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~d 121 (324) T protein:vir:99 77 ADK------------PGAYWVGEG-----------------------QKIETSKATWVNATMRAFKLGVILPVTKEFLNY 121 (324) T ss_pred ecC------------cceeEeccC-----------------------ccccccccceeEEEEeeEEEEEeehhhHHHHhc Confidence 321 222333222 111122346788999999999999999987774 Q ss_pred hhchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeccc---cccccccCCcceecHHHHHHHHHHHHhccCccccc Q lcl|NC_013692. 150 DSDPAMEGHVTTEMVKGANEITEDLLQIDLLNSAGTVRYPGA---ATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIK 226 (399) Q Consensus 150 ~~D~~L~~~i~~el~~~~~~~t~d~l~~~~l~agt~V~YAg~---aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ 226 (399) . +..|.+.|..+|.+.-+.-.+ ..+|++.++--..++ +..-.+......+++++|.++...|+.+... T Consensus 122 s-~~~l~~~i~~~l~~ai~~~~d----~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~---- 192 (324) T protein:vir:99 122 T-YSQFFEEMKPMIAEAFYKKFD----EAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELE---- 192 (324) T ss_pred c-hHHHHHHHHHHHHHHHHHHHH----HHhhhcCCCCccCccccccccccceeccccCCHHHHHHHHHhhhhccCC---- Confidence 3 445767666666665554333 344543332111100 1111111223568899999999888776432 Q ss_pred eeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCc Q lcl|NC_013692. 227 MITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGK 306 (399) Q Consensus 227 ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa 306 (399) ++ +.+|||.....|+.++|--+.+-|. .+.-+++-++.++.++.+. T Consensus 193 -------------~~--~~v~n~~~~~~L~~l~d~~g~~~~~------------~~~~~~l~G~PVv~~~~~~------- 238 (324) T protein:vir:99 193 -------------AN--AFISKTQNRSLLRKIVDPETKERIY------------DRNSDTLDGLPVVNLKSSN------- 238 (324) T ss_pred -------------CC--EEEEcHHHHHHHHHhhcCCCceeec------------CCCCccccceeEEeecCCC------- Confidence 12 2478999999999988755554442 1223567777777665210 Q ss_pred ccCCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCC-cCCCCCCcc--chhhHHHHH--HHHH Q lcl|NC_013692. 307 AVDPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGE-ATADRSDPY--GEMGFMSIK--WYYG 381 (399) Q Consensus 307 ~~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~-~tad~~DPl--gQrg~~gwK--~~~~ 381 (399) .++. .+++|.-+.-.+++. ++..+++.--.-.. .......++ -|++.+.|+ ++++ T Consensus 239 -------------~~~~----~~i~gd~~~~~~~~~---~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d 298 (324) T protein:vir:99 239 -------------LKRG----ELITGDFDKLIYGIP---QLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVA 298 (324) T ss_pred -------------CCcc----eEEEEecccEEEEEe---cCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEc Confidence 0111 256676655555554 22333332111100 001112222 467778887 6789 Q ss_pred HhhccccceEEEEEecCC Q lcl|NC_013692. 382 FMVFRPEWIALLKTVARL 399 (399) Q Consensus 382 ~~iLn~~~m~~iet~A~~ 399 (399) +.+++++-++.|..+.+. T Consensus 299 ~~v~~~~a~~~lt~a~~~ 316 (324) T protein:vir:99 299 LHIADDKAFAKLVPADKK 316 (324) T ss_pred cEEecccceEEEEeccCC Confidence 999999999999887776 No 83 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=98.44 E-value=1.1e-07 Score=58.64 Aligned_cols=295 Identities=12% Similarity=0.108 Sum_probs=161.7 Q ss_pred CCCcc-----------ccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEc Q lcl|NC_013692. 1 MAGPV-----------DNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHY 69 (399) Q Consensus 1 ~~~~~-----------~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry 69 (399) |--+- ++++++.++.-....+++-+.-+-+ .+..+.++.+++..++.+++...+|+. .++++.++ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~~iP~-~~~~~ii~~~~~~s~l~~~~~~~~~~~---~~~~ip~~ 76 (324) T protein:vir:97 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMN-EFTTPILQEVMENSKIMQLGKYEPMEG---TEKKFTFW 76 (324) T ss_pred CccchhHHHHHHHHHHhhhhhhhhccccccccCCCcceech-hHHHHHHHHHHhhcchhhhcceeeccC---CceEEEEE Confidence 11110 1233344433221111222222222 235677777888888999999888873 44666554 Q ss_pred cCCcCCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhh Q lcl|NC_013692. 70 IPLLDDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDF 149 (399) Q Consensus 70 ~pl~~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t 149 (399) ..- |..++++.| +...--.+++..++.+.++++.+..+|+++.+. T Consensus 77 ~~~------------~~a~~v~Eg-----------------------~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~d 121 (324) T protein:vir:97 77 ADK------------PGAYWVGEG-----------------------QKIETSKATWVNATMRAFKLGVILPVTKEFLNY 121 (324) T ss_pred ecC------------cceeEeccC-----------------------ccccccccceeEEEEeeEEEEEeehhhHHHHhc Confidence 321 122333222 111122347788999999999999999987763 Q ss_pred hhchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeccc---cccccccCCcceecHHHHHHHHHHHHhccCccccc Q lcl|NC_013692. 150 DSDPAMEGHVTTEMVKGANEITEDLLQIDLLNSAGTVRYPGA---ATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIK 226 (399) Q Consensus 150 ~~D~~L~~~i~~el~~~~~~~t~d~l~~~~l~agt~V~YAg~---aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ 226 (399) +.+.|...|..+|.+.-+...+ ..+|++.+.-.-.+. ...-.+......+++++|.++...|+.+... T Consensus 122 -s~~~l~~~i~~~l~~aia~~~d----~a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~---- 192 (324) T protein:vir:97 122 -TYSQFFEEMKPMIAEAFYKKFD----EAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELE---- 192 (324) T ss_pred -chHHHHHHHHHHHHHHHHHHHH----HHhhccCCCCccCccccccccccceeccccCCHHHHHHHHHhhhhccCC---- Confidence 3345666666666665554333 344433222111100 0011112223668999999999888775432 Q ss_pred eeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCc Q lcl|NC_013692. 227 MITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGK 306 (399) Q Consensus 227 ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa 306 (399) +++ .+|||.....|+.++|--+.+-|. .+--|.+-++.++.++.. T Consensus 193 -------------~~~--~v~n~~~~~~L~~lkd~~g~~~~~------------~~~~~tl~G~PV~~~~~~-------- 237 (324) T protein:vir:97 193 -------------ANA--FISKTQNRSLLRKIVDPETKERIY------------DRNSDTLDGLPVVNLKSS-------- 237 (324) T ss_pred -------------CCE--EEEcHHHHHHHHHhhcCCCceeec------------CCCCccccceeeEeecCC-------- Confidence 122 478999999999998765555442 223466778887766410 Q ss_pred ccCCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEe---cCCCcCCCCCCc--cchhhHHHHH--HH Q lcl|NC_013692. 307 AVDPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITK---RPGEATADRSDP--YGEMGFMSIK--WY 379 (399) Q Consensus 307 ~~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk---~pG~~tad~~DP--lgQrg~~gwK--~~ 379 (399) . .++. .+++|.-+...++... ++.+++.-- ..+. ....-+ +-|+..+.++ ++ T Consensus 238 --~----------~~~~----~~~~gd~~~~~i~~~~---~~~i~~~~~~~~~~~~--~~~~~~~~~f~~d~~~~r~~~r 296 (324) T protein:vir:97 238 --N----------LKRG----ELITGDFDKLIYGIPQ---LIEYKIDETAQLSTVK--NEDGTPVNLFEQDMVALRATMH 296 (324) T ss_pred --C----------CCcc----eEEEEecccEEEEEec---CcEEEEeecccccccc--cccccchhhhhcCcEEEEEEEE Confidence 0 0111 2567765555555542 223332211 1110 011112 2456666666 67 Q ss_pred HHHhhccccceEEEEEecCC Q lcl|NC_013692. 380 YGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 380 ~~~~iLn~~~m~~iet~A~~ 399 (399) +.+.+.+++-++.|+.+.+. T Consensus 297 ~d~~v~~~~a~~~l~~~~~~ 316 (324) T protein:vir:97 297 VALHIADDKAFAKLVPADKK 316 (324) T ss_pred eccEEecccceEEEEeccCC Confidence 89999999999999988887 No 84 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=98.44 E-value=1.5e-07 Score=57.94 Aligned_cols=297 Identities=11% Similarity=0.113 Sum_probs=145.2 Q ss_pred CCcc-cccccceehhhhhHHHHHHhhhHHhhhhcccccc----cCcCCCcEEEEEEccCCcCCCcccc--CCCCcchhhh Q lcl|NC_013692. 18 NGVE-SSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFS----MPKHYGKEIVRLHYIPLLDDRNVND--QGIDASGATI 90 (399) Q Consensus 18 ~~~~-~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~----mPKn~GktIkfrry~pl~~~~t~lt--eGV~p~g~~~ 90 (399) |..+ .+.-|| -|.+++|..-++.+|+.++....- ....+|.||++|+--++....-... .++++ T Consensus 1 MaN~llT~~p~----iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~~~~~~~~~~----- 71 (423) T protein:vir:10 1 MPNNLDSNVSQ----IVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNK----- 71 (423) T ss_pred CccchhhhhHH----HHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCceeeeccCCcccccccc----- Confidence 2211 111233 488999999999999988864422 2346899999987665543222111 11222 Q ss_pred hhhhhccccccccccccccccccccccceeeccceeEEEEEEeeee-cceehhhhhhhhhhhchhHHHHHHHHHHHHHHH Q lcl|NC_013692. 91 ANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKY-GFFREYTQEQLDFDSDPAMEGHVTTEMVKGANE 169 (399) Q Consensus 91 ~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~Qy-G~~~e~Td~~~~t~~D~~L~~~i~~el~~~~~~ 169 (399) +.|+| ..++.+|-|. .+-.+++|+-...+++ ++. +.+..|.. T Consensus 72 -------------------~dl~e------------~~v~l~id~~k~va~~v~d~E~~~~i~-----~~~-~~l~~A~~ 114 (423) T protein:vir:10 72 -------------------NNLIS------------GKATGRVGNYITVAVEYQQLEEAIKLN-----QLE-EILAPVRQ 114 (423) T ss_pred -------------------Ccccc------------ceeEEEeeceeeeeeeechHHHhcChh-----hHH-HHHHHHHH Confidence 23344 3445555433 3445677665544333 222 34555655 Q ss_pred HHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCeeEEEech Q lcl|NC_013692. 170 ITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNARALYVGS 249 (399) Q Consensus 170 ~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv~~~h~ 249 (399) -..+.+-.++++-.....+...++..... =.++++..+.+.|+++++|+ ..|++++.| T Consensus 115 aLA~~vd~~ia~~~~~~~~~~~gt~~t~~-----~a~~~i~~a~~~Ld~~~vP~-----------------~~R~~Vv~p 172 (423) T protein:vir:10 115 RIVTDLETELAHFMMNNGALSLGSPNTPI-----TKWSDVAQTASFLKDLGVNE-----------------GENYAVMDP 172 (423) T ss_pred HHHHHHHHHHHHHHhhccccccccCCccc-----chHHHHHHHHHHHHhccCCc-----------------CCCEEEeCh Confidence 56666667776543333332222222211 13788999999999999985 248889999 Q ss_pred hhhHHHHHHhhhcCCCCceehhhcCCccccccccc-eeEcCeEEEecCccccccc-C--Ccc-cCCccccccc--ccC-- Q lcl|NC_013692. 250 DLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEV-GQLGRFRVIVNPQMMHWAG-V--GKA-VDPNDQVPMH--ESG-- 320 (399) Q Consensus 250 dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEI-G~i~~~RfV~~~~~~~~~~-a--Ga~-~~~~~~~~~~--~~~-- 320 (399) +....|.. ++.+....+-+..+.+-+|+| |++.+|++.++.++-.-.. . |.. ..++..++.. +++ T Consensus 173 ~~~a~Ll~------~~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnip~~T~gt~~~t~~~~~~~~v~~~a~~~a~~ 246 (423) T protein:vir:10 173 WSAQRLAD------AQTGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTVTYNAVKDSYQ 246 (423) T ss_pred HHHHHHhc------cccceecccccchhhhhhccceeeecceEEEEeCCCccccccccccceeeeecceeccccccccce Confidence 99888763 344555555566777888887 9999999999988664311 1 100 0011111000 111 Q ss_pred -------ccceeEEEEEEccccceeccccc-----C-----CC-CCcceEEEecCCCcCCCC-CCccchhhHHHHHHHHH Q lcl|NC_013692. 321 -------GKYSVFPMLCVASEAFTTVGFAT-----D-----GK-NVKFKIITKRPGEATADR-SDPYGEMGFMSIKWYYG 381 (399) Q Consensus 321 -------~~~DVYp~lV~G~~Afg~v~l~~-----~-----g~-~~k~~~ivk~pG~~tad~-~DPlgQrg~~gwK~~~~ 381 (399) .....|..|..|. .|.--|++. . +. ...++..|..-.. ++. +| ...|.| . T Consensus 247 ~~~~~~~~~~~~~~~l~~GD-~~t~aGv~~v~~~tk~~~~~~~t~~~~~~~v~a~~~--~~~~g~-------~tv~i~-p 315 (423) T protein:vir:10 247 FTVTLTGATASVTGFLKAGD-QVKFTNTYWLQQQTKQALYNGATPISFTATVTADAN--SDSGGD-------VTVTLS-G 315 (423) T ss_pred eeeeeeeccccccCceeecc-eEEecceeeecccccccccccccCcceEEEEEeeee--eccCCc-------eeeecc-C Confidence 1234466666665 443333221 0 00 0112333332210 000 00 000000 0 Q ss_pred Hhh---ccccceE---EEEEecCC Q lcl|NC_013692. 382 FMV---FRPEWIA---LLKTVARL 399 (399) Q Consensus 382 ~~i---Ln~~~m~---~iet~A~~ 399 (399) +.+ -+..+-. .....+.+ T Consensus 316 ~~i~~~~~~~~~~v~a~~a~~~~v 339 (423) T protein:vir:10 316 VPIYDTTNPQYNSVSRQVEAGDAV 339 (423) T ss_pred ccccccCCcccccccccccCCcee Confidence 000 0000000 00001111 No 85 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=98.43 E-value=7.5e-08 Score=59.63 Aligned_cols=287 Identities=11% Similarity=0.042 Sum_probs=157.2 Q ss_pred CCCccccccccccCCCCCCcc---cccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVE---SSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRN 77 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~---~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t 77 (399) +..-+.......-+....+++ +.+-|+ .+..+.+....+...+.+++.+.+|+-+.|+...+++...-+. .. T Consensus 94 ~~~~~~~~~~~~~~~~~~~t~~~gg~~iP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~-a~ 168 (397) T protein:vir:48 94 FKNLVRGRYQNLLDSKTDASGSDAGLTIPQ----DIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKWADITGL-AK 168 (397) T ss_pred HHHHHhhhhhHHHHHhhccCCccccccccH----HHHHHHHHHHHHHHHHHhhhceeeccCCcceEEEEeecCCCcc-ee Confidence 000000000000011111111 122333 4466677777788899999999999998887665543321111 11 Q ss_pred cccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHH Q lcl|NC_013692. 78 VNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEG 157 (399) Q Consensus 78 ~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~ 157 (399) -..|| ..+ |.. -..++..|+.+.++++.++.+|+++.. +++..|.. T Consensus 169 ~v~E~-----~~~-------------------~~~---------~~~~~~~v~~~~~k~~~~~~iS~ell~-ds~~~l~~ 214 (397) T protein:vir:48 169 LDDEA-----GSI-------------------GTN---------DDPKLYPIRYAIKRYAGISTVTNSLLA-DSAENILA 214 (397) T ss_pred eeccc-----ccc-------------------ccc---------cccceeeEEeeheeeeeehhhHHHHHh-hchHHHHH Confidence 11111 110 100 113677899999999999999998876 34555666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_013692. 158 HVTTEMVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTR 237 (399) Q Consensus 158 ~i~~el~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~ 237 (399) .|..+|.+.-+. .+...+|++-++ +. ..+...++++|.++...|+..-.+ T Consensus 215 ~v~~~l~~~~~~----~~d~~il~G~g~------~~-----~~~~~~~~d~i~~~~~~l~~~~~~--------------- 264 (397) T protein:vir:48 215 WLSGWIAKKVVV----TRNKAILEAIAT------LP-----TKPTLTKWDDIIDLQAKVDPAIKQ--------------- 264 (397) T ss_pred HHHHHHHHHHHH----HHHHHHhhcccc------cc-----cccccccHHHHHHHHHHhhhhhcC--------------- Confidence 666665555444 233344533211 11 123567899999988888754332 Q ss_pred cccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCccccccc Q lcl|NC_013692. 238 TVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMH 317 (399) Q Consensus 238 ~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~ 317 (399) .-+.+|||.+...|+.|+|-.+.|-|.+ .+..|.-+.+-|+.++.++.. +...++ T Consensus 265 ----~a~~v~n~~~~~~L~~lkd~~G~~i~~~--------~~~~~~~~~l~G~PV~~~~~~--~~~~~~----------- 319 (397) T protein:vir:48 265 ----TSFFLTNTSGFTALKKVKNAFGDYLMER--------DVKSPTGYSIDGFAVKEVADR--WLANAS----------- 319 (397) T ss_pred ----CCEEEECHHHHHHHHHhhcCCCceeecc--------CcCCCCCceeccceeEEeccc--ccCCcC----------- Confidence 1244689999999999988777776643 234456678888887766521 111111 Q ss_pred ccCccceeEEEEEEcccc-ceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH--HHHHHhhccccceEEEE Q lcl|NC_013692. 318 ESGGKYSVFPMLCVASEA-FTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK--WYYGFMVFRPEWIALLK 394 (399) Q Consensus 318 ~~~~~~DVYp~lV~G~~A-fg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iLn~~~m~~ie 394 (399) .+.. .+++|.-+ +-.+..+ +.+.+ .+..- .+-+-+.+.+.|+ +++.+.+++++-++.++ T Consensus 320 -----~~~~-~~~~gd~~~~~~~~~~---~~~~i--~~~~~-------~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~ 381 (397) T protein:vir:48 320 -----SGAM-PLYFGDLKQAVTLFDR---QQMSL--LSTNI-------GGGAFETDTTKIRVIDRFDVVATDTESFVPAS 381 (397) T ss_pred -----CCce-EEEEEeccceEEEEee---cceEE--EEecc-------chhhhhcCceeEEEEeeeccEEecccceEEEE Confidence 0111 24466432 1122222 11222 22111 2335566777776 56788999999998888 Q ss_pred EecCC Q lcl|NC_013692. 395 TVARL 399 (399) Q Consensus 395 t~A~~ 399 (399) ..+.- T Consensus 382 ~~~~~ 386 (397) T protein:vir:48 382 FKAIA 386 (397) T ss_pred ecccc Confidence 77665 No 86 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=98.42 E-value=1.4e-07 Score=58.18 Aligned_cols=283 Identities=13% Similarity=0.119 Sum_probs=159.4 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVND 80 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lt 80 (399) ||. .+ ++.+.+-|+ .+ ..+.++.+.+.-++.+++...+||.+.. ++-++.. T Consensus 1 ma~------------~t-~~~G~lip~---~~-~~~ii~~l~~~s~i~~l~~~~~~~~~~~---~~p~~~~--------- 51 (300) T protein:vir:95 1 MSE------------AQ-LSKGNLFNP---EL-VTKVINKVKGHSSIAKLSPQKPIPFNGQ---REFVFDF--------- 51 (300) T ss_pred Ccc------------cc-cCCcceech---hh-HHHHHHHHHhhhhhhhhcceeeccCCce---EEEEEec--------- Confidence 321 11 123444444 23 5778888888888999999999988632 3333221 Q ss_pred CCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhc--hhHHHH Q lcl|NC_013692. 81 QGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSD--PAMEGH 158 (399) Q Consensus 81 eGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D--~~L~~~ 158 (399) + |...++..| +....-.+++..++.+.++++.+..+|+++.....| +.|.+. T Consensus 52 -~--~~a~wv~Eg-----------------------~~~~~s~~~f~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~ 105 (300) T protein:vir:95 52 -D--SDIDIVAEN-----------------------GKKTHGGVSLDPVTIVPLKVEYGARVSDEFLHASEEAKVDMLTD 105 (300) T ss_pred -C--cceEEeeCC-----------------------cccccccccceeeEeeeEEEEEeehhhHHHhccCCCCHHHHHHH Confidence 1 122233222 122223357788999999999999999998864433 456565 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhc-------Ccee----eeccccccccccCCcceecHHHHHHHHHHHHhccCccccce Q lcl|NC_013692. 159 VTTEMVKGANEITEDLLQIDLLNS-------AGTV----RYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKM 227 (399) Q Consensus 159 i~~el~~~~~~~t~d~l~~~~l~a-------gt~V----~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~i 227 (399) |..+|.+.-+. .+-..+|++ ++.. .+++..+ ..+......++++|.++...|...+.. T Consensus 106 i~~~l~~aia~----~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~i~~~~~~~~~~~~~----- 174 (300) T protein:vir:95 106 FVEGFSKKLAR----GLDIMSIHGINPRTKQASTIIGDNCFDKKVT--QTVPFKDTNPDESMEDAVGMIDGSERD----- 174 (300) T ss_pred HHHHHHHHHHH----HHHHhhhhcccCCCCCCcccccccccccccc--eeecccccchHHHHHHHHHHhhhcCCC----- Confidence 55555555444 333344433 1111 1121111 222233557788899998877765442 Q ss_pred eccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcc Q lcl|NC_013692. 228 ITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKA 307 (399) Q Consensus 228 i~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~ 307 (399) ++ +.++||.....|+.|+|-.+.|=|.+. ...+.-+++-+++++.++.+.. ++ T Consensus 175 ------------~~--~~vmn~~~~~~L~~lkd~~G~~i~~~~--------~~~~~~~~l~G~Pv~~s~~v~~----~~- 227 (300) T protein:vir:95 175 ------------IT--GAILDPIFTTALSKMKNAEGGKLYPEL--------AWGGVPDAINGLAVDKNRTVSY----SQ- 227 (300) T ss_pred ------------cc--EEEECHHHHHHHHHhhccCCCeeccCc--------cccCCCceecceeeEEecCCCC----CC- Confidence 12 367899999999999875555544322 1235668899999998886421 11 Q ss_pred cCCcccccccccCccceeEEEEEEccccce-ecccccCCCCCcceEEEecCCCcCCC-CCCccchhhHHHHH--HHHHHh Q lcl|NC_013692. 308 VDPNDQVPMHESGGKYSVFPMLCVASEAFT-TVGFATDGKNVKFKIITKRPGEATAD-RSDPYGEMGFMSIK--WYYGFM 383 (399) Q Consensus 308 ~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg-~v~l~~~g~~~k~~~ivk~pG~~tad-~~DPlgQrg~~gwK--~~~~~~ 383 (399) +..+ .++++|.=+-+ .++++ +.+.+++ ..-+. .| +.--|-|+.-++++ +.+++. T Consensus 228 -----------~~~~----~~~~~GDf~~~~~~~~~---~~~~~~v--~~~~~--~d~~~~~~f~~~~v~~r~~~r~d~~ 285 (300) T protein:vir:95 228 -----------TDPK----NTAIVGDFETMFKWGYA---KEVPMEI--IKYGD--PDNSGRDLKGYNQIYIRCEAYIGWG 285 (300) T ss_pred -----------CCCc----cEEEEeeccceEEEEEe---cccEEEE--eeccC--CCCcchhhhhcCcEEEEEEEeecce Confidence 0111 13455652211 23333 1222322 22221 11 11124566667776 367889 Q ss_pred hccccceEEEEEecC Q lcl|NC_013692. 384 VFRPEWIALLKTVAR 398 (399) Q Consensus 384 iLn~~~m~~iet~A~ 398 (399) +++++.+++|+-+|= T Consensus 286 v~~~~a~~~l~~~~g 300 (300) T protein:vir:95 286 IMDAASFARIVKTGG 300 (300) T ss_pred eecccceEEEecCCC Confidence 999999999988888 No 87 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=98.41 E-value=6e-08 Score=60.18 Aligned_cols=302 Identities=15% Similarity=0.101 Sum_probs=159.6 Q ss_pred CCCc-cccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCccc Q lcl|NC_013692. 1 MAGP-VDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVN 79 (399) Q Consensus 1 ~~~~-~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~l 79 (399) ++|- ++-.+-..-. + ++++-+.-+-+. +..+.++.+.+...+.+++...+|+.+ ++++-++.. T Consensus 2 ~~~~~~~~~~~~~~~--t--~~~~~~~~ip~~-~~~~ii~~~~~~s~l~~~~~~~~~~~~---~~~~p~~~~-------- 65 (320) T protein:vir:10 2 AAGTAFQVDHAQIAQ--T--GDTMFKGYLEPE-QAKDYFAEAEKTSIVQQFAQKVPMGTT---GQKIPHWIG-------- 65 (320) T ss_pred CCCccCCHHHHHhhc--c--ccccccccccHH-HHHHHHHHHHhccchhhhcceeeccCC---ceEEEEEeC-------- Confidence 3332 2211111111 1 111112222222 357788888888889999999988743 345433321 Q ss_pred cCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHH Q lcl|NC_013692. 80 DQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHV 159 (399) Q Consensus 80 teGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i 159 (399) + +...+.+.| +..-....+++.++.+.+++|.+..+|+++.+ ++++.|.+.+ T Consensus 66 --~--~~a~~v~E~-----------------------~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~-ds~~~l~~~i 117 (320) T protein:vir:10 66 --D--VSAQWIGEG-----------------------DMKPITKGNMTSQNIAPHKIATIFVASAETVR-ANPANYLGTM 117 (320) T ss_pred --C--cceEEecCC-----------------------ccccccccceeEEEEeeEEEEEeehhhHHHHh-cChHHHHHHH Confidence 1 111222221 11112234678899999999999999999877 4455677777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCcee---eecc-----ccccccccCCcceecHH-HHHHHHHHHHhccCccccceecc Q lcl|NC_013692. 160 TTEMVKGANEITEDLLQIDLLNSAGTV---RYPG-----AATSDAEVDATTEVTYD-SLMRLRLDLDNARAPTKIKMITG 230 (399) Q Consensus 160 ~~el~~~~~~~t~d~l~~~~l~agt~V---~YAg-----~aTsra~v~~~~~vt~~-~lr~a~~~Lk~nrApk~T~ii~~ 230 (399) ..+|.+.-+...+.. +|++-++. .-++ ........+.......+ ++..+.-.|+..... T Consensus 118 ~~~l~~a~a~~~d~a----~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------- 185 (320) T protein:vir:10 118 RTKVATAFAMAFDSA----ALNGTDSPFPTYLAQTTKSVSLADPGGATASDLTAYDAVAVNGLSLLVNAKKK-------- 185 (320) T ss_pred HHHHHHHHHHHHHHH----hhcccCCCCCcccccccccccceecccccccccccHHHHHHHHHhhhhcccCC-------- Confidence 777776666543333 44332210 0010 01111122222222222 344444444433322 Q ss_pred ccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCC Q lcl|NC_013692. 231 TRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDP 310 (399) Q Consensus 231 s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~ 310 (399) .-+.+|||.....|+.++|-.+.+=|.+...-+....+. -+.+-++.++.++.+. ++ T Consensus 186 -----------~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~---~~~i~g~pv~~~~~~~----~~----- 242 (320) T protein:vir:10 186 -----------WTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDENSPFR---AGRIVSRPTILSDHVA----DG----- 242 (320) T ss_pred -----------CcEEEEcHHHHHHHHHhhccCCceeeccccccCcccccc---CceeeeeeeEecCCCC----CC----- Confidence 125678999999999999877777676655555544432 2456778888776421 11 Q ss_pred cccccccccCccceeEEEEEEccccceecccccCCCCCcceEE---EecCCCcCCCCCCccchhhHHHHH--HHHHHhhc Q lcl|NC_013692. 311 NDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKII---TKRPGEATADRSDPYGEMGFMSIK--WYYGFMVF 385 (399) Q Consensus 311 ~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~i---vk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL 385 (399) + ..+++|.-+...+++.+ .+.+++. ...-|.......--+-|++.+.|+ +++++.++ T Consensus 243 -----------~----~~~~~gd~~~~~~~~~~---~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~ 304 (320) T protein:vir:10 243 -----------T----TVGYMGDFRNVIWGQVG---GLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNN 304 (320) T ss_pred -----------c----eEEEEeecceEEEEEec---CeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeeccEEe Confidence 1 12456665554455542 2222221 111110000111123567777877 78999999 Q ss_pred cccceEEEE-EecCC Q lcl|NC_013692. 386 RPEWIALLK-TVARL 399 (399) Q Consensus 386 n~~~m~~ie-t~A~~ 399 (399) +++-.++|+ .+||= T Consensus 305 ~~~a~~~l~~~~ap~ 319 (320) T protein:vir:10 305 DKDAFVKLTNVVTPD 319 (320) T ss_pred cccceEEEEeccCCC Confidence 999999998 56666 No 88 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=98.41 E-value=9.2e-08 Score=59.16 Aligned_cols=300 Identities=13% Similarity=0.080 Sum_probs=159.7 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVND 80 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lt 80 (399) ||. . ++ ..+.+-|+ .| .++.++.+++..++.+++...+|+.+. +++-+... T Consensus 1 Mat----~-------tt--~~g~~vP~---~~-~~~ii~~~~~~s~l~~~~~~i~~~~~~---~~~p~~~~--------- 51 (311) T protein:vir:99 1 MAT----F-------GT--GNLKNLPR---NI-ADGMVKDVVQGSTVAVLSARKPQRFGN---EDIITFNG--------- 51 (311) T ss_pred Cce----e-------cC--CCceeccH---HH-HHHHHHHHHhhchhhhhcceeeccCCc---eEEEEEeC--------- Confidence 221 0 01 11122243 34 577888888999999999998888543 23322211 Q ss_pred CCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhc--hhHHHH Q lcl|NC_013692. 81 QGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSD--PAMEGH 158 (399) Q Consensus 81 eGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D--~~L~~~ 158 (399) .|...++.. |+....-..++..++.+.++++.+..+|+++....+| ..|.+. T Consensus 52 ---~~~a~wv~E-----------------------g~~~~~~~~~f~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~ 105 (311) T protein:vir:99 52 ---RPKAEFVGE-----------------------GQQKSSTTGEFDFVTSTPKKAQVTMRFNEEVQWADEDYQLGVLQT 105 (311) T ss_pred ---CceeEEeec-----------------------CcccccccceeeEEEEeeEEEEEeehhhHHHhhcccccHHHHHHH Confidence 122233322 2333334457788999999999999999999865444 446666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh-cCceee----eccccccccccCCccee-cHHHHHHHHHHHHhccCccccceecccc Q lcl|NC_013692. 159 VTTEMVKGANEITEDLLQIDLLN-SAGTVR----YPGAATSDAEVDATTEV-TYDSLMRLRLDLDNARAPTKIKMITGTR 232 (399) Q Consensus 159 i~~el~~~~~~~t~d~l~~~~l~-agt~V~----YAg~aTsra~v~~~~~v-t~~~lr~a~~~Lk~nrApk~T~ii~~s~ 232 (399) |..+|.+.-+.-.+..+..+.=. .|+... +.+.++...+....... ..+++..+...+..+++.. T Consensus 106 i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~--------- 176 (311) T protein:vir:99 106 LSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRVELTADTIANPDLAIEAAVGLLVANGHPT--------- 176 (311) T ss_pred HHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccceeeccccccchhHHHHHHHHHHHhhhccCC--------- Confidence 66666655555334333322110 022211 11122222223222222 3455666665555555431 Q ss_pred ccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcc Q lcl|NC_013692. 233 MIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPND 312 (399) Q Consensus 233 ~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~ 312 (399) +.+ ..++||.....|+.|+|-.+.|=|.+. ...++.|++.+++++.+..+..-.........- T Consensus 177 -------~~~-~~vmn~~~~~~L~~lkd~~G~~l~~~~--------~~~~~~~~l~G~Pv~~s~~i~~~~~~~~~~~~~- 239 (311) T protein:vir:99 177 -------PVN-GLALHPSIAWGLSTARYTDGRKKFPEL--------GLGIGVSSFEGIDASVSDTVNGGDEADPDDEDL- 239 (311) T ss_pred -------Ccc-EEEEcHHHHHHHHhhhccCCCeeecCc--------ccCCCCceecceeeEeecccccccccccccchh- Confidence 111 257899999999999876666666432 234566889999999988654222221111100 Q ss_pred cccccccCccceeEEEEEEccccc-eecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH--HHHHHhhccccc Q lcl|NC_013692. 313 QVPMHESGGKYSVFPMLCVASEAF-TTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK--WYYGFMVFRPEW 389 (399) Q Consensus 313 ~~~~~~~~~~~DVYp~lV~G~~Af-g~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iLn~~~ 389 (399) . .+. +..+++|.-+= -.++.. +...+++. +-+. .+..=.|-|+....+| +++++.++++++ T Consensus 240 ----~-~~~----~~~~~~Gdf~~~~~~~~~---~~~~~~~~--~~~~--~~~~~~~~~~d~~~~r~~~r~d~~v~~~~~ 303 (311) T protein:vir:99 240 ----D-AAR----AVRGIVGDFANGIHWGVQ---RDIPVELI--KYGD--PDGQGDLKRHNQIALRLEIVYGWYVFTDRF 303 (311) T ss_pred ----h-ccC----cceEEEeeccccEEEEEe---cCceEEEe--ecCC--CCcchhhhhcCcEEEEEEEeecceecChhH Confidence 0 011 12345565221 112222 11122222 1111 1111134667777776 788899999999 Q ss_pred eEEEEEec Q lcl|NC_013692. 390 IALLKTVA 397 (399) Q Consensus 390 m~~iet~A 397 (399) .+..+.+| T Consensus 304 v~~~~~~A 311 (311) T protein:vir:99 304 VVIENAVA 311 (311) T ss_pred eeeecccC Confidence 99888888 No 89 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=98.39 E-value=2.8e-08 Score=61.98 Aligned_cols=287 Identities=16% Similarity=0.185 Sum_probs=157.6 Q ss_pred CCCccccccc-cccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCccc Q lcl|NC_013692. 1 MAGPVDNIKP-MKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVN 79 (399) Q Consensus 1 ~~~~~~~~~~-~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~l 79 (399) +.+-+.+-.. ..-+..+.+..+-+-|+ -+..+.++.+++..++.+++.+.+++.+.++-.+. . T Consensus 117 f~~~l~~~e~~~al~~~t~~~gG~lvP~----~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~~~~------~------ 180 (425) T protein:vir:10 117 FKAHVKRGDVQAALNKGEDSEGGYLTPI----EWDRTITNKLVLISPMRQLCRVQPVSKAGFSKLFN------M------ 180 (425) T ss_pred HHHHhhhhhhHHHhhcCcCCCCceeccH----hHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEE------c------ Confidence 0000000000 00111121111224453 23566777777888999999999888665433221 0 Q ss_pred cCCCCcchhhhhhhhhccccccccccccccccccccccceeec-cceeEEEEEEeeeecceehhhhhhhhhhhchhHHHH Q lcl|NC_013692. 80 DQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRV-GFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGH 158 (399) Q Consensus 80 teGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~-~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~ 158 (399) .+..+ .+...| +.+--. ..++..|+.+.++++.++.+|+++.+ ++++.|.+. T Consensus 181 -~~~~a--~wv~E~-----------------------~~~~~~~~~~f~~v~~~~~k~~~~i~iS~ell~-ds~~~l~~~ 233 (425) T protein:vir:10 181 -GGTTS--GWVGEA-----------------------SQRPQTNAATFQPLSFASGEIYANPAATQQILD-DAEIDLESW 233 (425) T ss_pred -CCcce--eeeccc-----------------------cccccccccccceeeeeheeeEeehHhHHHHHh-cchhHHHHH Confidence 11110 111111 110001 12567899999999999999999887 445567666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCce------eeeccccccc----------cccCCcceecHHHHHHHHHHHHhccCc Q lcl|NC_013692. 159 VTTEMVKGANEITEDLLQIDLLNSAGT------VRYPGAATSD----------AEVDATTEVTYDSLMRLRLDLDNARAP 222 (399) Q Consensus 159 i~~el~~~~~~~t~d~l~~~~l~agt~------V~YAg~aTsr----------a~v~~~~~vt~~~lr~a~~~Lk~nrAp 222 (399) |..+|...-+...+ ..+|++-++ +.+...++.. ..-.....+++++|..+...|+..-.. T Consensus 234 i~~~la~ai~~~~d----~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~~ 309 (425) T protein:vir:10 234 LATEVQTEFAKQEG----KAFLAGDGTNKPNGLLTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPSAFTG 309 (425) T ss_pred HHHHHHHHHHHHHH----hhhhcccCCCCcceeeeccccccccccccccccccccccccccccHHHHHHHHhhhhhhhcc Confidence 66666655554322 344443211 1111111110 001123568888888888777644321 Q ss_pred cccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccc Q lcl|NC_013692. 223 TKIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWA 302 (399) Q Consensus 223 k~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~ 302 (399) . -+-+|||....-|+.|+|--+.|=|.|-. -.|.-+++-|..++.++.| +.. T Consensus 310 ------------------~-a~~vmn~~~~~~L~~lkD~~G~~l~~~~~--------~~g~~~~l~G~PV~~~~~~-p~~ 361 (425) T protein:vir:10 310 ------------------N-ARFAMNRNTQRQVRKLKDGQGNYLWQPSY--------VAGQPATLAGYPVTEVPDM-PDV 361 (425) T ss_pred ------------------C-CEEEEchHHHHHHHHhhcCCCceeeccCc--------cCCCCceecceeeEEecCc-CCc Confidence 1 13479999999999999877777776532 2345567778888888764 322 Q ss_pred cCCcccCCcccccccccCccceeEEEEEEcccc--ceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHH-- Q lcl|NC_013692. 303 GVGKAVDPNDQVPMHESGGKYSVFPMLCVASEA--FTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKW-- 378 (399) Q Consensus 303 ~aGa~~~~~~~~~~~~~~~~~DVYp~lV~G~~A--fg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~-- 378 (399) +++ .. .++||.-+ |-...- .+ +++ ..|||.+.+.++|++ T Consensus 362 ~~~-------------------~~-~i~~Gd~~~~~~i~~~--~~----~~v-----------~~d~~~~~~~~~~~~~~ 404 (425) T protein:vir:10 362 AAN-------------------ST-PILFGDFQQTYLIIDR--IG----VRV-----------LRDPYTAKPYVLFYTTK 404 (425) T ss_pred cCC-------------------cc-EEEEEehhccEEEEEe--cc----eEE-----------EecccccCCcEEEEEEE Confidence 211 11 24557433 221111 11 221 257788888888884 Q ss_pred HHHHhhccccceEEEEEecCC Q lcl|NC_013692. 379 YYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 379 ~~~~~iLn~~~m~~iet~A~~ 399 (399) ++.+.+++++-++.|+++|.= T Consensus 405 r~d~~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 405 RVGGGLLNPEPMRAMKVAASE 425 (425) T ss_pred EeccEeecccceEEEEeeccC Confidence 488999999999999988777 No 90 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=98.39 E-value=1e-07 Score=58.89 Aligned_cols=280 Identities=11% Similarity=0.090 Sum_probs=158.6 Q ss_pred CCcc-cccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCccccCCCCcchhhhhhhhhc Q lcl|NC_013692. 18 NGVE-SSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVNDQGIDASGATIANGNLY 96 (399) Q Consensus 18 ~~~~-~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lteGV~p~g~~~~ngn~~ 96 (399) |.+. +.+-|. .+ ..+.++.+++.-.+.+++.+.+|+.+. +++-+...- |...++..| T Consensus 1 ma~~gG~lvp~---~~-~~~ii~~~~~~s~i~~l~~~~~~~~~~---~~ip~~~~~------------~~a~~v~E~--- 58 (298) T protein:vir:16 1 MVLNKGTLFDP---TL-VTDLISKVAGKSSIARLSAQKPIPFNG---EKVFTFTMD------------SEIDVVAES--- 58 (298) T ss_pred CcccCcceech---hH-HHHHHHHHHhhhhhhhhcceeeccCCc---eEEEEEecC------------cceEEecCC--- Confidence 3322 334453 23 467777788888999999999887533 333332211 222233222 Q ss_pred cccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhc--hhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 97 GSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSD--PAMEGHVTTEMVKGANEITEDL 174 (399) Q Consensus 97 ~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D--~~L~~~i~~el~~~~~~~t~d~ 174 (399) +..-.-.+++..++.+.++++.+..+|+++.....| ..|.+.|..+|.+.-+. . T Consensus 59 --------------------~~~~~~~~~f~~v~l~~~k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~----~ 114 (298) T protein:vir:16 59 --------------------GKKTHGGVTLAPQTMVPIKVEYGARISDEFMYASDEEKINILQEFNDGFAKKVAR----G 114 (298) T ss_pred --------------------ccccccccceeEEEEeeeeEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHH----H Confidence 111112346788999999999999999999864444 34555554444444333 3 Q ss_pred HHHHHHhc---Cce-----e---eeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCee Q lcl|NC_013692. 175 LQIDLLNS---AGT-----V---RYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNAR 243 (399) Q Consensus 175 l~~~~l~a---gt~-----V---~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~y 243 (399) +...+|++ ++. + ..++..+..........-.+++|.++...|..++.+. + T Consensus 115 ~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~-----------------~-- 175 (298) T protein:vir:16 115 IDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGAIENAVELLTGVDADV-----------------T-- 175 (298) T ss_pred HHHHhhccccCCCCcccccccccccccccccccccccccccHHHHHHHHHHHhhhcCCCc-----------------c-- Confidence 34445543 111 0 1111111222222223334678888887777765531 1 Q ss_pred EEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccccCccc Q lcl|NC_013692. 244 ALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESGGKY 323 (399) Q Consensus 244 v~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~~~~ 323 (399) ..+|||.....|+.|+|--+.|-|.+. ...+.-|++-+..++.++.+....+ .++. T Consensus 176 ~~vmn~~~~~~l~~lkd~~G~~i~~~~--------~~~~~~~~l~G~PV~~~~~v~~~~~----------------~~~~ 231 (298) T protein:vir:16 176 GIAINPSFRSALAKQKDLQDNALFPEL--------KWGATPDTINGLPVDVNKTVSDMSL----------------TQRD 231 (298) T ss_pred EEEEcHHHHHHHHHhhccCCCeeecCc--------ccCCCCceecceeeEEecccccccC----------------CCcc Confidence 366799999999999887777777543 2344557888999998875432111 1121 Q ss_pred eeEEEEEEccccce-ecccccCCCCCcceEEEecCCCcCCCC-CCccchhhHHHHH--HHHHHhhccccceEEEEEec Q lcl|NC_013692. 324 SVFPMLCVASEAFT-TVGFATDGKNVKFKIITKRPGEATADR-SDPYGEMGFMSIK--WYYGFMVFRPEWIALLKTVA 397 (399) Q Consensus 324 DVYp~lV~G~~Afg-~v~l~~~g~~~k~~~ivk~pG~~tad~-~DPlgQrg~~gwK--~~~~~~iLn~~~m~~iet~A 397 (399) .+++|.-+-+ .++.. +.+.+++ ..-+. +|. .-=|-|++.++|+ +++.+.+++++-+++|+-+= T Consensus 232 ----~~~~GDfs~~~~~~~~---~~~~~~~--~~~~~--~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 232 ----RAIIGDFANGFKWGYA---KEVPLEV--IQYGD--PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred ----EEEEeeccceEEEEEe---cCceEEE--eeccC--CcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 3667875432 34444 2223332 22221 110 0114466677777 47889999999999998776 No 91 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=98.36 E-value=9.8e-08 Score=59.00 Aligned_cols=306 Identities=13% Similarity=0.073 Sum_probs=155.6 Q ss_pred CC-Ccc-ccccccccCCCCC-CcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCc Q lcl|NC_013692. 1 MA-GPV-DNIKPMKYNDPAN-GVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRN 77 (399) Q Consensus 1 ~~-~~~-~~~~~~~~n~~~~-~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t 77 (399) |+ +|= ...-......-.. +++++-+.-+.+.+ ..+.++.+++...+.+++...+|+.+ ++++-+... T Consensus 1 ~~~~~~r~~~~~~~~e~~a~~~~~~~~g~~ip~~~-~~~ii~~~~~~s~i~~~~~~~~~~~~---~~~~p~~~~------ 70 (326) T protein:vir:42 1 MAVNPDRTTPFLGVNDPKVAQTGDSMFEGYLEPEQ-AQDYFAEAEKISIVQQFAQKIPMGTT---GQKIPHWTG------ 70 (326) T ss_pred CCCCccchhhhcCcchhhheeccccCCcceechhh-HHHHHHHHHhcchhhhhcceeeccCC---ceEEEEEeC------ Confidence 21 110 0000000000011 11111111222223 56677778888888899999998854 344433221 Q ss_pred cccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHH Q lcl|NC_013692. 78 VNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEG 157 (399) Q Consensus 78 ~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~ 157 (399) + |...+... |+.......++..++.+.+++|.++.+|+++.+ +++..|.. T Consensus 71 ----~--~~a~~v~E-----------------------g~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~-~s~~~~~~ 120 (326) T protein:vir:42 71 ----D--VSASWIGE-----------------------GDMKPITKGNMTSQTIAPHKIATIFVASAETVR-ANPANYLG 120 (326) T ss_pred ----C--cceEEecC-----------------------CccccccccceeEEEEeeEEEEEeehhhHHHHh-cCHHHHHH Confidence 1 11222221 222223345778899999999999999998877 44556777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCceeee------cccc--ccccccCCcceecHHHHHHHHHHHHhccCccccceec Q lcl|NC_013692. 158 HVTTEMVKGANEITEDLLQIDLLNSAGTVRY------PGAA--TSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMIT 229 (399) Q Consensus 158 ~i~~el~~~~~~~t~d~l~~~~l~agt~V~Y------Ag~a--Tsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~ 229 (399) .|..+|.+.-+.-.++. +|++-++-.- +... +..........++..+++.+.......... T Consensus 121 ~i~~~l~~a~~~~~d~a----~l~G~gs~~p~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------- 189 (326) T protein:vir:42 121 TMRTKVATAFAMAFDNA----AINGTDSPFPTFLAQTTKEVSLVDPDGTGSNADLTVYDAVAVNALSLLVNAG------- 189 (326) T ss_pred HHHHHHHHHHHHHHHHH----hhcccCCCccccccccccccceeecccccccccchhHHHHHHHHHhhhhhhc------- Confidence 77777766555533333 3432221000 0000 000000111223444433222111111111 Q ss_pred cccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccC Q lcl|NC_013692. 230 GTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVD 309 (399) Q Consensus 230 ~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~ 309 (399) ...-+-+|||.....|+.|+|-.+.|=|.+...-+....+ ..|.+-++.++.++.+ +++. T Consensus 190 ----------~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~---~~~~l~G~pv~~~~~~----~~~~--- 249 (326) T protein:vir:42 190 ----------KKWTHTLLDDITEPILNGAKDKSGRPLFIESTYTEENSPF---RLGRIVARPTILSDHV----ASGT--- 249 (326) T ss_pred ----------cCccEEEEeHHHHHHHHHhhccCCceeeccccccCccccc---cCceeeeeeEEEcCCC----CCCc--- Confidence 0112346899999999999998888888876554444332 3578888999888743 2211 Q ss_pred CcccccccccCccceeEEEEEEccccceecccccCCCCCcceE---EEecCCCcCCCCCCcc--chhhHHHHH--HHHHH Q lcl|NC_013692. 310 PNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKI---ITKRPGEATADRSDPY--GEMGFMSIK--WYYGF 382 (399) Q Consensus 310 ~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~---ivk~pG~~tad~~DPl--gQrg~~gwK--~~~~~ 382 (399) .++++|.=+...++... .+.+++ ....-| +.+.++|+ -|+....|| +++.+ T Consensus 250 -----------------~~~~~Gd~s~~~~~~~~---~~~v~~~~e~~~~~~--~~~~~~~~~~~~~d~~~~r~~~~~d~ 307 (326) T protein:vir:42 250 -----------------VVGYQGDFRQLVWGQVG---GLSFDVTDQATLNLG--TPQAPNFVSLWQHNLVAVRVEAEYAF 307 (326) T ss_pred -----------------eEEEEeecceEEEEEec---ceEEEEeecceeeec--ccccccchhhhhcCcEEEEEEEEecc Confidence 12344543333333321 111211 111111 12233443 466777776 78899 Q ss_pred hhccccceEEEEEecCC Q lcl|NC_013692. 383 MVFRPEWIALLKTVARL 399 (399) Q Consensus 383 ~iLn~~~m~~iet~A~~ 399 (399) .+.+++-+++|+.++-- T Consensus 308 ~v~~~~a~~~l~~~~~~ 324 (326) T protein:vir:42 308 HCNDKDAFVKLTNVDAT 324 (326) T ss_pred EEecccceEEEeecccc Confidence 99999999998877666 No 92 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=98.34 E-value=8.5e-08 Score=59.35 Aligned_cols=297 Identities=12% Similarity=0.105 Sum_probs=159.3 Q ss_pred CCCcc-----------ccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEc Q lcl|NC_013692. 1 MAGPV-----------DNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHY 69 (399) Q Consensus 1 ~~~~~-----------~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry 69 (399) |--+= +.++++.++......+.+-+.-+-+ -+..+.+....+.-.+.+++...+|+. .++++-+. T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~liP~-~~~~~ii~~~~~~s~l~~~~~~~~~~~---~~~~~p~~ 76 (324) T protein:vir:10 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLN-DFTTPILQEVMENSKIMQLGKYEPMEG---TEKKFTFW 76 (324) T ss_pred CCCchHHHHHHHHHHHHhhccceecccceeccCCCcceech-hHHHHHHHHHHhhchhhhhcceeeccC---CceEEEEE Confidence 11110 1123333332221112211222222 335777777778888899998888773 34555443 Q ss_pred cCCcCCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhh Q lcl|NC_013692. 70 IPLLDDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDF 149 (399) Q Consensus 70 ~pl~~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t 149 (399) ..- +...+++.| +..-.-..++..++.+.++++.+..+|+++.+. T Consensus 77 ~~~------------~~a~~v~Eg-----------------------~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~d 121 (324) T protein:vir:10 77 ADK------------PGAYWVGEG-----------------------QKIETSKATWVNATMRAFKLGVILPVTKEFLNY 121 (324) T ss_pred eCC------------cceeEeccC-----------------------ccccccccceeEEEEeeEEEEEeehhhHHHHhc Confidence 321 112232222 111122346788999999999999999987763 Q ss_pred hhchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeee-ccc--cccccccCCcceecHHHHHHHHHHHHhccCccccc Q lcl|NC_013692. 150 DSDPAMEGHVTTEMVKGANEITEDLLQIDLLNSAGTVRY-PGA--ATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIK 226 (399) Q Consensus 150 ~~D~~L~~~i~~el~~~~~~~t~d~l~~~~l~agt~V~Y-Ag~--aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ 226 (399) ++..|...|..+|.+.-+.-.+ ..+|++.+.-.. .+- ++.-.+......+++++|.++.-.|+.+... T Consensus 122 -s~~~l~~~i~~~l~~ai~~~~d----~a~l~G~g~~~~~~~i~~~~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~---- 192 (324) T protein:vir:10 122 -TYSQFFEEMKPMIAEAFYKKFD----EAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELE---- 192 (324) T ss_pred -chHHHHHHHHHHHHHHHHHHHH----HHhhhcCCCCccCccccccccccceeccccCCHHHHHHHHHhhhhccCC---- Confidence 3445666666666655554333 344433322110 100 1111111223568999999999888776432 Q ss_pred eeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCc Q lcl|NC_013692. 227 MITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGK 306 (399) Q Consensus 227 ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa 306 (399) +++ .++||.....|+.++|--+.|-|.+ +.-+++-++.++.++.+ T Consensus 193 -------------~~~--~v~n~~~~~~L~~l~d~~g~~~~~~------------~~~~~l~G~PV~~~~~~-------- 237 (324) T protein:vir:10 193 -------------ANA--FISKTQNRSLLRKIVDPETKERIYD------------RNSDTLDGLPVVNLKSS-------- 237 (324) T ss_pred -------------CCE--EEEcHHHHHHHHHhhccCCceeecC------------CCCccccceeEEeecCC-------- Confidence 122 4789999999999987655554421 23356677777665421 Q ss_pred ccCCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCC-cCCCCCCc--cchhhHHHHH--HHHH Q lcl|NC_013692. 307 AVDPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGE-ATADRSDP--YGEMGFMSIK--WYYG 381 (399) Q Consensus 307 ~~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~-~tad~~DP--lgQrg~~gwK--~~~~ 381 (399) . .++. .+++|.-+.-.+++. ++..+++..-.-.. .......+ +-|++.+.|+ ++++ T Consensus 238 --~----------~~~~----~~~~gd~~~~~~~~~---~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d 298 (324) T protein:vir:10 238 --N----------LKRG----ELITGDFDKLIYGIP---QLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVA 298 (324) T ss_pred --C----------CCcc----eEEEEecccEEEEEe---cCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEc Confidence 0 0111 245665554445443 22333333211100 00011122 2467778887 6788 Q ss_pred HhhccccceEEEEEecCC Q lcl|NC_013692. 382 FMVFRPEWIALLKTVARL 399 (399) Q Consensus 382 ~~iLn~~~m~~iet~A~~ 399 (399) +.+++++-+++|.-+.+. T Consensus 299 ~~v~~~~A~~~l~~a~~~ 316 (324) T protein:vir:10 299 LHIADDKAFAKLVPADKK 316 (324) T ss_pred cEEecccceEEEEeccCC Confidence 999999999999887777 No 93 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=98.33 E-value=7.5e-08 Score=59.66 Aligned_cols=285 Identities=13% Similarity=0.080 Sum_probs=151.7 Q ss_pred CCCccccccc-cccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCccc Q lcl|NC_013692. 1 MAGPVDNIKP-MKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVN 79 (399) Q Consensus 1 ~~~~~~~~~~-~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~l 79 (399) -......... ...+...++++++-+.-+-+ .+.+..+....+...+.+++...+|+.+ ++++.+..... T Consensus 97 ~~~~~~~~~~~~~~~~~~~~~~~~~g~lip~-~~~~~ii~~~~~~~~i~~~~~~~~~~~~---~~~~~~~~~~~------ 166 (390) T protein:vir:97 97 DRSARATMNIKAALNTASTDAAGSAGALTTP-NRLPGFITPPDARLTVRDLIGSGRTDSA---LIEYVQETGFV------ 166 (390) T ss_pred hhhhhhhhHHHHHHHhhhcccccccccccch-hhhHHHHHHHhhhhhhHhhcceeeccCC---ceEEEEEecCC------ Confidence 0000000000 01111111112221111111 2345666667777788888988888743 34443332211 Q ss_pred cCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHH Q lcl|NC_013692. 80 DQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHV 159 (399) Q Consensus 80 teGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i 159 (399) +...+... |+.+.--..++..++.++++++.+..+|+++.. +++ .|...| T Consensus 167 -----~~a~~v~E-----------------------g~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~-ds~-~l~~~i 216 (390) T protein:vir:97 167 -----NNAAIVAE-----------------------GALKPESSLKFAKKTDTTHVIAHTMKATRQILS-DAP-QLASYM 216 (390) T ss_pred -----cceeeecC-----------------------CccccccccceeEEEEeeeeEEEeehhhHHHHH-hHH-HHHHHH Confidence 11112111 111122234778899999999999999999876 454 476767 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhc-Cceee----eccccccccccCCcceecHHHHHHHHHHHHhccCccccceecccccc Q lcl|NC_013692. 160 TTEMVKGANEITEDLLQIDLLNS-AGTVR----YPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMI 234 (399) Q Consensus 160 ~~el~~~~~~~t~d~l~~~~l~a-gt~V~----YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~ 234 (399) ..+|.+.-+.-.+ ..+|++ |++.. .................++++|..+.-.|+....+. T Consensus 217 ~~~la~a~~~~~d----~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~----------- 281 (390) T protein:vir:97 217 NNRLIRGLKVKED----AEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYPA----------- 281 (390) T ss_pred HHHHHHHHHHHHH----HHHhhcCCCCccccceeeccccccccccccccchHHHHHHHHHhhccccCCC----------- Confidence 6666665555333 344533 22211 110111111112234567788888776666554431 Q ss_pred CcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccc Q lcl|NC_013692. 235 DTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQV 314 (399) Q Consensus 235 gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~ 314 (399) =+.+|||.....|+.|+|--+.|=|-+. ..+.-+++-|..+++++.|. +| T Consensus 282 --------~~~v~n~~~~~~L~~lkd~~G~~l~~~~---------~~~~~~~l~G~pV~~~~~~~----~~--------- 331 (390) T protein:vir:97 282 --------SGIVINPIDWAAIELAKDANNQYLIGNA---------RGTLTPTLWGLPVVATQAMA----PG--------- 331 (390) T ss_pred --------CEEEEcHHHHHHHHHhhcCCCceeecCc---------cCCCCceecceeeEEcCCCC----CC--------- Confidence 1457899999999998864444444321 23445678899999988541 11 Q ss_pred cccccCccceeEEEEEEccccce-ecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH--HHHHHhhccccceE Q lcl|NC_013692. 315 PMHESGGKYSVFPMLCVASEAFT-TVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK--WYYGFMVFRPEWIA 391 (399) Q Consensus 315 ~~~~~~~~~DVYp~lV~G~~Afg-~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iLn~~~m~ 391 (399) .+++|.-+.+ .+..+ .+ +.+.+. +.+++-+++.+.|+ ++++..+++++-++ T Consensus 332 -------------~~~~gd~~~~~~~~~~-~~----~~i~~~--------~~~~~f~~~~~~~r~~~r~d~~v~~~~a~v 385 (390) T protein:vir:97 332 -------------EFLVGAFDLAAQIFDQ-WD----ARVEIG--------YVNDDFQRNMVTVLAEERLALVVYRPEALI 385 (390) T ss_pred -------------cEEEEeccceEEEEEe-cc----eEEEEe--------ecccccccCcEEEEEEEeeccEEeccccEE Confidence 1356653322 12221 12 111111 12346678888888 68999999999999 Q ss_pred EEEEe Q lcl|NC_013692. 392 LLKTV 396 (399) Q Consensus 392 ~iet~ 396 (399) .++.+ T Consensus 386 ~~~~a 390 (390) T protein:vir:97 386 TGSFA 390 (390) T ss_pred EEEeC Confidence 99999 No 94 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=98.32 E-value=2.3e-07 Score=56.96 Aligned_cols=298 Identities=13% Similarity=0.041 Sum_probs=154.3 Q ss_pred CCCccccccccccC-----CCCCCcccccccceehhhhhHHHHHHhh-hHHhhhhcccccccCcCCCcEEEEEEccCCcC Q lcl|NC_013692. 1 MAGPVDNIKPMKYN-----DPANGVESSIGPQIHTRYWYKRALIDAA-KEAYFGQLADTFSMPKHYGKEIVRLHYIPLLD 74 (399) Q Consensus 1 ~~~~~~~~~~~~~n-----~~~~~~~~~i~p~~~t~y~~~k~L~~A~-p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~ 74 (399) ....+....-..+. ..+.+..+.+-|. -| ....+..+. +.-.+.+++.+.++ .|+.. +.+...- T Consensus 233 ~~~~l~~~e~~~~~~~~~~~~t~~~gg~lip~---~~-~~~ii~~~~~~~~~l~~~~~~~~~---~g~~~-~~~~~~~-- 302 (543) T protein:vir:81 233 HAAILTEEEKRAINEVRAMGLTKADGGYLVPF---QL-DPTVIITSNGSLNDIRRFARQVVA---TGDVW-HGVSSAA-- 302 (543) T ss_pred HHHHhhhhhhhhhhhhhhcccccccCcccCch---hh-hhHHHHHHHhhhchhhhhcccccC---CcceE-EEEecCC-- Confidence 00000000001111 1111112223342 12 233444444 33456666655333 45432 2121111 Q ss_pred CCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchh Q lcl|NC_013692. 75 DRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPA 154 (399) Q Consensus 75 ~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~ 154 (399) |...+.+.| +.+-.-.+++..|+.+.++++.++.+|+++.+. + +. T Consensus 303 ----------~~a~~v~Eg-----------------------~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d-~-~~ 347 (543) T protein:vir:81 303 ----------VQWSWDAEF-----------------------EEVSDDSPEFGQPEIPVKKAQGFVPISIEALQD-E-AN 347 (543) T ss_pred ----------cceeecccC-----------------------ccccccccccceeeeeeeeeEeeehhhHHHHhc-c-HH Confidence 111222221 111112246778999999999999999998874 3 56 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCc-e------eeeccccccccccCCcceecHHHHHHHHHHHHhccCccccce Q lcl|NC_013692. 155 MEGHVTTEMVKGANEITEDLLQIDLLNSAG-T------VRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKM 227 (399) Q Consensus 155 L~~~i~~el~~~~~~~t~d~l~~~~l~agt-~------V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~i 227 (399) |...|..+|...-+...+. -+|++-+ . +.+++.............++++++..+...|+.+.... T Consensus 348 ~~~~i~~~l~~~~~~~~d~----ail~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~---- 419 (543) T protein:vir:81 348 VTETVALLFAEGKDELEAV----TLTTGTGQGNQPTGIVTALAGTAAEIAPVTAETFALADVYAVYEQLAARHRRQ---- 419 (543) T ss_pred HHHHHHHHHHHHHHHHHHH----HHhccCCCCcccccchhhcccccccccccccccccHHHHHHHHHhhhccccCC---- Confidence 7777776666665553333 3444322 1 11111111111112235688999999988887654421 Q ss_pred eccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcc Q lcl|NC_013692. 228 ITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKA 307 (399) Q Consensus 228 i~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~ 307 (399) -+.+|||.....|+.|+|--+.|=|-+ +..|.-+++-|..+++++.|-.......+ T Consensus 420 ---------------~~~v~n~~~~~~l~~lkd~~G~~l~~~---------~~~g~~~~l~G~pv~~~~~~~~~~~~~~~ 475 (543) T protein:vir:81 420 ---------------GAWLANNLIYNKIRQFDTQGGAGLWTT---------IGNGEPSQLLGRPVGEAEAMDANWNTSAS 475 (543) T ss_pred ---------------cEEEEcHHHHHHHHHhhcCCCceeccC---------cCCCCCccccceeeEEecccccccccccc Confidence 245799999999999987666665644 23445568889999999875443332221 Q ss_pred cCCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccc Q lcl|NC_013692. 308 VDPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRP 387 (399) Q Consensus 308 ~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~ 387 (399) . +.+ .++||.=+.-.++... . +.+.+.+-+. .+.....|++++..| +++++.++++ T Consensus 476 ~---------------~~~-~i~~gd~~~~~i~~~~---~--~~i~~~~~~~--~~~~~~~~~~~~~~~-~r~d~~v~~~ 531 (543) T protein:vir:81 476 A---------------DNF-VLLYGNFQNYVIADRI---G--MTVEFIPHLF--GTNRRPNGSRGWFAY-YRMGADVVNP 531 (543) T ss_pred C---------------Ccc-eEEEeeccceeEEeec---c--cEEEEecccc--ccchhhcCceEEEEE-EeeccEeecc Confidence 1 122 3556776655555542 2 3333333221 233334455555442 4688899999 Q ss_pred cceEEEEEecCC Q lcl|NC_013692. 388 EWIALLKTVARL 399 (399) Q Consensus 388 ~~m~~iet~A~~ 399 (399) .-++.++..+.- T Consensus 532 ~A~~~l~~~~~a 543 (543) T protein:vir:81 532 NAFRLLNVETAS 543 (543) T ss_pred cceEEEEecccC Confidence 998888766555 No 95 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=98.29 E-value=2e-07 Score=57.32 Aligned_cols=285 Identities=12% Similarity=0.061 Sum_probs=151.2 Q ss_pred CCCccccccccccCCC-CCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDP-ANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVN 79 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~-~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~l 79 (399) ..............+. ..+++++-+.-+-+.| .+..++...+...+.+++...+++.+. +++.+...-. T Consensus 97 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~-~~~ii~~~~~~~~l~~~~~~~~~~~~~---~~~~~~~~~~------ 166 (390) T protein:vir:81 97 DRSARATMNIKAALNTASTDAAGSAGALTTPNR-LPGFITPPDARLTVRDLIGSGRTDSAL---IEYVQETGFV------ 166 (390) T ss_pred hhhhhhhhHHHHHHHhhccccccCCcceechhh-hHHHHHHHhhhhhhhhhcceeeccCCc---eEEEEEecCC------ Confidence 0111111111111011 1111122111111123 356666677778888999988887543 4443332111 Q ss_pred cCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHH Q lcl|NC_013692. 80 DQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHV 159 (399) Q Consensus 80 teGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i 159 (399) +...+... |+.+.-..+++..++.++++++.++.+|+++.+. +. .|...| T Consensus 167 -----~~a~~v~E-----------------------g~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d-~~-~~~~~i 216 (390) T protein:vir:81 167 -----NNAAIVAE-----------------------GALKPESSLKFAKKTDTTHVIAHTMKATRQILSD-AP-QLASYM 216 (390) T ss_pred -----cceeeecC-----------------------CcccccccceeeEEEEeeeEEEEeehhhHHHHHh-HH-HHHHHH Confidence 11111111 1112223357788999999999999999998774 43 577777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhc-Cceeee----ccccccccccCCcceecHHHHHHHHHHHHhccCccccceecccccc Q lcl|NC_013692. 160 TTEMVKGANEITEDLLQIDLLNS-AGTVRY----PGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMI 234 (399) Q Consensus 160 ~~el~~~~~~~t~d~l~~~~l~a-gt~V~Y----Ag~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~ 234 (399) ..+|.+..+.-.+ ..+|++ |++... ................++++|..+.-.|+....+. T Consensus 217 ~~~l~~~~~~~~d----~a~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------- 281 (390) T protein:vir:81 217 NNRLIRGLKVKED----AEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYNP----------- 281 (390) T ss_pred HHHHHHHHHHHHH----HHHHhcCCCCCcccceeecccccccccccccchhHHHHHHHHHhhccccCCC----------- Confidence 6666666555333 344532 222111 11111112222335567888888877666554421 Q ss_pred CcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccc Q lcl|NC_013692. 235 DTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQV 314 (399) Q Consensus 235 gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~ 314 (399) + ..+|||.....|+.++|--+.|=|.+ ...+..+.+-++++++++.|. +| T Consensus 282 -------~-~~v~~~~~~~~l~~lkd~~G~~l~~~---------~~~~~~~~l~G~pv~~~~~~p----~~--------- 331 (390) T protein:vir:81 282 -------S-GIVINPIDWAAIELAKDANNQYLIGN---------ARGTLTPTLWGLPVVATQAMA----PG--------- 331 (390) T ss_pred -------C-EEEEcHHHHHHHHHhhcCCCceeecC---------cccccCceecceeeEEcCCCC----CC--------- Confidence 1 35789999999999987555544432 123445678899999888542 11 Q ss_pred cccccCccceeEEEEEEccccce-ecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH--HHHHHhhccccceE Q lcl|NC_013692. 315 PMHESGGKYSVFPMLCVASEAFT-TVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK--WYYGFMVFRPEWIA 391 (399) Q Consensus 315 ~~~~~~~~~DVYp~lV~G~~Afg-~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iLn~~~m~ 391 (399) .+++|.-+.+ .+... +++.+ -+. +.+.+-+++.+.|+ .++.+.+++++-++ T Consensus 332 -------------~~~~gd~~~~~~~~~~---~~~~v--~~~--------~~~~~~~~~~v~~r~~~r~d~~v~~~~a~v 385 (390) T protein:vir:81 332 -------------EFLVGAFDLAAQIFDQ---WDARV--EIG--------YVGEDFQRNMITVLAEERLALVVYRPEALI 385 (390) T ss_pred -------------cEEEEehhceEEEEEe---cceEE--EEe--------cccchhhcCcEEEEEEEeeccEEecccceE Confidence 1355654332 11111 12112 111 12235566666665 67889999999999 Q ss_pred EEEEe Q lcl|NC_013692. 392 LLKTV 396 (399) Q Consensus 392 ~iet~ 396 (399) ++..+ T Consensus 386 ~~t~a 390 (390) T protein:vir:81 386 SGSFA 390 (390) T ss_pred EEEeC Confidence 99999 No 96 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=98.28 E-value=4.6e-07 Score=55.34 Aligned_cols=284 Identities=12% Similarity=0.063 Sum_probs=153.9 Q ss_pred CC-CccccccccccCCCCCCccc-ccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcc Q lcl|NC_013692. 1 MA-GPVDNIKPMKYNDPANGVES-SIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNV 78 (399) Q Consensus 1 ~~-~~~~~~~~~~~n~~~~~~~~-~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~ 78 (399) +. +..+..+.+.- .+ ++.+ -+-|+ .+..+.+....+...+.+++.+.+|+-+.|+....++ .-.. T Consensus 98 l~~~~~~~~~~~~~--~t-~~~gg~~iP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~-~~~~----- 164 (397) T protein:vir:49 98 VRGRYQNLLDSKTD--GS-GSDAGLTIPQ----DIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEKW-ADIT----- 164 (397) T ss_pred hhcchhhHHHhhhc--cC-CccCcceecH----HHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEee-ccCC----- Confidence 11 11111111111 11 1111 11233 3346666667788889999999999998887543332 2111 Q ss_pred ccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHH Q lcl|NC_013692. 79 NDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGH 158 (399) Q Consensus 79 lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~ 158 (399) +.+.+...| +..|.. -..++..|+.+.++++.++.+|+++.. +++..|... T Consensus 165 ------~~a~~v~E~-------------~~~~~~---------~~~~~~~v~~~~~k~~~~~~iS~ell~-ds~~~l~~~ 215 (397) T protein:vir:49 165 ------GLAKLDDEG-------------GQIGQN---------DDPKLSLIRYAIKRYAGISTVTNSLLA-DSAENILAW 215 (397) T ss_pred ------cceeeeccc-------------cccccc---------cccceeeeEeeeeeeEeehhhHHHHHh-hhhHHHHHH Confidence 111122111 000110 012567899999999999999998775 334446565 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCccc Q lcl|NC_013692. 159 VTTEMVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRT 238 (399) Q Consensus 159 i~~el~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~ 238 (399) |..+|...-+. .+-..+|++-++ ++ .....+++++|.++...|+.+-.+. T Consensus 216 i~~~l~~~~~~----~~d~ail~G~g~------~~-----~~~~~~~~d~i~~~~~~l~~~~~~~--------------- 265 (397) T protein:vir:49 216 LSGWIAKKVVV----TRNKAILEAIGT------LP-----NKPTLAKWDDIIDLQAKVDPAIKQT--------------- 265 (397) T ss_pred HHHHHHHHHHH----HHHHHHHhcccc------cc-----ccccccCHHHHHHHHHhhhhhhcCC--------------- Confidence 65555555444 223345533221 11 1235678999999888877544321 Q ss_pred ccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccc Q lcl|NC_013692. 239 VGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHE 318 (399) Q Consensus 239 I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~ 318 (399) -+.+|||.....|+.|+|--+.|=|.|- +..|--+++-++.++.++.. +...+ T Consensus 266 ----a~~v~n~~~~~~l~~lkd~~g~~l~~~~--------~~~g~~~~l~G~pV~~~~~~--~~~~~------------- 318 (397) T protein:vir:49 266 ----SLFLTNTSGFTALKKVKNAMGDYLMERD--------VKSPTGYSIDGFVVKEISDR--FLPNG------------- 318 (397) T ss_pred ----CEEEEcHHHHHHHHHhhccCCceeeccc--------ccCCCCceecceeeEEeccc--ccccc------------- Confidence 2567999999999999876666655441 22334456778776655421 11111 Q ss_pred cCccceeEEEEEEccccc-eecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH--HHHHHhhccccceEEEEE Q lcl|NC_013692. 319 SGGKYSVFPMLCVASEAF-TTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK--WYYGFMVFRPEWIALLKT 395 (399) Q Consensus 319 ~~~~~DVYp~lV~G~~Af-g~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iLn~~~m~~iet 395 (399) +.++.+ ++||.-+- -.+..+ +++.+ .+.+- .+-+-+++...++ +++.+.+++++-++.++. T Consensus 319 ~~~~~~----~~~gd~~~~~~~~~~---~~~~i--~~~~~-------~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~ 382 (397) T protein:vir:49 319 TGGAMP----LYFGDLKQAVTLFDR---QHLSL--LSTNI-------GGGAFETDTTKVRVIDRFDVVSTDTEAFVPASF 382 (397) T ss_pred cCCcee----EEEeeccceEEEEee---cccEE--EEecc-------ccchhhcCeeeEEEEEeeccEEecccceEEEEe Confidence 111222 35675332 122222 22222 21111 1224456666655 678899999999999998 Q ss_pred ecCC Q lcl|NC_013692. 396 VARL 399 (399) Q Consensus 396 ~A~~ 399 (399) .+.. T Consensus 383 ~~~~ 386 (397) T protein:vir:49 383 KAIA 386 (397) T ss_pred cccc Confidence 8877 No 97 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=98.27 E-value=6.5e-07 Score=54.51 Aligned_cols=285 Identities=13% Similarity=0.102 Sum_probs=155.5 Q ss_pred CCCccccccccccCCCCCCcccc-cccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESS-IGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVN 79 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~-i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~l 79 (399) +-+...+... .-+..+ ++.++ +-|+ .+..+.+....+...+.+++...+|+.+.|+....+. ....... T Consensus 98 l~~~~~~~~~-~~~~~t-~~~gg~~vP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~-~~~~~~a--- 167 (397) T protein:vir:49 98 VRGRYQNLLD-SKTDAS-GSDAGLTIPQ----DIQTAIHTLVSQYDSLQEYVNVENVTTLTGSRVYEKW-TDITGLA--- 167 (397) T ss_pred HhcchhHHHH-Hhhccc-cccCcccccH----hHHHHHHHHHHhhhhHHhhhceeecccCccceEEEee-ccCCcce--- Confidence 1111100000 000011 11122 2343 3356666667788899999999999999987543322 2111111 Q ss_pred cCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHH Q lcl|NC_013692. 80 DQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHV 159 (399) Q Consensus 80 teGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i 159 (399) +..+.|..+ | ..-..++..|+.++++++.++.+|+++.+ ++++.|...| T Consensus 168 --~~v~E~~~~-------------------~---------~~~~~~~~~i~~~~~k~~~~~~iS~ell~-ds~~~l~~~i 216 (397) T protein:vir:49 168 --NIDDEAGKI-------------------A---------DVDDPKLSLIKYTIKRYAGISTVTNSLLA-DSAENILAWL 216 (397) T ss_pred --eeecCcccc-------------------c---------cccccceeeEEeeeeeEEeeehhHHHHHh-hhHHHHHHHH Confidence 111111111 0 00123677899999999999999999875 4555676666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccc Q lcl|NC_013692. 160 TTEMVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTV 239 (399) Q Consensus 160 ~~el~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I 239 (399) ..+|...-+. .+...+|++-++ ++ ......++++|..+...|+.+-.+. T Consensus 217 ~~~l~~~~~~----~~d~ai~~G~g~------~~-----~~~~~~~~d~i~~~~~~l~~~~~~~---------------- 265 (397) T protein:vir:49 217 SGWIAKKVVV----TRNKAILEAIAA------LP-----TKPTLTKWDDIIDLEAKVDPAIKQT---------------- 265 (397) T ss_pred HHHHHHHHHH----HHHHHHHhhccc------cc-----cccccccHHHHHHHHHhhhhhhcCC---------------- Confidence 6665555444 233345543111 11 1224568899999988887654321 Q ss_pred cCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCccccccccc Q lcl|NC_013692. 240 GNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHES 319 (399) Q Consensus 240 ~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~ 319 (399) + +.+|||.....|+.|+|--+.|=|.| .+..+.-+++-|+.++.++. .|...++. T Consensus 266 --a-~~vmn~~~~~~l~~lkd~~G~~l~~~--------~~~~~~~~~l~G~PV~~~~~--~~~~~~~~------------ 320 (397) T protein:vir:49 266 --S-FFLTNTSGFTALKKVKNALGDYLMER--------DVKSPTGYSIDGFAVKEVAD--RWLANGTG------------ 320 (397) T ss_pred --C-EEEEcHHHHHHHHHhhcCCCceeecc--------CcCCCCCceecceeeEEecc--cccccccC------------ Confidence 1 45789999999999988766666644 23455667888888876653 22222111 Q ss_pred CccceeEEEEEEcccc-ceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH--HHHHHhhccccceEEEEEe Q lcl|NC_013692. 320 GGKYSVFPMLCVASEA-FTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK--WYYGFMVFRPEWIALLKTV 396 (399) Q Consensus 320 ~~~~DVYp~lV~G~~A-fg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iLn~~~m~~iet~ 396 (399) +-. .+++|.=+ |-.+..+ +.+.+ .+.+- .+-+-+.+...++ ..+.+.+++++-++.++.. T Consensus 321 ----~~~-~i~~gd~~~~~~~~~~---~~~~i--~~~~~-------~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~ 383 (397) T protein:vir:49 321 ----GAM-PLYFGDLKQAVTLFDR---QHMSL--LSTNI-------GGGAFETDTTKVRVIDRFDVVATDTEAFVPASFK 383 (397) T ss_pred ----Cce-eEEEeeccceEEEEee---cceEE--EEecc-------ccchhhcCceeEEEEeeeCcEEecccceEEEEee Confidence 111 24567433 2222222 11122 11111 1223445555554 5778899999999999877 Q ss_pred cCC Q lcl|NC_013692. 397 ARL 399 (399) Q Consensus 397 A~~ 399 (399) +.- T Consensus 384 ~~~ 386 (397) T protein:vir:49 384 AIA 386 (397) T ss_pred ccc Confidence 655 No 98 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=98.26 E-value=2.8e-07 Score=56.53 Aligned_cols=285 Identities=13% Similarity=0.068 Sum_probs=156.6 Q ss_pred CCCcccccc-ccccC-----CCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcC Q lcl|NC_013692. 1 MAGPVDNIK-PMKYN-----DPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLD 74 (399) Q Consensus 1 ~~~~~~~~~-~~~~n-----~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~ 74 (399) ..+.+.... ....+ ..+.+..+.+-|+ .+....+....+...+.+++...+|+.+.. .+-+... T Consensus 117 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lvp~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~---~~~~~~~--- 186 (418) T protein:vir:10 117 RKSVRVRVDRKSIMNVPATVGSGVSGSNSLVVA----DRQAGIIAPPQRKMTIRDLLMPGQTSSSSI---EYTVETG--- 186 (418) T ss_pred hhhhhhhhHHHHHHHhhhhccCCCCCCccccch----hHHHHHHHHHhhhhhHHhhcceeeccCCce---eEEEEec--- Confidence 111111000 00000 0011111223332 334556666677888888998888875543 3322111 Q ss_pred CCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchh Q lcl|NC_013692. 75 DRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPA 154 (399) Q Consensus 75 ~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~ 154 (399) .++...+.. +|+.+..-.+++..|+.+.++++.++.+|+++.+. ++ . T Consensus 187 --------~~~~a~~v~-----------------------E~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~d-s~-~ 233 (418) T protein:vir:10 187 --------FTNNAAAVA-----------------------EGAQKPTSDLKFNLKNQPVRTIAHLFKASRQILDD-AP-A 233 (418) T ss_pred --------CCCceeeec-----------------------cCccccccccceeeEEEeeeeEEEeehhhHHHHHh-HH-H Confidence 111111211 12222223357788999999999999999998874 44 4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcC-ce------eeeccccccccccCCcceecHHHHHHHHHHHHhccCccccce Q lcl|NC_013692. 155 MEGHVTTEMVKGANEITEDLLQIDLLNSA-GT------VRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKM 227 (399) Q Consensus 155 L~~~i~~el~~~~~~~t~d~l~~~~l~ag-t~------V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~i 227 (399) |...|..+|...-+.-.+ ..+|++- +. +..++.. ....+.....++++|..+...+.....+. T Consensus 234 l~~~i~~~l~~a~~~~~d----~a~l~G~g~~~~p~Gi~~~~~~~--~~~~~~~~~~~~~~i~~~~~~~~~~~~~~---- 303 (418) T protein:vir:10 234 LQSYIDGRARYGLQLTEE----GQILKGDGTGANILGILPQASAF--MPSITLANATPIDKIRLALLQAVLAEFPA---- 303 (418) T ss_pred HHHHHHHHHHHHHHHHHH----HHHhccCCCCccccccccccccc--cccccccccccHHHHHHHHHhhccccCCC---- Confidence 777776666665555333 3444332 11 1122211 22333345567888888876665443321 Q ss_pred eccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcc Q lcl|NC_013692. 228 ITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKA 307 (399) Q Consensus 228 i~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~ 307 (399) =..+|||.....|+.++|-.+.|=|-+ ...+..|.+-++.+|+++.|. +| T Consensus 304 ---------------~~~v~n~~~~~~L~~lkd~~G~~i~~~---------~~~~~~~~l~G~pV~~~~~~p----~~-- 353 (418) T protein:vir:10 304 ---------------TGIVLNPIDWASIELTKDSQGRYIVGN---------PVNGTTPRLWNLPVVETQAMT----AN-- 353 (418) T ss_pred ---------------CEEEEcHHHHHHHHHhhcCCCceeccc---------cccCCCceecceeeEEcCCCC----CC-- Confidence 136689999999999987555554521 245566888999999988652 11 Q ss_pred cCCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH--HHHHHhhc Q lcl|NC_013692. 308 VDPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK--WYYGFMVF 385 (399) Q Consensus 308 ~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL 385 (399) .+++|.-+....-. ..+. +.+.+..- .+.+-+++.+.|+ +++.+.++ T Consensus 354 --------------------~~~~gd~s~~~~~~--~~~~--~~i~~~~~-------~~~~f~~~~~~~r~~~~~d~~~~ 402 (418) T protein:vir:10 354 --------------------EFLVGAFSMAAQIF--DRME--IEVLLSTE-------NVDDFEKNMVSIRAEERLALAVY 402 (418) T ss_pred --------------------cEEEeeccceEEEE--Eecc--eEEEEecc-------cchhhhcCceEEEEEEeeccEEe Confidence 14566533221111 1112 22222111 2335677888887 57889999 Q ss_pred cccceEEEEEecCC Q lcl|NC_013692. 386 RPEWIALLKTVARL 399 (399) Q Consensus 386 n~~~m~~iet~A~~ 399 (399) +++-+++++..++. T Consensus 403 ~~~a~~~~~~~~~~ 416 (418) T protein:vir:10 403 RPESFVTGALVEQA 416 (418) T ss_pred cccceEEEEeccCC Confidence 99999999999999 No 99 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=98.20 E-value=2e-07 Score=57.27 Aligned_cols=282 Identities=11% Similarity=0.051 Sum_probs=152.9 Q ss_pred CCCccccccccccC--------CCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCC Q lcl|NC_013692. 1 MAGPVDNIKPMKYN--------DPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPL 72 (399) Q Consensus 1 ~~~~~~~~~~~~~n--------~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl 72 (399) ..+...+......+ ..+.+.-+.+-|+ .+....+....+..++.+++.+.+|+...|+.... +-..- T Consensus 103 ~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~lvP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~-~~~~~ 177 (397) T protein:vir:12 103 RGKRLTDEERDLLDSPEFRAMSGINDEDGGILIPE----DIGRQIHEFKRQFEPLEQYVTVEPVTTRSGTRLLE-KNADM 177 (397) T ss_pred hccCCcHHHHHHHhhhhhhhccccccccCcccCch----hHHHHHHHhhhhhhhHHhhcceeeccCCceeEEEE-EecCC Confidence 11211111111110 0111111122343 33455555566778889999999999888753322 11111 Q ss_pred cCCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhc Q lcl|NC_013692. 73 LDDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSD 152 (399) Q Consensus 73 ~~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D 152 (399) + .-....||-+ + |..+ ..++..|+.+.++++.++.+|+++.. +++ T Consensus 178 ~-~a~~v~Eg~~-----~-------------------~~~~---------~~~~~~v~~~~~k~~~~~~is~e~l~-ds~ 222 (397) T protein:vir:12 178 V-PFSPVEELGN-----L-------------------PEID---------QPRFTKVSYSIIDYGGIMTLSNSMLN-DSD 222 (397) T ss_pred c-ceeeeccccc-----c-------------------cccc---------cccceeEEeeheeeEeeehhhHHHHh-hch Confidence 1 1112222211 0 1000 12567899999999999999999886 444 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHH-HHHhccCccccceeccc Q lcl|NC_013692. 153 PAMEGHVTTEMVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRL-DLDNARAPTKIKMITGT 231 (399) Q Consensus 153 ~~L~~~i~~el~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~-~Lk~nrApk~T~ii~~s 231 (399) ..|.+.|..+|...-+. .+-..+|++-++ + .....+++++|..+.. .|+..-.+ T Consensus 223 ~~l~~~i~~~l~~~~~~----~~d~~il~G~g~------~------~~~g~~~~~~i~~~~~~~l~~~~~~--------- 277 (397) T protein:vir:12 223 QAIMTYVAKWFAKKSVV----TRNNLILAAIAS------L------KKVDIDGLDGIKKALNVTLDPMVAP--------- 277 (397) T ss_pred HHHHHHHHHHHHHHHHH----HHHHHHHhcccc------c------cccccccHHHHHHHHhhccchhhhC--------- Confidence 55766666665555544 223345543222 1 1224577888877653 44433221 Q ss_pred cccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCc Q lcl|NC_013692. 232 RMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPN 311 (399) Q Consensus 232 ~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~ 311 (399) .-+.+|||.....|+.++|--+.|-|.|- +..|--+++-|..+++++..++-.++|. T Consensus 278 ----------~a~~~~n~~~~~~L~~lkd~~G~~l~~~~--------~~~g~~~~l~G~pv~~~~~~~~~~~~~~----- 334 (397) T protein:vir:12 278 ----------GSIVLTNQDGYDWLDTLKDGTGRYLLQPD--------PTNPTKKLLDGRPVVPFTNRVLKTQKGK----- 334 (397) T ss_pred ----------CCEEEEcHHHHHHHHHhhccCCceeeccc--------ccCCCCccccceeeEEecccccccCCCc----- Confidence 12357999999999999876666655442 2344556778899888775333222111 Q ss_pred ccccccccCccceeEEEEEEcccc--ceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH--HHHHHhhccc Q lcl|NC_013692. 312 DQVPMHESGGKYSVFPMLCVASEA--FTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK--WYYGFMVFRP 387 (399) Q Consensus 312 ~~~~~~~~~~~~DVYp~lV~G~~A--fg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iLn~ 387 (399) .+ ++||.-+ |- +..+ +.+.+++ . +..+.+-+++...|+ +++.+.++++ T Consensus 335 --------------~~-~~~gd~~~~~~-~~~~---~~~~i~~--~-------~~~~~~f~~~~~~~r~~~r~d~~~~~~ 386 (397) T protein:vir:12 335 --------------AP-LIIGNLKEAIV-LFDR---EQQSIAS--T-------DTGAGAFETNSTKVRGIEREDVRKWDE 386 (397) T ss_pred --------------cE-EEEEehhceEE-EEee---cceEEEE--e-------ccccchhhcCceEEEEEEeeccEEecc Confidence 11 4577532 22 2211 1112211 0 112334566777777 4589999999 Q ss_pred cceEEEEEecC Q lcl|NC_013692. 388 EWIALLKTVAR 398 (399) Q Consensus 388 ~~m~~iet~A~ 398 (399) +-++.++..|+ T Consensus 387 ~a~~~~~~t~~ 397 (397) T protein:vir:12 387 DAVVFGQITVE 397 (397) T ss_pred cceEEEEEeeC Confidence 99999999999 No 100 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=98.19 E-value=1.3e-06 Score=52.89 Aligned_cols=277 Identities=11% Similarity=0.051 Sum_probs=148.0 Q ss_pred ccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCccccCCCCcchhhhh Q lcl|NC_013692. 12 KYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVNDQGIDASGATIA 91 (399) Q Consensus 12 ~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lteGV~p~g~~~~ 91 (399) .-+.-..+++++ +.-+-+-.+..+.++..++...+.+++.+.+|+.+.|+....++-.- ++.+.+.. T Consensus 1 ~l~~~~~~t~~~-gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~------------~~~a~~v~ 67 (293) T protein:vir:48 1 MLDSKTDHSGSD-AGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDI------------TGLANIDD 67 (293) T ss_pred CceeecccccCc-CceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCC------------Ccceeeec Confidence 111111111111 11111224456677677788899999999999999986544432111 11111221 Q ss_pred hhhhcccccccccccccccccccccccee-eccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHHHHHHHHHHHH Q lcl|NC_013692. 92 NGNLYGSSRDVGNITAKMPTLTEIGGRVN-RVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVTTEMVKGANEI 170 (399) Q Consensus 92 ngn~~~ss~d~g~it~k~~~lte~g~r~~-~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~~el~~~~~~~ 170 (399) .| +.+- .-..++..|+.+.++++.+.++|+++.+ +++..|.+.|..++.+.-+. T Consensus 68 Eg-----------------------~~~~~~~~~~~~~i~l~~~k~~~~~~iS~ell~-ds~~~l~~~i~~~la~~~~~- 122 (293) T protein:vir:48 68 EA-----------------------GKIADIDDPKLSLIKYTIKRYAGISTVTNSLLA-DSAENILAWLSGWIAKKVVV- 122 (293) T ss_pred CC-----------------------cccccccccceeEEEEeeeEEEEeehhhHHHHh-hhhHHHHHHHHHHHHHHHHH- Confidence 11 0000 1113678899999999999999999876 34445656555555444333 Q ss_pred HHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCeeEEEechh Q lcl|NC_013692. 171 TEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNARALYVGSD 250 (399) Q Consensus 171 t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv~~~h~d 250 (399) .+| ..++++ .. ..+ .....+++++|.++...|+..-.+ .+ +-+||+. T Consensus 123 ~~~---~~i~~g-~~-----~~~-----~~~~~~~~d~i~~~~~~l~~~~~~------------------~a-~~vmn~~ 169 (293) T protein:vir:48 123 TRN---KAILGV-VD-----KLP-----TKPTLTKWDDIIDLEAKVDPAIKQ------------------TS-FFLTNTS 169 (293) T ss_pred HHH---hHHhhc-cc-----ccc-----ccccccCHHHHHHHHHhhhhhhcC------------------CC-EEEEcHH Confidence 333 344422 11 111 123567899999988877643221 11 3468999 Q ss_pred hhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccccCccceeEEEEE Q lcl|NC_013692. 251 LVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESGGKYSVFPMLC 330 (399) Q Consensus 251 l~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~~~~DVYp~lV 330 (399) ....|+.|+|--+.|=|.|- +.++--+++-+..+++++. ..+.+. ..+.++ ++ T Consensus 170 ~~~~L~~lkd~~g~~l~~~~--------~~~~~~~~l~G~Pv~~~~~-~~~~~~-----------------~~~~~~-~~ 222 (293) T protein:vir:48 170 GFTALKKVKNALGDYLMERD--------VKSPTGYSIAGFAVKEISD-RWLPNA-----------------SSGVMP-LY 222 (293) T ss_pred HHHHHHHhhccCCceEeecC--------cCCCCCceecceeeEEecc-cccCCc-----------------cCCceE-EE Confidence 99999999876555555431 2334456777877765542 111111 111333 34 Q ss_pred Ecc--ccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH--HHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 331 VAS--EAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK--WYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 331 ~G~--~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iLn~~~m~~iet~A~~ 399 (399) +|. ++|-... + + . +.+.+..- .+-+-+++..+++ ..+.+.+.+++-++.++..+.. T Consensus 223 ~gd~~~~~~~~~-~-~--~--~~i~~~~~-------~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~ 282 (293) T protein:vir:48 223 FGDLKQAVTLFD-R-Q--Q--MSLLSTNI-------GGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIA 282 (293) T ss_pred EEeccceEEEEE-e-c--c--eEEEEecc-------cchhhhcCeEEEEEEEeeCcEEecccceEEEEeeccc Confidence 563 3332211 1 1 1 12222111 1123344444444 5678899999999999876655 No 101 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=98.17 E-value=2.9e-07 Score=56.41 Aligned_cols=302 Identities=14% Similarity=0.071 Sum_probs=156.9 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVND 80 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lt 80 (399) -+| .-+++..-..... ++++-+.-+-+. +..+.++...+.-++.+++.+.+|+.+ ++++-+... T Consensus 2 ~~~--~~~~~e~~~~~~~-~~~~~~~~ip~~-~~~~ii~~~~~~~~l~~~~~~~~~~~~---~~~ip~~~~--------- 65 (318) T protein:vir:24 2 AAG--TAFAVDHAQIAQT-GDTMFKGYLEPE-QAKDYFAEAEKTSIVQQFAQKVPMGTT---GQKIPHWVG--------- 65 (318) T ss_pred CCC--CCCCHHHHHhhcc-cCcccceeechh-HHHHHHHHHHhhchhhhhcceeeccCC---ceEEEEEeC--------- Confidence 111 2222222221111 122222222222 346677777888899999999998743 344433221 Q ss_pred CCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHH Q lcl|NC_013692. 81 QGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVT 160 (399) Q Consensus 81 eGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~ 160 (399) .|..+++..| +.+.....+++.|+.+.++++.+..+|+++.+. ++..+.+.|. T Consensus 66 ---~~~a~~v~Eg-----------------------~~~~~~~~~f~~i~~~~~k~~~~~~iS~e~l~d-s~~~~~~~i~ 118 (318) T protein:vir:24 66 ---DVSAQWIGEG-----------------------DMKPITKGNMTSQTIAPHKIATIFVASAETVRA-NPANYLGTMR 118 (318) T ss_pred ---CcceEEecCC-----------------------ccccccccceeEEEEeeEEEEEeehhhHHHhhc-ChHHHHHHHH Confidence 1222333222 222223457788999999999999999988763 3344666666 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeeeccc-----cccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccC Q lcl|NC_013692. 161 TEMVKGANEITEDLLQIDLLNSAGTVRYPGA-----ATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMID 235 (399) Q Consensus 161 ~el~~~~~~~t~d~l~~~~l~agt~V~YAg~-----aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~g 235 (399) .+|.+.-+. .+...+|++-++-.-.+. +.+.+..........+++..+...++..... T Consensus 119 ~~l~~~~~~----~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------- 181 (318) T protein:vir:24 119 TKVATAFAM----AFDGAAMHGTDSPFPTYIGQTTKAISIADTTGATTVYDQVAVNGLSLLVNDGKK------------- 181 (318) T ss_pred HHHHHHHHH----HHHHhhhcccCCCCCcccccccccccccccccccchHHHHHHHHHHhhccccCC------------- Confidence 665555544 333344543222110100 1111122222334444455555444332221 Q ss_pred cccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCccccc Q lcl|NC_013692. 236 TRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVP 315 (399) Q Consensus 236 T~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~ 315 (399) .+ +.+|||.....|+.++|-.+.|=|.+...-+...+ .+.+.+-++.++.++.+ ..|. T Consensus 182 -----~~-~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~---~~~~~i~g~pv~~~~~~----~~~~--------- 239 (318) T protein:vir:24 182 -----WT-HTLLDDITEPILNGAKDQNGRPLFIESTYGEAASP---FRSGRIVARPTILSDHV----VEGT--------- 239 (318) T ss_pred -----CC-EEEEcHHHHHHHHHhhccCCceeecCccccCcccc---ccCceEEEEeeEEeCCC----CCCc--------- Confidence 12 35899999999999998777777766444444433 23355666666666532 1111 Q ss_pred ccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCC--CcCCCCCCcc--chhhHHHHH--HHHHHhhccccc Q lcl|NC_013692. 316 MHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPG--EATADRSDPY--GEMGFMSIK--WYYGFMVFRPEW 389 (399) Q Consensus 316 ~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG--~~tad~~DPl--gQrg~~gwK--~~~~~~iLn~~~ 389 (399) .++++|+=+...++..+ .+.+++. ...+ .++.+.++|. -|++...|| +++.+.+++++- T Consensus 240 -----------~~~~~gdfs~~~~~~~~---~l~i~~~-~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a 304 (318) T protein:vir:24 240 -----------TVGFMGDFSQLIWGQIG---GLSFDVT-DQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEA 304 (318) T ss_pred -----------cEEEEeecceEEEEEec---CeEEEEe-eccceeccccccccchhhhhcCcEEEEEEEEEccEEecccc Confidence 13456665555454432 2222211 1111 0011223332 466667766 788999999999 Q ss_pred eEEEEEecCC Q lcl|NC_013692. 390 IALLKTVARL 399 (399) Q Consensus 390 m~~iet~A~~ 399 (399) ++.|..++-= T Consensus 305 ~~~i~~~~a~ 314 (318) T protein:vir:24 305 FVALTNVVSG 314 (318) T ss_pred eEEEEeeccC Confidence 9998876555 No 102 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=98.16 E-value=5.1e-07 Score=55.08 Aligned_cols=282 Identities=11% Similarity=0.014 Sum_probs=154.6 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVND 80 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lt 80 (399) +.+..+.... ...+.+.-+.+-|+ .|..+.+....+...+.+++...+|+.+.|+...+.+...- T Consensus 98 ~~~~~~~~~~---~~~~~~~gg~~vP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~-------- 162 (395) T protein:vir:38 98 VKDFKNLVTS---GTTGTGNAGLTIPE----DIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEKLADIT-------- 162 (395) T ss_pred HHHHHHHHhh---ccCccCCCceecch----hHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEeeccCC-------- Confidence 1111100000 00111111112233 34566777777888999999999999999975554333221 Q ss_pred CCCCcchhhhhhhhhcccccccccccccccccccccccee-eccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHH Q lcl|NC_013692. 81 QGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVN-RVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHV 159 (399) Q Consensus 81 eGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~-~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i 159 (399) +.+.+..+| +.+- ....++..|+.+.++++.++.+|+++.. ++++.|.+.| T Consensus 163 ----~~a~~v~E~-----------------------~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~-ds~~~l~~~i 214 (395) T protein:vir:38 163 ----PLKDLDDES-----------------------ALIGDNDDPELTVVKYLIHRYAGITTVTNTLLK-DTVDNIIQWL 214 (395) T ss_pred ----ccccccccc-----------------------cccccccccceeeEEeeeeeeEeehhhHHHHHh-hhHHHHHHHH Confidence 111121111 1111 1124678899999999999999998876 5666787777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHH-HHHhccCccccceeccccccCccc Q lcl|NC_013692. 160 TTEMVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRL-DLDNARAPTKIKMITGTRMIDTRT 238 (399) Q Consensus 160 ~~el~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~-~Lk~nrApk~T~ii~~s~~~gT~~ 238 (399) ..+|.+.-+.. +-..+|++-+. . . ......++++|-.+.- .|+..-. T Consensus 215 ~~~la~~~~~~----~~~~il~g~g~--------~-~--~~~~~~~~~~i~~~~~~~l~~~~~----------------- 262 (395) T protein:vir:38 215 VNWAAKKDVVT----RNAKILEVMGK--------A-P--KKPTISQFDNIKDLENNTLDPAIE----------------- 262 (395) T ss_pred HHHHHHHHHHH----HHHHHhhcccc--------c-c--cccccccHHHHHHHHHHhhhhhhc----------------- Confidence 77766655542 33345543221 1 0 1123456777766542 3322111 Q ss_pred ccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccc Q lcl|NC_013692. 239 VGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHE 318 (399) Q Consensus 239 I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~ 318 (399) +.-+-+|||....-|+.++|--+.|-|.+- +..|.-+++-++.+++++.+. ...++ T Consensus 263 --~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~--------~~~~~~~~l~G~pV~~~~~~~--~~~~~------------ 318 (395) T protein:vir:38 263 --STSSFITNQSGYNILSKVKDADGRYLMQPD--------VTSPDKYLIDGKPVIRIADKW--LPDVS------------ 318 (395) T ss_pred --CCCEEEEcHHHHHHHHHhhccCCceeeccC--------cCCCCcceeccceeEEecccc--cCcCC------------ Confidence 112457999999999999876666656442 234555678888888887421 11000 Q ss_pred cCccceeEEEEEEccccc-eecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHH--HHHHhhccccceEEEEE Q lcl|NC_013692. 319 SGGKYSVFPMLCVASEAF-TTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKW--YYGFMVFRPEWIALLKT 395 (399) Q Consensus 319 ~~~~~DVYp~lV~G~~Af-g~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~--~~~~~iLn~~~m~~iet 395 (399) +. . .++||.-+- -.+..+ .+ +.+ -+. +-.+.+-+++..+|++ ++.+.+++++-++.++. T Consensus 319 --~~---~-~i~~gd~~~~~~i~~~-~~--~~i--~~~-------~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~ 380 (395) T protein:vir:38 319 --GS---H-PLYFGDLKQGITLFDR-QQ--MQI--DTT-------NVGAGSFEHDTTKLRFIDRFDVQLIDDGAFAAASF 380 (395) T ss_pred --Cc---c-eEEEEeccccEEEEEe-cc--eEE--EEe-------ccccchhhcCceEEEEEEeeccEEecccceEEEEe Confidence 01 1 246775332 223322 11 111 111 1122345677777774 48999999999999998 Q ss_pred ecCC Q lcl|NC_013692. 396 VARL 399 (399) Q Consensus 396 ~A~~ 399 (399) .+.. T Consensus 381 ~~~~ 384 (395) T protein:vir:38 381 KTVA 384 (395) T ss_pred eccc Confidence 7666 No 103 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=98.09 E-value=2.9e-06 Score=50.92 Aligned_cols=304 Identities=12% Similarity=0.070 Sum_probs=147.5 Q ss_pred CC-CccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCccc Q lcl|NC_013692. 1 MA-GPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVN 79 (399) Q Consensus 1 ~~-~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~l 79 (399) ++ ..+.......-.....++.+.+-|+ .+ ..+.+....+..++.+++ .+.+|...|+ +++-|.. T Consensus 113 ~~~~~~~~~~~~~~~~~~~~~gg~liP~---~~-~~~ii~~l~~~~~l~~~~-~~~~~~~~g~-~~~p~~~--------- 177 (428) T protein:vir:10 113 FASDELNDQSVSMAISTAAGSGGVLIPQ---NI-HSEVIELLRDRTIVRKLG-ARSIPLPNGN-MSLPRLA--------- 177 (428) T ss_pred HhhhhhhhhhHhhhhcccccCCccccch---hH-HHHHHHHHhhhchhhhhc-ceeeecCCcc-eEEEEEe--------- Confidence 00 0000000000011111111223353 22 355666667777888883 2346665665 3443332 Q ss_pred cCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHH Q lcl|NC_013692. 80 DQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHV 159 (399) Q Consensus 80 teGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i 159 (399) .| |...+... |+....-.+++..|+.+.++++.++.+|+++.+ ++++.|.+.| T Consensus 178 -~~--~~a~~v~E-----------------------g~~~~~~~~~f~~i~~~~~k~~~~v~is~ell~-ds~~~l~~~i 230 (428) T protein:vir:10 178 -GG--ATASYTGE-----------------------NQDAKVSEARFDDVKLTAKTMIAMVPISNALIG-RAGFNVEQLV 230 (428) T ss_pred -CC--cceeeecc-----------------------CccccccccceeeEEeeeEEEEEeehhhHHHHh-hhhHHHHHHH Confidence 11 11122221 222223345778899999999999999999866 5577787777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCc-eeeecc-----cccccc-ccCCcceecHHHHHHHHHHHHhccCccccceecccc Q lcl|NC_013692. 160 TTEMVKGANEITEDLLQIDLLNSAG-TVRYPG-----AATSDA-EVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTR 232 (399) Q Consensus 160 ~~el~~~~~~~t~d~l~~~~l~agt-~V~YAg-----~aTsra-~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~ 232 (399) ..+|...-+...+. -+|++.+ +..--| ..+... ........+.+.++.....|....... T Consensus 231 ~~~l~~ai~~~~d~----~~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------- 297 (428) T protein:vir:10 231 LQDILTAISVREDK----AFMRDDGTGDTPIGMKARATQWNRLLPWAADAAVNLDTIDTYLDSIILMSMDG--------- 297 (428) T ss_pred HHHHHHHHHHHHHH----HHhccCCCCccccccccccccccccccccccccccHHHHHHHHHHHHHhhhcc--------- Confidence 77777766654443 3444322 110000 001000 011123455555555444332211100 Q ss_pred ccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcc Q lcl|NC_013692. 233 MIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPND 312 (399) Q Consensus 233 ~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~ 312 (399) ..... .-+-++|+....-|+.|+|--+.|-|.+.. -|++-+.++++++.|..-.+.+ T Consensus 298 ---~~~~~-~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~------------~g~l~G~pv~~~~~~p~~~~~~------- 354 (428) T protein:vir:10 298 ---NSNMI-SSGWGMSNRTYMKLFGLRDGNGNKVYPEMA------------QGMLKGYPIQRTSAIPANLGEG------- 354 (428) T ss_pred ---ccccc-cCEEEEcHHHHHHHHHhhccCCceeccCCC------------CCeeeceeeEEeccccccccCC------- Confidence 00011 123467999999999998877777774431 2578899999888543221111 Q ss_pred cccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCC-Cc-CCCCCCccchhhHHHHH--HHHHHhhcccc Q lcl|NC_013692. 313 QVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPG-EA-TADRSDPYGEMGFMSIK--WYYGFMVFRPE 388 (399) Q Consensus 313 ~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG-~~-tad~~DPlgQrg~~gwK--~~~~~~iLn~~ 388 (399) +.+ +.++||.=++-.++..+ . +++.+.+=+ .. ....--.+-|+..+.|+ +.+.+.+.+++ T Consensus 355 -------~~~----~~i~~gd~s~~~i~~~~---~--i~i~~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~ 418 (428) T protein:vir:10 355 -------GKE----SEIYFADFNDVVIGEDG---N--MKVDFSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPE 418 (428) T ss_pred -------Ccc----ceEEEEecceEEEEEec---c--eEEEeecccccccccccccchhhcchhheeeeeeeCceeeccc Confidence 111 12456665555555442 1 111111110 00 00011135567777777 45566777777 Q ss_pred ceEEEEEecC Q lcl|NC_013692. 389 WIALLKTVAR 398 (399) Q Consensus 389 ~m~~iet~A~ 398 (399) -.+++.-+.= T Consensus 419 a~~~~t~~~~ 428 (428) T protein:vir:10 419 GLVLGTGVLF 428 (428) T ss_pred eEEEEeccCC Confidence 7766655555 No 104 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=98.08 E-value=4.6e-07 Score=55.30 Aligned_cols=286 Identities=13% Similarity=0.072 Sum_probs=152.1 Q ss_pred CCCcccccc-ccccCCCCCCcccccccceehhhhhHHHHHHhhh-HHhhhhcccccccCcCCCcEEEEEEccCCcCCCcc Q lcl|NC_013692. 1 MAGPVDNIK-PMKYNDPANGVESSIGPQIHTRYWYKRALIDAAK-EAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNV 78 (399) Q Consensus 1 ~~~~~~~~~-~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p-~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~ 78 (399) +.+...... .......+++..+.+-|+ +.+ .+.+++..+ ..++.+++.+.++. .+..+++.+..-- T Consensus 97 ~~~~~r~~~~~~~~~~~t~~~~g~~~~~--~~~--~~~i~~~~~~~~~l~~~~~~~~~~--~~~~~~~p~~~~~------ 164 (390) T protein:vir:62 97 NLGEARSFEFAPEKRDGTKAGNPNVLSR--TLY--GQLIAQAVERSAIMRGGATTFTTS--DANPLDFTVITGR------ 164 (390) T ss_pred hhhhhHHHHhhhhhhcccccCCCccccc--cch--HHHHHHHHhhhhhhhhcceeeecC--CCceeEEEEEcCC------ Confidence 111110000 000011111111222222 223 556666554 45677788887653 3344444332211 Q ss_pred ccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHH Q lcl|NC_013692. 79 NDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGH 158 (399) Q Consensus 79 lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~ 158 (399) +...+.+ +|+.+..-..++..++.+.++++.+..+|+++.+ +++..|.+. T Consensus 165 ------~~a~wv~-----------------------E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~-ds~~~l~~~ 214 (390) T protein:vir:62 165 ------SSASIVG-----------------------ETAEIPESYPATAQRSMGGFKYGFASVVSYEFAT-DQVLDLVGF 214 (390) T ss_pred ------cceeeec-----------------------ccccccccccceeeeEeeeeeEEeehHHHHHHHh-hhhHHHHHH Confidence 1112222 2222333345778899999999999999999987 455567666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCce----eeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceecccccc Q lcl|NC_013692. 159 VTTEMVKGANEITEDLLQIDLLNSAGT----VRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMI 234 (399) Q Consensus 159 i~~el~~~~~~~t~d~l~~~~l~agt~----V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~ 234 (399) |..+|...-+...++ .+|++-+. +-+++.++........+.+++++|..+...|+..-.. T Consensus 215 i~~~l~~~i~~~~d~----~~l~G~G~p~Gi~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~------------ 278 (390) T protein:vir:62 215 LVSDAGPAIGDAMGR----HFITGTGQPRGILTDASPATATFLATDTDSKVSDALIDLFHEVPSAYRA------------ 278 (390) T ss_pred HHHHHHHHHHHHHHh----hhhccCCccccccccccccccceecccccccchHHHHHHHHhhhhhhhc------------ Confidence 666665555543333 34433221 1111111111122233568899988887777543221 Q ss_pred CcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccc Q lcl|NC_013692. 235 DTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQV 314 (399) Q Consensus 235 gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~ 314 (399) .+ +-+||+.+..-|+.|+|--+.|=|.|-. -.|.-+.+-+..+++++.+- + T Consensus 279 -----~a--~~vmn~~~~~~L~~lkd~~g~~l~~~~~--------~~g~~~~l~G~Pv~~~~~~p------~-------- 329 (390) T protein:vir:62 279 -----NA--KYVVNDLRAAQMRKLKDANGQYLWQSGL--------TVGAPSLFNGKVVETDDGMP------A-------- 329 (390) T ss_pred -----CC--EEEEchHHHHHHHHhhccCCCeeecCCc--------CCCccceecccceEEecCCC------C-------- Confidence 11 3478999999999998755555564432 23444567777777766421 0 Q ss_pred cccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHH--HHHHhhccccceEE Q lcl|NC_013692. 315 PMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKW--YYGFMVFRPEWIAL 392 (399) Q Consensus 315 ~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~--~~~~~iLn~~~m~~ 392 (399) -+ ++||.=++..++...+- .++ ...|++-+++.+.+++ .+.+.+++++-++. T Consensus 330 -----------~~-i~~gd~s~~~i~~~~~~-------~v~-------~~~~~~~~~~~~~~~~~~r~d~~~~~~~A~~~ 383 (390) T protein:vir:62 330 -----------DK-ILFADLSKYRVRFAGSL-------RVD-------RSVDAKFSTDQIVYRFLQRADGLLVDARGAKV 383 (390) T ss_pred -----------cc-EEEeeccceeEEeecce-------EEE-------eeccccccCCcEEEEEEEEeCcEeechhheEE Confidence 01 45776555555554211 111 1245666666666664 48999999999988 Q ss_pred EEEecCC Q lcl|NC_013692. 393 LKTVARL 399 (399) Q Consensus 393 iet~A~~ 399 (399) |++.+-- T Consensus 384 l~~~~~a 390 (390) T protein:vir:62 384 LTVTPGA 390 (390) T ss_pred EEeecCC Confidence 8876666 No 105 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=98.06 E-value=1.7e-06 Score=52.15 Aligned_cols=301 Identities=14% Similarity=0.112 Sum_probs=147.6 Q ss_pred CC---------------CccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEE Q lcl|NC_013692. 1 MA---------------GPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIV 65 (399) Q Consensus 1 ~~---------------~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIk 65 (399) ++ ...........+..+.+..+-+-|+ .+.++.++...+..++.+++ .+.+|-..|. ++ T Consensus 105 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lvP~----~~~~~ii~~l~~~~~i~~~~-~~~v~~~~~~-~~ 178 (435) T protein:vir:80 105 LAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPE----NLSSEVIELLRPKSVVRKLG-ARTLPLSNGN-IT 178 (435) T ss_pred HHhccchhHHHHHHHHhhhhhhhhhhhhcccCCCCCccccch----hHHHHHHHHHhhhchhhhcc-ceeeecCCCc-eE Confidence 00 0000111111222221111223444 23456666666777777772 3345555553 44 Q ss_pred EEEccCCcCCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhh Q lcl|NC_013692. 66 RLHYIPLLDDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQE 145 (399) Q Consensus 66 frry~pl~~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~ 145 (399) +-+...- |...+...| +....-..++..|+.+.++++.++.+|++ T Consensus 179 ~p~~~~~------------~~a~~v~E~-----------------------~~~~~~~~~f~~i~~~~~k~~~~~~is~e 223 (435) T protein:vir:80 179 IPRLKGG------------AIVGYIGAD-----------------------TDIPTTQQQFDDLKLTAKKMAALVPIAND 223 (435) T ss_pred EEEEeCC------------cceeeeccC-----------------------ccccccccceeeEEEeeEEEEEeehhhHH Confidence 4333211 111222211 11112234778899999999999999998 Q ss_pred hhhh-hhchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcCc-e------eeeccccccccccCCc--ceecHHHHHHHHHH Q lcl|NC_013692. 146 QLDF-DSDPAMEGHVTTEMVKGANEITEDLLQIDLLNSAG-T------VRYPGAATSDAEVDAT--TEVTYDSLMRLRLD 215 (399) Q Consensus 146 ~~~t-~~D~~L~~~i~~el~~~~~~~t~d~l~~~~l~agt-~------V~YAg~aTsra~v~~~--~~vt~~~lr~a~~~ 215 (399) +.+. ..+|.|.+.|..+|.+.-+.-.+. -+|++-+ . ..++..+.. ...+.+ ......++.+++.. T Consensus 224 ll~ds~~~~~l~~~i~~~l~~a~~~~~d~----a~l~G~G~~~~p~Gi~~~~~~~~~-~~~~~~~~~~~~~~d~~~~~~~ 298 (435) T protein:vir:80 224 LIKYAGVNPNVDQIVVGDLTAAIGAREDK----AFIRDDGTANTPKGLRFWALPGNV-ITASDGSTLQKIETDLGKAILA 298 (435) T ss_pred HHHhhcccHHHHHHHHHHHHHHHHHHHHH----HhhccCCCCCcccceeecccccce-eecccccchhhHHHHHHHHHHH Confidence 8654 446778777777776666654433 3443322 1 111111111 011111 11224466777766 Q ss_pred HHhccCccccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEec Q lcl|NC_013692. 216 LDNARAPTKIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVN 295 (399) Q Consensus 216 Lk~nrApk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~ 295 (399) |+.+.... -.++| +|||.....|+.++|--+.|-|-+. + =|++-++.++++ T Consensus 299 ~~~~~~~~---------------~~~~~--vmn~~~~~~L~~lkd~~G~~l~~~~---~---------~~~l~G~pv~~~ 349 (435) T protein:vir:80 299 LENADANL---------------TQPGW--IMAPRTFRFLEGLRDGNGNKVYPEL---A---------NGMLKGYPVGKT 349 (435) T ss_pred hhcccccc---------------ccCEE--EEcHHHHHHHHhhhccCCceeccCC---C---------CCeEeeeeeEEe Confidence 76654321 12233 6899999999999887677666321 1 157889999998 Q ss_pred CcccccccCCcccCCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCC-CcC-CCCCCccchhhH Q lcl|NC_013692. 296 PQMMHWAGVGKAVDPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPG-EAT-ADRSDPYGEMGF 373 (399) Q Consensus 296 ~~~~~~~~aGa~~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG-~~t-ad~~DPlgQrg~ 373 (399) +.+-.-.+.+ +... .+++|.=++-.++.. +. +++-+..-+ ... ...--.+-|+.. T Consensus 350 ~~~p~~~~~~--------------~~~~----~i~~gd~s~~~i~~~---~~--~~i~~~~~~~~~~~~~~~~~~f~~n~ 406 (435) T protein:vir:80 350 TQVPINLGEA--------------GKES----EIYFTDFGDVFIGEE---ET--LEIDYSKEATYKDADGHMVSAFQRDQ 406 (435) T ss_pred ccccccccCC--------------CCcc----eEEEEEcccEEEEee---cc--eEEEEeccccccccccchhhhhhcCc Confidence 8753211111 1111 245676555444433 22 222222211 100 011123556667 Q ss_pred HHHH--HHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 374 MSIK--WYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 374 ~gwK--~~~~~~iLn~~~m~~iet~A~~ 399 (399) +.|+ +++.+.+.+++-++.|.-++== T Consensus 407 ~~~r~~~r~d~~~~~~~a~~~l~~~~~~ 434 (435) T protein:vir:80 407 TLIRVIAKNDFGPRHVESIAVLSGVAWG 434 (435) T ss_pred ceeeeeeeeCcEeecccceEEEeccCCC Confidence 7777 5567777777766655422111 No 106 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=98.05 E-value=4.8e-06 Score=49.77 Aligned_cols=302 Identities=14% Similarity=0.109 Sum_probs=149.8 Q ss_pred CCC--ccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcc Q lcl|NC_013692. 1 MAG--PVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNV 78 (399) Q Consensus 1 ~~~--~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~ 78 (399) .+. .........-+..+.+..+-+-|+ -+..+.+....+..++.++. .+.+|-..|+ +++-+...- T Consensus 118 ~~~~~~~~~~~~~~~~~~t~~~gg~~vP~----~~~~~ii~~l~~~~~i~~~~-~~~~~~~~~~-~~~p~~~~~------ 185 (435) T protein:vir:14 118 LAIERGFGEEVAMSLNTLSPGAGGVLVPE----NLSSEVIELLRPKSVVRKLG-ARTLPLSNGN-ITIPRLKGG------ 185 (435) T ss_pred HHHhhhhhhhhhhhcccCCcCCCccccch----hHHHHHHHHHhhhchhhhhc-ceeeecCCCc-eEEEEEeCC------ Confidence 000 000001111111221112224454 23456666666777777762 3455655553 444333211 Q ss_pred ccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhh-hhchhHHH Q lcl|NC_013692. 79 NDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDF-DSDPAMEG 157 (399) Q Consensus 79 lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t-~~D~~L~~ 157 (399) +...+... |+....-..++..|+.+.++++.++.+|+++.+. ..|+.|.+ T Consensus 186 ------~~a~~v~E-----------------------~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~~~l~~ 236 (435) T protein:vir:14 186 ------AIVGYIGA-----------------------DTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQ 236 (435) T ss_pred ------cceeeecc-----------------------CccccccccceeEEEeeeEEEEEeehhhHHHHHhhccCHHHHH Confidence 11112211 1111222346788999999999999999987664 45788888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCce-e-----eeccccccccccCCc--ceecHHHHHHHHHHHHhccCccccceec Q lcl|NC_013692. 158 HVTTEMVKGANEITEDLLQIDLLNSAGT-V-----RYPGAATSDAEVDAT--TEVTYDSLMRLRLDLDNARAPTKIKMIT 229 (399) Q Consensus 158 ~i~~el~~~~~~~t~d~l~~~~l~agt~-V-----~YAg~aTsra~v~~~--~~vt~~~lr~a~~~Lk~nrApk~T~ii~ 229 (399) .|..+|.+.-+...+.. +|++-++ . ......+.....+.. .....+++.+++..|+.+.+-. T Consensus 237 ~i~~~l~~ai~~~~d~a----~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~------ 306 (435) T protein:vir:14 237 IVVGDLTAAIGAREDKA----FIRDDGTANTPKGLRFWALPSNVITASDASTLQKIETDLGKVILALENADANL------ 306 (435) T ss_pred HHHHHHHHHHHHHHHHH----hhccCCCCccccceeecccccceeccccccchhhHHHHHHHHHHHhhhccccc------ Confidence 77777776666644433 3433221 1 101001111111111 1122456677776666654421 Q ss_pred cccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccC Q lcl|NC_013692. 230 GTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVD 309 (399) Q Consensus 230 ~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~ 309 (399) -. -+.+|||.....|+.++|--+.|-|.+. --|++-|+++++++.+-.-.+.++ T Consensus 307 ---------~~--~~~v~n~~~~~~L~~lkd~~G~~l~~~~------------~~g~l~G~Pv~~~~~~p~~~~~~~--- 360 (435) T protein:vir:14 307 ---------TQ--PGWIMAPRTFRFLEGLRDGNGNKVYPEL------------ANGMLKGYPVGKTTQVPINLGETG--- 360 (435) T ss_pred ---------cC--CEEEEcHHHHHHHHHhhccCCceeccCC------------CCCeeecceeEeeccccccccCCC--- Confidence 01 2347899999999999886666666321 125788999999886421111111 Q ss_pred CcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCc--CCCCCCccchhhHHHHH--HHHHHhhc Q lcl|NC_013692. 310 PNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEA--TADRSDPYGEMGFMSIK--WYYGFMVF 385 (399) Q Consensus 310 ~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~--tad~~DPlgQrg~~gwK--~~~~~~iL 385 (399) ... .+++|.=+.-.++..+ . +++.+.+=+.. ....--.|-|++.+.++ +++.+.+. T Consensus 361 -----------~~~----~i~~gd~s~~~i~~~~---~--~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~ 420 (435) T protein:vir:14 361 -----------KES----EIYFTDFGDVFIGEEE---T--LEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPR 420 (435) T ss_pred -----------ccc----eEEEeecccEEEEEec---c--cEEEEeccccccccccchhhhhhcChhheeeeeeeCceee Confidence 111 2556765444444331 1 22222221100 00111245677778877 66788888 Q ss_pred cccceEEEEEecCC Q lcl|NC_013692. 386 RPEWIALLKTVARL 399 (399) Q Consensus 386 n~~~m~~iet~A~~ 399 (399) +++-++.|.-++== T Consensus 421 ~~~a~~~l~~~~~~ 434 (435) T protein:vir:14 421 HVESIAVLAGVAWG 434 (435) T ss_pred cccceEEEecCCCC Confidence 88877666532211 No 107 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=98.04 E-value=1.3e-06 Score=52.90 Aligned_cols=282 Identities=12% Similarity=0.084 Sum_probs=150.3 Q ss_pred CCC-ccccccccccCCC----CCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCC Q lcl|NC_013692. 1 MAG-PVDNIKPMKYNDP----ANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDD 75 (399) Q Consensus 1 ~~~-~~~~~~~~~~n~~----~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~ 75 (399) ..+ -............ ..+..+.+.|. -+ ....+....+...+.+++.+.+++.+ .+++.+..... T Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~---~~-~~~ii~~~~~~~~l~~~~~~~~~~~~---~~~~~~~~~~~-- 166 (390) T protein:vir:10 96 NDRSARATMNIKAALNTASTDAAGSAGALTTP---NR-LPGFITQPDARLTVRDLIGSGRTDSA---LIEYVQETGFV-- 166 (390) T ss_pred hhhhhhhhhHHHHHHHhhhcccccccccccch---hH-HHHHHHHHHhhchhhhhcceeeccCC---ceEEEEEecCC-- Confidence 000 0000000000000 01111222332 12 35566666777778888998888754 34443332211 Q ss_pred CccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhH Q lcl|NC_013692. 76 RNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAM 155 (399) Q Consensus 76 ~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L 155 (399) +...+... |+...--.+++..|+.++++++.++.+|+++.+ +++ .| T Consensus 167 ---------~~a~~v~E-----------------------g~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~-d~~-~l 212 (390) T protein:vir:10 167 ---------NNAAIVAE-----------------------GALKPESSLKFAKKTDTTHVIAHTMKATRQILS-DAP-QL 212 (390) T ss_pred ---------cceeeecC-----------------------CccccccccceeEEEEeeEEEEEeehhhHHHHH-hHH-HH Confidence 11112111 222222345788899999999999999999876 444 57 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhc-Cceeeecc----ccccccccCCcceecHHHHHHHHHHHHhccCccccceecc Q lcl|NC_013692. 156 EGHVTTEMVKGANEITEDLLQIDLLNS-AGTVRYPG----AATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITG 230 (399) Q Consensus 156 ~~~i~~el~~~~~~~t~d~l~~~~l~a-gt~V~YAg----~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~ 230 (399) ...|..+|.+..+...+ ..+|++ |++....| ..............+++++..+...|+....+. T Consensus 213 ~~~i~~~l~~~~~~~~~----~~il~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~------- 281 (390) T protein:vir:10 213 ASYMNNRLIRGLKVKED----AEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYPA------- 281 (390) T ss_pred HHHHHHHHHHHHHHHHH----HHHhhcCCCCccccccccccccccccccccccchHHHHHHHHHhhccccCCC------- Confidence 77777777766555333 344433 22221111 011111111223356778888877776655431 Q ss_pred ccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCC Q lcl|NC_013692. 231 TRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDP 310 (399) Q Consensus 231 s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~ 310 (399) -+.+|||.....|+.|+|--+.|=|.+. ..+.-+++-++++++++.|- +| T Consensus 282 ------------~~~v~n~~~~~~L~~lkd~~g~~l~~~~---------~~~~~~~l~G~pv~~~~~~p----~~----- 331 (390) T protein:vir:10 282 ------------SGIVINPIDWAAIELAKDANNQYLIGNA---------RGTLTPTLWGLPVVATQAMA----PG----- 331 (390) T ss_pred ------------CEEEEcHHHHHHHHHhhcCCCceeecCC---------cCcCCceecceeeEEcCCCC----CC----- Confidence 2357999999999999875555555332 23345678899999988642 11 Q ss_pred cccccccccCccceeEEEEEEccccce-ecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH--HHHHHhhccc Q lcl|NC_013692. 311 NDQVPMHESGGKYSVFPMLCVASEAFT-TVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK--WYYGFMVFRP 387 (399) Q Consensus 311 ~~~~~~~~~~~~~DVYp~lV~G~~Afg-~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iLn~ 387 (399) .+++|.-+.+ .+... +++.++ +.. .+.+-+++.+.++ .++.+.++++ T Consensus 332 -----------------~~~~gdf~~~~~~~~~---~~~~i~--~~~--------~~~~~~~~~~~~r~~~r~d~~v~~~ 381 (390) T protein:vir:10 332 -----------------EFLVGAFDLAAQIFDQ---WDARVE--IGY--------VNDDFQRNMVTVLAEERLALVVYRP 381 (390) T ss_pred -----------------cEEEEeccceEEEEEe---cceEEE--Eee--------cccccccCcEEEEEEEeeccEEecc Confidence 1356654432 22221 122221 111 1123456666766 5889999999 Q ss_pred cceEEEEEe Q lcl|NC_013692. 388 EWIALLKTV 396 (399) Q Consensus 388 ~~m~~iet~ 396 (399) +-++.+..| T Consensus 382 ~a~~~~~~a 390 (390) T protein:vir:10 382 EALISGSFA 390 (390) T ss_pred ccEEEEEeC Confidence 999999999 No 108 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=98.00 E-value=1.2e-06 Score=53.12 Aligned_cols=289 Identities=15% Similarity=0.051 Sum_probs=152.5 Q ss_pred CCCc-----cccccc-------cccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEE Q lcl|NC_013692. 1 MAGP-----VDNIKP-------MKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLH 68 (399) Q Consensus 1 ~~~~-----~~~~~~-------~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrr 68 (399) -.+. ++.+.- ...+..+....+-+-|+ .+..+.+....+..++.+++.+.+||.+.|+-... + T Consensus 67 ~~~~~~~~~~~~~~~~l~~~~~~a~~~~t~~~gg~~vP~----~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~-~ 141 (371) T protein:vir:81 67 PTVQVKENEVEAFVNHIRTRFRNAMSEGSNQDGGYTVPQ----DIQTRINELRESKDALQNLITVEPVTTLSGSRVFK-K 141 (371) T ss_pred cchhhHHHHHHHHHHHHHHHHHHhhccCCCccCceeecH----hHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEE-e Confidence 0000 000000 00111111111222332 23566777777888999999999999777653222 2 Q ss_pred ccCCcCCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhh Q lcl|NC_013692. 69 YIPLLDDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLD 148 (399) Q Consensus 69 y~pl~~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~ 148 (399) ...-+ ...+.+.|. ..|.++ .+++..|+.+.++++.++.+|+++.+ T Consensus 142 ~~~~~------------~a~~v~Eg~-------------~~~~~~---------~~~f~~i~~~~~k~~~~~~iS~ell~ 187 (371) T protein:vir:81 142 RSQQT------------GFVEVAEGA-------------AIGEKA---------TPQFTLLQYQVKKYAGFFRVTNELLN 187 (371) T ss_pred ecCCc------------ceeeecccc-------------cccccc---------ccceeeEEeeeeEEEEeehhhHHHHh Confidence 11111 111222110 011111 24678899999999999999999876 Q ss_pred hhhchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHH-HHHhccCccccce Q lcl|NC_013692. 149 FDSDPAMEGHVTTEMVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRL-DLDNARAPTKIKM 227 (399) Q Consensus 149 t~~D~~L~~~i~~el~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~-~Lk~nrApk~T~i 227 (399) ++++.|...|..+|.+.-+. .+ ...++++-++ ++ .....++++|..+.. .|+..-.. T Consensus 188 -ds~~~l~~~i~~~l~~a~~~-~~---~~~i~~g~g~----~~--------~~~~~~~~~i~~~~~~~l~~~~~~----- 245 (371) T protein:vir:81 188 -DSTEAIVNTLVRWIGDESRV-TR---NGLIINVLNT----KA--------KTAIADLDGLKQIINVQLDPVFRS----- 245 (371) T ss_pred -hhhHHHHHHHHHHHHHHHHH-HH---HHHHHhhccc----cc--------ccccccHHHHHHHHHhhcchhhhc----- Confidence 45556777776666655544 22 2334432221 11 123467777776653 34332211 Q ss_pred eccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcc Q lcl|NC_013692. 228 ITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKA 307 (399) Q Consensus 228 i~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~ 307 (399) .+ +.+|||.....|+.++|--+.|-|.|- +-.|-.|++-+..+++++.| ++...+.. T Consensus 246 -------------~a-~~vmn~~~~~~L~~lkd~~g~~l~~~~--------~~~~~~~~l~G~pV~~~~~~-~~~~~~~~ 302 (371) T protein:vir:81 246 -------------TS-SVIVNQDAFNWLDTLKDQNGQYLLQPS--------ISSPTGRQLLGLPVVIVSNK-VLANRVDG 302 (371) T ss_pred -------------CC-EEEEcHHHHHHHHHhhccCCCeeeecc--------cCCCCCceecceeEEEeccc-ccCccccc Confidence 11 357999999999999886666666542 23455678889999998874 43322211 Q ss_pred cCCcccccccccCccceeEEEEEEccccce-ecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH--HHHHHhh Q lcl|NC_013692. 308 VDPNDQVPMHESGGKYSVFPMLCVASEAFT-TVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK--WYYGFMV 384 (399) Q Consensus 308 ~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg-~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~i 384 (399) ..+ . .-..+++|.=+-+ .+..+ .+ +.+.+..- .+.+-+++.+.|+ .++.+.+ T Consensus 303 ~~~------------~-~~~~i~~Gd~~~~~~~~~~-~~----~~i~~~~~-------~~~~f~~~~v~~~~~~r~d~~~ 357 (371) T protein:vir:81 303 GTG------------A-QFAPIIVGDLKEAVVMFDR-QR----TEIMSSNV-------AMDAFETDATLWRAIERMDVKM 357 (371) T ss_pred ccc------------C-CcceEEEEehhceEEEEee-cc----eEEEEecc-------ccchhhcCceEEEEEEeeccEE Confidence 110 1 1223667742211 12111 12 22222111 1223456667777 4568899 Q ss_pred ccccceEEEEEecC Q lcl|NC_013692. 385 FRPEWIALLKTVAR 398 (399) Q Consensus 385 Ln~~~m~~iet~A~ 398 (399) .++.-++.++..+- T Consensus 358 ~~~~a~~~~~~~~A 371 (371) T protein:vir:81 358 RDDEAFVFGEVQLA 371 (371) T ss_pred ecccceEEEEEecC Confidence 99999999987777 No 109 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=97.99 E-value=2.6e-06 Score=51.24 Aligned_cols=297 Identities=17% Similarity=0.139 Sum_probs=156.1 Q ss_pred CCCccccccccccCCC--CCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDP--ANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNV 78 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~--~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~ 78 (399) +.+...........+. ..+++++-+.-+- -.+..+.+....+...+.+++.+.+|+.+. +++.+..+.....+ T Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp-~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~---~~~~~~~~~~~~~~- 175 (413) T protein:vir:81 101 SVGEYVAPRVKAASDPASTATLTDEFQGGYG-TTWNRNIIYRRREKLVVADLMDNLTMTNTT---IKYLMEKANRVVEG- 175 (413) T ss_pred hhhhhhhhHHHhhhhhhhhcccccccccccc-hhhHHHHHHHHhhhhhHHhhcceeeccCCc---eeEEEecccccccc- Confidence 1111110000001111 1111122222222 234677888888888899999999887543 44444333221110 Q ss_pred ccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccc-eeEEEEEEeeeecceehhhhhhhhhhhchhHHH Q lcl|NC_013692. 79 NDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGF-KRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEG 157 (399) Q Consensus 79 lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~-t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~ 157 (399) ...++.. |+.....+. ++..|+.+.++++.++.+|+++.+ +++ .|.. T Consensus 176 -------~a~~v~E-----------------------g~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~-ds~-~l~~ 223 (413) T protein:vir:81 176 -------GFKTVAE-----------------------GGKKPYMRFADFDIVTESLSKIAGLTKITDEMIE-DYD-FLVS 223 (413) T ss_pred -------ccceecC-----------------------cccccccCcccceeeEeeeeeEEEeehhhHHHHH-HHH-HHHH Confidence 0111111 111111222 467899999999999999999876 554 4777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcC-ce------eeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceecc Q lcl|NC_013692. 158 HVTTEMVKGANEITEDLLQIDLLNSA-GT------VRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITG 230 (399) Q Consensus 158 ~i~~el~~~~~~~t~d~l~~~~l~ag-t~------V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~ 230 (399) .|..+|.+.-+.-.+ ..+|++- +. +-.++..+. .+. ...-.++++..+...+..+... T Consensus 224 ~i~~~la~~~~~~~d----~~~l~G~G~~~~~~Gi~~~~~~~~~--~~~-~~~~~~~~i~~~~~~~~~~~~~-------- 288 (413) T protein:vir:81 224 YINARLLEELAIEEE----RQLLLGDGTGNNLTGLLKRDGIQTL--AVS-NKDELADSIYKAMTNISLATPF-------- 288 (413) T ss_pred HHHHHHHHHHHHHHH----HHHhccCCCCCcccccccccccccc--ccc-ccchhHHHHHHHHHHhhhhccC-------- Confidence 777777666555333 2345431 11 111111111 111 1122356666666444433321 Q ss_pred ccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhh--cCCccccccccceeEcCeEEEecCcccccccCCccc Q lcl|NC_013692. 231 TRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEK--YAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAV 308 (399) Q Consensus 231 s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~k--Yg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~ 308 (399) .++ ..+|||.....|+.|+|--+.|=|.+... +++. ...-.+.+-+.+++.++.+. +| T Consensus 289 ---------~~~-~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~---~~~~~~~l~G~pv~~s~~~~----~~--- 348 (413) T protein:vir:81 289 ---------QAD-ALVINPLDYQELRLAKDANGQYYGGGVFQGQYGSG---GIMLDPAPWGLRTVQSQVVP----VG--- 348 (413) T ss_pred ---------CCc-EEEEcHHHHHHHHHhhccCCceecccccccccccc---ccccCceecceeeEEcCCCC----cc--- Confidence 112 25789999999999988666666655432 2322 12233567789999887542 10 Q ss_pred CCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH--HHHHHhhcc Q lcl|NC_013692. 309 DPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK--WYYGFMVFR 386 (399) Q Consensus 309 ~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iLn 386 (399) -++||.-+.+-..+...+ +.+-+- +..+++-+++.++|+ +++.+.+.+ T Consensus 349 -------------------~~~~gd~~~~~~~~~~~~----~~v~~~-------~~~~~~~~~~~~~~r~~~r~d~~~~~ 398 (413) T protein:vir:81 349 -------------------KPVVGAFRSAASVLRKGG----VRIDST-------NTNVDDFENNLITVRAEERVGLMVTF 398 (413) T ss_pred -------------------cEEEEecccEEEEEEecc----eEEEEe-------ccccchhhcCcEEEEEEEeeccEEec Confidence 145676543322222122 222111 113356677778887 578999999 Q ss_pred ccceEEEEEecCC Q lcl|NC_013692. 387 PEWIALLKTVARL 399 (399) Q Consensus 387 ~~~m~~iet~A~~ 399 (399) ++-+++++.++++ T Consensus 399 ~~a~~~l~~~~~~ 411 (413) T protein:vir:81 399 PEAIVQLDVAEVV 411 (413) T ss_pred ccceEEEEecCCC Confidence 9999999998888 No 110 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=97.96 E-value=5.7e-06 Score=49.35 Aligned_cols=287 Identities=13% Similarity=0.086 Sum_probs=149.7 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVND 80 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lt 80 (399) -.+.++.......+.......+-+-|+ .+....+....+...+.+++...+|+.+.|+-...+. .. T Consensus 104 ~~~~~~~~~~~a~~~~~~~~gg~~vP~----~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~-~~--------- 169 (408) T protein:vir:74 104 PMAFLNTVSSKTETSGSDSAAGLTIPQ----DIRTMINTLVRQYDSLQQYVRVESVSTSSGSRVYEKW-TD--------- 169 (408) T ss_pred chhhhhhhhhhhhcccccCCCceeech----hHhhHHHHHHhhhcchhhhcceeeccCCcceEEEEee-cC--------- Confidence 011111221111222221111222343 4456666667788889999999999988776443322 11 Q ss_pred CCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHH Q lcl|NC_013692. 81 QGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVT 160 (399) Q Consensus 81 eGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~ 160 (399) ..+.+.+..+| +..|.++ ..++..|+.++++++.++.+|+++.+ +++..|..+|. T Consensus 170 --~~~~~~~v~E~-------------~~~~~~~---------~~~~~~i~~~~~k~~~~~~iS~ell~-ds~~~l~~~i~ 224 (408) T protein:vir:74 170 --VTPLKAMDEED-------------GKIPDLD---------NPRLTIIKYLIKRYAGIITATNTLLK-DTAENILAWLS 224 (408) T ss_pred --Ccccccccccc-------------ccccccc---------ccceeeEEeeeeeEEeeehhHHHHHh-hchHHHHHHHH Confidence 11122222111 0111111 13678899999999999999999886 45556777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHH-HHHHhccCccccceeccccccCcccc Q lcl|NC_013692. 161 TEMVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLR-LDLDNARAPTKIKMITGTRMIDTRTV 239 (399) Q Consensus 161 ~el~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~-~~Lk~nrApk~T~ii~~s~~~gT~~I 239 (399) .+|...-+...+ ..+|++-++ .+ ..+..+++++|..+. ..|+.+-.. T Consensus 225 ~~l~~~~~~~~d----~~il~G~G~------~~-----~~~~~~~~~~i~~~~~~~l~~~~~~----------------- 272 (408) T protein:vir:74 225 SWIAKKVVVTRN----QAIIAAMGT------VP-----KKPTIANFDDVITMINTSVDPAIIA----------------- 272 (408) T ss_pred HHHHHHHHHHHH----HHHhhcccc------cc-----cccccccHHHHHHHHHHhhhhhhcC----------------- Confidence 776665554322 344533221 11 123556788887654 345433221 Q ss_pred cCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCccccccccc Q lcl|NC_013692. 240 GNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHES 319 (399) Q Consensus 240 ~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~ 319 (399) .+ +.+|||.+..-|+.|+|--+.|-|.+- +..|--+++-+..++.++. .+.... + T Consensus 273 -~a-~~v~n~~~~~~l~~lkd~~G~~l~~~~--------~~~~~~~~l~G~pV~~~~~--~~~~~~-------------~ 327 (408) T protein:vir:74 273 -TS-SLLTNQSGLNKLALVKTAEGKYLLEPD--------PTKPNSYLIKGKQVIVVAD--RWLPNS-------------G 327 (408) T ss_pred -CC-EEEEcHHHHHHHHHhhcCCCceEeccC--------cCCCCCceecceeeEEecC--cccccc-------------c Confidence 11 356899999999999875555554321 2233345778888776652 111110 1 Q ss_pred CccceeEEEEEEccccce-ecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH--HHHHHhhccccceEEEEEe Q lcl|NC_013692. 320 GGKYSVFPMLCVASEAFT-TVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK--WYYGFMVFRPEWIALLKTV 396 (399) Q Consensus 320 ~~~~DVYp~lV~G~~Afg-~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iLn~~~m~~iet~ 396 (399) +++. .+++|.-+-+ .+..+ +.+.++ +..- .+..-+++...|+ +.+.+.+++++-++.++.. T Consensus 328 ~~~~----~i~~gd~~~~~~~~~~---~~~~i~--~~~~-------~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~ 391 (408) T protein:vir:74 328 STVY----PLYYGDMSQAITLFDR---ENMSLL--PTNI-------GAGAFETDTTKIRVIDRFDVKATDSEALVAGSFT 391 (408) T ss_pred CCcc----eEEEEehhccEEEEEe---cceEEE--Eecc-------ccchhhcceeeEEEEEeeCcEEecccceEEEEee Confidence 1222 2467754321 22222 122221 1110 1122345555555 5678899999999888864 Q ss_pred cCC Q lcl|NC_013692. 397 ARL 399 (399) Q Consensus 397 A~~ 399 (399) +-- T Consensus 392 ~~~ 394 (408) T protein:vir:74 392 AIA 394 (408) T ss_pred ccc Confidence 433 No 111 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=97.95 E-value=1.3e-06 Score=52.82 Aligned_cols=314 Identities=12% Similarity=0.115 Sum_probs=142.4 Q ss_pred CCCcccccccc-ccC--CCCCCccccc-ccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCC Q lcl|NC_013692. 1 MAGPVDNIKPM-KYN--DPANGVESSI-GPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDR 76 (399) Q Consensus 1 ~~~~~~~~~~~-~~n--~~~~~~~~~i-~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~ 76 (399) ......-.+.. ... ..+.+.-+.+ .|+ +.....++..++..++.+++...+||...|. +++-+...-+..- T Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~~gg~lv~~~----~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~-~~ip~~~~~~~~a 215 (477) T protein:vir:84 141 DKEIRKIAKVGEEYRDLDRNGGTGGYAVPPL----WMMNRFIELARAGRTYANLCPTEPLPGGTSS-INIPKILTGTSTA 215 (477) T ss_pred hhhHHHHHHhhhhhccccccCCCcceeeccc----hhHHHHHHHhhhcchHHHhhceeeecCCcce-eEEEEEecCccee Confidence 00000000000 000 0111111122 233 2234455556677788888999999987765 3332221111100 Q ss_pred ccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHH Q lcl|NC_013692. 77 NVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAME 156 (399) Q Consensus 77 t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~ 156 (399) ....||- .+ +. +...--.+++..|+.+.++++.++.+|+++.+. +.+.|. T Consensus 216 ~~~~Eg~-----~~---------------~~---------~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~d-s~~~l~ 265 (477) T protein:vir:84 216 IQAADNA-----AL---------------TA---------PSAHEVDLTDGFVQANVKTIAGQQGIAIQLLDQ-AAVSVD 265 (477) T ss_pred eeeccCc-----cc---------------cc---------ccccccccceeeEEEeeeeEEeeeHHHHHHHhc-cchhHH Confidence 1111211 11 00 001112346788999999999999999988764 445576 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCc-e------eeeccccccccccCCcceecHHHHHHHHHHH-HhccCcccccee Q lcl|NC_013692. 157 GHVTTEMVKGANEITEDLLQIDLLNSAG-T------VRYPGAATSDAEVDATTEVTYDSLMRLRLDL-DNARAPTKIKMI 228 (399) Q Consensus 157 ~~i~~el~~~~~~~t~d~l~~~~l~agt-~------V~YAg~aTsra~v~~~~~vt~~~lr~a~~~L-k~nrApk~T~ii 228 (399) ..|..+|...-+.-.+. .+|++-+ . .-+++... .+.+ ....+...+......| +.+.. T Consensus 266 ~~i~~~l~~~~~~~~d~----~~l~G~Gt~~~p~Gi~~~~~~~~--~~~~-~~~~t~~~~~~~~~~i~~~~~~------- 331 (477) T protein:vir:84 266 EFVFRDLAADYANKLNV----QVISGTGSNNQVVGVRATAGITQ--VTAT-SAGSALEKHQIIYQKIADAIQR------- 331 (477) T ss_pred HHHHHHHHHHHHHHHHH----HHhccCCCCCccceeeecccccc--cccc-ccccchhhHHHHHHHHHHHHhh------- Confidence 77777776666553333 3443321 1 11221100 0110 0112222222222111 11111 Q ss_pred ccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccc-----cccccceeEcCeEEEecCccccccc Q lcl|NC_013692. 229 TGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGA-----TMHGEVGQLGRFRVIVNPQMMHWAG 303 (399) Q Consensus 229 ~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~-----i~~gEIG~i~~~RfV~~~~~~~~~~ 303 (399) +.+-..-...+.++||.....|+.|+|--+.|=|.|-....+... +.++=.|.+-++++|+++.|-. + T Consensus 332 -----~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G~pVv~s~~~p~--~ 404 (477) T protein:vir:84 332 -----VHTSRFLEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHGLPVVTDPTLPT--T 404 (477) T ss_pred -----ccccccCCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhcccceEecCcccc--c Confidence 000011112356889999999999999888887876644333333 4445557899999999986521 2 Q ss_pred CCcccCCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHh Q lcl|NC_013692. 304 VGKAVDPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFM 383 (399) Q Consensus 304 aGa~~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~ 383 (399) .|+.. | ...++||.-+.-.+.- ++ +.+. ..+- ...+ .++..|.-+ .++.+. T Consensus 405 ~~~~~---------------d-~~~i~~gd~~~~~i~~--~~----~~~~-~~~~----~~~~-~~~~~~~v~-~~~~~~ 455 (477) T protein:vir:84 405 LGTGT---------------D-QDVIHVLRASDLALFE--SS----VRMR-ALQE----TRAE-NLSVLLQVY-GYLAFT 455 (477) T ss_pred ccccC---------------C-cceEEEEEeceEEEEe--ec----eeEE-eccc----cccc-cceeeeeeh-hhhhhh Confidence 22211 1 1135566655544432 11 1111 1110 0000 223222111 245555 Q ss_pred hcc-ccceEEEEEecCC Q lcl|NC_013692. 384 VFR-PEWIALLKTVARL 399 (399) Q Consensus 384 iLn-~~~m~~iet~A~~ 399 (399) ..+ ++-.++|=-.|.- T Consensus 456 ~~r~~~afv~~t~~~~~ 472 (477) T protein:vir:84 456 AARFPQSVVEIGGTALT 472 (477) T ss_pred hhccccceEEeeccccc Confidence 554 6666665443333 No 112 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=97.91 E-value=3.1e-06 Score=50.75 Aligned_cols=285 Identities=13% Similarity=0.091 Sum_probs=149.7 Q ss_pred CCCccccc--cccccCCCCCCcccccccceehhhhhHHHHHHhhhH-HhhhhcccccccCcCCCcEEEEEEccCCcCCCc Q lcl|NC_013692. 1 MAGPVDNI--KPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKE-AYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRN 77 (399) Q Consensus 1 ~~~~~~~~--~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~-lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t 77 (399) +.+..... ....-+.++.+..+-+.|+ .+ .+.+....+. .++.+++.+.++ +.+..+.+.+...-+ T Consensus 97 ~~~~~~~~~~~~~~~~~t~~~~g~~~~~~----~~-~~~i~~~~~~~~~l~~~~~~~~~--~~~~~~~~~~~~~~~---- 165 (392) T protein:vir:13 97 NLGEARSFEFAPEKRDGTKAGNPNVLSRT----LY-GQLIAQAVERSAIMRGGASTFTT--SDANPMDFTVITGRA---- 165 (392) T ss_pred chhhhHHHHhhhhhhcccccCCCcccccc----ch-HHHHHHHHhhhhhhhhcceeeec--CCCceeEEEEEcCCc---- Confidence 11111111 0111111122111122222 22 4455555543 466777766543 345556654443211 Q ss_pred cccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHH Q lcl|NC_013692. 78 VNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEG 157 (399) Q Consensus 78 ~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~ 157 (399) ...++. +|+.+..-..++..|+.+.++++.++.+|+++.+ +++..|.. T Consensus 166 --------~a~~v~-----------------------E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~-ds~~~l~~ 213 (392) T protein:vir:13 166 --------TAGIVG-----------------------ETAEIPESYPATTQRSMGGFKYGFASVVSYEFAT-DQVLDLVG 213 (392) T ss_pred --------ceeeec-----------------------ccccccccccceeeEEeeeeeEEeeehhHHHHHh-cchHHHHH Confidence 111211 1222233345778899999999999999999887 45656766 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCce------eeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccc Q lcl|NC_013692. 158 HVTTEMVKGANEITEDLLQIDLLNSAGT------VRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGT 231 (399) Q Consensus 158 ~i~~el~~~~~~~t~d~l~~~~l~agt~------V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s 231 (399) .|..+|...-+...+. .+|++-++ +.+++.++...+-.....+++++|.++...|+..-.. T Consensus 214 ~i~~~l~~~i~~~~d~----~~l~G~Gt~~p~Gil~~~~~~~~~~~~~~~~~~~~d~l~~~~~~l~~~~~~--------- 280 (392) T protein:vir:13 214 FLVSDAGPAIGDAMGR----HFLTGTGTGQPRGILTDATGANAAFGEADADSKVSDALIDLFHEVPSAYRK--------- 280 (392) T ss_pred HHHHHHHHHHHHHHHH----HHhcccCCccccccccccccccccccccccccccHHHHHHHHHhhhhhhhc--------- Confidence 6766666655543333 34443221 1122111111111223568899888887777543221 Q ss_pred cccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCc Q lcl|NC_013692. 232 RMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPN 311 (399) Q Consensus 232 ~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~ 311 (399) +++ .+||+.....|+.|+|--+.|=|.|-- -.|.-+.+-++.+++++.+- + T Consensus 281 --------~a~--~v~n~~~~~~l~~lkd~~G~~l~~~~~--------~~g~~~~l~G~Pv~~~~~~~----~------- 331 (392) T protein:vir:13 281 --------NAK--FVVNDLRAAQMRKLKDANGQYLWQSAL--------TVGAPDTFNGKVVETDDGMP----A------- 331 (392) T ss_pred --------CCE--EEEcHHHHHHHHHhhccCCceeecCCc--------CCCCCceecceeeEEcCCCC----C------- Confidence 122 377999999999998866666665432 23444567788888877531 0 Q ss_pred ccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH--HHHHHhhccccc Q lcl|NC_013692. 312 DQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK--WYYGFMVFRPEW 389 (399) Q Consensus 312 ~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iLn~~~ 389 (399) ..++||.=+.-.++..+ .+. ++ ...|++-+++...++ .++.+.+.+++- T Consensus 332 ---------------~~i~~Gdf~~~~i~~~~---~~~----i~-------~~~~~~~~~~~~~~r~~~r~d~~~~~~~A 382 (392) T protein:vir:13 332 ---------------DKVLFADLSKYRVRFAG---SLR----VD-------RSVDAKFSTDQIVYRFLQRADGLLVDARG 382 (392) T ss_pred ---------------CcEEEeeccceeEEeec---ceE----EE-------eeccccccCCcEEEEEEEEeccEEecccc Confidence 01456764333344331 111 11 124567666666655 567899999998 Q ss_pred eEEEEEecCC Q lcl|NC_013692. 390 IALLKTVARL 399 (399) Q Consensus 390 m~~iet~A~~ 399 (399) ++++++.+-- T Consensus 383 ~~~~~~~~aa 392 (392) T protein:vir:13 383 AKVLTVTPAA 392 (392) T ss_pred eEEEEeeccC Confidence 8865543333 No 113 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=97.90 E-value=5.7e-06 Score=49.33 Aligned_cols=286 Identities=13% Similarity=0.068 Sum_probs=152.9 Q ss_pred CCC---ccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCc Q lcl|NC_013692. 1 MAG---PVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRN 77 (399) Q Consensus 1 ~~~---~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t 77 (399) +.+ .++.......+..+.+.-+-+-|+ -+....+..+.+...+.+++...+|+.+.|+....++- +.+ T Consensus 101 ~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~----~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-----~~~ 171 (408) T protein:vir:10 101 VRNPMAFMNTVSSKTETSGSDSAAGLTIPQ----DIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWT-----DVT 171 (408) T ss_pred hhcchhhhhhhhhhhhhcccccCCceeccH----hHHHHHHHHHHhhchhhhhcceeeccCCcceEEEeecc-----ccc Confidence 111 122222222222222111222343 23466677777888999999999999999875433211 111 Q ss_pred cccCCCCcchhhhhhhhhccccccccccccccccccccccceee-ccceeEEEEEEeeeecceehhhhhhhhhhhchhHH Q lcl|NC_013692. 78 VNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNR-VGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAME 156 (399) Q Consensus 78 ~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~-~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~ 156 (399) + .+.+...| +.... -..++..|+.+.++++.+..+|+++.. +++..|. T Consensus 172 ~-------~a~~v~E~-----------------------~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~-ds~~~l~ 220 (408) T protein:vir:10 172 P-------LTVMDAED-----------------------GKIPDLDNPQLTIIKYLIKRYAGIITATNTSLK-DTAENIL 220 (408) T ss_pred c-------ceeeecCc-----------------------cccccccCcceeeEEeeeeeEEeeehhHHHHHh-hchHHHH Confidence 1 11121111 00000 113677899999999999999999876 3455566 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHH-HHHHhccCccccceeccccccC Q lcl|NC_013692. 157 GHVTTEMVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLR-LDLDNARAPTKIKMITGTRMID 235 (399) Q Consensus 157 ~~i~~el~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~-~~Lk~nrApk~T~ii~~s~~~g 235 (399) ..|..+|...-+. .+ -..+|++-+. ++. .....++++|..+. ..|+..-.+ T Consensus 221 ~~i~~~l~~~~~~-~~---~~~il~g~g~------~~~-----~~~~~~~~~l~~~~~~~~~~~~~~------------- 272 (408) T protein:vir:10 221 AWLSSWIAKKVVV-TR---NQAIIEVMKA------APK-----KPTIAKFDDVITMINTAVDPAIIA------------- 272 (408) T ss_pred HHHHHHHHHHHHH-HH---HHHHhhcccc------ccc-----ccccccHHHHHHHHHHhhhhhhcc------------- Confidence 6665555444443 33 3345533221 111 22446788887764 334332221 Q ss_pred cccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCccccc Q lcl|NC_013692. 236 TRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVP 315 (399) Q Consensus 236 T~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~ 315 (399) .+ +-+||+.+..-|+.++|--+.|-|.+- +-.|..+++-|+.++.++.. +..+.+ T Consensus 273 -----~a-~~v~n~~~~~~l~~lkd~~G~~i~~~~--------~~~~~~~~l~G~PV~~~~~~--~~~~~~--------- 327 (408) T protein:vir:10 273 -----TS-SLLTNQSGLNKLALVKTAEGKYLLEPD--------PTKPNSYLIKGKQVIVVADR--WLPNTG--------- 327 (408) T ss_pred -----CC-EEEEcHHHHHHHHHhhccCCceEeccC--------cCCCCCceecceeeEEeccc--ccCccC--------- Confidence 11 457999999999999887777766541 22355668888888776631 211111 Q ss_pred ccccCccceeEEEEEEcccc-ceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH--HHHHHhhccccceEE Q lcl|NC_013692. 316 MHESGGKYSVFPMLCVASEA-FTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK--WYYGFMVFRPEWIAL 392 (399) Q Consensus 316 ~~~~~~~~DVYp~lV~G~~A-fg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iLn~~~m~~ 392 (399) .+ .+ .+++|.=+ |-.+..+ .+ +.+ -+.. . ....-+++...++ +++.+.+++++-++. T Consensus 328 ----~~---~~-~i~~gd~~~~~~~~~~-~~--~~v--~~~~------~-~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~ 387 (408) T protein:vir:10 328 ----ST---VY-PLYYGDMSQAITLFDR-EN--MSL--LPTN------I-GAGAFETDTTKIRVIDRFDVKATDSEALVA 387 (408) T ss_pred ----CC---ce-EEEEEehhccEEEEEe-cc--eEE--EEcc------c-ccchhhcCceEEEEEEeeccEEeccccEEE Confidence 11 12 25677533 2222222 11 111 1111 0 0112356777777 458999999999999 Q ss_pred EEEecCC Q lcl|NC_013692. 393 LKTVARL 399 (399) Q Consensus 393 iet~A~~ 399 (399) ++..+.- T Consensus 388 ~~~~~~~ 394 (408) T protein:vir:10 388 GSFSAIA 394 (408) T ss_pred EEeeccc Confidence 8866643 No 114 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=97.86 E-value=1.4e-05 Score=47.24 Aligned_cols=287 Identities=13% Similarity=0.078 Sum_probs=151.9 Q ss_pred CCC---ccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCc Q lcl|NC_013692. 1 MAG---PVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRN 77 (399) Q Consensus 1 ~~~---~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t 77 (399) +.+ -++.......+..+.+..+-+-|+ .+..+.+....+...+.+++.+.+|+...|+...++... . + T Consensus 101 ~~~~~~~~~~~e~~a~~~~t~~~gg~~iP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~-~----~ 171 (404) T protein:vir:39 101 VRNPMAFLNTVSSKTETSGSDSAAGLTIPQ----DIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTD-V----T 171 (404) T ss_pred HhcchhhhhhhhhhhhhcccccCCceeccH----HHHHHHHHHHHhhhhHHhhcceeeccCCcceEEEEeecC-C----c Confidence 111 111111111111111111112233 335666666778889999999999999988776554321 1 1 Q ss_pred cccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHH Q lcl|NC_013692. 78 VNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEG 157 (399) Q Consensus 78 ~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~ 157 (399) +...+...| ...|.+ -..++..|+.++++++.++.+|+++.+ +++..|.. T Consensus 172 -------~~a~~v~Eg-------------~~~~~~---------~~~~f~~i~~~~~k~~~~~~iS~ell~-ds~~~l~~ 221 (404) T protein:vir:39 172 -------PLTVMDAED-------------GKIPDL---------DNPRLTIIKYLIKRYAGIITATNTLLK-DTAENILA 221 (404) T ss_pred -------cceeeecCc-------------cccccc---------cccceeeEEeeeeeEEeeehhHHHHHh-hchHHHHH Confidence 111111111 000100 113678899999999999999999886 45566777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHH-HHHhccCccccceeccccccCc Q lcl|NC_013692. 158 HVTTEMVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRL-DLDNARAPTKIKMITGTRMIDT 236 (399) Q Consensus 158 ~i~~el~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~-~Lk~nrApk~T~ii~~s~~~gT 236 (399) .|..+|.+.-+.-.+. .+|++-++ ++ ..+...++++|..+.. .++..-.+ T Consensus 222 ~i~~~l~~~~~~~~d~----~il~g~g~------~~-----~~~~~~~~~~i~~~~~~~~~~~~~~-------------- 272 (404) T protein:vir:39 222 WLSSWIAKKVVVTRNQ----AIIAAMGT------VP-----KKPTIAKFDDVITMINTSVDPAIIA-------------- 272 (404) T ss_pred HHHHHHHHHHHHHHHH----HHHhcccc------cc-----cccccccHHHHHHHHHHhhhhhhcc-------------- Confidence 7777776666653333 34432111 11 1234567777766653 23322111 Q ss_pred ccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccc Q lcl|NC_013692. 237 RTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPM 316 (399) Q Consensus 237 ~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~ 316 (399) . =+.+|||.....|+.|+|-.+.|-|.+- +..+..+++-|+.++.+..+ +....+ T Consensus 273 ----~-a~~v~n~~~~~~L~~lkd~~G~~l~~~~--------~~~~~~~~l~G~pV~~~~~~--~~~~~~---------- 327 (404) T protein:vir:39 273 ----T-SSLLTNQSGLNKLALVKTAEGKYLLEPD--------PTKPNSYLIKGKKVIVVADR--WLPNSG---------- 327 (404) T ss_pred ----C-CEEEEcHHHHHHHHHhhccCCceeeccC--------cCCCCcceecceeEEEeccc--ccCccC---------- Confidence 1 1468999999999999876666655432 23445567888777766531 111111 Q ss_pred cccCccceeEEEEEEcccc-ceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH--HHHHHhhccccceEEE Q lcl|NC_013692. 317 HESGGKYSVFPMLCVASEA-FTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK--WYYGFMVFRPEWIALL 393 (399) Q Consensus 317 ~~~~~~~DVYp~lV~G~~A-fg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iLn~~~m~~i 393 (399) ... + .+++|.-+ +-.+..+ .+ +.+ -+.. -.+.+-+.+...++ +.+++.+++++-++.+ T Consensus 328 ---~~~---~-~~~~gd~~~~~~~~~~-~~--~~i--~~~~-------~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~ 388 (404) T protein:vir:39 328 ---STV---Y-PLYYGDMSQAITLFDR-EN--MSL--LPTN-------IGAGAFETDTTKIRVIDRFDVKTTDSEALVAG 388 (404) T ss_pred ---CCc---c-EEEEEeccccEEEEee-cc--eEE--EEec-------cchhhhhhceeeEEEEeeeccEEecccceEEE Confidence 111 1 24566433 2222222 11 111 1111 12235566666666 6789999999998888 Q ss_pred EEecCC Q lcl|NC_013692. 394 KTVARL 399 (399) Q Consensus 394 et~A~~ 399 (399) +..+.= T Consensus 389 ~~~~~a 394 (404) T protein:vir:39 389 SFTAIA 394 (404) T ss_pred Eeeccc Confidence 854443 No 115 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=97.85 E-value=9e-06 Score=48.26 Aligned_cols=284 Identities=13% Similarity=0.061 Sum_probs=146.5 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVND 80 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lt 80 (399) +.+...... ..-+..+.+..+-+-|+ .|....+....+...+.+++.+.+||.+.++-..... .. T Consensus 100 l~~~~~~~~-~~~~~~t~~~gg~~vP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~-------~~--- 164 (394) T protein:vir:10 100 IHSHGKVID-NAAGHVTSTEAGVLIPE----EIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKR-------AT--- 164 (394) T ss_pred Hhccchhhh-hhhcccccccCceeccH----HHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEec-------CC--- Confidence 111111100 01111121111222343 3456666667778889999999999887654332211 00 Q ss_pred CCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHH Q lcl|NC_013692. 81 QGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVT 160 (399) Q Consensus 81 eGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~ 160 (399) +...++ .|.|.....-.+++..|+.++++++.+..+|+++.+. +++.|...|. T Consensus 165 ----~~~~~~----------------------~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~d-s~~~l~~~i~ 217 (394) T protein:vir:10 165 ----DRFSSV----------------------AELAENPALAEPEFEQVDWSVSTYRGAIPLSEEAIAD-SAVDLTSLVG 217 (394) T ss_pred ----Cccccc----------------------cccccccccccccceeEEeeeeeeEeeehhHHHHHhh-hhHHHHHHHH Confidence 111122 2222222222357788999999999999999998774 5556766666 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHH-HHHhccCccccceeccccccCcccc Q lcl|NC_013692. 161 TEMVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRL-DLDNARAPTKIKMITGTRMIDTRTV 239 (399) Q Consensus 161 ~el~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~-~Lk~nrApk~T~ii~~s~~~gT~~I 239 (399) .+|...-+. .+| ..+|++-++ ++... .....++++|..+.. .|+... T Consensus 218 ~~la~~~~~-~~~---~~il~g~g~------~~~~~---~~~~~~~d~l~~~~~~~~~~~~------------------- 265 (394) T protein:vir:10 218 QSINEKSVN-TYN---AMIAPVLQS------FTAKA---TTTDTLVDSLKHILNVDLDPAY------------------- 265 (394) T ss_pred HHHHHHHHH-HHH---HHHhhcccc------ccccc---ccccccHHHHHHHHHhhhhhhc------------------- Confidence 666544443 333 234432221 12111 123466777766543 222211 Q ss_pred cCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCccccccccc Q lcl|NC_013692. 240 GNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHES 319 (399) Q Consensus 240 ~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~ 319 (399) .+ +.+|||.+..-|+.|+|--+.|-|.|--.- ..-.+.-+++-+++++.++.... +.+ T Consensus 266 ~a--~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~----~~~~~~~~~L~G~PV~~~~~~~~--~~~-------------- 323 (394) T protein:vir:10 266 SR--ALVVTQSLFNTLDTLKDKNGRYLLHDASDS----ITDGTAKGTVLGVPVYVVGDALL--GSA-------------- 323 (394) T ss_pred cC--EEEecHHHHHHHHHhhccCCCeeeeccccc----cccCCcccccccceeEEeccccc--CCC-------------- Confidence 11 357999999999999887677767653221 11123446788999888764211 110 Q ss_pred CccceeEEEEEEccccc-eecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEecC Q lcl|NC_013692. 320 GGKYSVFPMLCVASEAF-TTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPEWIALLKTVAR 398 (399) Q Consensus 320 ~~~~DVYp~lV~G~~Af-g~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~iet~A~ 398 (399) .++ ..+++|.-+- -.+..+ .+ +.+ -+.+ +....+++..+ +.+.+.+.++.=++.|+..+. T Consensus 324 ~~~----~~i~~gd~s~~~~~~~~-~~--~~v--~~~~---------~~~~~~~~~~~-~r~d~~~~~~~ai~~~~~~~~ 384 (394) T protein:vir:10 324 AGD----QKAFVGDLKRGVLFADR-QQ--VTL--AWED---------SKIYGRYLGAA-FRFGVKQADSNAGYFVTNTDA 384 (394) T ss_pred CCc----eEEEEeeccccEEEEee-cc--eEE--EEec---------ccccceeEEEE-EEeccEEeccccEEEEEeecc Confidence 011 2456775332 222221 11 122 1111 11122333322 467888888998888887777 Q ss_pred C Q lcl|NC_013692. 399 L 399 (399) Q Consensus 399 ~ 399 (399) . T Consensus 385 ~ 385 (394) T protein:vir:10 385 A 385 (394) T ss_pred c Confidence 7 No 116 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=97.84 E-value=3.7e-06 Score=50.35 Aligned_cols=284 Identities=8% Similarity=0.017 Sum_probs=152.7 Q ss_pred CCCcccccccc----ccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCC Q lcl|NC_013692. 1 MAGPVDNIKPM----KYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDR 76 (399) Q Consensus 1 ~~~~~~~~~~~----~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~ 76 (399) ...-...+... .-.....+++++.+.-+-+. |....+........+.+++.+.++.-+ ++++-+..-... T Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ip~~-~~~~ii~~~~~~~~i~~~~~~~~~~~~---~~~~~~~~~~~~-- 161 (379) T protein:vir:10 88 NFNDIKEVRNGKSIQVKAVGDMTLPVNLTGAQPKD-YNFDVVLNPSQMLNVSDIVGAVSISGG---TYTFVRENGAGE-- 161 (379) T ss_pred HHHhHHHHHhhhhhhhhhhcccccCCCCccccchh-hhhHHHHhHHhhhhHHhhceeeeccCC---ceEEEEeecCCC-- Confidence 00000000000 00011112233333222333 355566666667788888887777533 344422111000 Q ss_pred ccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHH Q lcl|NC_013692. 77 NVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAME 156 (399) Q Consensus 77 t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~ 156 (399) ++..+. .+|+......+++..|+.++++|+.+..+|+++.+ +++ .|. T Consensus 162 --------~~~~~v-----------------------~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~-D~~-~l~ 208 (379) T protein:vir:10 162 --------GAIGAQ-----------------------VEGATKGQKDYDISMIDVNTDFIAGFTRYSKKMAN-NLP-FLT 208 (379) T ss_pred --------cccccc-----------------------cCCccccccccceeeeEeeeeeEEeeehhhHHHHh-hHH-HHH Confidence 111111 22333334456788999999999999999999876 554 577 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCc Q lcl|NC_013692. 157 GHVTTEMVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDT 236 (399) Q Consensus 157 ~~i~~el~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT 236 (399) ..|..+|...-+. .+|.-..+.+.+.+. .....+ ....+.++|.++.-.|..+..+ T Consensus 209 ~~i~~~la~~~~~-~~~~~~~~g~~~~~~-------~~~~~~--~~~~~~d~i~~~~~~~~~~~~~-------------- 264 (379) T protein:vir:10 209 SFIPNALRRDYAK-AENAAFNAVLAANAT-------ASTEII--TNKNKVEMLINEIAKQENLDFP-------------- 264 (379) T ss_pred HHHHHHHHHHHHH-HHHHHHhcccccccc-------cccccc--cCcccHHHHHHHHHhhhhccCC-------------- Confidence 7777776655443 444433333332221 111112 2345677888877656544332 Q ss_pred ccccCeeEEEechhhhHHHHHHhhhcCCCCceehh--hcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccc Q lcl|NC_013692. 237 RTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIE--KYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQV 314 (399) Q Consensus 237 ~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~--kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~ 314 (399) .-+.+|||.+...|+.++|-.+.|=|.|-. +.+.+ -++-++++|+++.|. +| T Consensus 265 -----~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~--------~~l~G~pvv~s~~~~----ag--------- 318 (379) T protein:vir:10 265 -----VTAIVLRPTDYYDILVTQKSVGAGYGLPGVVTQDNGV--------LRINGIPLFRATWLA----AN--------- 318 (379) T ss_pred -----CCEEEEcHHHHHHHHHhhccCCceeccCCccCCCCCc--------ceecceeeEecCCCC----CC--------- Confidence 124668999999999998866666554321 12222 256679999887542 11 Q ss_pred cccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHH--HHHHhhccccceEE Q lcl|NC_013692. 315 PMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKW--YYGFMVFRPEWIAL 392 (399) Q Consensus 315 ~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~--~~~~~iLn~~~m~~ 392 (399) + ++||.-+...+.+..+ +.+++. . +-.| +-+++.+.|++ -+++.+++++-++. T Consensus 319 ---------~----~~~gdf~~~~~~~~~~---~~i~~~--~------~~~~-~f~~~~~~~r~~~R~~~~v~~p~a~v~ 373 (379) T protein:vir:10 319 ---------K----YYVGDWTRVTKVTTEG---LSLEFS--E------VEGT-NFVKNNITARIEAQVALAVEQPAALIF 373 (379) T ss_pred ---------c----eEEeecccEEEEEEec---eEEEEe--e------cccc-cccCCcEEEEEEEEeccEEecCccEEE Confidence 1 4577766665555421 122211 1 1122 34577777773 68899999999999 Q ss_pred EEEecC Q lcl|NC_013692. 393 LKTVAR 398 (399) Q Consensus 393 iet~A~ 398 (399) ++..|= T Consensus 374 ~~~~~~ 379 (379) T protein:vir:10 374 GDFTAV 379 (379) T ss_pred EEecCC Confidence 998887 No 117 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=97.82 E-value=5.1e-06 Score=49.60 Aligned_cols=295 Identities=14% Similarity=0.119 Sum_probs=146.4 Q ss_pred CCCccccccccc---cCCCCCCc---ccccccceehhhhhHHHHHHh-hhHHhhhhcccccccCcCCCcEEEEEEccCCc Q lcl|NC_013692. 1 MAGPVDNIKPMK---YNDPANGV---ESSIGPQIHTRYWYKRALIDA-AKEAYFGQLADTFSMPKHYGKEIVRLHYIPLL 73 (399) Q Consensus 1 ~~~~~~~~~~~~---~n~~~~~~---~~~i~p~~~t~y~~~k~L~~A-~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~ 73 (399) ..+.+....... -+....++ ...+.|+ ++ .+.++.. ....++.+++...++..+ ++++-+..... T Consensus 106 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~---~~--~~~i~~~~~~~~~i~~~~~~~~~~~~---~~~~~~~~~~~ 177 (419) T protein:vir:94 106 FQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQ---LV--PGIVPTTPDLPLLVADLLDQQNADYN---VLEYIRDTSGT 177 (419) T ss_pred hhHHHHHHHHHHhhccccccccccCCcccccch---hh--hHHHHHHHhhhhhhhhcceeeeccCC---ceeeeeecccc Confidence 000000000000 00011111 1122333 22 3333333 334566777777776533 34443322111 Q ss_pred CCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhch Q lcl|NC_013692. 74 DDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDP 153 (399) Q Consensus 74 ~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~ 153 (399) .+...+ .+..++.. +|+....-.+++..|+.++++++.+..+|+++.+ +.+ T Consensus 178 ---~~~~~~-~~~a~~v~-----------------------Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~-d~~- 228 (419) T protein:vir:94 178 ---AGAGST-WNKAAVVP-----------------------EGTAKPQSTLSFDTITTTLKTVAHWLPITRQAAD-DNS- 228 (419) T ss_pred ---cccccc-Ccccceec-----------------------CCccccccccceeeEEeeeeeEEEeehhhHHHHH-hHH- Confidence 010000 11112222 2233333445788999999999999999999887 443 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeccc---------cccccccCCcceecHHHHHHHHHHHHhccCccc Q lcl|NC_013692. 154 AMEGHVTTEMVKGANEITEDLLQIDLLNSAGTVRYPGA---------ATSDAEVDATTEVTYDSLMRLRLDLDNARAPTK 224 (399) Q Consensus 154 ~L~~~i~~el~~~~~~~t~d~l~~~~l~agt~V~YAg~---------aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~ 224 (399) .|...|..+|.+.-+. .+-..+|++-++-.--|- ++............+++|.++.-.|.....+. T Consensus 229 ~l~~~i~~~la~a~~~----~~d~aii~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~t~~~~~~~l~~~~~~~~~~~~~~- 303 (419) T protein:vir:94 229 QLMGYIQGRLTYGLRF----LRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPP- 303 (419) T ss_pred HHHHHHHHHHHHHHHH----HHHHHHHhccCcccccceecccccccccccccccccccchhHHHHHHHHHhhhhccCCC- Confidence 4655555555444444 333445543332111110 00111111224456788888877666544421 Q ss_pred cceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccC Q lcl|NC_013692. 225 IKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGV 304 (399) Q Consensus 225 T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~a 304 (399) + +.+|||.....|+.++|--+.+-|.. ...-.+-.+++-|+.+++++.+. + T Consensus 304 -----------------~-~~v~n~~~~~~l~~~k~~~~~~~~~~-------~~~~~~~~~~l~G~pV~~~~~~~----~ 354 (419) T protein:vir:94 304 -----------------D-GVVVHPQDWESIELDQAPGSGVFRVI-------ANVQGEATPRIWGLNVVSTVAIA----Q 354 (419) T ss_pred -----------------C-EEEEcHHHHHHHHHHhhcCCCceeec-------CCcccCCCccccceeeEEcCCCC----C Confidence 2 57899999999998875333222211 11233445678899999988642 1 Q ss_pred CcccCCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH--HHHHH Q lcl|NC_013692. 305 GKAVDPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK--WYYGF 382 (399) Q Consensus 305 Ga~~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~ 382 (399) + .++||.-+..-..+...+ +.+ ... +..+.+-+++...|+ +++.+ T Consensus 355 ~----------------------~~~~gd~~~~~~~~~~~~----~~v---~~~----~~~~~~~~~~~~~~r~~~r~d~ 401 (419) T protein:vir:94 355 G----------------------TALVGGFRQGATLWSRQG----ITV---LMT----DSHADFFTANTLVILAEFRANL 401 (419) T ss_pred c----------------------cEEEeeccceEEEEEecc----eEE---EEe----ccccchhhcCcEEEEEEEeecc Confidence 0 145665444322222222 111 111 112234455666666 67889 Q ss_pred hhccccceEEEEEecCC Q lcl|NC_013692. 383 MVFRPEWIALLKTVARL 399 (399) Q Consensus 383 ~iLn~~~m~~iet~A~~ 399 (399) .+.+++-+++++++|-. T Consensus 402 ~v~~~~a~~~~~~~aa~ 418 (419) T protein:vir:94 402 AVYQPKAFVRVTFAAAT 418 (419) T ss_pred EEeccccEEEEEeccCC Confidence 99999999999999888 No 118 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=97.82 E-value=6.4e-06 Score=49.07 Aligned_cols=295 Identities=14% Similarity=0.152 Sum_probs=149.7 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVND 80 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lt 80 (399) +.|-+...-....+.++. .-+-+-|+ .+..+.+....+...+.+++.+.++. | .+++-++..-+.+ T Consensus 131 l~~~~~~~e~~a~~~~t~-~GG~lvP~----~~~~~Ii~~l~~~~~i~~~~~~~~~~---~-~~~~p~~~~~~~a----- 196 (434) T protein:vir:62 131 IVGNIDEKEARALGLVTG-NGSVTIPD----FLSKEIITYAQEENFLRRLGTGVKTK---E-NIKYPVLVKKAEA----- 196 (434) T ss_pred hccccchhhhhhhccccc-ccceecch----hhHHHHHHhhhhhhhhhhhcceeccC---C-ceEEEEEecCCcc----- Confidence 222222111111111111 11223343 23566666677778888999876543 3 2444333211111 Q ss_pred CCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHH Q lcl|NC_013692. 81 QGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVT 160 (399) Q Consensus 81 eGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~ 160 (399) .+.+. .++|+....-.+++..|+.+.++++.+..+|+++.+ +++..|.+.|. T Consensus 197 -------~~~~~--------------------~~e~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~-ds~~~l~~~i~ 248 (434) T protein:vir:62 197 -------QGHKN--------------------ERTNNEMPETDIEFDEIELSPTEFDALATVTKKLLA-RTGLPIEQIVM 248 (434) T ss_pred -------cceec--------------------ccccccccccccceeeEEeeheeeEeehhhHHHHHh-cchHHHHHHHH Confidence 11100 111222222335778899999999999999999877 34445767777 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeeecccc--ccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCccc Q lcl|NC_013692. 161 TEMVKGANEITEDLLQIDLLNSAGTVRYPGAA--TSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRT 238 (399) Q Consensus 161 ~el~~~~~~~t~d~l~~~~l~agt~V~YAg~a--Tsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~ 238 (399) .+|...-+...+ .-+|++-++-...++. .+..+.+.....++++|.++...|+..-. T Consensus 249 ~~la~~~~~~~d----~~~l~G~G~~~~~~g~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~----------------- 307 (434) T protein:vir:62 249 DELKKAYVRKET----QYMVNGDEANNINDGALAKKAVEFKTDEKNLYDALVKMKNTPVKEVR----------------- 307 (434) T ss_pred HHHHHHHHHHHH----HHHhccCCCCccccceeecccccccccccchhhHHHHHHhhcchhhh----------------- Confidence 777665555333 3345433221111111 11223333456788888888776654322 Q ss_pred ccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccc Q lcl|NC_013692. 239 VGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHE 318 (399) Q Consensus 239 I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~ 318 (399) +.+ +.+||+....-|+.|+|--+.|=|.|... +-.|--..+-+.++++++.|. .+.+ T Consensus 308 -~~a-~~v~n~~~~~~L~~lkd~~G~~l~~~~~~------~~~g~~~tl~G~pV~~~~~~~----~~~~----------- 364 (434) T protein:vir:62 308 -KKA-RWVLNTAALTKIETMKTDDGFPLLRPFNQ------AEGGIGYTLLGFPVEEEDAID----IPDS----------- 364 (434) T ss_pred -cCC-EEEEcHHHHHHHHHhhccCCCEeeccCCC------ccCCCCceecceeeEEecCcc----CccC----------- Confidence 223 23789999999999988666666665321 112233467889999887652 1110 Q ss_pred cCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHH--HHh-hccccceEEEEE Q lcl|NC_013692. 319 SGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYY--GFM-VFRPEWIALLKT 395 (399) Q Consensus 319 ~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~--~~~-iLn~~~m~~iet 395 (399) .+. +.+.||.-+...|.-..+. ..+ + ...++|-.++.++++++. .++ |++++=.+.++. T Consensus 365 ----~~~-~~i~~Gdfs~~~i~~~~g~--~~i----~-------~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~ 426 (434) T protein:vir:62 365 ----PDT-PVFYFGDFSKFYIQDVIGS--LEV----Q-------KLVELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKY 426 (434) T ss_pred ----CCc-eEEEEeeccceEEEEeece--eEE----E-------eehhhhcccCceEEEEEeeecceeecCcccceEEEE Confidence 012 4566787666554433221 111 1 113445555555544332 334 344655555544 Q ss_pred ecCC Q lcl|NC_013692. 396 VARL 399 (399) Q Consensus 396 ~A~~ 399 (399) .-+. T Consensus 427 ~~~~ 430 (434) T protein:vir:62 427 VLKA 430 (434) T ss_pred Eecc Confidence 4222 No 119 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=97.81 E-value=1.7e-05 Score=46.68 Aligned_cols=280 Identities=11% Similarity=0.037 Sum_probs=140.8 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhH-Hhh------hhcccccccCcCCCcEEEEEEccCCc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKE-AYF------GQLADTFSMPKHYGKEIVRLHYIPLL 73 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~-lv~------~~~a~~~~mPKn~GktIkfrry~pl~ 73 (399) || | +++.++.-+++..| -.++....++ .-| ...++...+=..-|.+|.+=.|.+|. T Consensus 1 Ma----~------------~~T~l~d~i~pevf-~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~ 63 (330) T protein:vir:10 1 MA----N------------ELTKILDTITPQQY-NAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLT 63 (330) T ss_pred CC----C------------CceEeeeeechhHH-HHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCC Confidence 22 1 12444433333333 2333333333 223 22233333223458999998899896 Q ss_pred CCCccccCC---CCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhh Q lcl|NC_013692. 74 DDRNVNDQG---IDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFD 150 (399) Q Consensus 74 ~~~t~lteG---V~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~ 150 (399) .+...+.+| |+|. ++ +...-.+.+.++|.-.++||.+.... T Consensus 64 G~~~~~~dg~~~i~~~--ki----------------------------------~t~~~~a~i~~~~k~~~~tD~a~~~~ 107 (330) T protein:vir:10 64 GDSEVLGNGDKALETG--KI----------------------------------TAGADIACVLYRGRGWAANELTGVVA 107 (330) T ss_pred CcccccCCCccccchh--hc----------------------------------ccceeEEEEEeecceeeehhhhhhhc Confidence 565556555 4332 22 33345788999999999999987765 Q ss_pred hchhHHHHHHHHHHHHHHHHHHHHHHH---HHHh---cCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccc Q lcl|NC_013692. 151 SDPAMEGHVTTEMVKGANEITEDLLQI---DLLN---SAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTK 224 (399) Q Consensus 151 ~D~~L~~~i~~el~~~~~~~t~d~l~~---~~l~---agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~ 224 (399) -.+-+ +++...+..--....++.|.. ++.+ ++.+..+....++.. ......++++.|-+|...|..+... T Consensus 108 g~dp~-~~i~~q~a~~w~~~~q~~lla~l~gvf~~~~~~~~~~~~~~~~~~~-~~~~a~~s~~~l~~A~~~~GD~~~~-- 183 (330) T protein:vir:10 108 GSDPV-RAILNRIGAYWLREDQKALIATLNGIFATGTAGEKGALEETHVSDQ-SKASTGIDAGMVLDAKQLLGDSADQ-- 183 (330) T ss_pred chhHH-HHHHHHHHHHhhhhHHHHHHHHHHhhhhhhhcccchhhhhhheecc-cccccccCHHHHHHHHHHhcccccc-- Confidence 44333 556666554433333333221 2221 122211111111111 1123458888888888777665443 Q ss_pred cceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccC Q lcl|NC_013692. 225 IKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGV 304 (399) Q Consensus 225 T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~a 304 (399) -.+.+|||....+||+ .++++-.+|... .+.||.+-+.|+|++.-+- T Consensus 184 -----------------~~~ivmhS~v~~~L~~-------~~li~~~~~s~~----~~~i~~~~G~~VivdD~~p----- 230 (330) T protein:vir:10 184 -----------------VTAIAMHSAVYTKLQK-------DNLIQYIQPTTA----TINIPTYLGYRVIIDDGIA----- 230 (330) T ss_pred -----------------ceEEEEcHHHHHHHHH-------hhhhhhhccccc----CcccccccceEEEEeCCCC----- Confidence 3799999999999997 356777777766 3579999999999998542 Q ss_pred CcccCCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccc------hhh-----H Q lcl|NC_013692. 305 GKAVDPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYG------EMG-----F 373 (399) Q Consensus 305 Ga~~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlg------Qrg-----~ 373 (399) +. ..+|-.++||..|++...=+ .+..+.. .+ | .||.+ .|- - T Consensus 231 ---~~-------------~~~yt~yl~~~GAi~~~~~~-~~~~v~~-----Et-----d-Rd~~~g~~~l~~r~~~~~hp 282 (330) T protein:vir:10 231 ---PT-------------GDIYTSYLFRTGSIGLNTGN-PSGLTTF-----ET-----S-REAAKGNDMIYTRRALVMHP 282 (330) T ss_pred ---CC-------------CCceeEEEEecCceeeeccc-CCccccc-----cc-----c-CCccccceEEEEeeEEEeee Confidence 11 12788889999998743200 0011111 11 1 22321 110 0 Q ss_pred HHHHHHHHH---hhccccceEEEEEecCC Q lcl|NC_013692. 374 MSIKWYYGF---MVFRPEWIALLKTVARL 399 (399) Q Consensus 374 ~gwK~~~~~---~iLn~~~m~~iet~A~~ 399 (399) -|+||--.+ .-..+-+ +-|++++-- T Consensus 283 ~G~s~~~~~~~~~~~sPt~-~~L~~~~NW 310 (330) T protein:vir:10 283 YGVKWTGAEVDAGNITPSN-ADLAKFKNW 310 (330) T ss_pred eeeeecccccccCcCCcCh-HHhcCCcCc Confidence 111111000 0000000 001111110 No 120 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=97.72 E-value=1.1e-05 Score=47.86 Aligned_cols=287 Identities=13% Similarity=0.067 Sum_probs=145.1 Q ss_pred CCCcccccccc----------ccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEcc Q lcl|NC_013692. 1 MAGPVDNIKPM----------KYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYI 70 (399) Q Consensus 1 ~~~~~~~~~~~----------~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~ 70 (399) |.+-..+-... ..+..+.+.-+-+-|+ .| ..+.+....+.-++.+++.+.+|+-+.|+...++. . T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~---~~-~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~-~ 158 (392) T protein:vir:10 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQ---DI-QTQINELARSFDALEQYVTVEPVRTRSGSRVLEKN-S 158 (392) T ss_pred HhcccccHHHHHHHhhhhhhhhccccccCCCceecch---hH-HHHHHHHHHhhhhhhhhceeeeccCCceeEEEEee-c Confidence 11111000000 0011111001112333 23 44555556677788899999999988886543321 1 Q ss_pred CCcCCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeec-cceeEEEEEEeeeecceehhhhhhhhh Q lcl|NC_013692. 71 PLLDDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRV-GFKRVEIKGKLEKYGFFREYTQEQLDF 149 (399) Q Consensus 71 pl~~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~-~~t~tdi~~~l~QyG~~~e~Td~~~~t 149 (399) .- .. ..+...| +..... ..++..|+.+.++++.++.+|+++.+ T Consensus 159 ~~---~~---------a~~v~E~-----------------------~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~- 202 (392) T protein:vir:10 159 DM---IP---------FAEITEM-----------------------GEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQ- 202 (392) T ss_pred CC---cc---------ceeeccc-----------------------ccccccccccceeEEeeeeeEEEeehhhHHHHh- Confidence 11 11 1121111 011111 13667899999999999999999876 Q ss_pred hhchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHH-HHHHhccCcccccee Q lcl|NC_013692. 150 DSDPAMEGHVTTEMVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLR-LDLDNARAPTKIKMI 228 (399) Q Consensus 150 ~~D~~L~~~i~~el~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~-~~Lk~nrApk~T~ii 228 (399) ++++.|...|..+|...-+. .+|. .++++-++ + ......++++|-.+. ..|+..-.+ T Consensus 203 ds~~~l~~~i~~~l~~~i~~-~~d~---~~~~g~g~------~------~~~~~~~~d~i~~~~~~~l~~~~~~------ 260 (392) T protein:vir:10 203 DSDQNILKYVTKWLGKKSKV-TRNV---LILGVIEK------L------TKQAIKSLDDIKDVLNVKLDPAISP------ 260 (392) T ss_pred hhHHHHHHHHHHHHHHHHHH-HHHH---HHhhcccc------c------cccCccCHHHHHHHHHHhhhhhhcc------ Confidence 46667777666665544333 3333 33432221 1 112446778877665 344443321 Q ss_pred ccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCccc Q lcl|NC_013692. 229 TGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAV 308 (399) Q Consensus 229 ~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~ 308 (399) . =+.+|||.....|+.++|--+.|=|.|- +-.|.-+++-+.|.|.+.....-...+. T Consensus 261 ------------~-a~~vm~~~~~~~L~~lkd~~G~~l~~~~--------~~~~~~~tllG~~~v~~~~~~~~~~~~~-- 317 (392) T protein:vir:10 261 ------------N-AILLTNQDGFNYLDKLKDKDGKYILQSD--------PTQKNKKLFAGTNPVVVVSNRFLKSKGT-- 317 (392) T ss_pred ------------C-CEEEEcHHHHHHHHHhhccCCCeEeecC--------ccCCccccccCcccEEEecccccCCCcc-- Confidence 1 1247899999999999876666666441 1234556777888776432111111111 Q ss_pred CCcccccccccCccceeEEEEEEcccc-ceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH--HHHHHhhc Q lcl|NC_013692. 309 DPNDQVPMHESGGKYSVFPMLCVASEA-FTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK--WYYGFMVF 385 (399) Q Consensus 309 ~~~~~~~~~~~~~~~DVYp~lV~G~~A-fg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL 385 (399) ..+.++ +++|.=+ |-.+..+ ..+.+ -+. +-.+.+-+++.+.++ +++++.++ T Consensus 318 -------------~~~~~~-~~~gdfs~~~~i~~~---~~~~~--~~~-------~~~~~~f~~~~~~~r~~~r~d~~v~ 371 (392) T protein:vir:10 318 -------------TAKKAP-LIIGDLKEAIVLFKR---EDMEL--AST-------DVGGKAFTRNTLDLRAIQRDDVQMW 371 (392) T ss_pred -------------cCCceE-EEEEehhceEEEEee---cceEE--EEe-------ccccchhhcCceEEEEEEeeccEEe Confidence 111222 4566422 1222222 11112 111 112344456666666 55788999 Q ss_pred cccceEEEEE--ecCC Q lcl|NC_013692. 386 RPEWIALLKT--VARL 399 (399) Q Consensus 386 n~~~m~~iet--~A~~ 399 (399) +++-++.++. +||. T Consensus 372 ~~~a~~~l~~~~~a~~ 387 (392) T protein:vir:10 372 DNEAAVYGEIDLSAPV 387 (392) T ss_pred cccceEEEEecccccc Confidence 9999988654 4444 No 121 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=97.72 E-value=1.1e-05 Score=47.86 Aligned_cols=287 Identities=13% Similarity=0.067 Sum_probs=145.1 Q ss_pred CCCcccccccc----------ccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEcc Q lcl|NC_013692. 1 MAGPVDNIKPM----------KYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYI 70 (399) Q Consensus 1 ~~~~~~~~~~~----------~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~ 70 (399) |.+-..+-... ..+..+.+.-+-+-|+ .| ..+.+....+.-++.+++.+.+|+-+.|+...++. . T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~---~~-~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~-~ 158 (392) T protein:vir:10 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQ---DI-QTQINELARSFDALEQYVTVEPVRTRSGSRVLEKN-S 158 (392) T ss_pred HhcccccHHHHHHHhhhhhhhhccccccCCCceecch---hH-HHHHHHHHHhhhhhhhhceeeeccCCceeEEEEee-c Confidence 11111000000 0011111001112333 23 44555556677788899999999988886543321 1 Q ss_pred CCcCCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeec-cceeEEEEEEeeeecceehhhhhhhhh Q lcl|NC_013692. 71 PLLDDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRV-GFKRVEIKGKLEKYGFFREYTQEQLDF 149 (399) Q Consensus 71 pl~~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~-~~t~tdi~~~l~QyG~~~e~Td~~~~t 149 (399) .- .. ..+...| +..... ..++..|+.+.++++.++.+|+++.+ T Consensus 159 ~~---~~---------a~~v~E~-----------------------~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~- 202 (392) T protein:vir:10 159 DM---IP---------FAEITEM-----------------------GEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQ- 202 (392) T ss_pred CC---cc---------ceeeccc-----------------------ccccccccccceeEEeeeeeEEEeehhhHHHHh- Confidence 11 11 1121111 011111 13667899999999999999999876 Q ss_pred hhchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHH-HHHHhccCcccccee Q lcl|NC_013692. 150 DSDPAMEGHVTTEMVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLR-LDLDNARAPTKIKMI 228 (399) Q Consensus 150 ~~D~~L~~~i~~el~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~-~~Lk~nrApk~T~ii 228 (399) ++++.|...|..+|...-+. .+|. .++++-++ + ......++++|-.+. ..|+..-.+ T Consensus 203 ds~~~l~~~i~~~l~~~i~~-~~d~---~~~~g~g~------~------~~~~~~~~d~i~~~~~~~l~~~~~~------ 260 (392) T protein:vir:10 203 DSDQNILKYVTKWLGKKSKV-TRNV---LILGVIEK------L------TKQAIKSLDDIKDVLNVKLDPAISP------ 260 (392) T ss_pred hhHHHHHHHHHHHHHHHHHH-HHHH---HHhhcccc------c------cccCccCHHHHHHHHHHhhhhhhcc------ Confidence 46667777666665544333 3333 33432221 1 112446778877665 344443321 Q ss_pred ccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCccc Q lcl|NC_013692. 229 TGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAV 308 (399) Q Consensus 229 ~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~ 308 (399) . =+.+|||.....|+.++|--+.|=|.|- +-.|.-+++-+.|.|.+.....-...+. T Consensus 261 ------------~-a~~vm~~~~~~~L~~lkd~~G~~l~~~~--------~~~~~~~tllG~~~v~~~~~~~~~~~~~-- 317 (392) T protein:vir:10 261 ------------N-AILLTNQDGFNYLDKLKDKDGKYILQSD--------PTQKNKKLFAGTNPVVVVSNRFLKSKGT-- 317 (392) T ss_pred ------------C-CEEEEcHHHHHHHHHhhccCCCeEeecC--------ccCCccccccCcccEEEecccccCCCcc-- Confidence 1 1247899999999999876666666441 1234556777888776432111111111 Q ss_pred CCcccccccccCccceeEEEEEEcccc-ceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH--HHHHHhhc Q lcl|NC_013692. 309 DPNDQVPMHESGGKYSVFPMLCVASEA-FTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK--WYYGFMVF 385 (399) Q Consensus 309 ~~~~~~~~~~~~~~~DVYp~lV~G~~A-fg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL 385 (399) ..+.++ +++|.=+ |-.+..+ ..+.+ -+. +-.+.+-+++.+.++ +++++.++ T Consensus 318 -------------~~~~~~-~~~gdfs~~~~i~~~---~~~~~--~~~-------~~~~~~f~~~~~~~r~~~r~d~~v~ 371 (392) T protein:vir:10 318 -------------TAKKAP-LIIGDLKEAIVLFKR---EDMEL--AST-------DVGGKAFTRNTLDLRAIQRDDVQMW 371 (392) T ss_pred -------------cCCceE-EEEEehhceEEEEee---cceEE--EEe-------ccccchhhcCceEEEEEEeeccEEe Confidence 111222 4566422 1222222 11112 111 112344456666666 55788999 Q ss_pred cccceEEEEE--ecCC Q lcl|NC_013692. 386 RPEWIALLKT--VARL 399 (399) Q Consensus 386 n~~~m~~iet--~A~~ 399 (399) +++-++.++. +||. T Consensus 372 ~~~a~~~l~~~~~a~~ 387 (392) T protein:vir:10 372 DNEAAVYGEIDLSAPV 387 (392) T ss_pred cccceEEEEecccccc Confidence 9999988654 4444 No 122 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=97.72 E-value=1.1e-05 Score=47.86 Aligned_cols=287 Identities=13% Similarity=0.067 Sum_probs=145.1 Q ss_pred CCCcccccccc----------ccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEcc Q lcl|NC_013692. 1 MAGPVDNIKPM----------KYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYI 70 (399) Q Consensus 1 ~~~~~~~~~~~----------~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~ 70 (399) |.+-..+-... ..+..+.+.-+-+-|+ .| ..+.+....+.-++.+++.+.+|+-+.|+...++. . T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~---~~-~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~-~ 158 (392) T protein:vir:10 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQ---DI-QTQINELARSFDALEQYVTVEPVRTRSGSRVLEKN-S 158 (392) T ss_pred HhcccccHHHHHHHhhhhhhhhccccccCCCceecch---hH-HHHHHHHHHhhhhhhhhceeeeccCCceeEEEEee-c Confidence 11111000000 0011111001112333 23 44555556677788899999999988886543321 1 Q ss_pred CCcCCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeec-cceeEEEEEEeeeecceehhhhhhhhh Q lcl|NC_013692. 71 PLLDDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRV-GFKRVEIKGKLEKYGFFREYTQEQLDF 149 (399) Q Consensus 71 pl~~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~-~~t~tdi~~~l~QyG~~~e~Td~~~~t 149 (399) .- .. ..+...| +..... ..++..|+.+.++++.++.+|+++.+ T Consensus 159 ~~---~~---------a~~v~E~-----------------------~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~- 202 (392) T protein:vir:10 159 DM---IP---------FAEITEM-----------------------GEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQ- 202 (392) T ss_pred CC---cc---------ceeeccc-----------------------ccccccccccceeEEeeeeeEEEeehhhHHHHh- Confidence 11 11 1121111 011111 13667899999999999999999876 Q ss_pred hhchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHH-HHHHhccCcccccee Q lcl|NC_013692. 150 DSDPAMEGHVTTEMVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLR-LDLDNARAPTKIKMI 228 (399) Q Consensus 150 ~~D~~L~~~i~~el~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~-~~Lk~nrApk~T~ii 228 (399) ++++.|...|..+|...-+. .+|. .++++-++ + ......++++|-.+. ..|+..-.+ T Consensus 203 ds~~~l~~~i~~~l~~~i~~-~~d~---~~~~g~g~------~------~~~~~~~~d~i~~~~~~~l~~~~~~------ 260 (392) T protein:vir:10 203 DSDQNILKYVTKWLGKKSKV-TRNV---LILGVIEK------L------TKQAIKSLDDIKDVLNVKLDPAISP------ 260 (392) T ss_pred hhHHHHHHHHHHHHHHHHHH-HHHH---HHhhcccc------c------cccCccCHHHHHHHHHHhhhhhhcc------ Confidence 46667777666665544333 3333 33432221 1 112446778877665 344443321 Q ss_pred ccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCccc Q lcl|NC_013692. 229 TGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAV 308 (399) Q Consensus 229 ~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~ 308 (399) . =+.+|||.....|+.++|--+.|=|.|- +-.|.-+++-+.|.|.+.....-...+. T Consensus 261 ------------~-a~~vm~~~~~~~L~~lkd~~G~~l~~~~--------~~~~~~~tllG~~~v~~~~~~~~~~~~~-- 317 (392) T protein:vir:10 261 ------------N-AILLTNQDGFNYLDKLKDKDGKYILQSD--------PTQKNKKLFAGTNPVVVVSNRFLKSKGT-- 317 (392) T ss_pred ------------C-CEEEEcHHHHHHHHHhhccCCCeEeecC--------ccCCccccccCcccEEEecccccCCCcc-- Confidence 1 1247899999999999876666666441 1234556777888776432111111111 Q ss_pred CCcccccccccCccceeEEEEEEcccc-ceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH--HHHHHhhc Q lcl|NC_013692. 309 DPNDQVPMHESGGKYSVFPMLCVASEA-FTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK--WYYGFMVF 385 (399) Q Consensus 309 ~~~~~~~~~~~~~~~DVYp~lV~G~~A-fg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL 385 (399) ..+.++ +++|.=+ |-.+..+ ..+.+ -+. +-.+.+-+++.+.++ +++++.++ T Consensus 318 -------------~~~~~~-~~~gdfs~~~~i~~~---~~~~~--~~~-------~~~~~~f~~~~~~~r~~~r~d~~v~ 371 (392) T protein:vir:10 318 -------------TAKKAP-LIIGDLKEAIVLFKR---EDMEL--AST-------DVGGKAFTRNTLDLRAIQRDDVQMW 371 (392) T ss_pred -------------cCCceE-EEEEehhceEEEEee---cceEE--EEe-------ccccchhhcCceEEEEEEeeccEEe Confidence 111222 4566422 1222222 11112 111 112344456666666 55788999 Q ss_pred cccceEEEEE--ecCC Q lcl|NC_013692. 386 RPEWIALLKT--VARL 399 (399) Q Consensus 386 n~~~m~~iet--~A~~ 399 (399) +++-++.++. +||. T Consensus 372 ~~~a~~~l~~~~~a~~ 387 (392) T protein:vir:10 372 DNEAAVYGEIDLSAPV 387 (392) T ss_pred cccceEEEEecccccc Confidence 9999988654 4444 No 123 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=97.72 E-value=1.1e-05 Score=47.86 Aligned_cols=287 Identities=13% Similarity=0.067 Sum_probs=145.1 Q ss_pred CCCcccccccc----------ccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEcc Q lcl|NC_013692. 1 MAGPVDNIKPM----------KYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYI 70 (399) Q Consensus 1 ~~~~~~~~~~~----------~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~ 70 (399) |.+-..+-... ..+..+.+.-+-+-|+ .| ..+.+....+.-++.+++.+.+|+-+.|+...++. . T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~---~~-~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~-~ 158 (392) T protein:vir:10 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQ---DI-QTQINELARSFDALEQYVTVEPVRTRSGSRVLEKN-S 158 (392) T ss_pred HhcccccHHHHHHHhhhhhhhhccccccCCCceecch---hH-HHHHHHHHHhhhhhhhhceeeeccCCceeEEEEee-c Confidence 11111000000 0011111001112333 23 44555556677788899999999988886543321 1 Q ss_pred CCcCCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeec-cceeEEEEEEeeeecceehhhhhhhhh Q lcl|NC_013692. 71 PLLDDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRV-GFKRVEIKGKLEKYGFFREYTQEQLDF 149 (399) Q Consensus 71 pl~~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~-~~t~tdi~~~l~QyG~~~e~Td~~~~t 149 (399) .- .. ..+...| +..... ..++..|+.+.++++.++.+|+++.+ T Consensus 159 ~~---~~---------a~~v~E~-----------------------~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~- 202 (392) T protein:vir:10 159 DM---IP---------FAEITEM-----------------------GEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQ- 202 (392) T ss_pred CC---cc---------ceeeccc-----------------------ccccccccccceeEEeeeeeEEEeehhhHHHHh- Confidence 11 11 1121111 011111 13667899999999999999999876 Q ss_pred hhchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHH-HHHHhccCcccccee Q lcl|NC_013692. 150 DSDPAMEGHVTTEMVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLR-LDLDNARAPTKIKMI 228 (399) Q Consensus 150 ~~D~~L~~~i~~el~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~-~~Lk~nrApk~T~ii 228 (399) ++++.|...|..+|...-+. .+|. .++++-++ + ......++++|-.+. ..|+..-.+ T Consensus 203 ds~~~l~~~i~~~l~~~i~~-~~d~---~~~~g~g~------~------~~~~~~~~d~i~~~~~~~l~~~~~~------ 260 (392) T protein:vir:10 203 DSDQNILKYVTKWLGKKSKV-TRNV---LILGVIEK------L------TKQAIKSLDDIKDVLNVKLDPAISP------ 260 (392) T ss_pred hhHHHHHHHHHHHHHHHHHH-HHHH---HHhhcccc------c------cccCccCHHHHHHHHHHhhhhhhcc------ Confidence 46667777666665544333 3333 33432221 1 112446778877665 344443321 Q ss_pred ccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCccc Q lcl|NC_013692. 229 TGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAV 308 (399) Q Consensus 229 ~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~ 308 (399) . =+.+|||.....|+.++|--+.|=|.|- +-.|.-+++-+.|.|.+.....-...+. T Consensus 261 ------------~-a~~vm~~~~~~~L~~lkd~~G~~l~~~~--------~~~~~~~tllG~~~v~~~~~~~~~~~~~-- 317 (392) T protein:vir:10 261 ------------N-AILLTNQDGFNYLDKLKDKDGKYILQSD--------PTQKNKKLFAGTNPVVVVSNRFLKSKGT-- 317 (392) T ss_pred ------------C-CEEEEcHHHHHHHHHhhccCCCeEeecC--------ccCCccccccCcccEEEecccccCCCcc-- Confidence 1 1247899999999999876666666441 1234556777888776432111111111 Q ss_pred CCcccccccccCccceeEEEEEEcccc-ceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH--HHHHHhhc Q lcl|NC_013692. 309 DPNDQVPMHESGGKYSVFPMLCVASEA-FTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK--WYYGFMVF 385 (399) Q Consensus 309 ~~~~~~~~~~~~~~~DVYp~lV~G~~A-fg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iL 385 (399) ..+.++ +++|.=+ |-.+..+ ..+.+ -+. +-.+.+-+++.+.++ +++++.++ T Consensus 318 -------------~~~~~~-~~~gdfs~~~~i~~~---~~~~~--~~~-------~~~~~~f~~~~~~~r~~~r~d~~v~ 371 (392) T protein:vir:10 318 -------------TAKKAP-LIIGDLKEAIVLFKR---EDMEL--AST-------DVGGKAFTRNTLDLRAIQRDDVQMW 371 (392) T ss_pred -------------cCCceE-EEEEehhceEEEEee---cceEE--EEe-------ccccchhhcCceEEEEEEeeccEEe Confidence 111222 4566422 1222222 11112 111 112344456666666 55788999 Q ss_pred cccceEEEEE--ecCC Q lcl|NC_013692. 386 RPEWIALLKT--VARL 399 (399) Q Consensus 386 n~~~m~~iet--~A~~ 399 (399) +++-++.++. +||. T Consensus 372 ~~~a~~~l~~~~~a~~ 387 (392) T protein:vir:10 372 DNEAAVYGEIDLSAPV 387 (392) T ss_pred cccceEEEEecccccc Confidence 9999988654 4444 No 124 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=97.69 E-value=8.5e-06 Score=48.38 Aligned_cols=293 Identities=13% Similarity=0.123 Sum_probs=143.3 Q ss_pred cccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCccccCCCC Q lcl|NC_013692. 5 VDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVNDQGID 84 (399) Q Consensus 5 ~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lteGV~ 84 (399) |++ +++++.+.-+-+.| ..+.++.+++.-.+.+++...+|+.+ ++++-+...-+.+ .-..||- T Consensus 1 ma~-----------~t~~~gg~liP~~~-~~~Ii~~~~~~s~l~~l~~~~~~~~~---~~~~p~~~~~~~a-~wv~E~~- 63 (305) T protein:vir:25 1 MAD-----------ISRAEVASLIQEAY-SDTLLAAAKQGSTVLSAFQNVNMGTK---TTHLPVLATLPEA-DWVGESA- 63 (305) T ss_pred CCC-----------ccCCccceecCHHH-HHHHHHHHHhhchhhhhcceeeccCC---cEEEEEEeCCcce-EEeeccc- Confidence 111 12222232233234 57778888888889999999888643 3444332211111 1112221 Q ss_pred cchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHHHHHH Q lcl|NC_013692. 85 ASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVTTEMV 164 (399) Q Consensus 85 p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~~el~ 164 (399) ...++ .+ | . -..++..|+.+.++++.+..+|+++.+ +++..+...|..+|. T Consensus 64 ----~~~~~----------~~----~-~---------s~~~f~~i~~~~~k~~~~~~is~ell~-ds~~~~~~~i~~~l~ 114 (305) T protein:vir:25 64 ----TDPKG----------VK----P-T---------SKVTWANRTLVAEEIAVIIPVHENVID-DATVAVLTEVAELGG 114 (305) T ss_pred ----ccccc----------cc----c-c---------cccceeeEEeeeEEEEEeehhhHHHHh-cchHHHHHHHHHHHH Confidence 11000 00 0 0 124677899999999999999999886 455557666666666 Q ss_pred HHHHHHHHHHHHHHHHhcCceeeec---cc-cccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCccccc Q lcl|NC_013692. 165 KGANEITEDLLQIDLLNSAGTVRYP---GA-ATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVG 240 (399) Q Consensus 165 ~~~~~~t~d~l~~~~l~agt~V~YA---g~-aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~ 240 (399) +.-+...++.+..+.=+..+...+. .. .......+.....+.+++..+...+....... .-.. T Consensus 115 ~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~ 181 (305) T protein:vir:25 115 QAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVANESDIVGATNRAAKAVASA-------------GWAP 181 (305) T ss_pred HHHHHHHhhhheeccCCCCCccccccccccccccccccccccchhhhHHHHHHHHHHHhhhhc-------------cccc Confidence 6555544444332211111111111 00 01111122223344444444444333332210 0001 Q ss_pred CeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccccC Q lcl|NC_013692. 241 NARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESG 320 (399) Q Consensus 241 ~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~ 320 (399) . ..++||.....|+.++|--+.|=| .. +.+-+..++.++.+ ++. . T Consensus 182 ~--~~v~~~~~~~~l~~lkd~~G~~i~-------------~~--~~l~G~Pv~~~~~~-~~~-----------------~ 226 (305) T protein:vir:25 182 D--TLLSSLALRYEVANIRDANGNPVF-------------RD--DSFAGFRTFFNRNG-AWD-----------------A 226 (305) T ss_pred c--eeEecHHHHHHHHHhhccCCceee-------------cC--CcccccceEEcCcc-CCC-----------------C Confidence 1 257899999999988764444433 22 46788888877642 111 1 Q ss_pred ccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHH--HHHHhhccccceEEEEEe-- Q lcl|NC_013692. 321 GKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKW--YYGFMVFRPEWIALLKTV-- 396 (399) Q Consensus 321 ~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~--~~~~~iLn~~~m~~iet~-- 396 (399) ++. .+++|.=+.-.++... .+.+++.-. -.....+..--+-|+....||+ .++..++|++-.+.+.-+ T Consensus 227 ~~~----~~~~gd~s~~~i~~~~---~~~i~~~~~-~~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~ 298 (305) T protein:vir:25 227 DAA----IEVIADSSRVKIGVRQ---DITVKFLDQ-ATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPV 298 (305) T ss_pred Ccc----EEEEEecceEEEEEec---CeEEEEeee-eeeecCCceeeeeecCcEEEEEEEeecceeeCcccEEEEccccc Confidence 122 2456765544555542 223332211 0000011112245777788873 478889998877776552 Q ss_pred cCC Q lcl|NC_013692. 397 ARL 399 (399) Q Consensus 397 A~~ 399 (399) |.+ T Consensus 299 ~~~ 301 (305) T protein:vir:25 299 AVV 301 (305) T ss_pred ccc Confidence 333 No 125 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=97.67 E-value=3.9e-06 Score=50.22 Aligned_cols=276 Identities=14% Similarity=0.068 Sum_probs=143.0 Q ss_pred CCCccccccccccCCC-CCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDP-ANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVN 79 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~-~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~l 79 (399) ....... ....+.. +.+..+-+-|+ .|....+....+.-.+.+++.+.+++.+.++-... ...+ T Consensus 123 ~~~~~~~--~~~~~~~~~~~~gg~~vP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~-------~~~~-- 187 (400) T protein:vir:38 123 RAVPTDA--SDAVNAGVKAADAASTIPE----TISNTPQRELQTVVDLKPFTNVFQASTQKGTYPTV-------ANAT-- 187 (400) T ss_pred hhhhHHH--HHHHhhcccccCCcccccH----HHHHHHHHHHHhhhhhhhcceeEeccCcceEEEEE-------ecCC-- Confidence 1111110 0011111 11111223443 23455555566777889999999998776532221 1111 Q ss_pred cCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHH Q lcl|NC_013692. 80 DQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHV 159 (399) Q Consensus 80 teGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i 159 (399) +...+.+. .|.....-..++..|+.+.++++.++.+|+++.+ ++++.|...| T Consensus 188 -----~~~~~~~E----------------------~~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~-ds~~~~~~~i 239 (400) T protein:vir:38 188 -----TKMVTVAE----------------------LEKNPAMAKPEFKPVNWSVETYRQALPVSQESID-DSAIDLVGLI 239 (400) T ss_pred -----Cccccccc----------------------cccccccccccceeeEeehhheeeehhhHHHHHh-hhHHHHHHHH Confidence 11112211 1111122224677899999999999999998776 4555676666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHH-HHHhccCccccceeccccccCccc Q lcl|NC_013692. 160 TTEMVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRL-DLDNARAPTKIKMITGTRMIDTRT 238 (399) Q Consensus 160 ~~el~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~-~Lk~nrApk~T~ii~~s~~~gT~~ 238 (399) ..+|...-+. .++ ..++++-+. ++ .....++++|..+.. .++... T Consensus 240 ~~~l~~~~~~-~~~---~~i~~~~~~------~~------~~~~~~~~~~~~~~~~~~~~~~------------------ 285 (400) T protein:vir:38 240 AQNGQQIKVN-TTN---GAVATLLKG------FT------AKTISSVDDLKHINNVDLDPAY------------------ 285 (400) T ss_pred HHHHHHHHHH-HHH---Hhhhhcccc------cc------ccccccHHHHHHHHHhhhhhhh------------------ Confidence 6665544333 222 233322211 11 123456777766543 121110 Q ss_pred ccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccc Q lcl|NC_013692. 239 VGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHE 318 (399) Q Consensus 239 I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~ 318 (399) . -+.++||.....|+.|+|--+.|=|.| .+. .+--+.+-|..++.++.+ ++..+|. T Consensus 286 -~--a~~v~~~~~~~~l~~lkd~~G~~i~~~--~~~------~~~~~~l~G~pv~~~~~~-~~~~~g~------------ 341 (400) T protein:vir:38 286 -S--RVIIASQSFYNFLDTVKDGNGRYLLQD--SIL------TPSGKSVLGMPIAVVSDD-TLGAAGE------------ 341 (400) T ss_pred -C--cEEEEcHHHHHHHHHhhccCCCeeeec--CcC------CCCccccccceeEEeccc-ccCCCCc------------ Confidence 1 245789999999999988666766654 122 233457889999888864 3322211 Q ss_pred cCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEecC Q lcl|NC_013692. 319 SGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPEWIALLKTVAR 398 (399) Q Consensus 319 ~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~iet~A~ 398 (399) + .++||.-+-+.+.+...+ +.++. . .+.+.+.++.++ +.+.+.+++++-++.|+..+. T Consensus 342 ------~--~~~~gd~s~~~~~~~~~~--~~~~~--~---------~~~~~~~~~~~~-~r~d~~~~~~~a~~~l~~~~~ 399 (400) T protein:vir:38 342 ------A--HAFLGDIKRAILFANRAD--FMVRW--V---------DDQIYGQFLQAG-MRFGVSVADEKAGYFLTYTPK 399 (400) T ss_pred ------e--EEEEEeccccEEEEeecc--eEEEE--e---------cccccceeEEEE-EEeccEEecccceEEEEeecC Confidence 1 356776442222222122 12211 1 122333344333 688999999999888887555 Q ss_pred C Q lcl|NC_013692. 399 L 399 (399) Q Consensus 399 ~ 399 (399) - T Consensus 400 a 400 (400) T protein:vir:38 400 A 400 (400) T ss_pred C Confidence 5 No 126 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=97.66 E-value=2.3e-05 Score=45.97 Aligned_cols=299 Identities=12% Similarity=0.075 Sum_probs=139.9 Q ss_pred CCCccc--cccccccCCC--------CCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEcc Q lcl|NC_013692. 1 MAGPVD--NIKPMKYNDP--------ANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYI 70 (399) Q Consensus 1 ~~~~~~--~~~~~~~n~~--------~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~ 70 (399) -...+. .+....|+.. +.++-+.+-|+ .+ ..+.++..++..++.+++ .+.+|-..|+ +++-+.. T Consensus 43 ~g~~~~a~~~a~~~~~~~~~~~a~~~~~~~Gg~lvP~---~~-~~~ii~~l~~~s~l~~lg-~~~v~~~~g~-~~~p~~t 116 (366) T protein:vir:57 43 KGNLADAAKFAATELGDTGLSMAISTAAGSGGALIPQ---NM-QNEVIELLRDRTVVRILG-ARSIPLPNGN-LSMPRLS 116 (366) T ss_pred ccchhHHHHHHHHhhcchhhhhhccccccCCccccch---hH-HHHHHHHHhhhcchhhhc-eeeeecCCCc-eEEEEEe Confidence 000000 0000111111 11111222343 23 355566666777777773 3345555564 5544443 Q ss_pred CCcCCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhh Q lcl|NC_013692. 71 PLLDDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFD 150 (399) Q Consensus 71 pl~~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~ 150 (399) .- |...+...| +.+..-.+++..|+.+.++++.+..+|+++.+ + T Consensus 117 ~~------------~~a~wv~E~-----------------------~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~-d 160 (366) T protein:vir:57 117 GG------------ATAGYVGEG-----------------------KDVVATGATFDDVKLSAKTMIALVPVSNQLIG-R 160 (366) T ss_pred CC------------cceeeeccC-----------------------ccccccccceeEEEEeeEEEEEeehhhHHHHh-h Confidence 11 111222211 11122234778899999999999999999876 4 Q ss_pred hchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcCc-e------eeeccccccccccCCcceecHHHHHHHHHHHHhccCcc Q lcl|NC_013692. 151 SDPAMEGHVTTEMVKGANEITEDLLQIDLLNSAG-T------VRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPT 223 (399) Q Consensus 151 ~D~~L~~~i~~el~~~~~~~t~d~l~~~~l~agt-~------V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk 223 (399) +++.+.+.|..+|.+.-+...+ .-+|++-+ . +.+++. ++......++..+.+.++.....|....... T Consensus 161 s~~~~~~~i~~~l~~a~~~~~d----~a~l~G~G~~~~p~Gi~~~~~~-~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~ 235 (366) T protein:vir:57 161 AGFNVEQLLLGDILSAIATRED----KAFLRDDGTGDTPKGMKAVATA-ANRLVAWTGTAINLTTIDEYLDSLILKHMDS 235 (366) T ss_pred hhHHHHHHHHHHHHHHHHHHHH----HHhhccCCCCccccceeecccc-ccceeeccccccchhhHHHHHHHHHHhhhcc Confidence 5556777676666666555333 33443221 1 223321 1111111233455555544433332222110 Q ss_pred ccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCccccccc Q lcl|NC_013692. 224 KIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAG 303 (399) Q Consensus 224 ~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~ 303 (399) . .-... -.-+||+.....|+.|+|--+.|-|.+. .-|.+.++++++++.|-.-.+ T Consensus 236 ------~------~~~~~-a~~vmn~~~~~~L~~lkd~~G~~l~~~~------------~~g~l~G~Pvv~s~~ip~~~~ 290 (366) T protein:vir:57 236 ------N------SNMIR-CGWGLSNRTYMTLFGLRDGNGNKVYPEM------------SQGILKGYPIQRTSAIPANLG 290 (366) T ss_pred ------c------ccccc-CEEEecHHHHHHHHhhhccCCceeccCC------------CCCeecceeeEEccccccccc Confidence 0 00111 2236999999999999887777777432 126788999999886532111 Q ss_pred CCcccCCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCc------cchhhHHHHH Q lcl|NC_013692. 304 VGKAVDPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDP------YGEMGFMSIK 377 (399) Q Consensus 304 aGa~~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DP------lgQrg~~gwK 377 (399) .+ +++. .++||.=+.-.++..+ .+.+++. ..+.. .|+ +-|+....+| T Consensus 291 ~~--------------~~~~----~i~~gdfs~~~i~~~~---~i~i~~~-~ea~~-----~~~~g~~~~~f~~~~~~iR 343 (366) T protein:vir:57 291 DD--------------GNES----EIYFCDFNDVVIGEDG---MMKVDFS-TEATY-----KDADGQLVSAFARNQSLIR 343 (366) T ss_pred cC--------------CCcc----EEEEEecceEEEEEec---ceEEEEe-ecccc-----ccccccchhhhhcCceeEE Confidence 11 1111 1345655444444432 1122111 11111 122 2344445555 Q ss_pred --HHHHHhhccccceEEEEEecC Q lcl|NC_013692. 378 --WYYGFMVFRPEWIALLKTVAR 398 (399) Q Consensus 378 --~~~~~~iLn~~~m~~iet~A~ 398 (399) +.+.+.+.+++-.+.+.-+-= T Consensus 344 ~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 344 VVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred eeeeeCcEeeccccEEEEecccC Confidence 345566666666555543333 No 127 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=97.54 E-value=3.4e-05 Score=45.11 Aligned_cols=277 Identities=12% Similarity=0.028 Sum_probs=143.3 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVND 80 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lt 80 (399) ...++.... -+-.+.+..+-+-|+ .+..+.+....+...+.+++...+|+.+.++-..+ .-.+...+ ..+. T Consensus 105 ~~~~~~~~~---ra~~t~~~gg~liP~----~~~~~Ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~-~~~~~~~~-~~~~ 175 (421) T protein:vir:13 105 RGIQLSEEE---RDIMSSTNNGAVIPQ----EFVNEFEKLKEGYPSLKEHCHVIPVNRNAGKMPVR-AGASVDKL-ANLA 175 (421) T ss_pred hccchhHHH---hhccccCCcceecch----hhHHHHHHHHHhhhhhhhhceeeeccCCceEEEEe-ecCCccce-eecc Confidence 111111100 011111111223343 33455555666777888999999988776532221 11111100 0011 Q ss_pred CCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHH Q lcl|NC_013692. 81 QGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVT 160 (399) Q Consensus 81 eGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~ 160 (399) || +.+..-.+++..|+.++++++.++.+|+++.+ +++..|...|. T Consensus 176 E~----------------------------------~~~~~s~~~f~~i~~~~~k~~~~v~iS~ell~-ds~~~l~~~i~ 220 (421) T protein:vir:13 176 KD----------------------------------TELVKAMLKTQPMAYDIDDYGLLAPIDNSLLE-DSEINFLEFVN 220 (421) T ss_pred cc----------------------------------ccccccccceeEEEeeeeeeEeehhhhHHHHh-hhHHHHHHHHH Confidence 11 11111224677899999999999999998875 45566777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCccccc Q lcl|NC_013692. 161 TEMVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVG 240 (399) Q Consensus 161 ~el~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~ 240 (399) .+|.+.... .++.--.+.+++.. +.....++++|.++...|+..-.+ T Consensus 221 ~~la~~~~~-~~~~~i~~~~~g~~--------------~~~~~~~~d~i~~~~~~l~~~~~~------------------ 267 (421) T protein:vir:13 221 EEFAEFAVN-TENAEIVKQAKAVL--------------AEETINDYAGLVKTINSLVPNARK------------------ 267 (421) T ss_pred HHHHHHHHH-HhhhhHhhhhhhcc--------------ccccccchHHHHHHHHHhhhhhcC------------------ Confidence 777654443 33332223332211 122345788999998888654332 Q ss_pred CeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccccC Q lcl|NC_013692. 241 NARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESG 320 (399) Q Consensus 241 ~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~ 320 (399) .=+.++||.....|+.|+|-.+.|=|.++. .+.-+++-|..+++++.|. .+.+. T Consensus 268 -~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~---------~~~~~tl~G~pV~~~~~~~--~~~~~-------------- 321 (421) T protein:vir:13 268 -RAIIVTNSDGRAYLDGLMDKQGRPLLKELS---------DGGDLVFKGRPVIELEESI--FDVGD-------------- 321 (421) T ss_pred -CCEEEEcHHHHHHHHHhhcCCCceeecCcC---------CCCCceecceeeEEecccc--ccCCC-------------- Confidence 124578999999999999877777776642 3445788899999888643 11111 Q ss_pred ccceeEEEEEEcccc-ceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH--HHHHHhhccccceEEEEEec Q lcl|NC_013692. 321 GKYSVFPMLCVASEA-FTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK--WYYGFMVFRPEWIALLKTVA 397 (399) Q Consensus 321 ~~~DVYp~lV~G~~A-fg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~~iLn~~~m~~iet~A 397 (399) .+ .+++|.-+ +-.+..+. .+.+ -+. .+++-+++...++ +.+.+.+.+++....+.+.. T Consensus 322 ----~~-~~~~gd~~~~~~~~~~~---~~~v--~~~---------~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~ 382 (421) T protein:vir:13 322 ----ET-KFIVSDFKTLIKFMDRK---QYLI--DQS---------KEAGYTKNETIARIIERFDVNSPLDKSSDAEKIRK 382 (421) T ss_pred ----ce-EEEEEeccccEEEEEec---ceEE--Eee---------cccccccCeeEEEEEeeecceeecchhhheeeecc Confidence 12 34566533 22333331 1122 111 2234445554444 33444555554433333321 Q ss_pred C---C Q lcl|NC_013692. 398 R---L 399 (399) Q Consensus 398 ~---~ 399 (399) + + T Consensus 383 ~~a~v 387 (421) T protein:vir:13 383 FGVIV 387 (421) T ss_pred cceee Confidence 1 1 No 128 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=97.52 E-value=2.4e-05 Score=45.90 Aligned_cols=311 Identities=12% Similarity=0.049 Sum_probs=146.4 Q ss_pred CCCccccccccccCCCCCCcc-ccccc-ceehhhhhHHHHHHhhhHHhhhhcccccccCcCCC-cEEEEEEccCCcCCCc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVE-SSIGP-QIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYG-KEIVRLHYIPLLDDRN 77 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~-~~i~p-~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~G-ktIkfrry~pl~~~~t 77 (399) ..++..+.....-....+++. +.-++ ..-+.| ..+.++...|..++.+++.....+-+.- -.+++- T Consensus 321 ~~~~~~~~~~~~a~~~~~~~~~~~~Gg~~vp~~~-~~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip---------- 389 (645) T protein:vir:93 321 PDDSRLHHVLKSAVGAGTTTDPQWAGSLSEYQEY-AQDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVH---------- 389 (645) T ss_pred ccchhhhhhhhhhhhccccccccccCCccCchhh-HHHHHHhhhhhhhHHhhccccccccccccCceeee---------- Confidence 011111110010001111111 11111 111223 4556666678889999986543222210 112211 Q ss_pred cccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHH Q lcl|NC_013692. 78 VNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEG 157 (399) Q Consensus 78 ~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~ 157 (399) ..+.| |...+.. +|+.+..-.+++..|+.+.++.+.++.+|+++.+ ++.+.+.+ T Consensus 390 ~~t~~--~~a~wv~-----------------------Eg~~~~~s~~~f~~v~l~~~kla~~~~iS~ell~-ds~~~~~~ 443 (645) T protein:vir:93 390 AQVSG--GAAGWVG-----------------------EGKTKPLTKFDFESITFSHAKVSAIAVLTEELIR-FSSPAADA 443 (645) T ss_pred eeecC--cceEEec-----------------------cCccccccccceeEEEEeeEEEEEeehhHHHHHh-hchHHHHH Confidence 11122 2223332 2333344456888999999999999999999876 34556777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcc Q lcl|NC_013692. 158 HVTTEMVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTR 237 (399) Q Consensus 158 ~i~~el~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~ 237 (399) .|..+|...-+...+..+..+--.+++++.-+|- +...........+..++..+...|..++... T Consensus 444 ~i~~~l~~aia~~~d~a~l~g~g~~~~~~~p~gi-~~~~~~~~~~~~~~~d~~~~~~~~~~a~~~~-------------- 508 (645) T protein:vir:93 444 LVRNALAEAVVARLDTDFVDPKKAAVADVSPASI-THDVKGTASSGNPDADAEAAFGQFVAANLQP-------------- 508 (645) T ss_pred HHHHHHHHHHHHHHHHHhhcCCCcccCCccccce-eccccccccccchHHHHHHHHHHHHhcCCCc-------------- Confidence 7777777666653333322211122223222211 1111111112234567777776666555431 Q ss_pred cccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCccccccc Q lcl|NC_013692. 238 TVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMH 317 (399) Q Consensus 238 ~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~ 317 (399) -.+ +-+|||.+...|+.|+|--+.+-|..+. ..=|.+-+.+++.+..+- + ....+ T Consensus 509 -~~a--~~vmn~~~~~~L~~lkd~~G~~~~~~~~----------~~~~tL~G~PV~~s~~vp---~---~~~~g------ 563 (645) T protein:vir:93 509 -TGA--VWLMSSTNALALSMRKNALGQKEYPDMT----------LLGGSFQGLPVIVSQYVG---D---QLVLV------ 563 (645) T ss_pred -ccc--EEEEcHHHHHHHHhccccCCceeecCCC----------CCCceeeceeeEEeccCC---c---ceeEe------ Confidence 112 3457999999999998766666663221 111578899999887641 0 00000 Q ss_pred ccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCC---ccchhhHHHHH--HHHHHhhccccceEE Q lcl|NC_013692. 318 ESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSD---PYGEMGFMSIK--WYYGFMVFRPEWIAL 392 (399) Q Consensus 318 ~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~D---PlgQrg~~gwK--~~~~~~iLn~~~m~~ 392 (399) ..-++ ++|.+ +.+-+.-+. ...+++..-+-|..+.+... -|-|+..+++| +.+.+.+.+++-.++ T Consensus 564 ---d~s~~----~ig~~--~~v~i~~s~-~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~ 633 (645) T protein:vir:93 564 ---NAPDI----YLADD--GGVAVDMSR-EASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAV 633 (645) T ss_pred ---ccccE----EEEEe--cceEEEeec-ceeEEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCccceEE Confidence 00011 22322 111111110 01222222221211111111 14678888888 667888888887776 Q ss_pred EEEecCC Q lcl|NC_013692. 393 LKTVARL 399 (399) Q Consensus 393 iet~A~~ 399 (399) |. +++. T Consensus 634 lt-~~~~ 639 (645) T protein:vir:93 634 IT-GVNY 639 (645) T ss_pred Ee-cccC Confidence 65 4444 No 129 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=97.49 E-value=5.2e-05 Score=44.07 Aligned_cols=283 Identities=14% Similarity=0.082 Sum_probs=139.9 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhccccc---ccCcCCCcEEEEEEccCCc-CCC Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTF---SMPKHYGKEIVRLHYIPLL-DDR 76 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~---~mPKn~GktIkfrry~pl~-~~~ 76 (399) || ++| | -+ -|+..+++...+.++.+.+-... .+--+.|++||+.+-.--. .|- T Consensus 1 MA----~~n---~---------------a~-~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~~gl~DY 57 (299) T protein:vir:79 1 MA----ALN---Y---------------AK-EYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTISTTGRVDS 57 (299) T ss_pred Cc----cch---h---------------HH-HHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEecccccccccc Confidence 21 111 1 12 35777888888888888754332 2333668999996654211 122 Q ss_pred ccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhch--- Q lcl|NC_013692. 77 NVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDP--- 153 (399) Q Consensus 77 t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~--- 153 (399) +..+.|-++. + +..+.++++.+=.+| |... +++.|.|. T Consensus 58 ~R~~~g~~~g-----------------~-----------------~~~~~~t~~ldqdr~--~~f~---vD~~Dvdet~~ 98 (299) T protein:vir:79 58 NRDTIAVAQR-----------------N-----------------YDNAWEPKVLTNQRK--WSTL---VHPADINQTNY 98 (299) T ss_pred ccCCCccccc-----------------c-----------------cCcceeEEEeecccc--ceec---cchhhHHHHhh Confidence 2222222211 1 112334444443332 2222 22222221 Q ss_pred hH-HHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceecccc Q lcl|NC_013692. 154 AM-EGHVTTEMVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTR 232 (399) Q Consensus 154 ~L-~~~i~~el~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~ 232 (399) .+ ...+..+....+..--.|..+...|.++.+. .|..++..+++.... ++.|+.+...|++++.|. T Consensus 99 ~~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~~--~g~~~~~~~~T~~n~--y~~i~~~~~~lde~~vP~--------- 165 (299) T protein:vir:79 99 VASIGNITKVYNEEQKFPEMDAYCISKIYADWTA--LGNTADTTVLTTTNV--LEVFDKLMEKMTEARVPE--------- 165 (299) T ss_pred hhHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhhh--cCCcccccccCHHHH--HHHHHHHHHHHHhcCCCC--------- Confidence 11 0111111111111111233444444333221 122222233333322 577999999999999873 Q ss_pred ccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCccccccc-----CCcc Q lcl|NC_013692. 233 MIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAG-----VGKA 307 (399) Q Consensus 233 ~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~-----aGa~ 307 (399) ..+|+||.|+.-..|+. ++.|......++.....+|-||++++|.++++|.- .+.. .|.. T Consensus 166 --------~~rvl~vtp~~~~~L~~------~~~f~k~~~~~~~~~~~~g~Vg~idG~~Ii~Vps~-r~~t~~~~~~G~~ 230 (299) T protein:vir:79 166 --------NGRILYVTPVVNTLIKN------AKEIQRTVNIKDAGTSLNRQTTDIDTVKIIKVPSN-LMKTAYDFTTGWK 230 (299) T ss_pred --------CCeEEEeCHHHHHHHhh------chhhhcccccccccceeeeeeeeecceEEEEechh-hcCccceeccCcc Confidence 35999999999998874 68999888888887889999999999999998862 2222 2222 Q ss_pred cCCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccc Q lcl|NC_013692. 308 VDPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRP 387 (399) Q Consensus 308 ~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~ 387 (399) .+.+ .--..++|+=+ -+.+... +--+.. +-.||. -.++|=|-| .+.||-.-+|.. T Consensus 231 ~~~~-----------ak~in~ii~~~--~a~~~~~---K~~~~~--~~~P~~--~~~~~~~~~-----~r~y~d~~v~~n 285 (299) T protein:vir:79 231 VGAG-----------AKQIFMSLVHP--SAIITPV---SYQFSK--LDEPTA--VTEGKYFYF-----EESFEDVFILNK 285 (299) T ss_pred ccCc-----------ccccceEEEcC--CeeeeeE---eeeeEE--eecCCC--CCccceeee-----eeeeeeeeeecc Confidence 1111 11244555533 2333332 111222 235773 344443322 255655555542 Q ss_pred ---cceEEEEEecC Q lcl|NC_013692. 388 ---EWIALLKTVAR 398 (399) Q Consensus 388 ---~~m~~iet~A~ 398 (399) .--+-+|.|-- T Consensus 286 k~~~i~~~~~~a~~ 299 (299) T protein:vir:79 286 KADAIQFVVEGAGA 299 (299) T ss_pred ccCeEEEEeeecCC Confidence 33455555444 No 130 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=97.43 E-value=1.6e-05 Score=46.84 Aligned_cols=273 Identities=13% Similarity=0.063 Sum_probs=138.0 Q ss_pred CCCccc-cccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCccc Q lcl|NC_013692. 1 MAGPVD-NIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVN 79 (399) Q Consensus 1 ~~~~~~-~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~l 79 (399) .++-.. ..+... +-.+....+-+-|+ -|....+....+...+.+++.+.+++.+.++-..+.. .. T Consensus 116 ~~~~~~~~~~~~~-~~~t~~~gg~liP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~-------~~-- 181 (394) T protein:vir:97 116 MPINETTPVEPQK-DGIKKENAKPVSSE----EILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQR-------AT-- 181 (394) T ss_pred HHHHhhhhhhhhc-cccccccccccChH----HHHHHHHHHhhhhhhhhhhceeeeccCcceEEEEEec-------CC-- Confidence 000000 000000 00011111223444 2355555556677889999999999888765333211 00 Q ss_pred cCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHH Q lcl|NC_013692. 80 DQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHV 159 (399) Q Consensus 80 teGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i 159 (399) +...++..|- ..| ..-..++..|+.+.++++.++.+|+++.+. +++.|...| T Consensus 182 -----~~~~~v~E~~-------------~~~---------~~~~~~~~~v~l~~~k~~~~i~is~ell~d-s~~~~~~~i 233 (394) T protein:vir:97 182 -----TKMVTVAELE-------------KNP---------ALAKPDFKDVAWNIDTYRGAIPLSQESIDD-ADVDLVGIV 233 (394) T ss_pred -----Cccceecccc-------------ccc---------ccccccceeEEeehhheeeehhhHHHHHhh-hhHHHHHHH Confidence 1111222110 001 011236778999999999999999998773 344465656 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccc Q lcl|NC_013692. 160 TTEMVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTV 239 (399) Q Consensus 160 ~~el~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I 239 (399) ..+|...-+. ++|. .+|++.+. ++ .....++++|..+...+..... T Consensus 234 ~~~la~~~~~-~~~~---~i~~g~~~------~~------~~~~~~~~~~~~~~~~~~~~~~------------------ 279 (394) T protein:vir:97 234 SESISQIKVN-TTND---AIAKVLKS------FT------TKTVKNLDEIKALLNGGFDPAY------------------ 279 (394) T ss_pred HHHHHHHHHH-HHHH---HHhhcccc------cc------ccccccHHHHHHHHHhhhhhhh------------------ Confidence 5555443333 3332 33433221 11 1234577777766643322211 Q ss_pred cCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCccccccccc Q lcl|NC_013692. 240 GNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHES 319 (399) Q Consensus 240 ~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~ 319 (399) .++ .+|||.+...|+.|+|--+.|=|.|- +-.|--+.+-|+.+++++.. +.+.+ T Consensus 280 ~a~--~v~n~~~~~~l~~lkd~~G~~i~~~~--------~~~~~~~~l~G~pv~~~~~~--~~~~~-------------- 333 (394) T protein:vir:97 280 NVS--LIVSQSFYQTLDTLKDGNGRYLLQDD--------ITAVSGKVLLGKPVFVLSDE--VLGAN-------------- 333 (394) T ss_pred CCE--EEEcHHHHHHHHHhhccCCCeeeecC--------cCCCCCceeccceeEEeccc--ccCCc-------------- Confidence 123 46899999999999886666666541 22344467888888876521 11110 Q ss_pred CccceeEEEEEEcc--ccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccccceEEEEE-- Q lcl|NC_013692. 320 GGKYSVFPMLCVAS--EAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPEWIALLKT-- 395 (399) Q Consensus 320 ~~~~DVYp~lV~G~--~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~iet-- 395 (399) .++||. ..|. +... +.+.++ + ..+.+.+.++.++ +.+.+.+++++-++.|+. T Consensus 334 --------~~~~gd~~~~~~-~~~~---~~~~~~--~---------~~~~~~~~~~~~~-~r~d~~v~~~~a~~~~~~~~ 389 (394) T protein:vir:97 334 --------KAFIGDFKRGVL-FADR---KDLGLR--W---------ADNEIYGQYLQAV-LRFGVSKVDDKAGYYVTFTP 389 (394) T ss_pred --------cEEEeeccccEE-EEEe---cceEEE--E---------ecccccceeEEEE-EEEccEEecccceEEEEecc Confidence 135665 2222 2211 111111 1 1222333344333 678888999998888875 Q ss_pred -ecCC Q lcl|NC_013692. 396 -VARL 399 (399) Q Consensus 396 -~A~~ 399 (399) ++|+ T Consensus 390 ~~~p~ 394 (394) T protein:vir:97 390 EPLPL 394 (394) T ss_pred cccCC Confidence 3445 No 131 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=97.36 E-value=1.4e-05 Score=47.28 Aligned_cols=276 Identities=15% Similarity=0.085 Sum_probs=137.7 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVND 80 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lt 80 (399) +..-+........+..+....+.+-|+ .+..+.. +......+.+++.+.+++.+.++.-.. ...+ T Consensus 120 ~~~~~~~~~~~~~~~~~~~~~~~~vp~----~~~~~i~-~~~~~~~l~~~~~~~~~~~~~~~~~~~-------~~~~--- 184 (397) T protein:vir:96 120 INAFVKSKGAEKRDGFTSVEGGALIPQ----ELLQPQL-EPKDIVDLSKYVRSVPVNSASGKFPVI-------SKSG--- 184 (397) T ss_pred HHHHHHhhhhhhhhcccccccccchhH----HHHHHHH-HhhhhhhHHHhhhhccccccceeEEEE-------eccC--- Confidence 000001011111111111111111121 2233332 333444556777777777665542211 1110 Q ss_pred CCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHH Q lcl|NC_013692. 81 QGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVT 160 (399) Q Consensus 81 eGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~ 160 (399) ++. . ...|.+.+...-..++..|+.++++++.++.+|+++.+. ++..|...|. T Consensus 185 ---~~~-~----------------------~~~E~~~~~~~~~~~~~~i~~~~~~~~~~~~~s~ell~d-s~~~l~~~i~ 237 (397) T protein:vir:96 185 ---SKM-A----------------------TVQQLEKNPQLANPKMVEIDYSVATRRGYIPISQEMIDD-ASYDVTGLIA 237 (397) T ss_pred ---Ccc-c----------------------cccccccccccccccccceeecHhHhhcchhhHHHHHhh-hHHHHHHHHH Confidence 000 1 112222222223346788999999999999999998774 4445767676 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCccccc Q lcl|NC_013692. 161 TEMVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVG 240 (399) Q Consensus 161 ~el~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~ 240 (399) .+|...-+. .++ ..++ +|+. .+ +....+++++|..+.-..... +. . T Consensus 238 ~~l~~~~~~-~~~---~~i~-~g~g---~~--------~~~~~~~~d~~~~~~~~~~~~-~~-----------------~ 283 (397) T protein:vir:96 238 DEIQDQSLN-TKN---ADIA-AVLK---TA--------TAKSVVGVDGLKDLINKEIKK-VY-----------------D 283 (397) T ss_pred HHHHHHHHH-HHH---HHHh-hccc---cc--------ccccccchHHHHHHHHHhhhh-hc-----------------C Confidence 666554443 332 2333 2222 11 122457788887776432111 10 1 Q ss_pred CeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccccC Q lcl|NC_013692. 241 NARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESG 320 (399) Q Consensus 241 ~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~ 320 (399) + +.++||....-|+.|+|-.+.|-|.|- +-.+..+.+-|..+++++.+.+-.++| T Consensus 284 a--~~v~n~~~~~~l~~lkd~~G~~~~~~~--------~~~~~~~~l~G~pv~~~~~~~~~~~~~--------------- 338 (397) T protein:vir:96 284 V--KLFISASMYSELDKLKDKNGRYLLQDS--------ITAASGKQLLGKEVVVLDDDVIGKSVG--------------- 338 (397) T ss_pred c--EEEEcHHHHHHHHHhhccCCCeEeccC--------ccCCCcccccccceEEecccccCCCCC--------------- Confidence 1 468999999999999987777777541 234455688899998877543311110 Q ss_pred ccceeEEEEEEccccc-eecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccccceEEEE-Eec Q lcl|NC_013692. 321 GKYSVFPMLCVASEAF-TTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPEWIALLK-TVA 397 (399) Q Consensus 321 ~~~DVYp~lV~G~~Af-g~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~ie-t~A 397 (399) -+ .++||.=+- -.+... +.+.+. +. .+.+.++++.++ +.+.+.+.+++-++.++ ++| T Consensus 339 ----~~-~~~~gd~~~~~~~~~~---~~~~~~--~~---------~~~~~~~~~~~~-~r~d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 339 ----NV-VGFIGDAKAFASFFDR---KQVSVS--WV---------DNNIYGQLLAGI-IRYDVKATDKKAGFYVTFTIG 397 (397) T ss_pred ----ce-EEEEeehhcceEeEee---cceEEE--Ee---------cccccceeEEEE-EEEccEEecccceEEEEeecC Confidence 11 245675331 122222 111221 11 122234444433 57888899999999998 455 No 132 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=97.35 E-value=8.7e-05 Score=42.85 Aligned_cols=307 Identities=12% Similarity=0.067 Sum_probs=149.2 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCc-CCCccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLL-DDRNVN 79 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~-~~~t~l 79 (399) |.-| |......++ ..++. -.+..--|.-+.++.=+..-++..+=.++.+ ..||+.+|.|---.. ...+| T Consensus 1 Ms~~-n~~t~~~~~-----~s~~~-~al~le~f~geV~taF~~~si~~~~~~vrti--~~GkS~qf~~iG~~~a~y~~~- 70 (402) T protein:vir:97 1 MSTP-NTLTNVAVS-----ASGEV-DSLLIEKFNGKVNEQYLKGENILSYFDVQTV--TGTNTVSNKYLGETELQVLAP- 70 (402) T ss_pred CCCc-ccccccccc-----cccch-hhhhhhhhhhhHHHHHHHHHhhcCcceeeee--cccceEEEEEEeeeEEeeecc- Confidence 3222 211122221 01111 1112123455566555545555566666665 378888886652221 12222 Q ss_pred cCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeec--ceehhhhhhhhhhhchhHHH Q lcl|NC_013692. 80 DQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYG--FFREYTQEQLDFDSDPAMEG 157 (399) Q Consensus 80 teGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG--~~~e~Td~~~~t~~D~~L~~ 157 (399) |-.+.|+.+.+ ....|++.=-.|= +++.|-|-..+.+. +.. T Consensus 71 --G~~ldg~~~~~--------------------------------~k~~ItID~lL~a~~~V~diDeaq~~yD~---vRs 113 (402) T protein:vir:97 71 --GQSPNATPTQA--------------------------------DKNQLVIDTTVIARNTVAHIHDVQGDIDS---LKP 113 (402) T ss_pred --ccccCCCCccc--------------------------------ccEEEEeCceeechhhhhhHHHHHhcccc---hhH Confidence 22222222211 1112222211121 12222222222221 223 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCce------eeec---ccccccccc-CC-cceecHHHHH----HHHHHHHhccCc Q lcl|NC_013692. 158 HVTTEMVKGANEITEDLLQIDLLNSAGT------VRYP---GAATSDAEV-DA-TTEVTYDSLM----RLRLDLDNARAP 222 (399) Q Consensus 158 ~i~~el~~~~~~~t~d~l~~~~l~agt~------V~YA---g~aTsra~v-~~-~~~vt~~~lr----~a~~~Lk~nrAp 222 (399) +++.|++..=+..+ |+..+.++.++.. ..+. +.+++-..+ +. +...+...|. .+...|++...| T Consensus 114 e~s~e~G~ALA~~~-Dq~ii~~i~~aa~a~t~~~~~~~~~~~~g~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP 192 (402) T protein:vir:97 114 KLAMNQAKQLKRLE-DQMAIQQMLLGGIANTKAERNKPRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVD 192 (402) T ss_pred HHHHHHHHHHHHHH-HHHHHHHHHHhhccccccccccCcccccccccccccccchhhcCHHHHHHHHHHHHHHHHhcCCC Confidence 44555555544433 3333333322221 0111 111111111 00 0123444344 455667777776 Q ss_pred cccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhc--CCccccccccceeEcCeEEEecCcccc Q lcl|NC_013692. 223 TKIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKY--AAGGATMHGEVGQLGRFRVIVNPQMMH 300 (399) Q Consensus 223 k~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kY--g~~~~i~~gEIG~i~~~RfV~~~~~~~ 300 (399) . ..+++++.|..-.-|.+ ++.|++. .| +....+..|+|+++.+||+++++++-- T Consensus 193 ~-----------------~dRv~vv~P~~y~~Ll~------~~rl~n~-d~~~~~~g~~~~G~v~~v~Gv~Vv~SnnlP~ 248 (402) T protein:vir:97 193 I-----------------SDVAIMMPWKFFNALRD------ADRIVDK-TYTISQSGATINGFVLSSYNCPVIPSNRFPT 248 (402) T ss_pred c-----------------cccEEEeChHHHHHHhh------cccccch-hhccccCCccccceeEEEeceEEEecCcccc Confidence 3 23899999999999986 6788888 44 556668999999999999999998643 Q ss_pred cccCCcccCCcccccccccCcccee------EEEEEEccccceecccccCCCCCcceEEEecCCCcCCC-CCCccchhhH Q lcl|NC_013692. 301 WAGVGKAVDPNDQVPMHESGGKYSV------FPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATAD-RSDPYGEMGF 373 (399) Q Consensus 301 ~~~aGa~~~~~~~~~~~~~~~~~DV------Yp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad-~~DPlgQrg~ 373 (399) ......+ ..+ .....++.||| --.++|=++|-+++-+.. + |.| --|+=-|..+ T Consensus 249 ~a~~it~---~~l-s~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~----v------------T~~~~~d~r~~~~~ 308 (402) T protein:vir:97 249 FAQDQAH---HLL-SNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIE----V------------TGDIFYEKKEKTYY 308 (402) T ss_pred ccccccc---ccc-ccCCCCccCCcCcccceeEEEEEecceEEEEEeec----c------------ccchhhchhHHHHH Confidence 2221111 101 01111334442 134455556666554421 1 112 1366678889 Q ss_pred HHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 374 MSIKWYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 374 ~gwK~~~~~~iLn~~~m~~iet~A~~ 399 (399) +=-|+.|+...+|++.-..+++.--- T Consensus 309 id~~~a~G~g~~RPeaa~vv~~~~~~ 334 (402) T protein:vir:97 309 IDTFMAEGAIPDRWEAVSVVTTKRDA 334 (402) T ss_pred HHHHHHhCCcccCccceEEEEEeccc Confidence 99999999999999999988765411 No 133 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=97.35 E-value=8.3e-05 Score=42.94 Aligned_cols=275 Identities=15% Similarity=0.168 Sum_probs=133.1 Q ss_pred CC------CccccccccccCCCCCCcccccccceehhhh---hHHHHHHhhhHHhhhhcc--cccccCcCCCcEEEEEEc Q lcl|NC_013692. 1 MA------GPVDNIKPMKYNDPANGVESSIGPQIHTRYW---YKRALIDAAKEAYFGQLA--DTFSMPKHYGKEIVRLHY 69 (399) Q Consensus 1 ~~------~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~---~~k~L~~A~p~lv~~~~a--~~~~mPKn~GktIkfrry 69 (399) |- --|-..|.|.|.+. ++.| +++.. .-.+|........+.... ... .=-+.|++|++.+- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~------~~~~--nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~-~e~~gg~tVkIp~i 71 (319) T protein:vir:97 1 MNKTIKNATGMLKLNLQHFANK------SVEP--GQTLLKNKHVGILERVTAVNAYSTPALISND-AIFMEGRSFTVMKG 71 (319) T ss_pred CCcccccccceeEeehhhhhcc------CCCc--chHHHHHHHHHHHHHHHHHhhhhhhcccCcc-eEeccCcEEEEeee Confidence 32 22333555666332 2222 22222 134566665655555432 322 22357899999765 Q ss_pred cCCc-CCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhh Q lcl|NC_013692. 70 IPLL-DDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLD 148 (399) Q Consensus 70 ~pl~-~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~ 148 (399) .--. .|-+.+ .|+++. + ++.+..+-+|.|-=.| .|. +++ T Consensus 72 ~~~gl~DY~R~-~g~~~g-----------------~-------------------vt~~~~t~tidqdR~~-~F~--VD~ 111 (319) T protein:vir:97 72 DTTELKDYKRN-ATNEFD-----------------H-------------------PKIEETTYFLDQEKYW-GRF--VDA 111 (319) T ss_pred cccccccccCC-CCcccC-----------------C-------------------cccceeEEEeeccccc-ccc--cch Confidence 5321 111111 123221 1 1223344455442112 222 222 Q ss_pred hhhch---hHHHHHHHHHHHHHHHH---HHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCc Q lcl|NC_013692. 149 FDSDP---AMEGHVTTEMVKGANEI---TEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAP 222 (399) Q Consensus 149 t~~D~---~L~~~i~~el~~~~~~~---t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrAp 222 (399) .|.|. .| .+...+.+.+.+. ..|..+...|.++.. .....+++. .=.++.|+.+...|++++.| T Consensus 112 ~D~~Etn~~l--~a~~i~~~~~~~~v~PEiDay~~skla~~a~------~~~~~~~t~--~n~y~~i~~a~~~Lde~~VP 181 (319) T protein:vir:97 112 LDRKDTEGNI--DINYVVARQGAEVVAPYLDNLRFATLARNKA------KHLTVGTGS--DAQYDAVLDVSVELDEIKAP 181 (319) T ss_pred hhHhhhhchh--hHHHHHHHHHHHHhhhhhhHHHHHHHHhhcc------cccccccCH--HHHHHHHHHHHHHHHhcCCC Confidence 22221 12 1122222222221 233344444533211 111122222 22478899999999998874 Q ss_pred cccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccc Q lcl|NC_013692. 223 TKIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWA 302 (399) Q Consensus 223 k~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~ 302 (399) ..+|+||.|+.-..|+. ++.|+.....+. +.+.+|-||++++|.++++|.-.- T Consensus 182 ------------------~~Rvl~Vtp~~~~~L~~------~~~f~~~~~~~~-~~~~~g~Vg~idG~~Vi~vps~~~-- 234 (319) T protein:vir:97 182 ------------------ENRVLFVSPTFYKGIKK------FVIALPQGDTRQ-QVLGKGVQGELDGFVIVKVPTKLL-- 234 (319) T ss_pred ------------------CCcEEEeCHHHHHHHHh------hhhhhccccccc-cceeeeeceeecCeEEEEeccccc-- Confidence 36999999999998874 688998877765 567899999999999999874110 Q ss_pred cCCcccCCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHH--HHHH Q lcl|NC_013692. 303 GVGKAVDPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSI--KWYY 380 (399) Q Consensus 303 ~aGa~~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gw--K~~~ 380 (399) . -+.++++-+.|... ..|-+ +.++.-..|+. +| -.| ..|| T Consensus 235 ------------------k---~in~i~~h~~A~~~-~~k~~----~~~~~~p~~~~-----------~a-~~v~gr~y~ 276 (319) T protein:vir:97 235 ------------------Q---GLQAIAVVGEVLAS-PIQAD----LAKTNSNIPGM-----------FG-TLAEQLLYT 276 (319) T ss_pred ------------------c---cceEEEEcCCeeee-eeeee----eeeccCCCccc-----------cc-eeeeeeeee Confidence 0 12345554544431 11111 11111112321 11 122 3789 Q ss_pred HHhhccccceEEE--EEecCC Q lcl|NC_013692. 381 GFMVFRPEWIALL--KTVARL 399 (399) Q Consensus 381 ~~~iLn~~~m~~i--et~A~~ 399 (399) ++.++++.-..+. ...+|- T Consensus 277 d~~V~~~k~~~Iy~~~~~~~~ 297 (319) T protein:vir:97 277 GAFVPEHLQKYIFTIGGTEVA 297 (319) T ss_pred eeEEeccccceEEEeecCCcc Confidence 9999988754443 333333 No 134 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=97.35 E-value=8.3e-05 Score=42.94 Aligned_cols=275 Identities=15% Similarity=0.168 Sum_probs=133.1 Q ss_pred CC------CccccccccccCCCCCCcccccccceehhhh---hHHHHHHhhhHHhhhhcc--cccccCcCCCcEEEEEEc Q lcl|NC_013692. 1 MA------GPVDNIKPMKYNDPANGVESSIGPQIHTRYW---YKRALIDAAKEAYFGQLA--DTFSMPKHYGKEIVRLHY 69 (399) Q Consensus 1 ~~------~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~---~~k~L~~A~p~lv~~~~a--~~~~mPKn~GktIkfrry 69 (399) |- --|-..|.|.|.+. ++.| +++.. .-.+|........+.... ... .=-+.|++|++.+- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~------~~~~--nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~-~e~~gg~tVkIp~i 71 (319) T protein:vir:94 1 MNKTIKNATGMLKLNLQHFANK------SVEP--GQTLLKNKHVGILERVTAVNAYSTPALISND-AIFMEGRSFTVMKG 71 (319) T ss_pred CCcccccccceeEeehhhhhcc------CCCc--chHHHHHHHHHHHHHHHHHhhhhhhcccCcc-eEeccCcEEEEeee Confidence 32 22333555666332 2222 22222 134566665655555432 322 22357899999765 Q ss_pred cCCc-CCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhh Q lcl|NC_013692. 70 IPLL-DDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLD 148 (399) Q Consensus 70 ~pl~-~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~ 148 (399) .--. .|-+.+ .|+++. + ++.+..+-+|.|-=.| .|. +++ T Consensus 72 ~~~gl~DY~R~-~g~~~g-----------------~-------------------vt~~~~t~tidqdR~~-~F~--VD~ 111 (319) T protein:vir:94 72 DTTELKDYKRN-ATNEFD-----------------H-------------------PKIEETTYFLDQEKYW-GRF--VDA 111 (319) T ss_pred cccccccccCC-CCcccC-----------------C-------------------cccceeEEEeeccccc-ccc--cch Confidence 5321 111111 123221 1 1223344455442112 222 222 Q ss_pred hhhch---hHHHHHHHHHHHHHHHH---HHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCc Q lcl|NC_013692. 149 FDSDP---AMEGHVTTEMVKGANEI---TEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAP 222 (399) Q Consensus 149 t~~D~---~L~~~i~~el~~~~~~~---t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrAp 222 (399) .|.|. .| .+...+.+.+.+. ..|..+...|.++.. .....+++. .=.++.|+.+...|++++.| T Consensus 112 ~D~~Etn~~l--~a~~i~~~~~~~~v~PEiDay~~skla~~a~------~~~~~~~t~--~n~y~~i~~a~~~Lde~~VP 181 (319) T protein:vir:94 112 LDRKDTEGNI--DINYVVARQGAEVVAPYLDNLRFATLARNKA------KHLTVGTGS--DAQYDAVLDVSVELDEIKAP 181 (319) T ss_pred hhHhhhhchh--hHHHHHHHHHHHHhhhhhhHHHHHHHHhhcc------cccccccCH--HHHHHHHHHHHHHHHhcCCC Confidence 22221 12 1122222222221 233344444533211 111122222 22478899999999998874 Q ss_pred cccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccc Q lcl|NC_013692. 223 TKIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWA 302 (399) Q Consensus 223 k~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~ 302 (399) ..+|+||.|+.-..|+. ++.|+.....+. +.+.+|-||++++|.++++|.-.- T Consensus 182 ------------------~~Rvl~Vtp~~~~~L~~------~~~f~~~~~~~~-~~~~~g~Vg~idG~~Vi~vps~~~-- 234 (319) T protein:vir:94 182 ------------------ENRVLFVSPTFYKGIKK------FVIALPQGDTRQ-QVLGKGVQGELDGFVIVKVPTKLL-- 234 (319) T ss_pred ------------------CCcEEEeCHHHHHHHHh------hhhhhccccccc-cceeeeeceeecCeEEEEeccccc-- Confidence 36999999999998874 688998877765 567899999999999999874110 Q ss_pred cCCcccCCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHH--HHHH Q lcl|NC_013692. 303 GVGKAVDPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSI--KWYY 380 (399) Q Consensus 303 ~aGa~~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gw--K~~~ 380 (399) . -+.++++-+.|... ..|-+ +.++.-..|+. +| -.| ..|| T Consensus 235 ------------------k---~in~i~~h~~A~~~-~~k~~----~~~~~~p~~~~-----------~a-~~v~gr~y~ 276 (319) T protein:vir:94 235 ------------------Q---GLQAIAVVGEVLAS-PIQAD----LAKTNSNIPGM-----------FG-TLAEQLLYT 276 (319) T ss_pred ------------------c---cceEEEEcCCeeee-eeeee----eeeccCCCccc-----------cc-eeeeeeeee Confidence 0 12345554544431 11111 11111112321 11 122 3789 Q ss_pred HHhhccccceEEE--EEecCC Q lcl|NC_013692. 381 GFMVFRPEWIALL--KTVARL 399 (399) Q Consensus 381 ~~~iLn~~~m~~i--et~A~~ 399 (399) ++.++++.-..+. ...+|- T Consensus 277 d~~V~~~k~~~Iy~~~~~~~~ 297 (319) T protein:vir:94 277 GAFVPEHLQKYIFTIGGTEVA 297 (319) T ss_pred eeEEeccccceEEEeecCCcc Confidence 9999988754443 333333 No 135 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=97.23 E-value=9.9e-05 Score=42.53 Aligned_cols=282 Identities=15% Similarity=0.093 Sum_probs=139.0 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVND 80 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lt 80 (399) +.+...--+.+ +..+.+..+-+-|+ .|....+....+...+.+++.+.+|+.+.++-... +... ++ T Consensus 99 lr~~~~~~~~~--~~~t~~~gg~~vP~----~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~-~~~~-----~~-- 164 (389) T protein:vir:10 99 IHSHGKVIDAT--SKVTSTEAGVLIPE----EIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPIL-KRAT-----DR-- 164 (389) T ss_pred hhcchhhhhhh--cccccCCcceeehH----HHHHHHHHHHHhhhhHHhhcceeeccCCeeEEEEE-ecCC-----Cc-- Confidence 11111000011 01111111122333 34566666677788888999999998776542222 1110 00 Q ss_pred CCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHH Q lcl|NC_013692. 81 QGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVT 160 (399) Q Consensus 81 eGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~ 160 (399) ..+. +|.|.+...-..++..|+.++++++.++.+|+++.+ +++..|...|. T Consensus 165 ------~~~~----------------------~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~-ds~~~l~~~i~ 215 (389) T protein:vir:10 165 ------FSSV----------------------AELAENPKLAEPEFNKVDWSVATYRGAIPLSEEAIA-DSAVDLTALVG 215 (389) T ss_pred ------cccc----------------------cccccccccccccceeeeeeheeeEeeehhhHHHHh-hhhHHHHHHHH Confidence 0111 122222222234678899999999999999999876 45656766665 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHH-HHHhccCccccceeccccccCcccc Q lcl|NC_013692. 161 TEMVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRL-DLDNARAPTKIKMITGTRMIDTRTV 239 (399) Q Consensus 161 ~el~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~-~Lk~nrApk~T~ii~~s~~~gT~~I 239 (399) .+|...-+ ..++....+.+..++. ..+ ....++++|..+.. .|+. + + T Consensus 216 ~~la~~~~-~~~~~~i~~g~~~~~~----------~~~--~~~~~~d~l~~~~~~~~~~--~------------~----- 263 (389) T protein:vir:10 216 QSIKEKSV-NTYNAMIAPVLQSFTA----------KKT--TTDTLVDSLKHILNVDLDP--A------------Y----- 263 (389) T ss_pred HHHHHHHH-HHHHHHHhhhhccccc----------ccc--cccccHHHHHHHHHhhhhh--h------------h----- Confidence 55544333 3444332232222211 111 23456777766543 2221 1 0 Q ss_pred cCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCccccccccc Q lcl|NC_013692. 240 GNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHES 319 (399) Q Consensus 240 ~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~ 319 (399) .+ +.+||+....-|+.|+|-.+.|=|.|-- .+. .-.+..+++-|+.+++++...+=.++ T Consensus 264 ~a--~~~~n~~~~~~L~~lkd~~G~~i~~~~~--~~~--~~~~~~~~l~G~pV~~~~~~~~~~~~--------------- 322 (389) T protein:vir:10 264 SR--ALVVTQSLFNTLDTLKDKNGRYLLHDAS--DSI--TDGTAKGTILGVPVYVVGDTLLGSLA--------------- 322 (389) T ss_pred Cc--EEEecHHHHHHHHHhhccCCCeeeecCc--ccc--cccccccccccceeEEecccccCCCC--------------- Confidence 11 3579999999999999877777775432 111 11234567888888776642210000 Q ss_pred CccceeEEEEEEccc--cceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEec Q lcl|NC_013692. 320 GGKYSVFPMLCVASE--AFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPEWIALLKTVA 397 (399) Q Consensus 320 ~~~~DVYp~lV~G~~--Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~iet~A 397 (399) ++ ..++||.= +|- +..+ +.+.+. +.. .+.+. .++.+. +.+.+.+++++=++.++..+ T Consensus 323 -~~----~~~~~gd~~~~~~-~~~~---~~~~i~--~~~--------~~~~~-~~~~~~-~r~d~~~~~~~a~~~~~~~~ 381 (389) T protein:vir:10 323 -GD----QKAFVGDLKRGVL-FTDR---QQVTLA--WED--------SKIYG-KYLGAA-FRFGVQKADSKAGYFVTNTD 381 (389) T ss_pred -Cc----eEEEEeeccccEE-EEee---cceEEE--eec--------ccccc-ceEEEE-EEeccEEecccceEEEEeec Confidence 11 13577742 232 2222 111111 111 12222 222211 46677777777777776544 Q ss_pred CC Q lcl|NC_013692. 398 RL 399 (399) Q Consensus 398 ~~ 399 (399) .- T Consensus 382 ~~ 383 (389) T protein:vir:10 382 VP 383 (389) T ss_pred cC Confidence 33 No 136 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=97.20 E-value=0.00013 Score=41.83 Aligned_cols=205 Identities=13% Similarity=0.117 Sum_probs=97.9 Q ss_pred EEEEeeeecceehhhhhhhhhhhchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcCce-eeec-cccccccccCCcceec- Q lcl|NC_013692. 129 IKGKLEKYGFFREYTQEQLDFDSDPAMEGHVTTEMVKGANEITEDLLQIDLLNSAGT-VRYP-GAATSDAEVDATTEVT- 205 (399) Q Consensus 129 i~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~~el~~~~~~~t~d~l~~~~l~agt~-V~YA-g~aTsra~v~~~~~vt- 205 (399) |..-|-- -.+=|.++..-..-.|+.+++.|++..=+.....-+.+-+.+++.. --+. +.+.....+..+...+ T Consensus 1 iD~lL~a----~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g~~~~~~a~~t~~~ 76 (221) T protein:vir:17 1 MDDLLVA----SQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDGGFSVNIGAGNTNNA 76 (221) T ss_pred CCcchhH----HHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccccCcceeccccccCCH Confidence 1110000 0011122222222224444444444443333322222222222211 1011 0111111222222222 Q ss_pred ---HHHHHHHHHHHHhccCccccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccc Q lcl|NC_013692. 206 ---YDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHG 282 (399) Q Consensus 206 ---~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~g 282 (399) ++.|+.|...|+++..|. ..+++++.|+.-+.|-.- .|+-+......+..+.+.+| T Consensus 77 ~~l~dai~~a~~~LdekdVP~-----------------~gR~~vv~P~~y~~LL~~----~d~~~~n~d~~~s~g~~~~g 135 (221) T protein:vir:17 77 QAIVDGFFEAAAVLDERSAPM-----------------DGRVAVLSPRQYYSLISS----VDTNILNREIGNTQGDMNTG 135 (221) T ss_pred HHHHHHHHHHHHHHhhcCCCC-----------------CCCEEEeCcHHHHHHHHh----cCcceeeeeccccccccccc Confidence 466777888899999883 459999999887777641 25667777666777777888 Q ss_pred -cceeEcCeEEEecCcccccccCCcccCCcccccc--cccCccc----eeEEEEEEccccceecccccCCCCCcceEE-- Q lcl|NC_013692. 283 -EVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPM--HESGGKY----SVFPMLCVASEAFTTVGFATDGKNVKFKII-- 353 (399) Q Consensus 283 -EIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~--~~~~~~~----DVYp~lV~G~~Afg~v~l~~~g~~~k~~~i-- 353 (399) |||++.+||+++++++-.-. |.....++.... ....++| +==--||+=.+|-|+|-|=+--..++..|- T Consensus 136 ~~i~~v~G~~V~~SnnlP~~~--gt~~~~~ag~~~~~~~~~~~yr~~fs~~~glv~~~~Avgtvkl~~~~~~~~~~~~~~ 213 (221) T protein:vir:17 136 KGLYVNAGIRIYKSNVLASLY--GTNLVTDPGDATTSGENNGSYRPAITDRAGLVFHKEAADTVEVLLPPSRPPLVISMF 213 (221) T ss_pred ceeeeecCcEEEEeccCCccc--ccccccCCccccccccccccccccccceEEEEEcchheeeeeeecCCCCCceeeeee Confidence 79999999999999864322 211111111100 0111122 222368899999999999533222222111 Q ss_pred -EecCCCcCCCCC Q lcl|NC_013692. 354 -TKRPGEATADRS 365 (399) Q Consensus 354 -vk~pG~~tad~~ 365 (399) +.+| |+- T Consensus 214 ~~~~~-----~~~ 221 (221) T protein:vir:17 214 SIRRP-----DRR 221 (221) T ss_pred eccCC-----CCC Confidence 2233 322 No 137 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=97.13 E-value=4.9e-05 Score=44.21 Aligned_cols=279 Identities=12% Similarity=0.083 Sum_probs=143.9 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVND 80 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lt 80 (399) ..-|.+..........+.++-+.+-|. -+..++.++...+..++.+++ .+.+|-..|+ +++=+. + T Consensus 345 ~~~~~~~l~~ra~~~~t~~~gg~lvp~---~~~~~~iie~lr~~s~i~~l~-~~~~~~~~g~-~~ip~~----------~ 409 (632) T protein:vir:96 345 FYMPHEVLVQRQLEKKTAGKGGELVAT---ELLSEEFIDILRNKAIIGQMG-ARMLPGLVGD-VDIPKK----------T 409 (632) T ss_pred hhhhHHHHHHhhhhccccccccccccc---ccchHHHHHHHhhcchhhhhc-ceEeecCCcc-eEEEEE----------e Confidence 000111110111111121111223332 122344455556777788873 4456766664 333221 1 Q ss_pred CCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHH Q lcl|NC_013692. 81 QGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVT 160 (399) Q Consensus 81 eGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~ 160 (399) .| |...+.. +|+.+..-.+++..++.+.++++.++.+|+++.+. +++.|.+.|. T Consensus 410 ~~--~~a~wv~-----------------------E~~~~~~s~~~f~~i~l~~~k~~~~v~iS~ell~d-s~~~~~~~i~ 463 (632) T protein:vir:96 410 SG--ANFYWIG-----------------------EDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQ-SSIHVENLIR 463 (632) T ss_pred CC--ceeEeec-----------------------CCccccccccceeeEEeeeeEEEEehhhHHHHHhc-cchHHHHHHH Confidence 11 1222222 22222333467889999999999999999998764 4666877777 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCc-e------eeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccc Q lcl|NC_013692. 161 TEMVKGANEITEDLLQIDLLNSAG-T------VRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRM 233 (399) Q Consensus 161 ~el~~~~~~~t~d~l~~~~l~agt-~------V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~ 233 (399) .+|...-+...+. .+|++-+ . +.+++. ...+. ....+++++|..+...|...++.. T Consensus 464 ~~l~~a~~~~~d~----a~l~G~G~~~~p~Gi~~~~~~--~~~~~-~~~~~~~~~i~~~~~~i~~~~~~~---------- 526 (632) T protein:vir:96 464 EDLIEGIGVALDL----AMLTGTGLANDPVGLLNMTGV--PALTY-PAGGVDWASVVDMETKISTFNADA---------- 526 (632) T ss_pred HHHHHHHHHHHHH----HhhcccCCCCccceeeecccc--cceec-ccccCCHHHHHHHHHHHhhccccc---------- Confidence 7776666553333 3443322 1 112211 10111 124468888888877776655421 Q ss_pred cCcccccCeeEEEechhhhHHHHHHh--hhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCc Q lcl|NC_013692. 234 IDTRTVGNARALYVGSDLVPTIEAMK--DNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPN 311 (399) Q Consensus 234 ~gT~~I~~~yv~~~h~dl~~di~d~~--~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~ 311 (399) -. -+-+|||.....++.+. |- +..+|+. -|.+.+.+++.+..+. ++ T Consensus 527 -----~~--~~~~~~~~~~~~l~~~~l~d~-------------~G~~i~~--~~~l~G~pv~~s~~ip----~~------ 574 (632) T protein:vir:96 527 -----GR--LAYLTSVTQRGAAKKAQVFDN-------------TGERIWQ--NNEVNGYRAEASNQIP----AD------ 574 (632) T ss_pred -----Cc--cEEEEchhHHHHHHHHhccCC-------------CCceeec--CCeecccceEeccccc----cC------ Confidence 01 23357888877777532 32 2333443 2678888888876542 00 Q ss_pred ccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccccceE Q lcl|NC_013692. 312 DQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPEWIA 391 (399) Q Consensus 312 ~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m~ 391 (399) .+++|.-+.--++..+ + +++.+.+=- .---|+..+..| +.+.+.+.+++.++ T Consensus 575 ----------------~~~~gd~s~~~i~~~~-~----~~i~~~~~~------~~~~~~v~~~~~-~~~d~~v~~~~af~ 626 (632) T protein:vir:96 575 ----------------TWIFGDWSQIVIAMWG-V----LDLKVDPYT------KAASDGLVLRVF-QDVDAGVRRKEAFC 626 (632) T ss_pred ----------------cEEEeecceEEEEEec-c----eEEEEcccc------ccccCceEEEEE-eecCceeechhhhh Confidence 1567776655555542 1 333332210 001233333333 45788999999999 Q ss_pred EEEEec Q lcl|NC_013692. 392 LLKTVA 397 (399) Q Consensus 392 ~iet~A 397 (399) .++.+| T Consensus 627 ~~k~~A 632 (632) T protein:vir:96 627 IAKKGA 632 (632) T ss_pred heeecC Confidence 999999 No 138 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=96.84 E-value=0.0003 Score=39.87 Aligned_cols=284 Identities=13% Similarity=0.045 Sum_probs=143.8 Q ss_pred CCC-cccccccccc----CCCCCC-cccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcC Q lcl|NC_013692. 1 MAG-PVDNIKPMKY----NDPANG-VESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLD 74 (399) Q Consensus 1 ~~~-~~~~~~~~~~----n~~~~~-~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~ 74 (399) .++ .........+ .....+ ..+-+-|+ .+ ..+.+....+...+.+++.+.+++ |++ ++-+.. T Consensus 120 ~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~---~~-~~~Ii~~l~~~~~i~~~~~~~~~~---g~~-~ip~~~---- 187 (425) T protein:vir:95 120 KTGEYYKRSEVVEFYEKFRNLRAVAGGELTIPE---VV-VNRIMDIMGDYTTLYPLVDKIRVK---GTT-RILVDT---- 187 (425) T ss_pred hhhhhhhhhHHHHHHHHHHhhcccccCceeccH---HH-HHHHHHHHHhhhhHHHhhceeecC---cee-EEEEec---- Confidence 000 0000000000 010111 11113343 23 455556666777777888777774 432 332211 Q ss_pred CCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeecc-ceeEEEEEEeeeecceehhhhhhhhhhhch Q lcl|NC_013692. 75 DRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVG-FKRVEIKGKLEKYGFFREYTQEQLDFDSDP 153 (399) Q Consensus 75 ~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~-~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~ 153 (399) -.|.+.+.+.| +.+.-.. .++..|+-+.++++.++.+|+++.+- ++. T Consensus 188 --------~~~~a~~v~E~-----------------------~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~d-s~~ 235 (425) T protein:vir:95 188 --------DTSPATWIEQS-----------------------GALPTGDVGTIASIDFDGFKVGKVTFVDNYLLQD-SII 235 (425) T ss_pred --------CCccccccccc-----------------------cccccccccccceeeeeheeeeeeehhhHHHHhc-cHH Confidence 11222333222 1111111 25667899999999999999997663 333 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhcCc---ee---eeccccccccccCCcceecHHHHHHHHHHHHhccCccccce Q lcl|NC_013692. 154 AMEGHVTTEMVKGANEITEDLLQIDLLNSAG---TV---RYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKM 227 (399) Q Consensus 154 ~L~~~i~~el~~~~~~~t~d~l~~~~l~agt---~V---~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~i 227 (399) .|...|..+|...-+. .+-..+|++-+ +. ++++.....+........++++|.++...+..... T Consensus 236 ~l~~~i~~~l~~~i~~----~~d~~il~G~G~~~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------ 305 (425) T protein:vir:95 236 NLDDYVTKKIARAIAK----ALDLAIVKGTGAANKQPLGIIPSLPPENQVTVEADNNLLKNLVKQIGLIDTGDD------ 305 (425) T ss_pred HHHHHHHHHHHHHHHH----HHHHHhhccCCCCccccceeecccccccccccccccchHHHHHHHHHhhhhhcc------ Confidence 4666666666555544 33335554322 10 23321111122233467788888888765544332 Q ss_pred eccccccCcccccCeeEEEechhh-hHHH---HHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCccccccc Q lcl|NC_013692. 228 ITGTRMIDTRTVGNARALYVGSDL-VPTI---EAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAG 303 (399) Q Consensus 228 i~~s~~~gT~~I~~~yv~~~h~dl-~~di---~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~ 303 (399) ....+ +.++|+.. ...| +-++|--+.+=|. .-.++.+.+-+.++|.++.|. T Consensus 306 ----------~~~~~-~~v~~~~~~~~~l~~l~~~kd~~g~~i~~----------~~~~~~~~l~G~pvv~~~~~~---- 360 (425) T protein:vir:95 306 ----------SVGEI-VAVMKRSTYYNRLVEFSIQVDSNGNVVGK----------LPNLRTPDLLGLRVVFNNFLD---- 360 (425) T ss_pred ----------ccCce-EEEEeChHHHHHHHHHHhhcCCCCceeec----------cCCCCCccccceeeEEcCcCC---- Confidence 12333 34666653 3334 3333322222221 113456677888888887541 Q ss_pred CCcccCCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH--HHHH Q lcl|NC_013692. 304 VGKAVDPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK--WYYG 381 (399) Q Consensus 304 aGa~~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~ 381 (399) ++ .++||.-++-.++.. +. +.+-+ +.|.+-.++..+++ .++. T Consensus 361 ~~----------------------~i~~Gd~~~~~~~~~---~~--~~i~~---------~~~~~f~~~~~~~~~~~r~d 404 (425) T protein:vir:95 361 DD----------------------TVLFGEFEQYTLVER---EN--ITIDS---------STHVKFTEDQTAFRGKGRFD 404 (425) T ss_pred Cc----------------------cEEEEecccEEEEee---cc--eEEEe---------ecccccccCceEEEEEEeeC Confidence 00 156776555555543 11 11111 23445566667766 4688 Q ss_pred HhhccccceEEEEEecCC Q lcl|NC_013692. 382 FMVFRPEWIALLKTVARL 399 (399) Q Consensus 382 ~~iLn~~~m~~iet~A~~ 399 (399) +.+.+++=.+.++...|+ T Consensus 405 ~~~~~~~a~~~~~i~~~~ 422 (425) T protein:vir:95 405 GKPVKPEAFVLVTITDPV 422 (425) T ss_pred cEeecccceEEEEecCcC Confidence 999999999999999999 No 139 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=96.80 E-value=7.3e-05 Score=43.26 Aligned_cols=316 Identities=15% Similarity=0.123 Sum_probs=147.5 Q ss_pred CCCcc---------------------cccccc-ccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCc Q lcl|NC_013692. 1 MAGPV---------------------DNIKPM-KYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPK 58 (399) Q Consensus 1 ~~~~~---------------------~~~~~~-~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPK 58 (399) +..+. +....+ ..-.-..+++++-+.-+-+-+ ..+.+....+...+.+++.+.+++. T Consensus 114 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~-~~~ii~~~~~~~~i~~l~~~~~~~~ 192 (497) T protein:vir:78 114 FDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTF-LPGIVEQLFYELSLADLISSRPVTS 192 (497) T ss_pred hhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcccccccchhh-hHHHHHHHHhhhhHHhhccccccCC Confidence 00000 000000 000001122222222222223 4566666778888899999888865 Q ss_pred CCCcEEEEEEccCCcCCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecc Q lcl|NC_013692. 59 HYGKEIVRLHYIPLLDDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGF 138 (399) Q Consensus 59 n~GktIkfrry~pl~~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~ 138 (399) + .+++-+-. +.++...++. +|+....-.+++..|+.+.++++. T Consensus 193 ~---~~~~~~~~-----------~~~~~a~wv~-----------------------E~~~~~~s~~~f~~i~~~~~k~a~ 235 (497) T protein:vir:78 193 P---NLSYLTES-----------AAHNNAAAVA-----------------------EAGTYPFSSEEFARVYEQVGKVAN 235 (497) T ss_pred C---ceEEEEEc-----------CCCCcceeec-----------------------cCcccccccccceeeEeeeeeeEe Confidence 4 34442211 1112222322 223333344678899999999999 Q ss_pred eehhhhhhhhhhhchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee------eccc-------cccccc-------- Q lcl|NC_013692. 139 FREYTQEQLDFDSDPAMEGHVTTEMVKGANEITEDLLQIDLLNSAGTVR------YPGA-------ATSDAE-------- 197 (399) Q Consensus 139 ~~e~Td~~~~t~~D~~L~~~i~~el~~~~~~~t~d~l~~~~l~agt~V~------YAg~-------aTsra~-------- 197 (399) ++.+|+++.+ +. +.|...|..+|...-+. .+ -..+|++.++-. .++. .+.... T Consensus 236 ~~~iS~ell~-d~-~~l~~~i~~~l~~~i~~-~~---d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~ 309 (497) T protein:vir:78 236 ALTITDEGLR-DA-PELFNFVQGRLLEGIQR-KE---EVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVK 309 (497) T ss_pred ecHhHHHHHH-hH-HHHHHHHHHHHHHHHHH-HH---HHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhh Confidence 9999999887 44 45766666666554443 22 334554433210 0100 000000 Q ss_pred ----cCCcceecHHHHHHHHHHHHhccCccccceeccccccCc----------ccc------cCeeEEEechhhhHHHHH Q lcl|NC_013692. 198 ----VDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDT----------RTV------GNARALYVGSDLVPTIEA 257 (399) Q Consensus 198 ----v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT----------~~I------~~~yv~~~h~dl~~di~d 257 (399) .+....+..+.+..+......+-.+..-..+.++...+. ..+ +++ +.++||....-|+. T Consensus 310 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~vmn~~~~~~l~~ 388 (497) T protein:vir:78 310 FPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPN-AVVMNPRDWELLRL 388 (497) T ss_pred hhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCC-eEEEchHHHHHHHH Confidence 000011111112211111111111000000000000000 000 111 46799999999999 Q ss_pred HhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccccCccceeEEEEEEccccce Q lcl|NC_013692. 258 MKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESGGKYSVFPMLCVASEAFT 337 (399) Q Consensus 258 ~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg 337 (399) ++|-.+.+-|.+.-.-.... ..+.-+++-++|+++++.|. +| + ++||.=+.+ T Consensus 389 lkd~~G~~i~~~~~~~~~~~--~~~~~~~l~G~pV~~t~~~~----~~----------------~------~~~Gd~~~~ 440 (497) T protein:vir:78 389 TKDANGQYMGGNFFGNAYGN--PVNGGKNIWGVPVVTTPLIP----LG----------------T------ILVGHFAPS 440 (497) T ss_pred hhcCCCceeccCcccccccc--cccCCceeeceeeEecCCCC----CC----------------c------eEEeecccc Confidence 98877777776543322222 23344578899999998652 11 0 133432211 Q ss_pred ecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHH--HHHHhhccccceEEEEEecCC Q lcl|NC_013692. 338 TVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKW--YYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 338 ~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~--~~~~~iLn~~~m~~iet~A~~ 399 (399) .+.+- +... +++.+-+- ..++-+++.+++++ .+++.+++++-+++++..+.. T Consensus 441 ~~~i~-~r~~--~~v~~~~~-------~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~ 494 (497) T protein:vir:78 441 VIQTA-RREG--VTMQMTNS-------NGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) T ss_pred eEEEE-Eecc--cEEEeecc-------cchhhhcCcEEEEEEEeecceeeccccEEEEEecCCc Confidence 11110 1111 22222111 12345677777764 478899999999999999988 No 140 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=96.80 E-value=7.3e-05 Score=43.26 Aligned_cols=316 Identities=15% Similarity=0.123 Sum_probs=147.5 Q ss_pred CCCcc---------------------cccccc-ccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCc Q lcl|NC_013692. 1 MAGPV---------------------DNIKPM-KYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPK 58 (399) Q Consensus 1 ~~~~~---------------------~~~~~~-~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPK 58 (399) +..+. +....+ ..-.-..+++++-+.-+-+-+ ..+.+....+...+.+++.+.+++. T Consensus 114 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~-~~~ii~~~~~~~~i~~l~~~~~~~~ 192 (497) T protein:vir:10 114 FDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTF-LPGIVEQLFYELSLADLISSRPVTS 192 (497) T ss_pred hhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcccccccchhh-hHHHHHHHHhhhhHHhhccccccCC Confidence 00000 000000 000001122222222222223 4566666778888899999888865 Q ss_pred CCCcEEEEEEccCCcCCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecc Q lcl|NC_013692. 59 HYGKEIVRLHYIPLLDDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGF 138 (399) Q Consensus 59 n~GktIkfrry~pl~~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~ 138 (399) + .+++-+-. +.++...++. +|+....-.+++..|+.+.++++. T Consensus 193 ~---~~~~~~~~-----------~~~~~a~wv~-----------------------E~~~~~~s~~~f~~i~~~~~k~a~ 235 (497) T protein:vir:10 193 P---NLSYLTES-----------AAHNNAAAVA-----------------------EAGTYPFSSEEFARVYEQVGKVAN 235 (497) T ss_pred C---ceEEEEEc-----------CCCCcceeec-----------------------cCcccccccccceeeEeeeeeeEe Confidence 4 34442211 1112222322 223333344678899999999999 Q ss_pred eehhhhhhhhhhhchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee------eccc-------cccccc-------- Q lcl|NC_013692. 139 FREYTQEQLDFDSDPAMEGHVTTEMVKGANEITEDLLQIDLLNSAGTVR------YPGA-------ATSDAE-------- 197 (399) Q Consensus 139 ~~e~Td~~~~t~~D~~L~~~i~~el~~~~~~~t~d~l~~~~l~agt~V~------YAg~-------aTsra~-------- 197 (399) ++.+|+++.+ +. +.|...|..+|...-+. .+ -..+|++.++-. .++. .+.... T Consensus 236 ~~~iS~ell~-d~-~~l~~~i~~~l~~~i~~-~~---d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~ 309 (497) T protein:vir:10 236 ALTITDEGLR-DA-PELFNFVQGRLLEGIQR-KE---EVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVK 309 (497) T ss_pred ecHhHHHHHH-hH-HHHHHHHHHHHHHHHHH-HH---HHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhh Confidence 9999999887 44 45766666666554443 22 334554433210 0100 000000 Q ss_pred ----cCCcceecHHHHHHHHHHHHhccCccccceeccccccCc----------ccc------cCeeEEEechhhhHHHHH Q lcl|NC_013692. 198 ----VDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDT----------RTV------GNARALYVGSDLVPTIEA 257 (399) Q Consensus 198 ----v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT----------~~I------~~~yv~~~h~dl~~di~d 257 (399) .+....+..+.+..+......+-.+..-..+.++...+. ..+ +++ +.++||....-|+. T Consensus 310 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~vmn~~~~~~l~~ 388 (497) T protein:vir:10 310 FPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPN-AVVMNPRDWELLRL 388 (497) T ss_pred hhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCC-eEEEchHHHHHHHH Confidence 000011111112211111111111000000000000000 000 111 46799999999999 Q ss_pred HhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccccCccceeEEEEEEccccce Q lcl|NC_013692. 258 MKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESGGKYSVFPMLCVASEAFT 337 (399) Q Consensus 258 ~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg 337 (399) ++|-.+.+-|.+.-.-.... ..+.-+++-++|+++++.|. +| + ++||.=+.+ T Consensus 389 lkd~~G~~i~~~~~~~~~~~--~~~~~~~l~G~pV~~t~~~~----~~----------------~------~~~Gd~~~~ 440 (497) T protein:vir:10 389 TKDANGQYMGGNFFGNAYGN--PVNGGKNIWGVPVVTTPLIP----LG----------------T------ILVGHFAPS 440 (497) T ss_pred hhcCCCceeccCcccccccc--cccCCceeeceeeEecCCCC----CC----------------c------eEEeecccc Confidence 98877777776543322222 23344578899999998652 11 0 133432211 Q ss_pred ecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHH--HHHHhhccccceEEEEEecCC Q lcl|NC_013692. 338 TVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKW--YYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 338 ~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~--~~~~~iLn~~~m~~iet~A~~ 399 (399) .+.+- +... +++.+-+- ..++-+++.+++++ .+++.+++++-+++++..+.. T Consensus 441 ~~~i~-~r~~--~~v~~~~~-------~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~ 494 (497) T protein:vir:10 441 VIQTA-RREG--VTMQMTNS-------NGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) T ss_pred eEEEE-Eecc--cEEEeecc-------cchhhhcCcEEEEEEEeecceeeccccEEEEEecCCc Confidence 11110 1111 22222111 12345677777764 478899999999999999988 No 141 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=96.38 E-value=0.00068 Score=37.96 Aligned_cols=309 Identities=13% Similarity=0.105 Sum_probs=157.4 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVND 80 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lt 80 (399) |.-| ||.....++ + .++. -.+..--|.-+.+..=+..-++..+=.+|. -..||+..|-|---. ...-++ T Consensus 1 Ms~~-n~~t~p~~~----g-sg~~-~aL~Le~f~GeV~taF~~~si~~~~~~vRt--I~~gkS~qf~~lG~s--~a~y~~ 69 (400) T protein:vir:10 1 MSTP-NNLTNVAVS----A-SGEV-DSLLIEKFNGKVNEQYLKGENIMSYFDVQT--VTGTNTVSNKYLGET--ELQVLA 69 (400) T ss_pred CCCC-ccccccccc----c-ccch-hhhHHhHhcchHHHHHHHHhhhcccceeee--ecccceEEEEEeeee--EEeeec Confidence 3322 222122221 0 1111 011222334455555444555556666775 467788888654211 111222 Q ss_pred CCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhh-hchhHHHHH Q lcl|NC_013692. 81 QGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFD-SDPAMEGHV 159 (399) Q Consensus 81 eGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~-~D~~L~~~i 159 (399) .|-.+.|+.+.+. | +...|+.-|--.=+++.|-|...+.+ -+.++ T Consensus 70 pG~~ldg~~~~~d--------------k----------------~~ItIDtLL~a~~~V~dlDd~q~~yD~vRse~---- 115 (400) T protein:vir:10 70 PGQSPAATSTQAD--------------K----------------NQLVIDATVIARNTVAHLHDVQGDIDSLKPKL---- 115 (400) T ss_pred CCCCcCCCCcccC--------------c----------------EEEEeCceeeecchhhhHHHHhhccccccHHH---- Confidence 3333333332211 1 22344555555556667766666665 35444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcC-cee-e---eccccc--ccccc---CCcceecHHHHHH----HHHHHHhccCcccc Q lcl|NC_013692. 160 TTEMVKGANEITEDLLQIDLLNSA-GTV-R---YPGAAT--SDAEV---DATTEVTYDSLMR----LRLDLDNARAPTKI 225 (399) Q Consensus 160 ~~el~~~~~~~t~d~l~~~~l~ag-t~V-~---YAg~aT--sra~v---~~~~~vt~~~lr~----a~~~Lk~nrApk~T 225 (399) +.|++..=+......+.+-++.|+ .+. . ..++.. ....+ +.+..++...|.. |...|.++..| T Consensus 116 s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g~~~g~s~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP--- 192 (400) T protein:vir:10 116 ATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPRVKGHGFSVNVEVNEGEALVNPQYVMAAVEFALEQQLEQEVD--- 192 (400) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCccccccceeecccccccccCHHHHHHHHHHHHHHHHhcCCC--- Confidence 556666555544333333333232 210 0 111100 01112 2223345455554 44445555553 Q ss_pred ceeccccccCcccccCe-eEEEechhhhHHHHHHhhhcCCCCceehhhcC--CccccccccceeEcCeEEEecCcccccc Q lcl|NC_013692. 226 KMITGTRMIDTRTVGNA-RALYVGSDLVPTIEAMKDNHGNPAFIPIEKYA--AGGATMHGEVGQLGRFRVIVNPQMMHWA 302 (399) Q Consensus 226 ~ii~~s~~~gT~~I~~~-yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg--~~~~i~~gEIG~i~~~RfV~~~~~~~~~ 302 (399) .+ ++.++.|+--.-|++ .+.+++.. |+ .....-.|+|.++.|||+|+++++--.+ T Consensus 193 ---------------~~d~vvl~pp~~Ys~Ll~------~dkLvnrd-f~~s~~g~~~~g~v~~v~Gv~Iv~Sn~lP~~a 250 (400) T protein:vir:10 193 ---------------ISDVAILMPWRYFNVLRD------ADRIVDKS-YTISQSGATIQGFVLSSYNCPVIPSNRFPKYS 250 (400) T ss_pred ---------------ccceEEEcCHHHHHHHHh------CCcccchh-ccccCCCccccceEEEEeceEEEeeCcCCccc Confidence 23 777888887777775 34455554 44 3466789999999999999999864322 Q ss_pred cCCcccCCcccccccccCcccee------EEEEEEccccceecccccCCCCCcceEEEecCCCcCCC-CCCccchhhHHH Q lcl|NC_013692. 303 GVGKAVDPNDQVPMHESGGKYSV------FPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATAD-RSDPYGEMGFMS 375 (399) Q Consensus 303 ~aGa~~~~~~~~~~~~~~~~~DV------Yp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad-~~DPlgQrg~~g 375 (399) +..+..- + +....|..||| --.|+|=.+|-+++-+. .+ +.+ --||=-|..++= T Consensus 251 ~~~~~~~---l-S~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~----~l------------t~~~~~d~r~~~~~id 310 (400) T protein:vir:10 251 QGQKHHL---L-SNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSI----DV------------IGDIFYEKKEKTYYID 310 (400) T ss_pred Ccccccc---c-ccCCCCccCCccccccceeEEEEehhheEEEEee----cc------------ccccccchhhHHHHHH Confidence 2111100 0 01112333442 23556666666654442 11 111 147778899999 Q ss_pred HHHHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 376 IKWYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 376 wK~~~~~~iLn~~~m~~iet~A~~ 399 (399) -|+.|+...+|++.-..++|+-.- T Consensus 311 ~~~a~G~g~~RPeaa~vv~~~~~~ 334 (400) T protein:vir:10 311 TFMSEGAIPDRWEAVSVVTTKRQS 334 (400) T ss_pred HHHHhCCcccchhheEEEEecCCc Confidence 999999999999999999997554 No 142 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=96.31 E-value=0.00075 Score=37.70 Aligned_cols=279 Identities=14% Similarity=0.134 Sum_probs=127.4 Q ss_pred CCC-----------ccc------cccccccCCCCCCcccccccceehhhhhHH---HHHHhhhHHhhhhc--ccccccCc Q lcl|NC_013692. 1 MAG-----------PVD------NIKPMKYNDPANGVESSIGPQIHTRYWYKR---ALIDAAKEAYFGQL--ADTFSMPK 58 (399) Q Consensus 1 ~~~-----------~~~------~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k---~L~~A~p~lv~~~~--a~~~~mPK 58 (399) |-| .+. ..|.|.|.+- ++ +-|++=..+| +|........+... +. ...=. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~--~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N-~~~e~ 71 (329) T protein:vir:10 1 MDGIFITGVKTMNKEIKNATGKLKLNLQHFANK------SV--EPGDTLLKNKHVGILEKVTAANSYSAPAVIS-NDAIF 71 (329) T ss_pred CCceEEechhhhhhhhhcccceeEEehhhhcCC------cc--CCchhHHHHHHHHHHHHHHHhhceeeeeecc-cceee Confidence 222 111 1344555322 12 2233332333 23222221112211 12 22235 Q ss_pred CCCcEEEEEEccCCc-CCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeec Q lcl|NC_013692. 59 HYGKEIVRLHYIPLL-DDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYG 137 (399) Q Consensus 59 n~GktIkfrry~pl~-~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG 137 (399) +.|++|++.+-.-.. .|-+.. .|.++. ++ +.+..+-+|.| . T Consensus 72 ~~g~tVkIp~i~~~gl~DY~R~-~g~~~g-----------------~v-------------------t~~~~t~tidq-d 113 (329) T protein:vir:10 72 MQGRSFTVIKGDVTELKDYKRN-ATNEFD-----------------HP-------------------QIQETTYFLDQ-E 113 (329) T ss_pred ccCcEEEEeeecccccccccCC-CCcccc-----------------cc-------------------ccceeEEEeec-c Confidence 689999997654211 111111 122221 11 22334445544 2 Q ss_pred ceehhhhhhhhhhhch---hH-HHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHH Q lcl|NC_013692. 138 FFREYTQEQLDFDSDP---AM-EGHVTTEMVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLR 213 (399) Q Consensus 138 ~~~e~Td~~~~t~~D~---~L-~~~i~~el~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~ 213 (399) -+..|. +++.|.|. .| ...+.+|....+..-..|..+...|.++ |+ ..+...++.. =.++.|+.+. T Consensus 114 R~~~F~--VD~~D~dEtn~~l~a~~i~~~~~~~~v~pEiDay~~skla~~-----a~-~~~~~~~t~~--nay~~i~~a~ 183 (329) T protein:vir:10 114 KYWGRF--VDALDRRDTEGNIDINYVVAKQASEVVAPYLDNLRFATLARN-----KA-KHLTVGSGAD--AQYDAVLDVS 183 (329) T ss_pred cceeee--cchhhHhhhhhhhhHHHHHHHHHHHHhhhHHHHHHHHHHHhh-----cc-cccccccCHH--HHHHHHHHHH Confidence 223332 22222221 12 1112222222222222344444445322 11 1111222222 2477899999 Q ss_pred HHHHhccCccccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEE Q lcl|NC_013692. 214 LDLDNARAPTKIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVI 293 (399) Q Consensus 214 ~~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV 293 (399) ..|+++..| ..+++||.|+.-..|+. ++.|+......+ +...+|.||++++|.++ T Consensus 184 ~~Lde~~vp------------------~~Rvl~VtP~~~~~Lk~------~~~f~~~~~~~~-~~~~~g~Vg~idG~~Ii 238 (329) T protein:vir:10 184 VELDEIGAG------------------ASRILFVTPKFYKGIKK------FVIELPQGDNRQ-QVLGKGVQGELDGFTIV 238 (329) T ss_pred HHHHhcCCC------------------CCcEEEeCHHHHHHHHh------hhhhhccccccc-cceeeeeeeeecCeEEE Confidence 999987664 36999999999998875 688888766655 46789999999999999 Q ss_pred ecCcccccccCCcccCCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhH Q lcl|NC_013692. 294 VNPQMMHWAGVGKAVDPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGF 373 (399) Q Consensus 294 ~~~~~~~~~~aGa~~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~ 373 (399) ++|.-.- + =+.+|++-+.|...+ .|-+ +.++.-..|+. .+| . T Consensus 239 ~vps~~~-----------------k------~in~ii~~~~A~~~~-~K~~----~~~~~~p~~~~----~a~------~ 280 (329) T protein:vir:10 239 KVPSKML-----------------Q------GVEAMAVIGEVMASP-IQAN----EAKLNSNVPGM----FGT------L 280 (329) T ss_pred EecCCcc-----------------c------ceeEEEEcCCceeee-eeee----eeeeeCCCCcc----chh------e Confidence 9974110 0 124566666665521 1111 12111112221 111 1 Q ss_pred HHHHHHHHHhhccccceEEE--EEec-CC Q lcl|NC_013692. 374 MSIKWYYGFMVFRPEWIALL--KTVA-RL 399 (399) Q Consensus 374 ~gwK~~~~~~iLn~~~m~~i--et~A-~~ 399 (399) +--..||++.++++.-..+. ...| +. T Consensus 281 v~gr~yyd~~V~~~k~~~I~~~~~~a~~~ 309 (329) T protein:vir:10 281 AEQMLYTGAFVPEHLQKYIFTIGGKEVET 309 (329) T ss_pred eeeeeeeeeEEEccccCEEEEecccCccc Confidence 11137888999887744432 2222 22 No 143 >protein:vir:2106 Length: 430 # NCBI annotation: coat protein # Family: family:all:1412 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059630;genbank:gi:9635538;genbank:GeneID:1262831 Probab=96.25 E-value=0.00083 Score=37.48 Aligned_cols=298 Identities=13% Similarity=0.103 Sum_probs=134.2 Q ss_pred CCcccccccceehhhhhHHHHHHhhhHHhhhhcccc-cccCc---CCCcEEEEEEccCCcCCCccccCCCCcchhhhhhh Q lcl|NC_013692. 18 NGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADT-FSMPK---HYGKEIVRLHYIPLLDDRNVNDQGIDASGATIANG 93 (399) Q Consensus 18 ~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~-~~mPK---n~GktIkfrry~pl~~~~t~lteGV~p~g~~~~ng 93 (399) |.+ ++... .++=+ |+.|+.-+-.+||.+.+.+ ++.-. +.|.||..+ .|+. ... ..|-+- T Consensus 1 Ma~--~~~~~-lti~~-~eal~~~~n~lV~a~~~~~~r~~d~~~~r~Gdti~ip--~p~~--~~~-~~G~~~-------- 63 (430) T protein:vir:21 1 MAL--NEGQI-VTLAV-DEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMP--VEQE--SPT-QEGWDL-------- 63 (430) T ss_pred Ccc--ccchh-hHHHH-HHHHHHhhhhhhhhhhhhccCCchhhhhcccceEEee--cccc--ccc-cccccc-------- Confidence 211 12222 22222 8888888888888886553 22222 788998543 3322 111 113221 Q ss_pred hhccccccccccccccccccccccceeeccceeEEEEEEee-eecceehhhhhhhhhhhchhHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 94 NLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLE-KYGFFREYTQEQLDFDSDPAMEGHVTTEMVKGANEITE 172 (399) Q Consensus 94 n~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~-QyG~~~e~Td~~~~t~~D~~L~~~i~~el~~~~~~~t~ 172 (399) +++++-++| .-+.++|. |.+..++|+++-.. |. +.+++.+..+..... T Consensus 64 ------------t~~~~~~~e------------~~v~~~~~~~~~V~~~~~~kEl~---~~----~~~er~l~pAm~~LA 112 (430) T protein:vir:21 64 ------------TDKATGLLE------------LNVAVNMGEPDNDFFQLRADDLR---DE----TAYRRRIQSAARKLA 112 (430) T ss_pred ------------cCCCcccee------------eeEeEEEeeeccceEEeehhHhc---Ch----hhHHHHHHHHHHHHH Confidence 222333333 23455553 46777888855322 21 123445555555555 Q ss_pred HHHHHHHHh----cCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCeeEEEec Q lcl|NC_013692. 173 DLLQIDLLN----SAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNARALYVG 248 (399) Q Consensus 173 d~l~~~~l~----agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv~~~h 248 (399) ..|-.++++ .+.+|.=...++..+ +. -..+++-.+-+.|..|.+|+ +.-+-+++- T Consensus 113 ~~Vd~dl~~~~~~~~~~v~~~~~~t~~~----~~-~~~~~~A~a~~~L~~~~vP~----------------~~~R~~~~~ 171 (430) T protein:vir:21 113 NNVELKVANMAAEMGSLVITSPDAIGTN----TA-DAWNFVADAEEIMFSRELNR----------------DMGTSYFFN 171 (430) T ss_pred HHHHHHHHHHhhhhhhccccccCCCCCC----CC-cchhhHHHHHHHHHHhcCCC----------------CCCcEEEeC Confidence 555555552 233332110111111 11 13455666777899999985 223677788 Q ss_pred hhhhHHHHHHhhhcCCCCceehhhcC--Ccccccccccee-EcCeEEE-ecCccccccc-CCccc--CCcc--------- Q lcl|NC_013692. 249 SDLVPTIEAMKDNHGNPAFIPIEKYA--AGGATMHGEVGQ-LGRFRVI-VNPQMMHWAG-VGKAV--DPND--------- 312 (399) Q Consensus 249 ~dl~~di~d~~~~~~~p~fi~v~kYg--~~~~i~~gEIG~-i~~~RfV-~~~~~~~~~~-aGa~~--~~~~--------- 312 (399) |+....+-+ .+...+.-+ ..+.+.+|+||+ +.+|+.+ .+...-+-.. .|+.. +.++ T Consensus 172 p~~~~~l~~--------~l~~~~~~~~~~~~A~r~g~i~r~~~Gfd~~~~s~~~~~~t~gt~t~~tv~gA~~~~~~~~tv 243 (430) T protein:vir:21 172 PQDYKKAGY--------DLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQL 243 (430) T ss_pred hHHHHHHhh--------hhccccccccchhHHHhhcccccccchhhhhhhcCCcccccCccCcCceecccccccccccee Confidence 887776632 122221111 345567899997 9999865 4444433211 11111 1000 Q ss_pred ----------------------------------------------------cccccccCccceeEEEEE---------- Q lcl|NC_013692. 313 ----------------------------------------------------QVPMHESGGKYSVFPMLC---------- 330 (399) Q Consensus 313 ----------------------------------------------------~~~~~~~~~~~DVYp~lV---------- 330 (399) .+.....++.+.|||-|| T Consensus 244 ~~~g~~~~~d~~~~~it~s~tg~l~~GD~ftiaGV~~v~~itk~~~~~l~qf~V~a~~~~ttv~I~Pai~~~~~~~~~~~ 323 (430) T protein:vir:21 244 DNDGNKVNVDNRFATVTLSATTGMKRGDKISFAGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPE 323 (430) T ss_pred ccccccccccccceeeeeecccceecccEEEecceeeeccccccccCCcceEEEEEecCCceeEEeeccccccccccccc Confidence 000001123455666553 Q ss_pred -------------------Ec-----------cccce--eccccc-CC-CCC-----------cceEEEecCCCcCCCCC Q lcl|NC_013692. 331 -------------------VA-----------SEAFT--TVGFAT-DG-KNV-----------KFKIITKRPGEATADRS 365 (399) Q Consensus 331 -------------------~G-----------~~Afg--~v~l~~-~g-~~~-----------k~~~ivk~pG~~tad~~ 365 (399) +| ++||+ +.+|.- .| ... .+.+++-.-+ T Consensus 324 ~~~y~nVsaspa~~aavT~v~~a~~~~Nl~fh~~A~~La~~pl~~p~~~~~~~~~~~~~~~~~Glsirv~~~y------- 396 (430) T protein:vir:21 324 QRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQG------- 396 (430) T ss_pred ccccceeccccccCceeEEeccCCcccceeEccceeEEEEecccCCCChhHhhheeeeeccccceEEEEEEcc------- Confidence 22 12222 122210 00 000 0111111111 Q ss_pred CccchhhHHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 366 DPYGEMGFMSIKWYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 366 DPlgQrg~~gwK~~~~~~iLn~~~m~~iet~A~~ 399 (399) |.-.....+.|=.+||+..|++||..++=....- T Consensus 397 d~~~~~~~~r~DilyG~~~l~Pe~a~v~l~g~~~ 430 (430) T protein:vir:21 397 DISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred ccccCceEEEEEeecCccccCcceEEEEcCCCCC Confidence 1111122334457899999999997544322222 No 144 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=95.77 E-value=0.0015 Score=36.09 Aligned_cols=282 Identities=15% Similarity=0.080 Sum_probs=130.9 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVND 80 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lt 80 (399) ...-+..-........+.+..+-+-|+ .+ .. .+........+.+++.+.+++.+.++-... ...++. T Consensus 144 ~~~~~~~~e~~~~~~~~~~~~g~lvp~---~~-~~-~i~~~~~~~~l~~~~~~~~~~~~~~~~~~~-------~~~~~~- 210 (437) T protein:vir:10 144 FADYLKTGEVRDVTGIALKDGKVIIPE---TI-LT-PEKEVHQFPRLGSLVRTESVTTTTGKLPIF-------NNSTDL- 210 (437) T ss_pred hHHHHHhhhhhhhhhcccccccccchH---HH-HH-HHHHhhhhhhhhhcceeEeeccCceeeEEe-------eccccc- Confidence 000000000001111111112223333 22 22 234433444667788888887766542222 111111 Q ss_pred CCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHH Q lcl|NC_013692. 81 QGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVT 160 (399) Q Consensus 81 eGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~ 160 (399) ..+. +|.+.....-..++..|+-+.++++.+..+|+++.+ +++..|...|. T Consensus 211 ------~~~~----------------------~e~~~~~e~~~~~~~~v~~~~~k~~~~~~is~ell~-ds~~~~~~~i~ 261 (437) T protein:vir:10 211 ------LTAH----------------------TEYGQTTKNATPVITPILWDLKTYTGGYVFSQELIS-DSSYDWQAELQ 261 (437) T ss_pred ------cccc----------------------cccccccccccccceeeeeehhheeeehhhhHHHHh-hhHHHHHHHHH Confidence 1111 111111111123667889999999999999999866 44445666666 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeeeccccccccccCCcceecHHHHHHHHH-HHHhccCccccceeccccccCcccc Q lcl|NC_013692. 161 TEMVKGANEITEDLLQIDLLNSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRL-DLDNARAPTKIKMITGTRMIDTRTV 239 (399) Q Consensus 161 ~el~~~~~~~t~d~l~~~~l~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~-~Lk~nrApk~T~ii~~s~~~gT~~I 239 (399) .+|...-+. .+| ..+|++-++ ++... ....+.++|..+.. .|+..-.+ T Consensus 262 ~~l~~~~~~-~~~---~~i~~g~g~------~~~~~----~~~~~~~~~~~~~~~~l~~~~~~----------------- 310 (437) T protein:vir:10 262 SRLIELRDN-TDD---SLIITALTD------GIKKT----TSTYLLGDLKKVLNVTLKPQDSA----------------- 310 (437) T ss_pred HHHHHHHHH-HHH---HHHhhhhcc------ccccc----ccccchhhHHHHHHhhhhhhhhc----------------- Confidence 565544333 333 344543222 11111 12234455555432 34332211 Q ss_pred cCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCccccccccc Q lcl|NC_013692. 240 GNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHES 319 (399) Q Consensus 240 ~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~ 319 (399) .+ +.+||+....-|+.|+|-.+.|=|.|- ++ .|.-+++-|..++.++.|.. ..++ T Consensus 311 -~~-~~~~~~~~~~~l~~lkd~~g~~~~~~~--~~------~~~~~~l~G~pv~~~~~~~~--~~~~------------- 365 (437) T protein:vir:10 311 -AA-SIVMSQSAYNLFDMATDAMGRPLLQPN--VT------AATGYTLLGKTVVIVDDKLF--PSAS------------- 365 (437) T ss_pred -CC-EEEEcHHHHHHHHHhhccCCCeeeccC--cc------CCCCcccccceeEEeccccc--CCcC------------- Confidence 12 358999999999999887777777552 22 23446788888888776421 1110 Q ss_pred CccceeEEEEEEccc--cceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEe- Q lcl|NC_013692. 320 GGKYSVFPMLCVASE--AFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPEWIALLKTV- 396 (399) Q Consensus 320 ~~~~DVYp~lV~G~~--Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~iet~- 396 (399) .++ + .++||.= +|..+.- .+ +.+-+ -+..|.+.+...+- +.+.+.+++++-++.|..- T Consensus 366 ~~~---~-~~~~gd~~~~~~~~~r--~~----~~~~~-------~~~~~~~~~~~~~~--~r~d~~~~~~~a~~~l~~~~ 426 (437) T protein:vir:10 366 AGD---V-NIVVAPLKKAVINFKL--TE----ITGQF-------QDTYDIWYKQLGIF--LRQNVVQASKDLIVNLTGKL 426 (437) T ss_pred CCc---e-EEEEeeccccEEEEee--ec----eEEEE-------ecccccccceeeEE--EEEccEEecccceEEEEeec Confidence 011 2 2456653 3322111 11 11111 12244454433222 3468888888888876521 Q ss_pred -cCC Q lcl|NC_013692. 397 -ARL 399 (399) Q Consensus 397 -A~~ 399 (399) |.. T Consensus 427 ~~~~ 430 (437) T protein:vir:10 427 KAVT 430 (437) T ss_pred cccc Confidence 111 No 145 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=95.55 E-value=0.00051 Score=38.63 Aligned_cols=270 Identities=14% Similarity=0.131 Sum_probs=130.6 Q ss_pred CCCcccc---ccccccCC-CCCCcc---cccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCc Q lcl|NC_013692. 1 MAGPVDN---IKPMKYND-PANGVE---SSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLL 73 (399) Q Consensus 1 ~~~~~~~---~~~~~~n~-~~~~~~---~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~ 73 (399) +.+.... .+.....+ -..++. +-+-|+ -+..+.+......-.+.+++.+.++.- .++ .+...+.. T Consensus 99 ~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~----~~~~~Ii~~~~~~~~l~~~~~~~~~~~---~~~-p~~~~~~~ 170 (387) T protein:vir:96 99 ILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPK----TLSKEIVSEPFAKNQLREKARLTNIKG---LEI-PRVSYTLD 170 (387) T ss_pred HhhhhHHHHHHHHHHHHhhhccCCCCCCceeech----hHHHHHHHHHHhhchhhhhceeeecCC---cee-eeeeccCC Confidence 1110000 00000000 001111 112343 224566666666666777888776642 222 11111111 Q ss_pred CCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhch Q lcl|NC_013692. 74 DDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDP 153 (399) Q Consensus 74 ~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~ 153 (399) ++.++.. |...+--.+++..|+.+.++++.|+.+|+++.+ ++++ T Consensus 171 ------------~a~~v~E-----------------------g~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~-ds~~ 214 (387) T protein:vir:96 171 ------------DDDFITD-----------------------VETAKELKAKGDTVKFTTNKFKVFAAISDTVIH-GSDV 214 (387) T ss_pred ------------ccccccc-----------------------cccccccccccceeeechheeeeechhhHHHHh-hhHH Confidence 1112221 222222335677899999999999999999766 5666 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhcCc-e-----eeeccccccccccCCcceecHHHHHHHHHHHHhccCccccce Q lcl|NC_013692. 154 AMEGHVTTEMVKGANEITEDLLQIDLLNSAG-T-----VRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKM 227 (399) Q Consensus 154 ~L~~~i~~el~~~~~~~t~d~l~~~~l~agt-~-----V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~i 227 (399) .|.+.|..+|.+.-+. .++.. ++..|+ + +.+..+. ..++ ...++++|..+.-.|+....+ T Consensus 215 ~l~~~i~~~la~~~~~-~e~~~---~~~~g~g~g~~~g~~~~~~~---~~~~--~~~~~d~i~~~~~~l~~~y~~----- 280 (387) T protein:vir:96 215 DLVNWVENALQSGLAA-KERKD---ALAVSPKSGLEHMSFYNGSV---KEVE--GADMYDAIINALADLHEDYRD----- 280 (387) T ss_pred HHHHHHHHHHHHHHHH-HHHHh---HhhcCCCccccceeeecccc---cccc--ccchHHHHHHHHhccChhhhc----- Confidence 6777777777665443 33221 222221 1 1222111 1222 224578888887777654322 Q ss_pred eccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcc Q lcl|NC_013692. 228 ITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKA 307 (399) Q Consensus 228 i~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~ 307 (399) .+ ..+||+.....++.+.+--+- .++.|.-+.+-|..++.+.- T Consensus 281 -------------na-~~imn~~t~~~~~~~~~~~~~-------------~~~~~~~~~llG~PV~~~~~---------- 323 (387) T protein:vir:96 281 -------------NA-TIYMRYADYVKIISVLSNGTT-------------NFFDTPAEKVFGKPVVFTDA---------- 323 (387) T ss_pred -------------CC-EEEEechHHHHHHHHHhcCCC-------------cccccCCccccccceEEecC---------- Confidence 11 236787766666665432222 23333334455555554331 Q ss_pred cCCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCc-cchhhHHHHHHHHHHhhcc Q lcl|NC_013692. 308 VDPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDP-YGEMGFMSIKWYYGFMVFR 386 (399) Q Consensus 308 ~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DP-lgQrg~~gwK~~~~~~iLn 386 (399) . +-++||.=++.-+.+.+.. + .+- .|. -|+++|..+. ++.+.+.+ T Consensus 324 --------------~----~~~~~GDf~~~~~~~~~~~----~----~~~-------~~~~~~~~~~~~~~-r~Dg~v~~ 369 (387) T protein:vir:96 324 --------------A----VKPIVGDFNYFGINYDGTT----Y----DTD-------KDVKKGEYLFVLTA-WYDQQRTL 369 (387) T ss_pred --------------C----Cceeeechhhhhhhhhhhh----h----eec-------ccccCCceEEEEEE-EeCcEeec Confidence 0 1246777555444443211 1 111 111 2455555544 78999999 Q ss_pred ccceEEEEEecCC Q lcl|NC_013692. 387 PEWIALLKTVARL 399 (399) Q Consensus 387 ~~~m~~iet~A~~ 399 (399) ++-+..++.-|.- T Consensus 370 ~~A~~~l~~ka~~ 382 (387) T protein:vir:96 370 DSAFRIAKAKENT 382 (387) T ss_pred hhheEEEEeecCC Confidence 9999999986655 No 146 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=95.55 E-value=0.00051 Score=38.63 Aligned_cols=270 Identities=14% Similarity=0.131 Sum_probs=130.6 Q ss_pred CCCcccc---ccccccCC-CCCCcc---cccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCc Q lcl|NC_013692. 1 MAGPVDN---IKPMKYND-PANGVE---SSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLL 73 (399) Q Consensus 1 ~~~~~~~---~~~~~~n~-~~~~~~---~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~ 73 (399) +.+.... .+.....+ -..++. +-+-|+ -+..+.+......-.+.+++.+.++.- .++ .+...+.. T Consensus 99 ~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~----~~~~~Ii~~~~~~~~l~~~~~~~~~~~---~~~-p~~~~~~~ 170 (387) T protein:vir:94 99 ILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPK----TLSKEIVSEPFAKNQLREKARLTNIKG---LEI-PRVSYTLD 170 (387) T ss_pred HhhhhHHHHHHHHHHHHhhhccCCCCCCceeech----hHHHHHHHHHHhhchhhhhceeeecCC---cee-eeeeccCC Confidence 1110000 00000000 001111 112343 224566666666666777888776642 222 11111111 Q ss_pred CCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhch Q lcl|NC_013692. 74 DDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDP 153 (399) Q Consensus 74 ~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~ 153 (399) ++.++.. |...+--.+++..|+.+.++++.|+.+|+++.+ ++++ T Consensus 171 ------------~a~~v~E-----------------------g~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~-ds~~ 214 (387) T protein:vir:94 171 ------------DDDFITD-----------------------VETAKELKAKGDTVKFTTNKFKVFAAISDTVIH-GSDV 214 (387) T ss_pred ------------ccccccc-----------------------cccccccccccceeeechheeeeechhhHHHHh-hhHH Confidence 1112221 222222335677899999999999999999766 5666 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhcCc-e-----eeeccccccccccCCcceecHHHHHHHHHHHHhccCccccce Q lcl|NC_013692. 154 AMEGHVTTEMVKGANEITEDLLQIDLLNSAG-T-----VRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKM 227 (399) Q Consensus 154 ~L~~~i~~el~~~~~~~t~d~l~~~~l~agt-~-----V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~i 227 (399) .|.+.|..+|.+.-+. .++.. ++..|+ + +.+..+. ..++ ...++++|..+.-.|+....+ T Consensus 215 ~l~~~i~~~la~~~~~-~e~~~---~~~~g~g~g~~~g~~~~~~~---~~~~--~~~~~d~i~~~~~~l~~~y~~----- 280 (387) T protein:vir:94 215 DLVNWVENALQSGLAA-KERKD---ALAVSPKSGLEHMSFYNGSV---KEVE--GADMYDAIINALADLHEDYRD----- 280 (387) T ss_pred HHHHHHHHHHHHHHHH-HHHHh---HhhcCCCccccceeeecccc---cccc--ccchHHHHHHHHhccChhhhc----- Confidence 6777777777665443 33221 222221 1 1222111 1222 224578888887777654322 Q ss_pred eccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcc Q lcl|NC_013692. 228 ITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKA 307 (399) Q Consensus 228 i~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~ 307 (399) .+ ..+||+.....++.+.+--+- .++.|.-+.+-|..++.+.- T Consensus 281 -------------na-~~imn~~t~~~~~~~~~~~~~-------------~~~~~~~~~llG~PV~~~~~---------- 323 (387) T protein:vir:94 281 -------------NA-TIYMRYADYVKIISVLSNGTT-------------NFFDTPAEKVFGKPVVFTDA---------- 323 (387) T ss_pred -------------CC-EEEEechHHHHHHHHHhcCCC-------------cccccCCccccccceEEecC---------- Confidence 11 236787766666665432222 23333334455555554331 Q ss_pred cCCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCc-cchhhHHHHHHHHHHhhcc Q lcl|NC_013692. 308 VDPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDP-YGEMGFMSIKWYYGFMVFR 386 (399) Q Consensus 308 ~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DP-lgQrg~~gwK~~~~~~iLn 386 (399) . +-++||.=++.-+.+.+.. + .+- .|. -|+++|..+. ++.+.+.+ T Consensus 324 --------------~----~~~~~GDf~~~~~~~~~~~----~----~~~-------~~~~~~~~~~~~~~-r~Dg~v~~ 369 (387) T protein:vir:94 324 --------------A----VKPIVGDFNYFGINYDGTT----Y----DTD-------KDVKKGEYLFVLTA-WYDQQRTL 369 (387) T ss_pred --------------C----Cceeeechhhhhhhhhhhh----h----eec-------ccccCCceEEEEEE-EeCcEeec Confidence 0 1246777555444443211 1 111 111 2455555544 78999999 Q ss_pred ccceEEEEEecCC Q lcl|NC_013692. 387 PEWIALLKTVARL 399 (399) Q Consensus 387 ~~~m~~iet~A~~ 399 (399) ++-+..++.-|.- T Consensus 370 ~~A~~~l~~ka~~ 382 (387) T protein:vir:94 370 DSAFRIAKAKENT 382 (387) T ss_pred hhheEEEEeecCC Confidence 9999999986655 No 147 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=95.55 E-value=0.00051 Score=38.63 Aligned_cols=270 Identities=14% Similarity=0.131 Sum_probs=130.6 Q ss_pred CCCcccc---ccccccCC-CCCCcc---cccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCc Q lcl|NC_013692. 1 MAGPVDN---IKPMKYND-PANGVE---SSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLL 73 (399) Q Consensus 1 ~~~~~~~---~~~~~~n~-~~~~~~---~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~ 73 (399) +.+.... .+.....+ -..++. +-+-|+ -+..+.+......-.+.+++.+.++.- .++ .+...+.. T Consensus 99 ~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~----~~~~~Ii~~~~~~~~l~~~~~~~~~~~---~~~-p~~~~~~~ 170 (387) T protein:vir:26 99 ILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPK----TLSKEIVSEPFAKNQLREKARLTNIKG---LEI-PRVSYTLD 170 (387) T ss_pred HhhhhHHHHHHHHHHHHhhhccCCCCCCceeech----hHHHHHHHHHHhhchhhhhceeeecCC---cee-eeeeccCC Confidence 1110000 00000000 001111 112343 224566666666666777888776642 222 11111111 Q ss_pred CCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhch Q lcl|NC_013692. 74 DDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDP 153 (399) Q Consensus 74 ~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~ 153 (399) ++.++.. |...+--.+++..|+.+.++++.|+.+|+++.+ ++++ T Consensus 171 ------------~a~~v~E-----------------------g~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~-ds~~ 214 (387) T protein:vir:26 171 ------------DDDFITD-----------------------VETAKELKAKGDTVKFTTNKFKVFAAISDTVIH-GSDV 214 (387) T ss_pred ------------ccccccc-----------------------cccccccccccceeeechheeeeechhhHHHHh-hhHH Confidence 1112221 222222335677899999999999999999766 5666 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhcCc-e-----eeeccccccccccCCcceecHHHHHHHHHHHHhccCccccce Q lcl|NC_013692. 154 AMEGHVTTEMVKGANEITEDLLQIDLLNSAG-T-----VRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKM 227 (399) Q Consensus 154 ~L~~~i~~el~~~~~~~t~d~l~~~~l~agt-~-----V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~i 227 (399) .|.+.|..+|.+.-+. .++.. ++..|+ + +.+..+. ..++ ...++++|..+.-.|+....+ T Consensus 215 ~l~~~i~~~la~~~~~-~e~~~---~~~~g~g~g~~~g~~~~~~~---~~~~--~~~~~d~i~~~~~~l~~~y~~----- 280 (387) T protein:vir:26 215 DLVNWVENALQSGLAA-KERKD---ALAVSPKSGLEHMSFYNGSV---KEVE--GADMYDAIINALADLHEDYRD----- 280 (387) T ss_pred HHHHHHHHHHHHHHHH-HHHHh---HhhcCCCccccceeeecccc---cccc--ccchHHHHHHHHhccChhhhc----- Confidence 6777777777665443 33221 222221 1 1222111 1222 224578888887777654322 Q ss_pred eccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcc Q lcl|NC_013692. 228 ITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKA 307 (399) Q Consensus 228 i~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~ 307 (399) .+ ..+||+.....++.+.+--+- .++.|.-+.+-|..++.+.- T Consensus 281 -------------na-~~imn~~t~~~~~~~~~~~~~-------------~~~~~~~~~llG~PV~~~~~---------- 323 (387) T protein:vir:26 281 -------------NA-TIYMRYADYVKIISVLSNGTT-------------NFFDTPAEKVFGKPVVFTDA---------- 323 (387) T ss_pred -------------CC-EEEEechHHHHHHHHHhcCCC-------------cccccCCccccccceEEecC---------- Confidence 11 236787766666665432222 23333334455555554331 Q ss_pred cCCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCc-cchhhHHHHHHHHHHhhcc Q lcl|NC_013692. 308 VDPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDP-YGEMGFMSIKWYYGFMVFR 386 (399) Q Consensus 308 ~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DP-lgQrg~~gwK~~~~~~iLn 386 (399) . +-++||.=++.-+.+.+.. + .+- .|. -|+++|..+. ++.+.+.+ T Consensus 324 --------------~----~~~~~GDf~~~~~~~~~~~----~----~~~-------~~~~~~~~~~~~~~-r~Dg~v~~ 369 (387) T protein:vir:26 324 --------------A----VKPIVGDFNYFGINYDGTT----Y----DTD-------KDVKKGEYLFVLTA-WYDQQRTL 369 (387) T ss_pred --------------C----Cceeeechhhhhhhhhhhh----h----eec-------ccccCCceEEEEEE-EeCcEeec Confidence 0 1246777555444443211 1 111 111 2455555544 78999999 Q ss_pred ccceEEEEEecCC Q lcl|NC_013692. 387 PEWIALLKTVARL 399 (399) Q Consensus 387 ~~~m~~iet~A~~ 399 (399) ++-+..++.-|.- T Consensus 370 ~~A~~~l~~ka~~ 382 (387) T protein:vir:26 370 DSAFRIAKAKENT 382 (387) T ss_pred hhheEEEEeecCC Confidence 9999999986655 No 148 >protein:vir:100939 Length: 430 # NCBI annotation: Gp5 # Family: family:all:1412 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006408;genbank:gi:46358700;genbank:GeneID:2777089 Probab=95.31 E-value=0.0023 Score=35.02 Aligned_cols=297 Identities=14% Similarity=0.165 Sum_probs=131.8 Q ss_pred CCcccccccceehhhhhHHHHHHhhhHHhhhhccccc-----ccCcCCCcEEEEEEccCCcCCCccccCCCCcchhhhhh Q lcl|NC_013692. 18 NGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTF-----SMPKHYGKEIVRLHYIPLLDDRNVNDQGIDASGATIAN 92 (399) Q Consensus 18 ~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~-----~mPKn~GktIkfrry~pl~~~~t~lteGV~p~g~~~~n 92 (399) |. .+++..++-+. |+.|+--+-.+||.+.+.+. ++ .+.|.||+. |++.+-.. ..|-+..++ T Consensus 1 MA--n~l~~~~~ii~--~eal~~l~n~~v~a~~~~~~r~~d~~~-~r~Gdti~~----p~~~~~~~-~~G~~~t~~---- 66 (430) T protein:vir:10 1 MA--LNEGQIVTLAV--DEIIETISAITPMAQKAKKYTPPAASM-QRSSNTIWM----PVEQESPT-QEGWDLTDK---- 66 (430) T ss_pred Cc--cchhhHHHHHH--HHHHHHHhhhhhhhhhhcccCCchhhh-hcccceEEe----cccccccc-ccCcccCCC---- Confidence 21 12333223233 77777777778888764432 22 367888743 33322222 225554432 Q ss_pred hhhccccccccccccccccccccccceeeccceeEEEEEEe-eeecceehhhhhhhhhhhchhHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 93 GNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKL-EKYGFFREYTQEQLDFDSDPAMEGHVTTEMVKGANEIT 171 (399) Q Consensus 93 gn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l-~QyG~~~e~Td~~~~t~~D~~L~~~i~~el~~~~~~~t 171 (399) ++-++|. -+.++| +|.+..++|++.-+. +++ . +++-+..+.... T Consensus 67 ----------------~~~i~e~------------~v~~~v~~~k~V~~~~~~kel~-~~~--~----~~~~i~~Am~~L 111 (430) T protein:vir:10 67 ----------------ATGLLEL------------NVAVNMGEPDNDFFQLRADDLR-DET--A----YRHRIQSAARKL 111 (430) T ss_pred ----------------CCccccc------------eEEEEEeeeccceEEechhHhc-Chh--H----HHHHhHHHHHHH Confidence 2223331 234444 457888899875532 111 1 122234444444 Q ss_pred HHHHHHHHHhc----CceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCeeEEEe Q lcl|NC_013692. 172 EDLLQIDLLNS----AGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNARALYV 247 (399) Q Consensus 172 ~d~l~~~~l~a----gt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv~~~ 247 (399) ...|-.++++- +.+|.=. .++.-+. ..-..++.-.+-+.|..|.+|+ +--+-+++ T Consensus 112 A~~Vd~dl~~~~~~~~~~v~~~----~~~t~~~-~~~~~~~~A~a~~~L~~~~vP~----------------~~~R~~vl 170 (430) T protein:vir:10 112 ANNVELKVANMAAEMGSLVITS----PDAIGTN-TADAWNFVADAEELMFSRELNR----------------DMGTSYFF 170 (430) T ss_pred HHHHHHHHHHHhhhcccccccc----cccCCCc-CCcchhhHHHHHHHHHHhcCCC----------------CCCcEEEe Confidence 44555555522 2222210 1111111 1113456667778899999985 11266778 Q ss_pred chhhhHHHHHHhhhcCCCCceehhhcC--Ccccccccccee-EcCeEEE-ecCccccccc-CCccc--CCccc------- Q lcl|NC_013692. 248 GSDLVPTIEAMKDNHGNPAFIPIEKYA--AGGATMHGEVGQ-LGRFRVI-VNPQMMHWAG-VGKAV--DPNDQ------- 313 (399) Q Consensus 248 h~dl~~di~d~~~~~~~p~fi~v~kYg--~~~~i~~gEIG~-i~~~RfV-~~~~~~~~~~-aGa~~--~~~~~------- 313 (399) -|+....+-+ .+...+.-+ ..+.+-+||||+ +.+|..+ .+...-+-.+ .|+.. +.++. T Consensus 171 dp~~~~~l~~--------~l~~l~~~~~~~~~A~r~g~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~tv~gA~~~~~~~~~ 242 (430) T protein:vir:10 171 NPQDYKKAGY--------DLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQ 242 (430) T ss_pred ChHHHHHHHh--------hhccccccccchhHHHhhccccccchhhhhhhhcCCcccccCccCcCceeccccccccccce Confidence 7887777643 122232222 345577999997 9999865 4444333211 11111 10000 Q ss_pred ----------------ccccc--------------------------------------cCccceeEEEEE--------- Q lcl|NC_013692. 314 ----------------VPMHE--------------------------------------SGGKYSVFPMLC--------- 330 (399) Q Consensus 314 ----------------~~~~~--------------------------------------~~~~~DVYp~lV--------- 330 (399) .+++. .++.+.|||-++ T Consensus 243 v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftiaGV~~v~~~tkq~~~~l~~F~Vt~~~~atsv~I~paii~~~~~~~~~ 322 (430) T protein:vir:10 243 LDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSP 322 (430) T ss_pred ecccccccccccccceeeeecccceecccEEEecceeeeccccccccCCccEEEEEEecCCceeEEeccccccccccccc Confidence 00011 123344555543 Q ss_pred --------------------Ec-----------cccce--eccccc-CC-C-----------CCcceEEEecCCCcCCCC Q lcl|NC_013692. 331 --------------------VA-----------SEAFT--TVGFAT-DG-K-----------NVKFKIITKRPGEATADR 364 (399) Q Consensus 331 --------------------~G-----------~~Afg--~v~l~~-~g-~-----------~~k~~~ivk~pG~~tad~ 364 (399) +| ++||+ +.+|.- .| . .+.+.+++-.-+ T Consensus 323 ~~~~y~nVsaspa~~aavTvv~~a~~~~Nl~fhr~A~aLa~~pL~~~~~~~~~~~~~~~~~~~~Glsirv~~~y------ 396 (430) T protein:vir:10 323 EQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQG------ 396 (430) T ss_pred cccccceeccccccCceeEEeccCCcccceeEcccceEEEEecccCCCCHHHhhhhheeccccceEEEEEEEec------ Confidence 22 12222 112210 00 0 000111111111 Q ss_pred CCccchhhHHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 365 SDPYGEMGFMSIKWYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 365 ~DPlgQrg~~gwK~~~~~~iLn~~~m~~iet~A~~ 399 (399) |.-.....+.|=.+||+..|++||..++=....- T Consensus 397 -d~~~~~~~~r~DvLyG~~~v~Pe~a~v~l~g~~~ 430 (430) T protein:vir:10 397 -DISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred -ccccCceEEEEeeeccceecCcceEEEEcCCCCC Confidence 1111112223447899999999996544322222 No 149 >protein:vir:9265 Length: 430 # NCBI annotation: 5 # Family: family:all:1412 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720329;genbank:gi:24371587;genbank:GeneID:955820 Probab=95.31 E-value=0.0023 Score=35.02 Aligned_cols=297 Identities=14% Similarity=0.165 Sum_probs=131.8 Q ss_pred CCcccccccceehhhhhHHHHHHhhhHHhhhhccccc-----ccCcCCCcEEEEEEccCCcCCCccccCCCCcchhhhhh Q lcl|NC_013692. 18 NGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTF-----SMPKHYGKEIVRLHYIPLLDDRNVNDQGIDASGATIAN 92 (399) Q Consensus 18 ~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~-----~mPKn~GktIkfrry~pl~~~~t~lteGV~p~g~~~~n 92 (399) |. .+++..++-+. |+.|+--+-.+||.+.+.+. ++ .+.|.||+. |++.+-.. ..|-+..++ T Consensus 1 MA--n~l~~~~~ii~--~eal~~l~n~~v~a~~~~~~r~~d~~~-~r~Gdti~~----p~~~~~~~-~~G~~~t~~---- 66 (430) T protein:vir:92 1 MA--LNEGQIVTLAV--DEIIETISAITPMAQKAKKYTPPAASM-QRSSNTIWM----PVEQESPT-QEGWDLTDK---- 66 (430) T ss_pred Cc--cchhhHHHHHH--HHHHHHHhhhhhhhhhhcccCCchhhh-hcccceEEe----cccccccc-ccCcccCCC---- Confidence 21 12333223233 77777777778888764432 22 367888743 33322222 225554432 Q ss_pred hhhccccccccccccccccccccccceeeccceeEEEEEEe-eeecceehhhhhhhhhhhchhHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 93 GNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKL-EKYGFFREYTQEQLDFDSDPAMEGHVTTEMVKGANEIT 171 (399) Q Consensus 93 gn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l-~QyG~~~e~Td~~~~t~~D~~L~~~i~~el~~~~~~~t 171 (399) ++-++|. -+.++| +|.+..++|++.-+. +++ . +++-+..+.... T Consensus 67 ----------------~~~i~e~------------~v~~~v~~~k~V~~~~~~kel~-~~~--~----~~~~i~~Am~~L 111 (430) T protein:vir:92 67 ----------------ATGLLEL------------NVAVNMGEPDNDFFQLRADDLR-DET--A----YRHRIQSAARKL 111 (430) T ss_pred ----------------CCccccc------------eEEEEEeeeccceEEechhHhc-Chh--H----HHHHhHHHHHHH Confidence 2223331 234444 457888899875532 111 1 122234444444 Q ss_pred HHHHHHHHHhc----CceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCeeEEEe Q lcl|NC_013692. 172 EDLLQIDLLNS----AGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNARALYV 247 (399) Q Consensus 172 ~d~l~~~~l~a----gt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv~~~ 247 (399) ...|-.++++- +.+|.=. .++.-+. ..-..++.-.+-+.|..|.+|+ +--+-+++ T Consensus 112 A~~Vd~dl~~~~~~~~~~v~~~----~~~t~~~-~~~~~~~~A~a~~~L~~~~vP~----------------~~~R~~vl 170 (430) T protein:vir:92 112 ANNVELKVANMAAEMGSLVITS----PDAIGTN-TADAWNFVADAEELMFSRELNR----------------DMGTSYFF 170 (430) T ss_pred HHHHHHHHHHHhhhcccccccc----cccCCCc-CCcchhhHHHHHHHHHHhcCCC----------------CCCcEEEe Confidence 44555555522 2222210 1111111 1113456667778899999985 11266778 Q ss_pred chhhhHHHHHHhhhcCCCCceehhhcC--Ccccccccccee-EcCeEEE-ecCccccccc-CCccc--CCccc------- Q lcl|NC_013692. 248 GSDLVPTIEAMKDNHGNPAFIPIEKYA--AGGATMHGEVGQ-LGRFRVI-VNPQMMHWAG-VGKAV--DPNDQ------- 313 (399) Q Consensus 248 h~dl~~di~d~~~~~~~p~fi~v~kYg--~~~~i~~gEIG~-i~~~RfV-~~~~~~~~~~-aGa~~--~~~~~------- 313 (399) -|+....+-+ .+...+.-+ ..+.+-+||||+ +.+|..+ .+...-+-.+ .|+.. +.++. T Consensus 171 dp~~~~~l~~--------~l~~l~~~~~~~~~A~r~g~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~tv~gA~~~~~~~~~ 242 (430) T protein:vir:92 171 NPQDYKKAGY--------DLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQ 242 (430) T ss_pred ChHHHHHHHh--------hhccccccccchhHHHhhccccccchhhhhhhhcCCcccccCccCcCceeccccccccccce Confidence 7887777643 122232222 345577999997 9999865 4444333211 11111 10000 Q ss_pred ----------------ccccc--------------------------------------cCccceeEEEEE--------- Q lcl|NC_013692. 314 ----------------VPMHE--------------------------------------SGGKYSVFPMLC--------- 330 (399) Q Consensus 314 ----------------~~~~~--------------------------------------~~~~~DVYp~lV--------- 330 (399) .+++. .++.+.|||-++ T Consensus 243 v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftiaGV~~v~~~tkq~~~~l~~F~Vt~~~~atsv~I~paii~~~~~~~~~ 322 (430) T protein:vir:92 243 LDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSP 322 (430) T ss_pred ecccccccccccccceeeeecccceecccEEEecceeeeccccccccCCccEEEEEEecCCceeEEeccccccccccccc Confidence 00011 123344555543 Q ss_pred --------------------Ec-----------cccce--eccccc-CC-C-----------CCcceEEEecCCCcCCCC Q lcl|NC_013692. 331 --------------------VA-----------SEAFT--TVGFAT-DG-K-----------NVKFKIITKRPGEATADR 364 (399) Q Consensus 331 --------------------~G-----------~~Afg--~v~l~~-~g-~-----------~~k~~~ivk~pG~~tad~ 364 (399) +| ++||+ +.+|.- .| . .+.+.+++-.-+ T Consensus 323 ~~~~y~nVsaspa~~aavTvv~~a~~~~Nl~fhr~A~aLa~~pL~~~~~~~~~~~~~~~~~~~~Glsirv~~~y------ 396 (430) T protein:vir:92 323 EQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQG------ 396 (430) T ss_pred cccccceeccccccCceeEEeccCCcccceeEcccceEEEEecccCCCCHHHhhhhheeccccceEEEEEEEec------ Confidence 22 12222 112210 00 0 000111111111 Q ss_pred CCccchhhHHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 365 SDPYGEMGFMSIKWYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 365 ~DPlgQrg~~gwK~~~~~~iLn~~~m~~iet~A~~ 399 (399) |.-.....+.|=.+||+..|++||..++=....- T Consensus 397 -d~~~~~~~~r~DvLyG~~~v~Pe~a~v~l~g~~~ 430 (430) T protein:vir:92 397 -DISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred -ccccCceEEEEeeeccceecCcceEEEEcCCCCC Confidence 1111112223447899999999996544322222 No 150 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=95.02 E-value=0.003 Score=34.46 Aligned_cols=283 Identities=14% Similarity=0.116 Sum_probs=142.7 Q ss_pred ccceehhhhhHHHHHHhhhHHhhhhcccccc----cCc--CCCcEEEEEEccCCcCCCc------cccCCCCcchhhhhh Q lcl|NC_013692. 25 GPQIHTRYWYKRALIDAAKEAYFGQLADTFS----MPK--HYGKEIVRLHYIPLLDDRN------VNDQGIDASGATIAN 92 (399) Q Consensus 25 ~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~----mPK--n~GktIkfrry~pl~~~~t------~lteGV~p~g~~~~n 92 (399) =|.++..-| +|+=.+.....|...|++.-+ ||= =.|.+..+.|=.-+++..- .+.+|+.|. T Consensus 1 mpaltLaea-~k~~~d~l~~~ViE~~~~~s~lL~~LpF~~veg~~~~ynR~~~~~~~~~~~v~~~~~~~g~~~~------ 73 (310) T protein:vir:97 1 MASVTLAES-AKLAQDELVAGVIENIITVNRMFDVLPFDSIEGNSLAYNRENVLGDVIMAGVGTTFSGAGAGKA------ 73 (310) T ss_pred CcccchHHH-hhcCcchHHHHHHHHHhccchHHHhCCcccccCCcceeeEeeccCCcccccccccccCCCcccc------ Confidence 011111111 122222223334444443222 221 2355555555444433221 122333332 Q ss_pred hhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhh-hchhHHHHHHHHH-HHHHHHH Q lcl|NC_013692. 93 GNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFD-SDPAMEGHVTTEM-VKGANEI 170 (399) Q Consensus 93 gn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~-~D~~L~~~i~~el-~~~~~~~ 170 (399) -.|+..++.+|.=.|-..++--.+.|+. .|+.-.-....++ .+.-.+. T Consensus 74 ------------------------------~~t~~~~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~ 123 (310) T protein:vir:97 74 ------------------------------AATFTKVNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRK 123 (310) T ss_pred ------------------------------ccccceeeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHH Confidence 1245678888888888889888888886 6653311111122 1222333 Q ss_pred HHHHHHHHHHhcCc--eeeecc-----cccccccc-CCcceecHHHHHHHHHHHH-hccCccccceeccccccCcccccC Q lcl|NC_013692. 171 TEDLLQIDLLNSAG--TVRYPG-----AATSDAEV-DATTEVTYDSLMRLRLDLD-NARAPTKIKMITGTRMIDTRTVGN 241 (399) Q Consensus 171 t~d~l~~~~l~agt--~V~YAg-----~aTsra~v-~~~~~vt~~~lr~a~~~Lk-~nrApk~T~ii~~s~~~gT~~I~~ 241 (399) . ++.++|+-. | -|-| ..+++-.. +.+-.+|.++|+.+.-..- ..+.| T Consensus 124 ~----e~~lINGD~a~n-~F~GL~~~~~~~q~i~~~~~gg~~t~d~LDeLl~~v~~~~g~p------------------- 179 (310) T protein:vir:97 124 Y----QDQLINGNGAGN-EFAGLIQLCASGQKATTGATGSAISFAILDELMDLVVDKDGQV------------------- 179 (310) T ss_pred H----HHHhhccccCCC-cccchhhcCCccceeecCCCCCCCCHHHHHHHHHHHhcCCCCC------------------- Confidence 3 444444211 1 1100 01122111 1234588999999885432 33343 Q ss_pred eeEEEechhhhHHHHHHhhhcCCCCceeh--hhcCCccccccccceeEcCeEEEecCcccccccCCcccCCccccccccc Q lcl|NC_013692. 242 ARALYVGSDLVPTIEAMKDNHGNPAFIPI--EKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHES 319 (399) Q Consensus 242 ~yv~~~h~dl~~di~d~~~~~~~p~fi~v--~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~ 319 (399) =+.++||-+.+-|+++..-.+--+--|+ ..+|.+ +=++.++-|+....+..=...+ ++ T Consensus 180 -~~~l~~~~~~r~i~A~~R~~~~~g~~~~~~~~~G~~-------v~~~~GiPi~~~d~ip~~~~~~------------~~ 239 (310) T protein:vir:97 180 -DYLTMHARTLRSYKALLRALGGASINEVVELPSGAE-------VPAYSGTPIFRNDYIPTNQTKG------------GT 239 (310) T ss_pred -CEEEecHHHHHHHHHHHHHhcCCCCCCccccCCCCE-------EeeeCCeEEEEeCccCCCcccc------------cc Confidence 2578999888888876432222233222 112222 2256788888865322111111 12 Q ss_pred CccceeEEEEEEcccc--ceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEec Q lcl|NC_013692. 320 GGKYSVFPMLCVASEA--FTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPEWIALLKTVA 397 (399) Q Consensus 320 ~~~~DVYp~lV~G~~A--fg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~iet~A 397 (399) ++.-.+|. +-+|.++ .|.+||...+. .-+-|.-+|+. .|.---+..+ +||+++.+|++.=.++||-+- T Consensus 240 ~gtTsIya-~r~Ge~~~~~Gv~Gl~~~~~---~glsVr~~G~~----~~~~v~~~~V--~~Y~~~av~~~~A~a~L~~V~ 309 (310) T protein:vir:97 240 TGCTTIFA-GTLDDGSRTHGIAGLTATQA---AGIQVVDVGES----EDSDEHIWRV--KWYCGLALFSEKGLACADGIT 309 (310) T ss_pred CCceeEEE-EeeCccccccceeccccCCc---cceeEEeCCcc----cCCcceeEEE--EEeeeEEEecccceeeecccc Confidence 33445775 5689886 78999986552 23566677742 3333444444 479999999999999999888 Q ss_pred C Q lcl|NC_013692. 398 R 398 (399) Q Consensus 398 ~ 398 (399) - T Consensus 310 ~ 310 (310) T protein:vir:97 310 N 310 (310) T ss_pred C Confidence 8 No 151 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=94.63 E-value=0.0039 Score=33.79 Aligned_cols=309 Identities=12% Similarity=0.082 Sum_probs=148.0 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCcccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVND 80 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lt 80 (399) |.-| ||.....++. .++. -.+..--|.-+.+..=+..-++..+=.+|. -..||+..|-|---.. ..-++ T Consensus 1 Ms~~-n~~t~~~~~~-----sg~~-~al~Le~f~GeV~taF~~~si~~~~~~vRt--i~~gkS~qf~~~G~s~--~~~~~ 69 (401) T protein:vir:70 1 MSTP-NNLTNVAVSA-----SGEV-DSLLIEKFNGKVNEQYLKGENIMSYFDVQT--VTGTNTVSNKYLGETE--LQVLA 69 (401) T ss_pred CCCC-cccccccccc-----ccch-hHhHHhHhcchHHHHHHHHhhhcccceeee--ecccceEEEEEeeeeE--eeeec Confidence 3333 2222222211 1111 011222234455555444455556666675 4677888886542211 11122 Q ss_pred CCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhh-hchhHHHHH Q lcl|NC_013692. 81 QGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFD-SDPAMEGHV 159 (399) Q Consensus 81 eGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~-~D~~L~~~i 159 (399) .|-.+.|+.+.+. | ....|+.-|--.=+++.|-|...+.+ -+.+ + T Consensus 70 pG~~ld~~~~~~d--------------K----------------~~ItID~lL~a~~~V~dlDe~q~~yD~vRse----~ 115 (401) T protein:vir:70 70 PGQSPAATSTQAD--------------K----------------NQLVIDATVIARNTVAHLHDVQGDIDSLKPK----L 115 (401) T ss_pred CCCCcCCCCcccc--------------c----------------EEEEeCceeehhhhhhhHHHHHhcccccchH----H Confidence 2222333222211 1 11223333333334445554444444 3433 3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcC-cee-eec------cccccccccCCc---ceecHHHHH----HHHHHHHhccCccc Q lcl|NC_013692. 160 TTEMVKGANEITEDLLQIDLLNSA-GTV-RYP------GAATSDAEVDAT---TEVTYDSLM----RLRLDLDNARAPTK 224 (399) Q Consensus 160 ~~el~~~~~~~t~d~l~~~~l~ag-t~V-~YA------g~aTsra~v~~~---~~vt~~~lr----~a~~~Lk~nrApk~ 224 (399) +.+++..=+......+.+-+..|| .+. ... +++++ .++++. ..++...|. .+...|+++..| T Consensus 116 s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p~~~~~G~~-i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP-- 192 (401) T protein:vir:70 116 ATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNPRVKGHGFS-INVEVAEGEALVNPQYVMAAVEFALEQQLEQEVD-- 192 (401) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCcCCCceE-EeccccccccccCHHHHHHHHHHHHHHHHhcCCC-- Confidence 555555555533333333332222 221 000 11111 122211 223433344 455556666664 Q ss_pred cceeccccccCcccccCe-eEEEechhhhHHHHHHhhhcCCCCceehh-hcCCccccccccceeEcCeEEEecCcccccc Q lcl|NC_013692. 225 IKMITGTRMIDTRTVGNA-RALYVGSDLVPTIEAMKDNHGNPAFIPIE-KYAAGGATMHGEVGQLGRFRVIVNPQMMHWA 302 (399) Q Consensus 225 T~ii~~s~~~gT~~I~~~-yv~~~h~dl~~di~d~~~~~~~p~fi~v~-kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~ 302 (399) .+ |+.++.|+--.-|++ .|...+.. .|......-+|+|+++.|||+|+++++--.. T Consensus 193 ----------------~~r~vvl~pp~~Ys~Ll~------~d~L~nrd~~~s~~g~~~~G~v~~vaGv~Vv~SnnlP~~a 250 (401) T protein:vir:70 193 ----------------ISDVAILMPWRYFNVLRD------ADRIVDKTYTISQSGATIQGFTLSSYNCPVIPSNRFPKYS 250 (401) T ss_pred ----------------ccceEEEcCHHHHHHHHh------cCcccchhhccccCCccccceEEEEeceEEEeeccccccc Confidence 23 778888887777765 34555532 2345567899999999999999999854322 Q ss_pred cCCcccCCcccccccccCcccee------EEEEEEccccceecccccCCCCCcceEEEecCCCcCCC-CCCccchhhHHH Q lcl|NC_013692. 303 GVGKAVDPNDQVPMHESGGKYSV------FPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATAD-RSDPYGEMGFMS 375 (399) Q Consensus 303 ~aGa~~~~~~~~~~~~~~~~~DV------Yp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad-~~DPlgQrg~~g 375 (399) +. .+...+ .....+..||| --.|+|=.+|-+++-+. .+ +.+ --|+=-|..++= T Consensus 251 ~~---it~~~l-s~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~----~l------------t~~~~~d~r~~~~~id 310 (401) T protein:vir:70 251 QG---QTHHLL-SNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSI----DV------------TGDIFYEKKEKTYYID 310 (401) T ss_pred cc---cccccc-cccCCCccCCCCccccceeEEEEehhheEEEEee----cc------------ccchhhhhhhhHHHHH Confidence 11 111111 01112344442 13455555655554442 11 111 136667888888 Q ss_pred HHHHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 376 IKWYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 376 wK~~~~~~iLn~~~m~~iet~A~~ 399 (399) -|..|+...+|++.-+.++|.-.- T Consensus 311 ~~~a~g~g~~RPeaa~vv~~k~~~ 334 (401) T protein:vir:70 311 TFMAEGAIPDRWEAVSVVTTKRNT 334 (401) T ss_pred HHHHhCCcccchhheEEEeecCcc Confidence 999999999999999998776442 No 152 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=94.34 E-value=0.0041 Score=33.70 Aligned_cols=271 Identities=15% Similarity=0.167 Sum_probs=125.2 Q ss_pred CCCccc--cccccccC-CCCCCccc---ccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcC Q lcl|NC_013692. 1 MAGPVD--NIKPMKYN-DPANGVES---SIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLD 74 (399) Q Consensus 1 ~~~~~~--~~~~~~~n-~~~~~~~~---~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~ 74 (399) ...... ..+..... .-..++++ -+-|+ -+ ..+.+......-.+.+++.+.++. +.++ .+...+... T Consensus 100 ~~~~~~~~~~~~~~~~~al~~~t~s~gG~~IP~---~~-~~~Ii~~~~~~~~l~~~~~v~~~~---~~~~-p~~~~~~~~ 171 (387) T protein:vir:93 100 LPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPK---TL-SKEIVSEPFAKNQLREKARLTNIK---GLEI-PRVSYTLDD 171 (387) T ss_pred hhhhhhhhhhhhHHHHHhhccCcCCCCceeech---hH-HHHHHHHHHhhchhhhheeeeecC---CceE-EEEeecCCc Confidence 000000 00000000 00011111 12333 22 344555555555566777776654 2222 111222111 Q ss_pred CCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchh Q lcl|NC_013692. 75 DRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPA 154 (399) Q Consensus 75 ~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~ 154 (399) +.++.. |....-..+++..|+.+.++|+.++.+|+++.+ +++.. T Consensus 172 ------------a~~v~E-----------------------~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~-Ds~~~ 215 (387) T protein:vir:93 172 ------------DDFITD-----------------------VETAKELKLKGDTVKFTTNKFKVFAAISDTVIH-GSDVD 215 (387) T ss_pred ------------cccccC-----------------------cccccccccccceeeeeheeeeeechhhHHHHh-hhHHH Confidence 112221 112222234677899999999999999999765 55656 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCc-e-----eeeccccccccccCCcceecHHHHHHHHHHHHhccCcccccee Q lcl|NC_013692. 155 MEGHVTTEMVKGANEITEDLLQIDLLNSAG-T-----VRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMI 228 (399) Q Consensus 155 L~~~i~~el~~~~~~~t~d~l~~~~l~agt-~-----V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii 228 (399) |...|..++.+.-+. .++. .++..|+ + +.+.... ..++ ...+++.|..+...|+..... T Consensus 216 l~~~i~~~la~~~~~-~e~~---~~~~~g~g~g~p~g~l~~~~~---~~v~--~~~~~d~i~~~~~~l~~~~~~------ 280 (387) T protein:vir:93 216 LVNWVENALQSGLAA-KERK---DALAVSPKSGLDHMSFYNGSV---KEVE--GADMYDAIINALADLHEDYRD------ 280 (387) T ss_pred HHHHHHHHHHHHHHH-HHHH---hHhhcCCCccccceeeecccc---cccc--ccchHHHHHHHHhccChhhhc------ Confidence 777676666554433 3322 1222222 1 1222111 1222 223577787777666654332 Q ss_pred ccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCccc Q lcl|NC_013692. 229 TGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAV 308 (399) Q Consensus 229 ~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~ 308 (399) .+ +.+||+.....++.+.+--+ ..++.|+=.++-|..++.+.- T Consensus 281 ------------~a-~~~mn~~t~~~~~~~~~d~~-------------~~~~~~~~~~llG~PV~~~~~----------- 323 (387) T protein:vir:93 281 ------------NA-TIYMRYADYVKIISVLSNGT-------------TNFFDTPAEKVFGKPVVFTDA----------- 323 (387) T ss_pred ------------CC-EEEEechHHHHHHHHHhcCC-------------CcccccCCccccccceEEecC----------- Confidence 12 23677765555554432111 123333334555555555331 Q ss_pred CCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhcccc Q lcl|NC_013692. 309 DPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPE 388 (399) Q Consensus 309 ~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~ 388 (399) . +-++||.=++.-+.+.+. .+.+--+ +. -|+.+|..+ .++.+.+.+++ T Consensus 324 ----------------~-~~~~~GDf~~~~~~~~~~--------~~~~~~~--~~----~~~~~~~~~-~r~d~~v~~~e 371 (387) T protein:vir:93 324 ----------------A-VKPIVGDFNYFGINYDGT--------TYDTDKD--VK----KGEYLFVLT-AWYDQQRTLDS 371 (387) T ss_pred ----------------C-Cceeeeehhhhheehhhh--------eeeeccc--cc----CCceeEEEE-eeeCceeechh Confidence 0 113567655554443211 1111100 11 245555544 48899999999 Q ss_pred ceEEEEEecCC Q lcl|NC_013692. 389 WIALLKTVARL 399 (399) Q Consensus 389 ~m~~iet~A~~ 399 (399) =+..++..+.- T Consensus 372 A~~~l~~k~~~ 382 (387) T protein:vir:93 372 AFRIAKAKENT 382 (387) T ss_pred heEEEEeecCC Confidence 99988885555 No 153 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=93.96 E-value=0.0031 Score=34.31 Aligned_cols=271 Identities=15% Similarity=0.158 Sum_probs=124.3 Q ss_pred CCCcc-c--cccccccCCC-CCCcc---cccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCc Q lcl|NC_013692. 1 MAGPV-D--NIKPMKYNDP-ANGVE---SSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLL 73 (399) Q Consensus 1 ~~~~~-~--~~~~~~~n~~-~~~~~---~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~ 73 (399) +.+-. . ........+. ..+++ +-+-|+ -+..+.+......-.+.+++.+.++. |.++- +...+.. T Consensus 114 ~~~~~~~~~~~~~~~~~~a~~~~t~~~GG~lIP~----~~~~~Ii~~~~~~~~l~~~~~v~~~~---~~~~p-~~~~~~~ 185 (402) T protein:vir:93 114 ILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPK----TLSKEIVSEPFAKNQLREKARLTNIK---GLEIP-RVSYTLD 185 (402) T ss_pred HhhhhHHHHHHhHHHHHhhhccCCCcCCccccch----hHHHHHHHhHHhhhhhhhhceeeecC---Cceee-eeeccCC Confidence 00000 0 0000000000 01111 112233 12455555555555667777776653 22221 1111111 Q ss_pred CCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhch Q lcl|NC_013692. 74 DDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDP 153 (399) Q Consensus 74 ~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~ 153 (399) + +.++.. |+..+--.+++..|+.+.++++.|+.+|+++.+ +++. T Consensus 186 ~------------a~~v~E-----------------------g~~~~~~~~~f~~i~~~~~k~~~~i~iS~ell~-Ds~~ 229 (402) T protein:vir:93 186 D------------DDFITD-----------------------VETAKELKAKGDTVKFTTNKFKVFAAISDTVIH-GSDV 229 (402) T ss_pred c------------cccccc-----------------------cccccccccccceeeecceeeeeechhhHHHHh-hhHH Confidence 1 112211 222222334677899999999999999999666 5555 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhcCce-----eeeccccccccccCCcceecHHHHHHHHHHHHhccCcccccee Q lcl|NC_013692. 154 AMEGHVTTEMVKGANEITEDLLQIDLLNSAGT-----VRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMI 228 (399) Q Consensus 154 ~L~~~i~~el~~~~~~~t~d~l~~~~l~agt~-----V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii 228 (399) .|.+.|..+|...-+. .++... ++++.++ +.++...+ .++ ..-.+++|..+.-.|+..... T Consensus 230 ~l~~~i~~~la~~~~~-~e~~~~--~~~g~g~g~p~g~~~~~~~~---~~~--~~~~~d~l~~~~~~l~~~y~~------ 295 (402) T protein:vir:93 230 DLVNWVENALQSGLAA-KERKDA--LAVSPKSGLEHMSFYNGSVK---EVE--GADMYDAIINALADLHEDYRD------ 295 (402) T ss_pred HHHHHHHHHHHHHHHH-HHHHhH--hhcCCCccccceeeeccccc---ccc--ccchHHHHHHHHhccChhhhc------ Confidence 6777776666665443 332211 2222111 22322111 122 123467788777766543321 Q ss_pred ccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCccc Q lcl|NC_013692. 229 TGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAV 308 (399) Q Consensus 229 ~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~ 308 (399) .+ +.+||+.....++.+.+ |.+ ..++.|.-+.+-+..++.+.- T Consensus 296 ------------na-~~imn~~t~~~~~~~~~---d~~----------~~~~~~~~~~llG~PV~~t~~----------- 338 (402) T protein:vir:93 296 ------------NA-TIYMRYADYVKIISVLS---NGT----------TNFFDTPAEKVFGKPVVFTDA----------- 338 (402) T ss_pred ------------CC-EEEEechHHHHHHHHHh---cCC----------CcccccCCccccccceEEecC----------- Confidence 11 23677776666665543 211 223333333344444443320 Q ss_pred CCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCc-cchhhHHHHHHHHHHhhccc Q lcl|NC_013692. 309 DPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDP-YGEMGFMSIKWYYGFMVFRP 387 (399) Q Consensus 309 ~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DP-lgQrg~~gwK~~~~~~iLn~ 387 (399) . +.++||.=++.-+.+.... + .+ ..|+ .|+++|.++. ++.+.+.++ T Consensus 339 ----------------~-~~i~~GDf~~~~~~~~~~~----~-----~~------~~~~~~~~~~~~~~~-r~Dg~v~~~ 385 (402) T protein:vir:93 339 ----------------A-VKPIVGDFNYFGINYDGTT----Y-----DT------DKDVKKGEYLFVLTA-WYDQQRTLD 385 (402) T ss_pred ----------------C-Cceeeechhhhhhhhhhhh----h-----hh------hhcccCCceEEEEEE-EeCcEEech Confidence 1 1246676554433332111 1 11 0122 2566666554 678888999 Q ss_pred cceEEEEEecCC Q lcl|NC_013692. 388 EWIALLKTVARL 399 (399) Q Consensus 388 ~~m~~iet~A~~ 399 (399) +=+..++.-+.- T Consensus 386 ~A~~~l~ik~~~ 397 (402) T protein:vir:93 386 SAFRIAKAKENT 397 (402) T ss_pred hheEEEEeecCC Confidence 988888875555 No 154 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=93.95 E-value=0.0059 Score=32.82 Aligned_cols=301 Identities=12% Similarity=0.034 Sum_probs=135.9 Q ss_pred CCCccccccccccCCCCCCcccc-cccceehhhh-----hHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcC Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESS-IGPQIHTRYW-----YKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLD 74 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~-i~p~~~t~y~-----~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~ 74 (399) |+. ++.... -++ |-|++=+-|= +|..|+..- ++..-++...+=...|.+|..=.|.+|.. T Consensus 1 M~~----~~~~T~-------l~Dii~pEvF~~Yv~~~~~e~~~l~qSG---iv~~d~~l~~~~~~gG~~v~iPf~~~L~g 66 (367) T protein:vir:80 1 MPD----FNNQVR-------LVDAVIPEVYTSYTAIDRPELTAFFLSG---AVASNDFLSQFLSAPGRLINIPFWRDLDS 66 (367) T ss_pred Ccc----hhhhhh-------hhhccchhhhhHHHhhhhhhhhhhhhcc---eeecCHHHHHHhhcCCCEEEeeeeccCCC Confidence 332 111110 011 1233211221 122222210 01111111111134678888877777743 Q ss_pred CCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchh Q lcl|NC_013692. 75 DRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPA 154 (399) Q Consensus 75 ~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~ 154 (399) +.-- |++..|...+|+. .+|...-.+.+..+|.=...+|.+.+..-++ T Consensus 67 ~~~n-----------------~~~d~~~~~~t~~--------------kittg~~~a~v~~r~kaw~~~Dla~~lsG~d- 114 (367) T protein:vir:80 67 LEPN-----------------YGSDNPNVEAPID--------------GLGSGEMKTTKTWLNKAYGAMDLTAELAGSN- 114 (367) T ss_pred Cccc-----------------cCCCCCccccccc--------------ccccchheeeeehhcccchhhhHHHHhhCch- Confidence 2211 1222222112211 0122344677888888888888887765543 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH---HHHhcC--cee----ee----c------ccccc---ccccCCcceecHHHHHHH Q lcl|NC_013692. 155 MEGHVTTEMVKGANEITEDLLQI---DLLNSA--GTV----RY----P------GAATS---DAEVDATTEVTYDSLMRL 212 (399) Q Consensus 155 L~~~i~~el~~~~~~~t~d~l~~---~~l~ag--t~V----~Y----A------g~aTs---ra~v~~~~~vt~~~lr~a 212 (399) -|++|..++..--....+..|.. ++++.. .+. .| | +.-+- ..+-+....+|.+.+-.| T Consensus 115 pm~~Ia~qva~yW~r~~q~~Lla~L~Gvf~~~~a~~~~~~~~~~~~~a~~~~~~~~~~~Dis~~t~~~~~~~s~~~~~~A 194 (367) T protein:vir:80 115 PMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDA 194 (367) T ss_pred HHHHHHHHHHHHhhhhhHHHHHHHHHHhhccccccchhhhhhhhccccccccccCceeeeeeccCCCccceecHHHHHHH Confidence 34677666655444444433222 333211 110 00 0 00000 011112366899999999 Q ss_pred HHHHHhccCccccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEE Q lcl|NC_013692. 213 RLDLDNARAPTKIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRV 292 (399) Q Consensus 213 ~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~Rf 292 (399) ...|-.+...- =+++|||.....|+.+ ..+.-.+|.+. +.+|+.+-+.|+ T Consensus 195 ~~~lGD~~~~l-------------------~~i~mHS~V~~~L~~~-------~li~~i~~sd~----~~~i~ty~G~~V 244 (367) T protein:vir:80 195 AFTMGDHVGSI-------------------AAIAVHSMVYKRMTNN-------DEIEFIPDSKG----QLTIPTYMGKVV 244 (367) T ss_pred HHHhccccccc-------------------cEEEEchHHHHHHHhc-------cccccccCCCC----ccccceecceeE Confidence 87777765532 4789999999999975 46777778876 358999999999 Q ss_pred EecCcccccccCCcccCCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchh- Q lcl|NC_013692. 293 IVNPQMMHWAGVGKAVDPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEM- 371 (399) Q Consensus 293 V~~~~~~~~~~aGa~~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQr- 371 (399) |+...|-.. +. +...+|-..+||..||+--. +....+.-+...|-.+--+.-|=|=.| T Consensus 245 IvDD~~Pv~-~~----------------~a~~~yttYlfg~GAi~~~~----~~~~~~~E~~Rd~~~~~~gG~d~L~~Rr 303 (367) T protein:vir:80 245 IVDDGMPVF-GT----------------GADKTYLSILFGGAAFGYAD----GAPQVPVAVGRRELRGNGSGLEYILERK 303 (367) T ss_pred EEeCCCccc-cc----------------CCCceEEEEEEecceeeecc----cCCccceecccchhhhcCCceEEEEeee Confidence 998864332 11 12238999999999998322 211111111111100000111111111 Q ss_pred ----hHHHHHHHHHHhhc---------------cccceEEEEEecCC Q lcl|NC_013692. 372 ----GFMSIKWYYGFMVF---------------RPEWIALLKTVARL 399 (399) Q Consensus 372 ----g~~gwK~~~~~~iL---------------n~~~m~~iet~A~~ 399 (399) +-.|.||--++.+- .+-+ +-|+.++-- T Consensus 304 ~~~~hP~G~s~~~~~v~~~~~~~~~~~~~~~~~sPt~-~eLa~~~NW 349 (367) T protein:vir:80 304 EWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITL-ANLANPDNW 349 (367) T ss_pred eEEeecceeeecccccccccccccccccccccCCCCh-HHhcCCccc Confidence 11222332221110 0000 000011111 No 155 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=93.31 E-value=0.008 Score=32.08 Aligned_cols=268 Identities=15% Similarity=0.140 Sum_probs=125.6 Q ss_pred CCC--------ccccc-cccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccC Q lcl|NC_013692. 1 MAG--------PVDNI-KPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIP 71 (399) Q Consensus 1 ~~~--------~~~~~-~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~p 71 (399) +.+ ....+ +.+....++. -+-+-|+ -+ ..+.+........+.+++.+.++. |.++ .+.... T Consensus 64 ~~~~~~~~~~~~~~~~~~al~~~~~~~--gG~lIP~---~~-~~~Ii~~l~~~s~l~~~~~v~~~~---~~~~-p~~~~~ 133 (352) T protein:vir:78 64 ILPNEFEKPSMEAQRLLHALPTGNDSG--GDKLLPK---TL-SKEIVSEPFAKNQLREKARLTNIK---GLEI-PRVSYT 133 (352) T ss_pred hhhhHHHHHHhhHHHHHHHhccCCCCC--CceeccH---hH-HHHHHHHHHhhcchhhheeeEecC---CceE-EEEecC Confidence 000 00000 1111111111 1223343 12 344444444445566677665542 2222 111111 Q ss_pred CcCCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhh Q lcl|NC_013692. 72 LLDDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDS 151 (399) Q Consensus 72 l~~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~ 151 (399) . +++.+.. +|.......+++..|+.+.++|+.++.+|+++.+ ++ T Consensus 134 ~------------~~a~~v~-----------------------E~~~~~~~~~~f~~v~~~~~k~~~~i~is~ell~-Ds 177 (352) T protein:vir:78 134 L------------DDDDFIT-----------------------DVETAKELKLKGDTVKFTTNKFKVFAAISDTVIH-GS 177 (352) T ss_pred C------------Ccccccc-----------------------cccccccccccceeeeecceeEEeechhhHHHHh-hh Confidence 1 1122222 2222333346778899999999999999999665 45 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHhcCc-e-----eeeccccccccccCCcceecHHHHHHHHHHHHhccCcccc Q lcl|NC_013692. 152 DPAMEGHVTTEMVKGANEITEDLLQIDLLNSAG-T-----VRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKI 225 (399) Q Consensus 152 D~~L~~~i~~el~~~~~~~t~d~l~~~~l~agt-~-----V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T 225 (399) ++.|.+.|..+|.+.-+. .++.+ ++..|+ + ..++.+ -.++++. -++++|..+.-.|+..-.. T Consensus 178 ~~~l~~~i~~~la~~~~~-~e~~~---~~~~g~g~~~~~g~l~~~~---~~~~t~~--~~~d~i~~~~~~l~~~~~~--- 245 (352) T protein:vir:78 178 DVDLVNWVENALQSGLAA-KERKD---ALAVSPKSGLEHMSFYNGS---VKEVEGA--NMYDAIINALADLHEDYRD--- 245 (352) T ss_pred hHHHHHHHHHHHHHHHHH-HHHHh---hhhcCCCCcccccceeccc---ccccccc--chHHHHHHHHhccChhhhc--- Confidence 566777777676665543 33221 221221 1 122211 1223322 2477888777766544321 Q ss_pred ceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCC Q lcl|NC_013692. 226 KMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVG 305 (399) Q Consensus 226 ~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aG 305 (399) .+ +.+|++.....++.+++--+. +++.+.-..+-|..++.+.- T Consensus 246 ---------------~a-~~~mn~~t~~~l~~~~~~~~~-------------~~~~~~~~~llG~PV~~~~~-------- 288 (352) T protein:vir:78 246 ---------------NA-TIYMRYADYVKIISVLSNGTT-------------NFFDTPAEKVFGKPVVFTDA-------- 288 (352) T ss_pred ---------------CC-EEEEehHHHHHHHHHHhccCC-------------cccccCCccccccceEEecC-------- Confidence 12 246788777777776542222 23333333444444444321 Q ss_pred cccCCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCcc-chhhHHHHHHHHHHhh Q lcl|NC_013692. 306 KAVDPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPY-GEMGFMSIKWYYGFMV 384 (399) Q Consensus 306 a~~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPl-gQrg~~gwK~~~~~~i 384 (399) . +-++||.=++.-+...+. ++.+- .|++ |+.+|.+. .++.+.+ T Consensus 289 ----------------~----~~~~~Gdf~~~~~~~~~~--------~~~~~-------~~~~~g~~~f~~~-~r~Dg~~ 332 (352) T protein:vir:78 289 ----------------A----VKPIVGDFNYFGINYDGT--------TYDTD-------KDVKKGEYLFVLT-AWYDQQR 332 (352) T ss_pred ----------------C----CceeEeehhhhhhhhhhh--------eeeee-------ccccCCeeEEEEE-eeeCcee Confidence 1 113567755554443211 11111 1111 23333332 4778888 Q ss_pred ccccceEEEEEecCC Q lcl|NC_013692. 385 FRPEWIALLKTVARL 399 (399) Q Consensus 385 Ln~~~m~~iet~A~~ 399 (399) .+++=+..+++.|.= T Consensus 333 ~~~eA~~~l~~~a~~ 347 (352) T protein:vir:78 333 TLDSAFRIAKAKEST 347 (352) T ss_pred echhheEEEEeeccc Confidence 889888888877766 No 156 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=93.16 E-value=0.0086 Score=31.92 Aligned_cols=284 Identities=13% Similarity=0.016 Sum_probs=123.7 Q ss_pred CCCccccccccccCCC-CCCcc---cccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCC Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDP-ANGVE---SSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDR 76 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~-~~~~~---~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~ 76 (399) +...+.+..-..++.- ..+++ +-+-|+ -+ ..+.+....+.-.+.+++.+.++..+..+ |.. ... T Consensus 68 ~~~~l~~~~r~~~~~~~~~~~~~~gg~lvP~---~~-~~~I~~~~~~~s~i~~~~~~~~~~~~~~~-i~~--~~~----- 135 (390) T protein:vir:40 68 GANALTSDESKYYNEVIAGNGFAGVTALLPP---TV-FERVFEDLTVEHPLLSKINFVNTTATTEW-IIS--VGD----- 135 (390) T ss_pred CchhccHHHHHHHHHHHhccCcccCcccccH---HH-HHHHHHHHHhhhhhhhhceeeecCCceeE-EEE--EcC----- Confidence 1111111111111110 00111 222343 22 35555566666667788888877543322 211 110 Q ss_pred ccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHH Q lcl|NC_013692. 77 NVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAME 156 (399) Q Consensus 77 t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~ 156 (399) .|...+..+| +..|+. -..++..|+-+.++++.+..+|+++.+ +++..|. T Consensus 136 -------~~~a~~~~E~-------------~~~~~~---------~~~~f~~i~l~~~k~~~~i~iS~ell~-ds~~~l~ 185 (390) T protein:vir:40 136 -------VATAWWGPLC-------------AEIKEV---------LDNGFDKIQTGMYKLSAYIPVCNAMLD-LGPSWLD 185 (390) T ss_pred -------Ccceeeeccc-------------cccCcc---------ccccceeeEeeeeeEEEeehhhHHHHh-cchHHHH Confidence 1111221110 111111 123678899999999999999999888 4454566 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCce------eeecccccccc-ccCCcceecHHHHHHHHHHHH----hccCcccc Q lcl|NC_013692. 157 GHVTTEMVKGANEITEDLLQIDLLNSAGT------VRYPGAATSDA-EVDATTEVTYDSLMRLRLDLD----NARAPTKI 225 (399) Q Consensus 157 ~~i~~el~~~~~~~t~d~l~~~~l~agt~------V~YAg~aTsra-~v~~~~~vt~~~lr~a~~~Lk----~nrApk~T 225 (399) +.|..+|...-+. .+-..+|++-++ +...+.++... .......++..++-.+...|+ .+-. T Consensus 186 ~~i~~~la~~i~~----~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~---- 257 (390) T protein:vir:40 186 QYVRTILGEAMAL----GLEAGIVNGSGKDQPIGMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGK---- 257 (390) T ss_pred HHHHHHHHHHHHH----HHHhhhhcccCCCccceeeeccccccccccccccccccchhhHHHHHHHHHHHhhcchh---- Confidence 6565555554444 233344443111 01111111100 011123344444333333332 2211 Q ss_pred ceeccccccCcccccCeeEEEechhhhHH-HHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccC Q lcl|NC_013692. 226 KMITGTRMIDTRTVGNARALYVGSDLVPT-IEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGV 304 (399) Q Consensus 226 ~ii~~s~~~gT~~I~~~yv~~~h~dl~~d-i~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~a 304 (399) +..+.+ +.+||+....+ |+.+. .++|+.=.++ ... ..-++++|+++.|. + T Consensus 258 -----------~~~~~a-~~i~n~~t~~~~l~~~~-~~~d~~G~~v---------~~~---~~~g~pvv~~~~~p----~ 308 (390) T protein:vir:40 258 -----------KSVSDA-ILVINPADYWSKIYAAT-SYMTPQGVWV---------TGI---LPVPLEIVQSVAVP----V 308 (390) T ss_pred -----------hhhcCc-eEEEcchhHHHHHHHHh-hccCCCCccc---------ccc---CCCceeEEEcCCCC----C Confidence 111222 34688765332 33221 1122221111 111 12367888877642 0 Q ss_pred CcccCCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH--HHHHH Q lcl|NC_013692. 305 GKAVDPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK--WYYGF 382 (399) Q Consensus 305 Ga~~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK--~~~~~ 382 (399) | + ++||.-+...++.+ +.+.+ -+. .+.+-.++..+++ ..+.+ T Consensus 309 ~------------------~----i~~Gd~s~~~i~~~---~~~~v--~~~---------~~~~f~~~~~~~r~~~r~dg 352 (390) T protein:vir:40 309 G------------------K----AVAGRAKDYFMGIG---SEQVI--RTS---------TEYRLLDDETLYYAKQYANG 352 (390) T ss_pred C------------------c----EEEEeeceEEEEee---cceEE--Eec---------chhhhhcCcEEEEEEEEeCC Confidence 0 1 56776555555443 22222 111 1123333444444 56778 Q ss_pred hhccccceEEEEEecCC Q lcl|NC_013692. 383 MVFRPEWIALLKTVARL 399 (399) Q Consensus 383 ~iLn~~~m~~iet~A~~ 399 (399) .+.+++=++.++..+.- T Consensus 353 ~v~~~~A~~~l~~~~~~ 369 (390) T protein:vir:40 353 RPKDNSSFLVFDITGLE 369 (390) T ss_pred EEecccceEEEEeeccC Confidence 88888888888876664 No 157 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=91.12 E-value=0.017 Score=30.23 Aligned_cols=280 Identities=13% Similarity=0.150 Sum_probs=120.0 Q ss_pred cccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCccccCCCCcchhhhhhhhhcccccccc Q lcl|NC_013692. 24 IGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVNDQGIDASGATIANGNLYGSSRDVG 103 (399) Q Consensus 24 i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lteGV~p~g~~~~ngn~~~ss~d~g 103 (399) ++=..-+ -|...+++.....++.+.+... ..==+.|++||+-+-.- .|+-.= .-..|+-+| T Consensus 1 Main~a~-~~~~~Ld~~~~~~~~t~~l~~~-~~~~~ggktVkI~~i~~---------~gl~DY--~R~~g~~~g------ 61 (290) T protein:vir:78 1 MAINYVD-KYGKELDQKLVFGTYTNELETP-NLLWLDAKTFKIQTITT---------TGLKAH--TRNKGYNEG------ 61 (290) T ss_pred CchhHHH-HHHHHHHHHHHhhheeeecccc-ceeeccCCEEEEeeecc---------Cccccc--ccCCCcccC------ Confidence 1111011 2566677777788888887533 33335679999966431 121110 000111111 Q ss_pred ccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHHHHHHHHHHHH---HHHHHHHHHH Q lcl|NC_013692. 104 NITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVTTEMVKGANEI---TEDLLQIDLL 180 (399) Q Consensus 104 ~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~~el~~~~~~~---t~d~l~~~~l 180 (399) ++ ..+.+..+-+-..| .+|+=...|-+|=... -.+...+.+.+.+. -.|..+..-| T Consensus 62 ~v-----------------~~~~et~tl~qdR~---~~F~vD~~DvDEt~~~-~~~~nv~~ef~~~~v~PEiDayr~skl 120 (290) T protein:vir:78 62 SA-----------------SNTNKSYTIDFDRD---VEFFVDVMDVDETGQA-LSAANVTKEFNSRHAGPEMDAYRFSKL 120 (290) T ss_pred cc-----------------ccceeeEEeecccc---ceeeccccchhHHhhh-hhHHHHHHHHHHHHhhhhhhHHHHHHH Confidence 11 12334444332222 1111111122211111 11222333333332 2344444444 Q ss_pred hcCceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCeeEEEechhhhHHHHHHhh Q lcl|NC_013692. 181 NSAGTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKD 260 (399) Q Consensus 181 ~agt~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~ 260 (399) .++.. +.+..-..+++... -++.|+.+...|++ .| +..+++||.|+.-..|+. T Consensus 121 a~~a~---~~~~~~~~t~t~~n--~~~~i~~~~~~lde--vp-----------------~~~rvl~vtp~~~~lL~~--- 173 (290) T protein:vir:78 121 ATAAK---TNSNSVAEEITKDN--VFTKLKAAIRKVKK--YG-----------------TQNLVMYVSPDVMAALEL--- 173 (290) T ss_pred Hhhhh---ccCcccccccCHHH--HHHHHHHHHHHHHh--cC-----------------CCCeEEEECHHHHHHHhh--- Confidence 22211 00111111222221 23446666666653 33 235999999999998874 Q ss_pred hcCCCCceehhhcCC-ccccccccceeEcCeEEEecCcccc------cccCCcccCCcccccccccCccceeEEEEEEcc Q lcl|NC_013692. 261 NHGNPAFIPIEKYAA-GGATMHGEVGQLGRFRVIVNPQMMH------WAGVGKAVDPNDQVPMHESGGKYSVFPMLCVAS 333 (399) Q Consensus 261 ~~~~p~fi~v~kYg~-~~~i~~gEIG~i~~~RfV~~~~~~~------~~~aGa~~~~~~~~~~~~~~~~~DVYp~lV~G~ 333 (399) +++|..-..=++ .....++-||++++|.+|++|--.. |.+ |... ++...-..+||+=+ T Consensus 174 ---~~~f~r~~~~~~~~~~~i~~~V~~idG~~ii~vps~~r~~t~~~f~~-G~~~-----------~~~ak~in~ii~~~ 238 (290) T protein:vir:78 174 ---SDDFVRAINVQNIGPSSIETRITAIDGTRIVEVEAEDRFYDTFDFTD-GYKP-----------AAGAKKLNFLLVNK 238 (290) T ss_pred ---ChhhhccccccccccccccceeeeecCcEEEEecccchhhhhhhhcc-cccc-----------cCCccceeEEEEcC Confidence 678866333332 2335699999999999999983111 111 2111 11222334555554 Q ss_pred ccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 334 EAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 334 ~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~iet~A~~ 399 (399) .| .+... |--+. -.-.||.- .++| ....-.+.||-+-+|...--...-.++ | T Consensus 239 ~a--~i~~~---K~~~~--~~~~P~~~--~~~d----~~~~~~r~y~d~~v~~nk~~~i~~~~~-~ 290 (290) T protein:vir:78 239 GS--VVGGA---KHASI--YLHAPGSV--GQGD----GWLYQYRVYHDIFVLDQQKDGVIASTE-V 290 (290) T ss_pred Cc--eeeee---eeeEE--EeeCCCCC--cCcc----eeeeeeeeeeeeeeeccccCeeEEEee-C Confidence 33 22222 11122 23357732 2222 111222345666666544333322222 2 No 158 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=91.06 E-value=0.018 Score=30.19 Aligned_cols=282 Identities=14% Similarity=0.037 Sum_probs=131.6 Q ss_pred CCC-cccccc---c--cccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcC Q lcl|NC_013692. 1 MAG-PVDNIK---P--MKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLD 74 (399) Q Consensus 1 ~~~-~~~~~~---~--~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~ 74 (399) |+. -|+++- . ..++.........|.|++. .+++......-.+.+.+.+.++-...|+..++ -+.+-. T Consensus 1 ~~~k~~~~~l~~~~~~~~~~~~~~~~g~~v~~~~~-----~~l~~~i~e~s~~l~~i~v~~v~~~~~~i~~~-~~~~~~- 73 (321) T protein:vir:31 1 MASRTINNDLSRITEKNALTVDDLDAGGTLPDPLW-----DEFWTDMIEETPLLDAIRTETVGAKKTRIPTL-NIGERH- 73 (321) T ss_pred CchHHHHHHHHHHHHhccccccccCCcceeCHHHH-----HHHHHHHHHhhhhhhhceeeeccCcceeeeee-ccCCcc- Confidence 432 223320 1 1111111111234556543 34454444555677889988887777765443 111111 Q ss_pred CCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhh-hhch Q lcl|NC_013692. 75 DRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDF-DSDP 153 (399) Q Consensus 75 ~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t-~~D~ 153 (399) ..+ .+|| .++-++-.++...++-+++|+..+.++|+++.+- ++-| T Consensus 74 ~~~-~~e~---------------------------------~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d~a~~~ 119 (321) T protein:vir:31 74 RRP-QDEG---------------------------------EWNENESDVSTGTIDISTEKATVAWDLPREVVQENPEGE 119 (321) T ss_pred ccc-cccc---------------------------------ccccccccceeeeeeeeeEEEEeehhccHHHHHhhhcch Confidence 010 0122 2233444456677899999999999999998864 3345 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhcCce------------eeeccccccccccCCcceecHHHHHHHHHHHHhccC Q lcl|NC_013692. 154 AMEGHVTTEMVKGANEITEDLLQIDLLNSAGT------------VRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARA 221 (399) Q Consensus 154 ~L~~~i~~el~~~~~~~t~d~l~~~~l~agt~------------V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrA 221 (399) .+.+.|...+.+ +....++.+ .+++-+. +..+.+..... ..++..++.+.|.++...|+.... T Consensus 120 d~e~~i~~~ia~-~~a~~~~~~---~~nGd~~~~~~~~~~n~G~l~~a~~~~~~~-~~~~~~~~~d~l~~l~~~l~~~yr 194 (321) T protein:vir:31 120 ALADRILNLMTD-AWSADVEDL---AANGDEDAEDSFENQNDGFITVAEGDVETI-DAADDILDNDLVIRTIAGLDSKYR 194 (321) T ss_pred hHHHHHHHHHHH-HHHHHHHhh---eeeccccCCCcccccchhhhhhhccccccc-cccccccCHHHHHHHHHhccHhHh Confidence 676655443333 333233222 2222111 00110011111 112356888999888887764321 Q ss_pred ccccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCccccc Q lcl|NC_013692. 222 PTKIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHW 301 (399) Q Consensus 222 pk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~ 301 (399) . .+..+.+|++++..+++....--+.|-|.+. +..+...++.|+.++.+|.|-. T Consensus 195 ~-----------------~~~~v~im~~~~~~~~~~~l~~~~~~~~~~~--------l~~~~~~tl~G~pvv~~~~mP~- 248 (321) T protein:vir:31 195 A-----------------RMNPALIVSEDQLLSYHYTLTDRDTPLGDNV--------IMGEADVNPFSFPIIGSGLWPD- 248 (321) T ss_pred c-----------------CCCeEEEechHHHHHHHHHHhcCCCccccch--------hhccccccccceeEEEcCCCCC- Confidence 1 1136899999999888874332333333332 3344555788999999887631 Q ss_pred ccCCcccCCccccc--ccccCcccee-------------EEEEEEcccc-----ceecccccCCCCCcceEEEecCC Q lcl|NC_013692. 302 AGVGKAVDPNDQVP--MHESGGKYSV-------------FPMLCVASEA-----FTTVGFATDGKNVKFKIITKRPG 358 (399) Q Consensus 302 ~~aGa~~~~~~~~~--~~~~~~~~DV-------------Yp~lV~G~~A-----fg~v~l~~~g~~~k~~~ivk~pG 358 (399) ++..-+..... +...+.+..+ |-.-....+. |+.+.+-. |-..+++++..-+. T Consensus 249 ---~~il~t~~~nl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ve~~~a~a~~~-~i~~~~~~~~~~~~ 321 (321) T protein:vir:31 249 ---DKAMFTDPQNLIYALYRDLEIDVLTESDKVSERDLHARYFMRGDDDFAIENTEAVVLAE-GLGDPLEHLEEETS 321 (321) T ss_pred ---CcEEEeccccEEEEEeeccEEEEeecCccccccceeeEeeeeeecceeEeccccEEEEe-cCCcchhcccCCCC Confidence 11111110000 0001111111 1111222222 33333332 22345666655542 No 159 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=90.63 E-value=0.02 Score=29.91 Aligned_cols=280 Identities=15% Similarity=0.096 Sum_probs=135.2 Q ss_pred CCcccccccceehhhhhHHHHHHhhh--HHhhhhcccccccCcCCCcEEEEEEccCCcCCCccccCCCCcchhhhhhhhh Q lcl|NC_013692. 18 NGVESSIGPQIHTRYWYKRALIDAAK--EAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVNDQGIDASGATIANGNL 95 (399) Q Consensus 18 ~~~~~~i~p~~~t~y~~~k~L~~A~p--~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lteGV~p~g~~~~ngn~ 95 (399) |-.|+..-..+.+.+ +|.|.++-. .-.+.+||.. .|. -.|.-..-+. T Consensus 1 m~it~~~l~~l~~~~--~~~~~~~y~~a~~~~~~~a~~--~~s-df~~~~~~~l-------------------------- 49 (302) T protein:vir:10 1 MLINKQSLNAAFVAI--KTIFNNAFAAAPTTWQKIAME--VPS-NTSSNDYKWL-------------------------- 49 (302) T ss_pred CcccHHHHHHHHHHH--HHHHHHHHHhhhhhhhceeee--cCC-Ccceeeceec-------------------------- Confidence 322332222223333 444444321 1245778753 443 3444333222 Q ss_pred ccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 96 YGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVTTEMVKGANEITEDLL 175 (399) Q Consensus 96 ~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~~el~~~~~~~t~d~l 175 (399) +.+|.|.|-+|......++-..-+..++.||.-+.+|.+...-| |=.+...+..+|+. ++..++|.+ T Consensus 50 -----------g~~p~l~e~~Ge~~~~~l~~~~~~i~~~~~g~~v~i~R~~i~nD-dlg~~~~~~~~~G~-aaa~~~~~l 116 (302) T protein:vir:10 50 -----------STFPKMRRWIGAKVVKNLKAYKYVVENEDFEATVEVDRNDIEDD-QIGIYSPQAKMAGY-SAAQLPDEL 116 (302) T ss_pred -----------CCCCCccccccceeeccccccceeEEeecccceecccHHhhccc-ccchhHHHHHHHHH-HHHhhHHHH Confidence 33455666544445556677778899999999999997654322 22344455555554 455588889 Q ss_pred HHHHHhcCce-eeeccc----------cccccccCCc------ceecHHHHHHHHHHHHhccCccccceeccccccCccc Q lcl|NC_013692. 176 QIDLLNSAGT-VRYPGA----------ATSDAEVDAT------TEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRT 238 (399) Q Consensus 176 ~~~~l~agt~-V~YAg~----------aTsra~v~~~------~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~ 238 (399) .+++|++|.+ +.|-|. ..++.++..+ ..++.+.|..+...+..+... .| +.+ . T Consensus 117 v~~~L~~g~~~~~~DG~~fF~~dH~~g~~~~~N~g~~~~~~~~~~l~~~~~~aa~~am~~~k~~------~G-~~L---~ 186 (302) T protein:vir:10 117 VYEAVNGAFTKPCFDGQYFIDTDHPVGDASVSNKGTAPLSNASQAAAKAGYGAARTAMKKFKDE------EG-RSL---N 186 (302) T ss_pred HHHHHhccCCCcccCCcceecccccccccccccccchhhhhcccccchHHHHHHHHHHHHHhhh------cc-ccc---c Confidence 9999988643 334322 1222222211 245555555554444444332 12 223 3 Q ss_pred ccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccc Q lcl|NC_013692. 239 VGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHE 318 (399) Q Consensus 239 I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~ 318 (399) |.+.+ .+|+|+++..-+.+. .++ .++ -+..-+ +. +.++.|+.|.+ ++|.+ =.+. . T Consensus 187 i~P~~-LiVp~~le~~A~~ll---~~~-~~~---~g~~Np-~~------g~~~~vv~p~L----~s~~a---WyL~--a- 241 (302) T protein:vir:10 187 VSPNV-LLVGPALEDVAKMLL---TNP-KLA---DNTPNP-YV------GTAELVVDGRI----ESDTA---WFLL--D- 241 (302) T ss_pred cCCCE-EEecchhHHHHHHHh---hcc-ccC---CCCcce-ec------cceEEEEeecc----CCCCc---eEEE--e- Confidence 55556 689999999998863 121 110 122222 11 34688888765 22211 1111 0 Q ss_pred cCccceeEEEEEEcccccee---cccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhccccceEEEEE Q lcl|NC_013692. 319 SGGKYSVFPMLCVASEAFTT---VGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFRPEWIALLKT 395 (399) Q Consensus 319 ~~~~~DVYp~lV~G~~Afg~---v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn~~~m~~iet 395 (399) ....+++. ..=|.+.--. .++..+| +++++.+.= | .|-=+-.|+.-|.++|++.= + T Consensus 242 ~~~~i~~~--~l~g~~~P~~~~~~~~~~dg--v~~k~~~d~-G------vd~R~~~G~~~wq~a~~s~g----------~ 300 (302) T protein:vir:10 242 TTKPVKPF--IFQPRKQPEFVSQVNLDSDD--VFNLRKLKF-G------AEARAAAGYGFWQLAYGSTG----------T 300 (302) T ss_pred cCCccceE--EEcCccccEEEeccCCCCCc--eEEEEEEEE-e------eeeeeecchhhhhhhhccCc----------c Confidence 11123332 2223333221 1222222 222222221 2 24455667777787777653 2 Q ss_pred ec Q lcl|NC_013692. 396 VA 397 (399) Q Consensus 396 ~A 397 (399) +| T Consensus 301 ~~ 302 (302) T protein:vir:10 301 GA 302 (302) T ss_pred CC Confidence 22 No 160 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=89.50 E-value=0.026 Score=29.28 Aligned_cols=289 Identities=10% Similarity=-0.065 Sum_probs=125.2 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhccccc-ccCcCCCcEEEEEEccCCcCCCccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTF-SMPKHYGKEIVRLHYIPLLDDRNVN 79 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~-~mPKn~GktIkfrry~pl~~~~t~l 79 (399) +-|-++++. ...+.+.. .-+-+-|+ .++ +.+......-.+.+.+.+. +|=.+ +.++ .-+. ...+. T Consensus 8 ~~~~~~~~~-k~~t~~d~-~Gg~l~P~----~~~-~~i~~~~e~s~~l~~~~vi~~~~~~---~~~i---~~~g-~~~~~ 73 (315) T protein:vir:41 8 RGGKPFEIV-PKIDVPDL-GRGVLSVD----RFG-EFVKAVRDSAVIIPEARIDNALKSY---EKDI---SRLS-LVLDV 73 (315) T ss_pred hcCChhhhh-hhcCCcCC-CCceechH----HHH-HHHHHHHhhhhhhhhceeeeccccc---cccc---cccc-cCccc Confidence 222223322 11221111 11223444 333 4555555566777888763 33111 1111 0010 00011 Q ss_pred cCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhh-hchhHHHH Q lcl|NC_013692. 80 DQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFD-SDPAMEGH 158 (399) Q Consensus 80 teGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~-~D~~L~~~ 158 (399) ..|.+..| .+.+..+-..++..++-..+++..+.++|+++.+-. ..+.+.+. T Consensus 74 ~~g~~~~~---------------------------~~~~~~~~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~ 126 (315) T protein:vir:41 74 GPGRDETG---------------------------QKLAPPESTAEVKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQK 126 (315) T ss_pred cccccccc---------------------------CcCCCCCCccccceeeeceeeeeeeccccHHHHHhhhccccHHHH Confidence 11222111 111222223456678889999999999999987643 34667666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcC------------ceeeeccccccccccCCc-ceecHHHHHHHHHHHHhc-cCccc Q lcl|NC_013692. 159 VTTEMVKGANEITEDLLQIDLLNSA------------GTVRYPGAATSDAEVDAT-TEVTYDSLMRLRLDLDNA-RAPTK 224 (399) Q Consensus 159 i~~el~~~~~~~t~d~l~~~~l~ag------------t~V~YAg~aTsra~v~~~-~~vt~~~lr~a~~~Lk~n-rApk~ 224 (399) |..++.+.-+. .+ ..-.+++- +-+.+|+.......++.. ..++.+.|..++..|+.. |.- T Consensus 127 l~~~~a~~~a~-~~---~~~~~nGdg~s~~p~~~~~~G~l~~a~~~~~~~~~~~~a~~~~~d~l~~l~~sl~~~yr~~-- 200 (315) T protein:vir:41 127 IVTLLGEGISY-VL---EKYYLHGDTSSSDPLLRMSDGWLKLASEKLTESDVDPEAEDWPMNLFDTMIESLPTPYRNN-- 200 (315) T ss_pred HHHHHHHHHHH-HH---HHHhhccCCcCcCccccccccceecccccccccccccccccccHHHHHHHHHhcChHHhhc-- Confidence 65555554333 22 22344442 223344332222222222 346788899998888752 210 Q ss_pred cceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccC Q lcl|NC_013692. 225 IKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGV 304 (399) Q Consensus 225 T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~a 304 (399) ...-+-+|+.+....||.+++--+.+-|-|...=|.+. .+-+..++..|.|-..... T Consensus 201 ---------------~~~~~~imn~~t~~~~rklk~~~g~~lw~~~~~~g~~~--------tl~G~PV~~~~~m~~~~~~ 257 (315) T protein:vir:41 201 ---------------LPNMKFYVTWDIYRAYRDALKGRETGLGDQALTGANSI--------LYDGRPVQYVPALEALNDG 257 (315) T ss_pred ---------------CCceEEEEcHHHHHHHHHHhccCCCccccchhhcCCCc--------eecccceEecccccccCCC Confidence 01256789999999999999877888888776555444 4556777777776433221 Q ss_pred CcccCCcccccc---cccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHH Q lcl|NC_013692. 305 GKAVDPNDQVPM---HESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYG 381 (399) Q Consensus 305 Ga~~~~~~~~~~---~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~ 381 (399) -+..-.+....+ ....-+.++++-.-.+...|- ..++.|-. |.-.=++..|..-. T Consensus 258 ~~~ilf~d~~nl~~~~~~~i~i~~~~~a~~~~~~~~---------------~~~r~d~~-------~~~~~~~a~~~~~v 315 (315) T protein:vir:41 258 KSRALFVVPTQLVYGFWRNIKVVPDYDAEMRLTKYV---------------ASLRTDNH-------YEDEEGAVSATITV 315 (315) T ss_pred CccEEEecccceEEEeccccEEEeeecCCCCceEEE---------------EEEEecee-------EEeccceeEeeeeC Confidence 111111100000 000011111111000000000 00111100 00000000000000 No 161 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=89.24 E-value=0.027 Score=29.14 Aligned_cols=274 Identities=10% Similarity=0.054 Sum_probs=125.0 Q ss_pred cccccccc-----cCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCccc Q lcl|NC_013692. 5 VDNIKPMK-----YNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVN 79 (399) Q Consensus 5 ~~~~~~~~-----~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~l 79 (399) |+-.+-+. .+.+.. ..+-+.|+ -+ .|++...++.-.+.+.|.+.+.-++....|-.--+ .. .+ T Consensus 1 ~~~~~~~~~~~k~it~~d~-~gG~L~P~----~~-~~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~-----g~-~~ 68 (314) T protein:vir:41 1 MDFLNKPFQITPKIDVPDL-GKGILAVQ----RF-GEFVREVRENSAIIKDARVLNALKSYEVDISRISL-----GV-EL 68 (314) T ss_pred CchhhhHHHhhcccccccC-CCceeChH----HH-HHHHHHHHhccchhhheeeecccCccceeeccccc-----Cc-cc Confidence 33222111 111211 12234454 33 46666666777888888866433332223321000 00 00 Q ss_pred cCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhh-chhHHHH Q lcl|NC_013692. 80 DQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDS-DPAMEGH 158 (399) Q Consensus 80 teGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~-D~~L~~~ 158 (399) ..|.+-. +.+...++...++..++-..+|+..++.+|+++.+-.. -++|.+. T Consensus 69 ~~~~~~~---------------------------~~~~~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~ 121 (314) T protein:vir:41 69 EPGRNTS---------------------------GTKVAPTADEVTVSTNTLEMKELVTKVVLEDEALEDNIEQSAFEQT 121 (314) T ss_pred ccccccc---------------------------cCCccCCcccccccceeeeeEEEEEeecccHHHHHhhhchhhHHHH Confidence 0011000 00111122334666778888999999999988876433 3568777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCcee--------------eecccc-ccccccCCcceecHHHHHHHHHHHHhccCcc Q lcl|NC_013692. 159 VTTEMVKGANEITEDLLQIDLLNSAGTV--------------RYPGAA-TSDAEVDATTEVTYDSLMRLRLDLDNARAPT 223 (399) Q Consensus 159 i~~el~~~~~~~t~d~l~~~~l~agt~V--------------~YAg~a-Tsra~v~~~~~vt~~~lr~a~~~Lk~nrApk 223 (399) |..++.+.-+. .++.+ .+++-++. .-|+++ ++.+ ..+...+.+.|.+++..|+.-..+ T Consensus 122 i~~~~Ae~~g~-~~~~~---~~nGdg~~~s~~~~~~~p~G~l~~a~~~~~~~~--~~~~~~~~~~~~~l~~sl~~~yr~- 194 (314) T protein:vir:41 122 ITSLLASGVTY-DLECF---FLHADSSLTTGRELYRINDGWMKLAGNQYTDAE--PEDENWPLNLFDGMMDELDTRYLQ- 194 (314) T ss_pred HHHHHHHHHHH-HHHHH---hhccccCCcCcccchhcchhhhhhcccceeecC--ccccccHHHHHHHHHHhcCchhhc- Confidence 76665554333 33332 22222210 001111 1100 112457788899999888663211 Q ss_pred ccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCccccccc Q lcl|NC_013692. 224 KIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAG 303 (399) Q Consensus 224 ~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~ 303 (399) -+.-.+.+||++....+|.+.+--+.+.|-+... .+.--.+-++.++..|.|. + T Consensus 195 ---------------~~~~~~~~m~~~t~~~~r~~l~~~~~~l~~~~~~--------~~~~~~l~G~PV~~~~~~~---~ 248 (314) T protein:vir:41 195 ---------------LKPRMKFYVSNEIYNGYRKQLLVRETGLGDSALI--------GATGLQYDGIPIQYVPALD---A 248 (314) T ss_pred ---------------CCCceEEEecHHHHHHHHHHHhccCCcccchhhh--------CCCCceecceeeEeccccc---c Confidence 0012778899999999999876555566655533 3344457788888888763 2 Q ss_pred CCcccCCcccccccccCccceeEEEEEEccc-------------------cceecccccCCCCCcceEEEecCCCcCCC Q lcl|NC_013692. 304 VGKAVDPNDQVPMHESGGKYSVFPMLCVASE-------------------AFTTVGFATDGKNVKFKIITKRPGEATAD 363 (399) Q Consensus 304 aGa~~~~~~~~~~~~~~~~~DVYp~lV~G~~-------------------Afg~v~l~~~g~~~k~~~ivk~pG~~tad 363 (399) .|+...+-.. +--.++ || +++.+ .+..+.+.-+.. -...++++.- ++ T Consensus 249 ~~~~~~~i~f----gd~~nl-v~---~~~~~ir~~~~~~a~~~~~~~~~~~r~d~~~~~~~a--a~~~~~~~~~---~~ 314 (314) T protein:vir:41 249 LGDDKARALL----TVPTNL-VY---GFWRNIRIEPKRDAAMRRTEYIASLRADCNYEDENA--AVAAVIDMSS---GG 314 (314) T ss_pred cCCCCceEEE----echhhe-EE---EeeceeEEeecccCcCCeEEEEEEEEeceEEEEcCc--EEEEEeeccC---CC Confidence 2222221111 000111 11 11111 111222221111 1222333331 22 No 162 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=84.76 E-value=0.058 Score=27.37 Aligned_cols=301 Identities=11% Similarity=0.059 Sum_probs=134.4 Q ss_pred CCccc--c-cccc--eehhhhhHHHHHHhhhHH--hhhhcccccccCcCCCcEEEEEEccCCcCCCccccCCCCcchhhh Q lcl|NC_013692. 18 NGVES--S-IGPQ--IHTRYWYKRALIDAAKEA--YFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVNDQGIDASGATI 90 (399) Q Consensus 18 ~~~~~--~-i~p~--~~t~y~~~k~L~~A~p~l--v~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lteGV~p~g~~~ 90 (399) |++|. + |.|+ +-+-|...+..+..+=.+ ++..-++...+=..-|+.+..=.|.+|..+.-++..+=++++... T Consensus 1 Ma~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~L~g~~e~nv~~D~~~~~~t 80 (349) T protein:vir:78 1 MAITTIGDIVTGNIPVLASYMTEDPVEKTAFFDSGILTSTPYAAEIANGPSNIANLPFWKAIDTSIEPNYSNDVYQDIAT 80 (349) T ss_pred CCceEEeeeeccCHHHHHHHHHHhhHHhhhhhhccceeccHHHHHHhhcCCCEEEeeeeecCCCCcccccCCCCcccccc Confidence 33221 1 3454 334465555544422100 111111222222345888888778888643333322211111000 Q ss_pred hhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhh-hhchhHHHHHHHHHHHHHHH Q lcl|NC_013692. 91 ANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDF-DSDPAMEGHVTTEMVKGANE 169 (399) Q Consensus 91 ~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t-~~D~~L~~~i~~el~~~~~~ 169 (399) . +.| +--.-.+.+..+|.=...+|.+.+. .+|| |++|..++..--.. T Consensus 81 ~-----------~ki-------------------tt~~~~a~~~~r~kaw~~~Dla~~lsG~dp--m~~Ia~~va~yW~r 128 (349) T protein:vir:78 81 P-----------RAI-------------------QTGEMMARVAYLNEGFGQADLTVELTSQNP--LQSVASRLDNFWQR 128 (349) T ss_pred c-----------ccc-------------------cccceeeeeeeeccccchhHHHHHhhCchH--HHHHHHHHHHHHhh Confidence 0 011 2223466677778778888866554 5555 46787766665555 Q ss_pred HHHHHHHH---HHHhcC---c-eeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCe Q lcl|NC_013692. 170 ITEDLLQI---DLLNSA---G-TVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNA 242 (399) Q Consensus 170 ~t~d~l~~---~~l~ag---t-~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~ 242 (399) ..+..|.. ++++.+ . +..+.++-+ ..++.+...|.+.+-.+...|-..-+ |-. =..= T Consensus 129 ~~q~~Lia~L~Gvf~~~~~a~~~~~~~~~~t--~d~s~~a~~~~~~~~dA~~~lgda~~--------Gd~------~~~l 192 (349) T protein:vir:78 129 QAQRRLIATALGLYNDNVSATDAYHEQNDMV--VDVSATLGFDAGAFIDATQTMGDALM--------GNG------GEVL 192 (349) T ss_pred HHHHHHHHHHHHhhcccccccchhhhcccce--eeeccccCCChhhhhhhHHHHHHHhc--------ccc------ccce Confidence 44444222 333211 1 111111111 11223334566666666554433210 000 0112 Q ss_pred eEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccccCcc Q lcl|NC_013692. 243 RALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESGGK 322 (399) Q Consensus 243 yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~~~ 322 (399) =+++|||.....|++.. .+.-.+...+ +-.|+.+-+-|+|+..-|-.. + .+. T Consensus 193 t~i~mHS~v~~~L~~~~-------li~~i~~s~~----~~~i~ty~G~~VivDD~~Pv~-~----------------~g~ 244 (349) T protein:vir:78 193 GAIAMHSFVYAQARKAQ-------LIDFIRDAEN----NTMFATYQGYRVIVDDSMTVV-G----------------QGA 244 (349) T ss_pred eEEEEchHHHHHHHhhh-------hhhhccCccc----CcccceecCeEEEEeCCCccc-c----------------CCC Confidence 47899999999999752 3333333333 236899999999999875322 1 123 Q ss_pred ceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchh-----hHHHHHHHHHHhhc--------cccc Q lcl|NC_013692. 323 YSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEM-----GFMSIKWYYGFMVF--------RPEW 389 (399) Q Consensus 323 ~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQr-----g~~gwK~~~~~~iL--------n~~~ 389 (399) ..||-+.+||..|++--. +....+.-+...|-.+.-+.-|=|=.| +-.|.||--++..- .+-| T Consensus 245 ~~~yttylfg~GAi~~~~----~~~~~~~et~rd~~~g~~~G~d~l~~R~~~~~hp~G~s~~~a~v~~~~~~~~~~sPt~ 320 (349) T protein:vir:78 245 QRKFISIIFGQGAIGYGE----GNPVMPLEYEREASRANGGGVETLWTRKTWLLHPFGYRFTSAVITGNGTETIARSASW 320 (349) T ss_pred CceEEEEEeecceEEEcc----CCCccceeeecccccCCcceeEEEEEeeEEEeeeeeeeeccccccCCccccccCCCCh Confidence 458999999999987422 211112222233311111222322221 22233333222110 0000 Q ss_pred eEEEEEecCC Q lcl|NC_013692. 390 IALLKTVARL 399 (399) Q Consensus 390 m~~iet~A~~ 399 (399) +-|+.++-- T Consensus 321 -aeLa~~~NW 329 (349) T protein:vir:78 321 -QDLANATNW 329 (349) T ss_pred -HHhcCCcCc Confidence 011111111 No 163 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=84.74 E-value=0.058 Score=27.36 Aligned_cols=297 Identities=12% Similarity=0.059 Sum_probs=134.4 Q ss_pred CCCccccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccc----cccC--cCCCcEEEEEEccCCcC Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADT----FSMP--KHYGKEIVRLHYIPLLD 74 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~----~~mP--Kn~GktIkfrry~pl~~ 74 (399) .-|-...+. +.| |+..-++---+|.. |+=-+.....+...|.+. ..|| +-.|....+.|=.-++. T Consensus 9 ~~~~~~~~~-~~~--p~l~m~alTLaea~------~l~~d~~~~~VIE~l~~~s~iL~~lpf~~ve~~~~~~~r~~~lp~ 79 (330) T protein:vir:94 9 LRGRWRTLT-HQF--PELKMPTVTLAESA------KLSQDHLVSGLIETIVEVNPLYEMMPFTEIEGNALAYNRENVLGD 79 (330) T ss_pred cccceeehh-ccc--cccchhhhhhhHHh------hcCchhhHHHHHHhhhccchHHhhcccccccCCcceeeeeecCCc Confidence 111111111 001 11111100001100 000111111222222221 1122 12333333333222322 Q ss_pred CC-ccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhch Q lcl|NC_013692. 75 DR-NVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDP 153 (399) Q Consensus 75 ~~-t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~ 153 (399) .. -.+-+|++|+. ..|+++++.++.-.|.+.++...+.|+..++ T Consensus 80 a~~r~~n~~~~~~~-----------------------------------~~Tf~q~t~~l~~l~~~~~Vd~~iadl~g~~ 124 (330) T protein:vir:94 80 VQFLAVGGTITAKN-----------------------------------PATFTKVTSELTTLIGDAEVNGLIQATRSDF 124 (330) T ss_pred ceeeeccccccccC-----------------------------------cceeeeeeechhhhhhhHHHHHHHHHhcCCH Confidence 11 11233333321 1267889999999999999999999998864 Q ss_pred hH-HHHHHHHHHHHHHHHHHHHHHHHHHhc--------Cceeeeccccccccc-cCCcceecHHHHHHHHHHHHhcc-Cc Q lcl|NC_013692. 154 AM-EGHVTTEMVKGANEITEDLLQIDLLNS--------AGTVRYPGAATSDAE-VDATTEVTYDSLMRLRLDLDNAR-AP 222 (399) Q Consensus 154 ~L-~~~i~~el~~~~~~~t~d~l~~~~l~a--------gt~V~YAg~aTsra~-v~~~~~vt~~~lr~a~~~Lk~nr-Ap 222 (399) .= ...-.+...++-.+.. .+.++++ |.-..++ +.++.. .+.+..+|.|+|+.+.-....-+ .| T Consensus 125 ~d~~~~q~~~~ieal~~~~----e~~linGDs~~~~F~GL~~~~~--~~q~i~tg~~gg~~T~d~LDeLl~~v~~~~g~~ 198 (330) T protein:vir:94 125 MDQTSVQVASKAKSIGRQY----QASMITGDGTGNSFQGMMGLVA--ASQTISAGANGGTLTFELLDQLLDLVKDKDGQV 198 (330) T ss_pred HHHHHHHHHHHHHHHHHHH----HHHhhccCCCCccccchhhcCC--cccEEecCCCCCCCCHHHHHHHHHHhcCCCCCC Confidence 21 1111222333333333 4445542 2211222 122211 12235588999998875332222 12 Q ss_pred cccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccce----eEcCeEEEecCcc Q lcl|NC_013692. 223 TKIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVG----QLGRFRVIVNPQM 298 (399) Q Consensus 223 k~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG----~i~~~RfV~~~~~ 298 (399) -+.+++.-.-.-|+++.+ ..-.|+-..+..+ ..| ++.++-|+...- T Consensus 199 --------------------~~~l~n~a~~r~I~a~~R--------~~~~~~v~~~~~~-~~G~~v~~~~GvPi~~~d~- 248 (330) T protein:vir:94 199 --------------------DYLMSSFAMRRKYFSLLR--------ALGGAAIGEVMTL-PSGRQIPTYRGVPWFVNDF- 248 (330) T ss_pred --------------------cEEEechhHHHHHHHHHH--------hccCCCCCCcccc-cCCCEEeeeCCeEEEeccc- Confidence 244555555666666432 1223333222111 233 356777776542 Q ss_pred cccccCCcccCCcccccccccCccceeEEEEEEccc--cceecccccCCCCCcceEEEecCCCcCCCCCCccc-hhhHHH Q lcl|NC_013692. 299 MHWAGVGKAVDPNDQVPMHESGGKYSVFPMLCVASE--AFTTVGFATDGKNVKFKIITKRPGEATADRSDPYG-EMGFMS 375 (399) Q Consensus 299 ~~~~~aGa~~~~~~~~~~~~~~~~~DVYp~lV~G~~--Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlg-Qrg~~g 375 (399) ...+... .+.++.-.||.+- +|.+ --|.+||+..|. .-+-|.-+|+. |.-+ -++.+ T Consensus 249 ---ip~~~~~--------~~~~~ttsIyav~-~G~~~~~qgV~Gl~~~g~---~glsVr~~G~~-----~~k~v~~~~v- 307 (330) T protein:vir:94 249 ---IPSNMTQ--------GTATNATAIFAGT-FDDGSNKYGIAGLTARGS---AGLRVQNVGAK-----ENADETITRV- 307 (330) T ss_pred ---ccCCCCc--------ccCCCceeEEEEe-ecccccccceEeecCCCC---CcceeeeCCCc-----cccceeeEEE- Confidence 2221110 1123344577654 5644 358999986652 23566777741 1111 22333 Q ss_pred HHHHHHHhhccccceEEEEEecCC Q lcl|NC_013692. 376 IKWYYGFMVFRPEWIALLKTVARL 399 (399) Q Consensus 376 wK~~~~~~iLn~~~m~~iet~A~~ 399 (399) +||+++.+|++.-.++||-+..= T Consensus 308 -~~y~~~av~~~~a~~~L~~V~~g 330 (330) T protein:vir:94 308 -KMYCGFANFSQLGLAAIKGLIPG 330 (330) T ss_pred -EEeeeeEEechhheeeeccccCC Confidence 57999999999999999887777 No 164 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=79.91 E-value=0.1 Score=26.07 Aligned_cols=299 Identities=11% Similarity=0.045 Sum_probs=134.4 Q ss_pred CCccc--c-cccc--eehhhhhHHHHHHhhhHHhhhhccc---ccccCcCCCcEEEEEEccCCcCCCccccCCCCcchhh Q lcl|NC_013692. 18 NGVES--S-IGPQ--IHTRYWYKRALIDAAKEAYFGQLAD---TFSMPKHYGKEIVRLHYIPLLDDRNVNDQGIDASGAT 89 (399) Q Consensus 18 ~~~~~--~-i~p~--~~t~y~~~k~L~~A~p~lv~~~~a~---~~~mPKn~GktIkfrry~pl~~~~t~lteGV~p~g~~ 89 (399) |++|. + |.|+ +-+-|...+.++..+ .+--+.... ...+=..-|+.+..=.|.+|.-+..++..|-++.+ T Consensus 1 Ma~T~l~D~iipe~~vf~~Yv~~~~~e~~~-l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~l~g~~e~n~~~dt~~~-- 77 (349) T protein:vir:94 1 MAITTIGNIVTGNIPVLASYMTEDPVEKTA-FFNSGILTPTPYAAEIARGPSNIANLPFWKAIDTSIEPNYSNDVYQD-- 77 (349) T ss_pred CCceEEeeeeccChHHHHHHHHHhHHHhhh-hhhccceeccHHHHHHHhcCCCEEEeeeeecCCCCcccccCCCCccc-- Confidence 33221 1 3454 334465555544322 111111221 11222356888888777777544344444433321 Q ss_pred hhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhh-hhchhHHHHHHHHHHHHHH Q lcl|NC_013692. 90 IANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDF-DSDPAMEGHVTTEMVKGAN 168 (399) Q Consensus 90 ~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t-~~D~~L~~~i~~el~~~~~ 168 (399) .+|+. .++--.-.+.+..+|.=...+|.+.+. .+|| |++|..++..--. T Consensus 78 --------------~~t~~--------------kit~~~~~a~~~~r~kaw~~~Dla~~lsG~dp--m~~Ia~~va~yW~ 127 (349) T protein:vir:94 78 --------------IATPR--------------AIQTGEMMARVAYLNEGFGQADLTVELTSQNP--LQSVASRLDNFWQ 127 (349) T ss_pred --------------ccccc--------------cccccceeeeeeeeccccchhHHHHHhhCchH--HHHHHHHHHHHHh Confidence 22211 012223456666777777778777665 4454 4677776666555 Q ss_pred HHHHHHHHHHHHhcC-------c-eeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCccccc Q lcl|NC_013692. 169 EITEDLLQIDLLNSA-------G-TVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVG 240 (399) Q Consensus 169 ~~t~d~l~~~~l~ag-------t-~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~ 240 (399) ...+..| ..+|++- + +..+.++.+ ..++.....+...+-.|...|-..-+ |- .=. T Consensus 128 r~~q~~L-ia~L~Gvf~~~~~~~~~~~~~~~~~--~d~~~~a~~~~~~~~~A~~~~Gdaa~-------------Gd-~~~ 190 (349) T protein:vir:94 128 RQAQRRL-IATALGLYNDNVSATDAYHEQNDMV--VDVSATSGFDAGAFIDATQTMGDALM-------------GN-GGE 190 (349) T ss_pred hHHHHHH-HHHHHhhhcccccccccccccCcee--EEecccCCCChhhHHHHHHHHHHHhc-------------cc-ccc Confidence 5444442 2333221 1 111111111 12223334566666655543332210 00 001 Q ss_pred CeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccccccccC Q lcl|NC_013692. 241 NARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESG 320 (399) Q Consensus 241 ~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~ 320 (399) .==+++|||....+|++.. .=+|+ +...+. -.|+.+-+-|+|+..-|-... . T Consensus 191 ~lt~i~mHS~v~~~L~~~~----li~~i---~~s~~~----~~i~ty~G~~VivDD~~Pv~~-----------------~ 242 (349) T protein:vir:94 191 VLGAIAMHSFVYAQARKAQ----LIDFI---RDAENN----TMFATYQGYRVIVDDSMTVVG-----------------Q 242 (349) T ss_pred ceeEEEEchHHHHHHHhcc----hhhhc---cCcccC----cccceecCcEEEEeCCCcccc-----------------C Confidence 1247899999999999853 22333 333332 258999999999998654321 1 Q ss_pred ccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccch-----hhHHHHHHHHHHhhc--------cc Q lcl|NC_013692. 321 GKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGE-----MGFMSIKWYYGFMVF--------RP 387 (399) Q Consensus 321 ~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQ-----rg~~gwK~~~~~~iL--------n~ 387 (399) +...+|-+.+||..|++--. +....+.-+...|-.+.-+.-|=|=. .+-.|.||--++..= .+ T Consensus 243 g~~~~yttylfg~GAi~~~~----~~~~~~~E~~rd~~~g~~~G~d~L~~R~~~~~hp~G~s~~~a~v~~~~~~~~~~sP 318 (349) T protein:vir:94 243 DTSRKFISIIFGQGAIGYGE----GNPEMPLEYEREASRANGGGVETLWTRKTWLLHPFGYSFTSAVITGNGTETIARSA 318 (349) T ss_pred CCCceEEEEEeecceEEeec----CCCCcceeeecccccCCcceeEEEEEeeEEEeeeeeeeecccccCCCccccccCCC Confidence 23448999999999988332 22111222333331111122232222 122233333222110 00 Q ss_pred cceEEEEEecCC Q lcl|NC_013692. 388 EWIALLKTVARL 399 (399) Q Consensus 388 ~~m~~iet~A~~ 399 (399) -| +-|+.++-- T Consensus 319 t~-aeLa~~~NW 329 (349) T protein:vir:94 319 SW-QDLANAANW 329 (349) T ss_pred Ch-HHhcCCcCc Confidence 00 011111111 No 165 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=78.08 E-value=0.12 Score=25.67 Aligned_cols=296 Identities=15% Similarity=0.115 Sum_probs=132.4 Q ss_pred CCCccccccccccCCCCCCccccccccee-hhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCccc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVESSIGPQIH-TRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVN 79 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~~~i~p~~~-t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~l 79 (399) |..|-+ =...|..+.. +++.=|+ +-+=.+++.+.++|.++-..|=+... -+.+..++|++-.|.-.+-.+ T Consensus 1 ~~~~~~--i~s~~~~~~i----tv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~~--a~~~~~v~f~~~~p~~~~~d~- 71 (318) T protein:vir:10 1 MTAPTG--IVSVSDGPAI----TVRELVGNPLWIPTALKKMMVNQFISESLFRNGG--ANPNGVVAYNEGNPSFLEDDV- 71 (318) T ss_pred CCCCCc--ceeeecCCce----ehHHhhCCchhHHHHHHHHHhccchhhhhhhccc--ccccceeEEEecccccccCcH- Confidence 544411 1122322222 1222122 11223444444555544443322211 456679999998887544333 Q ss_pred cCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHH Q lcl|NC_013692. 80 DQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHV 159 (399) Q Consensus 80 teGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i 159 (399) |.| +.|..+ |+-... .| ...-+..+|||-=..+|||..+---= +.+ T Consensus 72 -e~V-aEggEi-------------------P~~~~~------~G---~~~ia~~~K~G~~~~vS~Em~~~n~~----~~v 117 (318) T protein:vir:10 72 -ADV-AEFGEI-------------------PVSAGA------RG---LPRTAFAVKKALGVRVSKEMIDENRV----GAV 117 (318) T ss_pred -hhc-cCcccc-------------------cccCCC------CC---chhhhhhehhccceeccHHHHhhcCh----hHH Confidence 555 444332 110000 01 12334556999999999998875322 333 Q ss_pred HHHHHHHHH--HHHHHHHHHHHHhcC-ceeeeccccccccccCCcceecHHHHHHHHHHHHhccCcccccee--ccc--c Q lcl|NC_013692. 160 TTEMVKGAN--EITEDLLQIDLLNSA-GTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMI--TGT--R 232 (399) Q Consensus 160 ~~el~~~~~--~~t~d~l~~~~l~ag-t~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii--~~s--~ 232 (399) .+.+..-.+ .+..|....+.|.++ +.-.=+ .++-+ +.+.+.. ++-.|....+.-++-. +. .++ . T Consensus 118 ~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~~~-s~~w~----~~~~~~~-d~~~A~e~v~~a~~~~---~~a~~~~~~~ 188 (318) T protein:vir:10 118 NDQMLQLRNTFIRANDRSAKALLQSPIVPTLAV-PTAWD----NGGKVRT-DIAIAIEQISTAAPTA---YPAGVGSSDE 188 (318) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccccccccC-CcCCC----Ccccccc-cchhhhhhhhhhhhhh---hhhhhhhhhh Confidence 343333333 345667777888544 332211 11111 1111111 3333333333333300 00 000 1 Q ss_pred ccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCcccc-----ccccc-eeEcCeEEEecCcccccccCCc Q lcl|NC_013692. 233 MIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGAT-----MHGEV-GQLGRFRVIVNPQMMHWAGVGK 306 (399) Q Consensus 233 ~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i-----~~gEI-G~i~~~RfV~~~~~~~~~~aGa 306 (399) .+|=. .=..++||.+..-|++ ++.|.++-. +...++ +.|.+ |++=++|+|.+|..-. T Consensus 189 ~~GY~----pdtIVlhP~~~~~l~~------n~~~~~~y~-~~a~~~~~~~~~tg~~~g~~lGl~vi~s~~~p~------ 251 (318) T protein:vir:10 189 YFGFI----PDTIVMHYALLPILMD------NENFMKVYE-RNANYVSTAPDWTGNFPGSVMGLNVIRSRTFPI------ 251 (318) T ss_pred ccCcc----ceeeEECHHHHHHHhc------chhhhhhhh-ccchhhhhcccccccccceeeceEEeecCccCC------ Confidence 11100 0267999999999974 788877621 011112 24554 6678899999995210 Q ss_pred ccCCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHHHHHHHhhcc Q lcl|NC_013692. 307 AVDPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIKWYYGFMVFR 386 (399) Q Consensus 307 ~~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK~~~~~~iLn 386 (399) ++ .+||=+...| +-++...+.++. ..+ +.+||. +--..+|. ....+ T Consensus 252 --------------~~-----alvlq~g~vG---~~~d~~pl~~t~--~~~-----egg~~~-g~~~~s~~----~~~~~ 297 (318) T protein:vir:10 252 --------------DR-----VLIMERGTVG---FYSDTRPLQFTA--LYP-----EGNGPN-GGPTESYR----ADASH 297 (318) T ss_pred --------------Ce-----eEEEecCCcc---eeeccccceeee--ccc-----CCCCCC-CCcchhhh----eehhe Confidence 11 3666664444 444444433322 222 346775 44555554 12222 Q ss_pred ccceEEEEEecCC Q lcl|NC_013692. 387 PEWIALLKTVARL 399 (399) Q Consensus 387 ~~~m~~iet~A~~ 399 (399) -.-+++.|=-|-+ T Consensus 298 ~~~~~V~~PkA~~ 310 (318) T protein:vir:10 298 KRALAVDQPKAAL 310 (318) T ss_pred eeeeeeeCcceeE Confidence 2222222211111 No 166 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=77.72 E-value=0.12 Score=25.60 Aligned_cols=293 Identities=17% Similarity=0.181 Sum_probs=128.4 Q ss_pred eehhh---hhHHHHHHhhhHHhhhhcc-cccccCc---CCCcEEEEEEccC---CcCCCccccCCCCcchhhhhhhhhcc Q lcl|NC_013692. 28 IHTRY---WYKRALIDAAKEAYFGQLA-DTFSMPK---HYGKEIVRLHYIP---LLDDRNVNDQGIDASGATIANGNLYG 97 (399) Q Consensus 28 ~~t~y---~~~k~L~~A~p~lv~~~~a-~~~~mPK---n~GktIkfrry~p---l~~~~t~lteGV~p~g~~~~ngn~~~ 97 (399) |.--| |...+.+...+.++...+. ..-...+ +.|++||+-+-.. |.+ -+. .-|..+.| T Consensus 1 Mainya~~~~~~Ld~~~~~~~lts~~l~~~~~~~~v~~~ggktVkIp~is~tsGl~D-Y~R-~~g~~~~g---------- 68 (346) T protein:vir:10 1 MTINYAEKYQAAVQQAFYDGHLYSAELWNSPSNSIIKFDGAKHIKVPRLEITSGRKD-RQR-RTITTPVA---------- 68 (346) T ss_pred CcchhHHHHHHHHHHHHHhhhccchhhcccccccceEecCCCEEEEEEeeeeccccc-ccc-cCCccccc---------- Confidence 33222 2333333344443332211 1111121 5689999976531 222 211 11222211 Q ss_pred ccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHHHHHHHHHHHH---HHHH Q lcl|NC_013692. 98 SSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVTTEMVKGANEI---TEDL 174 (399) Q Consensus 98 ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~~el~~~~~~~---t~d~ 174 (399) +| ..+++..+-+-..| |...=|.. |-+|=+.+. .+..-+.+.+.+. -.|. T Consensus 69 ------~v-----------------~~~~et~tl~qDR~--~~F~vD~m-DvDETn~~~-~~anv~~ef~r~~vvPEiDa 121 (346) T protein:vir:10 69 ------NY-----------------SNDWDSYELKNERY--WSTLVDPS-DIDETNMVV-SLANITKQFNLDSKMPEKDR 121 (346) T ss_pred ------cc-----------------ccceeEEEeecccc--ceeccccc-chHHHHHHh-HHHHHHHHHHHHhhcchhhH Confidence 11 12334444332222 21111211 111111110 1222222222222 2344 Q ss_pred HHHHHHhcC-ceeeeccccccccccCCcceecHHHHHHHHHHHHhccCccccceeccccccCcccccCeeEEEechhhhH Q lcl|NC_013692. 175 LQIDLLNSA-GTVRYPGAATSDAEVDATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTVGNARALYVGSDLVP 253 (399) Q Consensus 175 l~~~~l~ag-t~V~YAg~aTsra~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~ 253 (399) .+..-|.++ +.+ +++.....+++.... ++.|+.+...|++++.|. ..+|+||.|+.-. T Consensus 122 yrfskLa~~a~~~--~~~~~~~~a~T~~ni--~~~i~~~~~~lde~~vp~-----------------~~rvl~vTp~~~~ 180 (346) T protein:vir:10 122 YMFSHLYSGKEAA--HDGGITTNTLDEKNI--LPAFDNMMLDFDEARIPS-----------------TNRILYVTPKTNA 180 (346) T ss_pred HHHHHHHHhhhhh--ccccccccccCHHHH--HHHHHHHHHHHHHccCCC-----------------CCeEEEECHHHHH Confidence 444444221 110 001111112222222 355888888899988863 3499999999998 Q ss_pred HHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCccccccc-----CCcccCCcccccccccCccceeEEE Q lcl|NC_013692. 254 TIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAG-----VGKAVDPNDQVPMHESGGKYSVFPM 328 (399) Q Consensus 254 di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~-----aGa~~~~~~~~~~~~~~~~~DVYp~ 328 (399) -|+. +++|.....-++... .++-||++++|-+|++|.- .+.. .|....+++- .=+-.-|.|. T Consensus 181 lLk~------s~~f~k~~~v~~~~~-i~~~V~siDGv~Ii~VPs~-r~~t~~~f~~G~~~~t~ak-----~INfiiv~~~ 247 (346) T protein:vir:10 181 ILKR------AEAMNRALTLKDPNN-IQRTVYSLDDVTIRVVPSD-LMQTAYDFSDGSKIIDTAK-----QIEMFLIYNG 247 (346) T ss_pred HHhh------chhheeccccccccc-cceeeeeecCeEEEEcchh-hcccchhhccCccccCCcc-----ceeEEEECCc Confidence 8874 688887666666665 5999999999999998852 2210 1222111100 0000111111 Q ss_pred EEEccccceeccc-----------------------ccCCCCCcceEEEec-C---CCcCCCCCCccchhhHHHHHHH-- Q lcl|NC_013692. 329 LCVASEAFTTVGF-----------------------ATDGKNVKFKIITKR-P---GEATADRSDPYGEMGFMSIKWY-- 379 (399) Q Consensus 329 lV~G~~Afg~v~l-----------------------~~~g~~~k~~~ivk~-p---G~~tad~~DPlgQrg~~gwK~~-- 379 (399) .++.-.-+..+.+ .-..+. .+.+.++. | +..+....+|=.+--.--+|+| T Consensus 248 A~ia~~K~~~~~if~P~~~~~g~~l~~~R~Y~D~fv~~nk~~-~Iyv~~~~a~~~~~~~~~~~~kpt~~~~~~~~~~~~~ 326 (346) T protein:vir:10 248 VQIAPEKYSFVGFDQPSAATSGNYLYYEQSYDDVLLLNTKTK-GIQFVVSDKPKKDQEQSGQDAKPTAESTLEEIKAYLD 326 (346) T ss_pred eeeeeeeeeeeEeeCCCCCcccceeeeeeeeeeeeeeccccc-eEEEeeecccccCccCcccccCcccccchHHHHHHhc Confidence 1111111111111 101111 11112221 1 2222233566666677789998 Q ss_pred -----HHHhhccccceEEEE Q lcl|NC_013692. 380 -----YGFMVFRPEWIALLK 394 (399) Q Consensus 380 -----~~~~iLn~~~m~~ie 394 (399) |+.+-+.++.+++++ T Consensus 327 ~~~~~~~~~~~~~~~~~~~~ 346 (346) T protein:vir:10 327 KNHIDYTGKTKKDELLALVK 346 (346) T ss_pred ccccccccccchhhHHhhcC Confidence 456778888888888 No 167 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=70.13 E-value=0.21 Score=24.26 Aligned_cols=280 Identities=15% Similarity=0.203 Sum_probs=130.2 Q ss_pred CCCccc--cccccccCCCCCCccccc-ccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCc Q lcl|NC_013692. 1 MAGPVD--NIKPMKYNDPANGVESSI-GPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRN 77 (399) Q Consensus 1 ~~~~~~--~~~~~~~n~~~~~~~~~i-~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t 77 (399) |+|-+- .++..-|-.+++ +.| -|+. -+--+++-|+|.....++-|...| |.|+-+.|-++.-+ -+-+ T Consensus 59 m~G~~p~~eV~~~e~mtt~~---a~IliP~v----is~v~~Eaaepl~~~~kl~qk~~L--~~Grsm~F~~~g~~-Ra~~ 128 (393) T protein:vir:79 59 MEGETPTNEVNLREFMATPS---AQILIPRV----IVGTMREAAEPLYIGTKMLQKIRL--KSGQSMIFPSIGIM-RAYD 128 (393) T ss_pred hcCCCchhheehhhhhcCCC---cceechhh----hhhhhhhcccchhHHHHHHHHHhh--hcCcceeccchhee-eecc Confidence 887543 333322322221 222 2331 123477778888777777665444 77888887555422 1111 Q ss_pred cccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHH Q lcl|NC_013692. 78 VNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEG 157 (399) Q Consensus 78 ~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~ 157 (399) . +.|+.+-+-||=- +|..-|+.+..+||--++|||+.. +|+.| . T Consensus 129 I------gEGgE~~~~sld~--------------------------~T~dsv~~~~gK~G~~Ia~SqEmI---sDSg~-D 172 (393) T protein:vir:79 129 V------AEGQEIPEDSIDW--------------------------QTHESPEIRVGKSGIRLRFTDEMI---SDSQW-D 172 (393) T ss_pred c------cccccccccchhh--------------------------hcCCceeEEechhhhhhhhHHHHh---hcchH-H Confidence 1 2333433222211 234468899999999999999876 46666 3 Q ss_pred HHHHHHHHH---HHHHHHHHHHHHHHhcCceeeeccccc-cccccCC-------cceecHHHHHHHHHHHHhccCccccc Q lcl|NC_013692. 158 HVTTEMVKG---ANEITEDLLQIDLLNSAGTVRYPGAAT-SDAEVDA-------TTEVTYDSLMRLRLDLDNARAPTKIK 226 (399) Q Consensus 158 ~i~~el~~~---~~~~t~d~l~~~~l~agt~V~YAg~aT-sra~v~~-------~~~vt~~~lr~a~~~Lk~nrApk~T~ 226 (399) -|.. +++. +..+..|....+.+++-..+.|-+=-| ..+..++ ..++|.++|-+...+...++=- T Consensus 173 vin~-~l~aA~RaMaRkKee~a~n~fk~~ghtvfDa~st~t~ahptGr~~~~~qNGTlSleDllDm~~av~~~hyt---- 247 (393) T protein:vir:79 173 LMSM-MIKQAGRAMGRHKEQKAYHQFRSHGHTVFDNYSTNKLAHTTGLDKNGVQNDTFSAEDFLDLIIAVMANEYT---- 247 (393) T ss_pred HHHH-HHHHHHHHHHhhhHHHHHhhhhcccceeeeccccCccceeecCCccccccccccHHHHHHHHHHHhcccCC---- Confidence 3333 3332 233456677778887766666663211 1222222 2568888888777655555431 Q ss_pred eeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCc--eehh---hcCCccccccccc--------eeE-cCeEE Q lcl|NC_013692. 227 MITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAF--IPIE---KYAAGGATMHGEV--------GQL-GRFRV 292 (399) Q Consensus 227 ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~f--i~v~---kYg~~~~i~~gEI--------G~i-~~~Rf 292 (399) .=+.++||=|-.-+-. ++.- ..+. +|+.+ .++-|- |++ -|+-+ T Consensus 248 ---------------~svi~MHPLAWnv~AK------na~me~~~~na~gN~~~~--~~~ts~algp~~i~~~~~~nlnv 304 (393) T protein:vir:79 248 ---------------PSDLMMHPLAWTVFAK------NELMGSLQANPYGNYPAK--GAPSSMALGPDSIQGRLPFNFNV 304 (393) T ss_pred ---------------cceEEEcCchhhhhhh------hhhhcceeeccccccCcc--ccchhhhhchhhhccccccceeE Confidence 1367888865443321 1111 1111 22222 222221 111 15777 Q ss_pred EecCcccccccCCcccCCcccccccccCccceeEEE-------EEEcc--------------------ccceecccccCC Q lcl|NC_013692. 293 IVNPQMMHWAGVGKAVDPNDQVPMHESGGKYSVFPM-------LCVAS--------------------EAFTTVGFATDG 345 (399) Q Consensus 293 V~~~~~~~~~~aGa~~~~~~~~~~~~~~~~~DVYp~-------lV~G~--------------------~Afg~v~l~~~g 345 (399) |.+|. +++.+.+.++|+|.+ |.|-. |-||.==|. +| T Consensus 305 ~~sPf----------------vp~d~k~~rFd~~~Vd~NnvgvlLV~D~i~tdq~ddk~rdiq~iKl~ERYG~gvLn-~g 367 (393) T protein:vir:79 305 NLSPF----------------IPLDKKSRRFDVYAVDRNNVGVLLVRDDLKTDQWDEKARGLQNIKMIERYGIGILN-EG 367 (393) T ss_pred EEecc----------------cccccccceeeEEEeecCCceEEEEecCcceeccccccccceeeeeeeeeceeeee-CC Confidence 77773 344555677788764 22211 222210111 11 Q ss_pred CCCcceEEEecCCCcCCCCCCccchhhHHH Q lcl|NC_013692. 346 KNVKFKIITKRPGEATADRSDPYGEMGFMS 375 (399) Q Consensus 346 ~~~k~~~ivk~pG~~tad~~DPlgQrg~~g 375 (399) +.. -..|+--. +.+-.||+--.-.-. T Consensus 368 kai---avakNI~~-~k~y~~P~~~~~~~~ 393 (393) T protein:vir:79 368 KAI---AVAKNISM-DKSYAEPMLIKNVGN 393 (393) T ss_pred ceE---EEEeccee-ecccccchhhhccCC Confidence 110 00111000 112233432111000 No 168 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=67.17 E-value=0.26 Score=23.83 Aligned_cols=288 Identities=12% Similarity=0.006 Sum_probs=115.5 Q ss_pred cccccccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCC----Ccccc Q lcl|NC_013692. 5 VDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDD----RNVND 80 (399) Q Consensus 5 ~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~----~t~lt 80 (399) |.=....-||.-..+.+..--+|.-..| . ..-.-| ++ ..+-|- .|+.++.=.|.+|..+ .+... T Consensus 1 m~lsD~~vfN~~~~~a~~e~~~q~~~~f-n-~as~ga---i~------l~~~~~-~Gd~~~~pf~~~l~g~~~~~~~~~~ 68 (325) T protein:vir:95 1 MALSDLAVYSEYAYSAFSETLRQQVDLF-N-TATGGA---IM------LQSAAH-QGDFSDVAFFAKVTGGLVRRRNAYG 68 (325) T ss_pred CchhhhhhhhhhhhhhhhhhhhhhHhhh-h-hcccce---eE------eccccc-cCceeeccccccccccccccccCCC Confidence 3333333343333222222222211111 0 000000 00 001110 2666555557666432 12222 Q ss_pred CC-CCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHH Q lcl|NC_013692. 81 QG-IDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHV 159 (399) Q Consensus 81 eG-V~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i 159 (399) ++ |+| .+| | +.+++.+.+..=-.|.+.+-....-.+||. +++ T Consensus 69 ~~~vt~--~ki----------------------t-----------t~~~~av~~~r~~g~~~~d~~~~~~g~~~~--~~~ 111 (325) T protein:vir:95 69 SGTVAE--KVL----------------------K-----------HLVDTSVKVAAGTPPVRLDPGQFRWIQQNP--EVA 111 (325) T ss_pred Cceecc--cee----------------------c-----------cccceeeEEecccCcccccHHHHhhcCCCH--HHH Confidence 22 222 111 1 235666666554455555544444455653 355 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeeeccccc---ccc-ccC-CcceecHHHHHHHHHHHHhccCccccceecccccc Q lcl|NC_013692. 160 TTEMVKGANEITEDLLQIDLLNSAGTVRYPGAAT---SDA-EVD-ATTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMI 234 (399) Q Consensus 160 ~~el~~~~~~~t~d~l~~~~l~agt~V~YAg~aT---sra-~v~-~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~ 234 (399) ...+...-+.....-++..+|.+... ..++++. .-+ +.+ .+..+|...|-+|...|-.+... T Consensus 112 ~~~Ig~~~a~~~~~~~l~~~~~~l~~-a~~~~~~~v~dis~~~~~~~~~~s~~~l~~A~~klGD~~~~------------ 178 (325) T protein:vir:95 112 GAAMGQQLAVDTMADMLNVGLGSVYS-ALSQVSDVVYDATANTDAADKLPTWNNLNNGQAKFGDQSSQ------------ 178 (325) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH-hhcccccceeeeecccCcccccccHHHHHHHHHHhcccccc------------ Confidence 55555444443333333333322211 1111111 000 111 12347888888888888665543 Q ss_pred CcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCcccCCcccc Q lcl|NC_013692. 235 DTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQV 314 (399) Q Consensus 235 gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~ 314 (399) ==+.+|||....+|++.. ...+..+-.|+... .|+.+-+-|+|++.-| |-.+.| T Consensus 179 -------l~~~~MHS~v~~~L~~~~----L~~~~~~~~~~g~~-----~i~t~~G~~VIVdD~~-p~~~~g--------- 232 (325) T protein:vir:95 179 -------IAAWIMHSTPMHKLYGSN----LTNGERLFTYGTVN-----VVRDPFGKLLVMTDSP-NLFAAG--------- 232 (325) T ss_pred -------eeEEEEchHHHHHHHHhh----ccccccccccCCcc-----cccccCCcEEEEeCCC-CCCCcc--------- Confidence 246889999999999742 23333333333332 3566778899998742 111111 Q ss_pred cccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccc--hh-------hHHHHHHHHHHhhc Q lcl|NC_013692. 315 PMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYG--EM-------GFMSIKWYYGFMVF 385 (399) Q Consensus 315 ~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlg--Qr-------g~~gwK~~~~~~iL 385 (399) ...+|-+++||+.|++.-. +....+ .+.+- ++.+=++ +| +-.|+||--+..-. T Consensus 233 -------~~~~ytty~lg~GAi~~~~----~~~~~~-----~~~~~--~~~~~~~~~~~~~~tf~lhp~G~sw~~s~~g~ 294 (325) T protein:vir:95 233 -------TPNVYHILGLVPGGVLIGQ----NNDFDA-----NEETK--NGDENIIRTYQAEWSYNIGVKGFAWDKANGGK 294 (325) T ss_pred -------CceeEEEEEEecCeEEecC----CCCccc-----ccccc--CcccceeeeeeeeeeEEeecceeeeecccccC Confidence 1138999999999976221 111111 11110 0010011 00 11222221111111 Q ss_pred cccceEEEEEecCC Q lcl|NC_013692. 386 RPEWIALLKTVARL 399 (399) Q Consensus 386 n~~~m~~iet~A~~ 399 (399) ++-. +-|++++-- T Consensus 295 sPt~-aeL~~~~NW 307 (325) T protein:vir:95 295 SPTD-AALFTSTNW 307 (325) T ss_pred CcCh-HhhcCCcCc Confidence 1100 111111111 No 169 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=59.61 E-value=0.39 Score=22.83 Aligned_cols=286 Identities=10% Similarity=-0.011 Sum_probs=125.5 Q ss_pred CCCccccc-cccccCCCCCCccc---ccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCC Q lcl|NC_013692. 1 MAGPVDNI-KPMKYNDPANGVES---SIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDR 76 (399) Q Consensus 1 ~~~~~~~~-~~~~~n~~~~~~~~---~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~ 76 (399) +.+..-+- --..||.-..++++ -+-|+ .+..+.++.....-.+.+++.+.+++ |. .++-+... T Consensus 60 ~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~----~~~~~I~~~l~~~s~i~~~~~v~~~~---~~-~~i~~~~~----- 126 (381) T protein:vir:10 60 KSAQSLSANQRSFFMDINKNVNYKEEKLLPE----ETIDRIFEDLTTNHPLLADLGIKNAG---LR-LKFLKSET----- 126 (381) T ss_pred cCcccccHHHHHHHHHHhcccCCCCceecCH----HHHHHHHHHHHhhccceeheeeEecC---cc-eEEEEecC----- Confidence 11111111 11112211111112 22333 23466666666666777788888775 32 22211111 Q ss_pred ccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHH Q lcl|NC_013692. 77 NVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAME 156 (399) Q Consensus 77 t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~ 156 (399) . +...|... .+ .+..-...++..|+.+.++++.+..+|+++.+- +...|. T Consensus 127 ~-------~~a~w~~e-------------~~---------~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~D-s~~~ie 176 (381) T protein:vir:10 127 S-------GVAVWGKI-------------YG---------EIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDF-GPAWIE 176 (381) T ss_pred C-------cceeeecc-------------cc---------cccccccccceeeeecceeEEeechhhHHHhhc-CHHHHH Confidence 1 11122111 11 111112246788999999999999999998753 333455 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCcee-------------eecccc----ccccccCCc-ceecHHHHHHHHHHHHh Q lcl|NC_013692. 157 GHVTTEMVKGANEITEDLLQIDLLNSAGTV-------------RYPGAA----TSDAEVDAT-TEVTYDSLMRLRLDLDN 218 (399) Q Consensus 157 ~~i~~el~~~~~~~t~d~l~~~~l~agt~V-------------~YAg~a----Tsra~v~~~-~~vt~~~lr~a~~~Lk~ 218 (399) ..|..+|.+.-+. .+-..+|++-++- ..+++. ++..+++.. .....+.|..+.+.|.. T Consensus 177 ~~i~~~la~~~a~----~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~ 252 (381) T protein:vir:10 177 RFVRVQIEEAFAV----ALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHST 252 (381) T ss_pred HHHHHHHHHHHHH----HhhheeEeccCCCCceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhcc Confidence 5555555443333 2233344332221 111111 111122211 12334556666666654 Q ss_pred ccCccccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcc Q lcl|NC_013692. 219 ARAPTKIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQM 298 (399) Q Consensus 219 nrApk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~ 298 (399) +-..+ .+......+.+||+....+|+.+.++.. ++...+ . ..--+.++|+++.| T Consensus 253 ~~~~~------------~~~~~~~a~~~mn~~t~~~l~~~~~~~~----------~~G~~v-~---~l~~g~~vv~s~~~ 306 (381) T protein:vir:10 253 NEKGK------------SVAVKGNVTMVVNPSDAFEVQAQYTHLN----------ANGVYV-T---ALPFNLNVIESTVQ 306 (381) T ss_pred ccccc------------cccccCceEEEEccccHHhhccccccCC----------CCCcee-e---cCCCCceEEecCCC Confidence 43211 0122334677999999889987643221 111100 0 00024456666543 Q ss_pred cccccCCcccCCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH- Q lcl|NC_013692. 299 MHWAGVGKAVDPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK- 377 (399) Q Consensus 299 ~~~~~aGa~~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK- 377 (399) . + + + ++||.-++-.+..+. .+.++ .- ...+-..+..+++ T Consensus 307 p----~----------------~--~----iifgDfs~Y~i~~r~---~~~i~----~~-------~~~~~~~d~~~f~a 346 (381) T protein:vir:10 307 E----A----------------G--K----VLTYVKGLYDGYLAG---GINVQ----KF-------KETLALDDMDLYTA 346 (381) T ss_pred C----c----------------C--c----EEEEecccEEEEEec---ccEEE----ee-------chhHhhcCCeEEEE Confidence 1 0 0 1 577776665555542 22221 10 1112222222333 Q ss_pred -HHHHHhhccccceEEEEEec----CC Q lcl|NC_013692. 378 -WYYGFMVFRPEWIALLKTVA----RL 399 (399) Q Consensus 378 -~~~~~~iLn~~~m~~iet~A----~~ 399 (399) ..+.+++++++=++.++... |+ T Consensus 347 ~~r~dg~~~~~~A~~v~~l~~~~~~~~ 373 (381) T protein:vir:10 347 KQFAYGKAKDNKVAAVWKLDLKGHKPA 373 (381) T ss_pred EEEEcCEEecCceEEEEEEEecCCCcC Confidence 45566677777776655322 22 No 170 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=59.61 E-value=0.39 Score=22.83 Aligned_cols=286 Identities=10% Similarity=-0.011 Sum_probs=125.5 Q ss_pred CCCccccc-cccccCCCCCCccc---ccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCC Q lcl|NC_013692. 1 MAGPVDNI-KPMKYNDPANGVES---SIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDR 76 (399) Q Consensus 1 ~~~~~~~~-~~~~~n~~~~~~~~---~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~ 76 (399) +.+..-+- --..||.-..++++ -+-|+ .+..+.++.....-.+.+++.+.+++ |. .++-+... T Consensus 60 ~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~----~~~~~I~~~l~~~s~i~~~~~v~~~~---~~-~~i~~~~~----- 126 (381) T protein:vir:95 60 KSAQSLSANQRSFFMDINKNVNYKEEKLLPE----ETIDRIFEDLTTNHPLLADLGIKNAG---LR-LKFLKSET----- 126 (381) T ss_pred cCcccccHHHHHHHHHHhcccCCCCceecCH----HHHHHHHHHHHhhccceeheeeEecC---cc-eEEEEecC----- Confidence 11111111 11112211111112 22333 23466666666666777788888775 32 22211111 Q ss_pred ccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHH Q lcl|NC_013692. 77 NVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAME 156 (399) Q Consensus 77 t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~ 156 (399) . +...|... .+ .+..-...++..|+.+.++++.+..+|+++.+- +...|. T Consensus 127 ~-------~~a~w~~e-------------~~---------~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~D-s~~~ie 176 (381) T protein:vir:95 127 S-------GVAVWGKI-------------YG---------EIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDF-GPAWIE 176 (381) T ss_pred C-------cceeeecc-------------cc---------cccccccccceeeeecceeEEeechhhHHHhhc-CHHHHH Confidence 1 11122111 11 111112246788999999999999999998753 333455 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCcee-------------eecccc----ccccccCCc-ceecHHHHHHHHHHHHh Q lcl|NC_013692. 157 GHVTTEMVKGANEITEDLLQIDLLNSAGTV-------------RYPGAA----TSDAEVDAT-TEVTYDSLMRLRLDLDN 218 (399) Q Consensus 157 ~~i~~el~~~~~~~t~d~l~~~~l~agt~V-------------~YAg~a----Tsra~v~~~-~~vt~~~lr~a~~~Lk~ 218 (399) ..|..+|.+.-+. .+-..+|++-++- ..+++. ++..+++.. .....+.|..+.+.|.. T Consensus 177 ~~i~~~la~~~a~----~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~ 252 (381) T protein:vir:95 177 RFVRVQIEEAFAV----ALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHST 252 (381) T ss_pred HHHHHHHHHHHHH----HhhheeEeccCCCCceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhcc Confidence 5555555443333 2233344332221 111111 111122211 12334556666666654 Q ss_pred ccCccccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcc Q lcl|NC_013692. 219 ARAPTKIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQM 298 (399) Q Consensus 219 nrApk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~ 298 (399) +-..+ .+......+.+||+....+|+.+.++.. ++...+ . ..--+.++|+++.| T Consensus 253 ~~~~~------------~~~~~~~a~~~mn~~t~~~l~~~~~~~~----------~~G~~v-~---~l~~g~~vv~s~~~ 306 (381) T protein:vir:95 253 NEKGK------------SVAVKGNVTMVVNPSDAFEVQAQYTHLN----------ANGVYV-T---ALPFNLNVIESTVQ 306 (381) T ss_pred ccccc------------cccccCceEEEEccccHHhhccccccCC----------CCCcee-e---cCCCCceEEecCCC Confidence 43211 0122334677999999889987643221 111100 0 00024456666543 Q ss_pred cccccCCcccCCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCccchhhHHHHH- Q lcl|NC_013692. 299 MHWAGVGKAVDPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPYGEMGFMSIK- 377 (399) Q Consensus 299 ~~~~~aGa~~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPlgQrg~~gwK- 377 (399) . + + + ++||.-++-.+..+. .+.++ .- ...+-..+..+++ T Consensus 307 p----~----------------~--~----iifgDfs~Y~i~~r~---~~~i~----~~-------~~~~~~~d~~~f~a 346 (381) T protein:vir:95 307 E----A----------------G--K----VLTYVKGLYDGYLAG---GINVQ----KF-------KETLALDDMDLYTA 346 (381) T ss_pred C----c----------------C--c----EEEEecccEEEEEec---ccEEE----ee-------chhHhhcCCeEEEE Confidence 1 0 0 1 577776665555542 22221 10 1112222222333 Q ss_pred -HHHHHhhccccceEEEEEec----CC Q lcl|NC_013692. 378 -WYYGFMVFRPEWIALLKTVA----RL 399 (399) Q Consensus 378 -~~~~~~iLn~~~m~~iet~A----~~ 399 (399) ..+.+++++++=++.++... |+ T Consensus 347 ~~r~dg~~~~~~A~~v~~l~~~~~~~~ 373 (381) T protein:vir:95 347 KQFAYGKAKDNKVAAVWKLDLKGHKPA 373 (381) T ss_pred EEEEcCEEecCceEEEEEEEecCCCcC Confidence 45566677777776655322 22 No 171 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=43.88 E-value=0.83 Score=21.03 Aligned_cols=292 Identities=12% Similarity=0.030 Sum_probs=127.3 Q ss_pred CCCccccccccccCCCCCCcc---cccccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCc Q lcl|NC_013692. 1 MAGPVDNIKPMKYNDPANGVE---SSIGPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRN 77 (399) Q Consensus 1 ~~~~~~~~~~~~~n~~~~~~~---~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t 77 (399) +.-++.+..-..||.=..++. +-+-|+ .+..+.++...+.-.+.+++.+.++. |+ +++-+...- T Consensus 71 ~~~~l~~ee~~~~~~~~~~t~~~gG~liP~----~~~~~Ii~~l~~~s~i~~~~~v~~~~---~~-~~i~~~~~~----- 137 (395) T protein:vir:95 71 SQDPLTSEERKFFNDINYDVGYTDEKILPE----TVVERVFDDLQKDHPLLSKINFQNAG---IK-TRVIKADPA----- 137 (395) T ss_pred CccccchHHHHHHHHHhhccCCCCceeccH----HHHHHHHHHHHhhhhhhhhceeEecC---Cc-eEEEEecCC----- Confidence 111222222222222111112 223343 22466666677777888888888774 33 222111111 Q ss_pred cccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHH Q lcl|NC_013692. 78 VNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEG 157 (399) Q Consensus 78 ~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~ 157 (399) +...+.. ..+. +..-...++..|+.+.++++.+..+|+++.+ ++...|.. T Consensus 138 -------~~a~w~~-------------e~~~---------~~~~~~~~f~~i~l~~~kl~~~~~iS~ell~-ds~~~ie~ 187 (395) T protein:vir:95 138 -------GQAVWGK-------------VFGE---------IKGQLDAAFREENFTQYKLTCFVVLPDDLST-FGPAWIER 187 (395) T ss_pred -------cceEEee-------------cccc---------cCccccccceeeeeceeeEEEeecccHHHHh-cchhHHHH Confidence 1111110 0011 1111234678899999999999999999875 34444555 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCce--------eeeccccccc-cccCCcceecHHHHHHHHHHHHhccCcccccee Q lcl|NC_013692. 158 HVTTEMVKGANEITEDLLQIDLLNSAGT--------VRYPGAATSD-AEVDATTEVTYDSLMRLRLDLDNARAPTKIKMI 228 (399) Q Consensus 158 ~i~~el~~~~~~~t~d~l~~~~l~agt~--------V~YAg~aTsr-a~v~~~~~vt~~~lr~a~~~Lk~nrApk~T~ii 228 (399) .|..+|.+.-+. .+-..+|++-++ +.+.+..+.. ........++.+++..+...|......- .... T Consensus 188 ~i~~~la~~ia~----~~~~a~i~G~G~~~~qP~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~-~~~~ 262 (395) T protein:vir:95 188 FVRTQIQEAISV----ALESAIINGGGAAKTQPVGLMKDVNTNSGAVTDKASSGTLTFADADTTILELNDVLKNL-SVDE 262 (395) T ss_pred HHHHHHHHHHHH----HHhhheeeccCCCCcCceeeeecccccccccccccccchhhhhhhHhhHHHHHHHHHhh-cccc Confidence 555554444333 333455543222 2222111110 0001123355555555544443322210 0000 Q ss_pred ccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcccccccCCccc Q lcl|NC_013692. 229 TGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQMMHWAGVGKAV 308 (399) Q Consensus 229 ~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~~~~~~aGa~~ 308 (399) .+.. -..... -..++|+....|++ +.+-|.|. -|.+.+++. -++++|++..|- + T Consensus 263 ~~~~---~~~~~~-~~~~mn~~t~~~~~------g~~~~~~~--~G~~~~~lg------~g~~v~~~~~~p----~---- 316 (395) T protein:vir:95 263 KGKE---LKIDGK-VALVVNPRDSWDVQ------ARYTYLTA--NGGFVTVLP------YNVTIITSEFVP----E---- 316 (395) T ss_pred ccch---hhhcCc-eEEEEcchhhhhcC------CcceeccC--CCcceeccC------CcceEEEcCCCC----C---- Confidence 0000 001111 23467876555554 34556552 222222211 255666655431 0 Q ss_pred CCcccccccccCccceeEEEEEEccccceecccccCCCCCcceEEEecCCCcCCCCCCcc---chhhHHHHHHHHHHhhc Q lcl|NC_013692. 309 DPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADRSDPY---GEMGFMSIKWYYGFMVF 385 (399) Q Consensus 309 ~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~~DPl---gQrg~~gwK~~~~~~iL 385 (399) + + ++||.=++-.++.+ +.+.++ +. .+.+ +|.+|.++ +++.+++. T Consensus 317 ------------~--~----i~fgdfs~y~i~~r---~~~~i~--~~---------~~~~~~~d~~~f~~~-~r~dg~~~ 363 (395) T protein:vir:95 317 ------------G--K----LVAFVTDRYNAVRG---GGLTVK--KF---------DQTLALEDAVLFTAK-TFAYGQPD 363 (395) T ss_pred ------------C--c----EEEEecccEEEEEe---cceEEE--ec---------cchhhhCCcEEEEEE-EEECCEEe Confidence 0 1 46776554445443 221221 11 1223 34444433 35678888 Q ss_pred cccceEEEEEe---cCC Q lcl|NC_013692. 386 RPEWIALLKTV---ARL 399 (399) Q Consensus 386 n~~~m~~iet~---A~~ 399 (399) +++=+..+++. ++. T Consensus 364 ~~~A~~~l~i~~~~~~~ 380 (395) T protein:vir:95 364 DNKASAVYDLKVASAPR 380 (395) T ss_pred ccccEEEEEeeccCCCC Confidence 88877777664 222 No 172 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=42.25 E-value=0.89 Score=20.85 Aligned_cols=283 Identities=14% Similarity=0.115 Sum_probs=102.8 Q ss_pred CCC-------cc---------------------cccc-ccccCCCCCCcccccccceehhhhhHHHHHHhhhHHhhhhcc Q lcl|NC_013692. 1 MAG-------PV---------------------DNIK-PMKYNDPANGVESSIGPQIHTRYWYKRALIDAAKEAYFGQLA 51 (399) Q Consensus 1 ~~~-------~~---------------------~~~~-~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~A~p~lv~~~~a 51 (399) ..+ .. .... ...-..+.++ .+-+-|+ .+ ....+........+.+.+ T Consensus 109 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-~~~~vP~---~~-~~~i~~~l~~~~~l~~~~ 183 (466) T protein:vir:80 109 FVGGETRMKGFFRNMPYEQRAALIARSEVKEFLAQVRTLAQQKRAVSG-AELTIPD---VM-LELLRDNMHRYSKLISKV 183 (466) T ss_pred HhhHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhcc-ccccccH---HH-HHHHHHhhhhhhhhhhhe Confidence 000 00 0000 0000000000 1112333 22 222333333333444555 Q ss_pred cccccCcCCCcEEEEEEccCCcCCCccccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEE Q lcl|NC_013692. 52 DTFSMPKHYGKEIVRLHYIPLLDDRNVNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKG 131 (399) Q Consensus 52 ~~~~mPKn~GktIkfrry~pl~~~~t~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~ 131 (399) .+.++. |. +++..... .|.+.| +++|+.......++..|+. T Consensus 184 ~v~~~~---g~-~~~~~~~~------------~~~a~w-----------------------v~E~~~~~~~~~~f~~i~~ 224 (466) T protein:vir:80 184 RLRPLK---GT-ARQNIAGA------------IPEGVW-----------------------TEAVANLNELSLSFSQIEV 224 (466) T ss_pred eeeecC---ce-eEeeeecC------------Ccceee-----------------------cccccccccccccccceee Confidence 555443 21 12111110 111111 3344445555668888999 Q ss_pred EeeeecceehhhhhhhhhhhchhHHHHHHHHHHHHHHHHHHHHHHHHHHhcCce------eeeccccccccccC-----C Q lcl|NC_013692. 132 KLEKYGFFREYTQEQLDFDSDPAMEGHVTTEMVKGANEITEDLLQIDLLNSAGT------VRYPGAATSDAEVD-----A 200 (399) Q Consensus 132 ~l~QyG~~~e~Td~~~~t~~D~~L~~~i~~el~~~~~~~t~d~l~~~~l~agt~------V~YAg~aTsra~v~-----~ 200 (399) ++++|+.|+.+|+++.+ +++..|.+.|..+|...-+. .+ -..+|++-++ +-+.+..+. +..+ . T Consensus 225 ~~~k~~~~~~iS~ell~-ds~~~l~~~i~~~la~~~~~-~~---~~ail~G~G~~~P~Gil~~~~~~~~-~~~~~~~~~~ 298 (466) T protein:vir:80 225 DGYKVGGFIPIPNSTLE-DSDLNLADEILDAIGQAIGF-AL---DKAILYGTGTKMPVGIVTRLAQTTQ-PPNWGTKAPA 298 (466) T ss_pred cceeeeeehhhhHHHHh-cchHHHHHHHHHHHHHHHHH-HH---hhheeeccCCCCcceeeeccccccc-cccccccccc Confidence 99999999999999886 55556766665555544443 22 2334432211 011100000 0000 0 Q ss_pred cceecHHHH--------------HHHHHHHHhccCccccceeccccccCcccccCeeEEEechhhhHHHHHHhhhcCCCC Q lcl|NC_013692. 201 TTEVTYDSL--------------MRLRLDLDNARAPTKIKMITGTRMIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPA 266 (399) Q Consensus 201 ~~~vt~~~l--------------r~a~~~Lk~nrApk~T~ii~~s~~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~ 266 (399) -..++...+ .+++..+....++ ...+.++..++++..+.++.+.... ++. T Consensus 299 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~~~~~w~~~~~~~~~l~~~~~~~-~~~ 362 (466) T protein:vir:80 299 WTNLSTTNLLKIDPTGKSAEEFFSELVLKLSKARAN---------------YSNGMKFWAMSSNTHAVLMSKAITF-NSA 362 (466) T ss_pred ccccchhhhhhhhhhccchhhHHHHHHHHHHhhhcc---------------ccCCceeEEecchhHHHhhcccccc-cCC Confidence 011222211 1111111111121 2233456678888888887664211 111 Q ss_pred ceehhhcCCcccccc----------------c-----cceeEcCeEEEecCcccccccCCcccCCcccccccccCcccee Q lcl|NC_013692. 267 FIPIEKYAAGGATMH----------------G-----EVGQLGRFRVIVNPQMMHWAGVGKAVDPNDQVPMHESGGKYSV 325 (399) Q Consensus 267 fi~v~kYg~~~~i~~----------------g-----EIG~i~~~RfV~~~~~~~~~~aGa~~~~~~~~~~~~~~~~~DV 325 (399) ...+..=++..+++- | =|+.-.++++..+++.. |.. +....-.-..+|. T Consensus 363 g~~~~~~~~~~~i~G~pvv~s~~~~~~~~~~g~~~~y~i~~r~~~~i~~~~~~~-f~~---------d~~~~r~~~r~dg 432 (466) T protein:vir:80 363 GALVASLNNTMPIVGGDIVILDFIPDNDIIGGYGSLYLLAERADIKLAQSEHVR-FIE---------DQTVFKGTARYDG 432 (466) T ss_pred ccccccCCCcccccccceeecCccCccceeeeccccEEEEeecceEEEechhhh-hhc---------CcEEEEEEEEEcc Confidence 111111111111110 0 12223344444443311 100 0000111112222 Q ss_pred EEEEEEccccceecccccCCCCCcceEEEecCCCcCCCC Q lcl|NC_013692. 326 FPMLCVASEAFTTVGFATDGKNVKFKIITKRPGEATADR 364 (399) Q Consensus 326 Yp~lV~G~~Afg~v~l~~~g~~~k~~~ivk~pG~~tad~ 364 (399) =| +=.+||-.+.+.+-+.. -....+..+|. +++- T Consensus 433 ~~---~~~~afv~~~~~~~~~~-~~~~~~~~~~~-~~~~ 466 (466) T protein:vir:80 433 KP---VFGEGFVAVNIANANPT-TSITFAPDEAN-VPEV 466 (466) T ss_pred EE---eccCceEEEEecCCCcc-cceeeecCcCc-CCCC Confidence 11 22466665555432211 11223333443 4555 No 173 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=31.41 E-value=1.5 Score=19.62 Aligned_cols=291 Identities=20% Similarity=0.208 Sum_probs=133.3 Q ss_pred CCC--ccccccccccCCCCCCcccccccceehhhhhHHHHHH-hhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCc Q lcl|NC_013692. 1 MAG--PVDNIKPMKYNDPANGVESSIGPQIHTRYWYKRALID-AAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRN 77 (399) Q Consensus 1 ~~~--~~~~~~~~~~n~~~~~~~~~i~p~~~t~y~~~k~L~~-A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t 77 (399) ..| ++.-+ ...|- .+||+... +-.--..|.+|.. .+-...|.+|.-...++.=+- .++.|...+++=. T Consensus 383 ~~~~~~~~~~-~~a~~----htTSDFp~-IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~--~~~~~lg~~~~L~- 453 (693) T protein:vir:95 383 VASLNAPQMV-GLAFT----HTSSDFGL-ILLDVANKSVLAGWEEAEETFPLWTKSGILTDFKP--ARRVGLGEFSSLR- 453 (693) T ss_pred cCCCCHHHHH-HHHHh----cCcchhHH-HHHHHHHHHHHHHHHhhhhHHHHHhccCCCCcccc--cceeecCCCCChh- Confidence 111 11111 11221 12233322 1111223444441 122345667766666665442 2333344433222 Q ss_pred cccCCCCcchhhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHH Q lcl|NC_013692. 78 VNDQGIDASGATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEG 157 (399) Q Consensus 78 ~lteGV~p~g~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~ 157 (399) .+.|| ..+ |.-++.|. --+..|..||--+.+|.+..-- -|=.++. T Consensus 454 ~V~E~-----gEy-----------------k~~t~~e~------------~e~~~l~tyG~~~~iTRqaiIN-DDLga~~ 498 (693) T protein:vir:95 454 QVREG-----AEY-----------------KYVTLGER------------GEQIILATYGELFSITRQAIIN-DDLQMLS 498 (693) T ss_pred hcCCC-----Cce-----------------eeeecCCc------------cceeehhhcCCeeeecHHhhhc-cchHHHH Confidence 22232 222 11223332 2356799999999999765432 3334456 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCceeeecccc---cccccc-C-CcceecHHHHHHHHHHHHhccCccccceecccc Q lcl|NC_013692. 158 HVTTEMVKGANEITEDLLQIDLLNSAGTVRYPGAA---TSDAEV-D-ATTEVTYDSLMRLRLDLDNARAPTKIKMITGTR 232 (399) Q Consensus 158 ~i~~el~~~~~~~t~d~l~~~~l~agt~V~YAg~a---Tsra~v-~-~~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~ 232 (399) +|++.+++.+. .+++-+.+.+|.+.. +.+-|.+ +...++ + .+..+|.+.|..+...+..++... ..-.| + T Consensus 499 ~ip~~~g~aA~-~~~~~~vy~~L~~Np-~m~DGk~LFhadH~Nl~tga~sals~~sl~~a~~am~~qk~~~--~~~~g-~ 573 (693) T protein:vir:95 499 DIPFKLGQAAK-ATIGDLVYAVLTGNP-AMSDGKTLFHADHSNLLTGAASALSIDSLSKAKTQMATQKAQV--EKGKG-R 573 (693) T ss_pred HHHHHHHHHHH-HHHHHHHHHHHhcCc-cccCCcceeeccccccccccccccChHHHHHHHHHHHHhhcch--hccCC-c Confidence 67777655554 477788999996432 2333221 133443 2 346789999998888887777641 10011 1 Q ss_pred ccCcccccCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCe-EEEecCccc-----ccccCCc Q lcl|NC_013692. 233 MIDTRTVGNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRF-RVIVNPQMM-----HWAGVGK 306 (399) Q Consensus 233 ~~gT~~I~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~-RfV~~~~~~-----~~~~aGa 306 (399) .--|.+.|+ +|+|+++-+.+.+. ....+|..+ +-.|.+--+.++ .+|+.|.+. .|-=+.. T Consensus 574 ---~L~i~P~~l-lvP~~le~~a~~l~----~s~~~~~a~------~~~~~~NP~~~~~~vi~~prL~~~s~~~Wyl~a~ 639 (693) T protein:vir:95 574 ---TLNIRPGFV-LTPVALEDKANQII----NSESVPGAD------VNSGIVNPIRAFAQVIGEPRLDDASATAWYMAAK 639 (693) T ss_pred ---eeecccceE-EecchHHHHHHHHh----ccccccccc------cccccccchhccccccccceecCCCCCceEEecC Confidence 123566665 56999999999874 233444221 122223333332 456667662 4532111 Q ss_pred ccCCcccccccccCccceeEEEEEEccccceeccccc------CCCCCcceEEEecCCC Q lcl|NC_013692. 307 AVDPNDQVPMHESGGKYSVFPMLCVASEAFTTVGFAT------DGKNVKFKIITKRPGE 359 (399) Q Consensus 307 ~~~~~~~~~~~~~~~~~DVYp~lV~G~~Afg~v~l~~------~g~~~k~~~ivk~pG~ 359 (399) ....+--+.+-.|... |.| -=++.|.+-+++- +-+-+-+.=++|+||- T Consensus 640 ~~~dtie~~yL~G~~~----P~i-e~~~gf~~dG~~~kvr~D~G~~~iD~Rg~~kn~GA 693 (693) T protein:vir:95 640 KGSDTIEVAYLDGVDT----PYL-EQQEGFTVDGVASKVRIDAGVAPLDFRGLQKSNGA 693 (693) T ss_pred CCCCeEEEEEecCCCC----CeE-eecCCCCcceEEEEEEEeccCceeeccccccCCCC Confidence 1000101111111111 111 1123455443331 2223345557889983 No 174 >protein:vir:1991 Length: 305 # NCBI annotation: major head subunit # Family: family:all:776 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050638;genbank:gi:9633525;genbank:GeneID:2636267 Probab=31.12 E-value=1.5 Score=19.58 Aligned_cols=214 Identities=18% Similarity=0.231 Sum_probs=110.6 Q ss_pred cccCCCCCCccccc---ccceehhhhhHHHHHHhhhHHhhhhcccccccCcCCCcEEEEEEccCCcCCCccccCCCCcch Q lcl|NC_013692. 11 MKYNDPANGVESSI---GPQIHTRYWYKRALIDAAKEAYFGQLADTFSMPKHYGKEIVRLHYIPLLDDRNVNDQGIDASG 87 (399) Q Consensus 11 ~~~n~~~~~~~~~i---~p~~~t~y~~~k~L~~A~p~lv~~~~a~~~~mPKn~GktIkfrry~pl~~~~t~lteGV~p~g 87 (399) |.-| +..+ .-..++.| .+.|..|.| + +.++|... | .+..+++=| T Consensus 1 M~i~------~~~l~~l~~~~~~~f--~~~~~~a~~-~-~~~iA~~v--p----------------St~~~~tY~----- 47 (305) T protein:vir:19 1 MIVT------PASIKALMTSWRKDF--QGGLEDAPS-Q-YNKIAMVV--N----------------SSTRSNTYG----- 47 (305) T ss_pred CccC------HHHHHHHHHHHHHHH--HHHHhhcCc-c-cceEEeEe--c----------------CCCCccccc----- Confidence 1111 1111 11112222 222222211 1 14444322 1 122221111 Q ss_pred hhhhhhhhccccccccccccccccccccccceeeccceeEEEEEEeeeecceehhhhhhhhhhhchhHHHHHHHHHHHHH Q lcl|NC_013692. 88 ATIANGNLYGSSRDVGNITAKMPTLTEIGGRVNRVGFKRVEIKGKLEKYGFFREYTQEQLDFDSDPAMEGHVTTEMVKGA 167 (399) Q Consensus 88 ~~~~ngn~~~ss~d~g~it~k~~~lte~g~r~~~~~~t~tdi~~~l~QyG~~~e~Td~~~~t~~D~~L~~~i~~el~~~~ 167 (399) =.+.+|.|-|-.|..+...++-..-++.-+.|..=+.+......-| +=.+...+-.+|+..+ T Consensus 48 -----------------wLg~fP~lrewiGer~i~~l~~~~y~i~Nk~fe~tV~V~R~dIeDD-~lG~y~p~~~~~G~~a 109 (305) T protein:vir:19 48 -----------------WLGKFPTLKEWVGKRTIQQMEAHGYSIANKTFEGTVGISRDDFEDD-NLGIYAPIFQEMGRSA 109 (305) T ss_pred -----------------ccccCCccchhhcceeeeeccccceeEeeccccceeccchhhcccc-ccCchHHHHHHHHHHH Confidence 2478899999778888899999999999999988888774433211 1122333333555444 Q ss_pred HHHHHHHHHHHHHhcCcee-eec------------------ccccccccc------------------------------ Q lcl|NC_013692. 168 NEITEDLLQIDLLNSAGTV-RYP------------------GAATSDAEV------------------------------ 198 (399) Q Consensus 168 ~~~t~d~l~~~~l~agt~V-~YA------------------g~aTsra~v------------------------------ 198 (399) ...+|.|..++|++|-+- -|- |.+++-+++ T Consensus 110 -a~~pd~lv~~lL~~Gf~~~cyDGq~FFdtDHpv~~~~~~tg~~~~vsn~~~~~~~~g~~w~Lld~~~~ikP~I~Q~Rk~ 188 (305) T protein:vir:19 110 -AVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTGSAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRK 188 (305) T ss_pred -hhchhhHHHHHHHhcCCccCCCCCcccCCCCCcccCCcccccccchhhhhcCCCCCCceeeeeecCCcceeEEEecccc Confidence 458888899999887532 222 111121211 Q ss_pred ----------------------------CC-----------cceecHHHHHHHHHHHHhccCccccceeccccccCcccc Q lcl|NC_013692. 199 ----------------------------DA-----------TTEVTYDSLMRLRLDLDNARAPTKIKMITGTRMIDTRTV 239 (399) Q Consensus 199 ----------------------------~~-----------~~~vt~~~lr~a~~~Lk~nrApk~T~ii~~s~~~gT~~I 239 (399) +. ...||.+.|..+...+..++... | +.+ -| T Consensus 189 ~~~~~~~~~~d~~vf~~~e~~ygvd~R~n~Gygfwq~a~gS~~~Ls~~nl~aar~aM~~qk~d~------G-~pL---~I 258 (305) T protein:vir:19 189 PELVARTRIDDDHVFMDNEFLFGASTRRAAGYGFWQMAVAVKGDLTLDNLWKGWQLMRSFEGDG------G-KKL---GL 258 (305) T ss_pred cceeeccCCCchhhhhhceeeeeeeeeeeccccchhheecCCCCCCHHHHHHHHHHHHhhcCCC------C-cee---ee Confidence 00 13477777777777777766632 2 223 35 Q ss_pred cCeeEEEechhhhHHHHHHhhhcCCCCceehhhcCCccccccccceeEcCeEEEecCcc Q lcl|NC_013692. 240 GNARALYVGSDLVPTIEAMKDNHGNPAFIPIEKYAAGGATMHGEVGQLGRFRVIVNPQM 298 (399) Q Consensus 240 ~~~yv~~~h~dl~~di~d~~~~~~~p~fi~v~kYg~~~~i~~gEIG~i~~~RfV~~~~~ 298 (399) .+.| .+|+|.++-.-+.+.. ...++- |.. .+-| .+ .+.+-+|++|.+ T Consensus 259 ~P~~-LvVPp~LE~~A~qll~----s~~i~~---g~~-~~~N-p~--~g~~eliV~P~L 305 (305) T protein:vir:19 259 KPTH-IVVPVGLEKAAEQLLN----RELFAD---GNT-TVSN-EM--KGKLQLVVADYL 305 (305) T ss_pred ecCe-EEeCchhHHHHHHHHh----hcccCC---ccc-cccc-ee--cceEEEEecccC Confidence 5667 5899999999998742 222221 110 0111 11 234678888877 Done!