Query lcl|Aclame:protein:vir:94771|NCBI_annot:major head protein|genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Match_columns 298 No_of_seqs 109 out of 1022 Neff 9.3 Searched_HMMs 1612 Date Sat Nov 30 21:32:31 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_44 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_44_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:1638 Length: 298 # 100.0 1.4E-76 8.4E-80 436.5 31.6 298 1-298 1-298 (298) 2 protein:vir:94771 Length: 298 100.0 3E-76 1.9E-79 434.6 32.0 298 1-298 1-298 (298) 3 protein:vir:9574 Length: 300 # 100.0 4.7E-72 2.9E-75 411.6 30.8 296 1-298 1-299 (300) 4 protein:vir:9759 Length: 303 # 100.0 7.5E-70 4.6E-73 399.6 31.4 297 1-298 1-302 (303) 5 protein:vir:8187 Length: 311 # 100.0 7.6E-68 4.7E-71 388.6 30.9 295 1-298 1-309 (311) 6 protein:vir:99920 Length: 311 100.0 1.7E-64 1.1E-67 370.2 30.2 294 1-298 1-310 (311) 7 protein:vir:80684 Length: 315 100.0 1.3E-64 8.4E-68 370.7 29.1 291 1-298 1-305 (315) 8 protein:vir:7771 Length: 330 # 100.0 1.8E-62 1.1E-65 359.1 30.6 294 1-298 13-322 (330) 9 protein:vir:78523 Length: 338 100.0 9.3E-62 5.8E-65 355.2 29.9 292 1-298 20-334 (338) 10 protein:vir:94142 Length: 304 100.0 1.9E-61 1.2E-64 353.4 30.3 284 1-298 13-304 (304) 11 protein:vir:105905 Length: 304 100.0 1.9E-61 1.2E-64 353.4 30.3 284 1-298 13-304 (304) 12 protein:vir:78223 Length: 333 100.0 2.3E-61 1.4E-64 353.0 30.0 292 1-298 20-331 (333) 13 protein:vir:5739 Length: 366 # 100.0 5.1E-61 3.2E-64 351.1 29.3 286 1-298 69-365 (366) 14 protein:vir:104085 Length: 320 100.0 3.4E-60 2.1E-63 346.6 30.4 288 1-298 18-316 (320) 15 protein:vir:41 Length: 299 # N 100.0 3.7E-60 2.3E-63 346.4 29.9 282 1-298 10-297 (299) 16 protein:vir:100247 Length: 425 100.0 1.4E-60 8.7E-64 348.7 27.2 279 1-298 134-423 (425) 17 protein:vir:105038 Length: 428 100.0 2.8E-60 1.7E-63 347.1 28.3 286 1-298 130-427 (428) 18 protein:vir:78830 Length: 324 100.0 7.6E-60 4.7E-63 344.7 29.8 278 1-298 31-314 (324) 19 protein:vir:96392 Length: 324 100.0 7.6E-60 4.7E-63 344.7 29.8 278 1-298 31-314 (324) 20 protein:vir:97148 Length: 324 100.0 1.1E-59 6.6E-63 343.9 29.8 278 1-298 31-314 (324) 21 protein:vir:9309 Length: 324 # 100.0 1.2E-59 7.7E-63 343.5 29.7 278 1-298 31-314 (324) 22 protein:vir:485 Length: 407 # 100.0 7.4E-60 4.6E-63 344.8 28.0 279 1-298 110-399 (407) 23 protein:vir:96223 Length: 324 100.0 1.7E-59 1E-62 342.8 29.6 278 1-298 31-314 (324) 24 protein:vir:4456 Length: 401 # 100.0 7.7E-60 4.8E-63 344.7 26.9 279 1-298 111-400 (401) 25 protein:vir:2344 Length: 397 # 100.0 1.8E-59 1.1E-62 342.6 28.8 281 1-298 14-305 (397) 26 protein:vir:103955 Length: 324 100.0 4.3E-59 2.7E-62 340.6 29.8 278 1-298 31-314 (324) 27 protein:vir:80376 Length: 435 100.0 2.8E-59 1.7E-62 341.6 28.7 288 1-298 136-432 (435) 28 protein:vir:99749 Length: 324 100.0 4.8E-59 3E-62 340.3 29.8 278 1-298 31-314 (324) 29 protein:vir:1433 Length: 435 # 100.0 3.5E-59 2.2E-62 341.0 28.3 288 1-298 136-432 (435) 30 protein:vir:4226 Length: 326 # 100.0 1.7E-58 1E-61 337.3 30.0 288 1-298 23-322 (326) 31 protein:vir:2430 Length: 318 # 100.0 1.9E-58 1.2E-61 337.1 30.2 284 1-298 18-312 (318) 32 protein:vir:4339 Length: 395 # 100.0 1.9E-58 1.2E-61 337.0 29.5 277 1-298 117-394 (395) 33 protein:vir:2504 Length: 305 # 100.0 1.3E-58 8.1E-62 337.9 28.2 285 1-298 1-297 (305) 34 protein:vir:93616 Length: 645 100.0 3.7E-58 2.3E-61 335.5 28.0 277 1-298 338-638 (645) 35 protein:vir:95763 Length: 297 100.0 1.1E-57 6.7E-61 332.9 29.9 276 1-298 13-295 (297) 36 protein:vir:8102 Length: 543 # 100.0 7.5E-58 4.7E-61 333.8 27.8 283 1-298 254-541 (543) 37 protein:vir:97053 Length: 390 100.0 1.2E-57 7.2E-61 332.7 28.6 273 1-297 117-390 (390) 38 protein:vir:1886 Length: 385 # 100.0 2.3E-57 1.4E-60 331.1 29.1 275 1-298 108-383 (385) 39 protein:vir:191 Length: 385 # 100.0 2.3E-57 1.4E-60 331.1 29.1 275 1-298 108-383 (385) 40 protein:vir:100135 Length: 418 100.0 2.6E-57 1.6E-60 330.8 29.4 275 1-298 139-414 (418) 41 protein:vir:81070 Length: 390 100.0 4.2E-57 2.6E-60 329.7 29.1 273 1-297 117-390 (390) 42 protein:vir:6242 Length: 390 # 100.0 2.1E-57 1.3E-60 331.3 26.9 273 1-298 114-388 (390) 43 protein:vir:10364 Length: 390 100.0 7.4E-57 4.6E-60 328.3 29.6 273 1-297 117-390 (390) 44 protein:vir:1328 Length: 392 # 100.0 6.9E-57 4.3E-60 328.5 27.8 275 1-298 114-390 (392) 45 protein:vir:7855 Length: 497 # 100.0 8.9E-57 5.5E-60 327.9 27.6 277 1-298 155-492 (497) 46 protein:vir:101650 Length: 497 100.0 8.9E-57 5.5E-60 327.9 27.6 277 1-298 155-492 (497) 47 protein:vir:102119 Length: 404 100.0 2.8E-56 1.7E-59 325.1 28.3 280 1-298 114-399 (404) 48 protein:vir:81160 Length: 371 100.0 2.8E-56 1.8E-59 325.1 28.1 266 1-298 95-370 (371) 49 protein:vir:4856 Length: 293 # 100.0 5.9E-56 3.7E-59 323.4 28.3 267 1-298 9-280 (293) 50 protein:vir:4997 Length: 397 # 100.0 8.5E-56 5.2E-59 322.5 28.1 266 1-298 113-384 (397) 51 protein:vir:4953 Length: 397 # 100.0 8.3E-56 5.2E-59 322.6 27.8 266 1-298 113-384 (397) 52 protein:vir:1268 Length: 397 # 100.0 7.5E-56 4.7E-59 322.8 27.5 266 1-298 127-396 (397) 53 protein:vir:104256 Length: 458 100.0 1.1E-55 6.6E-59 322.0 28.3 278 1-298 166-457 (458) 54 protein:vir:3845 Length: 395 # 100.0 1.1E-55 6.6E-59 322.0 27.4 267 1-298 111-382 (395) 55 protein:vir:6212 Length: 434 # 100.0 1.1E-55 6.9E-59 321.8 26.6 277 1-298 146-432 (434) 56 protein:vir:81227 Length: 413 100.0 3E-55 1.9E-58 319.5 28.8 275 1-298 122-409 (413) 57 protein:vir:4830 Length: 397 # 100.0 2E-55 1.2E-58 320.5 27.4 267 1-298 113-384 (397) 58 protein:vir:1025 Length: 408 # 100.0 4.1E-55 2.5E-58 318.8 28.1 266 1-298 120-392 (408) 59 protein:vir:107593 Length: 392 100.0 4.7E-55 2.9E-58 318.5 27.1 266 1-298 110-383 (392) 60 protein:vir:105004 Length: 392 100.0 4.7E-55 2.9E-58 318.5 27.1 266 1-298 110-383 (392) 61 protein:vir:102082 Length: 392 100.0 4.7E-55 2.9E-58 318.5 27.1 266 1-298 110-383 (392) 62 protein:vir:102873 Length: 392 100.0 4.7E-55 2.9E-58 318.5 27.1 266 1-298 110-383 (392) 63 protein:vir:4700 Length: 415 # 100.0 7.8E-55 4.9E-58 317.2 28.3 276 1-298 125-403 (415) 64 protein:vir:4600 Length: 415 # 100.0 7.8E-55 4.9E-58 317.2 28.3 276 1-298 125-403 (415) 65 protein:vir:95376 Length: 425 100.0 6.7E-55 4.2E-58 317.6 26.5 272 1-298 142-420 (425) 66 protein:vir:98339 Length: 415 100.0 1.7E-54 1.1E-57 315.4 28.4 276 1-298 124-403 (415) 67 protein:vir:81100 Length: 415 100.0 1.7E-54 1.1E-57 315.4 28.4 276 1-298 124-403 (415) 68 protein:vir:79987 Length: 415 100.0 1.7E-54 1.1E-57 315.4 28.4 276 1-298 124-403 (415) 69 protein:vir:9410 Length: 415 # 100.0 2E-54 1.2E-57 315.0 28.0 276 1-298 125-403 (415) 70 protein:vir:3991 Length: 404 # 100.0 2.2E-54 1.4E-57 314.7 28.2 267 1-298 120-392 (404) 71 protein:vir:4511 Length: 409 # 100.0 1.1E-54 7.1E-58 316.3 26.5 280 1-298 121-405 (409) 72 protein:vir:7409 Length: 408 # 100.0 2.9E-54 1.8E-57 314.1 27.4 266 1-298 120-392 (408) 73 protein:vir:1383 Length: 421 # 100.0 2.3E-54 1.4E-57 314.6 25.8 263 1-298 118-382 (421) 74 protein:vir:101607 Length: 379 100.0 9.5E-54 5.9E-57 311.3 27.8 264 1-298 111-378 (379) 75 protein:vir:94673 Length: 419 100.0 8.8E-53 5.5E-56 306.0 28.5 277 1-298 128-416 (419) 76 protein:vir:96762 Length: 632 100.0 8.6E-53 5.3E-56 306.0 25.3 266 1-298 361-632 (632) 77 protein:vir:3870 Length: 400 # 100.0 1.9E-52 1.2E-55 304.1 25.8 259 1-298 138-398 (400) 78 protein:vir:100172 Length: 394 100.0 1.5E-51 9.6E-55 299.1 27.2 263 1-298 115-383 (394) 79 protein:vir:9704 Length: 394 # 100.0 1E-51 6.2E-55 300.2 26.0 256 1-298 132-389 (394) 80 protein:vir:100884 Length: 389 100.0 1.7E-51 1.1E-54 298.9 26.4 263 1-298 113-381 (389) 81 protein:vir:1084 Length: 437 # 100.0 2E-51 1.3E-54 298.5 24.4 263 1-298 160-426 (437) 82 protein:vir:98635 Length: 377 100.0 1.7E-51 1E-54 299.0 22.8 274 1-298 83-376 (377) 83 protein:vir:4092 Length: 390 # 100.0 3E-50 1.9E-53 292.1 26.4 267 1-298 88-367 (390) 84 protein:vir:8420 Length: 477 # 100.0 1.1E-50 6.6E-54 294.6 23.7 281 1-298 160-470 (477) 85 protein:vir:962 Length: 397 # 100.0 4E-50 2.5E-53 291.4 22.9 259 1-298 136-396 (397) 86 protein:vir:78640 Length: 352 100.0 1.4E-49 8.7E-53 288.4 21.9 258 1-298 87-345 (352) 87 protein:vir:95963 Length: 395 100.0 8.8E-48 5.5E-51 278.6 25.8 268 1-298 90-375 (395) 88 protein:vir:101291 Length: 381 100.0 8.3E-48 5.2E-51 278.7 24.4 265 1-298 80-367 (381) 89 protein:vir:9509 Length: 381 # 100.0 8.3E-48 5.2E-51 278.7 24.4 265 1-298 80-367 (381) 90 protein:vir:100632 Length: 381 100.0 2.8E-47 1.7E-50 275.8 23.7 265 1-298 80-367 (381) 91 protein:vir:9361 Length: 402 # 100.0 6.1E-48 3.8E-51 279.4 20.1 258 1-298 137-395 (402) 92 protein:vir:93881 Length: 387 100.0 1.5E-47 9.3E-51 277.3 21.8 258 1-298 122-380 (387) 93 protein:vir:2685 Length: 387 # 100.0 8.4E-48 5.2E-51 278.7 20.1 258 1-298 122-380 (387) 94 protein:vir:96978 Length: 387 100.0 8.4E-48 5.2E-51 278.7 20.1 258 1-298 122-380 (387) 95 protein:vir:94424 Length: 387 100.0 8.4E-48 5.2E-51 278.7 20.1 258 1-298 122-380 (387) 96 protein:vir:80128 Length: 466 100.0 4.6E-46 2.8E-49 269.2 23.3 270 1-298 152-447 (466) 97 protein:vir:9643 Length: 377 # 100.0 1.9E-45 1.2E-48 265.7 25.5 266 1-298 83-376 (377) 98 protein:vir:78350 Length: 383 100.0 1.9E-45 1.2E-48 265.8 21.3 273 1-298 87-374 (383) 99 protein:vir:4197 Length: 314 # 100.0 7.3E-41 4.5E-44 240.6 24.7 283 1-298 17-312 (314) 100 protein:vir:4159 Length: 315 # 100.0 7E-40 4.3E-43 235.3 23.5 282 1-296 21-315 (315) 101 protein:vir:3158 Length: 321 # 100.0 1.2E-36 7.6E-40 217.5 24.3 284 1-298 22-310 (321) 102 protein:vir:97397 Length: 517 100.0 8.4E-36 5.2E-39 212.9 21.3 270 1-298 243-515 (517) 103 protein:vir:3033 Length: 272 # 100.0 5.4E-33 3.3E-36 197.5 24.8 257 1-298 1-268 (272) 104 protein:vir:9820 Length: 272 # 100.0 5.4E-33 3.3E-36 197.5 24.8 257 1-298 1-268 (272) 105 protein:vir:4074 Length: 480 # 100.0 1.9E-33 1.2E-36 200.0 13.0 256 1-298 214-476 (480) 106 protein:vir:94933 Length: 330 99.9 3.6E-26 2.2E-29 160.1 21.2 284 1-298 25-328 (330) 107 protein:vir:3613 Length: 272 # 99.9 1.9E-25 1.2E-28 156.1 20.9 261 1-298 1-271 (272) 108 protein:vir:93742 Length: 274 99.9 1.1E-24 6.6E-28 152.0 22.5 256 1-298 1-269 (274) 109 protein:vir:96123 Length: 274 99.9 1.2E-23 7.3E-27 146.3 22.6 256 1-298 1-269 (274) 110 protein:vir:105334 Length: 276 99.9 8.1E-24 5.1E-27 147.2 21.2 258 1-298 1-269 (276) 111 protein:vir:96833 Length: 275 99.9 2.2E-23 1.3E-26 144.8 20.9 258 1-298 3-270 (275) 112 protein:vir:80930 Length: 278 99.9 5.3E-23 3.3E-26 142.7 21.6 264 1-298 1-276 (278) 113 protein:vir:94494 Length: 274 99.9 1.9E-22 1.2E-25 139.6 22.8 256 1-298 1-269 (274) 114 protein:vir:97433 Length: 274 99.9 1.9E-22 1.2E-25 139.6 22.8 256 1-298 1-269 (274) 115 protein:vir:95898 Length: 274 99.8 5.1E-22 3.2E-25 137.3 22.3 256 1-298 1-269 (274) 116 protein:vir:96262 Length: 274 99.8 5.1E-22 3.2E-25 137.3 22.3 256 1-298 1-269 (274) 117 protein:vir:1239 Length: 274 # 99.8 1E-21 6.3E-25 135.7 21.9 255 1-298 1-269 (274) 118 protein:vir:95107 Length: 270 99.8 1.7E-21 1E-24 134.5 20.3 258 1-298 1-264 (270) 119 protein:vir:97255 Length: 310 99.8 1.8E-20 1.1E-23 128.8 23.1 284 1-298 1-309 (310) 120 protein:vir:79928 Length: 393 99.8 3.7E-20 2.3E-23 127.1 16.8 282 1-298 74-377 (393) 121 protein:vir:739 Length: 231 # 99.7 9.6E-19 6E-22 119.4 17.5 228 31-298 1-230 (231) 122 protein:vir:102605 Length: 273 99.6 2.3E-16 1.4E-19 106.4 19.9 262 1-298 1-272 (273) 123 protein:vir:105822 Length: 273 99.6 2.3E-16 1.4E-19 106.4 19.9 262 1-298 1-272 (273) 124 protein:vir:7990 Length: 273 # 99.6 3.8E-16 2.4E-19 105.1 19.5 262 1-298 1-272 (273) 125 protein:vir:108211 Length: 318 99.6 2.8E-16 1.8E-19 105.8 18.0 277 1-298 1-316 (318) 126 protein:vir:99424 Length: 360 99.6 1.1E-15 6.7E-19 102.7 19.8 283 1-298 26-356 (360) 127 protein:vir:94576 Length: 347 99.5 2.9E-15 1.8E-18 100.3 17.0 283 1-297 1-347 (347) 128 protein:vir:5974 Length: 324 # 99.5 1.8E-14 1.1E-17 95.9 20.7 269 1-298 1-288 (324) 129 protein:vir:94622 Length: 341 99.5 5.6E-15 3.5E-18 98.7 17.4 287 1-298 1-338 (341) 130 protein:vir:80180 Length: 381 99.5 1.7E-14 1E-17 96.1 19.2 279 1-298 15-304 (381) 131 protein:vir:78739 Length: 332 99.4 1.8E-14 1.1E-17 95.9 16.2 284 1-297 17-332 (332) 132 protein:vir:10450 Length: 344 99.4 1.3E-14 8.2E-18 96.7 14.3 284 1-298 1-343 (344) 133 protein:vir:80213 Length: 334 99.4 2E-13 1.2E-16 90.3 20.3 284 1-298 1-333 (334) 134 protein:vir:3364 Length: 347 # 99.4 3E-14 1.9E-17 94.7 15.8 286 1-298 1-346 (347) 135 protein:vir:102944 Length: 330 99.3 3.9E-13 2.4E-16 88.6 20.0 274 1-298 1-292 (330) 136 protein:vir:103323 Length: 364 99.3 7E-13 4.3E-16 87.2 21.3 283 1-298 1-338 (364) 137 protein:vir:1583 Length: 351 # 99.3 1.2E-13 7.3E-17 91.5 16.8 273 1-298 1-298 (351) 138 protein:vir:2201 Length: 345 # 99.3 1.5E-13 9.2E-17 90.9 17.1 283 1-298 1-344 (345) 139 protein:vir:94711 Length: 347 99.3 5.2E-14 3.2E-17 93.4 14.4 284 1-298 1-345 (347) 140 protein:vir:6324 Length: 335 # 99.3 1E-12 6.3E-16 86.3 21.3 282 1-298 1-327 (335) 141 protein:vir:8324 Length: 410 # 99.3 4.6E-14 2.8E-17 93.7 13.5 257 1-297 132-410 (410) 142 protein:vir:1541 Length: 347 # 99.3 3.1E-13 1.9E-16 89.2 17.9 284 1-298 1-346 (347) 143 protein:vir:78935 Length: 335 99.3 1.7E-12 1.1E-15 85.1 20.8 282 1-298 1-327 (335) 144 protein:vir:100057 Length: 375 99.3 1.2E-12 7.7E-16 85.8 19.6 287 1-298 1-370 (375) 145 protein:vir:97031 Length: 402 99.3 1.7E-12 1E-15 85.2 18.7 287 1-298 1-335 (402) 146 protein:vir:8885 Length: 347 # 99.2 7.1E-13 4.4E-16 87.2 15.5 284 1-298 1-345 (347) 147 protein:vir:95318 Length: 328 99.2 5.2E-12 3.2E-15 82.5 18.3 227 1-235 1-328 (328) 148 protein:vir:103285 Length: 296 99.2 6.4E-12 4E-15 81.9 18.4 275 1-298 5-294 (296) 149 protein:vir:93858 Length: 400 99.1 1.1E-12 6.8E-16 86.2 12.5 274 1-297 122-400 (400) 150 protein:vir:99675 Length: 324 99.1 3.2E-12 2E-15 83.6 14.5 253 30-298 1-298 (324) 151 protein:vir:105645 Length: 400 99.1 1.7E-11 1.1E-14 79.6 17.7 290 1-298 1-332 (400) 152 protein:vir:7019 Length: 401 # 99.1 1.9E-11 1.2E-14 79.4 16.9 289 1-298 1-332 (401) 153 protein:vir:107687 Length: 319 99.0 1.3E-10 7.8E-14 74.9 19.2 275 1-297 27-319 (319) 154 protein:vir:102655 Length: 322 99.0 8.9E-11 5.5E-14 75.7 18.3 279 1-298 13-320 (322) 155 protein:vir:80068 Length: 301 99.0 2E-10 1.3E-13 73.7 19.9 275 1-297 1-301 (301) 156 protein:vir:95131 Length: 325 99.0 1.4E-10 8.6E-14 74.6 18.6 277 1-298 1-295 (325) 157 protein:vir:104342 Length: 314 99.0 1.1E-10 6.6E-14 75.2 16.9 274 1-298 23-312 (314) 158 protein:vir:3136 Length: 322 # 99.0 3.5E-11 2.2E-14 77.9 14.0 279 1-298 1-318 (322) 159 protein:vir:103759 Length: 330 99.0 1E-10 6.2E-14 75.4 15.7 227 1-235 1-330 (330) 160 protein:vir:98525 Length: 331 98.9 5.3E-10 3.3E-13 71.4 18.9 227 1-235 1-331 (331) 161 protein:vir:107826 Length: 331 98.9 5.3E-10 3.3E-13 71.4 18.9 227 1-235 1-331 (331) 162 protein:vir:107388 Length: 331 98.9 5.3E-10 3.3E-13 71.4 18.9 227 1-235 1-331 (331) 163 protein:vir:9927 Length: 295 # 98.8 2.6E-10 1.6E-13 73.2 13.7 257 1-298 1-287 (295) 164 protein:vir:79642 Length: 329 98.8 2.7E-09 1.7E-12 67.6 18.6 276 1-298 29-327 (329) 165 protein:vir:8843 Length: 317 # 98.7 1.1E-08 7E-12 64.2 20.3 280 1-298 1-314 (317) 166 protein:vir:7324 Length: 335 # 98.7 1.9E-09 1.2E-12 68.4 16.0 228 1-236 1-335 (335) 167 protein:vir:9875 Length: 296 # 98.6 1.3E-09 8E-13 69.3 11.9 259 1-298 1-294 (296) 168 protein:vir:108303 Length: 418 98.5 1.8E-07 1.1E-10 57.6 19.7 261 1-298 1-281 (418) 169 protein:vir:80446 Length: 367 98.4 1.3E-07 8.3E-11 58.3 18.1 276 1-298 1-339 (367) 170 protein:vir:99075 Length: 392 98.4 1.3E-07 8.3E-11 58.3 18.0 272 1-298 1-312 (392) 171 protein:vir:107732 Length: 379 98.4 4.5E-08 2.8E-11 60.9 14.9 288 1-297 56-379 (379) 172 protein:vir:94070 Length: 339 98.4 5E-08 3.1E-11 60.6 15.1 274 1-297 35-339 (339) 173 protein:vir:94989 Length: 349 98.3 1.3E-06 7.9E-10 52.9 21.8 273 1-298 1-319 (349) 174 protein:vir:78387 Length: 349 98.3 8.6E-07 5.4E-10 53.8 20.3 273 1-298 1-319 (349) 175 protein:vir:106647 Length: 303 98.3 1.1E-08 6.9E-12 64.2 9.6 257 1-298 1-296 (303) 176 protein:vir:94800 Length: 319 98.3 1.7E-06 1E-09 52.3 20.6 262 1-298 19-293 (319) 177 protein:vir:97331 Length: 319 98.3 1.7E-06 1E-09 52.3 20.6 262 1-298 19-293 (319) 178 protein:vir:101557 Length: 336 98.2 7.3E-08 4.5E-11 59.7 12.5 274 1-297 42-336 (336) 179 protein:vir:3643 Length: 336 # 98.2 1E-07 6.2E-11 59.0 12.1 274 1-297 42-336 (336) 180 protein:vir:78558 Length: 336 98.2 1.3E-07 8.3E-11 58.3 12.6 275 1-297 42-336 (336) 181 protein:vir:96792 Length: 315 98.1 1.6E-06 1E-09 52.3 17.6 261 1-298 1-280 (315) 182 protein:vir:107120 Length: 329 98.1 4.5E-06 2.8E-09 49.9 20.7 263 1-298 30-304 (329) 183 protein:vir:79548 Length: 652 98.1 2.8E-06 1.7E-09 51.1 18.7 270 1-296 361-652 (652) 184 protein:vir:95512 Length: 693 98.0 5E-06 3.1E-09 49.7 18.6 273 1-297 394-693 (693) 185 protein:vir:5255 Length: 304 # 97.9 1.7E-06 1E-09 52.3 14.7 277 3-296 1-304 (304) 186 protein:vir:106734 Length: 336 97.9 4.6E-07 2.9E-10 55.3 11.5 275 1-297 42-336 (336) 187 protein:vir:3525 Length: 423 # 97.9 1.1E-05 6.5E-09 47.9 18.5 264 1-298 1-309 (423) 188 protein:vir:95451 Length: 313 97.8 5.2E-06 3.2E-09 49.6 15.4 283 1-298 3-311 (313) 189 protein:vir:105374 Length: 423 97.7 3.1E-05 1.9E-08 45.3 19.9 275 1-298 1-302 (423) 190 protein:vir:99576 Length: 388 97.7 3.5E-06 2.2E-09 50.5 12.6 279 1-297 74-388 (388) 191 protein:vir:174 Length: 423 # 97.7 3.2E-05 2E-08 45.2 18.6 269 1-298 1-302 (423) 192 protein:vir:96079 Length: 382 97.6 8.6E-06 5.3E-09 48.4 13.8 278 1-297 70-382 (382) 193 protein:vir:1829 Length: 355 # 97.5 4.4E-05 2.7E-08 44.5 16.4 281 1-298 25-341 (355) 194 protein:vir:105522 Length: 423 97.4 8.3E-05 5.1E-08 43.0 18.9 268 1-298 1-302 (423) 195 protein:vir:104011 Length: 337 97.4 8.5E-05 5.3E-08 42.9 16.9 281 1-298 16-333 (337) 196 protein:vir:79171 Length: 337 97.4 8.6E-05 5.3E-08 42.9 16.9 281 1-298 16-333 (337) 197 protein:vir:1153 Length: 338 # 97.3 9.4E-05 5.9E-08 42.7 16.7 281 1-298 16-335 (338) 198 protein:vir:98856 Length: 343 97.3 8.9E-05 5.5E-08 42.8 16.4 279 1-298 27-332 (343) 199 protein:vir:3746 Length: 336 # 97.3 9.5E-05 5.9E-08 42.6 17.2 276 1-298 24-329 (336) 200 protein:vir:98566 Length: 355 97.3 8E-05 5E-08 43.0 16.1 281 1-298 16-341 (355) 201 protein:vir:3783 Length: 336 # 97.3 0.00011 7.1E-08 42.2 16.9 277 1-298 24-329 (336) 202 protein:vir:79008 Length: 299 97.2 0.00014 8.6E-08 41.7 20.5 269 1-298 1-298 (299) 203 protein:vir:100331 Length: 342 97.1 0.00013 8.4E-08 41.8 15.1 281 1-298 16-337 (342) 204 protein:vir:1781 Length: 221 # 97.1 5.1E-05 3.2E-08 44.1 12.7 197 78-291 1-221 (221) 205 protein:vir:78777 Length: 358 97.0 0.00023 1.5E-07 40.5 15.6 279 1-298 20-337 (358) 206 protein:vir:270 Length: 341 # 96.9 0.0002 1.2E-07 40.9 14.9 276 1-298 20-331 (341) 207 protein:vir:95875 Length: 401 96.8 0.00032 2E-07 39.8 16.5 285 1-298 12-399 (401) 208 protein:vir:2016 Length: 357 # 96.8 0.00025 1.6E-07 40.3 14.5 281 1-298 16-341 (357) 209 protein:vir:5694 Length: 357 # 96.8 0.00026 1.6E-07 40.3 14.4 281 1-298 16-341 (357) 210 protein:vir:6061 Length: 357 # 96.8 0.00026 1.6E-07 40.2 14.5 281 1-298 16-341 (357) 211 protein:vir:78186 Length: 337 96.7 0.00038 2.3E-07 39.3 15.0 281 1-298 16-333 (337) 212 protein:vir:79157 Length: 339 96.7 0.00039 2.4E-07 39.3 15.0 281 1-298 16-334 (339) 213 protein:vir:103886 Length: 302 96.5 0.00053 3.3E-07 38.6 18.2 271 1-298 1-301 (302) 214 protein:vir:78920 Length: 290 95.1 0.0028 1.7E-06 34.6 20.6 268 1-298 1-289 (290) 215 protein:vir:102823 Length: 470 92.5 0.011 6.9E-06 31.3 12.7 289 1-298 19-340 (470) 216 protein:vir:95603 Length: 463 91.6 0.015 9.4E-06 30.6 15.0 275 1-298 26-338 (463) 217 protein:vir:99311 Length: 463 91.6 0.015 9.4E-06 30.6 15.0 275 1-298 26-338 (463) 218 protein:vir:99888 Length: 309 91.3 0.016 1E-05 30.4 13.0 270 3-298 1-307 (309) 219 protein:vir:5942 Length: 523 # 91.1 0.017 1.1E-05 30.2 12.1 275 1-298 217-520 (523) 220 protein:vir:96666 Length: 462 91.0 0.018 1.1E-05 30.2 17.4 287 1-298 36-368 (462) 221 protein:vir:79712 Length: 285 89.7 0.025 1.5E-05 29.4 19.3 264 1-298 1-284 (285) 222 protein:vir:861 Length: 318 # 88.4 0.0096 5.9E-06 31.7 7.0 270 1-297 41-318 (318) 223 protein:vir:93966 Length: 400 88.0 0.007 4.4E-06 32.4 6.0 268 1-297 123-400 (400) 224 protein:vir:79078 Length: 307 87.6 0.037 2.3E-05 28.4 14.2 269 1-298 1-306 (307) 225 protein:vir:1663 Length: 393 # 86.5 0.012 7.4E-06 31.1 6.3 268 1-297 116-393 (393) 226 protein:vir:103370 Length: 418 86.3 0.047 2.9E-05 27.9 15.3 275 1-298 71-405 (418) 227 protein:vir:96442 Length: 418 86.1 0.048 3E-05 27.8 17.2 281 1-298 71-405 (418) 228 protein:vir:107882 Length: 307 86.0 0.049 3E-05 27.8 16.7 269 1-298 1-306 (307) 229 protein:vir:93696 Length: 364 85.3 0.054 3.3E-05 27.5 17.1 287 1-296 1-364 (364) 230 protein:vir:105464 Length: 346 82.6 0.076 4.7E-05 26.7 19.7 267 1-298 1-299 (346) 231 protein:vir:96490 Length: 348 81.8 0.082 5.1E-05 26.5 19.9 296 1-298 1-346 (348) 232 protein:vir:80835 Length: 464 81.4 0.086 5.4E-05 26.4 15.3 284 1-298 22-364 (464) 233 protein:vir:103463 Length: 521 78.8 0.11 6.9E-05 25.8 15.1 280 1-298 79-500 (521) 234 protein:vir:100603 Length: 529 78.5 0.11 7.1E-05 25.8 12.2 278 1-298 178-509 (529) 235 protein:vir:106286 Length: 534 76.7 0.13 8.2E-05 25.4 14.6 281 1-298 87-513 (534) 236 protein:vir:2736 Length: 348 # 75.9 0.14 8.8E-05 25.2 20.0 295 1-298 1-346 (348) 237 protein:vir:5670 Length: 514 # 72.0 0.19 0.00012 24.6 14.1 272 1-298 165-499 (514) 238 protein:vir:348 Length: 321 # 71.5 0.19 0.00012 24.5 18.4 288 1-297 1-321 (321) 239 protein:vir:104549 Length: 462 70.7 0.21 0.00013 24.3 13.2 265 1-298 147-460 (462) 240 protein:vir:100851 Length: 514 69.7 0.22 0.00014 24.2 13.6 280 1-298 43-399 (514) 241 protein:vir:80986 Length: 528 66.6 0.26 0.00016 23.8 15.0 280 1-298 78-508 (528) 242 protein:vir:107947 Length: 519 60.8 0.37 0.00023 23.0 14.5 280 1-298 77-498 (519) 243 protein:vir:102335 Length: 312 59.0 0.4 0.00025 22.8 20.2 271 1-298 1-309 (312) 244 protein:vir:4902 Length: 348 # 58.7 0.41 0.00025 22.7 18.1 296 1-298 1-346 (348) 245 protein:vir:98143 Length: 524 55.5 0.48 0.0003 22.3 12.2 276 1-298 174-505 (524) 246 protein:vir:98480 Length: 348 55.2 0.48 0.0003 22.3 20.1 291 1-298 1-348 (348) 247 protein:vir:6901 Length: 522 # 54.3 0.51 0.00031 22.2 12.6 267 1-298 194-501 (522) 248 protein:vir:78148 Length: 123 53.2 0.37 0.00023 22.9 5.8 107 178-298 1-122 (123) 249 protein:vir:2106 Length: 430 # 53.0 0.54 0.00034 22.0 18.1 265 1-298 1-302 (430) 250 protein:vir:103181 Length: 457 51.3 0.59 0.00036 21.9 14.0 270 1-298 141-445 (457) 251 protein:vir:104915 Length: 470 51.2 0.59 0.00036 21.9 18.2 281 1-298 69-458 (470) 252 protein:vir:78090 Length: 302 50.5 0.61 0.00038 21.8 19.7 270 1-298 1-300 (302) 253 protein:vir:80491 Length: 467 43.1 0.86 0.00053 21.0 15.2 282 1-298 35-337 (467) 254 protein:vir:63741 Length: 468 42.9 0.86 0.00054 20.9 15.1 282 1-298 36-338 (468) 255 protein:vir:7214 Length: 521 # 41.4 0.93 0.00058 20.8 12.3 278 1-298 166-500 (521) 256 protein:vir:1991 Length: 305 # 36.3 1.2 0.00073 20.2 11.4 221 1-298 1-237 (305) 257 protein:vir:101039 Length: 529 33.3 1.4 0.00085 19.8 13.3 272 1-298 197-510 (529) 258 protein:vir:6601 Length: 528 # 33.1 1.4 0.00086 19.8 16.4 280 1-298 78-508 (528) 259 protein:vir:106590 Length: 349 27.5 1.8 0.0011 19.1 18.5 293 1-297 1-349 (349) 260 protein:vir:101811 Length: 529 26.5 1.9 0.0012 19.0 16.0 268 1-298 197-510 (529) 261 protein:vir:9265 Length: 430 # 25.6 2 0.0013 18.9 17.4 265 1-298 1-347 (430) 262 protein:vir:100939 Length: 430 25.6 2 0.0013 18.9 17.4 265 1-298 1-347 (430) 263 protein:vir:99523 Length: 311 23.7 2.3 0.0014 18.6 20.6 280 1-297 1-311 (311) 264 protein:vir:94870 Length: 318 22.6 2.4 0.0015 18.5 8.9 264 1-297 35-318 (318) No 1 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=100.00 E-value=1.4e-76 Score=436.52 Aligned_cols=298 Identities=99% Similarity=1.423 Sum_probs=284.3 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEY 80 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~~ 80 (298) ||++||++||++++++||+.++++++++++++++|++++.+++|+.++.++++|++|++++|+++++|+++++++||+++ T Consensus 1 ma~~gG~lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~f~~v~l~~~k~a~ 80 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEY 80 (298) T ss_pred CcccCcceechhHHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecCcceEEecCCccccccccceeEEEEeeeeEEE Confidence 99999999999999999999999999999999999999889999999999999999999999999999999999999999 Q ss_pred EEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHHH Q lcl|Aclame:pro 81 GARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGAI 160 (298) Q Consensus 81 ~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 160 (298) ++++|+|||++++++..+++++|++++++++++++|.++++|+++++|....+.+.......++............+++| T Consensus 81 ~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 160 (298) T protein:vir:16 81 GARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGAI 160 (298) T ss_pred eehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcccccccccccccccccccccccccccHHHHH Confidence 99999999999989999999999999999999999999999999999999888888777776666666777777788999 Q ss_pred HHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeeccc Q lcl|Aclame:pro 161 ENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFAN 240 (298) Q Consensus 161 ~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~ 240 (298) .+++.++..++.++++|+|||+++..|+++||++|||+|++..+.+.+++|+|+||++++++|...+.+++.+++|||++ T Consensus 161 ~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~v~~~~~~~~~~~~~GDfs~ 240 (298) T protein:vir:16 161 ENAVELLTGVDADVTGIAINPSFRSALAKQKDLQDNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFAN 240 (298) T ss_pred HHHHHHhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCceecceeeEEecccccccCCCccEEEEeeccc Confidence 99999999999999999999999999999999999999999888889999999999999999998888888999999999 Q ss_pred eEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 241 GFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 241 ~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ++.++.++++++++.++.+.+++++++|++|++++|+++|+|++++||+||++||+|| T Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 241 GFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred eEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 9889999999999999998898899999999999999999999999999999999999 No 2 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=100.00 E-value=3e-76 Score=434.61 Aligned_cols=298 Identities=100% Similarity=1.437 Sum_probs=284.3 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEY 80 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~~ 80 (298) ||++||++||++++++|++.++++|+++++++++|++++.+++|++++.++++|++|++++|+++++|+++++.+||+++ T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~ 80 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEY 80 (298) T ss_pred CeeccccccChhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceEEeeCCccccccccceeEEEEeeeEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHHH Q lcl|Aclame:pro 81 GARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGAI 160 (298) Q Consensus 81 ~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 160 (298) ++++|+|||+++.++..+++++|++++++++++++|.++|+|+++++|....+.+..+....++............++++ T Consensus 81 ~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 160 (298) T protein:vir:94 81 GARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGAI 160 (298) T ss_pred eeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccccccccccccccccccccccHHHHH Confidence 99999999999889999999999999999999999999999998888888888888777777777667777777889999 Q ss_pred HHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeeccc Q lcl|Aclame:pro 161 ENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFAN 240 (298) Q Consensus 161 ~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~ 240 (298) .+++.++..++.++++|+|||+++.+|+++||++|+|+|++..+++.+++|+|+||++++.+|.+.+.+.+.+++|||++ T Consensus 161 ~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~PV~~~~~v~~~~~~~~~~~~~Gdfs~ 240 (298) T protein:vir:94 161 ENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFAN 240 (298) T ss_pred HHHHHhhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCceecceeeEEecccccccCCCccEEEEeeccc Confidence 99999999999999999999999999999999999999999999999999999999999999998888888999999999 Q ss_pred eEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 241 GFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 241 ~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ++.|+.++++++++.++.+.+++.+++|++|++++|+++|+|++++||+||++||+|| T Consensus 241 ~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 241 GFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred eEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 9889999999999999999999999999999999999999999999999999999999 No 3 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=100.00 E-value=4.7e-72 Score=411.61 Aligned_cols=296 Identities=58% Similarity=1.041 Sum_probs=273.0 Q ss_pred Ce---eccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeE Q lcl|Aclame:pro 1 MV---LNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIK 77 (298) Q Consensus 1 ma---t~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k 77 (298) || +++|++||++++.+||+.++++|+++++++++|++++.+++|+.+++++++|++|++++|+++++|+++++++|| T Consensus 1 ma~~t~~~G~lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~k 80 (300) T protein:vir:95 1 MSEAQLSKGNLFNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQREFVFDFDSDIDIVAENGKKTHGGVSLDPVTIVPLK 80 (300) T ss_pred CcccccCCcceechhhHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecCcceEEeeCCcccccccccceeeEeeeEE Confidence 55 566899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhH Q lcl|Aclame:pro 78 VEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPN 157 (298) Q Consensus 78 ~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (298) ++++++||+||+++++++..+++++|.+++++++++++|.++|+|+++++|....+.+.....+..+.. ........+ T Consensus 81 ~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~--~~~~~~~~~ 158 (300) T protein:vir:95 81 VEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQT--VPFKDTNPD 158 (300) T ss_pred EEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCccccccccccccccee--ecccccchH Confidence 999999999999988889999999999999999999999999999988888887777766555544332 223345668 Q ss_pred HHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEee Q lcl|Aclame:pro 158 GAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGD 237 (298) Q Consensus 158 ~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd 237 (298) ++|.+++..+...++++++|+|||+++.+|++|||++|||||++...++.+++|+|+||++++.+|...+..++.+++|| T Consensus 159 ~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~GD 238 (300) T protein:vir:95 159 ESMEDAVGMIDGSERDITGAILDPIFTTALSKMKNAEGGKLYPELAWGGVPDAINGLAVDKNRTVSYSQTDPKNTAIVGD 238 (300) T ss_pred HHHHHHHHHhhhcCCCccEEEECHHHHHHHHHhhccCCCeeccCccccCCCceecceeeEEecCCCCCCCCCccEEEEee Confidence 89999999999999999999999999999999999999999998888888999999999999999988888888899999 Q ss_pred ccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 238 FANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 238 ~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) |++++.++.|++++++++++.+.++..+++|++|++++|+++|+|+++++|+||++|+++= T Consensus 239 f~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~ 299 (300) T protein:vir:95 239 FETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIVKTG 299 (300) T ss_pred ccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEeecceeecccceEEEecCC Confidence 9998889999999999999999999999999999999999999999999999999999999 No 4 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=100.00 E-value=7.5e-70 Score=399.55 Aligned_cols=297 Identities=62% Similarity=1.001 Sum_probs=267.6 Q ss_pred Cee--ccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEE Q lcl|Aclame:pro 1 MVL--NKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKV 78 (298) Q Consensus 1 mat--~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~ 78 (298) |+| +||++||++++++||+.+++.|+++++++++|++++.+++|+.++++++.|++|++++|+++++|+++++++||+ T Consensus 1 m~t~t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~E~~~~~~s~~~f~~v~l~~~kl 80 (303) T protein:vir:97 1 MGTETSKASLFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEFTFTLDSDIDVVAENGKKTHGGLSLEPVTIVPIKV 80 (303) T ss_pred CcccCCCCeEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEecCcceEEeecCccccccccceeeEEeeeEEE Confidence 765 668999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHH Q lcl|Aclame:pro 79 EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNG 158 (298) Q Consensus 79 ~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (298) ++++++|+||+++++++.++++++|.+++++++++++|.++|+|+++++|....+.+........+.... .......++ T Consensus 81 ~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 159 (303) T protein:vir:97 81 EYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKVTQVVK-FTESEDADA 159 (303) T ss_pred EEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCccccccccccccccccccccc-cccccchHH Confidence 9999999999998888999999999999999999999999999988878877777666555444433333 234455789 Q ss_pred HHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccc-cccCcceecceeeEecCccccccc--cccceEEE Q lcl|Aclame:pro 159 AIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELK-WGATPDTINGLPVDVNKTVSDMSL--TQRDRAII 235 (298) Q Consensus 159 ~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~-~~~~~~~l~G~PV~~s~~~~~~~~--~~~~~~~~ 235 (298) +|.+++.++..+++.+++|+|||+++.+|+++||++|+|+|.+.. .++.+++|+|+||+++++||.... .+...+++ T Consensus 160 ~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~g~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~ 239 (303) T protein:vir:97 160 NIEAAVNLIQGAEGVVTGLAMDTEFSTALAKVTNGEMGPKMYPELAWGANPDSINGLKSSVNTTVGAGADEAESKDLVII 239 (303) T ss_pred HHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCCeEEecCccCCCCCceecceeeEEecccCCccccCCCccEEEE Confidence 999999999999999999999999999999999999999997654 455677999999999999987543 34457899 Q ss_pred eeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 236 GDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 236 gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) |||+..+.++.|+++++++.++.+.+++++++|++|+++||+++|+|+++++|+||++||+|. T Consensus 240 Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~~ 302 (303) T protein:vir:97 240 GDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKGE 302 (303) T ss_pred eeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCCC Confidence 999998899999999999999999999999999999999999999999999999999999999 No 5 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=100.00 E-value=7.6e-68 Score=388.55 Aligned_cols=295 Identities=34% Similarity=0.522 Sum_probs=254.0 Q ss_pred Cee--ccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEE Q lcl|Aclame:pro 1 MVL--NKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKV 78 (298) Q Consensus 1 mat--~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~ 78 (298) ||+ +||++||++++++||+.++++|+++++++++|++++.+++|+++++++++|++|++++|+++++|+++++.+||+ T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v~l~~~kl 80 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRKV 80 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhcchhhhhcceeecCCCceEEEEEeCCceeEEeecCcccccccceeeEEEEeeEEE Confidence 554 467999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHH Q lcl|Aclame:pro 79 EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNG 158 (298) Q Consensus 79 ~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (298) ++++++|+|||+++.++..+++++|.+++++++++++|.++++|++++++........... ................+. T Consensus 81 ~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~-~~~~~~~~~~~~~~~~~~ 159 (311) T protein:vir:81 81 QVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKIL-DTTNIVELTTGTSATPDL 159 (311) T ss_pred EEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCccccccccccc-ccceeeeecccccchHHH Confidence 9999999999998889999999999999999999999999999986555543322222111 111112222233334567 Q ss_pred HHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCcccccc------------ Q lcl|Aclame:pro 159 AIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMS------------ 226 (298) Q Consensus 159 ~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~------------ 226 (298) ++.+++.++...+.++++|+|||+++.+|++|||++|+|+|++...++.+++|+|+||++++.||.+. T Consensus 160 ~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~Pv~~~~~i~~~~~~~~~~~~~~~~ 239 (311) T protein:vir:81 160 AVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRT 239 (311) T ss_pred HHHHHHHHhhhcCCCceEEEEcHHHHHHHHhhhccCCCeeecCccccCCCceecceeEEecccccccccccccccchhcc Confidence 78888888888888999999999999999999999999999998888899999999999999998643 Q ss_pred ccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 227 LTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 227 ~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) +.....+++|||+++ .++.+++++++++++.+.++. +++|++|+++||+++|+|++++||+||++|++|+ T Consensus 240 ~~~~~~~~~gDfs~~-~i~~~~~~~~~~~~~~~~~~~-~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a~ 309 (311) T protein:vir:81 240 TNPNVKAIAGDFSAF-RWGVQVSIPLELIEFGDPDGL-GDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDAD 309 (311) T ss_pred cCCccEEEEEecccE-EEEEeccceEEEeccCCCCcc-hhhhhcCcEEEEEEEEeccEeecccceEEEEeec Confidence 223456799999985 589999999999998876654 6899999999999999999999999999999999 No 6 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=100.00 E-value=1.7e-64 Score=370.17 Aligned_cols=294 Identities=31% Similarity=0.534 Sum_probs=252.9 Q ss_pred Ce---eccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeE Q lcl|Aclame:pro 1 MV---LNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIK 77 (298) Q Consensus 1 ma---t~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k 77 (298) || +++|++||++++++|++.++++|+++++++++|++++.++||++++.+++.|++|++++|+++++|+++++++|| T Consensus 1 Mat~tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v~l~~~k 80 (311) T protein:vir:99 1 MATFGTGNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRFGNEDIITFNGRPKAEFVGEGQQKSSTTGEFDFVTSTPKK 80 (311) T ss_pred CceecCCCceeccHHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCceeEEeecCcccccccceeeEEEEeeEE Confidence 55 577899999999999999999999999999999999889999999999999999999999999999999999999 Q ss_pred EEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccccc-ccccchh Q lcl|Aclame:pro 78 VEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEA-PRGIADP 156 (298) Q Consensus 78 ~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 156 (298) +++++++|+||++++.|+..+|+++|.+++++++++++|.++|+|++++++ .++.+..+.....+..... ....... T Consensus 81 ~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g--~~~~g~~~~~~~~~~~~~~~~~~~~~~ 158 (311) T protein:vir:99 81 AQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTG--TVIPGWSNYLGAASKRVELTADTIANP 158 (311) T ss_pred EEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccC--ccccccccccccccceeeccccccchh Confidence 999999999999988889999999999999999999999999999764433 3444444444444333333 3333445 Q ss_pred HHHHHHHhhhhhhcC--CcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccc------- Q lcl|Aclame:pro 157 NGAIENAVELLTGVD--ADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSL------- 227 (298) Q Consensus 157 ~~~i~~~~~~l~~~~--~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~------- 227 (298) ++++.+++..+...+ +.+++|+|||+++..|+++||++|||+|++...++.+++|+|+||++++.+|.... T Consensus 159 ~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~Pv~~s~~i~~~~~~~~~~~~ 238 (311) T protein:vir:99 159 DLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARYTDGRKKFPELGLGIGVSSFEGIDASVSDTVNGGDEADPDDED 238 (311) T ss_pred HHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhccCCCeeecCcccCCCCceecceeeEeecccccccccccccch Confidence 677888888777664 45667999999999999999999999999998888899999999999999875433 Q ss_pred ---cccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 228 ---TQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 228 ---~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) .+.+.+++|||+.++.|+.+++++++++++.+.+. .+++|++|+++||+++|+||++.|| +|++++.++ T Consensus 239 ~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~d~~~~r~~~r~d~~v~~~-~~v~~~~~~ 310 (311) T protein:vir:99 239 LDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGDPDG-QGDLKRHNQIALRLEIVYGWYVFTD-RFVVIENAV 310 (311) T ss_pred hhccCcceEEEeeccccEEEEEecCceEEEeecCCCCc-chhhhhcCcEEEEEEEeecceecCh-hHeeeeccc Confidence 24456789999999999999999999999876554 5789999999999999999999986 688888888 No 7 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=100.00 E-value=1.3e-64 Score=370.74 Aligned_cols=291 Identities=29% Similarity=0.466 Sum_probs=244.5 Q ss_pred Ce----eccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeee Q lcl|Aclame:pro 1 MV----LNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPI 76 (298) Q Consensus 1 ma----t~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~ 76 (298) || ++||++||++++++||+.+++.|+++++++++|++++.++||++++++.++|++|++++++++++|+++++.+| T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~ 80 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQPI 80 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeCCccccccccceeeeEeeee Confidence 77 56789999999999999999999999999999999989999999999999999999999999999999999999 Q ss_pred EEEEEEeecHHHhhcccccH-HHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccch Q lcl|Aclame:pro 77 KVEYGARISDEFMYASDEEK-INILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIAD 155 (298) Q Consensus 77 k~~~~~~iS~ell~~~~d~~-~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (298) |++++++||+||++++.++. ..|.++|.+++++++++++|.++|+|++++++.. +.+.... ............. T Consensus 81 kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~--~~~~~~~---~~~~~~~~~~~~~ 155 (315) T protein:vir:80 81 KVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKA--ASAVHTS---LNKTKNIVDATDS 155 (315) T ss_pred eEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCcc--ccccccc---cccccceeecccc Confidence 99999999999998876543 4488999999999999999999999986555432 2222221 2222222333444 Q ss_pred hHHHHHHHhhhhhhcCC-cccEEEEcHHHHHHHHHhhccCCc-----eeecccccccCcceecceeeEecCcccccccc- Q lcl|Aclame:pro 156 PNGAIENAVELLTGVDA-DVTGIAINPSFRSALAKQKDLQGN-----ALFPELKWGATPDTINGLPVDVNKTVSDMSLT- 228 (298) Q Consensus 156 ~~~~i~~~~~~l~~~~~-~~~~~vm~~~~~~~L~~lkd~~G~-----~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~- 228 (298) .++++.+++.++...+. .+++|+|||+++..|+++||.+|+ |+|++. ..+.+++|+|+||+++++||.+... T Consensus 156 ~~~d~~~~~~~~~~~~~~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~~-~~g~~~tl~G~PV~~~~~~~~~~~~~ 234 (315) T protein:vir:80 156 ATADLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAA-GFAGLDNWRGLNVGASSTVSGAPEMS 234 (315) T ss_pred chHHHHHHHHHHhhccCccceEEEEcHHHHHHHHHHhhccCCccccccccccc-ccCCCceecceeeEecCcCCcccccc Confidence 57888899888866544 567899999999999999876654 556443 3456789999999999999876443 Q ss_pred --ccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 229 --QRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 229 --~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ..+.+++|||+++ .++.+++++++++++.+.+.+.+++|++|+++||+++|+|++++||+||++|++++ T Consensus 235 ~~~~~~~~~GDfs~~-~~g~~~~~~i~i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~ 305 (315) T protein:vir:80 235 PASGVKAIVGDFSRV-HWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKA 305 (315) T ss_pred cccccEEEEeecccE-EEEEecCeeEEEeccccccCcccchhhcCcEEEEEEEEecceeecccceEEEeecc Confidence 3456899999985 48999999999999999888899999999999999999999999999999999888 No 8 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=100.00 E-value=1.8e-62 Score=359.12 Aligned_cols=294 Identities=20% Similarity=0.264 Sum_probs=249.7 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEY 80 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~~ 80 (298) ...++|.++|++++++|++.+++.++++++++++|++++.+++|+.++.+++.|++|++++++++++|+++++.+||+++ T Consensus 13 ~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~ 92 (330) T protein:vir:77 13 LTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGPTGISIPHWTGAVSASWTGEAERKPITKGSFGKQELEPVKITT 92 (330) T ss_pred ccCCCcceechhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcceeEecCCCccccccceeeEEEEeEEEEEE Confidence 45566889999999999999999999999999999999889999999999999999999999999999999999999999 Q ss_pred EEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccc-ccccccccccccchhHHH Q lcl|Aclame:pro 81 GARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDS-KVTQKVEAPRGIADPNGA 159 (298) Q Consensus 81 ~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 159 (298) ++++|+|||++ +..+++++|.++|++++++++|.++|+|+|.+....+.......... ..+..........+.+++ T Consensus 93 ~~~is~ell~d---s~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 169 (330) T protein:vir:77 93 IFAESAEVVRL---NPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVVSLADTNLTTASGPQGNAYLA 169 (330) T ss_pred eehhhHHHHhc---chHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccccccccccccceeecccccccccccchhHHH Confidence 99999999964 45789999999999999999999999997643322221111111111 112233344455667899 Q ss_pred HHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeeccccccc-----CcceecceeeEecCccccccccccceEE Q lcl|Aclame:pro 160 IENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGA-----TPDTINGLPVDVNKTVSDMSLTQRDRAI 234 (298) Q Consensus 160 i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~-----~~~~l~G~PV~~s~~~~~~~~~~~~~~~ 234 (298) +.+++..+...+..+++|+|||+++..|+++||++|||+|++..+.+ ..++|+|+||+++++||++...++..++ T Consensus 170 l~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~PV~~~~~~p~~~~~~~~~~~ 249 (330) T protein:vir:77 170 VNNALSLLVNSGKKWTGTLLDNVTEPILNTAVDGNGRPLFVESTYTEQVGAIREGRILGRPTYVADNVVNGTVGNRVVGV 249 (330) T ss_pred HHHHHHhhhhcCCCccEEEEcHHHHHHHHHHhccCCceeecCccccccccccCCceecceeeEEeccccCCCCCCccEEE Confidence 99999999999999999999999999999999999999998765444 3468999999999999988777788899 Q ss_pred EeeccceEEEEeecceEEEEeecccc----------cccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 235 IGDFANGFKWGYAKEVPLEVIQYGDP----------DNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 235 ~gd~~~~~~~~~~~~~~i~~~~~~~~----------~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) +|||+.++ ++.+++++++++++... ....+++|++|+++||+++|+|++++||+||++|+.++ T Consensus 250 ~gd~s~~~-i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~ 322 (330) T protein:vir:77 250 MGDFSQVI-WGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDKDAFVKLTDQV 322 (330) T ss_pred EEecceEE-EEEecCcEEEEeecceeeecccccccccccccchhhcCcEEEEEEEEeccEEecccceEEEEecc Confidence 99999865 89999999999886542 23456889999999999999999999999999999999 No 9 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=100.00 E-value=9.3e-62 Score=355.18 Aligned_cols=292 Identities=25% Similarity=0.391 Sum_probs=243.1 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcc--------eEEeeccccccccccceeeEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSE--------IDVVAESGKKTHGGVTLAPQT 72 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~--------a~~v~E~~~~~~~~~~~~~v~ 72 (298) ..++++.|||++++++||+.+++.|+++++|+++|++++.+++|+.+..+. +.|++|++++++++++|++++ T Consensus 20 ~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~~~~~~~~~Eg~~~~~~~~~f~~v~ 99 (338) T protein:vir:78 20 LAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPISYGETIIPTTVKRPEVGQVGVGTSNEQREGGTKPLSGTAWDTRS 99 (338) T ss_pred eecccccccchHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCccceeecccccccccccccccccccceeEEE Confidence 555667799999999999999999999999999999999999999876544 556679999999999999999 Q ss_pred EeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccc--ccccccccc Q lcl|Aclame:pro 73 MVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDS--KVTQKVEAP 150 (298) Q Consensus 73 l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~--~~~~~~~~~ 150 (298) ++++|+++++++|+||+++ +..+++++|.+++++++++++|.++|+|++++.+ .++.+...... ..+...... T Consensus 100 l~~~k~~~~~~is~ell~d---s~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~--~~~~gi~~~~~~~~~~~~~~~~ 174 (338) T protein:vir:78 100 VAPIKLATIVTVSEEFARM---NPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTG--SALQGIDTNNVIVNTTNVDYLQ 174 (338) T ss_pred EEEEEEEEeehhhHHHHhc---CHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcc--cccccccccccccccccccccc Confidence 9999999999999999974 4478999999999999999999999999764333 33333332212 222233333 Q ss_pred cccchhHHHHHHHhhhhhhc-CCcccEEEEcHHHHHHH---HHhhccCCceeecccccccCcceecceeeEecCcccccc Q lcl|Aclame:pro 151 RGIADPNGAIENAVELLTGV-DADVTGIAINPSFRSAL---AKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMS 226 (298) Q Consensus 151 ~~~~~~~~~i~~~~~~l~~~-~~~~~~~vm~~~~~~~L---~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~ 226 (298) ......++++.+++..+... ....++|+|||+++..| +++||++|+|+|++....+.+++|+|+||+++++||... T Consensus 175 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~~~~~~~~~~~~l~G~PV~~~~~ip~~~ 254 (338) T protein:vir:78 175 TGTTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAYRDANGNVDPTRINLAASAGDLLGLPVQFGKAVGGDL 254 (338) T ss_pred ccchhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhhccCCCceeecccccCCCCceeeeeeEEEccccCccc Confidence 44456688899988887654 45677899999998776 457899999999998888889999999999999999754 Q ss_pred c---cccceEEEeeccceEEEEeecceEEEEeeccc------ccccchhhhhcCcEEEEEEEEEccEEecccceEEEeec Q lcl|Aclame:pro 227 L---TQRDRAIIGDFANGFKWGYAKEVPLEVIQYGD------PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEA 297 (298) Q Consensus 227 ~---~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a 297 (298) + ..+..+++|||+.+ .++.+++++++++++.. .+...+++|++|++++|+++|+||+++||+||++|++| T Consensus 255 ~~~~~~~~~~~~gdfs~~-~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~l~~~ 333 (338) T protein:vir:78 255 GAATDSKVRVVGGDFSQL-KYGFADEIRVKMSDTATLTDNTSPTPQTVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDD 333 (338) T ss_pred cccCCcccEEEEEecceE-EEEeecccEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEeecccceEEEecc Confidence 3 33456899999875 58999999999999864 34567899999999999999999999999999999999 Q ss_pred C Q lcl|Aclame:pro 298 N 298 (298) Q Consensus 298 ~ 298 (298) | T Consensus 334 ~ 334 (338) T protein:vir:78 334 E 334 (338) T ss_pred c Confidence 9 No 10 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=100.00 E-value=1.9e-61 Score=353.45 Aligned_cols=284 Identities=15% Similarity=0.216 Sum_probs=243.5 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEY 80 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~~ 80 (298) ...+||++||++++.+|++.++++++++++++++|++++.++||++++.+.+.|++|++++|+++++|++++++++|+++ T Consensus 13 ~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~i~~~~~k~~~ 92 (304) T protein:vir:94 13 LSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVGAYWVSETERIQTSKPEYAQAEMEAKKIGV 92 (304) T ss_pred ccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEeecCcccccccceeeEEEEEEEEEEE Confidence 44455899999999999999999999999999999999889999999999999999999999999999999999999999 Q ss_pred EEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHHH Q lcl|Aclame:pro 81 GARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGAI 160 (298) Q Consensus 81 ~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 160 (298) ++++|+|+++ ++..+++++|.++|++++++++|.++++|+|.+........ .....+.............+++| T Consensus 93 ~~~iS~ell~---ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~i 166 (304) T protein:vir:94 93 IIPLSKEFLK---WTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGK---PLVEGAEEKGNVVTDTNNLYVDL 166 (304) T ss_pred eehhhHHHHh---cchHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccc---cccccccccccccccccchHHHH Confidence 9999999997 44589999999999999999999999999754332211111 11122223333344555679999 Q ss_pred HHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeeccc Q lcl|Aclame:pro 161 ENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFAN 240 (298) Q Consensus 161 ~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~ 240 (298) .+++.++..++..+++|+|||+++..|+++||++|+|+|.+ .+++|+|+||+++++||... ++..+++|||++ T Consensus 167 ~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~-----~~~~l~G~PV~~~~~~~~~~--~~~~~~~gd~~~ 239 (304) T protein:vir:94 167 SALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDA-----NGNEIMGLPLSYTGADVYDK--KKSLALMGDWDY 239 (304) T ss_pred HHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecC-----CCccccceeeEEecccccCC--CCcEEEEEehhh Confidence 99999999999999999999999999999999999999965 34689999999999998643 345689999998 Q ss_pred eEEEEeecceEEEEeecc--------cccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 241 GFKWGYAKEVPLEVIQYG--------DPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 241 ~~~~~~~~~~~i~~~~~~--------~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) + .++.+++++++++++. +.++..+++|++|+++||+++|+|+++++|+||++||.|+ T Consensus 240 ~-~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 240 A-RYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred E-EEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 6 4899999999888764 3556678899999999999999999999999999999999 No 11 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=100.00 E-value=1.9e-61 Score=353.45 Aligned_cols=284 Identities=15% Similarity=0.216 Sum_probs=243.5 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEY 80 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~~ 80 (298) ...+||++||++++.+|++.++++++++++++++|++++.++||++++.+.+.|++|++++|+++++|++++++++|+++ T Consensus 13 ~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~i~~~~~k~~~ 92 (304) T protein:vir:10 13 LSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVGAYWVSETERIQTSKPEYAQAEMEAKKIGV 92 (304) T ss_pred ccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEeecCcccccccceeeEEEEEEEEEEE Confidence 44455899999999999999999999999999999999889999999999999999999999999999999999999999 Q ss_pred EEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHHH Q lcl|Aclame:pro 81 GARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGAI 160 (298) Q Consensus 81 ~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 160 (298) ++++|+|+++ ++..+++++|.++|++++++++|.++++|+|.+........ .....+.............+++| T Consensus 93 ~~~iS~ell~---ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~i 166 (304) T protein:vir:10 93 IIPLSKEFLK---WTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGK---PLVEGAEEKGNVVTDTNNLYVDL 166 (304) T ss_pred eehhhHHHHh---cchHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccc---cccccccccccccccccchHHHH Confidence 9999999997 44589999999999999999999999999754332211111 11122223333344555679999 Q ss_pred HHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeeccc Q lcl|Aclame:pro 161 ENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFAN 240 (298) Q Consensus 161 ~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~ 240 (298) .+++.++..++..+++|+|||+++..|+++||++|+|+|.+ .+++|+|+||+++++||... ++..+++|||++ T Consensus 167 ~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~-----~~~~l~G~PV~~~~~~~~~~--~~~~~~~gd~~~ 239 (304) T protein:vir:10 167 SALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDA-----NGNEIMGLPLSYTGADVYDK--KKSLALMGDWDY 239 (304) T ss_pred HHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecC-----CCccccceeeEEecccccCC--CCcEEEEEehhh Confidence 99999999999999999999999999999999999999965 34689999999999998643 345689999998 Q ss_pred eEEEEeecceEEEEeecc--------cccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 241 GFKWGYAKEVPLEVIQYG--------DPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 241 ~~~~~~~~~~~i~~~~~~--------~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) + .++.+++++++++++. +.++..+++|++|+++||+++|+|+++++|+||++||.|+ T Consensus 240 ~-~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 240 A-RYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred E-EEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 6 4899999999888764 3556678899999999999999999999999999999999 No 12 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=100.00 E-value=2.3e-61 Score=353.00 Aligned_cols=292 Identities=23% Similarity=0.365 Sum_probs=245.8 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeecc--------ccccccccceeeEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAES--------GKKTHGGVTLAPQT 72 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~--------~~~~~~~~~~~~v~ 72 (298) |+..++.+||+++.++|++.+++.++++++++++|++++.+++|+.++.+.+.|++|+ +.+++++++|++++ T Consensus 20 ~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~eg~~~~~~e~~~~~~~~~~f~~i~ 99 (333) T protein:vir:78 20 LAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISYGETIIPTTVKRPEVGQVGVGTSNEQREGGLKPLSGTAWDTRS 99 (333) T ss_pred eecCCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCceeEeecCcccccccccccccccccceeEEE Confidence 6777778999999999999999999999999999999999999999999999998776 46788999999999 Q ss_pred EeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccc--cccccccc Q lcl|Aclame:pro 73 MVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSK--VTQKVEAP 150 (298) Q Consensus 73 l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~--~~~~~~~~ 150 (298) +++||+++++++|+|++++ +..+++++|+++|++++++++|.++|+|+|... ..++.++...... .+...... T Consensus 100 l~~~kl~~~~~is~ell~~---s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~--~~~~~g~~~~~~~~~~~~~~~~~ 174 (333) T protein:vir:78 100 VSPIKLATIVTVSEEFARM---NPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLT--GSALQGIDTDNVIANTTNVDYLQ 174 (333) T ss_pred EeeEEEEEeehhhHHHHhc---CHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCC--Ccccccccccccccccccccccc Confidence 9999999999999999974 447899999999999999999999999976433 3334443322222 22333334 Q ss_pred cccchhHHHHHHHhhhhhhcC-CcccEEEEcHHHHHHHHH---hhccCCceeecccccccCcceecceeeEecCcccccc Q lcl|Aclame:pro 151 RGIADPNGAIENAVELLTGVD-ADVTGIAINPSFRSALAK---QKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMS 226 (298) Q Consensus 151 ~~~~~~~~~i~~~~~~l~~~~-~~~~~~vm~~~~~~~L~~---lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~ 226 (298) ......+++|.+++..+..++ +.+++|+|||.++..|++ ++|++|+|+|++....+.+++|+|+||+++++||.+. T Consensus 175 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~~~~~~~~~~~~l~G~Pv~~~~~i~~~~ 254 (333) T protein:vir:78 175 ETGDPLLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLRAQAYRDANGNVDPSRINLAAQTGDVLGLPAQFGRAVGGDL 254 (333) T ss_pred cccchhHHHHHHHHHhhccccccCceEEEEcchHHHHHHHHhhhcCCCCceeecCccccCCCceeeceeeEEccccCCCc Confidence 455566889999998887764 456789999999987765 6799999999998888889999999999999999764 Q ss_pred cc---ccceEEEeeccceEEEEeecceEEEEeeccc---ccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 227 LT---QRDRAIIGDFANGFKWGYAKEVPLEVIQYGD---PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 227 ~~---~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~---~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) .. .+..+++|||+++ .++.++++++++++|.+ .+...+++|++|+++||+++|+|+++++|+||++|++++ T Consensus 255 ~~~~~~~~~~~~gD~~~~-~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~ 331 (333) T protein:vir:78 255 GAAVDSKTRIIGGDFSQL-KFGFADEIRIKMSDTATLTDSGSATVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDE 331 (333) T ss_pred cccCCCccEEEEEecccE-EEEEeeccEEEEeccccccccccceeehhhcCcEEEEEEEEEccEEecccceEEEeccC Confidence 32 3457899999985 58999999999999864 344567899999999999999999999999999999999 No 13 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=100.00 E-value=5.1e-61 Score=351.11 Aligned_cols=286 Identities=13% Similarity=0.123 Sum_probs=233.6 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhh-cceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARL-SAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~-~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~ 79 (298) -+.+||+|||+++.++||+.+++.++++++ ++++|+.++++++|+++++++++|++|++.+|+++++|+++++++||++ T Consensus 69 ~~~~Gg~lvP~~~~~~ii~~l~~~s~l~~lg~~~v~~~~g~~~~p~~t~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~~ 148 (366) T protein:vir:57 69 AAGSGGALIPQNMQNEVIELLRDRTVVRILGARSIPLPNGNLSMPRLSGGATAGYVGEGKDVVATGATFDDVKLSAKTMI 148 (366) T ss_pred cccCCccccchhHHHHHHHHHhhhcchhhhceeeeecCCCceEEEEEeCCcceeeeccCccccccccceeEEEEeeEEEE Confidence 455778999999999999999999999998 7889998889999999999999999999999999999999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccccccc---ccchh Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPR---GIADP 156 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~---~~~~~ 156 (298) +++++|+|||+ ++.++++++|+++|++++++++|.++|+|+| +...+.|+.+............. ..... T Consensus 149 ~~~~iS~ell~---ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G----~~~~p~Gi~~~~~~~~~~~~~~~t~~~~~~~ 221 (366) T protein:vir:57 149 ALVPVSNQLIG---RAGFNVEQLLLGDILSAIATREDKAFLRDDG----TGDTPKGMKAVATAANRLVAWTGTAINLTTI 221 (366) T ss_pred EeehhhHHHHh---hhhHHHHHHHHHHHHHHHHHHHHHHhhccCC----CCccccceeeccccccceeeccccccchhhH Confidence 99999999996 4447899999999999999999999999954 33345555443332222221111 11111 Q ss_pred --HHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCcccccccc--ccce Q lcl|Aclame:pro 157 --NGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLT--QRDR 232 (298) Q Consensus 157 --~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~--~~~~ 232 (298) +.++..+.......+...+.|+|||.++..|+++||++|+|+|++. ..++|+|+||+++++||.+.+. +... T Consensus 222 ~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~l~~~~----~~g~l~G~Pvv~s~~ip~~~~~~~~~~~ 297 (366) T protein:vir:57 222 DEYLDSLILKHMDSNSNMIRCGWGLSNRTYMTLFGLRDGNGNKVYPEM----SQGILKGYPIQRTSAIPANLGDDGNESE 297 (366) T ss_pred HHHHHHHHHhhhccccccccCEEEecHHHHHHHHhhhccCCceeccCC----CCCeecceeeEEccccccccccCCCccE Confidence 1122233333344556678899999999999999999999999653 3468999999999999986543 3456 Q ss_pred EEEeeccceEEEEeecceEEEEeeccc---ccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 233 AIIGDFANGFKWGYAKEVPLEVIQYGD---PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 233 ~~~gd~~~~~~~~~~~~~~i~~~~~~~---~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ++||||+.++ ++.+++++++++++.. .++..+++|++|++++|+++|+||+++||+||++|++++ T Consensus 298 i~~gdfs~~~-i~~~~~i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~~ 365 (366) T protein:vir:57 298 IYFCDFNDVV-IGEDGMMKVDFSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGVI 365 (366) T ss_pred EEEEecceEE-EEEecceEEEEeeccccccccccchhhhhcCceeEEeeeeeCcEeeccccEEEEeccc Confidence 8999999865 8999999999998753 456678899999999999999999999999999999999 No 14 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=100.00 E-value=3.4e-60 Score=346.59 Aligned_cols=288 Identities=19% Similarity=0.220 Sum_probs=240.9 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEY 80 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~~ 80 (298) +.+++|.+||++++++||+.+++.++++++++++|++++++++|+.++.+++.|++|++++|+++++|+++++++||+++ T Consensus 18 ~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~ 97 (320) T protein:vir:10 18 GDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWIGDVSAQWIGEGDMKPITKGNMTSQNIAPHKIAT 97 (320) T ss_pred ccccccccccHHHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEecCCccccccccceeEEEEeeEEEEE Confidence 55566789999999999999999999999999999998899999999999999999999999999999999999999999 Q ss_pred EEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHHH Q lcl|Aclame:pro 81 GARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGAI 160 (298) Q Consensus 81 ~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 160 (298) ++++|+|+|+ ++..+++++|.+++++++++++|+++|+|++. +....+.+......................+++ T Consensus 98 ~~~is~ell~---ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 172 (320) T protein:vir:10 98 IFVASAETVR---ANPANYLGTMRTKVATAFAMAFDSAALNGTDS--PFPTYLAQTTKSVSLADPGGATASDLTAYDAVA 172 (320) T ss_pred eehhhHHHHh---cChHHHHHHHHHHHHHHHHHHHHHHhhcccCC--CCCcccccccccccceecccccccccccHHHHH Confidence 9999999997 44488999999999999999999999999753 333333333222222222222222222334567 Q ss_pred HHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccC-----cceecceeeEecCccccccccccceEEE Q lcl|Aclame:pro 161 ENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGAT-----PDTINGLPVDVNKTVSDMSLTQRDRAII 235 (298) Q Consensus 161 ~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~-----~~~l~G~PV~~s~~~~~~~~~~~~~~~~ 235 (298) .+++..+...+..+++|+|||+++.+|+++||++|+|+|++....+. .++++|+||++++++|.+ +..+++ T Consensus 173 ~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~~~~----~~~~~~ 248 (320) T protein:vir:10 173 VNGLSLLVNAKKKWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDENSPFRAGRIVSRPTILSDHVADG----TTVGYM 248 (320) T ss_pred HHHHhhhhcccCCCcEEEEcHHHHHHHHHhhccCCceeeccccccCccccccCceeeeeeeEecCCCCCC----ceEEEE Confidence 88888888899999999999999999999999999999987655443 357999999999999763 345789 Q ss_pred eeccceEEEEeecceEEEEeeccc------ccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 236 GDFANGFKWGYAKEVPLEVIQYGD------PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 236 gd~~~~~~~~~~~~~~i~~~~~~~------~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) |||++++ ++.+++++++++++.. .+...+++|++|+++||+++|+|+++.||+||++|++++ T Consensus 249 gd~~~~~-~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~~a~~~l~~~~ 316 (320) T protein:vir:10 249 GDFRNVI-WGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDKDAFVKLTNVV 316 (320) T ss_pred eecceEE-EEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeeccEEecccceEEEEecc Confidence 9999875 8999999999988754 334567899999999999999999999999999999999 No 15 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=100.00 E-value=3.7e-60 Score=346.40 Aligned_cols=282 Identities=17% Similarity=0.204 Sum_probs=242.2 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEY 80 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~~ 80 (298) .+.++|.+||++++.+|++.+++.++++++++++|++++..++|+.+ .+.+.|++|++++++++++|+++++.++|+++ T Consensus 10 ~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~f~~v~l~~~k~~~ 88 (299) T protein:vir:41 10 MQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMTKPEEEFTFMS-GVGAFWVDEAERIQTSKPTFTKAKMRSKKMGV 88 (299) T ss_pred ccCCCceecchhHHHHHHHHHHhcchhhhhceeeecCCCcEEEEEEc-CCceeeeecCccccccccceeEEEEeeEEEEE Confidence 34456799999999999999999999999999999999889999876 47899999999999999999999999999999 Q ss_pred EEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHHH Q lcl|Aclame:pro 81 GARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGAI 160 (298) Q Consensus 81 ~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 160 (298) ++++|+|+++ ++..++.++|.+++++++++++|.++|+|++.+ . +.++... ..............+++| T Consensus 89 ~~~is~ell~---ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~--~---~~gil~~---~~~~~~~~~~~~~~~~~l 157 (299) T protein:vir:41 89 IIPTTKENLN---YSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESP--Y---NWNILKS---ATDASNLVEETANKYDDL 157 (299) T ss_pred eehhhHHHHh---cCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCc--c---ccccccc---ccccceeeccccccHHHH Confidence 9999999997 444789999999999999999999999996432 2 2222222 222222333445568999 Q ss_pred HHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeeccc Q lcl|Aclame:pro 161 ENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFAN 240 (298) Q Consensus 161 ~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~ 240 (298) .+++.++..+++.+++|+|||+++.+|+++||++|+|+|.+....+ .++|+|+||++++.||.+ .+...++||||+. T Consensus 158 ~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~-~~~l~G~PV~~~~~~~~~--~~~~~~~~gdfs~ 234 (299) T protein:vir:41 158 NEAIGLIEAEDLEPNGIATIRKQRVKYRSTKDGNGMPIFNTATSNG-VDDVLGLPIAYTPKYTFG--DKDISELVGDWNQ 234 (299) T ss_pred HHHHHhhhcccCCcCEEEEcHHHHHHHHHhhccCCceeecCCcCCC-CceecceeeEEecccCCC--CCceEEEEEeccc Confidence 9999999999999999999999999999999999999998877654 468999999999999854 3556789999998 Q ss_pred eEEEEeecceEEEEeeccc------ccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 241 GFKWGYAKEVPLEVIQYGD------PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 241 ~~~~~~~~~~~i~~~~~~~------~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) + .++.+++++++++++.+ .+...+++|++|+++||+++|+|+++++|+||++|+.++ T Consensus 235 ~-~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~a 297 (299) T protein:vir:41 235 A-YYGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQPKA 297 (299) T ss_pred E-EEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEecc Confidence 6 48999999999988653 456677899999999999999999999999999999999 No 16 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=100.00 E-value=1.4e-60 Score=348.73 Aligned_cols=279 Identities=16% Similarity=0.156 Sum_probs=240.3 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccccccc-cceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGG-VTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~-~~~~~v~l~~~k~~ 79 (298) -.++||+|||++++++|++.+++.++++++|+++|++++..++|+.++.+.+.|++|++.+|+++ ++|+++++.+||++ T Consensus 134 t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~~~~~~f~~v~~~~~k~~ 213 (425) T protein:vir:10 134 EDSEGGYLTPIEWDRTITNKLVLISPMRQLCRVQPVSKAGFSKLFNMGGTTSGWVGEASQRPQTNAATFQPLSFASGEIY 213 (425) T ss_pred cCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEcCCcceeeeccccccccccccccceeeeeheeeE Confidence 66678899999999999999999999999999999999999999999999999999999999876 79999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccc----------ccccc Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVT----------QKVEA 149 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~----------~~~~~ 149 (298) +++++|+|+++ |+.+++.++|.+++++++++++|.++++|+|. + .+.|+.+.....+ ..... T Consensus 214 ~~i~iS~ell~---ds~~~l~~~i~~~la~ai~~~~d~~~l~G~G~--~---~p~Gil~~~~~~~~~~~~~~~~~~~~~~ 285 (425) T protein:vir:10 214 ANPAATQQILD---DAEIDLESWLATEVQTEFAKQEGKAFLAGDGT--N---KPNGLLTYIAGGANAAKHPFGAIEVVNS 285 (425) T ss_pred eehHhHHHHHh---cchhHHHHHHHHHHHHHHHHHHHhhhhcccCC--C---Ccceeeeccccccccccccccccccccc Confidence 99999999996 45588999999999999999999999999642 2 2333322221111 11222 Q ss_pred ccccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccc Q lcl|Aclame:pro 150 PRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQ 229 (298) Q Consensus 150 ~~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~ 229 (298) .......+++|.+++..+...+..+++|+|||+++..|+++||++|||+|.+....+.+++|+|+||+++++||... .+ T Consensus 286 ~~~~~~~~d~l~~l~~~l~~~~~~~a~~vmn~~~~~~L~~lkD~~G~~l~~~~~~~g~~~~l~G~PV~~~~~~p~~~-~~ 364 (425) T protein:vir:10 286 GAAADITSDGIIDLVYDLPSAFTGNARFAMNRNTQRQVRKLKDGQGNYLWQPSYVAGQPATLAGYPVTEVPDMPDVA-AN 364 (425) T ss_pred cccccccHHHHHHHHhhhhhhhccCCEEEEchHHHHHHHHhhcCCCceeeccCccCCCCceecceeeEEecCcCCcc-CC Confidence 33445568899999999999999999999999999999999999999999988888888999999999999998654 34 Q ss_pred cceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 230 RDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 230 ~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ...++||||+.+|.+..|.++++..++| |.+|++.||++.|+|+++++|+||++|+.+. T Consensus 365 ~~~i~~Gd~~~~~~i~~~~~~~v~~d~~----------~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~a 423 (425) T protein:vir:10 365 STPILFGDFQQTYLIIDRIGVRVLRDPY----------TAKPYVLFYTTKRVGGGLLNPEPMRAMKVAA 423 (425) T ss_pred ccEEEEEehhccEEEEEecceEEEeccc----------ccCCcEEEEEEEEeccEeecccceEEEEeec Confidence 5678999999988788888888765543 6789999999999999999999999999999 No 17 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=100.00 E-value=2.8e-60 Score=347.07 Aligned_cols=286 Identities=13% Similarity=0.133 Sum_probs=231.0 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhh-cceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARL-SAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~-~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~ 79 (298) -+.+||++||+++.++||+.+++.++++++ ++++|+.++.+++|++++++.++|++|++.+|+++++|++|++.++|++ T Consensus 130 ~~~~gg~liP~~~~~~ii~~l~~~~~l~~~~~~~~~~~~g~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~ 209 (428) T protein:vir:10 130 AAGSGGVLIPQNIHSEVIELLRDRTIVRKLGARSIPLPNGNMSLPRLAGGATASYTGENQDAKVSEARFDDVKLTAKTMI 209 (428) T ss_pred cccCCccccchhHHHHHHHHHhhhchhhhhcceeeecCCcceEEEEEeCCcceeeeccCccccccccceeeEEeeeEEEE Confidence 334678999999999999999999999998 6788998889999999999999999999999999999999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHH Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGA 159 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (298) +++++|+|||++ +.+++.++|.++|++++++++|.++|+|+| +...+.|+.+...................+. T Consensus 210 ~~v~is~ell~d---s~~~l~~~i~~~l~~ai~~~~d~~~l~G~G----~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~ 282 (428) T protein:vir:10 210 AMVPISNALIGR---AGFNVEQLVLQDILTAISVREDKAFMRDDG----TGDTPIGMKARATQWNRLLPWAADAAVNLDT 282 (428) T ss_pred EeehhhHHHHhh---hhHHHHHHHHHHHHHHHHHHHHHHHhccCC----CCccccccccccccccccccccccccccHHH Confidence 999999999974 447899999999999999999999999954 3334445443322221111111111212222 Q ss_pred H---HH---HhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccc--cccc Q lcl|Aclame:pro 160 I---EN---AVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSL--TQRD 231 (298) Q Consensus 160 i---~~---~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~--~~~~ 231 (298) + .+ +.......+...++|+|||.++..|+++||++|+|+|++. ..++|+|+||+++++||.+.+ .+.. T Consensus 283 ~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~----~~g~l~G~pv~~~~~~p~~~~~~~~~~ 358 (428) T protein:vir:10 283 IDTYLDSIILMSMDGNSNMISSGWGMSNRTYMKLFGLRDGNGNKVYPEM----AQGMLKGYPIQRTSAIPANLGEGGKES 358 (428) T ss_pred HHHHHHHHHHhhhccccccccCEEEEcHHHHHHHHHhhccCCceeccCC----CCCeeeceeeEEeccccccccCCCccc Confidence 2 22 2223344455677899999999999999999999999753 345899999999999987543 3455 Q ss_pred eEEEeeccceEEEEeecceEEEEeeccc---ccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 232 RAIIGDFANGFKWGYAKEVPLEVIQYGD---PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 232 ~~~~gd~~~~~~~~~~~~~~i~~~~~~~---~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) .++||||+.++ ++.+++++++++++.. .+...+++|++|+++||+++|+||++.+|+||+++++++ T Consensus 359 ~i~~gd~s~~~-i~~~~~i~i~~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~ 427 (428) T protein:vir:10 359 EIYFADFNDVV-IGEDGNMKVDFSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGVL 427 (428) T ss_pred eEEEEecceEE-EEEecceEEEeecccccccccccccchhhcchhheeeeeeeCceeeccceEEEEeccC Confidence 78999999754 8899999999998753 345567899999999999999999999999999999999 No 18 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=100.00 E-value=7.6e-60 Score=344.71 Aligned_cols=278 Identities=19% Similarity=0.193 Sum_probs=239.8 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEY 80 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~~ 80 (298) ...++|.+||+++.++||+.+++.|+++++++++|++++.+++|++++.++++|++|++.+|+++++|+++++++||+++ T Consensus 31 ~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~ 110 (324) T protein:vir:78 31 MHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGV 110 (324) T ss_pred ccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceeEecCCccccccccceeEEEEeeEEEEE Confidence 45566799999999999999999999999999999998889999999999999999999999999999999999999999 Q ss_pred EEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHHH Q lcl|Aclame:pro 81 GARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGAI 160 (298) Q Consensus 81 ~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 160 (298) ++++|+|++++ +..++.++|.+++++++++++|.++|+|++. ...+.++...... ......+...+++| T Consensus 111 ~~~is~ell~d---s~~~l~~~i~~~la~ai~~~~d~a~l~G~g~----~~~~~gi~~~~~~----~~~~~~~~~t~~~i 179 (324) T protein:vir:78 111 ILPVTKEFLNY---TYSQFFEEMKPMIAEAFYKKFDEAGILNQGN----NPFGKSIAQSIEK----TNKVIKGDFTQDNI 179 (324) T ss_pred eehhhHHHHhc---chHHHHHHHHHHHHHHHHHHHHHHHhccCCC----CCcCccccccccc----cceeccccccHHHH Confidence 99999999974 4478999999999999999999999999642 2222333222111 12223345568999 Q ss_pred HHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeeccc Q lcl|Aclame:pro 161 ENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFAN 240 (298) Q Consensus 161 ~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~ 240 (298) .+++.++..++..+++|+|||+++..|+++||++|+|++.+ +.+++|+|+||++++.++. ....+++|||++ T Consensus 180 ~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~~~~~~----~~~~~l~G~PV~~~~~~~~----~~~~~~~gd~~~ 251 (324) T protein:vir:78 180 IDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVVNLKSSNL----KRGELITGDFDK 251 (324) T ss_pred HHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCCeeecC----CCCCcccceeeEeeCCCCC----CcceEEEEecce Confidence 99999999999999999999999999999999999999863 4567899999998876543 345789999998 Q ss_pred eEEEEeecceEEEEeeccc------ccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 241 GFKWGYAKEVPLEVIQYGD------PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 241 ~~~~~~~~~~~i~~~~~~~------~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ++ ++.+++++++++++.. .+++.+++|++|+++||+++|+|+++.+|+||++|++|+ T Consensus 252 ~~-~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~ 314 (324) T protein:vir:78 252 LI-YGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPAD 314 (324) T ss_pred EE-EEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeccc Confidence 64 8999999999998753 456678899999999999999999999999999999999 No 19 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=100.00 E-value=7.6e-60 Score=344.71 Aligned_cols=278 Identities=19% Similarity=0.193 Sum_probs=239.8 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEY 80 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~~ 80 (298) ...++|.+||+++.++||+.+++.|+++++++++|++++.+++|++++.++++|++|++.+|+++++|+++++++||+++ T Consensus 31 ~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~ 110 (324) T protein:vir:96 31 MHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGV 110 (324) T ss_pred ccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceeEecCCccccccccceeEEEEeeEEEEE Confidence 45566799999999999999999999999999999998889999999999999999999999999999999999999999 Q ss_pred EEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHHH Q lcl|Aclame:pro 81 GARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGAI 160 (298) Q Consensus 81 ~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 160 (298) ++++|+|++++ +..++.++|.+++++++++++|.++|+|++. ...+.++...... ......+...+++| T Consensus 111 ~~~is~ell~d---s~~~l~~~i~~~la~ai~~~~d~a~l~G~g~----~~~~~gi~~~~~~----~~~~~~~~~t~~~i 179 (324) T protein:vir:96 111 ILPVTKEFLNY---TYSQFFEEMKPMIAEAFYKKFDEAGILNQGN----NPFGKSIAQSIEK----TNKVIKGDFTQDNI 179 (324) T ss_pred eehhhHHHHhc---chHHHHHHHHHHHHHHHHHHHHHHHhccCCC----CCcCccccccccc----cceeccccccHHHH Confidence 99999999974 4478999999999999999999999999642 2222333222111 12223345568999 Q ss_pred HHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeeccc Q lcl|Aclame:pro 161 ENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFAN 240 (298) Q Consensus 161 ~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~ 240 (298) .+++.++..++..+++|+|||+++..|+++||++|+|++.+ +.+++|+|+||++++.++. ....+++|||++ T Consensus 180 ~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~~~~~~----~~~~~l~G~PV~~~~~~~~----~~~~~~~gd~~~ 251 (324) T protein:vir:96 180 IDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVVNLKSSNL----KRGELITGDFDK 251 (324) T ss_pred HHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCCeeecC----CCCCcccceeeEeeCCCCC----CcceEEEEecce Confidence 99999999999999999999999999999999999999863 4567899999998876543 345789999998 Q ss_pred eEEEEeecceEEEEeeccc------ccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 241 GFKWGYAKEVPLEVIQYGD------PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 241 ~~~~~~~~~~~i~~~~~~~------~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ++ ++.+++++++++++.. .+++.+++|++|+++||+++|+|+++.+|+||++|++|+ T Consensus 252 ~~-~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~ 314 (324) T protein:vir:96 252 LI-YGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPAD 314 (324) T ss_pred EE-EEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeccc Confidence 64 8999999999998753 456678899999999999999999999999999999999 No 20 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=100.00 E-value=1.1e-59 Score=343.90 Aligned_cols=278 Identities=19% Similarity=0.195 Sum_probs=240.2 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEY 80 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~~ 80 (298) +++++|++||++++++|++.+++.++++++++++|++++.+++|+.++.+.+.|++|++.+|+++++|+++++++||+++ T Consensus 31 ~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~ 110 (324) T protein:vir:97 31 MHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGV 110 (324) T ss_pred ccCCCcceechhHHHHHHHHHHhhcchhhhcceeeccCCceEEEEEecCcceeEeccCccccccccceeEEEEeeEEEEE Confidence 56678899999999999999999999999999999999889999999999999999999999999999999999999999 Q ss_pred EEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHHH Q lcl|Aclame:pro 81 GARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGAI 160 (298) Q Consensus 81 ~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 160 (298) ++++|+|++++ +.+++.++|.+++++++++++|+++|+|++.+ ..+.++...... ......+...+++| T Consensus 111 ~~~is~ell~d---s~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~----~~~~gi~~~~~~----~~~~~~~~~~~~~i 179 (324) T protein:vir:97 111 ILPVTKEFLNY---TYSQFFEEMKPMIAEAFYKKFDEAGILNQGNN----PFGKSIAQSIEK----TNKVIKGDFTQDNI 179 (324) T ss_pred eehhhHHHHhc---chHHHHHHHHHHHHHHHHHHHHHHhhccCCCC----ccCccccccccc----cceeccccCCHHHH Confidence 99999999974 44789999999999999999999999996422 222332222111 11223344568999 Q ss_pred HHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeeccc Q lcl|Aclame:pro 161 ENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFAN 240 (298) Q Consensus 161 ~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~ 240 (298) .+++.++..+++.+++|+|||+++..|+++||++|+|+|.+ +.+++|+|+||++++.++. ....+++|||++ T Consensus 180 ~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~~~~~~----~~~~tl~G~PV~~~~~~~~----~~~~~~~gd~~~ 251 (324) T protein:vir:97 180 IDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDTLDGLPVVNLKSSNL----KRGELITGDFDK 251 (324) T ss_pred HHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcCCCceeecC----CCCccccceeeEeecCCCC----CcceEEEEeccc Confidence 99999999999999999999999999999999999999864 4467899999999876543 345689999998 Q ss_pred eEEEEeecceEEEEeeccc------ccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 241 GFKWGYAKEVPLEVIQYGD------PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 241 ~~~~~~~~~~~i~~~~~~~------~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ++ ++.+++++++++++.. .+++.+++|++|+++||+++|+|+++.+|+||++|++++ T Consensus 252 ~~-i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~ 314 (324) T protein:vir:97 252 LI-YGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPAD 314 (324) T ss_pred EE-EEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEecc Confidence 65 8899999999998764 345678899999999999999999999999999999999 No 21 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=100.00 E-value=1.2e-59 Score=343.51 Aligned_cols=278 Identities=18% Similarity=0.189 Sum_probs=238.8 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEY 80 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~~ 80 (298) +..+++++||++++++|++.+++.|+++++++++|++++.++||++++.+.+.|++|++.+|+++++|+++++.++|+++ T Consensus 31 ~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~ 110 (324) T protein:vir:93 31 MHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGV 110 (324) T ss_pred ccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceeeecCCccccccccceeEEEEEeEEEEE Confidence 34456789999999999999999999999999999999889999999999999999999999999999999999999999 Q ss_pred EEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHHH Q lcl|Aclame:pro 81 GARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGAI 160 (298) Q Consensus 81 ~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 160 (298) ++++|+||++++ .+++.++|.+++++++++++|.++|+|++. ...+.+....... ......+...+++| T Consensus 111 ~~~iS~ell~ds---~~~l~~~i~~~l~~aia~~~d~a~l~G~g~----~~~~~~~~~~~~~----~~~~~~~~~~~~~i 179 (324) T protein:vir:93 111 ILPVTKEFLNYT---YSQFFEEMKPMIAEAFYKKFDEAGILNQGN----NPFGKSIAQSIEK----TNKVIKGDFTQDNI 179 (324) T ss_pred eehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHHhcCCCC----CCcCccccccccc----cceeccccccHHHH Confidence 999999999744 478999999999999999999999999542 2222222221111 11223345568999 Q ss_pred HHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeeccc Q lcl|Aclame:pro 161 ENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFAN 240 (298) Q Consensus 161 ~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~ 240 (298) .+++.++..+++.+++|+|||+++..|+++||++|+|+|.+ +.+++|+|+||++++..+. ....+++|||+. T Consensus 180 ~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~G~~~~~~----~~~~~l~G~PVv~~~~~~~----~~~~i~~gdfs~ 251 (324) T protein:vir:93 180 IDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVVNLKSSNL----KRGELITGDFDK 251 (324) T ss_pred HHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCCCCCeeecC----CCCCcccceeeEeecCCCC----CcceEEEEecce Confidence 99999999999999999999999999999999999999864 4567899999998776543 355789999998 Q ss_pred eEEEEeecceEEEEeeccc------ccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 241 GFKWGYAKEVPLEVIQYGD------PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 241 ~~~~~~~~~~~i~~~~~~~------~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) + .++.+++++++++++.. .++..+++|++|+++||+++|+|+++.+|+||++|++|+ T Consensus 252 ~-~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~a~~~l~~a~ 314 (324) T protein:vir:93 252 L-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPAD 314 (324) T ss_pred E-EEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEeccc Confidence 5 58999999999998763 455678899999999999999999999999999999999 No 22 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=100.00 E-value=7.4e-60 Score=344.77 Aligned_cols=279 Identities=13% Similarity=0.071 Sum_probs=238.3 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccccccc-cceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGG-VTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~-~~~~~v~l~~~k~~ 79 (298) -.++||++||++++++|++.+++.++++++++++|++++.+.+|+.++++.+.|++|++.+|+++ ++|+++++.+||++ T Consensus 110 t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~i~~~~~k~~ 189 (407) T protein:vir:48 110 NDEDGGYAIPEELDRTILTLLKDEVVMRQEATVITLGGSDYKKLVNLGGTTSGWVGETDARPETATSKLGLIEPFMGEIY 189 (407) T ss_pred cCCCCcccccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEecCCcceeeecccccccccccccceeEEeeeeeeE Confidence 44577899999999999999999999999999999999899999999999999999999999864 79999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccc----------cccccc Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKV----------TQKVEA 149 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~----------~~~~~~ 149 (298) +++++|+|+|+ |+.+++.++|.++|++++++++|.++++|+|. +. +.|+....... ...... T Consensus 190 ~~~~iS~ell~---ds~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~--~~---p~Gil~~~~~~~~~~~~~~~~~~~~~~ 261 (407) T protein:vir:48 190 GNPQATQKMLD---DAFFNVEDWINSELALEFAEQEEIAFTSGDGS--KK---PKGFLAYESTDEDDKTRAFGKLQHIAS 261 (407) T ss_pred eehhhHHHHHh---cchHHHHHHHHHHHHHHHHHHHHhhhhccCCC--Cc---cceeeeccccccccccccccccccccc Confidence 99999999996 45589999999999999999999999999643 32 33332211111 011122 Q ss_pred ccccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccc Q lcl|Aclame:pro 150 PRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQ 229 (298) Q Consensus 150 ~~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~ 229 (298) .......+++|.+++..+...+...++|+||+.++..|+++||++|||||.+....+.+++|+|+||++++.||... .+ T Consensus 262 ~~~~~~~~d~i~~l~~~l~~~~~~~a~~v~n~~~~~~L~~lkD~~Gr~l~~~~~~~g~~~~l~G~PV~~~~~~p~~~-~~ 340 (407) T protein:vir:48 262 GAASGVTADAIIKLIYTLRKAHRSGAKFMMNNSSLFAIRLLKDNDGNYLWRPGIELGQPSSLAGYGIVENEQMPDIA-AD 340 (407) T ss_pred ccccccChHHHHHHHHhhchhhhcCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecCcCCcc-CC Confidence 33334458999999999999999999999999999999999999999999988888888999999999999999743 34 Q ss_pred cceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 230 RDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 230 ~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ...++||||+.+|.+..|.++++..++ +|++|++.||++.|+|+++++|+||++|+.+. T Consensus 341 ~~~i~~Gd~~~~~~i~~~~~~~i~~d~----------~~~~~~~~~~~~~r~d~~v~~~~a~~~l~~~a 399 (407) T protein:vir:48 341 AKAIAFGNFKRGYTIVDRIGTRILRDP----------YTNKPFVGFYTTKRTGGMLVDSQAIKLMKIGA 399 (407) T ss_pred ccEEEEEeccccEEEEEeeceEEEeec----------cccCCcEEEEEEEEeccEEecccceEEEEeec Confidence 557888999988878888898887655 36789999999999999999999999999988 No 23 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=100.00 E-value=1.7e-59 Score=342.85 Aligned_cols=278 Identities=18% Similarity=0.187 Sum_probs=238.6 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEY 80 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~~ 80 (298) +..++|.+||++++++|++.++++++++++++++|++++.++||++++.+++.|++|++.+|+++++|+++++.++|+++ T Consensus 31 ~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~ 110 (324) T protein:vir:96 31 MHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGV 110 (324) T ss_pred ccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceeeecCCccccccccceeEEEEEeEEEEE Confidence 34567789999999999999999999999999999999889999999999999999999999999999999999999999 Q ss_pred EEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHHH Q lcl|Aclame:pro 81 GARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGAI 160 (298) Q Consensus 81 ~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 160 (298) +++||+||+++ +..++.++|.+++++++++++|.++|+|++. ...+.+...... ...........+++| T Consensus 111 ~~~is~ell~d---s~~~l~~~i~~~l~~aia~~~d~~~l~G~g~----~~~~~~~~~~~~----~~~~~~~~~~~~~~i 179 (324) T protein:vir:96 111 ILPVTKEFLNY---TYSQFFEEMKPMIAEAFYKKFDEAGILNQGN----NPFGKSIAQSIK----KTNKVIKGDFTQDNI 179 (324) T ss_pred eehhhHHHHhc---chHHHHHHHHHHHHHHHHHHHHHHhhhcCCC----CCcCcccccccc----ccceecccccchHHH Confidence 99999999974 4478999999999999999999999999542 222222222111 112223344568999 Q ss_pred HHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeeccc Q lcl|Aclame:pro 161 ENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFAN 240 (298) Q Consensus 161 ~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~ 240 (298) .+++.++..+++.+++|+|||+++..|+++||++|+|+|.+ +.+++|+|+||++++..+. ....+++|||+. T Consensus 180 ~~~~~~i~~~~~~~~~~i~n~~~~~~L~~lkd~~G~~~~~~----~~~~~l~G~PV~~~~~~~~----~~~~~~~gd~s~ 251 (324) T protein:vir:96 180 IDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVVNLKSSNL----KRGELITGDFDK 251 (324) T ss_pred HHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCCCCCeeecC----CCCCcccceeeEeecCCCC----CcceEEEEecce Confidence 99999999999999999999999999999999999999853 4567899999998776543 345789999998 Q ss_pred eEEEEeecceEEEEeeccc------ccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 241 GFKWGYAKEVPLEVIQYGD------PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 241 ~~~~~~~~~~~i~~~~~~~------~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) + .++.+++++++++++.. .+...+++|++|+++||+++|+|+++.+|+||++|++|+ T Consensus 252 ~-~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~r~~~r~d~~v~~~~a~~~l~~a~ 314 (324) T protein:vir:96 252 L-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPAD 314 (324) T ss_pred E-EEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEeccc Confidence 5 58999999999998754 455678899999999999999999999999999999999 No 24 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=100.00 E-value=7.7e-60 Score=344.66 Aligned_cols=279 Identities=14% Similarity=0.083 Sum_probs=238.4 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeecccccccc-ccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHG-GVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~-~~~~~~v~l~~~k~~ 79 (298) ...+||++||+++.++|++.+++.++++++++++|++++.+.+|+..+++.+.|++|++.+|++ .++|++|++.+||++ T Consensus 111 ~~~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~~~~~~~~~v~~~~~k~~ 190 (401) T protein:vir:44 111 TDEDGGYAVPEELDRSILSLLKDEVVMRQEATVITVGGSDYKKLVNLGGTASGWVGETDTRSQTATSRLGLIEPFMGEIY 190 (401) T ss_pred CCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEecCCccceeeccccccCccccccceeeeeehhhee Confidence 3346789999999999999999999999999999999988999999999999999999999865 589999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccc----------cccccc Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKV----------TQKVEA 149 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~----------~~~~~~ 149 (298) +++++|+|+|+ |+.+++.++|.++|++++++++|.++|+|+|. +. +.|+.+..... ...+.. T Consensus 191 ~~~~iS~ell~---ds~~~l~~~i~~~la~ai~~~~~~~~l~G~G~--~~---p~Gil~~~~~~~~~~~~~~~~~~~~~t 262 (401) T protein:vir:44 191 GNPQATQKMLD---DAFFNVEAWINSELATEFAEQEEIAFTTGDGT--KK---PKGFLAYESTEESDKARAFGKLQHIVS 262 (401) T ss_pred eehhhhHHHHh---cchHHHHHHHHHHHHHHHHHHHHhhhhccCCC--Cc---cceeecccccccccccccccccccccc Confidence 99999999996 45589999999999999999999999999643 32 33322211111 111222 Q ss_pred ccccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccc Q lcl|Aclame:pro 150 PRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQ 229 (298) Q Consensus 150 ~~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~ 229 (298) .......+++|.+++..+...+..+++|+||++++..|+++||++|||||.+..+.+.+++|+|+||++++.||.. +.+ T Consensus 263 ~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~g~~~~l~G~PVv~~~~~p~~-~~~ 341 (401) T protein:vir:44 263 GEATAVTADAIIKLIYTLRKAHRTGAKFMMNNNSLFAIRLLKDTEGNYLWRPGLELGQPSSLAGYGIAENEQMPDI-AAD 341 (401) T ss_pred ccccccCHHHHHHHHHhcchhhhcCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCCceecceeeEEecCcCCc-cCC Confidence 3344456899999999999999999999999999999999999999999998888888899999999999999864 344 Q ss_pred cceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 230 RDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 230 ~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ...++||||+.+|.+..|.++++..++ +|++|++.||++.|+|+++++|+||++|+.++ T Consensus 342 ~~~i~~Gd~~~~~~i~~~~~~~~~~~~----------~~~~~~v~~~a~~r~d~~~~~~~a~~~l~~~a 400 (401) T protein:vir:44 342 AKAIAFGNFKRGYTIVDRIGTRILRDP----------YTNKPFVGFYTTKRTGGMLVDSQAIKLLKIAA 400 (401) T ss_pred ccEEEEeehhccEEEEEecceEEeeec----------cccCCcEEEEEEEEeccEEecccceEEEEeec Confidence 557888999988888889998887654 46789999999999999999999999999999 No 25 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=100.00 E-value=1.8e-59 Score=342.64 Aligned_cols=281 Identities=20% Similarity=0.205 Sum_probs=239.4 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEY 80 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~~ 80 (298) ..+++|.++|++++++||+.+++.++|+++++++|++++.++||+++..+.+.|++|++++++++++|+++++.+||+++ T Consensus 14 ~t~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~k~~~ 93 (397) T protein:vir:23 14 KDTMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATGIVIPHWTGDVSAQWIGEGDMKPITKGNMTKRDVHPAKIAT 93 (397) T ss_pred cCCCCccccchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcceEEecCCccccccccceeEEEEeeEEEEE Confidence 45556788999999999999999999999999999999889999999999999999999999999999999999999999 Q ss_pred EEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHHH Q lcl|Aclame:pro 81 GARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGAI 160 (298) Q Consensus 81 ~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 160 (298) +++||+||+++ +..+++++|++++++++++++|+++|+|++.+. +..+.... ... .........++++ T Consensus 94 ~v~iS~ell~d---s~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~----~~~~~~~~---~~~--~~~~~~~~~~~~~ 161 (397) T protein:vir:23 94 IFVASAETVRA---NPANYLGTMRTKVATAIAMAFDNAALHGTNAPS----AFQGYLDQ---SNK--TQSISPNAYQGLG 161 (397) T ss_pred eehhhHHHHhc---chHHHHHHHHHHHHHHHHHHHHHHHhhcccCCc----cccccccc---ccc--eeeecccchhHHH Confidence 99999999974 448899999999999999999999999965332 22222221 111 1112233456778 Q ss_pred HHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCc-----ceecceeeEecCccccccccccceEEE Q lcl|Aclame:pro 161 ENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATP-----DTINGLPVDVNKTVSDMSLTQRDRAII 235 (298) Q Consensus 161 ~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~-----~~l~G~PV~~s~~~~~~~~~~~~~~~~ 235 (298) .+++.++..++..+++|+|||+++..|+++||++|||+|.+....+.+ ++|+|+||+++++||.+ ...+++ T Consensus 162 ~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~tl~G~Pv~~s~~~~~g----~~~~~~ 237 (397) T protein:vir:23 162 VSGLTKLVTDGKKWTHTLLDDTVEPVLNGSVDANGRPLFVESTYESLTTPFREGRILGRPTILSDHVAEG----DVVGYA 237 (397) T ss_pred HHHHHhhhhcccCCCEEEEcHHHHHHHHHhhccCCceeecccccccccccccCceeeeeeEEEeCCCCCC----ceEEEE Confidence 888889999999999999999999999999999999999877655433 58999999999999853 456789 Q ss_pred eeccceEEEEeecceEEEEeeccc------ccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 236 GDFANGFKWGYAKEVPLEVIQYGD------PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 236 gd~~~~~~~~~~~~~~i~~~~~~~------~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) |||++++ ++.+++++++++++.. .....+++|++|+++||+++|+|+++++|+||++++..+ T Consensus 238 gDfs~~~-i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~ 305 (397) T protein:vir:23 238 GDFSQII-WGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDP 305 (397) T ss_pred eecceEE-EEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeeccceecccceEEEeecc Confidence 9999865 8999999999988754 344577899999999999999999999999999999987 No 26 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=100.00 E-value=4.3e-59 Score=340.57 Aligned_cols=278 Identities=19% Similarity=0.193 Sum_probs=238.8 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEY 80 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~~ 80 (298) +..++|.+||++++++|++.+++.++++++++++|++++.++||+.++.+++.|++|++++|+++++|+++++.+||+++ T Consensus 31 ~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~ 110 (324) T protein:vir:10 31 MHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGV 110 (324) T ss_pred ccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCcceeEeccCccccccccceeEEEEeeEEEEE Confidence 44455789999999999999999999999999999999899999999999999999999999999999999999999999 Q ss_pred EEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHHH Q lcl|Aclame:pro 81 GARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGAI 160 (298) Q Consensus 81 ~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 160 (298) ++++|+|++++ +.+++.++|.+++++++++++|.++|+|++.+ ..+.++...... ......+...+++| T Consensus 111 ~~~iS~ell~d---s~~~l~~~i~~~l~~ai~~~~d~a~l~G~g~~----~~~~~i~~~~~~----~~~~~~~~~t~~~i 179 (324) T protein:vir:10 111 ILPVTKEFLNY---TYSQFFEEMKPMIAEAFYKKFDEAGILNQGNN----PFGKSIAQSIEK----TNKVIKGDFTQDNI 179 (324) T ss_pred eehhhHHHHhc---chHHHHHHHHHHHHHHHHHHHHHHhhhcCCCC----ccCccccccccc----cceeccccCCHHHH Confidence 99999999974 44789999999999999999999999996432 222222221111 11223345568999 Q ss_pred HHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeeccc Q lcl|Aclame:pro 161 ENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFAN 240 (298) Q Consensus 161 ~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~ 240 (298) .+++.++..++..+++|+|||+++..|+++||++|+|+|.+ +.+++|+|+||++++.++. .+..+++|||++ T Consensus 180 ~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~~~~~~----~~~~~l~G~PV~~~~~~~~----~~~~~~~gd~~~ 251 (324) T protein:vir:10 180 IDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDTLDGLPVVNLKSSNL----KRGELITGDFDK 251 (324) T ss_pred HHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCceeecC----CCCccccceeEEeecCCCC----CcceEEEEeccc Confidence 99999999999999999999999999999999999999864 4567899999999876643 345789999998 Q ss_pred eEEEEeecceEEEEeeccc------ccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 241 GFKWGYAKEVPLEVIQYGD------PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 241 ~~~~~~~~~~~i~~~~~~~------~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ++ ++.+++++++++++.. .++..+++|++|+++||+++|+|+++.+|+||++|++|+ T Consensus 252 ~~-~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~ 314 (324) T protein:vir:10 252 LI-YGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPAD 314 (324) T ss_pred EE-EEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEecc Confidence 64 8999999999998754 455667899999999999999999999999999999999 No 27 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=100.00 E-value=2.8e-59 Score=341.59 Aligned_cols=288 Identities=14% Similarity=0.156 Sum_probs=236.8 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhh-cceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARL-SAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~-~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~ 79 (298) -..+||++||+++.++||+.+++.++++++ ++++|+.++.+++|+.++.+.+.|++|++.+|+++++|++|++.+||++ T Consensus 136 ~~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~~~v~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~i~~~~~k~~ 215 (435) T protein:vir:80 136 SPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNGNITIPRLKGGAIVGYIGADTDIPTTQQQFDDLKLTAKKMA 215 (435) T ss_pred CCCCCccccchhHHHHHHHHHhhhchhhhccceeeecCCCceEEEEEeCCcceeeeccCccccccccceeeEEEeeEEEE Confidence 334578899999999999999999999998 7789999889999999999999999999999999999999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccc-ccccccccchhHH Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQ-KVEAPRGIADPNG 158 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~ 158 (298) +++++|+|+|+++.. ..++.++|.+++++++++++|.++|+|+| ....+.|+.+....... ...........+. T Consensus 216 ~~~~is~ell~ds~~-~~~l~~~i~~~l~~a~~~~~d~a~l~G~G----~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~ 290 (435) T protein:vir:80 216 ALVPIANDLIKYAGV-NPNVDQIVVGDLTAAIGAREDKAFIRDDG----TANTPKGLRFWALPGNVITASDGSTLQKIET 290 (435) T ss_pred EeehhhHHHHHhhcc-cHHHHHHHHHHHHHHHHHHHHHHhhccCC----CCCcccceeecccccceeecccccchhhHHH Confidence 999999999975432 24689999999999999999999999953 33344454333222211 1122223334456 Q ss_pred HHHHHhhhhhhc--CCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCcccccccc--ccceEE Q lcl|Aclame:pro 159 AIENAVELLTGV--DADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLT--QRDRAI 234 (298) Q Consensus 159 ~i~~~~~~l~~~--~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~--~~~~~~ 234 (298) ++.+++..+... +..+++|+|||.++..|+++||++|+|+|++. ..++|+|+||++++.||...+. +...++ T Consensus 291 d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~----~~~~l~G~pv~~~~~~p~~~~~~~~~~~i~ 366 (435) T protein:vir:80 291 DLGKAILALENADANLTQPGWIMAPRTFRFLEGLRDGNGNKVYPEL----ANGMLKGYPVGKTTQVPINLGEAGKESEIY 366 (435) T ss_pred HHHHHHHHhhccccccccCEEEEcHHHHHHHHhhhccCCceeccCC----CCCeEeeeeeEEeccccccccCCCCcceEE Confidence 777777776654 44577899999999999999999999999643 3458999999999999975433 345789 Q ss_pred EeeccceEEEEeecceEEEEeeccc---ccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 235 IGDFANGFKWGYAKEVPLEVIQYGD---PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 235 ~gd~~~~~~~~~~~~~~i~~~~~~~---~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ||||+.++ ++.+++++++++++.. .+...+++|++|+++||++.|+||++.+|+||++|++++ T Consensus 367 ~gd~s~~~-i~~~~~~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~~ 432 (435) T protein:vir:80 367 FTDFGDVF-IGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGVA 432 (435) T ss_pred EEEcccEE-EEeecceEEEEeccccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEeccC Confidence 99999865 8899999999998763 345567889999999999999999999999999999999 No 28 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=100.00 E-value=4.8e-59 Score=340.31 Aligned_cols=278 Identities=19% Similarity=0.199 Sum_probs=239.0 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEY 80 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~~ 80 (298) +..++|.+||++++++|++.+++.++++++++++|++++.++||+.++.+++.|++|++.+|+++++|++++++++|+++ T Consensus 31 ~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~ 110 (324) T protein:vir:99 31 MHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGV 110 (324) T ss_pred ccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceeEeccCccccccccceeEEEEeeEEEEE Confidence 45556789999999999999999999999999999999899999999999999999999999999999999999999999 Q ss_pred EEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHHH Q lcl|Aclame:pro 81 GARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGAI 160 (298) Q Consensus 81 ~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 160 (298) ++++|+||++++ .+++.++|.+++++++++++|.++|+|++.+ ..+.++...... ......+...+++| T Consensus 111 ~~~iS~ell~ds---~~~l~~~i~~~l~~ai~~~~d~~~l~G~g~~----~~~~~~~~~~~~----~~~~~~~~~~~~~i 179 (324) T protein:vir:99 111 ILPVTKEFLNYT---YSQFFEEMKPMIAEAFYKKFDEAGILNQGNN----PFGKSIAQSIEK----TNKVIKGDFTQDNI 179 (324) T ss_pred eehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHhhhcCCCC----ccCccccccccc----cceeccccCCHHHH Confidence 999999999744 4789999999999999999999999986432 222222221111 11223345568999 Q ss_pred HHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeeccc Q lcl|Aclame:pro 161 ENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFAN 240 (298) Q Consensus 161 ~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~ 240 (298) .+++.++...++.+++|+|||+++..|+++||++|+|+|.+ +.+++|+|+||++++.++. ....+++|||+. T Consensus 180 ~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~~~~~~----~~~~~l~G~PVv~~~~~~~----~~~~~i~gd~~~ 251 (324) T protein:vir:99 180 IDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDTLDGLPVVNLKSSNL----KRGELITGDFDK 251 (324) T ss_pred HHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcCCCceeecC----CCCccccceeEEeecCCCC----CcceEEEEeccc Confidence 99999999999999999999999999999999999999864 4567899999999887654 345789999998 Q ss_pred eEEEEeecceEEEEeeccc------ccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 241 GFKWGYAKEVPLEVIQYGD------PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 241 ~~~~~~~~~~~i~~~~~~~------~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ++ ++.+++++++++++.. .+...+++|++|+++||+++|+|+++.||+||++|++++ T Consensus 252 ~~-~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~ 314 (324) T protein:vir:99 252 LI-YGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPAD 314 (324) T ss_pred EE-EEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEecc Confidence 64 8999999999998754 445667899999999999999999999999999999999 No 29 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=100.00 E-value=3.5e-59 Score=341.04 Aligned_cols=288 Identities=14% Similarity=0.157 Sum_probs=237.3 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhh-cceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARL-SAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~-~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~ 79 (298) =..+||++||+++.++||+.+++.++++++ ++.+|+.++.+++|+.++.+++.|++|++.+|+++++|++|++.++|++ T Consensus 136 t~~~gg~~vP~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~i~~~~~k~~ 215 (435) T protein:vir:14 136 SPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNGNITIPRLKGGAIVGYIGADTDIPTTQQQFDDLKLTAKKMA 215 (435) T ss_pred CcCCCccccchhHHHHHHHHHhhhchhhhhcceeeecCCCceEEEEEeCCcceeeeccCccccccccceeEEEeeeEEEE Confidence 223467899999999999999999999997 7788998889999999999999999999999999999999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccc-cccccccccchhHH Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVT-QKVEAPRGIADPNG 158 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 158 (298) +++++|+|||+++.. ..++.++|.+++++++++++|.++++|+| ....+.|+.+...... .............. T Consensus 216 ~~~~iS~ell~ds~~-~~~l~~~i~~~l~~ai~~~~d~a~l~G~G----~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~ 290 (435) T protein:vir:14 216 ALVPIANDLIKYAGV-NPNVDQIVVGDLTAAIGAREDKAFIRDDG----TANTPKGLRFWALPSNVITASDASTLQKIET 290 (435) T ss_pred EeehhhHHHHHhhcc-CHHHHHHHHHHHHHHHHHHHHHHhhccCC----CCccccceeecccccceeccccccchhhHHH Confidence 999999999975432 24699999999999999999999999953 3344555543322211 11122233344456 Q ss_pred HHHHHhhhhhhc--CCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCcccccccc--ccceEE Q lcl|Aclame:pro 159 AIENAVELLTGV--DADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLT--QRDRAI 234 (298) Q Consensus 159 ~i~~~~~~l~~~--~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~--~~~~~~ 234 (298) ++.+++..+... ++.+.+|+|||.++..|+++||++|+|+|++. ..++|+|+||++++.||.+.+. ....++ T Consensus 291 ~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~----~~g~l~G~Pv~~~~~~p~~~~~~~~~~~i~ 366 (435) T protein:vir:14 291 DLGKVILALENADANLTQPGWIMAPRTFRFLEGLRDGNGNKVYPEL----ANGMLKGYPVGKTTQVPINLGETGKESEIY 366 (435) T ss_pred HHHHHHHHhhhccccccCCEEEEcHHHHHHHHHhhccCCceeccCC----CCCeeecceeEeeccccccccCCCccceEE Confidence 777887777665 45577899999999999999999999999643 3468999999999999976443 344789 Q ss_pred EeeccceEEEEeecceEEEEeeccc---ccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 235 IGDFANGFKWGYAKEVPLEVIQYGD---PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 235 ~gd~~~~~~~~~~~~~~i~~~~~~~---~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ||||+.++ ++.|+++++++++|.. .+...+++|++|+++||+++|+||++.+|+||++|++++ T Consensus 367 ~gd~s~~~-i~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~ 432 (435) T protein:vir:14 367 FTDFGDVF-IGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGVA 432 (435) T ss_pred EeecccEE-EEEecccEEEEeccccccccccchhhhhhcChhheeeeeeeCceeecccceEEEecCC Confidence 99999865 8999999999999764 334567889999999999999999999999999999999 No 30 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=100.00 E-value=1.7e-58 Score=337.34 Aligned_cols=288 Identities=20% Similarity=0.195 Sum_probs=234.9 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEY 80 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~~ 80 (298) ...++|.++|++++++||+.+++.++++++++++|++++.+++|+.++.+.+.|++|++.+|+++++|+++++.+||+++ T Consensus 23 ~~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~ 102 (326) T protein:vir:42 23 GDSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIPMGTTGQKIPHWTGDVSASWIGEGDMKPITKGNMTSQTIAPHKIAT 102 (326) T ss_pred cccCCcceechhhHHHHHHHHHhcchhhhhcceeeccCCceEEEEEeCCcceEEecCCccccccccceeEEEEeeEEEEE Confidence 33345778999999999999999999999999999999899999999999999999999999999999999999999999 Q ss_pred EEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccccccccc-chhHHH Q lcl|Aclame:pro 81 GARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGI-ADPNGA 159 (298) Q Consensus 81 ~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 159 (298) ++++|+|+++++ ..++.++|.+++++++++++|.++|+|++ ++.+.++.................... ...... T Consensus 103 ~v~iS~ell~~s---~~~~~~~i~~~l~~a~~~~~d~a~l~G~g--s~~p~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~ 177 (326) T protein:vir:42 103 IFVASAETVRAN---PANYLGTMRTKVATAFAMAFDNAAINGTD--SPFPTFLAQTTKEVSLVDPDGTGSNADLTVYDAV 177 (326) T ss_pred eehhhHHHHhcC---HHHHHHHHHHHHHHHHHHHHHHHhhcccC--CCccccccccccccceeecccccccccchhHHHH Confidence 999999999754 47899999999999999999999999965 333332222111111111111111111 112223 Q ss_pred HHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCc-----ceecceeeEecCccccccccccceEE Q lcl|Aclame:pro 160 IENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATP-----DTINGLPVDVNKTVSDMSLTQRDRAI 234 (298) Q Consensus 160 i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~-----~~l~G~PV~~s~~~~~~~~~~~~~~~ 234 (298) +.++...+...+...++|+|||+++..|++|||++|+|||++..+.+.+ ++++|+||++++++|.+ +..++ T Consensus 178 ~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~----~~~~~ 253 (326) T protein:vir:42 178 AVNALSLLVNAGKKWTHTLLDDITEPILNGAKDKSGRPLFIESTYTEENSPFRLGRIVARPTILSDHVASG----TVVGY 253 (326) T ss_pred HHHHHhhhhhhccCccEEEEeHHHHHHHHHhhccCCceeeccccccCccccccCceeeeeeEEEcCCCCCC----ceEEE Confidence 5566667777788889999999999999999999999999877655443 47999999999999863 45678 Q ss_pred EeeccceEEEEeecceEEEEeeccc------ccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 235 IGDFANGFKWGYAKEVPLEVIQYGD------PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 235 ~gd~~~~~~~~~~~~~~i~~~~~~~------~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) +|||+.++ ++.+++++++++++.. .+...+++|++|+++||+++|+|+++.||+||++|++++ T Consensus 254 ~Gd~s~~~-~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~~v~~~~a~~~l~~~~ 322 (326) T protein:vir:42 254 QGDFRQLV-WGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAFHCNDKDAFVKLTNVD 322 (326) T ss_pred EeecceEE-EEEecceEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEecccceEEEeecc Confidence 99999875 8899999999988654 234567899999999999999999999999999999999 No 31 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=100.00 E-value=1.9e-58 Score=337.06 Aligned_cols=284 Identities=19% Similarity=0.227 Sum_probs=240.3 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEY 80 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~~ 80 (298) -.+++|.+||++++++||+.+++.++++++++++|++++.++||++++.+++.|++|++++++++++|+++++++||+++ T Consensus 18 ~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~ 97 (318) T protein:vir:24 18 GDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWVGDVSAQWIGEGDMKPITKGNMTSQTIAPHKIAT 97 (318) T ss_pred cCcccceeechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCcceEEecCCccccccccceeEEEEeeEEEEE Confidence 44667899999999999999999999999999999999899999999999999999999999999999999999999999 Q ss_pred EEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHHH Q lcl|Aclame:pro 81 GARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGAI 160 (298) Q Consensus 81 ~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 160 (298) ++++|+|+|++ +..++.++|.+++++++++++|.++|+|++.+. + .++........ ............+++ T Consensus 98 ~~~iS~e~l~d---s~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~--~---~~~~~~~~~~~-~~~~~~~~~~~~~~~ 168 (318) T protein:vir:24 98 IFVASAETVRA---NPANYLGTMRTKVATAFAMAFDGAAMHGTDSPF--P---TYIGQTTKAIS-IADTTGATTVYDQVA 168 (318) T ss_pred eehhhHHHhhc---ChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCC--C---ccccccccccc-ccccccccchHHHHH Confidence 99999999974 447899999999999999999999999975322 1 22222222111 111222333445667 Q ss_pred HHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCc-----ceecceeeEecCccccccccccceEEE Q lcl|Aclame:pro 161 ENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATP-----DTINGLPVDVNKTVSDMSLTQRDRAII 235 (298) Q Consensus 161 ~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~-----~~l~G~PV~~s~~~~~~~~~~~~~~~~ 235 (298) .+++..+...+..+++|+|||+++..|+++||++|+|+|.+...++.+ ++++|+||++++.+|.+ +..+++ T Consensus 169 ~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~~~~----~~~~~~ 244 (318) T protein:vir:24 169 VNGLSLLVNDGKKWTHTLLDDITEPILNGAKDQNGRPLFIESTYGEAASPFRSGRIVARPTILSDHVVEG----TTVGFM 244 (318) T ss_pred HHHHHhhccccCCCCEEEEcHHHHHHHHHhhccCCceeecCccccCccccccCceEEEEeeEEeCCCCCC----ccEEEE Confidence 888888889999999999999999999999999999999887665544 47899999999998753 456789 Q ss_pred eeccceEEEEeecceEEEEeeccc------ccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 236 GDFANGFKWGYAKEVPLEVIQYGD------PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 236 gd~~~~~~~~~~~~~~i~~~~~~~------~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) |||+.+ .++.+++++++++++.. .++..+++|++|++++|+++|+|+++.+|+||++|++++ T Consensus 245 gdfs~~-~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~ 312 (318) T protein:vir:24 245 GDFSQL-IWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAFVALTNVV 312 (318) T ss_pred eecceE-EEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeec Confidence 999986 48999999999988754 334567899999999999999999999999999999999 No 32 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=100.00 E-value=1.9e-58 Score=337.00 Aligned_cols=277 Identities=15% Similarity=0.095 Sum_probs=242.9 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CcceEEeeccccccccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTM-DSEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~ 79 (298) ...++|.++|++++.+||+.+++.++|+++++++|++++.+++|+.++ .+.+.|++|++.+|+++++|+++++++||++ T Consensus 117 ~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~i~~~~~k~~ 196 (395) T protein:vir:43 117 IDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTTESNSVEYVRETGFVNNAAPVSEGTQKPYSDLTFELENAPVRTIA 196 (395) T ss_pred cCCCCccccchhhHHHHHHHHHhhhhHHhhccceecCCCceEEEEEecCCCceeeecCCccccccccceeEEEEeeeeEE Confidence 667788999999999999999999999999999999988899999876 5789999999999999999999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHH Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGA 159 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (298) +++++|+|||++ + .++.++|.++|++++++++|.++|+|+| +...+.|+.......+............+++ T Consensus 197 ~~~~is~ell~d---~-~~l~~~v~~~la~a~~~~~d~~~l~G~g----~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~ 268 (395) T protein:vir:43 197 HLFKASRQILDD---A-SALQSYIDARARYGLMLVEECQLLYGNG----TGANLHGIIPQAQAYAPPSGVVVTAEQRIDR 268 (395) T ss_pred EeehhhHHHHHh---H-HHHHHHHHHHHHHHHHHHHHHHHHhccC----CCCccccccccccccccccccccccchhHHH Confidence 999999999863 2 3689999999999999999999999953 3445556555544444444445555667899 Q ss_pred HHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeecc Q lcl|Aclame:pro 160 IENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFA 239 (298) Q Consensus 160 i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~ 239 (298) +.+++..+...+..+++|+|||.++..|+++||++|+|+|++ ...+.+++|+|+||++++.||.+ .+++|||+ T Consensus 269 i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~-~~~~~~~~l~G~pVv~~~~~~~~------~~~~gd~~ 341 (395) T protein:vir:43 269 IRLAILQAQLAEFPASGIVLNPIDWALIELNKDAENRYIIGS-PQNGTTPTLWRLPVVETQAITQD------EFLTGAFS 341 (395) T ss_pred HHHHHHhhccccCCCcEEEEcHHHHHHHHHhhccCCceeccc-cccCCCceecceeeEEcCCCCCC------cEEEEecc Confidence 999999999999999999999999999999999999999976 44566789999999999999854 57999999 Q ss_pred ceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 240 NGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 240 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) .++.+..+++++++++++.. ++|++|++.||+++|+|+++++|+||++++.++ T Consensus 342 ~~~~~~~~~~~~i~~~~~~~------~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~ta 394 (395) T protein:vir:43 342 LGAQIFDRMDIEVLVSTEND------KDFENNMVTIRAEERLAFAVYRPEAFVTGSLTA 394 (395) T ss_pred ceEEEEEecceEEEEecccc------chhhcCcEEEEEEEeeccEEecccceEEEEecc Confidence 98888889999999887542 369999999999999999999999999999999 No 33 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=100.00 E-value=1.3e-58 Score=337.93 Aligned_cols=285 Identities=14% Similarity=0.172 Sum_probs=230.9 Q ss_pred Ceec----cccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeecccc-----ccccccceeeE Q lcl|Aclame:pro 1 MVLN----KGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGK-----KTHGGVTLAPQ 71 (298) Q Consensus 1 mat~----gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~-----~~~~~~~~~~v 71 (298) ||.. +|+|||++++++|++.+++.++++++++++|++++.+++|+.+..+.+.|++|++. ++.++++|+++ T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~~s~~~f~~i 80 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCcEEEEEEeCCcceEEeecccccccccccccccceeeE Confidence 6544 48999999999999999999999999999999988899999999999999999986 55678999999 Q ss_pred EEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc-ccccccccccccccccc Q lcl|Aclame:pro 72 TMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAV-IGTNHFDSKVTQKVEAP 150 (298) Q Consensus 72 ~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~-~~~~~~~~~~~~~~~~~ 150 (298) ++++||++++++||+||+++ +..+++++|++++++++++++|.++|+|+|.+.+..... .+............... T Consensus 81 ~~~~~k~~~~~~is~ell~d---s~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (305) T protein:vir:25 81 TLVAEEIAVIIPVHENVIDD---ATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGV 157 (305) T ss_pred EeeeEEEEEeehhhHHHHhc---chHHHHHHHHHHHHHHHHHHHhhhheeccCCCCCccccccccccccccccccccccc Confidence 99999999999999999974 447899999999999999999999999976544321111 00110111111111111 Q ss_pred cccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCcccccccccc Q lcl|Aclame:pro 151 RGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQR 230 (298) Q Consensus 151 ~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~ 230 (298) ....+.++.+..+...+...++.+++|+|||+++..|+++||++|||+|++ ++|+|+||++++.+|... .+ T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~-------~~l~G~Pv~~~~~~~~~~--~~ 228 (305) T protein:vir:25 158 ANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRNGAWDA--DA 228 (305) T ss_pred hhhhHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHHhhccCCceeecC-------CcccccceEEcCccCCCC--Cc Confidence 122234455666666666667778889999999999999999999999964 479999999999987543 34 Q ss_pred ceEEEeeccceEEEEeecceEEEEeeccc--ccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 231 DRAIIGDFANGFKWGYAKEVPLEVIQYGD--PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 231 ~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ..+++|||+++ .++.+++++++++++.. .+...+++|++|++++|+++|+||.+.||+||++++++. T Consensus 229 ~~~~~gd~s~~-~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~ 297 (305) T protein:vir:25 229 AIEVIADSSRV-KIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTP 297 (305) T ss_pred cEEEEEecceE-EEEEecCeEEEEeeeeeeecCCceeeeeecCcEEEEEEEeecceeeCcccEEEEcccc Confidence 57899999985 58999999999998764 455677899999999999999999999999999999976 No 34 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=100.00 E-value=3.7e-58 Score=335.45 Aligned_cols=277 Identities=16% Similarity=0.169 Sum_probs=220.4 Q ss_pred Ceec----cccccchhHHHHHHHHHHhhchhhhhcceeecC----CCceEEEEEeCCcceEEeeccccccccccceeeEE Q lcl|Aclame:pro 1 MVLN----KGTLFDPELVTDLISKVAGKSSIARLSAQKPIP----FNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQT 72 (298) Q Consensus 1 mat~----gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~----~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~ 72 (298) |.++ ||.++|+++..+||+.+++.+++++++.....+ .+++++|++++++.++|++|++.+|+++++|++++ T Consensus 338 ~~~~~~~~Gg~~vp~~~~~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip~~t~~~~a~wv~Eg~~~~~s~~~f~~v~ 417 (645) T protein:vir:93 338 TTTDPQWAGSLSEYQEYAQDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVHAQVSGGAAGWVGEGKTKPLTKFDFESIT 417 (645) T ss_pred ccccccccCCccCchhhHHHHHHhhhhhhhHHhhccccccccccccCceeeeeeecCcceEEeccCccccccccceeEEE Confidence 3333 778899999999999999999999998764332 24689999999999999999999999999999999 Q ss_pred EeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 73 MVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRG 152 (298) Q Consensus 73 l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 152 (298) +++||+++++++|+|||+++ .+++.++|.+++++++++++|.++|+|++.+. ....+.+..+ .. ..... T Consensus 418 l~~~kla~~~~iS~ell~ds---~~~~~~~i~~~l~~aia~~~d~a~l~g~g~~~-~~~~p~gi~~---~~----~~~~~ 486 (645) T protein:vir:93 418 FSHAKVSAIAVLTEELIRFS---SPAADALVRNALAEAVVARLDTDFVDPKKAAV-ADVSPASITH---DV----KGTAS 486 (645) T ss_pred EeeEEEEEeehhHHHHHhhc---hHHHHHHHHHHHHHHHHHHHHHHhhcCCCccc-CCccccceec---cc----ccccc Confidence 99999999999999999744 47899999999999999999999999864321 1122222211 11 11122 Q ss_pred cchhHHHHHHHhhhhhhcCCc--ccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCcccccccccc Q lcl|Aclame:pro 153 IADPNGAIENAVELLTGVDAD--VTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQR 230 (298) Q Consensus 153 ~~~~~~~i~~~~~~l~~~~~~--~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~ 230 (298) ....+.++.+++.++..++.. .++|+|||.++..|+++||++|+|+|++.. ...++|+|+||+++++||++ T Consensus 487 ~~~~~~d~~~~~~~~~~a~~~~~~a~~vmn~~~~~~L~~lkd~~G~~~~~~~~--~~~~tL~G~PV~~s~~vp~~----- 559 (645) T protein:vir:93 487 SGNPDADAEAAFGQFVAANLQPTGAVWLMSSTNALALSMRKNALGQKEYPDMT--LLGGSFQGLPVIVSQYVGDQ----- 559 (645) T ss_pred ccchHHHHHHHHHHHHhcCCCccccEEEEcHHHHHHHHhccccCCceeecCCC--CCCceeeceeeEEeccCCcc----- Confidence 223456788888888777654 457999999999999999999999996542 34469999999999999853 Q ss_pred ceEEEeeccceEEEEeecceEEEEeeccc--------------ccccchhhhhcCcEEEEEEEEEccEEecccceEEEee Q lcl|Aclame:pro 231 DRAIIGDFANGFKWGYAKEVPLEVIQYGD--------------PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTE 296 (298) Q Consensus 231 ~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--------------~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~ 296 (298) +++|||+..+ ++.++++.+.++++++ .....+++|++|+++||+++|+||+++||+||++|++ T Consensus 560 --~~~gd~s~~~-ig~~~~v~i~~s~~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt~ 636 (645) T protein:vir:93 560 --LVLVNAPDIY-LADDGGVAVDMSREASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAVITG 636 (645) T ss_pred --eeEeccccEE-EEEecceEEEeecceeEEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCccceEEEec Confidence 4677887643 6677777666655432 1224578999999999999999999999999999999 Q ss_pred cC Q lcl|Aclame:pro 297 AN 298 (298) Q Consensus 297 a~ 298 (298) ++ T Consensus 637 ~~ 638 (645) T protein:vir:93 637 VN 638 (645) T ss_pred cc Confidence 99 No 35 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=100.00 E-value=1.1e-57 Score=332.91 Aligned_cols=276 Identities=17% Similarity=0.195 Sum_probs=234.9 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCC-ceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFN-GEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~-~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~ 79 (298) ..+++|.|||++++++|++.+++.++++++++++|++++ ...+|+.++.+.+.|++|++.+++++++|++++++++|++ T Consensus 13 ~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~ 92 (297) T protein:vir:95 13 VSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGISAYWVNETEKIKTDKPEVVPVTLKAHKLG 92 (297) T ss_pred ccCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCceeEEeecCccccccccceeEEEEeeEEEE Confidence 455678999999999999999999999999999999765 4688888888999999999999999999999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHH Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGA 159 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (298) +++++|+|++++ +..++.++|.+++++++++++|.++|+|++.+. +.++... .... .........+++ T Consensus 93 ~~~~is~ell~d---s~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~-----~~gi~~~---~~~~-~~~~~~~~t~~~ 160 (297) T protein:vir:95 93 IILVTSREALNY---TWKKFFEDMKPQIVEAFYKKIDEAGLLGHDTPF-----ANSVAKA---AKDA-NKVIGGPINYDN 160 (297) T ss_pred EeehhhHHHHhc---CHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcc-----ccccccc---cccc-ceecccccCHHH Confidence 999999999974 447899999999999999999999999965322 1222211 1111 122233445889 Q ss_pred HHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeecc Q lcl|Aclame:pro 160 IENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFA 239 (298) Q Consensus 160 i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~ 239 (298) |.+++.++..++..+++|+|||+++..|++|||++|+|+|.+ .+++|+|+||++++..+. ....+++|||+ T Consensus 161 i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~G~~i~~~-----~~~~l~G~Pv~~~~~~~~----~~~~~~~gd~s 231 (297) T protein:vir:95 161 ILKLQDALYDADVEPNAFVSKIQNRSALREARDGNKVSIYDK-----AANTIDGITTVDLKSARF----EKGDLLAGDFD 231 (297) T ss_pred HHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCceeecC-----CCCcccceeeEeecCCCC----CCceEEEEecc Confidence 999999999999999999999999999999999999999964 356899999998765543 34578999999 Q ss_pred ceEEEEeecceEEEEeeccc------ccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 240 NGFKWGYAKEVPLEVIQYGD------PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 240 ~~~~~~~~~~~~i~~~~~~~------~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) .++ ++.+++++++++++.. .++..+++|++|++++|+++|+|+++.+|+||++||.|| T Consensus 232 ~~~-~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~at 295 (297) T protein:vir:95 232 NLI-YGVPYNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKLTPAE 295 (297) T ss_pred cEE-EEEecCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEeecC Confidence 865 8999999999998764 345667899999999999999999999999999999999 No 36 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=100.00 E-value=7.5e-58 Score=333.76 Aligned_cols=283 Identities=13% Similarity=0.052 Sum_probs=240.7 Q ss_pred CeeccccccchhHHHHHH-HHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLI-SKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii-~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~ 79 (298) -.++||+|||+++..++| +.++..+++++++++.++ ++.+.+|+.++.+.+.|++|++.+++++++|+++++.++|++ T Consensus 254 t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~~~-~g~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~ 332 (543) T protein:vir:81 254 TKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQVVA-TGDVWHGVSSAAVQWSWDAEFEEVSDDSPEFGQPEIPVKKAQ 332 (543) T ss_pred ccccCcccCchhhhhHHHHHHHhhhchhhhhcccccC-CcceEEEEecCCcceeecccCccccccccccceeeeeeeeeE Confidence 356789999999999877 556788999999998766 467899999999999999999999999999999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHH Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGA 159 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (298) ++++||+|+++ |+ +++.++|.++|++++++++|.++|+|+| +...+.|+....................+++ T Consensus 333 ~~~~is~ell~---d~-~~~~~~i~~~l~~~~~~~~d~ail~G~G----t~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~ 404 (543) T protein:vir:81 333 GFVPISIEALQ---DE-ANVTETVALLFAEGKDELEAVTLTTGTG----QGNQPTGIVTALAGTAAEIAPVTAETFALAD 404 (543) T ss_pred eeehhhHHHHh---cc-HHHHHHHHHHHHHHHHHHHHHHHhccCC----CCcccccchhhcccccccccccccccccHHH Confidence 99999999996 33 5899999999999999999999999954 3345666554444444444455556667899 Q ss_pred HHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccc----cccceEEE Q lcl|Aclame:pro 160 IENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSL----TQRDRAII 235 (298) Q Consensus 160 i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~----~~~~~~~~ 235 (298) +.+++..+...+...++|+|||.++..|+++||++|+|+|.+.. .+.+++|+|+||+++++||.+.. .+...++| T Consensus 405 ~~~~~~~l~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~-~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~i~~ 483 (543) T protein:vir:81 405 VYAVYEQLAARHRRQGAWLANNLIYNKIRQFDTQGGAGLWTTIG-NGEPSQLLGRPVGEAEAMDANWNTSASADNFVLLY 483 (543) T ss_pred HHHHHHhhhccccCCcEEEEcHHHHHHHHHhhcCCCceeccCcC-CCCCccccceeeEEeccccccccccccCCcceEEE Confidence 99999999999988889999999999999999999999998754 45678999999999999987542 34456899 Q ss_pred eeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 236 GDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 236 gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) |||++ +.++.+++++++++++...+ +.|.+|++.||++.|+|+++.+|+||++|+.++ T Consensus 484 gd~~~-~~i~~~~~~~i~~~~~~~~~----~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~~ 541 (543) T protein:vir:81 484 GNFQN-YVIADRIGMTVEFIPHLFGT----NRRPNGSRGWFAYYRMGADVVNPNAFRLLNVET 541 (543) T ss_pred eeccc-eeEEeecccEEEEecccccc----chhhcCceEEEEEEeeccEeecccceEEEEecc Confidence 99986 45889999999998876433 247899999999999999999999999999999 No 37 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=100.00 E-value=1.2e-57 Score=332.71 Aligned_cols=273 Identities=16% Similarity=0.128 Sum_probs=237.1 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CcceEEeeccccccccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTM-DSEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~ 79 (298) -+.++|+++|+++..+||+.+++.++|+++++++|++++.+++|+.++ .+.+.|++|++.+|+++++|+++++.+||++ T Consensus 117 ~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~ 196 (390) T protein:vir:97 117 AAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIA 196 (390) T ss_pred cccccccccchhhhHHHHHHHhhhhhhHhhcceeeccCCceEEEEEecCCcceeeecCCccccccccceeEEEEeeeeEE Confidence 566778999999999999999999999999999999998899999876 4789999999999999999999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHH Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGA 159 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (298) +++++|+|+++++ .++.++|.+++++++++++|.++|+|++ +...+.|+.+..+. ............+++ T Consensus 197 ~~~~is~ell~ds----~~l~~~i~~~la~a~~~~~d~a~l~G~g----~~~~p~Gi~~~~~~--~~~~~~~~~~~~~d~ 266 (390) T protein:vir:97 197 HTMKATRQILSDA----PQLASYMNNRLIRGLKVKEDAEILRGTG----ANDGLLGLIPQATT--YAAPTTIAGATRVDQ 266 (390) T ss_pred EeehhhHHHHHhH----HHHHHHHHHHHHHHHHHHHHHHHhhcCC----CCccccceeecccc--ccccccccccchHHH Confidence 9999999999643 3699999999999999999999999854 33345555433222 222233345566889 Q ss_pred HHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeecc Q lcl|Aclame:pro 160 IENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFA 239 (298) Q Consensus 160 i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~ 239 (298) +.+++..+...+..+++|+|||++|..|+++||++|+|||++.. .+.+++|+|+||++++.||.+ ++++|||+ T Consensus 267 ~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~-~~~~~~l~G~pV~~~~~~~~~------~~~~gd~~ 339 (390) T protein:vir:97 267 LRLAMLQASLAEYPASGIVINPIDWAAIELAKDANNQYLIGNAR-GTLTPTLWGLPVVATQAMAPG------EFLVGAFD 339 (390) T ss_pred HHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCcc-CCCCceecceeeEEcCCCCCC------cEEEEecc Confidence 99999999999999999999999999999999999999998754 456779999999999999853 58999999 Q ss_pred ceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeec Q lcl|Aclame:pro 240 NGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEA 297 (298) Q Consensus 240 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a 297 (298) .++.+..+++++++++++. ..|++|+++||+++|+|+++++|+||++++-| T Consensus 340 ~~~~~~~~~~~~i~~~~~~-------~~f~~~~~~~r~~~r~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 340 LAAQIFDQWDARVEIGYVN-------DDFQRNMVTVLAEERLALVVYRPEALITGSFA 390 (390) T ss_pred ceEEEEEecceEEEEeecc-------cccccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 8888899999999887543 25899999999999999999999999999999 No 38 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=100.00 E-value=2.3e-57 Score=331.11 Aligned_cols=275 Identities=20% Similarity=0.152 Sum_probs=238.3 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CcceEEeeccccccccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTM-DSEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~ 79 (298) -++++|.++|++++.+||+.+++.++|+++++++|++++.+++|+.++ .+.+.|++|++.+|+++++|+++++.+||++ T Consensus 108 ~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~ 187 (385) T protein:vir:18 108 DADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVREEVFTNNADVVAEKALKPESDITFSKQTANVKTIA 187 (385) T ss_pred ccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEEEEEecCCcceeeeccCccccccccceeEEEEeeeeEE Confidence 455568899999999999999999999999999999988899999876 6789999999999999999999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHH Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGA 159 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (298) ++++||+|++++ + .++.++|.+++++++++++|.++|+|++ ....+.|+....... ...........+++ T Consensus 188 ~~~~is~ell~d---~-~~l~~~i~~~la~a~~~~~d~~~l~G~g----~~~~~~Gi~~~~~~~--~~~~~~~~~~~~d~ 257 (385) T protein:vir:18 188 HWVQASRQVMDD---A-PMLQSYINNRLMYGLALKEEGQLLNGDG----TGDNLEGLNKVATAY--DTSLNATGDTRADI 257 (385) T ss_pred EeehhhHHHHhh---H-HHHHHHHHHHHHHHHHHHHHHHHHhccC----CCCcccccccccccc--cccccccccchHHH Confidence 999999999863 3 4699999999999999999999999954 334455554433322 22233345567899 Q ss_pred HHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeecc Q lcl|Aclame:pro 160 IENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFA 239 (298) Q Consensus 160 i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~ 239 (298) |.+++..+...+..+++|+|||+++..|+++||++|+|+|++. ..+.+++|+|+||++++.||.+ .++||||+ T Consensus 258 i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~l~~~~-~~~~~~~l~G~pV~~~~~~p~~------~~~~gd~~ 330 (385) T protein:vir:18 258 IAHAIYQVTESEFSASGIVLNPRDWHNIALLKDNEGRYIFGGP-QAFTSNIMWGLPVVPTKAQAAG------TFTVGGFD 330 (385) T ss_pred HHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccCc-ccCCCceecceeeEEcCcCCCC------cEEEeecc Confidence 9999999999999999999999999999999999999999764 4667889999999999999854 58999999 Q ss_pred ceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 240 NGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 240 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) .++.+..+++++++++++.. ++|++|++.||+++|+|+++.+|+||++++.++ T Consensus 331 ~~~~~~~~~~~~v~~~~~~~------~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~a 383 (385) T protein:vir:18 331 MASQVWDRMDATVEVSREDR------DNFVKNMLTILCEERLALAHYRPTAIIKGTFSS 383 (385) T ss_pred cEEEEEEecceEEEEecccc------chhhcCcEEEEEEEeeccEEecccceEEEEecc Confidence 98888999999998876532 369999999999999999999999999999999 No 39 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=100.00 E-value=2.3e-57 Score=331.11 Aligned_cols=275 Identities=20% Similarity=0.152 Sum_probs=238.3 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CcceEEeeccccccccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTM-DSEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~ 79 (298) -++++|.++|++++.+||+.+++.++|+++++++|++++.+++|+.++ .+.+.|++|++.+|+++++|+++++.+||++ T Consensus 108 ~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~ 187 (385) T protein:vir:19 108 DADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVREEVFTNNADVVAEKALKPESDITFSKQTANVKTIA 187 (385) T ss_pred ccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEEEEEecCCcceeeeccCccccccccceeEEEEeeeeEE Confidence 455568899999999999999999999999999999988899999876 6789999999999999999999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHH Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGA 159 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (298) ++++||+|++++ + .++.++|.+++++++++++|.++|+|++ ....+.|+....... ...........+++ T Consensus 188 ~~~~is~ell~d---~-~~l~~~i~~~la~a~~~~~d~~~l~G~g----~~~~~~Gi~~~~~~~--~~~~~~~~~~~~d~ 257 (385) T protein:vir:19 188 HWVQASRQVMDD---A-PMLQSYINNRLMYGLALKEEGQLLNGDG----TGDNLEGLNKVATAY--DTSLNATGDTRADI 257 (385) T ss_pred EeehhhHHHHhh---H-HHHHHHHHHHHHHHHHHHHHHHHHhccC----CCCcccccccccccc--cccccccccchHHH Confidence 999999999863 3 4699999999999999999999999954 334455554433322 22233345567899 Q ss_pred HHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeecc Q lcl|Aclame:pro 160 IENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFA 239 (298) Q Consensus 160 i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~ 239 (298) |.+++..+...+..+++|+|||+++..|+++||++|+|+|++. ..+.+++|+|+||++++.||.+ .++||||+ T Consensus 258 i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~l~~~~-~~~~~~~l~G~pV~~~~~~p~~------~~~~gd~~ 330 (385) T protein:vir:19 258 IAHAIYQVTESEFSASGIVLNPRDWHNIALLKDNEGRYIFGGP-QAFTSNIMWGLPVVPTKAQAAG------TFTVGGFD 330 (385) T ss_pred HHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccCc-ccCCCceecceeeEEcCcCCCC------cEEEeecc Confidence 9999999999999999999999999999999999999999764 4667889999999999999854 58999999 Q ss_pred ceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 240 NGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 240 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) .++.+..+++++++++++.. ++|++|++.||+++|+|+++.+|+||++++.++ T Consensus 331 ~~~~~~~~~~~~v~~~~~~~------~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~a 383 (385) T protein:vir:19 331 MASQVWDRMDATVEVSREDR------DNFVKNMLTILCEERLALAHYRPTAIIKGTFSS 383 (385) T ss_pred cEEEEEEecceEEEEecccc------chhhcCcEEEEEEEeeccEEecccceEEEEecc Confidence 98888999999998876532 369999999999999999999999999999999 No 40 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=100.00 E-value=2.6e-57 Score=330.83 Aligned_cols=275 Identities=16% Similarity=0.105 Sum_probs=237.5 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CcceEEeeccccccccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTM-DSEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~ 79 (298) =..++|++||++++.+|++.+++.++|+++++++|++++.+.+|+.+. ++.+.|++|++++++++++|++|++.+||++ T Consensus 139 ~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~ 218 (418) T protein:vir:10 139 GVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQTSSSSIEYTVETGFTNNAAAVAEGAQKPTSDLKFNLKNQPVRTIA 218 (418) T ss_pred CCCCCccccchhHHHHHHHHHhhhhhHHhhcceeeccCCceeEEEEecCCCceeeeccCccccccccceeeEEEeeeeEE Confidence 134567899999999999999999999999999999988899999876 6889999999999999999999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHH Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGA 159 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (298) ++++||+||++++ .++.++|.+++++++++++|.++|+|+| +...+.|+.+....... .........+++ T Consensus 219 ~~~~is~ell~ds----~~l~~~i~~~l~~a~~~~~d~a~l~G~g----~~~~p~Gi~~~~~~~~~--~~~~~~~~~~~~ 288 (418) T protein:vir:10 219 HLFKASRQILDDA----PALQSYIDGRARYGLQLTEEGQILKGDG----TGANILGILPQASAFMP--SITLANATPIDK 288 (418) T ss_pred EeehhhHHHHHhH----HHHHHHHHHHHHHHHHHHHHHHHhccCC----CCccccccccccccccc--cccccccccHHH Confidence 9999999999633 3699999999999999999999999954 33345555544333222 233334456789 Q ss_pred HHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeecc Q lcl|Aclame:pro 160 IENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFA 239 (298) Q Consensus 160 i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~ 239 (298) |.+++..+...+..+++|+|||.++..|+++||++|+|||++ ...+.+++|+|+||++++.||.+ .+++|||+ T Consensus 289 i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~-~~~~~~~~l~G~pV~~~~~~p~~------~~~~gd~s 361 (418) T protein:vir:10 289 IRLALLQAVLAEFPATGIVLNPIDWASIELTKDSQGRYIVGN-PVNGTTPRLWNLPVVETQAMTAN------EFLVGAFS 361 (418) T ss_pred HHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccc-cccCCCceecceeeEEcCCCCCC------cEEEeecc Confidence 999999999999999999999999999999999999999965 45567889999999999999854 57899999 Q ss_pred ceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 240 NGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 240 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) .++.++.+++++++++++.. .+|++|++.||+++|+|+++++|+||++++.++ T Consensus 362 ~~~~~~~~~~~~i~~~~~~~------~~f~~~~~~~r~~~~~d~~~~~~~a~~~~~~~~ 414 (418) T protein:vir:10 362 MAAQIFDRMEIEVLLSTENV------DDFEKNMVSIRAEERLALAVYRPESFVTGALVE 414 (418) T ss_pred ceEEEEEecceEEEEecccc------hhhhcCceEEEEEEeeccEEecccceEEEEecc Confidence 88778889999999887643 369999999999999999999999999999998 No 41 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=100.00 E-value=4.2e-57 Score=329.68 Aligned_cols=273 Identities=15% Similarity=0.132 Sum_probs=235.7 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CcceEEeeccccccccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTM-DSEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~ 79 (298) -..++|+++|+++..+||+.+++.++|+++++++|++++.+.+|+.++ .+.+.|++|++.+|+++++|+++++.+||++ T Consensus 117 ~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~ 196 (390) T protein:vir:81 117 AAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIA 196 (390) T ss_pred cccCCcceechhhhHHHHHHHhhhhhhhhhcceeeccCCceEEEEEecCCcceeeecCCcccccccceeeEEEEeeeEEE Confidence 345678999999999999999999999999999999999999999876 4689999999999999999999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHH Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGA 159 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (298) +++++|+|++++ + .++.++|.++|++++++++|.++++|++ +...+.|+...... ............+++ T Consensus 197 ~~~~is~ell~d---~-~~~~~~i~~~l~~~~~~~~d~a~l~G~g----~~~~~~Gi~~~~~~--~~~~~~~~~~~~~~~ 266 (390) T protein:vir:81 197 HTMKATRQILSD---A-PQLASYMNNRLIRGLKVKEDAEILRGTG----ANDGLLGLIPQATT--YAAPTTIAGATRVDQ 266 (390) T ss_pred EeehhhHHHHHh---H-HHHHHHHHHHHHHHHHHHHHHHHHhcCC----CCCcccceeecccc--cccccccccchhHHH Confidence 999999999863 3 3699999999999999999999999954 33345555433222 222233344556889 Q ss_pred HHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeecc Q lcl|Aclame:pro 160 IENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFA 239 (298) Q Consensus 160 i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~ 239 (298) +.+++..+...++.+++|+|||+++..|+++||++|+|+|.+.. .+.+++|+|+||++++.||.+ ++++|||+ T Consensus 267 ~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~-~~~~~~l~G~pv~~~~~~p~~------~~~~gd~~ 339 (390) T protein:vir:81 267 LRLAMLQASLAEYNPSGIVINPIDWAAIELAKDANNQYLIGNAR-GTLTPTLWGLPVVATQAMAPG------EFLVGAFD 339 (390) T ss_pred HHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCcc-cccCceecceeeEEcCCCCCC------cEEEEehh Confidence 99999999999999999999999999999999999999998754 455679999999999999854 58999999 Q ss_pred ceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeec Q lcl|Aclame:pro 240 NGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEA 297 (298) Q Consensus 240 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a 297 (298) .++.+..+++++++++++.. +|++|+++||+++|+|+++++|+||++++-| T Consensus 340 ~~~~~~~~~~~~v~~~~~~~-------~~~~~~v~~r~~~r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 340 LAAQIFDQWDARVEIGYVGE-------DFQRNMITVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred ceEEEEEecceEEEEecccc-------hhhcCcEEEEEEEeeccEEecccceEEEEeC Confidence 98878889999998876432 6999999999999999999999999999999 No 42 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=100.00 E-value=2.1e-57 Score=331.31 Aligned_cols=273 Identities=14% Similarity=0.090 Sum_probs=229.5 Q ss_pred CeeccccccchhHHHHHH-HHHHhhchhhhhcceeecCCC-ceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLI-SKVAGKSSIARLSAQKPIPFN-GEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKV 78 (298) Q Consensus 1 mat~gg~lip~~~~~~ii-~~~~~~s~i~~~~~~~~~~~~-~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~ 78 (298) ..+.+|.++|+++.+++| +.++..+++++++++++++++ .+.+|+.++.+.+.|++|++.+|+++++|+++++++||+ T Consensus 114 t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~ 193 (390) T protein:vir:62 114 TKAGNPNVLSRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLDFTVITGRSSASIVGETAEIPESYPATAQRSMGGFKY 193 (390) T ss_pred cccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceeeecccccccccccceeeeEeeeeeE Confidence 444456677777776665 455677778899999998764 589999999999999999999999999999999999999 Q ss_pred EEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHH Q lcl|Aclame:pro 79 EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNG 158 (298) Q Consensus 79 ~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (298) +++++||+|+|+ |+.+++.++|.+++++++++++|.++++|+|.+. |+.+..................++ T Consensus 194 ~~~~~iS~ell~---ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~p~-------Gi~~~~~~~~~~~~~~~~~~~~~~ 263 (390) T protein:vir:62 194 GFASVVSYEFAT---DQVLDLVGFLVSDAGPAIGDAMGRHFITGTGQPR-------GILTDASPATATFLATDTDSKVSD 263 (390) T ss_pred EeehHHHHHHHh---hhhHHHHHHHHHHHHHHHHHHHHhhhhccCCccc-------cccccccccccceecccccccchH Confidence 999999999996 4558999999999999999999999999975433 332222222222233334455688 Q ss_pred HHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeec Q lcl|Aclame:pro 159 AIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDF 238 (298) Q Consensus 159 ~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~ 238 (298) +|.+++..+...+...++|+||++++..|++|||++|+|||.+....+.+++|+|+||++++++|.+ .++|||| T Consensus 264 ~l~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~g~~~~l~G~Pv~~~~~~p~~------~i~~gd~ 337 (390) T protein:vir:62 264 ALIDLFHEVPSAYRANAKYVVNDLRAAQMRKLKDANGQYLWQSGLTVGAPSLFNGKVVETDDGMPAD------KILFADL 337 (390) T ss_pred HHHHHHHhhhhhhhcCCEEEEchHHHHHHHHhhccCCCeeecCCcCCCccceecccceEEecCCCCc------cEEEeec Confidence 9999999998888888899999999999999999999999998888888899999999999999853 5889999 Q ss_pred cceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 239 ANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 239 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) +.+ .++.+++++++.+.+. +|++|++.||++.|+|+++++|+||++|+.+. T Consensus 338 s~~-~i~~~~~~~v~~~~~~--------~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~~~ 388 (390) T protein:vir:62 338 SKY-RVRFAGSLRVDRSVDA--------KFSTDQIVYRFLQRADGLLVDARGAKVLTVTP 388 (390) T ss_pred cce-eEEeecceEEEeeccc--------cccCCcEEEEEEEEeCcEeechhheEEEEeec Confidence 975 5789999999877642 68999999999999999999999999999888 No 43 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=100.00 E-value=7.4e-57 Score=328.31 Aligned_cols=273 Identities=15% Similarity=0.116 Sum_probs=235.6 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CcceEEeeccccccccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTM-DSEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~ 79 (298) -+..+|.++||++..+||+.+++.++++++++++|++++.+++|+.++ .+.+.|++|++.+|+++++|+++++.++|++ T Consensus 117 ~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~ 196 (390) T protein:vir:10 117 AAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGRTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIA 196 (390) T ss_pred cccccccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCCcceeeecCCccccccccceeEEEEeeEEEE Confidence 345567899999999999999999999999999999988899999886 4789999999999999999999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHH Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGA 159 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (298) +++++|+|||++ + .++.++|.++|++++++++|.++|+|+| +...+.|+.+.... ............+++ T Consensus 197 ~~~~is~ell~d---~-~~l~~~i~~~l~~~~~~~~~~~il~G~G----~~~~p~Gi~~~~~~--~~~~~~~~~~~~~~~ 266 (390) T protein:vir:10 197 HTMKATRQILSD---A-PQLASYMNNRLIRGLKVKEDAEILRGTG----ANDGLLGLIPQATT--YAAPTTIAGATRVDQ 266 (390) T ss_pred EeehhhHHHHHh---H-HHHHHHHHHHHHHHHHHHHHHHHhhcCC----CCcccccccccccc--ccccccccccchHHH Confidence 999999999863 3 3799999999999999999999999954 33445555443322 222233344556889 Q ss_pred HHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeecc Q lcl|Aclame:pro 160 IENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFA 239 (298) Q Consensus 160 i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~ 239 (298) +.+++..+...++.+++|+|||+++..|+++||++|+|+|++... +.+++|+|+||++++.||.+ .+++|||+ T Consensus 267 ~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~~l~~~~~~-~~~~~l~G~pv~~~~~~p~~------~~~~gdf~ 339 (390) T protein:vir:10 267 LRLAMLQASLAEYPASGIVINPIDWAAIELAKDANNQYLIGNARG-TLTPTLWGLPVVATQAMAPG------EFLVGAFD 339 (390) T ss_pred HHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCCcC-cCCceecceeeEEcCCCCCC------cEEEEecc Confidence 999999999999999999999999999999999999999987654 45669999999999999853 58899999 Q ss_pred ceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeec Q lcl|Aclame:pro 240 NGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEA 297 (298) Q Consensus 240 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a 297 (298) .++.+..+++++++++.+. .+|++|++.||++.|+|+++++|+||++++-| T Consensus 340 ~~~~~~~~~~~~i~~~~~~-------~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 340 LAAQIFDQWDARVEIGYVN-------DDFQRNMVTVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred ceEEEEEecceEEEEeecc-------cccccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 8888889999999887643 25899999999999999999999999999999 No 44 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=100.00 E-value=6.9e-57 Score=328.47 Aligned_cols=275 Identities=12% Similarity=0.111 Sum_probs=234.0 Q ss_pred CeeccccccchhHHHHHHHHHHhh-chhhhhcceeecCCC-ceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGK-SSIARLSAQKPIPFN-GEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKV 78 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~-s~i~~~~~~~~~~~~-~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~ 78 (298) +.+.+|.++|+++..++|..+.+. ++++++++++++.++ .+.+|+.++.+.+.|++|++.+|+++++|+++++.+||+ T Consensus 114 t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~ 193 (392) T protein:vir:13 114 TKAGNPNVLSRTLYGQLIAQAVERSAIMRGGASTFTTSDANPMDFTVITGRATAGIVGETAEIPESYPATTQRSMGGFKY 193 (392) T ss_pred cccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceeeecccccccccccceeeEEeeeeeE Confidence 556667888999888888766554 567888999988654 589999999999999999999999999999999999999 Q ss_pred EEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHH Q lcl|Aclame:pro 79 EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNG 158 (298) Q Consensus 79 ~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (298) +++++||+|+|+ |+.+++.++|.++|++++++++|.++|+|+| ++ .+.|+.......+............++ T Consensus 194 ~~~~~iS~ell~---ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~G--t~---~p~Gil~~~~~~~~~~~~~~~~~~~~d 265 (392) T protein:vir:13 194 GFASVVSYEFAT---DQVLDLVGFLVSDAGPAIGDAMGRHFLTGTG--TG---QPRGILTDATGANAAFGEADADSKVSD 265 (392) T ss_pred EeeehhHHHHHh---cchHHHHHHHHHHHHHHHHHHHHHHHhcccC--Cc---cccccccccccccccccccccccccHH Confidence 999999999997 4457899999999999999999999999954 22 344444333333333334445556688 Q ss_pred HHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeec Q lcl|Aclame:pro 159 AIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDF 238 (298) Q Consensus 159 ~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~ 238 (298) ++.+++..+...+..+++|+|||.++..|+++||++|+|+|.+..+.+.+++|+|+||++++++|.+ +++|||| T Consensus 266 ~l~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~g~~~~l~G~Pv~~~~~~~~~------~i~~Gdf 339 (392) T protein:vir:13 266 ALIDLFHEVPSAYRKNAKFVVNDLRAAQMRKLKDANGQYLWQSALTVGAPDTFNGKVVETDDGMPAD------KVLFADL 339 (392) T ss_pred HHHHHHHhhhhhhhcCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCCceecceeeEEcCCCCCC------cEEEeec Confidence 9999999999888888899999999999999999999999998888888999999999999999853 6899999 Q ss_pred cceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 239 ANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 239 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) +.+ .++.+++++++.+.+. +|.+|++.||++.|+|+++.||+||++++... T Consensus 340 ~~~-~i~~~~~~~i~~~~~~--------~~~~~~~~~r~~~r~d~~~~~~~A~~~~~~~~ 390 (392) T protein:vir:13 340 SKY-RVRFAGSLRVDRSVDA--------KFSTDQIVYRFLQRADGLLVDARGAKVLTVTP 390 (392) T ss_pred cce-eEEeecceEEEeeccc--------cccCCcEEEEEEEEeccEEecccceEEEEeec Confidence 974 5889999998876542 68999999999999999999999999999888 No 45 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=100.00 E-value=8.9e-57 Score=327.87 Aligned_cols=277 Identities=16% Similarity=0.117 Sum_probs=221.9 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CcceEEeeccccccccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTM-DSEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~ 79 (298) -..++|++||+++..+||+.+++.++|+++++++|++++.++||+.++ .+.+.||+|++.+|+++++|++|++.+||++ T Consensus 155 ~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~a 234 (497) T protein:vir:78 155 STGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVA 234 (497) T ss_pred cCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceeeccCcccccccccceeeEeeeeeeE Confidence 345678999999999999999999999999999999999999999876 5789999999999999999999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccc------------- Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQK------------- 146 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~------------- 146 (298) ++++||+|||++ + .++.++|.++|+++|++++|.++|+|+|. +. +.|+.......... T Consensus 235 ~~~~iS~ell~d---~-~~l~~~i~~~l~~~i~~~~d~~~l~G~G~--~~---p~Gil~~~~~~~~~~~~~~~~~~~~~~ 305 (497) T protein:vir:78 235 NALTITDEGLRD---A-PELFNFVQGRLLEGIQRKEEVQLLAGGGY--PG---VNGLLQRSTGFTASSASSLFGATSATV 305 (497) T ss_pred eecHhHHHHHHh---H-HHHHHHHHHHHHHHHHHHHHHHhhcCCCc--cc---ccccccccccccccccccchhhhhhhh Confidence 999999999963 3 35999999999999999999999999642 22 33322211110000 Q ss_pred ---------------------------------------cccccccchhHHHHHHHhhhhhhcC-CcccEEEEcHHHHHH Q lcl|Aclame:pro 147 ---------------------------------------VEAPRGIADPNGAIENAVELLTGVD-ADVTGIAINPSFRSA 186 (298) Q Consensus 147 ---------------------------------------~~~~~~~~~~~~~i~~~~~~l~~~~-~~~~~~vm~~~~~~~ 186 (298) ........+..+.+..++..+...+ ..+++|+|||.+|.. T Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~ 385 (497) T protein:vir:78 306 SNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWEL 385 (497) T ss_pred hhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHH Confidence 0000111223344555555555544 456789999999999 Q ss_pred HHHhhccCCceeeccccc------ccCcceecceeeEecCccccccccccceEEEeeccce-EEEEeecceEEEEeeccc Q lcl|Aclame:pro 187 LAKQKDLQGNALFPELKW------GATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFANG-FKWGYAKEVPLEVIQYGD 259 (298) Q Consensus 187 L~~lkd~~G~~l~~~~~~------~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~~-~~~~~~~~~~i~~~~~~~ 259 (298) |+++||++|+|+|.+... .....+|||+||++++.||.+ .++||||+.+ +.+..|++++|+++++.. T Consensus 386 l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~------~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~ 459 (497) T protein:vir:78 386 LRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG------TILVGHFAPSVIQTARREGVTMQMTNSNG 459 (497) T ss_pred HHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCC------ceEEeecccceEEEEEecccEEEeecccc Confidence 999999999999976432 223458999999999999853 4789999974 556789999999987643 Q ss_pred ccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 260 PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 260 ~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ++|++|+++||++.|+|+.+++|+||++|+..+ T Consensus 460 ------~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~ 492 (497) T protein:vir:78 460 ------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKK 492 (497) T ss_pred ------hhhhcCcEEEEEEEeecceeeccccEEEEEecC Confidence 259999999999999999999999999999988 No 46 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=100.00 E-value=8.9e-57 Score=327.87 Aligned_cols=277 Identities=16% Similarity=0.117 Sum_probs=221.9 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CcceEEeeccccccccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTM-DSEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~ 79 (298) -..++|++||+++..+||+.+++.++|+++++++|++++.++||+.++ .+.+.||+|++.+|+++++|++|++.+||++ T Consensus 155 ~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~a 234 (497) T protein:vir:10 155 STGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVA 234 (497) T ss_pred cCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceeeccCcccccccccceeeEeeeeeeE Confidence 345678999999999999999999999999999999999999999876 5789999999999999999999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccc------------- Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQK------------- 146 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~------------- 146 (298) ++++||+|||++ + .++.++|.++|+++|++++|.++|+|+|. +. +.|+.......... T Consensus 235 ~~~~iS~ell~d---~-~~l~~~i~~~l~~~i~~~~d~~~l~G~G~--~~---p~Gil~~~~~~~~~~~~~~~~~~~~~~ 305 (497) T protein:vir:10 235 NALTITDEGLRD---A-PELFNFVQGRLLEGIQRKEEVQLLAGGGY--PG---VNGLLQRSTGFTASSASSLFGATSATV 305 (497) T ss_pred eecHhHHHHHHh---H-HHHHHHHHHHHHHHHHHHHHHHhhcCCCc--cc---ccccccccccccccccccchhhhhhhh Confidence 999999999963 3 35999999999999999999999999642 22 33322211110000 Q ss_pred ---------------------------------------cccccccchhHHHHHHHhhhhhhcC-CcccEEEEcHHHHHH Q lcl|Aclame:pro 147 ---------------------------------------VEAPRGIADPNGAIENAVELLTGVD-ADVTGIAINPSFRSA 186 (298) Q Consensus 147 ---------------------------------------~~~~~~~~~~~~~i~~~~~~l~~~~-~~~~~~vm~~~~~~~ 186 (298) ........+..+.+..++..+...+ ..+++|+|||.+|.. T Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~ 385 (497) T protein:vir:10 306 SNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWEL 385 (497) T ss_pred hhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHH Confidence 0000111223344555555555544 456789999999999 Q ss_pred HHHhhccCCceeeccccc------ccCcceecceeeEecCccccccccccceEEEeeccce-EEEEeecceEEEEeeccc Q lcl|Aclame:pro 187 LAKQKDLQGNALFPELKW------GATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFANG-FKWGYAKEVPLEVIQYGD 259 (298) Q Consensus 187 L~~lkd~~G~~l~~~~~~------~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~~-~~~~~~~~~~i~~~~~~~ 259 (298) |+++||++|+|+|.+... .....+|||+||++++.||.+ .++||||+.+ +.+..|++++|+++++.. T Consensus 386 l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~------~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~ 459 (497) T protein:vir:10 386 LRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLG------TILVGHFAPSVIQTARREGVTMQMTNSNG 459 (497) T ss_pred HHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCC------ceEEeecccceEEEEEecccEEEeecccc Confidence 999999999999976432 223458999999999999853 4789999974 556789999999987643 Q ss_pred ccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 260 PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 260 ~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ++|++|+++||++.|+|+.+++|+||++|+..+ T Consensus 460 ------~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~ 492 (497) T protein:vir:10 460 ------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKK 492 (497) T ss_pred ------hhhhcCcEEEEEEEeecceeeccccEEEEEecC Confidence 259999999999999999999999999999988 No 47 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=100.00 E-value=2.8e-56 Score=325.15 Aligned_cols=280 Identities=11% Similarity=0.049 Sum_probs=233.9 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCC--CceEEEEEeCCcceEEeecccccccc--ccceeeEEEeee Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPF--NGEKVFTFTMDSEIDVVAESGKKTHG--GVTLAPQTMVPI 76 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~--~~~~ip~~~~~~~a~~v~E~~~~~~~--~~~~~~v~l~~~ 76 (298) ..++||++||++++++|++.+++.++|+++++++|++. +.+.+|+.++.+.+.|++|++.++.+ +++|++++++++ T Consensus 114 ~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~~~~~~~~~~~~~v~e~~~~~~~~~~~~f~~i~~~~~ 193 (404) T protein:vir:10 114 IDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRTYEKRSKQKPMKPLSENQQIPTNGDNGKLERFNFKLK 193 (404) T ss_pred cCCCCceeechhHHHHHHHHHhhhhhHhhhhceeeccCCccceEEEEecCCcceeeccccccccccccccceeeeEeehe Confidence 45778999999999999999999999999999999874 56788998889999999999999875 589999999999 Q ss_pred EEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchh Q lcl|Aclame:pro 77 KVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADP 156 (298) Q Consensus 77 k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (298) |++++++||+|+|+ |+.+++.++|.+++++++++++|.++|+|+| +...+.|+....+..+ ........ T Consensus 194 k~~~~~~iS~ell~---ds~~~l~~~i~~~la~~~~~~~~~~il~G~g----~~~~~~gi~~~~~~~~----~~~~~~~~ 262 (404) T protein:vir:10 194 DLADFMSIPNDLLK---FADKSLEDWIINWFVDKVRITRNAEILYGAG----GDEHATGIMTANKFKK----ITLPKSPA 262 (404) T ss_pred eeEeeehhhHHHHh---hcHHHHHHHHHHHHHHHHHHHHHHHHhhcCC----CCCcccceeeccccce----eecccccc Confidence 99999999999996 4557899999999999999999999999954 3334444443322221 22233345 Q ss_pred HHHHHHHhh-hhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEe-cCccccccccccceEE Q lcl|Aclame:pro 157 NGAIENAVE-LLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDV-NKTVSDMSLTQRDRAI 234 (298) Q Consensus 157 ~~~i~~~~~-~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~-s~~~~~~~~~~~~~~~ 234 (298) ++++.+++. .+...+..+++|+|||++|..|+++||++|+|+|.+...++.+++|+|+||++ ++.++.. +.....++ T Consensus 263 ~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~-~~~~~~~~ 341 (404) T protein:vir:10 263 LKDFKKCKNVELLNVFKATSSWIVNQDGFNYLDSLEDKTGRPYLQPDPKDPTQYRFLGLPVIELPNDLLLS-TESAIPVL 341 (404) T ss_pred HHHHHHHHHhhhhccccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCccccceeeEEecccccCC-CCCccEEE Confidence 778888776 56666666778999999999999999999999999888888889999999985 4455543 34456789 Q ss_pred EeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 235 IGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 235 ~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ||||+.++.+..+++++++++++.. ..|++|++.||+++|+|+++.+|+||++++.++ T Consensus 342 ~gd~s~~~~~~~~~~~~i~~~~~~~------~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~ 399 (404) T protein:vir:10 342 LGDTKEAYKYVSDGAYELATTNIGA------GAFETNTTKARIIMRIDGNVKDSEALLIAEIPV 399 (404) T ss_pred EEeccccEEEEEecceEEEEecccc------chhhcCceEEEEEEeeccEEecccceEEEEeec Confidence 9999998888999999999877543 358999999999999999999999999999999 No 48 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=100.00 E-value=2.8e-56 Score=325.10 Aligned_cols=266 Identities=13% Similarity=0.058 Sum_probs=227.8 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceE--EEEEeCCcceEEeeccccccc-cccceeeEEEeeeE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEK--VFTFTMDSEIDVVAESGKKTH-GGVTLAPQTMVPIK 77 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~--ip~~~~~~~a~~v~E~~~~~~-~~~~~~~v~l~~~k 77 (298) -..+||++||+++.++|++.+++.++|+++++++|++++... +|+..+.+.+.|++|++.+++ +.++|++++++++| T Consensus 95 t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k 174 (371) T protein:vir:81 95 SNQDGGYTVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKKRSQQTGFVEVAEGAAIGEKATPQFTLLQYQVKK 174 (371) T ss_pred CCccCceeecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCcceeeeccccccccccccceeeEEeeeeE Confidence 445578999999999999999999999999999999876544 555666789999999999986 67999999999999 Q ss_pred EEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhH Q lcl|Aclame:pro 78 VEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPN 157 (298) Q Consensus 78 ~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (298) ++++++||+|+++ |+.+++.++|.+++++++++++|.++++|++.+.. . +...+ T Consensus 175 ~~~~~~iS~ell~---ds~~~l~~~i~~~l~~a~~~~~~~~i~~g~g~~~~-----~------------------~~~~~ 228 (371) T protein:vir:81 175 YAGFFRVTNELLN---DSTEAIVNTLVRWIGDESRVTRNGLIINVLNTKAK-----T------------------AIADL 228 (371) T ss_pred EEEeehhhHHHHh---hhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-----c------------------ccccH Confidence 9999999999996 44578999999999999999999999999643211 0 01124 Q ss_pred HHHHHHh-hhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccc------ccccc Q lcl|Aclame:pro 158 GAIENAV-ELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDM------SLTQR 230 (298) Q Consensus 158 ~~i~~~~-~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~------~~~~~ 230 (298) +++..++ ..+...+...++|+|||++|..|+++||++|+|+|.+....+.+++|+|+||++++++|.+ .+.+. T Consensus 229 ~~i~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~ 308 (371) T protein:vir:81 229 DGLKQIINVQLDPVFRSTSSVIVNQDAFNWLDTLKDQNGQYLLQPSISSPTGRQLLGLPVVIVSNKVLANRVDGGTGAQF 308 (371) T ss_pred HHHHHHHHhhcchhhhcCCEEEEcHHHHHHHHHhhccCCCeeeecccCCCCCceecceeEEEecccccCccccccccCCc Confidence 5666655 3566777788899999999999999999999999998888888999999999999999843 22345 Q ss_pred ceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 231 DRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 231 ~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ..++||||+.++.+..|++++++++++.. +.|++|++.||++.|+|+++.+|+||++++-++ T Consensus 309 ~~i~~Gd~~~~~~~~~~~~~~i~~~~~~~------~~f~~~~v~~~~~~r~d~~~~~~~a~~~~~~~~ 370 (371) T protein:vir:81 309 APIIVGDLKEAVVMFDRQRTEIMSSNVAM------DAFETDATLWRAIERMDVKMRDDEAFVFGEVQL 370 (371) T ss_pred ceEEEEehhceEEEEeecceEEEEecccc------chhhcCceEEEEEEeeccEEecccceEEEEEec Confidence 57899999998888999999999887643 369999999999999999999999999999999 No 49 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=100.00 E-value=5.9e-56 Score=323.37 Aligned_cols=267 Identities=14% Similarity=0.039 Sum_probs=228.8 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCC--ceEEEEEe-CCcceEEeeccccccc-cccceeeEEEeee Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFN--GEKVFTFT-MDSEIDVVAESGKKTH-GGVTLAPQTMVPI 76 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~--~~~ip~~~-~~~~a~~v~E~~~~~~-~~~~~~~v~l~~~ 76 (298) -.++||++||+++.++|++.++++++++++++++|++++ .+.+|+.. ..+.+.|++|++++++ ++++|+++++++| T Consensus 9 t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~i~l~~~ 88 (293) T protein:vir:48 9 SGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADIDDPKLSLIKYTIK 88 (293) T ss_pred ccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCcccccccccceeEEEEeee Confidence 555678999999999999999999999999999998764 45677765 4678999999999997 5799999999999 Q ss_pred EEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchh Q lcl|Aclame:pro 77 KVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADP 156 (298) Q Consensus 77 k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (298) |+++++++|+|+++ |+.+++.++|.+++++++++++|.++++|.+.... ...... T Consensus 89 k~~~~~~iS~ell~---ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~----------------------~~~~~~ 143 (293) T protein:vir:48 89 RYAGISTVTNSLLA---DSAENILAWLSGWIAKKVVVTRNKAILGVVDKLPT----------------------KPTLTK 143 (293) T ss_pred EEEEeehhhHHHHh---hhhHHHHHHHHHHHHHHHHHHHHhHHhhccccccc----------------------cccccC Confidence 99999999999997 44588999999999999999999999988532110 112234 Q ss_pred HHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCcccc-ccccccceEEE Q lcl|Aclame:pro 157 NGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSD-MSLTQRDRAII 235 (298) Q Consensus 157 ~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~-~~~~~~~~~~~ 235 (298) +++|.+++.++..++...++|+|||+++..|+++||++|||+|.+..+++.+++|+|+||++++..+. ..+.+...++| T Consensus 144 ~d~i~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~l~G~Pv~~~~~~~~~~~~~~~~~~~~ 223 (293) T protein:vir:48 144 WDDIIDLEAKVDPAIKQTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPTGYSIAGFAVKEISDRWLPNASSGVMPLYF 223 (293) T ss_pred HHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceEeecCcCCCCCceecceeeEEecccccCCccCCceEEEE Confidence 78999999999999999999999999999999999999999999988888899999999987544332 22334557899 Q ss_pred eeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 236 GDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 236 gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) |||++++.+..+++++++++++.. ++|++|++.||+++|+|+++.+|+||++++.++ T Consensus 224 gd~~~~~~~~~~~~~~i~~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~ 280 (293) T protein:vir:48 224 GDLKQAVTLFDRQQMSLLSTNIGG------GAFETDTTKVRVIDRFDVVATDTEAFVPASFKA 280 (293) T ss_pred EeccceEEEEEecceEEEEecccc------hhhhcCeEEEEEEEeeCcEEecccceEEEEeec Confidence 999998888999999999887532 469999999999999999999999999998777 No 50 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=100.00 E-value=8.5e-56 Score=322.51 Aligned_cols=266 Identities=13% Similarity=0.031 Sum_probs=228.5 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCc--eEEEEEeC-CcceEEeeccccccccc-cceeeEEEeee Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNG--EKVFTFTM-DSEIDVVAESGKKTHGG-VTLAPQTMVPI 76 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~--~~ip~~~~-~~~a~~v~E~~~~~~~~-~~~~~v~l~~~ 76 (298) -.++||++||+++..+|++.+++.++|+++++++|++++. +.+|+... .+.+.|++|++.+|+++ ++|++|++.++ T Consensus 113 t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~ 192 (397) T protein:vir:49 113 SGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEGGQIGQNDDPKLSLIRYAIK 192 (397) T ss_pred CCccCcceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEeeccCCcceeeeccccccccccccceeeeEeeee Confidence 4556789999999999999999999999999999998654 45666544 57899999999999875 79999999999 Q ss_pred EEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchh Q lcl|Aclame:pro 77 KVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADP 156 (298) Q Consensus 77 k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (298) |+++++++|+|+++ ++..++.++|.+++++++++++|.++++|+|.++. . ..... T Consensus 193 k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~~~~~~d~ail~G~g~~~~----~------------------~~~~~ 247 (397) T protein:vir:49 193 RYAGISTVTNSLLA---DSAENILAWLSGWIAKKVVVTRNKAILEAIGTLPN----K------------------PTLAK 247 (397) T ss_pred eeEeehhhHHHHHh---hhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccc----c------------------ccccC Confidence 99999999999996 45588999999999999999999999999643211 0 11123 Q ss_pred HHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecC--ccccccccccceEE Q lcl|Aclame:pro 157 NGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNK--TVSDMSLTQRDRAI 234 (298) Q Consensus 157 ~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~--~~~~~~~~~~~~~~ 234 (298) +++|.+++..+..++..+++|+|||.++..|++|||++|+|+|.+....+.+++|+|+||++++ .+|.. ..+...++ T Consensus 248 ~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~g~~l~~~~~~~g~~~~l~G~pV~~~~~~~~~~~-~~~~~~~~ 326 (397) T protein:vir:49 248 WDDIIDLQAKVDPAIKQTSLFLTNTSGFTALKKVKNAMGDYLMERDVKSPTGYSIDGFVVKEISDRFLPNG-TGGAMPLY 326 (397) T ss_pred HHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceeecccccCCCCceecceeeEEecccccccc-cCCceeEE Confidence 7889999999999999999999999999999999999999999888888888999999998754 44433 33456789 Q ss_pred EeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 235 IGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 235 ~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ||||+.++.++.+++++++++++.. ++|++|+++||++.|+|+++++|+||++++.+. T Consensus 327 ~gd~~~~~~~~~~~~~~i~~~~~~~------~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~ 384 (397) T protein:vir:49 327 FGDLKQAVTLFDRQHLSLLSTNIGG------GAFETDTTKVRVIDRFDVVSTDTEAFVPASFKA 384 (397) T ss_pred EeeccceEEEEeecccEEEEecccc------chhhcCeeeEEEEEeeccEEecccceEEEEecc Confidence 9999998889999999999887643 369999999999999999999999999998666 No 51 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=100.00 E-value=8.3e-56 Score=322.55 Aligned_cols=266 Identities=14% Similarity=0.052 Sum_probs=229.0 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCC--ceEEEEEeC-CcceEEeeccccccc-cccceeeEEEeee Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFN--GEKVFTFTM-DSEIDVVAESGKKTH-GGVTLAPQTMVPI 76 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~--~~~ip~~~~-~~~a~~v~E~~~~~~-~~~~~~~v~l~~~ 76 (298) -.++||++||++++++|++.+++.++|+++|+++|+++. .+.+|+... .+.+.|++|++.+++ +.++|+++++++| T Consensus 113 t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~ 192 (397) T protein:vir:49 113 SGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADVDDPKLSLIKYTIK 192 (397) T ss_pred ccccCcccccHhHHHHHHHHHHhhhhHHhhhceeecccCccceEEEeeccCCcceeeecCccccccccccceeeEEeeee Confidence 446678999999999999999999999999999998754 456666554 578999999999997 6799999999999 Q ss_pred EEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchh Q lcl|Aclame:pro 77 KVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADP 156 (298) Q Consensus 77 k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (298) |++++++||+||++ |+.+++.++|.+++++++++++|.++++|+|.+. ... .... T Consensus 193 k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~~----~~~------------------~~~~ 247 (397) T protein:vir:49 193 RYAGISTVTNSLLA---DSAENILAWLSGWIAKKVVVTRNKAILEAIAALP----TKP------------------TLTK 247 (397) T ss_pred eEEeeehhHHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc----ccc------------------cccc Confidence 99999999999996 4458899999999999999999999999964221 110 1123 Q ss_pred HHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCc--cccccccccceEE Q lcl|Aclame:pro 157 NGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKT--VSDMSLTQRDRAI 234 (298) Q Consensus 157 ~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~--~~~~~~~~~~~~~ 234 (298) +++|.+++..+..++..+++|+|||+++..|++|||++|+|+|.+..+++.+++|+|+||++++. +|.+ ..+...++ T Consensus 248 ~d~i~~~~~~l~~~~~~~a~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~-~~~~~~i~ 326 (397) T protein:vir:49 248 WDDIIDLEAKVDPAIKQTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPTGYSIDGFAVKEVADRWLANG-TGGAMPLY 326 (397) T ss_pred HHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceeeccCcCCCCCceecceeeEEecccccccc-cCCceeEE Confidence 78899999999999999999999999999999999999999999888888899999999997543 4433 33455789 Q ss_pred EeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 235 IGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 235 ~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ||||+.++.++.+++++++++++.. ++|++|++.||++.|+|+++.+|+||++++.++ T Consensus 327 ~gd~~~~~~~~~~~~~~i~~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~ 384 (397) T protein:vir:49 327 FGDLKQAVTLFDRQHMSLLSTNIGG------GAFETDTTKVRVIDRFDVVATDTEAFVPASFKA 384 (397) T ss_pred EeeccceEEEEeecceEEEEecccc------chhhcCceeEEEEeeeCcEEecccceEEEEeec Confidence 9999998888999999999887643 369999999999999999999999999999877 No 52 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=100.00 E-value=7.5e-56 Score=322.78 Aligned_cols=266 Identities=13% Similarity=0.038 Sum_probs=228.8 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCC--CceEEEEEeCCcceEEeecccccccc-ccceeeEEEeeeE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPF--NGEKVFTFTMDSEIDVVAESGKKTHG-GVTLAPQTMVPIK 77 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~--~~~~ip~~~~~~~a~~v~E~~~~~~~-~~~~~~v~l~~~k 77 (298) -.++||.+||+++.++||+.+++.++++++++++|+++ +.+.+|+.++.+.+.|++|++.+|++ .++|++|++.++| T Consensus 127 ~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~v~~~~~k 206 (397) T protein:vir:12 127 NDEDGGILIPEDIGRQIHEFKRQFEPLEQYVTVEPVTTRSGTRLLEKNADMVPFSPVEELGNLPEIDQPRFTKVSYSIID 206 (397) T ss_pred ccccCcccCchhHHHHHHHhhhhhhhHHhhcceeeccCCceeEEEEEecCCcceeeecccccccccccccceeEEeehee Confidence 34567899999999999999999999999999999875 45677777888899999999999975 6999999999999 Q ss_pred EEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhH Q lcl|Aclame:pro 78 VEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPN 157 (298) Q Consensus 78 ~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (298) +++++++|+|+++ |+.+++.++|.+++++++++++|.++++|++. +.+ .+ ...+ T Consensus 207 ~~~~~~is~e~l~---ds~~~l~~~i~~~l~~~~~~~~d~~il~G~g~--~~~---~g------------------~~~~ 260 (397) T protein:vir:12 207 YGGIMTLSNSMLN---DSDQAIMTYVAKWFAKKSVVTRNNLILAAIAS--LKK---VD------------------IDGL 260 (397) T ss_pred eEeeehhhHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHHhcccc--ccc---cc------------------cccH Confidence 9999999999996 44578999999999999999999999999642 111 11 1125 Q ss_pred HHHHHHhh-hhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEe Q lcl|Aclame:pro 158 GAIENAVE-LLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIG 236 (298) Q Consensus 158 ~~i~~~~~-~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~g 236 (298) +++.+++. .+...+..+++|+|||.+|.+|+++||++|+|+|.+....+.+++|+|+||++++++..+.+.+...++|| T Consensus 261 ~~i~~~~~~~l~~~~~~~a~~~~n~~~~~~L~~lkd~~G~~l~~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~g 340 (397) T protein:vir:12 261 DGIKKALNVTLDPMVAPGSIVLTNQDGYDWLDTLKDGTGRYLLQPDPTNPTKKLLDGRPVVPFTNRVLKTQKGKAPLIIG 340 (397) T ss_pred HHHHHHHhhccchhhhCCCEEEEcHHHHHHHHHhhccCCceeecccccCCCCccccceeeEEecccccccCCCccEEEEE Confidence 67777664 77778888889999999999999999999999999888888899999999998776544445556678999 Q ss_pred eccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 237 DFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 237 d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ||++++.+..+++++++++++.. ..|++|++.||+++|+|+++.+|+||++++-+- T Consensus 341 d~~~~~~~~~~~~~~i~~~~~~~------~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~t~ 396 (397) T protein:vir:12 341 NLKEAIVLFDREQQSIASTDTGA------GAFETNSTKVRGIEREDVRKWDEDAVVFGQITV 396 (397) T ss_pred ehhceEEEEeecceEEEEecccc------chhhcCceEEEEEEeeccEEecccceEEEEEee Confidence 99998888999999999876543 369999999999999999999999999999888 No 53 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=100.00 E-value=1.1e-55 Score=321.97 Aligned_cols=278 Identities=14% Similarity=0.068 Sum_probs=233.4 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeecccccccc------ccceeeEEEe Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHG------GVTLAPQTMV 74 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~------~~~~~~v~l~ 74 (298) ....||.+||++++++|++.+++.++++++++++|++++...+|+.+..+.+.|++|++.++++ +++|+++++. T Consensus 166 ~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~v~e~~~~~~~~~~~~~~~~~~~i~~~ 245 (458) T protein:vir:10 166 SVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSKILTMLVEPDAGKATWVAASTYGTDTTTGEEVKGALKEIHFS 245 (458) T ss_pred cCccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCcceEEEEecCCcceeecccccccccccccccccccceeeEee Confidence 3345788999999999999999999999999999999988999999999999999999988764 5789999999 Q ss_pred eeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccc----ccccccc Q lcl|Aclame:pro 75 PIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKV----TQKVEAP 150 (298) Q Consensus 75 ~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~----~~~~~~~ 150 (298) +||++++++||+|+|+ |+.+++.++|.++|++++++++|.++|+|+| ++ .+.|+.+..... ....... T Consensus 246 ~~k~~~~v~is~ell~---ds~~~~~~~i~~~l~~~i~~~~d~~~l~G~G--~~---~p~Gi~~~~~~~~~~~~~~~~~~ 317 (458) T protein:vir:10 246 TYKLAAKSFITDETEE---DAIFSLLPLLRKRLIEAHAVSIEEAFMTGDG--SG---KPKGLLTLASEDSAKVVTEAKAD 317 (458) T ss_pred eeeEEeeehhhHHHHh---cchHHHHHHHHHHHHHHHHHHHHHHhhcCCC--CC---ccceeeecccccccceeeccccc Confidence 9999999999999996 4447899999999999999999999999964 33 333433322211 1122223 Q ss_pred cccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeeccc----ccccCcceecceeeEecCcccccc Q lcl|Aclame:pro 151 RGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPEL----KWGATPDTINGLPVDVNKTVSDMS 226 (298) Q Consensus 151 ~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~----~~~~~~~~l~G~PV~~s~~~~~~~ 226 (298) ......+++|.+++..+...+..+++|+|||.+|..|+++||++|+|+|.+. ...+.+++|||+||++++.||... T Consensus 318 ~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~ 397 (458) T protein:vir:10 318 GSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDSVKLQGQVGRIYGLPVVVSEYFPAKA 397 (458) T ss_pred ccccccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHhhcccCCceeeccccccccccCcCceecceeeEEcccccccc Confidence 3334468999999999999999999999999999999999999999998653 334556799999999999999753 Q ss_pred ccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 227 LTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 227 ~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) +...++||||+++|.++.+.++++.+++| +.+|++.||++.|+|+.+.+|+||++.+-|. T Consensus 398 --~~~~~~~~~f~~~~~~~~~~~~~v~~d~~----------~~~~~~~~~~~~r~~~~v~~~~a~v~~~~aa 457 (458) T protein:vir:10 398 --NSAEFAVIVYKDNFVMPRQRAVTVERERQ----------AGKQRDAYYVTQRVNLQRYFANGVVSGTYAA 457 (458) T ss_pred --CCcceEEEEecccEEEEEeeceEEEeecc----------cCCCceEEEEEEEecceEecccceEEEeecc Confidence 33457889998888899999999887664 4689999999999999999999999999888 No 54 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=100.00 E-value=1.1e-55 Score=321.97 Aligned_cols=267 Identities=13% Similarity=0.050 Sum_probs=227.9 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceE--EEEEeC-CcceEEeecccccccc-ccceeeEEEeee Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEK--VFTFTM-DSEIDVVAESGKKTHG-GVTLAPQTMVPI 76 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~--ip~~~~-~~~a~~v~E~~~~~~~-~~~~~~v~l~~~ 76 (298) =..+||++||+++.++|++.+++.++++++++++|++++... +|+... .+.+.|++|++.++++ .++|++|++++| T Consensus 111 ~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~v~~~~~ 190 (395) T protein:vir:38 111 GTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEKLADITPLKDLDDESALIGDNDDPELTVVKYLIH 190 (395) T ss_pred ccCCCceecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEeeccCCccccccccccccccccccceeeEEeeee Confidence 234678999999999999999999999999999999865544 444443 5678999999999976 599999999999 Q ss_pred EEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchh Q lcl|Aclame:pro 77 KVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADP 156 (298) Q Consensus 77 k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (298) |+++++++|+||++ |+.+++.++|.++|++++++++|.++++|++.+.. .. +... T Consensus 191 k~~~~~~iS~ell~---ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~----~~------------------~~~~ 245 (395) T protein:vir:38 191 RYAGITTVTNTLLK---DTVDNIIQWLVNWAAKKDVVTRNAKILEVMGKAPK----KP------------------TISQ 245 (395) T ss_pred eeEeehhhHHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc----cc------------------cccc Confidence 99999999999997 44578999999999999999999999999653221 10 0112 Q ss_pred HHHHHHHhh-hhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEE Q lcl|Aclame:pro 157 NGAIENAVE-LLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAII 235 (298) Q Consensus 157 ~~~i~~~~~-~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~ 235 (298) +++|.+++. .+...+..+++|+|||.++..|+++||++|+|+|.+...++.+++|+|+||+++++++.....+...++| T Consensus 246 ~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~i~~ 325 (395) T protein:vir:38 246 FDNIKDLENNTLDPAIESTSSFITNQSGYNILSKVKDADGRYLMQPDVTSPDKYLIDGKPVIRIADKWLPDVSGSHPLYF 325 (395) T ss_pred HHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCcceeccceeEEecccccCcCCCcceEEE Confidence 566777665 5667777788999999999999999999999999988888889999999999998877666666778999 Q ss_pred eeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 236 GDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 236 gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) |||++++.++.+++++++++++.. .+|++|++.||++.|+|+++.+|+||++++.++ T Consensus 326 gd~~~~~~i~~~~~~~i~~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~ 382 (395) T protein:vir:38 326 GDLKQGITLFDRQQMQIDTTNVGA------GSFEHDTTKLRFIDRFDVQLIDDGAFAAASFKT 382 (395) T ss_pred EeccccEEEEEecceEEEEecccc------chhhcCceEEEEEEeeccEEecccceEEEEeec Confidence 999998888999999999987643 369999999999999999999999999999887 No 55 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=100.00 E-value=1.1e-55 Score=321.85 Aligned_cols=277 Identities=13% Similarity=0.088 Sum_probs=227.2 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEe---eccccccccccceeeEEEeeeE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVV---AESGKKTHGGVTLAPQTMVPIK 77 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v---~E~~~~~~~~~~~~~v~l~~~k 77 (298) -+.+||+|||+++.++|++.++++++|+++++++++++ ++++|+....+.+.|. +|++.+++++++|++|++.+|| T Consensus 146 ~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~~~~-~~~~p~~~~~~~a~~~~~~~e~~~~~~~~~~f~~v~~~~~k 224 (434) T protein:vir:62 146 VTGNGSVTIPDFLSKEIITYAQEENFLRRLGTGVKTKE-NIKYPVLVKKAEAQGHKNERTNNEMPETDIEFDEIELSPTE 224 (434) T ss_pred cccccceecchhhHHHHHHhhhhhhhhhhhcceeccCC-ceEEEEEecCCcccceecccccccccccccceeeEEeehee Confidence 12356899999999999999999999999999998764 5899999888877775 5678899999999999999999 Q ss_pred EEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhH Q lcl|Aclame:pro 78 VEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPN 157 (298) Q Consensus 78 ~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (298) +++++++|+|||+ |+.+++.++|.++|++++++++|.++|+|+|. ...+.+.... ............+ T Consensus 225 ~~~~~~iS~ell~---ds~~~l~~~i~~~la~~~~~~~d~~~l~G~G~----~~~~~g~~~~-----~~~~~~~~~~~~~ 292 (434) T protein:vir:62 225 FDALATVTKKLLA---RTGLPIEQIVMDELKKAYVRKETQYMVNGDEA----NNINDGALAK-----KAVEFKTDEKNLY 292 (434) T ss_pred eEeehhhHHHHHh---cchHHHHHHHHHHHHHHHHHHHHHHHhccCCC----Cccccceeec-----ccccccccccchh Confidence 9999999999997 45588999999999999999999999999642 2222222221 1122233445578 Q ss_pred HHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccc--cccCcceecceeeEecCccccccccccceEEE Q lcl|Aclame:pro 158 GAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELK--WGATPDTINGLPVDVNKTVSDMSLTQRDRAII 235 (298) Q Consensus 158 ~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~--~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~ 235 (298) ++|.+++..+..++..+++|+|||.++..|++|||++|||||++.. .++.+++|+|+||++++.||...++....++| T Consensus 293 d~l~~l~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~g~~~tl~G~pV~~~~~~~~~~~~~~~~i~~ 372 (434) T protein:vir:62 293 DALVKMKNTPVKEVRKKARWVLNTAALTKIETMKTDDGFPLLRPFNQAEGGIGYTLLGFPVEEEDAIDIPDSPDTPVFYF 372 (434) T ss_pred hHHHHHHhhcchhhhcCCEEEEcHHHHHHHHHhhccCCCEeeccCCCccCCCCceecceeeEEecCccCccCCCceEEEE Confidence 9999999999999998899999999999999999999999997643 45667899999999999999776666666889 Q ss_pred eeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEec-ccceEEE----eecC Q lcl|Aclame:pro 236 GDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILD-ATKFARV----TEAN 298 (298) Q Consensus 236 gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~-~~a~~~l----~~a~ 298 (298) |||+.++.+...+.++++.+.+ .+|.+|+|.||++.|+|+++++ |.+++++ |.|+ T Consensus 373 Gdfs~~~i~~~~g~~~i~~~~~--------~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~~~ 432 (434) T protein:vir:62 373 GDFSKFYIQDVIGSLEVQKLVE--------LFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKAPT 432 (434) T ss_pred eeccceEEEEeeceeEEEeehh--------hhcccCceEEEEEeeecceeecCcccceEEEEEeccCC Confidence 9999876444445677776653 2678999999999999999875 9988887 5566 No 56 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=100.00 E-value=3e-55 Score=319.49 Aligned_cols=275 Identities=17% Similarity=0.097 Sum_probs=230.1 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCC----cceEEeeccccccccc-cceeeEEEee Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMD----SEIDVVAESGKKTHGG-VTLAPQTMVP 75 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~----~~a~~v~E~~~~~~~~-~~~~~v~l~~ 75 (298) ..+++|++||+++.++||+.+++.++|+++++++|++++.+++|+.... ..++|++|++.+|+++ ++|+++++.+ T Consensus 122 ~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~ 201 (413) T protein:vir:81 122 LTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNLTMTNTTIKYLMEKANRVVEGGFKTVAEGGKKPYMRFADFDIVTESL 201 (413) T ss_pred cccccccccchhhHHHHHHHHhhhhhHHhhcceeeccCCceeEEEeccccccccccceecCcccccccCcccceeeEeee Confidence 5567899999999999999999999999999999999888999987643 4679999999999987 6899999999 Q ss_pred eEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccch Q lcl|Aclame:pro 76 IKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIAD 155 (298) Q Consensus 76 ~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (298) ||++++++||+|||+++ ..+.++|.+++++++++++|+++|+|+| +..++.|+....+..+ ........ T Consensus 202 ~k~~~~~~iS~ell~ds----~~l~~~i~~~la~~~~~~~d~~~l~G~G----~~~~~~Gi~~~~~~~~---~~~~~~~~ 270 (413) T protein:vir:81 202 SKIAGLTKITDEMIEDY----DFLVSYINARLLEELAIEEERQLLLGDG----TGNNLTGLLKRDGIQT---LAVSNKDE 270 (413) T ss_pred eeEEEeehhhHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHhccCC----CCCccccccccccccc---ccccccch Confidence 99999999999999643 2499999999999999999999999953 3344555544433322 22233445 Q ss_pred hHHHHHHHhhhhhhc-CCcccEEEEcHHHHHHHHHhhccCCceeeccccccc-------CcceecceeeEecCccccccc Q lcl|Aclame:pro 156 PNGAIENAVELLTGV-DADVTGIAINPSFRSALAKQKDLQGNALFPELKWGA-------TPDTINGLPVDVNKTVSDMSL 227 (298) Q Consensus 156 ~~~~i~~~~~~l~~~-~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~-------~~~~l~G~PV~~s~~~~~~~~ 227 (298) .++++.+++..+... ++.+++|+|||+++.+|++|||++|||||.+...+. ..++|||+||++++.||.+ T Consensus 271 ~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~l~G~pv~~s~~~~~~-- 348 (413) T protein:vir:81 271 LADSIYKAMTNISLATPFQADALVINPLDYQELRLAKDANGQYYGGGVFQGQYGSGGIMLDPAPWGLRTVQSQVVPVG-- 348 (413) T ss_pred hHHHHHHHHHHhhhhccCCCcEEEEcHHHHHHHHHhhccCCceeccccccccccccccccCceecceeeEEcCCCCcc-- Confidence 677787877766554 456778999999999999999999999997654322 3458999999999999853 Q ss_pred cccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 228 TQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 228 ~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) .++||||+.++.+..|++++++++++.. .+|++|++.||+++|+|+.+.+|+||++|+.++ T Consensus 349 ----~~~~gd~~~~~~~~~~~~~~v~~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~ 409 (413) T protein:vir:81 349 ----KPVVGAFRSAASVLRKGGVRIDSTNTNV------DDFENNLITVRAEERVGLMVTFPEAIVQLDVAE 409 (413) T ss_pred ----cEEEEecccEEEEEEecceEEEEecccc------chhhcCcEEEEEEEeeccEEecccceEEEEecC Confidence 6899999998888889999999988653 269999999999999999999999999999998 No 57 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=100.00 E-value=2e-55 Score=320.46 Aligned_cols=267 Identities=13% Similarity=0.034 Sum_probs=228.7 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEE---eCCcceEEeecccccccc-ccceeeEEEeee Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTF---TMDSEIDVVAESGKKTHG-GVTLAPQTMVPI 76 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~---~~~~~a~~v~E~~~~~~~-~~~~~~v~l~~~ 76 (298) -.++||++||++++++||+.+++.++|+++++++|++++...+|+. +..+.+.|++|++.++++ .++|++|+++++ T Consensus 113 t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~ 192 (397) T protein:vir:48 113 SGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEAGSIGTNDDPKLYPIRYAIK 192 (397) T ss_pred CCccccccccHHHHHHHHHHHHHHHHHHhhhceeeccCCcceEEEEeecCCCcceeeeccccccccccccceeeEEeehe Confidence 3455789999999999999999999999999999998776665543 345679999999999986 589999999999 Q ss_pred EEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchh Q lcl|Aclame:pro 77 KVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADP 156 (298) Q Consensus 77 k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (298) |++++++||+|++++ +..++.++|.+++++++++++|.++++|+|.+. .. ..... T Consensus 193 k~~~~~~iS~ell~d---s~~~l~~~v~~~l~~~~~~~~d~~il~G~g~~~----~~------------------~~~~~ 247 (397) T protein:vir:48 193 RYAGISTVTNSLLAD---SAENILAWLSGWIAKKVVVTRNKAILEAIATLP----TK------------------PTLTK 247 (397) T ss_pred eeeeehhhHHHHHhh---chHHHHHHHHHHHHHHHHHHHHHHHhhcccccc----cc------------------ccccc Confidence 999999999999974 457899999999999999999999999964221 10 11124 Q ss_pred HHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccc-cccccccceEEE Q lcl|Aclame:pro 157 NGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVS-DMSLTQRDRAII 235 (298) Q Consensus 157 ~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~-~~~~~~~~~~~~ 235 (298) +++|.+++.++...+..+++|+|||.++..|+++||++|+|+|.+....+.+++|+|+||++++..+ ...+.....++| T Consensus 248 ~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~~ 327 (397) T protein:vir:48 248 WDDIIDLQAKVDPAIKQTSFFLTNTSGFTALKKVKNAFGDYLMERDVKSPTGYSIDGFAVKEVADRWLANASSGAMPLYF 327 (397) T ss_pred HHHHHHHHHHhhhhhcCCCEEEECHHHHHHHHHhhcCCCceeeccCcCCCCCceeccceeEEecccccCCcCCCceEEEE Confidence 7889999999999999999999999999999999999999999988888889999999998765422 223445667899 Q ss_pred eeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 236 GDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 236 gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) |||+.++.++.+++++++++++.+ ++|++|++.||+++|+|+++.+|+||++++.++ T Consensus 328 gd~~~~~~~~~~~~~~i~~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~ 384 (397) T protein:vir:48 328 GDLKQAVTLFDRQQMSLLSTNIGG------GAFETDTTKIRVIDRFDVVATDTESFVPASFKA 384 (397) T ss_pred EeccceEEEEeecceEEEEeccch------hhhhcCceeEEEEeeeccEEecccceEEEEecc Confidence 999998888999999999887643 368999999999999999999999999998777 No 58 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=100.00 E-value=4.1e-55 Score=318.76 Aligned_cols=266 Identities=13% Similarity=0.019 Sum_probs=223.7 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEE--EEeC-CcceEEeecccccccc-ccceeeEEEeee Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVF--TFTM-DSEIDVVAESGKKTHG-GVTLAPQTMVPI 76 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip--~~~~-~~~a~~v~E~~~~~~~-~~~~~~v~l~~~ 76 (298) -..+||++||++++++||+.+++.++++++++++|++++...+| +..+ .+.+.|++|++.+|++ .++|++|++.+| T Consensus 120 t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~ 199 (408) T protein:vir:10 120 SDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIK 199 (408) T ss_pred cccCCceeccHhHHHHHHHHHHhhchhhhhcceeeccCCcceEEEeeccccccceeeecCccccccccCcceeeEEeeee Confidence 33457899999999999999999999999999999987655555 4443 5778999999999975 589999999999 Q ss_pred EEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchh Q lcl|Aclame:pro 77 KVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADP 156 (298) Q Consensus 77 k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (298) |++++++||+||++ |+.+++.++|.++|++++++++|.++++|++.+.. . .+... T Consensus 200 k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~~~----~------------------~~~~~ 254 (408) T protein:vir:10 200 RYAGIITATNTSLK---DTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPK----K------------------PTIAK 254 (408) T ss_pred eEEeeehhHHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHhhccccccc----c------------------ccccc Confidence 99999999999997 44588999999999999999999999999643211 0 01123 Q ss_pred HHHHHHHh-hhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCc--cccccccccceE Q lcl|Aclame:pro 157 NGAIENAV-ELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKT--VSDMSLTQRDRA 233 (298) Q Consensus 157 ~~~i~~~~-~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~--~~~~~~~~~~~~ 233 (298) ++++.+++ ..+...+..++.|+|||+++..|+++||++|+|+|.+....+.+++|+|+||+++++ +|.. +.+...+ T Consensus 255 ~~~l~~~~~~~~~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~-~~~~~~i 333 (408) T protein:vir:10 255 FDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNT-GSTVYPL 333 (408) T ss_pred HHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCceEeccCcCCCCCceecceeeEEecccccCcc-CCCceEE Confidence 66777766 467777777788999999999999999999999998888888889999999998653 4443 3345578 Q ss_pred EEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 234 IIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 234 ~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) +||||+.++.+..+++++++++++.. ..|++|++.||++.|+|+++.+|+||++++.++ T Consensus 334 ~~gd~~~~~~~~~~~~~~v~~~~~~~------~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~~~ 392 (408) T protein:vir:10 334 YYGDMSQAITLFDRENMSLLPTNIGA------GAFETDTTKIRVIDRFDVKATDSEALVAGSFSA 392 (408) T ss_pred EEEehhccEEEEEecceEEEEccccc------chhhcCceEEEEEEeeccEEeccccEEEEEeec Confidence 99999998889999999999887643 358999999999999999999999999999777 No 59 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=100.00 E-value=4.7e-55 Score=318.45 Aligned_cols=266 Identities=12% Similarity=0.042 Sum_probs=223.9 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCc--eEEEEEeCCcceEEeecccccccc-ccceeeEEEeeeE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNG--EKVFTFTMDSEIDVVAESGKKTHG-GVTLAPQTMVPIK 77 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~--~~ip~~~~~~~a~~v~E~~~~~~~-~~~~~~v~l~~~k 77 (298) -.++||.+||+++.++|++.+++.++|++++++++++++. +.+|+.++.+.+.|++|++.++++ .++|++|++.+|| T Consensus 110 t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k 189 (392) T protein:vir:10 110 TGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKD 189 (392) T ss_pred ccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeecccccccccccccceeEEeeeee Confidence 2345789999999999999999999999999999998654 456777778899999999999976 5899999999999 Q ss_pred EEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhH Q lcl|Aclame:pro 78 VEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPN 157 (298) Q Consensus 78 ~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (298) ++++++||+|+|++ +.+++.++|.+++++++++++|.++++|.+.+. . .+...+ T Consensus 190 ~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~--~---------------------~~~~~~ 243 (392) T protein:vir:10 190 RAGILPLSRSLLQD---SDQNILKYVTKWLGKKSKVTRNVLILGVIEKLT--K---------------------QAIKSL 243 (392) T ss_pred EEEeehhhHHHHhh---hHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--c---------------------cCccCH Confidence 99999999999964 457899999999999999999999999854211 0 011235 Q ss_pred HHHHHHhh-hhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEe-cCc-cccc--cccccce Q lcl|Aclame:pro 158 GAIENAVE-LLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDV-NKT-VSDM--SLTQRDR 232 (298) Q Consensus 158 ~~i~~~~~-~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~-s~~-~~~~--~~~~~~~ 232 (298) +++.+++. .+...+..+++|+|||+++..|+++||++|||+|.+....+.+++|+|+|+++ ++. ++.. ...+... T Consensus 244 d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~ 323 (392) T protein:vir:10 244 DDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAP 323 (392) T ss_pred HHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceE Confidence 77887764 67777778889999999999999999999999998888888899999986655 332 2322 2334557 Q ss_pred EEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 233 AIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 233 ~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ++||||+.++.++.|++++++++++.+ ++|++|++.||+++|+|+++++|+||++++..+ T Consensus 324 ~~~gdfs~~~~i~~~~~~~~~~~~~~~------~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~ 383 (392) T protein:vir:10 324 LIIGDLKEAIVLFKREDMELASTDVGG------KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDL 383 (392) T ss_pred EEEEehhceEEEEeecceEEEEecccc------chhhcCceEEEEEEeeccEEecccceEEEEecc Confidence 899999998889999999999987643 369999999999999999999999999999888 No 60 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=100.00 E-value=4.7e-55 Score=318.45 Aligned_cols=266 Identities=12% Similarity=0.042 Sum_probs=223.9 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCc--eEEEEEeCCcceEEeecccccccc-ccceeeEEEeeeE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNG--EKVFTFTMDSEIDVVAESGKKTHG-GVTLAPQTMVPIK 77 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~--~~ip~~~~~~~a~~v~E~~~~~~~-~~~~~~v~l~~~k 77 (298) -.++||.+||+++.++|++.+++.++|++++++++++++. +.+|+.++.+.+.|++|++.++++ .++|++|++.+|| T Consensus 110 t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k 189 (392) T protein:vir:10 110 TGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKD 189 (392) T ss_pred ccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeecccccccccccccceeEEeeeee Confidence 2345789999999999999999999999999999998654 456777778899999999999976 5899999999999 Q ss_pred EEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhH Q lcl|Aclame:pro 78 VEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPN 157 (298) Q Consensus 78 ~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (298) ++++++||+|+|++ +.+++.++|.+++++++++++|.++++|.+.+. . .+...+ T Consensus 190 ~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~--~---------------------~~~~~~ 243 (392) T protein:vir:10 190 RAGILPLSRSLLQD---SDQNILKYVTKWLGKKSKVTRNVLILGVIEKLT--K---------------------QAIKSL 243 (392) T ss_pred EEEeehhhHHHHhh---hHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--c---------------------cCccCH Confidence 99999999999964 457899999999999999999999999854211 0 011235 Q ss_pred HHHHHHhh-hhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEe-cCc-cccc--cccccce Q lcl|Aclame:pro 158 GAIENAVE-LLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDV-NKT-VSDM--SLTQRDR 232 (298) Q Consensus 158 ~~i~~~~~-~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~-s~~-~~~~--~~~~~~~ 232 (298) +++.+++. .+...+..+++|+|||+++..|+++||++|||+|.+....+.+++|+|+|+++ ++. ++.. ...+... T Consensus 244 d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~ 323 (392) T protein:vir:10 244 DDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAP 323 (392) T ss_pred HHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceE Confidence 77887764 67777778889999999999999999999999998888888899999986655 332 2322 2334557 Q ss_pred EEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 233 AIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 233 ~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ++||||+.++.++.|++++++++++.+ ++|++|++.||+++|+|+++++|+||++++..+ T Consensus 324 ~~~gdfs~~~~i~~~~~~~~~~~~~~~------~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~ 383 (392) T protein:vir:10 324 LIIGDLKEAIVLFKREDMELASTDVGG------KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDL 383 (392) T ss_pred EEEEehhceEEEEeecceEEEEecccc------chhhcCceEEEEEEeeccEEecccceEEEEecc Confidence 899999998889999999999987643 369999999999999999999999999999888 No 61 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=100.00 E-value=4.7e-55 Score=318.45 Aligned_cols=266 Identities=12% Similarity=0.042 Sum_probs=223.9 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCc--eEEEEEeCCcceEEeecccccccc-ccceeeEEEeeeE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNG--EKVFTFTMDSEIDVVAESGKKTHG-GVTLAPQTMVPIK 77 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~--~~ip~~~~~~~a~~v~E~~~~~~~-~~~~~~v~l~~~k 77 (298) -.++||.+||+++.++|++.+++.++|++++++++++++. +.+|+.++.+.+.|++|++.++++ .++|++|++.+|| T Consensus 110 t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k 189 (392) T protein:vir:10 110 TGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKD 189 (392) T ss_pred ccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeecccccccccccccceeEEeeeee Confidence 2345789999999999999999999999999999998654 456777778899999999999976 5899999999999 Q ss_pred EEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhH Q lcl|Aclame:pro 78 VEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPN 157 (298) Q Consensus 78 ~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (298) ++++++||+|+|++ +.+++.++|.+++++++++++|.++++|.+.+. . .+...+ T Consensus 190 ~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~--~---------------------~~~~~~ 243 (392) T protein:vir:10 190 RAGILPLSRSLLQD---SDQNILKYVTKWLGKKSKVTRNVLILGVIEKLT--K---------------------QAIKSL 243 (392) T ss_pred EEEeehhhHHHHhh---hHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--c---------------------cCccCH Confidence 99999999999964 457899999999999999999999999854211 0 011235 Q ss_pred HHHHHHhh-hhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEe-cCc-cccc--cccccce Q lcl|Aclame:pro 158 GAIENAVE-LLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDV-NKT-VSDM--SLTQRDR 232 (298) Q Consensus 158 ~~i~~~~~-~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~-s~~-~~~~--~~~~~~~ 232 (298) +++.+++. .+...+..+++|+|||+++..|+++||++|||+|.+....+.+++|+|+|+++ ++. ++.. ...+... T Consensus 244 d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~ 323 (392) T protein:vir:10 244 DDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAP 323 (392) T ss_pred HHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceE Confidence 77887764 67777778889999999999999999999999998888888899999986655 332 2322 2334557 Q ss_pred EEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 233 AIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 233 ~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ++||||+.++.++.|++++++++++.+ ++|++|++.||+++|+|+++++|+||++++..+ T Consensus 324 ~~~gdfs~~~~i~~~~~~~~~~~~~~~------~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~ 383 (392) T protein:vir:10 324 LIIGDLKEAIVLFKREDMELASTDVGG------KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDL 383 (392) T ss_pred EEEEehhceEEEEeecceEEEEecccc------chhhcCceEEEEEEeeccEEecccceEEEEecc Confidence 899999998889999999999987643 369999999999999999999999999999888 No 62 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=100.00 E-value=4.7e-55 Score=318.45 Aligned_cols=266 Identities=12% Similarity=0.042 Sum_probs=223.9 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCc--eEEEEEeCCcceEEeecccccccc-ccceeeEEEeeeE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNG--EKVFTFTMDSEIDVVAESGKKTHG-GVTLAPQTMVPIK 77 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~--~~ip~~~~~~~a~~v~E~~~~~~~-~~~~~~v~l~~~k 77 (298) -.++||.+||+++.++|++.+++.++|++++++++++++. +.+|+.++.+.+.|++|++.++++ .++|++|++.+|| T Consensus 110 t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k 189 (392) T protein:vir:10 110 TGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKD 189 (392) T ss_pred ccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeecccccccccccccceeEEeeeee Confidence 2345789999999999999999999999999999998654 456777778899999999999976 5899999999999 Q ss_pred EEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhH Q lcl|Aclame:pro 78 VEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPN 157 (298) Q Consensus 78 ~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (298) ++++++||+|+|++ +.+++.++|.+++++++++++|.++++|.+.+. . .+...+ T Consensus 190 ~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~--~---------------------~~~~~~ 243 (392) T protein:vir:10 190 RAGILPLSRSLLQD---SDQNILKYVTKWLGKKSKVTRNVLILGVIEKLT--K---------------------QAIKSL 243 (392) T ss_pred EEEeehhhHHHHhh---hHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--c---------------------cCccCH Confidence 99999999999964 457899999999999999999999999854211 0 011235 Q ss_pred HHHHHHhh-hhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEe-cCc-cccc--cccccce Q lcl|Aclame:pro 158 GAIENAVE-LLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDV-NKT-VSDM--SLTQRDR 232 (298) Q Consensus 158 ~~i~~~~~-~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~-s~~-~~~~--~~~~~~~ 232 (298) +++.+++. .+...+..+++|+|||+++..|+++||++|||+|.+....+.+++|+|+|+++ ++. ++.. ...+... T Consensus 244 d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~ 323 (392) T protein:vir:10 244 DDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAP 323 (392) T ss_pred HHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceE Confidence 77887764 67777778889999999999999999999999998888888899999986655 332 2322 2334557 Q ss_pred EEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 233 AIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 233 ~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ++||||+.++.++.|++++++++++.+ ++|++|++.||+++|+|+++++|+||++++..+ T Consensus 324 ~~~gdfs~~~~i~~~~~~~~~~~~~~~------~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~ 383 (392) T protein:vir:10 324 LIIGDLKEAIVLFKREDMELASTDVGG------KAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDL 383 (392) T ss_pred EEEEehhceEEEEeecceEEEEecccc------chhhcCceEEEEEEeeccEEecccceEEEEecc Confidence 899999998889999999999987643 369999999999999999999999999999888 No 63 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=100.00 E-value=7.8e-55 Score=317.21 Aligned_cols=276 Identities=13% Similarity=0.042 Sum_probs=231.5 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEE--eCCcceEEeeccccccc-cccceeeEEEeeeE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTF--TMDSEIDVVAESGKKTH-GGVTLAPQTMVPIK 77 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~--~~~~~a~~v~E~~~~~~-~~~~~~~v~l~~~k 77 (298) =...||.+||+++.++|++.+++.++|+++++++|++++..++|+. +..+.+.|++|++.+|+ +.++|++|++.+|+ T Consensus 125 ~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k 204 (415) T protein:vir:47 125 KTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINT 204 (415) T ss_pred cccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeee Confidence 1234668999999999999999999999999999999887777765 55678899999999997 56899999999999 Q ss_pred EEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhH Q lcl|Aclame:pro 78 VEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPN 157 (298) Q Consensus 78 ~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (298) ++++++||+|+++ |+.+++.++|.+++++++++++|.++++|++.+... ..... ...........+...+ T Consensus 205 ~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~--~~~~~-----~~~~~~~~~~~~~~~~ 274 (415) T protein:vir:47 205 HRGYFRISREAIE---DAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTG--STSSG-----FEKEGKKLEVKKAKSL 274 (415) T ss_pred eEeeehhhHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCcc--ccccc-----cccccceeccccccch Confidence 9999999999996 444789999999999999999999999996533221 11111 1111222334445568 Q ss_pred HHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEee Q lcl|Aclame:pro 158 GAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGD 237 (298) Q Consensus 158 ~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd 237 (298) ++|.+++..+...++.+++|+|||++|..|+++||++|+|+|.+...++.+++|+|+||++++++|.... +...++||| T Consensus 275 ~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~-~~~~~~~gd 353 (415) T protein:vir:47 275 DDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQK-GNNTLIIGN 353 (415) T ss_pred HHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCccccceeeEEeccccccCC-CccEEEEEe Confidence 9999999999999999999999999999999999999999998888888899999999999999986443 344689999 Q ss_pred ccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 238 FANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 238 ~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) |+.++.+..++++++++++ |.++...+|+++|+|+++.+|+||++++..+ T Consensus 354 ~~~~~~~~~~~~~~v~~~~-----------~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~ 403 (415) T protein:vir:47 354 LKDAIVLFDRSQYQASWTD-----------YMHFGECLMIAVRQDCRILDYKSAIVIEYDD 403 (415) T ss_pred hhccEEEEeecceEEEeec-----------cccCceEEEEEEEeccEEeccccEEEEEeec Confidence 9998888889999998765 3466778999999999999999999999888 No 64 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=100.00 E-value=7.8e-55 Score=317.21 Aligned_cols=276 Identities=13% Similarity=0.042 Sum_probs=231.5 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEE--eCCcceEEeeccccccc-cccceeeEEEeeeE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTF--TMDSEIDVVAESGKKTH-GGVTLAPQTMVPIK 77 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~--~~~~~a~~v~E~~~~~~-~~~~~~~v~l~~~k 77 (298) =...||.+||+++.++|++.+++.++|+++++++|++++..++|+. +..+.+.|++|++.+|+ +.++|++|++.+|+ T Consensus 125 ~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k 204 (415) T protein:vir:46 125 KTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINT 204 (415) T ss_pred cccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeee Confidence 1234668999999999999999999999999999999887777765 55678899999999997 56899999999999 Q ss_pred EEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhH Q lcl|Aclame:pro 78 VEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPN 157 (298) Q Consensus 78 ~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (298) ++++++||+|+++ |+.+++.++|.+++++++++++|.++++|++.+... ..... ...........+...+ T Consensus 205 ~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~--~~~~~-----~~~~~~~~~~~~~~~~ 274 (415) T protein:vir:46 205 HRGYFRISREAIE---DAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTG--STSSG-----FEKEGKKLEVKKAKSL 274 (415) T ss_pred eEeeehhhHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCcc--ccccc-----cccccceeccccccch Confidence 9999999999996 444789999999999999999999999996533221 11111 1111222334445568 Q ss_pred HHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEee Q lcl|Aclame:pro 158 GAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGD 237 (298) Q Consensus 158 ~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd 237 (298) ++|.+++..+...++.+++|+|||++|..|+++||++|+|+|.+...++.+++|+|+||++++++|.... +...++||| T Consensus 275 ~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~-~~~~~~~gd 353 (415) T protein:vir:46 275 DDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQK-GNNTLIIGN 353 (415) T ss_pred HHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCccccceeeEEeccccccCC-CccEEEEEe Confidence 9999999999999999999999999999999999999999998888888899999999999999986443 344689999 Q ss_pred ccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 238 FANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 238 ~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) |+.++.+..++++++++++ |.++...+|+++|+|+++.+|+||++++..+ T Consensus 354 ~~~~~~~~~~~~~~v~~~~-----------~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~ 403 (415) T protein:vir:46 354 LKDAIVLFDRSQYQASWTD-----------YMHFGECLMIAVRQDCRILDYKSAIVIEYDD 403 (415) T ss_pred hhccEEEEeecceEEEeec-----------cccCceEEEEEEEeccEEeccccEEEEEeec Confidence 9998888889999998765 3466778999999999999999999999888 No 65 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=100.00 E-value=6.7e-55 Score=317.57 Aligned_cols=272 Identities=14% Similarity=0.124 Sum_probs=225.7 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccccccc-cceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGG-VTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~-~~~~~v~l~~~k~~ 79 (298) .++.||++||+++.++|++.+++.++++++++++|++ ++.++|+..+.+.+.|++|++++|+++ ++|++|++++|+++ T Consensus 142 ~~~~gg~~vP~~~~~~Ii~~l~~~~~i~~~~~~~~~~-g~~~ip~~~~~~~a~~v~E~~~~~~~~~~~f~~i~l~~~k~~ 220 (425) T protein:vir:95 142 AVAGGELTIPEVVVNRIMDIMGDYTTLYPLVDKIRVK-GTTRILVDTDTSPATWIEQSGALPTGDVGTIASIDFDGFKVG 220 (425) T ss_pred ccccCceeccHHHHHHHHHHHHhhhhHHHhhceeecC-ceeEEEEecCCccccccccccccccccccccceeeeeheeee Confidence 5556789999999999999999999999999999986 468999999999999999999999877 68999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHH Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGA 159 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (298) ++++||+|+|+ |+.+++.++|.+++++++++++|.++|+|+|.+++.+ .|+....... ...........+++ T Consensus 221 ~~~~iS~ell~---ds~~~l~~~i~~~l~~~i~~~~d~~il~G~G~~~~~p---~Gil~~~~~~--~~~~~~~~~~~~~~ 292 (425) T protein:vir:95 221 KVTFVDNYLLQ---DSIINLDDYVTKKIARAIAKALDLAIVKGTGAANKQP---LGIIPSLPPE--NQVTVEADNNLLKN 292 (425) T ss_pred eeehhhHHHHh---ccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcccc---ceeecccccc--cccccccccchHHH Confidence 99999999996 4447899999999999999999999999976544332 2322211111 11223344556888 Q ss_pred HHHHhhhhhhcCC--cccEEEEcHHHH----HHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceE Q lcl|Aclame:pro 160 IENAVELLTGVDA--DVTGIAINPSFR----SALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRA 233 (298) Q Consensus 160 i~~~~~~l~~~~~--~~~~~vm~~~~~----~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~ 233 (298) +.+++..+...+. ...+|+||+.++ ..|+++||++|||+|... .+..++|+|+||+++++||.+ .+ T Consensus 293 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~l~~~kd~~g~~i~~~~--~~~~~~l~G~pvv~~~~~~~~------~i 364 (425) T protein:vir:95 293 LVKQIGLIDTGDDSVGEIVAVMKRSTYYNRLVEFSIQVDSNGNVVGKLP--NLRTPDLLGLRVVFNNFLDDD------TV 364 (425) T ss_pred HHHHHHhhhhhccccCceEEEEeChHHHHHHHHHHhhcCCCCceeeccC--CCCCccccceeeEEcCcCCCc------cE Confidence 9999888777654 456799999884 357888999999999743 344678999999999999854 58 Q ss_pred EEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 234 IIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 234 ~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) +||||+. +.++.+++++++++++. .|.+|+++||++.|+|+++.+|+||++++-.| T Consensus 365 ~~Gd~~~-~~~~~~~~~~i~~~~~~--------~f~~~~~~~~~~~r~d~~~~~~~a~~~~~i~~ 420 (425) T protein:vir:95 365 LFGEFEQ-YTLVERENITIDSSTHV--------KFTEDQTAFRGKGRFDGKPVKPEAFVLVTITD 420 (425) T ss_pred EEEeccc-EEEEeecceEEEeeccc--------ccccCceEEEEEEeeCcEeecccceEEEEecC Confidence 8999998 46889999999987653 58999999999999999999999999999999 No 66 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=100.00 E-value=1.7e-54 Score=315.36 Aligned_cols=276 Identities=14% Similarity=0.054 Sum_probs=230.6 Q ss_pred Cee-ccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEE--EEeCCcceEEeecccccccc-ccceeeEEEeee Q lcl|Aclame:pro 1 MVL-NKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVF--TFTMDSEIDVVAESGKKTHG-GVTLAPQTMVPI 76 (298) Q Consensus 1 mat-~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip--~~~~~~~a~~v~E~~~~~~~-~~~~~~v~l~~~ 76 (298) ..+ .||.+||+++.++|++.+++.++|+++++++||+++..++| +.++...+.|++|+++++++ .++|+++++.+| T Consensus 124 ~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~ 203 (415) T protein:vir:98 124 LKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDIN 203 (415) T ss_pred ccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeee Confidence 222 35789999999999999999999999999999987655555 55667789999999999975 689999999999 Q ss_pred EEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchh Q lcl|Aclame:pro 77 KVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADP 156 (298) Q Consensus 77 k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (298) +++++++||+||++ |+.+++.++|.++|++++++++|.++++|++.++...... . ...........+... T Consensus 204 k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~----~---~~~~~~~~~~~~~~~ 273 (415) T protein:vir:98 204 THRGYFRISREAIE---DAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSS----G---FEKEGKKLEVKKAKS 273 (415) T ss_pred eeEeeehhhHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccc----c---ccccccccccccccc Confidence 99999999999996 4457899999999999999999999999975433221111 1 111222233444566 Q ss_pred HHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEe Q lcl|Aclame:pro 157 NGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIG 236 (298) Q Consensus 157 ~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~g 236 (298) +++|.+++.++...++.+++|+|||++|..|+++||++|+|+|.+...++.+++|+|+||++++++|.+.. +...++|| T Consensus 274 ~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~-~~~~~~~G 352 (415) T protein:vir:98 274 LDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQK-GNNTLIIG 352 (415) T ss_pred hhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecccccCCC-CccEEEEE Confidence 89999999999999999999999999999999999999999999888888889999999999999986543 34568999 Q ss_pred eccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 237 DFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 237 d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ||++++.++.+++++++++++ .++...+|+++|+|+++.+|+||++++..+ T Consensus 353 d~~~~~~~~~~~~~~v~~~~~-----------~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~ 403 (415) T protein:vir:98 353 NLKDAIVLFDRSQYQASWTDY-----------MHFGECLMIAVRQDCRILDYKSAIVIEYDD 403 (415) T ss_pred ehhccEEEEeecceEEEEecc-----------ccCceEEEEEEEeccEEeccccEEEEEEec Confidence 999988788899999987653 455678999999999999999999999988 No 67 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=100.00 E-value=1.7e-54 Score=315.36 Aligned_cols=276 Identities=14% Similarity=0.054 Sum_probs=230.6 Q ss_pred Cee-ccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEE--EEeCCcceEEeecccccccc-ccceeeEEEeee Q lcl|Aclame:pro 1 MVL-NKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVF--TFTMDSEIDVVAESGKKTHG-GVTLAPQTMVPI 76 (298) Q Consensus 1 mat-~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip--~~~~~~~a~~v~E~~~~~~~-~~~~~~v~l~~~ 76 (298) ..+ .||.+||+++.++|++.+++.++|+++++++||+++..++| +.++...+.|++|+++++++ .++|+++++.+| T Consensus 124 ~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~ 203 (415) T protein:vir:81 124 LKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDIN 203 (415) T ss_pred ccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeee Confidence 222 35789999999999999999999999999999987655555 55667789999999999975 689999999999 Q ss_pred EEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchh Q lcl|Aclame:pro 77 KVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADP 156 (298) Q Consensus 77 k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (298) +++++++||+||++ |+.+++.++|.++|++++++++|.++++|++.++...... . ...........+... T Consensus 204 k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~----~---~~~~~~~~~~~~~~~ 273 (415) T protein:vir:81 204 THRGYFRISREAIE---DAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSS----G---FEKEGKKLEVKKAKS 273 (415) T ss_pred eeEeeehhhHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccc----c---ccccccccccccccc Confidence 99999999999996 4457899999999999999999999999975433221111 1 111222233444566 Q ss_pred HHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEe Q lcl|Aclame:pro 157 NGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIG 236 (298) Q Consensus 157 ~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~g 236 (298) +++|.+++.++...++.+++|+|||++|..|+++||++|+|+|.+...++.+++|+|+||++++++|.+.. +...++|| T Consensus 274 ~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~-~~~~~~~G 352 (415) T protein:vir:81 274 LDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQK-GNNTLIIG 352 (415) T ss_pred hhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecccccCCC-CccEEEEE Confidence 89999999999999999999999999999999999999999999888888889999999999999986543 34568999 Q ss_pred eccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 237 DFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 237 d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ||++++.++.+++++++++++ .++...+|+++|+|+++.+|+||++++..+ T Consensus 353 d~~~~~~~~~~~~~~v~~~~~-----------~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~ 403 (415) T protein:vir:81 353 NLKDAIVLFDRSQYQASWTDY-----------MHFGECLMIAVRQDCRILDYKSAIVIEYDD 403 (415) T ss_pred ehhccEEEEeecceEEEEecc-----------ccCceEEEEEEEeccEEeccccEEEEEEec Confidence 999988788899999987653 455678999999999999999999999988 No 68 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=100.00 E-value=1.7e-54 Score=315.36 Aligned_cols=276 Identities=14% Similarity=0.054 Sum_probs=230.6 Q ss_pred Cee-ccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEE--EEeCCcceEEeecccccccc-ccceeeEEEeee Q lcl|Aclame:pro 1 MVL-NKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVF--TFTMDSEIDVVAESGKKTHG-GVTLAPQTMVPI 76 (298) Q Consensus 1 mat-~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip--~~~~~~~a~~v~E~~~~~~~-~~~~~~v~l~~~ 76 (298) ..+ .||.+||+++.++|++.+++.++|+++++++||+++..++| +.++...+.|++|+++++++ .++|+++++.+| T Consensus 124 ~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~ 203 (415) T protein:vir:79 124 LKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDIN 203 (415) T ss_pred ccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeee Confidence 222 35789999999999999999999999999999987655555 55667789999999999975 689999999999 Q ss_pred EEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchh Q lcl|Aclame:pro 77 KVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADP 156 (298) Q Consensus 77 k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (298) +++++++||+||++ |+.+++.++|.++|++++++++|.++++|++.++...... . ...........+... T Consensus 204 k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~----~---~~~~~~~~~~~~~~~ 273 (415) T protein:vir:79 204 THRGYFRISREAIE---DAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSS----G---FEKEGKKLEVKKAKS 273 (415) T ss_pred eeEeeehhhHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccc----c---ccccccccccccccc Confidence 99999999999996 4457899999999999999999999999975433221111 1 111222233444566 Q ss_pred HHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEe Q lcl|Aclame:pro 157 NGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIG 236 (298) Q Consensus 157 ~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~g 236 (298) +++|.+++.++...++.+++|+|||++|..|+++||++|+|+|.+...++.+++|+|+||++++++|.+.. +...++|| T Consensus 274 ~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~-~~~~~~~G 352 (415) T protein:vir:79 274 LDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQK-GNNTLIIG 352 (415) T ss_pred hhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecccccCCC-CccEEEEE Confidence 89999999999999999999999999999999999999999999888888889999999999999986543 34568999 Q ss_pred eccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 237 DFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 237 d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ||++++.++.+++++++++++ .++...+|+++|+|+++.+|+||++++..+ T Consensus 353 d~~~~~~~~~~~~~~v~~~~~-----------~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~ 403 (415) T protein:vir:79 353 NLKDAIVLFDRSQYQASWTDY-----------MHFGECLMIAVRQDCRILDYKSAIVIEYDD 403 (415) T ss_pred ehhccEEEEeecceEEEEecc-----------ccCceEEEEEEEeccEEeccccEEEEEEec Confidence 999988788899999987653 455678999999999999999999999988 No 69 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=100.00 E-value=2e-54 Score=314.99 Aligned_cols=276 Identities=14% Similarity=0.053 Sum_probs=231.4 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEE--EEEeCCcceEEeecccccccc-ccceeeEEEeeeE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKV--FTFTMDSEIDVVAESGKKTHG-GVTLAPQTMVPIK 77 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~i--p~~~~~~~a~~v~E~~~~~~~-~~~~~~v~l~~~k 77 (298) =..+||.+||+++.++|++.+++.++|+++++++||+++..++ |+.++.+.+.|++|++.+|++ .++|++|++.+|| T Consensus 125 ~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k 204 (415) T protein:vir:94 125 KTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINT 204 (415) T ss_pred ccccccccCcHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEeecCCccceeccccccccccccccceeeEeehee Confidence 2335779999999999999999999999999999998765555 455677889999999999964 6899999999999 Q ss_pred EEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhH Q lcl|Aclame:pro 78 VEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPN 157 (298) Q Consensus 78 ~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (298) ++++++||+|+++ |+..++.++|.++|++++++++|.++++|++.++...... .. ..............+ T Consensus 205 ~~~~~~is~ell~---ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~----~~---~~~~~~~~~~~~~~~ 274 (415) T protein:vir:94 205 HRGYFRISREAIE---DAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSS----GF---EKEGKKLEVKKAKSL 274 (415) T ss_pred eeeechhhHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccc----cc---cccccccccccccch Confidence 9999999999997 4457899999999999999999999999965433221111 11 111222333344568 Q ss_pred HHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEee Q lcl|Aclame:pro 158 GAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGD 237 (298) Q Consensus 158 ~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd 237 (298) ++|.+++..+...++.+++|+|||++|.+|+++||++|+|+|.+...++.+++|+|+||++++.+|.+.. +...++||| T Consensus 275 ~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~-~~~~i~~gd 353 (415) T protein:vir:94 275 DDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQK-GNNTLIIGN 353 (415) T ss_pred HHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCceecceeeEEecccccCCC-CccEEEEEe Confidence 9999999999999999999999999999999999999999999888888889999999999999986543 345689999 Q ss_pred ccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 238 FANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 238 ~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) |++++.+..++++++++++ |.++.+.+|+++|+|+++.+|+||++++..+ T Consensus 354 ~~~~~~~~~~~~~~v~~~~-----------~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~ 403 (415) T protein:vir:94 354 LKDAIVLFDRSQYQASWTD-----------YMHFGECLMIAVRQDCRILDYKSAIVIEYDD 403 (415) T ss_pred hhccEEEEeecceEEEEec-----------cccCceEEEEEEEeccEEeccccEEEEEEec Confidence 9998878889999998765 3567788999999999999999999999888 No 70 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=100.00 E-value=2.2e-54 Score=314.71 Aligned_cols=267 Identities=13% Similarity=0.010 Sum_probs=223.2 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEE--EEeC-CcceEEeeccccccc-cccceeeEEEeee Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVF--TFTM-DSEIDVVAESGKKTH-GGVTLAPQTMVPI 76 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip--~~~~-~~~a~~v~E~~~~~~-~~~~~~~v~l~~~ 76 (298) -..+||++||+++.++|++.+++.++|+++++++|++++...+| +..+ .+.+.|++|++.+|+ +.++|+++++++| T Consensus 120 t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~ 199 (404) T protein:vir:39 120 SDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPRLTIIKYLIK 199 (404) T ss_pred cccCCceeccHHHHHHHHHHHHhhhhHHhhcceeeccCCcceEEEEeecCCccceeeecCccccccccccceeeEEeeee Confidence 23556899999999999999999999999999999987655554 4443 578999999999997 5799999999999 Q ss_pred EEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchh Q lcl|Aclame:pro 77 KVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADP 156 (298) Q Consensus 77 k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (298) |++++++||+|++++ +.+++.++|.++|++++++++|.++++|+|.+.. . ..... T Consensus 200 k~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~~~~~~d~~il~g~g~~~~----~------------------~~~~~ 254 (404) T protein:vir:39 200 RYAGIITATNTLLKD---TAENILAWLSSWIAKKVVVTRNQAIIAAMGTVPK----K------------------PTIAK 254 (404) T ss_pred eEEeeehhHHHHHhh---chHHHHHHHHHHHHHHHHHHHHHHHHhccccccc----c------------------ccccc Confidence 999999999999974 4578999999999999999999999999643211 0 01112 Q ss_pred HHHHHHHhh-hhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccc-cccccceEE Q lcl|Aclame:pro 157 NGAIENAVE-LLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDM-SLTQRDRAI 234 (298) Q Consensus 157 ~~~i~~~~~-~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~-~~~~~~~~~ 234 (298) ++++.+++. .+...+...++|+|||+++..|+++||++|+|+|.+....+.+++|+|+||+++++.+.. .+.....++ T Consensus 255 ~~~i~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~ 334 (404) T protein:vir:39 255 FDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKKVIVVADRWLPNSGSTVYPLY 334 (404) T ss_pred HHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCcceecceeEEEecccccCccCCCccEEE Confidence 566777665 555666777889999999999999999999999998888888899999999997654322 233455789 Q ss_pred EeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 235 IGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 235 ~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) +|||+.++.+..+++++++++++.. ++|++|++.+|++.|+|+.+.+|+||++++..+ T Consensus 335 ~gd~~~~~~~~~~~~~~i~~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~ 392 (404) T protein:vir:39 335 YGDMSQAITLFDRENMSLLPTNIGA------GAFETDTTKIRVIDRFDVKTTDSEALVAGSFTA 392 (404) T ss_pred EEeccccEEEEeecceEEEEeccch------hhhhhceeeEEEEeeeccEEecccceEEEEeec Confidence 9999998889999999999988643 369999999999999999999999999999777 No 71 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=100.00 E-value=1.1e-54 Score=316.31 Aligned_cols=280 Identities=14% Similarity=0.127 Sum_probs=231.7 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCC-cceEEeeccccccccccceeeEEEeeeEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNG-EKVFTFTMD-SEIDVVAESGKKTHGGVTLAPQTMVPIKV 78 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~-~~ip~~~~~-~~a~~v~E~~~~~~~~~~~~~v~l~~~k~ 78 (298) -..+||+|||+++.++|++.+++.++++++++++|++++. ..+|+..+. ..+.|++|++.+|+++++|.++++.++|+ T Consensus 121 ~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~f~~~~l~~~k~ 200 (409) T protein:vir:45 121 QDEKGGYTVPETFLAKVVEKMKSYGGIASVAQILTTSDGRTMEWATADGTSEVGVLLGENEEAGEEDTDFGMGSLGALKM 200 (409) T ss_pred cCcCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEeeccCccccccccccccccccccccceeeeeeeee Confidence 2245688999999999999999999999999999997764 455565543 45789999999999999999999999998 Q ss_pred E-EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhH Q lcl|Aclame:pro 79 E-YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPN 157 (298) Q Consensus 79 ~-~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (298) + +++++|+||++ |+.+++.++|.++|++++++++|.++|+|+|. +....+.|+...... ...........+ T Consensus 201 ~~~~i~is~ell~---ds~~~l~~~i~~~la~a~~~~~~~a~l~G~G~--~~~~~p~Gil~~~~~---~~~~~~~~~~~~ 272 (409) T protein:vir:45 201 TSKIIRVSNELLQ---DSAIDMEAYLARRIAERIGRGEARYLIQGTGA--GTPKQPKGLAASVTG---TTQTAAANAVKW 272 (409) T ss_pred eeeehhhhHHHHh---ccHHHHHHHHHHHHHHHHHHHHHHHhhccCCC--CCccccceeeecccc---ccccccccccch Confidence 6 57899999996 44589999999999999999999999999653 333445555433222 222233344557 Q ss_pred HHHHHHhhhhhhcCCcccE--EEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEE Q lcl|Aclame:pro 158 GAIENAVELLTGVDADVTG--IAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAII 235 (298) Q Consensus 158 ~~i~~~~~~l~~~~~~~~~--~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~ 235 (298) ++|.+++..+...+..++. |+||+.++..|++|||++|||+|.+....+.+++|+|+||+++++||... .+...++| T Consensus 273 d~i~~l~~~l~~~~~~~a~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~p~~~-~~~~~i~~ 351 (409) T protein:vir:45 273 QEILALKHSIDPAYRRGPKFRLAFNDNTLKLISEMEDGQGRPLWLPDIVGVAPASVLNVPYVIDQEIDDIG-AGKKFMFC 351 (409) T ss_pred HHHHHHHHhhhhhhccCCeEEEEECHHHHHHHHHhhcCCCceeeccCcCCCCCceecceeeEEecCcCCcc-CCccEEEE Confidence 8999999999998877765 47899999999999999999999988888888999999999999998643 34556889 Q ss_pred eeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 236 GDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 236 gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) |||++++ ++.+++++++...+. +|++|++.||++.|+|+++.+|+||++++.+. T Consensus 352 Gd~~~~~-i~~~~~~~~~~~~d~--------~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~k~ 405 (409) T protein:vir:45 352 GDFDRFI-IRRVRYMILKRLVER--------YAEYDQTGFLAFHRFDCILEDTSAIKALVGKG 405 (409) T ss_pred eehhhhh-eeeccceEEEEeecc--------cccCCcEEEEEEEEeccEeechhheEEEEecc Confidence 9999865 678889988876542 57899999999999999999999999999877 No 72 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=100.00 E-value=2.9e-54 Score=314.10 Aligned_cols=266 Identities=13% Similarity=0.021 Sum_probs=224.1 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCce--EEEEEeC-CcceEEeeccccccc-cccceeeEEEeee Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGE--KVFTFTM-DSEIDVVAESGKKTH-GGVTLAPQTMVPI 76 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~--~ip~~~~-~~~a~~v~E~~~~~~-~~~~~~~v~l~~~ 76 (298) -..+||++||+++.++||+.+++.++|+++++++|++++.. .+|+..+ .+.+.|++|++.+++ +.++|++|++.++ T Consensus 120 ~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~i~~~~~ 199 (408) T protein:vir:74 120 SDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSSGSRVYEKWTDVTPLKAMDEEDGKIPDLDNPRLTIIKYLIK 199 (408) T ss_pred ccCCCceeechhHhhHHHHHHhhhcchhhhcceeeccCCcceEEEEeecCCcccccccccccccccccccceeeEEeeee Confidence 34457899999999999999999999999999999987654 4555544 466789999999997 5699999999999 Q ss_pred EEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchh Q lcl|Aclame:pro 77 KVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADP 156 (298) Q Consensus 77 k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (298) |++++++||+|+++ |+.+++.++|.++|++++++++|.++++|+|.+. .. .+... T Consensus 200 k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~~~~~~d~~il~G~G~~~----~~------------------~~~~~ 254 (408) T protein:vir:74 200 RYAGIITATNTLLK---DTAENILAWLSSWIAKKVVVTRNQAIIAAMGTVP----KK------------------PTIAN 254 (408) T ss_pred eEEeeehhHHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHhhcccccc----cc------------------ccccc Confidence 99999999999996 4558899999999999999999999999954211 10 01122 Q ss_pred HHHHHHHh-hhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCc--cccccccccceE Q lcl|Aclame:pro 157 NGAIENAV-ELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKT--VSDMSLTQRDRA 233 (298) Q Consensus 157 ~~~i~~~~-~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~--~~~~~~~~~~~~ 233 (298) ++++.+++ ..+...+...++|+|||.++..|+++||++|+|+|.+....+.+++|+|+||+++++ +|.. +.+...+ T Consensus 255 ~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~-~~~~~~i 333 (408) T protein:vir:74 255 FDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNS-GSTVYPL 333 (408) T ss_pred HHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceEeccCcCCCCCceecceeeEEecCcccccc-cCCcceE Confidence 56777766 477778888889999999999999999999999999888888889999999998654 5543 3445678 Q ss_pred EEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 234 IIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 234 ~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ++|||+.++.++.|++++++++++.. ..|++|++.+|+++|+|+++.+|+||++++.++ T Consensus 334 ~~gd~~~~~~~~~~~~~~i~~~~~~~------~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~ 392 (408) T protein:vir:74 334 YYGDMSQAITLFDRENMSLLPTNIGA------GAFETDTTKIRVIDRFDVKATDSEALVAGSFTA 392 (408) T ss_pred EEEehhccEEEEEecceEEEEecccc------chhhcceeeEEEEEeeCcEEecccceEEEEeec Confidence 99999998889999999999987643 358999999999999999999999999999766 No 73 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=100.00 E-value=2.3e-54 Score=314.61 Aligned_cols=263 Identities=15% Similarity=0.068 Sum_probs=227.7 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcc--eEEeeccccccccccceeeEEEeeeEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSE--IDVVAESGKKTHGGVTLAPQTMVPIKV 78 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~--a~~v~E~~~~~~~~~~~~~v~l~~~k~ 78 (298) -.++||+|||+++.++|++.+++.++++++++++|++++..++|+...... +.|++|++.+++++++|++|++.+|++ T Consensus 118 t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~s~~~f~~i~~~~~k~ 197 (421) T protein:vir:13 118 SSTNNGAVIPQEFVNEFEKLKEGYPSLKEHCHVIPVNRNAGKMPVRAGASVDKLANLAKDTELVKAMLKTQPMAYDIDDY 197 (421) T ss_pred ccCCcceecchhhHHHHHHHHHhhhhhhhhceeeeccCCceEEEEeecCCccceeeccccccccccccceeEEEeeeeee Confidence 445578999999999999999999999999999999999999998876544 567999999999999999999999999 Q ss_pred EEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHH Q lcl|Aclame:pro 79 EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNG 158 (298) Q Consensus 79 ~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (298) ++++++|+|||+ |+..++.++|.+++++++++++|.++++. +.|+.. ..+...++ T Consensus 198 ~~~v~iS~ell~---ds~~~l~~~i~~~la~~~~~~~~~~i~~~----------~~g~~~------------~~~~~~~d 252 (421) T protein:vir:13 198 GLLAPIDNSLLE---DSEINFLEFVNEEFAEFAVNTENAEIVKQ----------AKAVLA------------EETINDYA 252 (421) T ss_pred EeehhhhHHHHh---hhHHHHHHHHHHHHHHHHHHHhhhhHhhh----------hhhccc------------cccccchH Confidence 999999999996 44578999999999999999999887743 111111 01112478 Q ss_pred HHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeec Q lcl|Aclame:pro 159 AIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDF 238 (298) Q Consensus 159 ~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~ 238 (298) +|.+++..+..+++.+++|+|||.+|..|++|||++|+|+|.+ ...+.+++|||+||++++++|...+ ....++|||| T Consensus 253 ~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~-~~~~~~~tl~G~pV~~~~~~~~~~~-~~~~~~~gd~ 330 (421) T protein:vir:13 253 GLVKTINSLVPNARKRAIIVTNSDGRAYLDGLMDKQGRPLLKE-LSDGGDLVFKGRPVIELEESIFDVG-DETKFIVSDF 330 (421) T ss_pred HHHHHHHHhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceeecC-cCCCCCceecceeeEEeccccccCC-CceEEEEEec Confidence 8999999999999999999999999999999999999999976 4556688999999999999986543 4567899999 Q ss_pred cceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 239 ANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 239 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ++++.++.+++++++++++. .|++|++.||++.|+|+++.+++||+.++.++ T Consensus 331 ~~~~~~~~~~~~~v~~~~~~--------~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~ 382 (421) T protein:vir:13 331 KTLIKFMDRKQYLIDQSKEA--------GYTKNETIARIIERFDVNSPLDKSSDAEKIRK 382 (421) T ss_pred cccEEEEEecceEEEeeccc--------ccccCeeEEEEEeeecceeecchhhheeeecc Confidence 99888899999999988763 59999999999999999999999999888877 No 74 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=100.00 E-value=9.5e-54 Score=311.28 Aligned_cols=264 Identities=11% Similarity=0.006 Sum_probs=223.8 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC--CcceEEeeccccccccccceeeEEEeeeEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTM--DSEIDVVAESGKKTHGGVTLAPQTMVPIKV 78 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~--~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~ 78 (298) -.++++.+||+++..+|++.++..++++++++++++.++.++||+.++ .+.+.|++|++.+|+++++|++|++.+||+ T Consensus 111 ~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~f~~i~~~~~k~ 190 (379) T protein:vir:10 111 LPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSISGGTYTFVRENGAGEGAIGAQVEGATKGQKDYDISMIDVNTDFI 190 (379) T ss_pred cCCCCccccchhhhhHHHHhHHhhhhHHhhceeeeccCCceEEEEeecCCCcccccccCCccccccccceeeeEeeeeeE Confidence 344556789999999999999999999999999999999999999875 356789999999999999999999999999 Q ss_pred EEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHH Q lcl|Aclame:pro 79 EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNG 158 (298) Q Consensus 79 ~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (298) +++++||+|||++ + .++.++|.++|++++++++|.+++.|.+... . ... ........++ T Consensus 191 ~~~~~iS~ell~D---~-~~l~~~i~~~la~~~~~~~~~~~~~g~~~~~-~-------~~~---------~~~~~~~~~d 249 (379) T protein:vir:10 191 AGFTRYSKKMANN---L-PFLTSFIPNALRRDYAKAENAAFNAVLAANA-T-------AST---------EIITNKNKVE 249 (379) T ss_pred EeeehhhHHHHhh---H-HHHHHHHHHHHHHHHHHHHHHHHhccccccc-c-------ccc---------ccccCcccHH Confidence 9999999999963 3 3699999999999999999999998854210 0 000 1112233467 Q ss_pred HHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccc--cccCcceecceeeEecCccccccccccceEEEe Q lcl|Aclame:pro 159 AIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELK--WGATPDTINGLPVDVNKTVSDMSLTQRDRAIIG 236 (298) Q Consensus 159 ~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~--~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~g 236 (298) +|.+++..+...++.+++|+|||.+|..|+++||++|+|+|++.. ..+.+.+|+|+||++++.||.+ .++|| T Consensus 250 ~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~l~G~pvv~s~~~~ag------~~~~g 323 (379) T protein:vir:10 250 MLINEIAKQENLDFPVTAIVLRPTDYYDILVTQKSVGAGYGLPGVVTQDNGVLRINGIPLFRATWLAAN------KYYVG 323 (379) T ss_pred HHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCceeccCCccCCCCCcceecceeeEecCCCCCC------ceEEe Confidence 899999999999999999999999999999999999999997654 3456679999999999999753 58999 Q ss_pred eccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 237 DFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 237 d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ||+.++ +..+++++++++.+.. ++|++|++.||+++|+|+.++||+||++++-+- T Consensus 324 df~~~~-~~~~~~~~i~~~~~~~------~~f~~~~~~~r~~~R~~~~v~~p~a~v~~~~~~ 378 (379) T protein:vir:10 324 DWTRVT-KVTTEGLSLEFSEVEG------TNFVKNNITARIEAQVALAVEQPAALIFGDFTA 378 (379) T ss_pred ecccEE-EEEEeceEEEEeeccc------ccccCCcEEEEEEEEeccEEecCccEEEEEecC Confidence 999865 5678889998876532 369999999999999999999999999988888 No 75 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=100.00 E-value=8.8e-53 Score=305.97 Aligned_cols=277 Identities=15% Similarity=0.074 Sum_probs=227.0 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC--------CcceEEeeccccccccccceeeEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTM--------DSEIDVVAESGKKTHGGVTLAPQT 72 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~--------~~~a~~v~E~~~~~~~~~~~~~v~ 72 (298) ..+.++.++|..+...|+...+..+.+++++++.|+.++.+.+|+.++ .+.+.|++|++.+++++++|++++ T Consensus 128 ~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~ 207 (419) T protein:vir:94 128 ITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTIT 207 (419) T ss_pred ccCCcccccchhhhHHHHHHHhhhhhhhhcceeeeccCCceeeeeeccccccccccCcccceecCCccccccccceeeEE Confidence 223333445555555566667777889999999999998889988653 356889999999999999999999 Q ss_pred EeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccc---cccccccc Q lcl|Aclame:pro 73 MVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDS---KVTQKVEA 149 (298) Q Consensus 73 l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~---~~~~~~~~ 149 (298) +.+||++++++||+|++++ + .++.++|.++|++++++++|.++|+|+| ++. +.|+.+..+ ........ T Consensus 208 ~~~~k~~~~~~is~ell~d---~-~~l~~~i~~~la~a~~~~~d~aii~G~G--~~~---p~Gi~~~~~~~~~~~~~~~~ 278 (419) T protein:vir:94 208 TTLKTVAHWLPITRQAADD---N-SQLMGYIQGRLTYGLRFLRDRQLLNGNG--STE---MQGILTTPGIGTYQQPKPTA 278 (419) T ss_pred eeeeeEEEeehhhHHHHHh---H-HHHHHHHHHHHHHHHHHHHHHHHHhccC--ccc---ccceeccccccccccccccc Confidence 9999999999999999963 2 4699999999999999999999999965 333 344433222 22222233 Q ss_pred ccccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCc-eeecccccccCcceecceeeEecCcccccccc Q lcl|Aclame:pro 150 PRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGN-ALFPELKWGATPDTINGLPVDVNKTVSDMSLT 228 (298) Q Consensus 150 ~~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~-~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~ 228 (298) .......+++|.+++..+...+..+++|+|||++|..|+++||++|+ +++.+...++.+++|+|+||++++.||.+ T Consensus 279 ~~t~~~~~~~l~~~~~~~~~~~~~~~~~v~n~~~~~~l~~~k~~~~~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~--- 355 (419) T protein:vir:94 279 PATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQG--- 355 (419) T ss_pred ccccchhHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHHhhcCCCceeecCCcccCCCccccceeeEEcCCCCCc--- Confidence 44455678999999999999999999999999999999999998666 55677777888899999999999999853 Q ss_pred ccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 229 QRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 229 ~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) .++||||+.++.+..+++++++++++.. ++|++|+++||++.|+|+++.+|+||++++.+. T Consensus 356 ---~~~~gd~~~~~~~~~~~~~~v~~~~~~~------~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~~a 416 (419) T protein:vir:94 356 ---TALVGGFRQGATLWSRQGITVLMTDSHA------DFFTANTLVILAEFRANLAVYQPKAFVRVTFAA 416 (419) T ss_pred ---cEEEeeccceEEEEEecceEEEEecccc------chhhcCcEEEEEEEeeccEEeccccEEEEEecc Confidence 5899999998878889999999887643 369999999999999999999999999998888 No 76 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=100.00 E-value=8.6e-53 Score=306.03 Aligned_cols=266 Identities=14% Similarity=0.170 Sum_probs=224.0 Q ss_pred CeeccccccchhH-HHHHHHHHHhhchhhhh-cceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEE Q lcl|Aclame:pro 1 MVLNKGTLFDPEL-VTDLISKVAGKSSIARL-SAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKV 78 (298) Q Consensus 1 mat~gg~lip~~~-~~~ii~~~~~~s~i~~~-~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~ 78 (298) -+.+||+|||+++ ..+||+.+++.++++++ ++.+|+.+++++||+++++++++|++|++.+++++++|+++++++||+ T Consensus 361 t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l~~~~~~~~~g~~~ip~~~~~~~a~wv~E~~~~~~s~~~f~~i~l~~~k~ 440 (632) T protein:vir:96 361 TAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTI 440 (632) T ss_pred cccccccccccccchHHHHHHHhhcchhhhhcceEeecCCcceEEEEEeCCceeEeecCCccccccccceeeEEeeeeEE Confidence 3457789999986 57899999999999998 578898888999999999999999999999999999999999999999 Q ss_pred EEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHH Q lcl|Aclame:pro 79 EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNG 158 (298) Q Consensus 79 ~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (298) +++++||+|||+++ .++++++|.++|++++++++|.++|+|+| ....+.|+.+..+..+ .........++ T Consensus 441 ~~~v~iS~ell~ds---~~~~~~~i~~~l~~a~~~~~d~a~l~G~G----~~~~p~Gi~~~~~~~~---~~~~~~~~~~~ 510 (632) T protein:vir:96 441 AGAVPVTRKLRKQS---SIHVENLIREDLIEGIGVALDLAMLTGTG----LANDPVGLLNMTGVPA---LTYPAGGVDWA 510 (632) T ss_pred EEehhhHHHHHhcc---chHHHHHHHHHHHHHHHHHHHHHhhcccC----CCCccceeeecccccc---eecccccCCHH Confidence 99999999999744 47899999999999999999999999954 3334555543322211 12223334578 Q ss_pred HHHHHhhhhhhcCCc--ccEEEEcHHHHHHHHH--hhccCCceeecccccccCcceecceeeEecCccccccccccceEE Q lcl|Aclame:pro 159 AIENAVELLTGVDAD--VTGIAINPSFRSALAK--QKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAI 234 (298) Q Consensus 159 ~i~~~~~~l~~~~~~--~~~~vm~~~~~~~L~~--lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~ 234 (298) ++.++..++...+.+ ..+|+|||.++..|++ ++|++|+|||.+ ++|+|+||+++++||.+ .++ T Consensus 511 ~i~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~~~l~d~~G~~i~~~-------~~l~G~pv~~s~~ip~~------~~~ 577 (632) T protein:vir:96 511 SVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEASNQIPAD------TWI 577 (632) T ss_pred HHHHHHHHHhhcccccCccEEEEchhHHHHHHHHhccCCCCceeecC-------CeecccceEeccccccC------cEE Confidence 899999888887753 5689999998877765 779999999953 58999999999999854 488 Q ss_pred EeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 235 IGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 235 ~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ||||+.++ ++.++++++.+++|. .|.+|++.||++.|+|+++++|++|+++|.+= T Consensus 578 ~gd~s~~~-i~~~~~~~i~~~~~~--------~~~~~~v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 578 FGDWSQIV-IAMWGVLDLKVDPYT--------KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred EeecceEE-EEEecceEEEEcccc--------ccccCceEEEEEeecCceeechhhhhheeecC Confidence 99999864 899999999998875 36789999999999999999999999998777 No 77 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=100.00 E-value=1.9e-52 Score=304.15 Aligned_cols=259 Identities=10% Similarity=0.065 Sum_probs=219.9 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CcceEEeeccccccc-cccceeeEEEeeeEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTM-DSEIDVVAESGKKTH-GGVTLAPQTMVPIKV 78 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~E~~~~~~-~~~~~~~v~l~~~k~ 78 (298) -..+||++||+++.++|++.+++.++++++++++|++++..++|+... .+.+.|++|++.+++ +.++|++|++.+||+ T Consensus 138 ~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~i~~~~~k~ 217 (400) T protein:vir:38 138 KAADAASTIPETISNTPQRELQTVVDLKPFTNVFQASTQKGTYPTVANATTKMVTVAELEKNPAMAKPEFKPVNWSVETY 217 (400) T ss_pred cccCCcccccHHHHHHHHHHHHhhhhhhhcceeEeccCcceEEEEEecCCCccccccccccccccccccceeeEeehhhe Confidence 355578999999999999999999999999999999998899999874 577899999999986 689999999999999 Q ss_pred EEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHH Q lcl|Aclame:pro 79 EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNG 158 (298) Q Consensus 79 ~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (298) ++++++|+|||+ |+.+++.++|.++++++++.++|.++++|++.+.. .+...++ T Consensus 218 ~~~~~is~ell~---ds~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~~~-----------------------~~~~~~~ 271 (400) T protein:vir:38 218 RQALPVSQESID---DSAIDLVGLIAQNGQQIKVNTTNGAVATLLKGFTA-----------------------KTISSVD 271 (400) T ss_pred eeehhhHHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc-----------------------cccccHH Confidence 999999999997 45578999999999999999999999998642211 0011256 Q ss_pred HHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeec Q lcl|Aclame:pro 159 AIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDF 238 (298) Q Consensus 159 ~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~ 238 (298) ++.+++......+ ..++|+|||.++..|+++||++|+|+|.+...++.+++|+|+||++++++|... .+...++|||| T Consensus 272 ~~~~~~~~~~~~~-~~a~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~-~g~~~~~~gd~ 349 (400) T protein:vir:38 272 DLKHINNVDLDPA-YSRVIIASQSFYNFLDTVKDGNGRYLLQDSILTPSGKSVLGMPIAVVSDDTLGA-AGEAHAFLGDI 349 (400) T ss_pred HHHHHHHhhhhhh-hCcEEEEcHHHHHHHHHhhccCCCeeeecCcCCCCccccccceeEEecccccCC-CCceEEEEEec Confidence 6777766544333 356899999999999999999999999888888888999999999999998643 33557899999 Q ss_pred cceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 239 ANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 239 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) +.++.+..|++++++++++. ++...+|+.+|+|+++.+|+||++|+.++ T Consensus 350 s~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~r~d~~~~~~~a~~~l~~~~ 398 (400) T protein:vir:38 350 KRAILFANRADFMVRWVDDQ-----------IYGQFLQAGMRFGVSVADEKAGYFLTYTP 398 (400) T ss_pred cccEEEEeecceEEEEeccc-----------ccceeEEEEEEeccEEecccceEEEEeec Confidence 99888888999999887642 23357899999999999999999999988 No 78 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=100.00 E-value=1.5e-51 Score=299.15 Aligned_cols=263 Identities=16% Similarity=0.182 Sum_probs=215.0 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CcceEEeeccccccc-cccceeeEEEeeeEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTM-DSEIDVVAESGKKTH-GGVTLAPQTMVPIKV 78 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~E~~~~~~-~~~~~~~v~l~~~k~ 78 (298) -.++||++||++++++|++.+++.++|+++++++|++++...+|+... .+.+.|++|++++++ +.++|++|++.+||+ T Consensus 115 t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~l~~~k~ 194 (394) T protein:vir:10 115 TSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKRATDRFSSVAELAENPALAEPEFEQVDWSVSTY 194 (394) T ss_pred ccccCceeccHHHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEecCCCccccccccccccccccccceeEEeeeeee Confidence 445578999999999999999999999999999999998899998765 577899999999996 679999999999999 Q ss_pred EEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHH Q lcl|Aclame:pro 79 EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNG 158 (298) Q Consensus 79 ~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (298) +++++||+|||+ |+.+++.++|.++|++++++++|.++++|.+. +.+.. ......++ T Consensus 195 ~~~~~iS~ell~---ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~--~~~~~------------------~~~~~~~d 251 (394) T protein:vir:10 195 RGAIPLSEEAIA---DSAVDLTSLVGQSINEKSVNTYNAMIAPVLQS--FTAKA------------------TTTDTLVD 251 (394) T ss_pred EeeehhHHHHHh---hhhHHHHHHHHHHHHHHHHHHHHHHHhhcccc--ccccc------------------ccccccHH Confidence 999999999997 44578999999999999999999999998642 11111 11123356 Q ss_pred HHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeeccccc----ccCcceecceeeEecCccccccccccceEE Q lcl|Aclame:pro 159 AIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKW----GATPDTINGLPVDVNKTVSDMSLTQRDRAI 234 (298) Q Consensus 159 ~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~----~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~ 234 (298) +|.+++.......+ .++|+|||++|..|++|||++|||+|.+... .+.+++|+|+||++++......+.+...++ T Consensus 252 ~l~~~~~~~~~~~~-~a~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~L~G~PV~~~~~~~~~~~~~~~~i~ 330 (394) T protein:vir:10 252 SLKHILNVDLDPAY-SRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGTVLGVPVYVVGDALLGSAAGDQKAF 330 (394) T ss_pred HHHHHHHhhhhhhc-cCEEEecHHHHHHHHHhhccCCCeeeeccccccccCCcccccccceeEEecccccCCCCCceEEE Confidence 77777664444444 4689999999999999999999999976653 345679999999987654334444556789 Q ss_pred EeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 235 IGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 235 ~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ||||++++.+..++++++.++++. .|.+ .+|+.+|+|+++++|+||+.++..+ T Consensus 331 ~gd~s~~~~~~~~~~~~v~~~~~~--------~~~~---~~~~~~r~d~~~~~~~ai~~~~~~~ 383 (394) T protein:vir:10 331 VGDLKRGVLFADRQQVTLAWEDSK--------IYGR---YLGAAFRFGVKQADSNAGYFVTNTD 383 (394) T ss_pred EeeccccEEEEeecceEEEEeccc--------ccce---eEEEEEEeccEEeccccEEEEEeec Confidence 999999888888999999877643 1333 4799999999999999999998887 No 79 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=100.00 E-value=1e-51 Score=300.17 Aligned_cols=256 Identities=15% Similarity=0.088 Sum_probs=215.1 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CcceEEeeccccccc-cccceeeEEEeeeEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTM-DSEIDVVAESGKKTH-GGVTLAPQTMVPIKV 78 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~E~~~~~~-~~~~~~~v~l~~~k~ 78 (298) -..+||++||+++.++|++.+++.++++++++++|+++++..+|+... ++.+.|++|++.+++ +.++|++|++.+||+ T Consensus 132 t~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~l~~~k~ 211 (394) T protein:vir:97 132 KKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTY 211 (394) T ss_pred ccccccccChHHHHHHHHHHhhhhhhhhhhceeeeccCcceEEEEEecCCCccceecccccccccccccceeEEeehhhe Confidence 334578999999999999999999999999999999998899999764 567899999999997 569999999999999 Q ss_pred EEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHH Q lcl|Aclame:pro 79 EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNG 158 (298) Q Consensus 79 ~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (298) ++++++|+||++ |+.+++.++|.+++++++++++|.++++|.+.+.. .+...++ T Consensus 212 ~~~i~is~ell~---ds~~~~~~~i~~~la~~~~~~~~~~i~~g~~~~~~-----------------------~~~~~~~ 265 (394) T protein:vir:97 212 RGAIPLSQESID---DADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTT-----------------------KTVKNLD 265 (394) T ss_pred eeehhhHHHHHh---hhhHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-----------------------cccccHH Confidence 999999999997 44578999999999999999999999988532110 0112356 Q ss_pred HHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeec Q lcl|Aclame:pro 159 AIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDF 238 (298) Q Consensus 159 ~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~ 238 (298) ++.+++...... ...+.|+|||.+|..|+++||++|+|+|.+...++.+++|+|+||++++.... +..+++|||| T Consensus 266 ~~~~~~~~~~~~-~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~----~~~~~~~gd~ 340 (394) T protein:vir:97 266 EIKALLNGGFDP-AYNVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVL----GANKAFIGDF 340 (394) T ss_pred HHHHHHHhhhhh-hhCCEEEEcHHHHHHHHHhhccCCCeeeecCcCCCCCceeccceeEEeccccc----CCccEEEeec Confidence 777777655443 33567999999999999999999999999888888889999999999665432 3446899999 Q ss_pred cceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 239 ANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 239 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) +.++.+..|++++++++++ .++...+|+++|+|+++.+|+||++|+..+ T Consensus 341 ~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~ 389 (394) T protein:vir:97 341 KRGVLFADRKDLGLRWADN-----------EIYGQYLQAVLRFGVSKVDDKAGYYVTFTP 389 (394) T ss_pred cccEEEEEecceEEEEecc-----------cccceeEEEEEEEccEEecccceEEEEecc Confidence 9888788999999987653 233457899999999999999999999877 No 80 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=100.00 E-value=1.7e-51 Score=298.92 Aligned_cols=263 Identities=16% Similarity=0.182 Sum_probs=213.7 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CcceEEeeccccccc-cccceeeEEEeeeEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTM-DSEIDVVAESGKKTH-GGVTLAPQTMVPIKV 78 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~E~~~~~~-~~~~~~~v~l~~~k~ 78 (298) ..++||++||+++.++|++.++++++++++|+++|++++..++|+... ...+.|++|++.+++ ++++|+++++.+||+ T Consensus 113 t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~k~ 192 (389) T protein:vir:10 113 TSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKRATDRFSSVAELAENPKLAEPEFNKVDWSVATY 192 (389) T ss_pred ccCCcceeehHHHHHHHHHHHHhhhhHHhhcceeeccCCeeEEEEEecCCCccccccccccccccccccceeeeeeheee Confidence 667789999999999999999999999999999999998899998865 566789999999885 799999999999999 Q ss_pred EEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHH Q lcl|Aclame:pro 79 EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNG 158 (298) Q Consensus 79 ~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (298) ++++++|+|+|+ |+.+++.++|.++|++++++++|.++++|.+.+. + ........++ T Consensus 193 ~~~~~iS~ell~---ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~--~------------------~~~~~~~~~d 249 (389) T protein:vir:10 193 RGAIPLSEEAIA---DSAVDLTALVGQSIKEKSVNTYNAMIAPVLQSFT--A------------------KKTTTDTLVD 249 (389) T ss_pred EeeehhhHHHHh---hhhHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc--c------------------ccccccccHH Confidence 999999999997 4457899999999999999999999998853211 0 0111223467 Q ss_pred HHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeeccccc----ccCcceecceeeEecCccccccccccceEE Q lcl|Aclame:pro 159 AIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKW----GATPDTINGLPVDVNKTVSDMSLTQRDRAI 234 (298) Q Consensus 159 ~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~----~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~ 234 (298) ++.+++.......+ .++|+|||++|..|+++||++|||||.+... .+.+++|+|+||++++........+...++ T Consensus 250 ~l~~~~~~~~~~~~-~a~~~~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~ 328 (389) T protein:vir:10 250 SLKHILNVDLDPAY-SRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGTILGVPVYVVGDTLLGSLAGDQKAF 328 (389) T ss_pred HHHHHHHhhhhhhh-CcEEEecHHHHHHHHHhhccCCCeeeecCcccccccccccccccceeEEecccccCCCCCceEEE Confidence 77777654333333 5689999999999999999999999976643 345579999999876554333333455789 Q ss_pred EeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 235 IGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 235 ~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ||||+++|.++.+++++++++++. .|. ..+|+.+|+|+++.+|+||++++-++ T Consensus 329 ~gd~~~~~~~~~~~~~~i~~~~~~--------~~~---~~~~~~~r~d~~~~~~~a~~~~~~~~ 381 (389) T protein:vir:10 329 VGDLKRGVLFTDRQQVTLAWEDSK--------IYG---KYLGAAFRFGVQKADSKAGYFVTNTD 381 (389) T ss_pred EeeccccEEEEeecceEEEeeccc--------ccc---ceEEEEEEeccEEecccceEEEEeec Confidence 999999888899999999987753 233 35789999999999999999998776 No 81 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=100.00 E-value=2e-51 Score=298.50 Aligned_cols=263 Identities=11% Similarity=0.036 Sum_probs=215.2 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CcceEEeeccccccc-cccceeeEEEeeeEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTM-DSEIDVVAESGKKTH-GGVTLAPQTMVPIKV 78 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~E~~~~~~-~~~~~~~v~l~~~k~ 78 (298) -..++|++||+++...|. .++..+.+++++++++++++...+|+... .+.+.|++|++.+++ ++++|++|++.+||+ T Consensus 160 ~~~~~g~lvp~~~~~~i~-~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~e~~~~~~~~v~~~~~k~ 238 (437) T protein:vir:10 160 ALKDGKVIIPETILTPEK-EVHQFPRLGSLVRTESVTTTTGKLPIFNNSTDLLTAHTEYGQTTKNATPVITPILWDLKTY 238 (437) T ss_pred ccccccccchHHHHHHHH-HhhhhhhhhhcceeEeeccCceeeEEeeccccccccccccccccccccccceeeeeehhhe Confidence 345678999999877665 46788899999999999988899998754 578999999999996 568999999999999 Q ss_pred EEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHH Q lcl|Aclame:pro 79 EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNG 158 (298) Q Consensus 79 ~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (298) ++++++|+|+|+ |+.+++.++|.+++++++++++|.++++|++.+. +. ......++ T Consensus 239 ~~~~~is~ell~---ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~~--~~-------------------~~~~~~~~ 294 (437) T protein:vir:10 239 TGGYVFSQELIS---DSSYDWQAELQSRLIELRDNTDDSLIITALTDGI--KK-------------------TTSTYLLG 294 (437) T ss_pred eeehhhhHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc--cc-------------------cccccchh Confidence 999999999997 4558899999999999999999999999964211 10 01112245 Q ss_pred HHHHHhh-hhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCcccc-ccccccceEEEe Q lcl|Aclame:pro 159 AIENAVE-LLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSD-MSLTQRDRAIIG 236 (298) Q Consensus 159 ~i~~~~~-~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~-~~~~~~~~~~~g 236 (298) ++.+++. .+...+..++.|+|||.++..|++|||++|+|+|.+....+.+++|+|+||++++++.. ..+.+...++|| T Consensus 295 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~g 374 (437) T protein:vir:10 295 DLKKVLNVTLKPQDSAAASIVMSQSAYNLFDMATDAMGRPLLQPNVTAATGYTLLGKTVVIVDDKLFPSASAGDVNIVVA 374 (437) T ss_pred hHHHHHHhhhhhhhhcCCEEEEcHHHHHHHHHhhccCCCeeeccCccCCCCcccccceeEEecccccCCcCCCceEEEEe Confidence 5666654 66677777888999999999999999999999998888888889999999999876532 233455678999 Q ss_pred eccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 237 DFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 237 d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ||+.+|.+..|+++++++.+. |..+...+|+..|+|+++++|+||++|++-. T Consensus 375 d~~~~~~~~~r~~~~~~~~~~----------~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~ 426 (437) T protein:vir:10 375 PLKKAVINFKLTEITGQFQDT----------YDIWYKQLGIFLRQNVVQASKDLIVNLTGKL 426 (437) T ss_pred eccccEEEEeeeceEEEEecc----------cccccceeeEEEEEccEEecccceEEEEeec Confidence 999988888999999987654 3344457788899999999999999998543 No 82 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=100.00 E-value=1.7e-51 Score=298.98 Aligned_cols=274 Identities=12% Similarity=-0.025 Sum_probs=224.4 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeecccccc-ccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKT-HGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~-~~~~~~~~v~l~~~k~~ 79 (298) -..+||++||+++..+|++.+.+.|+++++|+++++++ ..++|+.++.+.+.|++|+++.+ +++++|+++++.+||++ T Consensus 83 ~~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~~~-~~~~~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~ 161 (377) T protein:vir:98 83 GGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLT 161 (377) T ss_pred CCCCCccccCHHHHHHHHHHHHHhhhhhhheeeEecCc-ceEEEEecCCcceeEeecccccCcccCccceeEeecceeEE Confidence 56667999999999999999999999999999999865 58999999999999999987765 67999999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccc---ccccccccccchh Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKV---TQKVEAPRGIADP 156 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~---~~~~~~~~~~~~~ 156 (298) +++++|+|||+ |+.++++++|.+++++++++++|.+|++|+| ++ .|.|+....... ............. T Consensus 162 a~~~is~elL~---ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G--~~---qP~Gil~~~~~~~~~~~~~~~~~~~~~~ 233 (377) T protein:vir:98 162 AFVVIPKDALK---FGPKWIKQFITEQLKEAIAVALELAIVKGDG--LL---QPVGLLKDLSQPTVDQSTGRDITTYKTD 233 (377) T ss_pred eeecccHHhhh---ccHhHHHHHHHHHHHHHHHHHHhhceEeccC--CC---cceeeeecccccccccccccccccccch Confidence 99999999996 5668999999999999999999999999964 33 344443222111 1111112222233 Q ss_pred HHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccc--------------cccCcceecceee--EecC Q lcl|Aclame:pro 157 NGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELK--------------WGATPDTINGLPV--DVNK 220 (298) Q Consensus 157 ~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~--------------~~~~~~~l~G~PV--~~s~ 220 (298) .+.+.++...+...+....+|+||+.++..++++||.+|+|+|...+ ..+.+.+++|+|+ +.++ T Consensus 234 ~~~~~~l~~~~~~~~~~~a~~~m~~~t~~~~~klkd~~G~~i~~~n~~~~~~~~p~~~~~~~~G~~~t~lg~p~~vv~s~ 313 (377) T protein:vir:98 234 KEAIADLSDLTPDNAPKKLVPVMKHLSVNDKKRPLKIAGQVKLILNPEDRWALEAQFTSRNQFGEYVTVLPHGITILESL 313 (377) T ss_pred hhhHhhhhhhchhHHHHHHHHHHHHHHHHHHhhhhccCCceEEEecccchhhccccccccCCCCccccccCCCceEEecC Confidence 46678888888888888889999999999999999999999993111 2355668999985 5677 Q ss_pred ccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 221 TVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 221 ~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) +||++ .++||||+. |.++.|++++++.+++. +|.+|++.||+.+|+|+++++++||++|+-+= T Consensus 314 ~~p~~------~i~fgdf~~-Y~i~~r~~~~i~~~~~~--------~~~~d~~~f~~~~r~dg~~~~~~a~~vl~i~~ 376 (377) T protein:vir:98 314 AVETG------KAIAFVANR-YDAFMATASTIEEYDQT--------FAMEDLQLYLTKNYFYGKAKDNHTAALLTLAG 376 (377) T ss_pred CCCcc------cEEEEEecc-eeEEeecceEEEeechh--------hhhcCceEEEEEEEEcCEEeccCcEEEEEEec Confidence 88753 478999998 56889999999887753 68999999999999999999999999998877 No 83 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=100.00 E-value=3e-50 Score=292.06 Aligned_cols=267 Identities=10% Similarity=-0.054 Sum_probs=210.9 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeecccccc-ccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKT-HGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~-~~~~~~~~v~l~~~k~~ 79 (298) =.++||++||++++++|++.+++.++++++++++|++++...+|+.++.+.+.|++|++.++ .++++|+++++++||++ T Consensus 88 ~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~~~~i~~~~~~~~a~~~~E~~~~~~~~~~~f~~i~l~~~k~~ 167 (390) T protein:vir:40 88 GFAGVTALLPPTVFERVFEDLTVEHPLLSKINFVNTTATTEWIISVGDVATAWWGPLCAEIKEVLDNGFDKIQTGMYKLS 167 (390) T ss_pred CcccCcccccHHHHHHHHHHHHhhhhhhhhceeeecCCceeEEEEEcCCcceeeeccccccCccccccceeeEeeeeeEE Confidence 24467899999999999999999999999999999999889999999999999999998876 57899999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccc-ccccccchhHH Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKV-EAPRGIADPNG 158 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 158 (298) ++++||+||++ |+.++++++|.+++++++++++|.++++|+| ++ .+.|+....+..+... .........++ T Consensus 168 ~~i~iS~ell~---ds~~~l~~~i~~~la~~i~~~~~~a~l~G~G--~~---~P~Gil~~~~~~~~~~~~~~~~~~~t~~ 239 (390) T protein:vir:40 168 AYIPVCNAMLD---LGPSWLDQYVRTILGEAMALGLEAGIVNGSG--KD---QPIGMMRDLNNVTAGEHPVKTATPLTDL 239 (390) T ss_pred EeehhhHHHHh---cchHHHHHHHHHHHHHHHHHHHHhhhhcccC--CC---ccceeeeccccccccccccccccccchh Confidence 99999999997 4558899999999999999999999999964 22 2344433222211111 11111222233 Q ss_pred HHHHHhhhhhh-------cCCcccEEEEcHHHH----HHHHHhhccCCceeecccccccCcceecceeeEecCccccccc Q lcl|Aclame:pro 159 AIENAVELLTG-------VDADVTGIAINPSFR----SALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSL 227 (298) Q Consensus 159 ~i~~~~~~l~~-------~~~~~~~~vm~~~~~----~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~ 227 (298) ++.+++..+.. .....++|+|||.++ ..+++++|.+|+|+|.. .++|+||+++++||.+ T Consensus 240 ~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~~~~~d~~G~~v~~~--------~~~g~pvv~~~~~p~~-- 309 (390) T protein:vir:40 240 TPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAATSYMTPQGVWVTGI--------LPVPLEIVQSVAVPVG-- 309 (390) T ss_pred hHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHHhhccCCCCcccccc--------CCCceeEEEcCCCCCC-- Confidence 33333333322 234466799999884 35568999999999743 2479999999999854 Q ss_pred cccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 228 TQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 228 ~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) .++||||+.+ .++.|++++++++++. +|.+|++.||+..|+|+++.+++||++|+-+. T Consensus 310 ----~i~~Gd~s~~-~i~~~~~~~v~~~~~~--------~f~~~~~~~r~~~r~dg~v~~~~A~~~l~~~~ 367 (390) T protein:vir:40 310 ----KAVAGRAKDY-FMGIGSEQVIRTSTEY--------RLLDDETLYYAKQYANGRPKDNSSFLVFDITG 367 (390) T ss_pred ----cEEEEeeceE-EEEeecceEEEecchh--------hhhcCcEEEEEEEEeCCEEecccceEEEEeec Confidence 4889999985 5789999999887753 58999999999999999999999999997555 No 84 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=100.00 E-value=1.1e-50 Score=294.56 Aligned_cols=281 Identities=11% Similarity=0.050 Sum_probs=216.6 Q ss_pred CeeccccccchhH-HHHHHHHHHhhchhhhhcceeecCC--CceEEEEEeCCc-ceEEeeccc-----cccccccceeeE Q lcl|Aclame:pro 1 MVLNKGTLFDPEL-VTDLISKVAGKSSIARLSAQKPIPF--NGEKVFTFTMDS-EIDVVAESG-----KKTHGGVTLAPQ 71 (298) Q Consensus 1 mat~gg~lip~~~-~~~ii~~~~~~s~i~~~~~~~~~~~--~~~~ip~~~~~~-~a~~v~E~~-----~~~~~~~~~~~v 71 (298) -...||++||+++ .++||+.+++.++|+++++++|+++ +++.||+.++++ .+.|++|++ .+|+++++|+++ T Consensus 160 ~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~~f~~i 239 (477) T protein:vir:84 160 NGGTGGYAVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDLTDGFV 239 (477) T ss_pred cCCCcceeeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcceeeeeccCcccccccccccccceeeE Confidence 2455689999985 5789999999999999999998865 468999977654 577999986 457889999999 Q ss_pred EEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccc----c Q lcl|Aclame:pro 72 TMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQK----V 147 (298) Q Consensus 72 ~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~----~ 147 (298) ++++||++++++||+|||+ |+.+++.++|.++|++++++++|.++|+|+| +...+.|+.+..+..... . T Consensus 240 ~~~~~k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~G----t~~~p~Gi~~~~~~~~~~~~~~~ 312 (477) T protein:vir:84 240 QANVKTIAGQQGIAIQLLD---QAAVSVDEFVFRDLAADYANKLNVQVISGTG----SNNQVVGVRATAGITQVTATSAG 312 (477) T ss_pred EEeeeeEEeeeHHHHHHHh---ccchhHHHHHHHHHHHHHHHHHHHHHhccCC----CCCccceeeeccccccccccccc Confidence 9999999999999999997 4447899999999999999999999999953 334455554433221111 1 Q ss_pred ccccccchhHHHHHHHhhhhhhcCCc-ccEEEEcHHHHHHHHHhhccCCceeeccc-------------ccccCcceecc Q lcl|Aclame:pro 148 EAPRGIADPNGAIENAVELLTGVDAD-VTGIAINPSFRSALAKQKDLQGNALFPEL-------------KWGATPDTING 213 (298) Q Consensus 148 ~~~~~~~~~~~~i~~~~~~l~~~~~~-~~~~vm~~~~~~~L~~lkd~~G~~l~~~~-------------~~~~~~~~l~G 213 (298) .........+++|.+++..+...+.. .++|+|||+++..|+++||++|||||.+. ...+.+++|+| T Consensus 313 ~t~~~~~~~~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G 392 (477) T protein:vir:84 313 SALEKHQIIYQKIADAIQRVHTSRFLEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHG 392 (477) T ss_pred cchhhHHHHHHHHHHHHhhccccccCCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhcc Confidence 11112234567778888777777664 45799999999999999999999999754 23344679999 Q ss_pred eeeEecCcccccccc--ccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEe-cccc Q lcl|Aclame:pro 214 LPVDVNKTVSDMSLT--QRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGIL-DATK 290 (298) Q Consensus 214 ~PV~~s~~~~~~~~~--~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~-~~~a 290 (298) +||++++.||.+.+. ....++||||+.++ ++. .+++++++++. ++.++.+.||...++++..+ +|+| T Consensus 393 ~pVv~s~~~p~~~~~~~d~~~i~~gd~~~~~-i~~-~~~~~~~~~~~--------~~~~~~~~~~v~~~~~~~~~r~~~a 462 (477) T protein:vir:84 393 LPVVTDPTLPTTLGTGTDQDVIHVLRASDLA-LFE-SSVRMRALQET--------RAENLSVLLQVYGYLAFTAARFPQS 462 (477) T ss_pred cceEecCcccccccccCCcceEEEEEeceEE-EEe-eceeEEecccc--------ccccceeeeeehhhhhhhhhccccc Confidence 999999999976443 34578999998764 444 57888877764 24467778888777776555 5999 Q ss_pred eEEEeecC Q lcl|Aclame:pro 291 FARVTEAN 298 (298) Q Consensus 291 ~~~l~~a~ 298 (298) |++++++- T Consensus 463 fv~~t~~~ 470 (477) T protein:vir:84 463 VVEIGGTA 470 (477) T ss_pred eEEeeccc Confidence 99999887 No 85 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=100.00 E-value=4e-50 Score=291.41 Aligned_cols=259 Identities=13% Similarity=0.046 Sum_probs=216.9 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CcceEEeeccccccc-cccceeeEEEeeeEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTM-DSEIDVVAESGKKTH-GGVTLAPQTMVPIKV 78 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~E~~~~~~-~~~~~~~v~l~~~k~ 78 (298) -..++|+++|+++.+.|++ ++...+++++++++|+++++..+|+... ...+.|++|++..++ +.++|++|++++|++ T Consensus 136 ~~~~~~~~vp~~~~~~i~~-~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~~~ 214 (397) T protein:vir:96 136 TSVEGGALIPQELLQPQLE-PKDIVDLSKYVRSVPVNSASGKFPVISKSGSKMATVQQLEKNPQLANPKMVEIDYSVATR 214 (397) T ss_pred cccccccchhHHHHHHHHH-hhhhhhHHHhhhhccccccceeEEEEeccCCccccccccccccccccccccceeecHhHh Confidence 3556789999999999998 5677889999999999988889998764 567889999999996 689999999999999 Q ss_pred EEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHH Q lcl|Aclame:pro 79 EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNG 158 (298) Q Consensus 79 ~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (298) ++++++|+|+++ |+..++.++|.++++++++++++.++++|++.+.. .+...++ T Consensus 215 ~~~~~~s~ell~---ds~~~l~~~i~~~l~~~~~~~~~~~i~~g~g~~~~-----------------------~~~~~~d 268 (397) T protein:vir:96 215 RGYIPISQEMID---DASYDVTGLIADEIQDQSLNTKNADIAAVLKTATA-----------------------KSVVGVD 268 (397) T ss_pred hcchhhHHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-----------------------ccccchH Confidence 999999999997 44578999999999999999999999998643211 0112367 Q ss_pred HHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeec Q lcl|Aclame:pro 159 AIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDF 238 (298) Q Consensus 159 ~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~ 238 (298) +|.+++......++ +++|+|||++|..|++|||++|+|+|.+...++.+++|+|+||++++....+.+.+..+++|||| T Consensus 269 ~~~~~~~~~~~~~~-~a~~v~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~ 347 (397) T protein:vir:96 269 GLKDLINKEIKKVY-DVKLFISASMYSELDKLKDKNGRYLLQDSITAASGKQLLGKEVVVLDDDVIGKSVGNVVGFIGDA 347 (397) T ss_pred HHHHHHHHhhhhhc-CcEEEEcHHHHHHHHHhhccCCCeEeccCccCCCcccccccceEEecccccCCCCCceEEEEeeh Confidence 77777766544443 57899999999999999999999999888888888999999999877665555666778999999 Q ss_pred cceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 239 ANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 239 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) +.++.++.++++++.++++. .| ...+|+++|+|+++++|+||++++-.+ T Consensus 348 ~~~~~~~~~~~~~~~~~~~~--------~~---~~~~~~~~r~d~~~~~~~a~~~~~~~~ 396 (397) T protein:vir:96 348 KAFASFFDRKQVSVSWVDNN--------IY---GQLLAGIIRYDVKATDKKAGFYVTFTI 396 (397) T ss_pred hcceEeEeecceEEEEeccc--------cc---ceeEEEEEEEccEEecccceEEEEeec Confidence 99887899999999876542 23 356899999999999999999998777 No 86 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=100.00 E-value=1.4e-49 Score=288.40 Aligned_cols=258 Identities=13% Similarity=0.064 Sum_probs=206.5 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CcceEEeeccccccccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTM-DSEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~ 79 (298) -.++||+|||++++++||+.++++++|+++++++++++ ..+|+.+. .+++.|++|++.+++++++|++|++.+||++ T Consensus 87 ~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~~~~--~~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~ 164 (352) T protein:vir:78 87 NDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVETAKELKLKGDTVKFTTNKFK 164 (352) T ss_pred CCCCCceeccHhHHHHHHHHHHhhcchhhheeeEecCC--ceEEEEecCCCcccccccccccccccccceeeeecceeEE Confidence 34567899999999999999999999999999988764 57888765 4789999999999999999999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHH Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGA 159 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (298) ++++||+|||+ |+.+++.++|.++|+++++++++..+| +.+.++|.+.+.... ... ........+++ T Consensus 165 ~~i~is~ell~---Ds~~~l~~~i~~~la~~~~~~e~~~~~-~~g~g~~~~~g~l~~-----~~~----~~~t~~~~~d~ 231 (352) T protein:vir:78 165 VFAAISDTVIH---GSDVDLVNWVENALQSGLAAKERKDAL-AVSPKSGLEHMSFYN-----GSV----KEVEGANMYDA 231 (352) T ss_pred eechhhHHHHh---hhhHHHHHHHHHHHHHHHHHHHHHhhh-hcCCCCcccccceec-----ccc----ccccccchHHH Confidence 99999999996 555899999999999999988655444 333334333221111 111 11123345899 Q ss_pred HHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeecc Q lcl|Aclame:pro 160 IENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFA 239 (298) Q Consensus 160 i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~ 239 (298) |.+++..+..++..+++|+||+.++..|++++|.+|+|+|. +.+.+|+|+||++++.++ .++||||+ T Consensus 232 i~~~~~~l~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~~~~-----~~~~~llG~PV~~~~~~~--------~~~~Gdf~ 298 (352) T protein:vir:78 232 IINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD-----TPAEKVFGKPVVFTDAAV--------KPIVGDFN 298 (352) T ss_pred HHHHHhccChhhhcCCEEEEehHHHHHHHHHHhccCCcccc-----cCCccccccceEEecCCC--------ceeEeehh Confidence 99999999999999999999999999999999999999984 345789999999998764 36889999 Q ss_pred ceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 240 NGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 240 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) .++. .++++.++...+ ..++++.|+++.|+|+++++|+||++++.++ T Consensus 299 ~~~~--~~~~~~~~~~~~----------~~~g~~~f~~~~r~Dg~~~~~eA~~~l~~~a 345 (352) T protein:vir:78 299 YFGI--NYDGTTYDTDKD----------VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKE 345 (352) T ss_pred hhhh--hhhhheeeeecc----------ccCCeeEEEEEeeeCceeechhheEEEEeec Confidence 7643 345555443322 2368899999999999999999999999988 No 87 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=100.00 E-value=8.8e-48 Score=278.56 Aligned_cols=268 Identities=8% Similarity=-0.008 Sum_probs=206.3 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccc-cccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKK-THGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~-~~~~~~~~~v~l~~~k~~ 79 (298) ...+||+|||+++.++|++.+++.|+++++++++|+++ ...+|+.++.+.+.|++|.++. ++++++|+++++.+|+++ T Consensus 90 t~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~~~-~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~ 168 (395) T protein:vir:95 90 VGYTDEKILPETVVERVFDDLQKDHPLLSKINFQNAGI-KTRVIKADPAGQAVWGKVFGEIKGQLDAAFREENFTQYKLT 168 (395) T ss_pred cCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEecCCcceEEeecccccCccccccceeeeeceeeEE Confidence 67778999999999999999999999999999999975 5799999999999999987665 578999999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccc-cccccccchhHH Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQK-VEAPRGIADPNG 158 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 158 (298) ++++||+|||+ |+.++++++|.+++++++++++|.+|++|+|.+. ..|.|+.+.....+.. ..........++ T Consensus 169 ~~~~iS~ell~---ds~~~ie~~i~~~la~~ia~~~~~a~i~G~G~~~---~qP~Gil~~~~~~~~~~~~~~~~~~~t~~ 242 (395) T protein:vir:95 169 CFVVLPDDLST---FGPAWIERFVRTQIQEAISVALESAIINGGGAAK---TQPVGLMKDVNTNSGAVTDKASSGTLTFA 242 (395) T ss_pred EeecccHHHHh---cchhHHHHHHHHHHHHHHHHHHhhheeeccCCCC---cCceeeeecccccccccccccccchhhhh Confidence 99999999996 5568999999999999999999999999965332 1234443322211111 111111111222 Q ss_pred H-------HHHHhhhhh-------hcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecc--eeeEecCcc Q lcl|Aclame:pro 159 A-------IENAVELLT-------GVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTING--LPVDVNKTV 222 (298) Q Consensus 159 ~-------i~~~~~~l~-------~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G--~PV~~s~~~ 222 (298) + +.+++..+. ..+.....|+|||.++. |.+|+|+|.+ ..+.+.+++| +||+.++.| T Consensus 243 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~------~~~g~~~~~~--~~G~~~~~lg~g~~v~~~~~~ 314 (395) T protein:vir:95 243 DADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSW------DVQARYTYLT--ANGGFVTVLPYNVTIITSEFV 314 (395) T ss_pred hhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhh------hcCCcceecc--CCCcceeccCCcceEEEcCCC Confidence 2 333332221 12233457999998865 5579999987 3567778864 568889999 Q ss_pred ccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 223 SDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 223 ~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) |.+ .++||||+. |.++.|++++++++++. +|.+|++.||+.+|+|+++++++||++|+-.. T Consensus 315 p~~------~i~fgdfs~-y~i~~r~~~~i~~~~~~--------~~~~d~~~f~~~~r~dg~~~~~~A~~~l~i~~ 375 (395) T protein:vir:95 315 PEG------KLVAFVTDR-YNAVRGGGLTVKKFDQT--------LALEDAVLFTAKTFAYGQPDDNKASAVYDLKV 375 (395) T ss_pred CCC------cEEEEeccc-EEEEEecceEEEeccch--------hhhCCcEEEEEEEEECCEEeccccEEEEEeec Confidence 853 488999998 56899999999887753 58999999999999999999999999987654 No 88 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=100.00 E-value=8.3e-48 Score=278.69 Aligned_cols=265 Identities=11% Similarity=-0.025 Sum_probs=209.0 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeecccccc-ccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKT-HGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~-~~~~~~~~v~l~~~k~~ 79 (298) ..++||+|||+++.++|++.+++.|+++++|+++++++ ..++|+.++.+.+.|++|++.++ +++++|+++++.+||++ T Consensus 80 ~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~-~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~ 158 (381) T protein:vir:10 80 VNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLT 158 (381) T ss_pred cCCCCceecCHHHHHHHHHHHHhhccceeheeeEecCc-ceEEEEecCCcceeeecccccccccccccceeeeecceeEE Confidence 55688999999999999999999999999999999875 57999999999999999988776 56899999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccc-------c-----cccc Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSK-------V-----TQKV 147 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~-------~-----~~~~ 147 (298) +++++|+|||+ |+.++++++|.+++++++++++|.+|++|+| ++. |.|+...... . .... T Consensus 159 ~~~~is~elL~---Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G--~~q---P~Gil~~~~~~~~~~~g~~~~~~~~~t 230 (381) T protein:vir:10 159 AFVVLPKDLND---FGPAWIERFVRVQIEEAFAVALETAFLKGTG--KDQ---PIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) T ss_pred eechhhHHHhh---cCHHHHHHHHHHHHHHHHHHHhhheeEeccC--CCC---ceeeeeccCcccccccccccccccccc Confidence 99999999996 5568999999999999999999999999964 233 3333211110 0 0000 Q ss_pred ccccccchhHHHHHHHhhhhhhc-------CCcccEEEEcHHHHHHHHHhh---ccCCceeecccccccCcceecceeeE Q lcl|Aclame:pro 148 EAPRGIADPNGAIENAVELLTGV-------DADVTGIAINPSFRSALAKQK---DLQGNALFPELKWGATPDTINGLPVD 217 (298) Q Consensus 148 ~~~~~~~~~~~~i~~~~~~l~~~-------~~~~~~~vm~~~~~~~L~~lk---d~~G~~l~~~~~~~~~~~~l~G~PV~ 217 (298) .........++.+.+++..+... +.....|+|||.++..|++++ +++|+|+|.. -+|.+|+ T Consensus 231 ~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~G~~v~~l---------~~g~~vv 301 (381) T protein:vir:10 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTAL---------PFNLNVI 301 (381) T ss_pred cccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccCCCCCceeecC---------CCCceEE Confidence 11111222345566665555432 333457999999999998776 6788888641 1467799 Q ss_pred ecCccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeec Q lcl|Aclame:pro 218 VNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEA 297 (298) Q Consensus 218 ~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a 297 (298) .++.||.+ .++||||+. |.++.|++++++++++. +|.+|++.||+.+|+|+++++++||++++-. T Consensus 302 ~s~~~p~~------~iifgDfs~-Y~i~~r~~~~i~~~~~~--------~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~ 366 (381) T protein:vir:10 302 ESTVQEAG------KVLTYVKGL-YDGYLAGGINVQKFKET--------LALDDMDLYTAKQFAYGKAKDNKVAAVWKLD 366 (381) T ss_pred ecCCCCcC------cEEEEeccc-EEEEEecccEEEeechh--------HhhcCCeEEEEEEEEcCEEecCceEEEEEEE Confidence 99999853 489999998 56899999999888763 6999999999999999999999999997655 Q ss_pred C Q lcl|Aclame:pro 298 N 298 (298) Q Consensus 298 ~ 298 (298) . T Consensus 367 ~ 367 (381) T protein:vir:10 367 L 367 (381) T ss_pred e Confidence 4 No 89 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=100.00 E-value=8.3e-48 Score=278.69 Aligned_cols=265 Identities=11% Similarity=-0.025 Sum_probs=209.0 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeecccccc-ccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKT-HGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~-~~~~~~~~v~l~~~k~~ 79 (298) ..++||+|||+++.++|++.+++.|+++++|+++++++ ..++|+.++.+.+.|++|++.++ +++++|+++++.+||++ T Consensus 80 ~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~-~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~ 158 (381) T protein:vir:95 80 VNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLT 158 (381) T ss_pred cCCCCceecCHHHHHHHHHHHHhhccceeheeeEecCc-ceEEEEecCCcceeeecccccccccccccceeeeecceeEE Confidence 55688999999999999999999999999999999875 57999999999999999988776 56899999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccc-------c-----cccc Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSK-------V-----TQKV 147 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~-------~-----~~~~ 147 (298) +++++|+|||+ |+.++++++|.+++++++++++|.+|++|+| ++. |.|+...... . .... T Consensus 159 ~~~~is~elL~---Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G--~~q---P~Gil~~~~~~~~~~~g~~~~~~~~~t 230 (381) T protein:vir:95 159 AFVVLPKDLND---FGPAWIERFVRVQIEEAFAVALETAFLKGTG--KDQ---PIGLNRQVQKGVSVTEGAYPEKEEQGT 230 (381) T ss_pred eechhhHHHhh---cCHHHHHHHHHHHHHHHHHHHhhheeEeccC--CCC---ceeeeeccCcccccccccccccccccc Confidence 99999999996 5568999999999999999999999999964 233 3333211110 0 0000 Q ss_pred ccccccchhHHHHHHHhhhhhhc-------CCcccEEEEcHHHHHHHHHhh---ccCCceeecccccccCcceecceeeE Q lcl|Aclame:pro 148 EAPRGIADPNGAIENAVELLTGV-------DADVTGIAINPSFRSALAKQK---DLQGNALFPELKWGATPDTINGLPVD 217 (298) Q Consensus 148 ~~~~~~~~~~~~i~~~~~~l~~~-------~~~~~~~vm~~~~~~~L~~lk---d~~G~~l~~~~~~~~~~~~l~G~PV~ 217 (298) .........++.+.+++..+... +.....|+|||.++..|++++ +++|+|+|.. -+|.+|+ T Consensus 231 ~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~G~~v~~l---------~~g~~vv 301 (381) T protein:vir:95 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTAL---------PFNLNVI 301 (381) T ss_pred cccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccCCCCCceeecC---------CCCceEE Confidence 11111222345566665555432 333457999999999998776 6788888641 1467799 Q ss_pred ecCccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeec Q lcl|Aclame:pro 218 VNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEA 297 (298) Q Consensus 218 ~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a 297 (298) .++.||.+ .++||||+. |.++.|++++++++++. +|.+|++.||+.+|+|+++++++||++++-. T Consensus 302 ~s~~~p~~------~iifgDfs~-Y~i~~r~~~~i~~~~~~--------~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~ 366 (381) T protein:vir:95 302 ESTVQEAG------KVLTYVKGL-YDGYLAGGINVQKFKET--------LALDDMDLYTAKQFAYGKAKDNKVAAVWKLD 366 (381) T ss_pred ecCCCCcC------cEEEEeccc-EEEEEecccEEEeechh--------HhhcCCeEEEEEEEEcCEEecCceEEEEEEE Confidence 99999853 489999998 56899999999888763 6999999999999999999999999997655 Q ss_pred C Q lcl|Aclame:pro 298 N 298 (298) Q Consensus 298 ~ 298 (298) . T Consensus 367 ~ 367 (381) T protein:vir:95 367 L 367 (381) T ss_pred e Confidence 4 No 90 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=100.00 E-value=2.8e-47 Score=275.83 Aligned_cols=265 Identities=10% Similarity=-0.025 Sum_probs=205.7 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeecccccc-ccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKT-HGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~-~~~~~~~~v~l~~~k~~ 79 (298) ...+||+|||+++.++|++.+++.|+++++|+++++++ ..++|+.+..+.+.|++|.++.+ +++++|+++++.+||++ T Consensus 80 t~~~Gg~lvP~~~~~~I~~~l~~~spir~~a~v~~~~~-~~~i~~~~~~~~a~W~~e~~~~~~~~~~~f~~i~l~~~kl~ 158 (381) T protein:vir:10 80 VGYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLT 158 (381) T ss_pred CCCCCceecCHHHHHHHHHHHHhhcceeeeeeeEecCc-ceEEEeecCCcceEEeecccccccccCccceeEeecceeEE Confidence 67788999999999999999999999999999999864 57999999899999999987754 67899999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccc----------- Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVE----------- 148 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~----------- 148 (298) +++++|+|||+ |+.++++++|.+++++++++++|.+|++|+| ++. |.|+............ T Consensus 159 a~i~is~elL~---Ds~~~le~~i~~~la~~~a~~~~~afi~GdG--~~q---P~Gil~~~~~~~~~~~g~~~~~~~~~~ 230 (381) T protein:vir:10 159 AFVVLPKDLND---FGPAWIERFVRVQIEEAFAVALETAFLKGTG--KDQ---PIGLNRQVQKGVSVTDGAYPEKEEQGT 230 (381) T ss_pred eeccccHHHHh---ccHHHHHHHHHHHHHHHHHHHhhceeEeccc--CCC---ceeeeecCCcccccccccccccccccc Confidence 99999999995 5668999999999999999999999999964 333 3333211111000000 Q ss_pred -cccccchhHHHHHHHhhhhhh-------cCCcccEEEEcHHHHHHHHHhh---ccCCceeecccccccCcceecceeeE Q lcl|Aclame:pro 149 -APRGIADPNGAIENAVELLTG-------VDADVTGIAINPSFRSALAKQK---DLQGNALFPELKWGATPDTINGLPVD 217 (298) Q Consensus 149 -~~~~~~~~~~~i~~~~~~l~~-------~~~~~~~~vm~~~~~~~L~~lk---d~~G~~l~~~~~~~~~~~~l~G~PV~ 217 (298) ........+..+.+++..+.. .+.....|+|||.++..|++++ |++|+|+|.. -+|+||+ T Consensus 231 ~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~~~~~~~G~~v~~l---------p~g~~vv 301 (381) T protein:vir:10 231 LTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTAL---------PFNLNVI 301 (381) T ss_pred ccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhccccccCCCCCceeecC---------CCCceeE Confidence 000111122333333322211 2233457999999999998765 8899998742 1478899 Q ss_pred ecCccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeec Q lcl|Aclame:pro 218 VNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEA 297 (298) Q Consensus 218 ~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a 297 (298) .++.||.+ .++||||+. |.++.|.+++++++++. +|.+|++.||+.+|+|+++++++||++++-. T Consensus 302 ~~~~~p~~------~i~fGDfs~-Y~i~~r~~~~i~~~~~~--------~~~~d~~~f~a~~r~dG~~~~~~A~~v~~l~ 366 (381) T protein:vir:10 302 ESTVQEAG------KVLTYVKGL-YDGYLAGGINVQKFKET--------LALDDMDLYTAKQFAYGKAKDNKVAAVWKLD 366 (381) T ss_pred EcCCCCcC------cEEEEEccc-EEEEEecccEEEeechh--------hhhcCceEEEEEEEEcCEEecCCcEEEEEEe Confidence 99999853 489999998 56899999999888753 6999999999999999999999999997654 Q ss_pred C Q lcl|Aclame:pro 298 N 298 (298) Q Consensus 298 ~ 298 (298) - T Consensus 367 ~ 367 (381) T protein:vir:10 367 L 367 (381) T ss_pred e Confidence 3 No 91 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=100.00 E-value=6.1e-48 Score=279.43 Aligned_cols=258 Identities=13% Similarity=0.050 Sum_probs=202.4 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CcceEEeeccccccccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTM-DSEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~ 79 (298) -.++||++||++++.+||+.+++.++++++++++++++ ..+|+... .+++.|++|++.+++++++|+++++.+|+++ T Consensus 137 t~~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~ 214 (402) T protein:vir:93 137 NDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFK 214 (402) T ss_pred CCcCCccccchhHHHHHHHhHHhhhhhhhhceeeecCC--ceeeeeeccCCccccccccccccccccccceeeecceeee Confidence 34567899999999999999999999999999998864 57888664 5789999999999999999999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHH Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGA 159 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (298) ++++||+|||+ |+.+++.++|.++|+++++++++..+|.+ +.++|.+.+.. .. ...........+++ T Consensus 215 ~~i~iS~ell~---Ds~~~l~~~i~~~la~~~~~~e~~~~~~~-g~g~g~p~g~~---~~------~~~~~~~~~~~~d~ 281 (402) T protein:vir:93 215 VFAAISDTVIH---GSDVDLVNWVENALQSGLAAKERKDALAV-SPKSGLEHMSF---YN------GSVKEVEGADMYDA 281 (402) T ss_pred eechhhHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHhHhhc-CCCccccceee---ec------cccccccccchHHH Confidence 99999999996 55689999999999999999877655422 23333322211 10 01111223445889 Q ss_pred HHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeecc Q lcl|Aclame:pro 160 IENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFA 239 (298) Q Consensus 160 i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~ 239 (298) |.+++..+...|..++.|+||+.++..++++++.+|+++|. +.+++|+|+||++++.++ .++||||+ T Consensus 282 l~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~d~~~~~~~-----~~~~~llG~PV~~t~~~~--------~i~~GDf~ 348 (402) T protein:vir:93 282 IINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD-----TPAEKVFGKPVVFTDAAV--------KPIVGDFN 348 (402) T ss_pred HHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc-----cCCccccccceEEecCCC--------ceeeechh Confidence 99999999999988899999999998887777777888874 346789999999998764 46899999 Q ss_pred ceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 240 NGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 240 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) .++. .++++.++...+ ..++++.||+..|+|+++++|+||++|+... T Consensus 349 ~~~~--~~~~~~~~~~~~----------~~~~~~~~~~~~r~Dg~v~~~~A~~~l~ik~ 395 (402) T protein:vir:93 349 YFGI--NYDGTTYDTDKD----------VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKE 395 (402) T ss_pred hhhh--hhhhhhhhhhhc----------ccCCceEEEEEEEeCcEEechhheEEEEeec Confidence 7653 333444332221 1258999999999999999999999886644 No 92 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=100.00 E-value=1.5e-47 Score=277.29 Aligned_cols=258 Identities=13% Similarity=0.065 Sum_probs=200.5 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEe-CCcceEEeeccccccccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFT-MDSEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~-~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~ 79 (298) -.++||++||++++++||+.++++++|+++++++++++ ..+|+.. ..++++|++|++..++++++|+++++.+||++ T Consensus 122 t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~--~~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~ 199 (387) T protein:vir:93 122 NDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVETAKELKLKGDTVKFTTNKFK 199 (387) T ss_pred cCCCCceeechhHHHHHHHHHHhhchhhhheeeeecCC--ceEEEEeecCCccccccCcccccccccccceeeeeheeee Confidence 33567899999999999999999999999999998864 5788865 45789999999999999999999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHH Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGA 159 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (298) ++++||+|||+ |+.+++.++|.++++++++++++..+|. .++++|.+.+.. . +.. .........+++ T Consensus 200 ~~~~iS~ell~---Ds~~~l~~~i~~~la~~~~~~e~~~~~~-~g~g~g~p~g~l---~--~~~----~~~v~~~~~~d~ 266 (387) T protein:vir:93 200 VFAAISDTVIH---GSDVDLVNWVENALQSGLAAKERKDALA-VSPKSGLDHMSF---Y--NGS----VKEVEGADMYDA 266 (387) T ss_pred eechhhHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHhHhh-cCCCccccceee---e--ccc----cccccccchHHH Confidence 99999999996 5558999999999999999998776552 223333322211 1 101 111223445889 Q ss_pred HHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeecc Q lcl|Aclame:pro 160 IENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFA 239 (298) Q Consensus 160 i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~ 239 (298) |.+++..+...|...++|+||+.++..+.++++.+|+++|. +.+.+|+|+||++++.++ .++||||+ T Consensus 267 i~~~~~~l~~~~~~~a~~~mn~~t~~~~~~~~~d~~~~~~~-----~~~~~llG~PV~~~~~~~--------~~~~GDf~ 333 (387) T protein:vir:93 267 IINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD-----TPAEKVFGKPVVFTDAAV--------KPIVGDFN 333 (387) T ss_pred HHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc-----cCCccccccceEEecCCC--------ceeeeehh Confidence 99999999999999999999999987665444444445542 346789999999998764 46899999 Q ss_pred ceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 240 NGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 240 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) .++. .++++.+....+ +.++++.|+++.|+|+++++|+||++++..+ T Consensus 334 ~~~~--~~~~~~~~~~~~----------~~~~~~~~~~~~r~d~~v~~~eA~~~l~~k~ 380 (387) T protein:vir:93 334 YFGI--NYDGTTYDTDKD----------VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKE 380 (387) T ss_pred hhhe--ehhhheeeeccc----------ccCCceeEEEEeeeCceeechhheEEEEeec Confidence 8653 344555543322 3468899999999999999999999987766 No 93 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=100.00 E-value=8.4e-48 Score=278.67 Aligned_cols=258 Identities=13% Similarity=0.064 Sum_probs=204.1 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CcceEEeeccccccccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTM-DSEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~ 79 (298) -.++||++||++++++||+.++++++|+++++++++++ ..+|+... .+++.|++|++.+++++++|+++++.+||++ T Consensus 122 ~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~ 199 (387) T protein:vir:26 122 NDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFK 199 (387) T ss_pred CCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC--ceeeeeeccCCccccccccccccccccccceeeechheee Confidence 34557899999999999999999999999999998864 57888664 5789999999999999999999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHH Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGA 159 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (298) ++++||+|||+ |+.+++.++|.++|+++++++++..+|. .+.++|.+.+.. . +.. .........+++ T Consensus 200 ~~i~iS~ell~---ds~~~l~~~i~~~la~~~~~~e~~~~~~-~g~g~g~~~g~~---~--~~~----~~~~~~~~~~d~ 266 (387) T protein:vir:26 200 VFAAISDTVIH---GSDVDLVNWVENALQSGLAAKERKDALA-VSPKSGLEHMSF---Y--NGS----VKEVEGADMYDA 266 (387) T ss_pred eechhhHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHhHhh-cCCCccccceee---e--ccc----cccccccchHHH Confidence 99999999996 5558999999999999999987766552 233333322211 1 000 111223445889 Q ss_pred HHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeecc Q lcl|Aclame:pro 160 IENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFA 239 (298) Q Consensus 160 i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~ 239 (298) |.+++..+..+|..++.|+||+.++..+.++++.+|+++|. +.+++|+|+||++++.++ .++||||+ T Consensus 267 i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~-----~~~~~llG~PV~~~~~~~--------~~~~GDf~ 333 (387) T protein:vir:26 267 IINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD-----TPAEKVFGKPVVFTDAAV--------KPIVGDFN 333 (387) T ss_pred HHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc-----cCCccccccceEEecCCC--------ceeeechh Confidence 99999999999999999999999998887777777888874 346789999999998764 46899999 Q ss_pred ceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 240 NGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 240 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) .++. .++++.+....+ ..++++.||++.|+|+++++|+||++|+... T Consensus 334 ~~~~--~~~~~~~~~~~~----------~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka 380 (387) T protein:vir:26 334 YFGI--NYDGTTYDTDKD----------VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKE 380 (387) T ss_pred hhhh--hhhhhhheeccc----------ccCCceEEEEEEEeCcEeechhheEEEEeec Confidence 7653 344454433222 2368899999999999999999999998754 No 94 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=100.00 E-value=8.4e-48 Score=278.67 Aligned_cols=258 Identities=13% Similarity=0.064 Sum_probs=204.1 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CcceEEeeccccccccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTM-DSEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~ 79 (298) -.++||++||++++++||+.++++++|+++++++++++ ..+|+... .+++.|++|++.+++++++|+++++.+||++ T Consensus 122 ~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~ 199 (387) T protein:vir:96 122 NDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFK 199 (387) T ss_pred CCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC--ceeeeeeccCCccccccccccccccccccceeeechheee Confidence 34557899999999999999999999999999998864 57888664 5789999999999999999999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHH Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGA 159 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (298) ++++||+|||+ |+.+++.++|.++|+++++++++..+|. .+.++|.+.+.. . +.. .........+++ T Consensus 200 ~~i~iS~ell~---ds~~~l~~~i~~~la~~~~~~e~~~~~~-~g~g~g~~~g~~---~--~~~----~~~~~~~~~~d~ 266 (387) T protein:vir:96 200 VFAAISDTVIH---GSDVDLVNWVENALQSGLAAKERKDALA-VSPKSGLEHMSF---Y--NGS----VKEVEGADMYDA 266 (387) T ss_pred eechhhHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHhHhh-cCCCccccceee---e--ccc----cccccccchHHH Confidence 99999999996 5558999999999999999987766552 233333322211 1 000 111223445889 Q ss_pred HHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeecc Q lcl|Aclame:pro 160 IENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFA 239 (298) Q Consensus 160 i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~ 239 (298) |.+++..+..+|..++.|+||+.++..+.++++.+|+++|. +.+++|+|+||++++.++ .++||||+ T Consensus 267 i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~-----~~~~~llG~PV~~~~~~~--------~~~~GDf~ 333 (387) T protein:vir:96 267 IINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD-----TPAEKVFGKPVVFTDAAV--------KPIVGDFN 333 (387) T ss_pred HHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc-----cCCccccccceEEecCCC--------ceeeechh Confidence 99999999999999999999999998887777777888874 346789999999998764 46899999 Q ss_pred ceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 240 NGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 240 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) .++. .++++.+....+ ..++++.||++.|+|+++++|+||++|+... T Consensus 334 ~~~~--~~~~~~~~~~~~----------~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka 380 (387) T protein:vir:96 334 YFGI--NYDGTTYDTDKD----------VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKE 380 (387) T ss_pred hhhh--hhhhhhheeccc----------ccCCceEEEEEEEeCcEeechhheEEEEeec Confidence 7653 344454433222 2368899999999999999999999998754 No 95 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=100.00 E-value=8.4e-48 Score=278.67 Aligned_cols=258 Identities=13% Similarity=0.064 Sum_probs=204.1 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CcceEEeeccccccccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTM-DSEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~ 79 (298) -.++||++||++++++||+.++++++|+++++++++++ ..+|+... .+++.|++|++.+++++++|+++++.+||++ T Consensus 122 ~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~ 199 (387) T protein:vir:94 122 NDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFK 199 (387) T ss_pred CCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC--ceeeeeeccCCccccccccccccccccccceeeechheee Confidence 34557899999999999999999999999999998864 57888664 5789999999999999999999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHH Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGA 159 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (298) ++++||+|||+ |+.+++.++|.++|+++++++++..+|. .+.++|.+.+.. . +.. .........+++ T Consensus 200 ~~i~iS~ell~---ds~~~l~~~i~~~la~~~~~~e~~~~~~-~g~g~g~~~g~~---~--~~~----~~~~~~~~~~d~ 266 (387) T protein:vir:94 200 VFAAISDTVIH---GSDVDLVNWVENALQSGLAAKERKDALA-VSPKSGLEHMSF---Y--NGS----VKEVEGADMYDA 266 (387) T ss_pred eechhhHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHhHhh-cCCCccccceee---e--ccc----cccccccchHHH Confidence 99999999996 5558999999999999999987766552 233333322211 1 000 111223445889 Q ss_pred HHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeecc Q lcl|Aclame:pro 160 IENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFA 239 (298) Q Consensus 160 i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~ 239 (298) |.+++..+..+|..++.|+||+.++..+.++++.+|+++|. +.+++|+|+||++++.++ .++||||+ T Consensus 267 i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~-----~~~~~llG~PV~~~~~~~--------~~~~GDf~ 333 (387) T protein:vir:94 267 IINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD-----TPAEKVFGKPVVFTDAAV--------KPIVGDFN 333 (387) T ss_pred HHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc-----cCCccccccceEEecCCC--------ceeeechh Confidence 99999999999999999999999998887777777888874 346789999999998764 46899999 Q ss_pred ceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 240 NGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 240 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) .++. .++++.+....+ ..++++.||++.|+|+++++|+||++|+... T Consensus 334 ~~~~--~~~~~~~~~~~~----------~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka 380 (387) T protein:vir:94 334 YFGI--NYDGTTYDTDKD----------VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKE 380 (387) T ss_pred hhhh--hhhhhhheeccc----------ccCCceEEEEEEEeCcEeechhheEEEEeec Confidence 7653 344454433222 2368899999999999999999999998754 No 96 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=100.00 E-value=4.6e-46 Score=269.15 Aligned_cols=270 Identities=10% Similarity=-0.017 Sum_probs=209.8 Q ss_pred Ceecc-ccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNK-GTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~g-g~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~ 79 (298) -+.+| +.+||+++.++|++.+++.+++++++++.|+++ ..++|+....+.+.|++|++.+++++++|++|++.+|+++ T Consensus 152 ~~~~g~~~~vP~~~~~~i~~~l~~~~~l~~~~~v~~~~g-~~~~~~~~~~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~ 230 (466) T protein:vir:80 152 RAVSGAELTIPDVMLELLRDNMHRYSKLISKVRLRPLKG-TARQNIAGAIPEGVWTEAVANLNELSLSFSQIEVDGYKVG 230 (466) T ss_pred hhhccccccccHHHHHHHHHhhhhhhhhhhheeeeecCc-eeEeeeecCCcceeecccccccccccccccceeecceeee Confidence 23333 478999999999999999999999999999875 5789988888899999999999999999999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccccc---------- Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEA---------- 149 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~---------- 149 (298) ++++||+|||+ |+.+++.++|.++|++++++++|.++++|+| +|. +.|+.+..+..+..... T Consensus 231 ~~~~iS~ell~---ds~~~l~~~i~~~la~~~~~~~~~ail~G~G--~~~---P~Gil~~~~~~~~~~~~~~~~~~~~~~ 302 (466) T protein:vir:80 231 GFIPIPNSTLE---DSDLNLADEILDAIGQAIGFALDKAILYGTG--TKM---PVGIVTRLAQTTQPPNWGTKAPAWTNL 302 (466) T ss_pred eehhhhHHHHh---cchHHHHHHHHHHHHHHHHHHHhhheeeccC--CCC---cceeeeccccccccccccccccccccc Confidence 99999999996 5558999999999999999999999999964 232 33332221111110000 Q ss_pred -----------ccccchhHHHHHHHhhhhhhcCCccc-EEEEcHHHHHHHHHhh---ccCCceeecccccccCcceecce Q lcl|Aclame:pro 150 -----------PRGIADPNGAIENAVELLTGVDADVT-GIAINPSFRSALAKQK---DLQGNALFPELKWGATPDTINGL 214 (298) Q Consensus 150 -----------~~~~~~~~~~i~~~~~~l~~~~~~~~-~~vm~~~~~~~L~~lk---d~~G~~l~~~~~~~~~~~~l~G~ 214 (298) .......+.++...+..+...+.++. .|+||+.++..|.+++ +.+|.+++.+.. ...|+|+ T Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~~~~~g~~~~~~~~----~~~i~G~ 378 (466) T protein:vir:80 303 STTNLLKIDPTGKSAEEFFSELVLKLSKARANYSNGMKFWAMSSNTHAVLMSKAITFNSAGALVASLNN----TMPIVGG 378 (466) T ss_pred chhhhhhhhhhccchhhHHHHHHHHHHhhhccccCCceeEEecchhHHHhhcccccccCCccccccCCC----ccccccc Confidence 00011112222223333344445544 5999999999999998 678888775421 2359999 Q ss_pred eeEecCccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEE Q lcl|Aclame:pro 215 PVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARV 294 (298) Q Consensus 215 PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l 294 (298) ||+++++||.+ .+++|||+.+ .++.|+++++..+++. .|.+|++.||+++|+|+++++|+||+++ T Consensus 379 pvv~s~~~~~~------~~~~g~~~~y-~i~~r~~~~i~~~~~~--------~f~~d~~~~r~~~r~dg~~~~~~afv~~ 443 (466) T protein:vir:80 379 DIVILDFIPDN------DIIGGYGSLY-LLAERADIKLAQSEHV--------RFIEDQTVFKGTARYDGKPVFGEGFVAV 443 (466) T ss_pred ceeecCccCcc------ceeeeccccE-EEEeecceEEEechhh--------hhhcCcEEEEEEEEEccEEeccCceEEE Confidence 99999999864 4788999875 5889999999887653 5899999999999999999999999999 Q ss_pred eecC Q lcl|Aclame:pro 295 TEAN 298 (298) Q Consensus 295 ~~a~ 298 (298) +.++ T Consensus 444 ~~~~ 447 (466) T protein:vir:80 444 NIAN 447 (466) T ss_pred EecC Confidence 9998 No 97 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=100.00 E-value=1.9e-45 Score=265.75 Aligned_cols=266 Identities=12% Similarity=-0.025 Sum_probs=203.4 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeecccccc-ccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKT-HGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~-~~~~~~~~v~l~~~k~~ 79 (298) =...||+|||+++..+|++.+.+.|+++++|+++++++ ..++|+.++.+.+.|++|+++++ +++++|+++++.+||++ T Consensus 83 ~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~-~~~i~~~~~~~~a~wv~e~~~~~~~~~~~f~~i~l~~~kl~ 161 (377) T protein:vir:96 83 GGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLT 161 (377) T ss_pred CCCCCceecCHHHHHHHHHHHHhhhhhhhhceeEecCC-ceEEEEecCCcceeEeecccccccccCccceeEeeeeeeEE Confidence 34556899999999999999999999999999999864 58999999899999999988765 57999999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccc----------- Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVE----------- 148 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~----------- 148 (298) ++++||+|||+ |+.++++++|.+++++++++++|.++++|+| ++ .|.|+.......+.... T Consensus 162 ~~~~is~~ll~---ds~~~le~~i~~~l~~~~~~~~~~a~i~G~G--~~---~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 233 (377) T protein:vir:96 162 AFVVIPKDALK---FGPKWLKQFITEQLKEAIAVALELAIVKGNG--LL---QPVGLLKDLSQPTVDQSTGRDITTYKTD 233 (377) T ss_pred eechhhHHHhh---cchhhHHHHHHHHHHHHHHHHHhhceEeccC--CC---cceeeeeccccccccccccccccceeec Confidence 99999999996 5568999999999999999999999999964 22 33344322211110000 Q ss_pred ---cccccchhHHHHHHHhhhhhhcCC-----------cccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecce Q lcl|Aclame:pro 149 ---APRGIADPNGAIENAVELLTGVDA-----------DVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGL 214 (298) Q Consensus 149 ---~~~~~~~~~~~i~~~~~~l~~~~~-----------~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~ 214 (298) .........+.+.+++..+...+. ....|+|||.++..+ .|++.|.+ ..|.+.+++|+ T Consensus 234 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~------~~~~~~~~--~~G~~~~~l~~ 305 (377) T protein:vir:96 234 KEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTL------EAKFTSRN--QFGEYVTVLPH 305 (377) T ss_pred cccccccccCChhHHHHHHHHHHHhhccccccccccccCceEEEEchhhHHhc------cccccccC--CCCCceeccCC Confidence 000011112334444433333221 234699999987755 46666654 24566788888 Q ss_pred ee--EecCccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceE Q lcl|Aclame:pro 215 PV--DVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFA 292 (298) Q Consensus 215 PV--~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~ 292 (298) |+ +.++.||.+ .++||||+. |.++.|++++++.+++. +|.+|++.||+.+|+|+++++++||+ T Consensus 306 p~~v~~s~~~p~~------~i~fgdf~~-Y~i~~r~~~~i~~~~~~--------~~~~d~~~f~~~~r~dG~~~d~~a~~ 370 (377) T protein:vir:96 306 GITILESLAVETG------KAIAFVANR-YDAFMATASTIEEYDQT--------FAMEDLQLYLTKNYFYGKAKDNHTAA 370 (377) T ss_pred CceEEecCCCCcc------cEEEEEcCc-EEEEEecccEEEeehhh--------hhhcCCeEEEEEEEEcCEEecCCcEE Confidence 75 567788753 488999998 56899999999887653 68999999999999999999999999 Q ss_pred EEeecC Q lcl|Aclame:pro 293 RVTEAN 298 (298) Q Consensus 293 ~l~~a~ 298 (298) +|+-+= T Consensus 371 vl~l~~ 376 (377) T protein:vir:96 371 LLTLAG 376 (377) T ss_pred EEEEec Confidence 998777 No 98 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=100.00 E-value=1.9e-45 Score=265.78 Aligned_cols=273 Identities=11% Similarity=-0.049 Sum_probs=200.4 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeecccccc-ccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKT-HGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~-~~~~~~~~v~l~~~k~~ 79 (298) -.++||+|||+++.++|++.+++.|+++++|+++|+++ ..++|+.++.+.+.|++|+++++ .++++|+++++.+||++ T Consensus 87 ~~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~~~-~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~ 165 (383) T protein:vir:78 87 VGYKEETLLPQTVVDEIFEDLTTEHPFLASIGMRTTGL-RTKFLKSETSGVAVWGKIFGEIKGQLDATFSDEESIQNKLT 165 (383) T ss_pred CCCCCccccCHHHHHHHHHHHHhhccceeeeeeEecCC-ceEEEEEcCCcceEEeecccccccccCcceeeEeecceeeE Confidence 56677899999999999999999999999999999875 47999999999999999987764 67999999999999999 Q ss_pred EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccc-----ccccccc Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKV-----EAPRGIA 154 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~-----~~~~~~~ 154 (298) ++++||+|||+ |+.++++++|.+++++++++++|.+|++|+| ++ .|.|+.......+... ....... T Consensus 166 ~~i~is~ell~---Ds~~~ie~~i~~~l~~~~a~~~~~a~i~G~G--~~---qP~Gil~~~~~~~~~~~~~~~~~~~~~~ 237 (383) T protein:vir:78 166 AFVVVPKDLEK---FGPAWVKRFVVTQIEEAFAVALESAYIVGDG--ND---KPIGLNRKVGKGSTVVDGVYAEKAATGT 237 (383) T ss_pred eeccchHHHhh---ccHHHHHHHHHHHHHHHHHHHHhhheEeccC--CC---CceeeeeccCCcccccccccccccccch Confidence 99999999996 5668999999999999999999999999964 33 2334322111111000 0111222 Q ss_pred hhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhc---cCCceeeccc----ccccCcceeccee--eEecCccccc Q lcl|Aclame:pro 155 DPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKD---LQGNALFPEL----KWGATPDTINGLP--VDVNKTVSDM 225 (298) Q Consensus 155 ~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd---~~G~~l~~~~----~~~~~~~~l~G~P--V~~s~~~~~~ 225 (298) ..++++..+...+.. ++....|+||..++..+++++. ..+.+.|.+. ...|.+.+++|+| |+.++.||.+ T Consensus 238 ~~~~~~~~~~~~l~~-~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~t~l~~~~~iv~s~~~p~~ 316 (383) T protein:vir:78 238 LTFANPKTTVNELTD-VYKYHSVKENGHPLNVAGKVTLLVNPTDAWDVKKQYTSLNANGVYVTALPFNLNIIESLFVPEK 316 (383) T ss_pred hhhhhhHHHHHHHHH-HHhccchhcccchhhhcCceEEEEcCcchhhhccchhccCCCCceeeecCCCceEEecCCCCcc Confidence 334555555554432 3333345555555555555441 1111112111 1234445677776 5668888753 Q ss_pred cccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 226 SLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 226 ~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) .++||||+. |.++.|++++++++++. +|.+|++.||+.+|+|+++++++||++|+-+. T Consensus 317 ------~iifgdfs~-Y~i~~r~~~~i~~~~~~--------~f~~d~~~f~~~~r~dG~~~~~~A~~vl~~~~ 374 (383) T protein:vir:78 317 ------KAISYVAER-YDALIGGPLDIGTYDQT--------LAIEDLNLYAAKQFAYGKAKDDKAAAVWTLNI 374 (383) T ss_pred ------cEEEeeccc-eEEEecccceEEecchh--------hhhcCceEEEEEEEEcCEEecCCeEEEEEEEe Confidence 478999998 56899999999887653 69999999999999999999999999987666 No 99 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=100.00 E-value=7.3e-41 Score=240.64 Aligned_cols=283 Identities=14% Similarity=0.095 Sum_probs=218.8 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceee-cCCCceEEEEEeCC----cceEEeeccccccccccceeeEEEee Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKP-IPFNGEKVFTFTMD----SEIDVVAESGKKTHGGVTLAPQTMVP 75 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~-~~~~~~~ip~~~~~----~~a~~v~E~~~~~~~~~~~~~v~l~~ 75 (298) .-++||+|+|.++ +++|+.+++.+++++++++++ +++....||+...+ +...|.+|.+..++++++|+++++.+ T Consensus 17 ~d~~gG~L~P~~~-~~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~~~~~~~~~~~~~~~~~~~~~tf~~~~l~~ 95 (314) T protein:vir:41 17 PDLGKGILAVQRF-GEFVREVRENSAIIKDARVLNALKSYEVDISRISLGVELEPGRNTSGTKVAPTADEVTVSTNTLEM 95 (314) T ss_pred ccCCCceeChHHH-HHHHHHHHhccchhhheeeecccCccceeecccccCcccccccccccCCccCCcccccccceeeee Confidence 4556889999886 689999999999999999985 56777899987643 33456777888899999999999999 Q ss_pred eEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc---ccccccccccccccccccccccc Q lcl|Aclame:pro 76 IKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGT---ASAVIGTNHFDSKVTQKVEAPRG 152 (298) Q Consensus 76 ~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~---~~~~~~~~~~~~~~~~~~~~~~~ 152 (298) ||+...++||+|+|+++. ...+++++|...+|+++++.++..+++|+++..-. ...+.|........ ........ T Consensus 96 ~kl~~~v~is~e~L~D~a-~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~~~p~G~l~~a~~~-~~~~~~~~ 173 (314) T protein:vir:41 96 KELVTKVVLEDEALEDNI-EQSAFEQTITSLLASGVTYDLECFFLHADSSLTTGRELYRINDGWMKLAGNQ-YTDAEPED 173 (314) T ss_pred EEEEEeecccHHHHHhhh-chhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccchhcchhhhhhcccc-eeecCccc Confidence 999999999999997543 12489999999999999999999999997532110 11223332211111 11112233 Q ss_pred cchhHHHHHHHhhhhhhcCCc---ccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccc Q lcl|Aclame:pro 153 IADPNGAIENAVELLTGVDAD---VTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQ 229 (298) Q Consensus 153 ~~~~~~~i~~~~~~l~~~~~~---~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~ 229 (298) ...+.+.+.+++..++..|++ ..+|+||+.+..+++++++.+|+++|.+...++.+.+|+|+||+..+.||. .+.+ T Consensus 174 ~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~~~~~l~~~~~~~~~~~~l~G~PV~~~~~~~~-~~~~ 252 (314) T protein:vir:41 174 ENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLLVRETGLGDSALIGATGLQYDGIPIQYVPALDA-LGDD 252 (314) T ss_pred cccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHhccCCcccchhhhCCCCceecceeeEecccccc-cCCC Confidence 345677889999999998865 457999999999999999999999999988889999999999999999975 4445 Q ss_pred cceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccc--eEEEeecC Q lcl|Aclame:pro 230 RDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATK--FARVTEAN 298 (298) Q Consensus 230 ~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a--~~~l~~a~ 298 (298) +..++||||++. .++.+..++++..++. ..+++.|.++.|+|+.+...+| .+.+++++ T Consensus 253 ~~~i~fgd~~nl-v~~~~~~ir~~~~~~a----------~~~~~~~~~~~r~d~~~~~~~aa~~~~~~~~~ 312 (314) T protein:vir:41 253 KARALLTVPTNL-VYGFWRNIRIEPKRDA----------AMRRTEYIASLRADCNYEDENAAVAAVIDMSS 312 (314) T ss_pred CceEEEechhhe-EEEeeceeEEeecccC----------cCCeEEEEEEEEeceEEEEcCcEEEEEeeccC Confidence 778999999986 4677777766554432 4677899999999999876544 44557777 No 100 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=100.00 E-value=7e-40 Score=235.26 Aligned_cols=282 Identities=12% Similarity=0.042 Sum_probs=208.3 Q ss_pred Cee-ccccccchhHHHHHHHHHHhhchhhhhcceee-cCCCceEEEEEeCC----cceEEeeccccccccccceeeEEEe Q lcl|Aclame:pro 1 MVL-NKGTLFDPELVTDLISKVAGKSSIARLSAQKP-IPFNGEKVFTFTMD----SEIDVVAESGKKTHGGVTLAPQTMV 74 (298) Q Consensus 1 mat-~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~-~~~~~~~ip~~~~~----~~a~~v~E~~~~~~~~~~~~~v~l~ 74 (298) .+. +||+|+|++ .+++|+.+.+.|++++++++++ +.+....+++...+ ....|.+|.++.++++++|+++++. T Consensus 21 ~~d~~Gg~l~P~~-~~~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~~~~~g~~~~~~~~~~~~~~~~f~~~~l~ 99 (315) T protein:vir:41 21 VPDLGRGVLSVDR-FGEFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVLDVGPGRDETGQKLAPPESTAEVKTNTLY 99 (315) T ss_pred CcCCCCceechHH-HHHHHHHHHhhhhhhhhceeeeccccccccccccccCcccccccccccCcCCCCCCccccceeeec Confidence 222 456666655 5789999999999999999864 55544556554321 2345888888999999999999999 Q ss_pred eeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc-cccccccccccccc-ccccccccc Q lcl|Aclame:pro 75 PIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGT-ASAVIGTNHFDSKV-TQKVEAPRG 152 (298) Q Consensus 75 ~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~-~~~~~~~~~~~~~~-~~~~~~~~~ 152 (298) +|++.+.++||+|+|+++.. .++++++|.+++++++++.++.++++|++..... ...+.|........ ........+ T Consensus 100 ~~~l~~~~~it~elL~D~~~-~~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s~~p~~~~~~G~l~~a~~~~~~~~~~~~a 178 (315) T protein:vir:41 100 MREMVTKVVIHEDAIEDNIE-GKAFEQKIVTLLGEGISYVLEKYYLHGDTSSSDPLLRMSDGWLKLASEKLTESDVDPEA 178 (315) T ss_pred eeeeeeeccccHHHHHhhhc-cccHHHHHHHHHHHHHHHHHHHHhhccCCcCcCccccccccceeccccccccccccccc Confidence 99999999999999974431 2589999999999999999999999996431110 11223332211111 111112223 Q ss_pred cchhHHHHHHHhhhhhhcCCc---ccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccc Q lcl|Aclame:pro 153 IADPNGAIENAVELLTGVDAD---VTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQ 229 (298) Q Consensus 153 ~~~~~~~i~~~~~~l~~~~~~---~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~ 229 (298) .....+.+.+++..++..|++ +.+|+||++++.++|++||++|+|+|.+..+.+.+.+|+|+||+..+.||.... + T Consensus 179 ~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rklk~~~g~~lw~~~~~~g~~~tl~G~PV~~~~~m~~~~~-~ 257 (315) T protein:vir:41 179 EDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDALKGRETGLGDQALTGANSILYDGRPVQYVPALEALND-G 257 (315) T ss_pred ccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHHhccCCCccccchhhcCCCceecccceEecccccccCC-C Confidence 334567888999999998874 457999999999999999999999999999999999999999999999986543 4 Q ss_pred cceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccc--eEEEee Q lcl|Aclame:pro 230 RDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATK--FARVTE 296 (298) Q Consensus 230 ~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a--~~~l~~ 296 (298) ...++||||++. .++.+.+++++..++. ..+.+.|.++.|+|+.+...++ ++.+|. T Consensus 258 ~~~ilf~d~~nl-~~~~~~~i~i~~~~~a----------~~~~~~~~~~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 258 KSRALFVVPTQL-VYGFWRNIKVVPDYDA----------EMRLTKYVASLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred CccEEEecccce-EEEeccccEEEeeecC----------CCCceEEEEEEEeceeEEeccceeEeeeeC Confidence 567899999985 4788888888766553 2355678888999997665443 444444 No 101 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=100.00 E-value=1.2e-36 Score=217.48 Aligned_cols=284 Identities=8% Similarity=0.014 Sum_probs=212.2 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeec-c-ccccccccceeeEEEeeeEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAE-S-GKKTHGGVTLAPQTMVPIKV 78 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E-~-~~~~~~~~~~~~v~l~~~k~ 78 (298) -..++|++||+++.++|++.+.+.|+++++++++++.+....+|....++.+.|+++ + ...+.++++|+++++.+|++ T Consensus 22 ~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~~~~i~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~k~ 101 (321) T protein:vir:31 22 DDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAKKTRIPTLNIGERHRRPQDEGEWNENESDVSTGTIDISTEKA 101 (321) T ss_pred cccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCcceeeeeeccCCcccccccccccccccccceeeeeeeeeEEE Confidence 234568899999999999999999999999999999988889999887777788763 3 35667889999999999999 Q ss_pred EEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc-cccccccccccccccccccccccchhH Q lcl|Aclame:pro 79 EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTA-SAVIGTNHFDSKVTQKVEAPRGIADPN 157 (298) Q Consensus 79 ~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (298) .+.++||+|+|.++. ...++.++|.+.+++++++.++..+++|++...-.. ....|........... .......... T Consensus 102 ~~~~~it~e~L~d~a-~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~~~~~n~G~l~~a~~~~~~-~~~~~~~~~~ 179 (321) T protein:vir:31 102 TVAWDLPREVVQENP-EGEALADRILNLMTDAWSADVEDLAANGDEDAEDSFENQNDGFITVAEGDVET-IDAADDILDN 179 (321) T ss_pred EeehhccHHHHHhhh-cchhHHHHHHHHHHHHHHHHHHhheeeccccCCCcccccchhhhhhhcccccc-ccccccccCH Confidence 999999999996543 235899999999999999999999999964321100 0012221111111111 1122233446 Q ss_pred HHHHHHhhhhhhcCCc--ccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEE Q lcl|Aclame:pro 158 GAIENAVELLTGVDAD--VTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAII 235 (298) Q Consensus 158 ~~i~~~~~~l~~~~~~--~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~ 235 (298) +.+.+++..++..|.+ ..+|+||+.++.++++.....+.++|.+...++.+.+|+|+||+.+++||++ .+++ T Consensus 180 d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~~~~~~~~~~~l~~~~~~tl~G~pvv~~~~mP~~------~il~ 253 (321) T protein:vir:31 180 DLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYTLTDRDTPLGDNVIMGEADVNPFSFPIIGSGLWPDD------KAMF 253 (321) T ss_pred HHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHHHhcCCCccccchhhccccccccceeEEEcCCCCCC------cEEE Confidence 7889999999888864 3379999999988775444455578888777778889999999999999864 5899 Q ss_pred eeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 236 GDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 236 gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ++|++.+ ++.++++++++..+.... ....+.+......++|+.+.+++|++.+++.- T Consensus 254 t~~~nl~-~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~ve~~~a~a~~~~i~ 310 (321) T protein:vir:31 254 TDPQNLI-YALYRDLEIDVLTESDKV-----SERDLHARYFMRGDDDFAIENTEAVVLAEGLG 310 (321) T ss_pred eccccEE-EEEeeccEEEEeecCccc-----cccceeeEeeeeeecceeEeccccEEEEecCC Confidence 9999975 677888888776543211 11234444445567999999999999999755 No 102 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=100.00 E-value=8.4e-36 Score=212.88 Aligned_cols=270 Identities=13% Similarity=0.016 Sum_probs=186.0 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEY 80 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~~ 80 (298) ....+|++.|+++...+...+...+++++++++.+.+ ...+|..+....+.|+.||+.+|+++++|+++++.++++++ T Consensus 243 ~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~i~--~~~~~~~~~~~~a~~~~eG~~kp~s~~tf~~~~~~~~~ia~ 320 (517) T protein:vir:97 243 ERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLP--TLVVGGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYVYK 320 (517) T ss_pred cccccccccchHHHHHHHHhhhhhccceeeeeecccc--ceeeecccccceeeeeecCCcccccccceeeEEeeHhhhhh Confidence 1223578899999999999988889988888765544 46777777777889999999999999999999999999999 Q ss_pred EEeecHHHhhccc-ccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHH Q lcl|Aclame:pro 81 GARISDEFMYASD-EEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGA 159 (298) Q Consensus 81 ~~~iS~ell~~~~-d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (298) ++++|+|||+++. |+...|.++|.++|+++++++++.++++|+|.+ ....+....... .........+...+ T Consensus 321 ~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~l~~~ee~a~l~GdGtg----~~~~gi~~~a~~---~~~~~~~~~~~~~d 393 (517) T protein:vir:97 321 YIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTG----VSETQIYPVVGD---AWATNVTGTTNIQE 393 (517) T ss_pred hhhhhHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHhcccCCC----cccccccccccc---cccccccccchHHH Confidence 9999999997543 334459999999999999999999999996532 222222221111 11111111122222 Q ss_pred HHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEEEeecc Q lcl|Aclame:pro 160 IENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFA 239 (298) Q Consensus 160 i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~ 239 (298) +.+.+..... ....+.|+|||.+|..|++|||++|||||++...++.+.+++|.. +.+|..... ...++..+ T Consensus 394 ~i~~l~~a~~-~a~~a~~vmn~~t~~~I~klKD~~G~Yl~~~~~~~~~~~~l~G~~----~~~~~~~~~---~~~~~~~~ 465 (517) T protein:vir:97 394 LLEKLSVATP-KAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFN----RLVQSVAVD---EKTAVSLS 465 (517) T ss_pred HHHHHHHHhh-hccCCEEEECHHHHHHHHHhhcCCCCeeccCcCCcccccccCCcc----ccccccccC---ceeEeecc Confidence 2222222111 123567999999999999999999999999988888888999842 223322211 12223344 Q ss_pred ceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEe--ecC Q lcl|Aclame:pro 240 NGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVT--EAN 298 (298) Q Consensus 240 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~--~a~ 298 (298) + |.+..+.++++ .+..+ +.+|+..|+.++|+++.|+.|++|++.. ..+ T Consensus 466 ~-y~i~~~~g~~~--~~~fd--------~~~n~~~f~~~~~~~g~i~~~~r~a~~~~~p~~ 515 (517) T protein:vir:97 466 G-YVTNGSRGMEF--EQGTI--------LVENNKEYLFEMPISGSLEYKGTTAYGTYTPPV 515 (517) T ss_pred c-cEEEeecceee--eeeee--------cccCceeEeeeeeeccccccccceEEEEEcCCC Confidence 3 33444545442 22221 2468888999999999999999888753 333 No 103 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=100.00 E-value=5.4e-33 Score=197.49 Aligned_cols=257 Identities=14% Similarity=0.094 Sum_probs=198.4 Q ss_pred Ce---ecccc-ccchhHHHHHHHHHHhhchhhhhccee----ecCCCceEEEEEeCCcceEEeeccccccccccceeeEE Q lcl|Aclame:pro 1 MV---LNKGT-LFDPELVTDLISKVAGKSSIARLSAQK----PIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQT 72 (298) Q Consensus 1 ma---t~gg~-lip~~~~~~ii~~~~~~s~i~~~~~~~----~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~ 72 (298) || |+.+. ++|+.+++.+++.+++.+.+.+++.+. ..++..++||++...+++.|++||+.++.+++++++++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~~~~~ 80 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGFKKTT 80 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccccccccceEE Confidence 99 55564 456667777888998888888877653 23345689999988899999999999999999999999 Q ss_pred EeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 73 MVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRG 152 (298) Q Consensus 73 l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 152 (298) +.++++++.+.+|+++..+ +..++.+++.+++++++++++|..++.... +.. .. .. T Consensus 81 ~~~~~~~~~~~itd~~~~~---s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~---~a~--------------~~----~~ 136 (272) T protein:vir:30 81 MTIKKAGKGVEITDEAILS---GYGDPVGQAAKQIVEAIDHKVDADVLDALS---KST--------------QT----VE 136 (272) T ss_pred EEeeeeeeeeeecHHHHhh---ccccHHHHHHHHHHHHHHHHHHHHHHHHhc---ccc--------------cc----cc Confidence 9999999999999998754 346799999999999999999999985421 100 00 01 Q ss_pred cchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccC---CceeecccccccCcceecceeeEecCccccccccc Q lcl|Aclame:pro 153 IADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQ---GNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQ 229 (298) Q Consensus 153 ~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~---G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~ 229 (298) ....++.|.++..++...+.....|+|||.++..|++.+..+ ......+....+..++++|+||+++++||.+ T Consensus 137 ~~~t~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~~p~~---- 212 (272) T protein:vir:30 137 ATATVDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRKCPKG---- 212 (272) T ss_pred cccCHHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCCCCcc---- Confidence 122478899999999988888899999999999998775221 1112223344556789999999999999853 Q ss_pred cceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 230 RDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 230 ~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ++++.+. .++.+..+++++++.+++. .++...++...|+++++.+|+++++++.+- T Consensus 213 --t~~~~~~-~a~~~~~~~~~~ve~~r~~----------~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~ 268 (272) T protein:vir:30 213 --TAYMVRK-GALRIMLKRNTMVETDRDI----------TKAINQIVANKHYGVYLYKAEKAVKITLKD 268 (272) T ss_pred --eEEEEcC-CeEEEEecCCceeeecccc----------ccceeEEEEEEEEEEEEEcCCceEEEEecc Confidence 3455443 4667788888888766543 234567888999999999999999998888 No 104 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=100.00 E-value=5.4e-33 Score=197.49 Aligned_cols=257 Identities=14% Similarity=0.094 Sum_probs=198.4 Q ss_pred Ce---ecccc-ccchhHHHHHHHHHHhhchhhhhccee----ecCCCceEEEEEeCCcceEEeeccccccccccceeeEE Q lcl|Aclame:pro 1 MV---LNKGT-LFDPELVTDLISKVAGKSSIARLSAQK----PIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQT 72 (298) Q Consensus 1 ma---t~gg~-lip~~~~~~ii~~~~~~s~i~~~~~~~----~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~ 72 (298) || |+.+. ++|+.+++.+++.+++.+.+.+++.+. ..++..++||++...+++.|++||+.++.+++++++++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~~~~~ 80 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGFKKTT 80 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccccccccceEE Confidence 99 55564 456667777888998888888877653 23345689999988899999999999999999999999 Q ss_pred EeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 73 MVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRG 152 (298) Q Consensus 73 l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 152 (298) +.++++++.+.+|+++..+ +..++.+++.+++++++++++|..++.... +.. .. .. T Consensus 81 ~~~~~~~~~~~itd~~~~~---s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~---~a~--------------~~----~~ 136 (272) T protein:vir:98 81 MTIKKAGKGVEITDEAILS---GYGDPVGQAAKQIVEAIDHKVDADVLDALS---KST--------------QT----VE 136 (272) T ss_pred EEeeeeeeeeeecHHHHhh---ccccHHHHHHHHHHHHHHHHHHHHHHHHhc---ccc--------------cc----cc Confidence 9999999999999998754 346799999999999999999999985421 100 00 01 Q ss_pred cchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccC---CceeecccccccCcceecceeeEecCccccccccc Q lcl|Aclame:pro 153 IADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQ---GNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQ 229 (298) Q Consensus 153 ~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~---G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~ 229 (298) ....++.|.++..++...+.....|+|||.++..|++.+..+ ......+....+..++++|+||+++++||.+ T Consensus 137 ~~~t~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~~p~~---- 212 (272) T protein:vir:98 137 ATATVDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRKCPKG---- 212 (272) T ss_pred cccCHHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCCCCcc---- Confidence 122478899999999988888899999999999998775221 1112223344556789999999999999853 Q ss_pred cceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 230 RDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 230 ~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ++++.+. .++.+..+++++++.+++. .++...++...|+++++.+|+++++++.+- T Consensus 213 --t~~~~~~-~a~~~~~~~~~~ve~~r~~----------~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~ 268 (272) T protein:vir:98 213 --TAYMVRK-GALRIMLKRNTMVETDRDI----------TKAINQIVANKHYGVYLYKAEKAVKITLKD 268 (272) T ss_pred --eEEEEcC-CeEEEEecCCceeeecccc----------ccceeEEEEEEEEEEEEEcCCceEEEEecc Confidence 3455443 4667788888888766543 234567888999999999999999998888 No 105 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=99.96 E-value=1.9e-33 Score=200.00 Aligned_cols=256 Identities=11% Similarity=0.006 Sum_probs=160.9 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeecccccccc--ccceeeEEEe---e Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHG--GVTLAPQTMV---P 75 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~--~~~~~~v~l~---~ 75 (298) -.++.+..+|+++...+.......+++...++. ...+.....|++|....+.+ ..++.+.++. . T Consensus 214 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~g~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v 282 (480) T protein:vir:40 214 NVVNSLGSITSKYARKSGIYDGAMKARFQGLTL-----------AEDGVDDTFISGTFKAGTDKNKSQTATKRSLRPQMA 282 (480) T ss_pred cccccccccccchhhheeechhhhhhhhhccee-----------eeccccceeeeeeeecccccccccccccchhhHHHH Confidence 001111223333322222222222222221111 11234456788876554432 2234555555 4 Q ss_pred eEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccch Q lcl|Aclame:pro 76 IKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIAD 155 (298) Q Consensus 76 ~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (298) ++++.+.+.|.++| +|+ .+|.++|.++|++.++++++.+|++|.+++.. .+.+.... .+ . .+.... T Consensus 283 ~~l~~~~k~t~~lL---DDa-~~l~~~i~~~l~~~~~~~ee~a~l~G~g~g~~---~~~g~~~~---~~-~---~~~~~~ 348 (480) T protein:vir:40 283 EAYLQMDKATVRGV---NDS-GALSEYVMSEMVNRVIQKVEYNMILGSVDGSN---GFYGLKTA---TD-G---WTKQIE 348 (480) T ss_pred HHHHHhHHHHHHHh---hhh-HHHHHHHHHHHHHHHHHHHHHHhhccCCCCcc---ccccceee---cc-c---ccccch Confidence 67777788888877 344 47999999999999999999999999543321 12222111 11 0 011111 Q ss_pred hHHHHHHHhhhhhhcCCccc-EEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecC-ccccccccccceE Q lcl|Aclame:pro 156 PNGAIENAVELLTGVDADVT-GIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNK-TVSDMSLTQRDRA 233 (298) Q Consensus 156 ~~~~i~~~~~~l~~~~~~~~-~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~-~~~~~~~~~~~~~ 233 (298) ..+.|.+++.++...+..++ .|+|||.+|..|++|||++|||||++..+.+.+.+|||+||++++ .+|. +.. T Consensus 349 ~~d~id~L~~al~~~y~~~a~~~vmn~~t~~~I~klKD~~G~Yi~q~~~~~~~~~~llG~pvv~~~~~~~~------~~~ 422 (480) T protein:vir:40 349 YTDLFEGITDAVAECSISDAITIVMSPQTFAELRKAKGTDGHSRFNELATKEQIAQSFGAVNLETRVWMPK------DEV 422 (480) T ss_pred hHHHHHHHHHhhhHHhhCCCCEEEECHHHHHHHHHhhcCCCCeeccCcccccCcceecccceeeeeccccC------Ccc Confidence 23455667777888887766 699999999999999999999999999999999999999988764 3442 233 Q ss_pred EEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 234 IIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 234 ~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) .++.++.++.++.++ ++..+..+ +..|+..|+++.|+++.+.+|+|++.||.-= T Consensus 423 ~~~~~~~~~~~~d~~---~~~~~~~~--------~~~~~~~~~~e~~v~g~~~~~~~~~~~~~~~ 476 (480) T protein:vir:40 423 AVYNHDEYVLIGDLN---VENYNDFD--------LRYNVEQWLSETLVGGSIRGKNRSAYLKKKG 476 (480) T ss_pred eeeeCCccEEEEecc---cceecccc--------cccchhhhhhhhhhceeeEccccEEEEEecc Confidence 455666666666652 22222221 4578888999999999999999999997544 No 106 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=99.91 E-value=3.6e-26 Score=160.08 Aligned_cols=284 Identities=14% Similarity=0.136 Sum_probs=213.4 Q ss_pred Cee----ccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccccccc-cceeeEEEee Q lcl|Aclame:pro 1 MVL----NKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGG-VTLAPQTMVP 75 (298) Q Consensus 1 mat----~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~-~~~~~v~l~~ 75 (298) |++ ..+.+.|.++...|||.+.++|.+++..+..++.++.+.+++.+.-+.+.|+..++..+++. .+|.+++... T Consensus 25 m~alTLaea~~l~~d~~~~~VIE~l~~~s~iL~~lpf~~ve~~~~~~~r~~~lp~a~~r~~n~~~~~~~~~Tf~q~t~~l 104 (330) T protein:vir:94 25 MPTVTLAESAKLSQDHLVSGLIETIVEVNPLYEMMPFTEIEGNALAYNRENVLGDVQFLAVGGTITAKNPATFTKVTSEL 104 (330) T ss_pred hhhhhhhHHhhcCchhhHHHHHHhhhccchHHhhcccccccCCcceeeeeecCCcceeeeccccccccCcceeeeeeech Confidence 332 25788999999999999999999999999888888889999999999999999999888765 4799999999 Q ss_pred eEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccch Q lcl|Aclame:pro 76 IKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIAD 155 (298) Q Consensus 76 ~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (298) +.+++.+.|.+++.. ...+..+...+-.+...|+++++++.++++|+.. +..|.|+................+.. T Consensus 105 ~~l~~~~~Vd~~iad-l~g~~~d~~~~q~~~~ieal~~~~e~~linGDs~----~~~F~GL~~~~~~~q~i~tg~~gg~~ 179 (330) T protein:vir:94 105 TTLIGDAEVNGLIQA-TRSDFMDQTSVQVASKAKSIGRQYQASMITGDGT----GNSFQGMMGLVAASQTISAGANGGTL 179 (330) T ss_pred hhhhhhHHHHHHHHH-hcCCHHHHHHHHHHHHHHHHHHHHHHHhhccCCC----CccccchhhcCCcccEEecCCCCCCC Confidence 999999999999852 2234567778888999999999999999999532 33555664433322222222234556 Q ss_pred hHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeeccc---ccccCcceecceeeEecCccccccc----c Q lcl|Aclame:pro 156 PNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPEL---KWGATPDTINGLPVDVNKTVSDMSL----T 228 (298) Q Consensus 156 ~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~---~~~~~~~~l~G~PV~~s~~~~~~~~----~ 228 (298) ..+++..++.++......++.|+||+++..+|+.+++..|++-..+. ..+...-++.|+|++.++.+|.+.+ . T Consensus 180 T~d~LDeLl~~v~~~~g~~~~~l~n~a~~r~I~a~~R~~~~~~v~~~~~~~~G~~v~~~~GvPi~~~d~ip~~~~~~~~~ 259 (330) T protein:vir:94 180 TFELLDQLLDLVKDKDGQVDYLMSSFAMRRKYFSLLRALGGAAIGEVMTLPSGRQIPTYRGVPWFVNDFIPSNMTQGTAT 259 (330) T ss_pred CHHHHHHHHHHhcCCCCCCcEEEechhHHHHHHHHHHhccCCCCCCcccccCCCEEeeeCCeEEEecccccCCCCcccCC Confidence 67889999988877777889999999999999999998887654332 2233345788999999999987532 3 Q ss_pred ccceEEEeeccce----EEEEee----cceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 229 QRDRAIIGDFANG----FKWGYA----KEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 229 ~~~~~~~gd~~~~----~~~~~~----~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) +.+.+++..|... ...+.. .+++++.--. . -.++.+.+|.+++++.++..|+|+++|+++. T Consensus 260 ~ttsIyav~~G~~~~~qgV~Gl~~~g~~glsVr~~G~--~-------~~k~v~~~~v~~y~~~av~~~~a~~~L~~V~ 328 (330) T protein:vir:94 260 NATAIFAGTFDDGSNKYGIAGLTARGSAGLRVQNVGA--K-------ENADETITRVKMYCGFANFSQLGLAAIKGLI 328 (330) T ss_pred CceeEEEEeecccccccceEeecCCCCCcceeeeCCC--c-------cccceeeEEEEEeeeeEEechhheeeecccc Confidence 4566777666421 224443 2444432111 1 1345577899999999999999999999999 No 107 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.90 E-value=1.9e-25 Score=156.11 Aligned_cols=261 Identities=17% Similarity=0.142 Sum_probs=190.5 Q ss_pred Cee---ccccccchh-HHHHHHHHHHhhchhhhhcceeec----CCCceEEEEEeCCcceEEeeccccccccccceeeEE Q lcl|Aclame:pro 1 MVL---NKGTLFDPE-LVTDLISKVAGKSSIARLSAQKPI----PFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQT 72 (298) Q Consensus 1 mat---~gg~lip~~-~~~~ii~~~~~~s~i~~~~~~~~~----~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~ 72 (298) ||. .-..+|-|| +.+-+.+.+.+...+.+++..... ++..++||.+...+++.++.||+.++..+.+.++.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~eg~~i~~~~lt~~~~~ 80 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEGGEISLDKIGTTTKS 80 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccccccCCCCccChhhcCCccee Confidence 883 335555455 555566788777777887765442 244689999988788999999999999999999999 Q ss_pred EeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 73 MVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRG 152 (298) Q Consensus 73 l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 152 (298) +..++.+..+.++++...+ +..++.+.+.++++..+++++|+.++.... +.. .... T Consensus 81 ~~i~~~~k~~~vtD~~~~~---~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~---~~~------------------~~~~ 136 (272) T protein:vir:36 81 VTIKKAAKGTEITDEAALS---GYGDPIGESNKQLGLSLANKVDDDLLSAAK---TTS------------------QTVS 136 (272) T ss_pred EeeehhhccccccHHHHhh---ccchHHHHHHHHHHHHHHHHHHHHHHHHhc---ccc------------------cccc Confidence 9999999999999987543 335788999999999999999999885421 100 0011 Q ss_pred cchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCc--eeecccccccCcceecceeeEecCcccccccccc Q lcl|Aclame:pro 153 IADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGN--ALFPELKWGATPDTINGLPVDVNKTVSDMSLTQR 230 (298) Q Consensus 153 ~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~--~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~ 230 (298) ....++.|.++..++..++.....++|||.++..|+|..+-... ....+....+..++++|+||++++.||.+.+. . T Consensus 137 ~~~~~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~p~~~~~-~ 215 (272) T protein:vir:36 137 TKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSAL-M 215 (272) T ss_pred ccccHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHhcccccccccccccccceeeeccceecCeeEEEeCCCCCCcee-E Confidence 23357889999999999888888999999999999875432211 11222233455679999999999999965432 1 Q ss_pred ceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 231 DRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 231 ~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ..++++ +.++.+...+.+++|..++.. +..-.+++..+++.++.+|+++++++.+. T Consensus 216 ~~~~~~--~gA~~~~~~~~~~vE~~R~~~----------~~~d~i~~~~~y~~~v~~~~~vv~~t~~g 271 (272) T protein:vir:36 216 FKIVSN--SPALKLVLKRGVQVETDRDIV----------TKTTVITADEHYAAYLYDLTKVVNITFTG 271 (272) T ss_pred EEEEec--ccceeeeecCCcccccccchh----------hcCcEEEEEEEEEEEEEcCccEEEEeecC Confidence 223333 456666777777777555432 12235788899999999999999998888 No 108 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=99.89 E-value=1.1e-24 Score=152.00 Aligned_cols=256 Identities=14% Similarity=0.083 Sum_probs=189.6 Q ss_pred Ceecc---c-cccchhHHHHHHHHHHhhchhhhhcceeec----CCCceEEEEEeCCcceEEeeccccccccccceeeEE Q lcl|Aclame:pro 1 MVLNK---G-TLFDPELVTDLISKVAGKSSIARLSAQKPI----PFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQT 72 (298) Q Consensus 1 mat~g---g-~lip~~~~~~ii~~~~~~s~i~~~~~~~~~----~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~ 72 (298) ||... + .++|+-+.+-+.+.+.+...+.+++..... ++..++||++...+++.++.||+.++.++.++++.+ T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg~~i~~~~it~~~~~ 80 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccccCCCcccccccccceeE Confidence 77654 4 455666677778888888877777765422 234689999987788999999999999999999999 Q ss_pred EeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 73 MVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRG 152 (298) Q Consensus 73 l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 152 (298) +..++.+....++++...++ ..++.+.+.+++++++++++|..++....... .. ... T Consensus 81 ~~i~~~~~~~~i~D~~~~~~---~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~-----------------~~---~~~ 137 (274) T protein:vir:93 81 AKIRKIAKGTSITDEALLSG---YGDPQGEQVRQHGLAHANKVDNDVLEALMGAK-----------------LT---VNA 137 (274) T ss_pred EEeeeecccccccHHHHHhh---ccchHHHHHHHHHHHHHHHHHHHHHHHHhccc-----------------cc---ccc Confidence 99999998999999976433 35688899999999999999999986531110 00 011 Q ss_pred cchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCcee-----ecccccccCcceecceeeEecCccccccc Q lcl|Aclame:pro 153 IADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNAL-----FPELKWGATPDTINGLPVDVNKTVSDMSL 227 (298) Q Consensus 153 ~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l-----~~~~~~~~~~~~l~G~PV~~s~~~~~~~~ 227 (298) ....++.|.++..++..++.....++|||.++..|+|.. .-+++ -.+....+..++++|+||++++.+|.. T Consensus 138 ~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~--~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~-- 213 (274) T protein:vir:93 138 DITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDA--STNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAG-- 213 (274) T ss_pred cccCHHHHHHHHHHhhhccCCccEEEeCHHHHHHHHhhh--hhcccccccccccceeecccceecCeeEEEcCCCCcc-- Confidence 122478899999999888888889999999999997642 11111 011233456779999999999999853 Q ss_pred cccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 228 TQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 228 ~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) +.++.. ..++.+..++.+.++..++.. +..-.+++..+++.++.+|+++++++.+- T Consensus 214 ----t~~l~~-~gai~~~~~~~~~vE~~Rd~~----------~~~d~i~~~~~y~~~~~~~~~~v~~t~~~ 269 (274) T protein:vir:93 214 ----TAILAK-KGAVKLILKRDFFLEVARDAS----------TKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) T ss_pred ----eEEEEe-CCeEEEEecCCcccccccchh----------hcccEEEEEEEEEEEEEcCCceEEEeeCc Confidence 344443 345667777777776555432 22346788899999999999999999999 No 109 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=99.88 E-value=1.2e-23 Score=146.30 Aligned_cols=256 Identities=13% Similarity=0.103 Sum_probs=187.3 Q ss_pred Ce---eccccccchh-HHHHHHHHHHhhchhhhhcceeec----CCCceEEEEEeCCcceEEeeccccccccccceeeEE Q lcl|Aclame:pro 1 MV---LNKGTLFDPE-LVTDLISKVAGKSSIARLSAQKPI----PFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQT 72 (298) Q Consensus 1 ma---t~gg~lip~~-~~~~ii~~~~~~s~i~~~~~~~~~----~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~ 72 (298) || |.-+.+|-|| +.+-+.+.+.+...+.+++..... ++..++||++...+++..+.|++.++..+.++++.+ T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~g~~i~~~~it~~~~~ 80 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCccccCCCCcCchhhcccceeE Confidence 98 4436666555 455667777777777777654321 244689999987788889999999999999999999 Q ss_pred EeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 73 MVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRG 152 (298) Q Consensus 73 l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 152 (298) +..++.+..+.++++...+ +..++.+.+.+++++++++.+|..++......+ . .... T Consensus 81 ~~i~~~~~~~~i~D~~~~~---~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~-----------------~---~~~~ 137 (274) T protein:vir:96 81 AKVRKIGKGTELTDEAVLS---GFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT-----------------L---TVEA 137 (274) T ss_pred EEEEeeeceeeecHHHHHh---hcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-----------------C---CcCc Confidence 9999988889999987543 335788999999999999999998886421100 0 0111 Q ss_pred cchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceee-----cccccccCcceecceeeEecCccccccc Q lcl|Aclame:pro 153 IADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALF-----PELKWGATPDTINGLPVDVNKTVSDMSL 227 (298) Q Consensus 153 ~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~-----~~~~~~~~~~~l~G~PV~~s~~~~~~~~ 227 (298) ....++.|.++..++..++.....++|||.++..|+|... .+++- .+....+..++++|++|++++.+|.. T Consensus 138 ~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~--~~f~~~~~~g~~~~~~g~ig~~~G~~Vi~s~~~p~~-- 213 (274) T protein:vir:96 138 DITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSAS--DNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKG-- 213 (274) T ss_pred ccccHHHHHHHHHHhcccCCCceEEEeCHHHHHHHHhccc--ccccccccccccceeecccceecCeeEEEcCCCCcc-- Confidence 2234788999999998888888899999999999988641 11110 12223455789999999999999853 Q ss_pred cccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 228 TQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 228 ~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ...+++ .+++.+..++++++|..++.. +..-.+++..+|+.+++||+++++++++. T Consensus 214 ---t~~l~~--~gA~~~~~~~~~~vE~~Rd~~----------~~~d~i~~~~~yg~~~~~~~~vv~~t~~~ 269 (274) T protein:vir:96 214 ---EALLAK--KGAVKLITKRDFFLEKDRDAS----------RKSTALYSDKHYVAYLYDESKVVKITKGA 269 (274) T ss_pred ---eEEEEe--CcceeeeecCCcccccccchh----------hcccEEEEeeEEEEEEEcCccEEEEEcCc Confidence 234444 456667777777776554322 23346788899999999999999999999 No 110 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.87 E-value=8.1e-24 Score=147.17 Aligned_cols=258 Identities=15% Similarity=0.094 Sum_probs=190.0 Q ss_pred Ce--ec-cccccchh-HHHHHHHHHHhhchhhhhcceee----cCCCceEEEEEeCCcceEEeeccccccccccceeeEE Q lcl|Aclame:pro 1 MV--LN-KGTLFDPE-LVTDLISKVAGKSSIARLSAQKP----IPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQT 72 (298) Q Consensus 1 ma--t~-gg~lip~~-~~~~ii~~~~~~s~i~~~~~~~~----~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~ 72 (298) || ++ =+.+|-|| +.+-+.+.+.+...+.+++.... .++..++||.+..-+++.++.||+.++..+.+.++.+ T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg~~i~~~~lt~~~~~ 80 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEGQKIPVDKIETNRRE 80 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCCCccCccccccceee Confidence 88 32 25556555 55556778888888878876543 2455689999988788999999999999999999999 Q ss_pred EeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 73 MVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRG 152 (298) Q Consensus 73 l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 152 (298) ...++.+..+.++++....+. .+..+.+.++++.++++++|..++.-....+ ... .. T Consensus 81 a~i~~~~k~~~~tD~a~~~~~---~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~-----------------~~~---~~ 137 (276) T protein:vir:10 81 AKIHKIGKGTDITDEALLSGY---GDPQGEAVRQHGLAIANKVDNDVLEALRGTK-----------------LTV---SA 137 (276) T ss_pred EEeehccccccccHHHHHhhc---cchHHHHHHHHHHHHHHHHHHHHHHHHhccc-----------------ccc---cc Confidence 999999999999999765443 4677899999999999999998874321100 000 11 Q ss_pred cchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCcee---ecccccccCcceecceeeEecCccccccccc Q lcl|Aclame:pro 153 IADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNAL---FPELKWGATPDTINGLPVDVNKTVSDMSLTQ 229 (298) Q Consensus 153 ~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l---~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~ 229 (298) ....++.|.++..++..++....+++|||.++..|+|+.+.+-... -.+....+..++++|++|++++.+|.. T Consensus 138 ~~~t~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~---- 213 (276) T protein:vir:10 138 DIGTLAGLEAAIDTFDDEDLEPMVLFINPKDAGKLRSSASDNFTRATELGDNIIVKGAFGEALGAVIVRSKKLDEG---- 213 (276) T ss_pred cccCHHHHHHHHHHhccccCcccEEEEcHHHHHHHHHhccccccccccccccceeccccceecceeEEEcCCCCcc---- Confidence 1234788999999998888888899999999999998643221100 011223455679999999999999853 Q ss_pred cceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 230 RDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 230 ~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ...+++ ..++.+...+++.+|.++... +..-.+++..+++.++.+|+++++++.+. T Consensus 214 -t~~l~~--~gAi~~~~~~~~~vE~dRd~~----------~~~d~i~~~~~y~~~~~~~~~vv~~t~~~ 269 (276) T protein:vir:10 214 -EAILAK--RGAVKLITKRDFFLETDRDPS----------TKTTALYSDKHYVAYLYDESKAVKVTKGA 269 (276) T ss_pred -eEEEEe--ccceeeeecCCceeecccchh----------hcccEEEEeeEEEEEEEcCcceEEEecCC Confidence 234444 456667778888877666432 22345678889999999999999999999 No 111 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=99.86 E-value=2.2e-23 Score=144.83 Aligned_cols=258 Identities=12% Similarity=0.096 Sum_probs=187.7 Q ss_pred Ceecc--ccccchh-HHHHHHHHHHhhchhhhhcceeec----CCCceEEEEEeCCcceEEeeccccccccccceeeEEE Q lcl|Aclame:pro 1 MVLNK--GTLFDPE-LVTDLISKVAGKSSIARLSAQKPI----PFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTM 73 (298) Q Consensus 1 mat~g--g~lip~~-~~~~ii~~~~~~s~i~~~~~~~~~----~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l 73 (298) ||+.+ +.+|-|| +..-+.+.+.+...+.+++..... ++..++||.+...+++.++.||+.++..+.+.++.++ T Consensus 3 ~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~~ 82 (275) T protein:vir:96 3 LENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEGEEIPIDLIETKKRQA 82 (275) T ss_pred CcccchhhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCccccccCCCCcchhhcccceeeE Confidence 55443 4556555 555567788888888888765443 3446899999877889999999999999999999999 Q ss_pred eeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 74 VPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGI 153 (298) Q Consensus 74 ~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (298) ..++.+..+.++++...++. .++...+.++++.++++++|..++.-.+..+ ... ... T Consensus 83 ~i~~~~~~~~i~D~~~~~~~---~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~-----------------~~~---~~~ 139 (275) T protein:vir:96 83 TIRKIGKGTVLTDEALLSGY---GDPKGEAVRQHGLAIANKVDNDVLEALQGAT-----------------LKV---EAD 139 (275) T ss_pred EeehhcccccccHHHHHhhc---cchHHHHHHHHHHHHHHHHHHHHHHHHhccc-----------------ccc---ccc Confidence 99999999999999764333 4678889999999999999999885422110 000 111 Q ss_pred chhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccC---CceeecccccccCcceecceeeEecCcccccccccc Q lcl|Aclame:pro 154 ADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQ---GNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQR 230 (298) Q Consensus 154 ~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~---G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~ 230 (298) ...++.|.++..++..++.....++|||.++..|+|+.+.+ ....-.+....+..++++|++|++++.+|.+ T Consensus 140 ~~~~d~i~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~----- 214 (275) T protein:vir:96 140 ITKLAGLQTAIDKFNDEDLEPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFGEALGAIIVRSNKIKEG----- 214 (275) T ss_pred ccCHHHHHHHHHHhccccCCccEEEeCHHHHHHHHhcccccccccccccccceeccccceecCeeEEEeCCCCcc----- Confidence 23478899999999888878889999999999998863111 0000012233556789999999999999854 Q ss_pred ceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 231 DRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 231 ~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ...+++ +.++.+..++.+++|..++.. +..-.+++..+++.++.+|+++++++..- T Consensus 215 t~~i~~--~gA~~~~~~~~~~vE~~Rd~~----------~~~d~i~~~~~y~~~~~~~~~vv~~t~~~ 270 (275) T protein:vir:96 215 EAILAK--RGAVKLITKRDFFLETERHAS----------HKSTALFSDKHYVAYLYDESKVVKITKSA 270 (275) T ss_pred eEEEEe--ccceeeeecCCcccccccchh----------hcCcEEEEeEEEEEEEEcCccEEEEEecc Confidence 234554 456677777777777665432 22345778899999999999999997655 No 112 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=99.86 E-value=5.3e-23 Score=142.72 Aligned_cols=264 Identities=13% Similarity=0.059 Sum_probs=186.0 Q ss_pred Ce---ecccccc-chhHHHHHHHHHHhhchhhhhcceeec----CCCceEEEEEeCCcceEEeeccccccccccceeeEE Q lcl|Aclame:pro 1 MV---LNKGTLF-DPELVTDLISKVAGKSSIARLSAQKPI----PFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQT 72 (298) Q Consensus 1 ma---t~gg~li-p~~~~~~ii~~~~~~s~i~~~~~~~~~----~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~ 72 (298) || |.-+.++ |+-+.+-+.+.+++...+.+++..... ++..++||++...+++.++.|++.++..++++++.+ T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~g~~i~~~~lt~~~~~ 80 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAEGAAIDYSALETESVK 80 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcceeecCCCcCcccccccceee Confidence 66 5556555 555667777888887777777754322 234689999987788899999999999999999999 Q ss_pred EeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 73 MVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRG 152 (298) Q Consensus 73 l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 152 (298) +..++.+..+.++++...+ +..++.+.+.+++++++++.+|..++..... .... ... ...... T Consensus 81 ~~i~~~~~a~~v~D~~~~~---~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~---a~~~------~~~-----~~t~~~ 143 (278) T protein:vir:80 81 HGIKKAGKGVKLTDESVLS---GYGDPVEEAQKQIRMAIASKVDNDILEEALT---TTLE------VKG-----AINIGL 143 (278) T ss_pred EeeehhhccccccHHHHhh---ccccHHHHHHHHHHHHHHHHHHHHHHHHHhc---cccc------ccc-----ccccch Confidence 9999988889999986543 3357889999999999999999988865311 1000 000 011122 Q ss_pred cchhHHHHHHHhhhhhhcCCcc-cEEEEcHHHHHHHHHhhccCC---ceeecccccccCcceecceeeEecCcccccccc Q lcl|Aclame:pro 153 IADPNGAIENAVELLTGVDADV-TGIAINPSFRSALAKQKDLQG---NALFPELKWGATPDTINGLPVDVNKTVSDMSLT 228 (298) Q Consensus 153 ~~~~~~~i~~~~~~l~~~~~~~-~~~vm~~~~~~~L~~lkd~~G---~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~ 228 (298) ....++.+.++..++..++... ..++|||..+..|+|....+. ..+-.+....+..++++|++|++++++|.+ T Consensus 144 ~~~~~~~~~da~~~l~~~~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~--- 220 (278) T protein:vir:80 144 IDKIENTFTDAPDAIEDESITTTGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGELLGWEIVRTKKLADG--- 220 (278) T ss_pred hhhHHHHHHHHHHhhcccCCCcccEEEECHHHHHHHHhhhhhhccccccccccceeeccceeecceeEEEcCCCCcc--- Confidence 3345677888888887766543 358899999999987642111 011122234556789999999999999853 Q ss_pred ccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 229 QRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 229 ~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ...+++ .+++.+...+.+++|..++.. +..-.+++..+++.++.||+++++|+++= T Consensus 221 --t~~l~~--~gAi~~~~~~~~~vE~~Rd~~----------~~~d~i~~~~~yg~~v~~~~~~v~it~~a 276 (278) T protein:vir:80 221 --NALAVK--AGALKTFLKRNLLAESGRDMD----------HKLTKFNADQHYAVALVDETKAVKVVPVA 276 (278) T ss_pred --eEEEEe--ccceeeeecCCcccccccchh----------hccceeeeeeEEEEEEEcCcceEEEeecc Confidence 234443 456667777777776555432 22346778899999999999999998877 No 113 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=99.85 E-value=1.9e-22 Score=139.62 Aligned_cols=256 Identities=13% Similarity=0.068 Sum_probs=186.3 Q ss_pred Ceecc---c-cccchhHHHHHHHHHHhhchhhhhcceeec----CCCceEEEEEeCCcceEEeeccccccccccceeeEE Q lcl|Aclame:pro 1 MVLNK---G-TLFDPELVTDLISKVAGKSSIARLSAQKPI----PFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQT 72 (298) Q Consensus 1 mat~g---g-~lip~~~~~~ii~~~~~~s~i~~~~~~~~~----~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~ 72 (298) ||... + .++|+-+.+-+.+.+++...+.+++..... ++..++||++...+++..+.||+.++..+.+.++.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccccccccceeE Confidence 76633 3 345555666677888777777777765432 345689999887778889999999999999999999 Q ss_pred EeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 73 MVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRG 152 (298) Q Consensus 73 l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 152 (298) +..++.+....++++...++ ..++.+.+.+++++++++.+|..++.-..... .. ... T Consensus 81 ~~i~~~~~~~~i~D~~~~~~---~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~-----------------~~---~~~ 137 (274) T protein:vir:94 81 AKIRKIAKGTSITDEALLSG---YGDPQGEQVRQHGLAHANKVDNDVLEALMGAK-----------------LT---VNA 137 (274) T ss_pred EEeeeecceecccHHHHHhc---cchHHHHHHHHHHHHHHHHHHHHHHHHHhccC-----------------cc---ccc Confidence 99999988899999865433 35678899999999999999999885421100 00 011 Q ss_pred cchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCcee-----ecccccccCcceecceeeEecCccccccc Q lcl|Aclame:pro 153 IADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNAL-----FPELKWGATPDTINGLPVDVNKTVSDMSL 227 (298) Q Consensus 153 ~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l-----~~~~~~~~~~~~l~G~PV~~s~~~~~~~~ 227 (298) ....++.|.++..++..++.....++|||.++..|+|.. .-+++ -.+....+..++++|++|++++.+|.. T Consensus 138 ~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~--~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~-- 213 (274) T protein:vir:94 138 DITKLNGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRGDA--STNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAG-- 213 (274) T ss_pred cccCHHHHHHHHHHhhccCCCceEEEeCHHHHHHHHhhh--hhhccccCcccccceeccccceecCeeEEEcCCCCcc-- Confidence 123478899999999888888889999999999997641 11111 011233456789999999999999853 Q ss_pred cccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 228 TQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 228 ~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ...+++ ..++.++.++.+.+|..++.. +..-.+++..+++.++.+|+++++++.+. T Consensus 214 ---t~~l~~--~gA~~~~~~~~~~vE~~Rd~~----------~~~d~i~~~~~y~~~~~~~~~vv~~t~~~ 269 (274) T protein:vir:94 214 ---TAILAK--KGAVKLILKRDFFLEVARDAS----------TKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) T ss_pred ---eEEEEe--CcceEeeecCCceeccccchh----------hcccEEEEEEEEEEEEEcCCceEEEecCc Confidence 234444 456667777887777655432 12235678889999999999999999998 No 114 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=99.85 E-value=1.9e-22 Score=139.62 Aligned_cols=256 Identities=13% Similarity=0.068 Sum_probs=186.3 Q ss_pred Ceecc---c-cccchhHHHHHHHHHHhhchhhhhcceeec----CCCceEEEEEeCCcceEEeeccccccccccceeeEE Q lcl|Aclame:pro 1 MVLNK---G-TLFDPELVTDLISKVAGKSSIARLSAQKPI----PFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQT 72 (298) Q Consensus 1 mat~g---g-~lip~~~~~~ii~~~~~~s~i~~~~~~~~~----~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~ 72 (298) ||... + .++|+-+.+-+.+.+++...+.+++..... ++..++||++...+++..+.||+.++..+.+.++.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccccccccceeE Confidence 76633 3 345555666677888777777777765432 345689999887778889999999999999999999 Q ss_pred EeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 73 MVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRG 152 (298) Q Consensus 73 l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 152 (298) +..++.+....++++...++ ..++.+.+.+++++++++.+|..++.-..... .. ... T Consensus 81 ~~i~~~~~~~~i~D~~~~~~---~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~-----------------~~---~~~ 137 (274) T protein:vir:97 81 AKIRKIAKGTSITDEALLSG---YGDPQGEQVRQHGLAHANKVDNDVLEALMGAK-----------------LT---VNA 137 (274) T ss_pred EEeeeecceecccHHHHHhc---cchHHHHHHHHHHHHHHHHHHHHHHHHHhccC-----------------cc---ccc Confidence 99999988899999865433 35678899999999999999999885421100 00 011 Q ss_pred cchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCcee-----ecccccccCcceecceeeEecCccccccc Q lcl|Aclame:pro 153 IADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNAL-----FPELKWGATPDTINGLPVDVNKTVSDMSL 227 (298) Q Consensus 153 ~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l-----~~~~~~~~~~~~l~G~PV~~s~~~~~~~~ 227 (298) ....++.|.++..++..++.....++|||.++..|+|.. .-+++ -.+....+..++++|++|++++.+|.. T Consensus 138 ~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~--~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~-- 213 (274) T protein:vir:97 138 DITKLNGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRGDA--STNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAG-- 213 (274) T ss_pred cccCHHHHHHHHHHhhccCCCceEEEeCHHHHHHHHhhh--hhhccccCcccccceeccccceecCeeEEEcCCCCcc-- Confidence 123478899999999888888889999999999997641 11111 011233456789999999999999853 Q ss_pred cccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 228 TQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 228 ~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ...+++ ..++.++.++.+.+|..++.. +..-.+++..+++.++.+|+++++++.+. T Consensus 214 ---t~~l~~--~gA~~~~~~~~~~vE~~Rd~~----------~~~d~i~~~~~y~~~~~~~~~vv~~t~~~ 269 (274) T protein:vir:97 214 ---TAILAK--KGAVKLILKRDFFLEVARDAS----------TKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) T ss_pred ---eEEEEe--CcceEeeecCCceeccccchh----------hcccEEEEEEEEEEEEEcCCceEEEecCc Confidence 234444 456667777887777655432 12235678889999999999999999998 No 115 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=99.84 E-value=5.1e-22 Score=137.32 Aligned_cols=256 Identities=13% Similarity=0.056 Sum_probs=185.7 Q ss_pred Ceecc---cccc-chhHHHHHHHHHHhhchhhhhcceee----cCCCceEEEEEeCCcceEEeeccccccccccceeeEE Q lcl|Aclame:pro 1 MVLNK---GTLF-DPELVTDLISKVAGKSSIARLSAQKP----IPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQT 72 (298) Q Consensus 1 mat~g---g~li-p~~~~~~ii~~~~~~s~i~~~~~~~~----~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~ 72 (298) ||... +.+| |+-+.+-+.+.+.+...+.+++..-. .++..++||.+...+++..+.|++.++..+.+.++.+ T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccchhhcccceeE Confidence 66522 4445 55555666777777777767665432 2355789999987788889999999999999999999 Q ss_pred EeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 73 MVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRG 152 (298) Q Consensus 73 l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 152 (298) +..++.+..+.++++...++ ..++.+.+.++++.++++.+|..++.-....+ ... .. T Consensus 81 ~~i~~~~~a~~i~D~~~~~~---~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~-----------------~~~---~~ 137 (274) T protein:vir:95 81 AKIRKIAKGTSISDEALLSG---YGDPQGEQVRQHGLAHANKVDDDVLEALKSAK-----------------LTV---EA 137 (274) T ss_pred EEeeeeecceeehHHHHhhc---cchHHHHHHHHHHHHHHHHHHHHHHHHHhccc-----------------ccc---cc Confidence 99999888999999865433 34678899999999999999999885421110 000 11 Q ss_pred cchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceee-----cccccccCcceecceeeEecCccccccc Q lcl|Aclame:pro 153 IADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALF-----PELKWGATPDTINGLPVDVNKTVSDMSL 227 (298) Q Consensus 153 ~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~-----~~~~~~~~~~~l~G~PV~~s~~~~~~~~ 227 (298) ....++.|.++..++..++.....++|||..+..|++.. .-+++- .+....+..++++|++|++++.+|.. T Consensus 138 ~~~~~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~--~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~~~-- 213 (274) T protein:vir:95 138 DITKLTGLQTAIDKFNDEDLEPMVLFISPLDAGKLRGDA--TTNFTRATELGDDVIVKGAFGEALGAVIVRSNKLEAG-- 213 (274) T ss_pred cccCHHHHHHHHHHhccccccccEEEeCHHHHHHHHhhc--cccccccccccccceeccccceecCeEEEEeCCCCCc-- Confidence 123478899999999887778888999999999998742 111110 12234556789999999999999853 Q ss_pred cccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 228 TQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 228 ~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ...++| ..++.+...+.+++|..+... +..-.+++..+++.++.||++++++++.+ T Consensus 214 ---t~~l~~--~gA~~~~~~~~~~vE~~Rd~~----------~~~d~i~~~~~y~~~~~~~~~~v~~tk~~ 269 (274) T protein:vir:95 214 ---TAILAK--KGAVKLITKRDFFLETDRDPS----------TKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) T ss_pred ---eEEEEe--ccceeeeecCCcccccccccc----------cccCEEEEeEEEEEEEEcCCcEEEEEcCC Confidence 245555 456667777787877665432 23345778899999999999999999999 No 116 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=99.84 E-value=5.1e-22 Score=137.32 Aligned_cols=256 Identities=13% Similarity=0.056 Sum_probs=185.7 Q ss_pred Ceecc---cccc-chhHHHHHHHHHHhhchhhhhcceee----cCCCceEEEEEeCCcceEEeeccccccccccceeeEE Q lcl|Aclame:pro 1 MVLNK---GTLF-DPELVTDLISKVAGKSSIARLSAQKP----IPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQT 72 (298) Q Consensus 1 mat~g---g~li-p~~~~~~ii~~~~~~s~i~~~~~~~~----~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~ 72 (298) ||... +.+| |+-+.+-+.+.+.+...+.+++..-. .++..++||.+...+++..+.|++.++..+.+.++.+ T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccchhhcccceeE Confidence 66522 4445 55555666777777777767665432 2355789999987788889999999999999999999 Q ss_pred EeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 73 MVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRG 152 (298) Q Consensus 73 l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 152 (298) +..++.+..+.++++...++ ..++.+.+.++++.++++.+|..++.-....+ ... .. T Consensus 81 ~~i~~~~~a~~i~D~~~~~~---~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~-----------------~~~---~~ 137 (274) T protein:vir:96 81 AKIRKIAKGTSISDEALLSG---YGDPQGEQVRQHGLAHANKVDDDVLEALKSAK-----------------LTV---EA 137 (274) T ss_pred EEeeeeecceeehHHHHhhc---cchHHHHHHHHHHHHHHHHHHHHHHHHHhccc-----------------ccc---cc Confidence 99999888999999865433 34678899999999999999999885421110 000 11 Q ss_pred cchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceee-----cccccccCcceecceeeEecCccccccc Q lcl|Aclame:pro 153 IADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALF-----PELKWGATPDTINGLPVDVNKTVSDMSL 227 (298) Q Consensus 153 ~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~-----~~~~~~~~~~~l~G~PV~~s~~~~~~~~ 227 (298) ....++.|.++..++..++.....++|||..+..|++.. .-+++- .+....+..++++|++|++++.+|.. T Consensus 138 ~~~~~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~--~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~~~-- 213 (274) T protein:vir:96 138 DITKLTGLQTAIDKFNDEDLEPMVLFISPLDAGKLRGDA--TTNFTRATELGDDVIVKGAFGEALGAVIVRSNKLEAG-- 213 (274) T ss_pred cccCHHHHHHHHHHhccccccccEEEeCHHHHHHHHhhc--cccccccccccccceeccccceecCeEEEEeCCCCCc-- Confidence 123478899999999887778888999999999998742 111110 12234556789999999999999853 Q ss_pred cccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 228 TQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 228 ~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ...++| ..++.+...+.+++|..+... +..-.+++..+++.++.||++++++++.+ T Consensus 214 ---t~~l~~--~gA~~~~~~~~~~vE~~Rd~~----------~~~d~i~~~~~y~~~~~~~~~~v~~tk~~ 269 (274) T protein:vir:96 214 ---TAILAK--KGAVKLITKRDFFLETDRDPS----------TKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) T ss_pred ---eEEEEe--ccceeeeecCCcccccccccc----------cccCEEEEeEEEEEEEEcCCcEEEEEcCC Confidence 245555 456667777787877665432 23345778899999999999999999999 No 117 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=99.83 E-value=1e-21 Score=135.69 Aligned_cols=255 Identities=13% Similarity=0.065 Sum_probs=186.0 Q ss_pred Ceecc---ccc-cchhHHHHHHHHHHhhchhhhhcceee----cCCCceEEEEEeCCcceEEeeccccccccccceeeEE Q lcl|Aclame:pro 1 MVLNK---GTL-FDPELVTDLISKVAGKSSIARLSAQKP----IPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQT 72 (298) Q Consensus 1 mat~g---g~l-ip~~~~~~ii~~~~~~s~i~~~~~~~~----~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~ 72 (298) ||... +.+ +|+-+.+-+.+.+.+...+.+++.... .++..++||.+...+++..+.|++.++..+.+.++.+ T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccchhhcccceee Confidence 77643 444 454455666677777666666665532 2355789999987778889999999999999999999 Q ss_pred EeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 73 MVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRG 152 (298) Q Consensus 73 l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 152 (298) +..++.+..+.++++....+ ..++.+.+.++++.++++.+|..++.-...... . ... T Consensus 81 ~~i~~~~~~~~i~D~~~~~~---~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~-----------------~---~~~ 137 (274) T protein:vir:12 81 AKIRKIAKGTSITDEALLSG---YGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL-----------------T---VNA 137 (274) T ss_pred EEeeeecceeeecHHHHHhc---ccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc-----------------c---ccc Confidence 99999999999999865433 246778999999999999999998865321110 0 011 Q ss_pred cchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhh------ccCCceeecccccccCcceecceeeEecCcccccc Q lcl|Aclame:pro 153 IADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQK------DLQGNALFPELKWGATPDTINGLPVDVNKTVSDMS 226 (298) Q Consensus 153 ~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lk------d~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~ 226 (298) ....++.|.++..++..++.....++|||..+..|++.. ++++. .+....+..++++|++|++++.+|.. T Consensus 138 ~a~~~d~i~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g---~~~~~~G~ig~~~G~~Vi~s~~~p~~- 213 (274) T protein:vir:12 138 DITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELG---DDIIVKGAFGEALGAIIVRSNKLEAG- 213 (274) T ss_pred cccCHHHHHHHHHHhccccccccEEEeCHHHHHHHHhhhhhhcccccccc---ccceecccceeecCeeEEEeCCCCcc- Confidence 123478899999999888778888999999999998742 12111 12234556778999999999999853 Q ss_pred ccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 227 LTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 227 ~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ...++| ..++.++..+.+++|.++... +..-.+++..+++.++.||+++++++.+. T Consensus 214 ----t~~l~~--~gA~~~~~~~~~~vE~~Rd~~----------~~~d~i~~~~~y~~~~~~~~~vv~~t~~~ 269 (274) T protein:vir:12 214 ----TAILAK--KGAVKLILKRDFFLEVARDAS----------TKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) T ss_pred ----eEEEEe--ccceeeeecCCceeccccchh----------hcccEEEeeeEEEEEEEcCCceEEEEcCC Confidence 234555 356667777888877665432 12236788899999999999999999999 No 118 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.82 E-value=1.7e-21 Score=134.47 Aligned_cols=258 Identities=11% Similarity=0.072 Sum_probs=187.4 Q ss_pred Cee-ccccccchhHHHH-HHHHHHhhchhhhhcceeec----CCCceEEEEEeCCcceEEeeccccccccccceeeEEEe Q lcl|Aclame:pro 1 MVL-NKGTLFDPELVTD-LISKVAGKSSIARLSAQKPI----PFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMV 74 (298) Q Consensus 1 mat-~gg~lip~~~~~~-ii~~~~~~s~i~~~~~~~~~----~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~ 74 (298) ||. .-..+|-||+.++ +.+.+.+...+.+++..... ++..+++|.+...+++.-+.|++.++..+.+.++.... T Consensus 1 Ma~T~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae~~~eg~~i~~~~lt~~~~~a~ 80 (270) T protein:vir:95 1 MTQTKKANLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAEDLQEGVAMDTTQMSMTTTKVT 80 (270) T ss_pred CCceehhhhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecCCCccccccCCCccchhhcccchheee Confidence 994 5577776666555 55677777777777765432 34568999998888899999999999999999999999 Q ss_pred eeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 75 PIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIA 154 (298) Q Consensus 75 ~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 154 (298) .++.+..+.++++....+. .+....+.++++..+++++|+.++.-.. +. . . ..... T Consensus 81 i~~~gk~~~itD~a~~~~~---~dp~~~~~~q~a~~~a~~~d~~li~~l~---~a-------~-------~----~~~~~ 136 (270) T protein:vir:95 81 VKETGKAVEVTQTAIITNV---NGTLQEASRQLAMSLADKVEIDYIAELN---KS-------K-------Q----TATVS 136 (270) T ss_pred eehhhCcceecHHHHhhhc---cchHHHHHHHHHHHHHHHHHHHHHHHhc---cc-------c-------c----ccccc Confidence 9999999999999654332 3567889999999999999998874311 10 0 0 00112 Q ss_pred hhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccceEE Q lcl|Aclame:pro 155 DPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAI 234 (298) Q Consensus 155 ~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~ 234 (298) ..++.|.+++.++..+....++++|||.++..|+|...-.+...-.+...++..++++|++|+++++++... ...+ T Consensus 137 ~t~~~~~dA~~~lgd~~~~~~~i~vhs~~~~~Lrk~~~~~~~~~~~~~~~~G~ig~~~G~~Viv~s~~~~~~----~~~l 212 (270) T protein:vir:95 137 ADATGILDAIEVFNSENDEDYVLYVNPKDYNKLVKSLFKVGGNVQDRAISKGDLVEIVGVSDIVKSKRVSEN----TAFL 212 (270) T ss_pred cCHHHHHHHHHHhccccCCCcEEEEcHHHHHHHHhhhcccccccccchhcccccceecceeEEEeCCCCCce----eEEE Confidence 346788999999988888889999999999999875421111111222344567899999999988876432 2344 Q ss_pred EeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 235 IGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 235 ~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) |+ ++++.+...+++++|.+++.. +..-.+.+..+++..+.+++++++++-+. T Consensus 213 ~~--~gAi~~~~~~~~~vEtdRd~~----------~~~d~i~~~~~y~v~~~~~skvv~~t~~~ 264 (270) T protein:vir:95 213 QR--YGAMEIVNKKKPEAYTDFDIL----------KRTHLLSTNYHYSVNLKDETGVVKVTFKP 264 (270) T ss_pred Ee--ccceeeeecCCceeeeccchh----------hcccEEEeeeEEEEEEEccceEEEEEecC Confidence 44 567778888888887666432 12235677789999999999999997655 No 119 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=99.80 E-value=1.8e-20 Score=128.82 Aligned_cols=284 Identities=12% Similarity=0.119 Sum_probs=198.3 Q ss_pred Ce-e---ccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeec-----cccccccccceeeE Q lcl|Aclame:pro 1 MV-L---NKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAE-----SGKKTHGGVTLAPQ 71 (298) Q Consensus 1 ma-t---~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E-----~~~~~~~~~~~~~v 71 (298) |. . ..+.+.+.++...|||.+.+.|.+++..+..++.++.+.+.|...-+.+.+.+. .+..+++..+|.++ T Consensus 1 mpaltLaea~k~~~d~l~~~ViE~~~~~s~lL~~LpF~~veg~~~~ynR~~~~~~~~~~~v~~~~~~~g~~~~~~t~~~~ 80 (310) T protein:vir:97 1 MASVTLAESAKLAQDELVAGVIENIITVNRMFDVLPFDSIEGNSLAYNRENVLGDVIMAGVGTTFSGAGAGKAAATFTKV 80 (310) T ss_pred CcccchHHHhhcCcchHHHHHHHHHhccchHHHhCCcccccCCcceeeEeeccCCcccccccccccCCCcccccccccee Confidence 55 2 335788888999999999999999999999998888899999876555554433 34456788899999 Q ss_pred EEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 72 TMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPR 151 (298) Q Consensus 72 ~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~ 151 (298) +...+-+++.+.|.+.+..--.....+...+-.+..++++.++++..+++|+.. +..+.|+.+............. T Consensus 81 ~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a----~n~F~GL~~~~~~~q~i~~~~~ 156 (310) T protein:vir:97 81 NSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGA----GNEFAGLIQLCASGQKATTGAT 156 (310) T ss_pred eeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccC----CCcccchhhcCCccceeecCCC Confidence 999999999999987654211111234444445777899999999999999532 2346666555433322222233 Q ss_pred ccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhh-ccCCceeeccc--ccccCcceecceeeEecCcccccc-- Q lcl|Aclame:pro 152 GIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQK-DLQGNALFPEL--KWGATPDTINGLPVDVNKTVSDMS-- 226 (298) Q Consensus 152 ~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lk-d~~G~~l~~~~--~~~~~~~~l~G~PV~~s~~~~~~~-- 226 (298) .+....+++..++..+......++.++|||++..+|+.+. ..+++.+++.. ..+...-++.|+|++.++.+|.+. T Consensus 157 gg~~t~d~LDeLl~~v~~~~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~~~~~G~~v~~~~GiPi~~~d~ip~~~~~ 236 (310) T protein:vir:97 157 GSAISFAILDELMDLVVDKDGQVDYLTMHARTLRSYKALLRALGGASINEVVELPSGAEVPAYSGTPIFRNDYIPTNQTK 236 (310) T ss_pred CCCCCHHHHHHHHHHHhcCCCCCCEEEecHHHHHHHHHHHHHhcCCCCCCccccCCCCEEeeeCCeEEEEeCccCCCccc Confidence 4556678999999988777788899999999987777554 44455565433 233344689999999999998642 Q ss_pred --ccccceEEEeeccc-----eEEEEee----cceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEe Q lcl|Aclame:pro 227 --LTQRDRAIIGDFAN-----GFKWGYA----KEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVT 295 (298) Q Consensus 227 --~~~~~~~~~gd~~~-----~~~~~~~----~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~ 295 (298) ..+.+.+++..|.. ++ .+.. .+++++.... . =......+|.+++++.++..|+|+++|+ T Consensus 237 ~~~~gtTsIya~r~Ge~~~~~Gv-~Gl~~~~~~glsVr~~G~--~-------~~~~v~~~~V~~Y~~~av~~~~A~a~L~ 306 (310) T protein:vir:97 237 GGTTGCTTIFAGTLDDGSRTHGI-AGLTATQAAGIQVVDVGE--S-------EDSDEHIWRVKWYCGLALFSEKGLACAD 306 (310) T ss_pred cccCCceeEEEEeeCccccccce-eccccCCccceeEEeCCc--c-------cCCcceeEEEEEeeeEEEecccceeeec Confidence 23455666655542 32 3332 2344432211 1 1345567899999999999999999999 Q ss_pred ecC Q lcl|Aclame:pro 296 EAN 298 (298) Q Consensus 296 ~a~ 298 (298) +++ T Consensus 307 ~V~ 309 (310) T protein:vir:97 307 GIT 309 (310) T ss_pred ccc Confidence 999 No 120 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=99.75 E-value=3.7e-20 Score=127.11 Aligned_cols=282 Identities=12% Similarity=0.072 Sum_probs=186.8 Q ss_pred Ceeccc-cccchhHHHHHHHHHHhhchhhhhcceeecCCC-ceEEEEEeCCcceEEeeccccccccccc---eeeEEEee Q lcl|Aclame:pro 1 MVLNKG-TLFDPELVTDLISKVAGKSSIARLSAQKPIPFN-GEKVFTFTMDSEIDVVAESGKKTHGGVT---LAPQTMVP 75 (298) Q Consensus 1 mat~gg-~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~-~~~ip~~~~~~~a~~v~E~~~~~~~~~~---~~~v~l~~ 75 (298) |+|-.+ .|||+-+++.++|.+.+-...-++...+.+.+| +..+|-.. .--++-++||++.|+.++. +++|+++. T Consensus 74 mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L~~Grsm~F~~~g-~~Ra~~IgEGgE~~~~sld~~T~dsv~~~~ 152 (393) T protein:vir:79 74 MATPSAQILIPRVIVGTMREAAEPLYIGTKMLQKIRLKSGQSMIFPSIG-IMRAYDVAEGQEIPEDSIDWQTHESPEIRV 152 (393) T ss_pred hcCCCcceechhhhhhhhhhcccchhHHHHHHHHHhhhcCcceeccchh-eeeeccccccccccccchhhhcCCceeEEe Confidence 888766 778999999999988887777778777777443 45555433 4456779999999986554 78999999 Q ss_pred eEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccc--ccccccccccccccc Q lcl|Aclame:pro 76 IKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNH--FDSKVTQKVEAPRGI 153 (298) Q Consensus 76 ~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~--~~~~~~~~~~~~~~~ 153 (298) +|.+..+.+|+|++ +|+..++..+..+.+.+++++..|..++++... .|+ .-+.+... ....++-......++ T Consensus 153 gK~G~~Ia~SqEmI---sDSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~-~gh-tvfDa~st~t~ahptGr~~~~~qNG 227 (393) T protein:vir:79 153 GKSGIRLRFTDEMI---SDSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRS-HGH-TVFDNYSTNKLAHTTGLDKNGVQND 227 (393) T ss_pred chhhhhhhhHHHHh---hcchHHHHHHHHHHHHHHHHhhhHHHHHhhhhc-ccc-eeeeccccCccceeecCCccccccc Confidence 99999999999998 478899999999999999999999999998421 111 00111000 000001111124456 Q ss_pred chhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCC---ceeecccc------cccCcceecc-----eeeEec Q lcl|Aclame:pro 154 ADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQG---NALFPELK------WGATPDTING-----LPVDVN 219 (298) Q Consensus 154 ~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G---~~l~~~~~------~~~~~~~l~G-----~PV~~s 219 (298) ....++|.++.-+++.+.+.++.++|||-.|..+.|-.--.+ ++.-+-.. +.-.|..|.| +.|+++ T Consensus 228 TlSleDllDm~~av~~~hyt~svi~MHPLAWnv~AKna~me~~~~na~gN~~~~~~~ts~algp~~i~~~~~~nlnv~~s 307 (393) T protein:vir:79 228 TFSAEDFLDLIIAVMANEYTPSDLMMHPLAWTVFAKNELMGSLQANPYGNYPAKGAPSSMALGPDSIQGRLPFNFNVNLS 307 (393) T ss_pred cccHHHHHHHHHHHhcccCCcceEEEcCchhhhhhhhhhhcceeeccccccCccccchhhhhchhhhccccccceeEEEe Confidence 667899999999999999999999999999998866421111 11100000 0111222333 578889 Q ss_pred CccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEec-ccceEEEeecC Q lcl|Aclame:pro 220 KTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILD-ATKFARVTEAN 298 (298) Q Consensus 220 ~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~-~~a~~~l~~a~ 298 (298) +.+|-......-..+..|-...-.+-++.+++.+-.++ ..++..-++...|+|++|++ .+|++..++.+ T Consensus 308 Pfvp~d~k~~rFd~~~Vd~NnvgvlLV~D~i~tdq~dd----------k~rdiq~iKl~ERYG~gvLn~gkaiavakNI~ 377 (393) T protein:vir:79 308 PFIPLDKKSRRFDVYAVDRNNVGVLLVRDDLKTDQWDE----------KARGLQNIKMIERYGIGILNEGKAIAVAKNIS 377 (393) T ss_pred cccccccccceeeEEEeecCCceEEEEecCcceecccc----------ccccceeeeeeeeeceeeeeCCceEEEEecce Confidence 98875433322233333333222233444444432222 23566778999999998886 67888888887 No 121 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.70 E-value=9.6e-19 Score=119.35 Aligned_cols=228 Identities=14% Similarity=0.097 Sum_probs=167.6 Q ss_pred cceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHH Q lcl|Aclame:pro 31 SAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKK 110 (298) Q Consensus 31 ~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~ 110 (298) -+-+.+ +..+++|.+ -+++.-++||++++..++++++.++..++.+..+.|++|...+.. .+...+..++++.+ T Consensus 1 ~~~~~~-Gdtit~P~~--iGda~~v~eG~~i~~~~l~~t~~~atIk~~gk~~~itD~a~l~~~---gDp~~ea~~Q~~~~ 74 (231) T protein:vir:73 1 ENGINL-ANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGY---GDPIGESNKQLGLS 74 (231) T ss_pred CccccC-CceEEeccc--ccchhhhcCCCcCChhhccccceeeeEeeeccceeeeHHHHhhcc---CchHHHHHHHHHHH Confidence 111122 235889876 567889999999999999999999999999999999999765433 45678999999999 Q ss_pred HHHHHHHHHhcccccccccccccccccccccccccccccccccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHh Q lcl|Aclame:pro 111 VARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQ 190 (298) Q Consensus 111 i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~l 190 (298) |++++|..++.-.. + ++. .......++.|.+++..+..++..+..++|||+.+..|||. T Consensus 75 iA~kvD~di~~~~~---~--------------a~l----~~~~~~t~d~i~~A~~~fgde~~~~~vivv~p~~~~~Lrk~ 133 (231) T protein:vir:73 75 LANKVDDDLLKAAK---T--------------TSQ----TVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKD 133 (231) T ss_pred HHHhhhHHHHHhhc---c--------------ccc----cccccccHHHHHHHHHHhccccccceEEEEcchHHHhhhhc Confidence 99999999884311 0 000 01123458899999999999888888999999999999996 Q ss_pred hccCCc--eeecccccccCcceecceeeEecCccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhh Q lcl|Aclame:pro 191 KDLQGN--ALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLK 268 (298) Q Consensus 191 kd~~G~--~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f 268 (298) .+.+-. ..-.+....|.-++++|+||++|+.+|.+.+... .++ .-+.++.+...+++++|.+++.. T Consensus 134 ~~~~~~~~~~g~~i~~~G~iG~i~G~~Vi~S~~~~~~~~~~~-~~i--~~~gAl~~~~k~~~~vEtdRd~~--------- 201 (231) T protein:vir:73 134 ANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMF-KIV--SNSPALKLVLKRGVQVETDRDIV--------- 201 (231) T ss_pred cchhhhhhhhccceeeecccceEcceEEEEcCCCCCCceeee-eEE--eeccceeeeecccceeecccccc--------- Confidence 543221 1222334566778999999999999986543211 111 13577788888999988766543 Q ss_pred hcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 269 GYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 269 ~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) +....+.+.++++..+.+|+++++++-+= T Consensus 202 -~k~~~i~~~~~y~v~l~~~~~vv~~t~~g 230 (231) T protein:vir:73 202 -TKTTVITADEHYAAYLYDLTKVVNITFTG 230 (231) T ss_pred -ccccEEEEeEEEEEEEEcCccEEEEEeec Confidence 22345678889999999999999985555 No 122 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=99.60 E-value=2.3e-16 Score=106.37 Aligned_cols=262 Identities=13% Similarity=0.053 Sum_probs=166.3 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhccee----ecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeee Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQK----PIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPI 76 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~----~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~ 76 (298) ||.+ .++|+.+..++++.+++.+++..++..- ...+..+.||+......+....++..++..+.+.+++++... T Consensus 1 MA~~--~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid 78 (273) T protein:vir:10 1 MAFN--NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLID 78 (273) T ss_pred Ccch--hhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCccccccceEEEEEe Confidence 9995 6688889999999999998888887542 223446899997765566677788877777777787777775 Q ss_pred EE-EEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccch Q lcl|Aclame:pro 77 KV-EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIAD 155 (298) Q Consensus 77 k~-~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (298) +. +.-+.|++.-.. ....++.. +.++.+++++.++|..++.-... .+ . ............ T Consensus 79 ~~~~~~~~i~d~d~~---~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~-a~--~------------~~~~~~~~~~~~ 139 (273) T protein:vir:10 79 QEKSIDFLVDDIDRV---QVAGSLEA-YTRAGATALATDTDKFIADMLVD-NG--T------------ALTGSAPTDADD 139 (273) T ss_pred eeeecceEeecHHHh---hhhccHHH-HHHHHHHHHHHHHHHHHHHHHhc-cc--c------------ccccccccchhH Confidence 53 445567653221 12234555 66778999999999877642100 00 0 001111122345 Q ss_pred hHHHHHHHhhhhhhcCCc--ccEEEEcHHHHHHHHHhhccCCc-ee-e-cccccccCcceecceeeEecCcccccccccc Q lcl|Aclame:pro 156 PNGAIENAVELLTGVDAD--VTGIAINPSFRSALAKQKDLQGN-AL-F-PELKWGATPDTINGLPVDVNKTVSDMSLTQR 230 (298) Q Consensus 156 ~~~~i~~~~~~l~~~~~~--~~~~vm~~~~~~~L~~lkd~~G~-~l-~-~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~ 230 (298) .++.|.++..++...+.. .-.++++|.++..|++..+--.+ .. - ......+..++|.|++|+.++++|.+.+ T Consensus 140 ~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~--- 216 (273) T protein:vir:10 140 AFDLIAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD--- 216 (273) T ss_pred HHHHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecccccCCc--- Confidence 688899999888887764 33589999999999765321111 11 0 1123456678999999999999996532 Q ss_pred ceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 231 DRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 231 ~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) .+++.+. +.++.+.. +...++..+ ... .| ...+++.+.+|.+++||++++.|+..= T Consensus 217 ~~~~~~~-~~A~~~a~-q~~~~e~~r--~~~-----~~---~~~v~~~~~yg~~v~~~~~~~~l~~~g 272 (273) T protein:vir:10 217 EQFVAFH-PSAAAYVS-QIDTVEALR--DQD-----SF---SDRIRALHVYGGKVVRPTGVVVFNKTG 272 (273) T ss_pred cEEEEEe-ccceeeee-eeehhhccc--CCC-----cc---eeeeeeeeeeeeeEeccceEEEEeccC Confidence 2344443 34443322 222333222 111 12 235788899999999999999997544 No 123 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=99.60 E-value=2.3e-16 Score=106.37 Aligned_cols=262 Identities=13% Similarity=0.053 Sum_probs=166.3 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhccee----ecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeee Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQK----PIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPI 76 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~----~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~ 76 (298) ||.+ .++|+.+..++++.+++.+++..++..- ...+..+.||+......+....++..++..+.+.+++++... T Consensus 1 MA~~--~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid 78 (273) T protein:vir:10 1 MAFN--NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLID 78 (273) T ss_pred Ccch--hhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCccccccceEEEEEe Confidence 9995 6688889999999999998888887542 223446899997765566677788877777777787777775 Q ss_pred EE-EEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccch Q lcl|Aclame:pro 77 KV-EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIAD 155 (298) Q Consensus 77 k~-~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (298) +. +.-+.|++.-.. ....++.. +.++.+++++.++|..++.-... .+ . ............ T Consensus 79 ~~~~~~~~i~d~d~~---~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~-a~--~------------~~~~~~~~~~~~ 139 (273) T protein:vir:10 79 QEKSIDFLVDDIDRV---QVAGSLEA-YTRAGATALATDTDKFIADMLVD-NG--T------------ALTGSAPTDADD 139 (273) T ss_pred eeeecceEeecHHHh---hhhccHHH-HHHHHHHHHHHHHHHHHHHHHhc-cc--c------------ccccccccchhH Confidence 53 445567653221 12234555 66778999999999877642100 00 0 001111122345 Q ss_pred hHHHHHHHhhhhhhcCCc--ccEEEEcHHHHHHHHHhhccCCc-ee-e-cccccccCcceecceeeEecCcccccccccc Q lcl|Aclame:pro 156 PNGAIENAVELLTGVDAD--VTGIAINPSFRSALAKQKDLQGN-AL-F-PELKWGATPDTINGLPVDVNKTVSDMSLTQR 230 (298) Q Consensus 156 ~~~~i~~~~~~l~~~~~~--~~~~vm~~~~~~~L~~lkd~~G~-~l-~-~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~ 230 (298) .++.|.++..++...+.. .-.++++|.++..|++..+--.+ .. - ......+..++|.|++|+.++++|.+.+ T Consensus 140 ~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~--- 216 (273) T protein:vir:10 140 AFDLIAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD--- 216 (273) T ss_pred HHHHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecccccCCc--- Confidence 688899999888887764 33589999999999765321111 11 0 1123456678999999999999996532 Q ss_pred ceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 231 DRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 231 ~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) .+++.+. +.++.+.. +...++..+ ... .| ...+++.+.+|.+++||++++.|+..= T Consensus 217 ~~~~~~~-~~A~~~a~-q~~~~e~~r--~~~-----~~---~~~v~~~~~yg~~v~~~~~~~~l~~~g 272 (273) T protein:vir:10 217 EQFVAFH-PSAAAYVS-QIDTVEALR--DQD-----SF---SDRIRALHVYGGKVVRPTGVVVFNKTG 272 (273) T ss_pred cEEEEEe-ccceeeee-eeehhhccc--CCC-----cc---eeeeeeeeeeeeeEeccceEEEEeccC Confidence 2344443 34443322 222333222 111 12 235788899999999999999997544 No 124 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=99.58 E-value=3.8e-16 Score=105.11 Aligned_cols=262 Identities=13% Similarity=0.043 Sum_probs=167.7 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhccee----ecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeee Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQK----PIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPI 76 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~----~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~ 76 (298) ||.+ .++|+.+..++++.+++.+.+.+++... ...+..++||+......+....++..++..+++.+++++... T Consensus 1 MA~~--~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid 78 (273) T protein:vir:79 1 MAFN--NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLID 78 (273) T ss_pred Ccch--hhhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCcccccccccCCCccCccccccceEEEEEe Confidence 9996 4789989999999999998888887442 223446899997765666678888888888888888888886 Q ss_pred EE-EEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccch Q lcl|Aclame:pro 77 KV-EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIAD 155 (298) Q Consensus 77 k~-~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (298) +. ..-+.|+++-..+ ...++.+ +.+++++++++++|..++.-... .+ .. ........... T Consensus 79 ~~~~~~~~i~d~d~~~---~~~~~~~-~~~~~~~ala~~vD~~i~~~~~~-a~--~~------------~~~~~~~~~~~ 139 (273) T protein:vir:79 79 QEKSIDFLVDDIDRVQ---VAGSLEA-YTRAGATALATDTDKFIADMLVD-NG--TA------------LTGSAPSDADD 139 (273) T ss_pred eecccceeeccHHHHh---hcccHHH-HHHHHHHHHHHHHHHHHHHHHhh-cc--cc------------cccccccchhh Confidence 63 4456677632222 2235655 56778899999999876532100 00 00 00111122234 Q ss_pred hHHHHHHHhhhhhhcCCc--ccEEEEcHHHHHHHHHhhcc--CCceee-cccccccCcceecceeeEecCcccccccccc Q lcl|Aclame:pro 156 PNGAIENAVELLTGVDAD--VTGIAINPSFRSALAKQKDL--QGNALF-PELKWGATPDTINGLPVDVNKTVSDMSLTQR 230 (298) Q Consensus 156 ~~~~i~~~~~~l~~~~~~--~~~~vm~~~~~~~L~~lkd~--~G~~l~-~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~ 230 (298) .++.|.++..++..++.. .-.++++|..+..|++..+. +....- ......+..++|+|++|+.++.+|...+. T Consensus 140 ~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~i~~s~~lp~~~~~-- 217 (273) T protein:vir:79 140 AFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDE-- 217 (273) T ss_pred HHHHHHHHHHHhhhccCCccCcEEEECHHHHHHHhhchhhhhhhhhcccccceeeeEeeEEeceEEEecccccccCce-- Confidence 578888988888777763 23689999999998765421 111111 12234566789999999999999865331 Q ss_pred ceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 231 DRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 231 ~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ..+.+ .+.++.+..+ ...++..+. .. .| ...+++.+.+|.+++||++++.|+.+= T Consensus 218 -~~~a~-~~~A~~~a~~-~~~~e~~r~--~~-----~~---~~~v~~~~~yg~~v~~p~~vv~~~~~g 272 (273) T protein:vir:79 218 -QFVAF-HPSAAAYVSQ-IDTVEALRD--QD-----SF---SDRIRALHVYGGKVVRPTGVVVFNKTG 272 (273) T ss_pred -EEEEE-eccceeeeee-hhhhhcccC--cc-----cc---eeeeeeeeeeeeEEecCceEEEEeccC Confidence 22333 2344433222 223332221 11 12 245788899999999999999997544 No 125 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=99.57 E-value=2.8e-16 Score=105.81 Aligned_cols=277 Identities=13% Similarity=0.059 Sum_probs=164.6 Q ss_pred Ce--------eccccc-----cc-hhHH-HHHHHHHHhhchhhhhcceeecC-CCceEEEEEeC---CcceEEeeccccc Q lcl|Aclame:pro 1 MV--------LNKGTL-----FD-PELV-TDLISKVAGKSSIARLSAQKPIP-FNGEKVFTFTM---DSEIDVVAESGKK 61 (298) Q Consensus 1 ma--------t~gg~l-----ip-~~~~-~~ii~~~~~~s~i~~~~~~~~~~-~~~~~ip~~~~---~~~a~~v~E~~~~ 61 (298) |. .+++.+ +. |++. ..+.+.+++..+--.+.+..-.+ ++.+.+-+... ..++.-|+|++++ T Consensus 1 ~~~~~~i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~~a~~~~~v~f~~~~p~~~~~d~e~VaEggEi 80 (318) T protein:vir:10 1 MTAPTGIVSVSDGPAITVRELVGNPLWIPTALKKMMVNQFISESLFRNGGANPNGVVAYNEGNPSFLEDDVADVAEFGEI 80 (318) T ss_pred CCCCCcceeeecCCceehHHhhCCchhHHHHHHHHHhccchhhhhhhcccccccceeEEEecccccccCcHhhccCcccc Confidence 32 233322 21 3433 34556666665555666655433 34445533221 3677789999999 Q ss_pred cccccceeeEEE-eeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|Aclame:pro 62 THGGVTLAPQTM-VPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFD 140 (298) Q Consensus 62 ~~~~~~~~~v~l-~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~ 140 (298) |.+.+.++.-.+ ..+|.+..+.||+|++. ....+..+...+++++++++..|+.++.-..++. +..- ... T Consensus 81 P~~~~~~G~~~ia~~~K~G~~~~vS~Em~~---~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~-t~~~-----~~s 151 (318) T protein:vir:10 81 PVSAGARGLPRTAFAVKKALGVRVSKEMID---ENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPI-VPTL-----AVP 151 (318) T ss_pred cccCCCCCchhhhhhehhccceeccHHHHh---hcChhHHHHHHHHHHHHHHHHHHHHHHHHHhccc-cccc-----cCC Confidence 999999977666 66899999999999875 5568888999999999999999998886421111 0000 000 Q ss_pred cccccccccccccchhHHHHHHHhh---------hhhhcCCcccEEEEcHHHHHHHHHhhc------cCCceeecccc-c Q lcl|Aclame:pro 141 SKVTQKVEAPRGIADPNGAIENAVE---------LLTGVDADVTGIAINPSFRSALAKQKD------LQGNALFPELK-W 204 (298) Q Consensus 141 ~~~~~~~~~~~~~~~~~~~i~~~~~---------~l~~~~~~~~~~vm~~~~~~~L~~lkd------~~G~~l~~~~~-~ 204 (298) ..-.+.........+..+.+..+.. .-..-++.++.++|||..|..|++-++ .++.+++.... + T Consensus 152 ~~w~~~~~~~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~~~~~y~~~a~~~~~~~~~t 231 (318) T protein:vir:10 152 TAWDNGGKVRTDIAIAIEQISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNENFMKVYERNANYVSTAPDWT 231 (318) T ss_pred cCCCCcccccccchhhhhhhhhhhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcchhhhhhhhccchhhhhccccc Confidence 0000000000000001111111111 011335778899999999999955443 35555553332 3 Q ss_pred ccCcceecceeeEecCccccccccccceEEEeeccceEEEEeecceEEEEeec--ccccccchhhhhcCc-EEEEEEEEE Q lcl|Aclame:pro 205 GATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQY--GDPDNSGLDLKGYNQ-VYIRAELFL 281 (298) Q Consensus 205 ~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~--~~~~~~~~~~f~~n~-v~~r~~~r~ 281 (298) +.-+++++|+.|+.++++|.+ .+++.+=...-.+...++++.+-... .+++ .+.|+ ...|+.++- T Consensus 232 g~~~g~~lGl~vi~s~~~p~~------~alvlq~g~vG~~~d~~pl~~t~~~~egg~~~------g~~~~s~~~~~~~~~ 299 (318) T protein:vir:10 232 GNFPGSVMGLNVIRSRTFPID------RVLIMERGTVGFYSDTRPLQFTALYPEGNGPN------GGPTESYRADASHKR 299 (318) T ss_pred ccccceeeceEEeecCccCCC------eeEEEecCCcceeeccccceeeecccCCCCCC------CCcchhhheehheee Confidence 444788999999999999864 34554433222244555555433221 1222 22333 567888889 Q ss_pred ccEEecccceEEEeecC Q lcl|Aclame:pro 282 GWGILDATKFARVTEAN 298 (298) Q Consensus 282 ~~~v~~~~a~~~l~~a~ 298 (298) ...|.+|+|+++|++.= T Consensus 300 ~~~V~~PkA~~~itgi~ 316 (318) T protein:vir:10 300 ALAVDQPKAALWLTGIV 316 (318) T ss_pred eeeeeCcceeEEEeecc Confidence 99999999999999987 No 126 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=99.55 E-value=1.1e-15 Score=102.65 Aligned_cols=283 Identities=11% Similarity=0.071 Sum_probs=178.4 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEE-eecccccc-ccccceeeEEE-eeeE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDV-VAESGKKT-HGGVTLAPQTM-VPIK 77 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~-v~E~~~~~-~~~~~~~~v~l-~~~k 77 (298) ..+ +|.+++|+...++++.+++.++++++++++++.+...+|+++.-+....- ..|+...+ ..+.+..++.+ ..++ T Consensus 26 ~~l-~~g~L~p~~a~~Fl~~v~~~t~iL~~~r~~~~~s~~~ei~kig~G~r~~r~~~e~~~~~~~~~~~~~~v~~~~~~~ 104 (360) T protein:vir:99 26 AEL-DGFQLPVDVTEEFLERMQKGVQILGMADTMTLARLEMEVPQFGVPRLSGHTRDEEGSRTENSEAESGSVKFNATDK 104 (360) T ss_pred ccc-CceeecHHHHHHHHHHHhhccchhhhcceeecccccccccccccceeeccccccCCCCCcCCcCccccCccccccc Confidence 444 46788999999999999999999999999999988888887655433222 22433222 24454555555 3345 Q ss_pred EEEEEeecHHHhhccc-ccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-----ccccccccc-cccccccccccc--- Q lcl|Aclame:pro 78 VEYGARISDEFMYASD-EEKINILQAFNDGFAKKVARGIDLMAFHGVNPR-----LGTASAVIG-TNHFDSKVTQKV--- 147 (298) Q Consensus 78 ~~~~~~iS~ell~~~~-d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~-----~g~~~~~~~-~~~~~~~~~~~~--- 147 (298) +.....+..+-++... -......+.|.+.+++++++-++...++|..+. ++....+.. ..+....+..-+ T Consensus 105 ~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~~~g~~ds~d~~~~~~~d~fl~~~dGwlKka~~~~~~i 184 (360) T protein:vir:99 105 SYYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMGIRAGASSGNLQSIGGAAELDNTFKGWIARAEGDAQSV 184 (360) T ss_pred eeeEeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHHhhccchhcccccCcccchhhhhhHHHHHHhhcccchh Confidence 5666677776554322 112345688999999999999999999985332 111111111 111111110000 Q ss_pred -----------------------------ccccccchhHHHHHHHhhhhhhcCCcc----cEEEEcHHHHHHHHHhhccC Q lcl|Aclame:pro 148 -----------------------------EAPRGIADPNGAIENAVELLTGVDADV----TGIAINPSFRSALAKQKDLQ 194 (298) Q Consensus 148 -----------------------------~~~~~~~~~~~~i~~~~~~l~~~~~~~----~~~vm~~~~~~~L~~lkd~~ 194 (298) ............+.+++..|+..|.+. -.|+||+......+...... T Consensus 185 d~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~lf~~~~~~Lp~kyr~~~~~~~~~~~s~~~~~~yr~~L~~R 264 (360) T protein:vir:99 185 DDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTSLFNETIQTLDSRYRESDAYSPVLMTSPNQVQSYTMSLTER 264 (360) T ss_pred hccccccccccccccccccccchhhhccccccccccchHHHHHHHHHhcchhhhcCcccceEEEccCchHHHHHHHHhcc Confidence 000011123445788999999987642 27999999877666554333 Q ss_pred CceeecccccccCcceecceeeEecCccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEE Q lcl|Aclame:pro 195 GNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVY 274 (298) Q Consensus 195 G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~ 274 (298) ..++-.....++..-+.+|+|++..+.+|++ .+++.++++. .++..++++++...+... + ...... T Consensus 265 ~t~LGd~~l~g~~~~~~~Gipi~~v~~~pd~------~~mlT~p~NL-i~g~~~~iri~~~~e~~~------~-~~~~~~ 330 (360) T protein:vir:99 265 EDPLGSAVIFGDSDITPFSYDLVGVNGFPDE------YMMFTDPNNL-AFGLYEEMELDQSTDTDK------V-HEQRLH 330 (360) T ss_pred CcccchhheecccccccceeeeEEcCCCCCC------ceEEeccCce-eEEeeeeeEEeecccchh------h-hhhcee Confidence 3344333344444556889999999999854 5889999987 488888888865433211 0 112222 Q ss_pred --EEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 275 --IRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 275 --~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) .-.+..+|+.+.+++|+|++++.- T Consensus 331 ~~~~~~~~~D~~iee~~Av~~vt~~~ 356 (360) T protein:vir:99 331 SRNWLEGQFDFQIKEQQAGVLVTDLE 356 (360) T ss_pred eeEEEEEEeeEEEEecccEEEEecCC Confidence 224567999999999999998877 No 127 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=99.49 E-value=2.9e-15 Score=100.28 Aligned_cols=283 Identities=12% Similarity=0.029 Sum_probs=168.1 Q ss_pred Ce-ecccc-----------------ccchhHHHHHHHHHHhhchhhhhcceeecCC-CceEEEEEeCCcceEEeeccccc Q lcl|Aclame:pro 1 MV-LNKGT-----------------LFDPELVTDLISKVAGKSSIARLSAQKPIPF-NGEKVFTFTMDSEIDVVAESGKK 61 (298) Q Consensus 1 ma-t~gg~-----------------lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~-~~~~ip~~~~~~~a~~v~E~~~~ 61 (298) || +.+|. |.-+++..++.+...+.+.++.+.+++.+.+ ..+.||+ .+..++..+..|+.. T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~rti~~G~sv~~~~-iG~~~~~~~~~G~~l 79 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFLKVFGGEVLTAFTRTSVTMNKHLVRSIQSGKSAQFPV-LGRTKAAYLQPGENL 79 (347) T ss_pred CCccccccccccccccCCcccchHHHHHHHHhHHHHHHHHHHHhhhhhhhheeccccceEEeee-ccceeEeeeecCcCC Confidence 55 22222 3447888999988889999999999877664 5688987 456777888888877 Q ss_pred cc--cccceeeEEEeeeEE-EEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhc----ccccccccccccc Q lcl|Aclame:pro 62 TH--GGVTLAPQTMVPIKV-EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFH----GVNPRLGTASAVI 134 (298) Q Consensus 62 ~~--~~~~~~~v~l~~~k~-~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~----G~~~~~g~~~~~~ 134 (298) .. .++..++.++..-++ .....|.+- +...+..++.+.+.++.++++++..|+.++. +.+........+. T Consensus 80 ~~~~~~~~~~e~~ltID~~~y~~~~Vddi---D~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~~~~~ 156 (347) T protein:vir:94 80 DDKRKDMKHTEKTINIDGLLTADVLIYDI---EDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTANNENIA 156 (347) T ss_pred CCCcCCccccceEEEEcchhhhhhhhhhH---HHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Confidence 54 467778777766654 222223221 1112335688899999999999999988863 2111111111111 Q ss_pred ccc-----ccccccccccccccccchhHHHHHHHhhhhhhcCCccc--EEEEcHHHHHHHHHhh-ccCCceeeccccccc Q lcl|Aclame:pro 135 GTN-----HFDSKVTQKVEAPRGIADPNGAIENAVELLTGVDADVT--GIAINPSFRSALAKQK-DLQGNALFPELKWGA 206 (298) Q Consensus 135 ~~~-----~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~--~~vm~~~~~~~L~~lk-d~~G~~l~~~~~~~~ 206 (298) +.. .+....+............++.|.++...|-..+.... .++++|+.+..|.+.. +..+.+-.......+ T Consensus 157 g~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~~~~~~~~~~~~~~G 236 (347) T protein:vir:94 157 GLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAILAALMPNAANYQALIDPSTG 236 (347) T ss_pred cCCcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHHHhhcccccccccccccccc Confidence 111 11111111122223344568889999899888877532 4677899998887643 333333333334456 Q ss_pred CcceecceeeEecCccccccccccce-------------------EEEeeccceEEEE---------eecceEEEEeecc Q lcl|Aclame:pro 207 TPDTINGLPVDVNKTVSDMSLTQRDR-------------------AIIGDFANGFKWG---------YAKEVPLEVIQYG 258 (298) Q Consensus 207 ~~~~l~G~PV~~s~~~~~~~~~~~~~-------------------~~~gd~~~~~~~~---------~~~~~~i~~~~~~ 258 (298) ..+++.|+||+.++++|......... -+=+||++.+.+. ...++++++..+. T Consensus 237 ~V~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~~~e~~~~~ 316 (347) T protein:vir:94 237 SIRNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALERARRA 316 (347) T ss_pred eeEEeeceEEEEcCccccccCcccccccccccccccccccccccccccccccceEEEEechhhhhhhhhcccceeeeech Confidence 77899999999999999654221110 1223444332221 1222233333221 Q ss_pred cccccchhhhhcCcEEEEEEEEEccEEecccceE--EEeec Q lcl|Aclame:pro 259 DPDNSGLDLKGYNQVYIRAELFLGWGILDATKFA--RVTEA 297 (298) Q Consensus 259 ~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~--~l~~a 297 (298) . ++. ..+.+.+-+|..++||++.+ .++.| T Consensus 317 -------~-~~~--~~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 317 -------N-FQA--DQIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred -------h-hhh--hhhhhhhhhcCcccccceeEEEEecCC Confidence 1 122 24567778999999998876 66788 No 128 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=99.48 E-value=1.8e-14 Score=95.91 Aligned_cols=269 Identities=15% Similarity=0.094 Sum_probs=163.7 Q ss_pred Ceecc-ccccchhHHHHHHH-HHHhhchhhhhc---------cee--ecCCCceEEEEEeC-CcceEEeecccccccccc Q lcl|Aclame:pro 1 MVLNK-GTLFDPELVTDLIS-KVAGKSSIARLS---------AQK--PIPFNGEKVFTFTM-DSEIDVVAESGKKTHGGV 66 (298) Q Consensus 1 mat~g-g~lip~~~~~~ii~-~~~~~s~i~~~~---------~~~--~~~~~~~~ip~~~~-~~~a~~v~E~~~~~~~~~ 66 (298) ||++. +.+|-||+.+++++ ...+.+.+.+-+ ... ..++.-+++|.+.. ++++.-+.|++.++..+. T Consensus 1 MA~T~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~~i~~~~l 80 (324) T protein:vir:59 1 MAYTKISDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDTDDLVPQKI 80 (324) T ss_pred CCceeeeceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccccCCCcccchhhc Confidence 99877 78888887777654 444444443211 111 12445679998876 478888899999999999 Q ss_pred ceeeEEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccc Q lcl|Aclame:pro 67 TLAPQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQK 146 (298) Q Consensus 67 ~~~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~ 146 (298) +.++-....++.+..+.++++....+. .+.+..+.+++++.+++..++.+|.-.. |.-.......+ +.. T Consensus 81 ~t~~~~a~i~~~~k~~~~tD~a~~~sg---~dp~~~i~~q~a~~~~~~~~~~lia~l~---g~~~~~~~~~~-----~~d 149 (324) T protein:vir:59 81 NAGQDKAVLILRGNAWSSHDLAATLSG---SDPMQAIGSRVAAYWAREMQKIVFAELA---GVFSNDDMKDN-----KLD 149 (324) T ss_pred ccceeeEEEEeecCceeehhhhhhhcc---chHHHHHHHHHHHHHHHHHHHHHHHHHH---Hhhhccccccc-----eee Confidence 988877777777777888887543232 3667889999999999999988764311 00000000000 001 Q ss_pred cccccccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCcccccc Q lcl|Aclame:pro 147 VEAPRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMS 226 (298) Q Consensus 147 ~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~ 226 (298) .+.........+.|.++..++..+.....+|+|||.++..|++..-. .++... ..+..-++++|+||++++.||... T Consensus 150 vsa~~~~~~s~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li--~~~~~s-~~~~~i~~~~G~~VivdD~~p~~~ 226 (324) T protein:vir:59 150 ISGTADGIYSAETFVDASYKLGDHESLLTAIGMHSATMASAVKQDLI--EFVKDS-QSGIRFPTYMNKRVIVDDSMPVET 226 (324) T ss_pred eeccccceecHHHHHHHHHHhCCcccCcEEEEEchHHHHHHHHhhhh--hhcccc-ccCceeeeecccEEEEeCCCCccc Confidence 11111222345789999999988877888999999999999976421 112111 112345689999999999998643 Q ss_pred cccc----ceEEEeeccceEEEEee-cceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 227 LTQR----DRAIIGDFANGFKWGYA-KEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 227 ~~~~----~~~~~gd~~~~~~~~~~-~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) .... .+.+|+ .+++.+... ..+.+|+++... .+.-.+..+.++ +++|.++..-+.+. T Consensus 227 ~~~~~~~y~s~l~~--~GAi~~~~~~~~v~vE~dRd~~----------~g~~~l~~r~~~---~~~p~G~s~~~~~~ 288 (324) T protein:vir:59 227 LEDGTKVFTSYLFG--AGALGYAEGQPEVPTETARNAL----------GSQDILINRKHF---VLHPRGVKFTENAM 288 (324) T ss_pred cCCCCceEEEEEEe--cCeEEEeecCCCcceecccCcc----------ccceEEEEeeEE---EeEeeeEEeccccc Confidence 3222 234444 456666553 345666665432 223334445554 34555554432221 No 129 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=99.47 E-value=5.6e-15 Score=98.71 Aligned_cols=287 Identities=11% Similarity=0.007 Sum_probs=165.7 Q ss_pred Ce-----------ecc-ccccchhHHHHHHHHHHhhchhhhhcceeec---CCCceEEEEEeCCcceEEeeccccccccc Q lcl|Aclame:pro 1 MV-----------LNK-GTLFDPELVTDLISKVAGKSSIARLSAQKPI---PFNGEKVFTFTMDSEIDVVAESGKKTHGG 65 (298) Q Consensus 1 ma-----------t~g-g~lip~~~~~~ii~~~~~~s~i~~~~~~~~~---~~~~~~ip~~~~~~~a~~v~E~~~~~~~~ 65 (298) |+ ++. ..+||+-+..++++.+++.+++.++++..+. .+..++||+.. .+.+.-+.++..++..+ T Consensus 1 ~~~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~~~~d~~~~~~~Gdtv~ip~~g-~~~~~d~~~~~~i~~~~ 79 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTSVVKTWGAQVKKGDTFHVPRIS-ELGVEDKATDVPVGVQP 79 (341) T ss_pred CcchhhhccccccchhHHHHHHHHHHHHHHHHHHhhcchhhccccccccccCCceEEEeccC-cceeeeecCCCcccccc Confidence 33 222 2478888899999999999888888765432 24468999864 66777778888888777 Q ss_pred cceeeEEEeeeE-EEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccc Q lcl|Aclame:pro 66 VTLAPQTMVPIK-VEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVT 144 (298) Q Consensus 66 ~~~~~v~l~~~k-~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~ 144 (298) ++-.++++...+ ...-..|+++-.. ....++...+.++.++++++++|..++.-.....+...... . . .. T Consensus 80 ~~~~~~~itiD~~~~~~~~i~d~d~~---~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~----~-~-~~ 150 (341) T protein:vir:94 80 VNDTDFVITVDTDRTTAVALDDLLEI---QASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNV----F-S-SS 150 (341) T ss_pred ccCceEEEEEeeeeecceeechHHHH---hhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCcc----c-c-Cc Confidence 777787787744 3556777775321 33457888999999999999999888753211111111100 0 0 00 Q ss_pred cccccccccchhHHHHHHHhhhhhhcCCcc--cEEEEcHHHHHHHHHhhccCC-ceeecccccccCcceecceeeEecCc Q lcl|Aclame:pro 145 QKVEAPRGIADPNGAIENAVELLTGVDADV--TGIAINPSFRSALAKQKDLQG-NALFPELKWGATPDTINGLPVDVNKT 221 (298) Q Consensus 145 ~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~--~~~vm~~~~~~~L~~lkd~~G-~~l~~~~~~~~~~~~l~G~PV~~s~~ 221 (298) ............++.|.++...|...+... -.++++|..+..|++...-.. .+.-......+..++++|++|+.+++ T Consensus 151 ~~~~t~~~~~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~~~~~g~~~l~~G~ig~i~G~~V~~Sn~ 230 (341) T protein:vir:94 151 NGAITGNGQAFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIPQFISKDFINNAPIAQGQIGSLMGVRVIRTSL 230 (341) T ss_pred cccccCchhhhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhchhhhhhhccccchhheeeeeeEeceEEEEecc Confidence 111112223345788888888887777643 257899999999976431111 11112234456678999999999999 Q ss_pred cccccccccceE------------E-----E----eeccc--eEEEEeecce-EEEEee-ccc----c-cccchhhhh-- Q lcl|Aclame:pro 222 VSDMSLTQRDRA------------I-----I----GDFAN--GFKWGYAKEV-PLEVIQ-YGD----P-DNSGLDLKG-- 269 (298) Q Consensus 222 ~~~~~~~~~~~~------------~-----~----gd~~~--~~~~~~~~~~-~i~~~~-~~~----~-~~~~~~~f~-- 269 (298) +|...+...... + + ++++. ++ .+.++.+ .+++.+ ... . .......|. T Consensus 231 lp~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl-~~~~~av~~~k~~~~~~~~~~~~~~~~~~~~~~~~ 309 (341) T protein:vir:94 231 IGNNSATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATF-TGNSRPVHTAVMCHMDWAAAVVSKAPRVTQSFENR 309 (341) T ss_pred ccccccccccccccceecccccccccccccccccccccccEEEE-EEecccccceeeecchhhhccccccccccccchhh Confidence 997543321100 0 0 01111 11 0111111 111110 000 0 000000011 Q ss_pred cCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 270 YNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 270 ~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) +-...+++.+-||.+++||++.+.|+.+- T Consensus 310 ~~~~~i~~~~~~G~~~lrp~~~v~~~~~~ 338 (341) T protein:vir:94 310 EQVWLMVGRQAYGARLYRPLHAVNIHTTG 338 (341) T ss_pred hhhhhhhhhhhhcccccCcceeEEEecCc Confidence 11234567778999999999998886555 No 130 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=99.46 E-value=1.7e-14 Score=96.11 Aligned_cols=279 Identities=11% Similarity=-0.039 Sum_probs=168.8 Q ss_pred Ceeccc-cccchhHHHHHHHHHHhhchhhhhcceeec---CCCceEEEEEeCCcceEEeeccccccccccceeeEEEeee Q lcl|Aclame:pro 1 MVLNKG-TLFDPELVTDLISKVAGKSSIARLSAQKPI---PFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPI 76 (298) Q Consensus 1 mat~gg-~lip~~~~~~ii~~~~~~s~i~~~~~~~~~---~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~ 76 (298) |+++.- .++|+.+..++++.+++.+++..++..... .+..++||+.. .+++..+.++..++..+++..++++... T Consensus 15 ~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g-~~~a~d~~~g~~i~~~~~~~~~~~itID 93 (381) T protein:vir:80 15 VDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNIS-RAAVYDKQPQTPVNLQARTDSEFTFTVT 93 (381) T ss_pred cchhhHHhhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccC-cceeeeecCCCcccccccCCceEEEEEe Confidence 665554 778888899999999998888888765433 23468899865 5678889999988888888888777774 Q ss_pred EE-EEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc--cccccccccccccccccccccccc Q lcl|Aclame:pro 77 KV-EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGT--ASAVIGTNHFDSKVTQKVEAPRGI 153 (298) Q Consensus 77 k~-~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~--~~~~~~~~~~~~~~~~~~~~~~~~ 153 (298) +. ..-..|+++-. .....++.+.+.+++++++++++|+.++.-....... .........+.............. T Consensus 94 ~~~~~~~~Idd~D~---~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~t~~~~i~~~~~~~~~t~~~~ 170 (381) T protein:vir:80 94 KYKESSFMIEDIVN---TQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIYSYDTTLGDGTVNAHLTGTPA 170 (381) T ss_pred eeeecceeechHHH---HhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccccccccccccccchh Confidence 43 33466766422 1233568889999999999999999887532111111 111111111111111122222334 Q ss_pred chhHHHHHHHhhhhhhcCCcc--cEEEEcHHHHHHHHHhhc-cCCceeecccccccCcceecceeeEecCcccccccccc Q lcl|Aclame:pro 154 ADPNGAIENAVELLTGVDADV--TGIAINPSFRSALAKQKD-LQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQR 230 (298) Q Consensus 154 ~~~~~~i~~~~~~l~~~~~~~--~~~vm~~~~~~~L~~lkd-~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~ 230 (298) ...++.|.++...|..++... -.++++|..+..|++... .+-.+.-......+..++|+|++|+.++++|....... T Consensus 171 ~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~~~~~~l~~G~Ig~i~G~~Vv~Sn~lp~~~~t~~ 250 (381) T protein:vir:80 171 PLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFISVDFSQVKPVTSGVVGTILGMEVIVTTQIGINSLTGY 250 (381) T ss_pred hHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhhccchhhhceeeeEEcceEEEeecccccccccce Confidence 456888999999888877643 368999999999976432 12223333445667788999999999999997543321 Q ss_pred ceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEe-cccceEEEeecC Q lcl|Aclame:pro 231 DRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGIL-DATKFARVTEAN 298 (298) Q Consensus 231 ~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~-~~~a~~~l~~a~ 298 (298) .. ..|-... .. . .+.-.++.. -|.++..+++....+|..+. +...+-...+|. T Consensus 251 ~~-~agap~~---~~---~-~~~~~~~~g-------~~s~~a~av~~~k~yd~~~~~~~~~~~~~~g~~ 304 (381) T protein:vir:80 251 VN-GQGAPTQ---PT---P-GVLGSPYLP-------DQAGTANVVNTGSASDLAVSLSYFGLPVFSGAG 304 (381) T ss_pred ee-ecccccc---cc---c-ccccccccc-------ccccceeeeeeeeeeceeeeeeeccceeeecce Confidence 11 1110000 00 0 000011110 13344456677777777774 555555555555 No 131 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=99.41 E-value=1.8e-14 Score=95.87 Aligned_cols=284 Identities=10% Similarity=-0.038 Sum_probs=163.5 Q ss_pred Ceeccc----cccchhHHHHHHHHHHhhchhhhhcceeecCC-CceEEEEEeCCcceEEeeccccccc-cccceeeEEEe Q lcl|Aclame:pro 1 MVLNKG----TLFDPELVTDLISKVAGKSSIARLSAQKPIPF-NGEKVFTFTMDSEIDVVAESGKKTH-GGVTLAPQTMV 74 (298) Q Consensus 1 mat~gg----~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~-~~~~ip~~~~~~~a~~v~E~~~~~~-~~~~~~~v~l~ 74 (298) ..-+.| .|.-+.+..++++...+.|.++.+.+...+.+ ..+.||+. +..++..+..|+.+.. .+++-+++++. T Consensus 17 ~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~r~i~~G~tv~i~~i-g~~~~~~~~~g~~l~~~~~~~~~~~~l~ 95 (332) T protein:vir:78 17 ARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAGYHTPGTPIVGDAGIKANEKTLV 95 (332) T ss_pred ccccccccchhhhhhhhhhhHHHHHHHHhhhhhccccccccccceEEEEec-cceeEeeecCCCCCCCCCCCCCceEEEE Confidence 222223 25558889999999999999999998877664 56899986 4566666666666543 34665666665 Q ss_pred eeEEE-EEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 75 PIKVE-YGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGI 153 (298) Q Consensus 75 ~~k~~-~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (298) .-+.- ....| +.+ +...+..++.+.+.++.++++++.+|..++.-.-.+.....+..+..+... ..-........ T Consensus 96 ID~~ky~~~~V-ddi--D~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~~~~g~~~-~~~~~~~~~~~ 171 (332) T protein:vir:78 96 MDDLLVSSQFV-YSL--DEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGGFH-VNIGAGNTNDA 171 (332) T ss_pred EehhhhhHHHH-HhH--HHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCcccccccccc-cccCCccccCH Confidence 55421 11222 122 112334578899999999999999998876421000000000000000000 00011112234 Q ss_pred chhHHHHHHHhhhhhhcCCccc--EEEEcHHHHHHHHHhhccC--Cc-eee-ccccccc-CcceecceeeEecCcccccc Q lcl|Aclame:pro 154 ADPNGAIENAVELLTGVDADVT--GIAINPSFRSALAKQKDLQ--GN-ALF-PELKWGA-TPDTINGLPVDVNKTVSDMS 226 (298) Q Consensus 154 ~~~~~~i~~~~~~l~~~~~~~~--~~vm~~~~~~~L~~lkd~~--G~-~l~-~~~~~~~-~~~~l~G~PV~~s~~~~~~~ 226 (298) ...++.|.++...|...+.... .++++|..+..|.+.+|.. .+ +.- ......+ ..++++|++|+.++++|... T Consensus 172 ~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~~~~~~~g~~i~~i~G~~V~~Sn~lp~~~ 251 (332) T protein:vir:78 172 QAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIAGIRILKSNNLAGLY 251 (332) T ss_pred HHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHHhhcCceeeeeeccccccceecceeeeEEeeeEEEecCccccCc Confidence 4578889999999988888644 3577999999997754321 00 000 0111122 25789999999999999765 Q ss_pred ccccc--------eEEEeeccceEEE---------EeecceEEEEee-cccccccchhhhhcCcEEEEEEEEEccEEecc Q lcl|Aclame:pro 227 LTQRD--------RAIIGDFANGFKW---------GYAKEVPLEVIQ-YGDPDNSGLDLKGYNQVYIRAELFLGWGILDA 288 (298) Q Consensus 227 ~~~~~--------~~~~gd~~~~~~~---------~~~~~~~i~~~~-~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~ 288 (298) +.... ..+-|+|++...+ ....++++++.+ +... ..| .-.+++.+.+|.+++|| T Consensus 252 g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~~~-----~~~---~d~i~~~~~~G~~v~rP 323 (332) T protein:vir:78 252 GQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNV-----QYQ---GDLIVGKLAMGCGSLRT 323 (332) T ss_pred ccccccccccccccccccccccceEEeecccceeeeeeeccchhhhhcccch-----hhh---HhhhhhhhhhcCceecc Confidence 43221 1234444442211 111122232221 1111 112 13467778899999999 Q ss_pred cceEEEeec Q lcl|Aclame:pro 289 TKFARVTEA 297 (298) Q Consensus 289 ~a~~~l~~a 297 (298) ++++.|+-| T Consensus 324 e~~v~l~~a 332 (332) T protein:vir:78 324 SVAGSFQAA 332 (332) T ss_pred cceEEEeeC Confidence 999999999 No 132 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=99.39 E-value=1.3e-14 Score=96.68 Aligned_cols=284 Identities=12% Similarity=0.022 Sum_probs=164.8 Q ss_pred Ceec-cc------------------cccchhHHHHHHHHHHhhchhhhhcceeecCC-CceEEEEEeCCcceEEeecccc Q lcl|Aclame:pro 1 MVLN-KG------------------TLFDPELVTDLISKVAGKSSIARLSAQKPIPF-NGEKVFTFTMDSEIDVVAESGK 60 (298) Q Consensus 1 mat~-gg------------------~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~-~~~~ip~~~~~~~a~~v~E~~~ 60 (298) ||.. +| .|.-+++..++.....+.+.++.+.+++.+.+ +++.+|+. +..++..+..|++ T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~g~s~~~~~i-G~~~~~~~~~G~~ 79 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVL-GRTQAAYLAPGEN 79 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHHHHHHHHHHHHHHHHhhhcccceeeeecccceEEEEee-ceeEEEeeecCCC Confidence 7733 32 12237788999999999999999999888875 56889985 5677777777888 Q ss_pred cccc--ccceeeEEEeeeEE-EEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccc----c---cccccc Q lcl|Aclame:pro 61 KTHG--GVTLAPQTMVPIKV-EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGV----N---PRLGTA 130 (298) Q Consensus 61 ~~~~--~~~~~~v~l~~~k~-~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~----~---~~~g~~ 130 (298) ...+ ++.-++++|..-++ .....|.+ + +.-.+..++.+.+.++.++++++..|+.++.-. . +.+... T Consensus 80 l~~t~~~~~~~e~~l~ID~~~y~~~~VdD-i--D~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~~~~~ 156 (344) T protein:vir:10 80 LDDIRKDIKHTEKVITIDGLLTADVLIYD-I--EDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQYNENI 156 (344) T ss_pred CCCCCCCcccceEEEEEcchhhhhhhhhh-H--HHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Confidence 7654 45667766665542 12222221 1 111333578899999999999999998886321 1 111111 Q ss_pred ccc-ccccccccc-cccccccccccchhHHHHHHHhhhhhhcCCccc--EEEEcHHHHHHHHHhhc-cCCceeecccccc Q lcl|Aclame:pro 131 SAV-IGTNHFDSK-VTQKVEAPRGIADPNGAIENAVELLTGVDADVT--GIAINPSFRSALAKQKD-LQGNALFPELKWG 205 (298) Q Consensus 131 ~~~-~~~~~~~~~-~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~--~~vm~~~~~~~L~~lkd-~~G~~l~~~~~~~ 205 (298) .++ .+....... ..............++.|.++...|...+.... ..+++|..+..|.+-+. .+..+.-...... T Consensus 157 ~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~ 236 (344) T protein:vir:10 157 TGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAILAALMPNAANYAALIDPEK 236 (344) T ss_pred ccccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHHhhcccccccccccccceee Confidence 110 000000000 011111112223467888888888888887533 46889999998865442 2222222222334 Q ss_pred cCcceecceeeEecCccccccccccceEE---------------Eeeccce---------EEEEeecceEEEEeeccccc Q lcl|Aclame:pro 206 ATPDTINGLPVDVNKTVSDMSLTQRDRAI---------------IGDFANG---------FKWGYAKEVPLEVIQYGDPD 261 (298) Q Consensus 206 ~~~~~l~G~PV~~s~~~~~~~~~~~~~~~---------------~gd~~~~---------~~~~~~~~~~i~~~~~~~~~ 261 (298) +..++++|+||+.++++|.+.......++ ..+|++. +......+++++..+. . T Consensus 237 G~V~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~e~~r~--~- 313 (344) T protein:vir:10 237 GSIRNVMGFEVVEVPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLALERARR--A- 313 (344) T ss_pred eEEEEEeceEEEeccccccccCCcccccccCccccccCCcccceeeecceeEEEeechhhhhhhhhccceeecccc--h- Confidence 55678999999999999864322211111 1133221 1111222223333221 1 Q ss_pred ccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 262 NSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 262 ~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ..|. ..+++.+-+|.+++||++.+.++-++ T Consensus 314 ----~~~~---d~i~g~~~~G~~vlRPe~a~~v~~~~ 343 (344) T protein:vir:10 314 ----NFQA---DQIIAKYAMGHGGLRPEAAGAVVFKT 343 (344) T ss_pred ----hHHH---HHHHHHhhcccceecccceEEEEeec Confidence 1222 24567778999999999998888888 No 133 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=99.38 E-value=2e-13 Score=90.25 Aligned_cols=284 Identities=13% Similarity=-0.038 Sum_probs=162.4 Q ss_pred Ceec-------------cc--cccchhHHHHHHHHHHhhchhhhhcceeecCC-CceEEEEEeCCcceEEeecccccccc Q lcl|Aclame:pro 1 MVLN-------------KG--TLFDPELVTDLISKVAGKSSIARLSAQKPIPF-NGEKVFTFTMDSEIDVVAESGKKTHG 64 (298) Q Consensus 1 mat~-------------gg--~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~-~~~~ip~~~~~~~a~~v~E~~~~~~~ 64 (298) |++- ++ .|.-+++..++.....+.+.++.+.+++.+.+ +.+.||+. +..++....-|+++..+ T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~G~s~~~~~i-G~~~~~~~~~g~~l~~~ 79 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNVRSLRGTNQLRVDRV-GASTIAGRKAGEELVVQ 79 (334) T ss_pred CCCCcCCCccccccccccchheehhhhhhhHHHHHHHHhhhhhccceeeeccccceEEEeee-cceeeeeecCCCCCCCC Confidence 4443 23 23337788999988888899999999998875 46899975 66778888778888877 Q ss_pred ccceeeEEEeeeEE-EEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcc----cc--ccccccccc-ccc Q lcl|Aclame:pro 65 GVTLAPQTMVPIKV-EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHG----VN--PRLGTASAV-IGT 136 (298) Q Consensus 65 ~~~~~~v~l~~~k~-~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G----~~--~~~g~~~~~-~~~ 136 (298) .++-++.++....+ .....|.+- +.-.+..++.+.+.++++++++++.|.+++.. .. .+.....++ .|. T Consensus 80 ~~~~~~~~l~ID~~l~~~~~Vddi---D~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~~~G~ 156 (334) T protein:vir:80 80 KNVSDKLNLTVDTVLYARHFFDKF---DEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAFHDGI 156 (334) T ss_pred CcccCceEEEEeeeeehhhhHhhH---HHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccCCc Confidence 77778877777663 222223221 11133356889999999999999999987632 11 111101111 111 Q ss_pred cccccccccccccccccchhHHHHHHHhhhhhhcCCcc-----cEEEEcHHHHHHHHHhhccCCc-eeecc---cccccC Q lcl|Aclame:pro 137 NHFDSKVTQKVEAPRGIADPNGAIENAVELLTGVDADV-----TGIAINPSFRSALAKQKDLQGN-ALFPE---LKWGAT 207 (298) Q Consensus 137 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~-----~~~vm~~~~~~~L~~lkd~~G~-~l~~~---~~~~~~ 207 (298) ........+............+.+.++...+-..+... -..+++|+.+..|.+-+.--.+ |.-.+ ...++. T Consensus 157 ~~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~s~~~~~~~~g~ 236 (334) T protein:vir:80 157 LLPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGAKEGGNSFVGGR 236 (334) T ss_pred ceeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceecccccccccccee Confidence 11111111111122223334456667777777776652 3579999999999764321111 11000 112344 Q ss_pred cceecceeeEecCcccccccccc-----ceEEEeeccceEEE-Eeecc--------eEEEEeecccccccchhhhhcCcE Q lcl|Aclame:pro 208 PDTINGLPVDVNKTVSDMSLTQR-----DRAIIGDFANGFKW-GYAKE--------VPLEVIQYGDPDNSGLDLKGYNQV 273 (298) Q Consensus 208 ~~~l~G~PV~~s~~~~~~~~~~~-----~~~~~gd~~~~~~~-~~~~~--------~~i~~~~~~~~~~~~~~~f~~n~v 273 (298) .++++|+||+.|+++|....++. ...+-|||+....+ ..++. ++.+..++ .. .|.. T Consensus 237 i~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~--~~-----~~~d--- 306 (334) T protein:vir:80 237 IAMLNGVRVVETPRFPQSAITANALGADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEE--KK-----DFGH--- 306 (334) T ss_pred EEEEeceEEEeecCCCCccccccccccccccccccccceEEEEEeCceEEEEEEeecceeeeec--hh-----hHHH--- Confidence 67899999999999997654322 12344565543222 22222 22222221 11 1111 Q ss_pred EEEEEEEEccEEecccceEEEe--ecC Q lcl|Aclame:pro 274 YIRAELFLGWGILDATKFARVT--EAN 298 (298) Q Consensus 274 ~~r~~~r~~~~v~~~~a~~~l~--~a~ 298 (298) .+++.+-+|.+++||+|++.++ ..+ T Consensus 307 ~i~~~~a~G~g~lRPeaa~vv~~~~~~ 333 (334) T protein:vir:80 307 YLDTFQSYNIGQRRPDAVAVHDITVTN 333 (334) T ss_pred HHHHHHHcCCceeccceEEEEEEeeec Confidence 1233345899999997776664 444 No 134 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=99.38 E-value=3e-14 Score=94.70 Aligned_cols=286 Identities=11% Similarity=0.015 Sum_probs=160.9 Q ss_pred Ce-ecccc-----------------ccchhHHHHHHHHHHhhchhhhhcceeecC-CCceEEEEEeCCcceEEeeccccc Q lcl|Aclame:pro 1 MV-LNKGT-----------------LFDPELVTDLISKVAGKSSIARLSAQKPIP-FNGEKVFTFTMDSEIDVVAESGKK 61 (298) Q Consensus 1 ma-t~gg~-----------------lip~~~~~~ii~~~~~~s~i~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~E~~~~ 61 (298) || +.+|. |.-+.+..++.....+.|.++.+.++.... +..+.||+.. ..++..+..|+.+ T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~ie~~~g~V~~~f~~~s~~~~~v~~r~~~~G~sv~i~~iG-~~t~~~~~~g~~l 79 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIG-RTKAAYLKPGENL 79 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHHHHHHHHHHHHHHHHHhhhhhhccccccccceeEeeecc-ceeeeeecCCCCC Confidence 66 33333 333778888888888889999999987765 4568888854 5666766677766 Q ss_pred cc--cccceeeEEEeeeEEEE-EEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccc-----c-cccccccc Q lcl|Aclame:pro 62 TH--GGVTLAPQTMVPIKVEY-GARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGV-----N-PRLGTASA 132 (298) Q Consensus 62 ~~--~~~~~~~v~l~~~k~~~-~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~-----~-~~~g~~~~ 132 (298) +. .++...+.++..-+.-- ...|.+- +...+..++.+.+.++.++++++..|..++.-. . ........ T Consensus 80 ~~~~~~~~~~e~~ltiD~~~y~~~~Vddi---D~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~ 156 (347) T protein:vir:33 80 DDKRKDIKHTEKVIHIDGLLTADVLIYDI---EDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNENIE 156 (347) T ss_pred CCCCCCCccceEEEEechhhhhhHHHhhH---HHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Confidence 44 34555665555433211 1122111 111233568888999999999999999886210 0 00000000 Q ss_pred ccc----ccccccccccccccccccchhHHHHHHHhhhhhhcCCcc--cEEEEcHHHHHHHHHhhc-cCCceeecccccc Q lcl|Aclame:pro 133 VIG----TNHFDSKVTQKVEAPRGIADPNGAIENAVELLTGVDADV--TGIAINPSFRSALAKQKD-LQGNALFPELKWG 205 (298) Q Consensus 133 ~~~----~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~--~~~vm~~~~~~~L~~lkd-~~G~~l~~~~~~~ 205 (298) ..+ .......+.............++.|.++...|...+.+. -..+++|..+..|.+-.. .+..+.-...... T Consensus 157 ~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~d~~~~~~~~~ 236 (347) T protein:vir:33 157 GLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQALLDPER 236 (347) T ss_pred cccccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhcccccccccccccccccc Confidence 000 000111111111112233456888899999998888753 357999999999876542 2333432333445 Q ss_pred cCcceecceeeEecCccccccccccce----------------EEEeeccce--EE-----EEeecceEEEEeecccccc Q lcl|Aclame:pro 206 ATPDTINGLPVDVNKTVSDMSLTQRDR----------------AIIGDFANG--FK-----WGYAKEVPLEVIQYGDPDN 262 (298) Q Consensus 206 ~~~~~l~G~PV~~s~~~~~~~~~~~~~----------------~~~gd~~~~--~~-----~~~~~~~~i~~~~~~~~~~ 262 (298) +..++++|++|+.++++|......... ..-++|+.. +. ++..+.+.+++..+.+.. T Consensus 237 G~V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r~~~- 315 (347) T protein:vir:33 237 GTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRAN- 315 (347) T ss_pred ceeEEEeceeEEEecccccCccccccccccccccccccCCcccceeccccceeeeeecchhheeeeeeceeeeeccchh- Confidence 667899999999999999754332111 111222211 11 111122222233222221 Q ss_pred cchhhhhcCcEEEEEEEEEccEEecccceEEEe--ecC Q lcl|Aclame:pro 263 SGLDLKGYNQVYIRAELFLGWGILDATKFARVT--EAN 298 (298) Q Consensus 263 ~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~--~a~ 298 (298) .| .-.+++.+.+|.+++||++.+.|+ +.. T Consensus 316 ----~~---~d~i~~~~~~G~~vlrP~~av~i~~~~~~ 346 (347) T protein:vir:33 316 ----YQ---ADQIIAKYAMGHGGLRPEAAGAIVLPKVS 346 (347) T ss_pred ----hh---hHhhhhhhhcCCceecccceEEEecCCCC Confidence 11 234677788999999999988774 444 No 135 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=99.35 E-value=3.9e-13 Score=88.59 Aligned_cols=274 Identities=12% Similarity=0.044 Sum_probs=160.6 Q ss_pred Ce---eccccccchhHHHHHHH-HHHhhchhhhhcce---------eecCCCceEEEEEeC-CcceEEeeccc-cccccc Q lcl|Aclame:pro 1 MV---LNKGTLFDPELVTDLIS-KVAGKSSIARLSAQ---------KPIPFNGEKVFTFTM-DSEIDVVAESG-KKTHGG 65 (298) Q Consensus 1 ma---t~gg~lip~~~~~~ii~-~~~~~s~i~~~~~~---------~~~~~~~~~ip~~~~-~~~a~~v~E~~-~~~~~~ 65 (298) || |.-..+|-||+....++ ...+.+.+.+-+-. ..-++.-+++|.+.. ++++.-+.|++ .++..+ T Consensus 1 Ma~~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~G~~~~~~dg~~~i~~~k 80 (330) T protein:vir:10 1 MANELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLTGDSEVLGNGDKALETGK 80 (330) T ss_pred CCCCceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCccccchhh Confidence 99 66688888887777664 34444444332211 122455689999874 57788888885 688888 Q ss_pred cceeeEEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccc Q lcl|Aclame:pro 66 VTLAPQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQ 145 (298) Q Consensus 66 ~~~~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~ 145 (298) .+-++-....++.+..+.++++.... +..+.+..+.+++++.+.+..+..++.-...-.+.... ........... T Consensus 81 i~t~~~~a~i~~~~k~~~~tD~a~~~---~g~dp~~~i~~q~a~~w~~~~q~~lla~l~gvf~~~~~--~~~~~~~~~~~ 155 (330) T protein:vir:10 81 ITAGADIACVLYRGRGWAANELTGVV---AGSDPVRAILNRIGAYWLREDQKALIATLNGIFATGTA--GEKGALEETHV 155 (330) T ss_pred cccceeEEEEEeecceeeehhhhhhh---cchhHHHHHHHHHHHHhhhhHHHHHHHHHHhhhhhhhc--ccchhhhhhhe Confidence 88888777778878888888875432 33567788999999998888877766421100000000 00000000011 Q ss_pred ccccccccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccc Q lcl|Aclame:pro 146 KVEAPRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDM 225 (298) Q Consensus 146 ~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~ 225 (298) ............+.+.++..++........+|+|||.++..|++.+--+ ++ +....+..-++++|++|++++.||.. T Consensus 156 ~~~~~~~a~~s~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li~--~~-~~s~~~~~i~~~~G~~VivdD~~p~~ 232 (330) T protein:vir:10 156 SDQSKASTGIDAGMVLDAKQLLGDSADQVTAIAMHSAVYTKLQKDNLIQ--YI-QPTTATINIPTYLGYRVIIDDGIAPT 232 (330) T ss_pred ecccccccccCHHHHHHHHHHhccccccceEEEEcHHHHHHHHHhhhhh--hh-cccccCcccccccceEEEEeCCCCCC Confidence 1111222223457789999998888778889999999999998754111 11 11111234578999999999999864 Q ss_pred cccccceEEEeeccceEEEEee---cceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 226 SLTQRDRAIIGDFANGFKWGYA---KEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 226 ~~~~~~~~~~gd~~~~~~~~~~---~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) .+. -.+.+|+ .+++.+... ..+.+|++++.. .++-.+-.+.++ +++|.++..-+.+. T Consensus 233 ~~~-yt~yl~~--~GAi~~~~~~~~~~v~~EtdRd~~----------~g~~~l~~r~~~---~~hp~G~s~~~~~~ 292 (330) T protein:vir:10 233 GDI-YTSYLFR--TGSIGLNTGNPSGLTTFETSREAA----------KGNDMIYTRRAL---VMHPYGVKWTGAEV 292 (330) T ss_pred CCc-eeEEEEe--cCceeeecccCCccccccccCCcc----------ccceEEEEeeEE---Eeeeeeeeeccccc Confidence 332 2233444 345545432 234555555432 122223333443 45666666654321 No 136 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=99.35 E-value=7e-13 Score=87.23 Aligned_cols=283 Identities=12% Similarity=0.015 Sum_probs=157.4 Q ss_pred Ceec-----------cc--cccchhHHHHHHHHHHhhchhhhhcceeecCC-CceEEEEEeCCcceEEeecccccccccc Q lcl|Aclame:pro 1 MVLN-----------KG--TLFDPELVTDLISKVAGKSSIARLSAQKPIPF-NGEKVFTFTMDSEIDVVAESGKKTHGGV 66 (298) Q Consensus 1 mat~-----------gg--~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~-~~~~ip~~~~~~~a~~v~E~~~~~~~~~ 66 (298) |+.. +. .|.-+++..++.+.....+.++.+.+++.+.+ +++++|+. +..+++..--|+....+.+ T Consensus 1 ms~~n~~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~rti~~gkS~q~~~i-G~~~~~~~~~G~~ld~~~~ 79 (364) T protein:vir:10 1 MSNPNVLTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQEVVGTNSVSNKYI-GETELQVLSPGKSPDASPT 79 (364) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEeeee-eeeEEeeeccCcccCCCCc Confidence 3322 22 33447788899998888999999999888775 46899996 4556666555555545566 Q ss_pred ceeeEEEeeeEEEEEEeecHHHhhcccc--cHHH-HHHHHHHHHHHHHHHHHHHHHhccc---c--cccccccccccc-c Q lcl|Aclame:pro 67 TLAPQTMVPIKVEYGARISDEFMYASDE--EKIN-ILQAFNDGFAKKVARGIDLMAFHGV---N--PRLGTASAVIGT-N 137 (298) Q Consensus 67 ~~~~v~l~~~k~~~~~~iS~ell~~~~d--~~~~-l~~~i~~~la~~i~~~~d~~~l~G~---~--~~~g~~~~~~~~-~ 137 (298) .-++.++..-.+- +++.++.+.++ +..+ +.+++.+++++++++.+|..++.-. . ...+....+.+. . T Consensus 80 ~~~k~~itID~ll----~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~~~~ 155 (364) T protein:vir:10 80 EFDKNRLVVDTTV----IARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPRVAGH 155 (364) T ss_pred ccCcEEEEeccee----eechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCcccCC Confidence 6677666665532 22222222222 2234 4578888999999999999886311 0 000000000000 0 Q ss_pred cc-ccccccccccccccchhHHHHHHHhhhhhhcCCccc--EEEEcHHHHHHHHHhhccCC-ceee--cccccccCccee Q lcl|Aclame:pro 138 HF-DSKVTQKVEAPRGIADPNGAIENAVELLTGVDADVT--GIAINPSFRSALAKQKDLQG-NALF--PELKWGATPDTI 211 (298) Q Consensus 138 ~~-~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~--~~vm~~~~~~~L~~lkd~~G-~~l~--~~~~~~~~~~~l 211 (298) +. ................+.+.|.++...|-+.+.+.. ..+++|..+..|.+-.+=-. .|.. ......+...++ T Consensus 156 g~~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~~~lvn~d~~~~~~~~~~~G~v~~v 235 (364) T protein:vir:10 156 GFSIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDADRIVDKSYTIAASDNTVDGFVLKS 235 (364) T ss_pred cceeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcCCccccccccccCCCccccceeEEE Confidence 00 011111122222333456667788888888777544 57999999998876322000 0110 111234555789 Q ss_pred cceeeEecCcccccccccc-----------------ceEEEeeccc---------eEEEEeecceEEEEeecccccccch Q lcl|Aclame:pro 212 NGLPVDVNKTVSDMSLTQR-----------------DRAIIGDFAN---------GFKWGYAKEVPLEVIQYGDPDNSGL 265 (298) Q Consensus 212 ~G~PV~~s~~~~~~~~~~~-----------------~~~~~gd~~~---------~~~~~~~~~~~i~~~~~~~~~~~~~ 265 (298) .|+||+.++++|...+... ..-..+|+.. ++......+++.++.++.. T Consensus 236 ~Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~~~~------ 309 (364) T protein:vir:10 236 WNTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFYEKK------ 309 (364) T ss_pred eceEEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeeeeccc------ Confidence 9999999999996433211 0001133322 2222222333333332211 Q ss_pred hhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 266 DLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 266 ~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) .|. ..+.+.+-+|..++||++++.|+-++ T Consensus 310 -~~~---~~ida~~a~G~g~lRPeaa~~i~~~~ 338 (364) T protein:vir:10 310 -EKT---WYIDTFLAEGAIPDRWEAVAVVTAAD 338 (364) T ss_pred -eee---eeeeeehcccCcccCccceEEEEecC Confidence 011 23345667999999999999998777 No 137 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=99.34 E-value=1.2e-13 Score=91.46 Aligned_cols=273 Identities=11% Similarity=0.058 Sum_probs=158.2 Q ss_pred Ceecc-ccccchhHHHHHHH-HHHhhchhhhhcce---------eecCCCceEEEEEeC-CcceEEeeccccccccccce Q lcl|Aclame:pro 1 MVLNK-GTLFDPELVTDLIS-KVAGKSSIARLSAQ---------KPIPFNGEKVFTFTM-DSEIDVVAESGKKTHGGVTL 68 (298) Q Consensus 1 mat~g-g~lip~~~~~~ii~-~~~~~s~i~~~~~~---------~~~~~~~~~ip~~~~-~~~a~~v~E~~~~~~~~~~~ 68 (298) ||++. +.+|-||+...+++ ...+.+.+.+-+-. ..-++.-+++|.+.. ++++.-+.|+..++..+.+- T Consensus 1 MA~T~lsd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~~~~~~~~~i~~~kitt 80 (351) T protein:vir:15 1 MAETHLSDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGDPDNWTDSDDIDVNNLTS 80 (351) T ss_pred CCceeeeeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCcccchheecc Confidence 99887 78888888777764 44444544332111 112345689998875 46888889999999998888 Q ss_pred eeEEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccc Q lcl|Aclame:pro 69 APQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVE 148 (298) Q Consensus 69 ~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~ 148 (298) ++-....++.+..+.++++....+. .+.+..+.+++++.+++..++.+|.-...--+... . ...+.... ... T Consensus 81 ~~~~a~i~~~~kg~~~tD~a~~~sg---~dp~~~i~~q~a~~w~~~~q~~lla~l~gv~~~~~-~-~~~~~~d~---t~~ 152 (351) T protein:vir:15 81 GKQQGIKFYQTKAYGYTDLGTMISG---APVQETIGNRFAAFWQRADQKTLLSVLKGVMGVTK-I-ANSKVYDQ---TKV 152 (351) T ss_pred cceeEEEEeeccceehhhhhHhhcc---chHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchh-h-cccceecc---ccc Confidence 7777777777777888887543222 36778899999999999998887753110000000 0 00001010 011 Q ss_pred cccccchhHHHHHHHhhhhhhcCCc-ccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccc Q lcl|Aclame:pro 149 APRGIADPNGAIENAVELLTGVDAD-VTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSL 227 (298) Q Consensus 149 ~~~~~~~~~~~i~~~~~~l~~~~~~-~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~ 227 (298) ........++.+.++..++..+... -++|+||+.++..|++.+--+ ++ +....+..-++++|++|++++.||.... T Consensus 153 ~~~~~~is~~~l~~A~~~~GD~~~~~~~~ivmhS~v~~~L~~~~li~--~~-~~s~~~~~i~t~~G~~VivdD~~p~~~~ 229 (351) T protein:vir:15 153 SPSEPMFGAKGFTGAIGLMGDLQDTAFGAIAVNSATYSLMKVQGLIE--TI-QPQNGATPFEAYNGLRIVLDDDIEIDLT 229 (351) T ss_pred cccccccCHHHHHHHHHHhccccccceEEEEEChHHHHHHHhhhhhh--hc-cccccCcccceecceEEEEcCCCccccC Confidence 1222233467899999998886544 689999999999998754100 00 0001123357899999999999986544 Q ss_pred cccc----eEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEee--c---- Q lcl|Aclame:pro 228 TQRD----RAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTE--A---- 297 (298) Q Consensus 228 ~~~~----~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~--a---- 297 (298) .+.. +.+|+ .+++.+..+. ..+++.++..... ++-.+..+.++ +++|.++..-+. + T Consensus 230 ~~~~~~ytsyl~~--~GAi~~~~~~-~~ve~~rd~~~~~--------g~d~l~~r~~~---~~hp~G~s~~~~~~~~~~~ 295 (351) T protein:vir:15 230 DKTKPVSTSYIFA--PGAVRYSTNM-RSTETKYDPLING--------GQDVIVQKRVG---TIHVAGTSIKASFSPSKAS 295 (351) T ss_pred CCCCceeEEEEEe--cceeeeecCC-cCcceeecccCCC--------CceEEEEeeee---eeeeeeeeecccccccCcC Confidence 3322 33433 3455454443 3455555433221 11111222232 355555554211 1 Q ss_pred --C Q lcl|Aclame:pro 298 --N 298 (298) Q Consensus 298 --~ 298 (298) | T Consensus 296 sPt 298 (351) T protein:vir:15 296 FPT 298 (351) T ss_pred CcC Confidence 1 No 138 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=99.34 E-value=1.5e-13 Score=90.91 Aligned_cols=283 Identities=11% Similarity=0.021 Sum_probs=164.4 Q ss_pred Ceeccc-------------------cccchhHHHHHHHHHHhhchhhhhcceeecCC-CceEEEEEeCCcceEEeecccc Q lcl|Aclame:pro 1 MVLNKG-------------------TLFDPELVTDLISKVAGKSSIARLSAQKPIPF-NGEKVFTFTMDSEIDVVAESGK 60 (298) Q Consensus 1 mat~gg-------------------~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~-~~~~ip~~~~~~~a~~v~E~~~ 60 (298) ||...+ .|.-+++..++.....+.|.++.+.+++.+.+ +++.+|+. +..++..+..|++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~r~i~~gks~~~~~i-G~~~~~~~~~G~~ 79 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVL-GRTQAAYLAPGEN 79 (345) T ss_pred CcccccchhcccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEEeee-cceEEEeeecCCC Confidence 332221 34457888999999999999999999888775 56889985 6677888888887 Q ss_pred cccc--ccceeeEEEeeeEEEE-EEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccc----c---cccccc Q lcl|Aclame:pro 61 KTHG--GVTLAPQTMVPIKVEY-GARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGV----N---PRLGTA 130 (298) Q Consensus 61 ~~~~--~~~~~~v~l~~~k~~~-~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~----~---~~~g~~ 130 (298) ...+ ++...+.+|..-++-- ...|.+ + +.-.+..++.+.+.+++++++++.+|+.++.-. . +.++.+ T Consensus 80 l~~~~~~~~~~e~~ltID~~~y~~~~Vdd-i--D~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~~~ 156 (345) T protein:vir:22 80 LDDKRKDIKHTEKVITIDGLLTADVLIYD-I--EDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNENI 156 (345) T ss_pred CCCCCCCcccceEEEEecchhhhhhhHhh-H--HHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Confidence 7554 4666774454443211 112211 1 112344578899999999999999998887321 1 111111 Q ss_pred cccc-cccccccc-cccccccccccchhHHHHHHHhhhhhhcCCccc--EEEEcHHHHHHHHHhhcc-CCceeecccccc Q lcl|Aclame:pro 131 SAVI-GTNHFDSK-VTQKVEAPRGIADPNGAIENAVELLTGVDADVT--GIAINPSFRSALAKQKDL-QGNALFPELKWG 205 (298) Q Consensus 131 ~~~~-~~~~~~~~-~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~--~~vm~~~~~~~L~~lkd~-~G~~l~~~~~~~ 205 (298) .++. +....... ..............++.|.++...|...+.+.. ..+++|..+..|.+-+.- +..+.-...... T Consensus 157 ~~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~ 236 (345) T protein:vir:22 157 EGLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCDPDSYSAILAALMPNAANYAALIDPEK 236 (345) T ss_pred cccccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeChHHHHHHhcccccccccccccccccc Confidence 1110 00000000 001111122234568889999888888887654 579999999988654432 223332222334 Q ss_pred cCcceecceeeEecCccccccccc--------------------------cceEEEeeccceEEEEeecceEEEEeeccc Q lcl|Aclame:pro 206 ATPDTINGLPVDVNKTVSDMSLTQ--------------------------RDRAIIGDFANGFKWGYAKEVPLEVIQYGD 259 (298) Q Consensus 206 ~~~~~l~G~PV~~s~~~~~~~~~~--------------------------~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~ 259 (298) +..++++|++|+.++++|...... ....++. .+.++......+++++..+.. T Consensus 237 G~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~-h~~A~~~v~~~~~~~e~~r~~- 314 (345) T protein:vir:22 237 GSIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVIGLFM-HRSAVGTVKLRDLALERARRA- 314 (345) T ss_pred ceEEEEeceEEEecccccccccCccccCcccccccccccccceeeeeccCceEEEEE-ehhheeeeeeecceeeeeech- Confidence 556789999999999998532211 0011111 122322333333444444321 Q ss_pred ccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 260 PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 260 ~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ..|. ..+++.+-+|.+++||++.+.|+--= T Consensus 315 ------~~~~---d~I~~~~a~G~~vlRPeaa~~i~~~~ 344 (345) T protein:vir:22 315 ------NFQA---DQIIAKYAMGHGGLRPEAAGAVVFKV 344 (345) T ss_pred ------hHHH---HHHHHHHhcCCcccccceeEEEEEee Confidence 1122 24567778999999999988775433 No 139 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=99.33 E-value=5.2e-14 Score=93.42 Aligned_cols=284 Identities=12% Similarity=0.025 Sum_probs=159.3 Q ss_pred Ceecccccc-----------------chhHHHHHHHHHHhhchhhhhcceeecCC-CceEEEEEeCCcceEEeecccccc Q lcl|Aclame:pro 1 MVLNKGTLF-----------------DPELVTDLISKVAGKSSIARLSAQKPIPF-NGEKVFTFTMDSEIDVVAESGKKT 62 (298) Q Consensus 1 mat~gg~li-----------------p~~~~~~ii~~~~~~s~i~~~~~~~~~~~-~~~~ip~~~~~~~a~~v~E~~~~~ 62 (298) ||-.++..+ -+++.+++.....+.|.++.+.++..+.+ ..+.||+. +..++..+..|+..+ T Consensus 1 m~~~~~~~~~t~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~~r~i~~G~sv~i~~i-G~~tv~~~t~G~~l~ 79 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGKSSSDALALFLKVFAGEVLTAFTRRSVTADKHIVRTIQNGKSAQFPVM-GRTSGVYLAPGERLS 79 (347) T ss_pred CCCCCccccccccccCCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccccceEEEecc-cceeeeeecCCCCcC Confidence 554443222 25677888888888888899999888764 46889986 567777777777765 Q ss_pred cc--ccceeeEEEeeeEEEEEEeecHHHhhc--ccccHHHHHHHHHHHHHHHHHHHHHHHHhccc-------cccccccc Q lcl|Aclame:pro 63 HG--GVTLAPQTMVPIKVEYGARISDEFMYA--SDEEKINILQAFNDGFAKKVARGIDLMAFHGV-------NPRLGTAS 131 (298) Q Consensus 63 ~~--~~~~~~v~l~~~k~~~~~~iS~ell~~--~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~-------~~~~g~~~ 131 (298) .+ ..+-.++++...++- +++.++.+ ...+..++.+.+.++.++++++.+|..++.-. ++..+... T Consensus 80 ~~~~~~~~~e~~itID~~~----~~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~~~~~ 155 (347) T protein:vir:94 80 DKRKGIKHTEKVITIDGLL----TADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASNENIA 155 (347) T ss_pred CCCCCCCcceEEEEecchh----hhhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccC Confidence 43 334445445444331 12222211 12333568888999999999999999886311 11111111 Q ss_pred cccccccccccc--ccccccccccchhHHHHHHHhhhhhhcCCccc--EEEEcHHHHHHHHHhhc-cCCceeeccccccc Q lcl|Aclame:pro 132 AVIGTNHFDSKV--TQKVEAPRGIADPNGAIENAVELLTGVDADVT--GIAINPSFRSALAKQKD-LQGNALFPELKWGA 206 (298) Q Consensus 132 ~~~~~~~~~~~~--~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~--~~vm~~~~~~~L~~lkd-~~G~~l~~~~~~~~ 206 (298) ++ +........ +............++.|.++...|...+.... ..+++|..+..|.+-++ .+..+.-......+ T Consensus 156 g~-~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G 234 (347) T protein:vir:94 156 GL-GTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAANYAALIDPETG 234 (347) T ss_pred CC-cccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhhhcccccccccc Confidence 11 111111111 11111111223456778888888877776432 57999999988855443 22223323334456 Q ss_pred CcceecceeeEecCccccccccccc------------eE--------EEeeccceEEEE-------eecceEEEEeeccc Q lcl|Aclame:pro 207 TPDTINGLPVDVNKTVSDMSLTQRD------------RA--------IIGDFANGFKWG-------YAKEVPLEVIQYGD 259 (298) Q Consensus 207 ~~~~l~G~PV~~s~~~~~~~~~~~~------------~~--------~~gd~~~~~~~~-------~~~~~~i~~~~~~~ 259 (298) ..++++|++|+.++++|....+... .. +-+||++...+. .-+.+.+++..+.+ T Consensus 235 ~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~ 314 (347) T protein:vir:94 235 NIRNVMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLALERDRD 314 (347) T ss_pred ceEEEeceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhcccccccchhc Confidence 6789999999999999964332110 11 223333222211 11111112211111 Q ss_pred ccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 260 PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 260 ~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) .+ .|. ..+++.+.+|.+++||++.+.|+-.. T Consensus 315 ~~-----~~~---d~i~~~~~~G~~~~rP~~a~~~~~~~ 345 (347) T protein:vir:94 315 VD-----AQG---DLIVGKYAMGHGGLRPEAAGALVFSP 345 (347) T ss_pred hh-----hHH---HHhhhhhhhcCcccccceeEEEEecC Confidence 11 122 35678889999999999998887666 No 140 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=99.33 E-value=1e-12 Score=86.34 Aligned_cols=282 Identities=9% Similarity=-0.009 Sum_probs=163.9 Q ss_pred Cee-----------ccc--cccchhHHHHHHHHHHhhchhhhhcceeecCCC-ceEEEEEeCCcceEEeecccccccccc Q lcl|Aclame:pro 1 MVL-----------NKG--TLFDPELVTDLISKVAGKSSIARLSAQKPIPFN-GEKVFTFTMDSEIDVVAESGKKTHGGV 66 (298) Q Consensus 1 mat-----------~gg--~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~-~~~ip~~~~~~~a~~v~E~~~~~~~~~ 66 (298) |.+ +++ .|.-+++..++.......+.++.+.+++.+.++ ++.+|+. +..++....-|++...+.+ T Consensus 1 ms~~~~~tr~~~~~s~~d~al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~i-G~~~~~~~~pG~~l~~~~~ 79 (335) T protein:vir:63 1 MSFLNDLTRPNYAGKNADVDIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRL-GNVEAKGRRAGEELERSRV 79 (335) T ss_pred CCCcccchhhhcccccchhheehhhhhhhHHHHHHhhhhhccccceeeeccceeEEEeee-eeeeeecccCCcCcCCCCc Confidence 222 222 334488999999999999999999999888754 6899986 6677777777777766666 Q ss_pred ceeeEEEeeeEEEEEEeecHHHhhccc--ccHHHHHHHHHHHHHHHHHHHHHHHHhc----cccc--ccccccccccccc Q lcl|Aclame:pro 67 TLAPQTMVPIKVEYGARISDEFMYASD--EEKINILQAFNDGFAKKVARGIDLMAFH----GVNP--RLGTASAVIGTNH 138 (298) Q Consensus 67 ~~~~v~l~~~k~~~~~~iS~ell~~~~--d~~~~l~~~i~~~la~~i~~~~d~~~l~----G~~~--~~g~~~~~~~~~~ 138 (298) .-++.++..-.+- +++.++.+.+ .+..++.+++.+++++++++..|.+++. +... .... .+..+... T Consensus 80 ~~~k~~itVD~ll----~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~-~~~~~~G~ 154 (335) T protein:vir:63 80 VNDKWNLTVDTLL----YLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDL-EDAFSPGV 154 (335) T ss_pred cccceEEEeccee----echhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccc-CCCcCCCc Confidence 6777777666532 3333332222 3445788999999999999999998762 2110 0000 01100010 Q ss_pred cccccccccccccccchhHHHHHHHhhhhhhcCCcc-----cEEEEcHHHHHHHHHhhccCCc-eee-c--ccccccCcc Q lcl|Aclame:pro 139 FDSKVTQKVEAPRGIADPNGAIENAVELLTGVDADV-----TGIAINPSFRSALAKQKDLQGN-ALF-P--ELKWGATPD 209 (298) Q Consensus 139 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~-----~~~vm~~~~~~~L~~lkd~~G~-~l~-~--~~~~~~~~~ 209 (298) .................+.+.+.++..+|...+.+. -..+++|..+..|.+-+.--.+ |.- . .+...+... T Consensus 155 ~~~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~ 234 (335) T protein:vir:63 155 LEKLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATGATNDYVKSRVA 234 (335) T ss_pred ceeeeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhccccccccccccccccccccCceeE Confidence 000000111111233344566777778887777652 3589999999999775322222 111 0 112334567 Q ss_pred eecceeeEecCccccccccc-----cceEEEeeccceEEEEee---------cceEEEEeecccccccchhhhhcCcEEE Q lcl|Aclame:pro 210 TINGLPVDVNKTVSDMSLTQ-----RDRAIIGDFANGFKWGYA---------KEVPLEVIQYGDPDNSGLDLKGYNQVYI 275 (298) Q Consensus 210 ~l~G~PV~~s~~~~~~~~~~-----~~~~~~gd~~~~~~~~~~---------~~~~i~~~~~~~~~~~~~~~f~~n~v~~ 275 (298) +++|+||+.++++|....+. ....+-||+.....+..+ .++..++..+.. -|. ..+ T Consensus 235 ~v~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~-------~~~---~~i 304 (335) T protein:vir:63 235 ILNGVKVLETPRFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNE-------KFS---WVL 304 (335) T ss_pred EeeceEEEeeccCCCCCcccccccccCCccccccceeEEEEEecceEEEEEEeecccceeeccc-------hhh---HHh Confidence 89999999999999766542 222344555433322222 222222222111 011 123 Q ss_pred EEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 276 RAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 276 r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) .+.+-+|.+++||++++.++-.. T Consensus 305 ~~~~a~G~g~lRPe~a~~i~~tg 327 (335) T protein:vir:63 305 DTFQMYNIGARRPDTAGAIELKG 327 (335) T ss_pred HHHHHcCCcccccceEEEEEEcC Confidence 44456999999999999998543 No 141 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=99.32 E-value=4.6e-14 Score=93.71 Aligned_cols=257 Identities=10% Similarity=0.021 Sum_probs=168.2 Q ss_pred Ceecc---ccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceE-------Eeeccccccccccceee Q lcl|Aclame:pro 1 MVLNK---GTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEID-------VVAESGKKTHGGVTLAP 70 (298) Q Consensus 1 mat~g---g~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~-------~v~E~~~~~~~~~~~~~ 70 (298) -.-++ ...||+++..+.|+.+.+.+++..+....|.++..++||+.+...+.. .-.||...+..+++|+. T Consensus 132 ~~~~Tgd~~~~i~~~~v~d~i~li~q~r~i~slf~tLP~~g~T~eY~v~t~~~tV~~q~~~~kqa~EGd~L~~gKl~~~t 211 (410) T protein:vir:83 132 DHQKTGDLQGVIPDPIVGPVIDFIDSARPLVSTLGTLPLNNATFYRPIVSQRPAVGLQGVAGGASDEKTELDSQKMVIDR 211 (410) T ss_pred ccCcccccccccchhHhhhHHHHHhhccchhhhhhhCCCCCCeeEEeeecccccccccccccccccccccccccceeeee Confidence 11112 235788899999999999999999999999998889999887666543 23588889999999999 Q ss_pred EEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHH---HHhccccccccccccccccccccccccccc Q lcl|Aclame:pro 71 QTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDL---MAFHGVNPRLGTASAVIGTNHFDSKVTQKV 147 (298) Q Consensus 71 v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~---~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~ 147 (298) .+...++++++..+|++.+. .+.+..++...+.|..+++.+-+. ++|.++-. + .. .+ T Consensus 212 ~tA~ikTyGGyt~LSRQ~IE---Rs~v~~L~~~lraL~~AYA~atea~vra~L~~t~t------~---~~-----a~--- 271 (410) T protein:vir:83 212 LTVNAKTLGGYVNVSRQAID---FSSPSALDLVVNGLGQQYAIETEALVGAALASTST------G---AV-----GY--- 271 (410) T ss_pred ccceeehhcCcccccceeee---cCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhh------h---hh-----hh--- Confidence 99999999999999999774 555778888889997777766653 44443210 0 00 00 Q ss_pred ccccccchhHHHHHHHhhhhhhc--CCcccEEEEcHHHHHHHHHhhccCCceeeccc-------ccccCcceecceeeEe Q lcl|Aclame:pro 148 EAPRGIADPNGAIENAVELLTGV--DADVTGIAINPSFRSALAKQKDLQGNALFPEL-------KWGATPDTINGLPVDV 218 (298) Q Consensus 148 ~~~~~~~~~~~~i~~~~~~l~~~--~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~-------~~~~~~~~l~G~PV~~ 218 (298) ...+.+.+...|.++..++-.+ +.....+.++|.++..+.++- ..+++.+... ...+..+.+++.||++ T Consensus 272 -~~~Tad~~~~~i~da~~~v~da~~~~~~~~i~vS~DVl~~~~~~f-~~~~~~~~dt~Gfg~~~lg~gi~G~~~~ipVvm 349 (410) T protein:vir:83 272 -GNATADNVASAIWQAAGAVYTAVKGMGRLVIAIAPDVLGDFGPLF-APVNPTNAHSTGFEAGRFGQGVMGSISGIPVVM 349 (410) T ss_pred -hhccHHHHHHHHHHHHHHHhhhhccceeeeEEechhhhhhcccee-eccCCCCcccccccccccccchhhhhcccceEE Confidence 0112234445566666666665 555567899999987765442 1122221110 0124567899999999 Q ss_pred cCccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeec Q lcl|Aclame:pro 219 NKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEA 297 (298) Q Consensus 219 s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a 297 (298) .+..+.+ ++.|-| +.++..+.+..-.+++.+.. .-+-+.+ |- .++...++.++++.-|.+. T Consensus 350 ~~~a~Ag------TA~f~~-~~Ai~~~eS~~gp~qL~d~~-i~nLt~~--------yS--gY~a~a~~~~~gliPv~g~ 410 (410) T protein:vir:83 350 SAALGSG------DAYLFS-TAAIECFEQRVGTLQVVEPS-VFGLQVA--------YA--GYFSTLVVNEDAIVPLVGS 410 (410) T ss_pred ecCCCcC------eeeEec-cceeeeeecCCceeEeeCCc-hhhhhhh--------he--eeeeeccccccceeeeccC Confidence 8887643 344444 34666666654334444322 1111111 11 4578889999999999999 No 142 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=99.32 E-value=3.1e-13 Score=89.17 Aligned_cols=284 Identities=11% Similarity=0.022 Sum_probs=157.1 Q ss_pred Ceecc---------------cc---ccchhHHHHHHHHHHhhchhhhhcceeecC-CCceEEEEEeCCcceEEeeccccc Q lcl|Aclame:pro 1 MVLNK---------------GT---LFDPELVTDLISKVAGKSSIARLSAQKPIP-FNGEKVFTFTMDSEIDVVAESGKK 61 (298) Q Consensus 1 mat~g---------------g~---lip~~~~~~ii~~~~~~s~i~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~E~~~~ 61 (298) ||..- |. +.-+.+..++....++.|.++.+.++.... ++.+.||+.. ..++..+..|+.+ T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~~~~~G~sv~i~~ig-~~t~~~~~~g~~l 79 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIG-RTKAAYLKPGENL 79 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccccccccceeEeeecc-ceeeeeeccCCCC Confidence 44321 11 233667788888888889999999887765 4568998854 4667777777766 Q ss_pred cc--cccceeeEEEeeeEEEE-EEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccc------ccccccccc Q lcl|Aclame:pro 62 TH--GGVTLAPQTMVPIKVEY-GARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGV------NPRLGTASA 132 (298) Q Consensus 62 ~~--~~~~~~~v~l~~~k~~~-~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~------~~~~g~~~~ 132 (298) +. ...+..+.++..-+.-. ...| +.+ +...+..++.+.+.++.++++++..|..++.-. .+.+..... T Consensus 80 ~~~~~~~~~~e~~ltID~~~~~~~~V-ddl--D~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~ 156 (347) T protein:vir:15 80 DDKRKDIKHTEKVIHIDGLLTADVLI-YDI--EDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDASNENIE 156 (347) T ss_pred CCCCCCCccceEEEEechhhhhhHHh-hhH--HHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Confidence 44 34566665555443211 1122 121 112334568888999999999999998887321 011111111 Q ss_pred cccccccccccc----cccccccccchhHHHHHHHhhhhhhcCCccc--EEEEcHHHHHHHHHhhccC-Cceeecccccc Q lcl|Aclame:pro 133 VIGTNHFDSKVT----QKVEAPRGIADPNGAIENAVELLTGVDADVT--GIAINPSFRSALAKQKDLQ-GNALFPELKWG 205 (298) Q Consensus 133 ~~~~~~~~~~~~----~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~--~~vm~~~~~~~L~~lkd~~-G~~l~~~~~~~ 205 (298) ..|...+..... ............++.+.++...|...+.+.. ..+++|.++..|.+-.+-. ..+.-...... T Consensus 157 ~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~~~~~d~~~~~~~~~ 236 (347) T protein:vir:15 157 GLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQALIDHER 236 (347) T ss_pred ccCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhcccccccccccccccccc Confidence 111111111111 1111111223446777777778877777432 4688999999986654322 22222223345 Q ss_pred cCcceecceeeEecCccccccccccc--------eE--------EEeecc---------ceEEEEeecceEEEEeecccc Q lcl|Aclame:pro 206 ATPDTINGLPVDVNKTVSDMSLTQRD--------RA--------IIGDFA---------NGFKWGYAKEVPLEVIQYGDP 260 (298) Q Consensus 206 ~~~~~l~G~PV~~s~~~~~~~~~~~~--------~~--------~~gd~~---------~~~~~~~~~~~~i~~~~~~~~ 260 (298) +..++++|++|+.++++|....++.. .. .-++|+ .++.....+.+.++.. .+. T Consensus 237 G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~--~~~ 314 (347) T protein:vir:15 237 GTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERA--RRA 314 (347) T ss_pred eEEEEEeceEEEecccccccccccccccccccccccccccccceeeeccccceeeeeccceeeeeEeeceeeeec--ccc Confidence 66789999999999999965432210 00 111121 1111112222233332 221 Q ss_pred cccchhhhhcCcEEEEEEEEEccEEecccceEEEe--ecC Q lcl|Aclame:pro 261 DNSGLDLKGYNQVYIRAELFLGWGILDATKFARVT--EAN 298 (298) Q Consensus 261 ~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~--~a~ 298 (298) . +-.-.+++.+.+|.+++||++.+.|+ +.. T Consensus 315 ~--------~~~d~i~~~~~~G~~vlrP~~av~~~~~~~~ 346 (347) T protein:vir:15 315 N--------YQADQIIAKYAMGHGGLRPEAAGAIVLPKVS 346 (347) T ss_pred h--------hhhhhhehhhhcCCceeccccEEEEecCCCC Confidence 1 12234677788999999999988774 444 No 143 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=99.30 E-value=1.7e-12 Score=85.09 Aligned_cols=282 Identities=10% Similarity=0.007 Sum_probs=162.1 Q ss_pred Cee-----------ccc--cccchhHHHHHHHHHHhhchhhhhcceeecCC-CceEEEEEeCCcceEEeecccccccccc Q lcl|Aclame:pro 1 MVL-----------NKG--TLFDPELVTDLISKVAGKSSIARLSAQKPIPF-NGEKVFTFTMDSEIDVVAESGKKTHGGV 66 (298) Q Consensus 1 mat-----------~gg--~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~-~~~~ip~~~~~~~a~~v~E~~~~~~~~~ 66 (298) |.+ +++ .|.-+++..++.+.....+.++.+.+++.+.+ +++.+|+. +..++....-|++...+.+ T Consensus 1 ms~~~~~t~~~~~~s~~d~al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~i-G~~~~~~~~pG~~l~~~~~ 79 (335) T protein:vir:78 1 MSFLNDLTRPNYAGKNADVDIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRL-GNVEAKGRRAGEELERSRV 79 (335) T ss_pred CCccccccccccccccchhhhhhhhhhhHHHHHHHHhhhhccccceeeeccceeEEEeee-eeeeecccccCcccCCCCc Confidence 222 111 34458889999999999999999999988875 46899985 5667777666776666666 Q ss_pred ceeeEEEeeeEEEEEEeecHHHhhccc--ccHHHHHHHHHHHHHHHHHHHHHHHHhc----ccc--cccccccccccccc Q lcl|Aclame:pro 67 TLAPQTMVPIKVEYGARISDEFMYASD--EEKINILQAFNDGFAKKVARGIDLMAFH----GVN--PRLGTASAVIGTNH 138 (298) Q Consensus 67 ~~~~v~l~~~k~~~~~~iS~ell~~~~--d~~~~l~~~i~~~la~~i~~~~d~~~l~----G~~--~~~g~~~~~~~~~~ 138 (298) .-++..+..-.+- +++.++.+.+ .+..++.+.+.+++++++++..|+.++. +.. +......++ +... T Consensus 80 ~~~k~~itID~ll----~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~-~~G~ 154 (335) T protein:vir:78 80 VNDKWNLTVDTLL----YLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAF-SPGV 154 (335) T ss_pred ccCCeEEEeccee----echhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCc-CCCc Confidence 6677666665532 3333332222 3445788999999999999999998762 211 111101111 0000 Q ss_pred cccccccccccccccchhHHHHHHHhhhhhhcCCcc-----cEEEEcHHHHHHHHHhhccCCc-eeec---ccccccCcc Q lcl|Aclame:pro 139 FDSKVTQKVEAPRGIADPNGAIENAVELLTGVDADV-----TGIAINPSFRSALAKQKDLQGN-ALFP---ELKWGATPD 209 (298) Q Consensus 139 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~-----~~~vm~~~~~~~L~~lkd~~G~-~l~~---~~~~~~~~~ 209 (298) .................+.+.+.++...+...+... -..+++|+.+..|.+-+.--.+ |.-. .+...+... T Consensus 155 ~~~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~ 234 (335) T protein:vir:78 155 LEKLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVEYQATGATNDYVKSRVA 234 (335) T ss_pred ceeeeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhcccccccccccccccccccccceeE Confidence 000000111222334445666677777777665542 3589999999999775322121 1110 112345567 Q ss_pred eecceeeEecCccccccccccc-----eEEEeeccc---------eEEEEeecceEEEEeecccccccchhhhhcCcEEE Q lcl|Aclame:pro 210 TINGLPVDVNKTVSDMSLTQRD-----RAIIGDFAN---------GFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYI 275 (298) Q Consensus 210 ~l~G~PV~~s~~~~~~~~~~~~-----~~~~gd~~~---------~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~ 275 (298) .++|+||+.++++|.+..+... ...-+|++. ++......++..++..+.. -|. ..+ T Consensus 235 ~v~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~-------~~~---~~i 304 (335) T protein:vir:78 235 ILNGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDHD-------QFS---WVL 304 (335) T ss_pred EeeceEEEeeccCCCCCCccccccccCCcccccccceEEEEEecceEEEEEEEecccceeeccc-------hhh---Hhh Confidence 8999999999999976544211 222234433 2222222222223222111 111 123 Q ss_pred EEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 276 RAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 276 r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) .+.+-+|.+++||++.+.|+-.. T Consensus 305 ~~~~a~G~g~lRPe~a~~i~~tg 327 (335) T protein:vir:78 305 DTFQMYNIGARRPDTAGAIELKG 327 (335) T ss_pred hHHHHcCCcccCcceEEEEEecC Confidence 44556999999999999887544 No 144 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=99.29 E-value=1.2e-12 Score=85.85 Aligned_cols=287 Identities=11% Similarity=0.038 Sum_probs=161.2 Q ss_pred Ceec-------------------cc--cccchhHHHHHHHHHHhhchhhhhcceeecCC-CceEEEEEeCCcceEEeecc Q lcl|Aclame:pro 1 MVLN-------------------KG--TLFDPELVTDLISKVAGKSSIARLSAQKPIPF-NGEKVFTFTMDSEIDVVAES 58 (298) Q Consensus 1 mat~-------------------gg--~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~-~~~~ip~~~~~~~a~~v~E~ 58 (298) |+.. +. .|.-+.+..++.....+.|.++.+.++..+.+ ++++||+. +..++..+.-| T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~~~~rti~~Gksv~f~~i-G~~t~~~~t~G 79 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDLVTKRTLKNGKSLQFIYT-GRMTSSFHTPG 79 (375) T ss_pred CccccccccCccccCCccccccccchHHHHHHHHhHHHHHHHHHHHhhhccccccccccCceEEEEee-eeeEEeeecCC Confidence 2111 11 44557888999999999999999999888774 56889986 55666666555 Q ss_pred cccc---ccccceeeEEEeeeEE-EEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccc----c--cc-c Q lcl|Aclame:pro 59 GKKT---HGGVTLAPQTMVPIKV-EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGV----N--PR-L 127 (298) Q Consensus 59 ~~~~---~~~~~~~~v~l~~~k~-~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~----~--~~-~ 127 (298) +++. ..+....+.++..-++ .....|.+ + +...+..++.+.+.++.++++++.+|+.++.-. . .+ + T Consensus 80 ~~i~~~~~~d~~~te~~l~ID~~~y~~~~VdD-i--D~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~~ 156 (375) T protein:vir:10 80 TPILGNADKAPPVAEKTIVMDDLLISSAFVYD-L--DETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASPVS 156 (375) T ss_pred cCcCCccccCCCCCceEEEecchhhhhhhHhh-H--HHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc Confidence 5442 2244444444444432 11122221 1 112344578999999999999999998887321 0 00 0 Q ss_pred ccccccccccccccccccccccccccchhHHHHHHHhhhhhhcCCccc--EEEEcHHHHHHHHHhhccCC----ceeecc Q lcl|Aclame:pro 128 GTASAVIGTNHFDSKVTQKVEAPRGIADPNGAIENAVELLTGVDADVT--GIAINPSFRSALAKQKDLQG----NALFPE 201 (298) Q Consensus 128 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~--~~vm~~~~~~~L~~lkd~~G----~~l~~~ 201 (298) +...-..|...+................+++.|.++..+|...+.... ..+++|..+..|.+-+|.+. .+.-.. T Consensus 157 ~~~~~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~~~~n~d~~~~~ 236 (375) T protein:vir:10 157 ATNFVEPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQGRCAVLNPRQYYALIQDIGSNGLVNRDVQGSA 236 (375) T ss_pred cccccccCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHHhcCCccceeeecccccc Confidence 000001111111111122222223455678999999999988887532 57999999999977655331 111111 Q ss_pred cccccCcceecceeeEecCcccccccccc------------------------c-------eEEEeec---cc------- Q lcl|Aclame:pro 202 LKWGATPDTINGLPVDVNKTVSDMSLTQR------------------------D-------RAIIGDF---AN------- 240 (298) Q Consensus 202 ~~~~~~~~~l~G~PV~~s~~~~~~~~~~~------------------------~-------~~~~gd~---~~------- 240 (298) ....+..+++.|++|+.++++|...+... . .-+-+|| ++ T Consensus 237 ~~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~~~~ 316 (375) T protein:vir:10 237 LQSGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCGLIFQ 316 (375) T ss_pred eeccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcceeeccccccccccccccCceEEEEEc Confidence 22334456899999999999996543210 0 0111233 11 Q ss_pred --eEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeec-C Q lcl|Aclame:pro 241 --GFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEA-N 298 (298) Q Consensus 241 --~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a-~ 298 (298) +.....-.++++++++. +. --.+-...+.+.+-+|..+.||++.+.|+.. + T Consensus 317 ~~A~g~v~~~~~~~~~~~~---~~----~~~~q~~~i~~~~a~G~~~lrp~~av~l~~~~~ 370 (375) T protein:vir:10 317 KEAAGVVEAIGPQVQVTNG---DV----SVIYQGDVILGRMAMGADYLNPAAAVELYIGAT 370 (375) T ss_pred hhheeeeeeeccccccccc---hh----hheeeeeeeeeeeeeccCccCceeEEEEecCcC Confidence 11111112222222210 00 0112234567888999999999999988655 4 No 145 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=99.26 E-value=1.7e-12 Score=85.15 Aligned_cols=287 Identities=15% Similarity=0.047 Sum_probs=153.8 Q ss_pred Ceec-----------cc--cccchhHHHHHHHHHHhhchhhhhcceeecCC-CceEEEEEeCCcceEEeecccccccccc Q lcl|Aclame:pro 1 MVLN-----------KG--TLFDPELVTDLISKVAGKSSIARLSAQKPIPF-NGEKVFTFTMDSEIDVVAESGKKTHGGV 66 (298) Q Consensus 1 mat~-----------gg--~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~-~~~~ip~~~~~~~a~~v~E~~~~~~~~~ 66 (298) |+.. +. .|.-+++.+++.......+.++.+.+++.+.+ +++.+|+. +..++...--|+..-.+.+ T Consensus 1 Ms~~n~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~vrti~~GkS~qf~~i-G~~~a~y~~~G~~ldg~~~ 79 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPNATPT 79 (402) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEEEEE-eeeEEeeeccccccCCCCc Confidence 3322 12 33447788899998888999999999888765 46899986 5566666555555545556 Q ss_pred ceeeEEEeeeEEEEEEeecHHHhhcccccH--HH-HHHHHHHHHHHHHHHHHHHHHhccc---c--c--ccccccccccc Q lcl|Aclame:pro 67 TLAPQTMVPIKVEYGARISDEFMYASDEEK--IN-ILQAFNDGFAKKVARGIDLMAFHGV---N--P--RLGTASAVIGT 136 (298) Q Consensus 67 ~~~~v~l~~~k~~~~~~iS~ell~~~~d~~--~~-l~~~i~~~la~~i~~~~d~~~l~G~---~--~--~~g~~~~~~~~ 136 (298) .-++.++..-.+- +++.++.+.++.. .+ +.+++.+++++++++.+|+.++.-. + + ..+......+. T Consensus 80 ~~~k~~ItID~lL----~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~~~~ 155 (402) T protein:vir:97 80 QADKNQLVIDTTV----IARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) T ss_pred ccccEEEEeCcee----echhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCccccc Confidence 6667666665432 3333332223222 33 4578889999999999999876311 0 0 00001111111 Q ss_pred cccccccccccccccccchhHHHHHHHhhhhhhcCCccc--EEEEcHHHHHHHHHhhccCC-ceeec--ccccccCccee Q lcl|Aclame:pro 137 NHFDSKVTQKVEAPRGIADPNGAIENAVELLTGVDADVT--GIAINPSFRSALAKQKDLQG-NALFP--ELKWGATPDTI 211 (298) Q Consensus 137 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~--~~vm~~~~~~~L~~lkd~~G-~~l~~--~~~~~~~~~~l 211 (298) ......+............+.+.|.++...|-..+...+ ..+++|..+..|.+-.+--. .|... +....+...++ T Consensus 156 g~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~rl~n~d~~~~~~g~~~~G~v~~v 235 (402) T protein:vir:97 156 GFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSS 235 (402) T ss_pred ccccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhcccccchhhccccCCccccceeEEE Confidence 111111111111122334455667788888877776554 58999999999876432111 11111 11234556789 Q ss_pred cceeeEecCcccccccccc---------ce--EEEeeccceEEEE-eecce-EEEEee---cccccccchhhhhcCcEEE Q lcl|Aclame:pro 212 NGLPVDVNKTVSDMSLTQR---------DR--AIIGDFANGFKWG-YAKEV-PLEVIQ---YGDPDNSGLDLKGYNQVYI 275 (298) Q Consensus 212 ~G~PV~~s~~~~~~~~~~~---------~~--~~~gd~~~~~~~~-~~~~~-~i~~~~---~~~~~~~~~~~f~~n~v~~ 275 (298) +|+||+.++++|..++... .. -+-+|+.....+. .++.+ +++..+ +..-+.. -|.. .+ T Consensus 236 ~Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d~r---~~~~---~i 309 (402) T protein:vir:97 236 YNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKK---EKTY---YI 309 (402) T ss_pred eceEEEecCccccccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEEeeccccchhhchh---HHHH---HH Confidence 9999999999997542110 01 1225555332221 11111 222221 0000000 0000 11 Q ss_pred EEEEEEccEEecccceEEEee---cC Q lcl|Aclame:pro 276 RAELFLGWGILDATKFARVTE---AN 298 (298) Q Consensus 276 r~~~r~~~~v~~~~a~~~l~~---a~ 298 (298) -+.+-+|..++||++...+.- +| T Consensus 310 d~~~a~G~g~~RPeaa~vv~~~~~~t 335 (402) T protein:vir:97 310 DTFMAEGAIPDRWEAVSVVTTKRDAT 335 (402) T ss_pred HHHHHhCCcccCccceEEEEEecccc Confidence 233458999999999888832 22 No 146 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=99.23 E-value=7.1e-13 Score=87.18 Aligned_cols=284 Identities=11% Similarity=0.004 Sum_probs=161.8 Q ss_pred Ce---------eccc---------cccchhHHHHHHHHHHhhchhhhhcceeecCC-CceEEEEEeCCcceEEeeccccc Q lcl|Aclame:pro 1 MV---------LNKG---------TLFDPELVTDLISKVAGKSSIARLSAQKPIPF-NGEKVFTFTMDSEIDVVAESGKK 61 (298) Q Consensus 1 ma---------t~gg---------~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~-~~~~ip~~~~~~~a~~v~E~~~~ 61 (298) || +..| .|.-+++..++.....+.|.++.+.++..+.+ ..+.||+. +...+..+..|+.. T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~G~sv~~~~i-G~~~~~~~~~g~~l 79 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM-GRTKGYYLAPGENL 79 (347) T ss_pred CCCcccchhhhccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhccccccccCcceEEEeee-cceeeeeeccccCC Confidence 55 2211 23347788899888888899999999887664 56889874 45566666677665 Q ss_pred cc--cccceeeEEEeeeEE-EEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccc----ccccccccccc Q lcl|Aclame:pro 62 TH--GGVTLAPQTMVPIKV-EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGV----NPRLGTASAVI 134 (298) Q Consensus 62 ~~--~~~~~~~v~l~~~k~-~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~----~~~~g~~~~~~ 134 (298) .. .++..+++++..-++ .....|.+- +.-....++.+.+.++.++++++..|+.++.-. .........+. T Consensus 80 ~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~---D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~~~~~~~ 156 (347) T protein:vir:88 80 DDKRKDIKHSEKVIQIDGLLTSDVLIYDI---EDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIA 156 (347) T ss_pred CCCCCCCccceEEEEEechhhhhhhhhhH---HHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccC Confidence 44 356777777766654 222233221 111223467788999999999999999887321 10000001111 Q ss_pred cccc--cccccc--cccccccccchhHHHHHHHhhhhhhcCCcc--cEEEEcHHHHHHHHHhh-ccCCceeecccccccC Q lcl|Aclame:pro 135 GTNH--FDSKVT--QKVEAPRGIADPNGAIENAVELLTGVDADV--TGIAINPSFRSALAKQK-DLQGNALFPELKWGAT 207 (298) Q Consensus 135 ~~~~--~~~~~~--~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~--~~~vm~~~~~~~L~~lk-d~~G~~l~~~~~~~~~ 207 (298) |... ..+..+ ............++.|.++...|...+... -.++++|..+..|.+-+ .....+.-......+. T Consensus 157 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~ 236 (347) T protein:vir:88 157 GLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGN 236 (347) T ss_pred CccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHHHHHhcchhhhhhhhccccchhcce Confidence 1100 000000 111111222344788888888887777643 35899999998886543 2333333333344566 Q ss_pred cceecceeeEecCcccccccccc-------------------ceEEEeeccceEEEEe---------ecceEEEEeeccc Q lcl|Aclame:pro 208 PDTINGLPVDVNKTVSDMSLTQR-------------------DRAIIGDFANGFKWGY---------AKEVPLEVIQYGD 259 (298) Q Consensus 208 ~~~l~G~PV~~s~~~~~~~~~~~-------------------~~~~~gd~~~~~~~~~---------~~~~~i~~~~~~~ 259 (298) .++++|++|+.++++|.+..... ..-+-+|+++...+.. ..++.+|..+.. T Consensus 237 vg~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~~e~~r~~- 315 (347) T protein:vir:88 237 IRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRP- 315 (347) T ss_pred eeeeccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEechhhhhheecccceeeeeech- Confidence 78999999999999985322110 0112334444322211 112223322211 Q ss_pred ccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 260 PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 260 ~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) . .|. ..+++.+.+|.+++||++.+.|+-.- T Consensus 316 -~-----~~~---d~i~~~~~~G~~~~rPe~a~~~~~~~ 345 (347) T protein:vir:88 316 -E-----FQA---DQIIGKYAMGHGGLRPEAAGALVFTP 345 (347) T ss_pred -h-----hHH---HHhhhhhhhcCceeccceEEEEEeCC Confidence 1 121 24678889999999999887776544 No 147 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=99.19 E-value=5.2e-12 Score=82.47 Aligned_cols=227 Identities=11% Similarity=0.048 Sum_probs=155.4 Q ss_pred Ceecc----------ccccchhHHHHHHHHHHhhchhhhhcceeecC-CCceEEEEEeCCcceEEeecccccccccccee Q lcl|Aclame:pro 1 MVLNK----------GTLFDPELVTDLISKVAGKSSIARLSAQKPIP-FNGEKVFTFTMDSEIDVVAESGKKTHGGVTLA 69 (298) Q Consensus 1 mat~g----------g~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~ 69 (298) |++=+ ..+-|......|||.+.+.++|+...+.+... +....+.+.++-|.++|+.=++..++++.++. T Consensus 1 m~~~~~~~~TL~e~Akr~~~d~~~~~VIE~l~~~n~IL~~lpf~e~n~gt~~~~~v~~~LP~~~fR~lN~g~~~s~~tt~ 80 (328) T protein:vir:95 1 MAVKGLTALTLADWGKRVDPNGKVDKIIELLGQTNPILQDMPFVEGNLPTGHRTTIRSGLPSATWRLLNYGVQPSKSTTV 80 (328) T ss_pred CCccccccccHHHHHhhhCcchhHHHHHHHHhccchhHhhcceeecccCCcceeeEeeccCCceeeecCCccCcccceeE Confidence 55442 23445556778999999999999999999875 34578889999999999999999999999999 Q ss_pred eEEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccc----------- Q lcl|Aclame:pro 70 PQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNH----------- 138 (298) Q Consensus 70 ~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~----------- 138 (298) +++-..+-+++.+.|.+.+..... ...++...-.+...+++.+.+..++|+|+... .+..|.|+.. T Consensus 81 q~t~~l~ilgg~~eVDr~la~~~G-n~~~~ra~q~~~~~ka~~~~~~~~~iyGdsa~--~p~~F~GL~~R~~~~s~~~a~ 157 (328) T protein:vir:95 81 QVTDSVGMLETYAEVDKSLADLNG-NTAEFRLSEDRAFIEAMNQQMAQTLFYGDSSV--NPQQFMGLSSRYSSLSAGNAQ 157 (328) T ss_pred EEEEEEEEEecceeechHHHhhcC-CHHHHHHHHHHHHHHHHHHHHHHHHhcCCccC--ChhhhcchhhhcCcccccccc Confidence 999999999999999998775443 23345455566789999999999999984210 0111111000 Q ss_pred -----------ccc------------------------------------------------------------c---cc Q lcl|Aclame:pro 139 -----------FDS------------------------------------------------------------K---VT 144 (298) Q Consensus 139 -----------~~~------------------------------------------------------------~---~~ 144 (298) .++ . .. T Consensus 158 qiidaGgtg~~~TSi~~v~~g~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~g~~y~~y~~~~~w~~Gl~i~d~r~vvrI~ 237 (328) T protein:vir:95 158 NIIDAGGTGTDNTSIWLVVWGENTVHGIFPKGKKAGIQMEDKGQVTLEDANGGKYEGYRTHYKWDNGLALRDWRYVVRIA 237 (328) T ss_pred ceeecccCCCCceEEEEEEEcCCeEEEecccccccCceeeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEe Confidence 000 0 00 Q ss_pred cc----cccccccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhh-ccCCceeecccccccCcceecceeeEec Q lcl|Aclame:pro 145 QK----VEAPRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQK-DLQGNALFPELKWGATPDTINGLPVDVN 219 (298) Q Consensus 145 ~~----~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lk-d~~G~~l~~~~~~~~~~~~l~G~PV~~s 219 (298) .. ........+..+.+.+++.+++.......+|+||.+....|++.. +...-.+-.....+..+-.++|+||..+ T Consensus 238 NId~~~l~~~~~~~~l~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~~~~~~~~g~~~t~~~gipir~~ 317 (328) T protein:vir:95 238 NIDVSNLSEPSSAANIAKLMVKALHRIPNRGMGRPVFYMNRTVGQALDLQSLEKTSLAISVKETEGEWWTSFRGVPIRET 317 (328) T ss_pred cCcccccccccChhhHHHHHHHHHHHhccCCCCcceeehhHHHHHHHHHHHhcCcceeeeeeccCCcceeEECCeEEEEE Confidence 00 011123445566777888888776666788999999999998764 4444444433344445567899999988 Q ss_pred CccccccccccceEEE Q lcl|Aclame:pro 220 KTVSDMSLTQRDRAII 235 (298) Q Consensus 220 ~~~~~~~~~~~~~~~~ 235 (298) +.+-... . .++ T Consensus 318 dai~~tE----~-~vv 328 (328) T protein:vir:95 318 DALLETE----A-RVV 328 (328) T ss_pred eeeecCc----c-ccC Confidence 8774321 1 122 No 148 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=99.18 E-value=6.4e-12 Score=81.94 Aligned_cols=275 Identities=8% Similarity=-0.032 Sum_probs=171.3 Q ss_pred Ceeccccccchh---HHHHHHHHHHhhchhhhhcceee-cCCC--ceEEEEEeCCcceEEeecc-ccccccccceeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPE---LVTDLISKVAGKSSIARLSAQKP-IPFN--GEKVFTFTMDSEIDVVAES-GKKTHGGVTLAPQTM 73 (298) Q Consensus 1 mat~gg~lip~~---~~~~ii~~~~~~s~i~~~~~~~~-~~~~--~~~ip~~~~~~~a~~v~E~-~~~~~~~~~~~~v~l 73 (298) -|.++|.+.-.| +-+.+++...+.-..+++.++.. .+.+ .+.+++.+..+.+.|++.. ..+|..+..+++... T Consensus 5 ~a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~~~~~ 84 (296) T protein:vir:10 5 KADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIAQIVADYTDDLPLVDALATERQG 84 (296) T ss_pred chhhhHHHHHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCceeEeCCCccccceeeccceeEEE Confidence 466777777755 44677777666666666666442 3322 3566666677889998765 458888888999999 Q ss_pred eeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccc--ccccccc Q lcl|Aclame:pro 74 VPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVT--QKVEAPR 151 (298) Q Consensus 74 ~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~--~~~~~~~ 151 (298) ..+.++..+.++.+=|+.+.....++...-....++++++.+|+.+|+|... . +..|+.+.++... ....+.. T Consensus 85 ~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~-~----g~~GLlN~p~v~~~~~~~~W~~ 159 (296) T protein:vir:10 85 KVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTA-H----GIPSVFDYPNINNVVSGGSWSQ 159 (296) T ss_pred EEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeeccc-c----cceeEeecCCCccccccCCccC Confidence 9999999888887655544445567888888899999999999999999421 1 2233333333211 1112222 Q ss_pred ccchhHHHHHHHhhhhhhc---CCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCcccccccc Q lcl|Aclame:pro 152 GIADPNGAIENAVELLTGV---DADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLT 228 (298) Q Consensus 152 ~~~~~~~~i~~~~~~l~~~---~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~ 228 (298) ....++||..++.++... ...+..++++|..+..|.+.-...|.-++.-........++...|.+.+ ..+. T Consensus 160 -~t~i~~Di~~~~~~l~~~s~g~~~p~~l~L~p~~~~~L~~~~~~~~~t~l~~ik~~~~~l~i~~~~~l~~-----a~~~ 233 (296) T protein:vir:10 160 -PTTAVSDITSLLDIIETSTNGQHRATHLLLPTTARRIMQNLVPGTSVSYGEFFRQNNSGVTVEFVQYLND-----YNGT 233 (296) T ss_pred -HHHHHHHHHHHHHHHHHhhCceecceeEEeCHHHHHHHhhccCCCCccHHHHHHHhcCCceEEEeeeecc-----CCCC Confidence 235688999999877653 2456789999999999977666666544433333333344555554432 2233 Q ss_pred ccceEEEeecc-ceEEEEeecceEEEEeecccccccchhhhhcCc-EEEEEEEEEc-cEEecccceEEEeecC Q lcl|Aclame:pro 229 QRDRAIIGDFA-NGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQ-VYIRAELFLG-WGILDATKFARVTEAN 298 (298) Q Consensus 229 ~~~~~~~gd~~-~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~-v~~r~~~r~~-~~v~~~~a~~~l~~a~ 298 (298) +++..++-+.+ ..+.+...+.++ ..+ . -.+++ ..++...|++ ..+.+|.|++++.|.| T Consensus 234 g~~~~v~~~~~~~~~~~~v~~~~~--~~~-~---------e~~~l~~~~~~~~~~~Gv~i~~P~ai~~~dGI~ 294 (296) T protein:vir:10 234 GTSAAIAYEKDPNNMAIEIPEATN--ALP-A---------QPKDLHFKIPVTSKATGLIVYRPLTMAVMKGIT 294 (296) T ss_pred cceEEEEEEcCCceEEEEcCccee--eec-c---------cccCceEEEeeEeeEEEEEEECCceeEEEeeee Confidence 34444443332 222222222222 211 0 01222 4467788885 7788999999999999 No 149 >protein:vir:93858 Length: 400 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764266;genbank:gi:115315579;genbank:GeneID:5141552 Probab=99.15 E-value=1.1e-12 Score=86.15 Aligned_cols=274 Identities=12% Similarity=0.052 Sum_probs=167.5 Q ss_pred Ce-eccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEE-eeccccccccccceeeEEEeeeEE Q lcl|Aclame:pro 1 MV-LNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDV-VAESGKKTHGGVTLAPQTMVPIKV 78 (298) Q Consensus 1 ma-t~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~-v~E~~~~~~~~~~~~~v~l~~~k~ 78 (298) +. ++.-..+|.-+...|-..++.+.+++++.++..++. +-+-+.....+-.| .--|+.+.++..+|..-++.|.-+ T Consensus 122 vt~td~n~iLP~~il~aIq~al~~~~~~~~f~~v~n~p~--l~V~~~~dt~~qa~gHk~G~~K~eq~~tl~~rtL~P~~V 199 (400) T protein:vir:93 122 VTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGA--LLVSRSFDSANEAQVHKDGQTKTEQAATLTIDTLEPVMV 199 (400) T ss_pred cccCCchhhcchHHHHHHHHhhhccCCcccceeeecCCc--eeeecchhhhcccceeccCCcccceeeeeeeeccCHHHH Confidence 22 344456788888888888899999988888776642 22222222233445 567899999999999999999988 Q ss_pred EEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHH-HHHHHHhccccccc-ccccccccccccccccccccccccccchh Q lcl|Aclame:pro 79 EYGARISDEFMYASDEEKINILQAFNDGFAKKVAR-GIDLMAFHGVNPRL-GTASAVIGTNHFDSKVTQKVEAPRGIADP 156 (298) Q Consensus 79 ~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~-~~d~~~l~G~~~~~-g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (298) +.+..+.+-..+... +.-.|..++..+|...+-+ .++++++-|.|..+ ......+.+..+..-+..+.. +..... T Consensus 200 Yk~~~la~~~~~~~~-tygaL~nYVm~EL~q~vI~k~Ve~Aii~GdG~Ngf~~~dk~t~Ik~I~~dt~kt~~--a~~~~~ 276 (400) T protein:vir:93 200 YKLQSLAERVKRLQM-SYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKS--AGKTPF 276 (400) T ss_pred HHHhhhhhhhhhccc-cHHHHHHHHHHHHHHHHHHHHhhhheeecccccccCCCcchhhhhhhhhhhhhhhh--cCCccH Confidence 888887544444333 3467899999999999885 57999998843211 000111111111111110000 111122 Q ss_pred HHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecce-eeEecCccccccccccceEEE Q lcl|Aclame:pro 157 NGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGL-PVDVNKTVSDMSLTQRDRAII 235 (298) Q Consensus 157 ~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~-PV~~s~~~~~~~~~~~~~~~~ 235 (298) -+.+..++.-+.+...+..-++|+|..|+.|++|||++|.+.|+......+-.+=+|+ .+++...++. .+..+++ T Consensus 277 qdl~E~~~d~~~~~aad~~~Iv~s~d~~A~L~~lk~a~~~a~f~~~n~d~~IA~~fGv~~Lv~~Tr~~~----~kp~V~V 352 (400) T protein:vir:93 277 ADAIEEAVDFVRPTAGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKA----LKPTVLV 352 (400) T ss_pred HHHHHHHHhhhhhccCCceeEEeccchHHHHHHhcCCcceeeeeeccccchhhhhcccceeeeeccCCC----CCceeee Confidence 2333344443334444555699999999999999999999999765555444455565 3344455543 2344444 Q ss_pred eeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeec Q lcl|Aclame:pro 236 GDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEA 297 (298) Q Consensus 236 gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a 297 (298) |-+ +.++.. + ++....+ -|.+|+-.|..+...++-+.-|++-+.++.+ T Consensus 353 -Dek--~~i~~~-~--~~t~~sf--------~~~tNs~~ilvetlv~Gsi~~~N~~ay~~v~ 400 (400) T protein:vir:93 353 -DQK--YHIDMQ-D--LTKVDAF--------EWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred -ehh--hhcccc-C--ceeccce--------eeeeccceEEeeeeeccceecccceeeEeeC Confidence 322 222111 1 1111111 1556777778889999999999999999999 No 150 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=99.13 E-value=3.2e-12 Score=83.58 Aligned_cols=253 Identities=11% Similarity=0.058 Sum_probs=137.5 Q ss_pred hcceeecCCCceEEEEEeCCcceEEeeccccccc--cccceeeEEEe--eeEEEEEEeecHHHhhcccccHHHHHHHHHH Q lcl|Aclame:pro 30 LSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTH--GGVTLAPQTMV--PIKVEYGARISDEFMYASDEEKINILQAFND 105 (298) Q Consensus 30 ~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~--~~~~~~~v~l~--~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~ 105 (298) +.+.+. ++++++||+. +..++..+.-|+++.. .++.=.+.++. -.++.. ..|-+- +.-.+..++.+...+ T Consensus 1 ~vr~i~-~g~s~~~~~i-G~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~-~~VdDi---D~~qa~~Dlr~e~s~ 74 (324) T protein:vir:99 1 MTRTIT-SGKSAQFPVM-GRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTD-VLIYDI---EDAMNHYDVRSEYST 74 (324) T ss_pred Ceeeee-cCceEEEeee-eeeEeccccCCCCcCCCcCCcCcccEEEEecchhhhh-hhhhhH---HHHhcCccchhHHHH Confidence 545443 3567899996 5667777666666533 33444443333 333222 112110 111233578899999 Q ss_pred HHHHHHHHHHHHHHhcc----cc---cccccccccccccccccccccccccccccchhHHHHHHHhhhhhhcCCccc--E Q lcl|Aclame:pro 106 GFAKKVARGIDLMAFHG----VN---PRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGAIENAVELLTGVDADVT--G 176 (298) Q Consensus 106 ~la~~i~~~~d~~~l~G----~~---~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~--~ 176 (298) +.++++++.+|+.++.- .. +.+..+....|....................+++.|.++...|-..+.... . T Consensus 75 ~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP~~gR~ 154 (324) T protein:vir:99 75 QMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYIPAGDRT 154 (324) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCE Confidence 99999999999887622 10 011111111111111111111112222334567888898888888887533 5 Q ss_pred EEEcHHHHHHHHHhh-ccCCceeecccccccCcceecceeeEecCccccccccccce-------------------EEEe Q lcl|Aclame:pro 177 IAINPSFRSALAKQK-DLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDR-------------------AIIG 236 (298) Q Consensus 177 ~vm~~~~~~~L~~lk-d~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~-------------------~~~g 236 (298) .+++|..+..|.+-+ ..++.+.-......+..++++|++|+.++++|...+.+... -+-+ T Consensus 155 ~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky~~ 234 (324) T protein:vir:99 155 FYTDPDTYSAILAALMPNAANYAALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKMTV 234 (324) T ss_pred EEeChHHHHHHhhcccccccccccccceecceEEEEeceEEEecCCcccccccccccccccccccccccccccccccccc Confidence 799999998876543 23344443444556677899999999999999764432111 1223 Q ss_pred eccceEEEE---------eecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEe---ecC Q lcl|Aclame:pro 237 DFANGFKWG---------YAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVT---EAN 298 (298) Q Consensus 237 d~~~~~~~~---------~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~---~a~ 298 (298) |+++...+. ....++++..+ +. ..|. ..+++.+-+|.+++||++.+.++ ++| T Consensus 235 d~~~~~gl~~~~~a~~tv~~~~~~~e~~~--~~-----~~~~---d~i~~~~a~G~~~lRPe~a~~v~l~~~~~ 298 (324) T protein:vir:99 235 GADNVVGLFVHRSAVATLKLKDMALERAR--RP-----EYQA---DQIIAKYAMGHGGLRPEAVGAIIFEDGET 298 (324) T ss_pred ccCceeEEEEehhheEEEeeecceeccee--ch-----hhHH---HhhhhhhhhcCcccccceEEEEEEccCcc Confidence 333222111 11111222221 11 1122 34577778999999999987775 222 No 151 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=99.11 E-value=1.7e-11 Score=79.58 Aligned_cols=290 Identities=12% Similarity=0.001 Sum_probs=156.4 Q ss_pred Ceeccc-------------cccchhHHHHHHHHHHhhchhhhhcceeecCCC-ceEEEEEeCCcceEEeecccccccccc Q lcl|Aclame:pro 1 MVLNKG-------------TLFDPELVTDLISKVAGKSSIARLSAQKPIPFN-GEKVFTFTMDSEIDVVAESGKKTHGGV 66 (298) Q Consensus 1 mat~gg-------------~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~-~~~ip~~~~~~~a~~v~E~~~~~~~~~ 66 (298) |+..+. .|.-+++..++.......+.++.+.+++.+.++ ++.+|+. +..++....-|++.-.+.+ T Consensus 1 Ms~~n~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~vRtI~~gkS~qf~~l-G~s~a~y~~pG~~ldg~~~ 79 (400) T protein:vir:10 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPAATST 79 (400) T ss_pred CCCCccccccccccccchhhhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEe-eeeEEeeecCCCCcCCCCc Confidence 443321 344577888888888899999999999988764 6899986 6677888777777666666 Q ss_pred ceeeEEEeeeEE-EEEEeecHHHhhcccccHHH-HHHHHHHHHHHHHHHHHHHHHhcccc----cccccccccccccc-- Q lcl|Aclame:pro 67 TLAPQTMVPIKV-EYGARISDEFMYASDEEKIN-ILQAFNDGFAKKVARGIDLMAFHGVN----PRLGTASAVIGTNH-- 138 (298) Q Consensus 67 ~~~~v~l~~~k~-~~~~~iS~ell~~~~d~~~~-l~~~i~~~la~~i~~~~d~~~l~G~~----~~~g~~~~~~~~~~-- 138 (298) ..++..+..-.+ .+...|.+ + +...+..+ +-.++.+++++++++.+|+.++.-.- ..+....++.+... T Consensus 80 ~~dk~~ItIDtLL~a~~~V~d-l--Dd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g~~~g 156 (400) T protein:vir:10 80 QADKNQLVIDATVIARNTVAH-L--HDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPRVKGHG 156 (400) T ss_pred ccCcEEEEeCceeeecchhhh-H--HHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCccccc Confidence 677776666553 22233321 0 01122244 56788889999999999987763210 01111111111111 Q ss_pred -cccccccccccccccchhHHHHHHHhhhhhhcCCccc--EEEEcHHHHHHHHHhhc-cCCceeec--ccccccCcceec Q lcl|Aclame:pro 139 -FDSKVTQKVEAPRGIADPNGAIENAVELLTGVDADVT--GIAINPSFRSALAKQKD-LQGNALFP--ELKWGATPDTIN 212 (298) Q Consensus 139 -~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~--~~vm~~~~~~~L~~lkd-~~G~~l~~--~~~~~~~~~~l~ 212 (298) .....+...........+.+.+.++...+...+.+.+ ++++.|..+..|..-.- =|-.+... .....+...++. T Consensus 157 ~s~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~Ys~Ll~~dkLvnrdf~~s~~g~~~~g~v~~v~ 236 (400) T protein:vir:10 157 FSVNVEVNEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKSYTISQSGATIQGFVLSSY 236 (400) T ss_pred cceeecccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhCCcccchhccccCCCccccceEEEEe Confidence 1111111111112223344556777777776666543 56777777777653210 00011101 112234446799 Q ss_pred ceeeEecCccccccccc-----------cceEEEeeccceEEEE-eecc-eEEEEeecccccc-cchhhhhcCcEEEEEE Q lcl|Aclame:pro 213 GLPVDVNKTVSDMSLTQ-----------RDRAIIGDFANGFKWG-YAKE-VPLEVIQYGDPDN-SGLDLKGYNQVYIRAE 278 (298) Q Consensus 213 G~PV~~s~~~~~~~~~~-----------~~~~~~gd~~~~~~~~-~~~~-~~i~~~~~~~~~~-~~~~~f~~n~v~~r~~ 278 (298) |+||+.++++|...+.. ...-+-||++....+. .++. +.++..+-. .+. .-..-|. ..+-+. T Consensus 237 Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt-~~~~~d~r~~~---~~id~~ 312 (400) T protein:vir:10 237 NCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSIDVI-GDIFYEKKEKT---YYIDTF 312 (400) T ss_pred ceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheEEEEeeccc-cccccchhhHH---HHHHHH Confidence 99999999999754221 1111236665433332 1111 223322210 000 0000011 112344 Q ss_pred EEEccEEecccceEEEeecC Q lcl|Aclame:pro 279 LFLGWGILDATKFARVTEAN 298 (298) Q Consensus 279 ~r~~~~v~~~~a~~~l~~a~ 298 (298) +-+|..++||+++..++-++ T Consensus 313 ~a~G~g~~RPeaa~vv~~~~ 332 (400) T protein:vir:10 313 MSEGAIPDRWEAVSVVTTKR 332 (400) T ss_pred HHhCCcccchhheEEEEecC Confidence 56999999999999999888 No 152 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=99.09 E-value=1.9e-11 Score=79.41 Aligned_cols=289 Identities=13% Similarity=0.010 Sum_probs=155.1 Q ss_pred Ceecc----------c---cccchhHHHHHHHHHHhhchhhhhcceeecCC-CceEEEEEeCCcceEEeecccccccccc Q lcl|Aclame:pro 1 MVLNK----------G---TLFDPELVTDLISKVAGKSSIARLSAQKPIPF-NGEKVFTFTMDSEIDVVAESGKKTHGGV 66 (298) Q Consensus 1 mat~g----------g---~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~-~~~~ip~~~~~~~a~~v~E~~~~~~~~~ 66 (298) |+.-+ | .|.-+++..++.......+.++.+.+++.+.+ +++.+|+. +..++....-|+....+.+ T Consensus 1 Ms~~n~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~vRti~~gkS~qf~~~-G~s~~~~~~pG~~ld~~~~ 79 (401) T protein:vir:70 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPAATST 79 (401) T ss_pred CCCCccccccccccccchhHhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEe-eeeEeeeecCCCCcCCCCc Confidence 44332 1 34557788888888889999999999998876 46899986 5677777666666655667 Q ss_pred ceeeEEEeeeEEEEEEeecHHHhhcccccH--HH-HHHHHHHHHHHHHHHHHHHHHhccc-----ccccccccccccc-- Q lcl|Aclame:pro 67 TLAPQTMVPIKVEYGARISDEFMYASDEEK--IN-ILQAFNDGFAKKVARGIDLMAFHGV-----NPRLGTASAVIGT-- 136 (298) Q Consensus 67 ~~~~v~l~~~k~~~~~~iS~ell~~~~d~~--~~-l~~~i~~~la~~i~~~~d~~~l~G~-----~~~~g~~~~~~~~-- 136 (298) .-++..+..-.+- +++-++.+.++.. .+ +...+.+++++++++.+|..++.-. .........+.+. T Consensus 80 ~~dK~~ItID~lL----~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p~~~~~ 155 (401) T protein:vir:70 80 QADKNQLVIDATV----IARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNPRVKGH 155 (401) T ss_pred ccccEEEEeCcee----ehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCcCCC Confidence 7777666665432 2222222222222 33 4568889999999999998774321 1011111111111 Q ss_pred cccccccccccccccccchhHHHHHHHhhhhhhcCCccc--EEEEcHHHHHHHHHhhc-cCCceeec--ccccccCccee Q lcl|Aclame:pro 137 NHFDSKVTQKVEAPRGIADPNGAIENAVELLTGVDADVT--GIAINPSFRSALAKQKD-LQGNALFP--ELKWGATPDTI 211 (298) Q Consensus 137 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~--~~vm~~~~~~~L~~lkd-~~G~~l~~--~~~~~~~~~~l 211 (298) ...................+.+.+.++...+...+.+.. ++++.|..+..|..-.. =|-.|-.. +....+...++ T Consensus 156 G~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~~r~vvl~pp~~Ys~Ll~~d~L~nrd~~~s~~g~~~~G~v~~v 235 (401) T protein:vir:70 156 GFSINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKTYTISQSGATIQGFTLSS 235 (401) T ss_pred ceEEeccccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhcCcccchhhccccCCccccceEEEE Confidence 111111112222222333456678888888877777644 46667777766644210 00011100 11223445689 Q ss_pred cceeeEecCcccccccc-----------ccceEEEeeccceEEEE-eecc-eEEEEeeccccccc-chhhhhcCcEEEEE Q lcl|Aclame:pro 212 NGLPVDVNKTVSDMSLT-----------QRDRAIIGDFANGFKWG-YAKE-VPLEVIQYGDPDNS-GLDLKGYNQVYIRA 277 (298) Q Consensus 212 ~G~PV~~s~~~~~~~~~-----------~~~~~~~gd~~~~~~~~-~~~~-~~i~~~~~~~~~~~-~~~~f~~n~v~~r~ 277 (298) .|+||+.++++|..+.. +...-+-||++....+. .++. +.++..+-. .+.- -..-|. ..+-+ T Consensus 236 aGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~lt-~~~~~d~r~~~---~~id~ 311 (401) T protein:vir:70 236 YNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDVT-GDIFYEKKEKT---YYIDT 311 (401) T ss_pred eceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheEEEEeeccc-cchhhhhhhhH---HHHHH Confidence 99999999999975422 11111225665433322 1111 223322210 0000 000011 11224 Q ss_pred EEEEccEEecccceEEEeecC Q lcl|Aclame:pro 278 ELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 278 ~~r~~~~v~~~~a~~~l~~a~ 298 (298) .+-+|..++||+|++.++-+. T Consensus 312 ~~a~g~g~~RPeaa~vv~~k~ 332 (401) T protein:vir:70 312 FMAEGAIPDRWEAVSVVTTKR 332 (401) T ss_pred HHHhCCcccchhheEEEeecC Confidence 455899999999999985544 No 153 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=99.03 E-value=1.3e-10 Score=74.85 Aligned_cols=275 Identities=10% Similarity=0.037 Sum_probs=163.6 Q ss_pred Ceeccccccchh---HHHHHHHHHHhhchhhhhccee-ecCCC--ceEEEEEeCCcceEEeecc-ccccccccceeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPE---LVTDLISKVAGKSSIARLSAQK-PIPFN--GEKVFTFTMDSEIDVVAES-GKKTHGGVTLAPQTM 73 (298) Q Consensus 1 mat~gg~lip~~---~~~~ii~~~~~~s~i~~~~~~~-~~~~~--~~~ip~~~~~~~a~~v~E~-~~~~~~~~~~~~v~l 73 (298) =+.+.|.+...+ +-+++++...+....+++..+. +.+-+ .+.+...+..+.+.|++.+ ..+|..+..+++... T Consensus 27 a~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~d~~~dip~v~~~~~~~~~ 106 (319) T protein:vir:10 27 AAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELSPTDKTFEYMTFDKVGTAQIIADYTDDLPLVDALGTSEFG 106 (319) T ss_pred hhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCCCCceEEEEeeeeccccceeeecCccccccceeccceeeEE Confidence 111224444433 3456777777777777777754 33323 3556666677889999875 457888888888888 Q ss_pred eeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccc----- Q lcl|Aclame:pro 74 VPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVE----- 148 (298) Q Consensus 74 ~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~----- 148 (298) ..+.++....+|.+=+........++...-....++++++.+|+.+|+|.. ..| ..|+.+..+......+ T Consensus 107 ~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~-~~g----~~GLlN~p~~~~~~~~~~~~~ 181 (319) T protein:vir:10 107 KVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNRLVFKGSA-PHK----IVSVFNHPNITKITSGKWIDV 181 (319) T ss_pred EEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecc-ccc----ceeEEeCCCceeeecCCCCCc Confidence 888888888888664544444556788888889999999999999999942 112 2333333332211111 Q ss_pred cccccchhHHHHHHHhhhhhhc--C-CcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccc Q lcl|Aclame:pro 149 APRGIADPNGAIENAVELLTGV--D-ADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDM 225 (298) Q Consensus 149 ~~~~~~~~~~~i~~~~~~l~~~--~-~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~ 225 (298) .+.+....+++|..++.++... + ..+..++|+|+.+..|.......|..++.-........+|.+.|.+. .. T Consensus 182 ~t~t~~~i~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~t~l~~lk~~~~~l~I~~~pel~-----~a 256 (319) T protein:vir:10 182 STMKPETAEAELTQAIETIETITRGQHRATNILIPPSMRKVLAIRMPETTMSYLDYFKSQNSGIEIDSIAELE-----DI 256 (319) T ss_pred cccCHHHHHHHHHHHHHHHHHhcCceeeceEEEecHHHHHhhhcccCCCCeeHHHHHHHhcCCceEEEeeeec-----cc Confidence 1122345688899998888643 2 35678999999999997666666655543332232333455555443 22 Q ss_pred cccccceEEEeecc-ceEEEEeecceEEEEeecccccccchhhhhcCc-EEEEEEEEEc-cEEecccceEEEeec Q lcl|Aclame:pro 226 SLTQRDRAIIGDFA-NGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQ-VYIRAELFLG-WGILDATKFARVTEA 297 (298) Q Consensus 226 ~~~~~~~~~~gd~~-~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~-v~~r~~~r~~-~~v~~~~a~~~l~~a 297 (298) .+.+.+.+++-..+ ..+.+...+.++ +.+- -.+++ ..+.+..|++ ..+.+|.|++++.|. T Consensus 257 g~~g~~~~v~y~~~~~~~~~~v~~~~~--~~~~----------e~~~l~~~~~~~~r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 257 DGAGTKGVLVYEKNPMNMSIEIPEAFN--MLPA----------QPKDLHFKVPCTSKCTGLTIYRPMTIVLITGV 319 (319) T ss_pred CCCcceEEEEEecCCceEEEecCccee--eeee----------eecCceEEEeeeeeeEEEEEEccceeEeeecC Confidence 23334444443332 222222222222 2210 01122 2344566665 556789999999999 No 154 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=99.03 E-value=8.9e-11 Score=75.69 Aligned_cols=279 Identities=10% Similarity=-0.007 Sum_probs=156.0 Q ss_pred CeeccccccchhHHHHHHHHHH-hhchhhhhcceeecCCCc--eEEEEEeCCcce------EEeeccc-ccccccc--ce Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVA-GKSSIARLSAQKPIPFNG--EKVFTFTMDSEI------DVVAESG-KKTHGGV--TL 68 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~-~~s~i~~~~~~~~~~~~~--~~ip~~~~~~~a------~~v~E~~-~~~~~~~--~~ 68 (298) |+++=....-+++..++-.... +.|.++.-++...-..+. +..+........ .-.+.++ ..|.... .. T Consensus 13 Ms~~i~~~fv~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~dtp~~~~~~~~ 92 (322) T protein:vir:10 13 IAGDIDQAFVQTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETLASMDPDAVKRKRSRQQSADGTYPTPVNNKPFAK 92 (322) T ss_pred eechhhhHHHHHHHHHHHHHHHHhhhhhhcccccccccccccceeecccccccccccccccccccCcccCCCccccccce Confidence 7776444444666666655554 445566665533322222 122221111110 1111111 2333333 33 Q ss_pred eeEEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccc Q lcl|Aclame:pro 69 APQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVE 148 (298) Q Consensus 69 ~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~ 148 (298) ..+.+..+..+ ..|.+.-+. ....+..+...+..+.+++++.|..++.+.-... . .+..+. .+.....+. . T Consensus 93 r~~~~~d~~~~--~~VDd~D~~---k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a-~-~~~~gt-~v~~~ss~~-i 163 (322) T protein:vir:10 93 RRTNVDTYDTG--HVVEQEDIS---QMLLDPNSALITSQAYAMARKTDDLIIAGAWKPA-S-IKGTGQ-PVEFLATQE-I 163 (322) T ss_pred EEEeecccccc--eecchHHHH---HhhcCchHHHHHHHHHHhhhHHHHHHHhhhhccc-c-cccccc-ccccCCCcc-c Confidence 44555555443 455444221 2234566778889999999999998887531111 1 111111 111111111 2 Q ss_pred cccccchhHHHHHHHhhhhhhcCCccc---EEEEcHHHHHHHHHhhc-cCCceeecccc-cccCcceecceeeEecCccc Q lcl|Aclame:pro 149 APRGIADPNGAIENAVELLTGVDADVT---GIAINPSFRSALAKQKD-LQGNALFPELK-WGATPDTINGLPVDVNKTVS 223 (298) Q Consensus 149 ~~~~~~~~~~~i~~~~~~l~~~~~~~~---~~vm~~~~~~~L~~lkd-~~G~~l~~~~~-~~~~~~~l~G~PV~~s~~~~ 223 (298) ..+.....++.|+++...|..++.+.. .++.+|..|..|.+... ++..|.-.... ..+..++++|+.+..++++| T Consensus 164 ~~g~~g~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~~~~l~~~G~ig~~lGf~~i~s~~lp 243 (322) T protein:vir:10 164 GDGTKPISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQITEATSADYTSAMDLQSKGIITNWMGYTWIVSTRLD 243 (322) T ss_pred ccCccchhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchhhhhhhcccchhhhhcCeeeeeeeEEEEEeccCC Confidence 223445667889999999988887643 47999999999866543 33344433333 34667899999999999998 Q ss_pred cccccc------------cceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccce Q lcl|Aclame:pro 224 DMSLTQ------------RDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKF 291 (298) Q Consensus 224 ~~~~~~------------~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~ 291 (298) ...... .-.+++. .++++.+....++..++....+. .+...+++.+-+|..+++|++| T Consensus 244 ~~~~t~~~~~~~~~~~~~~~~~~a~-~k~Av~~a~~~dv~~~i~~~~~~---------~~a~~I~~~~~~Ga~ri~~~gV 313 (322) T protein:vir:10 244 KFDPTQWGMAAEDGPQGDEIWCIAM-TDMALGYHSCKDIWTKVAEDPSA---------SFAWRIYSAFTADCVRVEDEHI 313 (322) T ss_pred ccccccccccccCCCCccceeEEEE-ecCceeEEEeeeeeEEeeccCCc---------chhhhhhhhhhhCceEeccCcE Confidence 543321 1123333 35677788777777776543321 1223456667799999999999 Q ss_pred EEEeecC Q lcl|Aclame:pro 292 ARVTEAN 298 (298) Q Consensus 292 ~~l~~a~ 298 (298) +.+.--. T Consensus 314 v~i~~~e 320 (322) T protein:vir:10 314 FKLRLKN 320 (322) T ss_pred EEEEEec Confidence 9998866 No 155 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=99.02 E-value=2e-10 Score=73.74 Aligned_cols=275 Identities=11% Similarity=0.027 Sum_probs=164.1 Q ss_pred Ceeccccccchh----HHHHHHHHHHhhchhhhhccee-ecCCC--ceEEEEEeCCcceEEeeccc-cccccccceeeEE Q lcl|Aclame:pro 1 MVLNKGTLFDPE----LVTDLISKVAGKSSIARLSAQK-PIPFN--GEKVFTFTMDSEIDVVAESG-KKTHGGVTLAPQT 72 (298) Q Consensus 1 mat~gg~lip~~----~~~~ii~~~~~~s~i~~~~~~~-~~~~~--~~~ip~~~~~~~a~~v~E~~-~~~~~~~~~~~v~ 72 (298) |.++.-..++.+ +-+++++.+.+....+++..+. +.+-+ .+.++..+..+.+.|++.++ .+|..+..+++.. T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~dip~~~~~~~~~~ 80 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRSGAAKIIANGADDLPLVDVDMVRKS 80 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccceeEEEecCcccccccccccceeEE Confidence 877765556644 3466777777777777776654 33323 34566666678889988755 4788888888888 Q ss_pred EeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccc------ Q lcl|Aclame:pro 73 MVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQK------ 146 (298) Q Consensus 73 l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~------ 146 (298) ...+.++.-..++.+=|........++...-....++++++.+|+.+|+|... . +..|+.+..+..... T Consensus 81 ~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~-~----g~~GLlN~p~~~~~~~~~~~~ 155 (301) T protein:vir:80 81 VPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKK-Y----AIKGAFEATGIQIDVSPTTGV 155 (301) T ss_pred EEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeeccc-c----cceeeecCCCcccccccCccc Confidence 88888888888887645444445567888888999999999999999999531 1 223333332221110 Q ss_pred ---ccc-ccccchhHHHHHHHhhhhhhc--C-CcccEEEEcHHHHHHHHHhh--ccCCceeecccccccCcceecceeeE Q lcl|Aclame:pro 147 ---VEA-PRGIADPNGAIENAVELLTGV--D-ADVTGIAINPSFRSALAKQK--DLQGNALFPELKWGATPDTINGLPVD 217 (298) Q Consensus 147 ---~~~-~~~~~~~~~~i~~~~~~l~~~--~-~~~~~~vm~~~~~~~L~~lk--d~~G~~l~~~~~~~~~~~~l~G~PV~ 217 (298) ..+ ..+....+++|..++.++... + ..+..++|+|+.+..|.+.+ +..|..++.-........+|...|-+ T Consensus 156 ~~~~~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~~~tvl~~l~~~~~~~~I~~~p~L 235 (301) T protein:vir:80 156 GNVSKWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINKKRYSNEDSRSVLKVLQDNAWFSAIVRVPDL 235 (301) T ss_pred ccccccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhhccccCCCCeeHHHHHHHHcCcceEEEccee Confidence 011 123344688999999988653 2 24678999999999997544 55555544322222222344444443 Q ss_pred ecCccccccccccceEEEeecc-ceEEEEeecceEEEEeecccccccchhhhhcCc-EEEEEEEEE-ccEEecccceEEE Q lcl|Aclame:pro 218 VNKTVSDMSLTQRDRAIIGDFA-NGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQ-VYIRAELFL-GWGILDATKFARV 294 (298) Q Consensus 218 ~s~~~~~~~~~~~~~~~~gd~~-~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~-v~~r~~~r~-~~~v~~~~a~~~l 294 (298) .+ ..+.+.+.+++-.-+ ..+.+...+.+ ...+ .-.+|. ....+..|+ |..+.+|.|++++ T Consensus 236 ~~-----~g~~g~~~~v~~~~~~d~~~~~v~~~~--~~~~----------~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~ 298 (301) T protein:vir:80 236 AG-----MGTAGSDSFAVIHDSNETAELIIPMDI--TRHP----------EEYSFPRTKVPFEERTAGVVVRFPAAIVRV 298 (301) T ss_pred cc-----CCCCcccEEEEEecCCcEEEEEecCce--eeec----------ceecCceeEeeeeeeeEEEEEEccceEEEE Confidence 32 222233433332211 11222222222 1111 112332 223455666 4577899999999 Q ss_pred eec Q lcl|Aclame:pro 295 TEA 297 (298) Q Consensus 295 ~~a 297 (298) +|. T Consensus 299 ~GI 301 (301) T protein:vir:80 299 DGI 301 (301) T ss_pred ecC Confidence 999 No 156 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=99.01 E-value=1.4e-10 Score=74.64 Aligned_cols=277 Identities=9% Similarity=-0.012 Sum_probs=148.5 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcc-------eeecCCCceEEEEEeC--Cc--ceEEeeccccccccccc-e Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSA-------QKPIPFNGEKVFTFTM--DS--EIDVVAESGKKTHGGVT-L 68 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~-------~~~~~~~~~~ip~~~~--~~--~a~~v~E~~~~~~~~~~-~ 68 (298) |+.+--.+..+.+..-.+|.+.+.....+.+. ..+..+.-+.+|-+.. +. +..-+.+.+..+.++.+ . T Consensus 1 m~lsD~~vfN~~~~~a~~e~~~q~~~~fn~as~gai~l~~~~~~Gd~~~~pf~~~l~g~~~~~~~~~~~~~vt~~kitt~ 80 (325) T protein:vir:95 1 MALSDLAVYSEYAYSAFSETLRQQVDLFNTATGGAIMLQSAAHQGDFSDVAFFAKVTGGLVRRRNAYGSGTVAEKVLKHL 80 (325) T ss_pred CchhhhhhhhhhhhhhhhhhhhhhHhhhhhcccceeEeccccccCceeeccccccccccccccccCCCCceeccceeccc Confidence 99999888899988888888776655544332 1223333346776542 11 33334455555555544 4 Q ss_pred eeEEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccc Q lcl|Aclame:pro 69 APQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVE 148 (298) Q Consensus 69 ~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~ 148 (298) .++....++-.++.....+.+....+....+...|.+.+++...+.+-+.++.+.....+... ..+....+. . T Consensus 81 ~~~av~~~r~~g~~~~d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~a~~~~~-----~~v~dis~~--~ 153 (325) T protein:vir:95 81 VDTSVKVAAGTPPVRLDPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVYSALSQVS-----DVVYDATAN--T 153 (325) T ss_pred cceeeEEecccCcccccHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-----cceeeeecc--c Confidence 455555544444444444433333444455666677777777666655555543211100000 001111110 0 Q ss_pred cccccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCcccccccc Q lcl|Aclame:pro 149 APRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLT 228 (298) Q Consensus 149 ~~~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~ 228 (298) ...........+.++..++.+...+-+.|+||+.++..|.+.+-.+...++....... ..+++|++|++++.||..... T Consensus 154 ~~~~~~~s~~~l~~A~~klGD~~~~l~~~~MHS~v~~~L~~~~L~~~~~~~~~~g~~~-i~t~~G~~VIVdD~~p~~~~g 232 (325) T protein:vir:95 154 DAADKLPTWNNLNNGQAKFGDQSSQIAAWIMHSTPMHKLYGSNLTNGERLFTYGTVNV-VRDPFGKLLVMTDSPNLFAAG 232 (325) T ss_pred CcccccccHHHHHHHHHHhcccccceeEEEEchHHHHHHHHhhccccccccccCCccc-ccccCCcEEEEeCCCCCCCcc Confidence 1111112457889999999777778889999999999999877666655554433222 247899999999999865433 Q ss_pred ccc---eEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeec---C Q lcl|Aclame:pro 229 QRD---RAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEA---N 298 (298) Q Consensus 229 ~~~---~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a---~ 298 (298) ... +.+|| .+++.++..++......+.. .-++-...+|++.. -+++|.++.--+.+ + T Consensus 233 ~~~~ytty~lg--~GAi~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~t---f~lhp~G~sw~~s~~g~s 295 (325) T protein:vir:95 233 TPNVYHILGLV--PGGVLIGQNNDFDANEETKN--------GDENIIRTYQAEWS---YNIGVKGFAWDKANGGKS 295 (325) T ss_pred CceeEEEEEEe--cCeEEecCCCCccccccccC--------cccceeeeeeeeee---EEeecceeeeecccccCC Confidence 211 22333 35555555444332211110 01122234444332 25678887773221 1 No 157 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=98.98 E-value=1.1e-10 Score=75.24 Aligned_cols=274 Identities=9% Similarity=-0.026 Sum_probs=161.6 Q ss_pred Ceeccccccchh---HHHHHHHHHHhhchhhhhcceeec-CC--CceEEEEEeCCcceEEeeccc-cccccccceeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPE---LVTDLISKVAGKSSIARLSAQKPI-PF--NGEKVFTFTMDSEIDVVAESG-KKTHGGVTLAPQTM 73 (298) Q Consensus 1 mat~gg~lip~~---~~~~ii~~~~~~s~i~~~~~~~~~-~~--~~~~ip~~~~~~~a~~v~E~~-~~~~~~~~~~~v~l 73 (298) -+.++|.+.-.| +-++|++...+.-..+++.++..- +. ..+.++..+..+.+.|++..+ .+|..+..+++... T Consensus 23 ~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~~~~~~et~~~~~~e~~G~a~~~~d~~~dip~vd~~~~~~~~ 102 (314) T protein:vir:10 23 KADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTNEIPGHAKYFEYPEFDGVGIAQIIADYSDDLPLVDAFMTEKQG 102 (314) T ss_pred chhhhHHHHHHHHHHHHHHHhhhhccccccceeeccccCCCCceeEEEeeeeccccceeeeCCcccccceeecccceeEE Confidence 233445555543 445677766555555555554321 21 245666677778899998754 58888888998888 Q ss_pred eeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccc--cccccccc Q lcl|Aclame:pro 74 VPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKV--TQKVEAPR 151 (298) Q Consensus 74 ~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~--~~~~~~~~ 151 (298) ..+.++..+.+|.+=++.......+|...-....++++++.+|+.+|+|... . +..|+.+.++.. .....+ . T Consensus 103 ~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~-~----g~~GLlN~p~v~~~~~~~~W-a 176 (314) T protein:vir:10 103 KVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLVWSGSAP-H----GIVSVFDQPNINNVVATPNW-S 176 (314) T ss_pred EEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeeccc-c----cceeEeecCCCccccCCCCc-c Confidence 8899999888876544444444567888888889999999999999999421 1 223333322221 111122 2 Q ss_pred ccchhHHHHHHHhhhhhhc--C-CcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCcccccccc Q lcl|Aclame:pro 152 GIADPNGAIENAVELLTGV--D-ADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLT 228 (298) Q Consensus 152 ~~~~~~~~i~~~~~~l~~~--~-~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~ 228 (298) +....++||..++.++... + ..+..++|+|..+..|.+.-+..|.-++.-......+-+|-+.|-+.+ ..+. T Consensus 177 T~~ei~~Di~~~~~~l~~~s~g~~~p~~l~Lpp~~~~~L~~~~~~~~~tvl~~l~~n~~~l~I~~~~el~~-----ag~~ 251 (314) T protein:vir:10 177 VPQNAIDDVTAMIDAVESSTQGLHHVTDILLPASARRVMQGLVPQTNLSYGELFTRNNPGLTIRFLQFLDN-----YDGA 251 (314) T ss_pred cHHHHHHHHHHHHHHHHHhcCccccceeEEecHHHHHhhcccccCCCccHHHHHHHhCCCcEEEEcccccc-----cCCC Confidence 3345689999999988763 2 356789999999988866555556555433333333444555554332 2222 Q ss_pred ccceEEEeecc-ceEEEEeecceEEEEeecccccccchhhhhc-Cc-EEEEEEEEE-ccEEecccceEEEeecC Q lcl|Aclame:pro 229 QRDRAIIGDFA-NGFKWGYAKEVPLEVIQYGDPDNSGLDLKGY-NQ-VYIRAELFL-GWGILDATKFARVTEAN 298 (298) Q Consensus 229 ~~~~~~~gd~~-~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~-n~-v~~r~~~r~-~~~v~~~~a~~~l~~a~ 298 (298) +.+.+++-.-+ ..+.+...+. ++..+ .|+ ++ ..+.+..|+ |..+.+|.|++++.|.| T Consensus 252 g~~~~v~y~~~~~~~~~~vp~~--~~~l~-----------~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~dGI~ 312 (314) T protein:vir:10 252 GGKAALAFEKSPLNMSIEIPEV--TNVLP-----------AQPKDLHFRYPVTSKATGLIVYRPLTMAVIKGIT 312 (314) T ss_pred cceEEEEEecCCcEEEEecCcc--ceeec-----------ceecCceEEEcceeeeEEEEEECcceeEeeeeee Confidence 33333332222 1111111111 22211 111 21 233456666 46677899999999999 No 158 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=98.97 E-value=3.5e-11 Score=77.88 Aligned_cols=279 Identities=11% Similarity=0.008 Sum_probs=157.6 Q ss_pred Ceec-----ccccc-chhHHHHHHHHHHhhchhhhhcceeecCC-CceEEEEEeCCcceEEeeccccccccccceeeEEE Q lcl|Aclame:pro 1 MVLN-----KGTLF-DPELVTDLISKVAGKSSIARLSAQKPIPF-NGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTM 73 (298) Q Consensus 1 mat~-----gg~li-p~~~~~~ii~~~~~~s~i~~~~~~~~~~~-~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l 73 (298) |++- +..+| |+..+.+|+.-+++..+...+.++...+. ..++||++. .++..-+.+++.+.-..++-.++++ T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d~g~GDtV~InsIg-~~tV~dY~~~~~i~~d~ltt~~~~l 79 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKLLDVNIARVVDFPDGDKLTIPSVG-TPVVRSRPEQGDFTFDNLDTGEISI 79 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhcccccCCCCeEEecccc-ccccccccCCCCcccccCCCceEEE Confidence 7653 34556 77778889888888877777777555443 458888854 4666666677776655566655555 Q ss_pred eeeE--EEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhc----cccccccccccccccccccccccccc Q lcl|Aclame:pro 74 VPIK--VEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFH----GVNPRLGTASAVIGTNHFDSKVTQKV 147 (298) Q Consensus 74 ~~~k--~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~----G~~~~~g~~~~~~~~~~~~~~~~~~~ 147 (298) ...+ +.++ .|+++.. ++..+|.....++.+++++...|..+.. |.....+.. .+ ..+.+.....+ T Consensus 80 ~IDq~KYfaf-~VdDD~~----Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~-~p---~vin~~~~~iv 150 (322) T protein:vir:31 80 ILRDEVYAGN-AISKKLR----QDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQN-DP---NVINGVPHRFV 150 (322) T ss_pred EEehhhhhcc-ccchhHH----HhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccC-Cc---ceecCCcccee Confidence 4443 4433 4777543 3446899999999999999988877632 211111110 01 11111122334 Q ss_pred ccccccchhHHHHHHHhhhhhhcCCccc--EEEEcHHHHHHHHHhh-----ccCCceee--ccccccc--Ccceecceee Q lcl|Aclame:pro 148 EAPRGIADPNGAIENAVELLTGVDADVT--GIAINPSFRSALAKQK-----DLQGNALF--PELKWGA--TPDTINGLPV 216 (298) Q Consensus 148 ~~~~~~~~~~~~i~~~~~~l~~~~~~~~--~~vm~~~~~~~L~~lk-----d~~G~~l~--~~~~~~~--~~~~l~G~PV 216 (298) ..++.....|+.|.++..+|..++..-. ..|.+|.....|..+. -.++|... ......+ ..++++|+-| T Consensus 151 ~~gt~~~~ay~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~a~g~~~Vg~~~GF~V 230 (322) T protein:vir:31 151 GTGTDQTMDVTDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNNPRWEGIVESGIAPDMQFVRSVYGIDL 230 (322) T ss_pred ccCCCchhhHHHHHHHHHHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhccccccccccccchhhHHHHHHHhceee Confidence 4455556678999999988888887632 3588899988775431 12333321 1111111 1578999999 Q ss_pred EecCcccccc-----ccccceEEEeeccceEE---------EEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEc Q lcl|Aclame:pro 217 DVNKTVSDMS-----LTQRDRAIIGDFANGFK---------WGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLG 282 (298) Q Consensus 217 ~~s~~~~~~~-----~~~~~~~~~gd~~~~~~---------~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~ 282 (298) ++|+.++..- +........|-++.... .+-+++|. . .++..+ -.+..-.+|..+|+| T Consensus 231 ~~SN~l~~~~~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~~~l~--~-~e~~r~------~~~~~d~~~~~~~~g 301 (322) T protein:vir:31 231 FVSNLLADANETINAGGDARSTTAGKCNMFMNVSDMGLLPFVVAWKEMP--T-TKSFID------DYNDDLNTATTARWG 301 (322) T ss_pred eeeccccccccccccCcccccccceeecccccccchhhhhhhhHhhhhh--h-hhcccC------ccccccceeeeeeec Confidence 9999986421 00000111111111000 00111110 0 000000 011224578999999 Q ss_pred cEEecccceEEEe-ecC Q lcl|Aclame:pro 283 WGILDATKFARVT-EAN 298 (298) Q Consensus 283 ~~v~~~~a~~~l~-~a~ 298 (298) .++.||+..++|. .|+ T Consensus 302 ~g~~r~e~l~~~~a~~~ 318 (322) T protein:vir:31 302 NGLVRDENLVCVLANAD 318 (322) T ss_pred ceeecccceEEEEeccc Confidence 9999999998874 333 No 159 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=98.95 E-value=1e-10 Score=75.40 Aligned_cols=227 Identities=12% Similarity=0.025 Sum_probs=149.1 Q ss_pred Ceecc----------ccccchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCcceEEeecccccccccccee Q lcl|Aclame:pro 1 MVLNK----------GTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNG-EKVFTFTMDSEIDVVAESGKKTHGGVTLA 69 (298) Q Consensus 1 mat~g----------g~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~-~~ip~~~~~~~a~~v~E~~~~~~~~~~~~ 69 (298) |++-+ -.+-|......|||.+.+.+.|++..+.+.-.+.. -...+.++-|++.|..=++..++++.++. T Consensus 1 m~~~~~~a~TL~e~AKr~~~d~~~~~IIE~l~~tn~IL~~lpf~e~N~~tg~~t~vrt~LP~~~fR~lN~g~~~s~~tt~ 80 (330) T protein:vir:10 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSSTA 80 (330) T ss_pred CCcCCCCcccHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhhccCCcccceeEEeecCCchhhhcCCccccccceEE Confidence 66443 22334456678999999999999988877532221 12334567799999999999999999999 Q ss_pred eEEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccc----------- Q lcl|Aclame:pro 70 PQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNH----------- 138 (298) Q Consensus 70 ~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~----------- 138 (298) +++...+-+++.+.|-+.+.+... ...++.....+...+++.+.+..++|+|+.. ..+..|.|+.. T Consensus 81 qvt~~l~ilgg~~eVDr~la~~~G-n~a~~ra~e~~~~ikam~q~~~~~~iyGD~a--~~p~~F~GL~kR~~~~ta~~~~ 157 (330) T protein:vir:10 81 QVTDNCGMLEAYAEVDKALADLNG-NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDG--IAPAEFTGLSPRYNSLSAENKD 157 (330) T ss_pred EEEEEeEEecchhhhhhHHHhhcC-CHHHHHHHHHHHHHHHHHHHHHHHhccCCCC--CChhhccchhhhcCCCCCCchh Confidence 999999999999999888765433 3355666677789999999999999999421 11111111100 Q ss_pred --ccc-------------------------------------c------------------------------------- Q lcl|Aclame:pro 139 --FDS-------------------------------------K------------------------------------- 142 (298) Q Consensus 139 --~~~-------------------------------------~------------------------------------- 142 (298) +.. . T Consensus 158 qvIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~~~dg~gg~y~~~~~~~~w~~Gl~i~d~r~vvR 237 (330) T protein:vir:10 158 NVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVAR 237 (330) T ss_pred heeeccccccCceEEEEEEEcCCeEEEEcccCccccceeeeccceeeecccCCCCceeEEeeeeeeeeeeEEeCcccEEE Confidence 000 0 Q ss_pred c----ccccccccccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHh-hccCCceeecccccccCcceecceeeE Q lcl|Aclame:pro 143 V----TQKVEAPRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQ-KDLQGNALFPELKWGATPDTINGLPVD 217 (298) Q Consensus 143 ~----~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~l-kd~~G~~l~~~~~~~~~~~~l~G~PV~ 217 (298) . ...........++.+.++++...++.......+|+||......|++. .+.+.-.+-.+...+...-.++|+||. T Consensus 238 I~NIdvs~l~~~~~~~~li~lm~~A~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~k~n~~l~~~~~~g~~~t~~~gipir 317 (330) T protein:vir:10 238 VCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQ 317 (330) T ss_pred EeecccccCCCCccHHHHHHHHHHHHHhccCCCCCcceeeechHHHHHHHHHHhhcccceeeeeecCCeeeEEECCeEEE Confidence 0 00000111222445667777778777777778899999999999986 455554443333434444578999999 Q ss_pred ecCccccccccccceEEE Q lcl|Aclame:pro 218 VNKTVSDMSLTQRDRAII 235 (298) Q Consensus 218 ~s~~~~~~~~~~~~~~~~ 235 (298) .++.+-... . .++ T Consensus 318 ~~Dail~tE----~-~vv 330 (330) T protein:vir:10 318 RTDALLNTE----S-RVV 330 (330) T ss_pred EEeeeecCc----c-ccC Confidence 988874321 1 122 No 160 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=98.93 E-value=5.3e-10 Score=71.43 Aligned_cols=227 Identities=13% Similarity=0.023 Sum_probs=149.3 Q ss_pred Ceecc-c--cc------c-chh-HHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCcceEEeeccccccccccce Q lcl|Aclame:pro 1 MVLNK-G--TL------F-DPE-LVTDLISKVAGKSSIARLSAQKPIPFNG-EKVFTFTMDSEIDVVAESGKKTHGGVTL 68 (298) Q Consensus 1 mat~g-g--~l------i-p~~-~~~~ii~~~~~~s~i~~~~~~~~~~~~~-~~ip~~~~~~~a~~v~E~~~~~~~~~~~ 68 (298) |++=+ + .| + |.. +...|||.+.+.++|++..+.+.-.+.. ....+.++-|.+.|..=++..++++.++ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~~s~~tt 80 (331) T protein:vir:98 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSRT 80 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccCCccCccccee Confidence 77632 1 11 2 222 4567999999999999999988754332 3456678889999999999999999999 Q ss_pred eeEEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccc---------- Q lcl|Aclame:pro 69 APQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNH---------- 138 (298) Q Consensus 69 ~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~---------- 138 (298) .+++-..+-+++.+.|.+.+.+... ...++...-.+.+.+++.+.+..++|+|+... .+..|.|+.. T Consensus 81 ~q~t~~l~ilgg~~eVDk~la~~~G-n~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~--~p~~F~GL~kR~~~~~a~~~ 157 (331) T protein:vir:98 81 VQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSI--DAEKFMGLTPRFNSLSAENG 157 (331) T ss_pred EEEEEEEEEeccceeechHHHhhcC-CHHHHHHHHHHHHHHHHHHHHHHHHhcCCccc--Chhhhccchhhccccccccc Confidence 9999999999999999998775444 33455566667789999999999999985210 0111111000 Q ss_pred ------------ccc------------------------------------------------------------c---c Q lcl|Aclame:pro 139 ------------FDS------------------------------------------------------------K---V 143 (298) Q Consensus 139 ------------~~~------------------------------------------------------------~---~ 143 (298) .+. . . T Consensus 158 ~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri 237 (331) T protein:vir:98 158 QNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRI 237 (331) T ss_pred cceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEE Confidence 000 0 0 Q ss_pred ccc-----cccccccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhh-ccCC-ceeecccccccCcceecceee Q lcl|Aclame:pro 144 TQK-----VEAPRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQK-DLQG-NALFPELKWGATPDTINGLPV 216 (298) Q Consensus 144 ~~~-----~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lk-d~~G-~~l~~~~~~~~~~~~l~G~PV 216 (298) ... ...+.+..+..+.+.++..+++.......+|+||.+....|++.. +.+. ..+-.+...+...-.++|+|| T Consensus 238 ~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipi 317 (331) T protein:vir:98 238 ANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPC 317 (331) T ss_pred eccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeE Confidence 000 001122234556677777787777666678999999999998863 4433 334333344445567999999 Q ss_pred EecCccccccccccceEEE Q lcl|Aclame:pro 217 DVNKTVSDMSLTQRDRAII 235 (298) Q Consensus 217 ~~s~~~~~~~~~~~~~~~~ 235 (298) ..++.+-... . .++ T Consensus 318 r~~dai~~tE----~-~Vv 331 (331) T protein:vir:98 318 RRTDALLLTE----A-RVV 331 (331) T ss_pred EEeeeeecCc----c-ccC Confidence 9888774321 1 111 No 161 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=98.93 E-value=5.3e-10 Score=71.43 Aligned_cols=227 Identities=13% Similarity=0.023 Sum_probs=149.3 Q ss_pred Ceecc-c--cc------c-chh-HHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCcceEEeeccccccccccce Q lcl|Aclame:pro 1 MVLNK-G--TL------F-DPE-LVTDLISKVAGKSSIARLSAQKPIPFNG-EKVFTFTMDSEIDVVAESGKKTHGGVTL 68 (298) Q Consensus 1 mat~g-g--~l------i-p~~-~~~~ii~~~~~~s~i~~~~~~~~~~~~~-~~ip~~~~~~~a~~v~E~~~~~~~~~~~ 68 (298) |++=+ + .| + |.. +...|||.+.+.++|++..+.+.-.+.. ....+.++-|.+.|..=++..++++.++ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~~s~~tt 80 (331) T protein:vir:10 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSRT 80 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccCCccCccccee Confidence 77632 1 11 2 222 4567999999999999999988754332 3456678889999999999999999999 Q ss_pred eeEEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccc---------- Q lcl|Aclame:pro 69 APQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNH---------- 138 (298) Q Consensus 69 ~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~---------- 138 (298) .+++-..+-+++.+.|.+.+.+... ...++...-.+.+.+++.+.+..++|+|+... .+..|.|+.. T Consensus 81 ~q~t~~l~ilgg~~eVDk~la~~~G-n~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~--~p~~F~GL~kR~~~~~a~~~ 157 (331) T protein:vir:10 81 VQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSI--DAEKFMGLTPRFNSLSAENG 157 (331) T ss_pred EEEEEEEEEeccceeechHHHhhcC-CHHHHHHHHHHHHHHHHHHHHHHHHhcCCccc--Chhhhccchhhccccccccc Confidence 9999999999999999998775444 33455566667789999999999999985210 0111111000 Q ss_pred ------------ccc------------------------------------------------------------c---c Q lcl|Aclame:pro 139 ------------FDS------------------------------------------------------------K---V 143 (298) Q Consensus 139 ------------~~~------------------------------------------------------------~---~ 143 (298) .+. . . T Consensus 158 ~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri 237 (331) T protein:vir:10 158 QNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRI 237 (331) T ss_pred cceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEE Confidence 000 0 0 Q ss_pred ccc-----cccccccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhh-ccCC-ceeecccccccCcceecceee Q lcl|Aclame:pro 144 TQK-----VEAPRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQK-DLQG-NALFPELKWGATPDTINGLPV 216 (298) Q Consensus 144 ~~~-----~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lk-d~~G-~~l~~~~~~~~~~~~l~G~PV 216 (298) ... ...+.+..+..+.+.++..+++.......+|+||.+....|++.. +.+. ..+-.+...+...-.++|+|| T Consensus 238 ~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipi 317 (331) T protein:vir:10 238 ANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPC 317 (331) T ss_pred eccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeE Confidence 000 001122234556677777787777666678999999999998863 4433 334333344445567999999 Q ss_pred EecCccccccccccceEEE Q lcl|Aclame:pro 217 DVNKTVSDMSLTQRDRAII 235 (298) Q Consensus 217 ~~s~~~~~~~~~~~~~~~~ 235 (298) ..++.+-... . .++ T Consensus 318 r~~dai~~tE----~-~Vv 331 (331) T protein:vir:10 318 RRTDALLLTE----A-RVV 331 (331) T ss_pred EEeeeeecCc----c-ccC Confidence 9888774321 1 111 No 162 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=98.93 E-value=5.3e-10 Score=71.43 Aligned_cols=227 Identities=13% Similarity=0.023 Sum_probs=149.3 Q ss_pred Ceecc-c--cc------c-chh-HHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCcceEEeeccccccccccce Q lcl|Aclame:pro 1 MVLNK-G--TL------F-DPE-LVTDLISKVAGKSSIARLSAQKPIPFNG-EKVFTFTMDSEIDVVAESGKKTHGGVTL 68 (298) Q Consensus 1 mat~g-g--~l------i-p~~-~~~~ii~~~~~~s~i~~~~~~~~~~~~~-~~ip~~~~~~~a~~v~E~~~~~~~~~~~ 68 (298) |++=+ + .| + |.. +...|||.+.+.++|++..+.+.-.+.. ....+.++-|.+.|..=++..++++.++ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~~s~~tt 80 (331) T protein:vir:10 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSRT 80 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccCCccCccccee Confidence 77632 1 11 2 222 4567999999999999999988754332 3456678889999999999999999999 Q ss_pred eeEEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccc---------- Q lcl|Aclame:pro 69 APQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNH---------- 138 (298) Q Consensus 69 ~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~---------- 138 (298) .+++-..+-+++.+.|.+.+.+... ...++...-.+.+.+++.+.+..++|+|+... .+..|.|+.. T Consensus 81 ~q~t~~l~ilgg~~eVDk~la~~~G-n~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~--~p~~F~GL~kR~~~~~a~~~ 157 (331) T protein:vir:10 81 VQVKDSMGMLETYAEVDKALADLNG-NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSI--DAEKFMGLTPRFNSLSAENG 157 (331) T ss_pred EEEEEEEEEeccceeechHHHhhcC-CHHHHHHHHHHHHHHHHHHHHHHHHhcCCccc--Chhhhccchhhccccccccc Confidence 9999999999999999998775444 33455566667789999999999999985210 0111111000 Q ss_pred ------------ccc------------------------------------------------------------c---c Q lcl|Aclame:pro 139 ------------FDS------------------------------------------------------------K---V 143 (298) Q Consensus 139 ------------~~~------------------------------------------------------------~---~ 143 (298) .+. . . T Consensus 158 ~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri 237 (331) T protein:vir:10 158 QNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRI 237 (331) T ss_pred cceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEE Confidence 000 0 0 Q ss_pred ccc-----cccccccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhh-ccCC-ceeecccccccCcceecceee Q lcl|Aclame:pro 144 TQK-----VEAPRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQK-DLQG-NALFPELKWGATPDTINGLPV 216 (298) Q Consensus 144 ~~~-----~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lk-d~~G-~~l~~~~~~~~~~~~l~G~PV 216 (298) ... ...+.+..+..+.+.++..+++.......+|+||.+....|++.. +.+. ..+-.+...+...-.++|+|| T Consensus 238 ~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipi 317 (331) T protein:vir:10 238 ANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPC 317 (331) T ss_pred eccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeE Confidence 000 001122234556677777787777666678999999999998863 4433 334333344445567999999 Q ss_pred EecCccccccccccceEEE Q lcl|Aclame:pro 217 DVNKTVSDMSLTQRDRAII 235 (298) Q Consensus 217 ~~s~~~~~~~~~~~~~~~~ 235 (298) ..++.+-... . .++ T Consensus 318 r~~dai~~tE----~-~Vv 331 (331) T protein:vir:10 318 RRTDALLLTE----A-RVV 331 (331) T ss_pred EEeeeeecCc----c-ccC Confidence 9888774321 1 111 No 163 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=98.82 E-value=2.6e-10 Score=73.16 Aligned_cols=257 Identities=10% Similarity=0.045 Sum_probs=140.6 Q ss_pred Ceecc----ccccchhHHH---HHHHHHHhhchhhhhcceeecCCC-ceEEEEEeCCcceEEeeccccccccccceee-- Q lcl|Aclame:pro 1 MVLNK----GTLFDPELVT---DLISKVAGKSSIARLSAQKPIPFN-GEKVFTFTMDSEIDVVAESGKKTHGGVTLAP-- 70 (298) Q Consensus 1 mat~g----g~lip~~~~~---~ii~~~~~~s~i~~~~~~~~~~~~-~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~-- 70 (298) ||..+ -.|.+++... .+-.-+.+-..++...|.+||..+ .+++|.+.....+.-|+||+++|-++.+.++ T Consensus 1 mAe~nlt~~~dL~~~~sidfv~~f~~~i~~L~~~Lgi~r~~p~a~G~tIt~pK~~~tgda~dVaEGe~Iplskvt~~~~~ 80 (295) T protein:vir:99 1 MAEKNLNTMADLGDIKSIDFVNKFSKNINDLLKLLGVTRRETLTNDLKIQTYKWEVTLDQTDPGEGETIPLSKVTRTKDK 80 (295) T ss_pred CCCcccccHhhccCceeehhhHHhhhhHHHHHHHhccccccccccCCeEEeeeeeeecccccccCCcccchhhheeeeee Confidence 88764 3455444222 222222222334444488899854 6899999988999999999999999999864 Q ss_pred -EEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccccc Q lcl|Aclame:pro 71 -QTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEA 149 (298) Q Consensus 71 -v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~ 149 (298) .+++.+|.+.. +|.|.++.+. .-+-..+-.++|..+|+.++|..+|.-...++.+ . . T Consensus 81 t~t~kikK~rK~--tTdEAIqlsG--ygdpvgead~qL~~~ia~kId~D~~~~lktat~t-----------------~-t 138 (295) T protein:vir:99 81 DYTVKWFKKRRA--TTAEAIARHG--AARAITEADKRIMRELQNGIKDAFFTFLKTKPTK-----------------V-K 138 (295) T ss_pred eeEEEeeeeccc--ccHHHHHhcC--CCchhHHHHHHHHHHHHHhhhHHHHHHhccCcee-----------------e-e Confidence 77778887774 4999874333 1234578889999999999999999653211111 0 1 Q ss_pred ccccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccC-Cc-eeecccccccCcceeccee-eEecCcccccc Q lcl|Aclame:pro 150 PRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQ-GN-ALFPELKWGATPDTINGLP-VDVNKTVSDMS 226 (298) Q Consensus 150 ~~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~-G~-~l~~~~~~~~~~~~l~G~P-V~~s~~~~~~~ 226 (298) .......++.+.+.+..+...+..+.+.++||.....||+-..-+ .+ -.|.-.. --.++|.- |+.+..+|.+. T Consensus 139 g~~lq~a~a~~~~al~~f~Ee~~~~~V~FVnP~D~a~yl~~A~~~~~~a~~fG~~~----L~nfLG~q~II~S~kv~~G~ 214 (295) T protein:vir:99 139 GVGLQKALSASWAKLATFNEFEGSPLVSFVSPLDVANYLGDTKVGADASNVFGMTL----LKNFLGMQNVIVMPSVPEGK 214 (295) T ss_pred hhhHHHHHHHhhhhhhhcccccCCceEEEEehHHHHHHHhccccccchhhhhhhhh----hhhhhccceEEEcccCCCce Confidence 111122345555556666666666778999999999987654221 11 1110000 01388996 88999998752 Q ss_pred ccccceEEEeeccceE--EEEee-cceEEEEeecccccccchhhhhcCcEEEE-----------EEEEEc--cEEecccc Q lcl|Aclame:pro 227 LTQRDRAIIGDFANGF--KWGYA-KEVPLEVIQYGDPDNSGLDLKGYNQVYIR-----------AELFLG--WGILDATK 290 (298) Q Consensus 227 ~~~~~~~~~gd~~~~~--~~~~~-~~~~i~~~~~~~~~~~~~~~f~~n~v~~r-----------~~~r~~--~~v~~~~a 290 (298) . +..--.+.. ....+ +++.=.+.... | ++|.|.+. -....+ +-..++++ T Consensus 215 ~------~aT~~~Ni~~ay~~~~~g~l~~~f~~~~--D-------~tglIg~~h~~~~~~~t~et~~~~~~~lfpE~~dg 279 (295) T protein:vir:99 215 I------YSTAVENLVFASLNVKGGDLGGLFADFT--D-------ETGLIAAARNRQLSNLTYESVFFGANVLFAEIPEG 279 (295) T ss_pred E------EEeeccceEEEEecCCchhhhhhhhhcc--C-------cccceEEEeccccceeeehhhhHhHHHhcccccce Confidence 2 211111110 01111 11110000000 1 01111110 000001 11345666 Q ss_pred eEEEeecC Q lcl|Aclame:pro 291 FARVTEAN 298 (298) Q Consensus 291 ~~~l~~a~ 298 (298) +++.+--. T Consensus 280 iv~~tI~~ 287 (295) T protein:vir:99 280 VVEATIEA 287 (295) T ss_pred EEEEEEec Confidence 66665522 No 164 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=98.80 E-value=2.7e-09 Score=67.59 Aligned_cols=276 Identities=11% Similarity=0.054 Sum_probs=161.9 Q ss_pred Ce--ec-cccccchh---HHHHHHHHHHhhchhhhhcceee-cCCC--ceEEEEEeCCcceEEeecc-ccccccccceee Q lcl|Aclame:pro 1 MV--LN-KGTLFDPE---LVTDLISKVAGKSSIARLSAQKP-IPFN--GEKVFTFTMDSEIDVVAES-GKKTHGGVTLAP 70 (298) Q Consensus 1 ma--t~-gg~lip~~---~~~~ii~~~~~~s~i~~~~~~~~-~~~~--~~~ip~~~~~~~a~~v~E~-~~~~~~~~~~~~ 70 (298) |. .+ .|.+.-.| +-+.|++...+....+++..... .+-+ .+.++..+..+.+.|++.. ..+|..+..+.+ T Consensus 29 ~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~~~~~~~t~~~~~~~G~a~~~~d~~~dip~vd~~~~~ 108 (329) T protein:vir:79 29 AKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSELSDTDKTFEYQTFDKVGHAKIIADYTDDLSTVDALMTS 108 (329) T ss_pred ceeccchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccccCCCCceeEEEeeeeecceeeeeecCcccccceeecccce Confidence 22 12 23444433 45678887777777777776543 3222 4566667777889998865 568888888888 Q ss_pred EEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccc--- Q lcl|Aclame:pro 71 QTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKV--- 147 (298) Q Consensus 71 v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~--- 147 (298) .....+.++....++.+=++.+.....++...-....++++++.+|+-+|+|.. ..+ ..|+.+..+..+... T Consensus 109 ~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~-~~g----~~GLlN~p~v~~~~~~~~ 183 (329) T protein:vir:79 109 EFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLVNHLVFKGSK-PHK----IISVFEHPNLTTINSAGW 183 (329) T ss_pred eEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEeecc-ccc----ceeeecCCCccccccCCC Confidence 778888888888887654444444456788888889999999999999999942 111 233333333221111 Q ss_pred ----ccccccchhHHHHHHHhhhhhhc--C-CcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecC Q lcl|Aclame:pro 148 ----EAPRGIADPNGAIENAVELLTGV--D-ADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNK 220 (298) Q Consensus 148 ----~~~~~~~~~~~~i~~~~~~l~~~--~-~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~ 220 (298) -...+....+++|..++.++... + ..+..++|+|+.+..|.......|.-++.-......+-+|-+.|-+. T Consensus 184 ~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~~p~~L~Lpp~~~~~L~~~~~~~~~tvl~~lk~~~~~l~I~~~~el~-- 261 (329) T protein:vir:79 184 NNAAGTGKKPETAQDELEQAIEKIETLTNGQHRANMILIPPSMRKVLMVRMPETTMSYLDYFKQQNGGITIESISELE-- 261 (329) T ss_pred CCccccccCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHHhhcccCCCCccHHHHHHHhCCCcEEEEccccc-- Confidence 11122344678899998888764 2 24678999999999887655566655543332222233444444332 Q ss_pred ccccccccccceEEEeeccc-eEEEEeecceEEEEeecccccccchhhhhcCc-EEEEEEEEEc-cEEecccceEEEeec Q lcl|Aclame:pro 221 TVSDMSLTQRDRAIIGDFAN-GFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQ-VYIRAELFLG-WGILDATKFARVTEA 297 (298) Q Consensus 221 ~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~-v~~r~~~r~~-~~v~~~~a~~~l~~a 297 (298) ...+.+.+.+++-+.+. .+.+...+.+ ...+- . .+++ ..+....|++ ..+.+|.||+++.|. T Consensus 262 ---~ag~~g~~~~v~y~~~~~~~~~~vp~~~--~~l~~-q---------~~~~~~~v~~~~r~~Gv~i~~P~ai~~~dGI 326 (329) T protein:vir:79 262 ---DIDGAGTKAALVYEKDPMNMSIEIPEAF--NMLTA-Q---------PKDLHFKVPCTSKCTGLTIYRPLTLVLIKGL 326 (329) T ss_pred ---ccCCCCceEEEEEecCCceEEEecCcce--eeeec-e---------ecCceEEEceeeeEEEEEEECcceeeeeeee Confidence 22223445555544332 2222222222 22210 0 1121 2334556665 566789999999998 Q ss_pred C Q lcl|Aclame:pro 298 N 298 (298) Q Consensus 298 ~ 298 (298) - T Consensus 327 ~ 327 (329) T protein:vir:79 327 V 327 (329) T ss_pred e Confidence 8 No 165 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=98.74 E-value=1.1e-08 Score=64.16 Aligned_cols=280 Identities=8% Similarity=-0.026 Sum_probs=150.2 Q ss_pred Ceecccc-------ccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCC-cceEEeeccccccccccceeeEE Q lcl|Aclame:pro 1 MVLNKGT-------LFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMD-SEIDVVAESGKKTHGGVTLAPQT 72 (298) Q Consensus 1 mat~gg~-------lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~-~~a~~v~E~~~~~~~~~~~~~v~ 72 (298) ||+-... ..-+++.++|...-....|+..+.......+.-.++...+-. +...-..||.+.+.....-.... T Consensus 1 ma~~~~~~~t~~~~g~~~dl~~~I~~isp~dTPf~S~i~~~~a~~~~~~W~~d~l~~~~~~~~~EG~da~~~~~~~r~~~ 80 (317) T protein:vir:88 1 MATPTNAVSTVEINGKREDLIDIIYNIAPYDTPFMSAIGKGVATAITHEWQTDELRQPGKNTRVEGEDATIKAGSFTTML 80 (317) T ss_pred CCccccceEeeeeeeeeechhhhheecCCccCcceeeecCceecccEEEEEeeecCCccccccccCcccccccccCCEEe Confidence 7655543 344666666666656667887776665555444555554332 22234558876655433222211 Q ss_pred Eeee-EEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-ccccc---ccccccccccc----- Q lcl|Aclame:pro 73 MVPI-KVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPR-LGTAS---AVIGTNHFDSK----- 142 (298) Q Consensus 73 l~~~-k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~-~g~~~---~~~~~~~~~~~----- 142 (298) -..- -+...+.||.-+.........+...+-..+-..++.+-+|.++++|.... ++..+ ...|+...... T Consensus 81 ~N~tQIf~k~v~VSgTa~av~~~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~~~i~t~~~~~ 160 (317) T protein:vir:88 81 NNYCQISDETLQVTGTADRVKKAGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIFAYYKTNGSLG 160 (317) T ss_pred ccEEEEEEeEEEEeehhhhhhhcCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHHHHhccCceec Confidence 1111 12334455554322111111222223233334558899999999996321 12121 22222111100 Q ss_pred ---------cccccccccccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeeccccc---ccCc-- Q lcl|Aclame:pro 143 ---------VTQKVEAPRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKW---GATP-- 208 (298) Q Consensus 143 ---------~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~---~~~~-- 208 (298) .+......+......++|.+++.++..++..+..++|+|.....|.++-..++.++..+... +... T Consensus 161 ~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~Gg~~~~i~v~a~~k~~i~~~~~~~~~~i~~~~~~~~~g~~v~~ 240 (317) T protein:vir:88 161 ANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRNGGQANSIQTSSSIKKAISKNMKGRATEITLDASDNRIAQTVDV 240 (317) T ss_pred cCccccccCCCccccccccccccHHHHHHHHHHHHhcCCCCCEEEeChHHHHHHHHHhcCCceeEEEcccCeEEEEEEEE Confidence 01111112222345678999999999999999999999999999998854455555322111 0001 Q ss_pred -ceecc-eeeEecCccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEe Q lcl|Aclame:pro 209 -DTING-LPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGIL 286 (298) Q Consensus 209 -~~l~G-~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~ 286 (298) -+=+| +.++.+..||. +.+++.|++.. .+..-+++..+-... + -+......+..+++.+. T Consensus 241 ~~tdfG~v~ii~~r~lp~------~~~~~~D~~~~-~l~~Lr~~~~e~laK-----t------Gd~~k~~i~~E~tLe~~ 302 (317) T protein:vir:88 241 YESDFGKYTIRANRWFHE------NTLFVFDPKMH-SLCYLRPFFQHELAK-----T------GDSEKRQLLVEYTFRVN 302 (317) T ss_pred EEeCCeEEEEEeCCCCCC------CeEEEEccccc-ceeecccceeeccCC-----C------cccceeEEEEEEEEEEc Confidence 01123 47777888874 46788887742 333333443321111 0 12233455678999999 Q ss_pred cccceEEEeecC Q lcl|Aclame:pro 287 DATKFARVTEAN 298 (298) Q Consensus 287 ~~~a~~~l~~a~ 298 (298) +++|.+++.+-+ T Consensus 303 N~~a~a~i~~l~ 314 (317) T protein:vir:88 303 NEKSGALIRDVV 314 (317) T ss_pred CccceeEEEEec Confidence 999999999998 No 166 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=98.74 E-value=1.9e-09 Score=68.40 Aligned_cols=228 Identities=13% Similarity=0.024 Sum_probs=141.2 Q ss_pred Ceeccc----------cccchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCcceEEeecccccccccccee Q lcl|Aclame:pro 1 MVLNKG----------TLFDPELVTDLISKVAGKSSIARLSAQKPIPFNG-EKVFTFTMDSEIDVVAESGKKTHGGVTLA 69 (298) Q Consensus 1 mat~gg----------~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~-~~ip~~~~~~~a~~v~E~~~~~~~~~~~~ 69 (298) |++-+- .+-+......|||.+.+.+.|++..+.+.-.+.. -...+.++-|.+.|..=++..++++.++. T Consensus 1 m~~~~~~a~TL~E~Akr~~~d~~~~~IIE~l~~tneIL~~lpf~e~N~~tg~~~~vrt~LP~~~fR~lN~g~~~s~~tt~ 80 (335) T protein:vir:73 1 MALIGQTLPSLLDIYNRTDKNGRIARIVEQLAKTNDILTDAIYVPCNDGSKHKTTIRAGIPEPVWRRYNQGVQPTKTQTV 80 (335) T ss_pred CCcCCCCchhHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhcccCCcccceeEEEecCCchhhhcCCccccccceEE Confidence 665431 2233445667999999999999988877533221 12344577799999999999999999999 Q ss_pred eEEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccc----------- Q lcl|Aclame:pro 70 PQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNH----------- 138 (298) Q Consensus 70 ~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~----------- 138 (298) +++...+-+++.+.|-+.|.+... ...++...-.+...+++.+.+..++|+|+.. ..+..|.|+.. T Consensus 81 qvt~~l~ilgg~~eVDr~La~~~G-n~a~~ra~e~~~~ikam~q~~~~~~iyGDsa--~~p~~FdGL~kR~~~~st~~a~ 157 (335) T protein:vir:73 81 PVTDTTGMLYDLGFVDKALADRSN-NAAAFRVSENMGKLQGFNNKVARYSIYGNTD--AEPEAFMGLAPRFNTLSTSKAA 157 (335) T ss_pred EEEEEEEEecchhhhhHHHHhhcC-CHHHHHHHHHHHHHHHHHHHHHHHhccCCcC--CChhhccchhhhhcCccccccC Confidence 999999999999999887765444 3455666667778999999999999999421 11122222100 Q ss_pred -----ccc---------------------------------------------------------------------c-- Q lcl|Aclame:pro 139 -----FDS---------------------------------------------------------------------K-- 142 (298) Q Consensus 139 -----~~~---------------------------------------------------------------------~-- 142 (298) ++. . T Consensus 158 ~a~~iIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~d~~G~~y~~~~~~~~w~~Gl~i~d~r~vv 237 (335) T protein:vir:73 158 SAENVFSAGGSGSTNTSIWFMSWGENTAHMIYPEGMVAGFQHEDLGDDLVSDGNGGQFRAYRDEFKWDIGLSVRDWRSIS 237 (335) T ss_pred cccceeeccccccCceEEEEEEEcCCeeEEEcccCccccceeeeccceeeecCCCCEEeEEEeeeeeeeeeEEeCcccEE Confidence 000 0 Q ss_pred -ccccccc-----ccccchhHHHHHHHhh--hhhhcCCcccEEEEcHHHHHHHHHhh-ccCCceeecccccccCcceecc Q lcl|Aclame:pro 143 -VTQKVEA-----PRGIADPNGAIENAVE--LLTGVDADVTGIAINPSFRSALAKQK-DLQGNALFPELKWGATPDTING 213 (298) Q Consensus 143 -~~~~~~~-----~~~~~~~~~~i~~~~~--~l~~~~~~~~~~vm~~~~~~~L~~lk-d~~G~~l~~~~~~~~~~~~l~G 213 (298) ......+ .....++.+.+++++. .++.-.....+|+||......|++.. +.....+-.+...+...-.++| T Consensus 238 RI~NIdvs~l~~d~~~~~~l~~lmi~a~~~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~l~~~~~~g~~~t~~~g 317 (335) T protein:vir:73 238 RICNIDVTTLTKDASTGADLISMMVDAYYARDVAMLGDGKEVIYANKTIHAWLHKQAMNAKNVNLTIEEYGGKKIVSFLG 317 (335) T ss_pred EEeecccccccccccchhhHHhhHHHHHHHHhccCCCCCceEEEechHHHHHHHHHHhccCceeeeeeccCCceeEEECC Confidence 0000000 1111233333444432 23332333467999999999998764 4444444333333344456889 Q ss_pred eeeEecCccccccccccceEEEe Q lcl|Aclame:pro 214 LPVDVNKTVSDMSLTQRDRAIIG 236 (298) Q Consensus 214 ~PV~~s~~~~~~~~~~~~~~~~g 236 (298) +||..++.+-... ..++. T Consensus 318 ipir~~Dail~tE-----~~v~~ 335 (335) T protein:vir:73 318 IPIRRVDAILNTE-----SAVTA 335 (335) T ss_pred eEEEEEeeeecCc-----ccccC Confidence 9999988875321 11111 No 167 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=98.62 E-value=1.3e-09 Score=69.31 Aligned_cols=259 Identities=11% Similarity=0.028 Sum_probs=134.7 Q ss_pred Ceecccc----------c---cchhHHHHHHHHHHhhchhhhhcceeecCCCc-e-EEEEEeCCcceEEeeccccccccc Q lcl|Aclame:pro 1 MVLNKGT----------L---FDPELVTDLISKVAGKSSIARLSAQKPIPFNG-E-KVFTFTMDSEIDVVAESGKKTHGG 65 (298) Q Consensus 1 mat~gg~----------l---ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~-~-~ip~~~~~~~a~~v~E~~~~~~~~ 65 (298) |.++--+ | ..-++.+++-.-+.+-.-++...|.+||..++ + .+|.++....+.-|+||+.+|-++ T Consensus 1 ~~~~~~~~e~nlt~~~dl~~~~siDf~~~f~~~i~~L~~~LGv~r~~pla~GstIkt~k~~~y~gda~dVaEGe~Iplsk 80 (296) T protein:vir:98 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) T ss_pred CCCccccCcCCCcchhhhhhhhhhhhHHHHhhhHHHHHHHhhhcccccccCCCEEeeccceeeeeccccccCCcccchhh Confidence 6666532 1 11223333323233333344445889998654 5 456688889999999999999999 Q ss_pred cceee---EEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccc Q lcl|Aclame:pro 66 VTLAP---QTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSK 142 (298) Q Consensus 66 ~~~~~---v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~ 142 (298) .+.++ .+++.+|.+.-+ |.|.++.+- .-+-..+-.++|..+++.++|..+|.-...++++.. T Consensus 81 vt~~~~~t~t~~ikK~rK~t--TdEAIqlsG--yg~aVgetd~qL~~~iq~kId~d~~t~LktaT~t~~----------- 145 (296) T protein:vir:98 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYG--SNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQD----------- 145 (296) T ss_pred heeeecceEEEEeecccccc--CHHHHHhhc--CCchhHHHHHHHHHHHHHhhhHHHHHHHhcccceee----------- Confidence 99864 778888877764 999874333 123456788999999999999999865322111100 Q ss_pred cccccccccccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcce-ecceeeEecCc Q lcl|Aclame:pro 143 VTQKVEAPRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDT-INGLPVDVNKT 221 (298) Q Consensus 143 ~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~-l~G~PV~~s~~ 221 (298) ....+-.......+.++...+........+.++||.....+++-.. +-.+...+..... ++|.-|+.++. T Consensus 146 ----~t~~~lQ~Ala~~~~~l~~~feded~~~~V~FVnP~D~a~ylg~a~-----it~qt~fG~tyl~nfLG~~II~S~k 216 (296) T protein:vir:98 146 ----ALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAG-----ITTQTAFGLTYLVDFTGTVIISTND 216 (296) T ss_pred ----echhhHHHHHHHHhhhhhhhccccCCCceEEEEehHHHHHHhcCCc-----cchhheechhhhhhccccEEEEcCc Confidence 0011111112233444445565554456678999999988754321 1111111222222 88988899999 Q ss_pred cccccccccceEEEeeccc--eEEEEee-cceEEEEeecccccccchhhhhcCcEEEEEE----------EEEcc---EE Q lcl|Aclame:pro 222 VSDMSLTQRDRAIIGDFAN--GFKWGYA-KEVPLEVIQYGDPDNSGLDLKGYNQVYIRAE----------LFLGW---GI 285 (298) Q Consensus 222 ~~~~~~~~~~~~~~gd~~~--~~~~~~~-~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~----------~r~~~---~v 285 (298) +|.+. ++..--.+ .+....+ +++.-....+.+. +|.|.+.-. ..+.+ -. T Consensus 217 V~~G~------~~~T~~~Ni~~ay~~~~~~~l~~~f~~~~d~---------tglIGv~h~~~~~~~t~eT~~~~~~~lfp 281 (296) T protein:vir:98 217 VTKGE------IWATVPENIIFAYINPNNSELAKEFNLYGDP---------TGYIGMNHFQENTTLTIQTLLVSGMLMYP 281 (296) T ss_pred CCCce------EEEeeecceEEEeecccccchhhhhcccccc---------ccceEEEeccccceeeehhHhHhHHHhcc Confidence 98652 22211111 1111111 2222222222211 111111000 00111 12 Q ss_pred ecccceEEEeecC Q lcl|Aclame:pro 286 LDATKFARVTEAN 298 (298) Q Consensus 286 ~~~~a~~~l~~a~ 298 (298) .+++++++.+--= T Consensus 282 E~~dgiv~~tI~~ 294 (296) T protein:vir:98 282 ERIDGIVKVTLTP 294 (296) T ss_pred cccceEEEEEecC Confidence 3445555543311 No 168 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=98.46 E-value=1.8e-07 Score=57.58 Aligned_cols=261 Identities=9% Similarity=-0.010 Sum_probs=134.3 Q ss_pred Ceecccccc-chhHHHHHHHHHHhhchhhhhcceeec----C-CCceEEEEEeCCcceEEeeccccccccccceeeEEEe Q lcl|Aclame:pro 1 MVLNKGTLF-DPELVTDLISKVAGKSSIARLSAQKPI----P-FNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMV 74 (298) Q Consensus 1 mat~gg~li-p~~~~~~ii~~~~~~s~i~~~~~~~~~----~-~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~ 74 (298) ||+.-..++ |+.+..++++.++++.++.+++.+-.- . +..++||+... .-+.++..++-.+++-+++++. T Consensus 1 m~~~~N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~~----~~v~dg~~~~~~~~te~~v~l~ 76 (418) T protein:vir:10 1 MAVQDNNLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPYR----VKSASGRTLVKQPMVDQTIPFK 76 (418) T ss_pred CCccccccccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCCc----eeecccCCccccccccceEEEE Confidence 999888777 666779999999999998888765321 2 24688887332 2223444444445555555555 Q ss_pred eeE-EEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 75 PIK-VEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGI 153 (298) Q Consensus 75 ~~k-~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (298) ..+ ...-+.++++=+. -+..++.+.+.+..+++++..+|..++.-.. +.. ......... T Consensus 77 id~~k~~~~~itD~e~a---~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~~---~a~--------------~~~gt~gt~ 136 (418) T protein:vir:10 77 IAYQEHVGLEYTVKDKT---LDIMQFSERYLKSGMVQIANQIDRSLALTLK---KAF--------------HSSGTPGVR 136 (418) T ss_pred EecccccceeechHHHh---hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHh---hcc--------------cccccCCcC Confidence 422 2334566655321 1224677778888899999999988764211 000 000111122 Q ss_pred chhHHHHHHHhhhhhhcCCccc--E-EEEcHHHHHHHHHhhccCCcee---ecccccccCcceecceeeEecCccccccc Q lcl|Aclame:pro 154 ADPNGAIENAVELLTGVDADVT--G-IAINPSFRSALAKQKDLQGNAL---FPELKWGATPDTINGLPVDVNKTVSDMSL 227 (298) Q Consensus 154 ~~~~~~i~~~~~~l~~~~~~~~--~-~vm~~~~~~~L~~lkd~~G~~l---~~~~~~~~~~~~l~G~PV~~s~~~~~~~~ 227 (298) ...++++.++..+|...+.... . .+++|..+..|.+ |...... -......+..+++.|+.|+.++++|.... T Consensus 137 ~~~~~~i~~a~~~Ld~~~VP~~G~R~lVv~P~~~~~L~~--~~~~~~~~~~~~~~lr~G~IG~i~GF~V~~S~nip~~ta 214 (418) T protein:vir:10 137 PGAFIDFANAGAKQTTYAVPQDGMRHAVLDPFTCASLSD--EVTKLFKESMVEQAYKMGYRGNVAAYEVYESQNLPKHTV 214 (418) T ss_pred cchHHHHHHHHHHHHhcCCCCCCceEEEeCHHHHHHHhh--hccccccccccchhhheeeeeeeeceEEEEecCCCcccc Confidence 3348889999888888777532 3 5899998877753 2222111 11223456678999999999999995433 Q ss_pred cc-cc-eEEEeeccceEEEEeecc-----eEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 228 TQ-RD-RAIIGDFANGFKWGYAKE-----VPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 228 ~~-~~-~~~~gd~~~~~~~~~~~~-----~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) .. .. ..+.|-...+..+....+ -.+..-+.....+ .|.-|.+... ...++.-|++...++ T Consensus 215 g~~~~t~~v~ga~~~~~~~~~~~~t~s~~g~l~~Gd~~ti~g----v~~v~~~t~~-------~~~~~~~f~V~~~~~ 281 (418) T protein:vir:10 215 GDHGGTPLVNGTVVNGDTVGFDGGTASTTGFLKAGDVITFGG----VFGVNPQNYE-------TTGLLQEFVVLEDVD 281 (418) T ss_pred cccccceeeecccccceeEEEeecceeeccceeeccEEEECc----eeeccccccc-------ccccceEEEEEeecc Confidence 21 12 223233222221111000 0011111111000 0000111000 001223333333221 No 169 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=98.43 E-value=1.3e-07 Score=58.26 Aligned_cols=276 Identities=14% Similarity=0.058 Sum_probs=133.4 Q ss_pred Ceecc-----ccccchhHHHHHHH-HHHhhchhhhhccee---------ecCCCceEEEEEeC-CcceEEeeccc---cc Q lcl|Aclame:pro 1 MVLNK-----GTLFDPELVTDLIS-KVAGKSSIARLSAQK---------PIPFNGEKVFTFTM-DSEIDVVAESG---KK 61 (298) Q Consensus 1 mat~g-----g~lip~~~~~~ii~-~~~~~s~i~~~~~~~---------~~~~~~~~ip~~~~-~~~a~~v~E~~---~~ 61 (298) |+.-. ..+|-||+....++ ...+.+.+.+-+-+. ..++.-+++|.+.. .++..-+.|.. .. T Consensus 1 M~~~~~~T~l~Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~~~n~~~d~~~~~~ 80 (367) T protein:vir:80 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) T ss_pred CcchhhhhhhhhccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEEEeeeeccCCCCccccCCCCCcccc Confidence 88544 34566666666543 334555544332222 22344579998754 34444343433 23 Q ss_pred ccccccee-eEEEeeeEEEE--EEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHh---ccccccccccc-cc- Q lcl|Aclame:pro 62 THGGVTLA-PQTMVPIKVEY--GARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAF---HGVNPRLGTAS-AV- 133 (298) Q Consensus 62 ~~~~~~~~-~v~l~~~k~~~--~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l---~G~~~~~g~~~-~~- 133 (298) +..+.+-+ ++-...+.-.+ ...++.++- ..+.++.|.+++++-..+...+.+| .|.=..+.... .. T Consensus 81 t~~kittg~~~a~v~~r~kaw~~~Dla~~ls------G~dpm~~Ia~qva~yW~r~~q~~Lla~L~Gvf~~~~a~~~~~~ 154 (367) T protein:vir:80 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELA------GSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATI 154 (367) T ss_pred cccccccchheeeeehhcccchhhhHHHHhh------CchHHHHHHHHHHHHhhhhhHHHHHHHHHHhhccccccchhhh Confidence 33344433 22222222222 234444431 1356778888888766655544443 34211000000 00 Q ss_pred ------------ccccccccccccccccccccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecc Q lcl|Aclame:pro 134 ------------IGTNHFDSKVTQKVEAPRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPE 201 (298) Q Consensus 134 ------------~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~ 201 (298) ....++....+. .......-..+.+.++..++.....+-++++||+.++..|++++- =.|+ ++ T Consensus 155 ~~~~~~~a~~~~~~~~~~~Dis~~--t~~~~~~~s~~~~~~A~~~lGD~~~~l~~i~mHS~V~~~L~~~~l--i~~i-~~ 229 (367) T protein:vir:80 155 KTRGRVPAEVLGTAGDMVIDISGQ--TNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDE--IEFI-PD 229 (367) T ss_pred hhhhccccccccccCceeeeeecc--CCCccceecHHHHHHHHHHhccccccccEEEEchHHHHHHHhccc--cccc-cC Confidence 000111111110 011112233567888988888877788899999999999998751 0011 11 Q ss_pred cccccCcceecceeeEecCcccccccccc---ceEEEeeccceEEEEeecce-EEEEeecccccccchhhhhcCcEEEEE Q lcl|Aclame:pro 202 LKWGATPDTINGLPVDVNKTVSDMSLTQR---DRAIIGDFANGFKWGYAKEV-PLEVIQYGDPDNSGLDLKGYNQVYIRA 277 (298) Q Consensus 202 ~~~~~~~~~l~G~PV~~s~~~~~~~~~~~---~~~~~gd~~~~~~~~~~~~~-~i~~~~~~~~~~~~~~~f~~n~v~~r~ 277 (298) ......-++++|++|++++.||.....+. .+.+||. +++.|+..... .+++.++...... -++-.+.- T Consensus 230 sd~~~~i~ty~G~~VIvDD~~Pv~~~~a~~~yttYlfg~--GAi~~~~~~~~~~~E~~Rd~~~~~~------gG~d~L~~ 301 (367) T protein:vir:80 230 SKGQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG--AAFGYADGAPQVPVAVGRRELRGNG------SGLEYILE 301 (367) T ss_pred CCCccccceecceeEEEeCCCcccccCCCceEEEEEEec--ceeeecccCCccceecccchhhhcC------CceEEEEe Confidence 11123456899999999999996543222 2345552 44545443322 2455544321100 12222233 Q ss_pred EEEEccEEecccceEEEeec--------------------C Q lcl|Aclame:pro 278 ELFLGWGILDATKFARVTEA--------------------N 298 (298) Q Consensus 278 ~~r~~~~v~~~~a~~~l~~a--------------------~ 298 (298) +.| .+.+|.+|...+.+ | T Consensus 302 Rr~---~~~hP~G~s~~~~~v~~~~~~~~~~~~~~~~~sPt 339 (367) T protein:vir:80 302 RKE---WIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAIT 339 (367) T ss_pred eee---EEeecceeeecccccccccccccccccccccCCCC Confidence 333 36788887765421 1 No 170 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=98.42 E-value=1.3e-07 Score=58.28 Aligned_cols=272 Identities=11% Similarity=-0.022 Sum_probs=130.0 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhccee---ecC---CCceEEEEEeCCcceEEe-----ecccccccccccee Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQK---PIP---FNGEKVFTFTMDSEIDVV-----AESGKKTHGGVTLA 69 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~---~~~---~~~~~ip~~~~~~~a~~v-----~E~~~~~~~~~~~~ 69 (298) ||.+- ++|+.+..++++.+++..++.+++.+- ... +..++||+... ..+.++ +++..+...+.+-+ T Consensus 1 Ma~~~--~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (392) T protein:vir:99 1 MANAF--SKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSDFTED 77 (392) T ss_pred Ccccc--ccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeeccc-ccceeeeccccccCCcccccccccc Confidence 99543 778888899999999999888887532 222 23488887543 333332 34555555566666 Q ss_pred eEEEeeeE-EEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccc Q lcl|Aclame:pro 70 PQTMVPIK-VEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVE 148 (298) Q Consensus 70 ~v~l~~~k-~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~ 148 (298) ++++...+ .+.-+.++++-.. ....++...+.++..++++.++|..++.-.. +..... . ... T Consensus 78 ~~~~~id~~k~~~~~i~d~e~~---~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~---~a~~~~---------~--~~~ 140 (392) T protein:vir:99 78 SFPVTLTDVAYHLGVLTDEELT---FDLESFATQILPRQVRGVADILEEGVRDMIV---GAPYEA---------A--GAV 140 (392) T ss_pred eEEEEEeeeeecceeechHHHh---hhhhhhHHHHHHHHHHHHHHHHHHHHHHHHh---cccccc---------c--ccc Confidence 66666533 3334566666332 2234567777888899999999988774311 000000 0 001 Q ss_pred cccccchhHHHHHHHhhhhhhcCCcc-cEEEEcHHHHHHHHHhhc-cCCceeec---ccccccCcceecceeeEecCccc Q lcl|Aclame:pro 149 APRGIADPNGAIENAVELLTGVDADV-TGIAINPSFRSALAKQKD-LQGNALFP---ELKWGATPDTINGLPVDVNKTVS 223 (298) Q Consensus 149 ~~~~~~~~~~~i~~~~~~l~~~~~~~-~~~vm~~~~~~~L~~lkd-~~G~~l~~---~~~~~~~~~~l~G~PV~~s~~~~ 223 (298) ........|+.|.++..+|...+... -.++++|..+..|.+... .+-.+.-. .....+..+++.|++|+.++++| T Consensus 141 ~~~~~~~~~~~i~~a~~~L~~~~vP~~R~~vv~p~~~~~l~~~~~~~~~~~~g~~~~~~l~~G~vg~i~G~~v~~s~~~~ 220 (392) T protein:vir:99 141 HEVAPDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIP 220 (392) T ss_pred cccChhhhHHHHHHHHHHHhhcCCCCCCEEEEcHHHHHHHhcccceeecccccchhhhhhhcceeeeeeeeEEEeecccc Confidence 11123345788989888887776643 257999999988865410 10011100 11234566899999999999987 Q ss_pred cccccccc-eE-EEe--------eccceEEEEeecceEEEEeecccccccchhhhhcCcEEEE-------EEEEEccEEe Q lcl|Aclame:pro 224 DMSLTQRD-RA-IIG--------DFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIR-------AELFLGWGIL 286 (298) Q Consensus 224 ~~~~~~~~-~~-~~g--------d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r-------~~~r~~~~v~ 286 (298) ......-. .. .++ +......+..-..+...+....+.. +..+...+. .....+.... T Consensus 221 ~~t~~a~~~~a~~~at~a~v~~~~~~~~~s~s~~~~v~~~~~~~~~~t------~~s~~~~v~~~~g~~~v~~~~~~~~~ 294 (392) T protein:vir:99 221 HGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDST------ITSNRSLIDTYFGLKVVEDPNGVGFV 294 (392) T ss_pred cccceeeeccccccccccccccccccceeEEecccceecceeecccce------eeccccccceeEEEEEEeecccccee Confidence 65321100 00 000 0000000000001111111000000 000000000 0000000000 Q ss_pred cccc------eEEEeecC Q lcl|Aclame:pro 287 DATK------FARVTEAN 298 (298) Q Consensus 287 ~~~a------~~~l~~a~ 298 (298) .... -+.+.+.+ T Consensus 295 ~~~~~~~~~~~v~v~~v~ 312 (392) T protein:vir:99 295 RARKIHLIPGSIEVAPEA 312 (392) T ss_pred eeeeeeeecceeeeeeee Confidence 0000 00111111 No 171 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=98.40 E-value=4.5e-08 Score=60.87 Aligned_cols=288 Identities=9% Similarity=-0.051 Sum_probs=145.6 Q ss_pred Ceec--------cccccc------hh----HHHHHHHHHHhhchhhhhcceeecCCC---ceEEEEEeCCcceEEeeccc Q lcl|Aclame:pro 1 MVLN--------KGTLFD------PE----LVTDLISKVAGKSSIARLSAQKPIPFN---GEKVFTFTMDSEIDVVAESG 59 (298) Q Consensus 1 mat~--------gg~lip------~~----~~~~ii~~~~~~s~i~~~~~~~~~~~~---~~~ip~~~~~~~a~~v~E~~ 59 (298) |... ...|.| ++ +.+.+|+.+-.-..+.++..+...+.. ...+++.+..+.+.+++..+ T Consensus 56 md~~~~~~~~~~~~~l~~~~~~g~~~~l~~~~p~~i~~~tap~~a~~l~pv~t~g~W~~~~~~~~v~e~~G~A~~ygd~~ 135 (379) T protein:vir:10 56 MDSNDIGPIPTPLSPLSPVSIPGLIQFLQNWLPGHVRILTAVREADEFLGLSTVGQWDDEQIVQRVLEGLGTAQPYTDGG 135 (379) T ss_pred hccccccccccccCccccccccchHHHHHhhcchHHHHHhhhhhhhhhcccccCCCceeeeEEEeeeeeeeeeEEecccc Confidence 2221 011111 22 235666666555556666666554332 35666667778999999888 Q ss_pred cccccccceeeEEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccc Q lcl|Aclame:pro 60 KKTHGGVTLAPQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHF 139 (298) Q Consensus 60 ~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~ 139 (298) ..|..+.......-..+.++..+.++.+=++.......++.+.-+...++++.+.+++-.|+|.++......++..--++ T Consensus 136 d~pl~d~~~~~~~r~v~~~~~g~~yg~~El~~Aa~~g~~l~~~Ka~aA~~ale~~~N~i~f~G~~d~~~~~yGllNdP~l 215 (379) T protein:vir:10 136 NMALMSWTPTFETRTVVRFEAGLQVAPLEEARSSRVQVSSADEKRAMVGEALEVQRNRVAFYGYNDGSGRTFGFLNDPNL 215 (379) T ss_pred CCCeeeeeeeeeeeeeEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCCCcceEEEEeCCCC Confidence 88877766666566667777777776542333334556788888899999999999999999964332222222221111 Q ss_pred ccc---cccccc---c-ccccchhHHHHHHHhhhhhhcCC-------cccEEEEcHHHHHHHHHhhccCCceeecccccc Q lcl|Aclame:pro 140 DSK---VTQKVE---A-PRGIADPNGAIENAVELLTGVDA-------DVTGIAINPSFRSALAKQKDLQGNALFPELKWG 205 (298) Q Consensus 140 ~~~---~~~~~~---~-~~~~~~~~~~i~~~~~~l~~~~~-------~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~ 205 (298) ... ++.... + ..+....++||..++.++...-. .+..++|.|..+..|.+- +..|.-++.-.... T Consensus 216 ~a~~t~atg~~~~t~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~~~~~~~~tL~LP~~~~~~L~~~-n~~g~Tvl~~lk~n 294 (379) T protein:vir:10 216 PAYVAVPNGAGGSPLWAQKTTLEIIADLRNGLTALQVQSMGRIKSNKTPITIGIPNAYENYITTP-TELGYSVAQYMRES 294 (379) T ss_pred cccccccCCcccccccccCCHHHHHHHHHHHHHHHHHhhCCeecccccceeEEecHHHHHhhccc-cccCccHHHHHHHh Confidence 111 111111 1 22333467888888887664321 223689999999998753 33343332211111 Q ss_pred cCcceecceeeEecCccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEcc-E Q lcl|Aclame:pro 206 ATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGW-G 284 (298) Q Consensus 206 ~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~-~ 284 (298) +-++-++..+.+....+.+....++.+-....-....+.+..-+-.....-... ...-.....+..|.++ . T Consensus 295 -----~Pnl~i~t~pEL~~aggg~~~~~~~~~~~~~~~t~~~~~~~~~~p~k~~~l~ve---~~~~~~~~~~~~rt~Gv~ 366 (379) T protein:vir:10 295 -----YPNVTFVSAPELNDANGGSSAIYYYADAVENNGTDDGRTWLQVVPTKMFTLGVE---KKIKGYAEGYTNATAGAM 366 (379) T ss_pred -----cCCcEEEEcccccccCCCccEEEEEeeccCCCccCCcceEEEecchhhhhccce---ecCceeEeccccceeeee Confidence 223334444444332222233344433111000000000001010000000000 0001122345555555 4 Q ss_pred EecccceEEEeec Q lcl|Aclame:pro 285 ILDATKFARVTEA 297 (298) Q Consensus 285 v~~~~a~~~l~~a 297 (298) +.+|.||+++.|| T Consensus 367 ir~P~Ai~~~~G~ 379 (379) T protein:vir:10 367 LKRPFATYRQTGA 379 (379) T ss_pred eecchhhheecCC Confidence 5579999999999 No 172 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=98.40 E-value=5e-08 Score=60.63 Aligned_cols=274 Identities=8% Similarity=-0.069 Sum_probs=147.7 Q ss_pred Ceecc-----------ccccchh----HHHHHHHHHHhhchhhhhcceeecCC---CceEEEEEeCCcceEEeecccccc Q lcl|Aclame:pro 1 MVLNK-----------GTLFDPE----LVTDLISKVAGKSSIARLSAQKPIPF---NGEKVFTFTMDSEIDVVAESGKKT 62 (298) Q Consensus 1 mat~g-----------g~lip~~----~~~~ii~~~~~~s~i~~~~~~~~~~~---~~~~ip~~~~~~~a~~v~E~~~~~ 62 (298) ||.++ ..-||.- +.++|++...+....+++.++.+.+. ..+.+++.+..+.+.+++.++..| T Consensus 35 ~a~d~~~~~~~~~~~~~~~i~a~~~~~i~~~vy~~~~~~~~~~~l~pv~t~g~w~~~t~~y~~~e~~G~a~~ygd~ad~P 114 (339) T protein:vir:94 35 YAMDAVNLTPTLQTTANAGIPAWMTTFVDRRVIDIQLAPMAAAKIFPEVKKGDWTTTYGVFIIAEPVGQVATYSDWSANG 114 (339) T ss_pred hhccccccccccccccccchhhhhhhhhchhheeecccccchhhhcccccCCCCcccEEEEeeeecccceEEcccccCCC Confidence 33333 2335422 22566666667777778887766654 246888888889999999988887 Q ss_pred ccc--cceeeEEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|Aclame:pro 63 HGG--VTLAPQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFD 140 (298) Q Consensus 63 ~~~--~~~~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~ 140 (298) ..+ .++.+.++....++-.+. ..|+-+ ......++.+.-+...++++.+.+++..|+|... .+..|+.+.+ T Consensus 115 l~~~~v~~~~~~v~~~~~g~~y~-~~E~~~-A~~~g~~l~~~Ka~aA~~al~~~~N~i~~~Gd~~-----~~~~GLlN~P 187 (339) T protein:vir:94 115 MSKANVNFESRQNYRYQTWTEYG-DLEMAT-YGEAGIDYVARQEISASLVMAKFANSSYLLGVAG-----IANYGLMNDP 187 (339) T ss_pred cccccceeeEEeEEEEEEEEeec-HHHHHH-HHhhCCChHHHHHHHHHHHHHHhhceEEeeeecc-----cceEEEEeCC Confidence 665 556666666655555443 244432 2234467888888888999999999999999532 1223333332 Q ss_pred ccc---cccccc-ccccchhHHHHHHHhhhhhhcCC------cccEEEEcHHHHHHHHHhhccCCceeecccccccCcce Q lcl|Aclame:pro 141 SKV---TQKVEA-PRGIADPNGAIENAVELLTGVDA------DVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDT 210 (298) Q Consensus 141 ~~~---~~~~~~-~~~~~~~~~~i~~~~~~l~~~~~------~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~ 210 (298) +.. +....+ ..+....++||..++.++...-. .+..++|.|+.+..|.+- +..|.-++.-.... T Consensus 188 ~l~~~v~~s~~Wa~kT~~eI~~Di~~~~~~l~~~s~g~~~~~~~~~L~LP~~~~~~L~~~-n~~~~Tvl~~lk~n----- 261 (339) T protein:vir:94 188 SLPAPVAATVNWATAAPEDIANDVVAMVGRLISQSGGLITGQERMVMALAPSALNNVNRT-NNFGLSAGAKIAQT----- 261 (339) T ss_pred CccccccCCCCcccCCHHHHHHHHHHHHHHHHHhcCCeeeeccCcEEEecHHHHHhcccC-CcCCccHHHHHHHh----- Confidence 221 111222 23334457889888888865432 234699999999988653 44444343211111 Q ss_pred ecceeeEecCccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEcc-EEeccc Q lcl|Aclame:pro 211 INGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGW-GILDAT 289 (298) Q Consensus 211 l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~-~v~~~~ 289 (298) +.++.++..+.+.+ +++.....++-.. .....+.+.+......-... ...-.....+..|.++ .+++|. T Consensus 262 ~pnl~i~~~~el~~-a~g~~~~~~~~~~------~~~~~~~~~~p~~~~~lpvq---~~~~~~~v~~~~rt~Gv~i~~P~ 331 (339) T protein:vir:94 262 YPNIQFVAVPEFDT-ASGRLVQLWVPEV------NGQPTGEVAFAEKLRSHSIE---RYSTTTRQKHSGATFGAVIYQPW 331 (339) T ss_pred cCCcEEEEcccccc-CCCceEEEEEEec------cCCcceEEEcchhhhccccE---EcCceEEecceeeeeeEEEEccc Confidence 12333443333322 1111111111111 00111222221111000000 0011133456667555 556899 Q ss_pred ceEEEeec Q lcl|Aclame:pro 290 KFARVTEA 297 (298) Q Consensus 290 a~~~l~~a 297 (298) ||+++.|. T Consensus 332 ai~~~~GI 339 (339) T protein:vir:94 332 AVTQELGV 339 (339) T ss_pred eeeeeecC Confidence 99999999 No 173 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=98.33 E-value=1.3e-06 Score=52.91 Aligned_cols=273 Identities=14% Similarity=0.121 Sum_probs=130.7 Q ss_pred Ceecc-ccccchh--HHHHHHH-HHHhhchhhhhccee---------ecCCCceEEEEEeC-CcceE--Eeecc--cccc Q lcl|Aclame:pro 1 MVLNK-GTLFDPE--LVTDLIS-KVAGKSSIARLSAQK---------PIPFNGEKVFTFTM-DSEID--VVAES--GKKT 62 (298) Q Consensus 1 mat~g-g~lip~~--~~~~ii~-~~~~~s~i~~~~~~~---------~~~~~~~~ip~~~~-~~~a~--~v~E~--~~~~ 62 (298) ||++. ..+|.+| +...+++ ...+.+.+.+-+-.. .-++.-+++|.+.. .++.. +-+.. +..+ T Consensus 1 Ma~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~l~g~~e~n~~~dt~~~~~t 80 (349) T protein:vir:94 1 MAITTIGNIVTGNIPVLASYMTEDPVEKTAFFNSGILTPTPYAAEIARGPSNIANLPFWKAIDTSIEPNYSNDVYQDIAT 80 (349) T ss_pred CCceEEeeeeccChHHHHHHHHHhHHHhhhhhhccceeccHHHHHHHhcCCCEEEeeeeecCCCCcccccCCCCcccccc Confidence 99887 6555544 4555543 334555555432222 12344578997653 33322 21211 2333 Q ss_pred ccccc-eeeEEEeeeEEEEE--EeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHh---cccccccccccccccc Q lcl|Aclame:pro 63 HGGVT-LAPQTMVPIKVEYG--ARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAF---HGVNPRLGTASAVIGT 136 (298) Q Consensus 63 ~~~~~-~~~v~l~~~k~~~~--~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l---~G~~~~~g~~~~~~~~ 136 (298) ..+.+ ..++-...++-.++ -.++.++- . .+.++.|.+++++-..+...+.+| .|.=..... +. T Consensus 81 ~~kit~~~~~a~~~~r~kaw~~~Dla~~ls---G---~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~-----~~ 149 (349) T protein:vir:94 81 PRAIQTGEMMARVAYLNEGFGQADLTVELT---S---QNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVS-----AT 149 (349) T ss_pred cccccccceeeeeeeeccccchhHHHHHhh---C---chHHHHHHHHHHHHHhhHHHHHHHHHHHhhhccccc-----cc Confidence 34433 33443444333332 23444431 1 256778888888877766655444 342100000 00 Q ss_pred cccccccccccccccccchhHHHHHHHhhhhhhc-----CCcccEEEEcHHHHHHHHHhhccCCceeecccccccCccee Q lcl|Aclame:pro 137 NHFDSKVTQKVEAPRGIADPNGAIENAVELLTGV-----DADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTI 211 (298) Q Consensus 137 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~-----~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l 211 (298) .+............+........+.++..++... ...-++++||+.++..|++++-=. | +++......-.++ T Consensus 150 ~~~~~~~~~~~d~~~~a~~~~~~~~~A~~~~Gdaa~Gd~~~~lt~i~mHS~v~~~L~~~~li~--~-i~~s~~~~~i~ty 226 (349) T protein:vir:94 150 DAYHEQNDMVVDVSATSGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLID--F-IRDAENNTMFATY 226 (349) T ss_pred ccccccCceeEEecccCCCChhhHHHHHHHHHHHhccccccceeEEEEchHHHHHHHhcchhh--h-ccCcccCccccee Confidence 0000000000011111112234555665555443 345678999999999998875200 0 0111122234689 Q ss_pred cceeeEecCcccccccccc---ceEEEeeccceEEEEeecc-eEEEEeecccccccchhhhhcCcEEEEEEEEEccEEec Q lcl|Aclame:pro 212 NGLPVDVNKTVSDMSLTQR---DRAIIGDFANGFKWGYAKE-VPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILD 287 (298) Q Consensus 212 ~G~PV~~s~~~~~~~~~~~---~~~~~gd~~~~~~~~~~~~-~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~ 287 (298) +|++|++++.||....+.. .+-+|| .+++.++.... ..+++.++...... .++-.+..+.|+ +.+ T Consensus 227 ~G~~VivDD~~Pv~~~g~~~~yttylfg--~GAi~~~~~~~~~~~E~~rd~~~g~~------~G~d~L~~R~~~---~~h 295 (349) T protein:vir:94 227 QGYRVIVDDSMTVVGQDTSRKFISIIFG--QGAIGYGEGNPEMPLEYEREASRANG------GGVETLWTRKTW---LLH 295 (349) T ss_pred cCcEEEEeCCCccccCCCCceEEEEEee--cceEEeecCCCCcceeeecccccCCc------ceeEEEEEeeEE---Eee Confidence 9999999999996543322 234555 45666655542 23555554332110 122333344444 567 Q ss_pred ccceEEEeec-------------C Q lcl|Aclame:pro 288 ATKFARVTEA-------------N 298 (298) Q Consensus 288 ~~a~~~l~~a-------------~ 298 (298) |.++...+.. | T Consensus 296 p~G~s~~~a~v~~~~~~~~~~sPt 319 (349) T protein:vir:94 296 PFGYSFTSAVITGNGTETIARSAS 319 (349) T ss_pred eeeeeecccccCCCccccccCCCC Confidence 7777665422 1 No 174 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=98.32 E-value=8.6e-07 Score=53.83 Aligned_cols=273 Identities=14% Similarity=0.136 Sum_probs=130.0 Q ss_pred Ceecc-ccccchh--HHHHHHH-HHHhhchhhhhccee---------ecCCCceEEEEEeC-Cc--ceEEeec--ccccc Q lcl|Aclame:pro 1 MVLNK-GTLFDPE--LVTDLIS-KVAGKSSIARLSAQK---------PIPFNGEKVFTFTM-DS--EIDVVAE--SGKKT 62 (298) Q Consensus 1 mat~g-g~lip~~--~~~~ii~-~~~~~s~i~~~~~~~---------~~~~~~~~ip~~~~-~~--~a~~v~E--~~~~~ 62 (298) ||++. ..+|.+| +...+++ ...+.+.+.+-+-.. .-++.-+++|.+.. .+ +..+-.. .+..+ T Consensus 1 Ma~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~L~g~~e~nv~~D~~~~~~t 80 (349) T protein:vir:78 1 MAITTIGDIVTGNIPVLASYMTEDPVEKTAFFDSGILTSTPYAAEIANGPSNIANLPFWKAIDTSIEPNYSNDVYQDIAT 80 (349) T ss_pred CCceEEeeeeccCHHHHHHHHHHhhHHhhhhhhccceeccHHHHHHhhcCCCEEEeeeeecCCCCcccccCCCCcccccc Confidence 99887 6665554 4555543 334445544422222 12344578998753 33 3222122 22333 Q ss_pred ccccc-eeeEEEeeeEEEEE--EeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHh---cccccccccccccccc Q lcl|Aclame:pro 63 HGGVT-LAPQTMVPIKVEYG--ARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAF---HGVNPRLGTASAVIGT 136 (298) Q Consensus 63 ~~~~~-~~~v~l~~~k~~~~--~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l---~G~~~~~g~~~~~~~~ 136 (298) ..+.+ ..++-...++-.++ -.++.++- . .+.++.|.+++++-..+...+.++ .|.=... ..+. T Consensus 81 ~~kitt~~~~a~~~~r~kaw~~~Dla~~ls---G---~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~-----~~a~ 149 (349) T protein:vir:78 81 PRAIQTGEMMARVAYLNEGFGQADLTVELT---S---QNPLQSVASRLDNFWQRQAQRRLIATALGLYNDN-----VSAT 149 (349) T ss_pred cccccccceeeeeeeeccccchhHHHHHhh---C---chHHHHHHHHHHHHHhhHHHHHHHHHHHHhhccc-----cccc Confidence 33333 44444444443332 23344431 1 256788888888877666554444 3421000 0000 Q ss_pred cccccccccccccccccchhHHHHHHHhhhhhhc-----CCcccEEEEcHHHHHHHHHhhccCCceeecccccccCccee Q lcl|Aclame:pro 137 NHFDSKVTQKVEAPRGIADPNGAIENAVELLTGV-----DADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTI 211 (298) Q Consensus 137 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~-----~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l 211 (298) ..............+........+.++..++... ...-++++||+.++..|++.+-= .|+ ++......-.++ T Consensus 150 ~~~~~~~~~t~d~s~~a~~~~~~~~dA~~~lgda~~Gd~~~~lt~i~mHS~v~~~L~~~~li--~~i-~~s~~~~~i~ty 226 (349) T protein:vir:78 150 DAYHEQNDMVVDVSATLGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLI--DFI-RDAENNTMFATY 226 (349) T ss_pred chhhhcccceeeeccccCCChhhhhhhHHHHHHHhccccccceeEEEEchHHHHHHHhhhhh--hhc-cCcccCccccee Confidence 0000000000000111112334555655555443 34567899999999999876420 000 111122234689 Q ss_pred cceeeEecCcccccccccc---ceEEEeeccceEEEEeecc-eEEEEeecccccccchhhhhcCcEEEEEEEEEccEEec Q lcl|Aclame:pro 212 NGLPVDVNKTVSDMSLTQR---DRAIIGDFANGFKWGYAKE-VPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILD 287 (298) Q Consensus 212 ~G~PV~~s~~~~~~~~~~~---~~~~~gd~~~~~~~~~~~~-~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~ 287 (298) +|++|++++.||....+.. .+.+|| .+++.++.... ..+++.++..... ..++-.+..+.|+ +.+ T Consensus 227 ~G~~VivDD~~Pv~~~g~~~~yttylfg--~GAi~~~~~~~~~~~et~rd~~~g~------~~G~d~l~~R~~~---~~h 295 (349) T protein:vir:78 227 QGYRVIVDDSMTVVGQGAQRKFISIIFG--QGAIGYGEGNPVMPLEYEREASRAN------GGGVETLWTRKTW---LLH 295 (349) T ss_pred cCeEEEEeCCCccccCCCCceEEEEEee--cceEEEccCCCccceeeecccccCC------cceeEEEEEeeEE---Eee Confidence 9999999999996543322 234555 45665654432 2355555432211 1123334444444 566 Q ss_pred ccceEEEeec-------------C Q lcl|Aclame:pro 288 ATKFARVTEA-------------N 298 (298) Q Consensus 288 ~~a~~~l~~a-------------~ 298 (298) |.++...+.+ | T Consensus 296 p~G~s~~~a~v~~~~~~~~~~sPt 319 (349) T protein:vir:78 296 PFGYRFTSAVITGNGTETIARSAS 319 (349) T ss_pred eeeeeeccccccCCccccccCCCC Confidence 7766665321 1 No 175 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=98.31 E-value=1.1e-08 Score=64.17 Aligned_cols=257 Identities=12% Similarity=-0.015 Sum_probs=132.7 Q ss_pred CeeccccccchhH--------HHHHHHHHHhhchhhhhcceeecCCC-ceE---EEEEeCCcceEEeeccccccccccce Q lcl|Aclame:pro 1 MVLNKGTLFDPEL--------VTDLISKVAGKSSIARLSAQKPIPFN-GEK---VFTFTMDSEIDVVAESGKKTHGGVTL 68 (298) Q Consensus 1 mat~gg~lip~~~--------~~~ii~~~~~~s~i~~~~~~~~~~~~-~~~---ip~~~~~~~a~~v~E~~~~~~~~~~~ 68 (298) |+...+-..++.+ .+++-.-+.+-.-++...|.+||..+ .++ +|..+....+.-|+||+.+|.++.+. T Consensus 1 M~~e~nl~~~~dL~~a~siDF~~~f~~~i~~L~~~LGv~r~~pla~Gt~iktyK~~~~~y~gda~dVaEGe~Iplskvt~ 80 (303) T protein:vir:10 1 MSAENNLINVEALGKAKSIDFANKLGVGLNKLFEALAIQNKIPMNVGSALKQYRFKVEDSEKPNGDVAEGDVIPLTKVTR 80 (303) T ss_pred CCCCcCCcchhhcccceeehhhhhhhhhHHHHHHHhhhhccccccCCceeeeeeeeceeeccccccccCCcccchhhhee Confidence 7766665544443 23332223333334444588898755 344 44445668899999999999999997 Q ss_pred e---eEEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccc Q lcl|Aclame:pro 69 A---PQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQ 145 (298) Q Consensus 69 ~---~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~ 145 (298) + ..+++.+|.+..+ |.|.++.+- .-+-..+-.++|..+|+.++++.+|.-...++++... T Consensus 81 ~~~~t~~~~~kK~rK~t--TdEAIqlsG--yg~aVgetd~qL~~~Iq~kIdnd~~~~lktaT~t~~~------------- 143 (303) T protein:vir:10 81 EQVDITELQFAKYRKST--SAEAIQAHG--YDLAINQTDNEMIKYVQKKFRAKFFETLKSAIENGKR------------- 143 (303) T ss_pred eecceEEEEeecccccc--cHHHHHhhc--CCchhHHHHHHHHHHHHhhhhHHHHHHHhhccccccc------------- Confidence 5 5788889888755 999874332 1234567888899999999999998653222221111 Q ss_pred ccccccccchhHHHHHHHhhhh-------hhcCCcccEEEEcHHHHHHHHHhhccCCc-eeecccccccCcceecceeeE Q lcl|Aclame:pro 146 KVEAPRGIADPNGAIENAVELL-------TGVDADVTGIAINPSFRSALAKQKDLQGN-ALFPELKWGATPDTINGLPVD 217 (298) Q Consensus 146 ~~~~~~~~~~~~~~i~~~~~~l-------~~~~~~~~~~vm~~~~~~~L~~lkd~~G~-~l~~~~~~~~~~~~l~G~PV~ 217 (298) +.......+.|..++... .+++ ...+.+|||.....+++-..-+.+ ..|. ... - -.++|.-|+ T Consensus 144 ----t~~t~~s~~glq~Al~~~~~kl~~~~ed~-~~~V~FvNP~Daa~yl~~A~i~~~~t~fG-~n~--L-~nfLG~~II 214 (303) T protein:vir:10 144 ----TNKTKLSAENLQGALSKGRANLSVLLDDE-ITPIAFVNPNDTAEYLANGFINSTGAQFG-VNL--L-TPYVGVKIV 214 (303) T ss_pred ----ccceeecHHHHHHHHHhhhhhcccccccc-ccEEEEEchHHHHHHhhcCCcchhhhhhh-hhh--h-hhhhcceEE Confidence 001111234444544433 2222 234799999999998753322111 1120 000 0 138899999 Q ss_pred ecCccccccccccceEEEeeccc--eEEEEeecceEEEEeecccccccchhhhhcCcEEEE-----------EEEEEc-- Q lcl|Aclame:pro 218 VNKTVSDMSLTQRDRAIIGDFAN--GFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIR-----------AELFLG-- 282 (298) Q Consensus 218 ~s~~~~~~~~~~~~~~~~gd~~~--~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r-----------~~~r~~-- 282 (298) +++.+|.+.. +..--.+ ......++++.-.+....+ ++|.|.+. -....+ T Consensus 215 ~S~kv~~G~~------~~T~~~Ni~~ay~~~~g~l~~~f~~t~D---------~tglIGv~h~~~~~~~t~eT~~~~~~~ 279 (303) T protein:vir:10 215 EFADVPQGEV------WMTVAENLNVAYANPRGELSRAFAFATD---------ATGFVGVLHDIQPQRLTSDTIYASAIS 279 (303) T ss_pred EeccCCCceE------EEeeccceEEEEecCchhhhhhhhhccc---------cccceEEEeccccceeeehhHhHhHHH Confidence 9999987532 1111111 1011112222111111111 11111110 000011 Q ss_pred cEEecccceEEEee-cC Q lcl|Aclame:pro 283 WGILDATKFARVTE-AN 298 (298) Q Consensus 283 ~~v~~~~a~~~l~~-a~ 298 (298) +-..+++++++.+- +. T Consensus 280 lfpE~~dgiv~~ti~~~ 296 (303) T protein:vir:10 280 MFPENIDAVIKVTIKKD 296 (303) T ss_pred hcccccceEEEEEEecc Confidence 11345566666654 22 No 176 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=98.27 E-value=1.7e-06 Score=52.28 Aligned_cols=262 Identities=11% Similarity=-0.007 Sum_probs=128.1 Q ss_pred Cee---ccccccchhHHHHHHHHHHhhchhhh--hcc--eeecCCCceEEEEEeCCcce-EEeeccccccccccceeeEE Q lcl|Aclame:pro 1 MVL---NKGTLFDPELVTDLISKVAGKSSIAR--LSA--QKPIPFNGEKVFTFTMDSEI-DVVAESGKKTHGGVTLAPQT 72 (298) Q Consensus 1 mat---~gg~lip~~~~~~ii~~~~~~s~i~~--~~~--~~~~~~~~~~ip~~~~~~~a-~~v~E~~~~~~~~~~~~~v~ 72 (298) .|- +-..++=.+....+++.+.....+-. .++ .....++.++||+.+..+-. +-.+.+-....-+.++...+ T Consensus 19 ~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~DY~R~~g~~~g~vt~~~~t~t 98 (319) T protein:vir:94 19 FANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRNATNEFDHPKIEETTYF 98 (319) T ss_pred hhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcccCcceEeccCcEEEEeeecccccccccCCCCcccCCcccceeEEE Confidence 111 11222223334455665554444332 222 33455678999998763222 22223322333344444555 Q ss_pred EeeeEEEEE-EeecHHHhhcccccHH--HHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccccc Q lcl|Aclame:pro 73 MVPIKVEYG-ARISDEFMYASDEEKI--NILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEA 149 (298) Q Consensus 73 l~~~k~~~~-~~iS~ell~~~~d~~~--~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~ 149 (298) +.-.|.-.+ +.--+. .+... .....+.+...+.++-.+|...+.-.-...+. .... T Consensus 99 idqdR~~~F~VD~~D~-----~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~----------------~~~~ 157 (319) T protein:vir:94 99 LDQEKYWGRFVDALDR-----KDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAK----------------HLTV 157 (319) T ss_pred eecccccccccchhhH-----hhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhccc----------------cccc Confidence 555543332 111111 11111 12233445555556666676554331100000 0111 Q ss_pred ccccchhHHHHHHHhhhhhhcCCccc-EEEEcHHHHHHHHHhhccCCce-eecccccccCcceecceeeEecCccccccc Q lcl|Aclame:pro 150 PRGIADPNGAIENAVELLTGVDADVT-GIAINPSFRSALAKQKDLQGNA-LFPELKWGATPDTINGLPVDVNKTVSDMSL 227 (298) Q Consensus 150 ~~~~~~~~~~i~~~~~~l~~~~~~~~-~~vm~~~~~~~L~~lkd~~G~~-l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~ 227 (298) ..+....|+.|.++..++...+.... .++|+|.++..|.+-..-.... +.......+..++|.|++|+.++. ... T Consensus 158 ~~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~Vi~vps---~~~ 234 (319) T protein:vir:94 158 GTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPT---KLL 234 (319) T ss_pred ccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeeeceeecCeEEEEecc---ccc Confidence 22345579999999999988776533 4799999999886654211111 122333456678999999986432 211 Q ss_pred cccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 228 TQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 228 ~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ....+++|.- .+.. ...+--.+++.+.. .+ +....++.+..+|..|.+|++-.+...++ T Consensus 235 -k~in~i~~h~-~A~~-~~~k~~~~~~~~p~-~~--------~~a~~v~gr~y~d~~V~~~k~~~Iy~~~~ 293 (319) T protein:vir:94 235 -QGLQAIAVVG-EVLA-SPIQADLAKTNSNI-PG--------MFGTLAEQLLYTGAFVPEHLQKYIFTIGG 293 (319) T ss_pred -ccceEEEEcC-Ceee-eeeeeeeeeccCCC-cc--------ccceeeeeeeeeeeEEeccccceEEEeec Confidence 2233555543 3332 22222223322110 10 12256788899999999999666665544 No 177 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=98.27 E-value=1.7e-06 Score=52.28 Aligned_cols=262 Identities=11% Similarity=-0.007 Sum_probs=128.1 Q ss_pred Cee---ccccccchhHHHHHHHHHHhhchhhh--hcc--eeecCCCceEEEEEeCCcce-EEeeccccccccccceeeEE Q lcl|Aclame:pro 1 MVL---NKGTLFDPELVTDLISKVAGKSSIAR--LSA--QKPIPFNGEKVFTFTMDSEI-DVVAESGKKTHGGVTLAPQT 72 (298) Q Consensus 1 mat---~gg~lip~~~~~~ii~~~~~~s~i~~--~~~--~~~~~~~~~~ip~~~~~~~a-~~v~E~~~~~~~~~~~~~v~ 72 (298) .|- +-..++=.+....+++.+.....+-. .++ .....++.++||+.+..+-. +-.+.+-....-+.++...+ T Consensus 19 ~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~DY~R~~g~~~g~vt~~~~t~t 98 (319) T protein:vir:97 19 FANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRNATNEFDHPKIEETTYF 98 (319) T ss_pred hhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcccCcceEeccCcEEEEeeecccccccccCCCCcccCCcccceeEEE Confidence 111 11222223334455665554444332 222 33455678999998763222 22223322333344444555 Q ss_pred EeeeEEEEE-EeecHHHhhcccccHH--HHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccccc Q lcl|Aclame:pro 73 MVPIKVEYG-ARISDEFMYASDEEKI--NILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEA 149 (298) Q Consensus 73 l~~~k~~~~-~~iS~ell~~~~d~~~--~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~ 149 (298) +.-.|.-.+ +.--+. .+... .....+.+...+.++-.+|...+.-.-...+. .... T Consensus 99 idqdR~~~F~VD~~D~-----~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~----------------~~~~ 157 (319) T protein:vir:97 99 LDQEKYWGRFVDALDR-----KDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAK----------------HLTV 157 (319) T ss_pred eecccccccccchhhH-----hhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhccc----------------cccc Confidence 555543332 111111 11111 12233445555556666676554331100000 0111 Q ss_pred ccccchhHHHHHHHhhhhhhcCCccc-EEEEcHHHHHHHHHhhccCCce-eecccccccCcceecceeeEecCccccccc Q lcl|Aclame:pro 150 PRGIADPNGAIENAVELLTGVDADVT-GIAINPSFRSALAKQKDLQGNA-LFPELKWGATPDTINGLPVDVNKTVSDMSL 227 (298) Q Consensus 150 ~~~~~~~~~~i~~~~~~l~~~~~~~~-~~vm~~~~~~~L~~lkd~~G~~-l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~ 227 (298) ..+....|+.|.++..++...+.... .++|+|.++..|.+-..-.... +.......+..++|.|++|+.++. ... T Consensus 158 ~~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~Vi~vps---~~~ 234 (319) T protein:vir:97 158 GTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPT---KLL 234 (319) T ss_pred ccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeeeceeecCeEEEEecc---ccc Confidence 22345579999999999988776533 4799999999886654211111 122333456678999999986432 211 Q ss_pred cccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 228 TQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 228 ~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ....+++|.- .+.. ...+--.+++.+.. .+ +....++.+..+|..|.+|++-.+...++ T Consensus 235 -k~in~i~~h~-~A~~-~~~k~~~~~~~~p~-~~--------~~a~~v~gr~y~d~~V~~~k~~~Iy~~~~ 293 (319) T protein:vir:97 235 -QGLQAIAVVG-EVLA-SPIQADLAKTNSNI-PG--------MFGTLAEQLLYTGAFVPEHLQKYIFTIGG 293 (319) T ss_pred -ccceEEEEcC-Ceee-eeeeeeeeeccCCC-cc--------ccceeeeeeeeeeeEEeccccceEEEeec Confidence 2233555543 3332 22222223322110 10 12256788899999999999666665544 No 178 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=98.23 E-value=7.3e-08 Score=59.72 Aligned_cols=274 Identities=11% Similarity=0.016 Sum_probs=147.1 Q ss_pred CeeccccccchhHHH-----HHHHHHHhhchhhhhcceeecCCC---ceEEEEEeCCcceEEeeccccccccccceeeEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVT-----DLISKVAGKSSIARLSAQKPIPFN---GEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQT 72 (298) Q Consensus 1 mat~gg~lip~~~~~-----~ii~~~~~~s~i~~~~~~~~~~~~---~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~ 72 (298) |.+.+...|| ++.. .+++.+..-....++..+...+.. ...+++....+.+.+++.+...|..+..-...+ T Consensus 42 ~~~~~~~~i~-~~l~~~i~p~~~~~~~~p~~a~~l~pv~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~~d~~~~~~~ 120 (336) T protein:vir:10 42 LSSTGSSGIP-NYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQ 120 (336) T ss_pred cccCCCchhH-HHHHhhcccceeeehhhhhhhhhhccccccCCccceeEEEeeeeceeeEEEeeccCCCceeecccceee Confidence 3333333344 3333 344444444455566665554432 345666666788899999889998887777777 Q ss_pred EeeeEEEEEEeec-HHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccc---c-ccc Q lcl|Aclame:pro 73 MVPIKVEYGARIS-DEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKV---T-QKV 147 (298) Q Consensus 73 l~~~k~~~~~~iS-~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~---~-~~~ 147 (298) -..+.++....++ .|+-+ ......++.+.-+...++++.+.+++-.++|... . ...|+.+-++.. + .+. T Consensus 121 ~~v~~~~~g~~yg~~El~~-A~~~g~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~-~----~~yGllN~P~l~a~~t~~t~ 194 (336) T protein:vir:10 121 RQSYFFQTWTRWGERELEM-AGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG-L----ENYGLINDPSLSAPITATTP 194 (336) T ss_pred eeEEEEEeeeeeCHHHHHH-HHHhCCCcHHHHHHHHHHHHHHhhCcEEEEeccc-c----ceEEEEeCCCCccccccCCC Confidence 7788888888899 55543 3345567888888888888999999888888521 1 122222222221 1 111 Q ss_pred cc-ccccchhHHHHHHHhhhhhhcC------CcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecC Q lcl|Aclame:pro 148 EA-PRGIADPNGAIENAVELLTGVD------ADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNK 220 (298) Q Consensus 148 ~~-~~~~~~~~~~i~~~~~~l~~~~------~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~ 220 (298) -+ ..+....++||..++..+...- ..+..++|.|..+..|.+ +++.|.-++.-.... +-++-++..+ T Consensus 195 ~~~~~t~eei~~Di~~~~~~l~~qs~G~i~~~~~~tL~LP~~~~~~Ls~-~n~~g~Tvl~~lk~n-----~Pnl~i~t~p 268 (336) T protein:vir:10 195 WSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAKLKDI-----FPKLEFVTIP 268 (336) T ss_pred cccccCHHHHHHHHHHHHHHHHHhcCCeecccCcceEEecHHHHHhccC-CCccCccHHHHHHHh-----cCccEEEEcc Confidence 11 1223457899999999887743 236789999999888864 344444332211111 1122233323 Q ss_pred ccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEE-ecccceEEEeec Q lcl|Aclame:pro 221 TVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGI-LDATKFARVTEA 297 (298) Q Consensus 221 ~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v-~~~~a~~~l~~a 297 (298) .+. ++++.....++-+.. .....++.+......-... ...-.....+..|.++.+ ++|.||++++|. T Consensus 269 El~-~a~G~~~~l~~~~~~------~~~t~~~~~p~~~~~l~vq---~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:10 269 EYD-TASGRLVQLWAPRVE------GKDTATCGFTEKMRAHSIE---RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred ccc-cCCCceEEEEEEecC------CCcceeeecchhhhcccee---ecCceeEeccccceeeeeeeccchheeeecC Confidence 332 122211111211111 0011122221111000000 001113345666666655 579999999999 No 179 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=98.17 E-value=1e-07 Score=58.97 Aligned_cols=274 Identities=11% Similarity=0.024 Sum_probs=145.7 Q ss_pred CeeccccccchhHHH-----HHHHHHHhhchhhhhcceeecCCC---ceEEEEEeCCcceEEeeccccccccccceeeEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVT-----DLISKVAGKSSIARLSAQKPIPFN---GEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQT 72 (298) Q Consensus 1 mat~gg~lip~~~~~-----~ii~~~~~~s~i~~~~~~~~~~~~---~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~ 72 (298) |++.+..-|| ++.. .+++.+..-....++..+...+.. ...+++....+.+.+++.....|..+..-...+ T Consensus 42 ~~~~~~~~~~-~~l~~~i~p~~~~~~~~~~~~~~l~pv~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~~d~~~~~~~ 120 (336) T protein:vir:36 42 LSSTGSSGIP-NYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQ 120 (336) T ss_pred cccCCCcchH-HHHHHhhccceEeeecchhhhhhhccccccCCccceeEEEeeeeceeeEEEeeccCCCceeecccceee Confidence 2222222234 3333 344444444455566665554432 345666666788899999889998887777777 Q ss_pred EeeeEEEEEEeec-HHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccc---c-ccc Q lcl|Aclame:pro 73 MVPIKVEYGARIS-DEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKV---T-QKV 147 (298) Q Consensus 73 l~~~k~~~~~~iS-~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~---~-~~~ 147 (298) -..+.++....++ .|+.++ .....++.+.-+...++++.+.+++-.++|... . ...|+.+-++.. + .+. T Consensus 121 ~~v~~~~~g~~yg~~E~~~A-a~~~~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~-~----~~yGllNdP~l~a~~t~~t~ 194 (336) T protein:vir:36 121 RQSYFFQTWTRWGERELEMA-GAGRVDLASELNYSSALGLAKFLNGSYLFGVAG-L----ENYGLINDPSLSAPITATTP 194 (336) T ss_pred eeEEEEEeeeeeCHHHHHHH-HHhCCCcHHHHHHHHHHHHHHhhCcEEEEeccc-c----ceEEEEecCCCccccccCCC Confidence 7788888888898 565543 344567778888888888999999888888521 1 122222222221 1 111 Q ss_pred cc-ccccchhHHHHHHHhhhhhhcC------CcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecC Q lcl|Aclame:pro 148 EA-PRGIADPNGAIENAVELLTGVD------ADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNK 220 (298) Q Consensus 148 ~~-~~~~~~~~~~i~~~~~~l~~~~------~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~ 220 (298) -+ ..+....++||..++.++...- ..+..++|.|..+..|.+ +++.|.-++.-.... +-++-++..+ T Consensus 195 ~~~~~t~~ei~~Di~~~~~~l~~qt~G~i~~~~~~tL~LP~~~~~~Ls~-~n~~g~Tvl~~lk~n-----~Pnl~i~t~p 268 (336) T protein:vir:36 195 WSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAKLKDI-----FPKLEFVTIP 268 (336) T ss_pred cccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccEEEechHHHHhccC-CCccCccHHHHHHHh-----cCccEEEEcc Confidence 11 1223457899999999887743 236679999999888864 344444332211111 1122233323 Q ss_pred ccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEE-ecccceEEEeec Q lcl|Aclame:pro 221 TVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGI-LDATKFARVTEA 297 (298) Q Consensus 221 ~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v-~~~~a~~~l~~a 297 (298) .+. ++++.....++-+..+ ....++.+......-... ...-.....+..|.++.+ ++|.||++++|. T Consensus 269 El~-~a~g~~~~l~~~~~~~------~~t~~~~~p~~~~~l~vq---~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:36 269 EYD-TASGRLVQLWAPRVEG------KDTATCGFTEKMRAHSIE---RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred ccc-cCCCceEEEEEEecCC------Ccceeeecchhhhcccee---ecCceeEeccccceeeeeeeccchheeeecC Confidence 332 1221111112111110 011122221111000000 001113345666666655 579999999999 No 180 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=98.16 E-value=1.3e-07 Score=58.26 Aligned_cols=275 Identities=10% Similarity=0.003 Sum_probs=148.4 Q ss_pred CeeccccccchhHHH-----HHHHHHHhhchhhhhcceeecCCC---ceEEEEEeCCcceEEeeccccccccccceeeEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVT-----DLISKVAGKSSIARLSAQKPIPFN---GEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQT 72 (298) Q Consensus 1 mat~gg~lip~~~~~-----~ii~~~~~~s~i~~~~~~~~~~~~---~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~ 72 (298) |+|-...-|| ++.. ++++.+.......++..+..++.. .+.+++.+..+.+.+++.+...|..+..-...+ T Consensus 42 ~~t~~~~g~~-~~l~~~i~p~~~~~~~~~~~~~~l~~v~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~vd~~~~~~~ 120 (336) T protein:vir:78 42 LSSTGSSGIP-NYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQ 120 (336) T ss_pred cccCCCcchH-HHHHHhcccceeeehhhhhhhhhhcccccCCCccccEEEEeeeecceeeEEeecccCCCeeecceeeEE Confidence 2222222233 3333 444555455555666665554432 356777777889999999999999998888888 Q ss_pred EeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccc---c-cccc Q lcl|Aclame:pro 73 MVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKV---T-QKVE 148 (298) Q Consensus 73 l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~---~-~~~~ 148 (298) -+.+.++..+.++.+=++.......++.+.-+...++++.+.+++-.++|.. +. ...|+.+-++.. + .... T Consensus 121 ~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~-~~----~~~GllN~P~l~a~~t~~~~~ 195 (336) T protein:vir:78 121 RQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVA-GL----ENYGLINDPSLSAPITATTPW 195 (336) T ss_pred EEEEEEEeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEecc-cc----ceEEEEeCCCCCcccccCcCc Confidence 8999999999999543333334556788888888888889999988888842 11 222333322221 1 1111 Q ss_pred cc-cccchhHHHHHHHhhhhhhcCC------cccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCc Q lcl|Aclame:pro 149 AP-RGIADPNGAIENAVELLTGVDA------DVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKT 221 (298) Q Consensus 149 ~~-~~~~~~~~~i~~~~~~l~~~~~------~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~ 221 (298) +. .+....++||..++.++...-. .+..++|.|..+..|.+ ++..|.-++.-.... +-++-++..+. T Consensus 196 w~~~T~~~I~~Di~~~~~~l~~qt~g~~~~~~~~tL~Lp~~~~~~L~~-~n~~g~tv~~~lk~n-----~Pnl~i~t~pe 269 (336) T protein:vir:78 196 SGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAKLKEI-----FPKLEFVTIPE 269 (336) T ss_pred ccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccC-CCccCccHHHHHHHh-----cCccEEEEccc Confidence 11 2334578899999888855431 24479999999999865 344443332111111 11223433333 Q ss_pred cccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEE-ecccceEEEeec Q lcl|Aclame:pro 222 VSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGI-LDATKFARVTEA 297 (298) Q Consensus 222 ~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v-~~~~a~~~l~~a 297 (298) +.+ +++.....+.-+.. ....+++.+......-... .........+..|.++.+ .+|.||+++.|. T Consensus 270 l~~-Agg~~~~~~~~~~~------~~~t~~~~~p~~f~~lpvq---~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:78 270 YDT-ASGRLVQLWAPRVE------GKDTATCGFTEKMRAHSIE---RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred ccc-cCcceEEEEEeecc------CCcceeeecchhhhcccee---ecCceeEeccccceeeeeeeccchheeeccC Confidence 321 22211112211111 0011222222111100000 001223345566666655 579999999999 No 181 >protein:vir:96792 Length: 315 # NCBI annotation: major capsid protein # Family: family:all:47 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224246;genbank:gi:62362381;genbank:GeneID:3345731 Probab=98.11 E-value=1.6e-06 Score=52.29 Aligned_cols=261 Identities=11% Similarity=0.089 Sum_probs=117.1 Q ss_pred Ceeccc---cccchhHHHHHHHHHHhhchhhhhcce--eecCC----Cce-EEEEE-eCCcce-EEeeccccccccccc- Q lcl|Aclame:pro 1 MVLNKG---TLFDPELVTDLISKVAGKSSIARLSAQ--KPIPF----NGE-KVFTF-TMDSEI-DVVAESGKKTHGGVT- 67 (298) Q Consensus 1 mat~gg---~lip~~~~~~ii~~~~~~s~i~~~~~~--~~~~~----~~~-~ip~~-~~~~~a-~~v~E~~~~~~~~~~- 67 (298) |||+=- .+..+.+..-.+|.+.+...+++.+.. +.+.. +++ +.|-. .++... .=+...+.....+.+ T Consensus 1 ~~~t~~sdl~vfn~~~~~a~~e~~~~~~~~Fnaas~Gai~l~~~~~~GDf~~~~ff~i~~~~~~rnv~~~~~~t~~kit~ 80 (315) T protein:vir:96 1 MATTVNSDLVIYNDTAQTAYLERNMDNLAVFNENSRAAIGLNSELIEGDLKLRSFYKVGGAIADRDVNSTATVAGTKIAA 80 (315) T ss_pred CceeeecceeeehhhhhhhHHhhhHHHHHHhhhhcCCcccccccccccccccccccccccchhhcccCCCccccceeccc Confidence 887652 345677777788887766555443321 11110 111 11111 111100 111122222223322 Q ss_pred eeeEEEeeeEEE-EEEee--cHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccc Q lcl|Aclame:pro 68 LAPQTMVPIKVE-YGARI--SDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVT 144 (298) Q Consensus 68 ~~~v~l~~~k~~-~~~~i--S~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~ 144 (298) ..++.. |++ +.-++ +.+.+....++.......|..+++.++.+.+-...+.+.-..- .+.. T Consensus 81 ~~dvaV---k~~~~~~~~~~~~~~~a~~g~dp~~~~~~i~~~~~~~~l~~~l~~~l~~~~aai------~~~t------- 144 (315) T protein:vir:96 81 DEMVSV---KVPWKYGPYETTEEAFKRRARSPEEFSMLIGQDMADATMAGWIGYALNALQGAI------GSNA------- 144 (315) T ss_pred ccceeE---EEeecCCchhccHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhh------cccc------- Confidence 122222 222 22223 3333332233334444556666666666655555554421100 0000 Q ss_pred cccccccccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccc---cccCcceecceeeEecCc Q lcl|Aclame:pro 145 QKVEAPRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELK---WGATPDTINGLPVDVNKT 221 (298) Q Consensus 145 ~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~---~~~~~~~l~G~PV~~s~~ 221 (298) ...............+.++..++.....+-+.|+||+.++..|.+ +.= -..++.... .+..++ .+|+||++++. T Consensus 145 ~~~~~~~~a~~~~~~l~dA~~klGD~~~~l~~~vMHS~v~~~L~~-q~L-~~~~~~~~~~~~~~~~~~-~lGkrViVdD~ 221 (315) T protein:vir:96 145 GMNVSGELATEGKKVLTKGLRTMGDKASSIAIWVMDSTSYFDIVD-EAI-DNKLYEEAGVVVYGGTPG-TLGKPVLVTDQ 221 (315) T ss_pred cccccccccccCHHHHHHHHHHhcccccCeeEEEEchHHHHHHHH-hhh-hhhcccccceeEecCcCc-ccccEEEEECC Confidence 000111222334577889999997777888999999999999977 321 123332221 122244 45999999999 Q ss_pred cccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 222 VSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 222 ~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ||.. -+||--.+++.++..+.+.. .++.. ++. ++=.+.+|++.- -.++|.+|..-+.+. T Consensus 222 ~P~~-------~~~gl~~GAi~~~~~~~~~~--~~~~~-~g~-----e~l~~~~r~e~t---f~l~p~G~sw~~~~~ 280 (315) T protein:vir:96 222 CPAT-------KIFGLVAGAVMITESQAPGM--RSYQI-DDQ-----ENLAIGFRAEGT---ANVEVLGYKWKTKTN 280 (315) T ss_pred CCcc-------eeeeeecceeeecCCCcccc--ccccC-CCc-----ceeEEEEeeeeE---eeeeeeeEEeecCCC Confidence 9952 13332234554544433211 11110 000 111123333322 246677666632211 No 182 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=98.10 E-value=4.5e-06 Score=49.90 Aligned_cols=263 Identities=12% Similarity=0.023 Sum_probs=123.8 Q ss_pred Ceec---cccccchhHHHHHHHHH-Hhhchhh-hhcc--eeecCCCceEEEEEeCCcceEE-eeccccccccccceeeEE Q lcl|Aclame:pro 1 MVLN---KGTLFDPELVTDLISKV-AGKSSIA-RLSA--QKPIPFNGEKVFTFTMDSEIDV-VAESGKKTHGGVTLAPQT 72 (298) Q Consensus 1 mat~---gg~lip~~~~~~ii~~~-~~~s~i~-~~~~--~~~~~~~~~~ip~~~~~~~a~~-v~E~~~~~~~~~~~~~v~ 72 (298) .|.. -+.+.=.+....+++.. ...+.-. .+++ .....++.++||+....+-... .+.+-....-+.++...+ T Consensus 30 ~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~~g~tVkIp~i~~~gl~DY~R~~g~~~g~vt~~~~t~t 109 (329) T protein:vir:10 30 FANKSVEPGDTLLKNKHVGILEKVTAANSYSAPAVISNDAIFMQGRSFTVIKGDVTELKDYKRNATNEFDHPQIQETTYF 109 (329) T ss_pred hcCCccCCchhHHHHHHHHHHHHHHHhhceeeeeecccceeeccCcEEEEeeecccccccccCCCCccccccccceeEEE Confidence 1111 11122223233344433 2222111 1122 3445667899999875322222 223322223344445555 Q ss_pred EeeeEEEEEEeecHHHhhcccccH--HHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccc Q lcl|Aclame:pro 73 MVPIKVEYGARISDEFMYASDEEK--INILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAP 150 (298) Q Consensus 73 l~~~k~~~~~~iS~ell~~~~d~~--~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 150 (298) +.-.|.-.+. | +.+ +..+.. ......+.+...+.++..+|...+.-.-...+. ..... T Consensus 110 idqdR~~~F~-V-D~~--D~dEtn~~l~a~~i~~~~~~~~v~pEiDay~~skla~~a~~----------------~~~~~ 169 (329) T protein:vir:10 110 LDQEKYWGRF-V-DAL--DRRDTEGNIDINYVVAKQASEVVAPYLDNLRFATLARNKAK----------------HLTVG 169 (329) T ss_pred eecccceeee-c-chh--hHhhhhhhhhHHHHHHHHHHHHhhhHHHHHHHHHHHhhccc----------------ccccc Confidence 5555533322 1 111 001111 122233445555666667776655321000000 01112 Q ss_pred cccchhHHHHHHHhhhhhhcCCccc-EEEEcHHHHHHHHHhhccCCce-eecccccccCcceecceeeEecCcccccccc Q lcl|Aclame:pro 151 RGIADPNGAIENAVELLTGVDADVT-GIAINPSFRSALAKQKDLQGNA-LFPELKWGATPDTINGLPVDVNKTVSDMSLT 228 (298) Q Consensus 151 ~~~~~~~~~i~~~~~~l~~~~~~~~-~~vm~~~~~~~L~~lkd~~G~~-l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~ 228 (298) .+....|+.|.++..+|...+.... .++++|.++..|.+...-.... ........+..++|.|+||+.++.. .. T Consensus 170 ~t~~nay~~i~~a~~~Lde~~vp~~Rvl~VtP~~~~~Lk~~~~f~~~~~~~~~~~~~g~Vg~idG~~Ii~vps~---~~- 245 (329) T protein:vir:10 170 SGADAQYDAVLDVSVELDEIGAGASRILFVTPKFYKGIKKFVIELPQGDNRQQVLGKGVQGELDGFTIVKVPSK---ML- 245 (329) T ss_pred cCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeeeeeeecCeEEEEecCC---cc- Confidence 2345578999999999888765433 5789999999887632111111 1122234566789999999865332 11 Q ss_pred ccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 229 QRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 229 ~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ....++++.-+ ++.. ..+--.++..+.. .+ ++...++.+..+|..|.+|++..+...++ T Consensus 246 k~in~ii~~~~-A~~~-~~K~~~~~~~~p~-~~--------~~a~~v~gr~yyd~~V~~~k~~~I~~~~~ 304 (329) T protein:vir:10 246 QGVEAMAVIGE-VMAS-PIQANEAKLNSNV-PG--------MFGTLAEQMLYTGAFVPEHLQKYIFTIGG 304 (329) T ss_pred cceeEEEEcCC-ceee-eeeeeeeeeeCCC-Cc--------cchheeeeeeeeeeEEEccccCEEEEecc Confidence 12234555433 3322 2222223332210 10 12246788899999999999766665444 No 183 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=98.10 E-value=2.8e-06 Score=51.05 Aligned_cols=270 Identities=10% Similarity=0.091 Sum_probs=146.6 Q ss_pred Ceeccccccc---hhHHHHHHHHHHhh--chhhhhcceeecC-CCceEEEEEeCCcceEEeeccccccccccceeeEEEe Q lcl|Aclame:pro 1 MVLNKGTLFD---PELVTDLISKVAGK--SSIARLSAQKPIP-FNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMV 74 (298) Q Consensus 1 mat~gg~lip---~~~~~~ii~~~~~~--s~i~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~ 74 (298) +..+++. +| ....++.+...-+. ..+++++++..++ ....+..+..+.++..-|.|+++++.....=+..++. T Consensus 361 ~~hsTsD-Fp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~~~L~~V~E~gEyk~~t~~e~~e~~~ 439 (652) T protein:vir:79 361 FTHSTSD-FGNILLDVANKAILQGWEDAPETYEQWTRKGQLSDFKIAHRVGMGGFSALRQVREGAEYKYVTTGDKQATIA 439 (652) T ss_pred hhcCcch-HHHHHHHHHHHHHHHHHhhhHHHHHHHhccCCCccccccceeecCCCCCccccCCCCccceeeecCccceee Confidence 2222332 22 22223322222222 2467777766654 2344555667788889999999998776665677889 Q ss_pred eeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccc--cccc-cccccccccccccccccccccccc Q lcl|Aclame:pro 75 PIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGV--NPRL-GTASAVIGTNHFDSKVTQKVEAPR 151 (298) Q Consensus 75 ~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~--~~~~-g~~~~~~~~~~~~~~~~~~~~~~~ 151 (298) ..+++.++.||+|++- .+..+....|-..++++.++.+++.++.-. |+.- +....+.....=.+..+ .. T Consensus 440 l~tyG~~~~iTRqaiI---NDDL~a~~~ip~~~g~aA~~~~~~~vy~~l~~Np~~~~DGk~LF~hA~H~Nl~~----~a- 511 (652) T protein:vir:79 440 LATYGELFSITRQAII---NDDLNMLTDVPMKLGRAAKSTIADLVYAILTSNPKISTDNVSLFDKAKHANVLE----SA- 511 (652) T ss_pred eecccCeeeeehheee---ccchhHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccccCCceeecccccccccc----cc- Confidence 9999999999999762 333677888889999998888876655321 2111 11112221000001111 01 Q ss_pred ccchhHHHHHHHhhhhhh-------cCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecce-eeEecCccc Q lcl|Aclame:pro 152 GIADPNGAIENAVELLTG-------VDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGL-PVDVNKTVS 223 (298) Q Consensus 152 ~~~~~~~~i~~~~~~l~~-------~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~-PV~~s~~~~ 223 (298) ....+.+..+..++.. -+..|..|+..|......+++..+...+ ......+..+-+.|+ .|++++.+. T Consensus 512 --a~~~~~l~~ar~aM~~Qk~g~~~l~i~P~~llvp~~le~~a~~ll~s~~v~--~a~~~~~~~Np~~~~~~~i~eprL~ 587 (652) T protein:vir:79 512 --AMDVASLDKARQLMRVQKEGERHLNIRPAFVLVPTAMESVANQVIRSSSVK--GADINAGIINPVKDFATVIAEPRLD 587 (652) T ss_pred --cCCHHHHHHHHHHHHHhccCCccccccccEEEecchhHHHHHHHhccCCCc--ccccccccccccccccccccccccC Confidence 1112223333222211 1234667888888777776665332111 111122223335553 566666664 Q ss_pred cccccccceEEEeecc--ceEEEEeecce---EEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEee Q lcl|Aclame:pro 224 DMSLTQRDRAIIGDFA--NGFKWGYAKEV---PLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTE 296 (298) Q Consensus 224 ~~~~~~~~~~~~gd~~--~~~~~~~~~~~---~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~ 296 (298) ++. ...-++.+-. ..+.++.-++. .++..+ -|..+-+.||++..+|.+++|--+++|.+. T Consensus 588 ~~s---~~~wylaa~~~~dtiev~yL~G~~~P~ie~~~----------gf~~dG~~~kvrlD~G~~~iD~RG~~k~t~ 652 (652) T protein:vir:79 588 DNS---QTTFYLAASKGSDTIEVAYLNGVDTPYIDQME----------GFSVDGVTTKVRIDAGVAPVDHRGLVKCTA 652 (652) T ss_pred CCC---cccEEEecCCCCCeEEEEEecCCCCCeeeecC----------CCCcceEEEEEEEeccCceeeccceeeecC Confidence 432 1223333222 12333322222 222111 278888999999999999999999999988 No 184 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=98.01 E-value=5e-06 Score=49.67 Aligned_cols=273 Identities=10% Similarity=0.069 Sum_probs=144.6 Q ss_pred Cee--ccccccch---hHHHHHHHHH--HhhchhhhhcceeecC-CCceEEEEEeCCcceEEeeccccccccccceeeEE Q lcl|Aclame:pro 1 MVL--NKGTLFDP---ELVTDLISKV--AGKSSIARLSAQKPIP-FNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQT 72 (298) Q Consensus 1 mat--~gg~lip~---~~~~~ii~~~--~~~s~i~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~ 72 (298) +|. +++. +|- .+.+..+... .....+++.++...++ ....+..+..+-++..-|.|+++++-....=..-+ T Consensus 394 ~a~~htTSD-Fp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~~~L~~V~E~gEyk~~t~~e~~e~ 472 (693) T protein:vir:95 394 LAFTHTSSD-FGLILLDVANKSVLAGWEEAEETFPLWTKSGILTDFKPARRVGLGEFSSLRQVREGAEYKYVTLGERGEQ 472 (693) T ss_pred HHHhcCcch-hHHHHHHHHHHHHHHHHHhhhhHHHHHhccCCCCcccccceeecCCCCChhhcCCCCceeeeecCCccce Confidence 221 2221 221 1122222111 1123455666655544 23344445556677788999999876555545567 Q ss_pred EeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccc--ccccccccccccccccccccccccccc Q lcl|Aclame:pro 73 MVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGV--NPRLGTASAVIGTNHFDSKVTQKVEAP 150 (298) Q Consensus 73 l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~--~~~~g~~~~~~~~~~~~~~~~~~~~~~ 150 (298) +...+++.++.||+|++- .+..+....|-..++++.++.+++.++.=. |+.-.-...+....+- +.. .+ T Consensus 473 ~~l~tyG~~~~iTRqaiI---NDDLga~~~ip~~~g~aA~~~~~~~vy~~L~~Np~m~DGk~LFhadH~-Nl~-----tg 543 (693) T protein:vir:95 473 IILATYGELFSITRQAII---NDDLQMLSDIPFKLGQAAKATIGDLVYAVLTGNPAMSDGKTLFHADHS-NLL-----TG 543 (693) T ss_pred eehhhcCCeeeecHHhhh---ccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccCCcceeecccc-ccc-----cc Confidence 788889999999999863 333678888889999998888886665321 1111111111111110 000 01 Q ss_pred cccchhHHHHHHHhhhhhh------------cCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecce-eeE Q lcl|Aclame:pro 151 RGIADPNGAIENAVELLTG------------VDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGL-PVD 217 (298) Q Consensus 151 ~~~~~~~~~i~~~~~~l~~------------~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~-PV~ 217 (298) .......+.+..+..++.. -+..|..|+..+......+++-.+.-.+- .....+.++-+.|+ .|+ T Consensus 544 a~sals~~sl~~a~~am~~qk~~~~~~~g~~L~i~P~~llvP~~le~~a~~l~~s~~~~~--a~~~~~~~NP~~~~~~vi 621 (693) T protein:vir:95 544 AASALSIDSLSKAKTQMATQKAQVEKGKGRTLNIRPGFVLTPVALEDKANQIINSESVPG--ADVNSGIVNPIRAFAQVI 621 (693) T ss_pred cccccChHHHHHHHHHHHHhhcchhccCCceeecccceEEecchHHHHHHHHhccccccc--cccccccccchhcccccc Confidence 1111223334333333221 12456678888888877777764432221 01122223335554 556 Q ss_pred ecCccccccccccceEEEeecc-ceEEEEeecce---EEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEE Q lcl|Aclame:pro 218 VNKTVSDMSLTQRDRAIIGDFA-NGFKWGYAKEV---PLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFAR 293 (298) Q Consensus 218 ~s~~~~~~~~~~~~~~~~gd~~-~~~~~~~~~~~---~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~ 293 (298) +.+.+.+..+ ....++.|-. ..+.++.-++. .++..+ -|..+-+.||++..+|.+++|--+++| T Consensus 622 ~~prL~~~s~--~~Wyl~a~~~~dtie~~yL~G~~~P~ie~~~----------gf~~dG~~~kvr~D~G~~~iD~Rg~~k 689 (693) T protein:vir:95 622 GEPRLDDASA--TAWYMAAKKGSDTIEVAYLDGVDTPYLEQQE----------GFTVDGVASKVRIDAGVAPLDFRGLQK 689 (693) T ss_pred ccceecCCCC--CceEEecCCCCCeEEEEEecCCCCCeEeecC----------CCCcceEEEEEEEeccCceeecccccc Confidence 6666643221 1233333321 12333333222 232222 278888999999999999999999999 Q ss_pred Eeec Q lcl|Aclame:pro 294 VTEA 297 (298) Q Consensus 294 l~~a 297 (298) =.|| T Consensus 690 n~GA 693 (693) T protein:vir:95 690 SNGA 693 (693) T ss_pred CCCC Confidence 9999 No 185 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=97.93 E-value=1.7e-06 Score=52.25 Aligned_cols=277 Identities=9% Similarity=0.005 Sum_probs=148.5 Q ss_pred eccccccchhH---HHHHHHHHHhhchhhhhcceeec---CCCceEEEEEeCCcceE--Eeecc-ccccccccceeeEEE Q lcl|Aclame:pro 3 LNKGTLFDPEL---VTDLISKVAGKSSIARLSAQKPI---PFNGEKVFTFTMDSEID--VVAES-GKKTHGGVTLAPQTM 73 (298) Q Consensus 3 t~gg~lip~~~---~~~ii~~~~~~s~i~~~~~~~~~---~~~~~~ip~~~~~~~a~--~v~E~-~~~~~~~~~~~~v~l 73 (298) -++..++-.|+ -++|.+...+.-..+++..+... .-..+.+...+..+.+. |.+.+ ..+|..+..+++-.. T Consensus 1 ~~~lafl~~qL~~id~~vye~~~~~~~~~~lipv~t~~~~~~~~~~~~~~d~~G~a~~~~i~~~a~dip~vd~~~~~~~~ 80 (304) T protein:vir:52 1 MSLLAYVKNGLTAVSKDIAETKYPEIVFPQFVYVDQQTAVGITEKLHYGADEHGSLDDGLITVGTSTLDQVEVGFTPTRS 80 (304) T ss_pred CchHHHHHHHHHHHhhhhhccccccchhhhhccccCCCCcccceEEEeeeeccCcccccccCCcCCccceeecccceeEE Confidence 56665555453 34455543344444555554322 12245565566566666 87654 678988888888888 Q ss_pred eeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccc---c--- Q lcl|Aclame:pro 74 VPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQK---V--- 147 (298) Q Consensus 74 ~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~---~--- 147 (298) ..+..+....+|.+=|+.+-....++.+.=.+...+++...+++..|.|..+..| ..|+.+.++..... . T Consensus 81 ~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~g----~~GllN~p~v~~~~~~~~~a~ 156 (304) T protein:vir:52 81 YIVPWAKSVTWTKPELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAKDSR----LTGLLNNKSVEVYAIKGAAQN 156 (304) T ss_pred EEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeeccccc----eEEEEeCCCcceeeecCCccC Confidence 8888888777775534333333445666666777778899999999999532122 22333333322111 1 Q ss_pred -cc-ccccchhHHHHHHHhhhhhhcC---CcccEEEEcHHHHHHHHHhh-ccCCceeecccccccCcceecceeeEe--- Q lcl|Aclame:pro 148 -EA-PRGIADPNGAIENAVELLTGVD---ADVTGIAINPSFRSALAKQK-DLQGNALFPELKWGATPDTINGLPVDV--- 218 (298) Q Consensus 148 -~~-~~~~~~~~~~i~~~~~~l~~~~---~~~~~~vm~~~~~~~L~~lk-d~~G~~l~~~~~~~~~~~~l~G~PV~~--- 218 (298) .+ +.+....+++|.+++.++...- ..+..++|.|+.+..|.... ...|.-++.-..... +. ..|.|+-+ T Consensus 157 ~~w~~~T~~eI~~di~~~~~~i~~~s~~~~~p~tl~Lpp~~~~~l~~~~~~~~~~Tvl~~l~~n~-~~-~~g~~l~I~~v 234 (304) T protein:vir:52 157 TKVQAMDFDKAVAFFKEIFLKGMEKTKRIEAPNTFAIDSLDLAHLALVQRANTDTTALEFLTKHL-SA-AAGRQVAIKAL 234 (304) T ss_pred CccccCCHHHHHHHHHHHHHHHHhccCceecCceEEeCHHHHHHHhhccCCCCCchHHHHHHHhc-cc-ccCCcceEEEe Confidence 11 2233446788888888886543 24667999999999886532 333333321111111 11 12444322 Q ss_pred cCccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEE--EEEEEEccE-EecccceEEEe Q lcl|Aclame:pro 219 NKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYI--RAELFLGWG-ILDATKFARVT 295 (298) Q Consensus 219 s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~--r~~~r~~~~-v~~~~a~~~l~ 295 (298) .......+..+++.+++-+.+.-+ +...-.+.+++.+ ..++|...| =++.|+++. +++|.+++++- T Consensus 235 ~~~~~~~g~~g~~r~vvY~~d~~~-~~~~vP~p~~~l~----------~q~~~~~~~~vp~~~r~gGv~v~~P~a~~y~D 303 (304) T protein:vir:52 235 PSNYGTRVTDGKTRAMVYVNSKEH-VIFDVPMSPTVLD----------AQPKGLLAFESGLRMAFGGVTFMEPDSALYVD 303 (304) T ss_pred cccccccCCCCceEEEEEecChhh-eEEecCccccccc----------hhhcCCceEEecceeeeeeEEEEccceeeeec Confidence 111222233345555555444322 1111122222222 123454332 356666664 55799999998 Q ss_pred e Q lcl|Aclame:pro 296 E 296 (298) Q Consensus 296 ~ 296 (298) . T Consensus 304 ~ 304 (304) T protein:vir:52 304 Y 304 (304) T ss_pred C Confidence 8 No 186 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=97.93 E-value=4.6e-07 Score=55.30 Aligned_cols=275 Identities=10% Similarity=0.009 Sum_probs=142.9 Q ss_pred CeeccccccchhHHHHHH-----HHHHhhchhhhhcceeecCCC---ceEEEEEeCCcceEEeeccccccccccceeeEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLI-----SKVAGKSSIARLSAQKPIPFN---GEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQT 72 (298) Q Consensus 1 mat~gg~lip~~~~~~ii-----~~~~~~s~i~~~~~~~~~~~~---~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~ 72 (298) |+|-...-| +++....| +.+.....+.++..+...+.. ...++.....+.+.+.+.....|..+..-.... T Consensus 42 ~~t~~~~g~-~~~l~~~i~p~~~~~~~~~~~~~~l~~v~t~g~w~~~~~~~~~~e~~G~a~~ygd~~d~P~~d~~~~~~~ 120 (336) T protein:vir:10 42 LSSTGSSGI-PNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGTNINYPQ 120 (336) T ss_pred cccCCCcch-HHHHHhhcCcceeeeeechhchhhhcccccCCCcceeeEEEEeeeeeeeEEEccccCCCcceeeeeeeee Confidence 222221213 34444444 333333334444444433221 245566666788889998889998888777777 Q ss_pred EeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccc---c-cccc Q lcl|Aclame:pro 73 MVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKV---T-QKVE 148 (298) Q Consensus 73 l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~---~-~~~~ 148 (298) -+.+.++..+.++.+=+........++.+.-+...++++.+.++.-.+.|... . ...|..+-++.. + .... T Consensus 121 ~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~-~----~~~GllN~P~l~a~~t~~~~~ 195 (336) T protein:vir:10 121 RQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG-L----ENYGLINDPSLSAPITATTPW 195 (336) T ss_pred eeEEEEEEEEeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEeecc-c----ceEEEeecCCCCcccccCcCc Confidence 78888888899985433333345567778888888888888888888888421 1 122333322221 1 1111 Q ss_pred cc-cccchhHHHHHHHhhhhhhcCC------cccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCc Q lcl|Aclame:pro 149 AP-RGIADPNGAIENAVELLTGVDA------DVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKT 221 (298) Q Consensus 149 ~~-~~~~~~~~~i~~~~~~l~~~~~------~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~ 221 (298) +. .+....++||..++.++...-. .+..++|.|..+..|.+ ++..|.-++.-.... +-++-++..+. T Consensus 196 w~~~T~~eI~~Di~~~~~~l~~qt~g~i~~~~~~tL~Lp~~~~~~L~~-~n~~g~tv~~~lk~n-----~Pnl~i~t~pe 269 (336) T protein:vir:10 196 SGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAKLKEI-----FPKLEFVTIPE 269 (336) T ss_pred ccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccC-CCccCccHHHHHHHh-----CCccEEEEccc Confidence 11 2334578899999988855431 24479999999999865 344443332111111 11233443333 Q ss_pred cccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEE-ecccceEEEeec Q lcl|Aclame:pro 222 VSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGI-LDATKFARVTEA 297 (298) Q Consensus 222 ~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v-~~~~a~~~l~~a 297 (298) +.+ +++.....+.-+.. ....+++.+......-.-. .........+..|.++.+ .+|.||+++.|. T Consensus 270 l~~-Agg~~~~~~~~~~~------~~~t~~~~~P~~f~~lpvq---~~~~~~~v~~~~rt~Gv~i~rP~ai~~~~GI 336 (336) T protein:vir:10 270 YDT-ASGRLVQLWAPRVE------GKDTATCGFTEKMRAHSIE---RYSSYFRQKKSAGTWGAVIFRPFAVAQMLGV 336 (336) T ss_pred ccc-cCCceEEEEEeccc------CCcceeeecChhhhcccee---ecCceeEeccccceeeeeeeccchheeeccC Confidence 321 22211112211111 0011222221111100000 001223345566666655 479999999999 No 187 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=97.90 E-value=1.1e-05 Score=47.88 Aligned_cols=264 Identities=9% Similarity=0.029 Sum_probs=121.7 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhccee----ec-C--CCceEEEEEeCCcceE-E-eeccccccccccceeeE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQK----PI-P--FNGEKVFTFTMDSEID-V-VAESGKKTHGGVTLAPQ 71 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~----~~-~--~~~~~ip~~~~~~~a~-~-v~E~~~~~~~~~~~~~v 71 (298) ||.+=...||+.+..++++.++++.++.+++.+- .. . +..++||+... ..+. . .+.+..+...+..-.++ T Consensus 1 MAN~llT~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~-~~v~d~~~~~~~~~~~~~~~e~~v 79 (423) T protein:vir:35 1 MANNLESNISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQ-FKSERTETGDITGKDKNGLFSAKA 79 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCc-ceeecccCcCCCCcccccccccee Confidence 9977667789999999999999999988887652 11 1 33578887542 2222 2 11222333333443444 Q ss_pred --EEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccccc Q lcl|Aclame:pro 72 --TMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEA 149 (298) Q Consensus 72 --~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~ 149 (298) .+.-+|.. -+.++.+=+.+. ..++.+++... +++++..+|..++...- .+. ....| . T Consensus 80 ~l~id~~k~~-a~~v~d~e~~l~---i~~~~~~l~~a-~~ala~~vd~~l~~~l~--~~a-~~~vg-------------t 138 (423) T protein:vir:35 80 TGKVGKYITV-AVEWTQIEEALK---LNQLDQILSPI-HERMVTDLETELAHFMM--NNG-ALSLG-------------S 138 (423) T ss_pred eEEeccceec-cceeCHHHHHhh---HHHHHHHHHHH-HHHHHHHHHHHHHHHHh--hcc-ccccc-------------c Confidence 44444433 345555422111 23455555544 57788888888864210 000 00000 0 Q ss_pred ccccchhHHHHHHHhhhhhhcCCccc--EEEEcHHHHHHHHHhhc--cCCceeeccccccc-CcceecceeeEecCcccc Q lcl|Aclame:pro 150 PRGIADPNGAIENAVELLTGVDADVT--GIAINPSFRSALAKQKD--LQGNALFPELKWGA-TPDTINGLPVDVNKTVSD 224 (298) Q Consensus 150 ~~~~~~~~~~i~~~~~~l~~~~~~~~--~~vm~~~~~~~L~~lkd--~~G~~l~~~~~~~~-~~~~l~G~PV~~s~~~~~ 224 (298) .......++++.++-.+|...+.... ..+++|.+...|.+-.. ....-.-......+ ..+++.|+.|+.|+++|. T Consensus 139 ~~t~~~~~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnvp~ 218 (423) T protein:vir:35 139 PNTAIKKWADVAQTASFIKDIGIKTGENYAIMDPWSAQRLADAQSGLHAADQLVRTAWENAQISGNFGGIRALMSNGLAS 218 (423) T ss_pred ccCCcchHHHHHHHHHHHHHhcCCcCCCEEEeCHHHHHHHhccccceeccccchhHHHhhccceeeecceEEEEcCCCcc Confidence 11112347889998888877776532 46999999887753210 11110111112223 347999999999999996 Q ss_pred ccccc-cceEEEeeccc--eEEEEeecc----eEEEE-eecccccccchhhhhcCcEEEEEEEEEccEEecc-------- Q lcl|Aclame:pro 225 MSLTQ-RDRAIIGDFAN--GFKWGYAKE----VPLEV-IQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDA-------- 288 (298) Q Consensus 225 ~~~~~-~~~~~~gd~~~--~~~~~~~~~----~~i~~-~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~-------- 288 (298) ..... .....++--.. .......+. +.... ..+.. +-.-+.+.| .|...+++ T Consensus 219 ~T~gt~~~~~~v~~a~~v~~~a~~~~~~~~~~~~~~~~~~~g~-------l~~GD~~t~-----aGv~~v~~~t~~~~~~ 286 (423) T protein:vir:35 219 RKQGDFDGAITVKTAPNVDYLSVKDSYQFTVALTGATPSKTGF-------LKAGDQLKF-----TSTHWLNQQSKQTLYN 286 (423) T ss_pred ccccccccceeeccccccccccccccccceeeeeeeeeccCCc-------EEecceEEe-----eeeeeccccccceeec Confidence 43322 11112110000 000000000 00000 00100 000111111 12111111 Q ss_pred ------cceEEEee-------cC Q lcl|Aclame:pro 289 ------TKFARVTE-------AN 298 (298) Q Consensus 289 ------~a~~~l~~-------a~ 298 (298) .=|+++.. ++ T Consensus 287 ~~t~~~~~~~V~~~~~~~a~g~~ 309 (423) T protein:vir:35 287 GSTAMSFTATVLEETNSTASGDV 309 (423) T ss_pred ccCCceeEEEEeccccccccCce Confidence 11111111 11 No 188 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=97.80 E-value=5.2e-06 Score=49.57 Aligned_cols=283 Identities=9% Similarity=0.029 Sum_probs=157.2 Q ss_pred CeeccccccchhHHHHHHHH-HHhhchhhhhcc-eeecCCC-ceEEEEEeCCcceEEeeccccccccccceeeEEEeeeE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISK-VAGKSSIARLSA-QKPIPFN-GEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIK 77 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~-~~~~s~i~~~~~-~~~~~~~-~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k 77 (298) +..++-.+|-.|+..+.|+. +.+.-.--.+.+ +.-.+++ .+.||.. +.+...-..|..+..-.....++|++.... T Consensus 3 ~TSNT~A~I~SE~~s~~I~~~LH~~LL~~~~~R~V~DF~~G~~L~I~ti-Gs~~~~~~~E~~~~~~~~i~TGEIt~~i~~ 81 (313) T protein:vir:95 3 LTSNTRAFIESEQYSKFILLNLHDGLLPETFYRNVSDFGSGETLHIKTI-GSVTLQEAEEDTPLIYNPIETGEITFQITE 81 (313) T ss_pred ccccchheehhhhHHHHHHHHhhccccchhhhhhhccCCCCCEEEeccc-CceeeeccccCCCeeecccccceEEEEEEe Confidence 67777888887776665554 444432223444 3444444 4677663 345556667777777788888999999998 Q ss_pred EEEE-EeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhc-ccccccccccccccccccccccccccccccccch Q lcl|Aclame:pro 78 VEYG-ARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFH-GVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIAD 155 (298) Q Consensus 78 ~~~~-~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~-G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (298) +++- ..||++|-+++ ...-++.+++.-+-+|+|...++..+|. |.....+.+ +...+.+.....+...+...- T Consensus 82 Y~G~A~~vt~~LR~D~-~~I~~~~A~~~AE~~RAI~E~~~TD~L~~G~~~FA~~~----~P~~vNG~PH~~V~~~T~~~~ 156 (313) T protein:vir:95 82 YKGDAWYVTDDLREDG-TDIDRLMAERAAESTRAIQETFETDFLKTGAEYFAANP----GPHNVNGFPHVIVSAETNGVF 156 (313) T ss_pred ecCChhhhhhhhhhcc-hhHHHHhhhcchhhHHHHHHHHhhHHHhhchhhhccCC----CCcccccccceEEeccCCcee Confidence 8775 48999976543 3344577777777788888888877774 322112211 222333333334444444444 Q ss_pred hHHHHHHHhhhhhhcCCcc--cEEEEcHHHHHHHHHhh------ccCCceeecccccccC--cceecceeeEecCcccc- Q lcl|Aclame:pro 156 PNGAIENAVELLTGVDADV--TGIAINPSFRSALAKQK------DLQGNALFPELKWGAT--PDTINGLPVDVNKTVSD- 224 (298) Q Consensus 156 ~~~~i~~~~~~l~~~~~~~--~~~vm~~~~~~~L~~lk------d~~G~~l~~~~~~~~~--~~~l~G~PV~~s~~~~~- 224 (298) ...++..+-..+..++... -.++..|.....|..+. ..+|+-|.......+. ...+.|..+.+|+.+.. T Consensus 157 ~~~~~~~~~~~~~~a~~P~~G~v~IvDP~~~~~L~~l~~It~~vt~~~k~I~ESG~A~~~~Fi~~~YG~Di~~SN~L~~A 236 (313) T protein:vir:95 157 ALKHLIAMRLAFDKANVPAEGRVFIVDPVAEATLNGLVTITHDVTDFGKMILESGMARGQRFIMNLYGWDILTSNRLHVA 236 (313) T ss_pred hhhHHHHhhhhhhhccCCccceEEEEcchhhhhhhhhheeecccccccceeeeccCCchhHHHHHHhhhhhhhhhhhhhc Confidence 4556666555555555543 35899999999998885 2456666543322111 23577888888876642 Q ss_pred --ccccccceEEEeeccceEE-------EEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEeccc-ceEEE Q lcl|Aclame:pro 225 --MSLTQRDRAIIGDFANGFK-------WGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDAT-KFARV 294 (298) Q Consensus 225 --~~~~~~~~~~~gd~~~~~~-------~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~-a~~~l 294 (298) +.+....-..+|++=..+. .+-++.|. .+.+.-.++=.++..+.| +|+|++++|-+ =++++ T Consensus 237 N~~D~~tT~~G~~~NlFM~i~D~~~~P~~~AWr~MP-------~s~~~~~~~~~~~~~~~~--~R~G~Gi~R~~~L~~~~ 307 (313) T protein:vir:95 237 NYNDGTTTGNGYVGNLFMCILDDQTKPIMGAWRRMP-------KSEGERNKDRARDEHVVR--CRYGFGIQRLDTLGLLA 307 (313) T ss_pred cccccccccCceeeeeeeeeecccccceeeeecccc-------ccccccccccccccceee--eeecccceeecceeEEE Confidence 1112222334444321110 11111111 000000001112333444 68888888755 45667 Q ss_pred eecC Q lcl|Aclame:pro 295 TEAN 298 (298) Q Consensus 295 ~~a~ 298 (298) ..|| T Consensus 308 ~~A~ 311 (313) T protein:vir:95 308 TSAT 311 (313) T ss_pred eccc Confidence 8888 No 189 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=97.67 E-value=3.1e-05 Score=45.31 Aligned_cols=275 Identities=7% Similarity=-0.020 Sum_probs=123.2 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhccee----ec---CCCceEEEEEeCCcceEEe-eccccccccccceee-- Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQK----PI---PFNGEKVFTFTMDSEIDVV-AESGKKTHGGVTLAP-- 70 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~----~~---~~~~~~ip~~~~~~~a~~v-~E~~~~~~~~~~~~~-- 70 (298) ||.+=-..+|+.+..++++.+++..++.+++.+- .. .+..++||+.......... .++..+...+.+-++ T Consensus 1 MaN~llT~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~~~~~~~~~~~dl~e~~v~ 80 (423) T protein:vir:10 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLISGKAT 80 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCceeeeccCCccccccccCccccceeE Confidence 9977677789999999999999999988887652 21 1345777764432222222 122223333444444 Q ss_pred EEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccc Q lcl|Aclame:pro 71 QTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAP 150 (298) Q Consensus 71 v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 150 (298) +.+.-+|..++-.=..|+.+ .. .++.+.+++. .++++..+|..++.-. .+......+ .. T Consensus 81 l~id~~k~va~~v~d~E~~~-~i---~~~~~~l~~A-~~aLA~~vd~~ia~~~---~~~~~~~~g-------------t~ 139 (423) T protein:vir:10 81 GRVGNYITVAVEYQQLEEAI-KL---NQLEEILAPV-RQRIVTDLETELAHFM---MNNGALSLG-------------SP 139 (423) T ss_pred EEeeceeeeeeeechHHHhc-Ch---hhHHHHHHHH-HHHHHHHHHHHHHHHH---hhccccccc-------------cC Confidence 45555554443333444332 22 3455545444 6889999998887431 000001000 01 Q ss_pred cccchhHHHHHHHhhhhhhcCCcc--cEEEEcHHHHHHHHHhhc--cCCceeeccccccc-CcceecceeeEecCccccc Q lcl|Aclame:pro 151 RGIADPNGAIENAVELLTGVDADV--TGIAINPSFRSALAKQKD--LQGNALFPELKWGA-TPDTINGLPVDVNKTVSDM 225 (298) Q Consensus 151 ~~~~~~~~~i~~~~~~l~~~~~~~--~~~vm~~~~~~~L~~lkd--~~G~~l~~~~~~~~-~~~~l~G~PV~~s~~~~~~ 225 (298) ......++++.++-.+|...+... -..+++|.....|.+-.. ......-......+ ..+++.|+.|+.++++|.. T Consensus 140 ~t~~~a~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnip~~ 219 (423) T protein:vir:10 140 NTPITKWSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLASR 219 (423) T ss_pred CcccchHHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchhhhhhccceeeecceEEEEeCCCccc Confidence 111234778888888887776653 357999998887754221 11111111122223 3478999999999999964 Q ss_pred ccccc-ceEEE--eeccceEEEEeecceEEEEeeccccc-ccc--hhhhhcCcEEEEEEEEEc------cEEecccceEE Q lcl|Aclame:pro 226 SLTQR-DRAII--GDFANGFKWGYAKEVPLEVIQYGDPD-NSG--LDLKGYNQVYIRAELFLG------WGILDATKFAR 293 (298) Q Consensus 226 ~~~~~-~~~~~--gd~~~~~~~~~~~~~~i~~~~~~~~~-~~~--~~~f~~n~v~~r~~~r~~------~~v~~~~a~~~ 293 (298) ..... .+... +-.-.+-......+..+.+......+ ... .+.|..+-+ .+.-+.. +.-.++.-|.+ T Consensus 220 T~gt~~~t~~~~~~~~v~~~a~~~a~~~~~~~~~~~~~~~~~l~~GD~~t~aGv--~~v~~~tk~~~~~~~t~~~~~~~v 297 (423) T protein:vir:10 220 TQGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATASVTGFLKAGDQVKFTNT--YWLQQQTKQALYNGATPISFTATV 297 (423) T ss_pred cccccccceeeeecceeccccccccceeeeeeeeccccccCceeecceEEecce--eeecccccccccccccCcceEEEE Confidence 32211 11111 00000000000011111111000000 000 000110000 0000000 01111122222 Q ss_pred EeecC Q lcl|Aclame:pro 294 VTEAN 298 (298) Q Consensus 294 l~~a~ 298 (298) +..++ T Consensus 298 ~a~~~ 302 (423) T protein:vir:10 298 TADAN 302 (423) T ss_pred Eeeee Confidence 22111 No 190 >protein:vir:99576 Length: 388 # NCBI annotation: hypothetical protein # Family: family:all:1653 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039801;genbank:gi:126011051;genbank:GeneID:4818271 Probab=97.67 E-value=3.5e-06 Score=50.50 Aligned_cols=279 Identities=9% Similarity=-0.016 Sum_probs=138.2 Q ss_pred CeeccccccchhHHH----HHHHHHHhhchhhhhcceeecCCC---ceEEEEEeCCcceEEeeccccccccccceeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVT----DLISKVAGKSSIARLSAQKPIPFN---GEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTM 73 (298) Q Consensus 1 mat~gg~lip~~~~~----~ii~~~~~~s~i~~~~~~~~~~~~---~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l 73 (298) .++.++.=||-++.+ .|++.+..-....++..+...+.. .+.+++.+..+.+.+++.++..|..+..-...+- T Consensus 74 ~~t~~~~gip~~~~~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~~~~f~v~e~~G~A~~ygd~~D~Pl~d~~~~~~~r 153 (388) T protein:vir:99 74 PTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFERR 153 (388) T ss_pred ccccCcccHHHHHhhhhccceeeeeechhhhhhhccccccCCccceeEEEeeeecceeEEEeecccCCCceeccceeeee Confidence 233333225655543 444544444445556665554332 3466666777889999988888877666555555 Q ss_pred eeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccc---c--cc- Q lcl|Aclame:pro 74 VPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVT---Q--KV- 147 (298) Q Consensus 74 ~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~---~--~~- 147 (298) ..+.++....++.+=++.......++.+.-+...++++.+.+++-.|+|...... ....|+.+-++... . .. T Consensus 154 ~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~AA~~ale~~~N~i~f~G~~g~~~--~~~yGllNdP~l~a~v~at~~~~ 231 (388) T protein:vir:99 154 TIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNG--NRTFGFLNDPSLLPAIASTTPGG 231 (388) T ss_pred eEEEEEeeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHhhhceEEEEeecCCCc--cceEEEeeCCCcccccccccCCc Confidence 5566666666765433333344567778888888888999999999999542211 11122222221111 0 01 Q ss_pred --cc-ccccchhHHHHHHHhhhhhhcCC---cc----cEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeE Q lcl|Aclame:pro 148 --EA-PRGIADPNGAIENAVELLTGVDA---DV----TGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVD 217 (298) Q Consensus 148 --~~-~~~~~~~~~~i~~~~~~l~~~~~---~~----~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~ 217 (298) .+ ..+....++||..++.++...-. .+ ..++|.|+.+..|.+- +..|.-++.-.... +.++-++ T Consensus 232 ~~~Wa~kT~~eI~~Di~~~~~~i~~qs~g~~~~~~~~~tL~LP~~~~~~Ls~~-n~~g~Tvl~~lk~n-----~Pnl~i~ 305 (388) T protein:vir:99 232 WVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQT-----YPRVRVM 305 (388) T ss_pred CcccccCCHHHHHHHHHHHHHHHHHhcCCeeeecccceEEEechHHHHhcccc-CcCCccHHHHHHHh-----cCCcEEE Confidence 11 12334568889988888755432 12 2588999999888643 33343332111111 1122233 Q ss_pred ecCccc-cccccccceEEE-ee-ccceE--------EEEeecceEEEEeecccccccchhhhhcCc-EEEEEEEEEcc-E Q lcl|Aclame:pro 218 VNKTVS-DMSLTQRDRAII-GD-FANGF--------KWGYAKEVPLEVIQYGDPDNSGLDLKGYNQ-VYIRAELFLGW-G 284 (298) Q Consensus 218 ~s~~~~-~~~~~~~~~~~~-gd-~~~~~--------~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~-v~~r~~~r~~~-~ 284 (298) ....+. .+..++...+++ .+ +.... .+...-.+.+...+ ...++. ....+..|.++ . T Consensus 306 t~pEl~~a~~tgg~~~~~~~~~~~~~~~~~~~~~~~t~~~~~p~~~~~l~----------vq~~~~~~~~~~~~rt~Gv~ 375 (388) T protein:vir:99 306 SAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLG----------VEKRVKNYVEAYSNATAGVM 375 (388) T ss_pred EecccccccccCCceeEEEEecccccccccCccCcceeEEeccccccccc----------ceecCceeEeccccceeeeE Confidence 222221 111112222221 11 00000 00000011111100 001111 22234445444 5 Q ss_pred EecccceEEEeec Q lcl|Aclame:pro 285 ILDATKFARVTEA 297 (298) Q Consensus 285 v~~~~a~~~l~~a 297 (298) +++|.||+++.|. T Consensus 376 ir~P~Ai~~~~GI 388 (388) T protein:vir:99 376 LKRPWAVVRLIGL 388 (388) T ss_pred EeccchhheeccC Confidence 5689999999999 No 191 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=97.66 E-value=3.2e-05 Score=45.24 Aligned_cols=269 Identities=8% Similarity=-0.007 Sum_probs=123.9 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceee-----c--CCCceEEEEEeCCcce-EEe-ecccccccccccee-- Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKP-----I--PFNGEKVFTFTMDSEI-DVV-AESGKKTHGGVTLA-- 69 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~-----~--~~~~~~ip~~~~~~~a-~~v-~E~~~~~~~~~~~~-- 69 (298) ||.+=-..+|+.+..+.++.++++.++.+++.+.. . .+..++||+... ..+ ... ..+..+...+..-+ T Consensus 1 MaN~llT~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~~-~~~~~~~~~~~~~~~~~~l~e~~v 79 (423) T protein:vir:17 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQ-FSSLRTPTGDISGQNKNNLISGKA 79 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCCc-ceeecccCcccCCcccCcccccee Confidence 99776777899999999999999999888776532 1 133577876332 222 111 12222333344333 Q ss_pred eEEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccccc Q lcl|Aclame:pro 70 PQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEA 149 (298) Q Consensus 70 ~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~ 149 (298) .+.+.-+|..++-.=..|... +..++.+++++. .++++..+|..++.-. .+......+ . T Consensus 80 ~l~id~~k~va~~v~d~E~~~----~i~~~~~~l~~A-~~aLA~~vd~~ia~~~---~~~a~~~~g-------------t 138 (423) T protein:vir:17 80 TGRVGNYITVAVEYQQLEEAI----KLNQLEEILAPV-RQRIVTDLETELAHFM---MNNGALSLG-------------S 138 (423) T ss_pred EEEeeceeeeeeeecHHHHhc----ChhHHHHHHHHH-HHHHHHHHHHHHHHHH---hhccccccc-------------c Confidence 355555554443333344321 223454544444 6889999998876431 000000000 0 Q ss_pred ccccchhHHHHHHHhhhhhhcCCcc--cEEEEcHHHHHHHHHhhc--cCCceeeccccccc-CcceecceeeEecCcccc Q lcl|Aclame:pro 150 PRGIADPNGAIENAVELLTGVDADV--TGIAINPSFRSALAKQKD--LQGNALFPELKWGA-TPDTINGLPVDVNKTVSD 224 (298) Q Consensus 150 ~~~~~~~~~~i~~~~~~l~~~~~~~--~~~vm~~~~~~~L~~lkd--~~G~~l~~~~~~~~-~~~~l~G~PV~~s~~~~~ 224 (298) .......++++.++-.+|...+... -..+++|.....|.+-.. ......-......+ ..+++.|+.|+.++++|. T Consensus 139 ~~t~~~a~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdvy~Snnip~ 218 (423) T protein:vir:17 139 PNTPITKWSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLAS 218 (423) T ss_pred CCcccccHHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchHHHhhccceeeecceEEEEeCCCcc Confidence 0111134788888888887777653 357999999887754221 11011111112223 347899999999999996 Q ss_pred ccccccc-eEEE--eec-cceEEEEe-ecceEE--EEee-cccccccchhhhhcCcEEEE---EEEEE------ccEEec Q lcl|Aclame:pro 225 MSLTQRD-RAII--GDF-ANGFKWGY-AKEVPL--EVIQ-YGDPDNSGLDLKGYNQVYIR---AELFL------GWGILD 287 (298) Q Consensus 225 ~~~~~~~-~~~~--gd~-~~~~~~~~-~~~~~i--~~~~-~~~~~~~~~~~f~~n~v~~r---~~~r~------~~~v~~ 287 (298) ....... ++.. +.. ..+...+. ...+.+ .+.. +.... .-+.+.|- +..+. ++...+ T Consensus 219 ~T~gt~~~t~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~-------~GD~~t~aGv~~v~~~tk~v~~~~~t~~ 291 (423) T protein:vir:17 219 RTQGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATTSVTGFLK-------AGDQVKFTNTYWLQQQTKQALYNGATPI 291 (423) T ss_pred ccccceeceeeecccccccccccccccceeeeeeeeeeeccCcee-------ecceEEecceeeeccccccccccccccc Confidence 4332211 1111 100 00000000 000111 1100 10000 00111110 00011 111122 Q ss_pred ccceEEEeecC Q lcl|Aclame:pro 288 ATKFARVTEAN 298 (298) Q Consensus 288 ~~a~~~l~~a~ 298 (298) +.-|.+...++ T Consensus 292 ~~~~~v~~~~~ 302 (423) T protein:vir:17 292 SFTATVTADAN 302 (423) T ss_pred ceEEEEEeccc Confidence 33333332222 No 192 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=97.59 E-value=8.6e-06 Score=48.36 Aligned_cols=278 Identities=11% Similarity=-0.032 Sum_probs=137.2 Q ss_pred CeeccccccchhHH----HHHHHHHHhhchhhhhcceeecCCC---ceEEEEEeCCcceEEeeccccccccccceeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELV----TDLISKVAGKSSIARLSAQKPIPFN---GEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTM 73 (298) Q Consensus 1 mat~gg~lip~~~~----~~ii~~~~~~s~i~~~~~~~~~~~~---~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l 73 (298) .+|.++.=||.++. +.+++-+.+-...+++..+...+.. .+.+++.+..+.+.+++.++..|..+..-...+- T Consensus 70 ~~t~~~~g~p~~~l~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~t~ty~~~e~~G~A~~ygd~~D~Pl~d~~~~~~~r 149 (382) T protein:vir:96 70 PVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTSWNANFERR 149 (382) T ss_pred ccccCCccHHHHHHhhhhhhhhhhhhhhhhhhhhccccccCCccceEEEEeeeecccceEEeecccCCCccccccceeEE Confidence 44444444675554 4555655555556666666554432 3477777778899999988888876655444444 Q ss_pred eeeEEEEEEeec-HHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccc----cccc Q lcl|Aclame:pro 74 VPIKVEYGARIS-DEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVT----QKVE 148 (298) Q Consensus 74 ~~~k~~~~~~iS-~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~----~~~~ 148 (298) ..+..+....++ .|+.++. ....++.+.-+...++++.+.+++-.|+|..++. .....|+.+-++... .... T Consensus 150 ~v~~~~~g~~yg~lE~~rAa-~~~~~l~~~Ka~aA~~ale~~~N~i~f~G~~~g~--~~~~yGllNdP~l~a~~t~a~~~ 226 (382) T protein:vir:96 150 TIVRGELGLLVGTLEEGRAS-AIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGL--GNRTYGFLNDPNLPPFQTPPSQG 226 (382) T ss_pred EEEEEEEeeeecHHHHHHHH-hhCCCcHHHHHHHHHHHHHHhhceEEEEeeecCc--CcceEEEEeCCCcccccccCCCC Confidence 445555555554 5655533 2346677777788888889999999999964321 112223333222211 1111 Q ss_pred c-ccccchhHHHHHHHhhhhhhcCC---c----ccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecC Q lcl|Aclame:pro 149 A-PRGIADPNGAIENAVELLTGVDA---D----VTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNK 220 (298) Q Consensus 149 ~-~~~~~~~~~~i~~~~~~l~~~~~---~----~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~ 220 (298) + ..+....++||..++.++...-. . +..++|.|+.+..|.+- ++.|.-++.-.... +-++-++..+ T Consensus 227 Wa~kT~~eI~~Di~~l~~~i~~qt~G~~~~~~~~~~L~LP~~~~~~Ls~~-n~~g~Tvl~~lk~n-----~Pnl~i~t~p 300 (382) T protein:vir:96 227 WATADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITMALATSKVDYLSVT-TPYGISVSDWIEQT-----YPKMRIVSAP 300 (382) T ss_pred cccccHHHHHHHHHHHHHHHHhccCCeeeecccceEEeechHHHhhcccc-CccCccHHHHHHHh-----cCCcEEEEcc Confidence 1 23344567888888888855432 1 23578999888888542 33333332111111 1122232222 Q ss_pred cccc-ccc--cccceEE-Eee-ccc--------eEEEEeecceEEEEeecccccccchhhhhcCc-EEEEEEEE-EccEE Q lcl|Aclame:pro 221 TVSD-MSL--TQRDRAI-IGD-FAN--------GFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQ-VYIRAELF-LGWGI 285 (298) Q Consensus 221 ~~~~-~~~--~~~~~~~-~gd-~~~--------~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~-v~~r~~~r-~~~~v 285 (298) .+.. +.+ ++.+..+ +.+ ... ...+..+-...+.+.. ...+.. ....+..| .|..+ T Consensus 301 eL~~a~~~g~g~~~~~~~~~~e~~~~~~~s~~~p~~f~q~~p~~~~~l~----------ve~~~~~~~~~~s~~t~Gv~i 370 (382) T protein:vir:96 301 ELSGVQMQGKTPEDALVLFVEEVDASVDGSTDGGSVFSQLVQSKFITLG----------VEKRAKSYVEDFSNGTAGALC 370 (382) T ss_pred ccccccCCCccceeEEEEecchhhhhcccccccCcceeccccceeeecc----------ceeecceeEeccccceeeeEE Confidence 2211 111 1111111 111 000 0000000000000000 000111 11112223 45566 Q ss_pred ecccceEEEeec Q lcl|Aclame:pro 286 LDATKFARVTEA 297 (298) Q Consensus 286 ~~~~a~~~l~~a 297 (298) ++|.||+++.|. T Consensus 371 ~~P~ai~~~~GI 382 (382) T protein:vir:96 371 KRPWAVVRYLGI 382 (382) T ss_pred EcchhhhhccCC Confidence 789999999999 No 193 >protein:vir:1829 Length: 355 # NCBI annotation: major capsid protein # Family: family:all:201 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052253;genbank:gi:9634060;genbank:GeneID:1262428 Probab=97.48 E-value=4.4e-05 Score=44.46 Aligned_cols=281 Identities=9% Similarity=0.039 Sum_probs=156.0 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCcceEEeec--c-ccccccccceeeEEEeee Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNG-EKVFTFTMDSEIDVVAE--S-GKKTHGGVTLAPQTMVPI 76 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~-~~ip~~~~~~~a~~v~E--~-~~~~~~~~~~~~v~l~~~ 76 (298) -..+...-|-|.+.+.+.+.+.+.|-+++..+.+++.--. -.+-.-.+++-++-+.- + +..|.....++.-....+ T Consensus 25 ~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv~g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~ 104 (355) T protein:vir:18 25 DDVSKKFTVEPSVTQTLMNTVQASSAFLQMINILPVAEMKGEKIGVGVTGTIASTTDTSGDKERQTADFTALESNKYECN 104 (355) T ss_pred hHccceeccCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEEeeccCcceeeccccCCCCCcccccccccCCCccEEE Confidence 1223345577888899999999999999999999987422 23333333444443321 1 223333344566667777 Q ss_pred EEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc--cccccccc-c--------------c Q lcl|Aclame:pro 77 KVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGT--ASAVIGTN-H--------------F 139 (298) Q Consensus 77 k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~--~~~~~~~~-~--------------~ 139 (298) +.-.-+.|+.+.|-+.. .+.++...+.+.+.++++.-.-.--|+|+.-..-+ ...+.+.. + + T Consensus 105 qtn~dt~i~y~~LD~WA-~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~~Td~~~nPllqDVNkGWlQ~~Re~ap~rV 183 (355) T protein:vir:18 105 QINFDFHLTYKRLDLWA-RFQDFQRRIRDAIVQRQALDFIMAGFNGTTRADTSDRVKNPMLQDVAVGWLQKYRNEAPARV 183 (355) T ss_pred EeeeeeeecHHHHHHHh-cChhHHHHHHHHHHHHHhhchhhhcccceeeeccCChhhCcCccccchhHHHHHHhcchhhh Confidence 77777889999885554 44678888888888887776666677785321111 11111100 0 0 Q ss_pred -ccc-------ccccc--ccccccchhHHHHHHHhhhh-hhcCCcc-c-EEEEcHHHHH-HHHHhhccCCceeecccc-c Q lcl|Aclame:pro 140 -DSK-------VTQKV--EAPRGIADPNGAIENAVELL-TGVDADV-T-GIAINPSFRS-ALAKQKDLQGNALFPELK-W 204 (298) Q Consensus 140 -~~~-------~~~~~--~~~~~~~~~~~~i~~~~~~l-~~~~~~~-~-~~vm~~~~~~-~L~~lkd~~G~~l~~~~~-~ 204 (298) ... ++... ...+.-....+...+++..+ ...+.+. . +.+|.....+ +--+|-...+.|-=.-.. . T Consensus 184 ~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~d~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~ptE~~Aa~~ 263 (355) T protein:vir:18 184 MSNITDADGKVVSAVIRVGKNGDYENLDALVMDGTNTLIDEIYQDDPKLVAIVGRKLLADKYFPLVNKQQENTESLAADI 263 (355) T ss_pred hccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHhHHhhccCChHHHHHHHH Confidence 000 00000 01112223344455666543 4443332 2 5677766544 333333333333211000 0 Q ss_pred ccCcceecceeeEecCccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccE Q lcl|Aclame:pro 205 GATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWG 284 (298) Q Consensus 205 ~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~ 284 (298) -....++.|+|.+..+++|.+ .+++.-|++.-.|..++..+=.+.+...-+. ..++..+| -|.. T Consensus 264 i~s~k~iGGlpa~~~PffP~~------~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~r-ie~y~s~N---------e~Yv 327 (355) T protein:vir:18 264 IISQKRIGNLPAVRVPYFPAN------AVFVTTLENLSIYFMDESHRRSIDENPKKDR-VENYESMN---------IDYV 327 (355) T ss_pred HHHHHhhCCceeEEccccCCC------ceEEeeccccEEEEecCcEEEEEEecccccc-ccchhhhc---------ceee Confidence 111358999999999999976 4778888888777777666544433222111 11223333 4556 Q ss_pred EecccceEEEeecC Q lcl|Aclame:pro 285 ILDATKFARVTEAN 298 (298) Q Consensus 285 v~~~~a~~~l~~a~ 298 (298) |.++.++|.+++.+ T Consensus 328 VEd~~~~a~ieni~ 341 (355) T protein:vir:18 328 VEAYAAGCLLENIT 341 (355) T ss_pred eeccccEEEEeeee Confidence 78888888888777 No 194 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=97.37 E-value=8.3e-05 Score=42.96 Aligned_cols=268 Identities=8% Similarity=-0.048 Sum_probs=119.1 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceee----c---CCCceEEEEEeCCc---ceEEeeccccccccccce-- Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKP----I---PFNGEKVFTFTMDS---EIDVVAESGKKTHGGVTL-- 68 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~----~---~~~~~~ip~~~~~~---~a~~v~E~~~~~~~~~~~-- 68 (298) ||.+=..|+|+-++.++++.+++..++.+++.+-. . .+..++||+..... ...+-..+. ...+..= T Consensus 1 MANsl~~l~p~iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~~~~d~~~~~~t~~--~~~~l~e~~ 78 (423) T protein:vir:10 1 MANNLDANVSQIVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQFKSERTMDGDITGK--SKNSLISAK 78 (423) T ss_pred CccccccccHHHHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCceeeecccCcccCcc--cccccccce Confidence 99888889999999999999999999988876532 1 13457777643211 111111111 1112222 Q ss_pred eeEEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccc Q lcl|Aclame:pro 69 APQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVE 148 (298) Q Consensus 69 ~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~ 148 (298) -++++.-+|...+-.=+.|+. ... .++.+++ +.-.++++..+|..+..... .......+. T Consensus 79 v~l~id~~k~~a~~v~d~E~~-l~i---~~~~~~l-~~A~~aLA~~vd~~ia~~~~---~~~~~~vgt------------ 138 (423) T protein:vir:10 79 ATGEVGNYITVAVEYRQIEEA-LKL---NQLDQIL-VPINERMVTDLETELALFMM---KHGALSLGS------------ 138 (423) T ss_pred EEEEecceeeeeeeeChHHHh-cCh---hHHHHHH-HHHHHHHHHHHHHHHHHHhh---hcccccccc------------ Confidence 234455555444333344433 222 3455544 44477889999888853210 000011100 Q ss_pred cccccchhHHHHHHHhhhhhhcCCcc--cEEEEcHHHHHHHHH-h---hccCCceeeccccccc-CcceecceeeEecCc Q lcl|Aclame:pro 149 APRGIADPNGAIENAVELLTGVDADV--TGIAINPSFRSALAK-Q---KDLQGNALFPELKWGA-TPDTINGLPVDVNKT 221 (298) Q Consensus 149 ~~~~~~~~~~~i~~~~~~l~~~~~~~--~~~vm~~~~~~~L~~-l---kd~~G~~l~~~~~~~~-~~~~l~G~PV~~s~~ 221 (298) .......++++.++-..|...+... -..+++|.....|.+ + ...++. -......+ ..+++.|+.++.|++ T Consensus 139 -~~t~~~a~~~~a~a~~~L~~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~--~~~alr~~~i~G~~~GFdi~~Sn~ 215 (423) T protein:vir:10 139 -PNTPIKKWSDVAQTASFLKDLGINSGENYAVMDPWAAQRLADAQSGLHVSEQL--VRTAWENAQISGNFGGIRALMSNG 215 (423) T ss_pred -cccccccHHHHHHHHHHHhhccCCcCCCEEEeCHHHHHHHhhhhhhhcccccc--chHHHHhcccceeecceEEEEecC Confidence 0111123678888877777766543 357999998888753 2 211111 01112223 347999999999999 Q ss_pred cccccccccc-eEEEeeccceEEE-----EeecceEEEEeecccccccchhhhhcCcEEEE---EEEEEccE------Ee Q lcl|Aclame:pro 222 VSDMSLTQRD-RAIIGDFANGFKW-----GYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIR---AELFLGWG------IL 286 (298) Q Consensus 222 ~~~~~~~~~~-~~~~gd~~~~~~~-----~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r---~~~r~~~~------v~ 286 (298) +|....+... .... +..+.+ ..-......... .+. ....-+..-+.+.|- +.-+.... -. T Consensus 216 vp~~T~g~~~ga~~~---~~~~~vt~a~~~~~~~~~~~~~~-~T~-s~~g~l~~GD~~t~aGv~~v~~~tk~~l~~~~~~ 290 (423) T protein:vir:10 216 LASRTQGAFGGKLTV---KGTPEVNYDSVKDSYAFTATLTG-ATA-SKKGFLKVGDQLQFDDTHWLNQQSKQTLYNGASA 290 (423) T ss_pred Ccccccccccceeee---eeeeEEEecccccccccccceee-ccc-eeceeEEecceEeecceeeecccccceeecccCC Confidence 9853222111 0000 000000 000000000000 000 000000000111110 00011111 11 Q ss_pred cccceEEEeecC Q lcl|Aclame:pro 287 DATKFARVTEAN 298 (298) Q Consensus 287 ~~~a~~~l~~a~ 298 (298) ++.-|++...++ T Consensus 291 ~~~~~~V~~~~~ 302 (423) T protein:vir:10 291 LSFTATVMEDAN 302 (423) T ss_pred cceEEEEEeccc Confidence 222233322221 No 195 >protein:vir:104011 Length: 337 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293748;genbank:gi:72537718;genbank:GeneID:3608142 Probab=97.36 E-value=8.5e-05 Score=42.90 Aligned_cols=281 Identities=9% Similarity=-0.005 Sum_probs=155.2 Q ss_pred Ceecc-------ccccchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCcceEEee--ccccccccccceee Q lcl|Aclame:pro 1 MVLNK-------GTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNG-EKVFTFTMDSEIDVVA--ESGKKTHGGVTLAP 70 (298) Q Consensus 1 mat~g-------g~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~-~~ip~~~~~~~a~~v~--E~~~~~~~~~~~~~ 70 (298) +|... ..-|-|.+.+.+.+.+.+.|-+++..+.+++.--. -.+-.-.+++-++-+. .+...|..-..++. T Consensus 16 ~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrt~t~~~~R~~~~~~~l~~ 95 (337) T protein:vir:10 16 IAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDTTKAARQPIDPTALDS 95 (337) T ss_pred HHHhcChhhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeeccCcceeeeecCCCCccccccccccCC Confidence 22222 23366788888999999999999999999987422 2333333344443332 22233334455666 Q ss_pred EEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc--cccccccc------------ Q lcl|Aclame:pro 71 QTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLG--TASAVIGT------------ 136 (298) Q Consensus 71 v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g--~~~~~~~~------------ 136 (298) -....++.-.-+.|+.+.|-+.. .+.++...+.+.+.++++.-.-.--|+|+.-..- -...+.+. T Consensus 96 ~~Y~c~qtn~dt~i~y~~LD~WA-~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~Td~~~nPllqDVNkGWlQ~~Re 174 (337) T protein:vir:10 96 NRYRCEKTDYDTAIPYRKLDMWA-KFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYRE 174 (337) T ss_pred CccEEEEeeeeeeccHHHHHHHh-cChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCcCccccchhHHHHHHh Confidence 67777777778889999885544 4457888888888888877666667778532111 11112110 Q ss_pred ---ccc-cccc---ccccc-cccccchhHHHHHHHhhh-hhhcCCc-cc-EEEEcHHHHHH-HHHhhccCCceeecccc- Q lcl|Aclame:pro 137 ---NHF-DSKV---TQKVE-APRGIADPNGAIENAVEL-LTGVDAD-VT-GIAINPSFRSA-LAKQKDLQGNALFPELK- 203 (298) Q Consensus 137 ---~~~-~~~~---~~~~~-~~~~~~~~~~~i~~~~~~-l~~~~~~-~~-~~vm~~~~~~~-L~~lkd~~G~~l~~~~~- 203 (298) ..+ ...+ ..... ..+.-....+...+++.. +...+.+ +. +.+|.+...+. --.|-...+.|-=.-.. T Consensus 175 ~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~dLladk~~~l~n~~~~ptE~~Aa~ 254 (337) T protein:vir:10 175 RAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNATQAPTERLAAD 254 (337) T ss_pred cchhhhhccccccCcceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhHHhccCCCcHHHHHHH Confidence 000 0000 00111 111222233345666654 3444443 22 46666665542 22222222232100000 Q ss_pred cccCcceecceeeEecCccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEcc Q lcl|Aclame:pro 204 WGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGW 283 (298) Q Consensus 204 ~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~ 283 (298) .-....++.|+|.+..+++|.+ .+++.-|++.-.|..++...=.+.+...-+. ..++..+| -|+ T Consensus 255 ~i~s~k~iGGlpa~~~PffP~~------~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~r-ie~y~s~N---------e~Y 318 (337) T protein:vir:10 255 LIVSQKRIGNLPAVRVPFFPKR------ALMVTKLSNLSIYYQEGARRRTLKEVPERDR-IENYESSN---------DAY 318 (337) T ss_pred HHHHhhhhCCceeEEccccCCC------ceEEeechhcEEEEecCcEEEEEEEcccccc-ccchhhcc---------cee Confidence 0111357999999999999976 4778888888777777666544433222111 11222233 456 Q ss_pred EEecccceEEEeecC Q lcl|Aclame:pro 284 GILDATKFARVTEAN 298 (298) Q Consensus 284 ~v~~~~a~~~l~~a~ 298 (298) .|.++.++|.+++.+ T Consensus 319 vVEd~~~~a~ienI~ 333 (337) T protein:vir:10 319 VVEDFGCGCVAENIE 333 (337) T ss_pred eeeccccEEEEecee Confidence 688999999998777 No 196 >protein:vir:79171 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111033;genbank:gi:134288740;genbank:GeneID:4960690 Probab=97.36 E-value=8.6e-05 Score=42.87 Aligned_cols=281 Identities=9% Similarity=-0.005 Sum_probs=155.1 Q ss_pred Ceecc-------ccccchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCcceEEee--ccccccccccceee Q lcl|Aclame:pro 1 MVLNK-------GTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNG-EKVFTFTMDSEIDVVA--ESGKKTHGGVTLAP 70 (298) Q Consensus 1 mat~g-------g~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~-~~ip~~~~~~~a~~v~--E~~~~~~~~~~~~~ 70 (298) +|... ..-|-|.+.+.+.+.+.+.|-+++..+.+++.--. -.+-.-.+++-++-+. .+...|..-..++. T Consensus 16 ~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrt~t~~~~R~~~~~~~l~~ 95 (337) T protein:vir:79 16 IAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDTTKAARQPIDPTALDS 95 (337) T ss_pred HHHhcChhhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeeccCcceeeeecCCCCccccccccccCC Confidence 22222 23366788888999999999999999999987422 2333333344443332 22233334445666 Q ss_pred EEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc--cccccccc------------ Q lcl|Aclame:pro 71 QTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLG--TASAVIGT------------ 136 (298) Q Consensus 71 v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g--~~~~~~~~------------ 136 (298) -....++.-.-+.|+.+.|-+.. .+.++...+.+.+.++++.-.-.--|+|+.-..- -...+.+. T Consensus 96 ~~Y~c~qtn~dt~i~y~~LD~WA-~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~Td~~~nPllqDVNkGWlQ~~Re 174 (337) T protein:vir:79 96 NRYRCEKTDYDTAIPYRKLDAWA-KFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYRE 174 (337) T ss_pred CccEEEEeeeeeeccHHHHHHHh-cChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCcCccccchhHHHHHHh Confidence 67777777778889999885544 4457888888888888877666667778532111 11112110 Q ss_pred ---ccc-cccc---cccc-ccccccchhHHHHHHHhhh-hhhcCCc-cc-EEEEcHHHHHH-HHHhhccCCceeecccc- Q lcl|Aclame:pro 137 ---NHF-DSKV---TQKV-EAPRGIADPNGAIENAVEL-LTGVDAD-VT-GIAINPSFRSA-LAKQKDLQGNALFPELK- 203 (298) Q Consensus 137 ---~~~-~~~~---~~~~-~~~~~~~~~~~~i~~~~~~-l~~~~~~-~~-~~vm~~~~~~~-L~~lkd~~G~~l~~~~~- 203 (298) ..+ ...+ .... ...+.-....+...+++.. +...+.+ +. +.+|.+...+. --.|-...+.|-=.-.. T Consensus 175 ~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~dLladk~~~l~n~~~~ptE~~Aa~ 254 (337) T protein:vir:79 175 RAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVAICGRELLHDKYFPIVNATQAPTERLAAD 254 (337) T ss_pred cchhhhhccccccCcceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhHHhccCCCcHHHHHHH Confidence 000 0000 0011 1111222233345666654 3444443 22 46666665542 22222222232100000 Q ss_pred cccCcceecceeeEecCccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEcc Q lcl|Aclame:pro 204 WGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGW 283 (298) Q Consensus 204 ~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~ 283 (298) .-....++.|+|.+..+++|.+ .+++.-|++.-.|..++...=.+.+...-+. ..++..+| -|+ T Consensus 255 ~i~s~k~iGGlpa~~~PffP~~------~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~r-ie~y~s~N---------e~Y 318 (337) T protein:vir:79 255 LIVSQKRIGNLPAVRVPFFPKR------ALMVTKLSNLSIYYQEGARRRTLKEVPERDR-IENYESSN---------DAY 318 (337) T ss_pred HHHHhhhhCCceeEEccccCCC------ceEEeechhcEEEEecCcEEEEEEEcccccc-ccchhhcc---------cee Confidence 0111357999999999999976 4778888888777777666544433222111 11222233 456 Q ss_pred EEecccceEEEeecC Q lcl|Aclame:pro 284 GILDATKFARVTEAN 298 (298) Q Consensus 284 ~v~~~~a~~~l~~a~ 298 (298) .|.++.++|.+++.+ T Consensus 319 vVEd~~~~a~ienI~ 333 (337) T protein:vir:79 319 VVEDFGCGCVAENIE 333 (337) T ss_pred eeeccccEEEEecee Confidence 678999999998777 No 197 >protein:vir:1153 Length: 338 # NCBI annotation: predicted major capsid protein # Family: family:all:201 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490602;genbank:gi:17313222;genbank:GeneID:927319 Probab=97.32 E-value=9.4e-05 Score=42.65 Aligned_cols=281 Identities=11% Similarity=0.034 Sum_probs=154.4 Q ss_pred Ce------e-ccccccchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCcceEEee--ccc-ccccccccee Q lcl|Aclame:pro 1 MV------L-NKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNG-EKVFTFTMDSEIDVVA--ESG-KKTHGGVTLA 69 (298) Q Consensus 1 ma------t-~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~-~~ip~~~~~~~a~~v~--E~~-~~~~~~~~~~ 69 (298) +| . +...-|.|.+.+.+.+.+.+.|-+++..+.+++.--. -.+-.-.+++-++-+. ... ..|..-..++ T Consensus 16 ~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrtdT~~~~~R~~~~~~~l~ 95 (338) T protein:vir:11 16 LAKLNGVNSAVQTFAVEPSVQQKLEQRIQESSEFLKQINVYGVDELQGEKIGIGVSGTIASRTDTTGDGVRKPRDVSALD 95 (338) T ss_pred HHHHhCCCcccceeeeCHHHHHHHHHHHHHHHHhhccCceecccceeeeEeeeccCccccccccCCCCCccccccccccC Confidence 22 1 2234477888899999999999999999999987422 2333333344444332 122 2222222455 Q ss_pred eEEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc--cccccccc-c-------- Q lcl|Aclame:pro 70 PQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGT--ASAVIGTN-H-------- 138 (298) Q Consensus 70 ~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~--~~~~~~~~-~-------- 138 (298) .-....++.-.-+.|+.+.|-+.. .+.++...+.+.+.++++.-.-.--|+|+.-..-+ ...+.+.. + T Consensus 96 ~~~Y~c~qtn~dt~i~y~~LD~WA-~~~dF~~r~~~~i~k~~ALD~i~IGfnG~s~A~~Td~~~nPllqDVNkGWlQ~~R 174 (338) T protein:vir:11 96 NQRYECKHTDFDTAITYAMLDAWA-KFPEFQALLRDAILKRQALDRLMIGFNGTSAAATTNRAANPLLQDVNIGWFQQYR 174 (338) T ss_pred CCccEEEEeeeeeeecHHHHHHHh-cChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCcCccccchhHHHHHH Confidence 556677777777889999885544 44578888888888888776666677785321111 11111100 0 Q ss_pred -------ccccc-ccccc---cc-cccchhHHHHHHHhhhh-hhcCCc-cc-EEEEcHHHHHH-HHHhhccCCceeeccc Q lcl|Aclame:pro 139 -------FDSKV-TQKVE---AP-RGIADPNGAIENAVELL-TGVDAD-VT-GIAINPSFRSA-LAKQKDLQGNALFPEL 202 (298) Q Consensus 139 -------~~~~~-~~~~~---~~-~~~~~~~~~i~~~~~~l-~~~~~~-~~-~~vm~~~~~~~-L~~lkd~~G~~l~~~~ 202 (298) ..... +.... .. +.-....+...+++..+ ...+.+ +. +.+|.....+. --++-.....|-=.-. T Consensus 175 e~ap~rv~~~~~~~~~i~i~~g~~gdy~nLDalV~d~~~~lI~~~~~~d~dLVvivG~dLladk~~~l~n~~~~ptE~~A 254 (338) T protein:vir:11 175 NNAPARVLKEGKTTGKVVVGNGADADYKNLDALVFDVVSSLIDPWHRRDPGLVVILGRELVHDKYFPMVNKDQPATEKIA 254 (338) T ss_pred hhhhhhhhhcccccceeeecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHhHHHhcCCChHHHHH Confidence 00000 00011 11 12223334455666543 444443 22 56777665542 2223332222211000 Q ss_pred c-cccCcceecceeeEecCccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEE Q lcl|Aclame:pro 203 K-WGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFL 281 (298) Q Consensus 203 ~-~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~ 281 (298) . .-....++.|+|.+..+++|.+ .+++.-|++.-.|..++...=.+.+...-+. ..++..+| - T Consensus 255 a~~~~s~k~iGGlpa~~~PffP~~------~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~r-ie~y~s~N---------e 318 (338) T protein:vir:11 255 TDLILSQKRMGGLPPVEVPYVPEK------GLMVTTLKNLSLYWQIGGRRRYLKEVPEKNR-IENYESSN---------D 318 (338) T ss_pred HHHHHHhhhhCCceeEEccccCCC------ceEEeeccccEEEEecCcEEEEEEecccccc-ccchhhhc---------c Confidence 0 0111357999999999999976 4778888888777777666544433222111 11222333 4 Q ss_pred ccEEecccceEEEeecC Q lcl|Aclame:pro 282 GWGILDATKFARVTEAN 298 (298) Q Consensus 282 ~~~v~~~~a~~~l~~a~ 298 (298) |..|.++.++|.+++.+ T Consensus 319 ~YvVEd~~~~a~ieni~ 335 (338) T protein:vir:11 319 AYVVEDYGLGCLVENIE 335 (338) T ss_pred ceeeeccccEEEeecce Confidence 56678999999998888 No 198 >protein:vir:98856 Length: 343 # NCBI annotation: hypothetical protein # Family: family:all:201 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654732;genbank:gi:109302917;genbank:GeneID:4156061 Probab=97.32 E-value=8.9e-05 Score=42.78 Aligned_cols=279 Identities=10% Similarity=-0.010 Sum_probs=144.1 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceE-EEEEeCCcceEEeeccccccccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEK-VFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~-ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~ 79 (298) ...+.-.-|.|.+.+.+.+.+.+.|-+++..+.+++.--... +....++..++-....+...+. ...+.-....++.- T Consensus 27 ~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~q~~g~v~~~~~sg~~t~r~~t~~~~~~~-~~~~~~~Y~c~qTn 105 (343) T protein:vir:98 27 ALAGKQFSIEAPKESVLLGAIQQRSNFLEKINCVFSERYQRAIDLRSNRKRHYGAHDRRTPIQQR-WTRQVMSMNVSRQI 105 (343) T ss_pred hccCceeeecHHHHHHHHHHHHHHHHHhhcCceecchhhcceEEEeecCccccCccccCCCcccc-ccCCCCccEEEEee Confidence 122223557888889999999999999999999988532112 2222223222221111111000 00011134555555 Q ss_pred EEEeecHHHhhcccccHHH-HHHHHHHHHHHHHHHHHHHHHhccccccccccccccccc---------------cc-ccc Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKIN-ILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTN---------------HF-DSK 142 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~-l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~---------------~~-~~~ 142 (298) .-+.|+.+.|-+..- +.+ +...+++.+.++++.-.-.--|+|+.-..-+ ..+.+.. .+ ... T Consensus 106 ~dt~i~Y~~lD~WA~-~~deF~~r~~~~i~~~~ALD~i~IGfNGts~A~~T-~nPllqDVN~GWLQ~~Re~ap~rVm~~~ 183 (343) T protein:vir:98 106 QACLIPWAKLDQWGH-LKDKFASLYAEFVQNQIALDMIKIGFYGTSVGTDT-SDPNLADVNKGWIQFVRENKATQILTQG 183 (343) T ss_pred eeeeccHHHHHHhhc-ChhHHHHHHHHHHHHHHhhccceecccceeeccCC-CCcchhhcchHHHHHHHhcchhhhhccc Confidence 567788888744432 344 6677777777777665556666776433222 2222210 00 000 Q ss_pred cc--cc--cccccccchhHHHHHHHhhhhhhcCCc-cc-EEEEcHHHHHH-HHHhhccCCceeeccccc--ccCcceecc Q lcl|Aclame:pro 143 VT--QK--VEAPRGIADPNGAIENAVELLTGVDAD-VT-GIAINPSFRSA-LAKQKDLQGNALFPELKW--GATPDTING 213 (298) Q Consensus 143 ~~--~~--~~~~~~~~~~~~~i~~~~~~l~~~~~~-~~-~~vm~~~~~~~-L~~lkd~~G~~l~~~~~~--~~~~~~l~G 213 (298) .+ .. ...++.-....+...++.+.+...+.+ +. +.+|.+...+. --++-...+++-...... -....++.| T Consensus 184 ~~~~~~~~~G~ggdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dLla~~~~~l~n~~~~~ptEk~Aa~~~~~~k~iGG 263 (343) T protein:vir:98 184 ATSGEIRLFGEGADYVNLDELAYDLKQGLDARHRDAGDLVFLVGADLVAKEASLVYKGNGLIATEKAALNTHDLMKSFGG 263 (343) T ss_pred eeccceeEecCCCCcccHHHHHHHHHhcCchHHhcCCCEEEEEchhhhhhhhhhhhhhcCCChHHHHHHHHHHHHHhhCC Confidence 00 01 111112223334445566655554443 22 46666655443 223333334321110000 112357999 Q ss_pred eeeEecCccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEE Q lcl|Aclame:pro 214 LPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFAR 293 (298) Q Consensus 214 ~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~ 293 (298) +|.+..+++|.+ .+++.-|++.-.|..++...=.+.+...-+. ..++..+| -|..|.++.++|. T Consensus 264 l~a~~~PfFP~~------~llVT~L~NLsIY~Q~gs~RR~~~d~p~r~r-ie~y~s~N---------e~YvVEd~~~~a~ 327 (343) T protein:vir:98 264 MPAMIVPNMPPR------AAIVTSLSNLSIYTQEGSMRRGMKDDDDKKA-VRDSYYRN---------EAYAVEDCGKFMA 327 (343) T ss_pred CeeEEccccCCC------ceEEeeccccEEEEecCcEEEEEEecccccc-ccchhhhc---------ceeeeeccccEEE Confidence 999999999976 4778888888777777666544433222111 11223333 4556778888888 Q ss_pred EeecC Q lcl|Aclame:pro 294 VTEAN 298 (298) Q Consensus 294 l~~a~ 298 (298) +++.+ T Consensus 328 iE~i~ 332 (343) T protein:vir:98 328 VDFTK 332 (343) T ss_pred eeeee Confidence 88777 No 199 >protein:vir:3746 Length: 336 # NCBI annotation: orf15 # Family: family:all:201 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043487;genbank:gi:9628622;genbank:GeneID:1261135 Probab=97.32 E-value=9.5e-05 Score=42.62 Aligned_cols=276 Identities=10% Similarity=0.043 Sum_probs=146.1 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCcceEEeeccccccccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNG-EKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~-~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~ 79 (298) .+.+.-.-|.|.+.+.+.+.+.+.|-+++..+.+++.--. -.+-.-.+++-++-+.-+ ..+ .++..+.-....++.- T Consensus 24 ~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt~-R~~-~~~~l~~~~Y~c~qTn 101 (336) T protein:vir:37 24 VLRGESFALKAPEAALLGENIQQRSDFLKQINMIQVAHTKGQKLFGATEKGVTGRKQTG-RNL-ANLDHTQNGFELAETD 101 (336) T ss_pred hccCceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEeeeccCcccccccCCC-ccc-cccCcCCcccEEEEee Confidence 1222335678888999999999999999999999986421 233332333333322221 222 2245666667777777 Q ss_pred EEEeecHHHhhcccccHHHHH-HHHHHHHHHHHHHHHHHHHhccccccccccccccccc---------------cccc-c Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKINIL-QAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTN---------------HFDS-K 142 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~l~-~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~---------------~~~~-~ 142 (298) .-+.|+.+.|-+... +.+.. ..+...+.++++.-.-.--|+|+.-..-+ ..+.+.. .+.+ . T Consensus 102 ~dt~i~y~~LD~WA~-~~df~~~~~~~~~~r~iALD~i~IGfnG~s~A~~T-dnPllqDVNkGWlQ~~Re~a~~~v~~~~ 179 (336) T protein:vir:37 102 SGIIVPWALFDSFAI-FKDRLVELYSEYFQNQVALDILQIGWNGQSVADNT-TKADLSDVNKGWLKLLQEQRAANFMTES 179 (336) T ss_pred eeeeecHHHHHHHhc-ChhHHHHHHHHHHHHHHhhchhhhcccceeeccCC-CCCcccccchhHHHHHHhccchhhcccc Confidence 778899998844432 22321 22222233334433344455675332222 1222210 0000 0 Q ss_pred c---cccc--ccccccchhHHHHHHHhhhhhhcCCcc-c-EEEEcHHHHH-HHHHhhccCC-ceeecccc-c--ccCcce Q lcl|Aclame:pro 143 V---TQKV--EAPRGIADPNGAIENAVELLTGVDADV-T-GIAINPSFRS-ALAKQKDLQG-NALFPELK-W--GATPDT 210 (298) Q Consensus 143 ~---~~~~--~~~~~~~~~~~~i~~~~~~l~~~~~~~-~-~~vm~~~~~~-~L~~lkd~~G-~~l~~~~~-~--~~~~~~ 210 (298) + .... ..++.-....+...++++.+...+.+. . +.+|.+...+ ..-+|-..++ +|- +.. . -....+ T Consensus 180 ~~~~g~i~~~G~~gdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dLla~~~~~l~~~~~~~Pt--E~~Aa~~~~~~k~ 257 (336) T protein:vir:37 180 TKSSGKITIFGDNADYANLDDLAFDLKQGLDFRHQNRNDLVFLVGADLVSKETKLIQQKHGLTPT--EKAALGSHNLMGS 257 (336) T ss_pred cccCCceEEecCCCCcccHHHHHHHHHhcCchHHhcCCCeEEEEchhhhhhhhhhhhhhcCCCHH--HHHHHHHHHHHHh Confidence 0 0001 111222233444566776665555432 2 4566665543 2233333333 231 110 0 112358 Q ss_pred ecceeeEecCccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccc Q lcl|Aclame:pro 211 INGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATK 290 (298) Q Consensus 211 l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a 290 (298) +.|+|.+..+++|.+ .+++.-|++.-.|..++..+=.+.+...-+. ..++..+| -|..|.++.+ T Consensus 258 iGGlpa~~~PffP~~------~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~r-ie~y~s~N---------e~YvVEd~~~ 321 (336) T protein:vir:37 258 FGGMNAITPPNFPAR------AAAVTTLKNLSVYTEAESVRRSLRNDEDKKG-LVTSYYRQ---------EGYVVEDLGL 321 (336) T ss_pred hCCceeEEccccCCC------ceEEeechhcEEEEecCcEEEEEEEcccccc-ccchhhhc---------ceeeeecccc Confidence 999999999999976 4778888888777777666544433222111 11223333 4566889999 Q ss_pred eEEEeecC Q lcl|Aclame:pro 291 FARVTEAN 298 (298) Q Consensus 291 ~~~l~~a~ 298 (298) +|.+++.+ T Consensus 322 ~a~iE~i~ 329 (336) T protein:vir:37 322 MTAIDHTK 329 (336) T ss_pred EEEeeeee Confidence 99999988 No 200 >protein:vir:98566 Length: 355 # NCBI annotation: gp5 # Family: family:all:201 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958060;genbank:gi:41057357;genbank:GeneID:2744237 Probab=97.32 E-value=8e-05 Score=43.03 Aligned_cols=281 Identities=10% Similarity=0.050 Sum_probs=153.8 Q ss_pred Ce---------eccccccchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCcceEEeec--c-ccccccccc Q lcl|Aclame:pro 1 MV---------LNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNG-EKVFTFTMDSEIDVVAE--S-GKKTHGGVT 67 (298) Q Consensus 1 ma---------t~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~-~~ip~~~~~~~a~~v~E--~-~~~~~~~~~ 67 (298) +| .+...-|-|.+.+.+.+.+.+.|-+++..+.+++.--. -.+-.-.+++-++-+.- + +..|..-.. T Consensus 16 ~A~~ngv~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv~g~iagrtdT~~~~~R~~~~~~~ 95 (355) T protein:vir:98 16 VAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTTDTSGDKERQTADFTA 95 (355) T ss_pred HHHHhCCChhHccceeecCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEeeeccCccccccccCCCCCCcccccccc Confidence 22 22334466788889999999999999999999987422 23333333444443321 1 223333344 Q ss_pred eeeEEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc--ccccccccc-c------ Q lcl|Aclame:pro 68 LAPQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLG--TASAVIGTN-H------ 138 (298) Q Consensus 68 ~~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g--~~~~~~~~~-~------ 138 (298) ++.-....++.-.-+.|+.+.|-+.. .+.++...+.+.+.++++.-.-.--|+|+.-..- -...+.+.. + T Consensus 96 l~~~~Y~c~qtn~dt~i~y~~LD~WA-~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~~Td~~~nPllqDVNkGWlQ~ 174 (355) T protein:vir:98 96 LESSKYECNQINFDFHLKYKTLDLWA-RFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTKNTLLQDVAVGWLQK 174 (355) T ss_pred cCCCccEEEEeeeeeeecHHHHHHHh-cChhHHHHHHHHHHHHHhhchhhhcccceeeeccCChhhCcCccccchhHHHH Confidence 55566777777777889999885554 4467888888888888877666667778532111 111111100 0 Q ss_pred --------c-ccc-------ccccc--ccccccchhHHHHHHHhhhh-hhcCCc-cc-EEEEcHHHHH-HHHHhhccCCc Q lcl|Aclame:pro 139 --------F-DSK-------VTQKV--EAPRGIADPNGAIENAVELL-TGVDAD-VT-GIAINPSFRS-ALAKQKDLQGN 196 (298) Q Consensus 139 --------~-~~~-------~~~~~--~~~~~~~~~~~~i~~~~~~l-~~~~~~-~~-~~vm~~~~~~-~L~~lkd~~G~ 196 (298) + ... ..... ...+.-....+...+++..+ ...+.+ +. +.+|.+...+ +--+|-..... T Consensus 175 ~Re~ap~~v~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~ 254 (355) T protein:vir:98 175 YRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKLLADKYFPLVNKQQE 254 (355) T ss_pred HHhcchhhhhhhhcccCccccccceeeCCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHhhhHhhccCC Confidence 0 000 00000 01122223334455566544 444333 22 5677766544 32333333333 Q ss_pred eeecc-cccccCcceecceeeEecCccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEE Q lcl|Aclame:pro 197 ALFPE-LKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYI 275 (298) Q Consensus 197 ~l~~~-~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~ 275 (298) |-=.- ...-....++.|+|.+..+++|.+ .+++.-|++.-.|..++..+=.+.+...-+. ..++..+| T Consensus 255 ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~------~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~r-ie~y~s~N---- 323 (355) T protein:vir:98 255 NSESLAADIIISQKRIGNLPAVRVPYFPAN------AVLVTTLENLSIYFMDESHRRSIDENPKKDR-VENYESMN---- 323 (355) T ss_pred cHHHHHHHHHHHhhhhCCceeEEccccCCC------ceEEeeccccEEEEecCcEEEEEEecccccc-ccchhhhc---- Confidence 31100 000112358999999999999976 4778888888777777666544433222111 11223333 Q ss_pred EEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 276 RAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 276 r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) -|..|.++.++|.+++.+ T Consensus 324 -----e~YvVEd~~~~a~ienI~ 341 (355) T protein:vir:98 324 -----IDYVVEVYAAGCLLENIT 341 (355) T ss_pred -----ceeeeeccccEEEeecee Confidence 455677888888888766 No 201 >protein:vir:3783 Length: 336 # NCBI annotation: capsid # Family: family:all:201 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536823;genbank:gi:17981832;genbank:GeneID:929211 Probab=97.26 E-value=0.00011 Score=42.20 Aligned_cols=277 Identities=9% Similarity=0.033 Sum_probs=143.9 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCcceEEeeccccccccccceeeEEEeeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNG-EKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVE 79 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~-~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~ 79 (298) .+.+.-.-|.|.+.+.+.+.+.+.|-+++..+.+++.--. -.+-.-.+++-++-+.-+..... ...+.-....++.- T Consensus 24 ~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt~r~r~~--~~l~~~~Y~c~qTn 101 (336) T protein:vir:37 24 VLRGESFALKAPEAALLGENIQQRSDFLKGINMVQVAHTKGTKLFGATEKGVTGRKQTGRNLAT--LDHSQNGYELSETD 101 (336) T ss_pred hcccceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEEeeccCcccccccCCCCCccc--cCCCCCccEEEEee Confidence 1222335678888999999999999999999999986421 23333333333333322222211 23445556666666 Q ss_pred EEEeecHHHhhcccccHHH-HHHHHHHHHHHHHHHHHHHHHhccccccccccccccccc---------------cccc-c Q lcl|Aclame:pro 80 YGARISDEFMYASDEEKIN-ILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTN---------------HFDS-K 142 (298) Q Consensus 80 ~~~~iS~ell~~~~d~~~~-l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~---------------~~~~-~ 142 (298) .-+.|+.+.|-+... +.+ ....+...+.++++.-.-.--|+|+.-..-+. .+.+.. .+.+ . T Consensus 102 ~dt~i~y~~LD~WA~-~~d~~~~~~~~~~~r~iALD~i~IGfnG~s~A~~Td-nPllqDVNkGWlQ~~Re~a~~~v~~~~ 179 (336) T protein:vir:37 102 SGILVNWSLFDSFAI-FKDRLVELYSEYFQNQVALDILQIGWNGQSVATNTT-KTDLSDVNKGWLKLLQEQRAANFMTES 179 (336) T ss_pred eeeeccHHHHHHHhc-ChhHHHHHHHHHHHHHHhcchhhhcccceeeccCCC-CccccccchhHHHHHHhccchhhcccc Confidence 677899998844432 222 11222222333334333344556753322222 222210 0000 0 Q ss_pred c---cccc--ccccccchhHHHHHHHhhhhhhcCCcc-c-EEEEcHHHHHH-HHHhhccCC-ceeeccccc--ccCccee Q lcl|Aclame:pro 143 V---TQKV--EAPRGIADPNGAIENAVELLTGVDADV-T-GIAINPSFRSA-LAKQKDLQG-NALFPELKW--GATPDTI 211 (298) Q Consensus 143 ~---~~~~--~~~~~~~~~~~~i~~~~~~l~~~~~~~-~-~~vm~~~~~~~-L~~lkd~~G-~~l~~~~~~--~~~~~~l 211 (298) + .... ..++.-....+...++++.+...+.+. . +.+|.+...+. .-+|-..++ +|- ..... -....++ T Consensus 180 ~~~~g~i~~~G~~gdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dLla~~~~~l~~~~~~~Pt-E~~Aa~~~~~~k~i 258 (336) T protein:vir:37 180 TKSSGKITIFGDNADYANLDDLAFDLKQGLDFRHQNRNDLVFLVGADLVSKETKLIQQKHGLTPT-EKAALGSHNLMGSF 258 (336) T ss_pred cccCCceEEecCCCCcccHHHHHHHHHhccchHHhcCCCeEEEEchhhhhhhhhhhhhhcCCCHH-HHHHHHHHHHHHhh Confidence 0 0001 111222233444566776665555432 2 45666655432 223333333 221 00000 1124579 Q ss_pred cceeeEecCccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccce Q lcl|Aclame:pro 212 NGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKF 291 (298) Q Consensus 212 ~G~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~ 291 (298) .|+|.+..+++|.+ .+++.-|++.-.|..++..+=.+.+...-+. ..++..+| -|..|.++.++ T Consensus 259 GGlpa~~~PffP~~------~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~r-ie~y~s~N---------e~YvVEd~~~~ 322 (336) T protein:vir:37 259 GGMNAITPPNFPAR------AAAVTTLKNLSVYTEAESVRRSLRNDEDKKG-LVTSYYRQ---------EGYVVEDLGLM 322 (336) T ss_pred CCceEEEccccCCC------ceEEeeccccEEEEecCcEEEEEEEcccccc-ccchhhhc---------ceeeeeccccE Confidence 99999999999976 4778888888777777666544433222111 11223333 45668899999 Q ss_pred EEEeecC Q lcl|Aclame:pro 292 ARVTEAN 298 (298) Q Consensus 292 ~~l~~a~ 298 (298) |.+++.+ T Consensus 323 a~iE~i~ 329 (336) T protein:vir:37 323 TAIDHTK 329 (336) T ss_pred EEeeeee Confidence 9999988 No 202 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=97.18 E-value=0.00014 Score=41.73 Aligned_cols=269 Identities=8% Similarity=-0.018 Sum_probs=127.9 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcc------eeecCCCceEEEEEeCCcceEE-eec-cccccccccceeeEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSA------QKPIPFNGEKVFTFTMDSEIDV-VAE-SGKKTHGGVTLAPQT 72 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~------~~~~~~~~~~ip~~~~~~~a~~-v~E-~~~~~~~~~~~~~v~ 72 (298) ||+ -. ..+.+++.+.+.+++.+....+.. ....++..++||+.+..+-... .+- +......+.++...+ T Consensus 1 MA~-~n--~a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~~gl~DY~R~~~g~~~g~~~~~~~t~~ 77 (299) T protein:vir:79 1 MAA-LN--YAKEYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTISTTGRVDSNRDTIAVAQRNYDNAWEPKV 77 (299) T ss_pred Ccc-ch--hHHHHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEeccccccccccccCCCcccccccCcceeEEE Confidence 993 12 236677788888888776555432 2234456799999875333322 222 222223455666677 Q ss_pred EeeeEEEEEEeecHHHhhcccccH--HHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccc Q lcl|Aclame:pro 73 MVPIKVEYGARISDEFMYASDEEK--INILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAP 150 (298) Q Consensus 73 l~~~k~~~~~~iS~ell~~~~d~~--~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 150 (298) +.-.|.-.+. |- .+ +..++. ..+...+.+...+.++-.+|...+...- ++. ..... ...... T Consensus 78 ldqdr~~~f~-vD-~~--Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~--~~a-------~~~g~---~~~~~~ 141 (299) T protein:vir:79 78 LTNQRKWSTL-VH-PA--DINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIY--ADW-------TALGN---TADTTV 141 (299) T ss_pred eeccccceec-cc-hh--hHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHH--Hhh-------hhcCC---cccccc Confidence 7766643322 11 00 001111 1122223333444455555655443210 000 00000 011111 Q ss_pred cccchhHHHHHHHhhhhhhcCCcc--cEEEEcHHHHHHHHHhhc--cCCceeecccccccCcceecceeeEe--cCcccc Q lcl|Aclame:pro 151 RGIADPNGAIENAVELLTGVDADV--TGIAINPSFRSALAKQKD--LQGNALFPELKWGATPDTINGLPVDV--NKTVSD 224 (298) Q Consensus 151 ~~~~~~~~~i~~~~~~l~~~~~~~--~~~vm~~~~~~~L~~lkd--~~G~~l~~~~~~~~~~~~l~G~PV~~--s~~~~~ 224 (298) -+.+..|+.|.++..++..++... -.++++|.++..|.+.+. .............+..++|.|+||+. ++.|++ T Consensus 142 ~T~~n~y~~i~~~~~~lde~~vP~~~rvl~vtp~~~~~L~~~~~f~k~~~~~~~~~~~~g~Vg~idG~~Ii~Vps~r~~t 221 (299) T protein:vir:79 142 LTTTNVLEVFDKLMEKMTEARVPENGRILYVTPVVNTLIKNAKEIQRTVNIKDAGTSLNRQTTDIDTVKIIKVPSNLMKT 221 (299) T ss_pred cCHHHHHHHHHHHHHHHHhcCCCCCCeEEEeCHHHHHHHhhchhhhcccccccccceeeeeeeeecceEEEEechhhcCc Confidence 234567899999999999888754 357999999998876532 12222222233456678999999975 445543 Q ss_pred ccc---------ccc-ceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEeccc--ce- Q lcl|Aclame:pro 225 MSL---------TQR-DRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDAT--KF- 291 (298) Q Consensus 225 ~~~---------~~~-~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~--a~- 291 (298) ... +++ --+++.. ..+. +.+..--.+++++ .+.++. +-.++.-+.+.|.=|.+.+ ++ T Consensus 222 ~~~~~~G~~~~~~ak~in~ii~~-~~a~-~~~~K~~~~~~~~-P~~~~~-------~~~~~~~r~y~d~~v~~nk~~~i~ 291 (299) T protein:vir:79 222 AYDFTTGWKVGAGAKQIFMSLVH-PSAI-ITPVSYQFSKLDE-PTAVTE-------GKYFYFEESFEDVFILNKKADAIQ 291 (299) T ss_pred cceeccCccccCcccccceEEEc-CCee-eeeEeeeeEEeec-CCCCCc-------cceeeeeeeeeeeeeeccccCeEE Confidence 111 111 1234443 3332 2233222333332 122222 1122333345555555432 33 Q ss_pred EEEeecC Q lcl|Aclame:pro 292 ARVTEAN 298 (298) Q Consensus 292 ~~l~~a~ 298 (298) +-.+.|- T Consensus 292 ~~~~~a~ 298 (299) T protein:vir:79 292 FVVEGAG 298 (299) T ss_pred EEeeecC Confidence 3334444 No 203 >protein:vir:100331 Length: 342 # NCBI annotation: major capsid protein N # Family: family:all:201 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655472;genbank:gi:109289940;genbank:GeneID:4157374 Probab=97.08 E-value=0.00013 Score=41.80 Aligned_cols=281 Identities=14% Similarity=0.064 Sum_probs=154.0 Q ss_pred Ce---------eccc--cccchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCcceEEeec---cccccccc Q lcl|Aclame:pro 1 MV---------LNKG--TLFDPELVTDLISKVAGKSSIARLSAQKPIPFNG-EKVFTFTMDSEIDVVAE---SGKKTHGG 65 (298) Q Consensus 1 ma---------t~gg--~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~-~~ip~~~~~~~a~~v~E---~~~~~~~~ 65 (298) +| .+.+ .-|-|.+.+.+.+.+.+.|-+++..+.+++.--. -.+-.-.+++-++-+.- +...|..- T Consensus 16 ~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrtdT~~~~~R~~~~~ 95 (342) T protein:vir:10 16 QAELNNLPFNALATGIKFTVQPSVQQKLYEKVRESSDFLKSISFVFVDEQTGETLGLDSAHTVASTTDTSGDGERKTTSI 95 (342) T ss_pred HHHHhCCChhHccccceeecChHHHHHHHHHHHHHHHHhccCcccccccceeeEEecccCcccccccccCCCCCcccccc Confidence 22 2333 4577888899999999999999999999987422 23333333444443321 12233333 Q ss_pred cceeeEEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc--ccccccc------- Q lcl|Aclame:pro 66 VTLAPQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGT--ASAVIGT------- 136 (298) Q Consensus 66 ~~~~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~--~~~~~~~------- 136 (298) ..++.-....++.-.-+.|+.+.|-+.. .+.++...+++.+.++++.-.-.--|+|+.-..-+ ...+.+. T Consensus 96 ~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA-~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWl 174 (342) T protein:vir:10 96 AKLVKQTYHCQQINFDTHINYKQLDMWA-KFPDFQQKVANVAAKQRKRDLIMIGFNGTSRAATSDRNSNPLLQDVAKGWL 174 (342) T ss_pred cccCCCccEEEEeeecccccHHHHHHHh-cChhHHHHHHHHHHHHHhhccceecccceeeccCCChhhCcCccccchHHH Confidence 4556666677777777889999885443 44577778888888877766666667775322111 1111110 Q ss_pred ---------cccccccc--ccc-ccccccchhHHHHHHHhhhh-hhcCCc-cc-EEEEcHHHHHH-HHHhhccCCceeec Q lcl|Aclame:pro 137 ---------NHFDSKVT--QKV-EAPRGIADPNGAIENAVELL-TGVDAD-VT-GIAINPSFRSA-LAKQKDLQGNALFP 200 (298) Q Consensus 137 ---------~~~~~~~~--~~~-~~~~~~~~~~~~i~~~~~~l-~~~~~~-~~-~~vm~~~~~~~-L~~lkd~~G~~l~~ 200 (298) ........ ... ..++.-....+...+++..+ ...+.+ +. +.+|.+...+. --.|-.....|-=. T Consensus 175 Q~~Re~ap~rv~~~~~~~~~i~iG~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLladk~~~l~n~~~~ptE~ 254 (342) T protein:vir:10 175 QKMREDAKERVMNGESTDNQVLVGKGQEYANLDALVMDATEELIDEWHRDDTDLVVITGRKLLADKYFPIVNQQNAPTEE 254 (342) T ss_pred HHHHhhhhhhhcccceeccceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHHHHHhcCCChHHH Confidence 00000110 101 11122223334455667543 444443 22 46666666542 22222222222100 Q ss_pred cc-ccccCcceecceeeEecCccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEE Q lcl|Aclame:pro 201 EL-KWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAEL 279 (298) Q Consensus 201 ~~-~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~ 279 (298) -. ..-....++.|+|.+..+++|.+ .+++.-|++.-.|..++...=.+.+...-+. ..++..+| T Consensus 255 ~Aa~~i~s~k~iGGl~a~~~PfFP~~------~ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~r-ie~y~s~N-------- 319 (342) T protein:vir:10 255 LAADIVISQKRIGGLKAVRVPFFPAN------AILITKLENLAIYVQEGTTRKHIENVPKKDR-IETYESEN-------- 319 (342) T ss_pred HHHHHHHhhhhhcCceeEEccccCCC------ceEEeeccccEEEEecCcEEEEEEecccccc-ccchhhhc-------- Confidence 00 00112357999999999999976 4778888887777777666544433222111 11222333 Q ss_pred EEccEEecccceEEEeecC Q lcl|Aclame:pro 280 FLGWGILDATKFARVTEAN 298 (298) Q Consensus 280 r~~~~v~~~~a~~~l~~a~ 298 (298) -|..|.++.++|.+++.+ T Consensus 320 -e~YvVEd~~~~a~iE~i~ 337 (342) T protein:vir:10 320 -IDYVVEDYGCAALIENIT 337 (342) T ss_pred -cceeeeccccEEEeecce Confidence 456678889999888777 No 204 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=97.08 E-value=5.1e-05 Score=44.13 Aligned_cols=197 Identities=11% Similarity=0.011 Sum_probs=95.2 Q ss_pred EEEEEeecHHHhhccc--ccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccch Q lcl|Aclame:pro 78 VEYGARISDEFMYASD--EEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIAD 155 (298) Q Consensus 78 ~~~~~~iS~ell~~~~--d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (298) +-. ..+|+-++.+.+ .+..++.+...+++++++++..|+.++.-.-.+.....+..+..+.... ......+..... T Consensus 1 iD~-lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g~~~-~~~a~~t~~~~~ 78 (221) T protein:vir:17 1 MDD-LLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDGGFSV-NIGAGNTNNAQA 78 (221) T ss_pred CCc-chhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccccCcce-eccccccCCHHH Confidence 111 123444443322 3456688899999999999999998875221111111111111110010 011112234455 Q ss_pred hHHHHHHHhhhhhhcCCccc--EEEEcHHHHHHHHHhhcc-CCceeec---cccccc-CcceecceeeEecCcccccccc Q lcl|Aclame:pro 156 PNGAIENAVELLTGVDADVT--GIAINPSFRSALAKQKDL-QGNALFP---ELKWGA-TPDTINGLPVDVNKTVSDMSLT 228 (298) Q Consensus 156 ~~~~i~~~~~~l~~~~~~~~--~~vm~~~~~~~L~~lkd~-~G~~l~~---~~~~~~-~~~~l~G~PV~~s~~~~~~~~~ 228 (298) +++.|.++..+|-..+.... .++++|..+..|.+-.|. --+.-+. .....+ ..+++.|++|+.|+++|...++ T Consensus 79 l~dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~~i~~v~G~~V~~SnnlP~~~gt 158 (221) T protein:vir:17 79 IVDGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGKGLYVNAGIRIYKSNVLASLYGT 158 (221) T ss_pred HHHHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeecccccccccccceeeeecCcEEEEeccCCccccc Confidence 67889999999888887644 367799888777653221 1111111 111222 3567999999999999986655 Q ss_pred ccc-------------eEEEeeccceEEEEe-ecce-EEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccce Q lcl|Aclame:pro 229 QRD-------------RAIIGDFANGFKWGY-AKEV-PLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKF 291 (298) Q Consensus 229 ~~~-------------~~~~gd~~~~~~~~~-~~~~-~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~ 291 (298) +.. ..+=|||++.+.+.. ++.+ +++..--.....-.+++| .++||+.- T Consensus 159 ~~~~~ag~~~~~~~~~~~yr~~fs~~~glv~~~~Avgtvkl~~~~~~~~~~~~~~---------------~~~~~~~~ 221 (221) T protein:vir:17 159 NLVTDPGDATTSGENNGSYRPAITDRAGLVFHKEAADTVEVLLPPSRPPLVISMF---------------SIRRPDRR 221 (221) T ss_pred ccccCCccccccccccccccccccceEEEEEcchheeeeeeecCCCCCceeeeee---------------eccCCCCC Confidence 322 123455554332221 1111 222211111111111222 12333322 No 205 >protein:vir:78777 Length: 358 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285647;genbank:gi:148727153;genbank:GeneID:5220125 Probab=96.96 E-value=0.00023 Score=40.48 Aligned_cols=279 Identities=10% Similarity=0.015 Sum_probs=152.2 Q ss_pred Cee---------ccccccchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCcceEEeeccccccccccceee Q lcl|Aclame:pro 1 MVL---------NKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNG-EKVFTFTMDSEIDVVAESGKKTHGGVTLAP 70 (298) Q Consensus 1 mat---------~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~-~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~ 70 (298) +|. +...-|.|.+.+.+.+.+.+.|-+++..+.+++.--. -.+-.-.+++-++-+.. ..+.....++. T Consensus 20 ~A~~ngv~~~~~~~~Fsv~p~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrt~t--r~~~~~~~l~~ 97 (358) T protein:vir:78 20 LAKAYGIDISKLDKQFSVTGPVETTLRSALLASVEFLGLITCLDVDQIKGQVVQVGVGQLYTGRKKG--GRFKGKVGVDG 97 (358) T ss_pred HHHHhCCChhHccceeeeChHHHHHHHHHHHHHHHHhhcCcccccccceeeEEeecCCcccceecCC--CccccccccCC Confidence 222 3345578888899999999999999999999887422 23333233344443333 23333444555 Q ss_pred EEEeeeEEEEEEeecHHHhhcccc--cHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc--ccccccc---------- Q lcl|Aclame:pro 71 QTMVPIKVEYGARISDEFMYASDE--EKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGT--ASAVIGT---------- 136 (298) Q Consensus 71 v~l~~~k~~~~~~iS~ell~~~~d--~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~--~~~~~~~---------- 136 (298) -....++.-.-+.|+.+.|-+..- +..++...+++.+.++++.-.-.--|+|+.-..-+ ...+.+. T Consensus 98 ~~Y~c~qTn~dt~i~Y~~lD~WA~f~~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~~ 177 (358) T protein:vir:78 98 NTYELTETDSCASLDWATLCTWANAGSEGEFIKLVGEFVNKAFALDMLRVGWNGVSAADDTDPTANPLGQDVNKGWHQLA 177 (358) T ss_pred CccEEEEeceeeeccHHHHHHHHhCCChhHHHHHHHHHHHHHHhhccceecccceeeccCCChhhCcCccccchHHHHHH Confidence 566666666777889988744321 22357777777788877766666667775322111 1111111 Q ss_pred ------ccccccc-ccccccc----cccchhHHHHHHHhhh-hhhcCCcc-c-EEEEcHHHHH-HHHHhhccCCceeecc Q lcl|Aclame:pro 137 ------NHFDSKV-TQKVEAP----RGIADPNGAIENAVEL-LTGVDADV-T-GIAINPSFRS-ALAKQKDLQGNALFPE 201 (298) Q Consensus 137 ------~~~~~~~-~~~~~~~----~~~~~~~~~i~~~~~~-l~~~~~~~-~-~~vm~~~~~~-~L~~lkd~~G~~l~~~ 201 (298) ......+ +.....+ +.-....+...+++.. +...+.+. . +.+|.+...+ +--+|-...+.|-=. T Consensus 178 Re~a~~~v~~~~~~~~~i~ig~g~~Gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~pTE~- 256 (358) T protein:vir:78 178 REWKGGSQIIKAAAGEKIYFDPDGKGEYKTLDEMASDLINTTIDPLFQQDPRLVVLVGTDLVAAAQAKLYSEATKPSEQ- 256 (358) T ss_pred HhhchhhhhccccccCceeecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhhHhhcCCCcHHH- Confidence 0000000 0111011 1222333444556643 34444332 2 4666666654 323333333333110 Q ss_pred cccccCcceecceeeEecCccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEE Q lcl|Aclame:pro 202 LKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFL 281 (298) Q Consensus 202 ~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~ 281 (298) ........++.|+|.+..+++|.+ .+++.-|++.-.|..++..+=.+.+...-+. ..++..+| - T Consensus 257 ~Aa~~i~k~iGGlpa~~~PfFP~~------~ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~r-iE~y~s~N---------e 320 (358) T protein:vir:78 257 IAAQQLAKSIAGRKAYIPPFFPGK------RMVVTTLDNLHCYTQRGTRKRKADDNQDSKS-FDNQYWRM---------E 320 (358) T ss_pred HHHHHHHHHhCCCeEEEccccCCC------ceEEeeccccEEEEecCcEEEEEEecccccc-ccchhhhc---------c Confidence 001111257899999999999976 4778888888777777666544433222211 11223333 4 Q ss_pred ccEEecccceEEEeecC Q lcl|Aclame:pro 282 GWGILDATKFARVTEAN 298 (298) Q Consensus 282 ~~~v~~~~a~~~l~~a~ 298 (298) |..|.++.++|.+++.+ T Consensus 321 ~YvVEd~~~~a~iE~i~ 337 (358) T protein:vir:78 321 GYALGEHKAYGGFEEAD 337 (358) T ss_pred eeeeeccccEEEEeeee Confidence 56678999999998887 No 206 >protein:vir:270 Length: 341 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536650;genbank:gi:17975128;genbank:GeneID:929084 Probab=96.94 E-value=0.0002 Score=40.89 Aligned_cols=276 Identities=11% Similarity=0.045 Sum_probs=143.0 Q ss_pred Ceecc-------ccccchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCcceEEeeccccccccccceeeEE Q lcl|Aclame:pro 1 MVLNK-------GTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNG-EKVFTFTMDSEIDVVAESGKKTHGGVTLAPQT 72 (298) Q Consensus 1 mat~g-------g~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~-~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~ 72 (298) +|... ..-|-|.+.+.+.+.+.+.|-+++..+.+++.--. -.+-.-.+++-++-+.- +..+ .++.++.-. T Consensus 20 ~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrtdt-~R~~-r~~~l~~~~ 97 (341) T protein:vir:27 20 LAKSYGVSNVAELFNVSPQLETKLRAAITESAEFLKMITVTTVDQIEGQVVDVGVSGLYTGRKAG-GRFT-KQVGVGGHK 97 (341) T ss_pred HHHHcCcccccceEeecHHHHHHHHHHHHhhHHhhhcCccccccceeeeEeecccccceeeccCC-Ccee-cccccCCcc Confidence 22222 23467788899999999999999999999886422 22222223333333322 2222 223566666 Q ss_pred EeeeEEEEEEeecHHHhhcccc--cHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc--cccccccc------------ Q lcl|Aclame:pro 73 MVPIKVEYGARISDEFMYASDE--EKINILQAFNDGFAKKVARGIDLMAFHGVNPRLG--TASAVIGT------------ 136 (298) Q Consensus 73 l~~~k~~~~~~iS~ell~~~~d--~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g--~~~~~~~~------------ 136 (298) ...++.-.-+.|+.+.|-+... .+.++...+.+.+.++++.-.-.--|+|+.-..- -...+.+. T Consensus 98 Y~c~qtn~dt~i~y~~lDaWA~~g~~~dF~~r~~~~i~~~~ALD~i~IGfnGts~A~~Td~~anPllqDVNkGWlQ~~Re 177 (341) T protein:vir:27 98 YKLAETDSCAAITWAMLCQWANQGGRDQFMKHLTEFSNQMFALDIMRIGWNGVSAEADTDPSANPLGQDVNEGWIAFVKN 177 (341) T ss_pred eEEEEeeeeeeecHHHHHHHHhcCCChHHHHHHHHHHHHHHhhhhhhhcccceeeccCCChhhcccccccchhHHHHHHh Confidence 7777777778899988754443 2567888888888888877666677778532111 11111110 Q ss_pred ---cccccccccccccccccchhHHHHHHHhhhh-hhcCCcc-c-EEEEcHHHHH-HHHHhhccCCceeecccccccCcc Q lcl|Aclame:pro 137 ---NHFDSKVTQKVEAPRGIADPNGAIENAVELL-TGVDADV-T-GIAINPSFRS-ALAKQKDLQGNALFPELKWGATPD 209 (298) Q Consensus 137 ---~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l-~~~~~~~-~-~~vm~~~~~~-~L~~lkd~~G~~l~~~~~~~~~~~ 209 (298) ..+...........+.-....+...+++..+ ...+.+. . +.+|.+...+ .--+|-+....|-=. .....-.. T Consensus 178 ~a~~rVl~~~~~~~g~~gdy~nLDAlV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~ptE~-~Aa~~i~k 256 (341) T protein:vir:27 178 RKASQVVDVDVYFDETNGDYRTLDAMASDIINNQIHPMFRNDPRLTVFVGSGLIGAAQAKLYDKADKPSEQ-IAAQKLDK 256 (341) T ss_pred hcccceeccceeeccCCCccccHHHHHHHHHhcccChHHhcCCCEEEEEchhhhhhhhhhhhccCCCCHHH-HHHHHHHH Confidence 0000000011111111222333455666543 4443332 2 4666665554 222332222222100 00011135 Q ss_pred eecceeeEecCccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEeccc Q lcl|Aclame:pro 210 TINGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDAT 289 (298) Q Consensus 210 ~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~ 289 (298) ++.|+|.+..+++|.+ .+++.-|++.-.|...+...=.+.+...-+.-. + | +. ++.|.+.. T Consensus 257 ~iGGlpa~~~PffP~~------~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie-~-y--es---------~YvVEdyg 317 (341) T protein:vir:27 257 TIAGRPAYVPPFLPDN------AMVVTIPENLQVLTQHGTAQRKAKHESDRKRSK-T-H--TG---------AWKVTQWV 317 (341) T ss_pred hhCCCeEEEccccCCC------ceEEeeccceEEEEecCcEEEEEEecccccccc-c-h--hh---------hheeehhh Confidence 8999999999999976 477888888877777666654443322221110 1 1 22 23344444 Q ss_pred ceE-----EEeecC Q lcl|Aclame:pro 290 KFA-----RVTEAN 298 (298) Q Consensus 290 a~~-----~l~~a~ 298 (298) +|+ .+|..+ T Consensus 318 ~~~~~~~~~vkl~~ 331 (341) T protein:vir:27 318 CWKRSPLTTQKKST 331 (341) T ss_pred hhhhccccccccCc Confidence 444 344433 No 207 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=96.82 E-value=0.00032 Score=39.77 Aligned_cols=285 Identities=13% Similarity=0.081 Sum_probs=147.8 Q ss_pred Ceeccccccchh----HHHHHHHHHHhhchhhhhcceeecCCC---ceEEEEEeCCcce------EEeeccccc------ Q lcl|Aclame:pro 1 MVLNKGTLFDPE----LVTDLISKVAGKSSIARLSAQKPIPFN---GEKVFTFTMDSEI------DVVAESGKK------ 61 (298) Q Consensus 1 mat~gg~lip~~----~~~~ii~~~~~~s~i~~~~~~~~~~~~---~~~ip~~~~~~~a------~~v~E~~~~------ 61 (298) ..+..|.+=|+- +..+.+..+++.-.+.+++...|++.+ .+++.+...-+.+ +..++|++. T Consensus 12 ~~s~~g~~~~~~~t~y~~~k~L~~Aa~~lv~~~fA~~~piPkn~GkTIk~r~y~pl~~~~~pl~eGv~a~G~~~~~g~~y 91 (401) T protein:vir:95 12 KSSIDGANSDQMQTFFWLKKAIITARKEQYFMPLASVTNMPKHYGKTIKVYEYVPLLDDRNINDQGIDASGATIVNGNLY 91 (401) T ss_pred cccccccccceeeehhhHHHHHhhhhhhhhhhhcccccccccccCCeEEEEecccccccccchhcCCCcccccccCcccc Confidence 233334433321 235666666777788999999999854 2333332222221 122233321 Q ss_pred -----------------------cccccceeeEEEeeeEEEEEEeecHHHhhcccccHHHHHHHH-HHHHH---HHHHHH Q lcl|Aclame:pro 62 -----------------------THGGVTLAPQTMVPIKVEYGARISDEFMYASDEEKINILQAF-NDGFA---KKVARG 114 (298) Q Consensus 62 -----------------------~~~~~~~~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i-~~~la---~~i~~~ 114 (298) ....++-..++.+.++++.++.+|+++..-.+| ..+.+.+ .+.|. +..... T Consensus 92 ~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~D--~~l~~h~s~ell~g~~~~t~d~ 169 (401) T protein:vir:95 92 GSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDSD--DGLMEHLSRELMNGATQITEAV 169 (401) T ss_pred ccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhcc--hHHHHHHHHHHhhhhhhhHHHH Confidence 112333345677899999999999997654444 3444433 22222 223344 Q ss_pred HHHHHhcccccccccccccccccccccccccccccccccchhHHHHHHHhhhhhhcCCc------------------cc- Q lcl|Aclame:pro 115 IDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGAIENAVELLTGVDAD------------------VT- 175 (298) Q Consensus 115 ~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~------------------~~- 175 (298) +-+.+|++-+ ..--.|..... ++............++++..+...|..+... .+ T Consensus 170 i~~dll~ag~-----~viyAg~ats~--At~~~~~~~~t~vt~~~l~rl~~~L~~nRapk~t~~i~~s~~~dTk~i~~s~ 242 (401) T protein:vir:95 170 LQKDLLAAAG-----TVLYAGAATSD--ATITGEGSTPSVVSYKNLMRLDQILTENRTPTQTTIITGSRMIDTKVIGATR 242 (401) T ss_pred HHHHHHhhcC-----eeecCCcccee--eeccccccccceechhHHHHHHHHHHhcccccchhhhhhhhccCccccccce Confidence 4456664310 00000111111 1111122333344567777777666542211 11 Q ss_pred EEEEcHHHHHHHHHhhccCCceeeccc--------ccccCcceecceeeEecCccc--------ccc------------c Q lcl|Aclame:pro 176 GIAINPSFRSALAKQKDLQGNALFPEL--------KWGATPDTINGLPVDVNKTVS--------DMS------------L 227 (298) Q Consensus 176 ~~vm~~~~~~~L~~lkd~~G~~l~~~~--------~~~~~~~~l~G~PV~~s~~~~--------~~~------------~ 227 (298) .-+||+.....|+.++|-.|.|-|.+. .+.+..+++.++.+++++.+- ... + T Consensus 243 va~~h~~L~~di~a~~D~~~~~~fi~v~kYa~~~~i~~gEiG~i~~vR~i~~p~~~~w~~ag~~a~~~~~~y~~~~~~~g 322 (401) T protein:vir:95 243 VMYVGSELVPELKAMKDLFGNKAFIETQHYADAGTIMNGEVGSIDKFRIIQVPEMLHWAGAGAQATGANPGYRTSMVSGQ 322 (401) T ss_pred EEEEecCchhHHHHHHHhcCCCCceehhhcCCccccccccccccCceeEEecccceeecCCcccccccccccccccccCC Confidence 258899999999999998888877433 345667888899999887643 111 1 Q ss_pred cccc---eEEEeeccceEEEEeec-ce----EEEEee--cccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeec Q lcl|Aclame:pro 228 TQRD---RAIIGDFANGFKWGYAK-EV----PLEVIQ--YGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEA 297 (298) Q Consensus 228 ~~~~---~~~~gd~~~~~~~~~~~-~~----~i~~~~--~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a 297 (298) ++.+ ..++|.-..+. ..... +. .+-+.. +...+ ...-|-|++.+.|+ +..++.+++++-.++|+-+ T Consensus 323 g~~dVyp~lV~G~dAf~~-~~l~g~g~~~~~~~ivk~pG~~~ad-~~DPlgQ~g~vgwK--~~~a~~vL~~e~m~~ies~ 398 (401) T protein:vir:95 323 EHYDVYPMLVVGDDSFTS-IGFQTDGKSLKFTVMTKMPGKETAD-RNDPYGETGFSSIK--WYYGILVKRPERLALIKTV 398 (401) T ss_pred CcceeeeeeEEcccccee-cccccCCccccceeEeecCCcCCCC-CCCcccceehhhhh--hhhhhheeccceeEEEEee Confidence 1111 34556543221 22221 11 222222 11111 11123455666666 3677888999999999888 Q ss_pred C Q lcl|Aclame:pro 298 N 298 (298) Q Consensus 298 ~ 298 (298) - T Consensus 399 a 399 (401) T protein:vir:95 399 A 399 (401) T ss_pred c Confidence 8 No 208 >protein:vir:2016 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046760;genbank:gi:9630331;genbank:GeneID:1261541 Probab=96.81 E-value=0.00025 Score=40.32 Aligned_cols=281 Identities=10% Similarity=0.038 Sum_probs=153.2 Q ss_pred Ce---------eccccccchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCcceEEee--ccccccccc-cc Q lcl|Aclame:pro 1 MV---------LNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNG-EKVFTFTMDSEIDVVA--ESGKKTHGG-VT 67 (298) Q Consensus 1 ma---------t~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~-~~ip~~~~~~~a~~v~--E~~~~~~~~-~~ 67 (298) +| .+...-|-|.+.+.+.+.+.+.|-+++..+.+++.--. -.+-.-.+++-++-+. -+.+....+ .. T Consensus 16 ~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrtdT~~~~~R~~~~~~~ 95 (357) T protein:vir:20 16 VAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTTDTAGGTERQPKDFSK 95 (357) T ss_pred HHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccccccCCCCCCcccccccc Confidence 22 22344577888899999999999999999999887422 2333333344443332 112222222 34 Q ss_pred eeeEEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc--cccccccc-c------ Q lcl|Aclame:pro 68 LAPQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGT--ASAVIGTN-H------ 138 (298) Q Consensus 68 ~~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~--~~~~~~~~-~------ 138 (298) ++.-....++.-.-+.|+.+.|-+.. .+.++...+++.+.++++.-.-.--|+|+.-..-+ ...+.+.. + T Consensus 96 l~~~~Y~c~qTn~dt~i~Y~~lD~WA-~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~ 174 (357) T protein:vir:20 96 LASNKYECDQINFDFYIRYKTLDLWA-RYQDFQLRIRNAIIKRQSLDFIMAGFNGVKRAETSDRSSNPMLQDVAVGWLQK 174 (357) T ss_pred cCCCccEEEEeeecccccHHHHHHHh-cChhHHHHHHHHHHHHHhhccceecccceeeeccCChhhCcCccccchhHHHH Confidence 56666677777777889999885543 44677777888888877766666667775322111 11111110 0 Q ss_pred --------ccc-------c-ccccc--ccccccchhHHHHHHHhhhh-hhcCCc-cc-EEEEcHHHHH-HHHHhhccCCc Q lcl|Aclame:pro 139 --------FDS-------K-VTQKV--EAPRGIADPNGAIENAVELL-TGVDAD-VT-GIAINPSFRS-ALAKQKDLQGN 196 (298) Q Consensus 139 --------~~~-------~-~~~~~--~~~~~~~~~~~~i~~~~~~l-~~~~~~-~~-~~vm~~~~~~-~L~~lkd~~G~ 196 (298) +.+ . .+... ..++.-....+...+++..+ ...+.+ +. +.+|.+...+ +--+|-...+. T Consensus 175 ~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~ 254 (357) T protein:vir:20 175 YRNEAPARVMSKVTDEEGRTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQLLADKYFPIVNKEQD 254 (357) T ss_pred HHhhchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhhhhhhHhhccCC Confidence 000 0 00001 01122223334455667543 444443 22 4566665554 23333333333 Q ss_pred eeecccc-cccCcceecceeeEecCccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEE Q lcl|Aclame:pro 197 ALFPELK-WGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYI 275 (298) Q Consensus 197 ~l~~~~~-~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~ 275 (298) |-=.-.. .-....++.|+|.+..+++|.+ .+++.-|++.-.|..++..+=.+.+...-+. ..++..+| T Consensus 255 ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~------~ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~r-iE~y~s~N---- 323 (357) T protein:vir:20 255 NSEMLAADVIISQKRIGNLPAVRVPYFPAD------AMLITKLENLSIYYMDDSHRRVIEENPKLDR-VENYESMN---- 323 (357) T ss_pred hHHHHHHHHHHHhhhhCCceeEEccccCCC------ceEEeeccccEEEEecCcEEEEEEecccccc-ccchhhhc---- Confidence 3211000 0111357999999999999976 4778888888777777666544433222111 11223333 Q ss_pred EEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 276 RAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 276 r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) -|..|.++.++|.+++.+ T Consensus 324 -----e~YvVEd~~~~a~iE~i~ 341 (357) T protein:vir:20 324 -----IDYVVEDYAAGCLVEKIK 341 (357) T ss_pred -----ceeeeeccccEEEeeeee Confidence 455678888888888777 No 209 >protein:vir:5694 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839853;genbank:gi:30065708;genbank:GeneID:1260602 Probab=96.79 E-value=0.00026 Score=40.26 Aligned_cols=281 Identities=10% Similarity=0.029 Sum_probs=153.3 Q ss_pred Ce---------eccccccchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCcceEEee--cccc-ccccccc Q lcl|Aclame:pro 1 MV---------LNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNG-EKVFTFTMDSEIDVVA--ESGK-KTHGGVT 67 (298) Q Consensus 1 ma---------t~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~-~~ip~~~~~~~a~~v~--E~~~-~~~~~~~ 67 (298) +| .+...-|-|.+.+.+.+.+.+.|-+++..+.+++.--. -.+-.-.+++-++-+. -+.+ .|..-.. T Consensus 16 ~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrtdT~~~~~R~~~~~~~ 95 (357) T protein:vir:56 16 VAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTTDTAGGTERQPKDFSK 95 (357) T ss_pred HHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccccccCCCCCCcccccccc Confidence 22 22344577888899999999999999999999887422 2333323334343332 1122 2222234 Q ss_pred eeeEEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc--cccccccc-c------ Q lcl|Aclame:pro 68 LAPQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGT--ASAVIGTN-H------ 138 (298) Q Consensus 68 ~~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~--~~~~~~~~-~------ 138 (298) ++.-....++.-.-+.|+.+.|-+.. .+.++...+++.+.++++.-.-.--|+|+.-..-+ ...+.+.. + T Consensus 96 l~~~~Y~c~qTn~dt~i~Y~~lD~WA-~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~ 174 (357) T protein:vir:56 96 LASNKYECDQINFDFYIRYKTLDLWA-RYQDFQLRVRNAIIKRQSLDFIMAGFNGVKRAETSDRSSNPMLQDVAVGWLQK 174 (357) T ss_pred cCCCccEEEEeeecccccHHHHHHHh-cChhHHHHHHHHHHHHHhhccceecccceeeeccCChhhCcCccccchhHHHH Confidence 56666677777777889999885543 44677777888888877766666667775322111 11111110 0 Q ss_pred --------ccc--------cccccc--ccccccchhHHHHHHHhhhh-hhcCCc-cc-EEEEcHHHHH-HHHHhhccCCc Q lcl|Aclame:pro 139 --------FDS--------KVTQKV--EAPRGIADPNGAIENAVELL-TGVDAD-VT-GIAINPSFRS-ALAKQKDLQGN 196 (298) Q Consensus 139 --------~~~--------~~~~~~--~~~~~~~~~~~~i~~~~~~l-~~~~~~-~~-~~vm~~~~~~-~L~~lkd~~G~ 196 (298) +.+ ..+... ..++.-....+...+++..+ ...+.+ +. +.+|.+...+ +--+|-...+. T Consensus 175 ~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~ 254 (357) T protein:vir:56 175 YRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQLLADKYFPIVNKEQD 254 (357) T ss_pred HHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhhhhhhHhhccCC Confidence 000 000001 01122223334455667543 444443 22 4566665554 23333333333 Q ss_pred eeecccc-cccCcceecceeeEecCccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEE Q lcl|Aclame:pro 197 ALFPELK-WGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYI 275 (298) Q Consensus 197 ~l~~~~~-~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~ 275 (298) |-=.-.. .-....++.|+|.+..+++|.+ .+++.-|++.-.|..++..+=.+.+...-+. ..++..+| T Consensus 255 pTE~~Aa~~i~s~k~iGGl~a~~~PfFP~~------~llVT~L~NLsIY~Q~gs~RR~~~d~p~r~r-iE~y~s~N---- 323 (357) T protein:vir:56 255 NSEMLAADVIISQKRIGNLPAVRVPYFPAD------AMLITKLENLSIYYMDDSHRRVIEENPKLDR-VENYESMN---- 323 (357) T ss_pred hHHHHHHHHHHHhhhhCCceeEEccccCCC------ceEEeeccccEEEEecCcEEEEEEecccccc-ccchhhhc---- Confidence 3211000 0111357999999999999976 4778888888777777666544433222111 11223333 Q ss_pred EEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 276 RAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 276 r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) -|..|.++.++|.+++.+ T Consensus 324 -----e~YvVEd~~~~a~iE~i~ 341 (357) T protein:vir:56 324 -----IDYVVEDYAAGCLVEKIK 341 (357) T ss_pred -----ceeeeeccccEEEeeeee Confidence 455678888999888877 No 210 >protein:vir:6061 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878202;genbank:gi:33438901;genbank:GeneID:1457736 Probab=96.79 E-value=0.00026 Score=40.20 Aligned_cols=281 Identities=10% Similarity=0.029 Sum_probs=152.8 Q ss_pred Ce---------eccccccchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCcceEEee--cccc-ccccccc Q lcl|Aclame:pro 1 MV---------LNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNG-EKVFTFTMDSEIDVVA--ESGK-KTHGGVT 67 (298) Q Consensus 1 ma---------t~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~-~~ip~~~~~~~a~~v~--E~~~-~~~~~~~ 67 (298) +| .+...-|-|.+.+.+.+.+.+.|-+++..+.+++.--. -.+-.-.+++-++-+. -+.+ .|..-.. T Consensus 16 ~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrtdT~~~~~R~~~~~~~ 95 (357) T protein:vir:60 16 VAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTTDTAGGTERQPKDFSK 95 (357) T ss_pred HHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCcccccccccCCCCCcccccccc Confidence 22 22344577888899999999999999999999887422 2333333344443332 1122 2222234 Q ss_pred eeeEEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc--cccccccc-c------ Q lcl|Aclame:pro 68 LAPQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGT--ASAVIGTN-H------ 138 (298) Q Consensus 68 ~~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~--~~~~~~~~-~------ 138 (298) ++.-....++.-.-+.|+.+.|-+.. .+.++...+++.+.++++.-.-.--|+|+.-..-+ ...+.+.. + T Consensus 96 l~~~~Y~c~qTn~dt~i~Y~~lD~WA-~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~ 174 (357) T protein:vir:60 96 LASNKYECDQINFDFYIRYKTLDLWA-RYQDFQLRVRNAIIKRQSLDLIMAGFNGVRRAETSDRSSNQMLQDVAVGWLQK 174 (357) T ss_pred cCCCccEEEEeeeeccccHHHHHHHh-cChhHHHHHHHHHHHHHhhccceecccceeeeccCChhhCcCccccchhHHHH Confidence 56666677777777889999885543 44677777888888877766666667775321111 11111110 0 Q ss_pred --------cc-------c-cccccc--ccccccchhHHHHHHHhhhh-hhcCCc-cc-EEEEcHHHHH-HHHHhhccCCc Q lcl|Aclame:pro 139 --------FD-------S-KVTQKV--EAPRGIADPNGAIENAVELL-TGVDAD-VT-GIAINPSFRS-ALAKQKDLQGN 196 (298) Q Consensus 139 --------~~-------~-~~~~~~--~~~~~~~~~~~~i~~~~~~l-~~~~~~-~~-~~vm~~~~~~-~L~~lkd~~G~ 196 (298) +. + ..+... ..++.-....+...+++..+ ...+.+ +. +.+|.+...+ +--+|-...+. T Consensus 175 ~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~ 254 (357) T protein:vir:60 175 YRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQLLADKYFPIVNREQD 254 (357) T ss_pred HHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhhHhhcCCC Confidence 00 0 000001 01122223334455667543 444443 22 4566666554 22333333333 Q ss_pred eeecccc-cccCcceecceeeEecCccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEE Q lcl|Aclame:pro 197 ALFPELK-WGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYI 275 (298) Q Consensus 197 ~l~~~~~-~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~ 275 (298) |-=.-.. .-....++.|+|.+..+++|.+ .+++.-|++.-.|..++..+=.+.+...-+. ..++..+| T Consensus 255 pTE~~Aa~~i~s~k~iGGl~a~~~PfFP~~------~llVT~L~NLsIY~Q~gs~RR~~~d~p~r~r-iE~y~s~N---- 323 (357) T protein:vir:60 255 NSEMLAADVIISQKRIGNLPAVRVPYFPAD------AMLITKLENLSIYYMDDSHRRVIEENPKLDR-VENYESMN---- 323 (357) T ss_pred hHHHHHHHHHHHhhhhcCcceEEccccCCC------ceEEeeccccEEEEecCcEEEEEEecccccc-ccchhhhc---- Confidence 3210000 0111357999999999999976 4778888888777777666544433222111 11223333 Q ss_pred EEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 276 RAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 276 r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) -|..|.++.++|.+++.+ T Consensus 324 -----e~YvVEd~~~~a~iE~i~ 341 (357) T protein:vir:60 324 -----IDYVVEDYAAGCLVEKIK 341 (357) T ss_pred -----ceeeeeccccEEEeeeee Confidence 455678888888888777 No 211 >protein:vir:78186 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111152;genbank:gi:134288735;genbank:GeneID:4960646 Probab=96.73 E-value=0.00038 Score=39.35 Aligned_cols=281 Identities=9% Similarity=0.001 Sum_probs=154.2 Q ss_pred Ceecc-------ccccchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCcceEEeec--cccccccccceee Q lcl|Aclame:pro 1 MVLNK-------GTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNG-EKVFTFTMDSEIDVVAE--SGKKTHGGVTLAP 70 (298) Q Consensus 1 mat~g-------g~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~-~~ip~~~~~~~a~~v~E--~~~~~~~~~~~~~ 70 (298) +|... ..-|-|.+.+.+.+.+.+.|-+++..+.+++.--. -.+-.-.+++-++-+.- +...|..-..++. T Consensus 16 ~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt~~~~R~~~~~~~l~~ 95 (337) T protein:vir:78 16 IAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDTTKAARQPIDPTALDS 95 (337) T ss_pred HHHhcChhhhcceeecChHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCcceeeeecCCCcccccccccccCC Confidence 22222 23377888889999999999999999999887422 23333333343433322 2233333345666 Q ss_pred EEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc--ccccccc------------ Q lcl|Aclame:pro 71 QTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGT--ASAVIGT------------ 136 (298) Q Consensus 71 v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~--~~~~~~~------------ 136 (298) -....++.-.-+.|+.+.|-+.. .+.++...+.+.+.++++.-.-.--|+|+.-..-+ ...+.+. T Consensus 96 ~~Y~c~qTn~dt~i~Y~~lD~WA-~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~~Re 174 (337) T protein:vir:78 96 NRYRCEKTDYDTAIPYRKLDMWA-KFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYRE 174 (337) T ss_pred CccEEEEeceecccCHHHHHHHh-cChhHHHHHHHHHHHHHhhccceecccceeeccCCChhhCcCccccchHHHHHHHh Confidence 66677777777889999885443 44577788888888877766666667775322111 1111111 Q ss_pred ---ccc-ccccc---cc-cccccccchhHHHHHHHhhh-hhhcCCc-cc-EEEEcHHHHHH-HHHhhccCCceeecccc- Q lcl|Aclame:pro 137 ---NHF-DSKVT---QK-VEAPRGIADPNGAIENAVEL-LTGVDAD-VT-GIAINPSFRSA-LAKQKDLQGNALFPELK- 203 (298) Q Consensus 137 ---~~~-~~~~~---~~-~~~~~~~~~~~~~i~~~~~~-l~~~~~~-~~-~~vm~~~~~~~-L~~lkd~~G~~l~~~~~- 203 (298) ..+ ...+. .. ....+.-....+...+++.. +...+.+ +. +.+|.+...+. --.+-...+.|-=.-.. T Consensus 175 ~ap~rVl~~~~~~~~~i~iG~~gdy~NLDalV~d~~~~lI~~~~~~d~dLVvivG~dLladk~~~l~n~~~~ptE~~Aa~ 254 (337) T protein:vir:78 175 RAAQRVLHEGAKQAGKVLIGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNATQAPTERLAAD 254 (337) T ss_pred cchhhhhccccccCCceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHHHHHhcCCCcHHHHHHH Confidence 000 00000 00 11112222334455666654 3444443 22 46666666542 22222232333110000 Q ss_pred cccCcceecceeeEecCccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEcc Q lcl|Aclame:pro 204 WGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGW 283 (298) Q Consensus 204 ~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~ 283 (298) .-....++.|+|.+..+++|.+ .+++.-|++.-.|..++...=.+.+...-+. ..++..+| -|+ T Consensus 255 ~i~s~k~iGGl~a~~~PfFP~~------~ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~r-ie~y~s~N---------e~Y 318 (337) T protein:vir:78 255 LIVSQKRIGNLPAVRVPFFPKR------ALMVTKLSNLSIYYQEGARRRTLKEVPERDR-IENYESSN---------DAY 318 (337) T ss_pred HHHHhhhhcCcceEEccccCCC------ceEEeechhcEEEEecCcEEEEEEecccccc-ccchhhcc---------cee Confidence 0112357999999999999976 4778888888777777666544433222111 11222233 456 Q ss_pred EEecccceEEEeecC Q lcl|Aclame:pro 284 GILDATKFARVTEAN 298 (298) Q Consensus 284 ~v~~~~a~~~l~~a~ 298 (298) .|.++.++|.+++.+ T Consensus 319 vVEd~~~~a~iEnI~ 333 (337) T protein:vir:78 319 VVEDFGCGCVAENIE 333 (337) T ss_pred eeeccccEEEEecee Confidence 678999999998777 No 212 >protein:vir:79157 Length: 339 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165257;genbank:gi:145708082;genbank:GeneID:5247168 Probab=96.72 E-value=0.00039 Score=39.28 Aligned_cols=281 Identities=11% Similarity=0.033 Sum_probs=152.7 Q ss_pred Cee-------ccccccchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEeCCcceEEee--ccccccccccceee Q lcl|Aclame:pro 1 MVL-------NKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNG-EKVFTFTMDSEIDVVA--ESGKKTHGGVTLAP 70 (298) Q Consensus 1 mat-------~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~-~~ip~~~~~~~a~~v~--E~~~~~~~~~~~~~ 70 (298) +|. +...-|-|.+.+.+.+.+.+.|-+++..+.+++.--. -.+-.-.+++-++-+. -++..|..-..++. T Consensus 16 ~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt~~~~R~~~~~~~l~~ 95 (339) T protein:vir:79 16 IAKLNGVERVDEKFSVAPSVQQKLETKVQESSDFLKSINFYGVPEQEGEKIGLGVSGPVASTTDTTQQDRETSDISTMDG 95 (339) T ss_pred HHHHhCcccccceeeecHHHHHHHHHHHHHHHHHhccCcccccccceeeEEeeccCcceeecccCCCCCcccccccccCC Confidence 222 2234477888899999999999999999999887422 2333333333343321 12222222235555 Q ss_pred EEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc--cccccccc----------- Q lcl|Aclame:pro 71 QTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGT--ASAVIGTN----------- 137 (298) Q Consensus 71 v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~--~~~~~~~~----------- 137 (298) -....++.-.-+.|+.+.|-+.. .+.++...+++.+.++++.-.-.--|+|+.-..-+ ...+.+.. T Consensus 96 ~~Y~c~qTn~dt~i~Y~~lD~WA-~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~~Re 174 (339) T protein:vir:79 96 RRYRCEQTNSDTHITYQKLDAWA-KFADFQTRIRDAIIKRQALDRIMIGFNGVSRAATSDRVANPMLQDVNKGWLQNLRE 174 (339) T ss_pred CccEEEEeeeeceecHHHHHHHh-cChhHHHHHHHHHHHHHhhccceecccceeeecCCChhhCcCccccchhHHHHHHh Confidence 66667777777889999885443 44577788888888877766666667775321111 11111110 Q ss_pred ----c-cccc---ccccc--ccccccchhHHHHHHHhhh-hhhcCCc-cc-EEEEcHHHHH-HHHHhhccCCceeecccc Q lcl|Aclame:pro 138 ----H-FDSK---VTQKV--EAPRGIADPNGAIENAVEL-LTGVDAD-VT-GIAINPSFRS-ALAKQKDLQGNALFPELK 203 (298) Q Consensus 138 ----~-~~~~---~~~~~--~~~~~~~~~~~~i~~~~~~-l~~~~~~-~~-~~vm~~~~~~-~L~~lkd~~G~~l~~~~~ 203 (298) . .... ..... ..++.-....+...+++.. +...+.+ +. +.+|.+...+ +--+|-.....|-=.-.. T Consensus 175 ~ap~rV~~~g~~~s~~i~~~G~ggdy~NLDalV~d~~~~lId~~~~~d~dLVvivG~dLla~k~~~l~n~~~~ptE~~Aa 254 (339) T protein:vir:79 175 QAPQRVMKEGKAAAGKITVGGAGADYGNLDALVYDITNHLVEPWYAEDPDLVVVCGRNLLSDKYFPLVNRDRDPVQQIAA 254 (339) T ss_pred hhhhhhhccceeccceeEeccCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhhHhhhHhhcCCChHHHHHH Confidence 0 0000 00111 1112223334455667753 4444443 22 4566666554 222332332233110000 Q ss_pred -cccCcceecceeeEecCccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEc Q lcl|Aclame:pro 204 -WGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLG 282 (298) Q Consensus 204 -~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~ 282 (298) .-....++.|+|.+..+++|.+ .+++.-|++.-.|..++...=.+.+...-+. ..++..+| -| T Consensus 255 ~~i~s~k~iGGl~a~~~PfFP~~------~llVT~L~NLsIY~Q~gs~RR~~~d~p~r~r-ie~y~s~N---------e~ 318 (339) T protein:vir:79 255 DLIISQKRIGNLPAIRVPYFPAN------GLLVTRLDNLSIYYQEGGRRRTILDNAKRDR-IENYESSN---------DA 318 (339) T ss_pred HHHHHhhhhCCceeEEccccCCC------ceEEeechhcEEEEecCcEEEEEEecccccc-ccchhhcc---------ce Confidence 0111357999999999999976 4778888888777777666544433222111 11222333 45 Q ss_pred cEEecccceEEEeecC Q lcl|Aclame:pro 283 WGILDATKFARVTEAN 298 (298) Q Consensus 283 ~~v~~~~a~~~l~~a~ 298 (298) +.|.++.++|.+++.+ T Consensus 319 YvVEd~~~~a~iEni~ 334 (339) T protein:vir:79 319 YVIEDLACAAMAENIA 334 (339) T ss_pred eeeeccccEEEeeeee Confidence 6678888888888777 No 213 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=96.54 E-value=0.00053 Score=38.56 Aligned_cols=271 Identities=13% Similarity=0.082 Sum_probs=131.3 Q ss_pred Ceeccccc--cchhHHHHHHHHHH-hhchhhhhcceeecCCCceEEEEEeCCcce-EEeeccccccccccceeeEEEeee Q lcl|Aclame:pro 1 MVLNKGTL--FDPELVTDLISKVA-GKSSIARLSAQKPIPFNGEKVFTFTMDSEI-DVVAESGKKTHGGVTLAPQTMVPI 76 (298) Q Consensus 1 mat~gg~l--ip~~~~~~ii~~~~-~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a-~~v~E~~~~~~~~~~~~~v~l~~~ 76 (298) |.++...| +-..+...+.+... .....++++++.|..+..-++.++..-|.. .|.+| .+-.++.=...+++.+ T Consensus 1 m~it~~~l~~l~~~~~~~~~~~y~~a~~~~~~~a~~~~sdf~~~~~~~lg~~p~l~e~~Ge---~~~~~l~~~~~~i~~~ 77 (302) T protein:vir:10 1 MLINKQSLNAAFVAIKTIFNNAFAAAPTTWQKIAMEVPSNTSSNDYKWLSTFPKMRRWIGA---KVVKNLKAYKYVVENE 77 (302) T ss_pred CcccHHHHHHHHHHHHHHHHHHHHhhhhhhhceeeecCCCcceeeceecCCCCCccccccc---eeeccccccceeEEee Confidence 77665432 11122222222222 223467788777755555666666665654 57654 4444555556778899 Q ss_pred EEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccc----ccccccccccccccccc------cccccc Q lcl|Aclame:pro 77 KVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGV----NPRLGTASAVIGTNHFD------SKVTQK 146 (298) Q Consensus 77 k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~----~~~~g~~~~~~~~~~~~------~~~~~~ 146 (298) +.+..+.||++.+ +++...+..-+.+.++++.++..|+.++.-. ++.-.....+....+.. +..... T Consensus 78 ~~g~~v~i~R~~i---~nDdlg~~~~~~~~~G~aaa~~~~~lv~~~L~~g~~~~~~DG~~fF~~dH~~g~~~~~N~g~~~ 154 (302) T protein:vir:10 78 DFEATVEVDRNDI---EDDQIGIYSPQAKMAGYSAAQLPDELVYEAVNGAFTKPCFDGQYFIDTDHPVGDASVSNKGTAP 154 (302) T ss_pred cccceecccHHhh---cccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCCcccCCcceecccccccccccccccchh Confidence 9999999999977 4556788889999999999998887776421 11111111222111110 000000 Q ss_pred --cccccccchhHHHHHHHhhhhhhcC-----CcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecc-eeeEe Q lcl|Aclame:pro 147 --VEAPRGIADPNGAIENAVELLTGVD-----ADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTING-LPVDV 218 (298) Q Consensus 147 --~~~~~~~~~~~~~i~~~~~~l~~~~-----~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G-~PV~~ 218 (298) ..........+...+.++......+ ..|..++..|.....-+++-.. ++. . .+..+-+.| +.+++ T Consensus 155 ~~~~~~~l~~~~~~aa~~am~~~k~~~G~~L~i~P~~LiVp~~le~~A~~ll~~-~~~-~-----~g~~Np~~g~~~~vv 227 (302) T protein:vir:10 155 LSNASQAAAKAGYGAARTAMKKFKDEEGRSLNVSPNVLLVGPALEDVAKMLLTN-PKL-A-----DNTPNPYVGTAELVV 227 (302) T ss_pred hhhcccccchHHHHHHHHHHHHHhhhcccccccCCCEEEecchhHHHHHHHhhc-ccc-C-----CCCcceeccceEEEE Confidence 0011112223444444444443332 3355678777776665554311 111 1 111222333 35566 Q ss_pred cCccccccccccceEEEeecc--ceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEcc------EEecccc Q lcl|Aclame:pro 219 NKTVSDMSLTQRDRAIIGDFA--NGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGW------GILDATK 290 (298) Q Consensus 219 s~~~~~~~~~~~~~~~~gd~~--~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~------~v~~~~a 290 (298) ++.+.. +..-.++.|.+ +.+.+.-+++.+++..+. |..+-+.+|.+..+|. +...+.. T Consensus 228 ~p~L~s----~~aWyL~a~~~~i~~~~l~g~~~P~~~~~~~----------~~~dgv~~k~~~d~Gvd~R~~~G~~~wq~ 293 (302) T protein:vir:10 228 DGRIES----DTAWFLLDTTKPVKPFIFQPRKQPEFVSQVN----------LDSDDVFNLRKLKFGAEARAAAGYGFWQL 293 (302) T ss_pred eeccCC----CCceEEEecCCccceEEEcCccccEEEeccC----------CCCCceEEEEEEEEeeeeeeecchhhhhh Confidence 666532 22344444433 123344445555554433 3334455555555443 2222222 Q ss_pred eEEEeecC Q lcl|Aclame:pro 291 FARVTEAN 298 (298) Q Consensus 291 ~~~l~~a~ 298 (298) ...-+++- T Consensus 294 a~~s~g~~ 301 (302) T protein:vir:10 294 AYGSTGTG 301 (302) T ss_pred hhccCccC Confidence 22223322 No 214 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=95.10 E-value=0.0028 Score=34.60 Aligned_cols=268 Identities=10% Similarity=0.026 Sum_probs=130.6 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcc--eeecCCCceEEEEEeCCcc-eEEeeccccccccccceeeEEEeeeE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSA--QKPIPFNGEKVFTFTMDSE-IDVVAESGKKTHGGVTLAPQTMVPIK 77 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~--~~~~~~~~~~ip~~~~~~~-a~~v~E~~~~~~~~~~~~~v~l~~~k 77 (298) ||.+-. +.+++.+.+.++..+....+.. ..-.++..++||+.+..+- .+-.+.|-....-+.++...++.-.+ T Consensus 1 Main~a----~~~~~~Ld~~~~~~~~t~~l~~~~~~~~ggktVkI~~i~~~gl~DY~R~~g~~~g~v~~~~et~tl~qdR 76 (290) T protein:vir:78 1 MAINYV----DKYGKELDQKLVFGTYTNELETPNLLWLDAKTFKIQTITTTGLKAHTRNKGYNEGSASNTNKSYTIDFDR 76 (290) T ss_pred CchhHH----HHHHHHHHHHHHhhheeeeccccceeeccCCEEEEeeeccCcccccccCCCcccCccccceeeEEeeccc Confidence 887653 4566677777766665444443 3334556799999875332 23333433344445566666776666 Q ss_pred EEEE-EeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccchh Q lcl|Aclame:pro 78 VEYG-ARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADP 156 (298) Q Consensus 78 ~~~~-~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (298) --.+ +.--+. +.......+...+.+...+.++-.+|...+.-.-...+ ..+ .....+.+.+.. T Consensus 77 ~~~F~vD~~Dv---DEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~-~~~------------~~~~~t~t~~n~ 140 (290) T protein:vir:78 77 DVEFFVDVMDV---DETGQALSAANVTKEFNSRHAGPEMDAYRFSKLATAAK-TNS------------NSVAEEITKDNV 140 (290) T ss_pred cceeeccccch---hHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhh-ccC------------cccccccCHHHH Confidence 3332 211110 00011233445556666666777777665531100000 000 000111233467 Q ss_pred HHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCcee---ecccccccCcceecceeeEecC---ccc------- Q lcl|Aclame:pro 157 NGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNAL---FPELKWGATPDTINGLPVDVNK---TVS------- 223 (298) Q Consensus 157 ~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l---~~~~~~~~~~~~l~G~PV~~s~---~~~------- 223 (298) ++.|.++..++......+-.++|+|.++..|.+.+.=....- +......+..++|.|++|+..+ .+. T Consensus 141 ~~~i~~~~~~ldevp~~~rvl~vtp~~~~lL~~~~~f~r~~~~~~~~~~~i~~~V~~idG~~ii~vps~~r~~t~~~f~~ 220 (290) T protein:vir:78 141 FTKLKAAIRKVKKYGTQNLVMYVSPDVMAALELSDDFVRAINVQNIGPSSIETRITAIDGTRIVEVEAEDRFYDTFDFTD 220 (290) T ss_pred HHHHHHHHHHHHhcCCCCeEEEECHHHHHHHhhChhhhccccccccccccccceeeeecCcEEEEecccchhhhhhhhcc Confidence 888888888886654444468999999998865432111110 1111224556889999987532 111 Q ss_pred ---cccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhc-CcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 224 ---DMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGY-NQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 224 ---~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~-n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ...+..+--.++..-+ +. +.+..--.+.+++ .+.+ +. +...+.-+.+.|.=|.+.+.=.+...+. T Consensus 221 G~~~~~~ak~in~ii~~~~-a~-i~~~K~~~~~~~~-P~~~-------~~~d~~~~~~r~y~d~~v~~nk~~~i~~~~~ 289 (290) T protein:vir:78 221 GYKPAAGAKKLNFLLVNKG-SV-VGGAKHASIYLHA-PGSV-------GQGDGWLYQYRVYHDIFVLDQQKDGVIASTE 289 (290) T ss_pred cccccCCccceeEEEEcCC-ce-eeeeeeeEEEeeC-CCCC-------cCcceeeeeeeeeeeeeeeccccCeeEEEee Confidence 1111111123333322 22 2222222333332 1111 12 2244555567777777666555555555 No 215 >protein:vir:102823 Length: 470 # NCBI annotation: major structural protein # Family: family:all:2450 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874086;genbank:gi:118197693;genbank:GeneID:4496015 Probab=92.51 E-value=0.011 Score=31.30 Aligned_cols=289 Identities=12% Similarity=0.002 Sum_probs=125.5 Q ss_pred Cee-ccccccchhHHHHHHHHHHhhc--hhhhhcceeecCCCceEEEEEeC---CcceEEeeccccccccccceeeEEEe Q lcl|Aclame:pro 1 MVL-NKGTLFDPELVTDLISKVAGKS--SIARLSAQKPIPFNGEKVFTFTM---DSEIDVVAESGKKTHGGVTLAPQTMV 74 (298) Q Consensus 1 mat-~gg~lip~~~~~~ii~~~~~~s--~i~~~~~~~~~~~~~~~ip~~~~---~~~a~~v~E~~~~~~~~~~~~~v~l~ 74 (298) -|. .|+.|--+.+-.++-.+..... .+.+-....|..+.--++-.+.. ........|++-.+.+++++.+.... T Consensus 19 ~a~~~g~AlR~EsLd~~l~~lt~~~~~ftf~~~i~k~~a~STV~ey~~~~~rhG~~g~s~~~E~~l~~~~d~~~~Rr~v~ 98 (470) T protein:vir:10 19 AAGQVAESLEREDLEPEVTQLNVLDTPLTDLLSKNAVKAKAYEHEYNVVTARHDKIGYAAFREGGLPRTVEVNVVRRRIR 98 (470) T ss_pred HhhhcchhhhhhhhccceeEeeecCccchhhhhcCCchhhhHhhhhhhhccccccccceeecccccCccCCCceEEEEEE Confidence 111 1223212222222222211111 12222233344433223322222 12233468999999999999999999 Q ss_pred eeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccc------cccccccccccccccc-cccc Q lcl|Aclame:pro 75 PIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRL------GTASAVIGTNHFDSKV-TQKV 147 (298) Q Consensus 75 ~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~------g~~~~~~~~~~~~~~~-~~~~ 147 (298) .|=++.-..+|.-.++.......++++.+.++---.+++.+|.++|+|+..-+ .....+.|+.++.... ...+ T Consensus 99 ~K~l~~~~~VT~~a~~~~~n~v~d~~~~~~~dai~~ia~tiE~a~FyGDs~l~s~~~g~~~gleFDGl~~lId~~~~~NV 178 (470) T protein:vir:10 99 PMLVGHRITVTELATRTTQNGVMQIDELVKREKMIAVANEFEYLAFYGDNLLGDDVPGSPNNLQQDGIINIIKRGAPQNV 178 (470) T ss_pred EEEEeecchhhhhhhhhhhccccchHHHHHHHHHHHHHHHHHhhhhhhccccccccCcccCceeccchhhhccCCCCccc Confidence 99898888888664311112234677778787788899999999999965322 1222344444433321 1112 Q ss_pred ccccccchhHHHHHHHhhhhh--hcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeE--ecCccc Q lcl|Aclame:pro 148 EAPRGIADPNGAIENAVELLT--GVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVD--VNKTVS 223 (298) Q Consensus 148 ~~~~~~~~~~~~i~~~~~~l~--~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~--~s~~~~ 223 (298) --........+.|..+...+. .++..++-++|+..+.+.|..--...-|.+.+...... ..|+||- ++.. T Consensus 179 iDarG~~Ls~~~L~~aa~~I~~~~~fGt~TD~~lp~~vka~f~~~~~~~qRv~~~~N~~~~----~~G~~v~~f~sa~-- 252 (470) T protein:vir:10 179 LDAGGRPLSIDLLWEAESRVVSTQAFANPTAVFISYVDKLNLQASFYQISRVMTTADRRAG----LLGADAQSYIGVR-- 252 (470) T ss_pred cccCCCCccHHHHHHHHhhhcccccccChhhhccchhHHHHHHHhhcCceEEEEecCCCce----eeeeeccceeeee-- Confidence 222223334566777766663 46778889999999999998776666666554332211 1333331 0000 Q ss_pred cccccccceEEEeeccceEEEEeec--------ceEEEEeecc----c-ccccchhhhhc--CcEEEEEEEEEccEEecc Q lcl|Aclame:pro 224 DMSLTQRDRAIIGDFANGFKWGYAK--------EVPLEVIQYG----D-PDNSGLDLKGY--NQVYIRAELFLGWGILDA 288 (298) Q Consensus 224 ~~~~~~~~~~~~gd~~~~~~~~~~~--------~~~i~~~~~~----~-~~~~~~~~f~~--n~v~~r~~~r~~~~v~~~ 288 (298) ...... ...++.++.+..-...++ .++..++... . .+....++-.. +.-++++..+.|=. ++ T Consensus 253 G~I~L~-~s~~m~~~~k~~p~~l~~~v~~~aAP~~~~tv~~t~~~~a~~~~sk~g~~~~~~v~sy~y~v~~~~gds--~s 329 (470) T protein:vir:10 253 GEHSLY-PSQFLGDFHKFNPARFGAEVGDFAAPSNSWTVSTTDNFVTLPYNSGLGDPANTTVYSYAFKAANFYGES--AA 329 (470) T ss_pred eeeeec-ccccccchhhcCcccCCcccCCcccCceeEEeecCCCceeecccCCCCcccCcceeEEEEEEEEecCCC--Cc Confidence 000000 000111110000000000 0011111000 0 00000000000 11223333333222 22 Q ss_pred cceEE-EeecC Q lcl|Aclame:pro 289 TKFAR-VTEAN 298 (298) Q Consensus 289 ~a~~~-l~~a~ 298 (298) .++-. .+..+ T Consensus 330 ~~v~vt~t~~~ 340 (470) T protein:vir:10 330 KYIDVYIDSTE 340 (470) T ss_pred ceEEEEEeeeh Confidence 22211 10000 No 216 >protein:vir:95603 Length: 463 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240903;genbank:gi:66394965;genbank:GeneID:5132544 Probab=91.58 E-value=0.015 Score=30.56 Aligned_cols=275 Identities=11% Similarity=0.005 Sum_probs=134.6 Q ss_pred Cee----------ccccccchhHHHHHHHHHHhh--chhhhhcceeecCCCceEEEEEeC---CcceEEeeccccccccc Q lcl|Aclame:pro 1 MVL----------NKGTLFDPELVTDLISKVAGK--SSIARLSAQKPIPFNGEKVFTFTM---DSEIDVVAESGKKTHGG 65 (298) Q Consensus 1 mat----------~gg~lip~~~~~~ii~~~~~~--s~i~~~~~~~~~~~~~~~ip~~~~---~~~a~~v~E~~~~~~~~ 65 (298) |.+ ++|.|--+.+..+|-.+.... -.+.+-..+.|..+.--++-.... .+.+.+++|+...+.++ T Consensus 26 ~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~~~~~d 105 (463) T protein:vir:95 26 FQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSD 105 (463) T ss_pred hhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhheeeeccCccccccccccccccccCC Confidence 222 234554455555554443322 233344445555554334444332 25678999999999999 Q ss_pred cceeeEEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccc----cccccccccccccc Q lcl|Aclame:pro 66 VTLAPQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRL----GTASAVIGTNHFDS 141 (298) Q Consensus 66 ~~~~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~----g~~~~~~~~~~~~~ 141 (298) +++.+.....|=++....+|.-+-++ ....+.++.+.+.-.-.++..+|.++|+|+..-+ |..-.+.|+.+. T Consensus 106 ~~~~Rr~~~~K~l~~~~~VS~~~~l~--n~~~d~~~~~~~dai~~ia~tiE~a~FyGds~l~~~~~~~gleFDGl~~l-- 181 (463) T protein:vir:95 106 PNIRQKTVSMKYVSDTKNMSIASGLV--NNIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKL-- 181 (463) T ss_pred CceEEEEEEeeeeehhhhhhhHHHhh--cccccHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCcCccccchhhhhhh-- Confidence 99999999999888887777765322 2335677888888888999999999999964322 122233333333 Q ss_pred cccccccc-ccccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeeccccccc-------------- Q lcl|Aclame:pro 142 KVTQKVEA-PRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGA-------------- 206 (298) Q Consensus 142 ~~~~~~~~-~~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~-------------- 206 (298) ....... ........+.|..+-..+..++..++-++|+..+.+.|..---..-|.+.++...+. T Consensus 182 -Id~enviDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~~~~l~~qrv~~~~N~~~~~~G~~v~~f~s~~G 260 (463) T protein:vir:95 182 -IDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQDNSGNVNTGYSVNGFYSSRG 260 (463) T ss_pred -cCCCCeeecCCCcccHHHHhhhhhhhhcccCChhheecchHHHHHHHHHhcCceEEEEcCCCCceeeeeeccceeeeee Confidence 2222222 222334456677777777778888999999999999987443222233322211100 Q ss_pred ----CcceecceeeEecCccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEc Q lcl|Aclame:pro 207 ----TPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLG 282 (298) Q Consensus 207 ----~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~ 282 (298) .+.++++.|-......+. ..+++... .++..+.. +.+...-+--......|++...-+ T Consensus 261 ~I~L~~s~~m~~~~il~~~~~~---------~p~ap~~~-------~~tatv~~--~~~~~~~~~~~~a~~~Y~vv~~s~ 322 (463) T protein:vir:95 261 FIKLHGSTVMENELILDESLQP---------LPNAPQPA-------KVTATVET--KQKGAFENEEDRAGLSYKVVVNSD 322 (463) T ss_pred eeeeCCceecCCcccccchhhc---------CCCCccCc-------eeEEEEee--ccCCCCCCcccccceEEEEEEECC Confidence 011222222222111100 00000000 00111110 000000000001111223322222 Q ss_pred cEEecccceEEEeecC Q lcl|Aclame:pro 283 WGILDATKFARVTEAN 298 (298) Q Consensus 283 ~~v~~~~a~~~l~~a~ 298 (298) ..=-.|+.++-.+.|. T Consensus 323 ~geS~pS~ivtaT~a~ 338 (463) T protein:vir:95 323 DAQSAPSEEVTATVSN 338 (463) T ss_pred CCCcccchheeeeeee Confidence 2222333333333221 No 217 >protein:vir:99311 Length: 463 # NCBI annotation: putative capsid protein # Family: family:all:2450 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024474;genbank:gi:48696433;genbank:GeneID:2948039 Probab=91.58 E-value=0.015 Score=30.56 Aligned_cols=275 Identities=11% Similarity=0.005 Sum_probs=134.6 Q ss_pred Cee----------ccccccchhHHHHHHHHHHhh--chhhhhcceeecCCCceEEEEEeC---CcceEEeeccccccccc Q lcl|Aclame:pro 1 MVL----------NKGTLFDPELVTDLISKVAGK--SSIARLSAQKPIPFNGEKVFTFTM---DSEIDVVAESGKKTHGG 65 (298) Q Consensus 1 mat----------~gg~lip~~~~~~ii~~~~~~--s~i~~~~~~~~~~~~~~~ip~~~~---~~~a~~v~E~~~~~~~~ 65 (298) |.+ ++|.|--+.+..+|-.+.... -.+.+-..+.|..+.--++-.... .+.+.+++|+...+.++ T Consensus 26 ~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~~~~~d 105 (463) T protein:vir:99 26 FQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSD 105 (463) T ss_pred hhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhheeeeccCccccccccccccccccCC Confidence 222 234554455555554443322 233344445555554334444332 25678999999999999 Q ss_pred cceeeEEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccc----cccccccccccccc Q lcl|Aclame:pro 66 VTLAPQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRL----GTASAVIGTNHFDS 141 (298) Q Consensus 66 ~~~~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~----g~~~~~~~~~~~~~ 141 (298) +++.+.....|=++....+|.-+-++ ....+.++.+.+.-.-.++..+|.++|+|+..-+ |..-.+.|+.+. T Consensus 106 ~~~~Rr~~~~K~l~~~~~VS~~~~l~--n~~~d~~~~~~~dai~~ia~tiE~a~FyGds~l~~~~~~~gleFDGl~~l-- 181 (463) T protein:vir:99 106 PNIRQKTVSMKYVSDTKNMSIASGLV--NNIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKL-- 181 (463) T ss_pred CceEEEEEEeeeeehhhhhhhHHHhh--cccccHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCcCccccchhhhhhh-- Confidence 99999999999888887777765322 2335677888888888999999999999964322 122233333333 Q ss_pred cccccccc-ccccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeeccccccc-------------- Q lcl|Aclame:pro 142 KVTQKVEA-PRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGA-------------- 206 (298) Q Consensus 142 ~~~~~~~~-~~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~-------------- 206 (298) ....... ........+.|..+-..+..++..++-++|+..+.+.|..---..-|.+.++...+. T Consensus 182 -Id~enviDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~~~~l~~qrv~~~~N~~~~~~G~~v~~f~s~~G 260 (463) T protein:vir:99 182 -IDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQDNSGNVNTGYSVNGFYSSRG 260 (463) T ss_pred -cCCCCeeecCCCcccHHHHhhhhhhhhcccCChhheecchHHHHHHHHHhcCceEEEEcCCCCceeeeeeccceeeeee Confidence 2222222 222334456677777777778888999999999999987443222233322211100 Q ss_pred ----CcceecceeeEecCccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEc Q lcl|Aclame:pro 207 ----TPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLG 282 (298) Q Consensus 207 ----~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~ 282 (298) .+.++++.|-......+. ..+++... .++..+.. +.+...-+--......|++...-+ T Consensus 261 ~I~L~~s~~m~~~~il~~~~~~---------~p~ap~~~-------~~tatv~~--~~~~~~~~~~~~a~~~Y~vv~~s~ 322 (463) T protein:vir:99 261 FIKLHGSTVMENELILDESLQP---------LPNAPQPA-------KVTATVET--KQKGAFENEEDRAGLSYKVVVNSD 322 (463) T ss_pred eeeeCCceecCCcccccchhhc---------CCCCccCc-------eeEEEEee--ccCCCCCCcccccceEEEEEEECC Confidence 011222222222111100 00000000 00111110 000000000001111223322222 Q ss_pred cEEecccceEEEeecC Q lcl|Aclame:pro 283 WGILDATKFARVTEAN 298 (298) Q Consensus 283 ~~v~~~~a~~~l~~a~ 298 (298) ..=-.|+.++-.+.|. T Consensus 323 ~geS~pS~ivtaT~a~ 338 (463) T protein:vir:99 323 DAQSAPSEEVTATVSN 338 (463) T ss_pred CCCcccchheeeeeee Confidence 2222333333333221 No 218 >protein:vir:99888 Length: 309 # NCBI annotation: capsid protein # Family: family:all:908 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Probab=91.33 E-value=0.016 Score=30.38 Aligned_cols=270 Identities=10% Similarity=0.027 Sum_probs=119.4 Q ss_pred ecccccc-chhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceE-----Eeeccccccccccceee--EEEe Q lcl|Aclame:pro 3 LNKGTLF-DPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEID-----VVAESGKKTHGGVTLAP--QTMV 74 (298) Q Consensus 3 t~gg~li-p~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~-----~v~E~~~~~~~~~~~~~--v~l~ 74 (298) -+.+-.+ .|.+-+=-+.+-.+..+-..+++..|++....+||+... .++. -++-++.....+++... +.+. T Consensus 1 ~~~~~~~~dp~LT~~A~gy~n~~~Ia~~l~P~vpV~~~~~~~~~f~~-~e~F~~~~t~r~~~~~~~~v~~~~~~~~~~~~ 79 (309) T protein:vir:99 1 MSNAPFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDL-AQGFTVPETLVGRKSKPNEVEFSATDETGSTE 79 (309) T ss_pred CCCCCcCcCHhHHHHHhhccChhhhhhhcCCccccCccccceeeech-hhcccccchhhccCCCcceEeecccCceeeec Confidence 3334333 343333223332233334567888899877788887642 1211 12333333333333333 3444 Q ss_pred eeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 75 PIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIA 154 (298) Q Consensus 75 ~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 154 (298) .|-|. .+|..+-.++. +...+..+.-.+.+.+.|.+..|..+-.-..... .-..+... .. +.+..+....+ T Consensus 80 ~~~L~--~~i~~~~~~~a-~~~~d~~~~Av~~l~~~i~l~rE~~~A~lv~~~a---~y~~~~k~--~L-sgt~~wsd~~S 150 (309) T protein:vir:99 80 DHGLD--APVPQADIDNA-PTNYNPLGHATEQTTNLILLDREARTSKLVFSPN---SYAAGNKT--TL-SGADQWSDPTS 150 (309) T ss_pred cccee--ecCCchhhhhc-cCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcChh---hcCCCceE--Ee-cCccccCCCCC Confidence 44444 45555533322 2234556666666666666555543322110000 01111111 11 12233556778 Q ss_pred hhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHH---h----hccCCceeecccccccCcceecce-eeEecCcccccc Q lcl|Aclame:pro 155 DPNGAIENAVELLTGVDADVTGIAINPSFRSALAK---Q----KDLQGNALFPELKWGATPDTINGL-PVDVNKTVSDMS 226 (298) Q Consensus 155 ~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~---l----kd~~G~~l~~~~~~~~~~~~l~G~-PV~~s~~~~~~~ 226 (298) ++..+|.++..++ +..|+..+|..+.|.+|++ + |-+.+..- ..+...-..++|+ -|++.+..-..+ T Consensus 151 DPi~~i~~~~~~~---g~~PN~~vlg~~~~~~l~~hp~i~~~ik~~~~~~g---~it~~~la~l~~ve~V~vg~a~~n~a 224 (309) T protein:vir:99 151 NPLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEG---MVPMAFLQELLELDAIYIGEARLNIA 224 (309) T ss_pred CcHHHHHHHHHhh---CCCcceEEechHHHHHHhhCHHHHHHhcCCCcccc---ccCHHHHHHHhCcceEEeecceeecc Confidence 8889999988765 7789999999999988764 2 22222110 0111111235554 454433221111 Q ss_pred c--ccc----------ceEEEeec-------cceE--EEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEE Q lcl|Aclame:pro 227 L--TQR----------DRAIIGDF-------ANGF--KWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGI 285 (298) Q Consensus 227 ~--~~~----------~~~~~gd~-------~~~~--~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v 285 (298) . ... ..++++.. +.++ .|+.|..-++. +++.. ..+--.+|+.....-.+ T Consensus 225 ~~g~~~~~~~iwg~~~~L~y~~~~~~~~~~ps~G~t~~~~~r~~g~~~-d~~~~---------~~g~~~vr~~~~~k~~i 294 (309) T protein:vir:99 225 RPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIA-DPNIG---------LRGGQRVRVGESVKELV 294 (309) T ss_pred ccccccccccccCCcEEEEEcCCCCCCcccccccceeecccccCCcee-eeeec---------cCCceEEEEeccccchh Confidence 0 000 00111110 0011 11112111111 11111 12223466666666666 Q ss_pred ecccceEEEeecC Q lcl|Aclame:pro 286 LDATKFARVTEAN 298 (298) Q Consensus 286 ~~~~a~~~l~~a~ 298 (298) .-+++=..+++|- T Consensus 295 ~~~d~G~li~~~v 307 (309) T protein:vir:99 295 TAPDLGFFFENAV 307 (309) T ss_pred cchhcchhhhhcc Confidence 6666666777766 No 219 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=91.10 E-value=0.017 Score=30.22 Aligned_cols=275 Identities=10% Similarity=-0.013 Sum_probs=109.1 Q ss_pred Ceecccccc-----chhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcc-eEEeeccccccccccceeeEEEe Q lcl|Aclame:pro 1 MVLNKGTLF-----DPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSE-IDVVAESGKKTHGGVTLAPQTMV 74 (298) Q Consensus 1 mat~gg~li-----p~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~-a~~v~E~~~~~~~~~~~~~v~l~ 74 (298) -|..+.+.. +..... ................ ..........+. ...-.++..+++-..++++++.. T Consensus 217 ~Al~gEA~t~~sTd~at~~~-------Gtt~t~~~~~lyt~~~-g~~t~~~~~~~~~~~~~~~~~~~~eM~FsIeK~tVt 288 (523) T protein:vir:59 217 SALYARLFFVTGSDFATVAG-------GTPSTQDLDLVYYIDA-RNDFEDQSTDPDYPDPGFQSLDIPEINLELRSRPVA 288 (523) T ss_pred ccccccccccccccccccCC-------Cccccccccccccccc-ccchhhccccccccccccccccccceeeEEEeEEEe Confidence 111111000 000000 0000000000000000 001101111111 11124566788888889998888 Q ss_pred eeEEEEEEeecHHHhhcccc--cHHHHHHHHHHHHHHHHHHHHHHHHhccccccc--cccccccccccccccccccccc- Q lcl|Aclame:pro 75 PIKVEYGARISDEFMYASDE--EKINILQAFNDGFAKKVARGIDLMAFHGVNPRL--GTASAVIGTNHFDSKVTQKVEA- 149 (298) Q Consensus 75 ~~k~~~~~~iS~ell~~~~d--~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~--g~~~~~~~~~~~~~~~~~~~~~- 149 (298) +|.=+=...+|-||.+|--. ...|.+++|..-|+-.|...+++-++.-...-+ +...+. ...++.......... T Consensus 289 AkSRaLKAeYT~ELAQDLKAiH~GLDAE~ELanILStEImlEINR~ii~~~~~~a~~~~~~~~-~~~g~~~~~~~~~~~~ 367 (523) T protein:vir:59 289 TKTRKLRAAWTPEAMQDLAAYHKGVDLENEIVTLMSQYIAREIDLEILSTIMAHARRTDNYGF-WSEVVGEYYDETSGNF 367 (523) T ss_pred eecccccccccHHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhheeeeeccc-cccceeeecccccchh Confidence 87766677889998765332 124455566666666666666665554321100 100110 011111111111000 Q ss_pred --cc-------ccchhHHHHHHHhhhhhhc--CCcccEEEEcHHHHHHHHHhhccCCceeeccccccc-Ccceec-ceee Q lcl|Aclame:pro 150 --PR-------GIADPNGAIENAVELLTGV--DADVTGIAINPSFRSALAKQKDLQGNALFPELKWGA-TPDTIN-GLPV 216 (298) Q Consensus 150 --~~-------~~~~~~~~i~~~~~~l~~~--~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~-~~~~l~-G~PV 216 (298) +. ....++-.|....+.+... +...+-++|+++....|...--=+++.--....++. ..|.|. |++| T Consensus 368 ~~~~~~~~~~e~~~~l~~~~~~~~n~i~~~t~~~~~~~~~~s~~v~~~l~~~~~~~~~~~~~~~~~~~~~~g~l~~~~~v 447 (523) T protein:vir:59 368 VAGNFYGSKQEWLATLMIELNKVSNRIQQKTAVAGANFLVTSPQVAALLESMPGFTPGNDNRDGGTGIFYVGMVQGRYRL 447 (523) T ss_pred hhhhhhhhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHHhccccccCCccccccccceeEEEecCceEE Confidence 00 0011222233333333332 235667999999999886432111111101111111 235566 4699 Q ss_pred EecCccccccccccceEEEeeccceEEEEeecceEEEEeecccccc-----cchhhhhcCcEEEEEEEEEccEEecccce Q lcl|Aclame:pro 217 DVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDN-----SGLDLKGYNQVYIRAELFLGWGILDATKF 291 (298) Q Consensus 217 ~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~-----~~~~~f~~n~v~~r~~~r~~~~v~~~~a~ 291 (298) +++++.+. +.+++|--. .. +.. +-.+-..||....- ... -||- .+.|+ .|+++.|.+|-+. T Consensus 448 y~d~~~~~------dy~~~g~k~-~~--~~~-~~~~~y~Py~~l~~~~~~~dp~-s~qp-~~~~~--tRY~l~v~nP~~~ 513 (523) T protein:vir:59 448 YKNIYQNQ------PVIIMGNQD-LN--TPW-QTGAVYAPYVPLLFTPTIVDPV-NFSY-RRGLM--TRYALEVVRPEFY 513 (523) T ss_pred EecCCCCc------ceEEEEecc-cC--Ccc-cccceecccchhhcccccccCC-cccc-eeeee--eehhheecchhHh Confidence 99988653 445544211 00 000 01122223321100 001 1332 23333 5899888888766 Q ss_pred EEEeecC Q lcl|Aclame:pro 292 ARVTEAN 298 (298) Q Consensus 292 ~~l~~a~ 298 (298) .+|-.-- T Consensus 514 ~~~~~~~ 520 (523) T protein:vir:59 514 GLLYVKL 520 (523) T ss_pred hhhhhhh Confidence 5553222 No 220 >protein:vir:96666 Length: 462 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238545;genbank:gi:66391271;genbank:GeneID:5130448 Probab=91.01 E-value=0.018 Score=30.16 Aligned_cols=287 Identities=11% Similarity=0.059 Sum_probs=133.7 Q ss_pred CeeccccccchhHHHHHHHHHHhhc--hhhhhcceeecCCCceEEEEEeC---CcceEEeeccccccccccceeeEEEee Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKS--SIARLSAQKPIPFNGEKVFTFTM---DSEIDVVAESGKKTHGGVTLAPQTMVP 75 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s--~i~~~~~~~~~~~~~~~ip~~~~---~~~a~~v~E~~~~~~~~~~~~~v~l~~ 75 (298) =.+++|.|--+.+..+|-.+..... .+.+-..+.|..+.--++-.... .+...+++|+...+.+++++.+..... T Consensus 36 ~q~~~gAlR~esL~~~i~~Lt~~~~~~~~~~~i~k~~a~sTv~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~R~~~~~ 115 (462) T protein:vir:96 36 TQVDAGALRREILDDQITMLTWTQDDLIFYREISRRPAQSTVQKYDVYLRHGNVGHSRFVREVGVAPVSDPNIRQKTVEM 115 (462) T ss_pred cccccchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhheeeeccCccccccccccccccccCCCceEEEEEEE Confidence 2234455555555555544433322 23344445555554334443332 356789999999999999999999999 Q ss_pred eEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccc----cccccccccccccccccccccccc Q lcl|Aclame:pro 76 IKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRL----GTASAVIGTNHFDSKVTQKVEAPR 151 (298) Q Consensus 76 ~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~----g~~~~~~~~~~~~~~~~~~~~~~~ 151 (298) |=++.-..+|...-+++. ..+.++...++-.-.++..+|.++|+|+..-+ |....+.|+...... ..+.-.. T Consensus 116 k~l~~t~~vsi~~tl~n~--~~d~~~~~~~dai~~~a~tiE~a~Fygds~l~~~~~~~gleFDGl~~lI~~--~NViDar 191 (462) T protein:vir:96 116 KYVSDTKNLSIASTLVNN--IQDPMQILTEDAIAVVAKTIEWASFYGDASLTADPTGQGLEFDGLAKLIDK--DNVIDAK 191 (462) T ss_pred EEEeeeeeechhhhhccc--hhhHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCccccccchhhhhhhcCC--CceeecC Confidence 999988888876532222 35566888888888899999999999964322 222333343222211 1111222 Q ss_pred ccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeeccccc-------------cc-----Ccceecc Q lcl|Aclame:pro 152 GIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKW-------------GA-----TPDTING 213 (298) Q Consensus 152 ~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~-------------~~-----~~~~l~G 213 (298) ......+.|..+-..+..++..++-++|+..+.+.|..---..-|.+.++... .. .+.++++ T Consensus 192 G~~Ls~~~ln~aa~~i~~~fGt~TD~~~p~~v~a~f~~~~l~~qrv~~~~n~g~~~~G~~v~~f~s~~G~I~L~~s~~m~ 271 (462) T protein:vir:96 192 GESLTETLLNRSAVLIGKSFGTATDAYMPIGVHADFVNSVLGRQMQLMQDNSGNVNAGYNVQGFYSSRGFIKLHGSTVME 271 (462) T ss_pred CCCccHHHHhhhhhhcccccCChhheecchHHHHHHHHhhcCceEEEEcCCCCceeeeeeccceeeeeeeeeeCCceecC Confidence 23344566666666777778889999999999999874432222222222111 00 0122333 Q ss_pred eeeEecCcc------ccccccccc-----eEEEeeccceEEEEeecceEEEE---eecccccccch-h---hhhcCcEEE Q lcl|Aclame:pro 214 LPVDVNKTV------SDMSLTQRD-----RAIIGDFANGFKWGYAKEVPLEV---IQYGDPDNSGL-D---LKGYNQVYI 275 (298) Q Consensus 214 ~PV~~s~~~------~~~~~~~~~-----~~~~gd~~~~~~~~~~~~~~i~~---~~~~~~~~~~~-~---~f~~n~v~~ 275 (298) .|-.....+ |...+.... ...|+|-.+ ..++++++ +.++.+-.+.+ . .--.+.+.+ T Consensus 272 ~~~i~~~~~~~~p~ap~~~~vsaTv~t~~~g~f~~~~d------~~~y~Y~V~avs~dgeS~PS~~VtaTva~~~~gv~l 345 (462) T protein:vir:96 272 NELILDESLQPLPNAPQPATVKATVETGKKGLFTDEHD------RAELTYKVVVNSDDAQSAPSEAVTATVNNATDGVKL 345 (462) T ss_pred cccccccccccCCCCCCCCceeEEEEeCCCCCCCCccC------ceeEEEEEEEECCCCccccceeeEeeeecccccceE Confidence 333332221 111110000 001111110 11111111 11111100000 0 000000100 Q ss_pred EEEEEEccEEecccceEEEeec-C Q lcl|Aclame:pro 276 RAELFLGWGILDATKFARVTEA-N 298 (298) Q Consensus 276 r~~~r~~~~v~~~~a~~~l~~a-~ 298 (298) .... -...-..|+.+++-+.. + T Consensus 346 tIt~-~a~~~~~~~~~~IYRk~~~ 368 (462) T protein:vir:96 346 EISV-NAMYQQQPQFVSIYRQGRK 368 (462) T ss_pred EEEE-cCCccccceEEEEEeecCC Confidence 0000 00001112222222111 1 No 221 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=89.72 E-value=0.025 Score=29.39 Aligned_cols=264 Identities=12% Similarity=0.075 Sum_probs=120.3 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcc------eeecCCCceEEEEEeC--CcceEEeeccccccccccceeeEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSA------QKPIPFNGEKVFTFTM--DSEIDVVAESGKKTHGGVTLAPQT 72 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~------~~~~~~~~~~ip~~~~--~~~a~~v~E~~~~~~~~~~~~~v~ 72 (298) ||.. .-+.....+.+.....+....+.. +...+++.++||+.++ +-..+-.+.|-...+.+.+++..+ T Consensus 1 Main----~~~k~~~~ld~~~~~~~~~~~l~~~~n~~~~~~~gak~VkIp~ist~~gl~dY~R~~g~~~g~v~~~~et~t 76 (285) T protein:vir:79 1 MTVV----LDSKDLARIDEEYKADSQVWSYLTGGNGVTQRFRGHNEVRINKLSGFVDATAYKRGQDNARKTISVGKETVK 76 (285) T ss_pred Ccch----hhHHHHHHHHHHHHHhhhhhhhcccCCcceeEecCCCEEEEeeecccccccccccccCccccccceeeeEEE Confidence 8766 234456667776666655554432 3445566899999853 333333444434444455556666 Q ss_pred EeeeEEEE-EEeecHHHhhcccccHHHHHHHHHHHHHHH-HHHHHHHHHhcccccccccccccccccccccccccccccc Q lcl|Aclame:pro 73 MVPIKVEY-GARISDEFMYASDEEKINILQAFNDGFAKK-VARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAP 150 (298) Q Consensus 73 l~~~k~~~-~~~iS~ell~~~~d~~~~l~~~i~~~la~~-i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 150 (298) |.-.+--. .+.--+ .++...--.+.+..++.+. ..=.+|...|.-.-...+ .....+ T Consensus 77 l~~DR~~~f~iD~mD-----vdEn~~~~~~ni~~ef~~~~vvPEiDayrfskla~~a~----------------~~~~~~ 135 (285) T protein:vir:79 77 LTHEDWFGYDLDQFD-----MDENGAYTVENVVREHNKMITIPHRDKVAVQKLFDSAA----------------KKATDS 135 (285) T ss_pred eeccccceecccccc-----hhhhhhhhHHHHHHHHHhhhhcchhhHHHHHHHHhhcc----------------cccccc Confidence 65554222 121111 1111111223333333332 233455443322100000 000111 Q ss_pred cccchhHHHHHHHhhhhhhcCCcc-cEEEEcHHHHHHHHHhhccCCceee-ccc---ccccCcceecc-eeeEe--cCcc Q lcl|Aclame:pro 151 RGIADPNGAIENAVELLTGVDADV-TGIAINPSFRSALAKQKDLQGNALF-PEL---KWGATPDTING-LPVDV--NKTV 222 (298) Q Consensus 151 ~~~~~~~~~i~~~~~~l~~~~~~~-~~~vm~~~~~~~L~~lkd~~G~~l~-~~~---~~~~~~~~l~G-~PV~~--s~~~ 222 (298) -+.+..++.|.+++.++...+... -.++|+|.++..|++.+.=....-. +.. .-....++|.| .|++. ++.| T Consensus 136 ~T~~nv~~~i~~~~~~lde~~vp~~rvl~vTp~~~~~Lk~s~~~~r~~~~~~~~~~~~i~~~V~~lDg~v~ii~Vps~r~ 215 (285) T protein:vir:79 136 ITKDNALDAYDTAEAYMFDNEVPGGFVMFVSSAYYTALKQSAAVTRTFSTDGTMVINGIDRRVAQLDGGVPIVRVSSDRL 215 (285) T ss_pred cCHHHHHHHHHHHHHHHHHcCCCCceEEEEChHHHHHHHhhhhhheecccccceeccceeeeeccccceeEEEEcchhhc Confidence 234567899999999998887643 3578999999988766531111111 111 11234578998 89975 4555 Q ss_pred ccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecc--cceEEEeecC Q lcl|Aclame:pro 223 SDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDA--TKFARVTEAN 298 (298) Q Consensus 223 ~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~--~a~~~l~~a~ 298 (298) +......+--.++..-+ +. +.+...-.+.+++ ...++. -|...+.-+.+.|.=|.+. +++..-.+|- T Consensus 216 kt~~~~k~Infiiv~~~-a~-i~~~K~~~~~~f~-P~~~~~------~d~~~~~~R~Y~d~fv~~nk~~~Iy~~~~a~ 284 (285) T protein:vir:79 216 KGLGITNHVNFILTPLS-AI-APIVKYDSVSVID-PSTDRS------GNRWTIKGLSYYDAIVLDNAKKGIYVAATAG 284 (285) T ss_pred cCcCcchhccEEEecCc-ee-ccceeeeeeEeEC-CCCCCC------cceeeeeeeeeeeeeehhhccceeeeeeccc Confidence 43222222222333222 22 2222222222221 111111 1223334445566655543 3444444444 No 222 >protein:vir:861 Length: 318 # NCBI annotation: putative minor structural protein # Family: family:all:2417 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047120;genbank:gi:9630573;genbank:GeneID:1261764 Probab=88.40 E-value=0.0096 Score=31.65 Aligned_cols=270 Identities=13% Similarity=0.069 Sum_probs=117.6 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEY 80 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~~ 80 (298) -.|+.-..+|+.++--|-..+..+.++++..-+...+.-.++..- .+...+...-.|+.+++...+|.--++.+--++. T Consensus 41 tiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~V~~s~-~s~AeAq~HkdGqTK~eqa~~~~~~Tl~~~~VY~ 119 (318) T protein:vir:86 41 TITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSF-DSSAEAQVHKDGQTKTEQAATLTIDTLEPVMVYK 119 (318) T ss_pred eeeccchhccHHHHHHHHHhhhccCcceeeeeeccchhhhhhhhh-hhhhhhhhhccCCccccceeeeeeechhHHHHHH Confidence 234445567777766666666677777665444333321112111 2235566677888888888888877777644333 Q ss_pred EEeecHHHhhcccccHHHHHHHHHHHHHHHHH-HHHHHHHhccccccc-ccccccccccccccccccccccccccchhHH Q lcl|Aclame:pro 81 GARISDEFMYASDEEKINILQAFNDGFAKKVA-RGIDLMAFHGVNPRL-GTASAVIGTNHFDSKVTQKVEAPRGIADPNG 158 (298) Q Consensus 81 ~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~-~~~d~~~l~G~~~~~-g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (298) ...+ -|+.++.--+...+..++..+|+.+|. +.+|.++.-|+|..+ ...........+...++..-+ .....+.. T Consensus 120 ~~S~-Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~k~Ttkaks--agttpfan 196 (318) T protein:vir:86 120 LQSL-AERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGSNGFKSIDKEADVKKIKKITTKAKS--AGTTPFAN 196 (318) T ss_pred HHHH-HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhheeecCCCCccchhhHHHHHHHHHHhhhhhc--cCCCchhh Confidence 3233 334444445556678899999999987 899999998865322 111111111111111111111 11222334 Q ss_pred HHHHHhhhhhhcCCcccEEEEcHHH-HHHHHHhhccCCce--eecccccccCcceecce---eeEecCccccccccccce Q lcl|Aclame:pro 159 AIENAVELLTGVDADVTGIAINPSF-RSALAKQKDLQGNA--LFPELKWGATPDTINGL---PVDVNKTVSDMSLTQRDR 232 (298) Q Consensus 159 ~i~~~~~~l~~~~~~~~~~vm~~~~-~~~L~~lkd~~G~~--l~~~~~~~~~~~~l~G~---PV~~s~~~~~~~~~~~~~ 232 (298) .|..++.-+.+-..+.- ++..... .+.|..|+-+..+. -...+.+. ..+--|+ -|+. +...-+.+ T Consensus 197 aieeavdfvrptagrry-livkaedrkalldelrqatanahvriknddte--iasevgvdeiivyt------gskalkpt 267 (318) T protein:vir:86 197 AIEEAVDFVRPTAGRRY-LIVKAEDRKALLDELRQATANAHVRIKNDDTE--IASEVGVDEIIVYT------GSKALKPT 267 (318) T ss_pred HHHHHHhhhccCCCceE-EEEeecchHHHHHHHHhhcccceeEEeccchh--hhhhcCcceeeeee------ccccccce Confidence 56666554443322222 2333333 33345555322221 11111110 0000111 1110 00001111 Q ss_pred EEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeec Q lcl|Aclame:pro 233 AIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEA 297 (298) Q Consensus 233 ~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a 297 (298) ++ .| +.+.+.|.+-...|. --|.+|.-.+..+....+-+.--+|=++++.. T Consensus 268 vl-vd----------qkyhidmqdltkvda---fewktnsnmilvetltsghvetynagavitvs 318 (318) T protein:vir:86 268 VL-VD----------QKYHIDMQDLTKVDA---FEWKTNSNMILVETLTSGHVETYNAGAVITVS 318 (318) T ss_pred ee-ec----------cceecchhhhhhhhc---ceeccCCceEEEeecccCcceeecCceeEEeC Confidence 11 12 112222222111110 01444444444444444444433333333333 No 223 >protein:vir:93966 Length: 400 # NCBI annotation: structural protein # Family: family:all:2417 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764320;genbank:gi:115315634;genbank:GeneID:5176553 Probab=87.99 E-value=0.007 Score=32.39 Aligned_cols=268 Identities=14% Similarity=0.077 Sum_probs=118.5 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEY 80 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~~ 80 (298) -.|+.-..+|..++--|-..+..+.++++..-+..++.-.++.. +.+...|...-.|+.+++...+|.--++.+--++. T Consensus 123 tiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~V~~s-~~s~~~Aq~HkdGqTK~eqa~~~~~~Tl~~~~VY~ 201 (400) T protein:vir:93 123 TITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRS-FDSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYK 201 (400) T ss_pred ceeccchhccHHHHHHHHHhhhccCcceeeeeeccchhhhHHhh-hhhhhhhhhhccCCccccceeeeeeechhHHHHHH Confidence 23444556677766666666667777766544333321111111 12334566677788888888888877777644333 Q ss_pred EEeecHHHhhcccccHHHHHHHHHHHHHHHHH-HHHHHHHhccccccc-ccccccccccccccccccccccccccchhHH Q lcl|Aclame:pro 81 GARISDEFMYASDEEKINILQAFNDGFAKKVA-RGIDLMAFHGVNPRL-GTASAVIGTNHFDSKVTQKVEAPRGIADPNG 158 (298) Q Consensus 81 ~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~-~~~d~~~l~G~~~~~-g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (298) ...+ -|+.++.--+...+..++...|+.+|. +.+|.++.-|+|..+ ...........+...++..- ......+.+ T Consensus 202 ~~S~-Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~~~Ttkak--sagktpfad 278 (400) T protein:vir:93 202 LQSL-AERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAK--SAGKTPFAD 278 (400) T ss_pred HHHH-HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhhheecCCCCccchhhHHHHHHHHHHhhhhh--hcCCCchhH Confidence 3233 334444445556678899999999987 899999998854222 00001111111111111111 112223445 Q ss_pred HHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCce---eecccccccCcceecce---eeEecCccccccccccce Q lcl|Aclame:pro 159 AIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNA---LFPELKWGATPDTINGL---PVDVNKTVSDMSLTQRDR 232 (298) Q Consensus 159 ~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~---l~~~~~~~~~~~~l~G~---PV~~s~~~~~~~~~~~~~ 232 (298) .|..++.-+.+-..+.--++-.....+.|..|+-+..+. +-.++.. ..+--|+ -|+. +...-+.+ T Consensus 279 aieeavdfvrptagrrylivktedrkalldelrqatanahvriknddae---iasevgvdeiivyt------gskalkpt 349 (400) T protein:vir:93 279 AIEEAVDFVRPTAGRRYLIVKTEDRKALLDELRQATANAHVRIKNDDAE---IASEVGVDEIIVYT------GSKALKPT 349 (400) T ss_pred HHHHHHhhhccCCCceEEEEeccchHHHHHHHHhhccccceEeecchhh---hhhhcCcceeeeee------ccccccce Confidence 677776655443333222333333334455555332221 1111110 0011111 1111 00001111 Q ss_pred EEEeeccceEEEEeecceEEEEeecccccccchhh--hhcCcEEEEEEEEEccEEecccceEEEeec Q lcl|Aclame:pro 233 AIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDL--KGYNQVYIRAELFLGWGILDATKFARVTEA 297 (298) Q Consensus 233 ~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a 297 (298) ++ .|- .+.+.|.+-.. ++. |.+|.-.+..+....+-+.-.+|-++++.. T Consensus 350 vl-vdq----------kyhidmqdltk-----vdafewktnsnmilvetltsghvetynagavitvs 400 (400) T protein:vir:93 350 VL-VDQ----------KYHIDMQDLTK-----VDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred ee-ecc----------ccccchhhhhh-----hhhheeccCCceEEEeecccCcceeeccceeEeeC Confidence 11 121 11222222111 111 444444444444444444433333333333 No 224 >protein:vir:79078 Length: 307 # NCBI annotation: gp8 # Family: family:all:908 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111208;genbank:gi:134288798;genbank:GeneID:4960752 Probab=87.63 E-value=0.037 Score=28.40 Aligned_cols=269 Identities=16% Similarity=0.094 Sum_probs=122.0 Q ss_pred Ceeccc-cccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEee--c---ccccccccc-cee--eE Q lcl|Aclame:pro 1 MVLNKG-TLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVA--E---SGKKTHGGV-TLA--PQ 71 (298) Q Consensus 1 mat~gg-~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~--E---~~~~~~~~~-~~~--~v 71 (298) |.+-.. .++.|.+-+=.+.+-.+..+-..+++..|+.....+|++... ++--+. + +........ .++ .. T Consensus 1 m~~~~~~~~~dp~LT~~A~gy~n~~~Iad~lfP~vpV~~~~~k~~~f~~--e~f~~~~t~ra~~~~~~~v~~~~~~~~~~ 78 (307) T protein:vir:79 1 MGRLSKLRIVDPVLTNLAIGYTNAEFIGQTLMPVVEVEKEGGKIPKFGK--ESFRLYQTERALRAKSNRMNPEDIDSVDV 78 (307) T ss_pred CCCCCCCcccCHHHHHHHhhccchhhhhhhcCCcccccccccceeeecc--ccccccccccccCCCcceeeeeccccccc Confidence 666655 445555544334333333333457788888877778877632 111111 1 111111111 112 22 Q ss_pred EEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccc-ccccccccccccccccccccccccccc Q lcl|Aclame:pro 72 TMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGV-NPRLGTASAVIGTNHFDSKVTQKVEAP 150 (298) Q Consensus 72 ~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~-~~~~g~~~~~~~~~~~~~~~~~~~~~~ 150 (298) .+..|-+.. ++..+ ....+..++.+.-.+.+.+.|.+..|..+-.-. +..+. ..+ +.... +.+..+. T Consensus 79 ~~~~~~l~~--~id~r---~~~~~~~~~~~~Av~~l~d~I~l~~E~~~A~l~~~~~~y----~~~--~k~tL-sgt~~Ws 146 (307) T protein:vir:79 79 NLDEHDLEY--PIDYR---EDQESAFPLEQAAVQTATDAIQLRREKMIADLSQNPSSY----AAG--NKKQL-SATEKFT 146 (307) T ss_pred cccccchhh--cccch---hcCCCCCCHHHHHHHHHHHHHHhHHHHHHHHHhcccccc----CCC--ceEEE-ccCcccC Confidence 333333332 22222 112223344455455555555555543332211 00111 011 11111 1233466 Q ss_pred cccchhHHHHHHHhhhhhhc-CCcccEEEEcHHHHHHHHH---hh---ccCCceeecccccccCcceeccee-eEecCcc Q lcl|Aclame:pro 151 RGIADPNGAIENAVELLTGV-DADVTGIAINPSFRSALAK---QK---DLQGNALFPELKWGATPDTINGLP-VDVNKTV 222 (298) Q Consensus 151 ~~~~~~~~~i~~~~~~l~~~-~~~~~~~vm~~~~~~~L~~---lk---d~~G~~l~~~~~~~~~~~~l~G~P-V~~s~~~ 222 (298) ...+++..+|.+...++... +..|+.++|.++.|.+|++ ++ +..+.-+..+ ..-..++|+. |.+-+.. T Consensus 147 d~~sDPi~di~~~~~ai~~~~g~~Pn~~vlg~~a~~~l~~h~~i~~~lk~~~~g~it~----~~la~l~~v~~V~vg~a~ 222 (307) T protein:vir:79 147 AANSDPVGVIEDGKEAIRTKIGRRPNTMVIGASAYKTLKAHPQLIEKIKYSMKGIVTV----DLLKEIFEVENIAVGEAI 222 (307) T ss_pred CCCCCcHHHHHHHHHHHHHhhCCccceEEeCHHHHHHHhcCHHHHHHhcCccccccCH----HHHHHHhCceeEEEeeee Confidence 67788999999998888754 6789999999999988764 21 1222212111 1112355543 4332222 Q ss_pred cccccc------ccceEEEe-------------eccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEcc Q lcl|Aclame:pro 223 SDMSLT------QRDRAIIG-------------DFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGW 283 (298) Q Consensus 223 ~~~~~~------~~~~~~~g-------------d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~ 283 (298) -..... +.+.++.. ..+.+|.+. +++..+ ++.+ ........+|+.....- T Consensus 223 y~~~~~~~~~iw~~~~~l~y~~~~~~~~~~~~~~ps~Gyt~~-~~g~~~-~d~~---------~~~~~~~~vrv~~~~~~ 291 (307) T protein:vir:79 223 YADDKDRFTDIWGANIVLAYVPLQRGGQQRTPYEPSYGYTLR-KKGNPV-VDTR---------IEDGKLELVRATDIFRP 291 (307) T ss_pred eecccccchhcCCCceEEEecccccCCCCCcccccccceeEE-ecCceE-Eecc---------cCCCceeEEeecccccc Confidence 111110 01111110 112233221 122211 1111 11223344677777777 Q ss_pred EEecccceEEEeecC Q lcl|Aclame:pro 284 GILDATKFARVTEAN 298 (298) Q Consensus 284 ~v~~~~a~~~l~~a~ 298 (298) .+.-+++=..|++|- T Consensus 292 ~i~~~~~G~li~~~v 306 (307) T protein:vir:79 292 YLLGADAGYLISGIN 306 (307) T ss_pred eeeccccchhhccCC Confidence 888888888888888 No 225 >protein:vir:1663 Length: 393 # NCBI annotation: unknown # Family: family:all:2417 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044952;genbank:gi:9629659;genbank:GeneID:1261309 Probab=86.52 E-value=0.012 Score=31.14 Aligned_cols=268 Identities=13% Similarity=0.072 Sum_probs=117.8 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeccccccccccceeeEEEeeeEEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEY 80 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~~~v~l~~~k~~~ 80 (298) -.|+.-..+|+.++--|-..+..+.++++..-+...+.-.++.. +.+...|...-.|+.+++...+|.--++.+--++. T Consensus 116 tiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~V~~s-~~s~~eAq~HkdGqTK~eqa~~~~~~Tl~~~~VY~ 194 (393) T protein:vir:16 116 TITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRS-FDSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYK 194 (393) T ss_pred ceeccchhccHHHHHHHHHhhhccCcceeeeeeccchhhhHHhh-hhhhhhhhhhccCCccccceeeeeeechhHHHHHH Confidence 23444556677766666566667777766544333321111111 12234566677788888888888877776644333 Q ss_pred EEeecHHHhhcccccHHHHHHHHHHHHHHHHH-HHHHHHHhccccccc-ccccccccccccccccccccccccccchhHH Q lcl|Aclame:pro 81 GARISDEFMYASDEEKINILQAFNDGFAKKVA-RGIDLMAFHGVNPRL-GTASAVIGTNHFDSKVTQKVEAPRGIADPNG 158 (298) Q Consensus 81 ~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~-~~~d~~~l~G~~~~~-g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (298) ...+ -|+.++.--+...+..++...|+.+|. +.+|.++.-|+|..+ ...........+...++..- ......+.+ T Consensus 195 ~~S~-Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~k~Ttkak--sagktpfad 271 (393) T protein:vir:16 195 LQSL-AERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAK--SAGKTPFAD 271 (393) T ss_pred HHHH-HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhhheecCCCCccchhhHHHHHHHHHHhhhhh--hcCCCchhH Confidence 2223 334444445556678899999999987 899999998854222 00011111111111111111 112223445 Q ss_pred HHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCC--c-eeecccccccCcceecce---eeEecCccccccccccce Q lcl|Aclame:pro 159 AIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQG--N-ALFPELKWGATPDTINGL---PVDVNKTVSDMSLTQRDR 232 (298) Q Consensus 159 ~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G--~-~l~~~~~~~~~~~~l~G~---PV~~s~~~~~~~~~~~~~ 232 (298) .|..++.-+.+-..+.--++-.....+.|..|+-+.. + -+-.++..- .+--|+ -|+. +...-+.+ T Consensus 272 aieeavdfvrptagrrylivktedrkalldelrqatananvriknddtei---asevgvdeiivyt------gskalkpt 342 (393) T protein:vir:16 272 AIEEAVDFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEI---ASEVGVDEIIVYT------GSKALKPT 342 (393) T ss_pred HHHHHHhhhccCCCceEEEEeccchHHHHHHHHhhhccCceeeeccchhh---hhhcCcceeeeee------ccccccce Confidence 6777766554433332223333333344455542221 1 111111100 001111 1111 00001111 Q ss_pred EEEeeccceEEEEeecceEEEEeecccccccchhh--hhcCcEEEEEEEEEccEEecccceEEEeec Q lcl|Aclame:pro 233 AIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDL--KGYNQVYIRAELFLGWGILDATKFARVTEA 297 (298) Q Consensus 233 ~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a 297 (298) ++ .|- .+.+.|.+-.. ++. |.+|.-.+..+....+-+.-.+|-++++.. T Consensus 343 vl-vdq----------kyhidmqdltk-----vdafewktnsnmilvetltsghvetynagavitvs 393 (393) T protein:vir:16 343 VL-VDQ----------KYHIDMQDLTK-----VDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) T ss_pred ee-ecc----------ccccchhhhhh-----hhhheeccCCceEEEeecccCcceeeccceeEeeC Confidence 11 121 11222222111 112 444444444444444444433333333333 No 226 >protein:vir:103370 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1621 # MgeName: PaP2 # Cross-refs: genbank:acc:YP_024741;genbank:gi:48697083;genbank:GeneID:2846038 Probab=86.30 E-value=0.047 Score=27.89 Aligned_cols=275 Identities=12% Similarity=0.016 Sum_probs=117.9 Q ss_pred Ceecc-ccc-cchhHHHHHHHHHHhhchh-----hhhcceeecCCCceEEEEEeCCcceEEeeccc-------ccccccc Q lcl|Aclame:pro 1 MVLNK-GTL-FDPELVTDLISKVAGKSSI-----ARLSAQKPIPFNGEKVFTFTMDSEIDVVAESG-------KKTHGGV 66 (298) Q Consensus 1 mat~g-g~l-ip~~~~~~ii~~~~~~s~i-----~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~-------~~~~~~~ 66 (298) .++.+ ..+ ++.. +. +.+.+.+ ....++..+.+..+++-|..++..+.-+++|. .++|..- T Consensus 71 ~a~a~~T~l~ve~~---~~---f~~~~l~~~~~~~Evirv~sVng~~lTV~Rg~~~t~aaaia~n~~~~~Ig~~~eEGsd 144 (418) T protein:vir:10 71 EAAADATVLTVENS---DG---LTKGMIFYNEATGENMRLELVNGLNLTVKRQTGRISAAIIAANTKLIVIGTAFEEGSQ 144 (418) T ss_pred EEecCceEEEEcCc---ce---eccccEEEEccCCeEEEEEEEeCCEEEEEEecCCeeEEEEecCceEEEeccccccccc Confidence 11111 111 1111 11 2223322 12334445556667777765544333332222 2333322 Q ss_pred ceeeEEEeeeEEEE-------EEeecHHHhhcccc-cHHH-HHHHHHHHHHHHHHHHHHHHHhcccccccccccc-cccc Q lcl|Aclame:pro 67 TLAPQTMVPIKVEY-------GARISDEFMYASDE-EKIN-ILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASA-VIGT 136 (298) Q Consensus 67 ~~~~v~l~~~k~~~-------~~~iS~ell~~~~d-~~~~-l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~-~~~~ 136 (298) ..+.....+..+.. .+.+|.-....... -..+ ++.+..+++-++ ..+|+++++|.....+...+ .... T Consensus 145 ~~ta~~~k~~~vsNvtQIF~~avsvSgTaqAs~~q~Gvsn~~ese~drk~~~a--v~iEkalI~G~~~~~~~~~g~~R~m 222 (418) T protein:vir:10 145 RPTARSIQPVYVPNFTQIFRNAWALTDTARASYAEAGYSNITESRRDCMDFHA--TEQETAIFFGQAFMGTYNGQPLHTT 222 (418) T ss_pred cCCcceecceeccchhhhhhhhhhhhhhhhhccccccCchHHHHHHHHHHHHH--HHHHHHHhcccccCCCcCCcchhhH Confidence 22222222222222 23333332110000 0011 233444444433 47899999995222222222 2333 Q ss_pred ccccccc-----ccccccccccchhHHHHHHHhhhhhhc----CCcc----cEEEEcHHHHHHHHHhhccCCceeecccc Q lcl|Aclame:pro 137 NHFDSKV-----TQKVEAPRGIADPNGAIENAVELLTGV----DADV----TGIAINPSFRSALAKQKDLQGNALFPELK 203 (298) Q Consensus 137 ~~~~~~~-----~~~~~~~~~~~~~~~~i~~~~~~l~~~----~~~~----~~~vm~~~~~~~L~~lkd~~G~~l~~~~~ 203 (298) .++.... +......+.+...++.+.+++...-.- +.+. -.+++++++..++.++- |........ T Consensus 223 ~GIl~~vr~~~~gnVv~a~~~t~~s~d~l~~a~~~af~~g~~~G~~~q~~~f~~~V~~~~k~~I~k~~---~~I~~~~~e 299 (418) T protein:vir:10 223 QGIVDAVRQYAPDNVNAMPNPTAVTYDDVVDATIDAFKWSVNVGDNTQRVMFCDTVGMRTMQDIGRFF---GEVTVTQRE 299 (418) T ss_pred HHHHHHHhhhcccceeccCCCCccCHHHHHHHHHHHhhccCCCcccccceeEEEEeChHHHHHhhhhh---hheeecccc Confidence 3332211 233333334455678888877665321 1111 13678999999888873 321111000 Q ss_pred cc-cC--------cc--eecceeeEecCccccccccccceEEEeeccceEEEEee--cceEEEEeeccc----------c Q lcl|Aclame:pro 204 WG-AT--------PD--TINGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYA--KEVPLEVIQYGD----------P 260 (298) Q Consensus 204 ~~-~~--------~~--~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~--~~~~i~~~~~~~----------~ 260 (298) .. +. .+ .|.-.||+..=+||. +.+++-|... +.+.+- +++..+..-... + T Consensus 300 ~~~G~vv~~~~~~~G~I~L~~~p~~~~~~lp~------g~mlVvD~~~-vkL~~L~~R~~~~E~l~k~G~~~~~~~~~~~ 372 (418) T protein:vir:10 300 TSYGMVFTEWKFFKGRLILKEHPLFSAIGISP------GFAVVVDVPA-VKLAYMDGRNAKVENYGQGGGENKSGATDYS 372 (418) T ss_pred eeeeEEEEEEEcceEEEEeecccccccccCCC------ceEEEEcccc-ceEEEeccccccchhcccCCCcccccccccc Confidence 00 00 00 112224444335554 4678888764 344444 555554432211 1 Q ss_pred cccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 261 DNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 261 ~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ++.+++ -+.+++. -.+...+++|.+.+++++-- T Consensus 373 ~~~~~D-~~kG~iv----~E~tLe~~N~~a~avitgl~ 405 (418) T protein:vir:10 373 YGHGVD-AQGGSLT----SEWALELLNPQGCAVITGLQ 405 (418) T ss_pred cccccc-cccceEE----EEeeeeeecccceEEeeccc Confidence 111112 1334432 45777889999999998876 No 227 >protein:vir:96442 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1616 # MgeName: 119X # Cross-refs: genbank:acc:YP_001218814;genbank:gi:147917331;genbank:GeneID:5142645 Probab=86.13 E-value=0.048 Score=27.82 Aligned_cols=281 Identities=14% Similarity=0.068 Sum_probs=125.1 Q ss_pred Ceecccc-c-cchhHHHHHHHHHHhhchh-----hhhcceeecCCCceEEEEEeCCcceEEeecc-------cccccccc Q lcl|Aclame:pro 1 MVLNKGT-L-FDPELVTDLISKVAGKSSI-----ARLSAQKPIPFNGEKVFTFTMDSEIDVVAES-------GKKTHGGV 66 (298) Q Consensus 1 mat~gg~-l-ip~~~~~~ii~~~~~~s~i-----~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~-------~~~~~~~~ 66 (298) -++.++. + ++..- . +++.+.+ ....++..+.+..+++-|...+-.+.-++.| ..++|..- T Consensus 71 ~~~a~~T~i~V~~~~---~---f~~~~l~~~~~~~EvirVtsVng~~lTV~RG~~~t~aa~iaag~~~~~ig~~~eEGsd 144 (418) T protein:vir:96 71 EALADATVLTVENSD---G---LTKGMIFYNEATGENMRLELVNGLNLTVKRQTGRIAAAIIAANTKLIVIGTAFEEGSQ 144 (418) T ss_pred EEecCceEEEecCCc---c---cccccEEEEecCCeEEEEEEEeCCEEEEEEccCCeeeeeeecCceEEEeecCcccccc Confidence 1221111 1 22111 1 2333332 2334455555666777775544333223222 23333333 Q ss_pred ceeeEEEeeeEEEEEEeecHHHhhcccccHH--------HHHHHHHHHHHHHHHHHHHHHHhcccccc---cccc----- Q lcl|Aclame:pro 67 TLAPQTMVPIKVEYGARISDEFMYASDEEKI--------NILQAFNDGFAKKVARGIDLMAFHGVNPR---LGTA----- 130 (298) Q Consensus 67 ~~~~v~l~~~k~~~~~~iS~ell~~~~d~~~--------~l~~~i~~~la~~i~~~~d~~~l~G~~~~---~g~~----- 130 (298) ..+.....+..+..+..|-+|-..-+..+.. ++....+++|.+. ...+|.+++.|..-- +|.. T Consensus 145 ~~ta~~~k~~~vsN~tQIf~e~vsVSgTAqA~v~qaGvsn~~~~e~d~l~~~-kv~iE~ali~g~~~~~~~ng~p~~~t~ 223 (418) T protein:vir:96 145 RPTARSIQPVYVPNFTQIFRNAWALTDTARASYAEAGYSNITESRRDCMDFH-ATEQETAIFFGQAFMGTYNGQPLHTTQ 223 (418) T ss_pred cCCcceecceeccchhheehhhhhhhhhhhhhhhhcCcchhHHHHHHHHHHH-HHHHHHhhhccccccCCCCCccccccc Confidence 3333334444444444444443322222111 1222223444444 346688888875211 1111 Q ss_pred cccccccccccccccccccccccchhHHHHHHHhhhhhhc----CCcc----cEEEEcHHHHHHHHHhhccCCceeeccc Q lcl|Aclame:pro 131 SAVIGTNHFDSKVTQKVEAPRGIADPNGAIENAVELLTGV----DADV----TGIAINPSFRSALAKQKDLQGNALFPEL 202 (298) Q Consensus 131 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~----~~~~----~~~vm~~~~~~~L~~lkd~~G~~l~~~~ 202 (298) ....++..+. .+......+.....++.+.+++...... +... -.+++++++..+|.++-. +-++.-++. T Consensus 224 R~m~gI~~f~--~~Nvi~ag~~~~~t~d~L~~~~~~a~~~g~n~G~~~~~~~y~~~V~a~~k~~I~k~~~-~I~~~~~en 300 (418) T protein:vir:96 224 GIVDAIRQYA--PDNVNAMPNPTAVTYDDVVDATIDAFKWSVNVGDNTQRVMFCDTVGMRTMQDIGRFFG-EVTVTQRET 300 (418) T ss_pred chhHHHHhhc--cccccccCCCCcCCHHHHHHHHHHHHhhcCCCCCcccceEEEEEeChHHHHHHhhhhc-eeEeccccc Confidence 1112222221 1122223333345577777776664331 1221 136889999999998753 233222211 Q ss_pred ccccCc---ceecc-eeeEecCccccccccccceEEEeeccceEEEEee--cceEEEEeeccc----------ccccchh Q lcl|Aclame:pro 203 KWGATP---DTING-LPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYA--KEVPLEVIQYGD----------PDNSGLD 266 (298) Q Consensus 203 ~~~~~~---~~l~G-~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~--~~~~i~~~~~~~----------~~~~~~~ 266 (298) ..+... .+-+| ++++.++.+|..- -....+++-|.+.. .+.+- +++..+..-... +++.+++ T Consensus 301 ~~G~vv~~~~Td~G~v~ii~n~~~pad~-I~~g~mlVvD~~~v-kL~yL~~R~~~~E~l~k~G~~~~~~~~~~~~~~~~D 378 (418) T protein:vir:96 301 SYGMVFTEWKFFKGRLIIKEHPLFSAIG-ISPGFAVVVDVPAV-KLAYMDGRNAKVENYGQGGGENKSGATDYSYGHGVD 378 (418) T ss_pred eeceEEEEEEeeccEEEEEecCCCCccc-cCcceEEEEecCce-EEEEecCCCccchhcccCCCcccccccccccccccc Confidence 111111 12335 4888888887532 13345677777643 33332 444444332111 1111112 Q ss_pred hhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 267 LKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 267 ~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) -+.+++. ..+.+.+++|++.+++++.- T Consensus 379 -~~~G~l~----~Eltle~~N~~a~a~itgl~ 405 (418) T protein:vir:96 379 -AQGGSLT----SEWALELLNPQGCAVITGLQ 405 (418) T ss_pred -cccCEEE----EEEEEEeecccccEEeeccc Confidence 1334433 35677789999999998876 No 228 >protein:vir:107882 Length: 307 # NCBI annotation: gp34 # Family: family:all:908 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024707;genbank:gi:48696944;genbank:GeneID:2845970 Probab=85.98 E-value=0.049 Score=27.77 Aligned_cols=269 Identities=14% Similarity=0.065 Sum_probs=129.0 Q ss_pred Ceecc-ccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeecc--cc---ccccccc---eeeE Q lcl|Aclame:pro 1 MVLNK-GTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAES--GK---KTHGGVT---LAPQ 71 (298) Q Consensus 1 mat~g-g~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~--~~---~~~~~~~---~~~v 71 (298) |.+.. -.++.|.+-+--+.+-.+..+-..+++..|++....+||... .++.-+.+. +. ....++. .... T Consensus 1 m~~~~~~~~~dp~LT~~A~gy~n~~~ia~~l~P~vpv~~~~~k~~~f~--~eaF~~~~t~r~~~~~~~~v~~~~~~~~~~ 78 (307) T protein:vir:10 1 MGRLSKLRIVDPVLTNLAIGYTNAEFIGQSLMPVVEVEKEGGKIPKFG--KESFRLYKTERALRARSNRMNPEDLGSIDI 78 (307) T ss_pred CCCCCCCcccChhHHHHHHhhcchhhhhhhcCCcccccccccceeeEC--cccccchhhhcccCCCcceeeccccccccc Confidence 55555 455666665544555445545556788999887778888874 233212211 11 1111111 1123 Q ss_pred EEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccc-ccccccccccccccccccccccccccc Q lcl|Aclame:pro 72 TMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGV-NPRLGTASAVIGTNHFDSKVTQKVEAP 150 (298) Q Consensus 72 ~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~-~~~~g~~~~~~~~~~~~~~~~~~~~~~ 150 (298) .+..|-+..-+. .+ ...++..++.+...+.+.+.|.+..|..+-.-. +..+. ..+.. .. .+.+..+. T Consensus 79 ~~~~~~L~~~id--~r---~~~~~~~~~~~~av~~l~d~I~l~~E~~~A~l~~~~~~y----~~~~k--~t-LsGt~~Ws 146 (307) T protein:vir:10 79 VLDEHDLEYPID--YR---EDQESAFPLEQAAVQTATEAIQLRREKMVADLAQNPNSY----AGGNK--KQ-LSATEKFT 146 (307) T ss_pred ccccccccccCC--hh---hcCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCcccc----CCCce--EE-eccccccC Confidence 344444443222 22 222333455566666666666555544332110 11111 01111 11 12233566 Q ss_pred cccchhHHHHHHHhhhhhhc-CCcccEEEEcHHHHHHHHH---hh---ccCCceeecccccccCcceecce-eeEecCcc Q lcl|Aclame:pro 151 RGIADPNGAIENAVELLTGV-DADVTGIAINPSFRSALAK---QK---DLQGNALFPELKWGATPDTINGL-PVDVNKTV 222 (298) Q Consensus 151 ~~~~~~~~~i~~~~~~l~~~-~~~~~~~vm~~~~~~~L~~---lk---d~~G~~l~~~~~~~~~~~~l~G~-PV~~s~~~ 222 (298) ...+++..+|.+...++... +..|+..+|.++.|.+|++ ++ +..+.-+..+ ..-..++|+ -|++.+.. T Consensus 147 d~~sDPi~di~~~~~ai~~~~g~~Pn~~vlg~~a~~al~~hp~i~e~lk~~~~g~it~----~~la~ll~v~~i~vg~a~ 222 (307) T protein:vir:10 147 AAGSDPVGVIEDGKEAIRTKIGRRPNTMVIGASAYKTLKAHPQLIEKIKYSMKGIVTV----DLLKEIFEVENIAVGEAI 222 (307) T ss_pred CCCCCcHHHHHHHHHHHHhhhCCccceEEeCHHHHHHHhcCHHHHHHhCCccccccCH----HHHHHHhCceeEEEeeee Confidence 67888999999998888654 6789999999999988864 21 1222211111 111224442 23222211 Q ss_pred cccccc------ccceEEE-------------eeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEcc Q lcl|Aclame:pro 223 SDMSLT------QRDRAII-------------GDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGW 283 (298) Q Consensus 223 ~~~~~~------~~~~~~~-------------gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~ 283 (298) -+.... +.+.++. +..+.++.+ .+++..+...++ ...+...+|+.....- T Consensus 223 ~~~~~~~~~~iw~~~~vl~yv~~~~~~~~~~~~epsfGyT~-~~~g~~~~d~~~----------~~~~~~~~r~~~~~~~ 291 (307) T protein:vir:10 223 YADDKDRFTDIWGANIVLAYVPLQRGGQQRTPYEPSYGYTL-RKKGNPVVDTRI----------EDGKLELVRSTDIFRP 291 (307) T ss_pred eeccCCccceeCCCceEEEecccccCCCCCcccccccceeE-EEcCCeEeecee----------cCCceeEEeccccccc Confidence 110000 0111111 011223333 233333321111 1223344677777777 Q ss_pred EEecccceEEEeecC Q lcl|Aclame:pro 284 GILDATKFARVTEAN 298 (298) Q Consensus 284 ~v~~~~a~~~l~~a~ 298 (298) .+.-+++=..|++|. T Consensus 292 ~i~~~~~G~li~~~~ 306 (307) T protein:vir:10 292 YLLGADAGYLISGIN 306 (307) T ss_pred eeecccccceeccCC Confidence 888888888999999 No 229 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=85.28 E-value=0.054 Score=27.53 Aligned_cols=287 Identities=13% Similarity=0.029 Sum_probs=129.8 Q ss_pred Ceecc-ccccc---hhHHHHHHHHHHhhchhhh-hccee---e------cCC---CceEEEEEeCCcceEEeeccc--cc Q lcl|Aclame:pro 1 MVLNK-GTLFD---PELVTDLISKVAGKSSIAR-LSAQK---P------IPF---NGEKVFTFTMDSEIDVVAESG--KK 61 (298) Q Consensus 1 mat~g-g~lip---~~~~~~ii~~~~~~s~i~~-~~~~~---~------~~~---~~~~ip~~~~~~~a~~v~E~~--~~ 61 (298) ||.+. ++-=| ...+..+.-...+.+.+.. +...- | +.. ..+++.-.. .-...+|-+++ +- T Consensus 1 Ma~T~~~~~~p~a~~~ws~~l~~~~~~~s~f~~~l~G~~~~~~I~~~~dL~k~~Gd~v~f~L~~-~L~g~gv~Gd~~leG 79 (364) T protein:vir:93 1 MSQTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDRITFDLSV-HLRGKPTYGDARVEG 79 (364) T ss_pred CceeccCcCCHHHHHHHHHHHHHHHHhhCccccccccCCCCCcEEEeeecCCCCCceEEeeeee-ecccCCcccCceeec Confidence 88654 33222 3345555555555555543 32210 0 000 112222111 11223443333 33 Q ss_pred cccccceeeEEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccc----- Q lcl|Aclame:pro 62 THGGVTLAPQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGT----- 136 (298) Q Consensus 62 ~~~~~~~~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~----- 136 (298) .+..++|.+-++.+-.+..-+.....+-+ -....++...-++.|+.-+++..|..+|.=.....|....+... T Consensus 80 nee~L~~~~~~i~idq~r~~V~~~g~ms~--qRt~~dlr~~ar~~L~~w~~~~~d~~~f~~laGarg~~~~~~~~~~~~~ 157 (364) T protein:vir:93 80 KEESLRFYQDEVRIDQVRHSVSAGGRMSR--KRTVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFIETPDFTG 157 (364) T ss_pred cccceeEEeeEEEEeeccccccccCchhh--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccCccc Confidence 46677777666666655554443222211 14457888889999999999999987773211112221111110 Q ss_pred ------------ccccc-ccccccccccccchhHHHHHHHhhhhhhcCCc----------------ccEEEEcHHHHHHH Q lcl|Aclame:pro 137 ------------NHFDS-KVTQKVEAPRGIADPNGAIENAVELLTGVDAD----------------VTGIAINPSFRSAL 187 (298) Q Consensus 137 ------------~~~~~-~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~----------------~~~~vm~~~~~~~L 187 (298) ..+-. ..+.......+....++.|.++...+...+.. .-.++|||..+..| T Consensus 158 ~~~N~v~aPt~~r~~~~~~at~~~~l~stD~~sl~~id~a~~~a~~~~~~~~~~~~~~Pv~~~g~~~yV~~l~p~q~~~L 237 (364) T protein:vir:93 158 YAGNPLDAPDVDHLLYGGVATSKASLAATDIMAPLVIEKAVEKAAMMQAENPDVANMVPVSIDGDDHYVCVMSEYQATDM 237 (364) T ss_pred ccccccCCCCCCcEEeccccCchhhccccccccHHHHHHHHHHHHHhCCCCCCCcccceeEecCcceeEEEEcchhhhhh Confidence 00000 11111122223333466677776655443211 11589999999999 Q ss_pred HHhhc--------------cCCceeecccccccCcceecceeeEecCcccc----cccccc--c-eEEEeeccceEEEEe Q lcl|Aclame:pro 188 AKQKD--------------LQGNALFPELKWGATPDTINGLPVDVNKTVSD----MSLTQR--D-RAIIGDFANGFKWGY 246 (298) Q Consensus 188 ~~lkd--------------~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~----~~~~~~--~-~~~~gd~~~~~~~~~ 246 (298) +.-.| ...+|||. +.-+.+.|++|+-.+.++. +++.+. . .+++|--.-++.|+- T Consensus 238 r~~t~~~w~d~qk~A~~~~g~~nPlF~-----G~~gm~ngvii~~~~~vi~~~~~~~~~~v~~~ralllGaQA~~~a~g~ 312 (364) T protein:vir:93 238 RTAAGGTWIDFQKAAAAAEGRNNPIFK-----GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGT 312 (364) T ss_pred hhcCCHHHHHHHHHhhhcccccCCcee-----cCeeeEcCeEEeccCCcccccccccCccccchhhheecceeeEEEeec Confidence 75332 12245664 4467889998875544431 122111 1 246666555566666 Q ss_pred ecceEEEEeecc-ccc-ccchhh-hhcCcEEEEEEEEEccEEecccceEEEee Q lcl|Aclame:pro 247 AKEVPLEVIQYG-DPD-NSGLDL-KGYNQVYIRAELFLGWGILDATKFARVTE 296 (298) Q Consensus 247 ~~~~~i~~~~~~-~~~-~~~~~~-f~~n~v~~r~~~r~~~~v~~~~a~~~l~~ 296 (298) ..++...+.++. +-. ...+.. +..++-..|... -|+++.=-...+++-. T Consensus 313 ~~g~~~~w~Ee~~D~gn~~~i~~~~i~G~kK~rF~~-~DfGvi~idtaa~~~~ 364 (364) T protein:vir:93 313 ANGLRFDWEETVKDYGNEPAIAAGFIAGMKKARFNN-KDFGVISIDTAAKKHS 364 (364) T ss_pred CCCCCceeeecccCCCCchhhhhhhHhhhhhcccCC-ccceEEEecccccccC Confidence 556655444332 111 011110 122222222221 1333332222222222 No 230 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=82.55 E-value=0.076 Score=26.72 Aligned_cols=267 Identities=9% Similarity=-0.038 Sum_probs=118.1 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhh-------hcceeecCCCceEEEEEeC--CcceEEeeccccc-cccccceee Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIAR-------LSAQKPIPFNGEKVFTFTM--DSEIDVVAESGKK-THGGVTLAP 70 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~-------~~~~~~~~~~~~~ip~~~~--~~~a~~v~E~~~~-~~~~~~~~~ 70 (298) ||.+-. +.+++.+.+.+...+.-.. -..+.-.++..++||+.+. +-..+-..-+-.. ..-+.++.. T Consensus 1 Mainya----~~~~~~Ld~~~~~~~lts~~l~~~~~~~~v~~~ggktVkIp~is~tsGl~DY~R~~g~~~~g~v~~~~et 76 (346) T protein:vir:10 1 MTINYA----EKYQAAVQQAFYDGHLYSAELWNSPSNSIIKFDGAKHIKVPRLEITSGRKDRQRRTITTPVANYSNDWDS 76 (346) T ss_pred CcchhH----HHHHHHHHHHHHhhhccchhhcccccccceEecCCCEEEEEEeeeecccccccccCCcccccccccceeE Confidence 887653 3455666665555432211 1123345567899999863 2222222222211 233555666 Q ss_pred EEEeeeEEEEE-EeecHHHhhccccc--HHHHHHHHHHHHHHHHHHHHHHHHhccccc-ccccccccccccccccccccc Q lcl|Aclame:pro 71 QTMVPIKVEYG-ARISDEFMYASDEE--KINILQAFNDGFAKKVARGIDLMAFHGVNP-RLGTASAVIGTNHFDSKVTQK 146 (298) Q Consensus 71 v~l~~~k~~~~-~~iS~ell~~~~d~--~~~l~~~i~~~la~~i~~~~d~~~l~G~~~-~~g~~~~~~~~~~~~~~~~~~ 146 (298) .+|.-.+--.+ +.--+ .++. ...+...+.+...+..+=.+|...|.-.-. ..+... ... T Consensus 77 ~tl~qDR~~~F~vD~mD-----vDETn~~~~~anv~~ef~r~~vvPEiDayrfskLa~~a~~~~~------------~~~ 139 (346) T protein:vir:10 77 YELKNERYWSTLVDPSD-----IDETNMVVSLANITKQFNLDSKMPEKDRYMFSHLYSGKEAAHD------------GGI 139 (346) T ss_pred EEeeccccceecccccc-----hHHHHHHhHHHHHHHHHHHHhhcchhhHHHHHHHHHhhhhhcc------------ccc Confidence 66665553222 11101 0111 112222222223333334556544321000 000000 000 Q ss_pred cccccccchhHHHHHHHhhhhhhcCCcc--cEEEEcHHHHHHHHHhhccCCc-eeecccccccCcceecceeeEe--cCc Q lcl|Aclame:pro 147 VEAPRGIADPNGAIENAVELLTGVDADV--TGIAINPSFRSALAKQKDLQGN-ALFPELKWGATPDTINGLPVDV--NKT 221 (298) Q Consensus 147 ~~~~~~~~~~~~~i~~~~~~l~~~~~~~--~~~vm~~~~~~~L~~lkd~~G~-~l~~~~~~~~~~~~l~G~PV~~--s~~ 221 (298) ....-+....++.|.++..++...+... -.++|+|.++..|++.+.=+.. .+.......+..++|.|+||+. ++. T Consensus 140 ~~~a~T~~ni~~~i~~~~~~lde~~vp~~~rvl~vTp~~~~lLk~s~~f~k~~~v~~~~~i~~~V~siDGv~Ii~VPs~r 219 (346) T protein:vir:10 140 TTNTLDEKNILPAFDNMMLDFDEARIPSTNRILYVTPKTNAILKRAEAMNRALTLKDPNNIQRTVYSLDDVTIRVVPSDL 219 (346) T ss_pred cccccCHHHHHHHHHHHHHHHHHccCCCCCeEEEECHHHHHHHhhchhheeccccccccccceeeeeecCeEEEEcchhh Confidence 1111234567899999999998877643 3579999999988665421111 1112222355678999999974 445 Q ss_pred cccc---------cccccc-eEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccce Q lcl|Aclame:pro 222 VSDM---------SLTQRD-RAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKF 291 (298) Q Consensus 222 ~~~~---------~~~~~~-~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~ 291 (298) |++. ..+++. -.++.. ..+. +.+..--.++++.-. . . ..|...+.-+.+.|.=|.+.+.= T Consensus 220 ~~t~~~f~~G~~~~t~ak~INfiiv~-~~A~-ia~~K~~~~~if~P~-~--~-----~~g~~l~~~R~Y~D~fv~~nk~~ 289 (346) T protein:vir:10 220 MQTAYDFSDGSKIIDTAKQIEMFLIY-NGVQ-IAPEKYSFVGFDQPS-A--A-----TSGNYLYYEQSYDDVLLLNTKTK 289 (346) T ss_pred cccchhhccCccccCCccceeEEEEC-Ccee-eeeeeeeeeEeeCCC-C--C-----cccceeeeeeeeeeeeeeccccc Confidence 5421 111111 223332 2222 233333333333221 1 1 12333444556677766654432 Q ss_pred EE---EeecC Q lcl|Aclame:pro 292 AR---VTEAN 298 (298) Q Consensus 292 ~~---l~~a~ 298 (298) .+ ++.|. T Consensus 290 ~Iyv~~~~a~ 299 (346) T protein:vir:10 290 GIQFVVSDKP 299 (346) T ss_pred eEEEeeeccc Confidence 22 22222 No 231 >protein:vir:96490 Length: 348 # NCBI annotation: head protein # Family: family:all:1083 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238492;genbank:gi:66391768;genbank:GeneID:5176912 Probab=81.82 E-value=0.082 Score=26.53 Aligned_cols=296 Identities=11% Similarity=0.036 Sum_probs=126.9 Q ss_pred CeeccccccchhHHHHHHHHHH-hhch-h-hhhcceeecCCCceEEEEE-eCCcc-eEEeecccccc-ccccceeeEEEe Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVA-GKSS-I-ARLSAQKPIPFNGEKVFTF-TMDSE-IDVVAESGKKT-HGGVTLAPQTMV 74 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~-~~s~-i-~~~~~~~~~~~~~~~ip~~-~~~~~-a~~v~E~~~~~-~~~~~~~~v~l~ 74 (298) |++--..+-+.++ ..++..+. .... + ..+.+..++.+..+.+... ..... +.+++.+...+ ...-.++..+.. T Consensus 1 M~~i~d~f~~~~l-~~~i~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~~~~~~~~ 79 (348) T protein:vir:96 1 MGLIYDKVTASNI-AGYFNTLQENVDSTLGESIFPARKQLGTKLSYIKGASGQSVALKAAAFDTNVTIRDRVSAEIHDEQ 79 (348) T ss_pred CcchhhccCHHHH-HHHHHhcccchhhhhhhhcCCCccccceeEEEEeecCCceeEeeeecCCCCcceecccceeeeeee Confidence 8875444444444 44554332 2322 3 3566766665544444332 22233 66888776544 344557777777 Q ss_pred eeEEEEEEeecHHHh------hccc-cc-HHHHHHHHHH---HHHHHHHHHHHHHHh----cccc--ccccccccc-ccc Q lcl|Aclame:pro 75 PIKVEYGARISDEFM------YASD-EE-KINILQAFND---GFAKKVARGIDLMAF----HGVN--PRLGTASAV-IGT 136 (298) Q Consensus 75 ~~k~~~~~~iS~ell------~~~~-d~-~~~l~~~i~~---~la~~i~~~~d~~~l----~G~~--~~~g~~~~~-~~~ 136 (298) +-.++-...++.+=+ +.+. ++ ...+...+.+ .+.+.+.+.+|..+. +|.- .+.|..-.. .+. T Consensus 80 ~p~i~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~~~~~~~~vdfg~ 159 (348) T protein:vir:96 80 MPFFKEALLVKENDRQQLNLVKDTGNEALINTIVAGIFNDDVTLINGARARLEAMRMQVLATGKIAFTSDGVNKDIDYGV 159 (348) T ss_pred cCccccccccCHHHHHHHHhhhccCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEeecCCeeEEEeccC Confidence 766666555554211 1111 00 0112222222 233445555553333 3421 011110000 010 Q ss_pred cccccccccccccccccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHH---hhc----cCCce-eecccccccCc Q lcl|Aclame:pro 137 NHFDSKVTQKVEAPRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAK---QKD----LQGNA-LFPELKWGATP 208 (298) Q Consensus 137 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~---lkd----~~G~~-l~~~~~~~~~~ 208 (298) .. ....+....++....+++.+|.++...+...+..++.++|++++|.+|++ .++ .++.. ...+......- T Consensus 160 ~~-~~~~t~~~~W~~~~adp~~di~~~~~~~~~~G~~~~~~i~~~~~~~~l~~~~~v~~~~~~~~~~~~~~~~~~~~~~~ 238 (348) T protein:vir:96 160 KA-DHKKQVSKSWAEPGATPLADLEDAIETARELGLNPERAIMNAKTFGLIRKAASTVKAIKPLAGDGSSVTKAELQNYV 238 (348) T ss_pred Cc-ccceeeccccCCCCCCHHHHHHHHHHHHHhcCCcccEEEeCHHHHHHHhcCHHHHHHHhccCCccccccHHHHHHHH Confidence 00 11122233566677788999999988888888888899999999999864 332 11111 11111111112 Q ss_pred ceecceeeEec-Ccccccccc-----ccceEEEeec-c-ceEEEEee-cceEEEEeeccccc----c--cchhhhhc-C- Q lcl|Aclame:pro 209 DTINGLPVDVN-KTVSDMSLT-----QRDRAIIGDF-A-NGFKWGYA-KEVPLEVIQYGDPD----N--SGLDLKGY-N- 271 (298) Q Consensus 209 ~~l~G~PV~~s-~~~~~~~~~-----~~~~~~~gd~-~-~~~~~~~~-~~~~i~~~~~~~~~----~--~~~~~f~~-n- 271 (298) .+++|+++++- ....+..+. ..+.+++.-- . +...|+.- ++...........+ + -.+..|.+ + T Consensus 239 ~~~~g~~i~~y~~~y~d~~G~~~~~~p~~~v~l~~~~~~G~~~yg~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dP 318 (348) T protein:vir:96 239 ADNYGVEIVLENGTYRNEKGEVSKFFPDGHLTLIPNGPLGNTVFGTTPEESDLFADNTVNADVEIVDSGIAVTTTKTTDP 318 (348) T ss_pred hhhcCceEEEEccEEEecCCcEeccccCCeEEEEcCCCceeEEeccChhhhhhhhcccccccceecCCeeEEEeeecCCC Confidence 34567777652 222111111 1112222111 1 11112210 00000000000000 0 00000111 1 Q ss_pred -cEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 272 -QVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 272 -~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ...+++..+.==.+.+|+++.++|--. T Consensus 319 ~~~~~~~~s~plPv~~~~~~~~~a~Vl~ 346 (348) T protein:vir:96 319 VNVQTKVSMVALPSFERLGDVYMLTVIP 346 (348) T ss_pred ceEEEEEeeeeeccccCCCcEEEEEEec Confidence 122334333333445788888887777 No 232 >protein:vir:80835 Length: 464 # NCBI annotation: putative major capsid protein # Family: family:all:2450 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504125;genbank:gi:158079312;genbank:GeneID:5666484 Probab=81.35 E-value=0.086 Score=26.41 Aligned_cols=284 Identities=12% Similarity=0.010 Sum_probs=123.3 Q ss_pred Ce----------eccccccchhHHHHHHHHHHhh--chhhhhcceeecCCCceEEEEEeC---CcceEEeeccccccccc Q lcl|Aclame:pro 1 MV----------LNKGTLFDPELVTDLISKVAGK--SSIARLSAQKPIPFNGEKVFTFTM---DSEIDVVAESGKKTHGG 65 (298) Q Consensus 1 ma----------t~gg~lip~~~~~~ii~~~~~~--s~i~~~~~~~~~~~~~~~ip~~~~---~~~a~~v~E~~~~~~~~ 65 (298) |. +++|.|--+.+..+|-.+.... -.+.+-..+.|..+.--++-.... .+...+++|+...+.++ T Consensus 22 ~ttgy~~~p~~q~~~~AlRrEsL~~~i~~Lt~~~~~f~f~~di~k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~~~~~d 101 (464) T protein:vir:80 22 FTTGYGITPESQTDAAALRREFLDDQITMLTWADGDLSFYRDITKRPATSTVAKYDVYLAHGRVGHTRFTREIGVAPISD 101 (464) T ss_pred HHhCCccCcccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhhheeeccCccccccccccccccccCC Confidence 22 2234444444555554433222 233333445555554334433332 25678999999999999 Q ss_pred cceeeEEEeeeEEEEEE--eecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-----ccccccccccc Q lcl|Aclame:pro 66 VTLAPQTMVPIKVEYGA--RISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRL-----GTASAVIGTNH 138 (298) Q Consensus 66 ~~~~~v~l~~~k~~~~~--~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~-----g~~~~~~~~~~ 138 (298) +++.+.....|=+..-- .+-.+|.+ +..+-++.+.++-.-.++..+|.++|+|+..-+ |..-.+.|+.. T Consensus 102 ~~~~Rr~~~~Kfl~~~r~vsia~~lvn----~~~d~~~~~~~dai~~va~tiE~a~FyGds~l~~~~~~~~gleFDGl~~ 177 (464) T protein:vir:80 102 PNLRQKTVNMKYVSDTKNMSIATGLVN----NIEDPMRILTDDAISVVAKTIEWASFYGDSDLSENPDAGSGLEFDGLAK 177 (464) T ss_pred CceEEEEEEeeeeecceeeeeehhhhc----chhhHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCCCccccchhhhHh Confidence 99999888876554433 33333332 223555667777777899999999999974333 22223344332 Q ss_pred cccccccccccccccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHH-HHhhccCCceeecccccccCcceecceeeE Q lcl|Aclame:pro 139 FDSKVTQKVEAPRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSAL-AKQKDLQGNALFPELKWGATPDTINGLPVD 217 (298) Q Consensus 139 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L-~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~ 217 (298) .... ..+.-........+.|..+-..+...+..++-.+|+..+.+.+ ...-+.+=+.+. + .+.+...|+||- T Consensus 178 lI~~--~NViDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~v~a~f~n~~l~~q~~~~~-~----n~~~~~~G~~v~ 250 (464) T protein:vir:80 178 LIDK--HNVLDAKGASLTEALLNQASVLVGKGYGTPTDAYMPIGVQADFVNQQLDRQVQVIS-D----NGQNATMGFNVK 250 (464) T ss_pred hcCC--CceeecCCCCcCHHHHhhhhhhhhcccCChhhcccchhHHHHHHhhhcCceeEEEc-C----CCCcceeeeecc Confidence 2211 1111222233445667777777777888899999999998876 444433222221 1 111123333331 Q ss_pred --ecCccccccccccceEEEeeccceE-----EEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecc-- Q lcl|Aclame:pro 218 --VNKTVSDMSLTQRDRAIIGDFANGF-----KWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDA-- 288 (298) Q Consensus 218 --~s~~~~~~~~~~~~~~~~gd~~~~~-----~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~-- 288 (298) ++.. .. ... ....++.+..... .-+.....++......+..+....-.......+++...-+.+---| T Consensus 251 ~f~sa~-G~-i~L-~~s~~m~~~~~ld~~~~~~~~apaapsvt~tv~~~~~g~f~~~~~~~~~~Ykv~~vn~~GeS~ps~ 327 (464) T protein:vir:80 251 GFNSAR-GF-IRL-HGSTVMELEQILDENRMQLPNAPQKATVKATLEAGTKGKFRDEDLTIDTEYKVVVVSDDAESAPSD 327 (464) T ss_pred cccccc-cc-eec-cCccccCcccccccccccCCCCcCCceeEEEecCCcccCCccccccceeEEEEEEECCCCccccce Confidence 0000 00 000 0000111110000 0000001111111111111110000000111222222222111112 Q ss_pred ---------------------------cceEEEeecC Q lcl|Aclame:pro 289 ---------------------------TKFARVTEAN 298 (298) Q Consensus 289 ---------------------------~a~~~l~~a~ 298 (298) ..+.+-..+= T Consensus 328 ~~~~ti~~~~~~V~l~it~~~~~~~~p~yv~IYR~~~ 364 (464) T protein:vir:80 328 VASVVIDDKKKQVKLEITINNMYQARPQYVAIYRKGL 364 (464) T ss_pred eeeeeecCcccEEEEEEEeCCccccccceEEEEeecC Confidence 2222222110 No 233 >protein:vir:103463 Length: 521 # NCBI annotation: major head subunit precursor # Family: family:all:364 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803115;genbank:gi:116326395;genbank:GeneID:4405492 Probab=78.78 E-value=0.11 Score=25.82 Aligned_cols=280 Identities=11% Similarity=0.000 Sum_probs=110.4 Q ss_pred CeeccccccchhHHHHHHHHHH---hhchhhhhcceeecCCCce-------EEEEEeCC------------cceEEe--- Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVA---GKSSIARLSAQKPIPFNGE-------KVFTFTMD------------SEIDVV--- 55 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~---~~s~i~~~~~~~~~~~~~~-------~ip~~~~~------------~~a~~v--- 55 (298) .+.+.+.---..+.+.++..+| .+.+...++.+.||++... .++..... +++.|- T Consensus 79 i~es~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~g~eaf~~~~~ada~fSG~~ 158 (521) T protein:vir:10 79 IAAGQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMYGPDAMFSGQG 158 (521) T ss_pred ccccccccccccCCchhhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeccCCccccccccccchhccccccccccc Confidence 2222211111223334444444 3445567888888865421 11111000 011110 Q ss_pred ------------------------------------------------------------------e--------c---- Q lcl|Aclame:pro 56 ------------------------------------------------------------------A--------E---- 57 (298) Q Consensus 56 ------------------------------------------------------------------~--------E---- 57 (298) + | T Consensus 159 ~at~~s~~~~~~~~~~Gd~~~~~~~~~g~~~~~~~~~~t~~~t~~d~~~~~~~~~~~~~~~~~y~~~~GmsTa~aEal~~ 238 (521) T protein:vir:10 159 AAKKFAALAASTQTTVGDIYTHFFQDTGTVYLQASAQVTISSTADDAAKLDAEIKKQMEAGALVEIAEGMATSIAELQES 238 (521) T ss_pred cccccccccccccccccccccccccccccceecccccccCCCcccccccccccccccccccceeecccccchhhHhhhcc Confidence 0 1 Q ss_pred -----cccccccccceeeEEEeeeEEEEEEeecHHHhhcccc-cHHHHHHHHHHHHHHHHHHHHHHHHhccccc-ccccc Q lcl|Aclame:pro 58 -----SGKKTHGGVTLAPQTMVPIKVEYGARISDEFMYASDE-EKINILQAFNDGFAKKVARGIDLMAFHGVNP-RLGTA 130 (298) Q Consensus 58 -----~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~~~~d-~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~-~~g~~ 130 (298) +..+++-..++++++..+|.=+=...+|-||.++--. -..|.+++|..-|+-.|...+++-++.-..- ..-.. T Consensus 239 ~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~ 318 (521) T protein:vir:10 239 FNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGK 318 (521) T ss_pred CCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeeee Confidence 1124445555566666555545556788998764322 1244566666666666666666666632110 00000 Q ss_pred cccc----ccccccccccccccc-ccccc----hhHHHHHHHhhhhhh--cCCcccEEEEcHHHHHHHHHhh-----ccC Q lcl|Aclame:pro 131 SAVI----GTNHFDSKVTQKVEA-PRGIA----DPNGAIENAVELLTG--VDADVTGIAINPSFRSALAKQK-----DLQ 194 (298) Q Consensus 131 ~~~~----~~~~~~~~~~~~~~~-~~~~~----~~~~~i~~~~~~l~~--~~~~~~~~vm~~~~~~~L~~lk-----d~~ 194 (298) .+.+ ...++.......... ..... ..+-.|......+.. .....+-++|+++....|...- .+. T Consensus 319 ~g~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~ 398 (521) T protein:vir:10 319 SGMTLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQ 398 (521) T ss_pred eeeeeccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhcccccccccc Confidence 0000 001111111111000 00000 012222222223222 2245667999999999887531 111 Q ss_pred C-ceeecccccccC-cceec-ceeeEecCccccccccccceEEEeeccceEEEEeecceEEEEeeccc------ccccch Q lcl|Aclame:pro 195 G-NALFPELKWGAT-PDTIN-GLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGD------PDNSGL 265 (298) Q Consensus 195 G-~~l~~~~~~~~~-~~~l~-G~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------~~~~~~ 265 (298) | ..=|..+.+... .+.|. |++|+++++.+. +.+++|-- +...+ .-.+-..||.. .|.. T Consensus 399 ~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~------dy~~vG~K-G~~~~----~~glfyaPYv~l~~~~~~dp~-- 465 (521) T protein:vir:10 399 GLATGFNTDTTKSVFAGVLGGKYRVYIDQYAKQ------DYFTVGYK-GPNEM----DAGIYYAPYVALTPLRGSDPK-- 465 (521) T ss_pred cccccccccCCCceEEEEecCceEEEecCCCCc------ceEEEEEe-CCccc----ccceeeccccccccccccCCc-- Confidence 1 111222222211 24555 479999988753 34444421 00000 00111222211 1111 Q ss_pred hhhhcCcEEEEEEEEEccEEecccc-------eEEEeecC Q lcl|Aclame:pro 266 DLKGYNQVYIRAELFLGWGILDATK-------FARVTEAN 298 (298) Q Consensus 266 ~~f~~n~v~~r~~~r~~~~v~~~~a-------~~~l~~a~ 298 (298) -||- .+.|+ .|+++.+ +|=+ ..+|++-+ T Consensus 466 -sfqP-~~g~~--tRY~l~~-NP~~~~~~~~~~~~i~~~~ 500 (521) T protein:vir:10 466 -NFQP-VMGFK--TRYGIGI-NPFAESAAQAPASRIQSGM 500 (521) T ss_pred -cccc-eeeee--eeeceee-cCcccccCCccceeecccc Confidence 1332 13333 3565543 3311 12333333 No 234 >protein:vir:100603 Length: 529 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656387;genbank:gi:109290138;genbank:GeneID:4156581 Probab=78.48 E-value=0.11 Score=25.76 Aligned_cols=278 Identities=11% Similarity=-0.007 Sum_probs=104.2 Q ss_pred CeeccccccchhH---HHHHHHHHHhhchh---------hhhcceeecCCCceEEEEEeCCcceEEeec---------cc Q lcl|Aclame:pro 1 MVLNKGTLFDPEL---VTDLISKVAGKSSI---------ARLSAQKPIPFNGEKVFTFTMDSEIDVVAE---------SG 59 (298) Q Consensus 1 mat~gg~lip~~~---~~~ii~~~~~~s~i---------~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E---------~~ 59 (298) +++..+...-..+ ...++...-...+- ........+..+.+ +..+.+-..-.+| +. T Consensus 178 ~a~~~~~~~~~~~nea~t~~s~~~tg~~~~~g~~~tg~~~~~~~~~~~a~~~~---~~~~~gmsTa~aEal~~~g~ss~~ 254 (529) T protein:vir:10 178 QAVATGDIVYHFFYESGSAYLQNVTGGNVTVGTNETGAALDALVSAKIAAGEL---AEIAEGMATSIAELRQGFNGTTDN 254 (529) T ss_pred ceeeccccceeeecccccccccccccccccccccccCCccccccccccccccc---cccccccchhhhhccccCCCCccc Confidence 4433332211111 01111100000000 00000000111100 0011111111223 34 Q ss_pred cccccccceeeEEEeeeEEEEEEeecHHHhhccccc-HHHHHHHHHHHHHHHHHHHHHHHHhccccccc-cccccccc-- Q lcl|Aclame:pro 60 KKTHGGVTLAPQTMVPIKVEYGARISDEFMYASDEE-KINILQAFNDGFAKKVARGIDLMAFHGVNPRL-GTASAVIG-- 135 (298) Q Consensus 60 ~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~~~~d~-~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~-g~~~~~~~-- 135 (298) .+++-..++++++..++.=+=...+|-||.|+--.- ..|.+++|..-|+..|...+++-++.-..... -...+... T Consensus 255 ~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAvHGLDAEtELsNILStEImlEINReii~~i~~~a~~~~~g~~~~~ 334 (529) T protein:vir:10 255 PWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWINYTAQVGKSGWTQTV 334 (529) T ss_pred cccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHhhhhceeeeeeeeccc Confidence 577888888888777776666678899987643221 24455666666666666666666665111000 00000000 Q ss_pred --ccccccccccccccc-ccc----chhHHHHHHHhhhhhhc--CCcccEEEEcHHHHHHHHH--hhccCCcee----ec Q lcl|Aclame:pro 136 --TNHFDSKVTQKVEAP-RGI----ADPNGAIENAVELLTGV--DADVTGIAINPSFRSALAK--QKDLQGNAL----FP 200 (298) Q Consensus 136 --~~~~~~~~~~~~~~~-~~~----~~~~~~i~~~~~~l~~~--~~~~~~~vm~~~~~~~L~~--lkd~~G~~l----~~ 200 (298) ..++........... -.. ...+-.|...-+.+... +...+.++|+++....|.. ++|.-+..- |. T Consensus 335 ~~~~gv~d~~~~~d~~~~~~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~~~~~~~~~~~~sg~~ 414 (529) T protein:vir:10 335 GSAAGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALVDAGITPAAQGMASGLN 414 (529) T ss_pred cccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHhhhccccccccccccccce Confidence 011111111111000 001 11233344444444332 2346679999999999863 222211111 11 Q ss_pred ccccc-cCcceec-ceeeEecCccccccccccceEEEeec-----cceEEEEeecceEEEEeecccccccchhhhhcCcE Q lcl|Aclame:pro 201 ELKWG-ATPDTIN-GLPVDVNKTVSDMSLTQRDRAIIGDF-----ANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQV 273 (298) Q Consensus 201 ~~~~~-~~~~~l~-G~PV~~s~~~~~~~~~~~~~~~~gd~-----~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v 273 (298) .+.+. -..+.|. |++|+++++.+. +.+++|-- ..+..|.+=-+++ +.+ ..|.. -||- .+ T Consensus 415 ~d~~~~~~~G~l~~~~~vy~D~y~~~------dy~~vG~KG~~~~~~glfy~PYv~l~--~~~--~~dp~---sfqP-~~ 480 (529) T protein:vir:10 415 ADTTKGVFAGVLGGRYKVYIDQYARQ------DYFTMGYRGANNLDAGIYYCPYVALT--PLR--GSDPK---NFQP-VM 480 (529) T ss_pred eecCCceEEEEecCceEEEecCCCCc------ceEEEEEeCCcccccceeeccccccc--ccc--ccCCC---cccc-ee Confidence 11111 1134555 479999988753 34444421 1111111111111 111 11111 1332 13 Q ss_pred EEEEEEEEccEEecccc-------eEEEeecC Q lcl|Aclame:pro 274 YIRAELFLGWGILDATK-------FARVTEAN 298 (298) Q Consensus 274 ~~r~~~r~~~~v~~~~a-------~~~l~~a~ 298 (298) .|+ .|+++.+ +|=+ ..++.+.+ T Consensus 481 g~~--tRY~l~~-NP~~~~~~~~~~~r~~~g~ 509 (529) T protein:vir:10 481 GFK--TRYAIGV-NPFAESRTQAPTSRISNGM 509 (529) T ss_pred eee--eeeceee-cCccccccccccccccCCc Confidence 333 3555432 3311 11222222 No 235 >protein:vir:106286 Length: 534 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944113;genbank:gi:38640157;genbank:GeneID:2658034 Probab=76.75 E-value=0.13 Score=25.40 Aligned_cols=281 Identities=10% Similarity=0.030 Sum_probs=105.7 Q ss_pred CeeccccccchhHHHHHHHHHH---hhchhhhhcceeecCCCc-----eE--EEEEeC---C---------cceEEe--- Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVA---GKSSIARLSAQKPIPFNG-----EK--VFTFTM---D---------SEIDVV--- 55 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~---~~s~i~~~~~~~~~~~~~-----~~--ip~~~~---~---------~~a~~v--- 55 (298) .+.+...---....+.++..+| ...+...++.+.||++.. ++ |-.... . +++.|- T Consensus 87 ia~s~~s~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n~~~~~s~~EAf~ne~~adt~fSG~~ 166 (534) T protein:vir:10 87 IASGETSGSITNVGPAVMGLVRRAIPQLIAFDICGVQPMTSSTGQVFTLRAIYGGNSQDANAREAFHPTYGPDADFSGRG 166 (534) T ss_pred ccccccccccccccchhhhHHHHHHHhhhhhhhheeccCCchhhhheeeeeeecCCCCCccccccccccccccccccccc Confidence 2211111111222334444444 344556777888886542 11 100000 0 000110 Q ss_pred -------------------------------------------------------------------------------e Q lcl|Aclame:pro 56 -------------------------------------------------------------------------------A 56 (298) Q Consensus 56 -------------------------------------------------------------------------------~ 56 (298) + T Consensus 167 ~a~~~~~~~~~~a~~~g~~~~~~~~~~t~~~~Gt~~~~~~~~~~v~~~~~~~~~ag~~~~~~~~~~~~y~~~~gm~Ta~A 246 (534) T protein:vir:10 167 AAQDIAVFVRGTAVASGAFAKLHIEAATGVQAGTKTVQFIKDYAVDALPADQTEAGLAYKWLLANGYAVETSSAMATAFA 246 (534) T ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccCCccccccccccccccccceecccccchhhH Confidence 1 Q ss_pred c---------cccccccccceeeEEEeeeEEEEEEeecHHHhhccccc-HHHHHHHHHHHHHHHHHHHHHHHHhccccc- Q lcl|Aclame:pro 57 E---------SGKKTHGGVTLAPQTMVPIKVEYGARISDEFMYASDEE-KINILQAFNDGFAKKVARGIDLMAFHGVNP- 125 (298) Q Consensus 57 E---------~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~~~~d~-~~~l~~~i~~~la~~i~~~~d~~~l~G~~~- 125 (298) | +..+++-..++++++..+|.=+=...+|-||.|+--.- ..|.+++|..-|+..|...+++-++.-... T Consensus 247 E~lg~~ggs~~~~f~EMsFsIdKvtVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsNILSTEImlEINReii~~l~~~ 326 (534) T protein:vir:10 247 ELQQGFNGSADNEWNEMSFRIDKQVVEAKSRQLKAQYSIEMAQDLRAVHGLDADSELSSILANEIMHEINREMVLWINAT 326 (534) T ss_pred hhhccCCCCcccchhhcceEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhh Confidence 1 01244555566666666555455567888986542210 133445555555555555555555532111 Q ss_pred -ccccccccccc---cccccccccccc-cccccc----hhHHHHHHHhhhhhh--cCCcccEEEEcHHHHHHHHHhh--c Q lcl|Aclame:pro 126 -RLGTASAVIGT---NHFDSKVTQKVE-APRGIA----DPNGAIENAVELLTG--VDADVTGIAINPSFRSALAKQK--D 192 (298) Q Consensus 126 -~~g~~~~~~~~---~~~~~~~~~~~~-~~~~~~----~~~~~i~~~~~~l~~--~~~~~~~~vm~~~~~~~L~~lk--d 192 (298) .-+......+. .++......... ...... .++-.|...-+.+.. .+...+-++|+++....|...- + T Consensus 327 a~~~k~~~~~~~~~~~G~~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~i~~~T~rg~~n~~v~S~~Va~~L~~~g~l~ 406 (534) T protein:vir:10 327 AKVGKTGWTNMHGGKAGVFDFQDTKDIRGARWAGESYKALVVQIDKEANEIARQTGRGQGNFIICSRNVAAALGHTDMLM 406 (534) T ss_pred hheeecccccccccccceeeeeccccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchhHHHHHhhccchh Confidence 00111110000 111111111110 000000 112222222222222 2235667999999999885421 1 Q ss_pred ---cCCcee-eccccccc-Ccceec-ceeeEecCccccccccccceEEEeeccceEEEEeecceEEEEeecc------cc Q lcl|Aclame:pro 193 ---LQGNAL-FPELKWGA-TPDTIN-GLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYG------DP 260 (298) Q Consensus 193 ---~~G~~l-~~~~~~~~-~~~~l~-G~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~------~~ 260 (298) ..|-.. ...+.+.. ..|.|. |++|+++++.+. +.+++|-- +...+ .-.+-..||. .. T Consensus 407 ~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~------dy~~vG~K-G~~~~----~~glfyaPYv~l~~~~~~ 475 (534) T protein:vir:10 407 TPAVMGANTTMNTDTTSSLFAGVLAGKYRVYIDQYAVE------DYFTVGYK-GASEM----DAGLYYCPYVALTPLRGT 475 (534) T ss_pred ccccccccccccccCCCceEEEEecCceEEEecCCCCc------ceEEEEEe-CCccc----ccceeecccccccccccc Confidence 011111 11111111 234565 579999988763 34444421 00000 0011112221 11 Q ss_pred cccchhhhhcCcEEEEEEEEEccEEe------cccceEEEeecC Q lcl|Aclame:pro 261 DNSGLDLKGYNQVYIRAELFLGWGIL------DATKFARVTEAN 298 (298) Q Consensus 261 ~~~~~~~f~~n~v~~r~~~r~~~~v~------~~~a~~~l~~a~ 298 (298) |.. + ||- .+.|+ .|++..+- +.+-+.++.+.+ T Consensus 476 dp~--s-fqP-~~g~~--tRY~l~~NP~~~~~~~~~~~~i~~g~ 513 (534) T protein:vir:10 476 DPK--N-FQP-VLGFK--TRYGVKLHPMADATQNKGFAKISNGM 513 (534) T ss_pred CCc--c-ccc-eeeee--eeeceeecCcccccCCccccccccCC Confidence 211 1 332 13333 35554331 112223333322 No 236 >protein:vir:2736 Length: 348 # NCBI annotation: putative structural protein # Family: family:all:1083 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695109;genbank:gi:23455878;genbank:GeneID:955608 Probab=75.91 E-value=0.14 Score=25.24 Aligned_cols=295 Identities=11% Similarity=0.028 Sum_probs=124.7 Q ss_pred CeeccccccchhHHHHHHHHH-Hhhchh--hhhcceeecCCCceEEEEEe-CCcc-eEEeeccccccc-cccceeeEEEe Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKV-AGKSSI--ARLSAQKPIPFNGEKVFTFT-MDSE-IDVVAESGKKTH-GGVTLAPQTMV 74 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~-~~~s~i--~~~~~~~~~~~~~~~ip~~~-~~~~-a~~v~E~~~~~~-~~~~~~~v~l~ 74 (298) |++--..+-+.++ ..++..+ .....+ ..+++..++.+....+.... ..+. +.+++.+.+.+. ..-.++..+.. T Consensus 1 M~~i~d~f~~~~l-~~~v~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~~~~~~~~ 79 (348) T protein:vir:27 1 MGLIYDKVTASNI-AGYFNALQENVSSTLGESIFPARKQLGTKLSYIKGASGQSVALKAAAFDTNVTIRDRVSAEMHDEQ 79 (348) T ss_pred CcchhhhcCHHHH-HHHHHhccchhhhhhHhhcCCCccccceeEEEEeeccCceeEeeeecCCCCcceecccceeeeeee Confidence 8876444334444 3444333 333333 24666555554444333222 2232 567776655443 34456666666 Q ss_pred eeEEEEEEeecHHHh------hcccc--cHHHHHHHH---HHHHHHHHHHHHHHHHh----cccc--ccccccccc-ccc Q lcl|Aclame:pro 75 PIKVEYGARISDEFM------YASDE--EKINILQAF---NDGFAKKVARGIDLMAF----HGVN--PRLGTASAV-IGT 136 (298) Q Consensus 75 ~~k~~~~~~iS~ell------~~~~d--~~~~l~~~i---~~~la~~i~~~~d~~~l----~G~~--~~~g~~~~~-~~~ 136 (298) +-.++-...++.+=+ +.... ..-.+...+ .+.+.+.+.+.+|.++. +|-- .+.|..-.. .+. T Consensus 80 ~p~i~~~~~i~~~d~~~~~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~~al~~Gki~i~~~~~~~~vdfg~ 159 (348) T protein:vir:27 80 MPFFKEAMLVKENDRQQLNLVKDSGNAVLVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDGVNKDIDYGV 159 (348) T ss_pred cCccccccccCHHHHHHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEEecCCeeEEEeecC Confidence 666665555554322 11110 000111211 22334455555554443 3310 111110000 010 Q ss_pred cccccccccccccccccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHH---hhccC----C--ceeecccccccC Q lcl|Aclame:pro 137 NHFDSKVTQKVEAPRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAK---QKDLQ----G--NALFPELKWGAT 207 (298) Q Consensus 137 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~---lkd~~----G--~~l~~~~~~~~~ 207 (298) . .....+....+...+.+++++|.+....+...+..+..++|++++|..|++ .++.- + ..+- +.....- T Consensus 160 ~-~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~G~~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~~i~-~~~~~~~ 237 (348) T protein:vir:27 160 K-PDHKKQVSKSWAEPGATPLADLEDAIETARELGLNPERAVMNAKTFGLIRKAASTVKVIKPLAGDGSAVT-KAELENY 237 (348) T ss_pred C-cccceeeeeccCCCCCCHHHHHHHHHHHHHhcCCcccEEEECHHHHHHHhcCHHHHHHhcccCccccccC-HHHHHHH Confidence 0 011122334567777888999999988888888888999999999999865 33211 1 1111 1111111 Q ss_pred cceecceeeEecC-cccccccc-----ccceEEE-eecc-ceEEEEee-cceEEEEeeccccc----cc--chhhhhc-C Q lcl|Aclame:pro 208 PDTINGLPVDVNK-TVSDMSLT-----QRDRAII-GDFA-NGFKWGYA-KEVPLEVIQYGDPD----NS--GLDLKGY-N 271 (298) Q Consensus 208 ~~~l~G~PV~~s~-~~~~~~~~-----~~~~~~~-gd~~-~~~~~~~~-~~~~i~~~~~~~~~----~~--~~~~f~~-n 271 (298) -+++.|+++++-+ ...+..+. ..+.+++ .+-. +...|+.- ++...........+ +. .+..|.+ + T Consensus 238 ~~~~~g~~i~~yd~~y~d~~G~~~~~~p~~~vvl~~~~~~G~~~yG~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 317 (348) T protein:vir:27 238 IADNFGVSIVLENGTYRNDKGEVSKFYPDGHLTLIPNGPLGNTVFGTTPEESDLFADNTVNAEVEIVDNGIAVTTTKTTD 317 (348) T ss_pred HHhhcCceEEEEeeEEEcCCCcCcccccCCeEEEEcCCcceeEEeccCcchhhhhhccccccceeeeCCeeEEEeeecCC Confidence 1245677775422 22111111 1122222 2111 11222210 00000000000000 00 0000111 1 Q ss_pred --cEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 272 --QVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 272 --~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ...+++..+.==.+.+|+++.++|--. T Consensus 318 P~~~~~~~~s~~lPv~~~~~~~~~a~Vl~ 346 (348) T protein:vir:27 318 PVNVQTKVSMVALPSFERLDDVYMLTVIP 346 (348) T ss_pred CceEEEEEeeeeeccccCCCcEEEEEEec Confidence 122333333333445788888877666 No 237 >protein:vir:5670 Length: 514 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899609;genbank:gi:34419596;genbank:GeneID:2546039 Probab=71.97 E-value=0.19 Score=24.55 Aligned_cols=272 Identities=12% Similarity=0.006 Sum_probs=97.3 Q ss_pred CeeccccccchhH---HHHHHHHHHhhchhhhhcceeecCCC-ceEEE--------EEeCCcceEEeec---------cc Q lcl|Aclame:pro 1 MVLNKGTLFDPEL---VTDLISKVAGKSSIARLSAQKPIPFN-GEKVF--------TFTMDSEIDVVAE---------SG 59 (298) Q Consensus 1 mat~gg~lip~~~---~~~ii~~~~~~s~i~~~~~~~~~~~~-~~~ip--------~~~~~~~a~~v~E---------~~ 59 (298) -+...|...-.++ ..+.. ............-+.... ...+. ...+.+-..-.+| +. T Consensus 165 ~~~~~G~~~~~~~t~~~gd~~---~~~~~~~~~~~~~~~~~~~~t~~~~~~a~~~~y~~~~Gm~Ta~aEal~~lggs~~~ 241 (514) T protein:vir:56 165 GAATDGTPYKAEVTTSGGDVS---MRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFNGSSNN 241 (514) T ss_pred ccccccccccccccccccccc---cccccccccccccccccccccccccccccchhhhhhhhhhhhhhhhcccCCCCccc Confidence 0000011000000 00000 000000000000000000 00000 0000111111222 34 Q ss_pred cccccccceeeEEEeeeEEEEEEeecHHHhhccccc-HHHHHHHHHHHHHHHHHHHHHHHHhcccc-----ccccccccc Q lcl|Aclame:pro 60 KKTHGGVTLAPQTMVPIKVEYGARISDEFMYASDEE-KINILQAFNDGFAKKVARGIDLMAFHGVN-----PRLGTASAV 133 (298) Q Consensus 60 ~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~~~~d~-~~~l~~~i~~~la~~i~~~~d~~~l~G~~-----~~~g~~~~~ 133 (298) .+++-..++++++..+|.=+=...+|-||.++--.- ..|.+++|..-|+..|...+++-++.-.. ...+...+ T Consensus 242 ~f~EMaFsIdK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~l~~~atv~~~~~~~~- 320 (514) T protein:vir:56 242 EWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSGWTQG- 320 (514) T ss_pred ccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHHhheeehhcccccc- Confidence 577778888887777776666678899987643221 24455666666666666666666642211 00110011 Q ss_pred ccccccccccccccccccccchhHHHHHHHhhhh-------hh--cCCcccEEEEcHHHHHHHHHhh--c---cCC--ce Q lcl|Aclame:pro 134 IGTNHFDSKVTQKVEAPRGIADPNGAIENAVELL-------TG--VDADVTGIAINPSFRSALAKQK--D---LQG--NA 197 (298) Q Consensus 134 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l-------~~--~~~~~~~~vm~~~~~~~L~~lk--d---~~G--~~ 197 (298) .+..++..........+. .-..+.++.++.++ .. .....+.++|+++....|...- + ..| .- T Consensus 321 ~~~~G~~d~~~~~d~~~~--~~~~e~~~~l~~~i~~~an~i~~~T~rg~gn~~i~S~~Va~~L~~sg~l~~~~~~g~~~~ 398 (514) T protein:vir:56 321 AGAAGVFDFSDAVDVKGA--RWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQDG 398 (514) T ss_pred cccccccccccccccccc--hHHHHHHHHHHHHHHHHHHHHHhhcccccccEEEEchhHHHHHHhhhhhccccccCcccc Confidence 111222222211111110 01122223322222 22 2345667999999999886411 0 111 00 Q ss_pred eeccccc-ccCcceec-ceeeEecCccccccccccceEEEeeccceEEEEeecceEEEEeeccc------ccccchhhhh Q lcl|Aclame:pro 198 LFPELKW-GATPDTIN-GLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGD------PDNSGLDLKG 269 (298) Q Consensus 198 l~~~~~~-~~~~~~l~-G~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------~~~~~~~~f~ 269 (298) -+..+.. .-..|.|. |++|+++++.+. +.+++|-- ....+ .-.+-..||.. .|.. + || T Consensus 399 ~~~~d~~~~~~aG~l~~~~~vy~D~y~~~------dy~~vG~K-G~~~~----~~glfyaPYv~l~~~~~~dp~--s-fq 464 (514) T protein:vir:56 399 SMNTDTNQTVFAGVLGGRFKVYIDQYAVN------DYFTVGFK-GSTEM----DAGVFYSPYVPLTPLRGSDSK--N-FQ 464 (514) T ss_pred ccccccCcceEEEEecCceEEEecCCCCc------ceEEEEEe-cCcce----ecceeeccccccccccccCCc--c-cc Confidence 0111111 11124455 579999988763 34444421 00000 01122223321 1111 1 33 Q ss_pred cCcEEEEEEEEEccEEecccceEE----Eee--------cC Q lcl|Aclame:pro 270 YNQVYIRAELFLGWGILDATKFAR----VTE--------AN 298 (298) Q Consensus 270 ~n~v~~r~~~r~~~~v~~~~a~~~----l~~--------a~ 298 (298) - .+.|+ .|++..+ +| |+- ... |+ T Consensus 465 P-~~g~~--tRY~l~~-NP--y~~~~~~~~~~~~~~~~~a~ 499 (514) T protein:vir:56 465 P-VIGFK--TRYGVQV-NP--FADPTASATKVGNGAPVAAS 499 (514) T ss_pred c-eeeee--eeeceee-CC--CCCccccccccCCcchhhhc Confidence 2 13333 3555543 33 210 000 00 No 238 >protein:vir:348 Length: 321 # NCBI annotation: major virion structural protein # Family: family:all:3198 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203462;genbank:gi:15320618;genbank:GeneID:921734 Probab=71.52 E-value=0.19 Score=24.48 Aligned_cols=288 Identities=9% Similarity=-0.042 Sum_probs=135.2 Q ss_pred Ceecc-ccccc---hhHHHHHHHHHHhhchhhhh----cceeecCC-CceEEEEEeC-CcceEEe-ecccccccccccee Q lcl|Aclame:pro 1 MVLNK-GTLFD---PELVTDLISKVAGKSSIARL----SAQKPIPF-NGEKVFTFTM-DSEIDVV-AESGKKTHGGVTLA 69 (298) Q Consensus 1 mat~g-g~lip---~~~~~~ii~~~~~~s~i~~~----~~~~~~~~-~~~~ip~~~~-~~~a~~v-~E~~~~~~~~~~~~ 69 (298) |=-.. ..|+- .+.+.++.+.+-+.++++.. ++..+.++ .++..|..-. .+++.|. ++..-...-.-.|. T Consensus 1 mp~~~lsel~t~tl~~rs~~~~D~v~~~n~LL~~L~~kG~~~~~~gg~~I~~~l~y~~~s~~~wy~Gyd~l~~~p~d~~~ 80 (321) T protein:vir:34 1 MPFPNISDIITTTIESRSGVIADNVTKNNAILARLAKRGKPRLVSGGYTILEELSFSGNSNGGWYSGYDVLPTAPQDVIS 80 (321) T ss_pred CCCchHHHHHHHHHHhhcchhhhhhhcccHHHHHHHhcCcccccCCCeeEEEEEeeccCcceeEEEeeeeeccchhhhcc Confidence 22210 01111 11233455555555555432 33344443 2455665544 7889995 56555556667789 Q ss_pred eEEEeeeEEEEEEeecHH-Hhhcc-cccHHHHHHHHHHHHHHHHHHHHHHHHhc-ccccccccccccccccccccccccc Q lcl|Aclame:pro 70 PQTMVPIKVEYGARISDE-FMYAS-DEEKINILQAFNDGFAKKVARGIDLMAFH-GVNPRLGTASAVIGTNHFDSKVTQK 146 (298) Q Consensus 70 ~v~l~~~k~~~~~~iS~e-ll~~~-~d~~~~l~~~i~~~la~~i~~~~d~~~l~-G~~~~~g~~~~~~~~~~~~~~~~~~ 146 (298) +-++.++.++..+.||-. +++.+ ....++++.+=.+..-+.+...++..+.. |++.+.....++.+...+...+... T Consensus 81 ~Aef~wk~aa~~~~isg~e~l~n~g~~~~idll~~~~~~ae~t~~n~l~~~l~sdGTa~g~~~i~GL~~lv~~~p~tGtv 160 (321) T protein:vir:34 81 SAEYALKQYAVPVVISGLEMLQNSGKEAQLDLLEARMNVAEATMANDISAALYGDGTAFGGRAINGLDGAVPVDPTVGTY 160 (321) T ss_pred ccccchhheeEeeEEehhHHhhccchHHHHHHHHHHHHHHHHHHHhhhhHhhhccccccccchhhhhhhhcccCCCCcee Confidence 999999999998888864 34333 34455555555555566677778877765 5543343333333332222111100 Q ss_pred ccc-------------ccccchhHHHHHHHhhhhh----hcCCcccEEEEcHHHHHHHHHhhccCCceeecccc-cccCc Q lcl|Aclame:pro 147 VEA-------------PRGIADPNGAIENAVELLT----GVDADVTGIAINPSFRSALAKQKDLQGNALFPELK-WGATP 208 (298) Q Consensus 147 ~~~-------------~~~~~~~~~~i~~~~~~l~----~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~-~~~~~ 208 (298) ... ...+......|..++.++. .....|..|++....+...++..-..-|+--.+.. .+-.. T Consensus 161 GGIdra~~~~WRn~~~d~~~~~t~~tl~~~m~~~w~~~~Rg~~~PDlii~~~~~y~~y~~s~q~~qR~~~~~~a~~Gf~~ 240 (321) T protein:vir:34 161 GGINRALWPFWRSQVEDMAAVATINTIQPAMTKLWSRCVRGADMPDLIMSGNDAWTTYSNSLQVLQRFTSAEEANLGFRS 240 (321) T ss_pred ccccccchhhhhhhhhhhhhcccHHHHHHHHHHHHHhhccCCCCccEEEechHHHHHHHHhhheeeeeccccccccccee Confidence 000 0000111233444444333 33346778999999998876644332332211111 11112 Q ss_pred ceecceeeEecCccccccccccceEEEeeccceEEEEeecceEEEEeecccccc-cchhhhhcCcEEEEEEEEEccEEec Q lcl|Aclame:pro 209 DTINGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDN-SGLDLKGYNQVYIRAELFLGWGILD 287 (298) Q Consensus 209 ~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~-~~~~~f~~n~v~~r~~~r~~~~v~~ 287 (298) -.+.|.-|+.+.+.... .+....+|-|-+ ++.+.+..+-.+.......... +.. ...-+|.++ ....+.+ T Consensus 241 Lky~~~div~D~~~g~~--~pan~~yfiNT~-yl~~r~h~~~~~~pi~p~r~~~~Nqd--A~~q~I~~~----GnL~~sn 311 (321) T protein:vir:34 241 LKFLSTDVVLDGGIGGF--AGANTMYFLNTK-YLHFRPHKDRNMVPLSPSRRAAFNQD--AEAQILAWA----GNLTCSG 311 (321) T ss_pred eeeeeEEEEEeCCCCCC--ccccceeeeecc-eEEEEEcCCCceeecCcccccccchh--HHhhhhhhh----heeeeec Confidence 34667777777654322 223345665544 4445544333332222111000 000 011112222 2223345 Q ss_pred ccceEEEeec Q lcl|Aclame:pro 288 ATKFARVTEA 297 (298) Q Consensus 288 ~~a~~~l~~a 297 (298) +.+=.+|+.- T Consensus 312 ~~~~~vL~~~ 321 (321) T protein:vir:34 312 AQFQGRLIAE 321 (321) T ss_pred ccceeEEeeC Confidence 5555555555 No 239 >protein:vir:104549 Length: 462 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214669;genbank:gi:61806310;genbank:GeneID:3294604 Probab=70.66 E-value=0.21 Score=24.35 Aligned_cols=265 Identities=14% Similarity=0.049 Sum_probs=100.8 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCCCceEEE-----EEeCCcceEEeec-cccccccccceeeEEEe Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVF-----TFTMDSEIDVVAE-SGKKTHGGVTLAPQTMV 74 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip-----~~~~~~~a~~v~E-~~~~~~~~~~~~~v~l~ 74 (298) +...++.+.+....... +.. ....+..++. ..+...++-.-++ +..+++...++++++.. T Consensus 147 ~~~~~~~~~~~~g~~~~---------~~~-----~~~~g~~~~~~~~~GM~Ta~aE~lg~~s~n~~f~EMaFsIeK~tVt 212 (462) T protein:vir:10 147 PTASSSAVNDAEGANPG---------LLN-----DSPAGTYEVTGDATGMATATAEALDDSSASTAFREMGFSIEKVTVT 212 (462) T ss_pred cccccccccccccccce---------eec-----CCCccceecccccccccchhccccCCccCCcchhhceeEEEEEEEe Confidence 44444333221111100 000 0000111110 0111112111122 34678888888888877 Q ss_pred eeEEEEEEeecHHHhhcccc-cHHHHHHHHHHHHHHHHHHHHHHHHhccccc--cccccccccccccccccccccccccc Q lcl|Aclame:pro 75 PIKVEYGARISDEFMYASDE-EKINILQAFNDGFAKKVARGIDLMAFHGVNP--RLGTASAVIGTNHFDSKVTQKVEAPR 151 (298) Q Consensus 75 ~~k~~~~~~iS~ell~~~~d-~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~--~~g~~~~~~~~~~~~~~~~~~~~~~~ 151 (298) ++.=+=...+|-||.++-.. -..|.+++|..-|+-.|...+++-++.-... .-+.... ....++.... ... T Consensus 213 AKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEImlEINReii~~l~~~a~~~k~~~-~~~~Gv~dl~-----~~~ 286 (462) T protein:vir:10 213 AKSRALKAEYSIEMAQDLKAIHGLDAESELANILSTEILAEINREVVRTIYVNAVKGAIAN-TATDGIFDLD-----VDS 286 (462) T ss_pred eeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHhhhhhhheeeeccc-ccccceeeec-----ccc Confidence 77666667889998764322 1234445555555555555555555532110 0011001 0111111110 001 Q ss_pred ccchhHHHHHHHhhhh---------hhcCCcccEEEEcHHHHHHHHHhh--c----cCCc-eee-cccccccCcceec-c Q lcl|Aclame:pro 152 GIADPNGAIENAVELL---------TGVDADVTGIAINPSFRSALAKQK--D----LQGN-ALF-PELKWGATPDTIN-G 213 (298) Q Consensus 152 ~~~~~~~~i~~~~~~l---------~~~~~~~~~~vm~~~~~~~L~~lk--d----~~G~-~l~-~~~~~~~~~~~l~-G 213 (298) .+--..+..+.++.++ ..-....+-++|+++....|.-.- + .+++ .+. .++......|.|+ | T Consensus 287 ~gr~~~e~~k~l~~qi~~ean~i~~~t~r~~~n~~i~S~~Va~~La~sG~l~~~p~~~~~~~~~~~d~~~~~~~G~l~~r 366 (462) T protein:vir:10 287 NGRWSVEKFKGLLFQIERDSNAIGQETRRGKGNILICSADVASALGMAGVLDYAPGLQGNSALTGVDDTSSTLVGTLNGR 366 (462) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHhccccceEEEEchhHHHHhhhccchhccccccccccccccccccceeEEEecCc Confidence 1111223333333333 222345667999999999883221 1 0111 111 1111122245666 4 Q ss_pred eeeEecCccccccccccceEEEeeccceEEEEeecceEEEEeeccc------ccccchhhhhcCcEEEEEEEEEccEEe- Q lcl|Aclame:pro 214 LPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGD------PDNSGLDLKGYNQVYIRAELFLGWGIL- 286 (298) Q Consensus 214 ~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------~~~~~~~~f~~n~v~~r~~~r~~~~v~- 286 (298) ++|+++.+...++ ..+.+++|-- +.-.+ +-.+-..||.. .|... ||- .+.|+ .|++..+. T Consensus 367 ~~vy~D~Y~~~ns--~~dy~~vG~K-G~~~~----~~glfy~PYv~l~~~~~~dp~s---fqP-~~g~~--tRY~l~~NP 433 (462) T protein:vir:10 367 IKVYVDPYSSNVA--DKHFYVAGYK-GTSPY----DAGLFYCPYVPLQQVRAINPNT---FQP-KIGFK--TRYGMVSNP 433 (462) T ss_pred eEEEEecccCCCc--ccceEEEEEe-CCccc----ccceeeccccccccccccCCcc---ccc-eeeee--eeeeeeecC Confidence 7999988764321 2345555421 10000 01112222221 12211 332 13333 34544321 Q ss_pred ------ccc---------ceEEEeecC Q lcl|Aclame:pro 287 ------DAT---------KFARVTEAN 298 (298) Q Consensus 287 ------~~~---------a~~~l~~a~ 298 (298) ++. -|-++..++ T Consensus 434 ~t~~~~~~~~~~~~~~n~y~r~~~v~~ 460 (462) T protein:vir:10 434 FSGGLTQGSGALTANANKYYRRVQVAN 460 (462) T ss_pred CCCCcCCccccccccCcceeeeEEeec Confidence 111 122222222 No 240 >protein:vir:100851 Length: 514 # NCBI annotation: hypothetical protein # Family: family:all:2450 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164744;genbank:gi:56693157;genbank:GeneID:3197484 Probab=69.71 E-value=0.22 Score=24.20 Aligned_cols=280 Identities=13% Similarity=0.032 Sum_probs=123.7 Q ss_pred Ceecccccc------------chhHHHHHHHHHHhhc--hhhhhcceeecCCCceEEEEEe---CCcceEEeeccccccc Q lcl|Aclame:pro 1 MVLNKGTLF------------DPELVTDLISKVAGKS--SIARLSAQKPIPFNGEKVFTFT---MDSEIDVVAESGKKTH 63 (298) Q Consensus 1 mat~gg~li------------p~~~~~~ii~~~~~~s--~i~~~~~~~~~~~~~~~ip~~~---~~~~a~~v~E~~~~~~ 63 (298) =|.+.|+-+ -+.+-.++-.+..... .+.+-....|..+.-.++-... ..+...+++|+.-.+. T Consensus 43 ~a~t~gy~~~~~~~t~gaAlR~EsLd~~l~~Lt~~~~~ftf~~~i~k~~a~STV~ey~~~~~~G~~G~~~f~~E~gi~~~ 122 (514) T protein:vir:10 43 SAFTAGHSITPDTQTDGAANRIESLNRDLKVTTWGERDFTLYNDIAKQPVDNTVLKYTQYYSHGRTGHSLFQPEIGIGDV 122 (514) T ss_pred hhhccccccCCccccCccchhhhhhccceeEeeecCcchhhhhhcCCchhhHHHhhhhhhcccCcccccccccccccCcC Confidence 112222222 2222222222211111 2222223334443322332222 2345678999999999 Q ss_pred cccceeeEEEeeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-cccccccccccccccc Q lcl|Aclame:pro 64 GGVTLAPQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPR-LGTASAVIGTNHFDSK 142 (298) Q Consensus 64 ~~~~~~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~-~g~~~~~~~~~~~~~~ 142 (298) +++.+.+..+..+=++.-..+|.-+-++ ....+.++...+.-.-.++..+|.++|+|+..- ++....+..+.++.+. T Consensus 123 ~d~~~~rk~~~~k~l~~~~~vS~~~~l~--n~i~d~~~~~~~dai~~ia~tiE~a~FyGDs~L~s~~~~~gleFDGl~~l 200 (514) T protein:vir:10 123 NNPNERQRTINIKYIVDTHVTSIALQRA--NTIVDSLKVQEYAAISTVIKTDEWAMFYGDADLTSGQKGEGLQFDGLFKL 200 (514) T ss_pred CCcceEEEEEeeeeeeeeeeeeehhhhc--cchhhHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccCcchhhhHHHh Confidence 9999999999998888776666654222 233567777888888889999999999996431 1222222333333333 Q ss_pred ccccccc-ccccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccc-------------cCc Q lcl|Aclame:pro 143 VTQKVEA-PRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWG-------------ATP 208 (298) Q Consensus 143 ~~~~~~~-~~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~-------------~~~ 208 (298) ....... ........+.|..+-..+...+..++-++|+..+.+.|..-....-|-+.+....+ .+. T Consensus 201 I~~~NvIDarG~~Ls~~~ln~aA~~i~~gfGt~TD~ylp~~vka~f~~~~~~~qRV~~~~n~~~~~~G~~v~~f~s~~G~ 280 (514) T protein:vir:10 201 IAPENHIDLRGGRLSPAALNMAARKIGEGFGTPTDAYMPIGIKADFVNQHLNGQRVMLPGQTGGMTTGLDIDKFLSAHGS 280 (514) T ss_pred hcCCCeEecCCCCccHHHHhhhhhhhhcccCChhheeCchHHHHHHhhcccCcceEEeecCccceeeeeeccceeEeccc Confidence 3222222 22223344556666556666677888999999999988766554444443322111 001 Q ss_pred ceecc-----eeeEecCccccccccccceEEEeeccceEEEEeecceEEEEeeccc-----c---cccchhhhhcC---c Q lcl|Aclame:pro 209 DTING-----LPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGD-----P---DNSGLDLKGYN---Q 272 (298) Q Consensus 209 ~~l~G-----~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~-----~---~~~~~~~f~~n---~ 272 (298) -.|.| .+-......+.+...+. ...+.+.+.++.. . +.++..++..+ . T Consensus 281 I~L~gs~im~~~n~L~~~~~~~~~Ap~----------------~~~va~svT~~~~g~~~~ad~t~~~g~~~~~~~~g~~ 344 (514) T protein:vir:10 281 IRIQGSTIMDSDNKLDFDRPVSPTAPT----------------APQLSATVTPDGGGLWHEADKTDSKGEVILNKEVGVE 344 (514) T ss_pred eeecCCeeecccccCccCCccCCcCCC----------------CCcceEEEecCcccccCccccccccccccccccccee Confidence 11222 11111111110000000 0001111111100 0 00000000000 0 Q ss_pred EEEEEEEEEcc-----------------------------EEecccceEEEeecC Q lcl|Aclame:pro 273 VYIRAELFLGW-----------------------------GILDATKFARVTEAN 298 (298) Q Consensus 273 v~~r~~~r~~~-----------------------------~v~~~~a~~~l~~a~ 298 (298) -.+++...-+. .-..|+.|.+-.... T Consensus 345 ~sYaVv~~n~~GeS~ps~~vtaT~a~~~~~i~ltItp~~~~~~~p~yv~IYR~~~ 399 (514) T protein:vir:10 345 QSYVAVMVSRHGDSRPSLVQTATPTKKDDAITLTITPNAMQNVIPDYVAIYRKSN 399 (514) T ss_pred EEEEEEEECCCCcccccceeeeeeeccCceEEEEEEeccCcccccceEEEEeccC Confidence 01222111111 122233333333322 No 241 >protein:vir:80986 Length: 528 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469506;genbank:gi:157311463;genbank:GeneID:5602119 Probab=66.65 E-value=0.26 Score=23.75 Aligned_cols=280 Identities=11% Similarity=0.002 Sum_probs=111.8 Q ss_pred CeeccccccchhHHHHHHHHHH---hhchhhhhcceeecCCCce-------EEEEEeC------------Ccc------- Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVA---GKSSIARLSAQKPIPFNGE-------KVFTFTM------------DSE------- 51 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~---~~s~i~~~~~~~~~~~~~~-------~ip~~~~------------~~~------- 51 (298) .+.+.+.---..+.+.+|..+| .+.+...++.+.||++... +++.... .++ T Consensus 78 i~es~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~ea~~~~~~~da~fS~~~ 157 (528) T protein:vir:80 78 IAAGQTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGPNPLASQAKEAFHPMYAPDAFHSSLA 157 (528) T ss_pred ccccccccccccCCchhhhHHHHHHhhhhhhhhheeccCCchhhhheeeeeeecCCcccccccccccccccccccccccc Confidence 1212211111122333444444 3445567888888865411 1100000 000 Q ss_pred -------------------------------------------------------------------------------e Q lcl|Aclame:pro 52 -------------------------------------------------------------------------------I 52 (298) Q Consensus 52 -------------------------------------------------------------------------------a 52 (298) . T Consensus 158 t~~~a~~~ea~t~fs~~~~~~~~~~G~~~~~t~~~tg~~~~~~~~~~~~~~~~~gt~~~~~~~~~~~~~~~~~~~~~Gm~ 237 (528) T protein:vir:80 158 AKGAAVGSPTGTPFAKLAIGTQIEAGDIVHHTFAETGIAYLQNVTAEQVTPTKAGSESEDEVVMKLMEEGKLAEIAFGMA 237 (528) T ss_pred ccccccccccccccccccccccccccceeccccccccccccccccccccCccccCCcccccccccccccccccccccccc Confidence 0 Q ss_pred EEeec---------cccccccccceeeEEEeeeEEEEEEeecHHHhhcccc-cHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 53 DVVAE---------SGKKTHGGVTLAPQTMVPIKVEYGARISDEFMYASDE-EKINILQAFNDGFAKKVARGIDLMAFHG 122 (298) Q Consensus 53 ~~v~E---------~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~~~~d-~~~~l~~~i~~~la~~i~~~~d~~~l~G 122 (298) .-.+| +.++++-..++++++..+|.=+=...+|-||.|+--. -..|.+++|..-|+..|...+++-++.- T Consensus 238 Ta~AE~le~lg~ss~~~f~EMaFsIEKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILStEImlEINReii~~ 317 (528) T protein:vir:80 238 TSIAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDV 317 (528) T ss_pred hhhhhhhcccCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhh Confidence 00112 1224455555666666666555566788898654322 1245566666767777777777777532 Q ss_pred cccccccc--cccc----cccccccccccccccc-ccc----chhHHHHHHHhhhhhhc--CCcccEEEEcHHHHHHHHH Q lcl|Aclame:pro 123 VNPRLGTA--SAVI----GTNHFDSKVTQKVEAP-RGI----ADPNGAIENAVELLTGV--DADVTGIAINPSFRSALAK 189 (298) Q Consensus 123 ~~~~~g~~--~~~~----~~~~~~~~~~~~~~~~-~~~----~~~~~~i~~~~~~l~~~--~~~~~~~vm~~~~~~~L~~ 189 (298) .+. +... .... ...++..........+ -.. ..++-.|...-+.+... +...+.++|+++....|.. T Consensus 318 i~~-~a~~~~~~~t~~~~~~~G~~dl~~~~d~~g~r~~~e~~k~L~~~i~~~an~I~~~T~~~~gn~vi~S~~Va~~L~~ 396 (528) T protein:vir:80 318 INF-TAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILAS 396 (528) T ss_pred hhh-eeeeeeeeeeeccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhh Confidence 110 0000 0000 0001111111100000 000 11122233333333332 2344679999999998865 Q ss_pred hh-----c-cCCceeeccccccc-Ccceec-ceeeEecCccccccccccceEEEeec-----cceEEEEeecceEEEEee Q lcl|Aclame:pro 190 QK-----D-LQGNALFPELKWGA-TPDTIN-GLPVDVNKTVSDMSLTQRDRAIIGDF-----ANGFKWGYAKEVPLEVIQ 256 (298) Q Consensus 190 lk-----d-~~G~~l~~~~~~~~-~~~~l~-G~PV~~s~~~~~~~~~~~~~~~~gd~-----~~~~~~~~~~~~~i~~~~ 256 (298) .- + ......+..+.+.. ..|.|. |++|+++++.+. +.+++|-- ..+..|.+=-++.+... T Consensus 397 ~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~------dy~~vG~KG~~~~~~glfy~PYv~l~~~~~- 469 (528) T protein:vir:80 397 ADQGISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYARQ------DYFTVGYKGDNEMDAGIYYAPYVALTPLRA- 469 (528) T ss_pred ccccccccccccccccccCCCCceEEEEecCceEEEecCCCCc------ceEEEEEeCCcccccceeecccccceeeEe- Confidence 31 1 11122222222222 145566 479999988753 34444421 11111211111211111 Q ss_pred cccccccchhhhhcCcEEEEEEEEEccEEecccc-------eEEEeecC Q lcl|Aclame:pro 257 YGDPDNSGLDLKGYNQVYIRAELFLGWGILDATK-------FARVTEAN 298 (298) Q Consensus 257 ~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a-------~~~l~~a~ 298 (298) .|.. -||- .+.|+ .|+++.+ +|=+ .+++.+.+ T Consensus 470 ---~dp~---sfqP-~~g~~--tRY~l~~-NP~~~~~~~~~~~r~~~g~ 508 (528) T protein:vir:80 470 ---TDPQ---SFHP-VLGFK--TRYGIGI-NPFADSKSQAPSARITSGM 508 (528) T ss_pred ---eCCc---cccc-eeeee--eeeceee-cCcccccCCcccccccccc Confidence 1111 1332 23333 3565543 3311 22332222 No 242 >protein:vir:107947 Length: 519 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595301;genbank:gi:161622607;genbank:GeneID:5783666 Probab=60.79 E-value=0.37 Score=22.98 Aligned_cols=280 Identities=11% Similarity=-0.014 Sum_probs=111.0 Q ss_pred CeeccccccchhHHHHHHHHH---HhhchhhhhcceeecCCCce-------EEEEEeC------------CcceEE---- Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKV---AGKSSIARLSAQKPIPFNGE-------KVFTFTM------------DSEIDV---- 54 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~---~~~s~i~~~~~~~~~~~~~~-------~ip~~~~------------~~~a~~---- 54 (298) .+.+.+.---.++.+.++..+ ....+...++.+.||++... +++.... .+++.| T Consensus 77 i~~~~~t~~v~~~~P~l~~l~rRa~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n~~~~~~g~ea~~~~nEadt~fSG~~ 156 (519) T protein:vir:10 77 IAAGQTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQG 156 (519) T ss_pred cccccccccccccchhHHHHHHHHHHhhhhhhhheeecCCchhhhhheeeeeecCCccccccccccccccccccccCccc Confidence 222221111123333444444 33445567777777765321 1111100 000000 Q ss_pred -------------------------------------------------------------------------eec---- Q lcl|Aclame:pro 55 -------------------------------------------------------------------------VAE---- 57 (298) Q Consensus 55 -------------------------------------------------------------------------v~E---- 57 (298) .+| T Consensus 157 ~~~~~~~~~~~~~~~~g~~~~~~~~~s~~~~~~~~~~~t~~ag~t~~~~~~~a~~~~~~~~~~~~~~~gmsTa~aEal~~ 236 (519) T protein:vir:10 157 AAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEG 236 (519) T ss_pred cccccccccccccccccccccccccccccceeccccccccCCCCcCccccccccccccccccccccccccccchhhcccc Confidence 011 Q ss_pred -----cccccccccceeeEEEeeeEEEEEEeecHHHhhcccc-cHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-ccc Q lcl|Aclame:pro 58 -----SGKKTHGGVTLAPQTMVPIKVEYGARISDEFMYASDE-EKINILQAFNDGFAKKVARGIDLMAFHGVNPRL-GTA 130 (298) Q Consensus 58 -----~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~~~~d-~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~-g~~ 130 (298) +.++++-..++++++..+|.=+=...+|-||.|+--. -..|.+++|..-|+-.|...+++-++.=.+-.. -.. T Consensus 237 lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~ 316 (519) T protein:vir:10 237 FNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGK 316 (519) T ss_pred CCCccccchhhhceeEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhhhcce Confidence 1124455555666666666555556788998664322 124556667777777777777777664211000 000 Q ss_pred cccccc----cccccccccccccccc-c----chhHHHHHHHhhhhhhc--CCcccEEEEcHHHHHHHHHhh--c---c- Q lcl|Aclame:pro 131 SAVIGT----NHFDSKVTQKVEAPRG-I----ADPNGAIENAVELLTGV--DADVTGIAINPSFRSALAKQK--D---L- 193 (298) Q Consensus 131 ~~~~~~----~~~~~~~~~~~~~~~~-~----~~~~~~i~~~~~~l~~~--~~~~~~~vm~~~~~~~L~~lk--d---~- 193 (298) .+.+.. .++............- . ...+-.|...-+.+... +...+-++|+++....|...- + . T Consensus 317 ~g~t~~~~~~aGv~d~~~~~d~~~~rw~~e~~k~L~~~i~~~an~I~~~T~r~~gn~ii~S~~Va~~L~~~g~~~~~~~~ 396 (519) T protein:vir:10 317 SGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQ 396 (519) T ss_pred eecccCcccccceeecccccccccchHHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccchhccccc Confidence 000000 0111111111100000 0 11233344444444332 334467999999998886543 1 1 Q ss_pred CCceeeccccccc-Ccceec-ceeeEecCccccccccccceEEEeeccceEEEEeecceEEEEeeccc------ccccch Q lcl|Aclame:pro 194 QGNALFPELKWGA-TPDTIN-GLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGD------PDNSGL 265 (298) Q Consensus 194 ~G~~l~~~~~~~~-~~~~l~-G~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------~~~~~~ 265 (298) ..+..+..+.+.. ..|.|. |++|+++++.+. +.+++|-- ....+ .-.+-..||.. .|.. T Consensus 397 ~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~------dy~~vG~K-G~~~~----~~glfyaPYv~l~~~~~~dp~-- 463 (519) T protein:vir:10 397 GLGQGFNVDTTKAVFAGVLGGKYRVYIDQYARS------DYFTIGYK-GSNEM----DAGIYYAPYVALTPLRGSDPK-- 463 (519) T ss_pred cccccccccCCCceEEEEecCceEEEecCCCCc------ceEEEEEe-cCccc----ccceeeccccccccccccCCc-- Confidence 1111122221111 134555 479999988763 34444421 00000 00111222221 1111 Q ss_pred hhhhcCcEEEEEEEEEccEEecccc-------eEEEeecC Q lcl|Aclame:pro 266 DLKGYNQVYIRAELFLGWGILDATK-------FARVTEAN 298 (298) Q Consensus 266 ~~f~~n~v~~r~~~r~~~~v~~~~a-------~~~l~~a~ 298 (298) + ||- .+.|+ .|++..+ +|=+ -+++.+.. T Consensus 464 s-fqP-~~g~~--tRY~l~~-NP~~~~~~~~~~~~i~~g~ 498 (519) T protein:vir:10 464 N-FQP-VMGFK--TRYGIGI-NPFADPAAQAPTKRIQNGM 498 (519) T ss_pred c-ccc-eeeee--eeeceee-cCcccccccCccceeccCc Confidence 1 332 23333 3565543 3311 11222111 No 243 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=59.00 E-value=0.4 Score=22.76 Aligned_cols=271 Identities=12% Similarity=0.022 Sum_probs=118.0 Q ss_pred CeeccccccchhHHHHHHHHHHhhch--hhhhcc--eeecCCCceEEEEEeCCcce-EEeecccccc--ccccceeeEEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSS--IARLSA--QKPIPFNGEKVFTFTMDSEI-DVVAESGKKT--HGGVTLAPQTM 73 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~--i~~~~~--~~~~~~~~~~ip~~~~~~~a-~~v~E~~~~~--~~~~~~~~v~l 73 (298) ||.+=.+ .+.+.+.+-+.....+. .+.-.+ +.-.++..++||+.+..+-. +-...+.... +-+.+++..+| T Consensus 1 Mantl~y--a~~~~~~LD~~~~~~~~s~~l~~~~~~v~~~ggktVkIp~i~~~gl~DY~R~~g~~~~~g~v~~~~et~tl 78 (312) T protein:vir:10 1 MANTLAY--GQVLQQGLDKQATQELLTGWMDSNAKQIKYEGGKEVKIGKLSTDGLGDYSRGSANAYVGGDVKFEYETKTM 78 (312) T ss_pred CCcchhH--HHHHHHHHHHHHHhhhccccccCCCceEEEecCcEEEEEeeecccccccccccCCccccccccccceeEEe Confidence 9844222 24555555555444432 222111 22345567999998754322 2232232222 23445556666 Q ss_pred eeeEEEEE-EeecHHHhhccccc--HHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccccccc Q lcl|Aclame:pro 74 VPIKVEYG-ARISDEFMYASDEE--KINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAP 150 (298) Q Consensus 74 ~~~k~~~~-~~iS~ell~~~~d~--~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 150 (298) ...+--.+ +.--+ .++. ...+...+.+...+...=.+|...|.-. ........+... .....+ T Consensus 79 ~qDR~~~F~vD~mD-----vDETn~~~s~anv~~ef~r~~vvPEiDayrfskl------a~~a~~~~~~~~---~~~~~~ 144 (312) T protein:vir:10 79 TQDRGRKFTLDAMD-----VDETNFLVTATTVMGEFQRLKVIPEIDAYRLSRL------ATIAIGIKGDTN---VEYSYS 144 (312) T ss_pred eecccceeeccccc-----hhhHhhHHHHHHHHHHHHHhhhcchhhHHHHHHH------Hhhhhccccccc---cccccc Confidence 55553221 11111 1111 1223333333344445555666655321 000000000000 011122 Q ss_pred cccchhHHHHHHHhhhhhhcCCc-ccEEEEcHHHHHHHHHhhccCCcee---ecccccccCcceecceeeEe--cCcccc Q lcl|Aclame:pro 151 RGIADPNGAIENAVELLTGVDAD-VTGIAINPSFRSALAKQKDLQGNAL---FPELKWGATPDTINGLPVDV--NKTVSD 224 (298) Q Consensus 151 ~~~~~~~~~i~~~~~~l~~~~~~-~~~~vm~~~~~~~L~~lkd~~G~~l---~~~~~~~~~~~~l~G~PV~~--s~~~~~ 224 (298) -+.+..++.|.+++.++...+.. +-.++|+|.+...|++.. ..+.. +....-....++|.|+||+. ++.|.. T Consensus 145 ~T~~ni~~~i~~~~~~lde~~vp~~rvl~vTp~~~~lLk~~~--~~~~~~~~~~~~~i~~~V~~iDgv~Ii~VPs~r~~t 222 (312) T protein:vir:10 145 VNSSTIINKIKTGIKIIRENGYNGPLVCHLTYDSMFAIEEKV--LEKLTAVTFAQGGIQTQVPSIDGCALIKTPQNRMYS 222 (312) T ss_pred cCHHHHHHHHHHHHHHHHHccCCCceEEEeChHHHHHHhhhh--hceecccccccceeeeeeeeecccEEEEchhhhccc Confidence 24456789999999999887765 335789998887776521 11111 12222244567899999974 233321 Q ss_pred cc------------------ccccc-eEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEE Q lcl|Aclame:pro 225 MS------------------LTQRD-RAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGI 285 (298) Q Consensus 225 ~~------------------~~~~~-~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v 285 (298) .. ..++. -.++..-+ +. +.+...-.+.+++ .+.++. -|...+.-+.+.|.=| T Consensus 223 ~~~f~dG~t~~~~~gg~~~~~~ak~INfiiv~~~-a~-i~~~K~~~~~if~-P~~~~~------~d~~~~~~R~Y~D~fv 293 (312) T protein:vir:10 223 SILLNDGTTSNQTAGGYLKGTKALDTNFIIAPVD-VP-LAITKQDKMRIFD-PETNQT------ANAWSMDYRRYHDLWV 293 (312) T ss_pred eeeeccCcccccccCceeecCcccccceEEeCCc-ee-eceeeeeeeeeeC-CCCCCC------cceeeeeeeeeeeeee Confidence 10 00111 12222211 21 2222222233321 111111 1223444455666666 Q ss_pred ecc--cce-EEEeecC Q lcl|Aclame:pro 286 LDA--TKF-ARVTEAN 298 (298) Q Consensus 286 ~~~--~a~-~~l~~a~ 298 (298) .+. +++ +-++.|. T Consensus 294 ~~nk~~~Iyv~~k~a~ 309 (312) T protein:vir:10 294 TDNKANSVYANFKDAK 309 (312) T ss_pred eccccCeEEEEeeccc Confidence 653 333 5566666 No 244 >protein:vir:4902 Length: 348 # NCBI annotation: gp348 # Family: family:all:1083 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056680;genbank:gi:9635015;genbank:GeneID:1262657 Probab=58.73 E-value=0.41 Score=22.73 Aligned_cols=296 Identities=11% Similarity=0.025 Sum_probs=123.5 Q ss_pred CeeccccccchhHHHHHHHHHH-hhch-h-hhhcceeecCCCceEEEE-EeCCc-ceEEeecccccc-ccccceeeEEEe Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVA-GKSS-I-ARLSAQKPIPFNGEKVFT-FTMDS-EIDVVAESGKKT-HGGVTLAPQTMV 74 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~-~~s~-i-~~~~~~~~~~~~~~~ip~-~~~~~-~a~~v~E~~~~~-~~~~~~~~v~l~ 74 (298) |++--..+-++++ ..++..+. .... + ..+.+..++......... ..... .+.+++.+.+.+ ...-.++..+.. T Consensus 1 M~~l~d~f~~~~l-~~~v~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~~~~~~~~ 79 (348) T protein:vir:49 1 MGLIYDKVTASNI-AGYFNALQENVDSTLGESIFPARKQLGTKLSYITGASGQSVALKAAAFDTNVTVRDRVSAEMHDEQ 79 (348) T ss_pred CcchhhhcCHHHH-HHHHHhccccchhhhHhhcCCCccccCceeEEEEeecCceeeeeeecCCCCcceecccceeeeeee Confidence 8775433333443 33443322 2222 2 345665555544444333 23333 456777665544 344556666677 Q ss_pred eeEEEEEEeecHHHh------hccc--ccHHHHHHHHH---HHHHHHHHHHHHHHHh----cccc--ccccccccc-ccc Q lcl|Aclame:pro 75 PIKVEYGARISDEFM------YASD--EEKINILQAFN---DGFAKKVARGIDLMAF----HGVN--PRLGTASAV-IGT 136 (298) Q Consensus 75 ~~k~~~~~~iS~ell------~~~~--d~~~~l~~~i~---~~la~~i~~~~d~~~l----~G~~--~~~g~~~~~-~~~ 136 (298) +-.++-...++.+=+ .... ...-.+...|. +.+.+.+.+.+|.++. +|.- .+.|..-.. .+. T Consensus 80 ~p~i~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~i~~~g~~~~vdyg~ 159 (348) T protein:vir:49 80 MPFFKEAMLVKENDRQQLNLVKDSGNAALVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDGVNKDIDYGV 159 (348) T ss_pred cCccccccccCHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEecCCceEEEeecC Confidence 666666555653321 1100 00011222222 2233445555554443 3310 011110000 010 Q ss_pred cccccccccccccccccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHH---hh---cc-CCcee-ecccccccCc Q lcl|Aclame:pro 137 NHFDSKVTQKVEAPRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAK---QK---DL-QGNAL-FPELKWGATP 208 (298) Q Consensus 137 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~---lk---d~-~G~~l-~~~~~~~~~~ 208 (298) . .....+....+.....+++.+|.+....+...+..++.++|++++|..|++ .+ +. ++... ..+......- T Consensus 160 ~-~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~G~~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~~i~~~~~~~~~ 238 (348) T protein:vir:49 160 K-PDHKKQVSKSWAEPGATPLADLEDAIETARELGLNPERAVMNAKTFGLIRKAASTVKVIKPLAGDGSSVTKAELDNYI 238 (348) T ss_pred C-cccceeeeeccCCCCCCHHHHHHHHHHHHHhcCCcccEEEeCHHHHHHHhcCHHHHHHhhccCcccccccHHHHHHHH Confidence 0 011122334567777889999999988888888888999999999998854 22 11 11110 1111111112 Q ss_pred ceecceeeEe-cCcccccccc-----ccceEEEeec-c-ceEEEEee-cceEEEEeeccccc----c--cchhhhhcC-- Q lcl|Aclame:pro 209 DTINGLPVDV-NKTVSDMSLT-----QRDRAIIGDF-A-NGFKWGYA-KEVPLEVIQYGDPD----N--SGLDLKGYN-- 271 (298) Q Consensus 209 ~~l~G~PV~~-s~~~~~~~~~-----~~~~~~~gd~-~-~~~~~~~~-~~~~i~~~~~~~~~----~--~~~~~f~~n-- 271 (298) .++.|+++++ +....+..+. ..+.++++-- . +...|+.- +............+ + ..+..|.++ T Consensus 239 ~~~~g~~i~~y~~~y~d~dG~~~~~~p~~~v~l~~~~~~G~~~yg~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dP 318 (348) T protein:vir:49 239 ADNFGVTVVLENGTYRNEKGEVSKFFPDGHLTLIPNGPLGNTVFGTTPEESDLFADNTVNADVEIVDNGIAVTTTKTTDP 318 (348) T ss_pred HhhcCceEEEEeeEEEecCCcEeeeecCCeEEEecCCCcceeEEecChhhhhhccccccccceeecCCeEEEeeeecCCC Confidence 3455777754 2222111111 1112222111 1 11122211 00000000000000 0 000011111 Q ss_pred -cEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 272 -QVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 272 -~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ...+++..+.==.+.+|+++.++|--+ T Consensus 319 ~~~~~~~~s~~lPv~~~~~~~~~a~Vl~ 346 (348) T protein:vir:49 319 VNVQTKVSMVALPSFERLDDVYMLTVIP 346 (348) T ss_pred ceEEEEEeeeccccccCCCcEEEEEEec Confidence 122333333222345788888877777 No 245 >protein:vir:98143 Length: 524 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239203;genbank:gi:66391678;genbank:GeneID:3416245 Probab=55.50 E-value=0.48 Score=22.34 Aligned_cols=276 Identities=11% Similarity=-0.017 Sum_probs=97.8 Q ss_pred CeeccccccchhHHHHHHHHHHhh-chhhhhcceee----------cC-CCceEEEEEeCCcceEEeec---------cc Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGK-SSIARLSAQKP----------IP-FNGEKVFTFTMDSEIDVVAE---------SG 59 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~-s~i~~~~~~~~----------~~-~~~~~ip~~~~~~~a~~v~E---------~~ 59 (298) .++..|......+...-....... +....-...-| .. +....+. .+-..-.+| +. T Consensus 174 ~~~~~g~~~~~~~~~~g~~~~~~~~~g~~~~tgt~p~~~~~a~~~~~~~g~~~~~~----~GmsTA~aEaL~~~g~ss~~ 249 (524) T protein:vir:98 174 TAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEIS----VGMATSVAELQENFNGSSAN 249 (524) T ss_pred cccccccccccccccccceeccccccCcccccccccccccccccccccccceeecc----cccchhhhhhhccCCCCccc Confidence 111111111000000000000000 00000000000 00 0001111 111111122 44 Q ss_pred cccccccceeeEEEeeeEEEEEEeecHHHhhcccc-cHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-cccccccc-- Q lcl|Aclame:pro 60 KKTHGGVTLAPQTMVPIKVEYGARISDEFMYASDE-EKINILQAFNDGFAKKVARGIDLMAFHGVNPRL-GTASAVIG-- 135 (298) Q Consensus 60 ~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~~~~d-~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~-g~~~~~~~-- 135 (298) .+++-..++++++..+|.=+=...+|-||.|+--. -..|.+++|..-|+-.|...+++-++.-..... -...+... T Consensus 250 ~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~i~~~a~~~~~g~t~~~ 329 (524) T protein:vir:98 250 PWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTV 329 (524) T ss_pred cccceeeEEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhheeceeeccccc Confidence 57778888888777777666667889998764322 124455666666666666666666663211000 00000000 Q ss_pred --cccccccccccc-cccccc----chhHHHHHHHhhhhhhc--CCcccEEEEcHHHHHHHHHhh----ccCCcee--ec Q lcl|Aclame:pro 136 --TNHFDSKVTQKV-EAPRGI----ADPNGAIENAVELLTGV--DADVTGIAINPSFRSALAKQK----DLQGNAL--FP 200 (298) Q Consensus 136 --~~~~~~~~~~~~-~~~~~~----~~~~~~i~~~~~~l~~~--~~~~~~~vm~~~~~~~L~~lk----d~~G~~l--~~ 200 (298) ..++........ ..+-.. ...+-.|...-+.+... +...+-++|+++....|..+- +..+.-- .. T Consensus 330 ~~~~G~~dl~~~~d~~~~r~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~~i~S~~Va~~L~~~~~g~~~~s~~~~~~~~ 409 (524) T protein:vir:98 330 GSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLN 409 (524) T ss_pred ccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhhhcccccccchhhcccc Confidence 001111111111 001011 11233344444444332 234667999999998887531 1111100 00 Q ss_pred ccccc-cCcceec-ceeeEecCccccccccccceEEEeeccceEEEEeecceEEEEeeccc------ccccchhhhhcCc Q lcl|Aclame:pro 201 ELKWG-ATPDTIN-GLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGD------PDNSGLDLKGYNQ 272 (298) Q Consensus 201 ~~~~~-~~~~~l~-G~PV~~s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------~~~~~~~~f~~n~ 272 (298) .+.+. -..|.|. |++|+++++.+. +.+++|-- +...+ .-.+-..||.. .|.. + ||- . T Consensus 410 ~d~~~~~~~G~l~~~~~vy~D~y~~~------dy~~vG~K-G~~~~----~~glfyaPYv~l~~~~~~dp~--s-fqP-~ 474 (524) T protein:vir:98 410 VDTTKAVFAGVLGGTYKVYIDQYARQ------DYFTVGFK-GDNEM----DAGIYYAPYVALTPLRGSDPK--N-FQP-V 474 (524) T ss_pred cCCccceEEEEecCceEEEecCCCCc------ceEEEEee-CCccc----ccceeeccccccccccccCCc--c-ccc-e Confidence 01111 0123444 579999988753 34444421 00000 00111122211 1111 1 332 1 Q ss_pred EEEEEEEEEccEEecccc--------eEEEeecC Q lcl|Aclame:pro 273 VYIRAELFLGWGILDATK--------FARVTEAN 298 (298) Q Consensus 273 v~~r~~~r~~~~v~~~~a--------~~~l~~a~ 298 (298) +.|+ .|+++.+ +|=+ -..+.+.+ T Consensus 475 ~g~~--tRY~l~~-NP~~~~~~~~~~~ri~~g~~ 505 (524) T protein:vir:98 475 MGFK--TRYGIGI-NPFANSRSQAPADRITSGMI 505 (524) T ss_pred eeee--eeeceee-cCcccccCCccccccccCcc Confidence 3333 3554432 2211 01111111 No 246 >protein:vir:98480 Length: 348 # NCBI annotation: ORFp38 # Family: family:all:1083 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958280;genbank:gi:41057254;uniprot:Q38595;genbank:GeneID:2732864 Probab=55.21 E-value=0.48 Score=22.31 Aligned_cols=291 Identities=11% Similarity=0.008 Sum_probs=119.5 Q ss_pred Ceeccc-cccchhHHHHHHHHHHhh-----chhhhhcceeecCCCceEEEEEe-CC-cceEEeeccccccccc-cceeeE Q lcl|Aclame:pro 1 MVLNKG-TLFDPELVTDLISKVAGK-----SSIARLSAQKPIPFNGEKVFTFT-MD-SEIDVVAESGKKTHGG-VTLAPQ 71 (298) Q Consensus 1 mat~gg-~lip~~~~~~ii~~~~~~-----s~i~~~~~~~~~~~~~~~ip~~~-~~-~~a~~v~E~~~~~~~~-~~~~~v 71 (298) |+..=- .++.+.....++..+... -....+.+..++.+-.+.+-... .. ..+.+++.+.+.+..+ ..++.. T Consensus 1 M~~~~~~d~~~~~~l~~~i~~~~~~~~~~~~l~~~~fp~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~r~g~~~~ 80 (348) T protein:vir:98 1 MSWTLDTEFIEPTQLTGLIREALRDLQVNRFRLARWLPNVDVDDITFEFLRGGGGLAETASYRSWDTESKIGRREGLAKV 80 (348) T ss_pred CcchhhhhccCHHHHHHHHHHHhhccCcchhhHHhcCCCccccceEEEEEeccCCceeeeeeecCCCccceeecccceee Confidence 666442 234444445555443211 12345666555543333332211 12 2356777776666433 457777 Q ss_pred EEeeeEEEEEEeecHH-HhhcccccHHHHHHHHH---HHHHHHHHHHHHH----HHhccccccccccccc-ccccccccc Q lcl|Aclame:pro 72 TMVPIKVEYGARISDE-FMYASDEEKINILQAFN---DGFAKKVARGIDL----MAFHGVNPRLGTASAV-IGTNHFDSK 142 (298) Q Consensus 72 ~l~~~k~~~~~~iS~e-ll~~~~d~~~~l~~~i~---~~la~~i~~~~d~----~~l~G~~~~~g~~~~~-~~~~~~~~~ 142 (298) +..+-.++-...++.+ +++..-.....+...+. +.+.+.+.+.+|. ++.+|--.-.|..-.. .+.... .. T Consensus 81 ~~~~~~i~~~~~i~~~d~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~g~~~~vDyg~~~~-~~ 159 (348) T protein:vir:98 81 MGELPPISEKIPLNEYDRLRLRKLSRDEALPFIARDAQRLARNIGARFEVARGSALVNATVPVTELQQTVDFGRIGS-HS 159 (348) T ss_pred eeeccccccccccCHHHHHHhcCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEecCceEEccccCcc-cc Confidence 7777777666666654 22110011112223333 3445556666653 3333421111111000 000000 11 Q ss_pred cccccccc-cccchhHHHHHHHhhhhhhc-CCcccEEEEcHHHHHHHHH---hhcc-------CCceeecccccccCcce Q lcl|Aclame:pro 143 VTQKVEAP-RGIADPNGAIENAVELLTGV-DADVTGIAINPSFRSALAK---QKDL-------QGNALFPELKWGATPDT 210 (298) Q Consensus 143 ~~~~~~~~-~~~~~~~~~i~~~~~~l~~~-~~~~~~~vm~~~~~~~L~~---lkd~-------~G~~l~~~~~~~~~~~~ 210 (298) .+....+. ....+++.+|.+...++... +..+..++|++++|..|++ +++. +..++..+.....-- . T Consensus 160 ~t~~~~Ws~~~~adp~~di~~~~~~~~~~~G~~p~~~vm~~~~~~~l~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~-~ 238 (348) T protein:vir:98 160 VVAAVLWSVHATATPISDLESWVATYEDTNGQSPGVILMPKAAVSHMRQCEEVIRQVFPLAPSGTAPMVSVEQLNTVL-S 238 (348) T ss_pred cccccccCCCCCCCHHHHHHHHHHHHHHccCCcceEEEeCHHHHHHHhcCHHHHHHHhccCccccccccCHHHHHHHH-H Confidence 22333343 34567889999998888765 6788899999999998863 3321 111222111111001 1 Q ss_pred ecce-eeEecC-ccccccccccceEEEeeccceEEEEeecc---------e-------EEEEeec--cc----ccccchh Q lcl|Aclame:pro 211 INGL-PVDVNK-TVSDMSLTQRDRAIIGDFANGFKWGYAKE---------V-------PLEVIQY--GD----PDNSGLD 266 (298) Q Consensus 211 l~G~-PV~~s~-~~~~~~~~~~~~~~~gd~~~~~~~~~~~~---------~-------~i~~~~~--~~----~~~~~~~ 266 (298) -+|. ++.+-+ .+... + ...-++.+ +.+.+...+. + ..+..+. .. ..+-.+. T Consensus 239 ~~g~~~i~~~d~~~~~~-g--~~~~~~p~--~~i~l~p~~~~~~~~~~~~~G~t~~G~~~e~~~~~~~~~~~~~~~i~~~ 313 (348) T protein:vir:98 239 SMGLPPIEVYDAKVAVD-G--VSTRITPA--NAIALLPEPGATDAAQPTELGATLLGTTAESLEDDYALAPGEQPGIVAA 313 (348) T ss_pred hhCCeEEEEeeeEEEcC-C--ceeceecC--CeEEEEecCCcccccccccccceecccchhhhccccccceeccCceeee Confidence 2344 344322 22221 1 11112211 0111100000 0 0000000 00 0000000 Q ss_pred hhhc-C--cEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 267 LKGY-N--QVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 267 ~f~~-n--~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) .|.+ | ...+++..+.==.+.+|+++++++..= T Consensus 314 ~~~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl~ 348 (348) T protein:vir:98 314 TWKTKDPVRLWTHAAAVGIPVLREPNLTFKAQVLA 348 (348) T ss_pred eeeecCCcEEEEEEeeeeeccccCCCcEEEEEEeC Confidence 0111 1 233334444323345677777775544 No 247 >protein:vir:6901 Length: 522 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861877;genbank:gi:32453668;genbank:GeneID:1494303 Probab=54.31 E-value=0.51 Score=22.20 Aligned_cols=267 Identities=11% Similarity=0.016 Sum_probs=104.2 Q ss_pred Ceeccccccc--hhHH-HHHHHHHHhhchhhhhcceeecCCCceEEEE--EeCCcceE--Eeec-cccccccccceeeEE Q lcl|Aclame:pro 1 MVLNKGTLFD--PELV-TDLISKVAGKSSIARLSAQKPIPFNGEKVFT--FTMDSEID--VVAE-SGKKTHGGVTLAPQT 72 (298) Q Consensus 1 mat~gg~lip--~~~~-~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~--~~~~~~a~--~v~E-~~~~~~~~~~~~~v~ 72 (298) .+..++.-.. +... ..+...+ . .++.+.+.. .++..++- +-+. +..+++-..++++++ T Consensus 194 ~a~~t~~~t~~~~~~~~~ai~s~~-~-------------~~~~y~~g~GmsTa~aEal~~lggss~~~f~EMaFsIeKvT 259 (522) T protein:vir:69 194 SAQVTISSSADDAAKLDAEIIKQM-E-------------AGALVEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQV 259 (522) T ss_pred ccCCcCCCCCcccccccchhcccc-c-------------cccceeeccccchhhhhhcccCCCCcccchhhhcceEeeEE Confidence 2211111111 1110 1111110 0 011111111 11112211 1111 346788888899888 Q ss_pred EeeeEEEEEEeecHHHhhcccc-cHHHHHHHHHHHHHHHHHHHHHHHHhcccc-cccccccccc----cccccccccccc Q lcl|Aclame:pro 73 MVPIKVEYGARISDEFMYASDE-EKINILQAFNDGFAKKVARGIDLMAFHGVN-PRLGTASAVI----GTNHFDSKVTQK 146 (298) Q Consensus 73 l~~~k~~~~~~iS~ell~~~~d-~~~~l~~~i~~~la~~i~~~~d~~~l~G~~-~~~g~~~~~~----~~~~~~~~~~~~ 146 (298) ..+|.=+=...+|-||.|+--. -..|.+++|..-|+-.|...+++-++.-.. ...-...+.. ...++....... T Consensus 260 VtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~g~t~~~~~~~Gv~Dl~~~~ 339 (522) T protein:vir:69 260 IEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGMTNIVGSKAGVFDFQDPI 339 (522) T ss_pred EeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhheeeccccccccccccceeeccccc Confidence 8777766677889998764322 124455666666666666666666663211 0000000000 001111111111 Q ss_pred ccc-cccc----chhHHHHHHHhhhhhhc--CCcccEEEEcHHHHHHHHHhh-----ccCC-ceeeccccccc-Ccceec Q lcl|Aclame:pro 147 VEA-PRGI----ADPNGAIENAVELLTGV--DADVTGIAINPSFRSALAKQK-----DLQG-NALFPELKWGA-TPDTIN 212 (298) Q Consensus 147 ~~~-~~~~----~~~~~~i~~~~~~l~~~--~~~~~~~vm~~~~~~~L~~lk-----d~~G-~~l~~~~~~~~-~~~~l~ 212 (298) ..- +-.. ..++-.|...-..+... ....+-++|+++....|...- .+.| ..=|..+.+.. ..+.|. T Consensus 340 ~~~~~rw~~e~~k~L~~~i~~~an~i~~~T~rg~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~ 419 (522) T protein:vir:69 340 DIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLASGFNTDTTKSVFAGVLG 419 (522) T ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHhhcccccccccccccccccccCCCceEEEEec Confidence 100 0000 01122233333333332 234667999999999886531 1111 11122222221 124555 Q ss_pred -ceeeEecCccccccccccceEEEeec-----cceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEe Q lcl|Aclame:pro 213 -GLPVDVNKTVSDMSLTQRDRAIIGDF-----ANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGIL 286 (298) Q Consensus 213 -G~PV~~s~~~~~~~~~~~~~~~~gd~-----~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~ 286 (298) |++|+++++.+. +.+++|-- ..+..|.+=-+++ +.+ ..|.. + ||- .+.|+ .|+++.+ T Consensus 420 ~~~~vy~D~y~~~------dy~~vG~KG~~~~~~glfyaPYv~l~--~~~--~~dp~--s-fqP-~~g~~--tRY~l~v- 482 (522) T protein:vir:69 420 GKYRVYIDQYAKQ------DYFTVGYKGANEMDAGIYYAPYVALT--PLR--GSDPK--N-FQP-VMGFK--TRYGIGV- 482 (522) T ss_pred CceEEEecCCCCc------ceEEEEEeCCcccccceeeccccccc--ccc--ccCCc--c-ccc-eeeee--eeeceee- Confidence 479999988753 34454421 1111111111111 111 11211 1 332 23344 3565543 Q ss_pred cc-------cceEEEeecC Q lcl|Aclame:pro 287 DA-------TKFARVTEAN 298 (298) Q Consensus 287 ~~-------~a~~~l~~a~ 298 (298) +| +-.++|.+.+ T Consensus 483 NP~~~~~~~~~~~ri~~g~ 501 (522) T protein:vir:69 483 NPFAESSLQAPGARIQSGM 501 (522) T ss_pred cCcccccCCcccceeeccc Confidence 33 1123444444 No 248 >protein:vir:78148 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:4955 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294802;genbank:gi:149882823;genbank:GeneID:5309176 Probab=53.19 E-value=0.37 Score=22.94 Aligned_cols=107 Identities=13% Similarity=0.059 Sum_probs=57.9 Q ss_pred EEcHHHHHHHHHh-------hccCCceeecccccccCcceecceeeEecCccccccccccceEEEeeccc------eEEE Q lcl|Aclame:pro 178 AINPSFRSALAKQ-------KDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFAN------GFKW 244 (298) Q Consensus 178 vm~~~~~~~L~~l-------kd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~~~~~gd~~~------~~~~ 244 (298) +++...|+.+... --.+.++++.+ +-+-+++|+.-+.+.++|.+...--|.-++|-|.+ .|.- T Consensus 1 vvsdlqfA~~~g~~v~~~aLpRE~aNp~ltG----~lpV~~~GltWl~tpnlpg~~a~vlDst~lGgmaDE~l~~Pgya~ 76 (123) T protein:vir:78 1 MLSGAQFAKLIGILVDDKALPREQANIVLTG----SLPVSAYGLTWVTSRHITGTDPWLFDVEQLGGMADEKLLSPEFAP 76 (123) T ss_pred CcchhhHHHHhcchhcccccccccCCceEec----CcceeeeceeeeecCCCCCCccceeehhhhccccccccCCCcccC Confidence 2222223332221 11234555543 44557888888999999865433334444554432 1111 Q ss_pred EeecceEEEEeecccccccchhhhh--cCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 245 GYAKEVPLEVIQYGDPDNSGLDLKG--YNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 245 ~~~~~~~i~~~~~~~~~~~~~~~f~--~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ....+++++..++ -+ ++...+|++.-----+..|.|.++|++.- T Consensus 77 ~~~~Gvevkt~Re----------d~~~nD~yriRaRRvTvpiv~EP~Agv~ltg~g 122 (123) T protein:vir:78 77 AGNTGVEASTERA----------HQGVKDGYLVRGRRNTVAVVTEPMAGVRLTGTG 122 (123) T ss_pred CCCcceeEEeecc----------ccCCCCceEEeeeecceeEEecCccceEEeeec Confidence 1122233332221 12 57778888766666677999999999988 No 249 >protein:vir:2106 Length: 430 # NCBI annotation: coat protein # Family: family:all:1412 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059630;genbank:gi:9635538;genbank:GeneID:1262831 Probab=52.98 E-value=0.54 Score=22.05 Aligned_cols=265 Identities=13% Similarity=0.106 Sum_probs=100.8 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcc-eeec------CCCceEEEEEeCCc-ceEEeeccccccccccceeeEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSA-QKPI------PFNGEKVFTFTMDS-EIDVVAESGKKTHGGVTLAPQT 72 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~-~~~~------~~~~~~ip~~~~~~-~a~~v~E~~~~~~~~~~~~~v~ 72 (298) ||++=+.++-..+ .++|+.+....++.+.+. .+|. .+..+++|...... ..+|.--+ ..+ ...=+++. T Consensus 1 Ma~~~~~~lti~~-~eal~~~~n~lV~a~~~~~~r~~d~~~~r~Gdti~ip~p~~~~~~~G~~~t~-~~~--~~~e~~v~ 76 (430) T protein:vir:21 1 MALNEGQIVTLAV-DEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTD-KAT--GLLELNVA 76 (430) T ss_pred CccccchhhHHHH-HHHHHHhhhhhhhhhhhhccCCchhhhhcccceEEeeccccccccccccccC-CCc--cceeeeEe Confidence 9999777766555 999999999999888654 3332 23356777643322 22331111 111 22223333 Q ss_pred EeeeE---EEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccc-ccccccccccccccccccccccc Q lcl|Aclame:pro 73 MVPIK---VEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNP-RLGTASAVIGTNHFDSKVTQKVE 148 (298) Q Consensus 73 l~~~k---~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~-~~g~~~~~~~~~~~~~~~~~~~~ 148 (298) ++..+ +..... ++||. ..+ ..+++|+..+ ++++..+|..++.-... ++.. .+.... T Consensus 77 ~~~~~~~~V~~~~~-~kEl~--~~~---~~er~l~pAm-~~LA~~Vd~dl~~~~~~~~~~v-------------~~~~~~ 136 (430) T protein:vir:21 77 VNMGEPDNDFFQLR-ADDLR--DET---AYRRRIQSAA-RKLANNVELKVANMAAEMGSLV-------------ITSPDA 136 (430) T ss_pred EEEeeeccceEEee-hhHhc--Chh---hHHHHHHHHH-HHHHHHHHHHHHHHhhhhhhcc-------------ccccCC Confidence 32222 222222 56642 222 2344554444 77888888888743100 0000 000001 Q ss_pred cccccchhHHHHHHHhhhhhhcCCcc---cEEEEcHHHHHHHHH-h---hccCCceeecccccccCcc-eeccee-eEec Q lcl|Aclame:pro 149 APRGIADPNGAIENAVELLTGVDADV---TGIAINPSFRSALAK-Q---KDLQGNALFPELKWGATPD-TINGLP-VDVN 219 (298) Q Consensus 149 ~~~~~~~~~~~i~~~~~~l~~~~~~~---~~~vm~~~~~~~L~~-l---kd~~G~~l~~~~~~~~~~~-~l~G~P-V~~s 219 (298) ....+.+.+.++.++-..|....... -..+++|.....|.. + ...+.. . ......+.-+ .+.|+- +.-+ T Consensus 137 t~~~~~~~~~~~A~a~~~L~~~~vP~~~~R~~~~~p~~~~~l~~~l~~~~~~~~~-~-~~A~r~g~i~r~~~Gfd~~~~s 214 (430) T protein:vir:21 137 IGTNTADAWNFVADAEEIMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRI-P-EEAYRDGTIQRQVAGFDDVLRS 214 (430) T ss_pred CCCCCCcchhhHHHHHHHHHHhcCCCCCCcEEEeChHHHHHHhhhhccccccccc-h-hHHHhhcccccccchhhhhhhc Confidence 11122223455555555554444432 257999988876633 2 121111 0 0011112122 255553 4445 Q ss_pred CccccccccccceE-EEeecc----------ceEEEEe-ecceEEEEeecccccccchhhhhcCcEEEEE---EEEEccE Q lcl|Aclame:pro 220 KTVSDMSLTQRDRA-IIGDFA----------NGFKWGY-AKEVPLEVIQYGDPDNSGLDLKGYNQVYIRA---ELFLGWG 284 (298) Q Consensus 220 ~~~~~~~~~~~~~~-~~gd~~----------~~~~~~~-~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~---~~r~~~~ 284 (298) +.+|.-..+..... +-|-.. .+..... .+...+.++.-.. +..-+.+.|-+ .-++... T Consensus 215 ~~~~~~t~gt~t~~tv~gA~~~~~~~~tv~~~g~~~~~d~~~~~it~s~tg~-------l~~GD~ftiaGV~~v~~itk~ 287 (430) T protein:vir:21 215 PKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTG-------MKRGDKISFAGVKFLGQMAKN 287 (430) T ss_pred CCcccccCccCcCceeccccccccccceeccccccccccccceeeeeecccc-------eecccEEEecceeeecccccc Confidence 55553222111111 111110 1111100 0111111111100 00001111100 0000000 Q ss_pred Ee-cccceEEEeecC Q lcl|Aclame:pro 285 IL-DATKFARVTEAN 298 (298) Q Consensus 285 v~-~~~a~~~l~~a~ 298 (298) +. ++.=|+++..++ T Consensus 288 ~~~~l~qf~V~a~~~ 302 (430) T protein:vir:21 288 VLAQDATFSVVRVVD 302 (430) T ss_pred ccCCcceEEEEEecC Confidence 00 111122222122 No 250 >protein:vir:103181 Length: 457 # NCBI annotation: gp135 # Family: family:all:364 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717802;genbank:gi:113200639;genbank:GeneID:4239190 Probab=51.27 E-value=0.59 Score=21.86 Aligned_cols=270 Identities=10% Similarity=-0.001 Sum_probs=99.6 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcceeecCC-CceEEEEEeCCcceEEeec---cccccccccceeeEEEeee Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPF-NGEKVFTFTMDSEIDVVAE---SGKKTHGGVTLAPQTMVPI 76 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~-~~~~ip~~~~~~~a~~v~E---~~~~~~~~~~~~~v~l~~~ 76 (298) ..+.+.......... .. .+... -+... ....+..--....++-.++ +..+++-..++++++..++ T Consensus 141 ~~~~~~~~~~~~~gt--~~------~~~~~---~~~~~~~~~~~~~gmsTA~aE~lgd~~~n~~f~EMaFsIeK~tVtAK 209 (457) T protein:vir:10 141 YDPGATGVTNDAEGT--NP------ALLND---SPAGTYEQADDATGMSTATVEALDDSTANTAFREMGFSIEKVTVTAR 209 (457) T ss_pred ccccccccccccccc--cc------cccCc---cccccccccccccchhhhhhhccCCCCCccchhhheeEEEEEEEeee Confidence 111111111111000 00 00000 00000 0001111001112222222 2346777777788888777 Q ss_pred EEEEEEeecHHHhhcccc-cHHHHHHHHHHHHHHHHHHHHHHHHhccccc--cccccccccccccccccccccccccccc Q lcl|Aclame:pro 77 KVEYGARISDEFMYASDE-EKINILQAFNDGFAKKVARGIDLMAFHGVNP--RLGTASAVIGTNHFDSKVTQKVEAPRGI 153 (298) Q Consensus 77 k~~~~~~iS~ell~~~~d-~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~--~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (298) .=+=...+|-||.++--. -..|.+++|..-|+-.|...+++-++.-... .-+...+ ....++....... .+.+. T Consensus 210 SRaLKAEYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~~a~~~~~~~-~~~~gv~dl~~~~--~g~~~ 286 (457) T protein:vir:10 210 ARALKAEYSIEMAQDLKAIHGLDAEQELANILSTEILAEINREVVRTIYTNAVAGAQNN-TATAGVFDLDVDS--NGRWS 286 (457) T ss_pred ccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhheeeeccc-cccceeeeeeccc--cchhh Confidence 766677889998664221 1133445555555555555555555432110 0011011 1111221211110 01111 Q ss_pred chh-----HHHHHHHhhh-hhhcCCcccEEEEcHHHHHHHHH--hhc------cCCceeecccccccCcceec-ceeeEe Q lcl|Aclame:pro 154 ADP-----NGAIENAVEL-LTGVDADVTGIAINPSFRSALAK--QKD------LQGNALFPELKWGATPDTIN-GLPVDV 218 (298) Q Consensus 154 ~~~-----~~~i~~~~~~-l~~~~~~~~~~vm~~~~~~~L~~--lkd------~~G~~l~~~~~~~~~~~~l~-G~PV~~ 218 (298) ... +...+.+-.. ...-....+.++|+++....|.. ..+ .+.+..-.+.......|.|+ |++|++ T Consensus 287 ~e~~k~L~~~i~~ean~i~~~T~rg~gn~~i~S~~Va~~L~~sg~l~~~p~~~~~~~~~~~d~~~~~~~G~l~~r~~vy~ 366 (457) T protein:vir:10 287 VEKFKGLLFQIERDANAIGHQTRRGKGNILICSADVVSALGMAGVLDYTPALNGNNGLAGVDDTSSTLVGTLNGRIKVYV 366 (457) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccccceEEEEchhHHHHHhhcccccccchhhccccccccccccceeEEEecCCeEEEE Confidence 111 1111222111 12234456679999999988865 211 11111001112222245666 479999 Q ss_pred cCccccccccccceEEEeeccceEEEEeecceEEEEeeccc------ccccchhhhhcCcEEEEEEEEEccEEecccceE Q lcl|Aclame:pro 219 NKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGD------PDNSGLDLKGYNQVYIRAELFLGWGILDATKFA 292 (298) Q Consensus 219 s~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~ 292 (298) +.+...+ ...+.+++|-- +...+ .-.+-..||.. .|.. -||- .+.|+ .|++. ..+|-... T Consensus 367 D~Ya~~n--s~~dy~~vG~K-G~~~~----~~glfy~PYv~l~~~~~~dp~---sfqP-~~g~~--tRY~l-~~NP~~~~ 432 (457) T protein:vir:10 367 DPYSANV--ADKHFYVAGYK-GTSPY----DAGLFYCPYVPLQQVRAINPD---TFQP-KIGFK--TRYGM-VSNPFAGG 432 (457) T ss_pred ecccccC--CccceEEEEEe-CCcce----ecceeecccccccccCccCCc---cccc-eeeee--eeeee-eecccccc Confidence 9776422 12345555421 10000 01111222221 1221 1332 23333 46766 55655332 Q ss_pred EE-------eecC Q lcl|Aclame:pro 293 RV-------TEAN 298 (298) Q Consensus 293 ~l-------~~a~ 298 (298) += ++.| T Consensus 433 ~~~~~~~~~~~~n 445 (457) T protein:vir:10 433 LTQGSGALTVNAN 445 (457) T ss_pred cccccccccccch Confidence 11 1222 No 251 >protein:vir:104915 Length: 470 # NCBI annotation: T4-like major capsid protein # Family: family:all:364 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214367;genbank:gi:61806007;genbank:GeneID:3294435 Probab=51.22 E-value=0.59 Score=21.85 Aligned_cols=281 Identities=10% Similarity=0.007 Sum_probs=107.8 Q ss_pred CeeccccccchhHHHHHHHHHH---hhchhhhhcceeecCCCceEEE--E---EeCCc--------ceEEee-------- Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVA---GKSSIARLSAQKPIPFNGEKVF--T---FTMDS--------EIDVVA-------- 56 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~---~~s~i~~~~~~~~~~~~~~~ip--~---~~~~~--------~a~~v~-------- 56 (298) .+.+...---....+.++..+| ...+...++.+.||++...-|. | .+..+ +..|-+ T Consensus 69 i~~st~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~sG~EaffnEA~T~fSG~~~~~~~~ 148 (470) T protein:vir:10 69 SADATAAGPVAGFDPVLISLIRRSMPNLVAYDLAGVQPMNGPTGLIFAMRSRYKTQSGTEALFNEADTAFSGQPDGLDDT 148 (470) T ss_pred cccccccccccccCchhhhhHHHHHhhhhhhhhheeecCCccceeeeEEEEEecCCCccceeeecCCcccCccccccccc Confidence 1111111111112233344443 4455677888889876532222 1 00000 000100 Q ss_pred -------------------------------------------------c------cccccccccceeeEEEeeeEEEEE Q lcl|Aclame:pro 57 -------------------------------------------------E------SGKKTHGGVTLAPQTMVPIKVEYG 81 (298) Q Consensus 57 -------------------------------------------------E------~~~~~~~~~~~~~v~l~~~k~~~~ 81 (298) | +.++++...++++++..++.=+=. T Consensus 149 ~~~~~~~a~~~g~~~~~~~gt~~~~~~~~~~~a~~~~y~~~~GMsTa~aE~lg~s~~~~f~EMaFsIeK~tVtAKSRaLK 228 (470) T protein:vir:10 149 SGFTATGANNVGLGTTAQQGSNPGLLNSTAAQTNATDYNVGQGMRTDSAEDLGDGTGDQFNQMAFSIEKVTVTAKSRALK 228 (470) T ss_pred ccccccccccccccccccccccccccccccccccccccccccccchHHhhhcCCCCCcccceeeeEEEEEEEEeecccee Confidence 0 122344445555555555544445 Q ss_pred EeecHHHhhcccc-cHHHHHHHHHHHHHHHHHHHHHHHHhccccc--ccccccccccccccccccccccccccccchhHH Q lcl|Aclame:pro 82 ARISDEFMYASDE-EKINILQAFNDGFAKKVARGIDLMAFHGVNP--RLGTASAVIGTNHFDSKVTQKVEAPRGIADPNG 158 (298) Q Consensus 82 ~~iS~ell~~~~d-~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~--~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (298) ..+|-||.++-.. -..|.+++|..-|+..|...+++-++.-... .-+...+. ...++....... .+.-..+ T Consensus 229 AeYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~~a~~~k~~~~-~~~Gv~Dl~~~~-----~gr~~~e 302 (470) T protein:vir:10 229 AEYSLELAQDLKAIHGLNAEAELANILSTEILAEINREVIRTIYNVAEPGAQANV-AAAGTFDLDTDS-----NGRWSVE 302 (470) T ss_pred ccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhhhhceeccc-cccceEEeeccc-----chhHHHH Confidence 6788898654322 1234555566666666666666555533211 00111111 111111110000 0000112 Q ss_pred HHHHHhhhh---------hhcCCcccEEEEcHHHHHHHHHhh--ccCC---ceeecccccccCcceec-ceeeEecCccc Q lcl|Aclame:pro 159 AIENAVELL---------TGVDADVTGIAINPSFRSALAKQK--DLQG---NALFPELKWGATPDTIN-GLPVDVNKTVS 223 (298) Q Consensus 159 ~i~~~~~~l---------~~~~~~~~~~vm~~~~~~~L~~lk--d~~G---~~l~~~~~~~~~~~~l~-G~PV~~s~~~~ 223 (298) .+..++.++ .......+.++|+++....|.-.- +..+ ..+-.+....-..|.|. |++|+++.++. T Consensus 303 ~~~~l~~~i~~ean~i~~~t~r~~~n~~i~S~~Va~~La~sG~l~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~d~y~~ 382 (470) T protein:vir:10 303 KFKGLIFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAGILQGKYRVYIDPFSA 382 (470) T ss_pred HHHHHHHHHHHHHHHHHHhhccccceEEEEchhHHhHhhhccccccccccccccccCCCCceEEEEecCceEEEeecccc Confidence 222222222 122345567899999998883211 1000 01111111011134555 47999998775 Q ss_pred cccccccceEEEeeccceEEEEeecceEEEEeeccc------ccccchhhhhcCcEEEEEEEEEccEEe-----cccceE Q lcl|Aclame:pro 224 DMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGD------PDNSGLDLKGYNQVYIRAELFLGWGIL-----DATKFA 292 (298) Q Consensus 224 ~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------~~~~~~~~f~~n~v~~r~~~r~~~~v~-----~~~a~~ 292 (298) .+.....+.+++|-- +...+ .-.+-..||.. .|... ||- .+.|+ .|++..+- ..++.+ T Consensus 383 ~~~~a~~dy~~vG~K-G~~~~----~~glfy~PYv~l~~~~~~dp~s---fqP-~~g~~--tRY~l~~NP~~~~~~~~~~ 451 (470) T protein:vir:10 383 SGGAAATQYYVVGYK-GSSPY----DAGLFYCPYVPLQMVRAVGQDT---FQP-KIGFK--TRYGLVENPFSQGTTQGLG 451 (470) T ss_pred ccCcccccEEEEEEe-cCcce----ecceeeccccccccCCCCCCcc---ccc-eeeee--eeeceeecCcccCCCcccc Confidence 443444555665521 10000 01122223221 12211 332 23333 35555331 112222 Q ss_pred EE-eecC Q lcl|Aclame:pro 293 RV-TEAN 298 (298) Q Consensus 293 ~l-~~a~ 298 (298) ++ ++.| T Consensus 452 ~i~~~~n 458 (470) T protein:vir:10 452 TLTRNSN 458 (470) T ss_pred cccCCCC Confidence 22 2222 No 252 >protein:vir:78090 Length: 302 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468790;genbank:gi:157325371;genbank:GeneID:5601852 Probab=50.46 E-value=0.61 Score=21.76 Aligned_cols=270 Identities=13% Similarity=0.050 Sum_probs=112.1 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhh--c-ceee-cCCCceEEEEEeC------CcceEEeeccccccccccceee Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARL--S-AQKP-IPFNGEKVFTFTM------DSEIDVVAESGKKTHGGVTLAP 70 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~--~-~~~~-~~~~~~~ip~~~~------~~~a~~v~E~~~~~~~~~~~~~ 70 (298) ||.+=.+ .+.+.+.+.+.....+.-..| . ..+. .++..++||+++- +-..+-...|-....-+.+++. T Consensus 1 Mantl~y--a~~~~~~Ld~~~~~~~~t~~l~~~~~~v~~~Gak~vkIp~is~~~~~TsGl~dy~R~~g~~~g~v~~~~et 78 (302) T protein:vir:78 1 MANSLAL--AQIYQDNIDKAIAVNSKSAFLEANPNNVQYNGGNTIKIADISFGSGTTGDLKAYNRSTGFTQGSVTLAWSD 78 (302) T ss_pred CCchhHH--HHHHHHHHHHHHHhhhceeecccCCceEEEecCcEEEEEEEEeeccccccccccccccCccccceeeeeee Confidence 8844222 144555565655555432222 1 1233 3446789999862 2112223222222223344555 Q ss_pred EEEeeeEEEE-EEeecHHHhhccccc--HHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccc Q lcl|Aclame:pro 71 QTMVPIKVEY-GARISDEFMYASDEE--KINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKV 147 (298) Q Consensus 71 v~l~~~k~~~-~~~iS~ell~~~~d~--~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~ 147 (298) .+|.-.+--. .+.--+ .++. ...+...+.+...+...=.+|...|.-. .....+..+. .... T Consensus 79 ~tlt~DR~~~f~vD~mD-----vdETn~~~~~ani~~ef~r~~vvPEiDayrfskl------a~~a~~~~~~----~~~~ 143 (302) T protein:vir:78 79 YTLDYDLAQSFQIDAMD-----VDETKNLATVGNVLSEYQRTKIVPAIDKYRFTKL------ANDGTGVGGV----IDLS 143 (302) T ss_pred EEeeeccceeeeccccc-----hhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHH------HHhhhccCcc----cccc Confidence 5555444222 121111 1111 1112222222233334445555544210 0000000100 1111 Q ss_pred ccccccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceee---cccccccCcceecceeeEe--cCcc Q lcl|Aclame:pro 148 EAPRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALF---PELKWGATPDTINGLPVDV--NKTV 222 (298) Q Consensus 148 ~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~---~~~~~~~~~~~l~G~PV~~--s~~~ 222 (298) ....+....++.|..++..+..++ +-.++|+|.+...|...+.-+...-. ....-....++|.|+||+. ++.| T Consensus 144 ~~~~t~~nvl~~i~~~~~~~~e~~--~~vl~vtp~~~~~Lk~a~~~~~~~~~~~~~~~~i~~~V~~lDgv~Ii~VPs~r~ 221 (302) T protein:vir:78 144 KPDASAQALMGDIATAMELVDDSN--QLILVTSPTTLAGLLNTALIRESKNTQVLRRGEVDTKITFIQDVEVLQVPSEYL 221 (302) T ss_pred ccchhHHHHHHHHHHHHHHhhccC--CeEEEEChHHHHHHhcchhhccceeccccccccccceeeeecccEEEEchhhhc Confidence 112234566788888888877753 44689999999888654322222111 1111234567899999974 3334 Q ss_pred cccc---------ccccc-eEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEeccc--c Q lcl|Aclame:pro 223 SDMS---------LTQRD-RAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDAT--K 290 (298) Q Consensus 223 ~~~~---------~~~~~-~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~--a 290 (298) .... ..++. -.++..- .+. +.+...-.+.+++ .+..+. -|...+.-+.+.|.=|.+.+ + T Consensus 222 ~t~~~f~~G~~~~~~ak~INfiiv~~-~a~-ia~~K~~~~~if~-P~~~~~------gd~~l~~~R~Y~D~fV~~nk~~g 292 (302) T protein:vir:78 222 YDKVAPKVGVPDYTGAKKIPYMIFKR-DAP-TGIVKTDKVRVFE-PDTNQS------ADAYKVDLRLYHDLIVPKNQRPG 292 (302) T ss_pred ccceeccCCccccCCccceeEEEECC-Cee-eeeeeeeeeEeeC-CCCCCC------cceeeeeeeeEeeeeeeccccCe Confidence 3210 01111 1222222 221 2222222333332 111111 12234444556666666544 3 Q ss_pred eEEEeecC Q lcl|Aclame:pro 291 FARVTEAN 298 (298) Q Consensus 291 ~~~l~~a~ 298 (298) +.+=+.++ T Consensus 293 I~~~~~~~ 300 (302) T protein:vir:78 293 IIKASFGT 300 (302) T ss_pred EEEeeccc Confidence 33323333 No 253 >protein:vir:80491 Length: 467 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468466;genbank:gi:157325041;genbank:GeneID:5601449 Probab=43.12 E-value=0.86 Score=20.95 Aligned_cols=282 Identities=12% Similarity=0.053 Sum_probs=123.8 Q ss_pred CeeccccccchhHHHHHHHHHHhhch--hhhhcceeecCCCceEEEEEeC---CcceEEeeccccccccccceeeEEEee Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSS--IARLSAQKPIPFNGEKVFTFTM---DSEIDVVAESGKKTHGGVTLAPQTMVP 75 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~--i~~~~~~~~~~~~~~~ip~~~~---~~~a~~v~E~~~~~~~~~~~~~v~l~~ 75 (298) ==++|+.|--+.+..+|-.+...... +.+-..+.+..+.--++-.... .+...+++|+...+.+++++.+..... T Consensus 35 tq~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~di~k~~a~stv~~y~~~~~~G~~g~~~f~~E~g~~~~~~~~~~r~~~~~ 114 (467) T protein:vir:80 35 TQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNM 114 (467) T ss_pred cccCcchhhhhhhhhhhheeeccccchhhhhhcccchhhhhhhhheeeeccCccccccccccccccccCCCceEEEEEEe Confidence 11233444445555555544433322 2333334444443234443332 256789999999999999999999999 Q ss_pred eEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc-----cccccccccccccccccccc Q lcl|Aclame:pro 76 IKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTA-----SAVIGTNHFDSKVTQKVEAP 150 (298) Q Consensus 76 ~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~-----~~~~~~~~~~~~~~~~~~~~ 150 (298) |=++.-..+|.-+-+... ..+.++...+.-.-.++..+|.++|+|+..-.-.+ -.+.|+..+.+. ..+-.. T Consensus 115 k~l~~~~~vs~~~~l~n~--i~d~~~~~~~~ai~~~a~tiE~a~FyGds~l~~s~~~~~glqfDGi~~li~~--enviDa 190 (467) T protein:vir:80 115 KFASDTKNISIAAGLVNN--IQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQ--DNVHDA 190 (467) T ss_pred eeeeeeeeehhhhhhhcc--hhhHHHHHHHHHHHHHHHHHHHHhhhcccccccCCCccccccccceeEEecC--Cceecc Confidence 999988778776533322 34566788888888899999999999964331111 122333322211 111122 Q ss_pred cccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHH-HHhhccCCceeecccccccCcceecceeeEecCccccccc-c Q lcl|Aclame:pro 151 RGIADPNGAIENAVELLTGVDADVTGIAINPSFRSAL-AKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSL-T 228 (298) Q Consensus 151 ~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L-~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~-~ 228 (298) ......-++|..+.......+..++-++|+..+.+.| ......+ +.+.. +.......|.||- ..+..... . T Consensus 191 ~G~~ls~~~lneaa~~i~~gfG~~td~~~p~~v~a~~~~~~L~~q--~~v~~---~n~~~~~~G~~v~--g~~sa~G~I~ 263 (467) T protein:vir:80 191 RGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQ--TQLVR---DNGNNVSVGFNIQ--GFHSARGFIK 263 (467) T ss_pred CCCccCHHHHHHHhhhccccccChhhhhcchhHHhhhhhhhcCce--EEEEc---CCCCceeeeeccc--ceecceeeee Confidence 2223345566666666666677788899999998877 2222111 11110 0011112233221 00000000 0 Q ss_pred ccceEEEeeccceEEEEeec-ceE-------EEEeecccccccchhhhh-cCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 229 QRDRAIIGDFANGFKWGYAK-EVP-------LEVIQYGDPDNSGLDLKG-YNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 229 ~~~~~~~gd~~~~~~~~~~~-~~~-------i~~~~~~~~~~~~~~~f~-~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) -....|+++... +.... ..+ +.-+.. .+..+...-. -....||+...-+.+=.-|+..+-++.+. T Consensus 264 l~gs~il~~~~~---l~~~~~~~~~Apsp~~vsaT~~--~~~~g~~~~~~~a~y~Y~v~~vs~~GES~pS~~vtvTVaa 337 (467) T protein:vir:80 264 LHGSTVMENEQI---LDERILALPTAPQPAKVTATQE--AGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTA 337 (467) T ss_pred ecCceeeccccC---CCcccccccccccCCccceeee--cccCCcccCCCcceEEEEEEEECCCCccccccceEEEecC Confidence 000112222110 00000 000 000000 0000000000 00011222222221111222222221111 No 254 >protein:vir:63741 Length: 468 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547622;genbank:GeneID:3783474 Probab=42.94 E-value=0.86 Score=20.93 Aligned_cols=282 Identities=12% Similarity=0.054 Sum_probs=123.5 Q ss_pred CeeccccccchhHHHHHHHHHHhhc--hhhhhcceeecCCCceEEEEEeC---CcceEEeeccccccccccceeeEEEee Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKS--SIARLSAQKPIPFNGEKVFTFTM---DSEIDVVAESGKKTHGGVTLAPQTMVP 75 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s--~i~~~~~~~~~~~~~~~ip~~~~---~~~a~~v~E~~~~~~~~~~~~~v~l~~ 75 (298) ==++|+.|--+.+..+|-.+..... .+.+-..+.+..+.--++..... .+...+++|+...+.+++++.+..... T Consensus 36 ~q~~~~AlR~EsL~~~i~~L~~~~~~f~~~~di~k~~a~stv~~y~~~~~~G~~g~~~f~~E~g~~~~~~~~~~r~~~~~ 115 (468) T protein:vir:63 36 TQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNM 115 (468) T ss_pred cccCcchhhhhhhhhhhheeeecccchhhhhhcccchhhhhhhhheeeeccCccccccccccccccccCCCceEEEEEEe Confidence 1122344444555555544433322 22333334444443234443332 256789999999999999999999999 Q ss_pred eEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc-----cccccccccccccccccccc Q lcl|Aclame:pro 76 IKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTA-----SAVIGTNHFDSKVTQKVEAP 150 (298) Q Consensus 76 ~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~~g~~-----~~~~~~~~~~~~~~~~~~~~ 150 (298) |=++.-..+|.-+-+... ..+.++...+.-.-.++..+|.++|+|+..-.-.+ -.+.|+..+.+. ..+-.. T Consensus 116 k~l~~~~~vs~~~~l~n~--i~d~~~~~~~~ai~~~a~tiE~a~FyGds~l~~s~~~~~glqfDGi~~li~~--enviDa 191 (468) T protein:vir:63 116 KFASDTKNISIAAGLVNN--IQDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQ--DNVHDA 191 (468) T ss_pred eeeeeeeeehhhhhhhcc--hhhHHHHHHHHHHHHHHHHHHHHhhhcccccccCCCccccccccceeEEecC--Cceecc Confidence 999988788776533322 34566788888888899999999999964331111 122333322211 111122 Q ss_pred cccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHH-HHhhccCCceeecccccccCcceecceeeEecCccccccc-c Q lcl|Aclame:pro 151 RGIADPNGAIENAVELLTGVDADVTGIAINPSFRSAL-AKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSL-T 228 (298) Q Consensus 151 ~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L-~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~-~ 228 (298) ......-++|..+.......+..++-++|+..+.+.| ......+ +.+.. +.......|.||- ..+..... . T Consensus 192 ~G~~ls~~~lneaa~~i~~gfG~~td~~~~~~v~a~~~~~~L~~q--~~v~~---~n~~~~~~G~~v~--g~~sa~G~I~ 264 (468) T protein:vir:63 192 RGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQ--TQLVR---DNGNNVSVGFNIQ--GFHSARGFIK 264 (468) T ss_pred CCCccCHHHHHHHhhhccccccChhhhhcchhHHhhhhhhhcCce--EEEEc---CCCCceeeeeccc--ceecceeeee Confidence 2223345566666666666677788899999998877 2222111 11110 0011112233321 00000000 0 Q ss_pred ccceEEEeeccceEEEEeec-ceE-------EEEeecccccccchhhhh-cCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 229 QRDRAIIGDFANGFKWGYAK-EVP-------LEVIQYGDPDNSGLDLKG-YNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 229 ~~~~~~~gd~~~~~~~~~~~-~~~-------i~~~~~~~~~~~~~~~f~-~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) -....|+++... +.... ..+ +.-+.. .+..+...-. -....||+...-+.+=.-|+..+-++.+. T Consensus 265 l~gs~il~~~~~---l~~~~~~~~~Apsp~~vsaT~~--~~~~g~~~~~~~a~y~Y~v~~vs~~GES~pS~~vtvTVaa 338 (468) T protein:vir:63 265 LHGSTVMENEQI---LDERILALPTAPQPAKVTATQE--AGKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTA 338 (468) T ss_pred ecCceeeccccC---CCcccccccccccCCccceeee--cccCCcccCCCcceEEEEEEEECCCCccccccceEEEecC Confidence 000112222110 00000 000 000000 0000000000 00011222222221111222222221111 No 255 >protein:vir:7214 Length: 521 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049787;genbank:gi:9632597;genbank:GeneID:1258751 Probab=41.39 E-value=0.93 Score=20.76 Aligned_cols=278 Identities=11% Similarity=0.020 Sum_probs=100.2 Q ss_pred Ce----eccccccchhHH--HH-HHHHHHhhchhhhhcceee--------cCCCceEEEEEeCCcceEEeec-------- Q lcl|Aclame:pro 1 MV----LNKGTLFDPELV--TD-LISKVAGKSSIARLSAQKP--------IPFNGEKVFTFTMDSEIDVVAE-------- 57 (298) Q Consensus 1 ma----t~gg~lip~~~~--~~-ii~~~~~~s~i~~~~~~~~--------~~~~~~~ip~~~~~~~a~~v~E-------- 57 (298) .+ +..|...-..+. .. +........+...-..... +..+. . +..+.+-..-.+| T Consensus 166 ~~~~~~~a~Gd~~~~~~~~~gt~~~~~~~~~~~~~g~t~~~~t~~~v~~~~~a~~-~--y~~g~gm~Ta~aEal~~~g~s 242 (521) T protein:vir:72 166 LAASTQTTVGDIYTHFFQETGTVYLQASVQVTIDAGATDAAKLDAEIKKQMEAGA-L--VEIAEGMATSIAELQEGFNGS 242 (521) T ss_pred cccccccccccccccccccccccccccccccccCCCCCCccccccccccccccCc-e--eeeecccchhhhhhhcccCCc Confidence 11 111111111110 00 0000000000000000000 00000 0 0111111111122 Q ss_pred -cccccccccceeeEEEeeeEEEEEEeecHHHhhccccc-HHHHHHHHHHHHHHHHHHHHHHHHhccccc-ccccccccc Q lcl|Aclame:pro 58 -SGKKTHGGVTLAPQTMVPIKVEYGARISDEFMYASDEE-KINILQAFNDGFAKKVARGIDLMAFHGVNP-RLGTASAVI 134 (298) Q Consensus 58 -~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~~~~d~-~~~l~~~i~~~la~~i~~~~d~~~l~G~~~-~~g~~~~~~ 134 (298) +..+++-..++++++..+|.=+=...+|-||.++--.- ..|.+++|..-|+-.|...+++-++.=..- ..-...+.+ T Consensus 243 s~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~g~~g~t 322 (521) T protein:vir:72 243 TDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGMT 322 (521) T ss_pred ccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeeeeeeee Confidence 23466777777887777776666678899987653221 244566666666766777777666632210 000000000 Q ss_pred ----ccccccccccccccc-ccccc----hhHHHHHHHhhhhhh--cCCcccEEEEcHHHHHHHHHhh--c---cCC-ce Q lcl|Aclame:pro 135 ----GTNHFDSKVTQKVEA-PRGIA----DPNGAIENAVELLTG--VDADVTGIAINPSFRSALAKQK--D---LQG-NA 197 (298) Q Consensus 135 ----~~~~~~~~~~~~~~~-~~~~~----~~~~~i~~~~~~l~~--~~~~~~~~vm~~~~~~~L~~lk--d---~~G-~~ 197 (298) ...++.......... ..... ..+-.|......+.. .....+-++|+++....|...- | ++| .- T Consensus 323 ~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~ 402 (521) T protein:vir:72 323 LTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLAT 402 (521) T ss_pred eccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhcccccccccccccc Confidence 001111111101000 00000 012222222223222 2245667999999999887531 1 110 01 Q ss_pred eecccccc-cCcceec-ceeeEecCccccccccccceEEEeec-----cceEEEEeecceEEEEeecccccccchhhhhc Q lcl|Aclame:pro 198 LFPELKWG-ATPDTIN-GLPVDVNKTVSDMSLTQRDRAIIGDF-----ANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGY 270 (298) Q Consensus 198 l~~~~~~~-~~~~~l~-G~PV~~s~~~~~~~~~~~~~~~~gd~-----~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~ 270 (298) =|..+.+. -..+.|. |++|+++++.+. +.+++|-- ..+..|.+=-+ +.+.+ ..|.. -||- T Consensus 403 g~~~d~~~~~~~G~l~~~~~vy~D~y~~~------dy~~vG~KG~~~~~~glfyaPYv~--l~~~~--~~dp~---sfqP 469 (521) T protein:vir:72 403 GFSTDTTKSVFAGVLGGKYRVYIDQYAKQ------DYFTVGYKGPNEMDAGIYYAPYVA--LTPLR--GSDPK---NFQP 469 (521) T ss_pred cccccCCCceEEEEccCceEEEecCCCCc------ceEEEEEeCCcccccceeeccccc--ccccc--ccCCc---cccc Confidence 12112111 1124454 579999988753 34444421 11111111101 11111 11111 1332 Q ss_pred CcEEEEEEEEEccEEecc-------cceEEEeecC Q lcl|Aclame:pro 271 NQVYIRAELFLGWGILDA-------TKFARVTEAN 298 (298) Q Consensus 271 n~v~~r~~~r~~~~v~~~-------~a~~~l~~a~ 298 (298) .+.|+ .|+++.+ +| +...++++.+ T Consensus 470 -~~g~~--tRY~l~~-NP~~~~~~~~~a~~i~~~~ 500 (521) T protein:vir:72 470 -VMGFK--TRYGIGI-NPFAESAAQAPASRIQSGM 500 (521) T ss_pred -eeeee--eeeceee-cCcccccCcccceeecCcC Confidence 13333 3555533 33 2234444444 No 256 >protein:vir:1991 Length: 305 # NCBI annotation: major head subunit # Family: family:all:776 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050638;genbank:gi:9633525;genbank:GeneID:2636267 Probab=36.31 E-value=1.2 Score=20.19 Aligned_cols=221 Identities=11% Similarity=0.063 Sum_probs=107.2 Q ss_pred Ceeccccc--cchhHHHHHHHH-HH-hhchhhhhcceeecCCCceEEEEEeCCcce-EEeeccccccccccceeeEEEee Q lcl|Aclame:pro 1 MVLNKGTL--FDPELVTDLISK-VA-GKSSIARLSAQKPIPFNGEKVFTFTMDSEI-DVVAESGKKTHGGVTLAPQTMVP 75 (298) Q Consensus 1 mat~gg~l--ip~~~~~~ii~~-~~-~~s~i~~~~~~~~~~~~~~~ip~~~~~~~a-~~v~E~~~~~~~~~~~~~v~l~~ 75 (298) |..+...| +-..+ +.++.. +. ..+...+++.++|-++..-+|.++..-|.. .|+||- .-.+++-..-++.- T Consensus 1 M~i~~~~l~~l~~~~-~~~f~~~~~~a~~~~~~iA~~vpSt~~~~tY~wLg~fP~lrewiGer---~i~~l~~~~y~i~N 76 (305) T protein:vir:19 1 MIVTPASIKALMTSW-RKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWVGKR---TIQQMEAHGYSIAN 76 (305) T ss_pred CccCHHHHHHHHHHH-HHHHHHHHhhcCcccceEEeEecCCCCcccccccccCCccchhhcce---eeeeccccceeEee Confidence 77766554 11222 223322 22 224467777777766666677777766664 688653 34444445556777 Q ss_pred eEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccc----cccccccccccccccccccccccccccc Q lcl|Aclame:pro 76 IKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGV----NPRLGTASAVIGTNHFDSKVTQKVEAPR 151 (298) Q Consensus 76 ~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~----~~~~g~~~~~~~~~~~~~~~~~~~~~~~ 151 (298) ++...-+.|+++-+ +|+...+..-+.+.++++.+.--|.-++.-. +..--....+....+- T Consensus 77 k~fe~tV~V~R~dI---eDD~lG~y~p~~~~~G~~aa~~pd~lv~~lL~~Gf~~~cyDGq~FFdtDHp------------ 141 (305) T protein:vir:19 77 KTFEGTVGISRDDF---EDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHP------------ 141 (305) T ss_pred ccccceeccchhhc---cccccCchHHHHHHHHHHHhhchhhHHHHHHHhcCCccCCCCCcccCCCCC------------ Confidence 88889999999965 6777899999999999998887776665321 1000000111111110 Q ss_pred ccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceeecccccccCcceecceeeEecCccccccccccc Q lcl|Aclame:pro 152 GIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRD 231 (298) Q Consensus 152 ~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~s~~~~~~~~~~~~ 231 (298) ....+...+... ....++..++..|.+.| T Consensus 142 -----------v~~~~~~tg~~~--------~vsn~~~~~~~~g~~w~-------------------------------- 170 (305) T protein:vir:19 142 -----------VYPNVDGTGSAV--------NTSNIVEQDSFSGLPFY-------------------------------- 170 (305) T ss_pred -----------cccCCccccccc--------chhhhhcCCCCCCceee-------------------------------- Confidence 000000000000 00112233334443322 Q ss_pred eEEEeecc---ceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEeccc-ceE---EEeecC Q lcl|Aclame:pro 232 RAIIGDFA---NGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDAT-KFA---RVTEAN 298 (298) Q Consensus 232 ~~~~gd~~---~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~-a~~---~l~~a~ 298 (298) +-|.+ +-++|-.|...++.-.+.. +..+.|.+++..|-++.|+..+---.. |+. .|.-++ T Consensus 171 ---Lld~~~~ikP~I~Q~Rk~~~~~~~~~~----~d~~vf~~~e~~ygvd~R~n~Gygfwq~a~gS~~~Ls~~n 237 (305) T protein:vir:19 171 ---LLDCSRAVKPLIFQERRKPELVARTRI----DDDHVFMDNEFLFGASTRRAAGYGFWQMAVAVKGDLTLDN 237 (305) T ss_pred ---eeecCCcceeEEEecccccceeeccCC----CchhhhhhceeeeeeeeeeeccccchhheecCCCCCCHHH Confidence 11111 1123445555554322222 122457788877777777654433211 111 111111 No 257 >protein:vir:101039 Length: 529 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932516;genbank:gi:37651642;genbank:GeneID:2610532 Probab=33.28 E-value=1.4 Score=19.84 Aligned_cols=272 Identities=11% Similarity=0.005 Sum_probs=98.3 Q ss_pred Ce-eccccccchhHHHHHHHHHHhhchhhhhcceeecCC-CceEEEE--EeCCcce--EEee-ccccccccccceeeEEE Q lcl|Aclame:pro 1 MV-LNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPF-NGEKVFT--FTMDSEI--DVVA-ESGKKTHGGVTLAPQTM 73 (298) Q Consensus 1 ma-t~gg~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~-~~~~ip~--~~~~~~a--~~v~-E~~~~~~~~~~~~~v~l 73 (298) .. ...|.-+. ........ ..... ....+.. +...+.. .++..++ .+-+ .+.++++-..++++++. T Consensus 197 fs~~~~g~~~~--~g~~~~~~-----~~~~~-~~~~~a~~~~~~~~~Gm~Ta~aEaL~~~g~ss~~~f~EMaFsIeK~tV 268 (529) T protein:vir:10 197 FLQNVSGASVT--VGTNETGE-----ALDKL-INAAIGEGKLAEIAEGMATSIAELRQGFNGSNDNPWNEMSFRIDKQTV 268 (529) T ss_pred ecccccccccc--cCccccCc-----ccccc-cccccccccccccccccchhhhhccccCCCcccccccceeeEEEEEEE Confidence 11 00011110 00000000 00000 0001111 1111111 1111111 1111 23467888888888888 Q ss_pred eeeEEEEEEeecHHHhhccccc-HHHHHHHHHHHHHHHHHHHHHHHHhcccccc------cccccccccccccccccccc Q lcl|Aclame:pro 74 VPIKVEYGARISDEFMYASDEE-KINILQAFNDGFAKKVARGIDLMAFHGVNPR------LGTASAVIGTNHFDSKVTQK 146 (298) Q Consensus 74 ~~~k~~~~~~iS~ell~~~~d~-~~~l~~~i~~~la~~i~~~~d~~~l~G~~~~------~g~~~~~~~~~~~~~~~~~~ 146 (298) .+|.=+=...+|-||.++--.- ..|.+++|..-|+..|...+++-++.-...- .|...... ..++....... T Consensus 269 tAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILStEImlEINReii~~l~~~a~~~k~~g~~~~~~-~~Gv~d~~~~~ 347 (529) T protein:vir:10 269 EAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWINYTAQVGKSGWTKTDGS-ASGVFDFQDPI 347 (529) T ss_pred eeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHhHhhhhhhhhccccccccc-ccceeecccCc Confidence 7777666678899986542211 1334455555555555555555555321100 01100000 01111111111 Q ss_pred ccccc-cc----chhHHHHHHHhhhhhhc--CCcccEEEEcHHHHHHHHHh--hccCCc----eee-cccccccCcceec Q lcl|Aclame:pro 147 VEAPR-GI----ADPNGAIENAVELLTGV--DADVTGIAINPSFRSALAKQ--KDLQGN----ALF-PELKWGATPDTIN 212 (298) Q Consensus 147 ~~~~~-~~----~~~~~~i~~~~~~l~~~--~~~~~~~vm~~~~~~~L~~l--kd~~G~----~l~-~~~~~~~~~~~l~ 212 (298) ..... .. ...+-.|...-+.+... +...+.++|+++....|... ++.-+. .-| .+.......|.|. T Consensus 348 ~~~~~~~~~e~~k~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~ 427 (529) T protein:vir:10 348 DVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALIDTNISPAAQGMASGLNADTTKGVFAGILG 427 (529) T ss_pred cccccchHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHHhhhhhccccccccccccccccCCceEEEEec Confidence 10000 00 11233344444444332 33466799999999988632 111110 001 1111112234555 Q ss_pred -ceeeEecCccccccccccceEEEeec-----cceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEe Q lcl|Aclame:pro 213 -GLPVDVNKTVSDMSLTQRDRAIIGDF-----ANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGIL 286 (298) Q Consensus 213 -G~PV~~s~~~~~~~~~~~~~~~~gd~-----~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~ 286 (298) |++|+++++.+. +.+++|-- ..+..|.+=-++ .+.+ ..|.. + ||- .+.|+ .|++..+ T Consensus 428 ~~~~vy~D~y~~~------dy~~vG~KG~~~~~~glfy~PYv~l--~~~~--~~dp~--s-fqP-~~g~~--tRY~l~~- 490 (529) T protein:vir:10 428 GRYKVYIDQYARQ------DYFTMGYRGANNLDAGIYYCPYVAL--TPLR--GSDPK--N-FQP-VMGFK--TRYAIGV- 490 (529) T ss_pred CceEEEecCCCCc------ceEEEEEeCCcccccceeecccccc--cccc--ccCCC--c-ccc-eeeee--eeeceee- Confidence 479999988753 34444421 111111111111 1111 11111 1 332 13333 3555432 Q ss_pred cc------c-ceEE-EeecC Q lcl|Aclame:pro 287 DA------T-KFAR-VTEAN 298 (298) Q Consensus 287 ~~------~-a~~~-l~~a~ 298 (298) +| + ...+ +++.+ T Consensus 491 NP~~~~~~~~~~~r~~~g~~ 510 (529) T protein:vir:10 491 NPFAESRTQAPQGRITSGMP 510 (529) T ss_pred cCccccccccccccccCCcc Confidence 22 1 1112 22222 No 258 >protein:vir:6601 Length: 528 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891732;genbank:gi:33620668;genbank:GeneID:1725275 Probab=33.08 E-value=1.4 Score=19.82 Aligned_cols=280 Identities=11% Similarity=-0.003 Sum_probs=107.5 Q ss_pred CeeccccccchhHHHHHHHHHH---hhchhhhhcceeecCCCc------------------------------------- Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVA---GKSSIARLSAQKPIPFNG------------------------------------- 40 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~---~~s~i~~~~~~~~~~~~~------------------------------------- 40 (298) .+.+...---..+.+.+|..+| .+.+...++.+.||++.. T Consensus 78 i~es~~t~~v~~~~P~Li~lvRRa~p~LIa~DIwGVQPMTgPTGlIFAmRs~Y~~~~~~~~~~eAfh~~~g~ea~fsea~ 157 (528) T protein:vir:66 78 IAAGQTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLKSGAREAFHPMYAPDAFHSSLA 157 (528) T ss_pred ccccccccccccCchhHHHHHHHHHHhhhhhhhheeecCCchhhhheeeeeeecCCcccccccccccccccccccccccc Confidence 1111111001112223333332 233344555566654410 Q ss_pred -----------eEEEEE---------------------------------------------------------eCCcce Q lcl|Aclame:pro 41 -----------EKVFTF---------------------------------------------------------TMDSEI 52 (298) Q Consensus 41 -----------~~ip~~---------------------------------------------------------~~~~~a 52 (298) +.+... .+.+-. T Consensus 158 t~~a~~gGpTGliFAm~s~y~s~~~g~ea~~nea~t~fs~~~~~~~~~~~~~~~g~~~g~~~~~~~~a~~~~~~~~~Gm~ 237 (528) T protein:vir:66 158 AKEATVGSPTGTAFAKLTLSQAITAGDIVYHTFAETGIAYLQNVTGDSVTPQKVGSESEDEVVMKLIEEGKLAEIAFGMA 237 (528) T ss_pred cccccccCCccceeecccccccccccceeeecccccceeeeccccccccccCcccccccccccccccccccceecccccc Confidence 000000 000000 Q ss_pred EEeec---------cccccccccceeeEEEeeeEEEEEEeecHHHhhccccc-HHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 53 DVVAE---------SGKKTHGGVTLAPQTMVPIKVEYGARISDEFMYASDEE-KINILQAFNDGFAKKVARGIDLMAFHG 122 (298) Q Consensus 53 ~~v~E---------~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~~~~d~-~~~l~~~i~~~la~~i~~~~d~~~l~G 122 (298) .-.+| +.++++-..++++++..++.=+=...+|-||.|+--.- ..|.+++|..-|+..|...+++-++.- T Consensus 238 Ta~aEale~lg~~s~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsNILStEImlEINREii~~ 317 (528) T protein:vir:66 238 TSIAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDV 317 (528) T ss_pred hhhhhhhcccCCCcccchhhcceEEEeEEEEeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhh Confidence 01112 12345666666777776666555677899986643221 244556666666666666666666532 Q ss_pred cccccccc--cccc----ccccccccccccccccc-----ccchhHHHHHHHhhhhhhc--CCcccEEEEcHHHHHHHHH Q lcl|Aclame:pro 123 VNPRLGTA--SAVI----GTNHFDSKVTQKVEAPR-----GIADPNGAIENAVELLTGV--DADVTGIAINPSFRSALAK 189 (298) Q Consensus 123 ~~~~~g~~--~~~~----~~~~~~~~~~~~~~~~~-----~~~~~~~~i~~~~~~l~~~--~~~~~~~vm~~~~~~~L~~ 189 (298) .+. +... .... ...++..........+. .....+-.|...-+.+... +...+.++|+++....|.. T Consensus 318 i~~-~a~~~~~~~t~~~~~~aG~~dl~~~~d~~g~rw~~e~~k~L~~~i~~~an~I~~~T~r~~gn~vi~S~~Va~~L~~ 396 (528) T protein:vir:66 318 INF-TAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILAS 396 (528) T ss_pred hhh-eeeeeeeeeeeccccccceeecccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhh Confidence 110 0000 0000 00011111111100000 0111222233333333332 2344679999999998865 Q ss_pred hh-----c-cCCceeeccccccc-Ccceec-ceeeEecCccccccccccceEEEeec-----cceEEEEeecceEEEEee Q lcl|Aclame:pro 190 QK-----D-LQGNALFPELKWGA-TPDTIN-GLPVDVNKTVSDMSLTQRDRAIIGDF-----ANGFKWGYAKEVPLEVIQ 256 (298) Q Consensus 190 lk-----d-~~G~~l~~~~~~~~-~~~~l~-G~PV~~s~~~~~~~~~~~~~~~~gd~-----~~~~~~~~~~~~~i~~~~ 256 (298) .- + ......+..+.+.. ..|.|. |++|+++++.+. +.+++|-- ..+..|.+=-++.+... T Consensus 397 ~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~------dy~~vG~KG~~~~~~glfyaPYv~l~~~~~- 469 (528) T protein:vir:66 397 ADQGISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYARQ------DYFTVGYKGDNEMDAGIYYAPYVALTPLRA- 469 (528) T ss_pred ccccccccccccccccccCCCCceeEEEecCceEEEecCCCCc------ceEEEEEeCCcccccceeecccccceeeEe- Confidence 31 1 11222222222221 135566 579999988753 34444421 11111211111211111 Q ss_pred cccccccchhhhhcCcEEEEEEEEEccEEecccc-------eEEEeecC Q lcl|Aclame:pro 257 YGDPDNSGLDLKGYNQVYIRAELFLGWGILDATK-------FARVTEAN 298 (298) Q Consensus 257 ~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a-------~~~l~~a~ 298 (298) .|.. -||- .+.|+ .|+++.+ +|=+ -+++.+.+ T Consensus 470 ---~dp~---sfqP-~~g~~--tRY~l~v-NP~~~~~~~~~~~ri~~g~ 508 (528) T protein:vir:66 470 ---TDPQ---SFHP-VLGFK--TRYGIGI-NPFADSKSQEPSARITSGM 508 (528) T ss_pred ---eCCc---cccc-eeeee--eeeceee-cCcccccCccccccccccc Confidence 1111 1332 23333 3565543 3311 22332222 No 259 >protein:vir:106590 Length: 349 # NCBI annotation: putative major head protein # Family: family:all:1083 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958585;genbank:gi:41179245;genbank:GeneID:2717126 Probab=27.54 E-value=1.8 Score=19.14 Aligned_cols=293 Identities=13% Similarity=0.037 Sum_probs=109.4 Q ss_pred Ceec------------cccccchhHHHHHHHHHHhhchh-hhhcceeecCCCceEEEEEeC-Cc-ceEEeeccccccccc Q lcl|Aclame:pro 1 MVLN------------KGTLFDPELVTDLISKVAGKSSI-ARLSAQKPIPFNGEKVFTFTM-DS-EIDVVAESGKKTHGG 65 (298) Q Consensus 1 mat~------------gg~lip~~~~~~ii~~~~~~s~i-~~~~~~~~~~~~~~~ip~~~~-~~-~a~~v~E~~~~~~~~ 65 (298) |-.. =-.++.+.....+++.....+-+ ..+.+..++....+.+..... .+ .+.+++.+.+.+..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~d~~~~~~l~~~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~ 80 (349) T protein:vir:10 1 MKNQKLQLDLQRFATPILDMFSQNTVLDYTRNRQYPEMLGDTLFPAVKVPTLEVDILKAGSRVPTIASVSAFDAEAEIGT 80 (349) T ss_pred CCcchhhHHHHHHHHHhhcccCHHHHHHHHHhcCcchhhHhhcCCccccccceeEEEeeccCcceeeeeecCCCCcceec Confidence 1111 12222223333344333222222 235555554433333333221 22 255666665554333 Q ss_pred cceeeEEEeeeEEEEEEeecHH-Hh--hcc--cccHHHHHHHH---HHHHHHHHHHHHHHHHh----cccc--ccccccc Q lcl|Aclame:pro 66 VTLAPQTMVPIKVEYGARISDE-FM--YAS--DEEKINILQAF---NDGFAKKVARGIDLMAF----HGVN--PRLGTAS 131 (298) Q Consensus 66 ~~~~~v~l~~~k~~~~~~iS~e-ll--~~~--~d~~~~l~~~i---~~~la~~i~~~~d~~~l----~G~~--~~~g~~~ 131 (298) -.....+..+-.++-...++.+ ++ +.. +.....+...+ ...+.+.+.+.+|.++. +|.- .+.|..- T Consensus 81 r~~~~~~~~~p~ik~~~~i~e~dl~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~q~l~~Gki~~~~~g~~v 160 (349) T protein:vir:10 81 REASKMTAELAYVKRKMQITEEMLIKLQSPRNTAEENYLKQYVFDDIDAMVQAVKARGEKMTMEMFATGKITDKKNGIAI 160 (349) T ss_pred ccceeEEeeccccccccccCHHHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeeEEcCCcEEE Confidence 3333444444444444455533 22 110 11111222333 23333445555554333 3310 0011000 Q ss_pred c-cccccccccccccccccccccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHH------hhccCCceeecc-cc Q lcl|Aclame:pro 132 A-VIGTNHFDSKVTQKVEAPRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAK------QKDLQGNALFPE-LK 203 (298) Q Consensus 132 ~-~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~------lkd~~G~~l~~~-~~ 203 (298) . .....+... .+....++....++.++|.+....+ +..+..++|++++|..|++ .-+.++...... .. T Consensus 161 D~g~~~~~~~~-lt~~~~Ws~~~adpi~Di~~~~~~~---g~~p~~~vm~~~~~~~l~~~~~i~~~~~~~~~~~~~~~~~ 236 (349) T protein:vir:10 161 DYGVPKKHQET-LSGTKTWDKSDASIIDNLQDWSDSL---DVTPTRALTSKKVLRILMRSTEIKEAIFGKDTGRVVGQAD 236 (349) T ss_pred ecccCccceeE-ecCcccCCCCCCCHHHHHHHHHHHh---CCCccEEEeCHHHHHHHhcCHHHHHHhcccccccccCHHH Confidence 0 000011101 1233445556677888888876654 6678889999999998853 222222221111 00 Q ss_pred cccCcceecceeeEecCc-cccccc---------cccceEEE-eecc-ceEEEEee-cceEEEEe--eccccccc-chhh Q lcl|Aclame:pro 204 WGATPDTINGLPVDVNKT-VSDMSL---------TQRDRAII-GDFA-NGFKWGYA-KEVPLEVI--QYGDPDNS-GLDL 267 (298) Q Consensus 204 ~~~~~~~l~G~PV~~s~~-~~~~~~---------~~~~~~~~-gd~~-~~~~~~~~-~~~~i~~~--~~~~~~~~-~~~~ 267 (298) ....-+.+.|.++++-+. ..+..+ -..+.+++ .+-. +...|+.- +...+.-. ........ .+.. T Consensus 237 ~~~~l~~~~~~~i~~yd~~y~d~~~~~~~t~~~~~p~~~v~l~~~~~~G~~~yG~~~e~~~~~~g~~~~~~~~~~~~~~~ 316 (349) T protein:vir:10 237 LDQWMTAQGLPIIRAYDGKYRDEDSRGNLTTNSYFPEDRIVLFNDEVPGQKIYGPTPEENRLISSNAQVSNVGNIMAKIY 316 (349) T ss_pred HHHHHHhcCCceEEEEeeEEEeecCCCceeecccccCCeEEEecCCCceeEEeeccchhhhhcccccceeeccceEEEee Confidence 011112344555654221 111000 01122222 2211 11222211 10000000 00000000 0000 Q ss_pred h-hcC--cEEEEEEEEEccEEecccceEEEeec Q lcl|Aclame:pro 268 K-GYN--QVYIRAELFLGWGILDATKFARVTEA 297 (298) Q Consensus 268 f-~~n--~v~~r~~~r~~~~v~~~~a~~~l~~a 297 (298) + +.| ...+++..+.=-.+.+|+++.+++.. T Consensus 317 ~~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl 349 (349) T protein:vir:10 317 ETSEDPIGTWILASATMLPSFASADDVFQAKVL 349 (349) T ss_pred eecCCCceEEEEEeeeeeeeecCCCcEEEEEeC Confidence 0 111 23334444433344678888888888 No 260 >protein:vir:101811 Length: 529 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238888;genbank:gi:66391963;genbank:GeneID:3416638 Probab=26.55 E-value=1.9 Score=19.01 Aligned_cols=268 Identities=11% Similarity=-0.022 Sum_probs=96.9 Q ss_pred Ce--eccccccc-hhHHHHHHHHHHhhchhhhhcceeecCCC-ceEEEE--EeCCcce--EEee-ccccccccccceeeE Q lcl|Aclame:pro 1 MV--LNKGTLFD-PELVTDLISKVAGKSSIARLSAQKPIPFN-GEKVFT--FTMDSEI--DVVA-ESGKKTHGGVTLAPQ 71 (298) Q Consensus 1 ma--t~gg~lip-~~~~~~ii~~~~~~s~i~~~~~~~~~~~~-~~~ip~--~~~~~~a--~~v~-E~~~~~~~~~~~~~v 71 (298) .. +.++.... ... ............+..+ ...+.. .++..++ .|-+ -+.++++-..+++++ T Consensus 197 fs~~~~g~~~~~g~~~----------t~~~~~~~~~~~~a~~~~~~~~~GmsTa~aEaL~~~ggss~~~f~EMaFsIeK~ 266 (529) T protein:vir:10 197 FLQNVSGASVTVGTNE----------TGEALDKLINAAIGEGKLAEIAEGMATSIAELRQGFNGSNDNPWNEMSFRIDKQ 266 (529) T ss_pred eeccccccccccCccc----------cCcccccccccccccccccccccchhhhhhhccccCCCcccccccceeeEEEEE Confidence 11 11111100 000 0000000000011111 111111 1111111 1111 234578888888888 Q ss_pred EEeeeEEEEEEeecHHHhhccccc-HHHHHHHHHHHHHHHHHHHHHHHHhccccc------ccccccccccccccccccc Q lcl|Aclame:pro 72 TMVPIKVEYGARISDEFMYASDEE-KINILQAFNDGFAKKVARGIDLMAFHGVNP------RLGTASAVIGTNHFDSKVT 144 (298) Q Consensus 72 ~l~~~k~~~~~~iS~ell~~~~d~-~~~l~~~i~~~la~~i~~~~d~~~l~G~~~------~~g~~~~~~~~~~~~~~~~ 144 (298) +..+|.=+=...+|-||.++--.- ..|.+++|..-|+..|...+++-++.-... ..|+..... ..++..... T Consensus 267 tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILStEImlEINReii~~l~~~a~~~~~~~~~~~~~-~~Gv~d~~~ 345 (529) T protein:vir:10 267 TVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWINYTAQVGKSGWTKTDGS-ASGVFDFQD 345 (529) T ss_pred EEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhhhhhcccccccccc-ccceeeccc Confidence 887777666678899986542210 133444555555555555555544432110 001110000 011111111 Q ss_pred ccccccc-cc----chhHHHHHHHhhhhhhc--CCcccEEEEcHHHHHHHHHh--h-------ccCCceeecccccccCc Q lcl|Aclame:pro 145 QKVEAPR-GI----ADPNGAIENAVELLTGV--DADVTGIAINPSFRSALAKQ--K-------DLQGNALFPELKWGATP 208 (298) Q Consensus 145 ~~~~~~~-~~----~~~~~~i~~~~~~l~~~--~~~~~~~vm~~~~~~~L~~l--k-------d~~G~~l~~~~~~~~~~ 208 (298) ....... .. ...+-.|...-+.+... +...+.++|+++....|... + ...|- ..+....... T Consensus 346 ~~~~~~~~~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~~~~~~~~~~~~sg~--~~d~~~~~~~ 423 (529) T protein:vir:10 346 PIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALIDTNISPAAQGMASGL--NADTTKGVFA 423 (529) T ss_pred CccccccchHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHHhhccccccccccccccc--ccccCCceEE Confidence 1110000 00 11233344444444332 23466799999999988742 1 11111 0111111123 Q ss_pred ceec-ceeeEecCccccccccccceEEEeec-----cceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEc Q lcl|Aclame:pro 209 DTIN-GLPVDVNKTVSDMSLTQRDRAIIGDF-----ANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLG 282 (298) Q Consensus 209 ~~l~-G~PV~~s~~~~~~~~~~~~~~~~gd~-----~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~ 282 (298) |.|. |++|+++++.+. +.+++|-- ..+..|.+=-+++ +.+ ..|.. + ||- .+.|+ .|++ T Consensus 424 G~l~~~~~vy~D~y~~~------dy~~vG~KG~~~~~~glfy~PYv~l~--~~~--~~dp~--s-fqP-~~g~~--tRY~ 487 (529) T protein:vir:10 424 GILGGRYKVYIDQYARQ------DYFTMGYRGANNLDAGIYYCPYVALT--PLR--GFDPK--N-FQP-VMGFK--TRYA 487 (529) T ss_pred EEecCceEEEecCCCCc------ceEEEEEeCCcccccceeeccccccc--ccc--ccCCC--c-ccc-eeeee--eeec Confidence 4555 479999988753 34444421 1111111111111 111 11111 1 332 13333 3454 Q ss_pred cEEecccc-------eEE-EeecC Q lcl|Aclame:pro 283 WGILDATK-------FAR-VTEAN 298 (298) Q Consensus 283 ~~v~~~~a-------~~~-l~~a~ 298 (298) ..+ +|=+ ..+ +++.+ T Consensus 488 l~~-NP~~~~~~~~~~~r~~~g~~ 510 (529) T protein:vir:10 488 IGV-NPFAESRTQAPQGRITSGMP 510 (529) T ss_pred eee-cCccccccccccccccCCcc Confidence 432 2211 112 22222 No 261 >protein:vir:9265 Length: 430 # NCBI annotation: 5 # Family: family:all:1412 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720329;genbank:gi:24371587;genbank:GeneID:955820 Probab=25.64 E-value=2 Score=18.89 Aligned_cols=265 Identities=13% Similarity=0.069 Sum_probs=97.9 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcce-eec------CCCceEEEEEeCCc-ceEEeeccccccccccceeeEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQ-KPI------PFNGEKVFTFTMDS-EIDVVAESGKKTHGGVTLAPQT 72 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~-~~~------~~~~~~ip~~~~~~-~a~~v~E~~~~~~~~~~~~~v~ 72 (298) ||++=+.+++ -+..++++.++...++.+.+.+ +|. .+..+++|...... ..+|.--++ + +...=.++. T Consensus 1 MAn~l~~~~~-ii~~eal~~l~n~~v~a~~~~~~r~~d~~~~r~Gdti~~p~~~~~~~~~G~~~t~~--~-~~i~e~~v~ 76 (430) T protein:vir:92 1 MALNEGQIVT-LAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDK--A-TGLLELNVA 76 (430) T ss_pred CccchhhHHH-HHHHHHHHHHhhhhhhhhhhcccCCchhhhhcccceEEeccccccccccCcccCCC--C-CccccceEE Confidence 9999666544 3778999999999998886543 332 12346666643322 222211111 0 111112222 Q ss_pred EeeeEE--EEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccc-cccccccccccccccccccccccc Q lcl|Aclame:pro 73 MVPIKV--EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNP-RLGTASAVIGTNHFDSKVTQKVEA 149 (298) Q Consensus 73 l~~~k~--~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~-~~g~~~~~~~~~~~~~~~~~~~~~ 149 (298) ++..+. -.+--=++||. ..+ +..++|+..+ ++++..+|..++.-... ++... +..... T Consensus 77 ~~v~~~k~V~~~~~~kel~--~~~---~~~~~i~~Am-~~LA~~Vd~dl~~~~~~~~~~v~-------------~~~~~t 137 (430) T protein:vir:92 77 VNMGEPDNDFFQLRADDLR--DET---AYRHRIQSAA-RKLANNVELKVANMAAEMGSLVI-------------TSPDAI 137 (430) T ss_pred EEEeeeccceEEechhHhc--Chh---HHHHHhHHHH-HHHHHHHHHHHHHHhhhcccccc-------------cccccC Confidence 222221 11122256642 222 2456665554 68888888888743210 00000 000011 Q ss_pred ccccchhHHHHHHHhhhhhhcCCcc---cEEEEcHHHHHHHHH-h---hccC--CceeecccccccCcc-eeccee-eEe Q lcl|Aclame:pro 150 PRGIADPNGAIENAVELLTGVDADV---TGIAINPSFRSALAK-Q---KDLQ--GNALFPELKWGATPD-TINGLP-VDV 218 (298) Q Consensus 150 ~~~~~~~~~~i~~~~~~l~~~~~~~---~~~vm~~~~~~~L~~-l---kd~~--G~~l~~~~~~~~~~~-~l~G~P-V~~ 218 (298) ...+.+.+.++.++-..|....... -..+++|.....|.. + -..+ +.-.|. .+.-+ .+.|+- +.- T Consensus 138 ~~~~~~~~~~~A~a~~~L~~~~vP~~~~R~~vldp~~~~~l~~~l~~l~~~~~~~~~A~r----~g~i~~~~~Gfd~~~~ 213 (430) T protein:vir:92 138 GTNTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYR----DGTIQRQVAGFDDVLR 213 (430) T ss_pred CCcCCcchhhHHHHHHHHHHhcCCCCCCcEEEeChHHHHHHHhhhccccccccchhHHHh----hccccccchhhhhhhh Confidence 1122223455555555554444432 257999988777632 2 1211 111111 11111 122221 111 Q ss_pred cCcccccccc---------------------------------------ccceEEEeec--------------------c Q lcl|Aclame:pro 219 NKTVSDMSLT---------------------------------------QRDRAIIGDF--------------------A 239 (298) Q Consensus 219 s~~~~~~~~~---------------------------------------~~~~~~~gd~--------------------~ 239 (298) ++.+|.-.++ ....+-.||. . T Consensus 214 ~~~~~~~t~g~~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftiaGV~~v~~~tkq~~~~l~ 293 (430) T protein:vir:92 214 SPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDA 293 (430) T ss_pred cCCcccccCccCcCceeccccccccccceecccccccccccccceeeeecccceecccEEEecceeeeccccccccCCcc Confidence 2222210000 0011112221 0 Q ss_pred ceEEEEeecceEEEEeecccc-cccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 240 NGFKWGYAKEVPLEVIQYGDP-DNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 240 ~~~~~~~~~~~~i~~~~~~~~-~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ..........-++.+.+-... +... .....++...+.-.+.+..++..|..+. T Consensus 294 ~F~Vt~~~~atsv~I~paii~~~~~~------~~~~~~~y~nVsaspa~~aavTvv~~a~ 347 (430) T protein:vir:92 294 TFSVVRVVDGTHVEITPKPVALDDVS------LSPEQRAYANVNTSLADAMAVNILNVKD 347 (430) T ss_pred EEEEEEecCCceeEEecccccccccc------ccccccccceeccccccCceeEEeccCC Confidence 000001111111222221100 0000 0000111111222223333333333332 No 262 >protein:vir:100939 Length: 430 # NCBI annotation: Gp5 # Family: family:all:1412 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006408;genbank:gi:46358700;genbank:GeneID:2777089 Probab=25.64 E-value=2 Score=18.89 Aligned_cols=265 Identities=13% Similarity=0.069 Sum_probs=97.9 Q ss_pred CeeccccccchhHHHHHHHHHHhhchhhhhcce-eec------CCCceEEEEEeCCc-ceEEeeccccccccccceeeEE Q lcl|Aclame:pro 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQ-KPI------PFNGEKVFTFTMDS-EIDVVAESGKKTHGGVTLAPQT 72 (298) Q Consensus 1 mat~gg~lip~~~~~~ii~~~~~~s~i~~~~~~-~~~------~~~~~~ip~~~~~~-~a~~v~E~~~~~~~~~~~~~v~ 72 (298) ||++=+.+++ -+..++++.++...++.+.+.+ +|. .+..+++|...... ..+|.--++ + +...=.++. T Consensus 1 MAn~l~~~~~-ii~~eal~~l~n~~v~a~~~~~~r~~d~~~~r~Gdti~~p~~~~~~~~~G~~~t~~--~-~~i~e~~v~ 76 (430) T protein:vir:10 1 MALNEGQIVT-LAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTDK--A-TGLLELNVA 76 (430) T ss_pred CccchhhHHH-HHHHHHHHHHhhhhhhhhhhcccCCchhhhhcccceEEeccccccccccCcccCCC--C-CccccceEE Confidence 9999666544 3778999999999998886543 332 12346666643322 222211111 0 111112222 Q ss_pred EeeeEE--EEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHhccccc-cccccccccccccccccccccccc Q lcl|Aclame:pro 73 MVPIKV--EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNP-RLGTASAVIGTNHFDSKVTQKVEA 149 (298) Q Consensus 73 l~~~k~--~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~d~~~l~G~~~-~~g~~~~~~~~~~~~~~~~~~~~~ 149 (298) ++..+. -.+--=++||. ..+ +..++|+..+ ++++..+|..++.-... ++... +..... T Consensus 77 ~~v~~~k~V~~~~~~kel~--~~~---~~~~~i~~Am-~~LA~~Vd~dl~~~~~~~~~~v~-------------~~~~~t 137 (430) T protein:vir:10 77 VNMGEPDNDFFQLRADDLR--DET---AYRHRIQSAA-RKLANNVELKVANMAAEMGSLVI-------------TSPDAI 137 (430) T ss_pred EEEeeeccceEEechhHhc--Chh---HHHHHhHHHH-HHHHHHHHHHHHHHhhhcccccc-------------cccccC Confidence 222221 11122256642 222 2456665554 68888888888743210 00000 000011 Q ss_pred ccccchhHHHHHHHhhhhhhcCCcc---cEEEEcHHHHHHHHH-h---hccC--CceeecccccccCcc-eeccee-eEe Q lcl|Aclame:pro 150 PRGIADPNGAIENAVELLTGVDADV---TGIAINPSFRSALAK-Q---KDLQ--GNALFPELKWGATPD-TINGLP-VDV 218 (298) Q Consensus 150 ~~~~~~~~~~i~~~~~~l~~~~~~~---~~~vm~~~~~~~L~~-l---kd~~--G~~l~~~~~~~~~~~-~l~G~P-V~~ 218 (298) ...+.+.+.++.++-..|....... -..+++|.....|.. + -..+ +.-.|. .+.-+ .+.|+- +.- T Consensus 138 ~~~~~~~~~~~A~a~~~L~~~~vP~~~~R~~vldp~~~~~l~~~l~~l~~~~~~~~~A~r----~g~i~~~~~Gfd~~~~ 213 (430) T protein:vir:10 138 GTNTADAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYR----DGTIQRQVAGFDDVLR 213 (430) T ss_pred CCcCCcchhhHHHHHHHHHHhcCCCCCCcEEEeChHHHHHHHhhhccccccccchhHHHh----hccccccchhhhhhhh Confidence 1122223455555555554444432 257999988777632 2 1211 111111 11111 122221 111 Q ss_pred cCcccccccc---------------------------------------ccceEEEeec--------------------c Q lcl|Aclame:pro 219 NKTVSDMSLT---------------------------------------QRDRAIIGDF--------------------A 239 (298) Q Consensus 219 s~~~~~~~~~---------------------------------------~~~~~~~gd~--------------------~ 239 (298) ++.+|.-.++ ....+-.||. . T Consensus 214 ~~~~~~~t~g~~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftiaGV~~v~~~tkq~~~~l~ 293 (430) T protein:vir:10 214 SPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDA 293 (430) T ss_pred cCCcccccCccCcCceeccccccccccceecccccccccccccceeeeecccceecccEEEecceeeeccccccccCCcc Confidence 2222210000 0011112221 0 Q ss_pred ceEEEEeecceEEEEeecccc-cccchhhhhcCcEEEEEEEEEccEEecccceEEEeecC Q lcl|Aclame:pro 240 NGFKWGYAKEVPLEVIQYGDP-DNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) Q Consensus 240 ~~~~~~~~~~~~i~~~~~~~~-~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a~ 298 (298) ..........-++.+.+-... +... .....++...+.-.+.+..++..|..+. T Consensus 294 ~F~Vt~~~~atsv~I~paii~~~~~~------~~~~~~~y~nVsaspa~~aavTvv~~a~ 347 (430) T protein:vir:10 294 TFSVVRVVDGTHVEITPKPVALDDVS------LSPEQRAYANVNTSLADAMAVNILNVKD 347 (430) T ss_pred EEEEEEecCCceeEEecccccccccc------ccccccccceeccccccCceeEEeccCC Confidence 000001111111222221100 0000 0000111111222223333333333332 No 263 >protein:vir:99523 Length: 311 # NCBI annotation: putative protein # Family: family:all:701 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958538;genbank:gi:41179320;genbank:GeneID:2717161 Probab=23.65 E-value=2.3 Score=18.62 Aligned_cols=280 Identities=11% Similarity=-0.013 Sum_probs=111.0 Q ss_pred Ceecccccc---chhHHHHHHHHHHhhchhhhhcc--e-eecCCCceEEEEEeCCcc-eEEeeccccccccccceeeEEE Q lcl|Aclame:pro 1 MVLNKGTLF---DPELVTDLISKVAGKSSIARLSA--Q-KPIPFNGEKVFTFTMDSE-IDVVAESGKKTHGGVTLAPQTM 73 (298) Q Consensus 1 mat~gg~li---p~~~~~~ii~~~~~~s~i~~~~~--~-~~~~~~~~~ip~~~~~~~-a~~v~E~~~~~~~~~~~~~v~l 73 (298) |-++.-.+- -+.+++.+-+.+...+.-..+.. . +-.++..++||+.+..+- .+-...|-....-+.+++..+| T Consensus 1 ~~~~an~mAlnya~~~~~~Ld~~~~~~~~t~~l~~~~~~~~~Gak~VkIp~i~~~gl~dY~R~~g~~~g~v~~~~et~tl 80 (311) T protein:vir:99 1 MPTDAETRGFNYVTKDGNLLDQKITAGLFTAALGTPEVDLVNGGRSFTLKTISTSGLKDHTRGKGFNSGTISDEKTIYTM 80 (311) T ss_pred CCCcchhhHHHHHHHHHHHHHHHHHhhhcccceecCchheeecCCEEEEEeeeeccccccccccCccccceeeeeeEEEe Confidence 655554432 23344444444444322111111 1 212456799999875332 2233333332333445555555 Q ss_pred eeeEEEE-EEeecHHHhhcccccH--HHHHHHHHHHHHHHHHHHHHHHHhccc-cccccccccccccccccccccccccc Q lcl|Aclame:pro 74 VPIKVEY-GARISDEFMYASDEEK--INILQAFNDGFAKKVARGIDLMAFHGV-NPRLGTASAVIGTNHFDSKVTQKVEA 149 (298) Q Consensus 74 ~~~k~~~-~~~iS~ell~~~~d~~--~~l~~~i~~~la~~i~~~~d~~~l~G~-~~~~g~~~~~~~~~~~~~~~~~~~~~ 149 (298) .-.+--. .+.--+ .++.. ..+...+.+...+...=.+|...+.-. ...++........ ........... T Consensus 81 ~~DR~~~f~vD~mD-----vdETn~~~~~ani~~~f~r~~vvPEiDayrfskla~~a~~~~~~~~~~--~~~~~~~~~~~ 153 (311) T protein:vir:99 81 GQDRDVEFYLDRQD-----VDETDNELAMANISNVFITEHVQPELDSYRFSKIATSFDNLDGTDTEG--TLLAKTHKTEE 153 (311) T ss_pred eeccceeeecchhc-----hhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhcccccccch--hhhcccccccc Confidence 5554222 111111 01111 111111222222223334443333110 0000000000000 00001111122 Q ss_pred ccccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCCceee----cccccccCcceecceeeEe---cCcc Q lcl|Aclame:pro 150 PRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALF----PELKWGATPDTINGLPVDV---NKTV 222 (298) Q Consensus 150 ~~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G~~l~----~~~~~~~~~~~l~G~PV~~---s~~~ 222 (298) .-+.+..++.|..++.++......+-.++|+|.++..|...+.= .|-+- ....-....++|.|+||+- ++.| T Consensus 154 ~lt~~nvl~~l~~~~~~~~~v~~~~rvl~vTp~~~~lLk~~~~~-~r~~~~~~~~~~~i~~~V~~lDgv~Ii~V~ps~r~ 232 (311) T protein:vir:99 154 TLDETNAYSQLKTGIGKVRKYGTQNLVGYVSSEVMDALERSKEF-TRNITNQNVGTTALESRITSIDGVQLIEVYESNRF 232 (311) T ss_pred ccCHHHHHHHHHHHHHHHHhcCCCCeEEEEChHHHHHHhhchhh-heeeecccccccccccccceecCeEEEEecCchhh Confidence 23444567888888888866544455689999998877654311 11111 1111234467899999753 3333 Q ss_pred ccc------c---ccccc-eEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEeccc--c Q lcl|Aclame:pro 223 SDM------S---LTQRD-RAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDAT--K 290 (298) Q Consensus 223 ~~~------~---~~~~~-~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~--a 290 (298) ... . .+++. -.++-. ..+. +.+...-.+.+++ .+.++. -+...+.-+.+.|.=|.+.+ + T Consensus 233 ~t~~~ft~G~~~~~~ak~INfiiv~-~~a~-i~~~K~~~v~~f~-P~~~~~------gd~~l~~~R~Y~D~fv~~nk~~~ 303 (311) T protein:vir:99 233 MTKYDFTDGAKPTEDAKAINFLVVA-KPAV-ISIVKENAVFLFA-PGQHTD------GDGYLYQNRLYHDLFIKKHKRDG 303 (311) T ss_pred cchhhhcCCccccCcccccceEEeC-CCee-eeeeeeeeeeeeC-CCCCCC------cceeeeeeeeeeeeeeeccccCe Confidence 311 0 01111 122222 2222 2222222333332 111111 12234444556666666433 3 Q ss_pred -eEEEeec Q lcl|Aclame:pro 291 -FARVTEA 297 (298) Q Consensus 291 -~~~l~~a 297 (298) ++-+|.| T Consensus 304 Iyv~~k~A 311 (311) T protein:vir:99 304 IFVSVKKA 311 (311) T ss_pred EEEeeecC Confidence 3555777 No 264 >protein:vir:94870 Length: 318 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762518;genbank:gi:115304217;genbank:GeneID:5141183 Probab=22.64 E-value=2.4 Score=18.48 Aligned_cols=264 Identities=15% Similarity=0.093 Sum_probs=106.7 Q ss_pred Ceecccccc------chhHHHHHHHHHHhhchhhhhcceeecCCCceEEEE-EeCCcceEEeeccccccccccceeeEEE Q lcl|Aclame:pro 1 MVLNKGTLF------DPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFT-FTMDSEIDVVAESGKKTHGGVTLAPQTM 73 (298) Q Consensus 1 mat~gg~li------p~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ip~-~~~~~~a~~v~E~~~~~~~~~~~~~v~l 73 (298) +|.+|-.+. |..++..|-..+-...++.+..-+..+ +.+-+.+ .++..++...-+|+.+++...+++--++ T Consensus 35 laengvtitdttfqlprklvesintallntnpvfkvfhvtnv--gallvsrsfdssneaqvhkdgqtkteqaatltidtl 112 (318) T protein:vir:94 35 LAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNV--GALLVSRSFDSSNEAQVHKDGQTKTEQAATLTIDTL 112 (318) T ss_pred hhhCCceeecchhhhHHHHHHhhhhhhccCCcceeeeeehhh--hheeeeccccccchhhhhcccccccccceeeeeccc Confidence 565554433 333333333333344455444433332 2344444 3445667777788888888777776667 Q ss_pred eeeEEEEEEeecHHHhhcccccHHHHHHHHHHHHHHHHHHHH-HHHHhccccccccccccccccccccc------ccccc Q lcl|Aclame:pro 74 VPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGI-DLMAFHGVNPRLGTASAVIGTNHFDS------KVTQK 146 (298) Q Consensus 74 ~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~~i~~~~-d~~~l~G~~~~~g~~~~~~~~~~~~~------~~~~~ 146 (298) .|--+...-.+-..+.+- --+...+...|.-.+..+|..++ |.++..|.|. .++..+..... .++.. T Consensus 113 epvmvyklqslaervkrl-qmsyselynlivaeltqaivnkivdlalvegdgt-----ngfksidkeadvkkikkittka 186 (318) T protein:vir:94 113 EPVMVYKLQSLAERVKRL-QMSYSELYNLIVAELTQAIVNKIVDLALVEGDGT-----NGFKSIDKEADVKKIKKITTKA 186 (318) T ss_pred chhHHHHHHHHHHHHHHH-hhhHHHHHHHHHHHHHHHHHhhhhheeeeecCCc-----chhhhhchhhhHHHHHHhhhhh Confidence 665555544444443211 12234566667777777766544 6666666431 11221111111 11111 Q ss_pred cccccccchhHHHHHHHhhhhhhcCCcccEEEEcHHHHHHHHHhhccCC--c-eeecccccccCcceecce---eeEecC Q lcl|Aclame:pro 147 VEAPRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQG--N-ALFPELKWGATPDTINGL---PVDVNK 220 (298) Q Consensus 147 ~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~vm~~~~~~~L~~lkd~~G--~-~l~~~~~~~~~~~~l~G~---PV~~s~ 220 (298) -......+.+.|..++.-+..-..+.--++-.....+.|..|+-+.. + -+-.++..- .+--|+ -|+. T Consensus 187 --ksagktpfadaieeavdfvrptagrrylivktedrkalldelrqatananvriknddtei---asevgvdeiivyt-- 259 (318) T protein:vir:94 187 --KSAGKTPFADAIEEAVDFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEI---ASEVGVDEIIVYT-- 259 (318) T ss_pred --hhcCCCchhHHHHHHHhhhccCCCceEEEEeccchHHHHHHHHhhhcccceEEeccchhh---hhhcCcceeEEee-- Confidence 11122234456777766554433332223333333334445542221 1 111111100 000111 1111 Q ss_pred ccccccccccceEEEeeccceEEEEeecceEEEEeecccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeec Q lcl|Aclame:pro 221 TVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEA 297 (298) Q Consensus 221 ~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~f~~n~v~~r~~~r~~~~v~~~~a~~~l~~a 297 (298) +...-+.+++ .|- .+.+.|.+-...|. --|.+|.-.+..+..-.+-+.-.+|=+.++.. T Consensus 260 ----gskavkptvl-vdq----------kyhidmqdltkvda---fewktnsnmilvetltsghvetynagavitvs 318 (318) T protein:vir:94 260 ----GSKAVKPTVL-VDQ----------KYHIDMQDLTKVDA---FEWKTNSNMILVETLTSGHVETYNAGAVITVS 318 (318) T ss_pred ----ccccccceeE-ecc----------ceecchhhhhhhhc---eeeccCCceEEEEecccCcceeecCceeEEeC Confidence 0000011111 121 12222222111110 01444444444444444444433333333333 Done!