Query lcl|Aclame:protein:vir:96762|NCBI_annot:putative phage-related protein|genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Match_columns 632 No_of_seqs 252 out of 1817 Neff 11.0 Searched_HMMs 1612 Date Mon Dec 2 14:09:43 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_69 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_69_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:96762 Length: 632 100.0 4E-133 2E-136 746.5 52.0 632 1-632 1-632 (632) 2 protein:vir:93616 Length: 645 100.0 1.5E-87 9E-91 496.7 46.9 586 14-632 1-638 (645) 3 protein:vir:97397 Length: 517 100.0 4.4E-63 2.7E-66 362.4 37.1 501 28-632 1-513 (517) 4 protein:vir:4074 Length: 480 # 100.0 5.2E-56 3.2E-59 323.7 29.7 461 32-632 1-476 (480) 5 protein:vir:6242 Length: 390 # 100.0 1.7E-51 1.1E-54 298.9 35.2 378 213-632 1-388 (390) 6 protein:vir:1328 Length: 392 # 100.0 5.9E-51 3.6E-54 296.0 36.1 380 213-632 1-390 (392) 7 protein:vir:485 Length: 407 # 100.0 1.3E-50 7.8E-54 294.2 35.8 372 218-632 1-399 (407) 8 protein:vir:95376 Length: 425 100.0 5.7E-50 3.5E-53 290.6 38.8 410 181-632 1-420 (425) 9 protein:vir:1433 Length: 435 # 100.0 1.1E-50 6.8E-54 294.5 32.5 398 197-632 1-432 (435) 10 protein:vir:100247 Length: 425 100.0 7.2E-50 4.5E-53 290.0 36.9 396 186-632 1-423 (425) 11 protein:vir:4456 Length: 401 # 100.0 9.7E-50 6E-53 289.3 35.4 372 218-632 1-400 (401) 12 protein:vir:105038 Length: 428 100.0 4.9E-50 3.1E-53 290.9 33.2 391 197-632 1-427 (428) 13 protein:vir:80376 Length: 435 100.0 6.9E-50 4.3E-53 290.1 33.6 397 197-632 1-432 (435) 14 protein:vir:7855 Length: 497 # 100.0 6.7E-49 4.2E-52 284.7 37.5 415 181-632 1-492 (497) 15 protein:vir:101650 Length: 497 100.0 6.7E-49 4.2E-52 284.7 37.5 415 181-632 1-492 (497) 16 protein:vir:8102 Length: 543 # 100.0 3.6E-48 2.2E-51 280.7 41.0 467 155-632 1-541 (543) 17 protein:vir:10364 Length: 390 100.0 8.8E-49 5.5E-52 284.0 36.3 380 218-631 1-390 (390) 18 protein:vir:100135 Length: 418 100.0 1.6E-48 9.9E-52 282.6 37.6 403 197-632 1-414 (418) 19 protein:vir:97053 Length: 390 100.0 1E-48 6.3E-52 283.7 36.2 380 218-631 1-390 (390) 20 protein:vir:81070 Length: 390 100.0 1.7E-48 1E-51 282.5 36.2 380 218-631 1-390 (390) 21 protein:vir:4339 Length: 395 # 100.0 8.4E-48 5.2E-51 278.7 37.0 381 218-632 1-394 (395) 22 protein:vir:4511 Length: 409 # 100.0 8.8E-48 5.4E-51 278.6 35.6 386 213-632 1-405 (409) 23 protein:vir:104256 Length: 458 100.0 6.4E-47 4E-50 273.8 40.2 427 181-632 1-457 (458) 24 protein:vir:81227 Length: 413 100.0 2.1E-47 1.3E-50 276.5 37.5 387 213-632 1-409 (413) 25 protein:vir:1886 Length: 385 # 100.0 1.1E-47 6.8E-51 278.1 35.8 372 222-632 1-383 (385) 26 protein:vir:191 Length: 385 # 100.0 1.1E-47 6.8E-51 278.1 35.8 372 222-632 1-383 (385) 27 protein:vir:8420 Length: 477 # 100.0 2.1E-47 1.3E-50 276.5 35.4 425 183-632 1-470 (477) 28 protein:vir:4092 Length: 390 # 100.0 2.1E-47 1.3E-50 276.5 33.8 355 244-632 1-367 (390) 29 protein:vir:5739 Length: 366 # 100.0 2.2E-48 1.4E-51 281.9 26.2 337 266-632 1-365 (366) 30 protein:vir:7771 Length: 330 # 100.0 3.8E-48 2.4E-51 280.5 26.3 278 349-632 1-322 (330) 31 protein:vir:102119 Length: 404 100.0 1.5E-45 9.3E-49 266.3 36.0 379 217-632 1-399 (404) 32 protein:vir:41 Length: 299 # N 100.0 3.5E-47 2.2E-50 275.2 26.2 273 352-632 1-297 (299) 33 protein:vir:4600 Length: 415 # 100.0 3.9E-45 2.4E-48 264.1 36.7 387 208-632 1-403 (415) 34 protein:vir:4700 Length: 415 # 100.0 3.9E-45 2.4E-48 264.1 36.7 387 208-632 1-403 (415) 35 protein:vir:9410 Length: 415 # 100.0 4.7E-45 2.9E-48 263.6 36.3 384 218-632 1-403 (415) 36 protein:vir:81100 Length: 415 100.0 5.3E-45 3.3E-48 263.3 36.5 387 208-632 1-403 (415) 37 protein:vir:79987 Length: 415 100.0 5.3E-45 3.3E-48 263.3 36.5 387 208-632 1-403 (415) 38 protein:vir:98339 Length: 415 100.0 5.3E-45 3.3E-48 263.3 36.5 387 208-632 1-403 (415) 39 protein:vir:97148 Length: 324 100.0 2.2E-46 1.4E-49 270.9 27.6 295 324-632 1-314 (324) 40 protein:vir:94673 Length: 419 100.0 7.5E-45 4.6E-48 262.5 35.5 391 197-632 1-416 (419) 41 protein:vir:94142 Length: 304 100.0 1.9E-46 1.2E-49 271.3 26.4 278 349-632 1-304 (304) 42 protein:vir:105905 Length: 304 100.0 1.9E-46 1.2E-49 271.3 26.4 278 349-632 1-304 (304) 43 protein:vir:6212 Length: 434 # 100.0 1.3E-44 8E-48 261.2 34.3 408 183-632 1-430 (434) 44 protein:vir:101607 Length: 379 100.0 2.1E-44 1.3E-47 260.1 35.1 365 222-632 1-378 (379) 45 protein:vir:9574 Length: 300 # 100.0 6.3E-46 3.9E-49 268.4 26.4 269 358-632 1-299 (300) 46 protein:vir:1025 Length: 408 # 100.0 4.2E-44 2.6E-47 258.4 35.2 370 208-632 1-392 (408) 47 protein:vir:78830 Length: 324 100.0 1.7E-45 1.1E-48 266.0 27.1 295 324-632 1-314 (324) 48 protein:vir:96392 Length: 324 100.0 1.7E-45 1.1E-48 266.0 27.1 295 324-632 1-314 (324) 49 protein:vir:9309 Length: 324 # 100.0 2.6E-45 1.6E-48 265.0 28.1 295 329-632 1-314 (324) 50 protein:vir:80684 Length: 315 100.0 1.2E-45 7.2E-49 266.9 25.6 271 357-632 1-305 (315) 51 protein:vir:95763 Length: 297 100.0 1.8E-45 1.1E-48 265.9 26.5 277 349-632 1-295 (297) 52 protein:vir:4997 Length: 397 # 100.0 9E-44 5.6E-47 256.6 35.4 360 218-632 1-384 (397) 53 protein:vir:4953 Length: 397 # 100.0 9.6E-44 5.9E-47 256.4 35.5 363 218-632 1-384 (397) 54 protein:vir:2344 Length: 397 # 100.0 1.9E-45 1.2E-48 265.8 25.9 277 347-632 1-305 (397) 55 protein:vir:4226 Length: 326 # 100.0 2.3E-45 1.4E-48 265.3 26.2 287 316-632 1-322 (326) 56 protein:vir:80128 Length: 466 100.0 1.4E-43 8.4E-47 255.6 35.4 419 183-632 1-447 (466) 57 protein:vir:99749 Length: 324 100.0 5.8E-45 3.6E-48 263.1 26.9 295 324-632 1-314 (324) 58 protein:vir:3991 Length: 404 # 100.0 2.8E-43 1.7E-46 253.9 35.8 369 213-632 1-392 (404) 59 protein:vir:7409 Length: 408 # 100.0 3.4E-43 2.1E-46 253.4 36.1 369 208-632 1-392 (408) 60 protein:vir:2430 Length: 318 # 100.0 6E-45 3.7E-48 263.0 26.3 282 344-632 1-312 (318) 61 protein:vir:103955 Length: 324 100.0 8E-45 5E-48 262.3 26.7 295 324-632 1-314 (324) 62 protein:vir:1268 Length: 397 # 100.0 3.5E-43 2.2E-46 253.3 35.2 375 213-632 1-396 (397) 63 protein:vir:4830 Length: 397 # 100.0 7.4E-43 4.6E-46 251.6 35.9 360 218-632 1-384 (397) 64 protein:vir:96223 Length: 324 100.0 1.8E-44 1.1E-47 260.4 26.9 295 324-632 1-314 (324) 65 protein:vir:81160 Length: 371 100.0 4.7E-43 2.9E-46 252.6 34.7 345 217-632 1-370 (371) 66 protein:vir:98635 Length: 377 100.0 2.3E-44 1.4E-47 259.8 27.3 346 245-632 1-376 (377) 67 protein:vir:100632 Length: 381 100.0 6.2E-44 3.8E-47 257.5 28.6 344 250-632 1-367 (381) 68 protein:vir:8187 Length: 311 # 100.0 2.9E-44 1.8E-47 259.2 25.7 268 359-632 1-309 (311) 69 protein:vir:95963 Length: 395 100.0 6.3E-43 3.9E-46 251.9 32.0 349 245-632 1-375 (395) 70 protein:vir:9759 Length: 303 # 100.0 6.3E-44 3.9E-47 257.4 26.2 268 359-632 1-302 (303) 71 protein:vir:101291 Length: 381 100.0 2.9E-43 1.8E-46 253.8 29.5 344 250-632 1-367 (381) 72 protein:vir:9509 Length: 381 # 100.0 2.9E-43 1.8E-46 253.8 29.5 344 250-632 1-367 (381) 73 protein:vir:78223 Length: 333 100.0 1.2E-43 7.2E-47 256.0 27.2 282 346-632 1-331 (333) 74 protein:vir:107593 Length: 392 100.0 3.2E-42 2E-45 248.1 34.1 357 217-632 1-383 (392) 75 protein:vir:102873 Length: 392 100.0 3.2E-42 2E-45 248.1 34.1 357 217-632 1-383 (392) 76 protein:vir:105004 Length: 392 100.0 3.2E-42 2E-45 248.1 34.1 357 217-632 1-383 (392) 77 protein:vir:102082 Length: 392 100.0 3.2E-42 2E-45 248.1 34.1 357 217-632 1-383 (392) 78 protein:vir:104085 Length: 320 100.0 1.4E-43 8.4E-47 255.6 26.1 282 344-632 1-316 (320) 79 protein:vir:2504 Length: 305 # 100.0 1.3E-43 7.9E-47 255.7 25.9 272 357-632 1-297 (305) 80 protein:vir:3845 Length: 395 # 100.0 1.3E-41 8.3E-45 244.7 36.4 362 213-632 1-382 (395) 81 protein:vir:78523 Length: 338 100.0 3.6E-43 2.2E-46 253.3 27.2 282 348-632 1-334 (338) 82 protein:vir:1383 Length: 421 # 100.0 8.8E-42 5.5E-45 245.7 34.5 368 211-632 1-382 (421) 83 protein:vir:9643 Length: 377 # 100.0 1.3E-42 8.3E-46 250.2 29.8 346 245-632 1-376 (377) 84 protein:vir:93881 Length: 387 100.0 5.5E-42 3.4E-45 246.8 32.7 373 218-632 1-380 (387) 85 protein:vir:1638 Length: 298 # 100.0 5.5E-43 3.4E-46 252.3 26.2 266 361-632 1-298 (298) 86 protein:vir:9361 Length: 402 # 100.0 5.2E-42 3.2E-45 246.9 31.5 389 197-632 1-395 (402) 87 protein:vir:2685 Length: 387 # 100.0 6.9E-42 4.3E-45 246.3 31.4 374 218-632 1-380 (387) 88 protein:vir:96978 Length: 387 100.0 6.9E-42 4.3E-45 246.3 31.4 374 218-632 1-380 (387) 89 protein:vir:94424 Length: 387 100.0 6.9E-42 4.3E-45 246.3 31.4 374 218-632 1-380 (387) 90 protein:vir:3870 Length: 400 # 100.0 4E-41 2.5E-44 242.1 34.9 381 186-632 1-398 (400) 91 protein:vir:78640 Length: 352 100.0 6.6E-42 4.1E-45 246.3 29.5 341 248-632 1-345 (352) 92 protein:vir:9704 Length: 394 # 100.0 4.2E-41 2.6E-44 241.9 33.7 377 182-632 1-389 (394) 93 protein:vir:100884 Length: 389 100.0 9.1E-41 5.7E-44 240.1 34.8 360 218-632 1-381 (389) 94 protein:vir:100172 Length: 394 100.0 8.1E-41 5E-44 240.4 34.3 363 218-632 1-383 (394) 95 protein:vir:94771 Length: 298 100.0 2.5E-42 1.5E-45 248.7 25.9 266 361-632 1-298 (298) 96 protein:vir:78350 Length: 383 100.0 1.2E-41 7.7E-45 244.9 29.5 350 247-632 1-374 (383) 97 protein:vir:99920 Length: 311 100.0 4.8E-42 3E-45 247.1 25.8 271 358-632 1-311 (311) 98 protein:vir:4856 Length: 293 # 100.0 5.2E-42 3.2E-45 246.9 25.9 259 353-632 1-280 (293) 99 protein:vir:1084 Length: 437 # 100.0 1.2E-39 7.6E-43 233.9 37.6 407 181-632 1-426 (437) 100 protein:vir:962 Length: 397 # 100.0 3.9E-39 2.4E-42 231.2 35.0 380 197-632 1-396 (397) 101 protein:vir:4197 Length: 314 # 100.0 2.9E-33 1.8E-36 198.9 23.8 282 344-632 1-312 (314) 102 protein:vir:4159 Length: 315 # 100.0 3.7E-33 2.3E-36 198.4 22.4 285 333-630 1-315 (315) 103 protein:vir:79548 Length: 652 100.0 1E-31 6.3E-35 190.5 25.2 545 63-630 1-652 (652) 104 protein:vir:3158 Length: 321 # 100.0 4.7E-30 2.9E-33 181.4 24.4 289 333-632 1-310 (321) 105 protein:vir:95512 Length: 693 99.9 2.8E-29 1.7E-32 177.1 24.4 579 1-631 14-693 (693) 106 protein:vir:9820 Length: 272 # 99.9 6.4E-28 3.9E-31 169.7 23.9 257 358-632 1-268 (272) 107 protein:vir:3033 Length: 272 # 99.9 6.4E-28 3.9E-31 169.7 23.9 257 358-632 1-268 (272) 108 protein:vir:93742 Length: 274 99.8 1.4E-21 8.7E-25 134.9 21.5 258 358-632 1-269 (274) 109 protein:vir:8324 Length: 410 # 99.8 3.2E-22 2E-25 138.4 13.6 384 127-631 1-410 (410) 110 protein:vir:103886 Length: 302 99.8 6.5E-21 4E-24 131.3 16.7 270 357-632 1-301 (302) 111 protein:vir:96123 Length: 274 99.8 6.3E-20 3.9E-23 125.8 21.0 258 358-632 1-269 (274) 112 protein:vir:3613 Length: 272 # 99.8 4.7E-20 2.9E-23 126.5 19.6 257 358-632 1-271 (272) 113 protein:vir:80930 Length: 278 99.8 1.3E-19 7.9E-23 124.2 20.7 265 357-632 1-276 (278) 114 protein:vir:105334 Length: 276 99.8 1.3E-19 8E-23 124.1 20.4 258 358-632 1-269 (276) 115 protein:vir:97433 Length: 274 99.8 2.8E-19 1.7E-22 122.3 21.8 258 358-632 1-269 (274) 116 protein:vir:94494 Length: 274 99.8 2.8E-19 1.7E-22 122.3 21.8 258 358-632 1-269 (274) 117 protein:vir:96833 Length: 275 99.8 1.5E-19 9.1E-23 123.8 20.1 259 355-632 1-270 (275) 118 protein:vir:1239 Length: 274 # 99.7 4.1E-18 2.6E-21 115.9 20.9 258 358-632 1-269 (274) 119 protein:vir:95898 Length: 274 99.7 7.1E-18 4.4E-21 114.6 21.2 258 358-632 1-269 (274) 120 protein:vir:96262 Length: 274 99.7 7.1E-18 4.4E-21 114.6 21.2 258 358-632 1-269 (274) 121 protein:vir:94933 Length: 330 99.6 2E-16 1.3E-19 106.6 16.7 293 309-632 1-328 (330) 122 protein:vir:79928 Length: 393 99.5 3E-15 1.9E-18 100.2 20.2 340 258-632 1-380 (393) 123 protein:vir:95107 Length: 270 99.5 2.2E-15 1.4E-18 100.9 19.2 255 360-632 1-264 (270) 124 protein:vir:739 Length: 231 # 99.5 3E-15 1.9E-18 100.2 16.5 219 393-632 1-230 (231) 125 protein:vir:108211 Length: 318 99.3 9.7E-14 6E-17 91.9 15.2 272 353-632 1-316 (318) 126 protein:vir:7990 Length: 273 # 99.3 5E-13 3.1E-16 88.0 18.5 257 358-632 1-272 (273) 127 protein:vir:102605 Length: 273 99.3 1.5E-12 9E-16 85.5 18.8 257 363-632 1-272 (273) 128 protein:vir:105822 Length: 273 99.3 1.5E-12 9E-16 85.5 18.8 257 363-632 1-272 (273) 129 protein:vir:99424 Length: 360 99.2 1.2E-11 7.3E-15 80.5 20.4 293 329-632 1-356 (360) 130 protein:vir:97255 Length: 310 99.2 1.9E-11 1.2E-14 79.4 20.7 267 357-632 1-309 (310) 131 protein:vir:94622 Length: 341 99.1 4.5E-12 2.8E-15 82.8 15.0 280 347-632 1-338 (341) 132 protein:vir:80180 Length: 381 99.0 1.8E-10 1.1E-13 73.9 17.2 279 347-632 1-303 (381) 133 protein:vir:94576 Length: 347 98.9 1.1E-10 7E-14 75.1 15.9 285 343-632 1-346 (347) 134 protein:vir:78739 Length: 332 98.9 1.3E-10 8.1E-14 74.8 15.1 284 345-631 1-332 (332) 135 protein:vir:100057 Length: 375 98.9 1E-09 6.2E-13 69.9 18.8 285 347-632 1-369 (375) 136 protein:vir:8885 Length: 347 # 98.9 3.2E-10 2E-13 72.6 15.7 284 343-632 1-345 (347) 137 protein:vir:80213 Length: 334 98.8 3E-10 1.8E-13 72.8 14.6 285 345-632 1-331 (334) 138 protein:vir:3364 Length: 347 # 98.8 4.7E-10 2.9E-13 71.7 15.5 287 343-632 1-344 (347) 139 protein:vir:2201 Length: 345 # 98.8 1E-09 6.3E-13 69.9 16.6 282 347-632 1-344 (345) 140 protein:vir:5974 Length: 324 # 98.8 2.5E-09 1.5E-12 67.8 18.4 262 357-632 1-288 (324) 141 protein:vir:103323 Length: 364 98.8 9.2E-09 5.7E-12 64.6 20.5 281 347-632 1-338 (364) 142 protein:vir:94711 Length: 347 98.7 6.2E-10 3.9E-13 71.1 13.0 284 343-632 1-345 (347) 143 protein:vir:10450 Length: 344 98.7 1.5E-09 9.3E-13 69.0 14.9 286 339-632 1-343 (344) 144 protein:vir:1991 Length: 305 # 98.7 2.8E-10 1.7E-13 73.0 10.6 209 347-571 1-305 (305) 145 protein:vir:3136 Length: 322 # 98.7 4.7E-09 2.9E-12 66.3 17.1 268 357-632 1-317 (322) 146 protein:vir:6324 Length: 335 # 98.7 7.8E-09 4.9E-12 65.0 18.2 281 347-632 1-327 (335) 147 protein:vir:78935 Length: 335 98.7 8.9E-09 5.5E-12 64.7 17.8 280 347-632 1-327 (335) 148 protein:vir:102944 Length: 330 98.7 1.6E-08 9.8E-12 63.4 18.9 265 358-632 1-295 (330) 149 protein:vir:1583 Length: 351 # 98.7 1E-08 6.3E-12 64.4 17.7 262 357-632 1-292 (351) 150 protein:vir:99675 Length: 324 98.6 5.1E-09 3.2E-12 66.1 15.8 235 392-632 1-295 (324) 151 protein:vir:1541 Length: 347 # 98.6 5E-09 3.1E-12 66.1 15.8 285 334-632 1-344 (347) 152 protein:vir:102655 Length: 322 98.5 4.5E-08 2.8E-11 60.9 18.3 278 352-632 1-320 (322) 153 protein:vir:93858 Length: 400 98.5 2.4E-07 1.5E-10 56.8 20.3 377 230-631 1-400 (400) 154 protein:vir:103285 Length: 296 98.3 3E-07 1.9E-10 56.3 17.9 269 357-631 1-296 (296) 155 protein:vir:9927 Length: 295 # 98.3 2.5E-07 1.5E-10 56.8 16.5 253 357-632 1-287 (295) 156 protein:vir:80068 Length: 301 98.3 7.2E-07 4.5E-10 54.3 18.7 269 359-631 1-301 (301) 157 protein:vir:97031 Length: 402 98.2 9.8E-08 6.1E-11 59.0 13.5 282 347-632 1-332 (402) 158 protein:vir:106647 Length: 303 98.2 4E-07 2.5E-10 55.7 15.8 259 354-632 1-295 (303) 159 protein:vir:7019 Length: 401 # 98.1 5.3E-07 3.3E-10 55.0 14.1 282 347-632 1-332 (401) 160 protein:vir:105645 Length: 400 98.1 3.8E-07 2.3E-10 55.8 13.3 282 347-632 1-332 (400) 161 protein:vir:99075 Length: 392 98.0 1.4E-06 8.4E-10 52.8 16.0 259 358-632 1-303 (392) 162 protein:vir:107687 Length: 319 97.9 6.9E-06 4.3E-09 48.9 17.9 291 326-631 1-319 (319) 163 protein:vir:95451 Length: 313 97.9 2.5E-06 1.6E-09 51.3 15.4 271 357-632 1-310 (313) 164 protein:vir:9875 Length: 296 # 97.9 6.8E-06 4.2E-09 48.9 17.0 260 341-632 1-294 (296) 165 protein:vir:104342 Length: 314 97.8 5E-06 3.1E-09 49.6 16.0 287 326-631 1-314 (314) 166 protein:vir:8843 Length: 317 # 97.8 1.4E-05 8.8E-09 47.2 17.5 270 355-632 1-314 (317) 167 protein:vir:79642 Length: 329 97.6 2.4E-05 1.5E-08 45.9 16.7 296 321-632 1-327 (329) 168 protein:vir:108303 Length: 418 97.6 3.6E-05 2.2E-08 44.9 19.5 258 361-632 1-315 (418) 169 protein:vir:96792 Length: 315 97.6 4.4E-05 2.7E-08 44.5 18.0 261 357-632 1-280 (315) 170 protein:vir:95318 Length: 328 97.0 0.00012 7.1E-08 42.2 14.1 224 342-579 1-328 (328) 171 protein:vir:5255 Length: 304 # 97.0 0.00015 9.6E-08 41.5 14.7 263 363-630 1-304 (304) 172 protein:vir:94800 Length: 319 96.9 0.00027 1.7E-07 40.1 18.9 278 309-632 1-295 (319) 173 protein:vir:97331 Length: 319 96.9 0.00027 1.7E-07 40.1 18.9 278 309-632 1-295 (319) 174 protein:vir:80446 Length: 367 96.8 0.00035 2.2E-07 39.5 16.8 269 354-632 1-321 (367) 175 protein:vir:95131 Length: 325 96.7 0.0004 2.5E-07 39.2 17.3 260 358-632 1-292 (325) 176 protein:vir:3525 Length: 423 # 96.5 0.00056 3.5E-07 38.4 17.4 258 358-632 1-300 (423) 177 protein:vir:98525 Length: 331 96.3 0.00072 4.5E-07 37.8 14.0 225 347-579 1-331 (331) 178 protein:vir:107826 Length: 331 96.3 0.00072 4.5E-07 37.8 14.0 225 347-579 1-331 (331) 179 protein:vir:107388 Length: 331 96.3 0.00072 4.5E-07 37.8 14.0 225 347-579 1-331 (331) 180 protein:vir:99228 Length: 304 96.2 9.9E-05 6.1E-08 42.5 8.8 209 347-572 1-304 (304) 181 protein:vir:79246 Length: 304 96.2 9.9E-05 6.2E-08 42.5 8.8 209 347-572 1-304 (304) 182 protein:vir:103759 Length: 330 96.2 0.00089 5.5E-07 37.3 13.8 225 347-579 1-330 (330) 183 protein:vir:78387 Length: 349 95.7 0.0016 9.8E-07 35.9 20.1 264 358-632 1-306 (349) 184 protein:vir:107120 Length: 329 95.7 0.0017 1E-06 35.8 17.9 289 317-632 1-306 (329) 185 protein:vir:174 Length: 423 # 95.6 0.0019 1.1E-06 35.6 18.2 257 358-632 1-305 (423) 186 protein:vir:1781 Length: 221 # 95.5 0.0012 7.6E-07 36.5 12.1 178 441-632 1-201 (221) 187 protein:vir:7324 Length: 335 # 95.4 0.0021 1.3E-06 35.2 13.3 225 342-579 1-335 (335) 188 protein:vir:95875 Length: 401 95.1 0.0028 1.7E-06 34.6 14.8 282 346-632 1-399 (401) 189 protein:vir:94989 Length: 349 94.8 0.0034 2.1E-06 34.1 20.9 264 358-632 1-306 (349) 190 protein:vir:105374 Length: 423 93.8 0.0062 3.9E-06 32.7 17.8 260 358-632 1-305 (423) 191 protein:vir:101557 Length: 336 93.6 0.0068 4.2E-06 32.4 13.6 307 299-631 1-336 (336) 192 protein:vir:3643 Length: 336 # 93.6 0.0071 4.4E-06 32.4 12.5 306 299-631 1-336 (336) 193 protein:vir:96079 Length: 382 91.9 0.014 8.6E-06 30.8 11.6 322 270-631 1-382 (382) 194 protein:vir:94070 Length: 339 91.6 0.015 9.3E-06 30.6 13.8 303 305-631 1-339 (339) 195 protein:vir:78558 Length: 336 91.2 0.017 1.1E-05 30.3 13.0 307 299-631 1-336 (336) 196 protein:vir:270 Length: 341 # 87.3 0.039 2.4E-05 28.3 15.6 289 326-632 1-333 (341) 197 protein:vir:105522 Length: 423 87.3 0.04 2.5E-05 28.3 19.0 258 358-632 1-331 (423) 198 protein:vir:1153 Length: 338 # 86.8 0.043 2.7E-05 28.1 17.9 289 330-632 1-335 (338) 199 protein:vir:106734 Length: 336 84.6 0.059 3.7E-05 27.3 12.0 307 299-631 1-336 (336) 200 protein:vir:79008 Length: 299 84.1 0.063 3.9E-05 27.2 19.6 261 357-632 1-297 (299) 201 protein:vir:79157 Length: 339 83.9 0.065 4E-05 27.1 17.4 289 330-632 1-339 (339) 202 protein:vir:1829 Length: 355 # 80.9 0.091 5.6E-05 26.3 17.7 291 326-632 1-347 (355) 203 protein:vir:99576 Length: 388 80.2 0.097 6E-05 26.1 8.6 331 270-631 1-388 (388) 204 protein:vir:78777 Length: 358 79.2 0.11 6.7E-05 25.9 16.1 295 326-632 1-345 (358) 205 protein:vir:107732 Length: 379 77.0 0.13 8E-05 25.5 13.5 316 270-631 1-379 (379) 206 protein:vir:104011 Length: 337 76.1 0.14 8.6E-05 25.3 18.4 288 330-632 1-336 (337) 207 protein:vir:98566 Length: 355 75.1 0.15 9.3E-05 25.1 17.7 289 330-632 1-347 (355) 208 protein:vir:79171 Length: 337 74.9 0.15 9.5E-05 25.1 18.3 288 330-632 1-336 (337) 209 protein:vir:78186 Length: 337 72.7 0.18 0.00011 24.7 17.4 288 330-632 1-336 (337) 210 protein:vir:6061 Length: 357 # 70.7 0.21 0.00013 24.3 16.7 291 330-632 1-349 (357) 211 protein:vir:5694 Length: 357 # 69.3 0.22 0.00014 24.1 16.5 291 330-632 1-349 (357) 212 protein:vir:861 Length: 318 # 66.1 0.27 0.00017 23.7 10.0 297 309-631 1-318 (318) 213 protein:vir:100331 Length: 342 65.4 0.28 0.00018 23.6 17.5 288 330-632 1-337 (342) 214 protein:vir:3746 Length: 336 # 65.3 0.29 0.00018 23.6 18.3 287 329-632 1-329 (336) 215 protein:vir:2016 Length: 357 # 65.2 0.29 0.00018 23.6 16.5 291 330-632 1-349 (357) 216 protein:vir:348 Length: 321 # 63.5 0.32 0.0002 23.3 14.5 275 326-632 1-320 (321) 217 protein:vir:79712 Length: 285 63.1 0.32 0.0002 23.3 18.2 254 363-632 1-284 (285) 218 protein:vir:98856 Length: 343 60.6 0.37 0.00023 22.9 18.5 292 326-632 1-332 (343) 219 protein:vir:93966 Length: 400 60.4 0.37 0.00023 22.9 13.8 376 230-631 1-400 (400) 220 protein:vir:1663 Length: 393 # 59.1 0.4 0.00025 22.8 12.5 367 217-631 1-393 (393) 221 protein:vir:100603 Length: 529 53.5 0.53 0.00033 22.1 18.9 339 245-632 1-528 (529) 222 protein:vir:99311 Length: 463 52.9 0.54 0.00034 22.0 15.3 292 309-632 1-338 (463) 223 protein:vir:95603 Length: 463 52.9 0.54 0.00034 22.0 15.3 292 309-632 1-338 (463) 224 protein:vir:106286 Length: 534 50.0 0.62 0.00039 21.7 19.3 343 252-632 1-533 (534) 225 protein:vir:103463 Length: 521 49.0 0.65 0.0004 21.6 19.4 341 250-632 1-520 (521) 226 protein:vir:3783 Length: 336 # 47.7 0.69 0.00043 21.5 19.4 287 329-632 1-329 (336) 227 protein:vir:6901 Length: 522 # 46.6 0.73 0.00045 21.3 18.4 342 238-632 1-521 (522) 228 protein:vir:98143 Length: 524 39.7 1 0.00062 20.6 17.8 343 251-632 1-523 (524) 229 protein:vir:80986 Length: 528 39.6 1 0.00063 20.6 19.4 339 251-632 1-527 (528) 230 protein:vir:102823 Length: 470 38.7 1.1 0.00065 20.5 16.4 284 322-632 1-325 (470) 231 protein:vir:78920 Length: 290 36.7 1.2 0.00072 20.2 18.5 256 363-632 1-289 (290) 232 protein:vir:80835 Length: 464 33.7 1.3 0.00083 19.9 12.1 291 317-632 1-331 (464) 233 protein:vir:5670 Length: 514 # 31.2 1.5 0.00094 19.6 18.3 332 263-632 1-513 (514) No 1 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=100.00 E-value=3.9e-133 Score=746.48 Aligned_cols=632 Identities=100% Similarity=1.441 Sum_probs=518.7 Q ss_pred CCCccccccchhccccceeEEEEEEEeecccCCCcEEEEEEecCcceecCCCeEEEEecchhhhhhhccCCcEEEeeCCC Q lcl|Aclame:pro 1 MPQPTKKTTVLRTIEGRELQRELRVLSDSIDQEARTVELAASSEYPVPRWFGREILDHSPGAIRMGRLKNGAPLLDSHSL 80 (632) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~H~~ 80 (632) |||++++.+..|+++++.++|.+++++.++|+++|||+++|||++||+||+|+|+|+++++++|+++++++.||||+||+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~e~l~~~~~~~~~~~~~~~~~~l~~H~~ 80 (632) T protein:vir:96 1 MPQPTKKTTVLRTIEGRELQRELRVLSDSIDQEARTVELAASSEYPVPRWFGREILDHSPGAIRMGRLKNGAPLLDSHSL 80 (632) T ss_pred CCCcCCCCCccccccCceeeeEEeeeeccccccccEEEEEEecCCccccccCcccccccccccchhhccCCCeeeccCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCceEEEEEeeeecccceEEEEEeCCChhHHHHHHHHhcCCcceeeeeEEEeecccccCCCCeeEEEEEEeeeeccCccc Q lcl|Aclame:pro 81 REQIGVVEEVWLDDDRRLRARVRFSRSAKAEELWQDVLDGIRRHISIGYIIHEMVLESSGDQGDTYRVMDWEPYEISLIS 160 (632) Q Consensus 81 ~~~iG~~~~~~~e~~~gl~~~~~~~~~~~~~~~~~~v~~G~~~~~SiG~~~~~~~~~~~~~~~~~~~~~~~~l~EvS~v~ 160 (632) ++|||+|.++++|+++||+++++|++++.++++|++|++|.|++|||||+|++|+++..++++++|++++|+|+|||+|+ T Consensus 81 ~~~iG~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~SiG~~~~~~~~~~~~~~~~~~~~~~~~~~EiS~v~ 160 (632) T protein:vir:96 81 REQIGVVEEVWLDDDRRLRARVRFSRSAKAEELWQDVLDGIRRHISIGYIIHEMVLESSGDQGDTYRVMDWEPYEISLIS 160 (632) T ss_pred CCcceEEEEEEEeCCceEEEEEEeCCChhHHHHHHHHhcCcccceeeeeeeeeeeeecCCCCcceEEEEEEEEEEEEEee Confidence 99999999999999999999999999999999999999999999999999999999988888999999999999999999 Q ss_pred ccccccceeeeeeccchhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 161 VPADPTVGVGRSIDIGNITIRGAEMPDKDKQTQTAGSQQTETRGAETGAKNPAPAASGANENDILSRERTRISEITAIGQ 240 (632) Q Consensus 161 ~pa~~~a~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~ 240 (632) +||||+|.|+++...........+.............................+...............++...+..... T Consensus 161 ~pAd~~a~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~a~~~~~~~~~a~~~~~~~~E~~r~~eI~~l~~ 240 (632) T protein:vir:96 161 VPADPTVGVGRSIDIGNITIRGAEMPDKDKQTQTAGSQQTETRGAETGAKNPAPAASGANENDILSRERTRISEITAIGQ 240 (632) T ss_pred cCCCCcceeeeeccccccccccccccchhhhhhccccccccccchhhcccccchhhhhhhhhhhhhhhHHHHHHHHHHHH Confidence 99999999988776554443333333222222111111111111111111112222222233333445566666777777 Q ss_pred hhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 241 QFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGD 320 (632) Q Consensus 241 ~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (632) +++..++.++++..+...+..+....+..............................................+....+. T Consensus 241 ~~~~~~~~~~ai~~g~sld~~ra~~ld~l~~~~~a~~~~~~a~~~~~~~~~~~~~~i~~~~re~~~~~l~rai~a~a~~~ 320 (632) T protein:vir:96 241 QFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGD 320 (632) T ss_pred HhhhhhhHHHHHhccccHHHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHHhhhccc Confidence 77777888889999988888888777766555444333332222222222222212111122222222222233333333 Q ss_pred hhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCc Q lcl|Aclame:pro 321 WSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLV 400 (632) Q Consensus 321 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 400 (632) ...............+..++...........++.+++..++.++||++||++++.+.|++.+++.+++.+++++.+++.. T Consensus 321 ~~~a~~~~e~a~~~a~~~G~~arg~~~~~~~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l~~~~~~~~~ 400 (632) T protein:vir:96 321 WSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLV 400 (632) T ss_pred hhhhhhhhHHHHHHHHhhhhhhhhhhhhHHHHHHhhhhcccccccccccccccchHHHHHHHhhcchhhhhcceEeecCC Confidence 33333333444455555555555556667778889999999999999999999999999999999999999899999988 Q ss_pred eeEEEEEecCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 401 GDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLT 480 (632) Q Consensus 401 ~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~ 480 (632) +.+.+|+.++.+.+.|++|++++++++++|+++++.+++++++++||+|+|.|+.++++++|.+.|+.++++++|.++|+ T Consensus 401 g~~~ip~~~~~~~a~wv~E~~~~~~s~~~f~~i~l~~~k~~~~v~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~ 480 (632) T protein:vir:96 401 GDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLT 480 (632) T ss_pred cceEEEEEeCCceeEeecCCccccccccceeeEEeeeeEEEEehhhHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhhc Confidence 89999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCCccccccceeccccccccccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeeccccc Q lcl|Aclame:pro 481 GTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQNNEV 560 (632) Q Consensus 481 g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~~~l 560 (632) |+|++++|.||++.+++.+++.+.+.++++.+.++..++...+.+..++.|+|++....++...+++|.+|+|+|.+++| T Consensus 481 G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~~~l~d~~G~~i~~~~~l 560 (632) T protein:vir:96 481 GTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQNNEV 560 (632) T ss_pred ccCCCCccceeeecccccceecccccCCHHHHHHHHHHHhhcccccCccEEEEchhHHHHHHHHhccCCCCceeecCCee Confidence 99998999999999999888888888999999999999999988888889999999998898888999999999999999 Q ss_pred cCcceEEcCCCCCccEEEEehhhEEEEEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 561 NGYRAEASNQIPADTWIFGDWSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 561 ~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +|+||++++++|.++++||||+.|.+++++++++.++++++|.+|++.|+++.|+|++|++|++|+++|++| T Consensus 561 ~G~pv~~s~~ip~~~~~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 561 NGYRAEASNQIPADTWIFGDWSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred cccceEeccccccCcEEEeecceEEEEEecceEEEEccccccccCceEEEEEeecCceeechhhhhheeecC Confidence 999999999999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=100.00 E-value=1.5e-87 Score=496.66 Aligned_cols=586 Identities=17% Similarity=0.196 Sum_probs=349.9 Q ss_pred cccceeEEEEEEEeecccCCCcEEEEEEecCcceecCCCeEEEEecchhhhhhhccCCcEEEeeCCCCCceEEEEEeeee Q lcl|Aclame:pro 14 IEGRELQRELRVLSDSIDQEARTVELAASSEYPVPRWFGREILDHSPGAIRMGRLKNGAPLLDSHSLREQIGVVEEVWLD 93 (632) Q Consensus 14 ~~~~~~~~~~~~~~~~~d~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~H~~~~~iG~~~~~~~e 93 (632) |+++ +....+..+++|+++|+|+||+++++ ++|||.. +.|++++.+ +.+||||+||+++|||+|.. . + T Consensus 1 m~~~--~~~~~~~~k~~~~~~~~~~g~as~~~-~d~~gd~----i~~~~~~~~---~~~~~l~~H~~~~~iG~~~~-~-~ 68 (645) T protein:vir:93 1 MTLK--RACSLLTVKSFSEDERVITGIASTPS-PDRDGDI----LEPEGAEFG---SALPFLWQHDHSRPVGQCTV-R-R 68 (645) T ss_pred Cccc--ceeceeeEEeeecCceEEEEEEecCC-ccccCce----echhhhccc---CCceeeeccCCCCceeEEEE-E-e Confidence 3332 22223556788999999999999876 6888843 678998754 45789999999999999973 3 6 Q ss_pred cccceEEEEEeCCC---------hhHHHHHHHHhcCCcceeeeeEEEeecccccCCCCeeEEEEEEeeeeccCccccccc Q lcl|Aclame:pro 94 DDRRLRARVRFSRS---------AKAEELWQDVLDGIRRHISIGYIIHEMVLESSGDQGDTYRVMDWEPYEISLISVPAD 164 (632) Q Consensus 94 ~~~gl~~~~~~~~~---------~~~~~~~~~v~~G~~~~~SiG~~~~~~~~~~~~~~~~~~~~~~~~l~EvS~v~~pa~ 164 (632) +++||++++++..+ +.++++|++||+|.|++|||||++++|++.+.++ +++++|+|+|||+|++||| T Consensus 69 ~~~gl~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~G~~~~~SiG~~~~~~~~~~~~~----~~i~~~~l~EiS~V~~pAn 144 (645) T protein:vir:93 69 VSEGLEITATLAKPVPDMPSQLAARLDEAWAAIKTGLVRGLSVGFRPHEYTFLDGGG----LHFLRWELMEVSAVTVPAN 144 (645) T ss_pred cCCceEEEEEecccccccccchHHHHHHHHHHHhcCcccceeeeeEEeeeeeecCCC----eEEEEEEEEEEeeeccCCC Confidence 77899999999532 4799999999999999999999999998876544 6799999999999999999 Q ss_pred ccceeeeeeccchhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh-hhhhhhhhhhhhhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 165 PTVGVGRSIDIGNITIRGAEMPDKDKQTQTAGSQQTETRGAETGAKNPAP-AASGANENDILSRERTRISEITAIGQQFS 243 (632) Q Consensus 165 ~~a~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~r~~~~~~~~~~~~ 243 (632) |+|.|...+......................................... ........+..+...+++..........+ T Consensus 145 ~~a~v~~~ks~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~e~i~~l~~~ra~~~~~~~~l~~~a~~~g 224 (645) T protein:vir:93 145 AECTIRTIKSYDRQFSAASGNRKPVVKIASSAGAAAQSTTVFHKEKTIMNIGEQIKSFENKRAALAASLEEVMTKAAEEG 224 (645) T ss_pred CcchhhhhhhccchhhhhhhhhcchhhhhhhhcchhhccccccccccccchhhhhhhhhHHHHHHHHHhhhhhhhHhhhc Confidence 99999876543221111110000000000000000000000000000000 00000000000000000000000000000 Q ss_pred hhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhh--hhhhhhhhHHHHh----hhhhhhhh---hhHHHHHhhhhhhhhhh Q lcl|Aclame:pro 244 QRSLAQEAIQKGHTVDQFRALVLERMNPGQPGN--FEKPGAGDLPGKP----AIHSARDL---GIQHKELQQYSLMRAIN 314 (632) Q Consensus 244 ~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~----~~~~~~~~---~~~~~~~~~~~~~~~~~ 314 (632) ..+.. ......+.....+........... ............. ........ ................+ T Consensus 225 -~~l~a---ee~~~~d~l~aei~~l~~~i~r~e~~e~~~a~~a~pv~~~~~~~~~~~~~~~~~~~~~~~~kg~~f~~~~~ 300 (645) T protein:vir:93 225 -RTLDV---EEEEHYDNTAAEIRQVDAHLKRLRELEAGKAATAQPVKQAGNGNVAAVASAPVIRVEQKLDKGIGFARFAK 300 (645) T ss_pred -cccCH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccchhhhhhhhhHHHHHH Confidence 00000 000001111111000000000000 0000000000000 00000000 00000000000000000 Q ss_pred hhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcce Q lcl|Aclame:pro 315 AAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGAR 394 (632) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~ 394 (632) +.... ...........+..+...................+ +...+|.++.++.+...|++.+++.+++++++.+ T Consensus 301 al~~~-----~g~~~~a~e~a~~~~~~~~~~~~~~~~a~~~~~~~-~~~~~Gg~~vp~~~~~~ii~~l~~~svv~~l~~~ 374 (645) T protein:vir:93 301 SLAAA-----KGVRSEALEVARRQYPDDSRLHHVLKSAVGAGTTT-DPQWAGSLSEYQEYAQDFIDYLRPQTIIGRFGQG 374 (645) T ss_pred HHHhc-----ccchhHHHHHHHhhcccchhhhhhhhhhhhccccc-cccccCCccCchhhHHHHHHhhhhhhhHHhhccc Confidence 00000 00000000111111000000000000111111222 2223355566666778899999999999999877 Q ss_pred eeccC---ceeEEEEEecCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 395 MLPGL---VGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIG 471 (632) Q Consensus 395 ~~~~~---~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a 471 (632) .++.. .+.+++|+.++.+.+.||+|++.+++++++|+++++++++++++++||+|+|.|+.++++++|.+.++++++ T Consensus 375 ~~~~~~~~~~~~~ip~~t~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~kla~~~~iS~ell~ds~~~~~~~i~~~l~~aia 454 (645) T protein:vir:93 375 GIPALRQVPFNIRVHAQVSGGAAGWVGEGKTKPLTKFDFESITFSHAKVSAIAVLTEELIRFSSPAADALVRNALAEAVV 454 (645) T ss_pred cccccccccCceeeeeeecCcceEEeccCccccccccceeEEEEeeEEEEEeehhHHHHHhhchHHHHHHHHHHHHHHHH Confidence 66643 346789999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhhcCCCc---cccccceeccccccccccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcc Q lcl|Aclame:pro 472 VALDLAMLTGTGLA---NDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFD 548 (632) Q Consensus 472 ~~~~~~~~~g~g~~---~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 548 (632) +++|.++++|++++ ..|.|+++... . ...+....+++..++..+..++....++.|+|++..... +.+++| T Consensus 455 ~~~d~a~l~g~g~~~~~~~p~gi~~~~~--~--~~~~~~~~~d~~~~~~~~~~a~~~~~~a~~vmn~~~~~~--L~~lkd 528 (645) T protein:vir:93 455 ARLDTDFVDPKKAAVADVSPASITHDVK--G--TASSGNPDADAEAAFGQFVAANLQPTGAVWLMSSTNALA--LSMRKN 528 (645) T ss_pred HHHHHHhhcCCCcccCCccccceecccc--c--cccccchHHHHHHHHHHHHhcCCCccccEEEEcHHHHHH--HHhccc Confidence 99999999987654 45778765332 1 222233456777887777777666667889999887654 468899 Q ss_pred cCCceeec-----cccccCcceEEcCCCCCccEEEEehhhEEEEEecceEEEEeccc----------------------c Q lcl|Aclame:pro 549 NTGERIWQ-----NNEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGVLDLKVDPYT----------------------K 601 (632) Q Consensus 549 ~~g~~~~~-----~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~~~~~~~~~~----------------------~ 601 (632) .+|+|+|. .++|+|+||++++++|+ .++||||+.+.++.++++.+..+++. . T Consensus 529 ~~G~~~~~~~~~~~~tL~G~PV~~s~~vp~-~~~~gd~s~~~ig~~~~v~i~~s~~a~~~~~~~~~~~~~~~~~~~~v~l 607 (645) T protein:vir:93 529 ALGQKEYPDMTLLGGSFQGLPVIVSQYVGD-QLVLVNAPDIYLADDGGVAVDMSREASLEMQSEPTGDSTTPSPVELVSM 607 (645) T ss_pred cCCceeecCCCCCCceeeceeeEEeccCCc-ceeEeccccEEEEEecceEEEeecceeEEEeecccccccccccccchhH Confidence 99999873 35899999999999986 47789999999999999988776542 2 Q ss_pred cccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 602 AASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 602 ~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) |.+|++.||+..|+||+++||+||++|+-.= T Consensus 608 f~~d~vaira~~r~d~~~~~p~a~~~lt~~~ 638 (645) T protein:vir:93 608 FQTGSVAIRAERWINWRRRRTAAVAVITGVN 638 (645) T ss_pred hhcCceEEEEEEEEcceeeCccceEEEeccc Confidence 8899999999999999999999999988433 No 3 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=100.00 E-value=4.4e-63 Score=362.44 Aligned_cols=501 Identities=9% Similarity=-0.005 Sum_probs=301.4 Q ss_pred ecccCCCcEEEEEEecCcceecCCCeEEEEecchhhhhhhc-cCCcEEEeeCCCCCceEEEEEeeeecccceEEEEEeCC Q lcl|Aclame:pro 28 DSIDQEARTVELAASSEYPVPRWFGREILDHSPGAIRMGRL-KNGAPLLDSHSLREQIGVVEEVWLDDDRRLRARVRFSR 106 (632) Q Consensus 28 ~~~d~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~l~~H~~~~~iG~~~~~~~e~~~gl~~~~~~~~ 106 (632) =+.+.+.++|+||++.+..+++|+.. +.|+||+.+.. .+.+||||+||+++|||++.- ..++ +||+++++|++ T Consensus 1 ~~~~~~~~~~~g~a~~~~~~d~~~~~----~~~gaf~~~~~~~~~~~~l~~Hd~~~~ig~~~~-~~~~-~Gl~~~~~~~~ 74 (517) T protein:vir:97 1 MSGTFKDGVLIGKLVDYGSIDSYNTV----FEPGAFDEYVGSEQTFNLDYRHDMQDKLAKFKV-IGRE-DGIYIEAKPNN 74 (517) T ss_pred CccccCceEEEEEEEecCCCCCCCce----EccchHHHHHhcCCCeEEeecCCCCCceEEEEE-EEec-CceEEEEeeCc Confidence 23456677999999999888887743 68999988743 356889999999999999863 3344 49999999999 Q ss_pred ChhHHHHHHHHhcCCcceeeeeEEEeecccccCCCCeeEEEEEEeeeeccCcccccccccceeeeeeccchhhhhhhhhh Q lcl|Aclame:pro 107 SAKAEELWQDVLDGIRRHISIGYIIHEMVLESSGDQGDTYRVMDWEPYEISLISVPADPTVGVGRSIDIGNITIRGAEMP 186 (632) Q Consensus 107 ~~~~~~~~~~v~~G~~~~~SiG~~~~~~~~~~~~~~~~~~~~~~~~l~EvS~v~~pa~~~a~v~~~~~~~~~~~~~~~~~ 186 (632) ++.|++++.+|++| .+|||||++.+.. + . + .++.+++++|+|||+|++|||++|.|...+........... T Consensus 75 ~~~~~~~~~~~~~g--~~~S~gf~~~~~~-~-~-~--~~~~~~~~~l~EvS~v~~pa~~~a~I~~vke~~~~e~~~~~-- 145 (517) T protein:vir:97 75 DIAYKRMKEAIDKG--AGLSVTFQPVEAS-E-V-D--GVAYYKKCILAGGALTPNPSNKNAVVTYFREEKKKEENKMT-- 145 (517) T ss_pred hHHHHHHHHHHHcC--CceEEEEEeeccc-C-C-C--CceEEEEEeeeeeeecchhhhhhhhhhhhhhhhhhhhhhhh-- Confidence 99999999999999 4999999998742 1 1 1 24678899999999999999999998754322110000000 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHh Q lcl|Aclame:pro 187 DKDKQTQTAGSQQTETRGAETGAKNPAPAASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVL 266 (632) Q Consensus 187 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~ 266 (632) . .... ..... ....+ ..++.. ..+....... T Consensus 146 ---------~-------~~a~----------~ee~~----e~~~k------------~~el~a-------~l~~~~~~~~ 176 (517) T protein:vir:97 146 ---------F-------DQNL----------MQELL----DAKKL------------AADLNA-------KLKERENGGD 176 (517) T ss_pred ---------h-------hhhh----------hhhhh----hhhhh------------HHHHHH-------HHHHHHHHHH Confidence 0 0000 00000 00000 000000 0000000000 Q ss_pred hhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhh Q lcl|Aclame:pro 267 ERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFY 346 (632) Q Consensus 267 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 346 (632) ....+.......... ........ ........ ......... ........... .. T Consensus 177 ~~~~e~~~~l~a~~~-------~~~~~~~~---------~~~~~~~~-----~~~~~~~~~-~~~~~~~~~~~--~~--- 229 (517) T protein:vir:97 177 NAALKTVSELAANLM-------KQRESEKI---------LGVEALKV-----TPEATEFLK-TREAEVAYMSA--SL--- 229 (517) T ss_pred HHHHhhhhhhhhhHH-------HHHHhhhh---------cccccccc-----cchhhHHHH-HHHHHHHHHHh--cc--- Confidence 000000000000000 00000000 00000000 000000000 00000000000 00 Q ss_pred hhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcccccC Q lcl|Aclame:pro 347 MPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDS 426 (632) Q Consensus 347 ~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~ 426 (632) .................+++..|..+. ..+...+...+.+... ++..+. .....+..+....+.|+.||+.+|++ T Consensus 230 -~~~~~~~~~~~~~~~~~~~~~~p~~~~-~~i~~~~~~~~~i~~~-~~~~~i--~~~~~~~~~~~~~a~~~~eG~~kp~s 304 (517) T protein:vir:97 230 -TKDPKAAWTAELKERGISGMPAPAGIL-KRIQDAVNDEGSLLPF-IRHENL--PTLVVGGDNALTQGTGHTTGTDKTES 304 (517) T ss_pred -cccccceeeeecccccccccccchHHH-HHHHHhhhhhccceee-eeeccc--cceeeecccccceeeeeecCCccccc Confidence 000000001111122234555555543 3444445444444433 122111 23344555566678899999999999 Q ss_pred cccceeeeeeeeeeeeeehhhHHHhhcChhH----HHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccc Q lcl|Aclame:pro 427 DFDFTTLSFSPKTIAGAVPVTRKLRKQSSIH----VENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTY 502 (632) Q Consensus 427 ~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~----~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~ 502 (632) +++|+++++.++++++++.+|+|+|.|+.++ +++||.+.|..+++++++.+|++|+|++.++.|++..+....... T Consensus 305 ~~tf~~~~~~~~~ia~~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~l~~~ee~a~l~GdGtg~~~~gi~~~a~~~~~~~ 384 (517) T protein:vir:97 305 NITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATN 384 (517) T ss_pred ccceeeEEeeHhhhhhhhhhhHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCccccccccccccccccc Confidence 9999999999999999999999999987776 999999999999999999999999999888888876554222211 Q ss_pred cccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeeccc-------cccCcceEEcCCCCCcc Q lcl|Aclame:pro 503 PAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQNN-------EVNGYRAEASNQIPADT 575 (632) Q Consensus 503 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~~-------~l~G~pv~~~~~~~~~~ 575 (632) ....+.+.+++..+...+....++.|+||+.++.. +.++||++|+|+|++. +++|...+ .+.++.+. T Consensus 385 ---~~~~~~~~d~i~~l~~a~~~a~~a~~vmn~~t~~~--I~klKD~~G~Yl~~~~~~~~~~~~l~G~~~~-~~~~~~~~ 458 (517) T protein:vir:97 385 ---VTGTTNIQELLEKLSVATPKAADSTLVIHRNDLAA--IRFLKDKNGNYVFPVGVSNQTIATHFGFNRL-VQSVAVDE 458 (517) T ss_pred ---ccccchHHHHHHHHHHHhhhccCCEEEECHHHHHH--HHHhhcCCCCeeccCcCCcccccccCCcccc-ccccccCc Confidence 12224455666666666665556789998887654 4688999999999763 34553111 22344455 Q ss_pred EEEEehhhEEEEEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 576 WIFGDWSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 576 ~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) ..+++.+.|.+..+.|+....+ +++.+|+..|+.++|.++.|+.|++|+++.+.- T Consensus 459 ~~~~~~~~y~i~~~~g~~~~~~--fd~~~n~~~f~~~~~~~g~i~~~~r~a~~~~~p 513 (517) T protein:vir:97 459 KTAVSLSGYVTNGSRGMEFEQG--TILVENNKEYLFEMPISGSLEYKGTTAYGTYTP 513 (517) T ss_pred eeEeeccccEEEeecceeeeee--eecccCceeEeeeeeeccccccccceEEEEEcC Confidence 5666778888888888776543 455678999999999999999999999988776 No 4 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=100.00 E-value=5.2e-56 Score=323.66 Aligned_cols=461 Identities=12% Similarity=0.032 Sum_probs=259.8 Q ss_pred CCCcEEEEEEecCcceecCCCeEEEEecchhhhhhhccCCcEEEeeCCCCCceEEEEEeeeecccceEEEEEeCCChhHH Q lcl|Aclame:pro 32 QEARTVELAASSEYPVPRWFGREILDHSPGAIRMGRLKNGAPLLDSHSLREQIGVVEEVWLDDDRRLRARVRFSRSAKAE 111 (632) Q Consensus 32 ~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~H~~~~~iG~~~~~~~e~~~gl~~~~~~~~~~~~~ 111 (632) -+.|||+||++++..+++||+. +.+++|+ +..++|||+|| .|||+|... .|++ + +.+++.|+ T Consensus 1 ~~~~~~~G~a~~~~~~d~~gd~----~~~~a~~----~~~~~~l~~H~--~~iG~~~~~-~~~~-~------~~~t~~~~ 62 (480) T protein:vir:40 1 MKVKAVRGIANPLGTIDAHGTV----IESIANA----GDGVDILNRHR--EKIGSGFVH-LEGD-N------VILTGYVD 62 (480) T ss_pred CcceEEEEEEecCCCCCCcchh----ecccccC----CcCceeeeeCC--ceeeEEEEe-ecCC-C------CccchhHH Confidence 6789999999999889998863 4577775 34678999996 799998653 3443 3 34799999 Q ss_pred HHHHHHhcCCcceeeeeEEEeecccccCCCCeeEEEEEEeeeeccCcccccccccceeeeeeccchhhhhhhhhhhhhhh Q lcl|Aclame:pro 112 ELWQDVLDGIRRHISIGYIIHEMVLESSGDQGDTYRVMDWEPYEISLISVPADPTVGVGRSIDIGNITIRGAEMPDKDKQ 191 (632) Q Consensus 112 ~~~~~v~~G~~~~~SiG~~~~~~~~~~~~~~~~~~~~~~~~l~EvS~v~~pa~~~a~v~~~~~~~~~~~~~~~~~~~~~~ 191 (632) ++|++|++|.|++|||||++.++++...++ ++.+++++|+|||+|++|||++|.|...+......... T Consensus 63 ~~~~~~k~g~~~~~Sigf~~~~~~~~~~~~---~~~~~~~~l~EvS~v~~pa~~~a~v~~vks~~~~~e~~--------- 130 (480) T protein:vir:40 63 EEQYTAEKIEETGLSVGFNANGVKAREIDG---VGYYKDVTITEVSLTPLPSNKGAKVTKVREENKGEQEQ--------- 130 (480) T ss_pred HHHHHHHcCCccceeeeeeeeecccccCCC---eEEEEEEEEEEeEEeecccchhhhhhhhhhhhhhhhhh--------- Confidence 999999999999999999999876654333 47788999999999999999999986432211000000 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHh Q lcl|Aclame:pro 192 TQTAGSQQTETRGAETGAKNPAPAASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNP 271 (632) Q Consensus 192 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~ 271 (632) ... ....+ .. .+................+.... T Consensus 131 -------~~~-----------------~e~~e----~~-------------------~e~~e~~~~~~el~akl~el~k~ 163 (480) T protein:vir:40 131 -------MGA-----------------NETQE----IM-------------------KQAIEAGVKVRELEAKVEELNKE 163 (480) T ss_pred -------hhh-----------------HHHHH----HH-------------------HhhhhhhhhhhhHHHHHHHHHhH Confidence 000 00000 00 00000000000000000000000 Q ss_pred hhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhH Q lcl|Aclame:pro 272 GQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEV 351 (632) Q Consensus 272 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 351 (632) ..+.. ...... ........ . ........ ............. .. T Consensus 164 ----~ee~k------------~~~~~~-----~~~~~~~~---~-------~~~e~r~~-~~~~~~~~e~~~~-----~~ 206 (480) T protein:vir:40 164 ----REELK------------KEREAS-----IPSEKPED---A-------ERKFMREL-GSKMAEMPEQGFL-----RE 206 (480) T ss_pred ----HHHHh------------hhhhhh-----ccccchhh---h-------hhHHHHHH-HHHhccchhhhhh-----hh Confidence 00000 000000 00000000 0 00000000 0000000000000 00 Q ss_pred HhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcccccCc--cc Q lcl|Aclame:pro 352 LVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSD--FD 429 (632) Q Consensus 352 ~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~--~~ 429 (632) ...+ ...... ..+..+++.+.....+ ......+...... ....+.....|++|....+... .. T Consensus 207 ~~~~-~~~~~~-~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~------------~~~~g~~~~~~~~e~~~~~~~~~~~~ 271 (480) T protein:vir:40 207 FANG-ADLNVV-NSLGSITSKYARKSGI-YDGAMKARFQGLT------------LAEDGVDDTFISGTFKAGTDKNKSQT 271 (480) T ss_pred hhhh-cccccc-ccccccccchhhheee-chhhhhhhhhcce------------eeeccccceeeeeeeecccccccccc Confidence 0001 111111 1222333333222111 1111111111100 1111222344555543332211 11 Q ss_pred ceeeee---eeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCcc-ccccceecccccccccccc Q lcl|Aclame:pro 430 FTTLSF---SPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLAN-DPVGLLNMTGVPALTYPAG 505 (632) Q Consensus 430 ~~~~~~---~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~-~~~Gil~~a~~~~~~~~~~ 505 (632) +.+..+ .++++.....+|+++|.|+ .++++||.++|+..++++++.+|++|+|++. .+.|+..... ..+.. T Consensus 272 ~~~~~~~~~~v~~l~~~~k~t~~lLDDa-~~l~~~i~~~l~~~~~~~ee~a~l~G~g~g~~~~~g~~~~~~----~~~~~ 346 (480) T protein:vir:40 272 ATKRSLRPQMAEAYLQMDKATVRGVNDS-GALSEYVMSEMVNRVIQKVEYNMILGSVDGSNGFYGLKTATD----GWTKQ 346 (480) T ss_pred cccchhhHHHHHHHHHhHHHHHHHhhhh-HHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceeecc----ccccc Confidence 222222 2578888889999999765 4899999999999999999999999976653 3555543322 11222 Q ss_pred chhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeecc-------ccccCcceEEc-CCCCCccEE Q lcl|Aclame:pro 506 GVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEAS-NQIPADTWI 577 (632) Q Consensus 506 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~-------~~l~G~pv~~~-~~~~~~~~~ 577 (632) ....+.|.++++++...|+. ..+.|+||+.++.. +.++||++|+|||++ .+|+|+||+++ ..+|.+... T Consensus 347 ~~~~d~id~L~~al~~~y~~-~a~~~vmn~~t~~~--I~klKD~~G~Yi~q~~~~~~~~~~llG~pvv~~~~~~~~~~~~ 423 (480) T protein:vir:40 347 IEYTDLFEGITDAVAECSIS-DAITIVMSPQTFAE--LRKAKGTDGHSRFNELATKEQIAQSFGAVNLETRVWMPKDEVA 423 (480) T ss_pred chhHHHHHHHHHhhhHHhhC-CCCEEEECHHHHHH--HHHhhcCCCCeeccCcccccCcceecccceeeeeccccCCcce Confidence 23345566788999888865 33368898887654 568899999999986 47999998765 466767666 Q ss_pred EEehhhE-EEEEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 578 FGDWSQI-VIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 578 ~gd~s~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) .+.++.| .++++ +++. .+...+..+...|..+.|+|+.+..|++|+++|+++ T Consensus 424 ~~~~~~~~~~~d~-~~~~--~~~~~~~~~~~~~~~e~~v~g~~~~~~~~~~~~~~~ 476 (480) T protein:vir:40 424 VYNHDEYVLIGDL-NVEN--YNDFDLRYNVEQWLSETLVGGSIRGKNRSAYLKKKG 476 (480) T ss_pred eeeCCccEEEEec-ccce--ecccccccchhhhhhhhhhceeeEccccEEEEEecc Confidence 6666654 55655 4443 334455678899999999999999999999999999 No 5 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=100.00 E-value=1.7e-51 Score=298.91 Aligned_cols=378 Identities=16% Similarity=0.179 Sum_probs=241.8 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhh Q lcl|Aclame:pro 213 APAASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIH 292 (632) Q Consensus 213 ~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 292 (632) +.........+......+++..... .....++.++.. ...+....+.....+....... T Consensus 1 m~~~~l~~l~e~r~~~~~e~~~L~~---~~~~~~lt~e~~---~~~~~l~~e~~~l~~~i~~~~~--------------- 59 (390) T protein:vir:62 1 MDATTLSANFEARERATAELRTLTD---EFAGKEMTDEAR---EKEERLITAVSDYDARIKRGIE--------------- 59 (390) T ss_pred CChhHHHHHHHHHHHHHHHHHHHHH---HhhcccccHHHH---HHHHHHHHHHHHHHHHHHHHHH--------------- Confidence 1111111111111111111111000 000000000000 0111111111100000000000 Q ss_pred hhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechh Q lcl|Aclame:pro 293 SARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATE 372 (632) Q Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~ 372 (632) ............ ...... . ........... ....+........ ...........+...++.+++++ T Consensus 60 ~~~~~~~~~~~~-----~~~~~~----~-~~~~~~~~~~~---~~~~r~~~~~~~r-~~~~~~~~~~~t~~~~g~~~~~~ 125 (390) T protein:vir:62 60 AIKAIDPVTSLL-----SGLQGS----G-SGAQRSADVDD---DATLRAGNLGEAR-SFEFAPEKRDGTKAGNPNVLSRT 125 (390) T ss_pred HHHHHHHHHHHH-----hhcccc----c-ccchhhcchHH---HHHHhhhhhhhhH-HHHhhhhhhcccccCCCcccccc Confidence 000000000000 000000 0 00000000000 0000000000000 00001111223444556778888 Q ss_pred hhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhh Q lcl|Aclame:pro 373 LLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRK 452 (632) Q Consensus 373 ~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~ 452 (632) +....|.+.++..++++.++....+.....+.+|+.++.+.+.|++|++++|+++++|+++++.+++++++++||+|+|. T Consensus 126 ~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ 205 (390) T protein:vir:62 126 LYGQLIAQAVERSAIMRGGATTFTTSDANPLDFTVITGRSSASIVGETAEIPESYPATAQRSMGGFKYGFASVVSYEFAT 205 (390) T ss_pred chHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceeeecccccccccccceeeeEeeeeeEEeehHHHHHHHh Confidence 88888888888888887764443333445688999999999999999999999999999999999999999999999999 Q ss_pred cChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccc---cccccchhHHHHHHHHHHHHhhccccccc Q lcl|Aclame:pro 453 QSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPAL---TYPAGGVDWASVVDMETKISTFNADAGRL 529 (632) Q Consensus 453 d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~---~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 529 (632) |+.+++.++|.+.++++++.++|.++++|+| +|.||++....... .+....+++++++++++++...|+. ++ T Consensus 206 ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~G---~p~Gi~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~--~a 280 (390) T protein:vir:62 206 DQVLDLVGFLVSDAGPAIGDAMGRHFITGTG---QPRGILTDASPATATFLATDTDSKVSDALIDLFHEVPSAYRA--NA 280 (390) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHhhhhccCC---ccccccccccccccceecccccccchHHHHHHHHhhhhhhhc--CC Confidence 9999999999999999999999999999987 48999887654332 2334567899999999999887763 57 Q ss_pred eEEeehhHHHHHHHHhhcccCCceeeccc-------cccCcceEEcCCCCCccEEEEehhhEEEEEecceEEEEeccccc Q lcl|Aclame:pro 530 AYLTSVTQRGAAKKAQVFDNTGERIWQNN-------EVNGYRAEASNQIPADTWIFGDWSQIVIAMWGVLDLKVDPYTKA 602 (632) Q Consensus 530 ~~~~~~~~~~~~~~~~~~d~~g~~~~~~~-------~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~ 602 (632) .|+||+....++ .+++|.+|+|+|+++ +|+|+||++++.+|++.++||||+.|.++.++++.+.++.+.+| T Consensus 281 ~~vmn~~~~~~L--~~lkd~~g~~l~~~~~~~g~~~~l~G~Pv~~~~~~p~~~i~~gd~s~~~i~~~~~~~v~~~~~~~~ 358 (390) T protein:vir:62 281 KYVVNDLRAAQM--RKLKDANGQYLWQSGLTVGAPSLFNGKVVETDDGMPADKILFADLSKYRVRFAGSLRVDRSVDAKF 358 (390) T ss_pred EEEEchHHHHHH--HHhhccCCCeeecCCcCCCccceecccceEEecCCCCccEEEeeccceeEEeecceEEEeeccccc Confidence 899999887554 578999999999763 69999999999999999999999999999999999999999999 Q ss_pred ccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 603 ASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 603 ~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) .+|++.|+++.|+||++++|+||++|+++| T Consensus 359 ~~~~~~~~~~~r~d~~~~~~~A~~~l~~~~ 388 (390) T protein:vir:62 359 STDQIVYRFLQRADGLLVDARGAKVLTVTP 388 (390) T ss_pred cCCcEEEEEEEEeCcEeechhheEEEEeec Confidence 999999999999999999999999999999 No 6 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=100.00 E-value=5.9e-51 Score=295.97 Aligned_cols=380 Identities=16% Similarity=0.164 Sum_probs=242.4 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhh Q lcl|Aclame:pro 213 APAASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIH 292 (632) Q Consensus 213 ~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 292 (632) +.........+..+...+++........ ..++.++... ..+....+.....+..... .. T Consensus 1 m~~~~l~~l~e~r~~~~~e~~~l~~~~~---~~~~~~e~~~---~~~~l~~e~~~l~~~i~~~---------------~e 59 (392) T protein:vir:13 1 MDATTLSANFEARERATAELRSLTDEFA---GKEMTAEARE---KEERLLTAVADFDGRIKRG---------------ID 59 (392) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhh---cccccHHHHH---HHHHHHHHHHHHHHHHHHH---------------HH Confidence 1111111111111111111111111100 0111111000 0011111110000000000 00 Q ss_pred hhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechh Q lcl|Aclame:pro 293 SARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATE 372 (632) Q Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~ 372 (632) .................. ..... ........... +.+...... ...........+...++.+++++ T Consensus 60 ~~~~~~~~~~~~~~~~~~--------~~~~~-~~~~~~~~~~~----r~g~~~~~~-~~~~~~~~~~~t~~~~g~~~~~~ 125 (392) T protein:vir:13 60 AIKATDAVTSLLSGLQGS--------GSGAQ-RSADHDDDAVL----RAGNLGEAR-SFEFAPEKRDGTKAGNPNVLSRT 125 (392) T ss_pred HHHHHHHHHHHhcccCCc--------ccchh-hhhhHHHHHHH----hccchhhhH-HHHhhhhhhcccccCCCcccccc Confidence 000000000000000000 00000 00000000000 000000000 00001111223344456677888 Q ss_pred hhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhh Q lcl|Aclame:pro 373 LLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRK 452 (632) Q Consensus 373 ~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~ 452 (632) +..+.|.+.....++++.+.....+.....+.+++.++.+.+.|++|++++|+++++|+++++.+++++++++||+|+|. T Consensus 126 ~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ 205 (392) T protein:vir:13 126 LYGQLIAQAVERSAIMRGGASTFTTSDANPMDFTVITGRATAGIVGETAEIPESYPATTQRSMGGFKYGFASVVSYEFAT 205 (392) T ss_pred chHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceeeecccccccccccceeeEEeeeeeEEeeehhHHHHHh Confidence 77777777777777777764433334455688999999999999999999999999999999999999999999999999 Q ss_pred cChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceecccccccc---ccccchhHHHHHHHHHHHHhhccccccc Q lcl|Aclame:pro 453 QSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALT---YPAGGVDWASVVDMETKISTFNADAGRL 529 (632) Q Consensus 453 d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~---~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 529 (632) |+.++++++|.+.++++++++++.++++|+|+ ++|.||++.+...+.. .....++++.+.+++..+...|+. ++ T Consensus 206 ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~Gt-~~p~Gil~~~~~~~~~~~~~~~~~~~~d~l~~~~~~l~~~~~~--~a 282 (392) T protein:vir:13 206 DQVLDLVGFLVSDAGPAIGDAMGRHFLTGTGT-GQPRGILTDATGANAAFGEADADSKVSDALIDLFHEVPSAYRK--NA 282 (392) T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHhcccCC-ccccccccccccccccccccccccccHHHHHHHHHhhhhhhhc--CC Confidence 99999999999999999999999999999996 5799999876544332 334567899999999999887754 57 Q ss_pred eEEeehhHHHHHHHHhhcccCCceeecc-------ccccCcceEEcCCCCCccEEEEehhhEEEEEecceEEEEeccccc Q lcl|Aclame:pro 530 AYLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGVLDLKVDPYTKA 602 (632) Q Consensus 530 ~~~~~~~~~~~~~~~~~~d~~g~~~~~~-------~~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~ 602 (632) .|+||+..... +.+++|.+|+|+|++ .+|+|+||++++++|+++++||||+.|.++.++++.+..+.+.+| T Consensus 283 ~~v~n~~~~~~--l~~lkd~~G~~l~~~~~~~g~~~~l~G~Pv~~~~~~~~~~i~~Gdf~~~~i~~~~~~~i~~~~~~~~ 360 (392) T protein:vir:13 283 KFVVNDLRAAQ--MRKLKDANGQYLWQSALTVGAPDTFNGKVVETDDGMPADKVLFADLSKYRVRFAGSLRVDRSVDAKF 360 (392) T ss_pred EEEEcHHHHHH--HHHhhccCCceeecCCcCCCCCceecceeeEEcCCCCCCcEEEeeccceeEEeecceEEEeeccccc Confidence 89999887654 457899999999975 369999999999999999999999999999999999999999999 Q ss_pred ccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 603 ASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 603 ~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) .+|++.||++.|+|++++||+||++++++| T Consensus 361 ~~~~~~~r~~~r~d~~~~~~~A~~~~~~~~ 390 (392) T protein:vir:13 361 STDQIVYRFLQRADGLLVDARGAKVLTVTP 390 (392) T ss_pred cCCcEEEEEEEEeccEEecccceEEEEeec Confidence 999999999999999999999999988888 No 7 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=100.00 E-value=1.3e-50 Score=294.16 Aligned_cols=372 Identities=17% Similarity=0.199 Sum_probs=242.9 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhh Q lcl|Aclame:pro 218 GANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDL 297 (632) Q Consensus 218 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 297 (632) .....+ ..+.+.........+. +. .....+..+..... ...................... T Consensus 1 l~~~k~----l~~~i~e~~~~~~~~k--~~------~~~~~~~~e~~~~~--------l~~~~e~~~~~~~~~e~~~~~~ 60 (407) T protein:vir:48 1 MADVKD----VEQVAQELQRKFDDFK--EK------NDKRIDAIEQEKGK--------LAGEVETLNGKLAELENLKSDL 60 (407) T ss_pred CchHHH----HHHHHHHHHHHHHHHH--HH------HHHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHHHHHHH Confidence 000000 0000000000000000 00 00000000000000 0000000000000000000000 Q ss_pred hhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHH Q lcl|Aclame:pro 298 GIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEE 377 (632) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~ 377 (632) .... ....+.. .............+..... +. .........+.+++..++..+||++||.++ .+. T Consensus 61 ---~~~~-----~~~~~~~---~~~~~~~~~e~~~a~~~~l-~~--g~~~~~~~~e~~a~~~~t~~~gG~~iP~~~-~~~ 125 (407) T protein:vir:48 61 ---EAEL-----AEVKRPA---GGTQNKVASEHKEAFIGFM-RK--GREDGLRELERKALQVGNDEDGGYAIPEEL-DRT 125 (407) T ss_pred ---HHHH-----HHhhccc---cccccchhhHHHHHHHHHH-hc--cchhhhhHHHHHhhhcccCCCCcccccHhH-HHH Confidence 0000 0000000 0000000000011111111 11 111222344567788888888888887665 667 Q ss_pred HHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcccccC-cccceeeeeeeeeeeeeehhhHHHhhcChh Q lcl|Aclame:pro 378 FIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDS-DFDFTTLSFSPKTIAGAVPVTRKLRKQSSI 456 (632) Q Consensus 378 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~-~~~~~~~~~~~~t~~~~~~iSre~l~d~~~ 456 (632) |++.++..++++.+ +++++..+..+.+++..+.+.+.|++|++.++++ .++|+++++.+++++++++||+|+|.|+.+ T Consensus 126 I~~~~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~ 204 (407) T protein:vir:48 126 ILTLLKDEVVMRQE-ATVITLGGSDYKKLVNLGGTTSGWVGETDARPETATSKLGLIEPFMGEIYGNPQATQKMLDDAFF 204 (407) T ss_pred HHHHHHhhhhhhhh-ceeeecCCCceEEEEecCCcceeeecccccccccccccceeEEeeeeeeEeehhhHHHHHhcchH Confidence 88999999999886 6777877778999999999999999999999986 479999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccc-------------cccccchhHHHHHHHHHHHHhhc Q lcl|Aclame:pro 457 HVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPAL-------------TYPAGGVDWASVVDMETKISTFN 523 (632) Q Consensus 457 ~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~-------------~~~~~~~~~~~i~~~~~~~~~~~ 523 (632) +++++|.+.|+++++++++.++++|+|+ ++|.||++....... ....+.+++++|.++++.+...| T Consensus 205 ~l~~~i~~~l~~~i~~~~~~a~l~G~G~-~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~ 283 (407) T protein:vir:48 205 NVEDWINSELALEFAEQEEIAFTSGDGS-KKPKGFLAYESTDEDDKTRAFGKLQHIASGAASGVTADAIIKLIYTLRKAH 283 (407) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhccCCC-CccceeeecccccccccccccccccccccccccccChHHHHHHHHhhchhh Confidence 9999999999999999999999999998 579999976654322 23345678999999999999887 Q ss_pred cccccceEEeehhHHHHHHHHhhcccCCceeecc-------ccccCcceEEcCCCCC-----ccEEEEehhh-EEEEEec Q lcl|Aclame:pro 524 ADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEASNQIPA-----DTWIFGDWSQ-IVIAMWG 590 (632) Q Consensus 524 ~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~-------~~l~G~pv~~~~~~~~-----~~~~~gd~s~-~~~~~~~ 590 (632) +. ++.|+||+..+. .+.+++|.+|+|+|++ ++|+|+||++++++|. ..++||||+. |.++++. T Consensus 284 ~~--~a~~v~n~~~~~--~L~~lkD~~Gr~l~~~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~ 359 (407) T protein:vir:48 284 RS--GAKFMMNNSSLF--AIRLLKDNDGNYLWRPGIELGQPSSLAGYGIVENEQMPDIAADAKAIAFGNFKRGYTIVDRI 359 (407) T ss_pred hc--CCEEEEcHHHHH--HHHHhhccCCceeeccCcCCCCCceecceeeEEecCcCCccCCccEEEEEeccccEEEEEee Confidence 64 578999888765 4567899999999975 3799999999999985 3488999986 7889999 Q ss_pred ceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 591 VLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 591 ~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) ++++..+++ +.+|++.|+++.|+|+++++|+||++++++| T Consensus 360 ~~~i~~d~~--~~~~~~~~~~~~r~d~~v~~~~a~~~l~~~a 399 (407) T protein:vir:48 360 GTRILRDPY--TNKPFVGFYTTKRTGGMLVDSQAIKLMKIGA 399 (407) T ss_pred ceEEEeecc--ccCCcEEEEEEEEeccEEecccceEEEEeec Confidence 999887654 5789999999999999999999999999999 No 8 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=100.00 E-value=5.7e-50 Score=290.57 Aligned_cols=410 Identities=14% Similarity=0.136 Sum_probs=241.9 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHH Q lcl|Aclame:pro 181 RGAEMPDKDKQTQTAGSQQTETRGAETGAKNPAPAASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQ 260 (632) Q Consensus 181 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~ 260 (632) +. .+................+........ ....+...................+..+.........+. T Consensus 1 ~~-----~~~~~~~~el~~~~~~l~el~~~~~el-------~~~~~el~~~~e~ak~eee~~~l~~ei~~le~e~~~l~~ 68 (425) T protein:vir:95 1 MA-----LRQLMLTKKIEQRKAALDELVKREQEL-------QAKAAELEQAIEEAQTEEEVSAVEEEVAKLEDERNELNE 68 (425) T ss_pred Cc-----hHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHH Confidence 00 000000000000000000000000000 000000000000000000000000000000000000000 Q ss_pred HHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHh Q lcl|Aclame:pro 261 FRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGK 340 (632) Q Consensus 261 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 340 (632) ....+...... ....... ..... . .........+..... .............. T Consensus 69 ~~~~le~~~~~-----------~~~~l~~----~~~~~------~----~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~ 121 (425) T protein:vir:95 69 KKSKLEGEIAQ-----------LEDELEQ----INSKQ------P----SNQSRQKMQGSKGDV--VEMNRLQVREMLKT 121 (425) T ss_pred HHHHHHHHHHH-----------HHHHHHH----hhhhc------c----chhhhhhhhhhhhhH--HHHHHHHHHHHHhh Confidence 00000000000 0000000 00000 0 000000000000000 00000000000000 Q ss_pred hhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccC Q lcl|Aclame:pro 341 EARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGED 420 (632) Q Consensus 341 ~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~ 420 (632) ....................+...++.++|. .+.+.|++.+++.++++.+ +++++.. +.+.+|+.++.+.+.|++|+ T Consensus 122 ~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~-~~~~~Ii~~l~~~~~i~~~-~~~~~~~-g~~~ip~~~~~~~a~~v~E~ 198 (425) T protein:vir:95 122 GEYYKRSEVVEFYEKFRNLRAVAGGELTIPE-VVVNRIMDIMGDYTTLYPL-VDKIRVK-GTTRILVDTDTSPATWIEQS 198 (425) T ss_pred hhhhhhhHHHHHHHHHHhhcccccCceeccH-HHHHHHHHHHHhhhhHHHh-hceeecC-ceeEEEEecCCccccccccc Confidence 0000000000111111122233445555555 4667789999999999988 5556653 46789999999999999999 Q ss_pred cccccCc-ccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCc-cccccceeccccc Q lcl|Aclame:pro 421 EDVQDSD-FDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLA-NDPVGLLNMTGVP 498 (632) Q Consensus 421 ~~~~~~~-~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~-~~~~Gil~~a~~~ 498 (632) +++++++ ++|+++++.+++++++++||+|+|.|+..+++++|.+.++.++++++|.++++|+|++ ++|.||++..... T Consensus 199 ~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~G~G~~~~~p~Gil~~~~~~ 278 (425) T protein:vir:95 199 GALPTGDVGTIASIDFDGFKVGKVTFVDNYLLQDSIINLDDYVTKKIARAIAKALDLAIVKGTGAANKQPLGIIPSLPPE 278 (425) T ss_pred cccccccccccceeeeeheeeeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccc Confidence 9999887 6899999999999999999999999999999999999999999999999999999985 5799998754432 Q ss_pred -cccccccchhHHHHHHHHHHHHhhccccccceEEeehhHHH--HHHHHhhcccCCceeecc-----ccccCcceEEcCC Q lcl|Aclame:pro 499 -ALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRG--AAKKAQVFDNTGERIWQN-----NEVNGYRAEASNQ 570 (632) Q Consensus 499 -~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~d~~g~~~~~~-----~~l~G~pv~~~~~ 570 (632) +........+++++.+++..+...+.....+.|+|++.+.. ...+..++|.+|+|+|+. ++|+|+||++++. T Consensus 279 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~l~~~kd~~g~~i~~~~~~~~~~l~G~pvv~~~~ 358 (425) T protein:vir:95 279 NQVTVEADNNLLKNLVKQIGLIDTGDDSVGEIVAVMKRSTYYNRLVEFSIQVDSNGNVVGKLPNLRTPDLLGLRVVFNNF 358 (425) T ss_pred cccccccccchHHHHHHHHHhhhhhccccCceEEEEeChHHHHHHHHHHhhcCCCCceeeccCCCCCccccceeeEEcCc Confidence 33344567789999999999888888777888999988743 234567899999999963 4799999999999 Q ss_pred CCCccEEEEehhhEEEEEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 571 IPADTWIFGDWSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 571 ~~~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +|.+.++||||+.|.++.++++++.++++.+|.+|++.||++.|+|+++++|+||+++++.. T Consensus 359 ~~~~~i~~Gd~~~~~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~d~~~~~~~a~~~~~i~~ 420 (425) T protein:vir:95 359 LDDDTVLFGEFEQYTLVERENITIDSSTHVKFTEDQTAFRGKGRFDGKPVKPEAFVLVTITD 420 (425) T ss_pred CCCccEEEEecccEEEEeecceEEEeecccccccCceEEEEEEeeCcEeecccceEEEEecC Confidence 99999999999999999999999999999999999999999999999999999999999999 No 9 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=100.00 E-value=1.1e-50 Score=294.47 Aligned_cols=398 Identities=21% Similarity=0.281 Sum_probs=243.2 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh---hhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhh Q lcl|Aclame:pro 197 SQQTETRGAETGAKNPAPAASGANENDILSRERTRISEIT---AIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQ 273 (632) Q Consensus 197 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~---~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 273 (632) +....... ..+...++..... .....+...+ ....+....++........ T Consensus 1 M~i~eL~e-------------------~r~~~~~~~~~l~~~~~e~~~lt~ee--------~~~~~~l~~ei~~l~~~I~ 53 (435) T protein:vir:14 1 MNVNELRR-------------------ERAAVNQRVQALAQIEVGGTALSVEQ--------QAEFDQLSSKFSELTAQIE 53 (435) T ss_pred CCHHHHHH-------------------HHHHHHHHHHHHHHHHhccCCCCHHH--------HHHHHHHHHHHHHHHHHHH Confidence 00000000 0000000000000 0000000000 0001111111100000000 Q ss_pred hhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhh---hhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhh Q lcl|Aclame:pro 274 PGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQ---YSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHE 350 (632) Q Consensus 274 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 350 (632) .. +....... ............... ............................................ T Consensus 54 ~~--e~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 125 (435) T protein:vir:14 54 RA--EAAERMAA------AAAVPVDPNPTAVAAPAAAPVHAQPKALEVKGAKMARMVRALAAARGDAQLASKLAIERGFG 125 (435) T ss_pred HH--HHHHHHHH------hhcccccchhhhhhhccccccccccchhhhhHHHHHHHHHHHHhhcchhhHHHHHHHhhhhh Confidence 00 00000000 000000000000000 00000000000000000000000000000000000000000111 Q ss_pred HHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcccccCcccc Q lcl|Aclame:pro 351 VLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDF 430 (632) Q Consensus 351 ~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~ 430 (632) ......+...+..+||.++|.++ ...|++.+++.+++++++++.+++.++.+++|+.++.+.+.|++|++.+++++++| T Consensus 126 ~~~~~~~~~~t~~~gg~~vP~~~-~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~f 204 (435) T protein:vir:14 126 EEVAMSLNTLSPGAGGVLVPENL-SSEVIELLRPKSVVRKLGARTLPLSNGNITIPRLKGGAIVGYIGADTDIPTTQQQF 204 (435) T ss_pred hhhhhhcccCCcCCCccccchhH-HHHHHHHHhhhchhhhhcceeeecCCCceEEEEEeCCcceeeeccCccccccccce Confidence 22234455666677777777654 56789999999999998888888888889999999999999999999999999999 Q ss_pred eeeeeeeeeeeeeehhhHHHhhcCh--hHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccc-- Q lcl|Aclame:pro 431 TTLSFSPKTIAGAVPVTRKLRKQSS--IHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGG-- 506 (632) Q Consensus 431 ~~~~~~~~t~~~~~~iSre~l~d~~--~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~-- 506 (632) +++++.+++++++++||+|+|.|+. ++++++|...+++++++++|.+|++|+|++++|.||++.+...++...... T Consensus 205 ~~i~~~~~k~~~~~~iS~ell~ds~~~~~l~~~i~~~l~~ai~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~ 284 (435) T protein:vir:14 205 DDLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPSNVITASDAST 284 (435) T ss_pred eEEEeeeEEEEEeehhhHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccceeccccccc Confidence 9999999999999999999999984 569999999999999999999999999999999999987665544333222 Q ss_pred --hhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeec---cccccCcceEEcCCCCCc------- Q lcl|Aclame:pro 507 --VDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQ---NNEVNGYRAEASNQIPAD------- 574 (632) Q Consensus 507 --~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~---~~~l~G~pv~~~~~~~~~------- 574 (632) ....++.+++..+...+....+..|+|++.++..+ .+++|.+|+|+|. +++|+|+||++++.+|.+ T Consensus 285 ~~~~~~~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~L--~~lkd~~G~~l~~~~~~g~l~G~Pv~~~~~~p~~~~~~~~~ 362 (435) T protein:vir:14 285 LQKIETDLGKVILALENADANLTQPGWIMAPRTFRFL--EGLRDGNGNKVYPELANGMLKGYPVGKTTQVPINLGETGKE 362 (435) T ss_pred hhhHHHHHHHHHHHhhhccccccCCEEEEcHHHHHHH--HHhhccCCceeccCCCCCeeecceeEeeccccccccCCCcc Confidence 23456677777777665555567899988877554 5789999999995 368999999999999863 Q ss_pred -cEEEEehhhEEEEEecceEEEEeccc-----------ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 575 -TWIFGDWSQIVIAMWGVLDLKVDPYT-----------KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 575 -~~~~gd~s~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) .++||||+.|.++.++++++.++++. .|.+|++.||++.|+||++++|+||++|+-++ T Consensus 363 ~~i~~gd~s~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~ 432 (435) T protein:vir:14 363 SEIYFTDFGDVFIGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGVA 432 (435) T ss_pred ceEEEeecccEEEEEecccEEEEeccccccccccchhhhhhcChhheeeeeeeCceeecccceEEEecCC Confidence 58999999999999999999999874 38899999999999999999999999999999 No 10 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=100.00 E-value=7.2e-50 Score=290.01 Aligned_cols=396 Identities=17% Similarity=0.200 Sum_probs=240.8 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHH Q lcl|Aclame:pro 186 PDKDKQTQTAGSQQTETRGAETGAKNPAPAASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALV 265 (632) Q Consensus 186 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~ 265 (632) ...+. .....+...... .... .....+..+.....+.......... ..++.++ .....+..+... T Consensus 1 ~~~~~------~~~~~~~~~~~~-~~~~----~~~l~e~ra~~~~e~~~l~~~~~~~-~~~~k~~---~~~~~~~~~~~~ 65 (425) T protein:vir:10 1 MSKKL------LIAVLTAALTGP-VGAV----PRGIISVRAEGPTEVKALIENLQKA-FHDFKAE---HTKQLDAVKAGL 65 (425) T ss_pred CchhH------HHHhhHHHhhhh-hhhh----hHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHH---HHHHHHHHHhhh Confidence 00000 000110000000 0000 0000000000000000000000000 0000000 000000000000 Q ss_pred hhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhh Q lcl|Aclame:pro 266 LERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGF 345 (632) Q Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 345 (632) ... .. ...................+. ...... . ....... ......+..+.+..... T Consensus 66 ~~~-----e~-~~~~~~~~~ei~~~~~~~~~~---~~~~~~----~----~~~~~~~----~~~~~~~~~~af~~~l~-- 122 (425) T protein:vir:10 66 PTS-----DA-LAKVDKVSADLEALQAAVDEA---NIKIAA----A----QMGANGV----KPLRDPEYTEAFKAHVK-- 122 (425) T ss_pred ccH-----HH-HHHHHHHHHHHHHHHHHHHHH---HHHHHh----h----hcccccc----cccccHHHHHHHHHHhh-- Confidence 000 00 000000000000000000000 000000 0 0000000 00000111111111110 Q ss_pred hhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCccccc Q lcl|Aclame:pro 346 YMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQD 425 (632) Q Consensus 346 ~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~ 425 (632) .....+++..++..+||+++|+++ ...|++.++..++++++ +++++..+...++|+.++.+.+.|++|++.+|+ T Consensus 123 ----~~e~~~al~~~t~~~gG~lvP~~~-~~~ii~~~~~~s~l~~l-~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~ 196 (425) T protein:vir:10 123 ----RGDVQAALNKGEDSEGGYLTPIEW-DRTITNKLVLISPMRQL-CRVQPVSKAGFSKLFNMGGTTSGWVGEASQRPQ 196 (425) T ss_pred ----hhhhHHHhhcCcCCCCceeccHhH-HHHHHHHHHhhhhhhhh-ceeeeccCCceEEEEEcCCcceeeecccccccc Confidence 112345566677777888887665 56789999999999997 566777777888999999999999999999998 Q ss_pred Cc-ccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccc---- Q lcl|Aclame:pro 426 SD-FDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPAL---- 500 (632) Q Consensus 426 ~~-~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~---- 500 (632) +. ++|+++++.+++++++++||+|+|.|+.++++++|.+.|++++++++|.++++|+|+ ++|.||++....... T Consensus 197 ~~~~~f~~v~~~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~ai~~~~d~~~l~G~G~-~~p~Gil~~~~~~~~~~~~ 275 (425) T protein:vir:10 197 TNAATFQPLSFASGEIYANPAATQQILDDAEIDLESWLATEVQTEFAKQEGKAFLAGDGT-NKPNGLLTYIAGGANAAKH 275 (425) T ss_pred ccccccceeeeeheeeEeehHhHHHHHhcchhHHHHHHHHHHHHHHHHHHHhhhhcccCC-CCcceeeeccccccccccc Confidence 75 799999999999999999999999999999999999999999999999999999996 489999986654332 Q ss_pred ---------cccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeecc-------ccccCcc Q lcl|Aclame:pro 501 ---------TYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYR 564 (632) Q Consensus 501 ---------~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~-------~~l~G~p 564 (632) +...+.+++++|.+++..+...|+ .++.|+||+..+.. +.+++|.+|+|+|++ ++|+|+| T Consensus 276 ~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~--~~a~~vmn~~~~~~--L~~lkD~~G~~l~~~~~~~g~~~~l~G~P 351 (425) T protein:vir:10 276 PFGAIEVVNSGAAADITSDGIIDLVYDLPSAFT--GNARFAMNRNTQRQ--VRKLKDGQGNYLWQPSYVAGQPATLAGYP 351 (425) T ss_pred cccccccccccccccccHHHHHHHHhhhhhhhc--cCCEEEEchHHHHH--HHHhhcCCCceeeccCccCCCCceeccee Confidence 123456789999999999988876 46789999988654 457899999999965 3799999 Q ss_pred eEEcCCCCC-----ccEEEEehhh-EEEEEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 565 AEASNQIPA-----DTWIFGDWSQ-IVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 565 v~~~~~~~~-----~~~~~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) |++++++|. ..++||||+. |.++++.++++..++ ++.+|++.|+++.|+|+++++|+||++++++| T Consensus 352 V~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~v~~d~--~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~a 423 (425) T protein:vir:10 352 VTEVPDMPDVAANSTPILFGDFQQTYLIIDRIGVRVLRDP--YTAKPYVLFYTTKRVGGGLLNPEPMRAMKVAA 423 (425) T ss_pred eEEecCcCCccCCccEEEEEehhccEEEEEecceEEEecc--cccCCcEEEEEEEEeccEeecccceEEEEeec Confidence 999999984 4589999998 678889998877655 46789999999999999999999999999999 No 11 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=100.00 E-value=9.7e-50 Score=289.28 Aligned_cols=372 Identities=16% Similarity=0.171 Sum_probs=240.7 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhh Q lcl|Aclame:pro 218 GANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDL 297 (632) Q Consensus 218 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 297 (632) +...................... +..++ ..+..+. .......................... T Consensus 1 m~~~lk~l~~~~~el~~~~~~~k-----~~~~~------~~~~~e~--------~~~~l~~~~~~l~~~~~~~~~~~~~~ 61 (401) T protein:vir:44 1 MAVDIKDVEQVAQELQQKFDDFK-----AKNDK------RVEAIEQ--------EKGKLAGQVETLNGKLSELENLKSDL 61 (401) T ss_pred CCccHHHHHHHHHHHHHHHHHHH-----HHHHH------HHHHHHH--------HHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00000000000000000000000 00000 0000000 00000000000000000000000000 Q ss_pred hhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhh-hhhhhhhhHHhhhhhcccccccccceechhhhhH Q lcl|Aclame:pro 298 GIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEA-RGFYMPHEVLVQRQLEKKTAGKGGELVATELLSE 376 (632) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~ 376 (632) ....... . +........ ...+..+.+.... ..........+.+++..++..+||++||.++ .+ T Consensus 62 --~~~~~~~---~---~~~~~~~~~-------~~~e~~~a~~~~lr~~~~~~~~~~e~~a~~~~~~~~GG~~iP~~~-~~ 125 (401) T protein:vir:44 62 --EKELLEL---K---RPARGAQNK-------VAAEHKDAFVGFLRKGREDGLRDLERKALQVGTDEDGGYAVPEEL-DR 125 (401) T ss_pred --HHHHHHh---h---ccccccccc-------hhHHHHHHHHHHHhhhhhhhhHHHHHHHhhcCCCCCCceeccHhH-HH Confidence 0000000 0 000000000 0000111111111 0111222344567777777778888887665 56 Q ss_pred HHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcccccC-cccceeeeeeeeeeeeeehhhHHHhhcCh Q lcl|Aclame:pro 377 EFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDS-DFDFTTLSFSPKTIAGAVPVTRKLRKQSS 455 (632) Q Consensus 377 ~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~-~~~~~~~~~~~~t~~~~~~iSre~l~d~~ 455 (632) .|++.++..++++++ +++++..+..+.+++..+.+.+.|++|+++++.. .++|+++++.+++++++++||+|+|.|+. T Consensus 126 ~ii~~~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~ 204 (401) T protein:vir:44 126 SILSLLKDEVVMRQE-ATVITVGGSDYKKLVNLGGTASGWVGETDTRSQTATSRLGLIEPFMGEIYGNPQATQKMLDDAF 204 (401) T ss_pred HHHHHHHhhhhhhhh-ceeeecCCCceEEEEecCCccceeeccccccCccccccceeeeeehhheeeehhhhHHHHhcch Confidence 788999999998887 6667777778889999888999999999999874 58999999999999999999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccc-------------cccccchhHHHHHHHHHHHHhh Q lcl|Aclame:pro 456 IHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPAL-------------TYPAGGVDWASVVDMETKISTF 522 (632) Q Consensus 456 ~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~-------------~~~~~~~~~~~i~~~~~~~~~~ 522 (632) ++++++|.+.|+++++++++.++++|+|+ ++|.||++....... +...+.++++++.++++.+... T Consensus 205 ~~l~~~i~~~la~ai~~~~~~~~l~G~G~-~~p~Gil~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~~~~~~l~~~ 283 (401) T protein:vir:44 205 FNVEAWINSELATEFAEQEEIAFTTGDGT-KKPKGFLAYESTEESDKARAFGKLQHIVSGEATAVTADAIIKLIYTLRKA 283 (401) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhccCCC-CccceeeccccccccccccccccccccccccccccCHHHHHHHHHhcchh Confidence 99999999999999999999999999998 689999976654321 2244567899999999999887 Q ss_pred ccccccceEEeehhHHHHHHHHhhcccCCceeecc-------ccccCcceEEcCCCCC-----ccEEEEehhh-EEEEEe Q lcl|Aclame:pro 523 NADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEASNQIPA-----DTWIFGDWSQ-IVIAMW 589 (632) Q Consensus 523 ~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~-------~~l~G~pv~~~~~~~~-----~~~~~gd~s~-~~~~~~ 589 (632) |+. ++.|+|++..+.. +.+++|.+|+|+|++ ++|+|+||++++.+|. ..++||||+. |.++++ T Consensus 284 ~~~--~a~~v~n~~~~~~--L~~lkd~~G~~l~~~~~~~g~~~~l~G~PVv~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~ 359 (401) T protein:vir:44 284 HRT--GAKFMMNNNSLFA--IRLLKDTEGNYLWRPGLELGQPSSLAGYGIAENEQMPDIAADAKAIAFGNFKRGYTIVDR 359 (401) T ss_pred hhc--CCEEEEcHHHHHH--HHHhhccCCceeecCCcCCCCCceecceeeEEecCcCCccCCccEEEEeehhccEEEEEe Confidence 764 5789999987654 457899999999965 3699999999999884 2378999986 788999 Q ss_pred cceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 590 GVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 590 ~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) .++++.++++ +.+|++.|+++.|+|+++++|+||++++++| T Consensus 360 ~~~~~~~~~~--~~~~~v~~~a~~r~d~~~~~~~a~~~l~~~a 400 (401) T protein:vir:44 360 IGTRILRDPY--TNKPFVGFYTTKRTGGMLVDSQAIKLLKIAA 400 (401) T ss_pred cceEEeeecc--ccCCcEEEEEEEEeccEEecccceEEEEeec Confidence 9999877654 6789999999999999999999999999999 No 12 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=100.00 E-value=4.9e-50 Score=290.91 Aligned_cols=391 Identities=21% Similarity=0.324 Sum_probs=235.7 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh---hhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhh Q lcl|Aclame:pro 197 SQQTETRGAETGAKNPAPAASGANENDILSRERTRISEITAIG---QQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQ 273 (632) Q Consensus 197 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~---~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 273 (632) +..... ..+..+....++....... ......+. .+........+....++....... T Consensus 1 M~kl~~------------------L~e~r~~l~~~~~~l~~~~~e~~~lt~ee~-~~~~~l~~e~~~l~~~i~~~e~~e- 60 (428) T protein:vir:10 1 MPQIEE------------------LRRQRAGINEQIQALATIEATNGTLTAEQL-TEFAGLQQQFTDISAKMDRMEATE- 60 (428) T ss_pred CchHHH------------------HHHHHHHHHHHHHHHHHHHhccCCCCHHHH-HHHHHHHHHHHHHHHHHHHHHHHH- Confidence 000000 0000000000000000000 00000000 000000000111100000000000 Q ss_pred hhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHH--hhhhh--hhhhh Q lcl|Aclame:pro 274 PGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASG--KEARG--FYMPH 349 (632) Q Consensus 274 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~--~~~~~ 349 (632) ...... ..... ... . ...... .............. .........+ ..... ..... T Consensus 61 ~~~~~~----~~~~~-------~~~--~---~~~~~~----~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 119 (428) T protein:vir:10 61 RAAALV----AKPVK-------ATQ--H---GPAVIV----KAEPKQYTGAGMTR-MVMSIAAAQGNLQDAAKFASDELN 119 (428) T ss_pred HHHHHH----hhhhh-------chh--h---cccccc----ccccchhhhHHHHH-HHHHHHHhhhhHHHHHHHhhhhhh Confidence 000000 00000 000 0 000000 00000000000000 0000000000 00000 00001 Q ss_pred hHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcccccCccc Q lcl|Aclame:pro 350 EVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDFD 429 (632) Q Consensus 350 ~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~ 429 (632) .....+.. ..+..+||.+||.++ .+.|++.+++.+++++++++++++.++.+.+|+.++.+.+.|++|++.+++++++ T Consensus 120 ~~~~~~~~-~~~~~~gg~liP~~~-~~~ii~~l~~~~~l~~~~~~~~~~~~g~~~~p~~~~~~~a~~v~Eg~~~~~~~~~ 197 (428) T protein:vir:10 120 DQSVSMAI-STAAGSGGVLIPQNI-HSEVIELLRDRTIVRKLGARSIPLPNGNMSLPRLAGGATASYTGENQDAKVSEAR 197 (428) T ss_pred hhhHhhhh-cccccCCccccchhH-HHHHHHHHhhhchhhhhcceeeecCCcceEEEEEeCCcceeeeccCccccccccc Confidence 11122222 233346677777665 4678899999999999988888988888999999999999999999999999999 Q ss_pred ceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccc---cccc Q lcl|Aclame:pro 430 FTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTY---PAGG 506 (632) Q Consensus 430 ~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~---~~~~ 506 (632) |+++++.+++++++++||+|+|.|+.++++++|.+.|++++++++|.++++|+|++++|.||++.+...+... .... T Consensus 198 f~~i~~~~~k~~~~v~is~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~ 277 (428) T protein:vir:10 198 FDDVKLTAKTMIAMVPISNALIGRAGFNVEQLVLQDILTAISVREDKAFMRDDGTGDTPIGMKARATQWNRLLPWAADAA 277 (428) T ss_pred eeeEEeeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccccccccc Confidence 9999999999999999999999999999999999999999999999999999999999999998776544322 2233 Q ss_pred hhHHHHHHHHHHHHh----hccccccceEEeehhHHHHHHHHhhcccCCceeecc---ccccCcceEEcCCCCCc----- Q lcl|Aclame:pro 507 VDWASVVDMETKIST----FNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN---NEVNGYRAEASNQIPAD----- 574 (632) Q Consensus 507 ~~~~~i~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~---~~l~G~pv~~~~~~~~~----- 574 (632) .+.+.+......+.. ......++.|+|++..+.. +.+++|.+|+|+|.+ ++|+|+||++++++|.+ T Consensus 278 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~--L~~lkd~~G~~i~~~~~~g~l~G~pv~~~~~~p~~~~~~~ 355 (428) T protein:vir:10 278 VNLDTIDTYLDSIILMSMDGNSNMISSGWGMSNRTYMK--LFGLRDGNGNKVYPEMAQGMLKGYPIQRTSAIPANLGEGG 355 (428) T ss_pred ccHHHHHHHHHHHHHhhhccccccccCEEEEcHHHHHH--HHHhhccCCceeccCCCCCeeeceeeEEeccccccccCCC Confidence 444444443333322 1222345788998887754 467899999999954 58999999999999864 Q ss_pred ---cEEEEehhhEEEEEecceEEEEeccc-----------ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 575 ---TWIFGDWSQIVIAMWGVLDLKVDPYT-----------KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 575 ---~~~~gd~s~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) .++||||+.|.++.++++++..+++. .|.+|++.||++.|+||++.+|+||++++-.. T Consensus 356 ~~~~i~~gd~s~~~i~~~~~i~i~~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~ 427 (428) T protein:vir:10 356 KESEIYFADFNDVVIGEDGNMKVDFSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGVL 427 (428) T ss_pred ccceEEEEecceEEEEEecceEEEeecccccccccccccchhhcchhheeeeeeeCceeeccceEEEEeccC Confidence 48999999999999999999998873 58899999999999999999999999999999 No 13 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=100.00 E-value=6.9e-50 Score=290.12 Aligned_cols=397 Identities=22% Similarity=0.313 Sum_probs=243.4 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh---hhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhh Q lcl|Aclame:pro 197 SQQTETRGAETGAKNPAPAASGANENDILSRERTRISEITA---IGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQ 273 (632) Q Consensus 197 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~---~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 273 (632) +...... +..+...+++..... ....+...+ ..+........+....++...... + T Consensus 1 M~l~eL~-------------------~~r~~~~~~~~~l~~~~~e~~~l~~ee-~~~~~~l~~ei~~l~~~i~~~e~~-e 59 (435) T protein:vir:80 1 MNVNELR-------------------RERAAVNQRVQALAQIEVGGTALSVEQ-QAEFDQLSSKFNELTAQIERAEAA-E 59 (435) T ss_pred CCHHHHH-------------------HHHHHHHHHHHHHHHHHhccCCCCHHH-HHHHHHHHHHHHHHHHHHHHHHHH-H Confidence 0000000 000000000000000 000000000 000000000011111111000000 0 Q ss_pred hhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHH--HHhh--hhhhhhhh Q lcl|Aclame:pro 274 PGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADA--SGKE--ARGFYMPH 349 (632) Q Consensus 274 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~--~~~~~~~~ 349 (632) . .................... ........... .......... ....... .... ........ T Consensus 60 ~----~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~---~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 124 (435) T protein:vir:80 60 R----MAAAAAVPVDPNPAAVTASA-------AAPVYAQPKAP---EVKGAKMARM-VRALAAARGDAQLASKLAIERGF 124 (435) T ss_pred H----HHHhhcccccchhhhhcccc-------ccccccccchh---hhhHHHHHHH-HHHHHhccchhHHHHHHHHhhhh Confidence 0 00000000000000000000 00000000000 0000000000 0000000 0000 00000011 Q ss_pred hHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcccccCccc Q lcl|Aclame:pro 350 EVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDFD 429 (632) Q Consensus 350 ~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~ 429 (632) ......++...+...||.++|.+ +.+.|++.+++.+++++++++.+++.+..+.+|+.++.+.+.|++|++.+++++++ T Consensus 125 ~~~~~~~~~~~~~~~gg~lvP~~-~~~~ii~~l~~~~~i~~~~~~~v~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~ 203 (435) T protein:vir:80 125 GEEVAMSLNTLSPGAGGVLVPEN-LSSEVIELLRPKSVVRKLGARTLPLSNGNITIPRLKGGAIVGYIGADTDIPTTQQQ 203 (435) T ss_pred hhhhhhhhcccCCCCCccccchh-HHHHHHHHHhhhchhhhccceeeecCCCceEEEEEeCCcceeeeccCccccccccc Confidence 11122334555666677777766 45789999999999999888889988888999999999999999999999999999 Q ss_pred ceeeeeeeeeeeeeehhhHHHhhcChh--HHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccch Q lcl|Aclame:pro 430 FTTLSFSPKTIAGAVPVTRKLRKQSSI--HVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGV 507 (632) Q Consensus 430 ~~~~~~~~~t~~~~~~iSre~l~d~~~--~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~ 507 (632) |+++++.+++++++++||+|+|.|+.. +++++|.+.++++++++++.++++|+|++++|.||++.+...+........ T Consensus 204 f~~i~~~~~k~~~~~~is~ell~ds~~~~~l~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~ 283 (435) T protein:vir:80 204 FDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPGNVITASDGS 283 (435) T ss_pred eeeEEEeeEEEEEeehhhHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCcccceeecccccceeeccccc Confidence 999999999999999999999999854 799999999999999999999999999999999999988766654443333 Q ss_pred h----HHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeec---cccccCcceEEcCCCCCc------ Q lcl|Aclame:pro 508 D----WASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQ---NNEVNGYRAEASNQIPAD------ 574 (632) Q Consensus 508 ~----~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~---~~~l~G~pv~~~~~~~~~------ 574 (632) + ..++.+++..+...+....++.|+|++.++.. +.+++|.+|+|+|. +++|+|+||++++.+|.+ T Consensus 284 ~~~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~--L~~lkd~~G~~l~~~~~~~~l~G~pv~~~~~~p~~~~~~~~ 361 (435) T protein:vir:80 284 TLQKIETDLGKAILALENADANLTQPGWIMAPRTFRF--LEGLRDGNGNKVYPELANGMLKGYPVGKTTQVPINLGEAGK 361 (435) T ss_pred chhhHHHHHHHHHHHhhccccccccCEEEEcHHHHHH--HHhhhccCCceeccCCCCCeEeeeeeEEeccccccccCCCC Confidence 3 34566666666666555567889998888754 46789999999985 578999999999999863 Q ss_pred --cEEEEehhhEEEEEecceEEEEeccc-----------ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 575 --TWIFGDWSQIVIAMWGVLDLKVDPYT-----------KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 575 --~~~~gd~s~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) .++||||+.|.+++++++++.++++. .|.+|++.||++.|+||++++|+||++|+-.+ T Consensus 362 ~~~i~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~~ 432 (435) T protein:vir:80 362 ESEIYFTDFGDVFIGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGVA 432 (435) T ss_pred cceEEEEEcccEEEEeecceEEEEeccccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEeccC Confidence 58999999999999999999998875 38899999999999999999999999999988 No 14 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=100.00 E-value=6.7e-49 Score=284.69 Aligned_cols=415 Identities=14% Similarity=0.070 Sum_probs=226.1 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh----hhhhhhhhhhhhhhhhhhhhH Q lcl|Aclame:pro 181 RGAEMPDKDKQTQTAGSQQTETRGAETGAKNPAPAASGANENDILSRERTRISEI----TAIGQQFSQRSLAQEAIQKGH 256 (632) Q Consensus 181 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~----~~~~~~~~~~~~~~~a~~~~~ 256 (632) ++..........+....... .. .......+......... ..............+...... T Consensus 1 ~~~~~~l~~~~~~~~~~~~~-------------~~---~~~~~~~aE~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 64 (497) T protein:vir:78 1 MPSTAQLEAQGRQLAKSIKD-------------IN---ADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLG 64 (497) T ss_pred CCcchHHHHHHHHHHHHHHH-------------HH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00000000000000000000 00 00000000000000000 000000000000000000000 Q ss_pred HHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHH Q lcl|Aclame:pro 257 TVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIAD 336 (632) Q Consensus 257 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 336 (632) ........+........... ....... ............. ........................... T Consensus 65 ~~~a~~~~~~~~~~~~e~~~---~~~~~~~-------~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 131 (497) T protein:vir:78 65 GADAAKDGLDNDIPEVEVRN---LKQIRKH-------LARAVIMNPELKN-ATSFEKGTKFDVSFNVSAKAADPGTAA-- 131 (497) T ss_pred HHHHHHHHHHHHHHHHHhhh---hhhHHHH-------HHHHHhhhHHHHh-hhhhhhhhhhhhhhhhhhhhhhhHHHH-- Confidence 00000000000000000000 0000000 0000000000000 000000000000000000000000000 Q ss_pred HHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecC-Ccccc Q lcl|Aclame:pro 337 ASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTS-GANFY 415 (632) Q Consensus 337 ~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~a~ 415 (632) .................+....++++++|+++|+++. ..|++.+++.++++.+ +++++.+...+.+|+.++ .+.+. T Consensus 132 -~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~~-~~ii~~~~~~~~i~~l-~~~~~~~~~~~~~~~~~~~~~~a~ 208 (497) T protein:vir:78 132 -AELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFL-PGIVEQLFYELSLADL-ISSRPVTSPNLSYLTESAAHNNAA 208 (497) T ss_pred -HHHHHHHhhhhhhHHHHHhhhcccCcccccccchhhh-HHHHHHHHhhhhHHhh-ccccccCCCceEEEEEcCCCCcce Confidence 0000000111111223344556667778888888765 5678888888888887 456666667789998765 46899 Q ss_pred ccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceecc Q lcl|Aclame:pro 416 WIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMT 495 (632) Q Consensus 416 ~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a 495 (632) ||+|++.+|+++++|+++++.+++++++++||+|+|.|+ .+++++|.+.|++++++++|.+|++|+|++ +|.||++.+ T Consensus 209 wv~E~~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~-~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~-~p~Gil~~~ 286 (497) T protein:vir:78 209 AVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYP-GVNGLLQRS 286 (497) T ss_pred eeccCcccccccccceeeEeeeeeeEeecHhHHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHhhcCCCcc-ccccccccc Confidence 999999999999999999999999999999999999876 579999999999999999999999999985 699999876 Q ss_pred ccccccccccc-------------------------------------------------------hhHHHHHHHHHHHH Q lcl|Aclame:pro 496 GVPALTYPAGG-------------------------------------------------------VDWASVVDMETKIS 520 (632) Q Consensus 496 ~~~~~~~~~~~-------------------------------------------------------~~~~~i~~~~~~~~ 520 (632) +.......... .....+..+...+. T Consensus 287 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 366 (497) T protein:vir:78 287 TGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQ 366 (497) T ss_pred ccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhh Confidence 64433211100 00111222222222 Q ss_pred hhccccccceEEeehhHHHHHHHHhhcccCCceeecc-------------ccccCcceEEcCCCCCccEEEEehhh--EE Q lcl|Aclame:pro 521 TFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN-------------NEVNGYRAEASNQIPADTWIFGDWSQ--IV 585 (632) Q Consensus 521 ~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~-------------~~l~G~pv~~~~~~~~~~~~~gd~s~--~~ 585 (632) ..+. .....|+||+.++. .+.+++|.+|+|+|.+ .+|+|+||++++.+|.++++||||+. |. T Consensus 367 ~~~~-~~~~~~vmn~~~~~--~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd~~~~~~~ 443 (497) T protein:vir:78 367 LTLF-QTPNAVVMNPRDWE--LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQ 443 (497) T ss_pred hhcc-cCCCeEEEchHHHH--HHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCCceEEeecccceEE Confidence 2221 12235888877655 4568899999999964 27999999999999999999999997 55 Q ss_pred EEEecceEEEEecc--cccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 586 IAMWGVLDLKVDPY--TKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 586 ~~~~~~~~~~~~~~--~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) ++++.++++.++++ .+|.+|++.||++.|+|+.|++|+||++++++| T Consensus 444 i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~ 492 (497) T protein:vir:78 444 TARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKK 492 (497) T ss_pred EEEecccEEEeecccchhhhcCcEEEEEEEeecceeeccccEEEEEecC Confidence 78899999999987 459999999999999999999999999999999 No 15 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=100.00 E-value=6.7e-49 Score=284.69 Aligned_cols=415 Identities=14% Similarity=0.070 Sum_probs=226.1 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh----hhhhhhhhhhhhhhhhhhhhH Q lcl|Aclame:pro 181 RGAEMPDKDKQTQTAGSQQTETRGAETGAKNPAPAASGANENDILSRERTRISEI----TAIGQQFSQRSLAQEAIQKGH 256 (632) Q Consensus 181 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~----~~~~~~~~~~~~~~~a~~~~~ 256 (632) ++..........+....... .. .......+......... ..............+...... T Consensus 1 ~~~~~~l~~~~~~~~~~~~~-------------~~---~~~~~~~aE~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 64 (497) T protein:vir:10 1 MPSTAQLEAQGRQLAKSIKD-------------IN---ADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLG 64 (497) T ss_pred CCcchHHHHHHHHHHHHHHH-------------HH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00000000000000000000 00 00000000000000000 000000000000000000000 Q ss_pred HHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHH Q lcl|Aclame:pro 257 TVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIAD 336 (632) Q Consensus 257 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 336 (632) ........+........... ....... ............. ........................... T Consensus 65 ~~~a~~~~~~~~~~~~e~~~---~~~~~~~-------~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 131 (497) T protein:vir:10 65 GADAAKDGLDNDIPEVEVRN---LKQIRKH-------LARAVIMNPELKN-ATSFEKGTKFDVSFNVSAKAADPGTAA-- 131 (497) T ss_pred HHHHHHHHHHHHHHHHHhhh---hhhHHHH-------HHHHHhhhHHHHh-hhhhhhhhhhhhhhhhhhhhhhhHHHH-- Confidence 00000000000000000000 0000000 0000000000000 000000000000000000000000000 Q ss_pred HHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecC-Ccccc Q lcl|Aclame:pro 337 ASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTS-GANFY 415 (632) Q Consensus 337 ~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~a~ 415 (632) .................+....++++++|+++|+++. ..|++.+++.++++.+ +++++.+...+.+|+.++ .+.+. T Consensus 132 -~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~~-~~ii~~~~~~~~i~~l-~~~~~~~~~~~~~~~~~~~~~~a~ 208 (497) T protein:vir:10 132 -AELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFL-PGIVEQLFYELSLADL-ISSRPVTSPNLSYLTESAAHNNAA 208 (497) T ss_pred -HHHHHHHhhhhhhHHHHHhhhcccCcccccccchhhh-HHHHHHHHhhhhHHhh-ccccccCCCceEEEEEcCCCCcce Confidence 0000000111111223344556667778888888765 5678888888888887 456666667789998765 46899 Q ss_pred ccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceecc Q lcl|Aclame:pro 416 WIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMT 495 (632) Q Consensus 416 ~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a 495 (632) ||+|++.+|+++++|+++++.+++++++++||+|+|.|+ .+++++|.+.|++++++++|.+|++|+|++ +|.||++.+ T Consensus 209 wv~E~~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~-~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~-~p~Gil~~~ 286 (497) T protein:vir:10 209 AVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYP-GVNGLLQRS 286 (497) T ss_pred eeccCcccccccccceeeEeeeeeeEeecHhHHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHhhcCCCcc-ccccccccc Confidence 999999999999999999999999999999999999876 579999999999999999999999999985 699999876 Q ss_pred ccccccccccc-------------------------------------------------------hhHHHHHHHHHHHH Q lcl|Aclame:pro 496 GVPALTYPAGG-------------------------------------------------------VDWASVVDMETKIS 520 (632) Q Consensus 496 ~~~~~~~~~~~-------------------------------------------------------~~~~~i~~~~~~~~ 520 (632) +.......... .....+..+...+. T Consensus 287 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 366 (497) T protein:vir:10 287 TGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQ 366 (497) T ss_pred ccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhh Confidence 64433211100 00111222222222 Q ss_pred hhccccccceEEeehhHHHHHHHHhhcccCCceeecc-------------ccccCcceEEcCCCCCccEEEEehhh--EE Q lcl|Aclame:pro 521 TFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN-------------NEVNGYRAEASNQIPADTWIFGDWSQ--IV 585 (632) Q Consensus 521 ~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~-------------~~l~G~pv~~~~~~~~~~~~~gd~s~--~~ 585 (632) ..+. .....|+||+.++. .+.+++|.+|+|+|.+ .+|+|+||++++.+|.++++||||+. |. T Consensus 367 ~~~~-~~~~~~vmn~~~~~--~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd~~~~~~~ 443 (497) T protein:vir:10 367 LTLF-QTPNAVVMNPRDWE--LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQ 443 (497) T ss_pred hhcc-cCCCeEEEchHHHH--HHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCCceEEeecccceEE Confidence 2221 12235888877655 4568899999999964 27999999999999999999999997 55 Q ss_pred EEEecceEEEEecc--cccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 586 IAMWGVLDLKVDPY--TKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 586 ~~~~~~~~~~~~~~--~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) ++++.++++.++++ .+|.+|++.||++.|+|+.|++|+||++++++| T Consensus 444 i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~ 492 (497) T protein:vir:10 444 TARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKK 492 (497) T ss_pred EEEecccEEEeecccchhhhcCcEEEEEEEeecceeeccccEEEEEecC Confidence 78899999999987 459999999999999999999999999999999 No 16 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=100.00 E-value=3.6e-48 Score=280.67 Aligned_cols=467 Identities=15% Similarity=0.133 Sum_probs=245.9 Q ss_pred ccCcccccccccce-----eeeeeccchhhhhhhhhhhhhhhhh------hhhhhhhhhhhh-hhhh------------- Q lcl|Aclame:pro 155 EISLISVPADPTVG-----VGRSIDIGNITIRGAEMPDKDKQTQ------TAGSQQTETRGA-ETGA------------- 209 (632) Q Consensus 155 EvS~v~~pa~~~a~-----v~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~-~~~~------------- 209 (632) =-.+-+.|..|... +.++....+.........+...... ............ .... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~e~l~~~~~~~~~e~~~~~~~~ 80 (543) T protein:vir:81 1 MNTLDTLPVHPRTGLRAIGMGKRGPIWPVMGASDDHKDDAPTLTYSQARNRADEVHARMEQIAELDKPTDEENEEFRALG 80 (543) T ss_pred CCccccCcCChhHHHHHHHhhccCccchhcccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00112233333321 1111111000000000000000000 000000000000 0000 Q ss_pred -hhhhhhhhhhhhhhhhhhhhhhhhhhhhh--hhh------hhh---------hhhhhhhhhhhHHHHHHHHHHhhhhHh Q lcl|Aclame:pro 210 -KNPAPAASGANENDILSRERTRISEITAI--GQQ------FSQ---------RSLAQEAIQKGHTVDQFRALVLERMNP 271 (632) Q Consensus 210 -~~~~~~~~~~~~~~~~~~~~~r~~~~~~~--~~~------~~~---------~~~~~~a~~~~~~~~~~~~~~~~~~~~ 271 (632) ............. ......+....... ... ... .....+...........+......... T Consensus 81 ~e~~el~~~~~~l~--~~e~~~~~~e~~~~~~~~~~~~~~e~r~e~~a~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~e 158 (543) T protein:vir:81 81 AEFDSLVNHMSRLE--RAAELARVRSTHEQIGKPQSGGQRRMRVEAGSSQGGRGDYDRDAILEPDSIEDCRFRDPWNLSE 158 (543) T ss_pred HHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhHHHhhHHHHHhhhccCccHHHHHHHHHHHHHH Confidence 0000000000000 00000000000000 000 000 000000000000000000000000000 Q ss_pred hhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhh------hhhhhhhhhhhhhhhhhhhhHHHHHHHHHHH--hhhh Q lcl|Aclame:pro 272 GQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYS------LMRAINAAATGDWSKAGFEREVSLAIADASG--KEAR 343 (632) Q Consensus 272 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~ 343 (632) ....... ................+............ ......... ..................... .... T Consensus 159 ~k~~~e~-~~~e~~e~~~~~~~~~e~l~~~~e~~~~~~~~~~~~~d~~e~~~-~~~~~~~~~~~~~~a~~~~~~~~~~~~ 236 (543) T protein:vir:81 159 MRTFGRD-AEEVKGELRARALSAIEKMQGASDNVRAAATKIIERFDDEDSTL-ARQCLATSSPAYLRAWSKMARNPHAAI 236 (543) T ss_pred HHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-hhhhhhhhhhhhhhHHHHHHHhhHHHH Confidence 0000000 00000000000000000000000000000 000000000 000000000000000000000 0000 Q ss_pred hhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCccc Q lcl|Aclame:pro 344 GFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDV 423 (632) Q Consensus 344 ~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~ 423 (632) ...............+.+..++|.+||.++....|...++..+++..+. ++.++ ++.+.+++.++.+.+.|++|++.+ T Consensus 237 l~~~e~~~~~~~~~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~-~~~~~-~g~~~~~~~~~~~~a~~v~Eg~~~ 314 (543) T protein:vir:81 237 LTEEEKRAINEVRAMGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFA-RQVVA-TGDVWHGVSSAAVQWSWDAEFEEV 314 (543) T ss_pred hhhhhhhhhhhhhhcccccccCcccCchhhhhHHHHHHHhhhchhhhhc-ccccC-CcceEEEEecCCcceeecccCccc Confidence 0111111222223344566778889998888888888888888888874 34443 456788999899999999999999 Q ss_pred ccCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceecccccc---c Q lcl|Aclame:pro 424 QDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPA---L 500 (632) Q Consensus 424 ~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~---~ 500 (632) ++++++|+.+++.+++++++++||+++|.|+ +++.++|.+.|+++++++++.+|++|+|++++|.||++..+... . T Consensus 315 ~~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~~~~~i~~~l~~~~~~~~d~ail~G~Gt~~~p~Gi~~~~~~~~~~~~ 393 (543) T protein:vir:81 315 SDDSPEFGQPEIPVKKAQGFVPISIEALQDE-ANVTETVALLFAEGKDELEAVTLTTGTGQGNQPTGIVTALAGTAAEIA 393 (543) T ss_pred cccccccceeeeeeeeeEeeehhhHHHHhcc-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccchhhccccccccc Confidence 9999999999999999999999999999876 79999999999999999999999999999999999987655322 2 Q ss_pred cccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeecc------ccccCcceEEcCCCCCc Q lcl|Aclame:pro 501 TYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN------NEVNGYRAEASNQIPAD 574 (632) Q Consensus 501 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~------~~l~G~pv~~~~~~~~~ 574 (632) +.+...++++++.++...+...|.. ++.|+|++..+.. +.+++|++|+|+|.+ ++|+|+||++++++|.+ T Consensus 394 ~~~~~~~~~~~~~~~~~~l~~~~~~--~~~~v~n~~~~~~--l~~lkd~~G~~l~~~~~~g~~~~l~G~pv~~~~~~~~~ 469 (543) T protein:vir:81 394 PVTAETFALADVYAVYEQLAARHRR--QGAWLANNLIYNK--IRQFDTQGGAGLWTTIGNGEPSQLLGRPVGEAEAMDAN 469 (543) T ss_pred ccccccccHHHHHHHHHhhhccccC--CcEEEEcHHHHHH--HHHhhcCCCceeccCcCCCCCccccceeeEEecccccc Confidence 3455678899999999999887753 5689999888654 457899999999964 47999999999998864 Q ss_pred c----------EEEEehhhEEEEEecceEEEEeccc----ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 575 T----------WIFGDWSQIVIAMWGVLDLKVDPYT----KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 575 ~----------~~~gd~s~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) . ++||||+.|.++.++++++.++++. .|.+|++.|+++.|+|+++.+|+||++++++| T Consensus 470 ~~~~~~~~~~~i~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~~ 541 (543) T protein:vir:81 470 WNTSASADNFVLLYGNFQNYVIADRIGMTVEFIPHLFGTNRRPNGSRGWFAYYRMGADVVNPNAFRLLNVET 541 (543) T ss_pred ccccccCCcceEEEeeccceeEEeecccEEEEeccccccchhhcCceEEEEEEeeccEeecccceEEEEecc Confidence 2 8999999999999999999998875 45678999999999999999999999999999 No 17 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=100.00 E-value=8.8e-49 Score=284.04 Aligned_cols=380 Identities=14% Similarity=0.084 Sum_probs=239.0 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhh Q lcl|Aclame:pro 218 GANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDL 297 (632) Q Consensus 218 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 297 (632) .....+. ..++.......... ..+++.......++.+........+... ........+... T Consensus 1 m~e~~~~---l~~~~~~~~~~~~~-----~~e~~~~~~~~~~e~~~~~~~~~~e~~~-l~~~i~~~~~~~---------- 61 (390) T protein:vir:10 1 MTDITSK---LEATLANVTDSLRA-----FGERAVRDGELNASARSKVDELFATVGN-LSAEVQAARQRV---------- 61 (390) T ss_pred ChHHHHH---HHHHHHHHHHHHHH-----HHHHHHhhcccCHHHHHHHHHHHHHHHH-HHHHHHHHHHHH---------- Confidence 0000000 00000000000000 0000000000000000100000000000 000000000000 Q ss_pred hhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHH Q lcl|Aclame:pro 298 GIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEE 377 (632) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~ 377 (632) ................ ... .. ...... ..................... .....+.+...+|.++++++. .. T Consensus 62 --~~~~~~~~~~~~~~~~--~~~-~~-~~~~~~-~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~g~~~~~~~~-~~ 132 (390) T protein:vir:10 62 --AELEGNGAGGDVQHVS--VGD-LF-VASEQF-QASAGRWNDRSARATMNIKAA-LNTASTDAAGSAGALTTPNRL-PG 132 (390) T ss_pred --HHHHhhcccccccccc--hhh-hh-hhhHHH-HHHHHhhhhhhhhhhhHHHHH-HHhhhcccccccccccchhHH-HH Confidence 0000000000000000 000 00 000000 000000000000111111111 222334445556677777765 56 Q ss_pred HHHHHhhhhhhhhhcceeeccCceeEEEEEecCC-ccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcChh Q lcl|Aclame:pro 378 FIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSG-ANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSI 456 (632) Q Consensus 378 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~ 456 (632) +++.++..++++.+ +++++.....+++++.++. +.+.|++|++++++++++|+++++.+++++++++||+++|.|+ . T Consensus 133 ii~~~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~ 210 (390) T protein:vir:10 133 FITQPDARLTVRDL-IGSGRTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDA-P 210 (390) T ss_pred HHHHHHhhchhhhh-cceeeccCCceEEEEEecCCcceeeecCCccccccccceeEEEEeeEEEEEeehhhHHHHHhH-H Confidence 78888888888887 6677777777888987764 6799999999999999999999999999999999999999876 4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceecccccccccc-ccchhHHHHHHHHHHHHhhccccccceEEeeh Q lcl|Aclame:pro 457 HVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYP-AGGVDWASVVDMETKISTFNADAGRLAYLTSV 535 (632) Q Consensus 457 ~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~-~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 535 (632) ++.++|.+.|+++++++++.++++|+|+++.|.||++.++......+ .+...++.+.+++..+...+.. ...|+|++ T Consensus 211 ~l~~~i~~~l~~~~~~~~~~~il~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~--~~~~v~n~ 288 (390) T protein:vir:10 211 QLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYP--ASGIVINP 288 (390) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCCCCccccccccccccccccccccccchHHHHHHHHHhhccccCC--CCEEEEcH Confidence 89999999999999999999999999999999999998876655433 3445678899999999877654 56789988 Q ss_pred hHHHHHHHHhhcccCCceeecc------ccccCcceEEcCCCCCccEEEEehhh-EEEEEecceEEEEecc-cccccCcE Q lcl|Aclame:pro 536 TQRGAAKKAQVFDNTGERIWQN------NEVNGYRAEASNQIPADTWIFGDWSQ-IVIAMWGVLDLKVDPY-TKAASDGL 607 (632) Q Consensus 536 ~~~~~~~~~~~~d~~g~~~~~~------~~l~G~pv~~~~~~~~~~~~~gd~s~-~~~~~~~~~~~~~~~~-~~~~~~~~ 607 (632) +.+. .+.+++|.+|+|+|++ ++|+|+||++++.+|.++++||||+. |.++.+.++.+.++.+ .+|.+|++ T Consensus 289 ~~~~--~L~~lkd~~g~~l~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gdf~~~~~~~~~~~~~i~~~~~~~~~~~~~~ 366 (390) T protein:vir:10 289 IDWA--AIELAKDANNQYLIGNARGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVNDDFQRNMV 366 (390) T ss_pred HHHH--HHHHhhcCCCceeecCCcCcCCceecceeeEEcCCCCCCcEEEEeccceEEEEEecceEEEEeecccccccCcE Confidence 8765 4568899999999975 47999999999999999999999997 6788999999998775 68999999 Q ss_pred EEEEEEEeCcEEecccceEEEEec Q lcl|Aclame:pro 608 VLRVFQDVDAGVRRKEAFCIAKKG 631 (632) Q Consensus 608 ~~~~~~r~~~~v~~~~a~~~~~~~ 631 (632) .||++.|+|+++++|+||+++++| T Consensus 367 ~~r~~~r~d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 367 TVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred EEEEEEeeccEEeccccEEEEEeC Confidence 999999999999999999999999 No 18 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=100.00 E-value=1.6e-48 Score=282.62 Aligned_cols=403 Identities=11% Similarity=0.060 Sum_probs=240.1 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhh Q lcl|Aclame:pro 197 SQQTETRGAETGAKNPAPAASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGN 276 (632) Q Consensus 197 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 276 (632) ..+............. .....+......+.............. +...+........++............ ... T Consensus 1 ~~~~~~~~~~~~~~~~-----~~el~~~~~e~~~~l~~~~~e~~~~~e-~~~~e~~~~~~~~~e~~~~~~~l~~~~-~~l 73 (418) T protein:vir:10 1 MSHMNEPRQFGRKSGG-----DSHPEQVLETVTKELKRIGDEVKSAGE-KALAEAKRAGDLGVETKATVDELLIKQ-GEL 73 (418) T ss_pred CCCchhHHHHHHHhcc-----HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhhhhhhHHHHHHHHHHHHHH-HHH Confidence 0000000000000000 000000000000000000000000000 000000000000000000000000000 000 Q ss_pred hhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhh Q lcl|Aclame:pro 277 FEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQ 356 (632) Q Consensus 277 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a 356 (632) .......+.. .............+.... .... ............................. T Consensus 74 ~~~~~~~e~~--------------~~~~~~~~~~~~~~~~~~-~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 134 (418) T protein:vir:10 74 QARLLEAEQK--------------LARGGGSAELETPKTLGQ-LVTE----SEEMKGMDGSARKSVRVRVDRKSIMNVPA 134 (418) T ss_pred HHHHHHHHHH--------------HhhcccccccchhhhhhH-Hhhh----HHHHHHHHHHHhhhhhhhhHHHHHHHhhh Confidence 0000000000 000000000000000000 0000 00000000000001101111111122223 Q ss_pred hcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecC-CccccccccCcccccCcccceeeee Q lcl|Aclame:pro 357 LEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTS-GANFYWIGEDEDVQDSDFDFTTLSF 435 (632) Q Consensus 357 ~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~~~~~~~ 435 (632) ....+..+++.+||+++ ...|++.+++.+.++++ +++++.....+.+++.+. .+.+.|++|++++++++++|+++++ T Consensus 135 ~~~~~~~~~g~lvp~~~-~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~ 212 (418) T protein:vir:10 135 TVGSGVSGSNSLVVADR-QAGIIAPPQRKMTIRDL-LMPGQTSSSSIEYTVETGFTNNAAAVAEGAQKPTSDLKFNLKNQ 212 (418) T ss_pred hccCCCCCCccccchhH-HHHHHHHHhhhhhHHhh-cceeeccCCceeEEEEecCCCceeeeccCccccccccceeeEEE Confidence 34445556667777665 56688999999999987 566676666788888765 5788999999999999999999999 Q ss_pred eeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceecccccccccc-ccchhHHHHHH Q lcl|Aclame:pro 436 SPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYP-AGGVDWASVVD 514 (632) Q Consensus 436 ~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~-~~~~~~~~i~~ 514 (632) .+++++++++||+++|.++ .++.++|.+.+++++++++|.++++|+|++.+|.||++.++......+ .+..+++++.+ T Consensus 213 ~~~k~~~~~~is~ell~ds-~~l~~~i~~~l~~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~i~~ 291 (418) T protein:vir:10 213 PVRTIAHLFKASRQILDDA-PALQSYIDGRARYGLQLTEEGQILKGDGTGANILGILPQASAFMPSITLANATPIDKIRL 291 (418) T ss_pred eeeeEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccccHHHHHH Confidence 9999999999999999876 589999999999999999999999999999999999998876555443 33466888999 Q ss_pred HHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeec------cccccCcceEEcCCCCCccEEEEehhh-EEEE Q lcl|Aclame:pro 515 METKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQ------NNEVNGYRAEASNQIPADTWIFGDWSQ-IVIA 587 (632) Q Consensus 515 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~------~~~l~G~pv~~~~~~~~~~~~~gd~s~-~~~~ 587 (632) ++..+...+. ....|+|++.++. .+.+++|.+|+|+|. +++|+|+||++++.+|.++++||||+. |.++ T Consensus 292 ~~~~~~~~~~--~~~~~v~n~~~~~--~L~~lkd~~G~~i~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~s~~~~~~ 367 (418) T protein:vir:10 292 ALLQAVLAEF--PATGIVLNPIDWA--SIELTKDSQGRYIVGNPVNGTTPRLWNLPVVETQAMTANEFLVGAFSMAAQIF 367 (418) T ss_pred HHHhhccccC--CCCEEEEcHHHHH--HHHHhhcCCCceeccccccCCCceecceeeEEcCCCCCCcEEEeeccceEEEE Confidence 9888876654 3457999888865 456789999999995 357999999999999999999999997 7788 Q ss_pred EecceEEEEeccc--ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 588 MWGVLDLKVDPYT--KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 588 ~~~~~~~~~~~~~--~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +++++++.++++. +|.+|++.||++.|+||++++|+||+++++++ T Consensus 368 ~~~~~~i~~~~~~~~~f~~~~~~~r~~~~~d~~~~~~~a~~~~~~~~ 414 (418) T protein:vir:10 368 DRMEIEVLLSTENVDDFEKNMVSIRAEERLALAVYRPESFVTGALVE 414 (418) T ss_pred EecceEEEEecccchhhhcCceEEEEEEeeccEEecccceEEEEecc Confidence 9999999998876 48999999999999999999999999999999 No 19 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=100.00 E-value=1e-48 Score=283.71 Aligned_cols=380 Identities=14% Similarity=0.083 Sum_probs=241.6 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhh Q lcl|Aclame:pro 218 GANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDL 297 (632) Q Consensus 218 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 297 (632) .....+... .......... ..+.+.........++.+........+.. .........+.... T Consensus 1 m~~~~~~l~---~~~~~~~~~~-----~~~~e~~~~~~~~~~e~~~~~~~~~~e~~-~l~~~i~~~e~~~~--------- 62 (390) T protein:vir:97 1 MTDITAKLE---ATLANVTDSL-----KAFGERAVRDGELNASARSKVDELFATVG-NLSAEVQAARQRVA--------- 62 (390) T ss_pred ChHHHHHHH---HHHHHHHHHH-----HHHHHHHHhhcCCCHHHHHHHHHHHHHHH-HHHHHHHHHHHHHH--------- Confidence 000000000 0000000000 00000000000000000000000000000 00000000000000 Q ss_pred hhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHH Q lcl|Aclame:pro 298 GIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEE 377 (632) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~ 377 (632) ................ .. ....... ............... ........+...+.+..++|.++|+++. .. T Consensus 63 ---~~~~~~~~~~~~~~~~--~~--~~~~~~~-~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~g~lip~~~~-~~ 132 (390) T protein:vir:97 63 ---ELEGNGAGGDVQHVSV--GD--MFVASEQ-FQASTGRWNDRSARA-TMNIKAALNTASTDAAGSAGALTTPNRL-PG 132 (390) T ss_pred ---HHHhcccccccccccc--hh--hhhhhHH-HHHHHHHhhhhhhhh-hhHHHHHHHhhhcccccccccccchhhh-HH Confidence 0000000000000000 00 0000000 000111111111111 1111222334455566677888888765 56 Q ss_pred HHHHHhhhhhhhhhcceeeccCceeEEEEEecC-CccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcChh Q lcl|Aclame:pro 378 FIDILRNKAIIGQMGARMLPGLVGDVDIPKKTS-GANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSI 456 (632) Q Consensus 378 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~ 456 (632) |++.++..++++.+ ++.++.....+.+++.++ .+.+.|++|++++++++++|+++++.+++++++++||+|++.|+ . T Consensus 133 ii~~~~~~~~i~~~-~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds-~ 210 (390) T protein:vir:97 133 FITPPDARLTVRDL-IGSGRTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDA-P 210 (390) T ss_pred HHHHHhhhhhhHhh-cceeeccCCceEEEEEecCCcceeeecCCccccccccceeEEEEeeeeEEEeehhhHHHHHhH-H Confidence 88888888888887 567777777788898866 46899999999999999999999999999999999999999876 5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccc-cccchhHHHHHHHHHHHHhhccccccceEEeeh Q lcl|Aclame:pro 457 HVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTY-PAGGVDWASVVDMETKISTFNADAGRLAYLTSV 535 (632) Q Consensus 457 ~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~-~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 535 (632) ++.++|.+.+++++++++|.++++|+|+++.|.||++.++..+... ..+...++.+.+++..+...+.. ...|+|++ T Consensus 211 ~l~~~i~~~la~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~--~~~~v~n~ 288 (390) T protein:vir:97 211 QLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYP--ASGIVINP 288 (390) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCCCCccccceeeccccccccccccccchHHHHHHHHHhhccccCC--CCEEEEcH Confidence 8999999999999999999999999999999999999887665543 34456778899999999887764 46789988 Q ss_pred hHHHHHHHHhhcccCCceeecc------ccccCcceEEcCCCCCccEEEEehhh-EEEEEecceEEEEecc-cccccCcE Q lcl|Aclame:pro 536 TQRGAAKKAQVFDNTGERIWQN------NEVNGYRAEASNQIPADTWIFGDWSQ-IVIAMWGVLDLKVDPY-TKAASDGL 607 (632) Q Consensus 536 ~~~~~~~~~~~~d~~g~~~~~~------~~l~G~pv~~~~~~~~~~~~~gd~s~-~~~~~~~~~~~~~~~~-~~~~~~~~ 607 (632) .++. .+.+++|.+|+|+|.+ ++|+|+||++++.+|.++++||||+. |.++.+.++.+..+.+ .+|.+|++ T Consensus 289 ~~~~--~L~~lkd~~G~~l~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~f~~~~~ 366 (390) T protein:vir:97 289 IDWA--AIELAKDANNQYLIGNARGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVNDDFQRNMV 366 (390) T ss_pred HHHH--HHHHhhcCCCceeecCccCCCCceecceeeEEcCCCCCCcEEEEeccceEEEEEecceEEEEeecccccccCcE Confidence 8755 4557899999999964 47999999999999999999999997 7788999999998765 67999999 Q ss_pred EEEEEEEeCcEEecccceEEEEec Q lcl|Aclame:pro 608 VLRVFQDVDAGVRRKEAFCIAKKG 631 (632) Q Consensus 608 ~~~~~~r~~~~v~~~~a~~~~~~~ 631 (632) .||+..|+|+++.+|+|||++++| T Consensus 367 ~~r~~~r~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 367 TVLAEERLALVVYRPEALITGSFA 390 (390) T ss_pred EEEEEEeeccEEeccccEEEEEeC Confidence 999999999999999999999999 No 20 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=100.00 E-value=1.7e-48 Score=282.53 Aligned_cols=380 Identities=15% Similarity=0.110 Sum_probs=240.1 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhh Q lcl|Aclame:pro 218 GANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDL 297 (632) Q Consensus 218 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 297 (632) .....+.. ............ ...+.........++.+........+. ..........+.... T Consensus 1 m~~l~~~l---~~~~~~~~~~~~-----~~~e~~~~~~~~~~e~~~~~~~l~~e~-~~l~~~i~~~e~~~~--------- 62 (390) T protein:vir:81 1 MTDITSKL---EATLANVTDSLR-----AFGERAVRDGELNASARSKVDELFATV-GNLSAEVQAARQRVA--------- 62 (390) T ss_pred ChHHHHHH---HHHHHHHHHHHH-----HHHHHHHhhcCcCHHHHHHHHHHHHHH-HHHHHHHHHHHHHHH--------- Confidence 00000000 000000000000 000000000000000011100000000 000000000000000 Q ss_pred hhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHH Q lcl|Aclame:pro 298 GIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEE 377 (632) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~ 377 (632) .................... ..... .................... .......+.+..++|.++++++. .. T Consensus 63 -----~~~~~~~~~~~~~~~~~~~~--~~~~~-~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~g~~~~~~~~-~~ 132 (390) T protein:vir:81 63 -----ELEGNGAGGDVQHVSVGDMF--VASEQ-FQASAGRWNDRSARATMNIK-AALNTASTDAAGSAGALTTPNRL-PG 132 (390) T ss_pred -----HHHhcccccccccccchhhh--hhhHH-HHHHHHHHhhhhhhhhhHHH-HHHHhhccccccCCcceechhhh-HH Confidence 00000000000000000000 00000 00000000000001111111 11222334455677778888765 56 Q ss_pred HHHHHhhhhhhhhhcceeeccCceeEEEEEecCC-ccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcChh Q lcl|Aclame:pro 378 FIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSG-ANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSI 456 (632) Q Consensus 378 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~ 456 (632) +++.++..++++.+ +++++.....+.+++.++. +.+.|++|++++++++++|+++++.+++++++++||+|+|.|+ . T Consensus 133 ii~~~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~ 210 (390) T protein:vir:81 133 FITPPDARLTVRDL-IGSGRTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDA-P 210 (390) T ss_pred HHHHHhhhhhhhhh-cceeeccCCceEEEEEecCCcceeeecCCcccccccceeeEEEEeeeEEEEeehhhHHHHHhH-H Confidence 78888888888887 5667777778888988664 5789999999999999999999999999999999999999986 5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccc-cccchhHHHHHHHHHHHHhhccccccceEEeeh Q lcl|Aclame:pro 457 HVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTY-PAGGVDWASVVDMETKISTFNADAGRLAYLTSV 535 (632) Q Consensus 457 ~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~-~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 535 (632) ++.++|.+.|++++++++|.++++|+|+++.|.||++.++...... ......++.+.+++..+...+.. ...|+|++ T Consensus 211 ~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~v~~~ 288 (390) T protein:vir:81 211 QLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYN--PSGIVINP 288 (390) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccceeecccccccccccccchhHHHHHHHHHhhccccCC--CCEEEEcH Confidence 8999999999999999999999999999999999998877655543 34456788899999999877653 45789988 Q ss_pred hHHHHHHHHhhcccCCceeecc------ccccCcceEEcCCCCCccEEEEehhh-EEEEEecceEEEEecc-cccccCcE Q lcl|Aclame:pro 536 TQRGAAKKAQVFDNTGERIWQN------NEVNGYRAEASNQIPADTWIFGDWSQ-IVIAMWGVLDLKVDPY-TKAASDGL 607 (632) Q Consensus 536 ~~~~~~~~~~~~d~~g~~~~~~------~~l~G~pv~~~~~~~~~~~~~gd~s~-~~~~~~~~~~~~~~~~-~~~~~~~~ 607 (632) ..+. .+.+++|.+|+|+|.+ ++|+|+||++++.+|.++++||||+. |.++.+.++.+.++++ .+|.+|++ T Consensus 289 ~~~~--~l~~lkd~~G~~l~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~v 366 (390) T protein:vir:81 289 IDWA--AIELAKDANNQYLIGNARGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVGEDFQRNMI 366 (390) T ss_pred HHHH--HHHHhhcCCCceeecCcccccCceecceeeEEcCCCCCCcEEEEehhceEEEEEecceEEEEecccchhhcCcE Confidence 8765 4558899999999974 47999999999999999999999998 6788899999998875 67999999 Q ss_pred EEEEEEEeCcEEecccceEEEEec Q lcl|Aclame:pro 608 VLRVFQDVDAGVRRKEAFCIAKKG 631 (632) Q Consensus 608 ~~~~~~r~~~~v~~~~a~~~~~~~ 631 (632) .||++.|+|+++.+|+|||++++| T Consensus 367 ~~r~~~r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 367 TVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred EEEEEEeeccEEecccceEEEEeC Confidence 999999999999999999999999 No 21 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=100.00 E-value=8.4e-48 Score=278.67 Aligned_cols=381 Identities=14% Similarity=0.093 Sum_probs=238.7 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhh Q lcl|Aclame:pro 218 GANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDL 297 (632) Q Consensus 218 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 297 (632) +....+......+............. .....+........++.+...... ........... ... T Consensus 1 m~~~~k~l~el~~~~~~~~~~~~~~~-e~~~~~~~~~~~~~~e~~~~~~~~--------~~~~~~~~~~~-------~~~ 64 (395) T protein:vir:43 1 MSDFEKQIGELNASLKQVGDQIKSQA-EQVNTQIANFGEMNKETRAKVDEL--------LTAQGELQARL-------SAA 64 (395) T ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhhhHHHHHHHHHH--------HHHHHHHHHHH-------HHH Confidence 00000000000000000000000000 000000000000000000000000 00000000000 000 Q ss_pred hhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHH Q lcl|Aclame:pro 298 GIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEE 377 (632) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~ 377 (632) ...... ........... .............+.+...... .......+...+.+..++|.++|+++. .. T Consensus 65 --~~~~~~-----~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~g~~vp~~~~-~~ 132 (395) T protein:vir:43 65 --EQAMLA-----NEKRDGGEEAP-KTAGQMVAESLKEQGVTSSLRG---SHRVSMPRSAITSIDGSGGALVAPDRR-PG 132 (395) T ss_pred --HHHHHh-----hhccccccchh-hhHHHHHHHHHHHHHHHHHhhh---hhhhhhhhhhhcccCCCCccccchhhH-HH Confidence 000000 00000000000 0000000001111111111111 111122234445566677888888764 66 Q ss_pred HHHHHhhhhhhhhhcceeeccCceeEEEEEecC-CccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcChh Q lcl|Aclame:pro 378 FIDILRNKAIIGQMGARMLPGLVGDVDIPKKTS-GANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSI 456 (632) Q Consensus 378 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~ 456 (632) |++.+++.+.++.+ +++.+.....+.+++.++ .+.+.|++|++++++++++|+++++.+++++++++||+++|.++ . T Consensus 133 ii~~~~~~~~l~~l-~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~ 210 (395) T protein:vir:43 133 VVAAPQRRLTIRDL-VAPGTTESNSVEYVRETGFVNNAAPVSEGTQKPYSDLTFELENAPVRTIAHLFKASRQILDDA-S 210 (395) T ss_pred HHHHHHhhhhHHhh-ccceecCCCceEEEEEecCCCceeeecCCccccccccceeEEEEeeeeEEEeehhhHHHHHhH-H Confidence 88999999999887 555666666788888765 46899999999999999999999999999999999999999876 5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccc---cccchhHHHHHHHHHHHHhhccccccceEEe Q lcl|Aclame:pro 457 HVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTY---PAGGVDWASVVDMETKISTFNADAGRLAYLT 533 (632) Q Consensus 457 ~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~---~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 533 (632) ++.++|.+.|++++++++|.++++|+|+++.|.||++.++...... ......++.+.+++..+...+.. ...|+| T Consensus 211 ~l~~~v~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~--~~~~vm 288 (395) T protein:vir:43 211 ALQSYIDARARYGLMLVEECQLLYGNGTGANLHGIIPQAQAYAPPSGVVVTAEQRIDRIRLAILQAQLAEFP--ASGIVL 288 (395) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccchhHHHHHHHHHhhccccCC--CcEEEE Confidence 7999999999999999999999999999999999998877655443 23345678888888888877653 467999 Q ss_pred ehhHHHHHHHHhhcccCCceeecc------ccccCcceEEcCCCCCccEEEEehhh-EEEEEecceEEEEeccc--cccc Q lcl|Aclame:pro 534 SVTQRGAAKKAQVFDNTGERIWQN------NEVNGYRAEASNQIPADTWIFGDWSQ-IVIAMWGVLDLKVDPYT--KAAS 604 (632) Q Consensus 534 ~~~~~~~~~~~~~~d~~g~~~~~~------~~l~G~pv~~~~~~~~~~~~~gd~s~-~~~~~~~~~~~~~~~~~--~~~~ 604 (632) ++.++.. +.+++|.+|+|+|.+ ++|+|+||++++.+|.++++||||+. |.++++.++.+..+++. +|.+ T Consensus 289 n~~~~~~--l~~lkd~~G~~i~~~~~~~~~~~l~G~pVv~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~ 366 (395) T protein:vir:43 289 NPIDWAL--IELNKDAENRYIIGSPQNGTTPTLWRLPVVETQAITQDEFLTGAFSLGAQIFDRMDIEVLVSTENDKDFEN 366 (395) T ss_pred cHHHHHH--HHHhhccCCceeccccccCCCceecceeeEEcCCCCCCcEEEEeccceEEEEEecceEEEEeccccchhhc Confidence 9887654 457899999999853 57999999999999999999999998 66888999999888765 5899 Q ss_pred CcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 605 DGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 605 ~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) |++.||++.|+||++++|+||++++++| T Consensus 367 ~~~~~r~~~r~d~~v~~~~a~~~~~~ta 394 (395) T protein:vir:43 367 NMVTIRAEERLAFAVYRPEAFVTGSLTA 394 (395) T ss_pred CcEEEEEEEeeccEEecccceEEEEecc Confidence 9999999999999999999999999999 No 22 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=100.00 E-value=8.8e-48 Score=278.57 Aligned_cols=386 Identities=14% Similarity=0.140 Sum_probs=240.3 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhh Q lcl|Aclame:pro 213 APAASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIH 292 (632) Q Consensus 213 ~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 292 (632) +. ..+..+..+....+...... ......+.++. ....+.....+....+.. ...+........ T Consensus 1 M~---l~eL~e~r~~l~~e~~~l~~---k~~~~~~t~e~---~~~~~~~~~e~~~l~~~i--------~~~e~~~~~~~~ 63 (409) T protein:vir:45 1 MK---LHELKQKRNTIATDMRALNE---KIGDNAWTEEQ---RTEWNKAKSELEALDERI--------AREEELRRQDQA 63 (409) T ss_pred CC---HHHHHHHHHHHHHHHHHHHH---HhhcCCCCHHH---HHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHH Confidence 00 00000011111111111000 00000000000 001111111111100000 000000000000 Q ss_pred hhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHh--hhhhhhhhhhHHhhhhhcccccccccceec Q lcl|Aclame:pro 293 SARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGK--EARGFYMPHEVLVQRQLEKKTAGKGGELVA 370 (632) Q Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~ 370 (632) ...... . ........... .. ............... ..............+++..++..+||++|| T Consensus 64 ~~~~~~--~--~~~~~~~~~~~-----~~----~~~~~~~a~~~~l~~~~~~~~~~e~~~~~~~~a~~~~~~~~gg~liP 130 (409) T protein:vir:45 64 YIESNE--E--EQRQNLDPENN-----SQ----QDEKRAQVFDKWMRHGASELTSEERKALRELRAQGVAQDEKGGYTVP 130 (409) T ss_pred HHhhhh--h--hhcccCCCCCc-----ch----hhHHHHHHHHHHHHhhhhhccHHHHHHHHHHhhccCccCcCCceecc Confidence 000000 0 00000000000 00 000000000000000 000001111112445666677777888888 Q ss_pred hhhhhHHHHHHHhhhhhhhhhcceeeccCce-eEEEEEecC-CccccccccCcccccCcccceeeeeeeeeee-eeehhh Q lcl|Aclame:pro 371 TELLSEEFIDILRNKAIIGQMGARMLPGLVG-DVDIPKKTS-GANFYWIGEDEDVQDSDFDFTTLSFSPKTIA-GAVPVT 447 (632) Q Consensus 371 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~-~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~-~~~~iS 447 (632) .++ .+.|++.+++.++++.+ +++++.... .+.+++... ...+.|++|+++++++.+.|..+++.+++++ ++++|| T Consensus 131 ~~~-~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~f~~~~l~~~k~~~~~i~is 208 (409) T protein:vir:45 131 ETF-LAKVVEKMKSYGGIASV-AQILTTSDGRTMEWATADGTSEVGVLLGENEEAGEEDTDFGMGSLGALKMTSKIIRVS 208 (409) T ss_pred HhH-HHHHHHHHHhhhhhhhh-ceeeecCCCceEEEEeeccCccccccccccccccccccccceeeeeeeeeeeeehhhh Confidence 775 46788999999999887 455555433 455555544 3467899999999999999999999998875 678999 Q ss_pred HHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCc--cccccceeccccccccccccchhHHHHHHHHHHHHhhccc Q lcl|Aclame:pro 448 RKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLA--NDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNAD 525 (632) Q Consensus 448 re~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~--~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 525 (632) +|+|.|+.++++++|.+.++++++.+++.+|++|+|++ .+|.||++...........+.+++++|.+++..+...|+. T Consensus 209 ~ell~ds~~~l~~~i~~~la~a~~~~~~~a~l~G~G~~~~~~p~Gil~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~ 288 (409) T protein:vir:45 209 NELLQDSAIDMEAYLARRIAERIGRGEARYLIQGTGAGTPKQPKGLAASVTGTTQTAAANAVKWQEILALKHSIDPAYRR 288 (409) T ss_pred HHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCccccceeeeccccccccccccccchHHHHHHHHhhhhhhcc Confidence 99999999999999999999999999999999998864 5799999887766666677788999999999999988875 Q ss_pred cccceEEeehhHHHHHHHHhhcccCCceeecc-------ccccCcceEEcCCCCC-----ccEEEEehhhEEEEEecceE Q lcl|Aclame:pro 526 AGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEASNQIPA-----DTWIFGDWSQIVIAMWGVLD 593 (632) Q Consensus 526 ~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~-------~~l~G~pv~~~~~~~~-----~~~~~gd~s~~~~~~~~~~~ 593 (632) . +.|+|..+...+..+.+++|.+|+|+|++ .+|+|+||++++++|. ..++||||+.|.+..++++. T Consensus 289 ~--a~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~i~~~~~~~ 366 (409) T protein:vir:45 289 G--PKFRLAFNDNTLKLISEMEDGQGRPLWLPDIVGVAPASVLNVPYVIDQEIDDIGAGKKFMFCGDFDRFIIRRVRYMI 366 (409) T ss_pred C--CeEEEEECHHHHHHHHHhhcCCCceeeccCcCCCCCceecceeeEEecCcCCccCCccEEEEeehhhhheeeccceE Confidence 4 44544444444556678999999999975 3799999999999985 45899999999999999999 Q ss_pred EEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 594 LKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 594 ~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +..+.+.+|.+|++.||+..|+|+++++|+||++++++| T Consensus 367 ~~~~~d~~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~k~ 405 (409) T protein:vir:45 367 LKRLVERYAEYDQTGFLAFHRFDCILEDTSAIKALVGKG 405 (409) T ss_pred EEEeecccccCCcEEEEEEEEeccEeechhheEEEEecc Confidence 998888889999999999999999999999999999988 No 23 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=100.00 E-value=6.4e-47 Score=273.83 Aligned_cols=427 Identities=13% Similarity=0.110 Sum_probs=234.2 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHH Q lcl|Aclame:pro 181 RGAEMPDKDKQTQTAGSQQTETRGAETGAKNPAPAASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQ 260 (632) Q Consensus 181 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~ 260 (632) +..+......+....+........... ....+..... .+.......+..... .+ ...+.... .....++ T Consensus 1 ~~~~~~~~~~e~~~~e~a~~~~~~~~~-~k~~e~~~~~---ke~~~~~l~~~~e~~---~k-~~~E~~~~---le~~~ee 69 (458) T protein:vir:10 1 MTIDINKLKEELGLGDLAKSLEGLTAA-QKAQEAERMR---KEQEEKELARMNDLV---SK-AVGEDRKR---LEEALEL 69 (458) T ss_pred CccchhhhhhhhchhhHHHHHHHHHHH-HHHHHHHHHH---HHHHHHHHHHHHHHH---HH-HHHHHHHH---HHHHHHH Confidence 111111111100000000000000000 0000000000 000000000000000 00 00000000 0000000 Q ss_pred HHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHh Q lcl|Aclame:pro 261 FRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGK 340 (632) Q Consensus 261 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 340 (632) .+.. .+........................... ..................... .. ...................+ T Consensus 70 ~k~l-~ee~~~~~~~~a~~~e~~~~~~~~~~~~~-~~~~~~~e~~~~~~~~~~~~~-~~-~~~~~~~~~e~~~~~~~~~~ 145 (458) T protein:vir:10 70 VKSL-DEKSKKSNELFAQTVEKQQETIVGLQDEI-KSLLTAREGRSFVGDSVAKAL-YG-TQENFEDEVEKLVLLSYVME 145 (458) T ss_pred HHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhhhhhhhhhhccc-hh-hhhhHHHHHHHHHHHHHHHh Confidence 0000 00000000000000000000000000000 000000000000000000000 00 00000000000000000000 Q ss_pred hhhhhhhhhhHHhhhhhcc-cccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCcccccccc Q lcl|Aclame:pro 341 EARGFYMPHEVLVQRQLEK-KTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGE 419 (632) Q Consensus 341 ~~~~~~~~~~~~~~~a~~~-~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E 419 (632) ..............+... .+...++.++|+ .+.+.|++.+++.++++.+ +++++..+....+++.+..+.+.|++| T Consensus 146 -~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~-~~~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~a~~v~e 222 (458) T protein:vir:10 146 -KGVFETEHGQRHLKAVNQSSSVEVSSESYET-IFSQRIIRDLQKELVVGAL-FEELPMSSKILTMLVEPDAGKATWVAA 222 (458) T ss_pred -hccchhhhhhhhhhhhhhcccCccccceehh-hHhHHHHHHHHhhhhHHhh-cceeecCCcceEEEEecCCcceeeccc Confidence 000011111111222222 233445556655 4567788999999998887 556777777788889988999999999 Q ss_pred CcccccC------cccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccee Q lcl|Aclame:pro 420 DEDVQDS------DFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLN 493 (632) Q Consensus 420 ~~~~~~~------~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~ 493 (632) ++..+++ .++|+++++.+++++++++||+++|.|+.+++.++|.+.|++++++++|.+|++|+|+ ++|.||++ T Consensus 223 ~~~~~~~~~~~~~~~~~~~i~~~~~k~~~~v~is~ell~ds~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~-~~p~Gi~~ 301 (458) T protein:vir:10 223 STYGTDTTTGEEVKGALKEIHFSTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMTGDGS-GKPKGLLT 301 (458) T ss_pred ccccccccccccccccceeeEeeeeeEEeeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcCCCC-Cccceeee Confidence 9988864 4689999999999999999999999999999999999999999999999999999997 58999999 Q ss_pred cccccccc-------ccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeecc--------- Q lcl|Aclame:pro 494 MTGVPALT-------YPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN--------- 557 (632) Q Consensus 494 ~a~~~~~~-------~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~--------- 557 (632) .+...+.. .....+++++|.++++.+...|. .++.|+||+..+. .+.+++|.+|+|+|.+ T Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~--~~~~~v~~~~~~~--~l~~lkd~~G~~i~~~~~~~~~~~~ 377 (458) T protein:vir:10 302 LASEDSAKVVTEAKADGSVLVTAKTISKLRRKLGRHGL--KLSKLVLIVSMDA--YYDLLEDEEWQDVAQVGNDSVKLQG 377 (458) T ss_pred cccccccceeecccccccccccHHHHHHHHHhhhhhhc--CCCEEEEcHHHHH--HHHhhcccCCceeeccccccccccC Confidence 87644322 22345689999999999988775 4578999988765 4568899999999853 Q ss_pred --ccccCcceEEcCCCCCc----cEEEEehhh-EEEEEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEe Q lcl|Aclame:pro 558 --NEVNGYRAEASNQIPAD----TWIFGDWSQ-IVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKK 630 (632) Q Consensus 558 --~~l~G~pv~~~~~~~~~----~~~~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~ 630 (632) ++|+|+||++++++|++ .++||||+. |.++++.++++..+++ +.++++.|+++.|+|+.+++|+||++.++ T Consensus 378 ~~~~l~G~pv~~~~~~p~~~~~~~~~~~~f~~~~~~~~~~~~~v~~d~~--~~~~~~~~~~~~r~~~~v~~~~a~v~~~~ 455 (458) T protein:vir:10 378 QVGRIYGLPVVVSEYFPAKANSAEFAVIVYKDNFVMPRQRAVTVERERQ--AGKQRDAYYVTQRVNLQRYFANGVVSGTY 455 (458) T ss_pred cCceecceeeEEccccccccCCcceEEEEecccEEEEEeeceEEEeecc--cCCCceEEEEEEEecceEecccceEEEee Confidence 36999999999999874 589999976 7789999999887665 56899999999999999999999999999 Q ss_pred cC Q lcl|Aclame:pro 631 GA 632 (632) Q Consensus 631 ~A 632 (632) +| T Consensus 456 aa 457 (458) T protein:vir:10 456 AA 457 (458) T ss_pred cc Confidence 99 No 24 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=100.00 E-value=2.1e-47 Score=276.46 Aligned_cols=387 Identities=14% Similarity=0.081 Sum_probs=233.3 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhh Q lcl|Aclame:pro 213 APAASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIH 292 (632) Q Consensus 213 ~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 292 (632) +.......... ...++........ ++........+............. .. .......... T Consensus 1 ~~ke~~~~~~~---~~~~~~~e~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~----~~----~~~~~~~~~~ 60 (413) T protein:vir:81 1 MVKEAGDAPTN---AQVAEIAEVKSMV---------EQFKADEDAKRERAKSVKANQDFL----RE----LQEATAGSVD 60 (413) T ss_pred ChhhHHHHHHH---HHHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHHHHHH----HH----HHHHHHhHHh Confidence 00000000000 0000000000000 000000000000000000000000 00 0000000000 Q ss_pred hhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechh Q lcl|Aclame:pro 293 SARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATE 372 (632) Q Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~ 372 (632) ...................... ........... ...........................++...++.++|.+ T Consensus 61 ~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~ 133 (413) T protein:vir:81 61 SEKSGELTRKGEGYKSIGEFFA------KRAGDQIKQQA-GGAQLNYSVGEYVAPRVKAASDPASTATLTDEFQGGYGTT 133 (413) T ss_pred HHHhhhHhhhhhhhhhhhhhhh------hhhhhHHHHHH-HHHHhhhhhhhhhhhHHHhhhhhhhhcccccccccccchh Confidence 0000000000000000000000 00000000000 0000000000111111112222333444555666777665 Q ss_pred hhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecC----CccccccccCcccccCc-ccceeeeeeeeeeeeeehhh Q lcl|Aclame:pro 373 LLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTS----GANFYWIGEDEDVQDSD-FDFTTLSFSPKTIAGAVPVT 447 (632) Q Consensus 373 ~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~a~~v~E~~~~~~~~-~~~~~~~~~~~t~~~~~~iS 447 (632) +.+.|++.+++.++++.+ +.+.+.....+.+++... ...+.|++|++++++++ ++|+.+++.+++++++++|| T Consensus 134 -~~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS 211 (413) T protein:vir:81 134 -WNRNIIYRRREKLVVADL-MDNLTMTNTTIKYLMEKANRVVEGGFKTVAEGGKKPYMRFADFDIVTESLSKIAGLTKIT 211 (413) T ss_pred -hHHHHHHHHhhhhhHHhh-cceeeccCCceeEEEeccccccccccceecCcccccccCcccceeeEeeeeeEEEeehhh Confidence 567788999999998887 456666666677776654 34679999999999987 68999999999999999999 Q ss_pred HHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHHHHHHhhccccc Q lcl|Aclame:pro 448 RKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAG 527 (632) Q Consensus 448 re~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 527 (632) +|+|.|++ .+.++|.+.+++++++++|.++++|+|++..|.||++.++....+...+...++.+.+++..+........ T Consensus 212 ~ell~ds~-~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~Gi~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 290 (413) T protein:vir:81 212 DEMIEDYD-FLVSYINARLLEELAIEEERQLLLGDGTGNNLTGLLKRDGIQTLAVSNKDELADSIYKAMTNISLATPFQA 290 (413) T ss_pred HHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhccCCCCCcccccccccccccccccccchhHHHHHHHHHHhhhhccCCC Confidence 99998875 59999999999999999999999999999999999999888777666666667777787777655433222 Q ss_pred cceEEeehhHHHHHHHHhhcccCCceeecc--------------ccccCcceEEcCCCCCccEEEEehhh-EEEEEecce Q lcl|Aclame:pro 528 RLAYLTSVTQRGAAKKAQVFDNTGERIWQN--------------NEVNGYRAEASNQIPADTWIFGDWSQ-IVIAMWGVL 592 (632) Q Consensus 528 ~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~--------------~~l~G~pv~~~~~~~~~~~~~gd~s~-~~~~~~~~~ 592 (632) ..|+||+.++. .+.+++|.+|+|+|.+ ++|+|+||++++++|.++++||||+. |.++.++++ T Consensus 291 -~~~vmn~~~~~--~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~l~G~pv~~s~~~~~~~~~~gd~~~~~~~~~~~~~ 367 (413) T protein:vir:81 291 -DALVINPLDYQ--ELRLAKDANGQYYGGGVFQGQYGSGGIMLDPAPWGLRTVQSQVVPVGKPVVGAFRSAASVLRKGGV 367 (413) T ss_pred -cEEEEcHHHHH--HHHHhhccCCceeccccccccccccccccCceecceeeEEcCCCCcccEEEEecccEEEEEEecce Confidence 34889888765 4568899999999953 36999999999999999999999997 677888999 Q ss_pred EEEEeccc--ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 593 DLKVDPYT--KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 593 ~~~~~~~~--~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) ++.++++. +|.+|++.||++.|+|+++.+|+||+++++++ T Consensus 368 ~v~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~ 409 (413) T protein:vir:81 368 RIDSTNTNVDDFENNLITVRAEERVGLMVTFPEAIVQLDVAE 409 (413) T ss_pred EEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEecC Confidence 99988875 58999999999999999999999999999999 No 25 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=100.00 E-value=1.1e-47 Score=278.05 Aligned_cols=372 Identities=14% Similarity=0.059 Sum_probs=240.5 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHH Q lcl|Aclame:pro 222 NDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQH 301 (632) Q Consensus 222 ~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 301 (632) ........++..........+. + ......... ......................... .. T Consensus 1 M~~l~el~~~~~~~~~e~~~l~-----~----------~~~~e~~~~-~~~~~~l~~~~~~~~~~~~~~~~~~-----~~ 59 (385) T protein:vir:18 1 MSELALIQKAIEESQQKMTQLF-----D----------AQKAEIEST-GQVSKQLQSDLMKVQEELTKSGTRL-----FD 59 (385) T ss_pred ChHHHHHHHHHHHHHHHHHHHH-----H----------HHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHH-----HH Confidence 1111111111111111110000 0 000000000 0000000000000000000000000 00 Q ss_pred HHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHH Q lcl|Aclame:pro 302 KELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDI 381 (632) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~ 381 (632) .... ... ....... .............+..............+.....+..++|.++++++ ...|++. T Consensus 60 --~~~~----~~~----~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~-~~~ii~~ 127 (385) T protein:vir:18 60 --LEQK----LAS----GAENPGE-KKSFSERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQ-IPGIIMP 127 (385) T ss_pred --HHHH----hhc----cccccch-hhhhHHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchh-hhHHHHH Confidence 0000 000 0000000 00000011111111111111222223334444455566677777775 4668888 Q ss_pred HhhhhhhhhhcceeeccCceeEEEEEecC-CccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHH Q lcl|Aclame:pro 382 LRNKAIIGQMGARMLPGLVGDVDIPKKTS-GANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVEN 460 (632) Q Consensus 382 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~ 460 (632) ++..++++.+ +++.+.....+.+++.+. .+.+.|++|++++++++++|+++++.+++++++++||+++|.|+ .++++ T Consensus 128 ~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~-~~l~~ 205 (385) T protein:vir:18 128 GLRRLTIRDL-LAQGRTSSNALEYVREEVFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQVMDDA-PMLQS 205 (385) T ss_pred hhhccchhhh-cceecccCcceEEEEEecCCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHHHHhhH-HHHHH Confidence 8888888887 455666666788898765 56889999999999999999999999999999999999999865 57999 Q ss_pred HHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccc-cccchhHHHHHHHHHHHHhhccccccceEEeehhHHH Q lcl|Aclame:pro 461 LIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTY-PAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRG 539 (632) Q Consensus 461 ~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~-~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 539 (632) +|.+.|++++++++|.++++|+|+++.|.||++.++...... ..+...++.|.+++..+...+.. ...|+|++..+. T Consensus 206 ~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~--~~~~~~~~~~~~ 283 (385) T protein:vir:18 206 YINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLNATGDTRADIIAHAIYQVTESEFS--ASGIVLNPRDWH 283 (385) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccccccchHHHHHHHHHhhccccCC--CCEEEEcHHHHH Confidence 999999999999999999999999999999998877655443 33456789999999999776643 467899888765 Q ss_pred HHHHHhhcccCCceeecc------ccccCcceEEcCCCCCccEEEEehhh-EEEEEecceEEEEecc--cccccCcEEEE Q lcl|Aclame:pro 540 AAKKAQVFDNTGERIWQN------NEVNGYRAEASNQIPADTWIFGDWSQ-IVIAMWGVLDLKVDPY--TKAASDGLVLR 610 (632) Q Consensus 540 ~~~~~~~~d~~g~~~~~~------~~l~G~pv~~~~~~~~~~~~~gd~s~-~~~~~~~~~~~~~~~~--~~~~~~~~~~~ 610 (632) . +.+++|.+|+|+|.+ ++|+|+||++++.+|+++++||||+. |.++.+.++++..+++ .+|.+|++.|+ T Consensus 284 ~--l~~lkd~~G~~l~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~ 361 (385) T protein:vir:18 284 N--IALLKDNEGRYIFGGPQAFTSNIMWGLPVVPTKAQAAGTFTVGGFDMASQVWDRMDATVEVSREDRDNFVKNMLTIL 361 (385) T ss_pred H--HHHhhcCCCceeccCcccCCCceecceeeEEcCcCCCCcEEEeecccEEEEEEecceEEEEeccccchhhcCcEEEE Confidence 4 457899999999863 58999999999999999999999997 7788999999887765 45899999999 Q ss_pred EEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 611 VFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 611 ~~~r~~~~v~~~~a~~~~~~~A 632 (632) ++.|+|+++.+|+||++++++| T Consensus 362 ~~~r~~~~v~~~~a~~~~~~~a 383 (385) T protein:vir:18 362 CEERLALAHYRPTAIIKGTFSS 383 (385) T ss_pred EEEeeccEEecccceEEEEecc Confidence 9999999999999999999999 No 26 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=100.00 E-value=1.1e-47 Score=278.05 Aligned_cols=372 Identities=14% Similarity=0.059 Sum_probs=240.5 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHH Q lcl|Aclame:pro 222 NDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQH 301 (632) Q Consensus 222 ~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 301 (632) ........++..........+. + ......... ......................... .. T Consensus 1 M~~l~el~~~~~~~~~e~~~l~-----~----------~~~~e~~~~-~~~~~~l~~~~~~~~~~~~~~~~~~-----~~ 59 (385) T protein:vir:19 1 MSELALIQKAIEESQQKMTQLF-----D----------AQKAEIEST-GQVSKQLQSDLMKVQEELTKSGTRL-----FD 59 (385) T ss_pred ChHHHHHHHHHHHHHHHHHHHH-----H----------HHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHH-----HH Confidence 1111111111111111110000 0 000000000 0000000000000000000000000 00 Q ss_pred HHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHH Q lcl|Aclame:pro 302 KELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDI 381 (632) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~ 381 (632) .... ... ....... .............+..............+.....+..++|.++++++ ...|++. T Consensus 60 --~~~~----~~~----~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~-~~~ii~~ 127 (385) T protein:vir:19 60 --LEQK----LAS----GAENPGE-KKSFSERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQ-IPGIIMP 127 (385) T ss_pred --HHHH----hhc----cccccch-hhhhHHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchh-hhHHHHH Confidence 0000 000 0000000 00000011111111111111222223334444455566677777775 4668888 Q ss_pred HhhhhhhhhhcceeeccCceeEEEEEecC-CccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHH Q lcl|Aclame:pro 382 LRNKAIIGQMGARMLPGLVGDVDIPKKTS-GANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVEN 460 (632) Q Consensus 382 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~ 460 (632) ++..++++.+ +++.+.....+.+++.+. .+.+.|++|++++++++++|+++++.+++++++++||+++|.|+ .++++ T Consensus 128 ~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~-~~l~~ 205 (385) T protein:vir:19 128 GLRRLTIRDL-LAQGRTSSNALEYVREEVFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQVMDDA-PMLQS 205 (385) T ss_pred hhhccchhhh-cceecccCcceEEEEEecCCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHHHHhhH-HHHHH Confidence 8888888887 455666666788898765 56889999999999999999999999999999999999999865 57999 Q ss_pred HHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccc-cccchhHHHHHHHHHHHHhhccccccceEEeehhHHH Q lcl|Aclame:pro 461 LIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTY-PAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRG 539 (632) Q Consensus 461 ~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~-~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 539 (632) +|.+.|++++++++|.++++|+|+++.|.||++.++...... ..+...++.|.+++..+...+.. ...|+|++..+. T Consensus 206 ~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~--~~~~~~~~~~~~ 283 (385) T protein:vir:19 206 YINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLNATGDTRADIIAHAIYQVTESEFS--ASGIVLNPRDWH 283 (385) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccccccchHHHHHHHHHhhccccCC--CCEEEEcHHHHH Confidence 999999999999999999999999999999998877655443 33456789999999999776643 467899888765 Q ss_pred HHHHHhhcccCCceeecc------ccccCcceEEcCCCCCccEEEEehhh-EEEEEecceEEEEecc--cccccCcEEEE Q lcl|Aclame:pro 540 AAKKAQVFDNTGERIWQN------NEVNGYRAEASNQIPADTWIFGDWSQ-IVIAMWGVLDLKVDPY--TKAASDGLVLR 610 (632) Q Consensus 540 ~~~~~~~~d~~g~~~~~~------~~l~G~pv~~~~~~~~~~~~~gd~s~-~~~~~~~~~~~~~~~~--~~~~~~~~~~~ 610 (632) . +.+++|.+|+|+|.+ ++|+|+||++++.+|+++++||||+. |.++.+.++++..+++ .+|.+|++.|+ T Consensus 284 ~--l~~lkd~~G~~l~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~ 361 (385) T protein:vir:19 284 N--IALLKDNEGRYIFGGPQAFTSNIMWGLPVVPTKAQAAGTFTVGGFDMASQVWDRMDATVEVSREDRDNFVKNMLTIL 361 (385) T ss_pred H--HHHhhcCCCceeccCcccCCCceecceeeEEcCcCCCCcEEEeecccEEEEEEecceEEEEeccccchhhcCcEEEE Confidence 4 457899999999863 58999999999999999999999997 7788999999887765 45899999999 Q ss_pred EEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 611 VFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 611 ~~~r~~~~v~~~~a~~~~~~~A 632 (632) ++.|+|+++.+|+||++++++| T Consensus 362 ~~~r~~~~v~~~~a~~~~~~~a 383 (385) T protein:vir:19 362 CEERLALAHYRPTAIIKGTFSS 383 (385) T ss_pred EEEeeccEEecccceEEEEecc Confidence 9999999999999999999999 No 27 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=100.00 E-value=2.1e-47 Score=276.45 Aligned_cols=425 Identities=14% Similarity=0.162 Sum_probs=232.3 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHH Q lcl|Aclame:pro 183 AEMPDKDKQTQTAGSQQTETRGAETGAKNPAPAASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFR 262 (632) Q Consensus 183 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~ 262 (632) ++....+......+................+..... ..........+ .+ ....++....-...+... T Consensus 1 ~~k~~eem~~~i~eL~e~r~~l~~e~~~l~d~ak~e-~~~~~~~~e~~----------e~--~a~~~el~~ei~~le~~~ 67 (477) T protein:vir:84 1 MEKHLEELRALRAAAVEAVATLKAERQAIADGAKAE-ERAALSADETA----------EF--RAKSASIKAELDKVEDLD 67 (477) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-hhhhhhHHHHH----------HH--HHHHHHHHHHHHHHHHHH Confidence 000000000000000000000000000000000000 00000000000 00 000000000000000000 Q ss_pred HHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHH--h Q lcl|Aclame:pro 263 ALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASG--K 340 (632) Q Consensus 263 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~ 340 (632) ..+........... ........................... ....+........... ............ . T Consensus 68 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~ 139 (477) T protein:vir:84 68 EQIRELESEIERSG-KLEAETKTVRKATVEVNEALTYEKGNG-----QSYFRDLAMQTVGMAD--EPAKERLRRHMVDVE 139 (477) T ss_pred HHHHHHHHHHHHhh-cchhhhhhhcccccccccchhhhhhHH-----HHHHHHHHHHHhhhhh--hHHHHHHHHHHhhhh Confidence 00000000000000 000000000000000000000000000 0000000000000000 000000000000 0 Q ss_pred hhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhh-cceeeccCceeEEEEEecCCc-cccccc Q lcl|Aclame:pro 341 EARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQM-GARMLPGLVGDVDIPKKTSGA-NFYWIG 418 (632) Q Consensus 341 ~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~-~a~~v~ 418 (632) ..............+...+++..+||++++++++.+.|++.+++.+++.++ ....+++...++.+|+..+.+ .+.|++ T Consensus 140 ~~~~~~~~~~~~~~~~~~~~~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~ 219 (477) T protein:vir:84 140 SDKEIRKIAKVGEEYRDLDRNGGTGGYAVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAA 219 (477) T ss_pred hhhhHHHHHHhhhhhccccccCCCcceeeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcceeeeec Confidence 000111111122233344556677889999999889999999999888775 344556777788999876554 567899 Q ss_pred cCcc-----cccCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccee Q lcl|Aclame:pro 419 EDED-----VQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLN 493 (632) Q Consensus 419 E~~~-----~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~ 493 (632) |++. +++++++|+.+++++++++++++||+|+|.|+.+++.++|.++|+++++.++|.+|++|+|++++|.||++ T Consensus 220 Eg~~~~~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~ 299 (477) T protein:vir:84 220 DNAALTAPSAHEVDLTDGFVQANVKTIAGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRA 299 (477) T ss_pred cCcccccccccccccceeeEEEeeeeEEeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeee Confidence 9864 56788999999999999999999999999999999999999999999999999999999999899999999 Q ss_pred ccccccccccccchhH-------HHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeecc--------- Q lcl|Aclame:pro 494 MTGVPALTYPAGGVDW-------ASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN--------- 557 (632) Q Consensus 494 ~a~~~~~~~~~~~~~~-------~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~--------- 557 (632) .++.+.+..+.+..++ +.|.++...+...+.. ....|+|++..+. .+.+++|.+|+|+|.+ T Consensus 300 ~~~~~~~~~~~~~~t~~~~~~~~~~i~~~~~~~~~~~~~-~~~~~v~~~~~~~--~l~~lkd~~G~~l~~~~~~~~~~~~ 376 (477) T protein:vir:84 300 TAGITQVTATSAGSALEKHQIIYQKIADAIQRVHTSRFL-EPEVIVMHPRRWA--SFHAIFAGDDRPLIVPSGPGFNNLG 376 (477) T ss_pred ccccccccccccccchhhHHHHHHHHHHHHhhccccccC-CccEEEEcHHHHH--HHHHhhccCCCeeeecCcccccccc Confidence 8887766555444443 3444444444443332 2346888887754 4567899999999964 Q ss_pred -----------ccccCcceEEcCCCCCc--------cEEEEehhhEEEEEecceEEEEecccccccCcEEEEEEEEeCcE Q lcl|Aclame:pro 558 -----------NEVNGYRAEASNQIPAD--------TWIFGDWSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAG 618 (632) Q Consensus 558 -----------~~l~G~pv~~~~~~~~~--------~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~ 618 (632) ++|+|+||++++.+|.+ .++||||+.+.++. .++.+..+++.++.++++.|+++.++++. T Consensus 377 ~~~~~~~~~~~~~l~G~pVv~s~~~p~~~~~~~d~~~i~~gd~~~~~i~~-~~~~~~~~~~~~~~~~~~~~~v~~~~~~~ 455 (477) T protein:vir:84 377 VLTEVASQRVVGQMHGLPVVTDPTLPTTLGTGTDQDVIHVLRASDLALFE-SSVRMRALQETRAENLSVLLQVYGYLAFT 455 (477) T ss_pred cccccccccccchhcccceEecCcccccccccCCcceEEEEEeceEEEEe-eceeEEeccccccccceeeeeehhhhhhh Confidence 37999999999999964 48999999998876 57899999999999999999999888875 Q ss_pred E-ecccceEEEEecC Q lcl|Aclame:pro 619 V-RRKEAFCIAKKGA 632 (632) Q Consensus 619 v-~~~~a~~~~~~~A 632 (632) . ++|+||+.++..| T Consensus 456 ~~r~~~afv~~t~~~ 470 (477) T protein:vir:84 456 AARFPQSVVEIGGTA 470 (477) T ss_pred hhccccceEEeeccc Confidence 5 5699999999999 No 28 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=100.00 E-value=2.1e-47 Score=276.52 Aligned_cols=355 Identities=11% Similarity=0.085 Sum_probs=234.4 Q ss_pred hhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 244 QRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSK 323 (632) Q Consensus 244 ~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 323 (632) ..++.+... .....+..+.......... .+............ . ...... ........ ... T Consensus 1 ik~L~e~~~----e~~e~~~~~~~~~~~~~~~-~e~~~~~~~~~~~~---~-~~~~~~--~~~~~~~~-~~~-------- 60 (390) T protein:vir:40 1 MNNLDKKDS----ETLNISTAFLNAIKEGATE-AEQVTAFTNMAEQI---Q-NNIIAQ--ARKEVNRE-MND-------- 60 (390) T ss_pred CchHHHHHH----HHHHHHHHHHHHHhhhhhH-HHHHHHHHHHHHHH---H-HHHHHH--HHHHHHHH-HHH-------- Confidence 111111111 1111111111111110000 00000000000000 0 000000 00000000 000 Q ss_pred hhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeE Q lcl|Aclame:pro 324 AGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDV 403 (632) Q Consensus 324 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 403 (632) ..... .+...................++..++|+++|.++ .+.|++.++..++++++ +++++...... T Consensus 61 -------~~~~~---~~~~~~l~~~~r~~~~~~~~~~~~~~gg~lvP~~~-~~~I~~~~~~~s~i~~~-~~~~~~~~~~~ 128 (390) T protein:vir:40 61 -------NNVLA---SRGANALTSDESKYYNEVIAGNGFAGVTALLPPTV-FERVFEDLTVEHPLLSK-INFVNTTATTE 128 (390) T ss_pred -------HHHHH---hcCchhccHHHHHHHHHHHhccCcccCcccccHHH-HHHHHHHHHhhhhhhhh-ceeeecCCcee Confidence 00000 00000000000000111223344556666666555 57788999999998887 78888877788 Q ss_pred EEEEecCCccccccccCccccc-CcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|Aclame:pro 404 DIPKKTSGANFYWIGEDEDVQD-SDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGT 482 (632) Q Consensus 404 ~~~~~~~~~~a~~v~E~~~~~~-~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~ 482 (632) .+++.++.+.+.|++|++++++ ++++|+++++.+++++++++||+|+|.|+.++++++|.+.|+++++.+++.++++|+ T Consensus 129 ~i~~~~~~~~a~~~~E~~~~~~~~~~~f~~i~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~i~~~~~~a~l~G~ 208 (390) T protein:vir:40 129 WIISVGDVATAWWGPLCAEIKEVLDNGFDKIQTGMYKLSAYIPVCNAMLDLGPSWLDQYVRTILGEAMALGLEAGIVNGS 208 (390) T ss_pred EEEEEcCCcceeeeccccccCccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccc Confidence 8999999999999999999875 689999999999999999999999999999999999999999999999999999999 Q ss_pred CCccccccceeccccccc----cccccchhHHHHHHHHHHHHhhccc-----cccceEEeehhHHH-HH-HHHhhcccCC Q lcl|Aclame:pro 483 GLANDPVGLLNMTGVPAL----TYPAGGVDWASVVDMETKISTFNAD-----AGRLAYLTSVTQRG-AA-KKAQVFDNTG 551 (632) Q Consensus 483 g~~~~~~Gil~~a~~~~~----~~~~~~~~~~~i~~~~~~~~~~~~~-----~~~~~~~~~~~~~~-~~-~~~~~~d~~g 551 (632) |+ ++|.||++..+.... ..+...++..++.++...+...+.. ..++.|+|++.+.. .+ .+..++|.+| T Consensus 209 G~-~~P~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~~~~~d~~G 287 (390) T protein:vir:40 209 GK-DQPIGMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAATSYMTPQG 287 (390) T ss_pred CC-CccceeeeccccccccccccccccccchhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHHhhccCCCC Confidence 96 479999986543322 1233446666667766666554422 34677999988742 12 3346899999 Q ss_pred ceeeccccccCcceEEcCCCCCccEEEEehhhEEEEEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEec Q lcl|Aclame:pro 552 ERIWQNNEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKG 631 (632) Q Consensus 552 ~~~~~~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~ 631 (632) +|+|.. .++|+||++++++|+++++||||+.|.+++++++++.++++.+|.+|++.||+..|+|+++++++||++++++ T Consensus 288 ~~v~~~-~~~g~pvv~~~~~p~~~i~~Gd~s~~~i~~~~~~~v~~~~~~~f~~~~~~~r~~~r~dg~v~~~~A~~~l~~~ 366 (390) T protein:vir:40 288 VWVTGI-LPVPLEIVQSVAVPVGKAVAGRAKDYFMGIGSEQVIRTSTEYRLLDDETLYYAKQYANGRPKDNSSFLVFDIT 366 (390) T ss_pred cccccc-CCCceeEEEcCCCCCCcEEEEeeceEEEEeecceEEEecchhhhhcCcEEEEEEEEeCCEEecccceEEEEee Confidence 999864 4579999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred C Q lcl|Aclame:pro 632 A 632 (632) Q Consensus 632 A 632 (632) | T Consensus 367 ~ 367 (390) T protein:vir:40 367 G 367 (390) T ss_pred c Confidence 9 No 29 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=100.00 E-value=2.2e-48 Score=281.86 Aligned_cols=337 Identities=21% Similarity=0.357 Sum_probs=233.5 Q ss_pred hhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhh Q lcl|Aclame:pro 266 LERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGF 345 (632) Q Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 345 (632) +.+... .... ............. . .. ...........+...... ...... ....... T Consensus 1 ~a~~~a----~~~~--~~~~~~~~~~~~~--~--~~--~kg~~~~~~~~a~a~~~g---~~~~a~-~~a~~~~------- 57 (366) T protein:vir:57 1 MAAAVA----VPVK--AHSVAPGIIIKEE--L--QQ--YKGAGMTRMVMSIAAGKG---NLADAA-KFAATEL------- 57 (366) T ss_pred Cccccc----cccc--ccccccccccccc--c--cc--ccchhHHHHHHHHHhccc---chhHHH-HHHHHhh------- Confidence 000000 0000 0000000000000 0 00 000000000000000000 000000 0000000 Q ss_pred hhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCccccc Q lcl|Aclame:pro 346 YMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQD 425 (632) Q Consensus 346 ~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~ 425 (632) ......+++. .+..+||.+||.+ +.+.|++.+++.+++++++++.++..++.+.+|+.++.+.+.|++|++++++ T Consensus 58 ---~~~~~~~a~~-~~~~~Gg~lvP~~-~~~~ii~~l~~~s~l~~lg~~~v~~~~g~~~~p~~t~~~~a~wv~E~~~~~~ 132 (366) T protein:vir:57 58 ---GDTGLSMAIS-TAAGSGGALIPQN-MQNEVIELLRDRTVVRILGARSIPLPNGNLSMPRLSGGATAGYVGEGKDVVA 132 (366) T ss_pred ---cchhhhhhcc-ccccCCccccchh-HHHHHHHHHhhhcchhhhceeeeecCCCceEEEEEeCCcceeeeccCccccc Confidence 0111122333 3344677777766 4677999999999999998999998888899999999999999999999999 Q ss_pred CcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccc--ccc Q lcl|Aclame:pro 426 SDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPAL--TYP 503 (632) Q Consensus 426 ~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~--~~~ 503 (632) ++++|+++++.+++++++++||+|+|.|+.++++++|.+.|++++++++|.++++|+|++++|.||++.++..+. ..+ T Consensus 133 s~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~ 212 (366) T protein:vir:57 133 TGATFDDVKLSAKTMIALVPVSNQLIGRAGFNVEQLLLGDILSAIATREDKAFLRDDGTGDTPKGMKAVATAANRLVAWT 212 (366) T ss_pred cccceeEEEEeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeeccccccceeecc Confidence 999999999999999999999999999999999999999999999999999999999999999999987765443 233 Q ss_pred ccchhHHHHHHHHHHHHhhc----cccccceEEeehhHHHHHHHHhhcccCCceeec---cccccCcceEEcCCCCCc-- Q lcl|Aclame:pro 504 AGGVDWASVVDMETKISTFN----ADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQ---NNEVNGYRAEASNQIPAD-- 574 (632) Q Consensus 504 ~~~~~~~~i~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~---~~~l~G~pv~~~~~~~~~-- 574 (632) +...+++.+......+...+ .....+.|+|++..+.. +.+++|.+|+|+|. .++|+|+||++++++|.+ T Consensus 213 ~t~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~--L~~lkd~~G~~l~~~~~~g~l~G~Pvv~s~~ip~~~~ 290 (366) T protein:vir:57 213 GTAINLTTIDEYLDSLILKHMDSNSNMIRCGWGLSNRTYMT--LFGLRDGNGNKVYPEMSQGILKGYPIQRTSAIPANLG 290 (366) T ss_pred ccccchhhHHHHHHHHHHhhhccccccccCEEEecHHHHHH--HHhhhccCCceeccCCCCCeecceeeEEccccccccc Confidence 34455555444443333322 23456789998887654 45789999999995 368999999999999863 Q ss_pred ------cEEEEehhhEEEEEecceEEEEeccc-----------ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 575 ------TWIFGDWSQIVIAMWGVLDLKVDPYT-----------KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 575 ------~~~~gd~s~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) .++||||+.|.+++++++++..+++. .|.+|++.||++.|+||++.||+||++++-.= T Consensus 291 ~~~~~~~i~~gdfs~~~i~~~~~i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~~ 365 (366) T protein:vir:57 291 DDGNESEIYFCDFNDVVIGEDGMMKVDFSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGVI 365 (366) T ss_pred cCCCccEEEEEecceEEEEEecceEEEEeeccccccccccchhhhhcCceeEEeeeeeCcEeeccccEEEEeccc Confidence 48999999999999999999887762 47789999999999999999999999998766 No 30 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=100.00 E-value=3.8e-48 Score=280.54 Aligned_cols=278 Identities=16% Similarity=0.211 Sum_probs=231.8 Q ss_pred hhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcccccCcc Q lcl|Aclame:pro 349 HEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDF 428 (632) Q Consensus 349 ~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~ 428 (632) ....+.++....+..+++.++++++. +.+++.+++.++++++ ++.++.....+.+|+.++.+.+.|++|+++++++++ T Consensus 1 m~~~~~~a~~~~~t~~~g~~i~~~~~-~~ii~~~~~~s~l~~~-~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~ 78 (330) T protein:vir:77 1 MAGSTVPSTQVALTGDFSAFLTPEQS-QDYFAEIEKTSIVQRI-ARKVPMGPTGISIPHWTGAVSASWTGEAERKPITKG 78 (330) T ss_pred CcccccchhhccccCCCcceechhHH-HHHHHHHHhccchhhh-cceeeccCCceEEEEEcCCcceeEecCCCccccccc Confidence 12223445555556667778888765 6788999999999988 456777777789999999999999999999999999 Q ss_pred cceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceecccccccc------- Q lcl|Aclame:pro 429 DFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALT------- 501 (632) Q Consensus 429 ~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~------- 501 (632) +|+++++.+++++++++||+|+|.|+..+++++|.+.|++++++++|.++++|+|+++.+.|+++........ T Consensus 79 ~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~~~~~~~~~~ 158 (330) T protein:vir:77 79 SFGKQELEPVKITTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVVSLADTNLTT 158 (330) T ss_pred eeeEEEEeEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccccccccccccceeecccccc Confidence 9999999999999999999999999999999999999999999999999999999999999988755322211 Q ss_pred -ccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeecc------------ccccCcceEEc Q lcl|Aclame:pro 502 -YPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN------------NEVNGYRAEAS 568 (632) Q Consensus 502 -~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~------------~~l~G~pv~~~ 568 (632) .+.....++++.+++..+...+. ....|+|++..+..+ .+++|.+|+|+|++ ++|+|+||+++ T Consensus 159 ~~~~~~~~~~~l~~~~~~~~~~~~--~~~~~vmn~~~~~~l--~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~PV~~~ 234 (330) T protein:vir:77 159 ASGPQGNAYLAVNNALSLLVNSGK--KWTGTLLDNVTEPIL--NTAVDGNGRPLFVESTYTEQVGAIREGRILGRPTYVA 234 (330) T ss_pred cccccchhHHHHHHHHHhhhhcCC--CccEEEEcHHHHHHH--HHHhccCCceeecCccccccccccCCceecceeeEEe Confidence 12233446777777777776654 346799998887554 57899999999964 37899999999 Q ss_pred CCCCCcc------EEEEehhhEEEEEecceEEEEecccc------------------cccCcEEEEEEEEeCcEEecccc Q lcl|Aclame:pro 569 NQIPADT------WIFGDWSQIVIAMWGVLDLKVDPYTK------------------AASDGLVLRVFQDVDAGVRRKEA 624 (632) Q Consensus 569 ~~~~~~~------~~~gd~s~~~~~~~~~~~~~~~~~~~------------------~~~~~~~~~~~~r~~~~v~~~~a 624 (632) +++|+++ ++||||+.+.+++++++++..+++.+ |.+|++.||+..|+|+++.+|+| T Consensus 235 ~~~p~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a 314 (330) T protein:vir:77 235 DNVVNGTVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDKDA 314 (330) T ss_pred ccccCCCCCCccEEEEEecceEEEEEecCcEEEEeecceeeecccccccccccccchhhcCcEEEEEEEEeccEEecccc Confidence 9999754 89999999999999999998877643 78899999999999999999999 Q ss_pred eEEEEecC Q lcl|Aclame:pro 625 FCIAKKGA 632 (632) Q Consensus 625 ~~~~~~~A 632 (632) |++|+.++ T Consensus 315 ~~~i~~~~ 322 (330) T protein:vir:77 315 FVKLTDQV 322 (330) T ss_pred eEEEEecc Confidence 99999999 No 31 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=100.00 E-value=1.5e-45 Score=266.31 Aligned_cols=379 Identities=13% Similarity=0.139 Sum_probs=235.4 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhh Q lcl|Aclame:pro 217 SGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARD 296 (632) Q Consensus 217 ~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 296 (632) ..+...+.................... ...++........+......... ....... T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~~--~~~ee~~~~~~e~~~l~~~i~~~---------~~~~~~~------------ 57 (404) T protein:vir:10 1 MSKELRELLNQLDSKNKELNSLLNKDG--VTAEELNKTSNEIDILQAKIEAQ---------KRKENIE------------ 57 (404) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHhhcC--CCHHHHHHHHHHHHHHHHHHHHH---------HHHHHHH------------ Confidence 000000000000000000000000000 00000000000000000000000 0000000 Q ss_pred hhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhH Q lcl|Aclame:pro 297 LGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSE 376 (632) Q Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~ 376 (632) ............... .. ...............................+.+++..++..++|.++|.++. . T Consensus 58 -----~~~~~~~~~~~~~~~--~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~vP~~~~-~ 128 (404) T protein:vir:10 58 -----NNFNEDNVKSLNTGK--EE-NVIYNGALFVRAIADNLLKQKNQRGLNLSEKEINAISENIDEDGGYAVPEDIQ-T 128 (404) T ss_pred -----HHHhhhhcccccccc--ch-hhHHHHHHHHHHHHHHHHHHHHhhhhcchhhHHhhhccccCCCCceeechhHH-H Confidence 000000000000000 00 00000000000011111111111122223345566777777788887776664 6 Q ss_pred HHHHHHhhhhhhhhhcc-eeeccCceeEEEEEecCCccccccccCcccccC--cccceeeeeeeeeeeeeehhhHHHhhc Q lcl|Aclame:pro 377 EFIDILRNKAIIGQMGA-RMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDS--DFDFTTLSFSPKTIAGAVPVTRKLRKQ 453 (632) Q Consensus 377 ~i~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~--~~~~~~~~~~~~t~~~~~~iSre~l~d 453 (632) .|++.++..+++..+.. ..++...+.+.+++.++.+.+.|++|++..+.+ +++|+++++++++++++++||+|+|.| T Consensus 129 ~ii~~~~~~~~l~~l~~~~~~~~~~g~~~~~~~~~~~~~~~v~e~~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~d 208 (404) T protein:vir:10 129 KINTRLKDTTDLYNMVDYEPVFTRSGSRTYEKRSKQKPMKPLSENQQIPTNGDNGKLERFNFKLKDLADFMSIPNDLLKF 208 (404) T ss_pred HHHHHHhhhhhHhhhhceeeccCCccceEEEEecCCcceeeccccccccccccccceeeeEeeheeeEeeehhhHHHHhh Confidence 68899999888888732 234455667888998899999999999999875 588999999999999999999999999 Q ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHHH-HHHhhccccccceEE Q lcl|Aclame:pro 454 SSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMET-KISTFNADAGRLAYL 532 (632) Q Consensus 454 ~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~-~~~~~~~~~~~~~~~ 532 (632) +..+++++|.+.+++++++++|.++++|+|+++.|.|+++.++....+. .+..+++++.+++. .+...+. .++.|+ T Consensus 209 s~~~l~~~i~~~la~~~~~~~~~~il~G~g~~~~~~gi~~~~~~~~~~~-~~~~~~~~~~~~~~~~l~~~~~--~~~~~v 285 (404) T protein:vir:10 209 ADKSLEDWIINWFVDKVRITRNAEILYGAGGDEHATGIMTANKFKKITL-PKSPALKDFKKCKNVELLNVFK--ATSSWI 285 (404) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcccceeeccccceeec-cccccHHHHHHHHHhhhhcccc--CCCEEE Confidence 9999999999999999999999999999999999999998777655543 34567788887775 4554443 356789 Q ss_pred eehhHHHHHHHHhhcccCCceeecc-------ccccCcceEEc-CCCCC-----ccEEEEehhh-EEEEEecceEEEEec Q lcl|Aclame:pro 533 TSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEAS-NQIPA-----DTWIFGDWSQ-IVIAMWGVLDLKVDP 598 (632) Q Consensus 533 ~~~~~~~~~~~~~~~d~~g~~~~~~-------~~l~G~pv~~~-~~~~~-----~~~~~gd~s~-~~~~~~~~~~~~~~~ 598 (632) ||+..+.. +.+++|.+|+|+|.+ ++|+|+||++. +.++. ..++||||+. |.++.+.++++.+++ T Consensus 286 ~n~~~~~~--L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~~i~~~~ 363 (404) T protein:vir:10 286 VNQDGFNY--LDSLEDKTGRPYLQPDPKDPTQYRFLGLPVIELPNDLLLSTESAIPVLLGDTKEAYKYVSDGAYELATTN 363 (404) T ss_pred EcHHHHHH--HHHhhccCCceeeccCcCCCCCccccceeeEEecccccCCCCCccEEEEEeccccEEEEEecceEEEEec Confidence 98888654 457899999999975 37999999854 44443 3489999997 678889999998876 Q ss_pred c--cccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 599 Y--TKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 599 ~--~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) + ..|.+|++.|+++.|+|+++.+|+||+++++++ T Consensus 364 ~~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~ 399 (404) T protein:vir:10 364 IGAGAFETNTTKARIIMRIDGNVKDSEALLIAEIPV 399 (404) T ss_pred cccchhhcCceEEEEEEeeccEEecccceEEEEeec Confidence 5 568999999999999999999999999999999 No 32 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=100.00 E-value=3.5e-47 Score=275.25 Aligned_cols=273 Identities=17% Similarity=0.209 Sum_probs=233.9 Q ss_pred HhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcccccCcccce Q lcl|Aclame:pro 352 LVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFT 431 (632) Q Consensus 352 ~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~ 431 (632) .-..+....+..+++.+||++ +.+.|++.+++.++++++ ++.++..+....+++.+ .+.+.|++|++++++++++|+ T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~-~~~~ii~~~~~~s~l~~~-~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~f~ 77 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPIN-ISEQIITGVKNGSAAMKL-AKAVPMTKPEEEFTFMS-GVGAFWVDEAERIQTSKPTFT 77 (299) T ss_pred CCcCCCcccccCCCceecchh-HHHHHHHHHHhcchhhhh-ceeeecCCCcEEEEEEc-CCceeeeecCcccccccccee Confidence 233445555556667777665 457789999999999988 56677777777788775 477999999999999999999 Q ss_pred eeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHH Q lcl|Aclame:pro 432 TLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWAS 511 (632) Q Consensus 432 ~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~ 511 (632) ++++.+++++++++||+|++.++..+++++|.+.+++++++++|.++++|+|+ ++|.|+++.+............++++ T Consensus 78 ~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~-~~~~gil~~~~~~~~~~~~~~~~~~~ 156 (299) T protein:vir:41 78 KAKMRSKKMGVIIPTTKENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFTGVES-PYNWNILKSATDASNLVEETANKYDD 156 (299) T ss_pred EEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhhcccC-cccccccccccccceeeccccccHHH Confidence 99999999999999999999999999999999999999999999999999987 47889988766555556666788999 Q ss_pred HHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeecc------ccccCcceEEcCCCCCcc----EEEEeh Q lcl|Aclame:pro 512 VVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN------NEVNGYRAEASNQIPADT----WIFGDW 581 (632) Q Consensus 512 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~------~~l~G~pv~~~~~~~~~~----~~~gd~ 581 (632) |.+++.++...+.. .+.|+|++..+.. +.+++|.+|+|+|++ ++|+|+||++++.+|.+. ++|||| T Consensus 157 l~~~~~~l~~~~~~--~~~~v~n~~~~~~--L~~lkd~~G~~l~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~gdf 232 (299) T protein:vir:41 157 LNEAIGLIEAEDLE--PNGIATIRKQRVK--YRSTKDGNGMPIFNTATSNGVDDVLGLPIAYTPKYTFGDKDISELVGDW 232 (299) T ss_pred HHHHHHhhhcccCC--cCEEEEcHHHHHH--HHHhhccCCceeecCCcCCCCceecceeeEEecccCCCCCceEEEEEec Confidence 99999999877653 5679998888655 457899999999975 479999999999999876 999999 Q ss_pred hhEEEEEecceEEEEecccc--------------cccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 582 SQIVIAMWGVLDLKVDPYTK--------------AASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 582 s~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +.+.++.++++++..+++.. |.+|++.||++.|+|+++.+|+||+++|.+| T Consensus 233 s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~a 297 (299) T protein:vir:41 233 NQAYYGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQPKA 297 (299) T ss_pred ccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEecc Confidence 99999999999998877643 7899999999999999999999999999999 No 33 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=100.00 E-value=3.9e-45 Score=264.06 Aligned_cols=387 Identities=11% Similarity=0.051 Sum_probs=226.5 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHH Q lcl|Aclame:pro 208 GAKNPAPAASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPG 287 (632) Q Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 287 (632) .+...+-........ ................+..++........+.....+......... ..... T Consensus 1 mk~~~em~~~l~el~-------~~~~~~~~e~~~~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~--------~~~~~ 65 (415) T protein:vir:46 1 MKTKEELQSEISDIK-------RQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDK--------LKEKD 65 (415) T ss_pred CchHHHHHHHHHHHH-------HHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHH Confidence 000000000000000 000000000000000000000000111111111111100000000 00000 Q ss_pred HhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccc Q lcl|Aclame:pro 288 KPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGE 367 (632) Q Consensus 288 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~ 367 (632) ............... . ........... ................................+...++. T Consensus 66 ~~~~~~~~~~~~~~~-----------~--~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~ 131 (415) T protein:vir:46 66 RTSENNQQSVEVNEA-----------R--TYRNQANINDL-GISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFV 131 (415) T ss_pred Hhhhhcccccccchh-----------h--hhHHHHHHHHH-HHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCcc Confidence 000000000000000 0 00000000000 000000000000000000000011111122223334445 Q ss_pred eechhhhhHHHHHHHhhhhhhhhhcceeeccC--ceeEEEEEecCCccccccccCccccc-Ccccceeeeeeeeeeeeee Q lcl|Aclame:pro 368 LVATELLSEEFIDILRNKAIIGQMGARMLPGL--VGDVDIPKKTSGANFYWIGEDEDVQD-SDFDFTTLSFSPKTIAGAV 444 (632) Q Consensus 368 ~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~a~~v~E~~~~~~-~~~~~~~~~~~~~t~~~~~ 444 (632) ++| +.+.+.|++.+++.++++.+ +++++.. ...+.+++.+..+.+.|++|++++++ +.++|+++++.++++++++ T Consensus 132 ~iP-~~~~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~ 209 (415) T protein:vir:46 132 VIP-EEIVTDILKLKEVEFNLDKY-VTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYF 209 (415) T ss_pred ccc-HHHHHHHHHHHHhhhhhhhh-cceeeccCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeeeeEeee Confidence 555 55567789999999999887 4555544 34455556677778999999999997 5689999999999999999 Q ss_pred hhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 445 PVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNA 524 (632) Q Consensus 445 ~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~ 524 (632) +||+++|.|+.+++.++|.+.++++++++++.++++|+|++..+.+........+.....+..++++|.+++..+...+. T Consensus 210 ~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~ 289 (415) T protein:vir:46 210 RISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNY 289 (415) T ss_pred hhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccccceeccccccchHHHHHHHHhhhhhcc Confidence 99999999999999999999999999999999999999987766665554544555556677889999999999988775 Q ss_pred ccccceEEeehhHHHHHHHHhhcccCCceeecc-------ccccCcceEEcCCCCCc-----cEEEEehhh-EEEEEecc Q lcl|Aclame:pro 525 DAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEASNQIPAD-----TWIFGDWSQ-IVIAMWGV 591 (632) Q Consensus 525 ~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~-------~~l~G~pv~~~~~~~~~-----~~~~gd~s~-~~~~~~~~ 591 (632) . ++.|+||+..+..+ .+++|.+|+|+|.+ ++|+|+||++++++|.+ .++||||+. |.++.+.+ T Consensus 290 ~--~~~~v~n~~~~~~L--~~lkd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~ 365 (415) T protein:vir:46 290 E--HNVAIVSQTMFAKL--DKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQ 365 (415) T ss_pred C--CCEEEEcHHHHHHH--HHhhccCCCeeeccCcCCCCCccccceeeEEeccccccCCCccEEEEEehhccEEEEeecc Confidence 3 57899999886654 57899999999965 47999999999988854 489999998 66788999 Q ss_pred eEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 592 LDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 592 ~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +.+.+++ |.++++.+++++|+|+++.+|+||+++++.+ T Consensus 366 ~~v~~~~---~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~ 403 (415) T protein:vir:46 366 YQASWTD---YMHFGECLMIAVRQDCRILDYKSAIVIEYDD 403 (415) T ss_pred eEEEeec---cccCceEEEEEEEeccEEeccccEEEEEeec Confidence 9988776 5667788999999999999999999999988 No 34 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=100.00 E-value=3.9e-45 Score=264.06 Aligned_cols=387 Identities=11% Similarity=0.051 Sum_probs=226.5 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHH Q lcl|Aclame:pro 208 GAKNPAPAASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPG 287 (632) Q Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 287 (632) .+...+-........ ................+..++........+.....+......... ..... T Consensus 1 mk~~~em~~~l~el~-------~~~~~~~~e~~~~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~--------~~~~~ 65 (415) T protein:vir:47 1 MKTKEELQSEISDIK-------RQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDK--------LKEKD 65 (415) T ss_pred CchHHHHHHHHHHHH-------HHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHH Confidence 000000000000000 000000000000000000000000111111111111100000000 00000 Q ss_pred HhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccc Q lcl|Aclame:pro 288 KPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGE 367 (632) Q Consensus 288 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~ 367 (632) ............... . ........... ................................+...++. T Consensus 66 ~~~~~~~~~~~~~~~-----------~--~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~ 131 (415) T protein:vir:47 66 RTSENNQQSVEVNEA-----------R--TYRNQANINDL-GISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFV 131 (415) T ss_pred Hhhhhcccccccchh-----------h--hhHHHHHHHHH-HHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCcc Confidence 000000000000000 0 00000000000 000000000000000000000011111122223334445 Q ss_pred eechhhhhHHHHHHHhhhhhhhhhcceeeccC--ceeEEEEEecCCccccccccCccccc-Ccccceeeeeeeeeeeeee Q lcl|Aclame:pro 368 LVATELLSEEFIDILRNKAIIGQMGARMLPGL--VGDVDIPKKTSGANFYWIGEDEDVQD-SDFDFTTLSFSPKTIAGAV 444 (632) Q Consensus 368 ~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~a~~v~E~~~~~~-~~~~~~~~~~~~~t~~~~~ 444 (632) ++| +.+.+.|++.+++.++++.+ +++++.. ...+.+++.+..+.+.|++|++++++ +.++|+++++.++++++++ T Consensus 132 ~iP-~~~~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~ 209 (415) T protein:vir:47 132 VIP-EEIVTDILKLKEVEFNLDKY-VTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYF 209 (415) T ss_pred ccc-HHHHHHHHHHHHhhhhhhhh-cceeeccCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeeeeEeee Confidence 555 55567789999999999887 4555544 34455556677778999999999997 5689999999999999999 Q ss_pred hhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 445 PVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNA 524 (632) Q Consensus 445 ~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~ 524 (632) +||+++|.|+.+++.++|.+.++++++++++.++++|+|++..+.+........+.....+..++++|.+++..+...+. T Consensus 210 ~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~ 289 (415) T protein:vir:47 210 RISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNY 289 (415) T ss_pred hhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccccceeccccccchHHHHHHHHhhhhhcc Confidence 99999999999999999999999999999999999999987766665554544555556677889999999999988775 Q ss_pred ccccceEEeehhHHHHHHHHhhcccCCceeecc-------ccccCcceEEcCCCCCc-----cEEEEehhh-EEEEEecc Q lcl|Aclame:pro 525 DAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEASNQIPAD-----TWIFGDWSQ-IVIAMWGV 591 (632) Q Consensus 525 ~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~-------~~l~G~pv~~~~~~~~~-----~~~~gd~s~-~~~~~~~~ 591 (632) . ++.|+||+..+..+ .+++|.+|+|+|.+ ++|+|+||++++++|.+ .++||||+. |.++.+.+ T Consensus 290 ~--~~~~v~n~~~~~~L--~~lkd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~ 365 (415) T protein:vir:47 290 E--HNVAIVSQTMFAKL--DKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQ 365 (415) T ss_pred C--CCEEEEcHHHHHHH--HHhhccCCCeeeccCcCCCCCccccceeeEEeccccccCCCccEEEEEehhccEEEEeecc Confidence 3 57899999886654 57899999999965 47999999999988854 489999998 66788999 Q ss_pred eEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 592 LDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 592 ~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +.+.+++ |.++++.+++++|+|+++.+|+||+++++.+ T Consensus 366 ~~v~~~~---~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~ 403 (415) T protein:vir:47 366 YQASWTD---YMHFGECLMIAVRQDCRILDYKSAIVIEYDD 403 (415) T ss_pred eEEEeec---cccCceEEEEEEEeccEEeccccEEEEEeec Confidence 9988776 5667788999999999999999999999988 No 35 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=100.00 E-value=4.7e-45 Score=263.63 Aligned_cols=384 Identities=10% Similarity=0.034 Sum_probs=226.6 Q ss_pred hhhhhhhhhh---hhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhh Q lcl|Aclame:pro 218 GANENDILSR---ERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSA 294 (632) Q Consensus 218 ~~~~~~~~~~---~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 294 (632) .+...+..+. ...+..............+..++........+.....+......... ............ T Consensus 1 mk~~~el~~~l~el~~~~~~~~~~~~~~~~~~~~e~~~~~~~ei~~l~~~i~~~~~~~~~--------~~~~~~~~~~~~ 72 (415) T protein:vir:94 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDK--------LKEKDGTSENNQ 72 (415) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHhhhhcc Confidence 0000000000 00000000000000000000000000000111111111000000000 000000000000 Q ss_pred hhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhh Q lcl|Aclame:pro 295 RDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELL 374 (632) Q Consensus 295 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~ 374 (632) ....... ....... ............. .........+...............+..+++.++|. .+ T Consensus 73 ~~~~~~~--------~~~~~~~---~~~~~~~~~~~~~---~~~~~e~~~~~~~~~~~~~~~~~~~~~~~g~~~iP~-~~ 137 (415) T protein:vir:94 73 QSVEVNE--------ASTYRNQ---ANINDLGISIQNT---KVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPE-EI 137 (415) T ss_pred ccccccc--------hhhHHHH---HHHHHHHhhhhhh---hhhHHHHHHHHHHhhhhhhhhhhccccccccccCcH-HH Confidence 0000000 0000000 0000000000000 000000000000001111112223334445556654 45 Q ss_pred hHHHHHHHhhhhhhhhhcceeec--cCceeEEEEEecCCccccccccCcccccC-cccceeeeeeeeeeeeeehhhHHHh Q lcl|Aclame:pro 375 SEEFIDILRNKAIIGQMGARMLP--GLVGDVDIPKKTSGANFYWIGEDEDVQDS-DFDFTTLSFSPKTIAGAVPVTRKLR 451 (632) Q Consensus 375 ~~~i~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~a~~v~E~~~~~~~-~~~~~~~~~~~~t~~~~~~iSre~l 451 (632) ...+++.+++.+++.++ +++++ .....+.+++.++.+.+.|++|++++++. .++|+.+++.+++++++++||+|++ T Consensus 138 ~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell 216 (415) T protein:vir:94 138 VTDILKLKEVEFNLDKY-VTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAI 216 (415) T ss_pred HHHHHHHHHhhhhhhhh-cceeeccCCceeEEEEeecCCccceeccccccccccccccceeeEeeheeeeeechhhHHHH Confidence 67789999999998887 34444 44556777788888899999999999975 6899999999999999999999999 Q ss_pred hcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHHHHHHhhccccccceE Q lcl|Aclame:pro 452 KQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAY 531 (632) Q Consensus 452 ~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 531 (632) .|+++++.++|.+.|+++++++++.++++|+|++..+.+........+.....+..++++|.+++.++...+.. ++.| T Consensus 217 ~ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~--~~~~ 294 (415) T protein:vir:94 217 EDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYE--HNVA 294 (415) T ss_pred hhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchHHHHHHHHhhhhhccC--CCEE Confidence 99999999999999999999999999999999876665554444444444555678899999999998877653 5779 Q ss_pred EeehhHHHHHHHHhhcccCCceeecc-------ccccCcceEEcCCCCCcc-----EEEEehhh-EEEEEecceEEEEec Q lcl|Aclame:pro 532 LTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEASNQIPADT-----WIFGDWSQ-IVIAMWGVLDLKVDP 598 (632) Q Consensus 532 ~~~~~~~~~~~~~~~~d~~g~~~~~~-------~~l~G~pv~~~~~~~~~~-----~~~gd~s~-~~~~~~~~~~~~~~~ 598 (632) +||+..+..+ .+++|.+|+|+|.+ ++|+|+||++++++|.+. ++||||+. |.++.+.++++.+++ T Consensus 295 vmn~~~~~~l--~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~ 372 (415) T protein:vir:94 295 IVSQTMFAKL--DKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD 372 (415) T ss_pred EEcHHHHHHH--HHhhccCCCeeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEec Confidence 9998886544 57899999999964 369999999999988654 89999998 677889999998776 Q ss_pred ccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 599 YTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 599 ~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) |.++++.||++.|+|+++.+|+||+++++.+ T Consensus 373 ---~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~ 403 (415) T protein:vir:94 373 ---YMHFGECLMIAVRQDCRILDYKSAIVIEYDD 403 (415) T ss_pred ---cccCceEEEEEEEeccEEeccccEEEEEEec Confidence 4567788999999999999999999999998 No 36 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=100.00 E-value=5.3e-45 Score=263.34 Aligned_cols=387 Identities=11% Similarity=0.035 Sum_probs=227.1 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHH Q lcl|Aclame:pro 208 GAKNPAPAASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPG 287 (632) Q Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 287 (632) +....+..... .....+...........-..+...+........+.....+......... ..... T Consensus 1 mk~~~el~~~l-------~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~--------~~~~~ 65 (415) T protein:vir:81 1 MKTKEELQSEI-------SDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDK--------LKEKD 65 (415) T ss_pred CchHHHHHHHH-------HHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHH Confidence 00000000000 0000000000000000000000000000000111111111100000000 00000 Q ss_pred HhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccc Q lcl|Aclame:pro 288 KPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGE 367 (632) Q Consensus 288 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~ 367 (632) ................... ........ ..............+.+.. .............+...|+. T Consensus 66 ~~~~~~~~~~~~~~~~~~~--~~~~~~~~-----~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~gg~ 131 (415) T protein:vir:81 66 GTSENNQQSVEVNEARTYR--NQANINDL-----GISIQNTKVTSQEVRDFTE-------YLETRNDIQGGSLKTDSGFV 131 (415) T ss_pred hhhhhcccccccchhhhHH--HHHHHHHH-----hhhhhhhhhHHHHHHHHHH-------HHhhhhhhhhcccccccccc Confidence 0000000000000000000 00000000 0000000000000000100 00011111112223334555 Q ss_pred eechhhhhHHHHHHHhhhhhhhhhcceeec--cCceeEEEEEecCCccccccccCcccccC-cccceeeeeeeeeeeeee Q lcl|Aclame:pro 368 LVATELLSEEFIDILRNKAIIGQMGARMLP--GLVGDVDIPKKTSGANFYWIGEDEDVQDS-DFDFTTLSFSPKTIAGAV 444 (632) Q Consensus 368 ~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~a~~v~E~~~~~~~-~~~~~~~~~~~~t~~~~~ 444 (632) ++|.+ +.+.|++.+++.++++.+ +++++ .....+.+++.++.+.+.|++|++++++. .++|+.+++.++++++++ T Consensus 132 ~iP~~-~~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~ 209 (415) T protein:vir:81 132 VIPEE-IVTDILKLKEVEFNLDKY-VTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYF 209 (415) T ss_pred ccchH-HHHHHHHHHHhhhhhhhh-eeeeeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeee Confidence 56554 567788999999988887 44444 44556777788888889999999999975 689999999999999999 Q ss_pred hhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 445 PVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNA 524 (632) Q Consensus 445 ~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~ 524 (632) +||+|++.|+.+++.++|.+.++++++++++.++++|+|++..+.+........+.....+..++++|.+++.++...+. T Consensus 210 ~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~ 289 (415) T protein:vir:81 210 RISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNY 289 (415) T ss_pred hhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchhHHHHHHHhhhhhcc Confidence 99999999999999999999999999999999999999987666555555555555566677899999999999987765 Q ss_pred ccccceEEeehhHHHHHHHHhhcccCCceeeccc-------cccCcceEEcCCCCCcc-----EEEEehhh-EEEEEecc Q lcl|Aclame:pro 525 DAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQNN-------EVNGYRAEASNQIPADT-----WIFGDWSQ-IVIAMWGV 591 (632) Q Consensus 525 ~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~~-------~l~G~pv~~~~~~~~~~-----~~~gd~s~-~~~~~~~~ 591 (632) .++.|+||+..+..+ .+++|.+|+|+|.++ +|+|+||++++++|.+. ++||||+. |.++.+.+ T Consensus 290 --~~~~~v~n~~~~~~l--~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~ 365 (415) T protein:vir:81 290 --EHNVAIVSQTMFAKL--DKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQ 365 (415) T ss_pred --CCCEEEEcHHHHHHH--HHhhccCCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecc Confidence 356799998886554 578999999999753 79999999999888543 89999998 66788999 Q ss_pred eEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 592 LDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 592 ~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +++.++++ .++++.+++++|+|+++.+|+||+++++.+ T Consensus 366 ~~v~~~~~---~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~ 403 (415) T protein:vir:81 366 YQASWTDY---MHFGECLMIAVRQDCRILDYKSAIVIEYDD 403 (415) T ss_pred eEEEEecc---ccCceEEEEEEEeccEEeccccEEEEEEec Confidence 99988764 456678999999999999999999999999 No 37 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=100.00 E-value=5.3e-45 Score=263.34 Aligned_cols=387 Identities=11% Similarity=0.035 Sum_probs=227.1 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHH Q lcl|Aclame:pro 208 GAKNPAPAASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPG 287 (632) Q Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 287 (632) +....+..... .....+...........-..+...+........+.....+......... ..... T Consensus 1 mk~~~el~~~l-------~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~--------~~~~~ 65 (415) T protein:vir:79 1 MKTKEELQSEI-------SDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDK--------LKEKD 65 (415) T ss_pred CchHHHHHHHH-------HHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHH Confidence 00000000000 0000000000000000000000000000000111111111100000000 00000 Q ss_pred HhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccc Q lcl|Aclame:pro 288 KPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGE 367 (632) Q Consensus 288 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~ 367 (632) ................... ........ ..............+.+.. .............+...|+. T Consensus 66 ~~~~~~~~~~~~~~~~~~~--~~~~~~~~-----~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~gg~ 131 (415) T protein:vir:79 66 GTSENNQQSVEVNEARTYR--NQANINDL-----GISIQNTKVTSQEVRDFTE-------YLETRNDIQGGSLKTDSGFV 131 (415) T ss_pred hhhhhcccccccchhhhHH--HHHHHHHH-----hhhhhhhhhHHHHHHHHHH-------HHhhhhhhhhcccccccccc Confidence 0000000000000000000 00000000 0000000000000000100 00011111112223334555 Q ss_pred eechhhhhHHHHHHHhhhhhhhhhcceeec--cCceeEEEEEecCCccccccccCcccccC-cccceeeeeeeeeeeeee Q lcl|Aclame:pro 368 LVATELLSEEFIDILRNKAIIGQMGARMLP--GLVGDVDIPKKTSGANFYWIGEDEDVQDS-DFDFTTLSFSPKTIAGAV 444 (632) Q Consensus 368 ~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~a~~v~E~~~~~~~-~~~~~~~~~~~~t~~~~~ 444 (632) ++|.+ +.+.|++.+++.++++.+ +++++ .....+.+++.++.+.+.|++|++++++. .++|+.+++.++++++++ T Consensus 132 ~iP~~-~~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~ 209 (415) T protein:vir:79 132 VIPEE-IVTDILKLKEVEFNLDKY-VTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYF 209 (415) T ss_pred ccchH-HHHHHHHHHHhhhhhhhh-eeeeeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeee Confidence 56554 567788999999988887 44444 44556777788888889999999999975 689999999999999999 Q ss_pred hhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 445 PVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNA 524 (632) Q Consensus 445 ~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~ 524 (632) +||+|++.|+.+++.++|.+.++++++++++.++++|+|++..+.+........+.....+..++++|.+++.++...+. T Consensus 210 ~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~ 289 (415) T protein:vir:79 210 RISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNY 289 (415) T ss_pred hhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchhHHHHHHHhhhhhcc Confidence 99999999999999999999999999999999999999987666555555555555566677899999999999987765 Q ss_pred ccccceEEeehhHHHHHHHHhhcccCCceeeccc-------cccCcceEEcCCCCCcc-----EEEEehhh-EEEEEecc Q lcl|Aclame:pro 525 DAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQNN-------EVNGYRAEASNQIPADT-----WIFGDWSQ-IVIAMWGV 591 (632) Q Consensus 525 ~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~~-------~l~G~pv~~~~~~~~~~-----~~~gd~s~-~~~~~~~~ 591 (632) .++.|+||+..+..+ .+++|.+|+|+|.++ +|+|+||++++++|.+. ++||||+. |.++.+.+ T Consensus 290 --~~~~~v~n~~~~~~l--~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~ 365 (415) T protein:vir:79 290 --EHNVAIVSQTMFAKL--DKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQ 365 (415) T ss_pred --CCCEEEEcHHHHHHH--HHhhccCCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecc Confidence 356799998886554 578999999999753 79999999999888543 89999998 66788999 Q ss_pred eEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 592 LDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 592 ~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +++.++++ .++++.+++++|+|+++.+|+||+++++.+ T Consensus 366 ~~v~~~~~---~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~ 403 (415) T protein:vir:79 366 YQASWTDY---MHFGECLMIAVRQDCRILDYKSAIVIEYDD 403 (415) T ss_pred eEEEEecc---ccCceEEEEEEEeccEEeccccEEEEEEec Confidence 99988764 456678999999999999999999999999 No 38 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=100.00 E-value=5.3e-45 Score=263.34 Aligned_cols=387 Identities=11% Similarity=0.035 Sum_probs=227.1 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHH Q lcl|Aclame:pro 208 GAKNPAPAASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPG 287 (632) Q Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 287 (632) +....+..... .....+...........-..+...+........+.....+......... ..... T Consensus 1 mk~~~el~~~l-------~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~--------~~~~~ 65 (415) T protein:vir:98 1 MKTKEELQSEI-------SDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDK--------LKEKD 65 (415) T ss_pred CchHHHHHHHH-------HHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHH Confidence 00000000000 0000000000000000000000000000000111111111100000000 00000 Q ss_pred HhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccc Q lcl|Aclame:pro 288 KPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGE 367 (632) Q Consensus 288 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~ 367 (632) ................... ........ ..............+.+.. .............+...|+. T Consensus 66 ~~~~~~~~~~~~~~~~~~~--~~~~~~~~-----~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~gg~ 131 (415) T protein:vir:98 66 GTSENNQQSVEVNEARTYR--NQANINDL-----GISIQNTKVTSQEVRDFTE-------YLETRNDIQGGSLKTDSGFV 131 (415) T ss_pred hhhhhcccccccchhhhHH--HHHHHHHH-----hhhhhhhhhHHHHHHHHHH-------HHhhhhhhhhcccccccccc Confidence 0000000000000000000 00000000 0000000000000000100 00011111112223334555 Q ss_pred eechhhhhHHHHHHHhhhhhhhhhcceeec--cCceeEEEEEecCCccccccccCcccccC-cccceeeeeeeeeeeeee Q lcl|Aclame:pro 368 LVATELLSEEFIDILRNKAIIGQMGARMLP--GLVGDVDIPKKTSGANFYWIGEDEDVQDS-DFDFTTLSFSPKTIAGAV 444 (632) Q Consensus 368 ~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~a~~v~E~~~~~~~-~~~~~~~~~~~~t~~~~~ 444 (632) ++|.+ +.+.|++.+++.++++.+ +++++ .....+.+++.++.+.+.|++|++++++. .++|+.+++.++++++++ T Consensus 132 ~iP~~-~~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~ 209 (415) T protein:vir:98 132 VIPEE-IVTDILKLKEVEFNLDKY-VTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYF 209 (415) T ss_pred ccchH-HHHHHHHHHHhhhhhhhh-eeeeeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeee Confidence 56554 567788999999988887 44444 44556777788888889999999999975 689999999999999999 Q ss_pred hhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 445 PVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNA 524 (632) Q Consensus 445 ~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~ 524 (632) +||+|++.|+.+++.++|.+.++++++++++.++++|+|++..+.+........+.....+..++++|.+++.++...+. T Consensus 210 ~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~ 289 (415) T protein:vir:98 210 RISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNY 289 (415) T ss_pred hhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchhHHHHHHHhhhhhcc Confidence 99999999999999999999999999999999999999987666555555555555566677899999999999987765 Q ss_pred ccccceEEeehhHHHHHHHHhhcccCCceeeccc-------cccCcceEEcCCCCCcc-----EEEEehhh-EEEEEecc Q lcl|Aclame:pro 525 DAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQNN-------EVNGYRAEASNQIPADT-----WIFGDWSQ-IVIAMWGV 591 (632) Q Consensus 525 ~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~~-------~l~G~pv~~~~~~~~~~-----~~~gd~s~-~~~~~~~~ 591 (632) .++.|+||+..+..+ .+++|.+|+|+|.++ +|+|+||++++++|.+. ++||||+. |.++.+.+ T Consensus 290 --~~~~~v~n~~~~~~l--~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~ 365 (415) T protein:vir:98 290 --EHNVAIVSQTMFAKL--DKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQ 365 (415) T ss_pred --CCCEEEEcHHHHHHH--HHhhccCCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecc Confidence 356799998886554 578999999999753 79999999999888543 89999998 66788999 Q ss_pred eEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 592 LDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 592 ~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +++.++++ .++++.+++++|+|+++.+|+||+++++.+ T Consensus 366 ~~v~~~~~---~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~ 403 (415) T protein:vir:98 366 YQASWTDY---MHFGECLMIAVRQDCRILDYKSAIVIEYDD 403 (415) T ss_pred eEEEEecc---ccCceEEEEEEEeccEEeccccEEEEEEec Confidence 99988764 456678999999999999999999999999 No 39 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=100.00 E-value=2.2e-46 Score=270.89 Aligned_cols=295 Identities=13% Similarity=0.164 Sum_probs=236.8 Q ss_pred hhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeE Q lcl|Aclame:pro 324 AGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDV 403 (632) Q Consensus 324 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 403 (632) .........++. .+... . ......++.......+++.++|.++ ...|++.+++.++++++ ++.++..+..+ T Consensus 1 ~~~~~~~~~~~~-~f~~~---~---~~~~~~~a~~~~~~~~~~~~iP~~~-~~~ii~~~~~~s~l~~~-~~~~~~~~~~~ 71 (324) T protein:vir:97 1 MEQTQKLKLNLQ-HFASN---N---VKPQVFNPDNVMMHEKKDGTLMNEF-TTPILQEVMENSKIMQL-GKYEPMEGTEK 71 (324) T ss_pred CccchhHHHHHH-HHHHh---h---hhhhhhccccccccCCCcceechhH-HHHHHHHHHhhcchhhh-cceeeccCCce Confidence 000000011110 11000 0 0111233444445556677777655 56788999999999987 56777777788 Q ss_pred EEEEecCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|Aclame:pro 404 DIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTG 483 (632) Q Consensus 404 ~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g 483 (632) .+|+.++.+.+.|++|++++++++++|+++++.+++++++++||+|+|.|+.++++++|.+.+++++++++|.++++|+| T Consensus 72 ~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g 151 (324) T protein:vir:97 72 KFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) T ss_pred EEEEEecCcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccccccceeccccccccccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeecc---ccc Q lcl|Aclame:pro 484 LANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN---NEV 560 (632) Q Consensus 484 ~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~---~~l 560 (632) +++.|.|+++.....+ ....+.+++++|.++..++...+.. ...|+|++..+.. +.+++|.+|+|++.+ ++| T Consensus 152 ~~~~~~gi~~~~~~~~-~~~~~~~~~~~i~~~~~~l~~~~~~--~~~~v~n~~~~~~--L~~lkd~~g~~~~~~~~~~tl 226 (324) T protein:vir:97 152 NNPFGKSIAQSIEKTN-KVIKGDFTQDNIIDLEALLEDDELE--ANAFISKTQNRSL--LRKIVDPETKERIYDRNSDTL 226 (324) T ss_pred CCccCccccccccccc-eeccccCCHHHHHHHHHhhhhccCC--CCEEEEcHHHHHH--HHHhhcCCCceeecCCCCccc Confidence 9988999887655444 3455678899999999999887754 4578998888654 458899999999864 589 Q ss_pred cCcceEEcCCCC--CccEEEEehhhEEEEEecceEEEEeccc--------------ccccCcEEEEEEEEeCcEEecccc Q lcl|Aclame:pro 561 NGYRAEASNQIP--ADTWIFGDWSQIVIAMWGVLDLKVDPYT--------------KAASDGLVLRVFQDVDAGVRRKEA 624 (632) Q Consensus 561 ~G~pv~~~~~~~--~~~~~~gd~s~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~r~~~~v~~~~a 624 (632) +|+||++++..+ .+.++||||+.+.+++++++++..+++. .|.+|++.||++.|+|+++.+|+| T Consensus 227 ~G~PV~~~~~~~~~~~~~~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a 306 (324) T protein:vir:97 227 DGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKA 306 (324) T ss_pred cceeeEeecCCCCCcceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccc Confidence 999999988754 5679999999999999999999888763 388999999999999999999999 Q ss_pred eEEEEecC Q lcl|Aclame:pro 625 FCIAKKGA 632 (632) Q Consensus 625 ~~~~~~~A 632 (632) |++|+.+- T Consensus 307 ~~~l~~~~ 314 (324) T protein:vir:97 307 FAKLVPAD 314 (324) T ss_pred eEEEEecc Confidence 99999987 No 40 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=100.00 E-value=7.5e-45 Score=262.50 Aligned_cols=391 Identities=13% Similarity=0.048 Sum_probs=230.6 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhh Q lcl|Aclame:pro 197 SQQTETRGAETGAKNPAPAASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGN 276 (632) Q Consensus 197 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 276 (632) +........ ..+...++.......... ..+..++.. ...+.......... T Consensus 1 m~~~~~lee------------------~~a~l~~~~~~~~~~~~~--~~~~~~e~~---~~~~~~~~~~~~~~------- 50 (419) T protein:vir:94 1 MPPTPTLEE------------------QRAALLARLDDTSLTTEQ--VQEIVAEAR---GLADALQAESDRAA------- 50 (419) T ss_pred CCHHHHHHH------------------HHHHHHHHHHHHHHHHHH--HHHHHHHHH---HHHHHHHHHHHHHH------- Confidence 000000000 000000000000000000 000000000 00000000000000 Q ss_pred hhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhh Q lcl|Aclame:pro 277 FEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQ 356 (632) Q Consensus 277 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a 356 (632) ..................... . .........+................ ......... ............... T Consensus 51 -~~~~~~~~~~~~~~~~~~~~~---~--~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~-~~~~~~~~~~~~~~~ 122 (419) T protein:vir:94 51 -ARAALLRTAPPAPKGPADGGT---P--LTPAEAGTFRSLAQRFADSDGLREYR-ARDKRGQFQ-VEMRDIDPNRLLSRD 122 (419) T ss_pred -HHHHHHHHHHHHHHHHhhhhc---c--ccccccccccchhhhhhhHHHHHHHH-Hhhhhhhhh-HHHHHHHHHHhhccc Confidence 000000000000000000000 0 00000000000000000000000000 000000000 000011111122222 Q ss_pred hcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecC--------CccccccccCcccccCcc Q lcl|Aclame:pro 357 LEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTS--------GANFYWIGEDEDVQDSDF 428 (632) Q Consensus 357 ~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~a~~v~E~~~~~~~~~ 428 (632) ....+...++..++++.....+.........++.+ +++.+.....+.+++.++ .+.+.|++|++.++++++ T Consensus 123 ~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~ 201 (419) T protein:vir:94 123 APAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADL-LDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTL 201 (419) T ss_pred cccccccCCcccccchhhhHHHHHHHhhhhhhhhc-ceeeeccCCceeeeeeccccccccccCcccceecCCcccccccc Confidence 33444455666778888888888888777777776 566777666777776543 345789999999999999 Q ss_pred cceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccc------ Q lcl|Aclame:pro 429 DFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTY------ 502 (632) Q Consensus 429 ~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~------ 502 (632) +|+++++.+++++++++||+|+|.|+ .++.++|...++++++.++|.+||+|+|++ +|.|+++.+++..... T Consensus 202 ~~~~i~~~~~k~~~~~~is~ell~d~-~~l~~~i~~~la~a~~~~~d~aii~G~G~~-~p~Gi~~~~~~~~~~~~~~~~~ 279 (419) T protein:vir:94 202 SFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLRDRQLLNGNGST-EMQGILTTPGIGTYQQPKPTAP 279 (419) T ss_pred ceeeEEeeeeeEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcc-cccceecccccccccccccccc Confidence 99999999999999999999999875 589999999999999999999999999975 7999998877655432 Q ss_pred cccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCc-eeecc-------ccccCcceEEcCCCCCc Q lcl|Aclame:pro 503 PAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGE-RIWQN-------NEVNGYRAEASNQIPAD 574 (632) Q Consensus 503 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~-~~~~~-------~~l~G~pv~~~~~~~~~ 574 (632) ......+++|.++++.+...+.. +..|+|++.++..+ .++++.+|+ +++++ ++|+|+||++++.+|++ T Consensus 280 ~t~~~~~~~l~~~~~~~~~~~~~--~~~~v~n~~~~~~l--~~~k~~~~~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~ 355 (419) T protein:vir:94 280 ATDEPPLVDIRRAKTVAEIAGFP--PDGVVVHPQDWESI--ELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQG 355 (419) T ss_pred cccchhHHHHHHHHHhhhhccCC--CCEEEEcHHHHHHH--HHHhhcCCCceeecCCcccCCCccccceeeEEcCCCCCc Confidence 22345678999999999877653 45799999886655 466776555 45543 48999999999999999 Q ss_pred cEEEEehhh-EEEEEecceEEEEeccc--ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 575 TWIFGDWSQ-IVIAMWGVLDLKVDPYT--KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 575 ~~~~gd~s~-~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +++||||+. |.++++.++++..+++. +|.+|++.||++.|+|+++++|+|||+++++| T Consensus 356 ~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~~a 416 (419) T protein:vir:94 356 TALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAA 416 (419) T ss_pred cEEEeeccceEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEeccccEEEEEecc Confidence 999999998 67788999999988865 49999999999999999999999999999999 No 41 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=100.00 E-value=1.9e-46 Score=271.27 Aligned_cols=278 Identities=15% Similarity=0.179 Sum_probs=232.6 Q ss_pred hhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcccccCcc Q lcl|Aclame:pro 349 HEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDF 428 (632) Q Consensus 349 ~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~ 428 (632) .......+....++.+++.++|+++ .+.+++.+++.++++++ +++++..+..+++|+.++.+.+.|++|+++++++++ T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~-~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~ 78 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNGVIPAEQ-GTLIMKDIMANSAIMKL-AKNEPMTAQKKKFTYLAKGVGAYWVSETERIQTSKP 78 (304) T ss_pred CcccccccccccccCCCceecchhH-HHHHHHHHHhccchhhh-cceeeccCCceEEEEEeCCcceEEeecCcccccccc Confidence 1112223445555666677777765 57788999999999887 566777777788999998999999999999999999 Q ss_pred cceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccc----cccceeccccccccccc Q lcl|Aclame:pro 429 DFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLAND----PVGLLNMTGVPALTYPA 504 (632) Q Consensus 429 ~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~----~~Gil~~a~~~~~~~~~ 504 (632) +|+++++.+++++++++||+|++.|+.++++++|.+.|++++++++|.++++|+|++.. +.+++..+.......+. T Consensus 79 ~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (304) T protein:vir:94 79 EYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKGNVVTD 158 (304) T ss_pred eeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999987532 23344444444555556 Q ss_pred cchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeecc--ccccCcceEEcCCCCCc----cEEE Q lcl|Aclame:pro 505 GGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN--NEVNGYRAEASNQIPAD----TWIF 578 (632) Q Consensus 505 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~--~~l~G~pv~~~~~~~~~----~~~~ 578 (632) +..++++|.+++.++...+.. +..|+|++..+..+ .+++|.+|+|+|++ ++|+|+||++++++|.. .++| T Consensus 159 ~~~~~~~i~~~~~~l~~~~~~--~~~~v~~~~~~~~L--~~lkd~~G~~l~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~ 234 (304) T protein:vir:94 159 TNNLYVDLSALMATIEDEELD--PNGVLTTRSFRSKM--RNALDANDRPLFDANGNEIMGLPLSYTGADVYDKKKSLALM 234 (304) T ss_pred ccchHHHHHHHHHHhhhccCC--cCEEEEcHHHHHHH--HHhhccCCcEeecCCCccccceeeEEecccccCCCCcEEEE Confidence 677899999999999887654 45789988887655 47899999999986 68999999999999854 5999 Q ss_pred EehhhEEEEEecceEEEEeccc----------------ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 579 GDWSQIVIAMWGVLDLKVDPYT----------------KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 579 gd~s~~~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) |||+.+.+++++++++..+++. .|.+|++.||++.|+|+++.+|+||++||.+= T Consensus 235 gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 235 GDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred EehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 9999999999999999887763 48999999999999999999999999999988 No 42 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=100.00 E-value=1.9e-46 Score=271.27 Aligned_cols=278 Identities=15% Similarity=0.179 Sum_probs=232.6 Q ss_pred hhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcccccCcc Q lcl|Aclame:pro 349 HEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDF 428 (632) Q Consensus 349 ~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~ 428 (632) .......+....++.+++.++|+++ .+.+++.+++.++++++ +++++..+..+++|+.++.+.+.|++|+++++++++ T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~-~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~ 78 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNGVIPAEQ-GTLIMKDIMANSAIMKL-AKNEPMTAQKKKFTYLAKGVGAYWVSETERIQTSKP 78 (304) T ss_pred CcccccccccccccCCCceecchhH-HHHHHHHHHhccchhhh-cceeeccCCceEEEEEeCCcceEEeecCcccccccc Confidence 1112223445555666677777765 57788999999999887 566777777788999998999999999999999999 Q ss_pred cceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccc----cccceeccccccccccc Q lcl|Aclame:pro 429 DFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLAND----PVGLLNMTGVPALTYPA 504 (632) Q Consensus 429 ~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~----~~Gil~~a~~~~~~~~~ 504 (632) +|+++++.+++++++++||+|++.|+.++++++|.+.|++++++++|.++++|+|++.. +.+++..+.......+. T Consensus 79 ~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (304) T protein:vir:10 79 EYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKGNVVTD 158 (304) T ss_pred eeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999987532 23344444444555556 Q ss_pred cchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeecc--ccccCcceEEcCCCCCc----cEEE Q lcl|Aclame:pro 505 GGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN--NEVNGYRAEASNQIPAD----TWIF 578 (632) Q Consensus 505 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~--~~l~G~pv~~~~~~~~~----~~~~ 578 (632) +..++++|.+++.++...+.. +..|+|++..+..+ .+++|.+|+|+|++ ++|+|+||++++++|.. .++| T Consensus 159 ~~~~~~~i~~~~~~l~~~~~~--~~~~v~~~~~~~~L--~~lkd~~G~~l~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~ 234 (304) T protein:vir:10 159 TNNLYVDLSALMATIEDEELD--PNGVLTTRSFRSKM--RNALDANDRPLFDANGNEIMGLPLSYTGADVYDKKKSLALM 234 (304) T ss_pred ccchHHHHHHHHHHhhhccCC--cCEEEEcHHHHHHH--HHhhccCCcEeecCCCccccceeeEEecccccCCCCcEEEE Confidence 677899999999999887654 45789988887655 47899999999986 68999999999999854 5999 Q ss_pred EehhhEEEEEecceEEEEeccc----------------ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 579 GDWSQIVIAMWGVLDLKVDPYT----------------KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 579 gd~s~~~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) |||+.+.+++++++++..+++. .|.+|++.||++.|+|+++.+|+||++||.+= T Consensus 235 gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 235 GDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred EehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 9999999999999999887763 48999999999999999999999999999988 No 43 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=100.00 E-value=1.3e-44 Score=261.21 Aligned_cols=408 Identities=15% Similarity=0.152 Sum_probs=227.9 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHH Q lcl|Aclame:pro 183 AEMPDKDKQTQTAGSQQTETRGAETGAKNPAPAASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFR 262 (632) Q Consensus 183 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~ 262 (632) +. ....... ................ .. ....... ........ .++..+.....+..+... T Consensus 1 M~---l~el~~~--~~~~~~~~~a~l~~~~-----~~--~~~~~ee---~~~~~~e~-----~~l~~~~~~l~~~i~~le 60 (434) T protein:vir:62 1 MN---LKEILNA--SLTRTKSRLAELQGKV-----EK--NEVRSEE---LAAVKAEV-----EQLTKEIQTISEELAKLE 60 (434) T ss_pred CC---HHHHHHH--HHHHHHHHHHHHHHHH-----hc--cCccHHH---HHHHHHHH-----HHHHHHHHHHHHHHHHHH Confidence 00 0000000 0000000000000000 00 0000000 00000000 000000000000000000 Q ss_pred HHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhh Q lcl|Aclame:pro 263 ALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEA 342 (632) Q Consensus 263 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 342 (632) ........ .............................................. .......+..+.+.... T Consensus 61 ~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~-~~~~~~~e~r~a~~~~l 131 (434) T protein:vir:62 61 EKEKEEDP--------AKKKDDDPEKKEDPTAKENPNEKTELSEEQRSAISASIAAALSTKG-HRTNKETEIRSVFANYI 131 (434) T ss_pred HHHHHHHH--------HhhhcchhhhhcchhhhcchhhhHHHHHHHHHHHHHHHHhhhhhcc-ccchHHHHHHHHHHHHh Confidence 00000000 0000000000000000000000000000000000000000000000 00001111111111111 Q ss_pred hhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCcccccc---cc Q lcl|Aclame:pro 343 RGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWI---GE 419 (632) Q Consensus 343 ~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v---~E 419 (632) .. .....+.+++.. ++.+||++||.+ +.+.|++.+++.+++++++. +++.. ..+.+|+....+.+.|+ +| T Consensus 132 ~~---~~~~~e~~a~~~-~t~~GG~lvP~~-~~~~Ii~~l~~~~~i~~~~~-~~~~~-~~~~~p~~~~~~~a~~~~~~~e 204 (434) T protein:vir:62 132 VG---NIDEKEARALGL-VTGNGSVTIPDF-LSKEIITYAQEENFLRRLGT-GVKTK-ENIKYPVLVKKAEAQGHKNERT 204 (434) T ss_pred cc---ccchhhhhhhcc-cccccceecchh-hHHHHHHhhhhhhhhhhhcc-eeccC-CceEEEEEecCCcccceecccc Confidence 10 011112333333 334567777655 56778899999999998854 44443 35677777766666654 56 Q ss_pred CcccccCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceecccccc Q lcl|Aclame:pro 420 DEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPA 499 (632) Q Consensus 420 ~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~ 499 (632) ++..+.++++|+++++.+++++++++||+|+|.|+.++++++|.+.|+++++++++.++++|+|+++.+.|+++.+++.. T Consensus 205 ~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~~~~~~~ 284 (434) T protein:vir:62 205 NNEMPETDIEFDEIELSPTEFDALATVTKKLLARTGLPIEQIVMDELKKAYVRKETQYMVNGDEANNINDGALAKKAVEF 284 (434) T ss_pred cccccccccceeeEEeeheeeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccceeecccccc Confidence 77889999999999999999999999999999999999999999999999999999999999999988899987766533 Q ss_pred ccccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeecc---------ccccCcceEEcCC Q lcl|Aclame:pro 500 LTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN---------NEVNGYRAEASNQ 570 (632) Q Consensus 500 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~---------~~l~G~pv~~~~~ 570 (632) .+....++++|.++..++...|+ .++.|+||+..+.. +.+++|.+|+|+|++ .+|+|+||++++. T Consensus 285 --~~~~~~~~d~l~~l~~~l~~~~~--~~a~~v~n~~~~~~--L~~lkd~~G~~l~~~~~~~~~g~~~tl~G~pV~~~~~ 358 (434) T protein:vir:62 285 --KTDEKNLYDALVKMKNTPVKEVR--KKARWVLNTAALTK--IETMKTDDGFPLLRPFNQAEGGIGYTLLGFPVEEEDA 358 (434) T ss_pred --cccccchhhHHHHHHhhcchhhh--cCCEEEEcHHHHHH--HHHhhccCCCEeeccCCCccCCCCceecceeeEEecC Confidence 33456789999999999988776 46789998888755 457899999999964 2699999999999 Q ss_pred CCCcc------EEEEehhhEEEEEec-ceEEEEecccccccCcEEEEEEEEeCcEEec-ccceEE--EEecC Q lcl|Aclame:pro 571 IPADT------WIFGDWSQIVIAMWG-VLDLKVDPYTKAASDGLVLRVFQDVDAGVRR-KEAFCI--AKKGA 632 (632) Q Consensus 571 ~~~~~------~~~gd~s~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~-~~a~~~--~~~~A 632 (632) +|... ++|||||.|.++++. ++.+..+.+.+|.+|++.|+++.|+|+++++ |.++.+ ++.++ T Consensus 359 ~~~~~~~~~~~i~~Gdfs~~~i~~~~g~~~i~~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~ 430 (434) T protein:vir:62 359 IDIPDSPDTPVFYFGDFSKFYIQDVIGSLEVQKLVELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKA 430 (434) T ss_pred ccCccCCCceEEEEeeccceEEEEeeceeEEEeehhhhcccCceEEEEEeeecceeecCcccceEEEEEecc Confidence 98543 889999999888775 5789999999999999999999999999876 776544 45344 No 44 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=100.00 E-value=2.1e-44 Score=260.07 Aligned_cols=365 Identities=12% Similarity=0.061 Sum_probs=216.7 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHH Q lcl|Aclame:pro 222 NDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQH 301 (632) Q Consensus 222 ~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 301 (632) .+. ....+............. ....++... ..+.....+.... .....+.... ............. T Consensus 1 m~~-~e~~~~~~~~~~~l~~~~-~~~~~e~~~---~~e~~~~~~~~~~-------~~~~~e~~~~--~~~l~~~~~~~e~ 66 (379) T protein:vir:10 1 MEA-LEIKVALEAIKGQVDSKS-SAQALEVKG---LIEALEAKMTSEK-------DLAVNELKSD--MAALQAHADKLDV 66 (379) T ss_pred CCH-HHHHHHHHHHHHHHHHHH-HHHHHHHHH---HHHHHHhHhhHHH-------HHHHHHHHHH--HHHHHHHHHHHHH Confidence 000 000000000000000000 000000000 0000000000000 0000000000 0000000000000 Q ss_pred HHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHH Q lcl|Aclame:pro 302 KELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDI 381 (632) Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~ 381 (632) .. ........... .................. ........+....+..+++..+|+.+ ...|++. T Consensus 67 -~~--------~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ip~~~-~~~ii~~ 130 (379) T protein:vir:10 67 -KL--------KEKAKSEDKSD-----SLVKSITENFNDIKEVRN-GKSIQVKAVGDMTLPVNLTGAQPKDY-NFDVVLN 130 (379) T ss_pred -HH--------Hhcccccccch-----hHHHHHHHHHHhHHHHHh-hhhhhhhhhcccccCCCCccccchhh-hhHHHHh Confidence 00 00000000000 000000000000000000 00001111112223334444566554 5667788 Q ss_pred HhhhhhhhhhcceeeccCceeEEEEEecCC--ccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcChhHHH Q lcl|Aclame:pro 382 LRNKAIIGQMGARMLPGLVGDVDIPKKTSG--ANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVE 459 (632) Q Consensus 382 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~ 459 (632) ++..+.++.+ +++++..+..+.+++.++. +.+.|++|++.+|+++++|+++++.+++|+++++||+|+|.|+. ++. T Consensus 131 ~~~~~~i~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~D~~-~l~ 208 (379) T protein:vir:10 131 PSQMLNVSDI-VGAVSISGGTYTFVRENGAGEGAIGAQVEGATKGQKDYDISMIDVNTDFIAGFTRYSKKMANNLP-FLT 208 (379) T ss_pred HHhhhhHHhh-ceeeeccCCceEEEEeecCCCcccccccCCccccccccceeeeEeeeeeEEeeehhhHHHHhhHH-HHH Confidence 8888888887 5677777778888887643 46788999999999999999999999999999999999998764 799 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHHHHHHhhccccccceEEeehhHHH Q lcl|Aclame:pro 460 NLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRG 539 (632) Q Consensus 460 ~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 539 (632) ++|.+.|+++++++++.+++.|++++. +.+ ........+++++.++++.+...+.. +..|+||+..+. T Consensus 209 ~~i~~~la~~~~~~~~~~~~~g~~~~~-~~~---------~~~~~~~~~~d~i~~~~~~~~~~~~~--~~~~vmn~~~~~ 276 (379) T protein:vir:10 209 SFIPNALRRDYAKAENAAFNAVLAANA-TAS---------TEIITNKNKVEMLINEIAKQENLDFP--VTAIVLRPTDYY 276 (379) T ss_pred HHHHHHHHHHHHHHHHHHHhccccccc-ccc---------cccccCcccHHHHHHHHHhhhhccCC--CCEEEEcHHHHH Confidence 999999999999999999998887542 111 11223344567889988888776653 456899887754 Q ss_pred HHHHHhhcccCCceeeccc---------cccCcceEEcCCCCCccEEEEehhhEEEEEecceEEEEecc--cccccCcEE Q lcl|Aclame:pro 540 AAKKAQVFDNTGERIWQNN---------EVNGYRAEASNQIPADTWIFGDWSQIVIAMWGVLDLKVDPY--TKAASDGLV 608 (632) Q Consensus 540 ~~~~~~~~d~~g~~~~~~~---------~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~~~~~~~~~--~~~~~~~~~ 608 (632) .+.+++|.+|+|+|+++ +|+|+||++++.+|+++++||||+.+.+..+.++.+..+.+ .+|.+|++. T Consensus 277 --~l~~lkd~~G~~l~~~~~~~~~~~~~~l~G~pvv~s~~~~ag~~~~gdf~~~~~~~~~~~~i~~~~~~~~~f~~~~~~ 354 (379) T protein:vir:10 277 --DILVTQKSVGAGYGLPGVVTQDNGVLRINGIPLFRATWLAANKYYVGDWTRVTKVTTEGLSLEFSEVEGTNFVKNNIT 354 (379) T ss_pred --HHHHhhccCCceeccCCccCCCCCcceecceeeEecCCCCCCceEEeecccEEEEEEeceEEEEeecccccccCCcEE Confidence 45688999999999753 69999999999999999999999999888888888777665 469999999 Q ss_pred EEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 609 LRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 609 ~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) ||++.|+|++|++|+|||+++++| T Consensus 355 ~r~~~R~~~~v~~p~a~v~~~~~~ 378 (379) T protein:vir:10 355 ARIEAQVALAVEQPAALIFGDFTA 378 (379) T ss_pred EEEEEEeccEEecCccEEEEEecC Confidence 999999999999999999999999 No 45 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=100.00 E-value=6.3e-46 Score=268.38 Aligned_cols=269 Identities=12% Similarity=0.134 Sum_probs=221.3 Q ss_pred cccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcccccCcccceeeeeee Q lcl|Aclame:pro 358 EKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSP 437 (632) Q Consensus 358 ~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~ 437 (632) +..+..++|.++|+++ ...+++.++..++++++ +++++.....+.+|+.++.+.+.|++|++++++++++|+++++++ T Consensus 1 ma~~t~~~G~lip~~~-~~~ii~~l~~~s~i~~l-~~~~~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l~~ 78 (300) T protein:vir:95 1 MSEAQLSKGNLFNPEL-VTKVINKVKGHSSIAKL-SPQKPIPFNGQREFVFDFDSDIDIVAENGKKTHGGVSLDPVTIVP 78 (300) T ss_pred CcccccCCcceechhh-HHHHHHHHHhhhhhhhh-cceeeccCCceEEEEEecCcceEEeeCCcccccccccceeeEeee Confidence 4445556677888775 66788999999999887 455666666788999998999999999999999999999999999 Q ss_pred eeeeeeehhhHHHh---hcChhHHHHHHHHHHHHHHHHHHHHHHhhcC----CCccccccceecccccc-ccccccchhH Q lcl|Aclame:pro 438 KTIAGAVPVTRKLR---KQSSIHVENLIREDLIEGIGVALDLAMLTGT----GLANDPVGLLNMTGVPA-LTYPAGGVDW 509 (632) Q Consensus 438 ~t~~~~~~iSre~l---~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~----g~~~~~~Gil~~a~~~~-~~~~~~~~~~ 509 (632) ++++++++||+|+| .++.++++++|.+++++++++++|.++++|+ |++..+.|....++... .........+ T Consensus 79 ~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (300) T protein:vir:95 79 LKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQTVPFKDTNPD 158 (300) T ss_pred EEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccceeecccccchH Confidence 99999999999999 4667899999999999999999999999984 44444555544443332 2233456678 Q ss_pred HHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeec-------cccccCcceEEcCCCCCcc------E Q lcl|Aclame:pro 510 ASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQ-------NNEVNGYRAEASNQIPADT------W 576 (632) Q Consensus 510 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~-------~~~l~G~pv~~~~~~~~~~------~ 576 (632) +.|.++...+...+.. ...|+||+.... .+.+++|.+|+|+|. +++|+|+||++++.+|.+. + T Consensus 159 ~~i~~~~~~~~~~~~~--~~~~vmn~~~~~--~L~~lkd~~G~~i~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~ 234 (300) T protein:vir:95 159 ESMEDAVGMIDGSERD--ITGAILDPIFTT--ALSKMKNAEGGKLYPELAWGGVPDAINGLAVDKNRTVSYSQTDPKNTA 234 (300) T ss_pred HHHHHHHHHhhhcCCC--ccEEEECHHHHH--HHHHhhccCCCeeccCccccCCCceecceeeEEecCCCCCCCCCccEE Confidence 8999999988876643 456899888765 456889999999985 3689999999999998643 7 Q ss_pred EEEehhhE-EEEEecceEEEEeccc--------ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 577 IFGDWSQI-VIAMWGVLDLKVDPYT--------KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 577 ~~gd~s~~-~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) ++|||+.+ .++.+.++++.++++. +|.+|++.||++.|+|+++.+|+||++||.+| T Consensus 235 ~~GDf~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~ 299 (300) T protein:vir:95 235 IVGDFETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIVKTG 299 (300) T ss_pred EEeeccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEeecceeecccceEEEecCC Confidence 88999974 5888999999888763 38999999999999999999999999999999 No 46 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=100.00 E-value=4.2e-44 Score=258.42 Aligned_cols=370 Identities=13% Similarity=0.090 Sum_probs=222.6 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHH Q lcl|Aclame:pro 208 GAKNPAPAASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPG 287 (632) Q Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 287 (632) +........ ..+.......... ....+. .+...+.....+........+....... ...... T Consensus 1 m~~~m~l~e----l~~~~~~~~~~~~---~~~~~~--~~~~~~~~~~~ee~~~~~~~~~~~~~~~--------~~~~~~- 62 (408) T protein:vir:10 1 MGVKLTVNQ----LNEAWIASGDKVT---DFNDQI--NMALNDDNFSAEAMSELKNKRDNEKVRR--------DALREQ- 62 (408) T ss_pred CCccccHHH----HHHHHHHHHHHHH---HHHHHH--HHHhhcccccHHHHHHHHHHHHHHHHHH--------HHHHHH- Confidence 000000000 0000000000000 000000 0000000000000000000000000000 000000 Q ss_pred HhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccc Q lcl|Aclame:pro 288 KPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGE 367 (632) Q Consensus 288 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~ 367 (632) ... ......... ... .................+.+..............+.+++..++..+||+ T Consensus 63 ------~~~--~~~~~~~~~--~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~ 126 (408) T protein:vir:10 63 ------LVE--AQAEQVVNM--REE------EKGPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGL 126 (408) T ss_pred ------HHH--HHHHHHhcc--ccc------cccccccchhhhHHHHHHHHHHHhhcchhhhhhhhhhhhhcccccCCce Confidence 000 000000000 000 0000000000011111122222222222223344567777777777888 Q ss_pred eechhhhhHHHHHHHhhhhhhhhhcceeec--cCceeEEEEEecC-CccccccccCcccccC-cccceeeeeeeeeeeee Q lcl|Aclame:pro 368 LVATELLSEEFIDILRNKAIIGQMGARMLP--GLVGDVDIPKKTS-GANFYWIGEDEDVQDS-DFDFTTLSFSPKTIAGA 443 (632) Q Consensus 368 ~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~-~~~~~~~~~~~~t~~~~ 443 (632) +||.++ .+.|++.+++.++++.+ +++++ .....+.+++..+ .+.+.|++|++++++. .++|+++++.+++++++ T Consensus 127 ~vP~~~-~~~Ii~~~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~ 204 (408) T protein:vir:10 127 TIPQDI-RTMINTLVRQYDSLQQY-VRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGI 204 (408) T ss_pred eccHhH-HHHHHHHHHhhchhhhh-cceeeccCCcceEEEeeccccccceeeecCccccccccCcceeeEEeeeeeEEee Confidence 877655 56788999999998887 34444 3445566665544 4678999999999975 58999999999999999 Q ss_pred ehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHH-HHHHhh Q lcl|Aclame:pro 444 VPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDME-TKISTF 522 (632) Q Consensus 444 ~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~-~~~~~~ 522 (632) ++||+++|.|+.+++.++|.+.|+++++++++..|++|+|++... .+..+++++.+++ ..+... T Consensus 205 ~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~~~~---------------~~~~~~~~l~~~~~~~~~~~ 269 (408) T protein:vir:10 205 ITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKK---------------PTIAKFDDVITMINTAVDPA 269 (408) T ss_pred ehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc---------------cccccHHHHHHHHHHhhhhh Confidence 999999999999999999999999999999999999999875432 1234577888876 457666 Q ss_pred ccccccceEEeehhHHHHHHHHhhcccCCceeecc-------ccccCcceEEcC--CCCCc-----cEEEEehhh-EEEE Q lcl|Aclame:pro 523 NADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEASN--QIPAD-----TWIFGDWSQ-IVIA 587 (632) Q Consensus 523 ~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~-------~~l~G~pv~~~~--~~~~~-----~~~~gd~s~-~~~~ 587 (632) |+ .++.|+|++..+..+ .+++|.+|+|+|++ .+|+|+||++++ .+|.. .++||||+. |.++ T Consensus 270 ~~--~~a~~v~n~~~~~~l--~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~ 345 (408) T protein:vir:10 270 II--ATSSLLTNQSGLNKL--ALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLF 345 (408) T ss_pred hc--cCCEEEEcHHHHHHH--HHhhccCCceEeccCcCCCCCceecceeeEEecccccCccCCCceEEEEEehhccEEEE Confidence 65 357899998886554 57899999999975 379999999965 34542 389999997 6789 Q ss_pred EecceEEEEeccc--ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 588 MWGVLDLKVDPYT--KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 588 ~~~~~~~~~~~~~--~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) .++++++.++++. .|.+|++.||++.|+|+++.+|+||+++++++ T Consensus 346 ~~~~~~v~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~~~ 392 (408) T protein:vir:10 346 DRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSA 392 (408) T ss_pred EecceEEEEcccccchhhcCceEEEEEEeeccEEeccccEEEEEeec Confidence 9999999998874 48999999999999999999999999999999 No 47 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=100.00 E-value=1.7e-45 Score=265.96 Aligned_cols=295 Identities=13% Similarity=0.163 Sum_probs=234.4 Q ss_pred hhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeE Q lcl|Aclame:pro 324 AGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDV 403 (632) Q Consensus 324 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 403 (632) ........... +.+... . ......++....+..+++.++|.++ ...|++.+++.++++++ +++++..+..+ T Consensus 1 ~~~~~~~~~~~-~~~~~~---~---~~~~~~~a~~~~~~~~~~~~iP~~~-~~~ii~~~~~~s~l~~l-~~~~~~~~~~~ 71 (324) T protein:vir:78 1 MEQTQKLKLNL-QHFASN---N---VKPQVFNPDNVMMHEKKDGTLMNEF-TTPILQEVMENSKIMQL-GKYEPMEGTEK 71 (324) T ss_pred CCcchhhhHHH-HHHHHH---h---hhhhhhccccccccCcCccccchhH-HHHHHHHHHhhchhhhh-cceeeccCCce Confidence 00000000000 001000 0 0011223334444556667777665 47788889999999987 56677777788 Q ss_pred EEEEecCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|Aclame:pro 404 DIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTG 483 (632) Q Consensus 404 ~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g 483 (632) ++|+.++.+.+.|++|++++++++++|+++++.+++++++++||+|++.|+.+++.++|.+.+++++++++|.++++|+| T Consensus 72 ~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g 151 (324) T protein:vir:78 72 KFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) T ss_pred EEEEEecCcceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccccccceeccccccccccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeecc---ccc Q lcl|Aclame:pro 484 LANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN---NEV 560 (632) Q Consensus 484 ~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~---~~l 560 (632) +++.|.|+.+.....+. ...+..++++|.++..++...+.. ...|+|++..+.. +.+++|.+|+|++.. ++| T Consensus 152 ~~~~~~gi~~~~~~~~~-~~~~~~t~~~i~~~~~~l~~~~~~--~~~~vmn~~~~~~--L~~l~d~~G~~~~~~~~~~~l 226 (324) T protein:vir:78 152 NNPFGKSIAQSIEKTNK-VIKGDFTQDNIIDLEALLEDDELE--ANAFISKTQNRSL--LRKIVDPETKERIYDRNSDSL 226 (324) T ss_pred CCCcCccccccccccce-eccccccHHHHHHHHHhhhhccCC--CCEEEEcHHHHHH--HHHhhccCCCeeecCCCCCcc Confidence 99888998876554443 344667899999999999887654 4578998887654 457899999999863 579 Q ss_pred cCcceEEcCCC--CCccEEEEehhhEEEEEecceEEEEeccc--------------ccccCcEEEEEEEEeCcEEecccc Q lcl|Aclame:pro 561 NGYRAEASNQI--PADTWIFGDWSQIVIAMWGVLDLKVDPYT--------------KAASDGLVLRVFQDVDAGVRRKEA 624 (632) Q Consensus 561 ~G~pv~~~~~~--~~~~~~~gd~s~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~r~~~~v~~~~a 624 (632) +|+||++++.. +.+.++||||+.+.++.++++++..+++. .|.+|++.||++.|+|+++.+|+| T Consensus 227 ~G~PV~~~~~~~~~~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A 306 (324) T protein:vir:78 227 DGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKA 306 (324) T ss_pred cceeeEeeCCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccc Confidence 99999998764 45679999999999999999999887763 389999999999999999999999 Q ss_pred eEEEEecC Q lcl|Aclame:pro 625 FCIAKKGA 632 (632) Q Consensus 625 ~~~~~~~A 632 (632) |++|+.+- T Consensus 307 ~~~l~~a~ 314 (324) T protein:vir:78 307 FAKLVPAD 314 (324) T ss_pred eEEEeccc Confidence 99999865 No 48 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=100.00 E-value=1.7e-45 Score=265.96 Aligned_cols=295 Identities=13% Similarity=0.163 Sum_probs=234.4 Q ss_pred hhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeE Q lcl|Aclame:pro 324 AGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDV 403 (632) Q Consensus 324 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 403 (632) ........... +.+... . ......++....+..+++.++|.++ ...|++.+++.++++++ +++++..+..+ T Consensus 1 ~~~~~~~~~~~-~~~~~~---~---~~~~~~~a~~~~~~~~~~~~iP~~~-~~~ii~~~~~~s~l~~l-~~~~~~~~~~~ 71 (324) T protein:vir:96 1 MEQTQKLKLNL-QHFASN---N---VKPQVFNPDNVMMHEKKDGTLMNEF-TTPILQEVMENSKIMQL-GKYEPMEGTEK 71 (324) T ss_pred CCcchhhhHHH-HHHHHH---h---hhhhhhccccccccCcCccccchhH-HHHHHHHHHhhchhhhh-cceeeccCCce Confidence 00000000000 001000 0 0011223334444556667777665 47788889999999987 56677777788 Q ss_pred EEEEecCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|Aclame:pro 404 DIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTG 483 (632) Q Consensus 404 ~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g 483 (632) ++|+.++.+.+.|++|++++++++++|+++++.+++++++++||+|++.|+.+++.++|.+.+++++++++|.++++|+| T Consensus 72 ~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g 151 (324) T protein:vir:96 72 KFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) T ss_pred EEEEEecCcceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccccccceeccccccccccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeecc---ccc Q lcl|Aclame:pro 484 LANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN---NEV 560 (632) Q Consensus 484 ~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~---~~l 560 (632) +++.|.|+.+.....+. ...+..++++|.++..++...+.. ...|+|++..+.. +.+++|.+|+|++.. ++| T Consensus 152 ~~~~~~gi~~~~~~~~~-~~~~~~t~~~i~~~~~~l~~~~~~--~~~~vmn~~~~~~--L~~l~d~~G~~~~~~~~~~~l 226 (324) T protein:vir:96 152 NNPFGKSIAQSIEKTNK-VIKGDFTQDNIIDLEALLEDDELE--ANAFISKTQNRSL--LRKIVDPETKERIYDRNSDSL 226 (324) T ss_pred CCCcCccccccccccce-eccccccHHHHHHHHHhhhhccCC--CCEEEEcHHHHHH--HHHhhccCCCeeecCCCCCcc Confidence 99888998876554443 344667899999999999887654 4578998887654 457899999999863 579 Q ss_pred cCcceEEcCCC--CCccEEEEehhhEEEEEecceEEEEeccc--------------ccccCcEEEEEEEEeCcEEecccc Q lcl|Aclame:pro 561 NGYRAEASNQI--PADTWIFGDWSQIVIAMWGVLDLKVDPYT--------------KAASDGLVLRVFQDVDAGVRRKEA 624 (632) Q Consensus 561 ~G~pv~~~~~~--~~~~~~~gd~s~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~r~~~~v~~~~a 624 (632) +|+||++++.. +.+.++||||+.+.++.++++++..+++. .|.+|++.||++.|+|+++.+|+| T Consensus 227 ~G~PV~~~~~~~~~~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A 306 (324) T protein:vir:96 227 DGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKA 306 (324) T ss_pred cceeeEeeCCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccc Confidence 99999998764 45679999999999999999999887763 389999999999999999999999 Q ss_pred eEEEEecC Q lcl|Aclame:pro 625 FCIAKKGA 632 (632) Q Consensus 625 ~~~~~~~A 632 (632) |++|+.+- T Consensus 307 ~~~l~~a~ 314 (324) T protein:vir:96 307 FAKLVPAD 314 (324) T ss_pred eEEEeccc Confidence 99999865 No 49 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=100.00 E-value=2.6e-45 Score=264.99 Aligned_cols=295 Identities=12% Similarity=0.143 Sum_probs=235.4 Q ss_pred HHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEe Q lcl|Aclame:pro 329 EVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKK 408 (632) Q Consensus 329 ~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 408 (632) +......+...+...... ......++....+..+++.++|+++ .+.+++.+++.++++++ ++.++..+..+.+|+. T Consensus 1 ~~~~~~~~~~~~~f~~~~--~~~~~~~a~~~~~~~~~~~liP~~~-~~~ii~~~~~~s~l~~l-~~~~~~~~~~~~ip~~ 76 (324) T protein:vir:93 1 MEQTQKLKLNLQHFASNN--VKPQVFNPDNVMMHEKKDGTLLNDF-TTPILQEVMENSKIMQL-GKYEPMEGTEKKFTFW 76 (324) T ss_pred CchhHHHHHHHHHHHHhh--hhhhhcccccccccCCCcceechhH-HHHHHHHHHhhchhhhh-cceeeccCCceEEEEE Confidence 000011111111111111 1111223444445555666777665 57788999999999998 5667777777889999 Q ss_pred cCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccc Q lcl|Aclame:pro 409 TSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDP 488 (632) Q Consensus 409 ~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~ 488 (632) +..+.+.|++|++++++++++|+++++.+++++++++||+|+|.|+..++.++|.+.+++++++++|.++++|+|+++.+ T Consensus 77 ~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~~~ 156 (324) T protein:vir:93 77 ADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFG 156 (324) T ss_pred ecCcceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999998888 Q ss_pred ccceeccccccccccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeec---cccccCcce Q lcl|Aclame:pro 489 VGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQ---NNEVNGYRA 565 (632) Q Consensus 489 ~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~---~~~l~G~pv 565 (632) .|++......+. ...+.+++++|.+++..+...+.. ...|+|++..+..+ .+++|.+|+|++. +++|+|+|| T Consensus 157 ~~~~~~~~~~~~-~~~~~~~~~~i~~~~~~l~~~~~~--~~~~v~n~~~~~~L--~~l~d~~G~~~~~~~~~~~l~G~PV 231 (324) T protein:vir:93 157 KSIAQSIEKTNK-VIKGDFTQDNIIDLEALLEDDELE--ANAFISKTQNRSLL--RKIVDPETKERIYDRNSDSLDGLPV 231 (324) T ss_pred ccccccccccce-eccccccHHHHHHHHHhhhhccCC--CCEEEEcHHHHHHH--HHhhCCCCCeeecCCCCCcccceee Confidence 888876554433 344567899999999999887654 45789988886654 5789999999986 468999999 Q ss_pred EEcCC--CCCccEEEEehhhEEEEEecceEEEEeccc--------------ccccCcEEEEEEEEeCcEEecccceEEEE Q lcl|Aclame:pro 566 EASNQ--IPADTWIFGDWSQIVIAMWGVLDLKVDPYT--------------KAASDGLVLRVFQDVDAGVRRKEAFCIAK 629 (632) Q Consensus 566 ~~~~~--~~~~~~~~gd~s~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~r~~~~v~~~~a~~~~~ 629 (632) ++++. .+.+.+++|||+.+.++.++++++..+++. .|.+|++.||++.|+|+++.+|+||++|+ T Consensus 232 v~~~~~~~~~~~i~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~a~~~l~ 311 (324) T protein:vir:93 232 VNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLV 311 (324) T ss_pred EeecCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEe Confidence 99776 445679999999999999999999988764 38899999999999999999999999999 Q ss_pred ecC Q lcl|Aclame:pro 630 KGA 632 (632) Q Consensus 630 ~~A 632 (632) .+. T Consensus 312 ~a~ 314 (324) T protein:vir:93 312 PAD 314 (324) T ss_pred ccc Confidence 877 No 50 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=100.00 E-value=1.2e-45 Score=266.94 Aligned_cols=271 Identities=14% Similarity=0.153 Sum_probs=218.8 Q ss_pred hcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcccccCcccceeeeee Q lcl|Aclame:pro 357 LEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFS 436 (632) Q Consensus 357 ~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~ 436 (632) |...++.++|+++|.++ ...|++.+++.++++++ +++++.....+++|+.++.+.+.|++|++++++++++|+++++. T Consensus 1 Ma~~~~~~gg~~vP~~~-~~~ii~~l~~~s~i~~l-~~~i~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l~ 78 (315) T protein:vir:80 1 MADDFLSAGKLELPGSM-IGAVRDRAIDSGVLAKL-SPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQ 78 (315) T ss_pred CCCCcCCcCceEcchHH-HHHHHHHHHhhchhhhh-cceeecCCCceEEEEEeCCcceEEeeCCccccccccceeeeEee Confidence 55666677788877665 57788999999999998 56677777788999999999999999999999999999999999 Q ss_pred eeeeeeeehhhHHHhhcChhH----HHHHHHHHHHHHHHHHHHHHHhhcCCC--ccccccceeccccccccccccchhHH Q lcl|Aclame:pro 437 PKTIAGAVPVTRKLRKQSSIH----VENLIREDLIEGIGVALDLAMLTGTGL--ANDPVGLLNMTGVPALTYPAGGVDWA 510 (632) Q Consensus 437 ~~t~~~~~~iSre~l~d~~~~----~~~~i~~~l~~a~a~~~~~~~~~g~g~--~~~~~Gil~~a~~~~~~~~~~~~~~~ 510 (632) +++++++++||+|++.++..+ ++++|.+.+++++++++|.++++|++. +..+.|+.+...........+...++ T Consensus 79 ~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (315) T protein:vir:80 79 PIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKTKNIVDATDSATA 158 (315) T ss_pred eeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCccccccccccccccceeeccccchH Confidence 999999999999999887765 678999999999999999999998753 33445544433333333334445678 Q ss_pred HHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhccc-----CCceeec------cccccCcceEEcCCCCCc----- Q lcl|Aclame:pro 511 SVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDN-----TGERIWQ------NNEVNGYRAEASNQIPAD----- 574 (632) Q Consensus 511 ~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~-----~g~~~~~------~~~l~G~pv~~~~~~~~~----- 574 (632) ++.+++.++....... ...|+|++.+...+. ++++. +|+|+|. +++|+|+||++++++|.+ T Consensus 159 d~~~~~~~~~~~~~~~-~~~~imn~~~~~~L~--~l~~~~g~~~~g~~~~~~~~~g~~~tl~G~PV~~~~~~~~~~~~~~ 235 (315) T protein:vir:80 159 DLVKAVGLIAGAGLQV-PNGVALDPAFSFALS--TEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSP 235 (315) T ss_pred HHHHHHHHHhhccCcc-ceEEEEcHHHHHHHH--HHhhccCCcccccccccccccCCCceecceeeEecCcCCccccccc Confidence 8888888876543332 346999988876664 55544 4566663 358999999999999854 Q ss_pred ----cEEEEehhhEEEEEecceEEEEeccc--------ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 575 ----TWIFGDWSQIVIAMWGVLDLKVDPYT--------KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 575 ----~~~~gd~s~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) .++||||+.+.++.++++++.++++. .|.+|++.||+..|+|++|++|+||++||.+| T Consensus 236 ~~~~~~~~GDfs~~~~g~~~~~~i~i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~ 305 (315) T protein:vir:80 236 ASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKA 305 (315) T ss_pred ccccEEEEeecccEEEEEecCeeEEEeccccccCcccchhhcCcEEEEEEEEecceeecccceEEEeecc Confidence 37899999999999999999887763 48999999999999999999999999999888 No 51 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=100.00 E-value=1.8e-45 Score=265.94 Aligned_cols=277 Identities=12% Similarity=0.202 Sum_probs=230.8 Q ss_pred hhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcccccCcc Q lcl|Aclame:pro 349 HEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDF 428 (632) Q Consensus 349 ~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~ 428 (632) .......+....+..+++.+||+++ .+.|++.+++.+++++++............+++....+.+.|++|+++++++++ T Consensus 1 m~~~~~~~~~~~~t~~~~~lvP~~~-~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~ 79 (297) T protein:vir:95 1 MTVQTFNPENVLVSQKKDGTLHKEF-TDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGISAYWVNETEKIKTDKP 79 (297) T ss_pred CCccccccccccccCCCcceechhH-HHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCceeEEeecCcccccccc Confidence 1222234445555566677777665 577889999999999985443333344567888888899999999999999999 Q ss_pred cceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchh Q lcl|Aclame:pro 429 DFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVD 508 (632) Q Consensus 429 ~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~ 508 (632) +|+++++.+++++++++||+|++.|+..++.++|.+.+++++++++|.++++|+|+ +.|.|++......+ ....+.++ T Consensus 80 ~f~~v~l~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~-~~~~gi~~~~~~~~-~~~~~~~t 157 (297) T protein:vir:95 80 EVVPVTLKAHKLGIILVTSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGLLGHDT-PFANSVAKAAKDAN-KVIGGPIN 157 (297) T ss_pred ceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCC-cccccccccccccc-eecccccC Confidence 99999999999999999999999999999999999999999999999999999986 46888887665443 34456789 Q ss_pred HHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeecc--ccccCcceEEcCC--CCCccEEEEehhhE Q lcl|Aclame:pro 509 WASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN--NEVNGYRAEASNQ--IPADTWIFGDWSQI 584 (632) Q Consensus 509 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~--~~l~G~pv~~~~~--~~~~~~~~gd~s~~ 584 (632) +++|.+++.++...+.. ...|+|++..+..+ .+++|.+|+|+|++ ++|+|+||++++. ++.+.++||||+.+ T Consensus 158 ~~~i~~~~~~l~~~~~~--~~~~v~~~~~~~~L--~~l~d~~G~~i~~~~~~~l~G~Pv~~~~~~~~~~~~~~~gd~s~~ 233 (297) T protein:vir:95 158 YDNILKLQDALYDADVE--PNAFVSKIQNRSAL--REARDGNKVSIYDKAANTIDGITTVDLKSARFEKGDLLAGDFDNL 233 (297) T ss_pred HHHHHHHHHHhhhccCC--cCEEEEcHHHHHHH--HHhhccCCceeecCCCCcccceeeEeecCCCCCCceEEEEecccE Confidence 99999999999887653 46789988876655 57899999999975 6899999998654 56788999999999 Q ss_pred EEEEecceEEEEecccc--------------cccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 585 VIAMWGVLDLKVDPYTK--------------AASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 585 ~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) .++.++++++..+++.. |.+|++.||+..|+|+++++|+||++||.|. T Consensus 234 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~at 295 (297) T protein:vir:95 234 IYGVPYNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKLTPAE 295 (297) T ss_pred EEEEecCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEeecC Confidence 99999999998877643 8899999999999999999999999999999 No 52 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=100.00 E-value=9e-44 Score=256.58 Aligned_cols=360 Identities=14% Similarity=0.131 Sum_probs=221.9 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh----hhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhh Q lcl|Aclame:pro 218 GANENDILSRERTRISEITAIGQQFSQRSLAQEAI----QKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHS 293 (632) Q Consensus 218 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 293 (632) ++...+ ..++............ +..++.. ...+..+.....+...... ...... . T Consensus 1 Mk~~~e----L~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~--------~~~~~~-------~ 59 (397) T protein:vir:49 1 MKTSNE----LHDLWIAQGDKVENLN--EKLNVAMLDDSVSAEELQAIKNERDTAKMK--------RDLFKE-------Q 59 (397) T ss_pred CchHHH----HHHHHHHHHHHHHHHH--HHHHHHHhcchhhHHHHHHHHHHHHHHHHH--------HHHHHH-------H Confidence 110110 0000000000000000 0000000 0000000000000000000 000000 0 Q ss_pred hhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhh Q lcl|Aclame:pro 294 ARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATEL 373 (632) Q Consensus 294 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~ 373 (632) ..... ...... .....+ ...............+.+..... .......++....+..+||.+||.++ T Consensus 60 ~~~~~--~~~~~~--~~~~~~------~~~~~~~~~~~~~~~~~~~~~l~----~~~~~~~~~~~~~t~~~gg~~iP~~~ 125 (397) T protein:vir:49 60 YTEAR--ANEVAN--MSEEEK------KPLTKNEEEVKANFVKDFKNLVR----GRYQNLLDSKTDGSGSDAGLTIPQDI 125 (397) T ss_pred HHHHH--Hhhhhc--cccccc------ccccchhhHHHHHHHHHHHHHhh----cchhhHHHhhhccCCccCcceecHHH Confidence 00000 000000 000000 00000000001111111111111 11122234455556667777777655 Q ss_pred hhHHHHHHHhhhhhhhhhc-ceeeccCceeEEEEEecC-CccccccccCcccccCc-ccceeeeeeeeeeeeeehhhHHH Q lcl|Aclame:pro 374 LSEEFIDILRNKAIIGQMG-ARMLPGLVGDVDIPKKTS-GANFYWIGEDEDVQDSD-FDFTTLSFSPKTIAGAVPVTRKL 450 (632) Q Consensus 374 ~~~~i~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~-~~~~~~~~~~~t~~~~~~iSre~ 450 (632) ...|++.+++.++++.+. ...++.....+.+++..+ .+.+.|++|+++++++. ++|+.+++.+++++++++||+++ T Consensus 126 -~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~el 204 (397) T protein:vir:49 126 -RTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEGGQIGQNDDPKLSLIRYAIKRYAGISTVTNSL 204 (397) T ss_pred -HHHHHHHHHhhhhHhhhcceeeccCCcceEEEEeeccCCcceeeeccccccccccccceeeeEeeeeeeEeehhhHHHH Confidence 567889999999988863 233445556677776654 46789999999999865 79999999999999999999999 Q ss_pred hhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHHHHHHhhccccccce Q lcl|Aclame:pro 451 RKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLA 530 (632) Q Consensus 451 l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ 530 (632) |.|+.++++++|.+.+++++++++|.++++|+|++... ...+++++|.+++..+...+.. ++. T Consensus 205 l~ds~~~l~~~i~~~l~~~~~~~~d~ail~G~g~~~~~---------------~~~~~~d~i~~~~~~l~~~~~~--~a~ 267 (397) T protein:vir:49 205 LADSAENILAWLSGWIAKKVVVTRNKAILEAIGTLPNK---------------PTLAKWDDIIDLQAKVDPAIKQ--TSL 267 (397) T ss_pred HhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc---------------ccccCHHHHHHHHHhhhhhhcC--CCE Confidence 99999999999999999999999999999999875432 2335688999999999887764 578 Q ss_pred EEeehhHHHHHHHHhhcccCCceeecc-------ccccCcceEEcC--CCC-----CccEEEEehhh-EEEEEecceEEE Q lcl|Aclame:pro 531 YLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEASN--QIP-----ADTWIFGDWSQ-IVIAMWGVLDLK 595 (632) Q Consensus 531 ~~~~~~~~~~~~~~~~~d~~g~~~~~~-------~~l~G~pv~~~~--~~~-----~~~~~~gd~s~-~~~~~~~~~~~~ 595 (632) |+|++..+..+ .+++|.+|+|+|.+ ++|+|+||++++ .+| ...++||||+. |.+++++++++. T Consensus 268 ~v~n~~~~~~l--~~lkd~~g~~l~~~~~~~g~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~ 345 (397) T protein:vir:49 268 FLTNTSGFTAL--KKVKNAMGDYLMERDVKSPTGYSIDGFVVKEISDRFLPNGTGGAMPLYFGDLKQAVTLFDRQHLSLL 345 (397) T ss_pred EEEcHHHHHHH--HHhhccCCceeecccccCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecccEEE Confidence 99998886544 57899999999965 379999998854 334 34589999997 778999999999 Q ss_pred Eeccc--ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 596 VDPYT--KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 596 ~~~~~--~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) ++++. +|.+|++.||++.|+|+++++|+||++++++| T Consensus 346 ~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~ 384 (397) T protein:vir:49 346 STNIGGGAFETDTTKVRVIDRFDVVSTDTEAFVPASFKA 384 (397) T ss_pred EeccccchhhcCeeeEEEEEeeccEEecccceEEEEecc Confidence 99865 59999999999999999999999999999999 No 53 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=100.00 E-value=9.6e-44 Score=256.43 Aligned_cols=363 Identities=13% Similarity=0.141 Sum_probs=220.7 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhh Q lcl|Aclame:pro 218 GANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDL 297 (632) Q Consensus 218 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 297 (632) ++...+..+................. ....+.....+..+.....+.......+. .. ...... T Consensus 1 Mk~~~el~~~~~~~~~~~~~l~~~~~--~~~~~~~~~~ee~~~~~~~i~~~~~~~e~--------~~-------~~~~~~ 63 (397) T protein:vir:49 1 MKTSNELHDLWVAQGDKVENLNEKLN--VAMLDDSVSAEELQAIKNERDTAKMKRDM--------FK-------EQYTEA 63 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHH--HHHhhhhcCHHHHHHHHHHHHHHHHHHHH--------HH-------HHHHHH Confidence 11011000000000000000000000 00000000000000000000000000000 00 000000 Q ss_pred hhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHH Q lcl|Aclame:pro 298 GIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEE 377 (632) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~ 377 (632) . ...... ... .. ................+.+..... ........+....+..+||.++|.++ .+. T Consensus 64 ~--~~~~~~--~~~--~~----~~~~~~~~~~~~~~~~~~~~~~l~----~~~~~~~~~~~~~t~~~gg~~vP~~~-~~~ 128 (397) T protein:vir:49 64 R--ANEVAN--MSE--EE----KKPLTKSEEEVKAGFVKDFKNLVR----GRYQNLLDSKTDASGSDAGLTIPQDI-QTA 128 (397) T ss_pred H--HHhhhc--ccc--cc----ccccccchhHHHHHHHHHHHHHHh----cchhHHHHHhhccccccCcccccHhH-HHH Confidence 0 000000 000 00 000000000000011111111111 01111222344455566777776655 567 Q ss_pred HHHHHhhhhhhhhhccee--eccCceeEEEEEecC-CccccccccCccccc-CcccceeeeeeeeeeeeeehhhHHHhhc Q lcl|Aclame:pro 378 FIDILRNKAIIGQMGARM--LPGLVGDVDIPKKTS-GANFYWIGEDEDVQD-SDFDFTTLSFSPKTIAGAVPVTRKLRKQ 453 (632) Q Consensus 378 i~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~-~~~~~~~~~~~~~t~~~~~~iSre~l~d 453 (632) |++.+++.++++.+ +++ ++.....+.+++... .+.+.|++|++++++ +.++|+++++.++++++.++||+++|.| T Consensus 129 ii~~~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~d 207 (397) T protein:vir:49 129 IHTLVSQYDSLQEY-VNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADVDDPKLSLIKYTIKRYAGISTVTNSLLAD 207 (397) T ss_pred HHHHHHhhhhHHhh-hceeecccCccceEEEeeccCCcceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhh Confidence 88999999998887 344 344555666776554 467999999999997 5799999999999999999999999999 Q ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHHHHHHhhccccccceEEe Q lcl|Aclame:pro 454 SSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLT 533 (632) Q Consensus 454 ~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 533 (632) +.+++.++|.+.++++++++++.++++|+|++..+.| ..+++.|.++++.+...+.. ++.|+| T Consensus 208 s~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~~~~~~---------------~~~~d~i~~~~~~l~~~~~~--~a~~vm 270 (397) T protein:vir:49 208 SAENILAWLSGWIAKKVVVTRNKAILEAIAALPTKPT---------------LTKWDDIIDLEAKVDPAIKQ--TSFFLT 270 (397) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc---------------cccHHHHHHHHHhhhhhhcC--CCEEEE Confidence 9999999999999999999999999999987654322 34678999999999988764 578999 Q ss_pred ehhHHHHHHHHhhcccCCceeecc-------ccccCcceEEcCC--CCC-----ccEEEEehhh-EEEEEecceEEEEec Q lcl|Aclame:pro 534 SVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEASNQ--IPA-----DTWIFGDWSQ-IVIAMWGVLDLKVDP 598 (632) Q Consensus 534 ~~~~~~~~~~~~~~d~~g~~~~~~-------~~l~G~pv~~~~~--~~~-----~~~~~gd~s~-~~~~~~~~~~~~~~~ 598 (632) ++.++..+ .+++|.+|+|+|.+ ++|+|+||++.+. +|. ..++||||+. |.++.+.++++.+++ T Consensus 271 n~~~~~~l--~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~ 348 (397) T protein:vir:49 271 NTSGFTAL--KKVKNALGDYLMERDVKSPTGYSIDGFAVKEVADRWLANGTGGAMPLYFGDLKQAVTLFDRQHMSLLSTN 348 (397) T ss_pred cHHHHHHH--HHhhcCCCceeeccCcCCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecceEEEEec Confidence 99886544 57899999999975 3799999988543 333 3489999997 678899999999988 Q ss_pred cc--ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 599 YT--KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 599 ~~--~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +. +|.+|++.||++.|+|+++.+|+||++++++| T Consensus 349 ~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~ 384 (397) T protein:vir:49 349 IGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKA 384 (397) T ss_pred cccchhhcCceeEEEEeeeCcEEecccceEEEEeec Confidence 65 69999999999999999999999999999999 No 54 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=100.00 E-value=1.9e-45 Score=265.75 Aligned_cols=277 Identities=17% Similarity=0.163 Sum_probs=226.8 Q ss_pred hhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcccccC Q lcl|Aclame:pro 347 MPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDS 426 (632) Q Consensus 347 ~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~ 426 (632) +..... .+.+...+.++++.++++++. ..+++.+++.++++++ ++.++..+..+++|+.+..+.+.|++|+++++++ T Consensus 1 ~g~~~e-~~~~~~~~t~~~~g~l~~~~~-~~ii~~l~~~s~i~~l-~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s 77 (397) T protein:vir:23 1 MGFSAD-HSQIAQTKDTMFTGYLDPVQA-KDYFAEAEKTSIVQRV-AQKIPMGATGIVIPHWTGDVSAQWIGEGDMKPIT 77 (397) T ss_pred CCcCHH-HHHHhhccCCCCccccchhHH-HHHHHHHHhccchhhh-cceeeccCCceEEEEEcCCcceEEecCCcccccc Confidence 111222 223333344455566777765 5677777888888887 4667777777899999999999999999999999 Q ss_pred cccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccc Q lcl|Aclame:pro 427 DFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGG 506 (632) Q Consensus 427 ~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~ 506 (632) +++|+++++.+++++++++||+|+|.++.++++++|.+.+++++++++|.++++|+|++..+.++......... .... T Consensus 78 ~~~f~~v~l~~~k~~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~~~~~~~~~~~~~~~--~~~~ 155 (397) T protein:vir:23 78 KGNMTKRDVHPAKIATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHGTNAPSAFQGYLDQSNKTQS--ISPN 155 (397) T ss_pred ccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccCCcccccccccccceee--eccc Confidence 99999999999999999999999999999999999999999999999999999999987777666655443222 2344 Q ss_pred hhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeeccc------------cccCcceEEcCCCCCc Q lcl|Aclame:pro 507 VDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQNN------------EVNGYRAEASNQIPAD 574 (632) Q Consensus 507 ~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~~------------~l~G~pv~~~~~~~~~ 574 (632) ..++.+.++...+...+.. .+.|+|++..+.. +.+++|.+|+|+|.++ +|+|+||++++++|.+ T Consensus 156 ~~~~~~~~~~~~l~~~~~~--~a~~vmn~~~~~~--L~~lkd~~G~~i~~~~~~~~~~~~~~~~tl~G~Pv~~s~~~~~g 231 (397) T protein:vir:23 156 AYQGLGVSGLTKLVTDGKK--WTHTLLDDTVEPV--LNGSVDANGRPLFVESTYESLTTPFREGRILGRPTILSDHVAEG 231 (397) T ss_pred chhHHHHHHHHhhhhcccC--CCEEEEcHHHHHH--HHHhhccCCceeecccccccccccccCceeeeeeEEEeCCCCCC Confidence 5667788888888877653 5779998887654 4578999999999753 6899999999999987 Q ss_pred c--EEEEehhhEEEEEecceEEEEeccc--------------ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 575 T--WIFGDWSQIVIAMWGVLDLKVDPYT--------------KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 575 ~--~~~gd~s~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) . ++||||+.+.++.++++.+..+++. .|.+|++.||++.|+|+++++|+||++++..+ T Consensus 232 ~~~~~~gDfs~~~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~ 305 (397) T protein:vir:23 232 DVVGYAGDFSQIIWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDP 305 (397) T ss_pred ceEEEEeecceEEEEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeeccceecccceEEEeecc Confidence 6 4799999999999999999887653 38899999999999999999999999999988 No 55 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=100.00 E-value=2.3e-45 Score=265.28 Aligned_cols=287 Identities=17% Similarity=0.155 Sum_probs=221.0 Q ss_pred hhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhccee Q lcl|Aclame:pro 316 AATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARM 395 (632) Q Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~ 395 (632) +..+..+ . .......+.+++...+. +++.++|+++ .+.+++.+++.++++++ +++ T Consensus 1 ~~~~~~r-------------------~---~~~~~~~e~~a~~~~~~-~~g~~ip~~~-~~~ii~~~~~~s~i~~~-~~~ 55 (326) T protein:vir:42 1 MAVNPDR-------------------T---TPFLGVNDPKVAQTGDS-MFEGYLEPEQ-AQDYFAEAEKISIVQQF-AQK 55 (326) T ss_pred CCCCccc-------------------h---hhhcCcchhhheecccc-CCcceechhh-HHHHHHHHHhcchhhhh-cce Confidence 0000000 0 00011223455554443 3455677665 56788889999998887 566 Q ss_pred eccCceeEEEEEecCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 396 LPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALD 475 (632) Q Consensus 396 ~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~ 475 (632) ++.....+++|+.++.+.+.|++|++++++++++|+++++.+++++++++||+|++.++..+++++|.+.+++++++++| T Consensus 56 ~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d 135 (326) T protein:vir:42 56 IPMGTTGQKIPHWTGDVSASWIGEGDMKPITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFD 135 (326) T ss_pred eeccCCceEEEEEeCCcceEEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHH Confidence 77777788999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhcCCCccccccceeccccccccc-----cccchhHHHH--HHHHHHHHhhccccccceEEeehhHHHHHHHHhhcc Q lcl|Aclame:pro 476 LAMLTGTGLANDPVGLLNMTGVPALTY-----PAGGVDWASV--VDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFD 548 (632) Q Consensus 476 ~~~~~g~g~~~~~~Gil~~a~~~~~~~-----~~~~~~~~~i--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 548 (632) .++++|+|+ ++|.|+++......... .....+..++ .++...+. ......+.|+|++..+..+ .+++| T Consensus 136 ~a~l~G~gs-~~p~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~a~~v~n~~~~~~L--~~lkd 210 (326) T protein:vir:42 136 NAAINGTDS-PFPTFLAQTTKEVSLVDPDGTGSNADLTVYDAVAVNALSLLV--NAGKKWTHTLLDDITEPIL--NGAKD 210 (326) T ss_pred HHhhcccCC-CccccccccccccceeecccccccccchhHHHHHHHHHhhhh--hhccCccEEEEeHHHHHHH--HHhhc Confidence 999999996 47888876554322211 1122233333 23333333 3334567899988887655 47899 Q ss_pred cCCceeeccc------------cccCcceEEcCCCCCcc--EEEEehhhEEEEEecceEEEEecccc------------- Q lcl|Aclame:pro 549 NTGERIWQNN------------EVNGYRAEASNQIPADT--WIFGDWSQIVIAMWGVLDLKVDPYTK------------- 601 (632) Q Consensus 549 ~~g~~~~~~~------------~l~G~pv~~~~~~~~~~--~~~gd~s~~~~~~~~~~~~~~~~~~~------------- 601 (632) .+|+|+|++. +++|+||++++.+|+++ ++||||+.+.++.++++.+..+++.. T Consensus 211 ~~G~~l~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~Gd~s~~~~~~~~~~~v~~~~e~~~~~~~~~~~~~~~ 290 (326) T protein:vir:42 211 KSGRPLFIESTYTEENSPFRLGRIVARPTILSDHVASGTVVGYQGDFRQLVWGQVGGLSFDVTDQATLNLGTPQAPNFVS 290 (326) T ss_pred cCCceeeccccccCccccccCceeeeeeEEEcCCCCCCceEEEEeecceEEEEEecceEEEEeecceeeecccccccchh Confidence 9999999752 58999999999999886 46899999999999999998877643 Q ss_pred -cccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 602 -AASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 602 -~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) |.+|++.||+..|+|+++.+|+||++|+.++ T Consensus 291 ~~~~d~~~~r~~~~~d~~v~~~~a~~~l~~~~ 322 (326) T protein:vir:42 291 LWQHNLVAVRVEAEYAFHCNDKDAFVKLTNVD 322 (326) T ss_pred hhhcCcEEEEEEEEeccEEecccceEEEeecc Confidence 7889999999999999999999999999999 No 56 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=100.00 E-value=1.4e-43 Score=255.60 Aligned_cols=419 Identities=14% Similarity=0.117 Sum_probs=216.2 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHH Q lcl|Aclame:pro 183 AEMPDKDKQTQTAGSQQTETRGAETGAKNPAPAASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFR 262 (632) Q Consensus 183 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~ 262 (632) +.-...+. ................... ..........................+ ..+... T Consensus 1 ~~~~~~~l---~~~~~~~~~~l~el~e~~~-------~l~k~~~el~~~l~ea~~~ee~~~~ee----------~i~~l~ 60 (466) T protein:vir:80 1 MALRQLML---AKKIEQRKAALAELLEQEK-------ALQKRSEELEAAIDEANTDEEIAVVED----------EINKLE 60 (466) T ss_pred CchHHHHH---HHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHhhhhHHHHHHHHH----------HHHHHH Confidence 00000000 0000000000000000000 000000000000000000000000000 000000 Q ss_pred HHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhh Q lcl|Aclame:pro 263 ALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEA 342 (632) Q Consensus 263 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 342 (632) ....+..+. ...........+...................... ...... ..................+....... T Consensus 61 ~~~~el~e~-~~~l~~ei~~le~el~e~~~~~~~~~~~~~~~~~-~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~ 135 (466) T protein:vir:80 61 GEKTELEEK-KSKLEGEIKELENELEQLNNKEPKNNSEPAQVSG-ARTQQF---VGGETRMKGFFRNMPYEQRAALIARS 135 (466) T ss_pred HHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhhccCchhHHHHh-hhhhHH---hhHHHHHHHHHHhhhhhhHHHHHHHH Confidence 000000000 0000000000000000000000000000000000 000000 00000000000000000000000000 Q ss_pred hhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcc Q lcl|Aclame:pro 343 RGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDED 422 (632) Q Consensus 343 ~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~ 422 (632) .... ..... ..........+++..+.|+.+.+.|++.++..++++.. +++.+.. ....+++.+..+.+.|++|+++ T Consensus 136 ~~~~-~~~~~-~~~~~~~~~~~g~~~~vP~~~~~~i~~~l~~~~~l~~~-~~v~~~~-g~~~~~~~~~~~~a~wv~E~~~ 211 (466) T protein:vir:80 136 EVKE-FLAQV-RTLAQQKRAVSGAELTIPDVMLELLRDNMHRYSKLISK-VRLRPLK-GTARQNIAGAIPEGVWTEAVAN 211 (466) T ss_pred HHHH-HHHHH-HHHhhhhhhhccccccccHHHHHHHHHhhhhhhhhhhh-eeeeecC-ceeEeeeecCCcceeecccccc Confidence 0000 00000 01111122233444445566677788888888888876 4455443 3467777888888999999999 Q ss_pred cccCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccc Q lcl|Aclame:pro 423 VQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTY 502 (632) Q Consensus 423 ~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~ 502 (632) +++++++|+++++.+++++++++||+++|.|+..+++++|..+|+++++.+++.+|++|+|++ +|.||++..+...... T Consensus 212 ~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~ail~G~G~~-~P~Gil~~~~~~~~~~ 290 (466) T protein:vir:80 212 LNELSLSFSQIEVDGYKVGGFIPIPNSTLEDSDLNLADEILDAIGQAIGFALDKAILYGTGTK-MPVGIVTRLAQTTQPP 290 (466) T ss_pred cccccccccceeecceeeeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhheeeccCCC-Ccceeeeccccccccc Confidence 999999999999999999999999999999999999999999999999999999999999975 6899997654322211 Q ss_pred c-------ccchhHHHHH--------------HH---HHHHHhhccccccceEEeehhHHHHHHHHhh-cccCCceeecc Q lcl|Aclame:pro 503 P-------AGGVDWASVV--------------DM---ETKISTFNADAGRLAYLTSVTQRGAAKKAQV-FDNTGERIWQN 557 (632) Q Consensus 503 ~-------~~~~~~~~i~--------------~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~d~~g~~~~~~ 557 (632) . ...++...+. ++ ...+...+ ......|+|+......+....+ .+.+|.+++.+ T Consensus 291 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~w~~~~~~~~~l~~~~~~~~~~g~~~~~~ 369 (466) T protein:vir:80 291 NWGTKAPAWTNLSTTNLLKIDPTGKSAEEFFSELVLKLSKARANY-SNGMKFWAMSSNTHAVLMSKAITFNSAGALVASL 369 (466) T ss_pred ccccccccccccchhhhhhhhhhccchhhHHHHHHHHHHhhhccc-cCCceeEEecchhHHHhhcccccccCCccccccC Confidence 1 1111111111 11 12222222 2334457777776554432222 26677887765 Q ss_pred c---cccCcceEEcCCCCCccEEEEehhhEEEEEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 558 N---EVNGYRAEASNQIPADTWIFGDWSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 558 ~---~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) + +++|+||++++++|.+.++||||+.|.++++.++++.++++.+|.+|++.||+..|+||++++++||++++++. T Consensus 370 ~~~~~i~G~pvv~s~~~~~~~~~~g~~~~y~i~~r~~~~i~~~~~~~f~~d~~~~r~~~r~dg~~~~~~afv~~~~~~ 447 (466) T protein:vir:80 370 NNTMPIVGGDIVILDFIPDNDIIGGYGSLYLLAERADIKLAQSEHVRFIEDQTVFKGTARYDGKPVFGEGFVAVNIAN 447 (466) T ss_pred CCcccccccceeecCccCccceeeeccccEEEEeecceEEEechhhhhhcCcEEEEEEEEEccEEeccCceEEEEecC Confidence 3 58999999999999999999999999999999999999999999999999999999999999999999999888 No 57 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=100.00 E-value=5.8e-45 Score=263.11 Aligned_cols=295 Identities=12% Similarity=0.164 Sum_probs=234.8 Q ss_pred hhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeE Q lcl|Aclame:pro 324 AGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDV 403 (632) Q Consensus 324 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 403 (632) ............ .+... . ......++....+..+++.++|.++ .+.|++.+++.++++++ ++.++..+..+ T Consensus 1 ~~k~~~~~~~~~-~~~~~----~--~~~~~~~a~~~~~~~~~~~lip~~~-~~~ii~~~~~~s~l~~~-~~~~~~~~~~~ 71 (324) T protein:vir:99 1 MEQTQKLKLNLQ-HFASN----N--VKPQVFNPDNVMMHEKKDGTLLNDF-TTPILQEVMENSKIMRL-GKYEPMEGTEK 71 (324) T ss_pred CCCchHhhHHHH-HHHHH----h--hhhhhccccceeccCCCcceechhH-HHHHHHHHHhhchhhhh-cceeeccCCce Confidence 000000000000 01000 0 0111223344444455566777665 57888999999999987 56677777788 Q ss_pred EEEEecCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|Aclame:pro 404 DIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTG 483 (632) Q Consensus 404 ~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g 483 (632) .+|+.++.+.+.|++|++++++++++|+++++.+++++++++||+|++.|+..+++++|.+.+++++++++|.++++|+| T Consensus 72 ~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~g 151 (324) T protein:vir:99 72 KFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) T ss_pred EEEEEecCcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCC Confidence 99999888999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccccccceeccccccccccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeecc---ccc Q lcl|Aclame:pro 484 LANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN---NEV 560 (632) Q Consensus 484 ~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~---~~l 560 (632) +++.+.|+++.....+ ....+.++++++.++...+...+.. ...|+|++..+..+ .+++|.+|++++.+ ++| T Consensus 152 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~i~~~~~~l~~~~~~--~~~~v~n~~~~~~L--~~l~d~~g~~~~~~~~~~~l 226 (324) T protein:vir:99 152 NNPFGKSIAQSIEKTN-KVIKGDFTQDNIIDLEALLEDDELE--ANAFISKTQNRSLL--RKIVDPETKERIYDRNSDTL 226 (324) T ss_pred CCccCccccccccccc-eeccccCCHHHHHHHHHhhhhccCC--CCEEEEcHHHHHHH--HHhhcCCCceeecCCCCccc Confidence 9888888887655433 3445678899999999999887654 45789988886654 57899999999864 579 Q ss_pred cCcceEEcCCCC--CccEEEEehhhEEEEEecceEEEEeccc--------------ccccCcEEEEEEEEeCcEEecccc Q lcl|Aclame:pro 561 NGYRAEASNQIP--ADTWIFGDWSQIVIAMWGVLDLKVDPYT--------------KAASDGLVLRVFQDVDAGVRRKEA 624 (632) Q Consensus 561 ~G~pv~~~~~~~--~~~~~~gd~s~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~r~~~~v~~~~a 624 (632) +|+||++++..+ .+.+++|||+.+.++.++++++..+++. .|.+|++.||++.|+|+++.+|+| T Consensus 227 ~G~PVv~~~~~~~~~~~~i~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a 306 (324) T protein:vir:99 227 DGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKA 306 (324) T ss_pred cceeEEeecCCCCCcceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccc Confidence 999999998766 4569999999999999999999888763 388999999999999999999999 Q ss_pred eEEEEecC Q lcl|Aclame:pro 625 FCIAKKGA 632 (632) Q Consensus 625 ~~~~~~~A 632 (632) |++|+.+. T Consensus 307 ~~~lt~a~ 314 (324) T protein:vir:99 307 FAKLVPAD 314 (324) T ss_pred eEEEEecc Confidence 99999988 No 58 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=100.00 E-value=2.8e-43 Score=253.87 Aligned_cols=369 Identities=13% Similarity=0.085 Sum_probs=220.8 Q ss_pred hhhhh-hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhh Q lcl|Aclame:pro 213 APAAS-GANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAI 291 (632) Q Consensus 213 ~~~~~-~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 291 (632) +.... .....+................. ....+.....+...+......... .+......... T Consensus 1 ~~~~m~l~el~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~ee~~~~~~~~~~~~--------~~~~~~~~~~~--- 64 (404) T protein:vir:39 1 MGVKLTVNQLNEAWIASGDKVTDFNDQIN-----MALNDDNFSAEAMSELKNKRDNEK--------VRRDALREQLV--- 64 (404) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHhccccccHHHHHHHHHHHHHHH--------HHHHHHHHHHH--- Confidence 00000 00000000000000000000000 000000000000000000000000 00000000000 Q ss_pred hhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceech Q lcl|Aclame:pro 292 HSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVAT 371 (632) Q Consensus 292 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~ 371 (632) .. .............. ..............+.+..............+.+++..++..+||+++|. T Consensus 65 ----~~---~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~t~~~gg~~iP~ 130 (404) T protein:vir:39 65 ----EA---QAEQVVNMREEEKG-------PLNKSEYELKDKFVKEFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQ 130 (404) T ss_pred ----HH---HHHHHhcccccccc-------ccccchhhhHHHHHHHHHHHHhcchhhhhhhhhhhhhcccccCCceeccH Confidence 00 00000000000000 00000000111111222222222222333446667777777777777776 Q ss_pred hhhhHHHHHHHhhhhhhhhhcceeec--cCceeEEEEEec-CCccccccccCccccc-Ccccceeeeeeeeeeeeeehhh Q lcl|Aclame:pro 372 ELLSEEFIDILRNKAIIGQMGARMLP--GLVGDVDIPKKT-SGANFYWIGEDEDVQD-SDFDFTTLSFSPKTIAGAVPVT 447 (632) Q Consensus 372 ~~~~~~i~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~-~~~~a~~v~E~~~~~~-~~~~~~~~~~~~~t~~~~~~iS 447 (632) ++ ...|++.+++.++++.+. +..+ .....+.+++.. ..+.+.|++|++++++ +.++|+++++.+++++++++|| T Consensus 131 ~~-~~~ii~~~~~~~~l~~~~-~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS 208 (404) T protein:vir:39 131 DI-RTMINTLVRQYDSLQQYV-RVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPRLTIIKYLIKRYAGIITAT 208 (404) T ss_pred HH-HHHHHHHHHhhhhHHhhc-ceeeccCCcceEEEEeecCCccceeeecCccccccccccceeeEEeeeeeEEeeehhH Confidence 55 567889888988888873 4444 444555566554 4467899999999997 5799999999999999999999 Q ss_pred HHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHHH-HHHhhcccc Q lcl|Aclame:pro 448 RKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMET-KISTFNADA 526 (632) Q Consensus 448 re~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~-~~~~~~~~~ 526 (632) ++++.|+.+++.++|.+.|++++++++|.++++|+|++... ....+++++.+++. .+...+. T Consensus 209 ~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~g~g~~~~~---------------~~~~~~~~i~~~~~~~~~~~~~-- 271 (404) T protein:vir:39 209 NTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAMGTVPKK---------------PTIAKFDDVITMINTSVDPAII-- 271 (404) T ss_pred HHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc---------------cccccHHHHHHHHHHhhhhhhc-- Confidence 99999999999999999999999999999999999875322 12345677777765 4555554 Q ss_pred ccceEEeehhHHHHHHHHhhcccCCceeecc-------ccccCcceEEcCC--CCC-----ccEEEEehhh-EEEEEecc Q lcl|Aclame:pro 527 GRLAYLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEASNQ--IPA-----DTWIFGDWSQ-IVIAMWGV 591 (632) Q Consensus 527 ~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~-------~~l~G~pv~~~~~--~~~-----~~~~~gd~s~-~~~~~~~~ 591 (632) .++.|+|++..+..+ .+++|.+|+|+|.+ ++|+|+||+++++ +|. ..++||||+. |.++.+.+ T Consensus 272 ~~a~~v~n~~~~~~L--~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~ 349 (404) T protein:vir:39 272 ATSSLLTNQSGLNKL--ALVKTAEGKYLLEPDPTKPNSYLIKGKKVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDREN 349 (404) T ss_pred cCCEEEEcHHHHHHH--HHhhccCCceeeccCcCCCCcceecceeEEEecccccCccCCCccEEEEEeccccEEEEeecc Confidence 356799998886554 57899999999965 3799999999754 332 3489999997 67888999 Q ss_pred eEEEEeccc--ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 592 LDLKVDPYT--KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 592 ~~~~~~~~~--~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +++.++++. +|.+|++.||++.|+|+.+.+|+||+++++++ T Consensus 350 ~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~ 392 (404) T protein:vir:39 350 MSLLPTNIGAGAFETDTTKIRVIDRFDVKTTDSEALVAGSFTA 392 (404) T ss_pred eEEEEeccchhhhhhceeeEEEEeeeccEEecccceEEEEeec Confidence 999998875 68999999999999999999999999999888 No 59 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=100.00 E-value=3.4e-43 Score=253.39 Aligned_cols=369 Identities=14% Similarity=0.085 Sum_probs=223.6 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh--hhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhH Q lcl|Aclame:pro 208 GAKNPAPAASGANENDILSRERTRISEITAIGQQFSQ--RSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDL 285 (632) Q Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~--~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 285 (632) +...... ................... ..+..+.....+...+........... ...... T Consensus 1 m~~~m~i-----------~el~~~~~~~~~~~~~~~~e~~~~~~~~~~~~e~i~e~~~~~~~~~~~--------~~~~~~ 61 (408) T protein:vir:74 1 MGVKLTV-----------NQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVR--------RDALRE 61 (408) T ss_pred CChhhhH-----------HHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHH--------HHHHHH Confidence 0000000 0000000000000000000 000000000000000000000000000 000000 Q ss_pred HHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccc Q lcl|Aclame:pro 286 PGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKG 365 (632) Q Consensus 286 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~ 365 (632) . .......... ....... .. .............+.+..............+.+++..++..+| T Consensus 62 ~----------~~~~~~~~~~-~~~~~~~----~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~g 124 (408) T protein:vir:74 62 Q----------LVEAQAEQVV-NMREEEK----GP--LNKSENELKDKFVKDFVNMVRNPMAFLNTVSSKTETSGSDSAA 124 (408) T ss_pred H----------HHHHHHHHHh-hcccccc----cc--ccchhhhhHHHHHHHHHHHHhcchhhhhhhhhhhhcccccCCC Confidence 0 0000000000 0000000 00 0000000011111222222222222334456667777777778 Q ss_pred cceechhhhhHHHHHHHhhhhhhhhhcc-eeeccCceeEEEEEecC-CccccccccCccccc-Ccccceeeeeeeeeeee Q lcl|Aclame:pro 366 GELVATELLSEEFIDILRNKAIIGQMGA-RMLPGLVGDVDIPKKTS-GANFYWIGEDEDVQD-SDFDFTTLSFSPKTIAG 442 (632) Q Consensus 366 ~~~i~~~~~~~~i~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~-~~~~~~~~~~~~~t~~~ 442 (632) |.+||.++. ..|++.+++.++++.+.. ..++.....+.+++..+ .+.+.|++|++++++ +.++|+++++.++++++ T Consensus 125 g~~vP~~~~-~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~i~~~~~k~~~ 203 (408) T protein:vir:74 125 GLTIPQDIR-TMINTLVRQYDSLQQYVRVESVSTSSGSRVYEKWTDVTPLKAMDEEDGKIPDLDNPRLTIIKYLIKRYAG 203 (408) T ss_pred ceeechhHh-hHHHHHHhhhcchhhhcceeeccCCcceEEEEeecCCcccccccccccccccccccceeeEEeeeeeEEe Confidence 888876664 678899999988888732 22344455666776654 456789999999997 56999999999999999 Q ss_pred eehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHH-HHHHh Q lcl|Aclame:pro 443 AVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDME-TKIST 521 (632) Q Consensus 443 ~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~-~~~~~ 521 (632) +++||+|++.|+.++++++|.+.|++++++++|.++++|+|++.... ...+++++.+++ ..+.. T Consensus 204 ~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~G~~~~~~---------------~~~~~~~i~~~~~~~l~~ 268 (408) T protein:vir:74 204 IITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAMGTVPKKP---------------TIANFDDVITMINTSVDP 268 (408) T ss_pred eehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc---------------ccccHHHHHHHHHHhhhh Confidence 99999999999999999999999999999999999999998764322 234567888876 46777 Q ss_pred hccccccceEEeehhHHHHHHHHhhcccCCceeecc-------ccccCcceEEcCC--CCC-----ccEEEEehhh-EEE Q lcl|Aclame:pro 522 FNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEASNQ--IPA-----DTWIFGDWSQ-IVI 586 (632) Q Consensus 522 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~-------~~l~G~pv~~~~~--~~~-----~~~~~gd~s~-~~~ 586 (632) .|+. ++.|+|++..+..+ .+++|.+|+|+|.+ ++|+|+||+++++ +|. ..++||||+. |.+ T Consensus 269 ~~~~--~a~~v~n~~~~~~l--~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~ 344 (408) T protein:vir:74 269 AIIA--TSSLLTNQSGLNKL--ALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNSGSTVYPLYYGDMSQAITL 344 (408) T ss_pred hhcC--CCEEEEcHHHHHHH--HHhhcCCCceEeccCcCCCCCceecceeeEEecCcccccccCCcceEEEEehhccEEE Confidence 6653 57799988886544 57899999999975 4799999998753 442 3489999997 678 Q ss_pred EEecceEEEEeccc--ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 587 AMWGVLDLKVDPYT--KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 587 ~~~~~~~~~~~~~~--~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +.++++++.++++. .|.+|++.||++.|+|+++++|+||+++++++ T Consensus 345 ~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~ 392 (408) T protein:vir:74 345 FDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFTA 392 (408) T ss_pred EEecceEEEEeccccchhhcceeeEEEEEeeCcEEecccceEEEEeec Confidence 89999999998874 58999999999999999999999999999988 No 60 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=100.00 E-value=6e-45 Score=263.03 Aligned_cols=282 Identities=15% Similarity=0.137 Sum_probs=226.4 Q ss_pred hhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCccc Q lcl|Aclame:pro 344 GFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDV 423 (632) Q Consensus 344 ~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~ 423 (632) ......-..+.+.+...++.+++.++|+++ ...+++.+++.++++++ +++++..+..+++|+.++.+.+.|++|++++ T Consensus 1 ~~~~~~~~~e~~~~~~~~~~~~~~~ip~~~-~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~ 78 (318) T protein:vir:24 1 MAAGTAFAVDHAQIAQTGDTMFKGYLEPEQ-AKDYFAEAEKTSIVQQF-AQKVPMGTTGQKIPHWVGDVSAQWIGEGDMK 78 (318) T ss_pred CCCCCCCCHHHHHhhcccCcccceeechhH-HHHHHHHHHhhchhhhh-cceeeccCCceEEEEEeCCcceEEecCCccc Confidence 111111122334445555566677777655 56778899999999998 5677877778899999999999999999999 Q ss_pred ccCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceecccccccccc Q lcl|Aclame:pro 424 QDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYP 503 (632) Q Consensus 424 ~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~ 503 (632) ++++++|+++++.+++++++++||+|+|.|+..+++++|.+.|++++++++|.++++|+|++ .|.|++........... T Consensus 79 ~~~~~~f~~i~~~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~-~~~~~~~~~~~~~~~~~ 157 (318) T protein:vir:24 79 PITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMHGTDSP-FPTYIGQTTKAISIADT 157 (318) T ss_pred cccccceeEEEEeeEEEEEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCCC-CCccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999864 67787765543333322 Q ss_pred c--cchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeeccc------------cccCcceEEcC Q lcl|Aclame:pro 504 A--GGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQNN------------EVNGYRAEASN 569 (632) Q Consensus 504 ~--~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~~------------~l~G~pv~~~~ 569 (632) . .....+.+.++...+...+ ..+..|+|++..+..+ .+++|.+|+|+|.++ +++|+||++++ T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~--~~~~~~v~n~~~~~~L--~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~ 233 (318) T protein:vir:24 158 TGATTVYDQVAVNGLSLLVNDG--KKWTHTLLDDITEPIL--NGAKDQNGRPLFIESTYGEAASPFRSGRIVARPTILSD 233 (318) T ss_pred ccccchHHHHHHHHHHhhcccc--CCCCEEEEcHHHHHHH--HHhhccCCceeecCccccCccccccCceEEEEeeEEeC Confidence 2 2233345566666665544 4457899999887655 478999999999753 58899999999 Q ss_pred CCCCcc--EEEEehhhEEEEEecceEEEEecccc--------------cccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 570 QIPADT--WIFGDWSQIVIAMWGVLDLKVDPYTK--------------AASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 570 ~~~~~~--~~~gd~s~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) .+|.++ ++||||+.+.+++++++.+..+++.. |.+|++.||+..|+|+++.+|+||++|+.++ T Consensus 234 ~~~~~~~~~~~gdfs~~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~ 312 (318) T protein:vir:24 234 HVVEGTTVGFMGDFSQLIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAFVALTNVV 312 (318) T ss_pred CCCCCccEEEEeecceEEEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeec Confidence 999765 58999999999999999998877643 8899999999999999999999999999988 No 61 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=100.00 E-value=8e-45 Score=262.33 Aligned_cols=295 Identities=12% Similarity=0.165 Sum_probs=234.9 Q ss_pred hhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeE Q lcl|Aclame:pro 324 AGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDV 403 (632) Q Consensus 324 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 403 (632) ..........+. .+.... ......++....+..+++.++|.++ .+.|++.+++.++++++ ++.++..+..+ T Consensus 1 ~~~~~~~~~~~~-~f~~~~------~~~~~~~a~~~~~~~~~~~liP~~~-~~~ii~~~~~~s~l~~~-~~~~~~~~~~~ 71 (324) T protein:vir:10 1 MEQTQKLKLNLQ-HFASNN------VKPQVFNPDNVMMHEKKDGTLLNDF-TTPILQEVMENSKIMQL-GKYEPMEGTEK 71 (324) T ss_pred CCCchHHHHHHH-HHHHHh------hccceecccceeccCCCcceechhH-HHHHHHHHHhhchhhhh-cceeeccCCce Confidence 000000000110 111000 0011123334444555566777666 57888999999999987 56677777788 Q ss_pred EEEEecCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|Aclame:pro 404 DIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTG 483 (632) Q Consensus 404 ~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g 483 (632) .+|+.++.+.+.|++|++++++++++|+++++.+++++++++||+|++.|+..+++++|.+.+++++++++|.++++|+| T Consensus 72 ~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~a~l~G~g 151 (324) T protein:vir:10 72 KFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) T ss_pred EEEEEeCCcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCC Confidence 99999888999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccccccceeccccccccccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeecc---ccc Q lcl|Aclame:pro 484 LANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN---NEV 560 (632) Q Consensus 484 ~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~---~~l 560 (632) +++.|.|+++.....+ ....+.+++++|.++...+...+.. ...|+|++..+.. +.+++|.+|+|++.+ ++| T Consensus 152 ~~~~~~~i~~~~~~~~-~~~~~~~t~~~i~~~~~~l~~~~~~--~~~~v~n~~~~~~--L~~l~d~~g~~~~~~~~~~~l 226 (324) T protein:vir:10 152 NNPFGKSIAQSIEKTN-KVIKGDFTQDNIIDLEALLEDDELE--ANAFISKTQNRSL--LRKIVDPETKERIYDRNSDTL 226 (324) T ss_pred CCccCccccccccccc-eeccccCCHHHHHHHHHhhhhccCC--CCEEEEcHHHHHH--HHHhhccCCceeecCCCCccc Confidence 9988999887655443 3445678899999999999887654 4578998888664 457899999999864 579 Q ss_pred cCcceEEcCCCC--CccEEEEehhhEEEEEecceEEEEeccc--------------ccccCcEEEEEEEEeCcEEecccc Q lcl|Aclame:pro 561 NGYRAEASNQIP--ADTWIFGDWSQIVIAMWGVLDLKVDPYT--------------KAASDGLVLRVFQDVDAGVRRKEA 624 (632) Q Consensus 561 ~G~pv~~~~~~~--~~~~~~gd~s~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~r~~~~v~~~~a 624 (632) +|+||++++..+ .+.+++|||+.+.++.++++++..+++. .|.+|++.||++.|+|+++.+|+| T Consensus 227 ~G~PV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A 306 (324) T protein:vir:10 227 DGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKA 306 (324) T ss_pred cceeEEeecCCCCCcceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccc Confidence 999999988755 4569999999999999999999888763 388999999999999999999999 Q ss_pred eEEEEecC Q lcl|Aclame:pro 625 FCIAKKGA 632 (632) Q Consensus 625 ~~~~~~~A 632 (632) |++|+.+. T Consensus 307 ~~~l~~a~ 314 (324) T protein:vir:10 307 FAKLVPAD 314 (324) T ss_pred eEEEEecc Confidence 99999988 No 62 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=100.00 E-value=3.5e-43 Score=253.32 Aligned_cols=375 Identities=12% Similarity=0.069 Sum_probs=221.0 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhh Q lcl|Aclame:pro 213 APAASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIH 292 (632) Q Consensus 213 ~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 292 (632) ++....+...+..+............. ..+..++........+.....+....+.......... . T Consensus 1 ~~~~m~k~l~el~~~~~~~~~~~~~~~----~~~~~ee~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~----~------- 65 (397) T protein:vir:12 1 MPMQMSKKEIALRQQFTEKKQQADKAL----QEGNTDEARALLDEVKQLKNQIELMTEGRSLDVPDLP----G------- 65 (397) T ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHh----hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----H------- Confidence 000000000000000000000000000 0000000000000111111100000000000000000 0 Q ss_pred hhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHH--hhhhhhhhhhhHHhhhhhcccccccccceec Q lcl|Aclame:pro 293 SARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASG--KEARGFYMPHEVLVQRQLEKKTAGKGGELVA 370 (632) Q Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~ 370 (632) .............. . . ...............+..+... ...............+++...+..+||.+|| T Consensus 66 ~~~~~~~~~~~~~~---~---~---~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~lvP 136 (397) T protein:vir:12 66 GVNFVPEQERNPEG---Q---R---SQGQGNEERQQQYSKAFLKGLRGKRLTDEERDLLDSPEFRAMSGINDEDGGILIP 136 (397) T ss_pred Hhhhhhhhhhhhcc---c---c---cccchhhHHHHHHHHHHHHHHhccCCcHHHHHHHhhhhhhhccccccccCcccCc Confidence 00000000000000 0 0 0000000000000000000000 0000011111223456666667777777776 Q ss_pred hhhhhHHHHHHHhhhhhhhhhcce-eeccCceeEEEEEecCCccccccccCcccccC-cccceeeeeeeeeeeeeehhhH Q lcl|Aclame:pro 371 TELLSEEFIDILRNKAIIGQMGAR-MLPGLVGDVDIPKKTSGANFYWIGEDEDVQDS-DFDFTTLSFSPKTIAGAVPVTR 448 (632) Q Consensus 371 ~~~~~~~i~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~-~~~~~~~~~~~~t~~~~~~iSr 448 (632) .++ .+.|++.+++.++++.+... .++.....+.+++.++.+.+.|++|+++++++ .++|+++++.+++++++++||+ T Consensus 137 ~~~-~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~is~ 215 (397) T protein:vir:12 137 EDI-GRQIHEFKRQFEPLEQYVTVEPVTTRSGTRLLEKNADMVPFSPVEELGNLPEIDQPRFTKVSYSIIDYGGIMTLSN 215 (397) T ss_pred hhH-HHHHHHhhhhhhhHHhhcceeeccCCceeEEEEEecCCcceeeecccccccccccccceeEEeeheeeEeeehhhH Confidence 554 67789999999988887432 23445567888888888999999999999975 6899999999999999999999 Q ss_pred HHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHHH-HHHhhccccc Q lcl|Aclame:pro 449 KLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMET-KISTFNADAG 527 (632) Q Consensus 449 e~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~-~~~~~~~~~~ 527 (632) |++.|+.+++.++|.+.|++++++++|.+|++|+|++ .|.|+ ++++.+.+++. .+...+. . T Consensus 216 e~l~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~g~~-~~~g~---------------~~~~~i~~~~~~~l~~~~~--~ 277 (397) T protein:vir:12 216 SMLNDSDQAIMTYVAKWFAKKSVVTRNNLILAAIASL-KKVDI---------------DGLDGIKKALNVTLDPMVA--P 277 (397) T ss_pred HHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhccccc-ccccc---------------ccHHHHHHHHhhccchhhh--C Confidence 9999999999999999999999999999999999874 34443 34567777664 6665554 4 Q ss_pred cceEEeehhHHHHHHHHhhcccCCceeecc-------ccccCcceEEcCC-CCC-----ccEEEEehhh-EEEEEecceE Q lcl|Aclame:pro 528 RLAYLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEASNQ-IPA-----DTWIFGDWSQ-IVIAMWGVLD 593 (632) Q Consensus 528 ~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~-------~~l~G~pv~~~~~-~~~-----~~~~~gd~s~-~~~~~~~~~~ 593 (632) ++.|+|++..+..+ .+++|.+|+|+|++ ++|+|+||+++++ +|. ..++||||+. |.++.+.++. T Consensus 278 ~a~~~~n~~~~~~L--~~lkd~~G~~l~~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~ 355 (397) T protein:vir:12 278 GSIVLTNQDGYDWL--DTLKDGTGRYLLQPDPTNPTKKLLDGRPVVPFTNRVLKTQKGKAPLIIGNLKEAIVLFDREQQS 355 (397) T ss_pred CCEEEEcHHHHHHH--HHhhccCCceeecccccCCCCccccceeeEEecccccccCCCccEEEEEehhceEEEEeecceE Confidence 57899998886554 57899999999975 3799999987654 332 2489999998 4678899999 Q ss_pred EEEecc--cccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 594 LKVDPY--TKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 594 ~~~~~~--~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +.++++ ..|.+|++.||++.|+|+++++|+||++++++| T Consensus 356 i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~t~ 396 (397) T protein:vir:12 356 IASTDTGAGAFETNSTKVRGIEREDVRKWDEDAVVFGQITV 396 (397) T ss_pred EEEeccccchhhcCceEEEEEEeeccEEecccceEEEEEee Confidence 887654 468999999999999999999999999999999 No 63 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=100.00 E-value=7.4e-43 Score=251.57 Aligned_cols=360 Identities=13% Similarity=0.110 Sum_probs=218.5 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh----hhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhh Q lcl|Aclame:pro 218 GANENDILSRERTRISEITAIGQQFSQRSLAQEAIQ----KGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHS 293 (632) Q Consensus 218 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 293 (632) ++... ....+............ +....... ..+..+.....+....... .... .. T Consensus 1 Mk~~~----el~~~~~~~~~~i~~~~--~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~--------~~~~-------~~ 59 (397) T protein:vir:48 1 MKTSN----ELHDLWVAQGDKVENLN--EKLNVAMLDDSVTAEELQAIKNERDTAKMKR--------DMFK-------EQ 59 (397) T ss_pred CchHH----HHHHHHHHHHHHHHHHH--HHHHHhhcchhhhHHHHHHHHHHHHHHHHHH--------HHHH-------HH Confidence 00000 00000000000000000 00000000 0000000000000000000 0000 00 Q ss_pred hhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhh Q lcl|Aclame:pro 294 ARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATEL 373 (632) Q Consensus 294 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~ 373 (632) ........ ........ .............+..+.+...... .......+...++..++|.+||.++ T Consensus 60 ~~~~~~~~------~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~t~~~gg~~iP~~~ 125 (397) T protein:vir:48 60 YTEARANE------VVNMSEEE----KKPLTKSEEEVKAGFVKDFKNLVRG----RYQNLLDSKTDASGSDAGLTIPQDI 125 (397) T ss_pred HHHHHHhh------hhhhhhhc----cccccchhhHHHHHHHHHHHHHHhh----hhhHHHHHhhccCCccccccccHHH Confidence 00000000 00000000 0000000000111111111111111 1111122233445556777777666 Q ss_pred hhHHHHHHHhhhhhhhhhcce-eeccCceeEEEEEe-cCCccccccccCcccccC-cccceeeeeeeeeeeeeehhhHHH Q lcl|Aclame:pro 374 LSEEFIDILRNKAIIGQMGAR-MLPGLVGDVDIPKK-TSGANFYWIGEDEDVQDS-DFDFTTLSFSPKTIAGAVPVTRKL 450 (632) Q Consensus 374 ~~~~i~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~-~~~~~a~~v~E~~~~~~~-~~~~~~~~~~~~t~~~~~~iSre~ 450 (632) .+.|++.+++.++++.+... .+++....+.++.. +..+.+.|++|+++++++ .++|+++++++++++++++||+++ T Consensus 126 -~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~el 204 (397) T protein:vir:48 126 -QTAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEAGSIGTNDDPKLYPIRYAIKRYAGISTVTNSL 204 (397) T ss_pred -HHHHHHHHHHHHHHHhhhceeeccCCcceEEEEeecCCCcceeeeccccccccccccceeeEEeeheeeeeehhhHHHH Confidence 56789999999888887432 24444445555544 445678999999999986 589999999999999999999999 Q ss_pred hhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHHHHHHhhccccccce Q lcl|Aclame:pro 451 RKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLA 530 (632) Q Consensus 451 l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ 530 (632) |.|+.+++.++|.+.+++++++++|.++++|+|++.... +..+++.|.+++.++...+.. ++. T Consensus 205 l~ds~~~l~~~v~~~l~~~~~~~~d~~il~G~g~~~~~~---------------~~~~~d~i~~~~~~l~~~~~~--~a~ 267 (397) T protein:vir:48 205 LADSAENILAWLSGWIAKKVVVTRNKAILEAIATLPTKP---------------TLTKWDDIIDLQAKVDPAIKQ--TSF 267 (397) T ss_pred HhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc---------------ccccHHHHHHHHHHhhhhhcC--CCE Confidence 999999999999999999999999999999998764322 235678999999999888764 578 Q ss_pred EEeehhHHHHHHHHhhcccCCceeecc-------ccccCcceEEcCC--C-----CCccEEEEehhh-EEEEEecceEEE Q lcl|Aclame:pro 531 YLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEASNQ--I-----PADTWIFGDWSQ-IVIAMWGVLDLK 595 (632) Q Consensus 531 ~~~~~~~~~~~~~~~~~d~~g~~~~~~-------~~l~G~pv~~~~~--~-----~~~~~~~gd~s~-~~~~~~~~~~~~ 595 (632) |+|++..+.. +.+++|.+|+|+|++ ++|+|+||++.+. + +...++||||+. |.++.++++++. T Consensus 268 ~v~n~~~~~~--L~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~ 345 (397) T protein:vir:48 268 FLTNTSGFTA--LKKVKNAFGDYLMERDVKSPTGYSIDGFAVKEVADRWLANASSGAMPLYFGDLKQAVTLFDRQQMSLL 345 (397) T ss_pred EEECHHHHHH--HHHhhcCCCceeeccCcCCCCCceeccceeEEecccccCCcCCCceEEEEEeccceEEEEeecceEEE Confidence 9999888654 457899999999975 3799999987543 2 244689999997 568899999999 Q ss_pred Eeccc--ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 596 VDPYT--KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 596 ~~~~~--~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) .+++. +|.+|++.||++.|+|+++++|+||++++++| T Consensus 346 ~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~ 384 (397) T protein:vir:48 346 STNIGGGAFETDTTKIRVIDRFDVVATDTESFVPASFKA 384 (397) T ss_pred EeccchhhhhcCceeEEEEeeeccEEecccceEEEEecc Confidence 88865 69999999999999999999999999999999 No 64 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=100.00 E-value=1.8e-44 Score=260.45 Aligned_cols=295 Identities=12% Similarity=0.168 Sum_probs=231.8 Q ss_pred hhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeE Q lcl|Aclame:pro 324 AGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDV 403 (632) Q Consensus 324 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 403 (632) ............ .+... .. .....++.......+++.++|+++ .+.|++.+++.++++++ +++++..+..+ T Consensus 1 ~~~~~~~~~~~~-~f~~~----~~--~~~~~~a~~~~~~~~~~~lip~~~-~~~ii~~~~~~s~l~~l-~~~~~~~~~~~ 71 (324) T protein:vir:96 1 MEQTQKLKLNLQ-HFASN----NV--KPQVFNPDNVMMHEKKDGTLLNDF-TTPILQEVMENSKIMQL-GKYEPMEGTEK 71 (324) T ss_pred CCcchhhhHHHH-HHHHh----hh--hhhhcccccccccCCCcceechhH-HHHHHHHHHhhchhhhh-cceeeccCCce Confidence 000000000000 00000 00 001122333334445566777665 57788899999998887 56677777788 Q ss_pred EEEEecCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|Aclame:pro 404 DIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTG 483 (632) Q Consensus 404 ~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g 483 (632) .+|+.++.+.+.|++|++++++++++|+++++.+++++++++||+|+|.|+..++.++|.+.+++++++++|.++++|+| T Consensus 72 ~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~~~l~G~g 151 (324) T protein:vir:96 72 KFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) T ss_pred EEEEEecCcceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCC Confidence 99999988999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccccccceeccccccccccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeecc---ccc Q lcl|Aclame:pro 484 LANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN---NEV 560 (632) Q Consensus 484 ~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~---~~l 560 (632) +++.+.|++......+ ....+.+++++|.+++.++...+.. ...|+|++..+.. +.+++|.+|+|++.+ ++| T Consensus 152 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~i~~~~~~i~~~~~~--~~~~i~n~~~~~~--L~~lkd~~G~~~~~~~~~~~l 226 (324) T protein:vir:96 152 NNPFGKSIAQSIKKTN-KVIKGDFTQDNIIDLEALLEDDELE--ANAFISKTQNRSL--LRKIVDPETKERIYDRNSDSL 226 (324) T ss_pred CCCcCccccccccccc-eecccccchHHHHHHHHhhhhccCC--CCEEEEcHHHHHH--HHHhhCCCCCeeecCCCCCcc Confidence 9888888876544333 3444567899999999999877653 4578998888654 457899999999864 589 Q ss_pred cCcceEEcCCC--CCccEEEEehhhEEEEEecceEEEEeccc--------------ccccCcEEEEEEEEeCcEEecccc Q lcl|Aclame:pro 561 NGYRAEASNQI--PADTWIFGDWSQIVIAMWGVLDLKVDPYT--------------KAASDGLVLRVFQDVDAGVRRKEA 624 (632) Q Consensus 561 ~G~pv~~~~~~--~~~~~~~gd~s~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~r~~~~v~~~~a 624 (632) +|+||++++.. +.+.++||||+.+.++.++++++..+++. .|.+|++.||++.|+|+++.+|+| T Consensus 227 ~G~PV~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~r~~~r~d~~v~~~~a 306 (324) T protein:vir:96 227 DGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKA 306 (324) T ss_pred cceeeEeecCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccc Confidence 99999997764 45679999999999999999999888763 488999999999999999999999 Q ss_pred eEEEEecC Q lcl|Aclame:pro 625 FCIAKKGA 632 (632) Q Consensus 625 ~~~~~~~A 632 (632) |++|+.+- T Consensus 307 ~~~l~~a~ 314 (324) T protein:vir:96 307 FAKLVPAD 314 (324) T ss_pred eEEEeccc Confidence 99999877 No 65 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=100.00 E-value=4.7e-43 Score=252.65 Aligned_cols=345 Identities=13% Similarity=0.110 Sum_probs=219.2 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhh Q lcl|Aclame:pro 217 SGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARD 296 (632) Q Consensus 217 ~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 296 (632) ..+... ...........+.......+..+ ..+....++ .............. T Consensus 1 M~k~l~----~l~e~~~~~~~e~~~~~~~~~~e-------~~~~~~~ei---------------~~l~~~i~~~~~~~-- 52 (371) T protein:vir:81 1 MPKELR----ELLEQINNKKEEARKLLAENKIE-------EAKKLKEEI---------------VALQEKFDVAKELY-- 52 (371) T ss_pred CcHHHH----HHHHHHHHHHHHHHHHhhHHHHH-------HHHHHHHHH---------------HHHHHHHHHHHHHH-- Confidence 000000 00000000000000000000000 000000000 00000000000000 Q ss_pred hhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhH Q lcl|Aclame:pro 297 LGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSE 376 (632) Q Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~ 376 (632) .......... .. . ...........+.+.... .....+++..++..+||.++|.++ .. T Consensus 53 -----~~~~~~~~~~-~~--------~-~~~~~~~~~~~~~~~~~l-------~~~~~~a~~~~t~~~gg~~vP~~~-~~ 109 (371) T protein:vir:81 53 -----EEQKQTIEDK-EP--------L-KPTVQVKENEVEAFVNHI-------RTRFRNAMSEGSNQDGGYTVPQDI-QT 109 (371) T ss_pred -----HHHHHhhccc-cc--------c-ccchhhHHHHHHHHHHHH-------HHHHHHhhccCCCccCceeecHhH-HH Confidence 0000000000 00 0 000000001111111111 111335566667777778887765 56 Q ss_pred HHHHHHhhhhhhhhhcc-eeeccCceeEEEEEecCCccccccccCccccc-CcccceeeeeeeeeeeeeehhhHHHhhcC Q lcl|Aclame:pro 377 EFIDILRNKAIIGQMGA-RMLPGLVGDVDIPKKTSGANFYWIGEDEDVQD-SDFDFTTLSFSPKTIAGAVPVTRKLRKQS 454 (632) Q Consensus 377 ~i~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~-~~~~~~~~~~~~~t~~~~~~iSre~l~d~ 454 (632) .|++.+++.++++++.. ..+++....+.+++.++.+.+.|++|++++++ +.++|+++++++++++++++||+|+|.|+ T Consensus 110 ~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds 189 (371) T protein:vir:81 110 RINELRESKDALQNLITVEPVTTLSGSRVFKKRSQQTGFVEVAEGAAIGEKATPQFTLLQYQVKKYAGFFRVTNELLNDS 189 (371) T ss_pred HHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCcceeeeccccccccccccceeeEEeeeeEEEEeehhhHHHHhhh Confidence 78899999999888732 23344556677788888889999999999986 57999999999999999999999999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHHH-HHHhhccccccceEEe Q lcl|Aclame:pro 455 SIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMET-KISTFNADAGRLAYLT 533 (632) Q Consensus 455 ~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~-~~~~~~~~~~~~~~~~ 533 (632) .+++.++|.+.+++++++++|.++++|+|++. |.| ..+++.+..+.. .+...++ .++.|+| T Consensus 190 ~~~l~~~i~~~l~~a~~~~~~~~i~~g~g~~~-~~~---------------~~~~~~i~~~~~~~l~~~~~--~~a~~vm 251 (371) T protein:vir:81 190 TEAIVNTLVRWIGDESRVTRNGLIINVLNTKA-KTA---------------IADLDGLKQIINVQLDPVFR--STSSVIV 251 (371) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhhccccc-ccc---------------cccHHHHHHHHHhhcchhhh--cCCEEEE Confidence 99999999999999999999999999988642 222 234566666553 5555554 4578999 Q ss_pred ehhHHHHHHHHhhcccCCceeecc-------ccccCcceEEcCCCCC------------ccEEEEehhh-EEEEEecceE Q lcl|Aclame:pro 534 SVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEASNQIPA------------DTWIFGDWSQ-IVIAMWGVLD 593 (632) Q Consensus 534 ~~~~~~~~~~~~~~d~~g~~~~~~-------~~l~G~pv~~~~~~~~------------~~~~~gd~s~-~~~~~~~~~~ 593 (632) |+..+.. +.+++|.+|+|+|.+ ++|+|+||++++++|. ..++||||+. |.++.+.+++ T Consensus 252 n~~~~~~--L~~lkd~~g~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~~~i~~Gd~~~~~~~~~~~~~~ 329 (371) T protein:vir:81 252 NQDAFNW--LDTLKDQNGQYLLQPSISSPTGRQLLGLPVVIVSNKVLANRVDGGTGAQFAPIIVGDLKEAVVMFDRQRTE 329 (371) T ss_pred cHHHHHH--HHHhhccCCCeeeecccCCCCCceecceeEEEecccccCccccccccCCcceEEEEehhceEEEEeecceE Confidence 9888655 457899999999975 4799999999998873 3589999998 6778899999 Q ss_pred EEEeccc--ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 594 LKVDPYT--KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 594 ~~~~~~~--~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +.++++. .|.+|++.|+++.|+|+++.+|+||+++++++ T Consensus 330 i~~~~~~~~~f~~~~v~~~~~~r~d~~~~~~~a~~~~~~~~ 370 (371) T protein:vir:81 330 IMSSNVAMDAFETDATLWRAIERMDVKMRDDEAFVFGEVQL 370 (371) T ss_pred EEEeccccchhhcCceEEEEEEeeccEEecccceEEEEEec Confidence 9998875 58899999999999999999999999999999 No 66 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=100.00 E-value=2.3e-44 Score=259.83 Aligned_cols=346 Identities=13% Similarity=0.058 Sum_probs=228.1 Q ss_pred hhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 245 RSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKA 324 (632) Q Consensus 245 ~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 324 (632) +....+. . ....+....+.+........ .+.....+... .....+..... ....+. T Consensus 1 M~i~~k~--~-~~~~~~~~~l~~~~~~~~~~-ee~~~~~~~~~--------~~~~~~~~~~~---~~e~~~--------- 56 (377) T protein:vir:98 1 MAINLKE--L-PKYREAVAELSAKISAGATS-EEQEKLFEAAF--------TTMGDEILAKN---EEEMER--------- 56 (377) T ss_pred CCCcHHH--H-HHHHHHHHHHHHHHHhhhhh-HHHHHHHHHHH--------HhHHHHHHHHH---HHHHHH--------- Confidence 1111100 0 01111111111111110000 00000000000 00000000000 000000 Q ss_pred hhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEE Q lcl|Aclame:pro 325 GFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVD 404 (632) Q Consensus 325 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 404 (632) ... ..+...................++...+|++||.++ .+.|++.+...++++.+ +++.+.. .... T Consensus 57 --------~~~--~~~~~~~lt~ee~~~~~~~~~~~~~~~gg~~vP~~~-~~~I~~~l~~~s~i~~~-~~v~~~~-~~~~ 123 (377) T protein:vir:98 57 --------MFD--LRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEET-MVQVFDDLVAEHPLLKV-INFKNTS-LRLK 123 (377) T ss_pred --------HHH--hccCCcccCHHHHHHHHHHHhccCCCCCccccCHHH-HHHHHHHHHHhhhhhhh-eeeEecC-cceE Confidence 000 000000000000001111233455666777787665 55677778888888887 4555554 3568 Q ss_pred EEEecCCccccccccCcccc-cCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|Aclame:pro 405 IPKKTSGANFYWIGEDEDVQ-DSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTG 483 (632) Q Consensus 405 ~~~~~~~~~a~~v~E~~~~~-~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g 483 (632) +++.++.+.+.|++|+++.+ .++++|+++++.++++++.++||+++|.|+.++++++|.+.++++++++++.+|++|+| T Consensus 124 ~~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~a~~~is~elL~ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G 203 (377) T protein:vir:98 124 ALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWIKQFITEQLKEAIAVALELAIVKGDG 203 (377) T ss_pred EEEecCCcceeEeecccccCcccCccceeEeecceeEEeeecccHHhhhccHhHHHHHHHHHHHHHHHHHHhhceEeccC Confidence 89999999999999999876 46899999999999999999999999999999999999999999999999999999999 Q ss_pred Cccccccceeccccccccc------cccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeecc Q lcl|Aclame:pro 484 LANDPVGLLNMTGVPALTY------PAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN 557 (632) Q Consensus 484 ~~~~~~Gil~~a~~~~~~~------~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~ 557 (632) ++ +|.||++..+...... .......+.+.++...++..++. ++.|+|+..+.. .+.+++|.+|+|+|.. T Consensus 204 ~~-qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~--~a~~~m~~~t~~--~~~klkd~~G~~i~~~ 278 (377) T protein:vir:98 204 LL-QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLTPDNAPK--KLVPVMKHLSVN--DKKRPLKIAGQVKLIL 278 (377) T ss_pred CC-cceeeeecccccccccccccccccccchhhhHhhhhhhchhHHHH--HHHHHHHHHHHH--HHhhhhccCCceEEEe Confidence 75 8999998654332211 12223346777888888777654 467787776654 4568899999999931 Q ss_pred ---------------------ccccCcc--eEEcCCCCCccEEEEehhhEEEEEecceEEEEecccccccCcEEEEEEEE Q lcl|Aclame:pro 558 ---------------------NEVNGYR--AEASNQIPADTWIFGDWSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQD 614 (632) Q Consensus 558 ---------------------~~l~G~p--v~~~~~~~~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r 614 (632) .+++|+| |+.++.+|+++++||||+.|.+++++++++..+++.+|.+|++.|++..| T Consensus 279 n~~~~~~~~p~~~~~~~~G~~~t~lg~p~~vv~s~~~p~~~i~fgdf~~Y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r 358 (377) T protein:vir:98 279 NPEDRWALEAQFTSRNQFGEYVTVLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNY 358 (377) T ss_pred cccchhhccccccccCCCCccccccCCCceEEecCCCCcccEEEEEecceeEEeecceEEEeechhhhhcCceEEEEEEE Confidence 2577777 67788999999999999999999999999999999999999999999999 Q ss_pred eCcEEecccceEEEEecC Q lcl|Aclame:pro 615 VDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 615 ~~~~v~~~~a~~~~~~~A 632 (632) +|+++++++||++|+++. T Consensus 359 ~dg~~~~~~a~~vl~i~~ 376 (377) T protein:vir:98 359 FYGKAKDNHTAALLTLAG 376 (377) T ss_pred EcCEEeccCcEEEEEEec Confidence 999999999999999999 No 67 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=100.00 E-value=6.2e-44 Score=257.47 Aligned_cols=344 Identities=14% Similarity=0.070 Sum_probs=224.4 Q ss_pred hhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhH Q lcl|Aclame:pro 250 EAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFERE 329 (632) Q Consensus 250 ~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 329 (632) -.++..+...+.+................. ...... .... ... ...... T Consensus 1 m~~kl~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~-------------~~~~-----~~~-~~~~~~----------- 49 (381) T protein:vir:10 1 MTINLSETFANAKNEFINAVNNGEPQERQN-ELYGDM-------------INQL-----FEE-TKLQAK----------- 49 (381) T ss_pred CchhHHHHHHHHHHHHHHHHHhhhHHHHHH-HHHHHH-------------HHhh-----hhh-HHHHHH----------- Confidence 111111122222222221111110000000 000000 0000 000 000000 Q ss_pred HHHHHHHHH-HhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEe Q lcl|Aclame:pro 330 VSLAIADAS-GKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKK 408 (632) Q Consensus 330 ~~~~~~~~~-~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 408 (632) .+..+.. .+.........+.....++...+.++||+++|.++ .+.|++.+...++++++ +++.+.+ ...++++. T Consensus 50 --~e~~~~~~~~~~~~~l~~~e~~~~~~~~~~t~~~Gg~lvP~~~-~~~I~~~l~~~spir~~-a~v~~~~-~~~~i~~~ 124 (381) T protein:vir:10 50 --AEAERVSSLPKSAQTLSANQRNFFMDINKSVGYKEEKLLPEET-IDRIFEDLTTNHPLLAD-LGIKNAG-LRLKFLKS 124 (381) T ss_pred --HHHHHHHHhcccccccCHHHHHHHHHHhhcCCCCCceecCHHH-HHHHHHHHHhhcceeee-eeeEecC-cceEEEee Confidence 0000000 00000001111122233556667778888887765 56678888888999887 5566654 45678888 Q ss_pred cCCccccccccCcccc-cCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccc Q lcl|Aclame:pro 409 TSGANFYWIGEDEDVQ-DSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLAND 487 (632) Q Consensus 409 ~~~~~a~~v~E~~~~~-~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~ 487 (632) +..+.+.|++|.++.+ ..+++|+++++.+++++++++||+++|.|+.++++++|...++++++++++.+|++|+|++ + T Consensus 125 ~~~~~a~W~~e~~~~~~~~~~~f~~i~l~~~kl~a~i~is~elL~Ds~~~le~~i~~~la~~~a~~~~~afi~GdG~~-q 203 (381) T protein:vir:10 125 ETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKD-Q 203 (381) T ss_pred cCCcceEEeecccccccccCccceeEeecceeEEeeccccHHHHhccHHHHHHHHHHHHHHHHHHHhhceeEecccCC-C Confidence 8889999999998876 5689999999999999999999999999999999999999999999999999999999974 8 Q ss_pred cccceeccccccc-c-------ccccchhHHHHHHHHH-------HHHhh-----ccccccceEEeehhHHHHHHH-Hhh Q lcl|Aclame:pro 488 PVGLLNMTGVPAL-T-------YPAGGVDWASVVDMET-------KISTF-----NADAGRLAYLTSVTQRGAAKK-AQV 546 (632) Q Consensus 488 ~~Gil~~a~~~~~-~-------~~~~~~~~~~i~~~~~-------~~~~~-----~~~~~~~~~~~~~~~~~~~~~-~~~ 546 (632) |.||++....... . .+...+++.++..+.. .+... .....++.|+|++.+...+.. ... T Consensus 204 P~Gil~~~~~~~~~~~g~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~~~ 283 (381) T protein:vir:10 204 PIGLNRQVQKGVSVTDGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH 283 (381) T ss_pred ceeeeecCCccccccccccccccccccccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhcccccc Confidence 9999874332111 1 1111233333332222 22111 012345678999887654432 234 Q ss_pred cccCCceeeccccccCcceEEcCCCCCccEEEEehhhEEEEEecceEEEEecccccccCcEEEEEEEEeCcEEecccceE Q lcl|Aclame:pro 547 FDNTGERIWQNNEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFC 626 (632) Q Consensus 547 ~d~~g~~~~~~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~ 626 (632) .+.+|+|+|..+ .|+||++++.+|+++++||||+.|.++++.++++.++++.+|.+|++.|++..|+|++++|++||+ T Consensus 284 ~~~~G~~v~~lp--~g~~vv~~~~~p~~~i~fGDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dG~~~~~~A~~ 361 (381) T protein:vir:10 284 LNANGVYVTALP--FNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAA 361 (381) T ss_pred CCCCCceeecCC--CCceeEEcCCCCcCcEEEEEcccEEEEEecccEEEeechhhhhcCceEEEEEEEEcCEEecCCcEE Confidence 578899998643 588999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEecC Q lcl|Aclame:pro 627 IAKKGA 632 (632) Q Consensus 627 ~~~~~A 632 (632) ++++++ T Consensus 362 v~~l~~ 367 (381) T protein:vir:10 362 VWKLDL 367 (381) T ss_pred EEEEee Confidence 999997 No 68 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=100.00 E-value=2.9e-44 Score=259.25 Aligned_cols=268 Identities=14% Similarity=0.135 Sum_probs=216.3 Q ss_pred ccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcccccCcccceeeeeeee Q lcl|Aclame:pro 359 KKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPK 438 (632) Q Consensus 359 ~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~ 438 (632) ..+..+||+++|.++ .+.|++.+++.+++++++ ++++......++|+.++.+.+.|++|++++++++++|+++++.++ T Consensus 1 mat~~~gg~lvP~~~-~~~ii~~~~~~s~i~~~~-~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v~l~~~ 78 (311) T protein:vir:81 1 MVALATGTFQLPKHL-VPGVWQKAQGQSVLARLS-MAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPR 78 (311) T ss_pred CceecCCceEcchhH-HHHHHHHHHhcchhhhhc-ceeecCCCceEEEEEeCCceeEEeecCcccccccceeeEEEEeeE Confidence 334446677777665 588999999999999984 567777778999999999999999999999999999999999999 Q ss_pred eeeeeehhhHHHhh---cChhHHHHHHHHHHHHHHHHHHHHHHhhcCC--Cccccccceecccccc----ccccccchhH Q lcl|Aclame:pro 439 TIAGAVPVTRKLRK---QSSIHVENLIREDLIEGIGVALDLAMLTGTG--LANDPVGLLNMTGVPA----LTYPAGGVDW 509 (632) Q Consensus 439 t~~~~~~iSre~l~---d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g--~~~~~~Gil~~a~~~~----~~~~~~~~~~ 509 (632) +++++++||+|+|. ++..+++++|.+.+++++++++|.++++|++ ++..+.|+++...... .+.......+ T Consensus 79 kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~~~~~~~~~ 158 (311) T protein:vir:81 79 KVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPD 158 (311) T ss_pred EEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeeeecccccchHH Confidence 99999999999995 5668899999999999999999999999964 4556677765432221 1112222333 Q ss_pred HHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeecc-------ccccCcceEEcCCCCCc-------- Q lcl|Aclame:pro 510 ASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEASNQIPAD-------- 574 (632) Q Consensus 510 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~-------~~l~G~pv~~~~~~~~~-------- 574 (632) ..+.++...+...+. ....|+||+.....+ .+++|.+|+|+|.+ ++|+|+||++++.+|.+ T Consensus 159 ~~i~~~~~~~~~~~~--~~~~~vmn~~~~~~l--~~lkd~~G~~l~~~~~~~~~~~tl~G~Pv~~~~~i~~~~~~~~~~~ 234 (311) T protein:vir:81 159 LAVEAAVGLVLGDNL--SPDGVALDNTFSFML--ATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTAST 234 (311) T ss_pred HHHHHHHHHhhhcCC--CceEEEEcHHHHHHH--HhhhccCCCeeecCccccCCCceecceeEEeccccccccccccccc Confidence 455566655554332 233589988887554 57899999999963 68999999999988753 Q ss_pred ----------cEEEEehhhEEEEEecceEEEEeccc-------ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 575 ----------TWIFGDWSQIVIAMWGVLDLKVDPYT-------KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 575 ----------~~~~gd~s~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) .++||||+.|.++.+.++++..+++. .|.+|++.||+..|+|++|.+|+||++||.+. T Consensus 235 ~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a~ 309 (311) T protein:vir:81 235 GVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDAD 309 (311) T ss_pred chhcccCCccEEEEEecccEEEEEeccceEEEeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEEEeec Confidence 36899999999999999999888763 48999999999999999999999999999998 No 69 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=100.00 E-value=6.3e-43 Score=251.95 Aligned_cols=349 Identities=13% Similarity=0.109 Sum_probs=221.3 Q ss_pred hhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHh-hhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 245 RSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKP-AIHSARDLGIQHKELQQYSLMRAINAAATGDWSK 323 (632) Q Consensus 245 ~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 323 (632) +.............++.+..+............ ........... ........ ....... T Consensus 1 mt~~~~~~e~~~~~~e~~~~~~~~~~~~~~~e~-~~~~~~~~~~~~~~~~~~~~---~~e~~~~---------------- 60 (395) T protein:vir:95 1 MADMKQNNVKLKNYHEHKKQFANLVQNGASDEE-QSKAFGAMFDALSNDLQEEI---TAEINNR---------------- 60 (395) T ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHhhhhhHHH-HHHHHHHHHHHHHHHHHHHH---HHHHHHH---------------- Confidence 111111111111112222222211111100000 00000000000 00000000 0000000 Q ss_pred hhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeE Q lcl|Aclame:pro 324 AGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDV 403 (632) Q Consensus 324 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 403 (632) .. ... ....+... ...........++...+.++||++||.++ .+.|++.+...++++++ +++.+.+. .. T Consensus 61 -----~~-~~~-~~~~r~~~-~l~~ee~~~~~~~~~~t~~~gG~liP~~~-~~~Ii~~l~~~s~i~~~-~~v~~~~~-~~ 129 (395) T protein:vir:95 61 -----VV-DNG-ILAKRSQD-PLTSEERKFFNDINYDVGYTDEKILPETV-VERVFDDLQKDHPLLSK-INFQNAGI-KT 129 (395) T ss_pred -----HH-HHH-HHhhcCcc-ccchHHHHHHHHHhhccCCCCceeccHHH-HHHHHHHHHhhhhhhhh-ceeEecCC-ce Confidence 00 000 00000000 00011111123445556677788887665 56788888899999987 55666543 56 Q ss_pred EEEEecCCccccccccCcccc-cCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|Aclame:pro 404 DIPKKTSGANFYWIGEDEDVQ-DSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGT 482 (632) Q Consensus 404 ~~~~~~~~~~a~~v~E~~~~~-~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~ 482 (632) .+++.++.+.+.|+.|.++.+ .++++|+++++.+++++++++||+|+|.|+..+++++|.+.++++++.+++.+|++|+ T Consensus 130 ~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~iS~ell~ds~~~ie~~i~~~la~~ia~~~~~a~i~G~ 209 (395) T protein:vir:95 130 RVIKADPAGQAVWGKVFGEIKGQLDAAFREENFTQYKLTCFVVLPDDLSTFGPAWIERFVRTQIQEAISVALESAIINGG 209 (395) T ss_pred EEEEecCCcceEEeecccccCccccccceeeeeceeeEEEeecccHHHHhcchhHHHHHHHHHHHHHHHHHHhhheeecc Confidence 889999999999999988775 5789999999999999999999999999999999999999999999999999999999 Q ss_pred CCcc-ccccceeccccccccc----cccchhHHHHHHHHHHHHhhc------------cccccceEEeehhHHHHHHHHh Q lcl|Aclame:pro 483 GLAN-DPVGLLNMTGVPALTY----PAGGVDWASVVDMETKISTFN------------ADAGRLAYLTSVTQRGAAKKAQ 545 (632) Q Consensus 483 g~~~-~~~Gil~~a~~~~~~~----~~~~~~~~~i~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~ 545 (632) |+++ +|.||++.....+... ....++++.+..+...+...+ ....+..|+|++.+. T Consensus 210 G~~~~qP~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~------- 282 (395) T protein:vir:95 210 GAAKTQPVGLMKDVNTNSGAVTDKASSGTLTFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDS------- 282 (395) T ss_pred CCCCcCceeeeecccccccccccccccchhhhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhh------- Confidence 9874 7999997654333221 122233343433333332211 122356788887653 Q ss_pred hcccCCceeecc-----cccc--CcceEEcCCCCCccEEEEehhhEEEEEecceEEEEecccccccCcEEEEEEEEeCcE Q lcl|Aclame:pro 546 VFDNTGERIWQN-----NEVN--GYRAEASNQIPADTWIFGDWSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAG 618 (632) Q Consensus 546 ~~d~~g~~~~~~-----~~l~--G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~ 618 (632) .+..|+|+|++ .+++ |+||++++.+|+++++||||+.|.+++++++++.++++.+|.+|++.|++..|+|++ T Consensus 283 -~~~~g~~~~~~~~G~~~~~lg~g~~v~~~~~~p~~~i~fgdfs~y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dg~ 361 (395) T protein:vir:95 283 -WDVQARYTYLTANGGFVTVLPYNVTIITSEFVPEGKLVAFVTDRYNAVRGGGLTVKKFDQTLALEDAVLFTAKTFAYGQ 361 (395) T ss_pred -hhcCCcceeccCCCcceeccCCcceEEEcCCCCCCcEEEEecccEEEEEecceEEEeccchhhhCCcEEEEEEEEECCE Confidence 35678888875 2455 556899999999999999999999999999999999999999999999999999999 Q ss_pred EecccceEEEEecC Q lcl|Aclame:pro 619 VRRKEAFCIAKKGA 632 (632) Q Consensus 619 v~~~~a~~~~~~~A 632 (632) ++|++||++|++.. T Consensus 362 ~~~~~A~~~l~i~~ 375 (395) T protein:vir:95 362 PDDNKASAVYDLKV 375 (395) T ss_pred EeccccEEEEEeec Confidence 99999999999985 No 70 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=100.00 E-value=6.3e-44 Score=257.41 Aligned_cols=268 Identities=12% Similarity=0.176 Sum_probs=219.7 Q ss_pred ccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcccccCcccceeeeeeee Q lcl|Aclame:pro 359 KKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPK 438 (632) Q Consensus 359 ~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~ 438 (632) ..+.+++|+++|+++ ...|++.+++.++++++ +++++..+...++|+.++.+.+.|++|++++|+++++|+++++.++ T Consensus 1 m~t~t~gg~liP~~~-~~~ii~~l~~~s~i~~l-~~~~~~~~~~~~ip~~~~~~~a~wv~E~~~~~~s~~~f~~v~l~~~ 78 (303) T protein:vir:97 1 MGTETSKASLFDKHL-VSDLINKVKGHSSLAKL-SSQKPIPFNGSKEFTFTLDSDIDVVAENGKKTHGGLSLEPVTIVPI 78 (303) T ss_pred CcccCCCCeEcchhH-HHHHHHHHHhhchhhhh-cceeecCCCceEEEEEecCcceEEeecCccccccccceeeEEeeeE Confidence 335556777777665 57789999999999998 5667777778899999999999999999999999999999999999 Q ss_pred eeeeeehhhHHHh---hcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCc----cccccc--eeccccccccccccchhH Q lcl|Aclame:pro 439 TIAGAVPVTRKLR---KQSSIHVENLIREDLIEGIGVALDLAMLTGTGLA----NDPVGL--LNMTGVPALTYPAGGVDW 509 (632) Q Consensus 439 t~~~~~~iSre~l---~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~----~~~~Gi--l~~a~~~~~~~~~~~~~~ 509 (632) ++++++++|+|+| .++.+++.++|.+.+++++++++|.++++|+++. ..+.|. +..........+.+...+ T Consensus 79 kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (303) T protein:vir:97 79 KVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKVTQVVKFTESEDAD 158 (303) T ss_pred EEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCccccccccccccccccccccccccccchH Confidence 9999999999999 4677899999999999999999999999986432 222222 222223333444556778 Q ss_pred HHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeecc--------ccccCcceEEcCCCCCc------- Q lcl|Aclame:pro 510 ASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN--------NEVNGYRAEASNQIPAD------- 574 (632) Q Consensus 510 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~--------~~l~G~pv~~~~~~~~~------- 574 (632) ++|.+++.++...+. ....|+|++.....+ .+++|.+|+|+|.+ ++|+|+||++++++|.. T Consensus 159 ~~i~~~~~~~~~~~~--~~~~~vmn~~~~~~L--~~lkd~~g~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~ 234 (303) T protein:vir:97 159 ANIEAAVNLIQGAEG--VVTGLAMDTEFSTAL--AKVTNGEMGPKMYPELAWGANPDSINGLKSSVNTTVGAGADEAESK 234 (303) T ss_pred HHHHHHHHHHhhcCC--CccEEEEcHHHHHHH--HHhhccCCCeEEecCccCCCCCceecceeeEEecccCCccccCCCc Confidence 999999988876554 345699988876544 57899999999864 37999999999998853 Q ss_pred -cEEEEehhh-EEEEEecceEEEEeccc--------ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 575 -TWIFGDWSQ-IVIAMWGVLDLKVDPYT--------KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 575 -~~~~gd~s~-~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) .++||||+. |.++.+.++++..+++. .|.+|++.||++.|+|++|++|+||++||.+= T Consensus 235 ~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~~ 302 (303) T protein:vir:97 235 DLVIIGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKGE 302 (303) T ss_pred cEEEEeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCCC Confidence 389999965 78999999999887764 38999999999999999999999999999888 No 71 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=100.00 E-value=2.9e-43 Score=253.79 Aligned_cols=344 Identities=13% Similarity=0.052 Sum_probs=224.1 Q ss_pred hhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhH Q lcl|Aclame:pro 250 EAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFERE 329 (632) Q Consensus 250 ~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 329 (632) -.+.......+.+.+........... .......... .......... ... T Consensus 1 m~ik~~~~~~~~~~e~~~~~~~~~~~-~~~~~~~~~~------------------~~~~~~~~~~-~~~----------- 49 (381) T protein:vir:10 1 MTINLSETFANAKNEFINAVNNGEPQ-ERQNELYGDM------------------INQLFEETKL-QAK----------- 49 (381) T ss_pred CchhhHHHHHHHHHHHHHHHhhhhhh-HHHHHHHHHH------------------HHhhhhhHHH-HHH----------- Confidence 11111111111121111111110000 0000000000 0000000000 000 Q ss_pred HHHHHHHHHH-hhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEe Q lcl|Aclame:pro 330 VSLAIADASG-KEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKK 408 (632) Q Consensus 330 ~~~~~~~~~~-~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 408 (632) .+...... ..........+.....++..++.+.||+++|.++ .+.|++.+...++++++ +++.+.+ ....+++. T Consensus 50 --~e~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~-~~~I~~~l~~~s~i~~~-~~v~~~~-~~~~i~~~ 124 (381) T protein:vir:10 50 --AEAERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEET-IDRIFEDLTTNHPLLAD-LGIKNAG-LRLKFLKS 124 (381) T ss_pred --HHHHHHHHhccCcccccHHHHHHHHHHhcccCCCCceecCHHH-HHHHHHHHHhhccceeh-eeeEecC-cceEEEEe Confidence 00000000 0000000011122233455666777888887765 56788888888888887 4566654 35788999 Q ss_pred cCCccccccccCccccc-CcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccc Q lcl|Aclame:pro 409 TSGANFYWIGEDEDVQD-SDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLAND 487 (632) Q Consensus 409 ~~~~~a~~v~E~~~~~~-~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~ 487 (632) ++.+.+.|++|+++++. ++++|+++++.+++++++++||+++|.|+..+++++|...++++++.+++.+|++|+|++ + T Consensus 125 ~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~-q 203 (381) T protein:vir:10 125 ETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKD-Q 203 (381) T ss_pred cCCcceeeecccccccccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCC-C Confidence 99999999999998874 589999999999999999999999999999999999999999999999999999999974 8 Q ss_pred cccceeccccccccc---------------cccchhHHHHHHHHHHHHhhcc-----ccccceEEeehhHHHHHHH-Hhh Q lcl|Aclame:pro 488 PVGLLNMTGVPALTY---------------PAGGVDWASVVDMETKISTFNA-----DAGRLAYLTSVTQRGAAKK-AQV 546 (632) Q Consensus 488 ~~Gil~~a~~~~~~~---------------~~~~~~~~~i~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~-~~~ 546 (632) |.||++..+...... ......++.+.++...+...+. ...++.|+|++.+...+.. ... T Consensus 204 P~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~ 283 (381) T protein:vir:10 204 PIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH 283 (381) T ss_pred ceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhcccccc Confidence 999987543211111 1111223444555444433211 1345678999887654432 234 Q ss_pred cccCCceeeccccccCcceEEcCCCCCccEEEEehhhEEEEEecceEEEEecccccccCcEEEEEEEEeCcEEecccceE Q lcl|Aclame:pro 547 FDNTGERIWQNNEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFC 626 (632) Q Consensus 547 ~d~~g~~~~~~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~ 626 (632) .+.+|+|+|..+ .|.+|++++.+|++.++||||+.|.++++.++++.++++.+|.+|++.|++..|+|+++++++||+ T Consensus 284 ~~~~G~~v~~l~--~g~~vv~s~~~p~~~iifgDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dg~~~~~~A~~ 361 (381) T protein:vir:10 284 LNANGVYVTALP--FNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAA 361 (381) T ss_pred CCCCCceeecCC--CCceEEecCCCCcCcEEEEecccEEEEEecccEEEeechhHhhcCCeEEEEEEEEcCEEecCceEE Confidence 567899987633 467799999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEecC Q lcl|Aclame:pro 627 IAKKGA 632 (632) Q Consensus 627 ~~~~~A 632 (632) +++++. T Consensus 362 v~~l~~ 367 (381) T protein:vir:10 362 VWKLDL 367 (381) T ss_pred EEEEEe Confidence 988777 No 72 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=100.00 E-value=2.9e-43 Score=253.79 Aligned_cols=344 Identities=13% Similarity=0.052 Sum_probs=224.1 Q ss_pred hhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhH Q lcl|Aclame:pro 250 EAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFERE 329 (632) Q Consensus 250 ~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 329 (632) -.+.......+.+.+........... .......... .......... ... T Consensus 1 m~ik~~~~~~~~~~e~~~~~~~~~~~-~~~~~~~~~~------------------~~~~~~~~~~-~~~----------- 49 (381) T protein:vir:95 1 MTINLSETFANAKNEFINAVNNGEPQ-ERQNELYGDM------------------INQLFEETKL-QAK----------- 49 (381) T ss_pred CchhhHHHHHHHHHHHHHHHhhhhhh-HHHHHHHHHH------------------HHhhhhhHHH-HHH----------- Confidence 11111111111121111111110000 0000000000 0000000000 000 Q ss_pred HHHHHHHHHH-hhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEe Q lcl|Aclame:pro 330 VSLAIADASG-KEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKK 408 (632) Q Consensus 330 ~~~~~~~~~~-~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 408 (632) .+...... ..........+.....++..++.+.||+++|.++ .+.|++.+...++++++ +++.+.+ ....+++. T Consensus 50 --~e~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~-~~~I~~~l~~~s~i~~~-~~v~~~~-~~~~i~~~ 124 (381) T protein:vir:95 50 --AEAERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEET-IDRIFEDLTTNHPLLAD-LGIKNAG-LRLKFLKS 124 (381) T ss_pred --HHHHHHHHhccCcccccHHHHHHHHHHhcccCCCCceecCHHH-HHHHHHHHHhhccceeh-eeeEecC-cceEEEEe Confidence 00000000 0000000011122233455666777888887765 56788888888888887 4566654 35788999 Q ss_pred cCCccccccccCccccc-CcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccc Q lcl|Aclame:pro 409 TSGANFYWIGEDEDVQD-SDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLAND 487 (632) Q Consensus 409 ~~~~~a~~v~E~~~~~~-~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~ 487 (632) ++.+.+.|++|+++++. ++++|+++++.+++++++++||+++|.|+..+++++|...++++++.+++.+|++|+|++ + T Consensus 125 ~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~-q 203 (381) T protein:vir:95 125 ETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKD-Q 203 (381) T ss_pred cCCcceeeecccccccccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCC-C Confidence 99999999999998874 589999999999999999999999999999999999999999999999999999999974 8 Q ss_pred cccceeccccccccc---------------cccchhHHHHHHHHHHHHhhcc-----ccccceEEeehhHHHHHHH-Hhh Q lcl|Aclame:pro 488 PVGLLNMTGVPALTY---------------PAGGVDWASVVDMETKISTFNA-----DAGRLAYLTSVTQRGAAKK-AQV 546 (632) Q Consensus 488 ~~Gil~~a~~~~~~~---------------~~~~~~~~~i~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~-~~~ 546 (632) |.||++..+...... ......++.+.++...+...+. ...++.|+|++.+...+.. ... T Consensus 204 P~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~ 283 (381) T protein:vir:95 204 PIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH 283 (381) T ss_pred ceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhcccccc Confidence 999987543211111 1111223444555444433211 1345678999887654432 234 Q ss_pred cccCCceeeccccccCcceEEcCCCCCccEEEEehhhEEEEEecceEEEEecccccccCcEEEEEEEEeCcEEecccceE Q lcl|Aclame:pro 547 FDNTGERIWQNNEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFC 626 (632) Q Consensus 547 ~d~~g~~~~~~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~ 626 (632) .+.+|+|+|..+ .|.+|++++.+|++.++||||+.|.++++.++++.++++.+|.+|++.|++..|+|+++++++||+ T Consensus 284 ~~~~G~~v~~l~--~g~~vv~s~~~p~~~iifgDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dg~~~~~~A~~ 361 (381) T protein:vir:95 284 LNANGVYVTALP--FNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAA 361 (381) T ss_pred CCCCCceeecCC--CCceEEecCCCCcCcEEEEecccEEEEEecccEEEeechhHhhcCCeEEEEEEEEcCEEecCceEE Confidence 567899987633 467799999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEecC Q lcl|Aclame:pro 627 IAKKGA 632 (632) Q Consensus 627 ~~~~~A 632 (632) +++++. T Consensus 362 v~~l~~ 367 (381) T protein:vir:95 362 VWKLDL 367 (381) T ss_pred EEEEEe Confidence 988777 No 73 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=100.00 E-value=1.2e-43 Score=255.98 Aligned_cols=282 Identities=16% Similarity=0.136 Sum_probs=221.8 Q ss_pred hhhhhHHhhhhhcccccc------cccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCcccccccc Q lcl|Aclame:pro 346 YMPHEVLVQRQLEKKTAG------KGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGE 419 (632) Q Consensus 346 ~~~~~~~~~~a~~~~~~~------~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E 419 (632) .. ...+.++...++.. .++.++| +.+.+.|++.+++.++++++ +++++..+..+.+|+.+..+.+.|++| T Consensus 1 ~a--~l~el~~~~~~~~~~g~~~~~~~~liP-~~~~~~ii~~l~~~s~l~~~-~~~~~~~~~~~~~p~~~~~~~a~~v~e 76 (333) T protein:vir:78 1 MA--TLNELLPNSAGSNHQGRLAHVPSDLLP-KEIVGPIFDKAQESSLVLRM-GEQIPISYGETIIPTTVKRPEVGQVGV 76 (333) T ss_pred Cc--hhHHhhhhcccccccCceecCCccccc-hhHHHHHHHHHHhhchhhhh-cceeeccCCceEEEEEeCCceeEeecC Confidence 00 01111222222222 2223454 55568899999999999988 566777777889999998888877766 Q ss_pred C--------cccccCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCc--cccc Q lcl|Aclame:pro 420 D--------EDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLA--NDPV 489 (632) Q Consensus 420 ~--------~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~--~~~~ 489 (632) + +.+++++++|+++++.+++++++++||+|++.++..+++++|.+.|++++++++|.++|+|+|++ ..+. T Consensus 77 g~~~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~~~ 156 (333) T protein:vir:78 77 GTSNEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSALQ 156 (333) T ss_pred cccccccccccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCcccc Confidence 5 56788899999999999999999999999999999999999999999999999999999998864 4566 Q ss_pred cceeccccccc-----cccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHH-HHhhcccCCceeecc------ Q lcl|Aclame:pro 490 GLLNMTGVPAL-----TYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAK-KAQVFDNTGERIWQN------ 557 (632) Q Consensus 490 Gil~~a~~~~~-----~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~d~~g~~~~~~------ 557 (632) |+.+....... ....+.++++.|.+++..+...+. .....|+|++..+..+. +.+++|.+|+|+|.+ T Consensus 157 g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~-~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~~~~~~~~~ 235 (333) T protein:vir:78 157 GIDTDNVIANTTNVDYLQETGDPLLDRLLDGYDLVSANTD-VEFNGWAVDPRFRAHLLRAQAYRDANGNVDPSRINLAAQ 235 (333) T ss_pred cccccccccccccccccccccchhHHHHHHHHHhhccccc-cCceEEEEcchHHHHHHHHhhhcCCCCceeecCccccCC Confidence 77665443322 223345678888888888765542 33456999998876553 456899999999964 Q ss_pred -ccccCcceEEcCCCCCc---------cEEEEehhhEEEEEecceEEEEeccc-----------ccccCcEEEEEEEEeC Q lcl|Aclame:pro 558 -NEVNGYRAEASNQIPAD---------TWIFGDWSQIVIAMWGVLDLKVDPYT-----------KAASDGLVLRVFQDVD 616 (632) Q Consensus 558 -~~l~G~pv~~~~~~~~~---------~~~~gd~s~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~r~~ 616 (632) ++|+|+||++++++|.+ .++||||+.|.++.++++++..+++. .|.+|++.||++.|+| T Consensus 236 ~~~l~G~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d 315 (333) T protein:vir:78 236 TGDVLGLPAQFGRAVGGDLGAAVDSKTRIIGGDFSQLKFGFADEIRIKMSDTATLTDSGSATVSMWQTNQIAILIEVTFG 315 (333) T ss_pred CceeeceeeEEccccCCCccccCCCccEEEEEecccEEEEEeeccEEEEeccccccccccceeehhhcCcEEEEEEEEEc Confidence 47999999999999864 48999999999999999999998863 4889999999999999 Q ss_pred cEEecccceEEEEecC Q lcl|Aclame:pro 617 AGVRRKEAFCIAKKGA 632 (632) Q Consensus 617 ~~v~~~~a~~~~~~~A 632 (632) +++.+|+||++|+.++ T Consensus 316 ~~v~~~~a~~~l~~~~ 331 (333) T protein:vir:78 316 WLLGDKQAFVKFVDDE 331 (333) T ss_pred cEEecccceEEEeccC Confidence 9999999999999998 No 74 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=100.00 E-value=3.2e-42 Score=248.07 Aligned_cols=357 Identities=15% Similarity=0.130 Sum_probs=217.3 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhh Q lcl|Aclame:pro 217 SGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARD 296 (632) Q Consensus 217 ~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 296 (632) ..+...+..+ +..............+...+ .+....+........+. .....+ T Consensus 1 M~k~l~el~~----~~~~~~~e~~~~~~~~~~~e-------~~~~~~e~~~l~~~i~~--------~~~~~~-------- 53 (392) T protein:vir:10 1 MSKELRELLA----KLEGKKEEVRSLMGEDKVAE-------AEQMMEEVRSLQKKIDL--------QRSLDE-------- 53 (392) T ss_pred CcHHHHHHHH----HHHHHHHHHHHHhhHHHHHH-------HHHHHHHHHHHHHHHHH--------HHHHHH-------- Confidence 0000011011 10000000000000000000 00000000000000000 000000 Q ss_pred hhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHh---hhhhhhhhhhHHhhhhhcccccccccceechhh Q lcl|Aclame:pro 297 LGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGK---EARGFYMPHEVLVQRQLEKKTAGKGGELVATEL 373 (632) Q Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~ 373 (632) ..............+. ............+.... ............+.+.+...+..+||.++|.++ T Consensus 54 ---~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~ 122 (392) T protein:vir:10 54 ---AETEERNNGREVETRN--------VDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDI 122 (392) T ss_pred ---HHHHHhhccccccccC--------ccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhH Confidence 0000000000000000 00000000000000000 000011111223344555566667777777655 Q ss_pred hhHHHHHHHhhhhhhhhhcc-eeeccCceeEEEEEecCCccccccccCcccccC-cccceeeeeeeeeeeeeehhhHHHh Q lcl|Aclame:pro 374 LSEEFIDILRNKAIIGQMGA-RMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDS-DFDFTTLSFSPKTIAGAVPVTRKLR 451 (632) Q Consensus 374 ~~~~i~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~-~~~~~~~~~~~~t~~~~~~iSre~l 451 (632) ...|++.+++.++++.+.. ..+++....+.+++.++.+.+.|++|+++++++ .++|+++++.+++++++++||+|+| T Consensus 123 -~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell 201 (392) T protein:vir:10 123 -QTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLL 201 (392) T ss_pred -HHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHH Confidence 5678899999998888633 234455666778888888899999999999975 6899999999999999999999999 Q ss_pred hcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHH-HHHHhhccccccce Q lcl|Aclame:pro 452 KQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDME-TKISTFNADAGRLA 530 (632) Q Consensus 452 ~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~-~~~~~~~~~~~~~~ 530 (632) .|+.+++.++|.+.++++++++++.+|++|+|++.. .+..+++++.+++ ..+...++ .++. T Consensus 202 ~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~----------------~~~~~~d~i~~~~~~~l~~~~~--~~a~ 263 (392) T protein:vir:10 202 QDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK----------------QAIKSLDDIKDVLNVKLDPAIS--PNAI 263 (392) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc----------------cCccCHHHHHHHHHHhhhhhhc--cCCE Confidence 999999999999999999999999999999886432 1235677888877 46666665 3578 Q ss_pred EEeehhHHHHHHHHhhcccCCceeecc-------ccccCcceEE-cCCC-C--------CccEEEEehhh-EEEEEecce Q lcl|Aclame:pro 531 YLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEA-SNQI-P--------ADTWIFGDWSQ-IVIAMWGVL 592 (632) Q Consensus 531 ~~~~~~~~~~~~~~~~~d~~g~~~~~~-------~~l~G~pv~~-~~~~-~--------~~~~~~gd~s~-~~~~~~~~~ 592 (632) |+|++..+..+ .+++|.+|+|+|.+ ++|+|+|+++ .+.+ + ...++||||+. |.++.+.++ T Consensus 264 ~vm~~~~~~~L--~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~ 341 (392) T protein:vir:10 264 LLTNQDGFNYL--DKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDM 341 (392) T ss_pred EEEcHHHHHHH--HHhhccCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecce Confidence 99999886554 57899999999965 3799986554 3322 1 23479999998 678899999 Q ss_pred EEEEeccc--ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 593 DLKVDPYT--KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 593 ~~~~~~~~--~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) ++.++++. +|.+|++.|+++.|+|+++++|+||+++++++ T Consensus 342 ~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~ 383 (392) T protein:vir:10 342 ELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDL 383 (392) T ss_pred EEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEecc Confidence 99998864 68999999999999999999999999998877 No 75 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=100.00 E-value=3.2e-42 Score=248.07 Aligned_cols=357 Identities=15% Similarity=0.130 Sum_probs=217.3 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhh Q lcl|Aclame:pro 217 SGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARD 296 (632) Q Consensus 217 ~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 296 (632) ..+...+..+ +..............+...+ .+....+........+. .....+ T Consensus 1 M~k~l~el~~----~~~~~~~e~~~~~~~~~~~e-------~~~~~~e~~~l~~~i~~--------~~~~~~-------- 53 (392) T protein:vir:10 1 MSKELRELLA----KLEGKKEEVRSLMGEDKVAE-------AEQMMEEVRSLQKKIDL--------QRSLDE-------- 53 (392) T ss_pred CcHHHHHHHH----HHHHHHHHHHHHhhHHHHHH-------HHHHHHHHHHHHHHHHH--------HHHHHH-------- Confidence 0000011011 10000000000000000000 00000000000000000 000000 Q ss_pred hhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHh---hhhhhhhhhhHHhhhhhcccccccccceechhh Q lcl|Aclame:pro 297 LGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGK---EARGFYMPHEVLVQRQLEKKTAGKGGELVATEL 373 (632) Q Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~ 373 (632) ..............+. ............+.... ............+.+.+...+..+||.++|.++ T Consensus 54 ---~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~ 122 (392) T protein:vir:10 54 ---AETEERNNGREVETRN--------VDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDI 122 (392) T ss_pred ---HHHHHhhccccccccC--------ccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhH Confidence 0000000000000000 00000000000000000 000011111223344555566667777777655 Q ss_pred hhHHHHHHHhhhhhhhhhcc-eeeccCceeEEEEEecCCccccccccCcccccC-cccceeeeeeeeeeeeeehhhHHHh Q lcl|Aclame:pro 374 LSEEFIDILRNKAIIGQMGA-RMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDS-DFDFTTLSFSPKTIAGAVPVTRKLR 451 (632) Q Consensus 374 ~~~~i~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~-~~~~~~~~~~~~t~~~~~~iSre~l 451 (632) ...|++.+++.++++.+.. ..+++....+.+++.++.+.+.|++|+++++++ .++|+++++.+++++++++||+|+| T Consensus 123 -~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell 201 (392) T protein:vir:10 123 -QTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLL 201 (392) T ss_pred -HHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHH Confidence 5678899999998888633 234455666778888888899999999999975 6899999999999999999999999 Q ss_pred hcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHH-HHHHhhccccccce Q lcl|Aclame:pro 452 KQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDME-TKISTFNADAGRLA 530 (632) Q Consensus 452 ~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~-~~~~~~~~~~~~~~ 530 (632) .|+.+++.++|.+.++++++++++.+|++|+|++.. .+..+++++.+++ ..+...++ .++. T Consensus 202 ~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~----------------~~~~~~d~i~~~~~~~l~~~~~--~~a~ 263 (392) T protein:vir:10 202 QDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK----------------QAIKSLDDIKDVLNVKLDPAIS--PNAI 263 (392) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc----------------cCccCHHHHHHHHHHhhhhhhc--cCCE Confidence 999999999999999999999999999999886432 1235677888877 46666665 3578 Q ss_pred EEeehhHHHHHHHHhhcccCCceeecc-------ccccCcceEE-cCCC-C--------CccEEEEehhh-EEEEEecce Q lcl|Aclame:pro 531 YLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEA-SNQI-P--------ADTWIFGDWSQ-IVIAMWGVL 592 (632) Q Consensus 531 ~~~~~~~~~~~~~~~~~d~~g~~~~~~-------~~l~G~pv~~-~~~~-~--------~~~~~~gd~s~-~~~~~~~~~ 592 (632) |+|++..+..+ .+++|.+|+|+|.+ ++|+|+|+++ .+.+ + ...++||||+. |.++.+.++ T Consensus 264 ~vm~~~~~~~L--~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~ 341 (392) T protein:vir:10 264 LLTNQDGFNYL--DKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDM 341 (392) T ss_pred EEEcHHHHHHH--HHhhccCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecce Confidence 99999886554 57899999999965 3799986554 3322 1 23479999998 678899999 Q ss_pred EEEEeccc--ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 593 DLKVDPYT--KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 593 ~~~~~~~~--~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) ++.++++. +|.+|++.|+++.|+|+++++|+||+++++++ T Consensus 342 ~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~ 383 (392) T protein:vir:10 342 ELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDL 383 (392) T ss_pred EEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEecc Confidence 99998864 68999999999999999999999999998877 No 76 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=100.00 E-value=3.2e-42 Score=248.07 Aligned_cols=357 Identities=15% Similarity=0.130 Sum_probs=217.3 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhh Q lcl|Aclame:pro 217 SGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARD 296 (632) Q Consensus 217 ~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 296 (632) ..+...+..+ +..............+...+ .+....+........+. .....+ T Consensus 1 M~k~l~el~~----~~~~~~~e~~~~~~~~~~~e-------~~~~~~e~~~l~~~i~~--------~~~~~~-------- 53 (392) T protein:vir:10 1 MSKELRELLA----KLEGKKEEVRSLMGEDKVAE-------AEQMMEEVRSLQKKIDL--------QRSLDE-------- 53 (392) T ss_pred CcHHHHHHHH----HHHHHHHHHHHHhhHHHHHH-------HHHHHHHHHHHHHHHHH--------HHHHHH-------- Confidence 0000011011 10000000000000000000 00000000000000000 000000 Q ss_pred hhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHh---hhhhhhhhhhHHhhhhhcccccccccceechhh Q lcl|Aclame:pro 297 LGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGK---EARGFYMPHEVLVQRQLEKKTAGKGGELVATEL 373 (632) Q Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~ 373 (632) ..............+. ............+.... ............+.+.+...+..+||.++|.++ T Consensus 54 ---~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~ 122 (392) T protein:vir:10 54 ---AETEERNNGREVETRN--------VDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDI 122 (392) T ss_pred ---HHHHHhhccccccccC--------ccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhH Confidence 0000000000000000 00000000000000000 000011111223344555566667777777655 Q ss_pred hhHHHHHHHhhhhhhhhhcc-eeeccCceeEEEEEecCCccccccccCcccccC-cccceeeeeeeeeeeeeehhhHHHh Q lcl|Aclame:pro 374 LSEEFIDILRNKAIIGQMGA-RMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDS-DFDFTTLSFSPKTIAGAVPVTRKLR 451 (632) Q Consensus 374 ~~~~i~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~-~~~~~~~~~~~~t~~~~~~iSre~l 451 (632) ...|++.+++.++++.+.. ..+++....+.+++.++.+.+.|++|+++++++ .++|+++++.+++++++++||+|+| T Consensus 123 -~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell 201 (392) T protein:vir:10 123 -QTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLL 201 (392) T ss_pred -HHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHH Confidence 5678899999998888633 234455666778888888899999999999975 6899999999999999999999999 Q ss_pred hcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHH-HHHHhhccccccce Q lcl|Aclame:pro 452 KQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDME-TKISTFNADAGRLA 530 (632) Q Consensus 452 ~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~-~~~~~~~~~~~~~~ 530 (632) .|+.+++.++|.+.++++++++++.+|++|+|++.. .+..+++++.+++ ..+...++ .++. T Consensus 202 ~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~----------------~~~~~~d~i~~~~~~~l~~~~~--~~a~ 263 (392) T protein:vir:10 202 QDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK----------------QAIKSLDDIKDVLNVKLDPAIS--PNAI 263 (392) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc----------------cCccCHHHHHHHHHHhhhhhhc--cCCE Confidence 999999999999999999999999999999886432 1235677888877 46666665 3578 Q ss_pred EEeehhHHHHHHHHhhcccCCceeecc-------ccccCcceEE-cCCC-C--------CccEEEEehhh-EEEEEecce Q lcl|Aclame:pro 531 YLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEA-SNQI-P--------ADTWIFGDWSQ-IVIAMWGVL 592 (632) Q Consensus 531 ~~~~~~~~~~~~~~~~~d~~g~~~~~~-------~~l~G~pv~~-~~~~-~--------~~~~~~gd~s~-~~~~~~~~~ 592 (632) |+|++..+..+ .+++|.+|+|+|.+ ++|+|+|+++ .+.+ + ...++||||+. |.++.+.++ T Consensus 264 ~vm~~~~~~~L--~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~ 341 (392) T protein:vir:10 264 LLTNQDGFNYL--DKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDM 341 (392) T ss_pred EEEcHHHHHHH--HHhhccCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecce Confidence 99999886554 57899999999965 3799986554 3322 1 23479999998 678899999 Q ss_pred EEEEeccc--ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 593 DLKVDPYT--KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 593 ~~~~~~~~--~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) ++.++++. +|.+|++.|+++.|+|+++++|+||+++++++ T Consensus 342 ~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~ 383 (392) T protein:vir:10 342 ELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDL 383 (392) T ss_pred EEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEecc Confidence 99998864 68999999999999999999999999998877 No 77 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=100.00 E-value=3.2e-42 Score=248.07 Aligned_cols=357 Identities=15% Similarity=0.130 Sum_probs=217.3 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhh Q lcl|Aclame:pro 217 SGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARD 296 (632) Q Consensus 217 ~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 296 (632) ..+...+..+ +..............+...+ .+....+........+. .....+ T Consensus 1 M~k~l~el~~----~~~~~~~e~~~~~~~~~~~e-------~~~~~~e~~~l~~~i~~--------~~~~~~-------- 53 (392) T protein:vir:10 1 MSKELRELLA----KLEGKKEEVRSLMGEDKVAE-------AEQMMEEVRSLQKKIDL--------QRSLDE-------- 53 (392) T ss_pred CcHHHHHHHH----HHHHHHHHHHHHhhHHHHHH-------HHHHHHHHHHHHHHHHH--------HHHHHH-------- Confidence 0000011011 10000000000000000000 00000000000000000 000000 Q ss_pred hhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHh---hhhhhhhhhhHHhhhhhcccccccccceechhh Q lcl|Aclame:pro 297 LGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGK---EARGFYMPHEVLVQRQLEKKTAGKGGELVATEL 373 (632) Q Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~ 373 (632) ..............+. ............+.... ............+.+.+...+..+||.++|.++ T Consensus 54 ---~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~ 122 (392) T protein:vir:10 54 ---AETEERNNGREVETRN--------VDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDI 122 (392) T ss_pred ---HHHHHhhccccccccC--------ccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhH Confidence 0000000000000000 00000000000000000 000011111223344555566667777777655 Q ss_pred hhHHHHHHHhhhhhhhhhcc-eeeccCceeEEEEEecCCccccccccCcccccC-cccceeeeeeeeeeeeeehhhHHHh Q lcl|Aclame:pro 374 LSEEFIDILRNKAIIGQMGA-RMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDS-DFDFTTLSFSPKTIAGAVPVTRKLR 451 (632) Q Consensus 374 ~~~~i~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~-~~~~~~~~~~~~t~~~~~~iSre~l 451 (632) ...|++.+++.++++.+.. ..+++....+.+++.++.+.+.|++|+++++++ .++|+++++.+++++++++||+|+| T Consensus 123 -~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell 201 (392) T protein:vir:10 123 -QTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLL 201 (392) T ss_pred -HHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHH Confidence 5678899999998888633 234455666778888888899999999999975 6899999999999999999999999 Q ss_pred hcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHH-HHHHhhccccccce Q lcl|Aclame:pro 452 KQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDME-TKISTFNADAGRLA 530 (632) Q Consensus 452 ~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~-~~~~~~~~~~~~~~ 530 (632) .|+.+++.++|.+.++++++++++.+|++|+|++.. .+..+++++.+++ ..+...++ .++. T Consensus 202 ~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~----------------~~~~~~d~i~~~~~~~l~~~~~--~~a~ 263 (392) T protein:vir:10 202 QDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK----------------QAIKSLDDIKDVLNVKLDPAIS--PNAI 263 (392) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc----------------cCccCHHHHHHHHHHhhhhhhc--cCCE Confidence 999999999999999999999999999999886432 1235677888877 46666665 3578 Q ss_pred EEeehhHHHHHHHHhhcccCCceeecc-------ccccCcceEE-cCCC-C--------CccEEEEehhh-EEEEEecce Q lcl|Aclame:pro 531 YLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEA-SNQI-P--------ADTWIFGDWSQ-IVIAMWGVL 592 (632) Q Consensus 531 ~~~~~~~~~~~~~~~~~d~~g~~~~~~-------~~l~G~pv~~-~~~~-~--------~~~~~~gd~s~-~~~~~~~~~ 592 (632) |+|++..+..+ .+++|.+|+|+|.+ ++|+|+|+++ .+.+ + ...++||||+. |.++.+.++ T Consensus 264 ~vm~~~~~~~L--~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~ 341 (392) T protein:vir:10 264 LLTNQDGFNYL--DKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDM 341 (392) T ss_pred EEEcHHHHHHH--HHhhccCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecce Confidence 99999886554 57899999999965 3799986554 3322 1 23479999998 678899999 Q ss_pred EEEEeccc--ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 593 DLKVDPYT--KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 593 ~~~~~~~~--~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) ++.++++. +|.+|++.|+++.|+|+++++|+||+++++++ T Consensus 342 ~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~ 383 (392) T protein:vir:10 342 ELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDL 383 (392) T ss_pred EEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEecc Confidence 99998864 68999999999999999999999999998877 No 78 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=100.00 E-value=1.4e-43 Score=255.60 Aligned_cols=282 Identities=16% Similarity=0.147 Sum_probs=218.9 Q ss_pred hhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCccc Q lcl|Aclame:pro 344 GFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDV 423 (632) Q Consensus 344 ~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~ 423 (632) ...........+++...++..++.+||+++ .+.+++.+++.++++++ +++++..+..+++|+.++.+.+.|++|++++ T Consensus 1 ~~~~~~~~~~~~~~~~t~~~~~~~~ip~~~-~~~ii~~~~~~s~l~~~-~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~ 78 (320) T protein:vir:10 1 MAAGTAFQVDHAQIAQTGDTMFKGYLEPEQ-AKDYFAEAEKTSIVQQF-AQKVPMGTTGQKIPHWIGDVSAQWIGEGDMK 78 (320) T ss_pred CCCCccCCHHHHHhhccccccccccccHHH-HHHHHHHHHhccchhhh-cceeeccCCceEEEEEeCCcceEEecCCccc Confidence 111111112334455555566667777765 56678888888888887 5667766777899999999999999999999 Q ss_pred ccCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccc- Q lcl|Aclame:pro 424 QDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTY- 502 (632) Q Consensus 424 ~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~- 502 (632) |+++++|+++++.+++++++++||+|+|.|+..+++++|.+.+++++++++|.++++|+|++ .+.++.......+... T Consensus 79 ~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~-~~~~~~~~~~~~~~~~~ 157 (320) T protein:vir:10 79 PITKGNMTSQNIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNGTDSP-FPTYLAQTTKSVSLADP 157 (320) T ss_pred cccccceeEEEEeeEEEEEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcccCCC-CCcccccccccccceec Confidence 99999999999999999999999999999999999999999999999999999999999864 3444332222111111 Q ss_pred ---cccchh--HHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeecc------------ccccCcce Q lcl|Aclame:pro 503 ---PAGGVD--WASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN------------NEVNGYRA 565 (632) Q Consensus 503 ---~~~~~~--~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~------------~~l~G~pv 565 (632) +...++ .+.+.++...+...+. .++.|+||+..+..+ .+++|.+|+|+|.+ ++++|+|| T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~v~n~~~~~~L--~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv 233 (320) T protein:vir:10 158 GGATASDLTAYDAVAVNGLSLLVNAKK--KWTHTLLDDIVEPIL--NGAKDKNGRPLFIESTYTDENSPFRAGRIVSRPT 233 (320) T ss_pred ccccccccccHHHHHHHHHhhhhcccC--CCcEEEEcHHHHHHH--HHhhccCCceeeccccccCccccccCceeeeeee Confidence 111111 1235555666655543 467899988886555 57899999999964 36899999 Q ss_pred EEcCCCCCcc--EEEEehhhEEEEEecceEEEEeccc--------------ccccCcEEEEEEEEeCcEEecccceEEEE Q lcl|Aclame:pro 566 EASNQIPADT--WIFGDWSQIVIAMWGVLDLKVDPYT--------------KAASDGLVLRVFQDVDAGVRRKEAFCIAK 629 (632) Q Consensus 566 ~~~~~~~~~~--~~~gd~s~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~r~~~~v~~~~a~~~~~ 629 (632) ++++.+|.++ ++||||+.+.++.++++++..+++. .|.+|++.||++.|+|+++.+|+||++|+ T Consensus 234 ~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~~a~~~l~ 313 (320) T protein:vir:10 234 ILSDHVADGTTVGYMGDFRNVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDKDAFVKLT 313 (320) T ss_pred EecCCCCCCceEEEEeecceEEEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeeccEEecccceEEEE Confidence 9999999886 5799999999999999999888764 38899999999999999999999999999 Q ss_pred ecC Q lcl|Aclame:pro 630 KGA 632 (632) Q Consensus 630 ~~A 632 (632) .++ T Consensus 314 ~~~ 316 (320) T protein:vir:10 314 NVV 316 (320) T ss_pred ecc Confidence 766 No 79 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=100.00 E-value=1.3e-43 Score=255.74 Aligned_cols=272 Identities=16% Similarity=0.173 Sum_probs=216.9 Q ss_pred hcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCccc-----ccCcccce Q lcl|Aclame:pro 357 LEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDV-----QDSDFDFT 431 (632) Q Consensus 357 ~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~-----~~~~~~~~ 431 (632) +...+..+++.++|.++ .+.|++.+++.++++++ +++++..+..+.+|+.+..+.+.|++|++.. +.++++|+ T Consensus 1 ma~~t~~~gg~liP~~~-~~~Ii~~~~~~s~l~~l-~~~~~~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~~s~~~f~ 78 (305) T protein:vir:25 1 MADISRAEVASLIQEAY-SDTLLAAAKQGSTVLSA-FQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWA 78 (305) T ss_pred CCCccCCccceecCHHH-HHHHHHHHHhhchhhhh-cceeeccCCcEEEEEEeCCcceEEeeccccccccccccccccee Confidence 66666677777887665 57788999999999888 5677877778999999999999999999864 45678999 Q ss_pred eeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccc--ccceecccc--ccccccccch Q lcl|Aclame:pro 432 TLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDP--VGLLNMTGV--PALTYPAGGV 507 (632) Q Consensus 432 ~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~--~Gil~~a~~--~~~~~~~~~~ 507 (632) ++++.+++++++++||+|++.|+..+++++|.+.+++++++++|.++++|+|++... .+++..... .....+.... T Consensus 79 ~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (305) T protein:vir:25 79 NRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVA 158 (305) T ss_pred eEEeeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhheeccCCCCCccccccccccccccccccccccch Confidence 999999999999999999999999999999999999999999999999999864322 222222211 1222333344 Q ss_pred hHHHHHHHHHHHHhhcccc--ccceEEeehhHHHHHHHHhhcccCCceeeccccccCcceEEcCCCCC----ccEEEEeh Q lcl|Aclame:pro 508 DWASVVDMETKISTFNADA--GRLAYLTSVTQRGAAKKAQVFDNTGERIWQNNEVNGYRAEASNQIPA----DTWIFGDW 581 (632) Q Consensus 508 ~~~~i~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~~~l~G~pv~~~~~~~~----~~~~~gd~ 581 (632) ...++.+++..+...+... ....|+|++..... +.+++|.+|+|+|++++|+|+||++++.+|. ..++|||| T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~--l~~lkd~~G~~i~~~~~l~G~Pv~~~~~~~~~~~~~~~~~gd~ 236 (305) T protein:vir:25 159 NESDIVGATNRAAKAVASAGWAPDTLLSSLALRYE--VANIRDANGNPVFRDDSFAGFRTFFNRNGAWDADAAIEVIADS 236 (305) T ss_pred hhhHHHHHHHHHHHhhhhcccccceeEecHHHHHH--HHHhhccCCceeecCCcccccceEEcCccCCCCCccEEEEEec Confidence 4555555555554443322 22348888876654 4578999999999999999999999999874 36899999 Q ss_pred hhEEEEEecceEEEEeccc----------ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 582 SQIVIAMWGVLDLKVDPYT----------KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 582 s~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +.|.++.++++++..+++. .|.+|++.+|++.|+|+++.||+||++++..- T Consensus 237 s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~ 297 (305) T protein:vir:25 237 SRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTP 297 (305) T ss_pred ceEEEEEecCeEEEEeeeeeeecCCceeeeeecCcEEEEEEEeecceeeCcccEEEEcccc Confidence 9999999999999887763 48889999999999999999999999988853 No 80 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=100.00 E-value=1.3e-41 Score=244.66 Aligned_cols=362 Identities=13% Similarity=0.093 Sum_probs=213.7 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhh Q lcl|Aclame:pro 213 APAASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIH 292 (632) Q Consensus 213 ~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 292 (632) +..... .+.......++.......................+..+.....+..... ... . T Consensus 1 M~~~eL---~~~~~~~~~~~~~l~e~~~~~~~~~~~~~~~~~~ee~~~l~~~i~~~~~---------------~~~---~ 59 (395) T protein:vir:38 1 MNINQL---KDAFDMAGQKVQDLEDKRAQFAIDLGNDASSHSVDDINKLNASLKNAKM---------------AQE---L 59 (395) T ss_pred CCHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHH---------------HHH---H Confidence 000000 0000000000000000000000000000000000000000000000000 000 0 Q ss_pred hhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechh Q lcl|Aclame:pro 293 SARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATE 372 (632) Q Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~ 372 (632) ................... .... ..............+.. ...........+.+++|.++|.+ T Consensus 60 ~~~~~~~~~~~~~~~~~~~-----~~~~---~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~gg~~vP~~ 122 (395) T protein:vir:38 60 AKSAYEDARANLNAEPVNK-----KPLP---VKDGKPDAQAMKNQFVK---------DFKNLVTSGTTGTGNAGLTIPED 122 (395) T ss_pred HHHHHHHHHhhhhhccccc-----cccc---hhhhhHHHHHHHHHHHH---------HHHHHHhhccCccCCCceecchh Confidence 0000000000000000000 0000 00000000000000100 11111122334455677777766 Q ss_pred hhhHHHHHHHhhhhhhhhhcce-eeccCceeEEEEEecC-CccccccccCcccccC-cccceeeeeeeeeeeeeehhhHH Q lcl|Aclame:pro 373 LLSEEFIDILRNKAIIGQMGAR-MLPGLVGDVDIPKKTS-GANFYWIGEDEDVQDS-DFDFTTLSFSPKTIAGAVPVTRK 449 (632) Q Consensus 373 ~~~~~i~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~-~~~~~~~~~~~~t~~~~~~iSre 449 (632) + .+.|++.+++.++++.++.. .++.....+.++...+ .+.+.|++|+++++++ .++|+++++++++++++++||++ T Consensus 123 ~-~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~v~~~~~k~~~~~~iS~e 201 (395) T protein:vir:38 123 I-QLQIRTLTRSFTSLESLANVENVTTSHGSRVYEKLADITPLKDLDDESALIGDNDDPELTVVKYLIHRYAGITTVTNT 201 (395) T ss_pred H-hhHHHHHHHhhcchhhhcceeeccCCcceEEEEeeccCCccccccccccccccccccceeeEEeeeeeeEeehhhHHH Confidence 6 46789999999999887432 2334455666665544 4678899999999976 58999999999999999999999 Q ss_pred HhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHHH-HHHhhcccccc Q lcl|Aclame:pro 450 LRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMET-KISTFNADAGR 528 (632) Q Consensus 450 ~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~-~~~~~~~~~~~ 528 (632) ++.|+++++.++|.+.|+++++++++.+|++|+|++....| ..+++++.+++. .+...++ .+ T Consensus 202 ll~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~~~~---------------~~~~~~i~~~~~~~l~~~~~--~~ 264 (395) T protein:vir:38 202 LLKDTVDNIIQWLVNWAAKKDVVTRNAKILEVMGKAPKKPT---------------ISQFDNIKDLENNTLDPAIE--ST 264 (395) T ss_pred HHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc---------------cccHHHHHHHHHHhhhhhhc--CC Confidence 99999999999999999999999999999999987654332 234567777765 5555554 35 Q ss_pred ceEEeehhHHHHHHHHhhcccCCceeecc-------ccccCcceEEcCCCC------CccEEEEehhh-EEEEEecceEE Q lcl|Aclame:pro 529 LAYLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEASNQIP------ADTWIFGDWSQ-IVIAMWGVLDL 594 (632) Q Consensus 529 ~~~~~~~~~~~~~~~~~~~d~~g~~~~~~-------~~l~G~pv~~~~~~~------~~~~~~gd~s~-~~~~~~~~~~~ 594 (632) +.|+|++..+.. +.+++|.+|+|+|++ .+|+|+||+++++++ ...++||||+. |.++.+.++.+ T Consensus 265 a~~v~n~~~~~~--L~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~i~~gd~~~~~~i~~~~~~~i 342 (395) T protein:vir:38 265 SSFITNQSGYNI--LSKVKDADGRYLMQPDVTSPDKYLIDGKPVIRIADKWLPDVSGSHPLYFGDLKQGITLFDRQQMQI 342 (395) T ss_pred CEEEEcHHHHHH--HHHhhccCCceeeccCcCCCCcceeccceeEEecccccCcCCCcceEEEEeccccEEEEEecceEE Confidence 779999888654 457899999999964 379999999987643 23489999997 77899999999 Q ss_pred EEecc--cccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 595 KVDPY--TKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 595 ~~~~~--~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) .++++ .+|.+|++.||++.|+|+++.+|+||+++++++ T Consensus 343 ~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~ 382 (395) T protein:vir:38 343 DTTNVGAGSFEHDTTKLRFIDRFDVQLIDDGAFAAASFKT 382 (395) T ss_pred EEeccccchhhcCceEEEEEEeeccEEecccceEEEEeec Confidence 88874 569999999999999999999999999999998 No 81 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=100.00 E-value=3.6e-43 Score=253.26 Aligned_cols=282 Identities=15% Similarity=0.134 Sum_probs=219.1 Q ss_pred hhhHHhhhhhcccc------cccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCc--------c Q lcl|Aclame:pro 348 PHEVLVQRQLEKKT------AGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGA--------N 413 (632) Q Consensus 348 ~~~~~~~~a~~~~~------~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~ 413 (632) .-...+.++...++ .+.++.++|.+ +.+.|++.+++.++++++ +++++..+..+++|+.+..+ . T Consensus 1 ~~~~~e~~~~~~~~~~~~~~~~~~~~liP~~-~~~~ii~~~~~~s~l~~l-~~~~~~~~~~~~ip~~~~~~~a~~v~~~~ 78 (338) T protein:vir:78 1 MATLNELAPNTAGSNHQGRLAHVPSDLLPKE-IVGPIFDKAQESSLVLRL-GENIPISYGETIIPTTVKRPEVGQVGVGT 78 (338) T ss_pred CcchHHhhhhhcccccccceecccccccchH-HHHHHHHHHHhhchhhhh-cceeeccCCceEEEEEecCccceeecccc Confidence 00011111111111 22233455555 557889999999999998 46677777788888877554 4 Q ss_pred ccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCC--ccccccc Q lcl|Aclame:pro 414 FYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGL--ANDPVGL 491 (632) Q Consensus 414 a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~--~~~~~Gi 491 (632) +.|++|++++++++++|+++++.+++++++++||+|+|.|+..+++++|.+.+++++++++|.++++|+|+ ++.|.|+ T Consensus 79 ~~~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~~~gi 158 (338) T protein:vir:78 79 SNEQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTGSALQGI 158 (338) T ss_pred cccccccccccccccceeEEEEEEEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccc Confidence 56778999999999999999999999999999999999999999999999999999999999999999885 4567787 Q ss_pred eecccccccc-----ccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHH-HHHhhcccCCceeecc-------c Q lcl|Aclame:pro 492 LNMTGVPALT-----YPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAA-KKAQVFDNTGERIWQN-------N 558 (632) Q Consensus 492 l~~a~~~~~~-----~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~d~~g~~~~~~-------~ 558 (632) ++.+...... .+.....++.+.++...+.... ......|+|++.....+ .+..++|.+|+|+|.+ + T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~~~~~~~~~~~ 237 (338) T protein:vir:78 159 DTNNVIVNTTNVDYLQTGTTPLLDRFLDGYDLVSANT-DVDFNGWAADPRYRARLLRSQAYRDANGNVDPTRINLAASAG 237 (338) T ss_pred ccccccccccccccccccchhhHHHHHHHHHHhhhhc-cccceEEEEchHHHHHHHHHhhhccCCCceeecccccCCCCc Confidence 7654443322 2223455677777776665432 34456799999887655 4457899999999864 4 Q ss_pred cccCcceEEcCCCCCc---------cEEEEehhhEEEEEecceEEEEeccc--------------ccccCcEEEEEEEEe Q lcl|Aclame:pro 559 EVNGYRAEASNQIPAD---------TWIFGDWSQIVIAMWGVLDLKVDPYT--------------KAASDGLVLRVFQDV 615 (632) Q Consensus 559 ~l~G~pv~~~~~~~~~---------~~~~gd~s~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~r~ 615 (632) +|+|+||++++++|.+ .++||||+.|.+++++++.+.++++. .|.+|++.||++.|+ T Consensus 238 ~l~G~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~ 317 (338) T protein:vir:78 238 DLLGLPVQFGKAVGGDLGAATDSKVRVVGGDFSQLKYGFADEIRVKMSDTATLTDNTSPTPQTVSMWQTNQIAILIEVTF 317 (338) T ss_pred eeeeeeEEEccccCccccccCCcccEEEEEecceEEEEeecccEEEEeecccccccccccccchhhhhcCcEEEEEEEEe Confidence 7999999999998852 38899999999999999999888763 388999999999999 Q ss_pred CcEEecccceEEEEecC Q lcl|Aclame:pro 616 DAGVRRKEAFCIAKKGA 632 (632) Q Consensus 616 ~~~v~~~~a~~~~~~~A 632 (632) |+++.||+||++|+.++ T Consensus 318 d~~v~~~~a~~~l~~~~ 334 (338) T protein:vir:78 318 GWLLGDKQAFVKFVDDE 334 (338) T ss_pred ccEeecccceEEEeccc Confidence 99999999999999998 No 82 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=100.00 E-value=8.8e-42 Score=245.67 Aligned_cols=368 Identities=11% Similarity=0.104 Sum_probs=218.8 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhh Q lcl|Aclame:pro 211 NPAPAASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPA 290 (632) Q Consensus 211 ~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 290 (632) +. -........+.......+.................++........+.....+... .. T Consensus 1 Mn-~~e~lkel~~~~~el~~~~~~~~~~~~~~~~e~~~~e~~~~~~e~~~l~~~i~~~---------------~~----- 59 (421) T protein:vir:13 1 MN-LFERLKELRAKKKELEEKRCGIVEEIRSLAKEKKEEEARSKALEREKIEARMEII---------------EE----- 59 (421) T ss_pred CC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHH---------------HH----- Confidence 00 0000000000000000000000000000000000000000000000000000000 00 Q ss_pred hhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceec Q lcl|Aclame:pro 291 IHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVA 370 (632) Q Consensus 291 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~ 370 (632) ............... ... ......................+....... ....+.++. .+..+||.+|| T Consensus 60 --~~~~~~~~~~~~~~~-~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~ra~--~t~~~gg~liP 127 (421) T protein:vir:13 60 --EIESVMTAIDEERKN-TNF----TGGRVIINGDSKEEKRSLQLSAMSKTIRGI---QLSEEERDI--MSSTNNGAVIP 127 (421) T ss_pred --HHHHHHHHHHHHHhh-hcc----cccccccccchhHHHHHHHHHHHHHhhhcc---chhHHHhhc--cccCCcceecc Confidence 000000000000000 000 000000000000000011111111111111 111122333 33445677777 Q ss_pred hhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCc--cccccccCcccccCcccceeeeeeeeeeeeeehhhH Q lcl|Aclame:pro 371 TELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGA--NFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTR 448 (632) Q Consensus 371 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSr 448 (632) .++. ..|++.+++.++++++ +++++..+....+++....+ .+.|++|+++++.+.++|+.+++.+++++++++||+ T Consensus 128 ~~~~-~~Ii~~~~~~~~l~~l-~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~s~~~f~~i~~~~~k~~~~v~iS~ 205 (421) T protein:vir:13 128 QEFV-NEFEKLKEGYPSLKEH-CHVIPVNRNAGKMPVRAGASVDKLANLAKDTELVKAMLKTQPMAYDIDDYGLLAPIDN 205 (421) T ss_pred hhhH-HHHHHHHHhhhhhhhh-ceeeeccCCceEEEEeecCCccceeeccccccccccccceeEEEeeeeeeEeehhhhH Confidence 6554 6688999999998887 56666666666666655443 467799999999999999999999999999999999 Q ss_pred HHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHHHHHHhhcccccc Q lcl|Aclame:pro 449 KLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGR 528 (632) Q Consensus 449 e~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~ 528 (632) |+|.|+..++.++|.+.|+++++++++..+++ .|.|+++.++ ..++++|.+++..+...+.. + T Consensus 206 ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~------~~~g~~~~~~---------~~~~d~i~~~~~~l~~~~~~--~ 268 (421) T protein:vir:13 206 SLLEDSEINFLEFVNEEFAEFAVNTENAEIVK------QAKAVLAEET---------INDYAGLVKTINSLVPNARK--R 268 (421) T ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHhhhhHhh------hhhhcccccc---------ccchHHHHHHHHHhhhhhcC--C Confidence 99999999999999999999999999988763 4666664332 34678999999999887754 5 Q ss_pred ceEEeehhHHHHHHHHhhcccCCceeecc------ccccCcceEEcCCCCCc-----cEEEEehhh-EEEEEecceEEEE Q lcl|Aclame:pro 529 LAYLTSVTQRGAAKKAQVFDNTGERIWQN------NEVNGYRAEASNQIPAD-----TWIFGDWSQ-IVIAMWGVLDLKV 596 (632) Q Consensus 529 ~~~~~~~~~~~~~~~~~~~d~~g~~~~~~------~~l~G~pv~~~~~~~~~-----~~~~gd~s~-~~~~~~~~~~~~~ 596 (632) +.|+|++..+..+ .+++|.+|+|+|.+ ++|+|+||++++++|.. .++||||+. |.+++++++++.+ T Consensus 269 a~~v~n~~~~~~l--~~lkd~~G~~i~~~~~~~~~~tl~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~ 346 (421) T protein:vir:13 269 AIIVTNSDGRAYL--DGLMDKQGRPLLKELSDGGDLVFKGRPVIELEESIFDVGDETKFIVSDFKTLIKFMDRKQYLIDQ 346 (421) T ss_pred CEEEEcHHHHHHH--HHhhcCCCceeecCcCCCCCceecceeeEEeccccccCCCceEEEEEeccccEEEEEecceEEEe Confidence 7899988876554 57899999999975 47999999999998854 479999998 7789999999999 Q ss_pred ecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 597 DPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 597 ~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +++.+|.+|++.||+..|+|+++++++||+.++..- T Consensus 347 ~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~ 382 (421) T protein:vir:13 347 SKEAGYTKNETIARIIERFDVNSPLDKSSDAEKIRK 382 (421) T ss_pred ecccccccCeeEEEEEeeecceeecchhhheeeecc Confidence 999999999999999999999999999977665553 No 83 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=100.00 E-value=1.3e-42 Score=250.16 Aligned_cols=346 Identities=14% Similarity=0.076 Sum_probs=220.3 Q ss_pred hhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 245 RSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKA 324 (632) Q Consensus 245 ~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 324 (632) +....+. . ....+.+..+........... +.....+..... ...+. .......... T Consensus 1 M~i~~~~--~-~~~~e~~~~l~~~~~~~~~~e-~~~~~~~~~~~~---~~~~~---~~~~~~e~~~-------------- 56 (377) T protein:vir:96 1 MAINLKE--L-PKYREAVAELSAKISAGATPE-EQEKLFEAAFTT---MGDEI---LAKNEEEMER-------------- 56 (377) T ss_pred CCccHHH--H-HHHHHHHHHHHHHHhhcccHH-HHHHHHHHHHHH---HHHHH---HHHHHHHHHH-------------- Confidence 1111100 0 011111111111111110000 000000000000 00000 0000000000 Q ss_pred hhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEE Q lcl|Aclame:pro 325 GFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVD 404 (632) Q Consensus 325 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 404 (632) ..... +...................++...+|++||.++ .+.|++.+.+.++++++ +++.+.. .... T Consensus 57 --------~~~~~--~~~~~lt~ee~~~~~~~~~~~~~~~gg~lvP~~~-~~~I~~~l~~~s~i~~~-~~v~~~~-~~~~ 123 (377) T protein:vir:96 57 --------MFDLR--DKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEET-MVQVFDDLVAEHPLLKV-INFKNTS-LRLK 123 (377) T ss_pred --------HHHhc--cCCcccCHHHHHHHHHHHhcCCCCCCceecCHHH-HHHHHHHHHhhhhhhhh-ceeEecC-CceE Confidence 00000 0000000000000011223345556677777665 55677778888888887 4555554 4578 Q ss_pred EEEecCCccccccccCccccc-CcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|Aclame:pro 405 IPKKTSGANFYWIGEDEDVQD-SDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTG 483 (632) Q Consensus 405 ~~~~~~~~~a~~v~E~~~~~~-~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g 483 (632) +++.++.+.+.|++|+++.+. ++++|+++++.+++++++++||+++|.|+.++++++|.+.++++++++++.+|++|+| T Consensus 124 i~~~~~~~~a~wv~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~~ll~ds~~~le~~i~~~l~~~~~~~~~~a~i~G~G 203 (377) T protein:vir:96 124 ALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNG 203 (377) T ss_pred EEEecCCcceeEeecccccccccCccceeEeeeeeeEEeechhhHHHhhcchhhHHHHHHHHHHHHHHHHHhhceEeccC Confidence 888888899999999998764 6899999999999999999999999999999999999999999999999999999999 Q ss_pred Cccccccceeccccccccc-----------------cccchhHHHHHHHHHHHHhhcccc---------ccceEEeehhH Q lcl|Aclame:pro 484 LANDPVGLLNMTGVPALTY-----------------PAGGVDWASVVDMETKISTFNADA---------GRLAYLTSVTQ 537 (632) Q Consensus 484 ~~~~~~Gil~~a~~~~~~~-----------------~~~~~~~~~i~~~~~~~~~~~~~~---------~~~~~~~~~~~ 537 (632) ++ +|.||++......... ....++.+.+.++++.|...++.. .++.|+|++.+ T Consensus 204 ~~-~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t 282 (377) T protein:vir:96 204 LL-QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPED 282 (377) T ss_pred CC-cceeeeeccccccccccccccccceeeccccccccccCChhHHHHHHHHHHHhhccccccccccccCceEEEEchhh Confidence 75 8999998554332211 112345577778777776665422 35679999877 Q ss_pred HHHHHH-HhhcccCCceeeccccccCcc--eEEcCCCCCccEEEEehhhEEEEEecceEEEEecccccccCcEEEEEEEE Q lcl|Aclame:pro 538 RGAAKK-AQVFDNTGERIWQNNEVNGYR--AEASNQIPADTWIFGDWSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQD 614 (632) Q Consensus 538 ~~~~~~-~~~~d~~g~~~~~~~~l~G~p--v~~~~~~~~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r 614 (632) ...+.. ....+.+|+| .+++|+| ++.++.+|.+.++||||++|.+++++++++..+++.+|.+|++.|++..| T Consensus 283 ~~~~~~~~~~~~~~G~~----~~~l~~p~~v~~s~~~p~~~i~fgdf~~Y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r 358 (377) T protein:vir:96 283 RWTLEAKFTSRNQFGEY----VTVLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNY 358 (377) T ss_pred HHhccccccccCCCCCc----eeccCCCceEEecCCCCcccEEEEEcCcEEEEEecccEEEeehhhhhhcCCeEEEEEEE Confidence 533210 1223344543 3577776 66788999999999999999999999999999999999999999999999 Q ss_pred eCcEEecccceEEEEecC Q lcl|Aclame:pro 615 VDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 615 ~~~~v~~~~a~~~~~~~A 632 (632) +|++++|++||++|+++= T Consensus 359 ~dG~~~d~~a~~vl~l~~ 376 (377) T protein:vir:96 359 FYGKAKDNHTAALLTLAG 376 (377) T ss_pred EcCEEecCCcEEEEEEec Confidence 999999999999999988 No 84 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=100.00 E-value=5.5e-42 Score=246.78 Aligned_cols=373 Identities=11% Similarity=0.031 Sum_probs=217.5 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh-h---HHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhh Q lcl|Aclame:pro 218 GANENDILSRERTRISEITAIGQQFSQRSLAQEAIQK-G---HTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHS 293 (632) Q Consensus 218 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~-~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 293 (632) ++...+ ............... .+...+.... . +................ ..........+. . T Consensus 1 Mk~l~e----l~~~~~e~~~~~~~~--~~~~~~~~~~~~~~~ee~~~~~~~~~~l~~~~-~~l~~~~~~~e~-------~ 66 (387) T protein:vir:93 1 MPTLYE----LKQSLGMIGQQLKNK--NDELSQKATDPNIDMEDIKQLETEKAGLQQRF-NIVERQVKDIEE-------K 66 (387) T ss_pred CchHHH----HHHHHHHHHHHHHHH--HHHHHHHHhccCcCHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHH-------H Confidence 110000 000000000000000 0000000000 0 00000000000000000 000000000000 0 Q ss_pred hhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhh Q lcl|Aclame:pro 294 ARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATEL 373 (632) Q Consensus 294 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~ 373 (632) .. ..... . ..... ..... ...........+...................+++..++.+.||++||.++ T Consensus 67 ~~------~~~~~-~-~~~~~-~~~~~---~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~al~~~t~s~gG~~IP~~~ 134 (387) T protein:vir:93 67 EK------AKVKD-T-GEAYQ-SLNDH---EKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTL 134 (387) T ss_pred HH------Hhhhh-c-cccCC-Ccchh---hHHHHHHHHHHHHHhhhhhhhhhhhhhHHHHHhhccCcCCCCceeechhH Confidence 00 00000 0 00000 00000 00000000001100000000111112233456677777777888887765 Q ss_pred hhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEe-cCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhh Q lcl|Aclame:pro 374 LSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKK-TSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRK 452 (632) Q Consensus 374 ~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~ 452 (632) .+.|++.+++.++++.+ +++.+..+. .+|+. .+.+.+.|++|++..++++++|+++++.+++++++++||+|+|. T Consensus 135 -~~~Ii~~~~~~~~l~~~-~~v~~~~~~--~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ 210 (387) T protein:vir:93 135 -SKEIVSEPFAKNQLREK-ARLTNIKGL--EIPRVSYTLDDDDFITDVETAKELKLKGDTVKFTTNKFKVFAAISDTVIH 210 (387) T ss_pred -HHHHHHHHHhhchhhhh-eeeeecCCc--eEEEEeecCCccccccCcccccccccccceeeeeheeeeeechhhHHHHh Confidence 46788888888888887 445555443 34543 34567899999999999999999999999999999999999999 Q ss_pred cChhHHHHHHHHHHHHHHHHHHHHHHh-hcCCCccccccceeccccccccccccchhHHHHHHHHHHHHhhccccccceE Q lcl|Aclame:pro 453 QSSIHVENLIREDLIEGIGVALDLAML-TGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAY 531 (632) Q Consensus 453 d~~~~~~~~i~~~l~~a~a~~~~~~~~-~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 531 (632) |+.++++++|.+.++++++++++..++ .|+|+ +.|.|++..+++..++ +..+++.|+++++.+...|+. ++.| T Consensus 211 Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~-g~p~g~l~~~~~~~v~---~~~~~d~i~~~~~~l~~~~~~--~a~~ 284 (387) T protein:vir:93 211 GSDVDLVNWVENALQSGLAAKERKDALAVSPKS-GLDHMSFYNGSVKEVE---GADMYDAIINALADLHEDYRD--NATI 284 (387) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCc-cccceeeecccccccc---ccchHHHHHHHHhccChhhhc--CCEE Confidence 999999999999999999999877554 55555 4688988776654432 345689999999999988874 5789 Q ss_pred EeehhHHHHHHHHhhcccCCceee-ccccccCcceEEcCCCCCccEEEEehhhEEEEEecceEEEEecccccccCcEEEE Q lcl|Aclame:pro 532 LTSVTQRGAAKKAQVFDNTGERIW-QNNEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGVLDLKVDPYTKAASDGLVLR 610 (632) Q Consensus 532 ~~~~~~~~~~~~~~~~d~~g~~~~-~~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 610 (632) +|+..+...+ +.+++|.+|.+++ .+++|+|+||++++.++ +++||||+.|++. +.++.+.. +..+.++.+.|+ T Consensus 285 ~mn~~t~~~~-~~~~~d~~~~~~~~~~~~llG~PV~~~~~~~--~~~~GDf~~~~~~-~~~~~~~~--~~~~~~~~~~~~ 358 (387) T protein:vir:93 285 YMRYADYVKI-ISVLSNGTTNFFDTPAEKVFGKPVVFTDAAV--KPIVGDFNYFGIN-YDGTTYDT--DKDVKKGEYLFV 358 (387) T ss_pred EEechHHHHH-HHHHhcCCCcccccCCccccccceEEecCCC--ceeeeehhhhhee-hhhheeee--cccccCCceeEE Confidence 9998775443 3456777666655 35689999999998765 5899999998765 44454444 445678999999 Q ss_pred EEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 611 VFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 611 ~~~r~~~~v~~~~a~~~~~~~A 632 (632) ++.|+|+++++|+||++++++| T Consensus 359 ~~~r~d~~v~~~eA~~~l~~k~ 380 (387) T protein:vir:93 359 LTAWYDQQRTLDSAFRIAKAKE 380 (387) T ss_pred EEeeeCceeechhheEEEEeec Confidence 9999999999999999999988 No 85 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=100.00 E-value=5.5e-43 Score=252.28 Aligned_cols=266 Identities=12% Similarity=0.162 Sum_probs=212.8 Q ss_pred ccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcccccCcccceeeeeeeeee Q lcl|Aclame:pro 361 TAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTI 440 (632) Q Consensus 361 ~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~ 440 (632) ...++|.++|+++. ..|++.+++.+++++++ ++++..+...++|+.++.+.+.|++|++++++++++|+++++.++++ T Consensus 1 ma~~gG~lvp~~~~-~~ii~~~~~~s~i~~l~-~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~f~~v~l~~~k~ 78 (298) T protein:vir:16 1 MVLNKGTLFDPTLV-TDLISKVAGKSSIARLS-AQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKV 78 (298) T ss_pred CcccCcceechhHH-HHHHHHHHhhhhhhhhc-ceeeccCCceEEEEEecCcceEEecCCccccccccceeEEEEeeeeE Confidence 33466788888765 56789999999999884 56677667789999999999999999999999999999999999999 Q ss_pred eeeehhhHHHhh---cChhHHHHHHHHHHHHHHHHHHHHHHhhcCC----Cccccccceecccccc--c-cccccchhHH Q lcl|Aclame:pro 441 AGAVPVTRKLRK---QSSIHVENLIREDLIEGIGVALDLAMLTGTG----LANDPVGLLNMTGVPA--L-TYPAGGVDWA 510 (632) Q Consensus 441 ~~~~~iSre~l~---d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g----~~~~~~Gil~~a~~~~--~-~~~~~~~~~~ 510 (632) +++++||+|+|. ++..+++++|.+++++++++++|.++++|.+ ....+.|.....+... . ........++ T Consensus 79 a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (298) T protein:vir:16 79 EYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNG 158 (298) T ss_pred EEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcccccccccccccccccccccccccccHHH Confidence 999999999994 5568999999999999999999999999853 2223333322221111 1 1122223456 Q ss_pred HHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeecc-------ccccCcceEEcCCCCCc------cEE Q lcl|Aclame:pro 511 SVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEASNQIPAD------TWI 577 (632) Q Consensus 511 ~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~-------~~l~G~pv~~~~~~~~~------~~~ 577 (632) ++.+++.++...+.+ ...|+||+..+..+ .+++|.+|+|+|++ ++|+|+||++++.+|.. .++ T Consensus 159 ~i~~~~~~~~~~~~~--~~~~vmn~~~~~~l--~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~v~~~~~~~~~~~~ 234 (298) T protein:vir:16 159 AIENAVELLTGVDAD--VTGIAINPSFRSAL--AKQKDLQDNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAI 234 (298) T ss_pred HHHHHHHHhhhcCCC--ccEEEEcHHHHHHH--HHhhccCCCeeecCcccCCCCceecceeeEEecccccccCCCccEEE Confidence 788888888776543 45699988877544 57899999999975 57999999999999853 589 Q ss_pred EEehhhE-EEEEecceEEEEeccc--------ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 578 FGDWSQI-VIAMWGVLDLKVDPYT--------KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 578 ~gd~s~~-~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) ||||+.+ .++.+.++++.++++. +|.+|++.||++.|+|+++++|+||++||.+= T Consensus 235 ~GDfs~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 235 IGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred EeeccceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 9999984 5788999998887652 48999999999999999999999999998777 No 86 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=100.00 E-value=5.2e-42 Score=246.92 Aligned_cols=389 Identities=10% Similarity=0.001 Sum_probs=213.7 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh--hhhhhhhhhhhHHHHHHHHHHhhhhHhhhh Q lcl|Aclame:pro 197 SQQTETRGAETGAKNPAPAASGANENDILSRERTRISEITAIGQQFSQ--RSLAQEAIQKGHTVDQFRALVLERMNPGQP 274 (632) Q Consensus 197 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~--~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~ 274 (632) +........ .... ...+... .................. .+.........+................ . T Consensus 1 ~~~~~~~~~-~~~g-----~~mk~l~----el~~~~~e~~~~~~~~~~el~~~~~~~~~~~ee~~~~~~~~~~l~~~~-~ 69 (402) T protein:vir:93 1 MRNFKNDNE-LLGG-----NEMPTLY----ELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRF-N 69 (402) T ss_pred Ccchhhhhh-cCCC-----CCChHHH----HHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHH-H Confidence 000000000 0000 0000000 000000000000000000 0000000000000000000000000000 0 Q ss_pred hhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhh Q lcl|Aclame:pro 275 GNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQ 354 (632) Q Consensus 275 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 354 (632) .........+.. .......... ........ ...........+................... T Consensus 70 ~l~~~~~~~e~~--------------~~~~~~~~~~--~~~~~~~~---~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~ 130 (402) T protein:vir:93 70 IVERQVQDIEEK--------------EKAKVKDKGE--AYQSLSDN---EKMVKAKAEFYRHAILPNEFEKPSMEAQRLL 130 (402) T ss_pred HHHHHHHHHHHH--------------HHhhhhhccc--cCCCCchh---HHHHHHHHHHHHHHHhhhhHHHHHHhHHHHH Confidence 000000000000 0000000000 00000000 0000000000000000000011111122344 Q ss_pred hhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEec-CCccccccccCcccccCcccceee Q lcl|Aclame:pro 355 RQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKT-SGANFYWIGEDEDVQDSDFDFTTL 433 (632) Q Consensus 355 ~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~a~~v~E~~~~~~~~~~~~~~ 433 (632) +++..++..+||++||.++ ...|++.++..++++++. ++++..+. .+|+.. ..+.+.|++|++..++++++|+++ T Consensus 131 ~a~~~~t~~~GG~lIP~~~-~~~Ii~~~~~~~~l~~~~-~v~~~~~~--~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~i 206 (402) T protein:vir:93 131 HALPTGNDSGGDKLLPKTL-SKEIVSEPFAKNQLREKA-RLTNIKGL--EIPRVSYTLDDDDFITDVETAKELKAKGDTV 206 (402) T ss_pred hhhccCCCcCCccccchhH-HHHHHHhHHhhhhhhhhc-eeeecCCc--eeeeeeccCCcccccccccccccccccccee Confidence 5666777777888887765 567888888888888874 44555433 345443 456789999999999999999999 Q ss_pred eeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHH-hhcCCCccccccceeccccccccccccchhHHHH Q lcl|Aclame:pro 434 SFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAM-LTGTGLANDPVGLLNMTGVPALTYPAGGVDWASV 512 (632) Q Consensus 434 ~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~-~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i 512 (632) ++.+++++++++||+|+|.|+.+++.++|.+.|+++++++++..+ ..|+|+ +.|.|++..+++..++ +...+++| T Consensus 207 ~~~~~k~~~~i~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~-g~p~g~~~~~~~~~~~---~~~~~d~l 282 (402) T protein:vir:93 207 KFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKS-GLEHMSFYNGSVKEVE---GADMYDAI 282 (402) T ss_pred eecceeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCc-cccceeeecccccccc---ccchHHHH Confidence 999999999999999999999999999999999999999987654 455554 4788988877655543 34468899 Q ss_pred HHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceee--ccccccCcceEEcCCCCCccEEEEehhhEEEEEec Q lcl|Aclame:pro 513 VDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIW--QNNEVNGYRAEASNQIPADTWIFGDWSQIVIAMWG 590 (632) Q Consensus 513 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~--~~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~ 590 (632) +++++.+...|+. ++.|+|+..+...+. .+++|. |+++| .+++|+|+||++++.++ +++||||+.|++... T Consensus 283 ~~~~~~l~~~y~~--na~~imn~~t~~~~~-~~~~d~-~~~~~~~~~~~llG~PV~~t~~~~--~i~~GDf~~~~~~~~- 355 (402) T protein:vir:93 283 INALADLHEDYRD--NATIYMRYADYVKII-SVLSNG-TTNFFDTPAEKVFGKPVVFTDAAV--KPIVGDFNYFGINYD- 355 (402) T ss_pred HHHHhccChhhhc--CCEEEEechHHHHHH-HHHhcC-CCcccccCCccccccceEEecCCC--ceeeechhhhhhhhh- Confidence 9999999888864 578999988765433 344554 55555 45789999999999765 689999998765433 Q ss_pred ceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 591 VLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 591 ~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) ++.+. ...+..+|++.|++..|+|++|++|+||++|+++| T Consensus 356 ~~~~~--~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~ik~ 395 (402) T protein:vir:93 356 GTTYD--TDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKE 395 (402) T ss_pred hhhhh--hhhcccCCceEEEEEEEeCcEEechhheEEEEeec Confidence 33333 33344569999999999999999999999999988 No 87 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=100.00 E-value=6.9e-42 Score=246.26 Aligned_cols=374 Identities=11% Similarity=0.020 Sum_probs=214.2 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhh--hhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhh Q lcl|Aclame:pro 218 GANENDILSRERTRISEITAIGQQFSQ--RSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSAR 295 (632) Q Consensus 218 ~~~~~~~~~~~~~r~~~~~~~~~~~~~--~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 295 (632) ++ .......+............. .++........+................ ...... .+... T Consensus 1 Mk----~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~-~~l~~~---~~~~e-------- 64 (387) T protein:vir:26 1 MP----TLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRF-NIVERQ---VQDIE-------- 64 (387) T ss_pred Cc----hHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHH-HHHHHH---HHHHH-------- Confidence 00 000111111110000000000 0000000000000000000000000000 000000 00000 Q ss_pred hhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhh Q lcl|Aclame:pro 296 DLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLS 375 (632) Q Consensus 296 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~ 375 (632) .......... ..... ................+...................+++..++.++||++||.++ . T Consensus 65 ---~~~~~~~~~~--~~~~~---~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~-~ 135 (387) T protein:vir:26 65 ---EKEKAKVKDK--GEAYQ---SLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTL-S 135 (387) T ss_pred ---HHHHhhhhhc--cccCC---CCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhH-H Confidence 0000000000 00000 0000000000000000000000000111112223445566667777788777665 5 Q ss_pred HHHHHHHhhhhhhhhhcceeeccCceeEEEEEec-CCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcC Q lcl|Aclame:pro 376 EEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKT-SGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQS 454 (632) Q Consensus 376 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~ 454 (632) ..|++.++..++++.+. ++.+..+. .+|+.. ..+.+.|++|++..++++++|+++++.+++++++++||+|+|.|+ T Consensus 136 ~~Ii~~~~~~~~l~~~~-~~~~~~~~--~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds 212 (387) T protein:vir:26 136 KEIVSEPFAKNQLREKA-RLTNIKGL--EIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGS 212 (387) T ss_pred HHHHHHHHhhchhhhhc-eeeecCCc--eeeeeeccCCccccccccccccccccccceeeechheeeeechhhHHHHhhh Confidence 77888888888888874 45555443 344433 456789999999999999999999999999999999999999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHh-hcCCCccccccceeccccccccccccchhHHHHHHHHHHHHhhccccccceEEe Q lcl|Aclame:pro 455 SIHVENLIREDLIEGIGVALDLAML-TGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLT 533 (632) Q Consensus 455 ~~~~~~~i~~~l~~a~a~~~~~~~~-~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 533 (632) .++++++|.+.|+++++++++..++ .|+|+ +.|.|++..+++..++ +..++++|.++++.+...|+. ++.|+| T Consensus 213 ~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~-g~~~g~~~~~~~~~~~---~~~~~d~i~~~~~~l~~~y~~--na~~im 286 (387) T protein:vir:26 213 DVDLVNWVENALQSGLAAKERKDALAVSPKS-GLEHMSFYNGSVKEVE---GADMYDAIINALADLHEDYRD--NATIYM 286 (387) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhHhhcCCCc-cccceeeecccccccc---ccchHHHHHHHHhccChhhhc--CCEEEE Confidence 9999999999999999999776554 55554 4688988776655443 345689999999999988864 578999 Q ss_pred ehhHHHHHHHHhhcccCCceee--ccccccCcceEEcCCCCCccEEEEehhhEEEEEecceEEEEecccccccCcEEEEE Q lcl|Aclame:pro 534 SVTQRGAAKKAQVFDNTGERIW--QNNEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGVLDLKVDPYTKAASDGLVLRV 611 (632) Q Consensus 534 ~~~~~~~~~~~~~~d~~g~~~~--~~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 611 (632) +..+...+. .++++ .|+++| .+.+|+|+||++++.++ +++||||+.|++.. .++.+..+ .+..+|++.|++ T Consensus 287 n~~t~~~~~-~~~~~-~~~~~~~~~~~~llG~PV~~~~~~~--~~~~GDf~~~~~~~-~~~~~~~~--~~~~~~~~~~~~ 359 (387) T protein:vir:26 287 RYADYVKII-SVLSN-GTTNFFDTPAEKVFGKPVVFTDAAV--KPIVGDFNYFGINY-DGTTYDTD--KDVKKGEYLFVL 359 (387) T ss_pred echHHHHHH-HHHhc-CCCcccccCCccccccceEEecCCC--ceeeechhhhhhhh-hhhhheec--ccccCCceEEEE Confidence 887754432 34444 566666 35689999999999765 68999999976643 34444433 344579999999 Q ss_pred EEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 612 FQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 612 ~~r~~~~v~~~~a~~~~~~~A 632 (632) ..|+|+++++|+||++++++| T Consensus 360 ~~r~Dg~v~~~~A~~~l~~ka 380 (387) T protein:vir:26 360 TAWYDQQRTLDSAFRIAKAKE 380 (387) T ss_pred EEEeCcEeechhheEEEEeec Confidence 999999999999999999998 No 88 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=100.00 E-value=6.9e-42 Score=246.26 Aligned_cols=374 Identities=11% Similarity=0.020 Sum_probs=214.2 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhh--hhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhh Q lcl|Aclame:pro 218 GANENDILSRERTRISEITAIGQQFSQ--RSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSAR 295 (632) Q Consensus 218 ~~~~~~~~~~~~~r~~~~~~~~~~~~~--~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 295 (632) ++ .......+............. .++........+................ ...... .+... T Consensus 1 Mk----~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~-~~l~~~---~~~~e-------- 64 (387) T protein:vir:96 1 MP----TLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRF-NIVERQ---VQDIE-------- 64 (387) T ss_pred Cc----hHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHH-HHHHHH---HHHHH-------- Confidence 00 000111111110000000000 0000000000000000000000000000 000000 00000 Q ss_pred hhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhh Q lcl|Aclame:pro 296 DLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLS 375 (632) Q Consensus 296 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~ 375 (632) .......... ..... ................+...................+++..++.++||++||.++ . T Consensus 65 ---~~~~~~~~~~--~~~~~---~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~-~ 135 (387) T protein:vir:96 65 ---EKEKAKVKDK--GEAYQ---SLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTL-S 135 (387) T ss_pred ---HHHHhhhhhc--cccCC---CCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhH-H Confidence 0000000000 00000 0000000000000000000000000111112223445566667777788777665 5 Q ss_pred HHHHHHHhhhhhhhhhcceeeccCceeEEEEEec-CCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcC Q lcl|Aclame:pro 376 EEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKT-SGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQS 454 (632) Q Consensus 376 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~ 454 (632) ..|++.++..++++.+. ++.+..+. .+|+.. ..+.+.|++|++..++++++|+++++.+++++++++||+|+|.|+ T Consensus 136 ~~Ii~~~~~~~~l~~~~-~~~~~~~~--~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds 212 (387) T protein:vir:96 136 KEIVSEPFAKNQLREKA-RLTNIKGL--EIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGS 212 (387) T ss_pred HHHHHHHHhhchhhhhc-eeeecCCc--eeeeeeccCCccccccccccccccccccceeeechheeeeechhhHHHHhhh Confidence 77888888888888874 45555443 344433 456789999999999999999999999999999999999999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHh-hcCCCccccccceeccccccccccccchhHHHHHHHHHHHHhhccccccceEEe Q lcl|Aclame:pro 455 SIHVENLIREDLIEGIGVALDLAML-TGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLT 533 (632) Q Consensus 455 ~~~~~~~i~~~l~~a~a~~~~~~~~-~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 533 (632) .++++++|.+.|+++++++++..++ .|+|+ +.|.|++..+++..++ +..++++|.++++.+...|+. ++.|+| T Consensus 213 ~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~-g~~~g~~~~~~~~~~~---~~~~~d~i~~~~~~l~~~y~~--na~~im 286 (387) T protein:vir:96 213 DVDLVNWVENALQSGLAAKERKDALAVSPKS-GLEHMSFYNGSVKEVE---GADMYDAIINALADLHEDYRD--NATIYM 286 (387) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhHhhcCCCc-cccceeeecccccccc---ccchHHHHHHHHhccChhhhc--CCEEEE Confidence 9999999999999999999776554 55554 4688988776655443 345689999999999988864 578999 Q ss_pred ehhHHHHHHHHhhcccCCceee--ccccccCcceEEcCCCCCccEEEEehhhEEEEEecceEEEEecccccccCcEEEEE Q lcl|Aclame:pro 534 SVTQRGAAKKAQVFDNTGERIW--QNNEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGVLDLKVDPYTKAASDGLVLRV 611 (632) Q Consensus 534 ~~~~~~~~~~~~~~d~~g~~~~--~~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 611 (632) +..+...+. .++++ .|+++| .+.+|+|+||++++.++ +++||||+.|++.. .++.+..+ .+..+|++.|++ T Consensus 287 n~~t~~~~~-~~~~~-~~~~~~~~~~~~llG~PV~~~~~~~--~~~~GDf~~~~~~~-~~~~~~~~--~~~~~~~~~~~~ 359 (387) T protein:vir:96 287 RYADYVKII-SVLSN-GTTNFFDTPAEKVFGKPVVFTDAAV--KPIVGDFNYFGINY-DGTTYDTD--KDVKKGEYLFVL 359 (387) T ss_pred echHHHHHH-HHHhc-CCCcccccCCccccccceEEecCCC--ceeeechhhhhhhh-hhhhheec--ccccCCceEEEE Confidence 887754432 34444 566666 35689999999999765 68999999976643 34444433 344579999999 Q ss_pred EEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 612 FQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 612 ~~r~~~~v~~~~a~~~~~~~A 632 (632) ..|+|+++++|+||++++++| T Consensus 360 ~~r~Dg~v~~~~A~~~l~~ka 380 (387) T protein:vir:96 360 TAWYDQQRTLDSAFRIAKAKE 380 (387) T ss_pred EEEeCcEeechhheEEEEeec Confidence 999999999999999999998 No 89 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=100.00 E-value=6.9e-42 Score=246.26 Aligned_cols=374 Identities=11% Similarity=0.020 Sum_probs=214.2 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhh--hhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhh Q lcl|Aclame:pro 218 GANENDILSRERTRISEITAIGQQFSQ--RSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSAR 295 (632) Q Consensus 218 ~~~~~~~~~~~~~r~~~~~~~~~~~~~--~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 295 (632) ++ .......+............. .++........+................ ...... .+... T Consensus 1 Mk----~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~-~~l~~~---~~~~e-------- 64 (387) T protein:vir:94 1 MP----TLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRF-NIVERQ---VQDIE-------- 64 (387) T ss_pred Cc----hHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHH-HHHHHH---HHHHH-------- Confidence 00 000111111110000000000 0000000000000000000000000000 000000 00000 Q ss_pred hhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhh Q lcl|Aclame:pro 296 DLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLS 375 (632) Q Consensus 296 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~ 375 (632) .......... ..... ................+...................+++..++.++||++||.++ . T Consensus 65 ---~~~~~~~~~~--~~~~~---~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~-~ 135 (387) T protein:vir:94 65 ---EKEKAKVKDK--GEAYQ---SLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTL-S 135 (387) T ss_pred ---HHHHhhhhhc--cccCC---CCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhH-H Confidence 0000000000 00000 0000000000000000000000000111112223445566667777788777665 5 Q ss_pred HHHHHHHhhhhhhhhhcceeeccCceeEEEEEec-CCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcC Q lcl|Aclame:pro 376 EEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKT-SGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQS 454 (632) Q Consensus 376 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~ 454 (632) ..|++.++..++++.+. ++.+..+. .+|+.. ..+.+.|++|++..++++++|+++++.+++++++++||+|+|.|+ T Consensus 136 ~~Ii~~~~~~~~l~~~~-~~~~~~~~--~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds 212 (387) T protein:vir:94 136 KEIVSEPFAKNQLREKA-RLTNIKGL--EIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGS 212 (387) T ss_pred HHHHHHHHhhchhhhhc-eeeecCCc--eeeeeeccCCccccccccccccccccccceeeechheeeeechhhHHHHhhh Confidence 77888888888888874 45555443 344433 456789999999999999999999999999999999999999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHh-hcCCCccccccceeccccccccccccchhHHHHHHHHHHHHhhccccccceEEe Q lcl|Aclame:pro 455 SIHVENLIREDLIEGIGVALDLAML-TGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLT 533 (632) Q Consensus 455 ~~~~~~~i~~~l~~a~a~~~~~~~~-~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 533 (632) .++++++|.+.|+++++++++..++ .|+|+ +.|.|++..+++..++ +..++++|.++++.+...|+. ++.|+| T Consensus 213 ~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~-g~~~g~~~~~~~~~~~---~~~~~d~i~~~~~~l~~~y~~--na~~im 286 (387) T protein:vir:94 213 DVDLVNWVENALQSGLAAKERKDALAVSPKS-GLEHMSFYNGSVKEVE---GADMYDAIINALADLHEDYRD--NATIYM 286 (387) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhHhhcCCCc-cccceeeecccccccc---ccchHHHHHHHHhccChhhhc--CCEEEE Confidence 9999999999999999999776554 55554 4688988776655443 345689999999999988864 578999 Q ss_pred ehhHHHHHHHHhhcccCCceee--ccccccCcceEEcCCCCCccEEEEehhhEEEEEecceEEEEecccccccCcEEEEE Q lcl|Aclame:pro 534 SVTQRGAAKKAQVFDNTGERIW--QNNEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGVLDLKVDPYTKAASDGLVLRV 611 (632) Q Consensus 534 ~~~~~~~~~~~~~~d~~g~~~~--~~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 611 (632) +..+...+. .++++ .|+++| .+.+|+|+||++++.++ +++||||+.|++.. .++.+..+ .+..+|++.|++ T Consensus 287 n~~t~~~~~-~~~~~-~~~~~~~~~~~~llG~PV~~~~~~~--~~~~GDf~~~~~~~-~~~~~~~~--~~~~~~~~~~~~ 359 (387) T protein:vir:94 287 RYADYVKII-SVLSN-GTTNFFDTPAEKVFGKPVVFTDAAV--KPIVGDFNYFGINY-DGTTYDTD--KDVKKGEYLFVL 359 (387) T ss_pred echHHHHHH-HHHhc-CCCcccccCCccccccceEEecCCC--ceeeechhhhhhhh-hhhhheec--ccccCCceEEEE Confidence 887754432 34444 566666 35689999999999765 68999999976643 34444433 344579999999 Q ss_pred EEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 612 FQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 612 ~~r~~~~v~~~~a~~~~~~~A 632 (632) ..|+|+++++|+||++++++| T Consensus 360 ~~r~Dg~v~~~~A~~~l~~ka 380 (387) T protein:vir:94 360 TAWYDQQRTLDSAFRIAKAKE 380 (387) T ss_pred EEEeCcEeechhheEEEEeec Confidence 999999999999999999998 No 90 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=100.00 E-value=4e-41 Score=242.07 Aligned_cols=381 Identities=11% Similarity=0.011 Sum_probs=210.0 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHH Q lcl|Aclame:pro 186 PDKDKQTQTAGSQQTETRGAETGAKNPAPAASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALV 265 (632) Q Consensus 186 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~ 265 (632) .+... .-...... ..+. ............ ......................+.....+ T Consensus 1 ~~l~e--~i~e~~~~----------l~el-------~~~~~~~~~e~r---~~~e~~~~~~~~~~~~e~~~~~~~l~~ei 58 (400) T protein:vir:38 1 MTLDE--KLAAVKKQ----------LDEK-------RSALPAMKTELR---SLLEGEDSEENLKKAEGVRAKYDKAGKEI 58 (400) T ss_pred CChHH--HHHHHHHH----------HHHH-------HHHHHHHHHHHH---HHHHhhccchHHHHHHHHHHHHHHHHHHH Confidence 00000 00000000 0000 000000000000 00000000000000000000111111111 Q ss_pred hhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhh-HHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhh Q lcl|Aclame:pro 266 LERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGI-QHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARG 344 (632) Q Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 344 (632) ....+.... ................. ............. .............. ........ T Consensus 59 ~~l~e~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~--~~~~~~~~- 120 (400) T protein:vir:38 59 KDLEEKRDL--------YEAALKGNEQSSGKKPDHPEEHSYRDALNAY-------LHTRGRNTDGVNFE--KTDVGTFA- 120 (400) T ss_pred HHHHHHHHH--------HHHHHHHHhhcccccccchhhhhHHHHHHHH-------HhhHHHHHHHHHHH--HHHHHHHh- Confidence 110000000 00000000000000000 0000000000000 00000000000000 00000000 Q ss_pred hhhhhhHHhhhhhcc-cccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEec-CCccccccccCcc Q lcl|Aclame:pro 345 FYMPHEVLVQRQLEK-KTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKT-SGANFYWIGEDED 422 (632) Q Consensus 345 ~~~~~~~~~~~a~~~-~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~a~~v~E~~~ 422 (632) ...........++.. .+..+||++||.++ .+.|++.+++.+.++.+ +++++.+.....+|+.. ..+.+.|++|+++ T Consensus 121 ~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~-~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~ 198 (400) T protein:vir:38 121 VLRAVPTDASDAVNAGVKAADAASTIPETI-SNTPQRELQTVVDLKPF-TNVFQASTQKGTYPTVANATTKMVTVAELEK 198 (400) T ss_pred hhhhhhHHHHHHHhhcccccCCcccccHHH-HHHHHHHHHhhhhhhhc-ceeEeccCcceEEEEEecCCCcccccccccc Confidence 001111111222222 23445666666655 56789999999998886 56666666666777665 4467899999999 Q ss_pred ccc-CcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceecccccccc Q lcl|Aclame:pro 423 VQD-SDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALT 501 (632) Q Consensus 423 ~~~-~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~ 501 (632) ++. +.++|+.+++.+++++++++||+|+|.|+.++++++|.+.++++++.+++.++++|.|++. + T Consensus 199 ~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~~-~------------- 264 (400) T protein:vir:38 199 NPAMAKPEFKPVNWSVETYRQALPVSQESIDDSAIDLVGLIAQNGQQIKVNTTNGAVATLLKGFT-A------------- 264 (400) T ss_pred ccccccccceeeEeehhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc-c------------- Confidence 986 6799999999999999999999999999999999999999999999999999999887532 1 Q ss_pred ccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeecc-------ccccCcceEEcCCCCCc Q lcl|Aclame:pro 502 YPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEASNQIPAD 574 (632) Q Consensus 502 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~-------~~l~G~pv~~~~~~~~~ 574 (632) .+..+++.+.++....... ..++.|+|++..+.. +.+++|.+|+|+|.+ ++|+|+||++++++|.. T Consensus 265 --~~~~~~~~~~~~~~~~~~~---~~~a~~v~~~~~~~~--l~~lkd~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~ 337 (400) T protein:vir:38 265 --KTISSVDDLKHINNVDLDP---AYSRVIIASQSFYNF--LDTVKDGNGRYLLQDSILTPSGKSVLGMPIAVVSDDTLG 337 (400) T ss_pred --cccccHHHHHHHHHhhhhh---hhCcEEEEcHHHHHH--HHHhhccCCCeeeecCcCCCCccccccceeEEecccccC Confidence 1233566777766644332 335789998888655 457899999999975 37999999999988753 Q ss_pred -----cEEEEehhh-EEEEEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 575 -----TWIFGDWSQ-IVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 575 -----~~~~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) .++||||+. |.++.+.++.+.++++..|. ..|++++|+|+++.+|+||++|++++ T Consensus 338 ~~g~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~r~d~~~~~~~a~~~l~~~~ 398 (400) T protein:vir:38 338 AAGEAHAFLGDIKRAILFANRADFMVRWVDDQIYG---QFLQAGMRFGVSVADEKAGYFLTYTP 398 (400) T ss_pred CCCceEEEEEeccccEEEEeecceEEEEecccccc---eeEEEEEEeccEEecccceEEEEeec Confidence 389999998 67788999999998876654 47999999999999999999999988 No 91 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=100.00 E-value=6.6e-42 Score=246.34 Aligned_cols=341 Identities=12% Similarity=0.050 Sum_probs=214.9 Q ss_pred hhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 248 AQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFE 327 (632) Q Consensus 248 ~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 327 (632) .++........+..+...............+........ . ..........+ .. T Consensus 1 ~eei~~l~~~~~~l~~~~~~l~~~~d~~e~e~~~~~~~~------~----------~~~~~~~~~~~-----------~~ 53 (352) T protein:vir:78 1 MEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDK------G----------EAYQSLNDNEK-----------LV 53 (352) T ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc------c----------ccccccchhhh-----------HH Confidence 111111111111111111000000000000000000000 0 00000000000 00 Q ss_pred hHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEE Q lcl|Aclame:pro 328 REVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPK 407 (632) Q Consensus 328 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 407 (632) ...........................+++..++...||++||.++ .+.|++.++..++++.+ +++.+..+. .+|+ T Consensus 54 ~~~~~~~r~~~~~~~~~~~~~~~~~~~~al~~~~~~~gG~lIP~~~-~~~Ii~~l~~~s~l~~~-~~v~~~~~~--~~p~ 129 (352) T protein:vir:78 54 KAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTL-SKEIVSEPFAKNQLREK-ARLTNIKGL--EIPR 129 (352) T ss_pred HHHHHHHHHHhhhhHHHHHHhhHHHHHHHhccCCCCCCceeccHhH-HHHHHHHHHhhcchhhh-eeeEecCCc--eEEE Confidence 0000000000000011111112223445566667777788887665 56788888899998887 445555443 3454 Q ss_pred ec-CCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHH-HhhcCCCc Q lcl|Aclame:pro 408 KT-SGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLA-MLTGTGLA 485 (632) Q Consensus 408 ~~-~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~-~~~g~g~~ 485 (632) .. ..+.+.|++|++.+++++++|+++++.+++++++++||+|+|.|+.++++++|.+.|+++++++++.. +..|+|+ T Consensus 130 ~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~i~is~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~- 208 (352) T protein:vir:78 130 VSYTLDDDDFITDVETAKELKLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKS- 208 (352) T ss_pred EecCCCcccccccccccccccccceeeeecceeEEeechhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhhcCCCC- Confidence 43 45679999999999999999999999999999999999999999999999999999999999986664 4456655 Q ss_pred cccccceeccccccccccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeec--cccccCc Q lcl|Aclame:pro 486 NDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQ--NNEVNGY 563 (632) Q Consensus 486 ~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~--~~~l~G~ 563 (632) +.|.|++..+++..++ +..+++.|.+++..+...|+. ++.|+|++.+...+ ..+++.+|+++|. +++|+|+ T Consensus 209 ~~~~g~l~~~~~~~~t---~~~~~d~i~~~~~~l~~~~~~--~a~~~mn~~t~~~l--~~~~~~~~~~~~~~~~~~llG~ 281 (352) T protein:vir:78 209 GLEHMSFYNGSVKEVE---GANMYDAIINALADLHEDYRD--NATIYMRYADYVKI--ISVLSNGTTNFFDTPAEKVFGK 281 (352) T ss_pred cccccceecccccccc---ccchHHHHHHHHhccChhhhc--CCEEEEehHHHHHH--HHHHhccCCcccccCCcccccc Confidence 4677888776655543 234589999999999888865 57899988775543 4556777888874 5689999 Q ss_pred ceEEcCCCCCccEEEEehhhEEEEEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 564 RAEASNQIPADTWIFGDWSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 564 pv~~~~~~~~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) ||++++.++ +++||||+.|++. +.++.+.. .....+|++.|+++.|+|+++++|+||++++++| T Consensus 282 PV~~~~~~~--~~~~Gdf~~~~~~-~~~~~~~~--~~~~~~g~~~f~~~~r~Dg~~~~~eA~~~l~~~a 345 (352) T protein:vir:78 282 PVVFTDAAV--KPIVGDFNYFGIN-YDGTTYDT--DKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKE 345 (352) T ss_pred ceEEecCCC--ceeEeehhhhhhh-hhhheeee--eccccCCeeEEEEEeeeCceeechhheEEEEeec Confidence 999998764 5899999998664 34444433 3445679999999999999999999999999999 No 92 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=100.00 E-value=4.2e-41 Score=241.93 Aligned_cols=377 Identities=11% Similarity=0.044 Sum_probs=209.6 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHH Q lcl|Aclame:pro 182 GAEMPDKDKQTQTAGSQQTETRGAETGAKNPAPAASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQF 261 (632) Q Consensus 182 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~ 261 (632) +.+....+...... ........ ...... .... .+..++........+.. T Consensus 1 M~~~~l~el~~~l~------e~~~~i~~------------------~~~e~~------~~~~-~~~~~~~~~l~~eie~l 49 (394) T protein:vir:97 1 MFEEKIKEIKATIA------DLNNTIVT------------------KTAQVK------NALE-SDDLEAARSIKAEVEQA 49 (394) T ss_pred CcHHHHHHHHHHHH------HHHHHHHH------------------HHHHHH------Hhhc-hhhHHHHHHHHHHHHHH Confidence 00000000000000 00000000 000000 0000 00000000000001111 Q ss_pred HHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhh Q lcl|Aclame:pro 262 RALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKE 341 (632) Q Consensus 262 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 341 (632) ...+..................... .............. .......+...... .......... T Consensus 50 ~~ei~~l~~~~~~~e~~~e~~~~~~-----~~~~~~~~~~~~~~-~~~~~~~~~~~~~~---------~~~~~~~~~~-- 112 (394) T protein:vir:97 50 KANLVEAENDLKLYESSVEVGGAEN-----IGGKEVTQEEKTYR-ESVNDFIRSKGKIV---------NDSLRFEGKD-- 112 (394) T ss_pred HHHHHHHHHHHHHHHHHhhhhcccc-----ccccccchhhHHHH-HHHHHHHHHHHHHh---------hhhhhhhhHH-- Confidence 1111000000000000000000000 00000000000000 00000000000000 0000000000 Q ss_pred hhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEec-CCccccccccC Q lcl|Aclame:pro 342 ARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKT-SGANFYWIGED 420 (632) Q Consensus 342 ~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~a~~v~E~ 420 (632) ...................+..+||+++|.++ .+.|++.+++.++++.+ +++++.......+|+.. ..+.+.|++|+ T Consensus 113 ~~~~~~~~~~~~~~~~~~~t~~~gg~liP~~~-~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~~~v~E~ 190 (394) T protein:vir:97 113 EVLMPINETTPVEPQKDGIKKENAKPVSSEEI-LYTPAREVKTVVDLKPF-TTVYQAKKASGKYPVLQRATTKMVTVAEL 190 (394) T ss_pred HHHHHHHhhhhhhhhccccccccccccChHHH-HHHHHHHhhhhhhhhhh-ceeeeccCcceEEEEEecCCCccceeccc Confidence 00000011111122233345556677777665 56788899999988887 56666666666777654 44678999999 Q ss_pred ccccc-CcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceecccccc Q lcl|Aclame:pro 421 EDVQD-SDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPA 499 (632) Q Consensus 421 ~~~~~-~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~ 499 (632) +++++ +.++|+.+++.+++++++++||+|+|.|+.+++.++|.+.++++++++++.++++|.+++. T Consensus 191 ~~~~~~~~~~~~~v~l~~~k~~~~i~is~ell~ds~~~~~~~i~~~la~~~~~~~~~~i~~g~~~~~------------- 257 (394) T protein:vir:97 191 EKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFT------------- 257 (394) T ss_pred ccccccccccceeEEeehhheeeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcccccc------------- Confidence 99997 5799999999999999999999999999999999999999999999999999998876431 Q ss_pred ccccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeecc-------ccccCcceEEcC--C Q lcl|Aclame:pro 500 LTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEASN--Q 570 (632) Q Consensus 500 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~-------~~l~G~pv~~~~--~ 570 (632) ..+..+++++.+++..... ...++.|+||+..+.. +.+++|.+|+|+|.+ ++|+|+||++++ . T Consensus 258 ---~~~~~~~~~~~~~~~~~~~---~~~~a~~v~n~~~~~~--l~~lkd~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~ 329 (394) T protein:vir:97 258 ---TKTVKNLDEIKALLNGGFD---PAYNVSLIVSQSFYQT--LDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEV 329 (394) T ss_pred ---ccccccHHHHHHHHHhhhh---hhhCCEEEEcHHHHHH--HHHhhccCCCeeeecCcCCCCCceeccceeEEecccc Confidence 1223456777777765432 2345789999888654 567899999999975 379999999955 4 Q ss_pred CCCccEEEEehhh-EEEEEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 571 IPADTWIFGDWSQ-IVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 571 ~~~~~~~~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) ++.++++||||+. |.++.+.++.+.++++..+ ...|++++|+|+++.+|+||+++++.+ T Consensus 330 ~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~r~d~~v~~~~a~~~~~~~~ 389 (394) T protein:vir:97 330 LGANKAFIGDFKRGVLFADRKDLGLRWADNEIY---GQYLQAVLRFGVSKVDDKAGYYVTFTP 389 (394) T ss_pred cCCccEEEeeccccEEEEEecceEEEEeccccc---ceeEEEEEEEccEEecccceEEEEecc Confidence 6677899999997 6788899999998876654 357999999999999999999999988 No 93 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=100.00 E-value=9.1e-41 Score=240.09 Aligned_cols=360 Identities=10% Similarity=0.049 Sum_probs=209.7 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhh Q lcl|Aclame:pro 218 GANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDL 297 (632) Q Consensus 218 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 297 (632) .....+......++......... +...+.....+..+.....+....... ......... . T Consensus 1 meeL~~~~~~~~~~~~e~~~~l~-----~~~~~~~~~~e~~~~l~~ei~~~~~~~--------~~l~~~~~~-------~ 60 (389) T protein:vir:10 1 MDKLQTLFNDVSAKCADLNAQLN-----AKLQDENASVDDFQKIKDDLTAAKARR--------DAINDQIKA-------L 60 (389) T ss_pred ChHHHHHHHHHHHHHHHHHHHHH-----HHHHhHhhhHHHHHHHHHHHHHHHHHH--------HHHHHHHHH-------H Confidence 00000000000000000000000 000000000000000000000000000 000000000 0 Q ss_pred hhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHH Q lcl|Aclame:pro 298 GIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEE 377 (632) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~ 377 (632) ... ..... ........ ................+.+.... .......+++...+..+||++||.++ ... T Consensus 61 ~~~-~~~~~-~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~l-----r~~~~~~~~~~~~t~~~gg~~vP~~~-~~~ 128 (389) T protein:vir:10 61 EAE-KPAEP-KTEPKDDG----SKKGTDLSKKPIDAKKKAINDFI-----HSHGKVIDATSKVTSTEAGVLIPEEI-IYD 128 (389) T ss_pred HHH-HHhhh-hccccccc----cccccccchhHHHHHHHHHHHHh-----hcchhhhhhhcccccCCcceeehHHH-HHH Confidence 000 00000 00000000 00000000000000011111111 11122334555666677778777665 567 Q ss_pred HHHHHhhhhhhhhhcceeeccCceeEEEEEec-CCccccccccCccccc-CcccceeeeeeeeeeeeeehhhHHHhhcCh Q lcl|Aclame:pro 378 FIDILRNKAIIGQMGARMLPGLVGDVDIPKKT-SGANFYWIGEDEDVQD-SDFDFTTLSFSPKTIAGAVPVTRKLRKQSS 455 (632) Q Consensus 378 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~a~~v~E~~~~~~-~~~~~~~~~~~~~t~~~~~~iSre~l~d~~ 455 (632) |++.+++.++++.+ +++++..+....+++.. ....+.|++|+++++. +.++|+++++.+++++++++||+|+|.|+. T Consensus 129 i~~~~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~ 207 (389) T protein:vir:10 129 PTAEVNSVVDLSTL-VTKTPVTTPKGTYPILKRATDRFSSVAELAENPKLAEPEFNKVDWSVATYRGAIPLSEEAIADSA 207 (389) T ss_pred HHHHHHhhhhHHhh-cceeeccCCeeEEEEEecCCCccccccccccccccccccceeeeeeheeeEeeehhhHHHHhhhh Confidence 88999999999887 56666665556666654 3456789999999985 789999999999999999999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHHHH-HHhhccccccceEEee Q lcl|Aclame:pro 456 IHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMETK-ISTFNADAGRLAYLTS 534 (632) Q Consensus 456 ~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~-~~~~~~~~~~~~~~~~ 534 (632) +++.++|.+.|+++++++++.+|++|.+++. +. ...+..+++.+.+++.. +... .++.|+|| T Consensus 208 ~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~-~~------------~~~~~~~~d~l~~~~~~~~~~~----~~a~~~~n 270 (389) T protein:vir:10 208 VDLTALVGQSIKEKSVNTYNAMIAPVLQSFT-AK------------KTTTDTLVDSLKHILNVDLDPA----YSRALVVT 270 (389) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhcccc-cc------------cccccccHHHHHHHHHhhhhhh----hCcEEEec Confidence 9999999999999999999999998887532 11 12234567778877653 3332 34679998 Q ss_pred hhHHHHHHHHhhcccCCceeecc-----------ccccCcceEEcCCC-CCc-----cEEEEehhh-EEEEEecceEEEE Q lcl|Aclame:pro 535 VTQRGAAKKAQVFDNTGERIWQN-----------NEVNGYRAEASNQI-PAD-----TWIFGDWSQ-IVIAMWGVLDLKV 596 (632) Q Consensus 535 ~~~~~~~~~~~~~d~~g~~~~~~-----------~~l~G~pv~~~~~~-~~~-----~~~~gd~s~-~~~~~~~~~~~~~ 596 (632) +..+. .+.+++|.+|+|+|++ ++|+|+||++.++. +.. .++||||+. |.+++++++++.+ T Consensus 271 ~~~~~--~L~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~ 348 (389) T protein:vir:10 271 QSLFN--TLDTLKDKNGRYLLHDASDSITDGTAKGTILGVPVYVVGDTLLGSLAGDQKAFVGDLKRGVLFTDRQQVTLAW 348 (389) T ss_pred HHHHH--HHHHhhccCCCeeeecCcccccccccccccccceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEe Confidence 88865 4557899999999964 26999999876542 322 389999998 7889999999999 Q ss_pred ecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 597 DPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 597 ~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +++..|.+ .++++.|+|+++++|+||+++++.+ T Consensus 349 ~~~~~~~~---~~~~~~r~d~~~~~~~a~~~~~~~~ 381 (389) T protein:vir:10 349 EDSKIYGK---YLGAAFRFGVQKADSKAGYFVTNTD 381 (389) T ss_pred eccccccc---eEEEEEEeccEEecccceEEEEeec Confidence 98877765 5889999999999999999999887 No 94 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=100.00 E-value=8.1e-41 Score=240.38 Aligned_cols=363 Identities=10% Similarity=0.046 Sum_probs=209.5 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhh Q lcl|Aclame:pro 218 GANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDL 297 (632) Q Consensus 218 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 297 (632) +....+......++.......... ...+.....+................ ..........+.. . T Consensus 1 M~~l~~l~~~~~~~~~e~~~~~~~-----~~~~~~~~~ee~~~~~~~~~~~~~~~-~~l~~~i~~~e~~-------~--- 64 (394) T protein:vir:10 1 MDKLQTLFNEVSAKCADLNAQLNA-----KLQDENASVDDFQKIKDDLTAAKARR-DAINDQIKDLEAE-------N--- 64 (394) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHH-----HHhhhhccHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHH-------H--- Confidence 111010000001100000000000 00000000000000000000000000 0000000000000 0 Q ss_pred hhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHH Q lcl|Aclame:pro 298 GIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEE 377 (632) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~ 377 (632) .. ...........................+.+..... ........+....+..+||.++|.++ ... T Consensus 65 -----~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~----~~~~~~~~~~~~~t~~~gg~~vP~~~-~~~ 130 (394) T protein:vir:10 65 -----KA----NSDPDKPVDNAQPNGTDLKKKPIDAKKKAINDFIH----SHGKVIDNAAGHVTSTEAGVLIPEEI-IYD 130 (394) T ss_pred -----Hh----hcchhhhhhhhcccccchhhhHHHHHHHHHHHHHh----ccchhhhhhhcccccccCceeccHHH-HHH Confidence 00 00000000000000000000000111111111111 11112233455566667777777665 567 Q ss_pred HHHHHhhhhhhhhhcceeeccCceeEEEEEec-CCccccccccCccccc-CcccceeeeeeeeeeeeeehhhHHHhhcCh Q lcl|Aclame:pro 378 FIDILRNKAIIGQMGARMLPGLVGDVDIPKKT-SGANFYWIGEDEDVQD-SDFDFTTLSFSPKTIAGAVPVTRKLRKQSS 455 (632) Q Consensus 378 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~a~~v~E~~~~~~-~~~~~~~~~~~~~t~~~~~~iSre~l~d~~ 455 (632) |++.+++.++++++ +++++.......++... ..+.+.|++|++++++ +.++|+++++.+++++++++||+|+|.|+. T Consensus 131 ii~~~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~ 209 (394) T protein:vir:10 131 PTAEVNSVVDLSTL-VTKTPVTTPKGTYPILKRATDRFSSVAELAENPALAEPEFEQVDWSVSTYRGAIPLSEEAIADSA 209 (394) T ss_pred HHHHHHhhhhhhhh-ceeeeccCCceEEEEEecCCCccccccccccccccccccceeEEeeeeeeEeeehhHHHHHhhhh Confidence 89999999999887 45555554455555544 3467899999999997 679999999999999999999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHHHHHHhhccccccceEEeeh Q lcl|Aclame:pro 456 IHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSV 535 (632) Q Consensus 456 ~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 535 (632) +++.++|.+.|++++++++|.++++|.|++. +.+ ..+..+++.|.+++...... ..++.|+||+ T Consensus 210 ~~l~~~i~~~la~~~~~~~~~~il~g~g~~~-~~~------------~~~~~~~d~l~~~~~~~~~~---~~~a~~vmn~ 273 (394) T protein:vir:10 210 VDLTSLVGQSINEKSVNTYNAMIAPVLQSFT-AKA------------TTTDTLVDSLKHILNVDLDP---AYSRALVVTQ 273 (394) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcccccc-ccc------------ccccccHHHHHHHHHhhhhh---hccCEEEecH Confidence 9999999999999999999999999887642 111 12234567777776543322 2257899998 Q ss_pred hHHHHHHHHhhcccCCceeecc-----------ccccCcceEEcCCC--CC----ccEEEEehhh-EEEEEecceEEEEe Q lcl|Aclame:pro 536 TQRGAAKKAQVFDNTGERIWQN-----------NEVNGYRAEASNQI--PA----DTWIFGDWSQ-IVIAMWGVLDLKVD 597 (632) Q Consensus 536 ~~~~~~~~~~~~d~~g~~~~~~-----------~~l~G~pv~~~~~~--~~----~~~~~gd~s~-~~~~~~~~~~~~~~ 597 (632) ..+.. +.+++|.+|+|+|++ ++|+|+||++++.. +. ..++||||+. |.++.+.++++.++ T Consensus 274 ~~~~~--l~~lkd~~G~~i~~~~~~~~~~~~~~~~L~G~PV~~~~~~~~~~~~~~~~i~~gd~s~~~~~~~~~~~~v~~~ 351 (394) T protein:vir:10 274 SLFNT--LDTLKDKNGRYLLHDASDSITDGTAKGTVLGVPVYVVGDALLGSAAGDQKAFVGDLKRGVLFADRQQVTLAWE 351 (394) T ss_pred HHHHH--HHHhhccCCCeeeeccccccccCCcccccccceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEEe Confidence 88654 557899999999864 36999999987643 32 2389999998 67788899999988 Q ss_pred cccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 598 PYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 598 ~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) ++..|.+ .|+++.|+|+++++|+||+++++.+ T Consensus 352 ~~~~~~~---~~~~~~r~d~~~~~~~ai~~~~~~~ 383 (394) T protein:vir:10 352 DSKIYGR---YLGAAFRFGVKQADSNAGYFVTNTD 383 (394) T ss_pred cccccce---eEEEEEEeccEEeccccEEEEEeec Confidence 8777665 5899999999999999999999988 No 95 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=100.00 E-value=2.5e-42 Score=248.70 Aligned_cols=266 Identities=13% Similarity=0.170 Sum_probs=213.8 Q ss_pred ccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcccccCcccceeeeeeeeee Q lcl|Aclame:pro 361 TAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTI 440 (632) Q Consensus 361 ~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~ 440 (632) ...++|.++|+++ ...|++.+++.++++++ ++.++..+..+++|+.++.+.+.|++|++++++++++|+++++.++++ T Consensus 1 ma~~gG~lip~~~-~~~ii~~~~~~s~i~~~-~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~ 78 (298) T protein:vir:94 1 MVLNKGTLFDPEL-VTDLISKVAGKSSIARL-SAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKV 78 (298) T ss_pred CeeccccccChhH-HHHHHHHHHhhchhhhh-cceeeccCCceEEEEEecCcceEEeeCCccccccccceeEEEEeeeEE Confidence 3336677787765 55688889999999998 566777777789999999999999999999999999999999999999 Q ss_pred eeeehhhHHHhh---cChhHHHHHHHHHHHHHHHHHHHHHHhhcCC----Cccccccceecccc-cc--ccccccchhHH Q lcl|Aclame:pro 441 AGAVPVTRKLRK---QSSIHVENLIREDLIEGIGVALDLAMLTGTG----LANDPVGLLNMTGV-PA--LTYPAGGVDWA 510 (632) Q Consensus 441 ~~~~~iSre~l~---d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g----~~~~~~Gil~~a~~-~~--~~~~~~~~~~~ 510 (632) +++++||+|+|. ++..+++++|.+++++++++++|.++++|.+ ....+.|+...... .+ ...+.....++ T Consensus 79 ~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (298) T protein:vir:94 79 EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNG 158 (298) T ss_pred EEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccccccccccccccccccccccHHH Confidence 999999999995 4567899999999999999999999999843 22223332221111 11 11222334567 Q ss_pred HHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeecc-------ccccCcceEEcCCCCCc------cEE Q lcl|Aclame:pro 511 SVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEASNQIPAD------TWI 577 (632) Q Consensus 511 ~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~-------~~l~G~pv~~~~~~~~~------~~~ 577 (632) ++.+++.++...+.. ...|+|++..+..+ .+++|.+|+|+|++ ++|+|+||++++.+|.+ .++ T Consensus 159 ~i~~~~~~~~~~~~~--~~~~vmn~~~~~~l--~~lkd~~G~~l~~~~~~~~~~~tl~G~PV~~~~~v~~~~~~~~~~~~ 234 (298) T protein:vir:94 159 AIENAVELLTGVDAD--VTGIAINPSFRSAL--AKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAI 234 (298) T ss_pred HHHHHHHhhhhcCCC--ccEEEEcHHHHHHH--HHhhccCCCeeecCcccCCCCceecceeeEEecccccccCCCccEEE Confidence 888988888876653 45799999887654 57899999999964 47999999999999853 589 Q ss_pred EEehhhE-EEEEecceEEEEeccc--------ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 578 FGDWSQI-VIAMWGVLDLKVDPYT--------KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 578 ~gd~s~~-~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) ||||+.. .++.++++++.++++. +|.+|++.||++.|+|+++.+|+||+++|.+= T Consensus 235 ~Gdfs~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 235 IGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred EeeccceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 9999985 5888999999887653 58999999999999999999999999998877 No 96 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=100.00 E-value=1.2e-41 Score=244.86 Aligned_cols=350 Identities=14% Similarity=0.111 Sum_probs=217.1 Q ss_pred hhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 247 LAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGF 326 (632) Q Consensus 247 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 326 (632) ...+....-...++.+..+.+......... ........... ........ ........ . T Consensus 1 M~~kl~~~~~~~~e~~~~l~~~~~~~~~~~-~~~~~~~~~~~----~~~~~~~~--~~~~~~~~---------~------ 58 (383) T protein:vir:78 1 MTIKLKNNLANYEEKRTAFVNAVKNEDTQE-IQNKAYVEMVD----AMAADIME--QAKKEARQ---------E------ 58 (383) T ss_pred CchhHHHHHHHHHHHHHHHHHHHhccChHH-HHHHHHHHHHH----HHHHHHHH--HHHHHHHH---------H------ Confidence 111111111111111111111111110000 00000000000 00000000 00000000 0 Q ss_pred hhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEE Q lcl|Aclame:pro 327 EREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIP 406 (632) Q Consensus 327 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 406 (632) ...... ...........+....+++...+.++||++||.++ .+.|++.+...++++++ +++.+... ...++ T Consensus 59 ---~~~~~~---~~~g~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~-~~~I~~~l~~~s~l~~~-~~v~~~~~-~~~i~ 129 (383) T protein:vir:78 59 ---ADAYIS---ASRTDKNITNEEIKFFNDINKEVGYKEETLLPQTV-VDEIFEDLTTEHPFLAS-IGMRTTGL-RTKFL 129 (383) T ss_pred ---HHHHHH---hcCChhhhhHHHHHHHHHHhccCCCCCccccCHHH-HHHHHHHHHhhccceee-eeeEecCC-ceEEE Confidence 000000 00000000111112223455667778888887766 45677888888888887 45666544 46899 Q ss_pred EecCCccccccccCcccc-cCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCc Q lcl|Aclame:pro 407 KKTSGANFYWIGEDEDVQ-DSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLA 485 (632) Q Consensus 407 ~~~~~~~a~~v~E~~~~~-~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~ 485 (632) +.++.+.+.|++|+++.+ .++++|+++++.+++++++++||+|+|.|+.++++++|.+.++++++.+++.+|++|+|+ T Consensus 130 ~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~i~is~ell~Ds~~~ie~~i~~~l~~~~a~~~~~a~i~G~G~- 208 (383) T protein:vir:78 130 KSETSGVAVWGKIFGEIKGQLDATFSDEESIQNKLTAFVVVPKDLEKFGPAWVKRFVVTQIEEAFAVALESAYIVGDGN- 208 (383) T ss_pred EEcCCcceEEeecccccccccCcceeeEeecceeeEeeccchHHHhhccHHHHHHHHHHHHHHHHHHHHhhheEeccCC- Confidence 999999999999998876 468999999999999999999999999999999999999999999999999999999996 Q ss_pred cccccceecccccccc--------ccccchhHHHHHHHHHHHHhhcc------------ccccceEEeehhHHH-HHHHH Q lcl|Aclame:pro 486 NDPVGLLNMTGVPALT--------YPAGGVDWASVVDMETKISTFNA------------DAGRLAYLTSVTQRG-AAKKA 544 (632) Q Consensus 486 ~~~~Gil~~a~~~~~~--------~~~~~~~~~~i~~~~~~~~~~~~------------~~~~~~~~~~~~~~~-~~~~~ 544 (632) ++|.||++..+..... .+.+.++++++..+...+...+. ...+..|+|++.+.. +.-.. T Consensus 209 ~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~ 288 (383) T protein:vir:78 209 DKPIGLNRKVGKGSTVVDGVYAEKAATGTLTFANPKTTVNELTDVYKYHSVKENGHPLNVAGKVTLLVNPTDAWDVKKQY 288 (383) T ss_pred CCceeeeeccCCcccccccccccccccchhhhhhhHHHHHHHHHHHhccchhcccchhhhcCceEEEEcCcchhhhccch Confidence 4799998744322111 11223344444444444332111 112345778876543 22222 Q ss_pred hhcccCCceeeccccccCcc--eEEcCCCCCccEEEEehhhEEEEEecceEEEEecccccccCcEEEEEEEEeCcEEecc Q lcl|Aclame:pro 545 QVFDNTGERIWQNNEVNGYR--AEASNQIPADTWIFGDWSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRK 622 (632) Q Consensus 545 ~~~d~~g~~~~~~~~l~G~p--v~~~~~~~~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~ 622 (632) ...+.+|+|+ +++|+| |+.++.+|+++++||||+.|.+++++++++.++++.+|.+|++.|++..|+|++++|+ T Consensus 289 ~~~~~~G~~~----t~l~~~~~iv~s~~~p~~~iifgdfs~Y~i~~r~~~~i~~~~~~~f~~d~~~f~~~~r~dG~~~~~ 364 (383) T protein:vir:78 289 TSLNANGVYV----TALPFNLNIIESLFVPEKKAISYVAERYDALIGGPLDIGTYDQTLAIEDLNLYAAKQFAYGKAKDD 364 (383) T ss_pred hccCCCCcee----eecCCCceEEecCCCCcccEEEeeccceEEEecccceEEecchhhhhcCceEEEEEEEEcCEEecC Confidence 3345667665 455555 6779999999999999999999999999999999999999999999999999999999 Q ss_pred cceEEEEecC Q lcl|Aclame:pro 623 EAFCIAKKGA 632 (632) Q Consensus 623 ~a~~~~~~~A 632 (632) +||++++++= T Consensus 365 ~A~~vl~~~~ 374 (383) T protein:vir:78 365 KAAAVWTLNI 374 (383) T ss_pred CeEEEEEEEe Confidence 9999976654 No 97 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=100.00 E-value=4.8e-42 Score=247.10 Aligned_cols=271 Identities=13% Similarity=0.096 Sum_probs=209.0 Q ss_pred cccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcccccCcccceeeeeee Q lcl|Aclame:pro 358 EKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSP 437 (632) Q Consensus 358 ~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~ 437 (632) +.+.++++++++|+++ .+.|++.+++.+++++++ ++++......++|+.++.+.+.|++|++++|+++++|+++++.+ T Consensus 1 Mat~tt~~g~~vP~~~-~~~ii~~~~~~s~l~~~~-~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v~l~~ 78 (311) T protein:vir:99 1 MATFGTGNLKNLPRNI-ADGMVKDVVQGSTVAVLS-ARKPQRFGNEDIITFNGRPKAEFVGEGQQKSSTTGEFDFVTSTP 78 (311) T ss_pred CceecCCCceeccHHH-HHHHHHHHHhhchhhhhc-ceeeccCCceEEEEEeCCceeEEeecCcccccccceeeEEEEee Confidence 3344556777777665 578899999999999985 45666667789999999999999999999999999999999999 Q ss_pred eeeeeeehhhHHHhh---cChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCc--cccccceeccc--cccc--cccccchh Q lcl|Aclame:pro 438 KTIAGAVPVTRKLRK---QSSIHVENLIREDLIEGIGVALDLAMLTGTGLA--NDPVGLLNMTG--VPAL--TYPAGGVD 508 (632) Q Consensus 438 ~t~~~~~~iSre~l~---d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~--~~~~Gil~~a~--~~~~--~~~~~~~~ 508 (632) ++++++++||+|+|. |+..++.++|.+.+++++++++|.++++|+|++ ..+.|+.+... ...+ ........ T Consensus 79 ~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~ 158 (311) T protein:vir:99 79 KKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRVELTADTIANP 158 (311) T ss_pred EEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccceeeccccccchh Confidence 999999999999994 677899999999999999999999999998743 34444433221 1111 12222233 Q ss_pred HHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeecc-------ccccCcceEEcCCCCCc------- Q lcl|Aclame:pro 509 WASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEASNQIPAD------- 574 (632) Q Consensus 509 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~-------~~l~G~pv~~~~~~~~~------- 574 (632) .+.+.+++..+...+.......|+||+..+.. +.+++|.+|+|+|++ ++|+|+||++++.+|.+ T Consensus 159 ~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~--L~~lkd~~G~~l~~~~~~~~~~~~l~G~Pv~~s~~i~~~~~~~~~~ 236 (311) T protein:vir:99 159 DLAIEAAVGLLVANGHPTPVNGLALHPSIAWG--LSTARYTDGRKKFPELGLGIGVSSFEGIDASVSDTVNGGDEADPDD 236 (311) T ss_pred HHHHHHHHHHHhhhccCCCccEEEEcHHHHHH--HHhhhccCCCeeecCcccCCCCceecceeeEeeccccccccccccc Confidence 45666677666666555544568998887654 467899999999975 47999999999887632 Q ss_pred ---------cEEEEehhh-EEEEEecceEEEEeccc-------ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 575 ---------TWIFGDWSQ-IVIAMWGVLDLKVDPYT-------KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 575 ---------~~~~gd~s~-~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) .+++|||+. +.++.+.++.+..+++. .|.+|++.||+..|+||+|.||+++++.+.+| T Consensus 237 ~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~~~~~v~~~~~~A 311 (311) T protein:vir:99 237 EDLDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIALRLEIVYGWYVFTDRFVVIENAVA 311 (311) T ss_pred chhhccCcceEEEeeccccEEEEEecCceEEEeecCCCCcchhhhhcCcEEEEEEEeecceecChhHeeeecccC Confidence 257899997 55778888888777653 38999999999999999999976555555555 No 98 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=100.00 E-value=5.2e-42 Score=246.93 Aligned_cols=259 Identities=17% Similarity=0.208 Sum_probs=215.3 Q ss_pred hhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeec--cCceeEEEEEec-CCccccccccCccccc-Ccc Q lcl|Aclame:pro 353 VQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLP--GLVGDVDIPKKT-SGANFYWIGEDEDVQD-SDF 428 (632) Q Consensus 353 ~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~-~~~~a~~v~E~~~~~~-~~~ 428 (632) ..+++..++..+||.++|.++ .+.|++.+++.++++++ ++.++ .....+.+++.. ..+.+.|++|++++++ +.+ T Consensus 1 ~l~~~~~~t~~~gg~liP~~~-~~~Ii~~~~~~~~l~~~-~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~ 78 (293) T protein:vir:48 1 MLDSKTDHSGSDAGLTIPQDI-RTAINTLVRQYDSLQEY-VNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADIDDP 78 (293) T ss_pred CceeecccccCcCceEechhH-HHHHHHHHHhhhhhhhh-ceeeeccCCcceEEEEeecCCCcceeeecCCccccccccc Confidence 556777777777888887665 56688999999999887 44444 445567777765 4567999999999997 569 Q ss_pred cceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchh Q lcl|Aclame:pro 429 DFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVD 508 (632) Q Consensus 429 ~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~ 508 (632) +|+++++.+++++++++||+|+|.|+.++++++|.+++++++++++|+.|+.|++.+. ...+..+ T Consensus 79 ~~~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~---------------~~~~~~~ 143 (293) T protein:vir:48 79 KLSLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLP---------------TKPTLTK 143 (293) T ss_pred ceeEEEEeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhcccccc---------------ccccccC Confidence 9999999999999999999999999999999999999999999999999998876532 1234568 Q ss_pred HHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeeccc-------cccCcceEEcCC--CCC-----c Q lcl|Aclame:pro 509 WASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQNN-------EVNGYRAEASNQ--IPA-----D 574 (632) Q Consensus 509 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~~-------~l~G~pv~~~~~--~~~-----~ 574 (632) +++|.+++.++...++ .++.|+||+..+.. +.+++|.+|+|+|+++ +|+|+||++++. +|. . T Consensus 144 ~d~i~~~~~~l~~~~~--~~a~~vmn~~~~~~--L~~lkd~~g~~l~~~~~~~~~~~~l~G~Pv~~~~~~~~~~~~~~~~ 219 (293) T protein:vir:48 144 WDDIIDLEAKVDPAIK--QTSFFLTNTSGFTA--LKKVKNALGDYLMERDVKSPTGYSIAGFAVKEISDRWLPNASSGVM 219 (293) T ss_pred HHHHHHHHHhhhhhhc--CCCEEEEcHHHHHH--HHHhhccCCceEeecCcCCCCCceecceeeEEecccccCCccCCce Confidence 8999999999988776 35789999888654 4578999999999753 799999987543 332 2 Q ss_pred cEEEEehhh-EEEEEecceEEEEeccc--ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 575 TWIFGDWSQ-IVIAMWGVLDLKVDPYT--KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 575 ~~~~gd~s~-~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) .++||||+. |.++.++++++..+++. +|.+|++.||++.|+|+++++|+||+++|+++ T Consensus 220 ~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~ 280 (293) T protein:vir:48 220 PLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKA 280 (293) T ss_pred EEEEEeccceEEEEEecceEEEEecccchhhhcCeEEEEEEEeeCcEEecccceEEEEeec Confidence 489999998 67888999999998864 68999999999999999999999999999888 No 99 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=100.00 E-value=1.2e-39 Score=233.93 Aligned_cols=407 Identities=11% Similarity=0.010 Sum_probs=208.3 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHH Q lcl|Aclame:pro 181 RGAEMPDKDKQTQTAGSQQTETRGAETGAKNPAPAASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQ 260 (632) Q Consensus 181 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~ 260 (632) +..+....+.................... ........... .... ..+...+........+. T Consensus 1 Mki~elk~el~~~~~el~~~~~elr~~~~---~~~~~~~el~~----~~~e------------~~~~~~ei~el~~~l~~ 61 (437) T protein:vir:10 1 MKIEKLKKDLATKTAELNTKKAEIRSFTE---SEDKTIDEVKA----GMTE------------IKEKEDEIKEIRSNIEV 61 (437) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHH----HHHH------------HHHHHHHHHHHHHHHHH Confidence 11000000000000000000000000000 00000000000 0000 00000000000000000 Q ss_pred HHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhh-hhHHHHHHHHHHH Q lcl|Aclame:pro 261 FRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGF-EREVSLAIADASG 339 (632) Q Consensus 261 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~ 339 (632) .+.......................... ................ ..............+.... ............. T Consensus 62 ~~~~~~~~~e~~~~~~~~~~~e~~~~~~--~~e~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 138 (437) T protein:vir:10 62 LEQASALKVEEKRDDSDLVAPELEENSA--DNEEDDPEKLKTETKS-EAEKDKKTVKDEEKRDAGGLQDMKLKVGGEIAD 138 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhHHHHhHHHHHHHHHHHH Confidence 0000000000000000000000000000 0000000000000000 0000000000000000000 0000000000000 Q ss_pred hhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEec-CCccccccc Q lcl|Aclame:pro 340 KEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKT-SGANFYWIG 418 (632) Q Consensus 340 ~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~a~~v~ 418 (632) ...............++....+..++|+++|.++. ..+.. +...+.++.+ +++.+.......++... ..+.+.|++ T Consensus 139 ~~~~~~~~~~~~~e~~~~~~~~~~~~g~lvp~~~~-~~i~~-~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 215 (437) T protein:vir:10 139 KKVTAFADYLKTGEVRDVTGIALKDGKVIIPETIL-TPEKE-VHQFPRLGSL-VRTESVTTTTGKLPIFNNSTDLLTAHT 215 (437) T ss_pred hhhhhhHHHHHhhhhhhhhhcccccccccchHHHH-HHHHH-hhhhhhhhhc-ceeEeeccCceeeEEeecccccccccc Confidence 01111111112223445666667778888877654 44544 4555566654 55556555566677664 446789999 Q ss_pred cCccccc-CcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceecccc Q lcl|Aclame:pro 419 EDEDVQD-SDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGV 497 (632) Q Consensus 419 E~~~~~~-~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~ 497 (632) |++..++ +.++|+++++.+++++++++||+|+|.|+.+++.++|.+.++++++.+++.+|++|+|++.. T Consensus 216 e~~~~~e~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~~~---------- 285 (437) T protein:vir:10 216 EYGQTTKNATPVITPILWDLKTYTGGYVFSQELISDSSYDWQAELQSRLIELRDNTDDSLIITALTDGIK---------- 285 (437) T ss_pred ccccccccccccceeeeeehhheeeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc---------- Confidence 9999996 56899999999999999999999999999999999999999999999999999999886421 Q ss_pred ccccccccchhHHHHHHHHH-HHHhhccccccceEEeehhHHHHHHHHhhcccCCceeecc-------ccccCcceEEcC Q lcl|Aclame:pro 498 PALTYPAGGVDWASVVDMET-KISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEASN 569 (632) Q Consensus 498 ~~~~~~~~~~~~~~i~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~-------~~l~G~pv~~~~ 569 (632) ...+..+++++.+++. .+...|+ .++.|+||+.++.. +.+++|.+|+|+|.+ ++|+|+||++++ T Consensus 286 ----~~~~~~~~~~~~~~~~~~l~~~~~--~~~~~~~~~~~~~~--l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~ 357 (437) T protein:vir:10 286 ----KTTSTYLLGDLKKVLNVTLKPQDS--AAASIVMSQSAYNL--FDMATDAMGRPLLQPNVTAATGYTLLGKTVVIVD 357 (437) T ss_pred ----ccccccchhhHHHHHHhhhhhhhh--cCCEEEEcHHHHHH--HHHhhccCCCeeeccCccCCCCcccccceeEEec Confidence 1122334566777664 6666665 35789999988654 567899999999964 369999999986 Q ss_pred CC--CC-----ccEEEEehhh-EEEEEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 570 QI--PA-----DTWIFGDWSQ-IVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 570 ~~--~~-----~~~~~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) ++ |. ..++||||+. |.++++.++.+..++. |..+.+.+++..|+||+++||+||++|+.++ T Consensus 358 ~~~~~~~~~~~~~~~~gd~~~~~~~~~r~~~~~~~~~~--~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~ 426 (437) T protein:vir:10 358 DKLFPSASAGDVNIVVAPLKKAVINFKLTEITGQFQDT--YDIWYKQLGIFLRQNVVQASKDLIVNLTGKL 426 (437) T ss_pred ccccCCcCCCceEEEEeeccccEEEEeeeceEEEEecc--cccccceeeEEEEEccEEecccceEEEEeec Confidence 54 42 2389999997 6678899999887664 4556678999999999999999999998665 No 100 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=100.00 E-value=3.9e-39 Score=231.18 Aligned_cols=380 Identities=12% Similarity=0.047 Sum_probs=201.5 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhh Q lcl|Aclame:pro 197 SQQTETRGAETGAKNPAPAASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGN 276 (632) Q Consensus 197 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 276 (632) +..... .................+.......+..............+ +........+.....+......... T Consensus 1 m~~k~~---~l~~~~~el~~~l~eL~e~~~~l~~~~~el~~~~ee~~~~e---~~~~~~~~~~~l~~~i~~l~~~i~~-- 72 (397) T protein:vir:96 1 MALKQL---ILNKQIKERSSEIDKLLSQRSDLEKQENDLERALEEAKTDE---EISTVSDSADDLEKQVKDLDEKIAE-- 72 (397) T ss_pred CcHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHH---HHHHHHHHHHHHHHHHHHHHHHHHH-- Confidence 000000 00000000000011111111111111111000000000000 0000000001101110000000000 Q ss_pred hhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhh Q lcl|Aclame:pro 277 FEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQ 356 (632) Q Consensus 277 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a 356 (632) ..... .............. ....... ................ ................. T Consensus 73 ------~~~~~-------~~l~~~~~~~~~~~-~~~~~~~--~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~ 131 (397) T protein:vir:96 73 ------LQKEK-------QDLEDELAKAADPT-DQKPKDG--EKRKMKKFKVTEEELA-----EKRSAINAFVKSKGAEK 131 (397) T ss_pred ------HHHHH-------HHHHHHHHhhhhhh-hhhhHHH--HHHHHHHHhhhhHHHH-----HHHHHHHHHHHhhhhhh Confidence 00000 00000000000000 0000000 0000000000000000 00000000011111222 Q ss_pred hcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEec-CCccccccccCccccc-Ccccceeee Q lcl|Aclame:pro 357 LEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKT-SGANFYWIGEDEDVQD-SDFDFTTLS 434 (632) Q Consensus 357 ~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~a~~v~E~~~~~~-~~~~~~~~~ 434 (632) ....+..+++..+|.++. ..+++.... ..+... ++..+.+.....++... +...+.|++|++..++ +.+.|++++ T Consensus 132 ~~~~~~~~~~~~vp~~~~-~~i~~~~~~-~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~ 208 (397) T protein:vir:96 132 RDGFTSVEGGALIPQELL-QPQLEPKDI-VDLSKY-VRSVPVNSASGKFPVISKSGSKMATVQQLEKNPQLANPKMVEID 208 (397) T ss_pred hhcccccccccchhHHHH-HHHHHhhhh-hhHHHh-hhhccccccceeEEEEeccCCcccccccccccccccccccccee Confidence 334455566677776654 456665444 444443 33444444444444433 3466889999999996 689999999 Q ss_pred eeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHH Q lcl|Aclame:pro 435 FSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVD 514 (632) Q Consensus 435 ~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~ 514 (632) +.++++++++++|+++|.|+.+++.++|.+.++++++++++.++++|+|.+. +. +.+++++|.+ T Consensus 209 ~~~~~~~~~~~~s~ell~ds~~~l~~~i~~~l~~~~~~~~~~~i~~g~g~~~-~~---------------~~~~~d~~~~ 272 (397) T protein:vir:96 209 YSVATRRGYIPISQEMIDDASYDVTGLIADEIQDQSLNTKNADIAAVLKTAT-AK---------------SVVGVDGLKD 272 (397) T ss_pred ecHhHhhcchhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-cc---------------cccchHHHHH Confidence 9999999999999999999999999999999999999999999999987642 21 2356788888 Q ss_pred HHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeecc-------ccccCcceEEcCCC-C-----CccEEEEeh Q lcl|Aclame:pro 515 METKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEASNQI-P-----ADTWIFGDW 581 (632) Q Consensus 515 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~-------~~l~G~pv~~~~~~-~-----~~~~~~gd~ 581 (632) ++..+...+ .++.|+||+.++..+ .+++|.+|+|+|.+ ++|+|+||++++.. + ...++|||| T Consensus 273 ~~~~~~~~~---~~a~~v~n~~~~~~l--~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~ 347 (397) T protein:vir:96 273 LINKEIKKV---YDVKLFISASMYSEL--DKLKDKNGRYLLQDSITAASGKQLLGKEVVVLDDDVIGKSVGNVVGFIGDA 347 (397) T ss_pred HHHHhhhhh---cCcEEEEcHHHHHHH--HHhhccCCCeEeccCccCCCcccccccceEEecccccCCCCCceEEEEeeh Confidence 887654433 357899999886555 57899999999964 37999999986543 2 224899999 Q ss_pred hh-EEEEEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 582 SQ-IVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 582 s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +. |.++++.++.+.++++..| .+.++++.|+|+++++|+||+++++++ T Consensus 348 ~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~r~d~~~~~~~a~~~~~~~~ 396 (397) T protein:vir:96 348 KAFASFFDRKQVSVSWVDNNIY---GQLLAGIIRYDVKATDKKAGFYVTFTI 396 (397) T ss_pred hcceEeEeecceEEEEeccccc---ceeEEEEEEEccEEecccceEEEEeec Confidence 97 6788999999998887665 456899999999999999999999888 No 101 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=100.00 E-value=2.9e-33 Score=198.94 Aligned_cols=282 Identities=12% Similarity=0.091 Sum_probs=211.1 Q ss_pred hhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCC----cccccccc Q lcl|Aclame:pro 344 GFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSG----ANFYWIGE 419 (632) Q Consensus 344 ~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~a~~v~E 419 (632) ...+.......+++ +.++.+||+++|.+ . ..+++.+.+.+++++......+..+....+++.+.. +...|.+| T Consensus 1 ~~~~~~~~~~~k~i-t~~d~~gG~L~P~~-~-~~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~~~~~~~~~~~~ 77 (314) T protein:vir:41 1 MDFLNKPFQITPKI-DVPDLGKGILAVQR-F-GEFVREVRENSAIIKDARVLNALKSYEVDISRISLGVELEPGRNTSGT 77 (314) T ss_pred CchhhhHHHhhccc-ccccCCCceeChHH-H-HHHHHHHHhccchhhheeeecccCccceeecccccCcccccccccccC Confidence 11111111223333 34555677666644 4 468899999999999854433445666777765532 23456677 Q ss_pred CcccccCcccceeeeeeeeeeeeeehhhHHHhhcChh--HHHHHHHHHHHHHHHHHHHHHHhhcCCCc-------ccccc Q lcl|Aclame:pro 420 DEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSI--HVENLIREDLIEGIGVALDLAMLTGTGLA-------NDPVG 490 (632) Q Consensus 420 ~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~--~~~~~i~~~l~~a~a~~~~~~~~~g~g~~-------~~~~G 490 (632) ..+.++++++|+++.+.++++...+.||+++|.|+.. +++++|...|++++++.++..+++|+|+. +.|.| T Consensus 78 ~~~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~~~p~G 157 (314) T protein:vir:41 78 KVAPTADEVTVSTNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTGRELYRINDG 157 (314) T ss_pred CccCCcccccccceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccchhcchh Confidence 7778889999999999999999999999999999865 89999999999999999999999999853 46889 Q ss_pred ceecccccccc--ccccchhHHHHHHHHHHHHhhccc-cccceEEeehhHHHHHHHHhhcccCCceeecc-------ccc Q lcl|Aclame:pro 491 LLNMTGVPALT--YPAGGVDWASVVDMETKISTFNAD-AGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN-------NEV 560 (632) Q Consensus 491 il~~a~~~~~~--~~~~~~~~~~i~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~-------~~l 560 (632) ++..++..... .++...+.+.+.+++.+++..|+. ..+..|+|+..+...+ .++.+..++++|++ .++ T Consensus 158 ~l~~a~~~~~~~~~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~--r~~l~~~~~~l~~~~~~~~~~~~l 235 (314) T protein:vir:41 158 WMKLAGNQYTDAEPEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGY--RKQLLVRETGLGDSALIGATGLQY 235 (314) T ss_pred hhhhcccceeecCccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHH--HHHHhccCCcccchhhhCCCCcee Confidence 99876644332 334456778889999999999964 4578899988776544 46678888888753 468 Q ss_pred cCcceEEcCCCC-----CccEEEEehhhEEEEEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEE--EEecC Q lcl|Aclame:pro 561 NGYRAEASNQIP-----ADTWIFGDWSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCI--AKKGA 632 (632) Q Consensus 561 ~G~pv~~~~~~~-----~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~--~~~~A 632 (632) +|+||+.++.+| ++.++||||+.+.++.+..+++. ++.+..++++.|.+..|+|+.+.+.+|.|+ ++.++ T Consensus 236 ~G~PV~~~~~~~~~~~~~~~i~fgd~~nlv~~~~~~ir~~--~~~~a~~~~~~~~~~~r~d~~~~~~~aa~~~~~~~~~ 312 (314) T protein:vir:41 236 DGIPIQYVPALDALGDDKARALLTVPTNLVYGFWRNIRIE--PKRDAAMRRTEYIASLRADCNYEDENAAVAAVIDMSS 312 (314) T ss_pred cceeeEecccccccCCCCceEEEechhheEEEeeceeEEe--ecccCcCCeEEEEEEEEeceEEEEcCcEEEEEeeccC Confidence 999999998874 57899999999987777666654 455667899999999999999988865444 44444 No 102 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=100.00 E-value=3.7e-33 Score=198.39 Aligned_cols=285 Identities=13% Similarity=0.047 Sum_probs=207.8 Q ss_pred HHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCC- Q lcl|Aclame:pro 333 AIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSG- 411 (632) Q Consensus 333 ~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 411 (632) .+- -... +... -....+++. .++..||+ +.|+... .+++.+.+.+++++......++......+...+.. T Consensus 1 ~~~---~~~~--~~~~-~~~~~k~~t-~~d~~Gg~-l~P~~~~-~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~ 71 (315) T protein:vir:41 1 MLT---IEDI--RGGK-PFEIVPKID-VPDLGRGV-LSVDRFG-EFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVL 71 (315) T ss_pred Ccc---cchh--hcCC-hhhhhhhcC-CcCCCCce-echHHHH-HHHHHHHhhhhhhhhceeeeccccccccccccccCc Confidence 000 0000 0001 111123333 34444555 4555554 47788888899998854444444444444443211 Q ss_pred ---ccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcCh--hHHHHHHHHHHHHHHHHHHHHHHhhcCCCc- Q lcl|Aclame:pro 412 ---ANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSS--IHVENLIREDLIEGIGVALDLAMLTGTGLA- 485 (632) Q Consensus 412 ---~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~--~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~- 485 (632) ....|.+|....++++++|+++.+.++++...+.||+++|.|+. .+++++|...+++++++.++.++++|+++. T Consensus 72 ~~~~g~~~~~~~~~~~~~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s~ 151 (315) T protein:vir:41 72 DVGPGRDETGQKLAPPESTAEVKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHGDTSSS 151 (315) T ss_pred ccccccccccCcCCCCCCccccceeeeceeeeeeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhccCCcCc Confidence 23457788888889999999999999999999999999999985 489999999999999999999999999865 Q ss_pred ----cccccceeccccccc----cccccchhHHHHHHHHHHHHhhcccc-ccceEEeehhHHHHHHHHhhcccCCceeec Q lcl|Aclame:pro 486 ----NDPVGLLNMTGVPAL----TYPAGGVDWASVVDMETKISTFNADA-GRLAYLTSVTQRGAAKKAQVFDNTGERIWQ 556 (632) Q Consensus 486 ----~~~~Gil~~a~~~~~----~~~~~~~~~~~i~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~ 556 (632) +.|.|++..+..... .......+.+.+.+++++++..|+.. .++.|+|+..+...+ .++++.+|+|+|+ T Consensus 152 ~p~~~~~~G~l~~a~~~~~~~~~~~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~--rklk~~~g~~lw~ 229 (315) T protein:vir:41 152 DPLLRMSDGWLKLASEKLTESDVDPEAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAY--RDALKGRETGLGD 229 (315) T ss_pred CccccccccceecccccccccccccccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHH--HHHhccCCCcccc Confidence 456799987654322 22334456788999999999999754 478899999877544 5789999999996 Q ss_pred c-------ccccCcceEEcCCCC-----CccEEEEehhhEEEEEecceEEEEecccccccCcEEEEEEEEeCcEEecccc Q lcl|Aclame:pro 557 N-------NEVNGYRAEASNQIP-----ADTWIFGDWSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEA 624 (632) Q Consensus 557 ~-------~~l~G~pv~~~~~~~-----~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a 624 (632) + .+|+|+||+..+.|| ++.++||||+.|.++.+..+++..+.+ ..++.+.|....|+|+.+.+.++ T Consensus 230 ~~~~~g~~~tl~G~PV~~~~~m~~~~~~~~~ilf~d~~nl~~~~~~~i~i~~~~~--a~~~~~~~~~~~r~d~~~~~~~~ 307 (315) T protein:vir:41 230 QALTGANSILYDGRPVQYVPALEALNDGKSRALFVVPTQLVYGFWRNIKVVPDYD--AEMRLTKYVASLRTDNHYEDEEG 307 (315) T ss_pred chhhcCCCceecccceEecccccccCCCCccEEEecccceEEEeccccEEEeeec--CCCCceEEEEEEEeceeEEeccc Confidence 4 579999999999885 567999999999999998888776554 45677889999999998776654 Q ss_pred --eEEEEe Q lcl|Aclame:pro 625 --FCIAKK 630 (632) Q Consensus 625 --~~~~~~ 630 (632) +..+|+ T Consensus 308 ~a~~~~~v 315 (315) T protein:vir:41 308 AVSATITV 315 (315) T ss_pred eeEeeeeC Confidence 666777 No 103 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=100.00 E-value=1e-31 Score=190.49 Aligned_cols=545 Identities=10% Similarity=0.002 Sum_probs=283.7 Q ss_pred hhhhhccCCcEEEeeCCCCCceEEEE-Ee-e----ee--cccceEEEEEeC----CChhHHHHHHHHhc--CCcceeeee Q lcl|Aclame:pro 63 IRMGRLKNGAPLLDSHSLREQIGVVE-EV-W----LD--DDRRLRARVRFS----RSAKAEELWQDVLD--GIRRHISIG 128 (632) Q Consensus 63 ~~~~~~~~~~~~l~~H~~~~~iG~~~-~~-~----~e--~~~gl~~~~~~~----~~~~~~~~~~~v~~--G~~~~~SiG 128 (632) +.-..-+..-..+|+- ||-|. .+ . |. ++. -.+.+.+. +--.|-.+|..++. |.+..+-.| T Consensus 1 ~~a~~~~~aei~iy~~-----Ig~w~vta~~~~~~L~~l~~~-~~I~i~INSpGG~V~~G~aIyn~lk~~~~~v~~~i~G 74 (652) T protein:vir:79 1 MQAGHQSDADIYIYDE-----IGFWGVTAKQFISDLNALGDI-THINLHINSPGGDVFEGIAIFNALKTHGASITVYVDG 74 (652) T ss_pred CCCCCCCCceEEEEee-----cccccCCHHHHHHHHHhcCCC-ceEEEEEeCCCCChhHHHHHHHHHhhcCCCeEEEEee Confidence 1111111111233322 22221 00 0 11 121 14566664 34788999999974 344443334 Q ss_pred EEEeeccccc-CCCCeeEEEEEEeeeeccCccccccccccee-eeee----------------ccchh-----hhhhhhh Q lcl|Aclame:pro 129 YIIHEMVLES-SGDQGDTYRVMDWEPYEISLISVPADPTVGV-GRSI----------------DIGNI-----TIRGAEM 185 (632) Q Consensus 129 ~~~~~~~~~~-~~~~~~~~~~~~~~l~EvS~v~~pa~~~a~v-~~~~----------------~~~~~-----~~~~~~~ 185 (632) +--.--.+.- .++......-..+-++..|-...+ ++.- +... ..... ..+..+. T Consensus 75 ~AAS~ASvIa~ag~~~~m~~~a~lMIH~p~~~~~G---~a~dl~~~a~~L~~~~~~i~~~Ya~ktG~~~e~i~~~m~~et 151 (652) T protein:vir:79 75 VAASMASVIAMVGNPVIMPENTFMMIHKPFGFTGG---DAEDMRTYADLLDKVEAVLLPAYAQKTGKTTDEIAAMLADET 151 (652) T ss_pred hhhhHHHHHHhcCCeEEcCCCceEEEEcccccccc---CHHHHHHHHHHHHHHHHHHHHHHHHhhCCCHHHHHHHHhhhc Confidence 3322222221 122110000000111222211111 1100 0000 00000 0000000 Q ss_pred hhhhhh-------------------------hhhhhhhhhhh-------------hhhhhhhhhhhhh------hhhhhh Q lcl|Aclame:pro 186 PDKDKQ-------------------------TQTAGSQQTET-------------RGAETGAKNPAPA------ASGANE 221 (632) Q Consensus 186 ~~~~~~-------------------------~~~~~~~~~~~-------------~~~~~~~~~~~~~------~~~~~~ 221 (632) ...-.+ ...+..+.... ...+......... ....-. T Consensus 152 wlta~EA~e~Gf~D~i~~~~~~~a~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~v~d~EPa~~~~pvqAaAP~~De~air 231 (652) T protein:vir:79 152 WMSGAECLAQGFADQVTPAVKAMACIQSKRTEEFKKMPDSIRNMITPPRNSAPRVQDDEPAASRTPVQAAAPVVDENSIR 231 (652) T ss_pred CCCHHHHHhcCCcccccchhhhhhhhhhhhhhhhhhhHHHHHHHhcccccccccccccccccccccccccCCcCchhHHH Confidence 000000 00000000000 0000000000000 000001 Q ss_pred hhhhhhhhhhhhhhhhhhhhhh--hhhh-hhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhh Q lcl|Aclame:pro 222 NDILSRERTRISEITAIGQQFS--QRSL-AQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLG 298 (632) Q Consensus 222 ~~~~~~~~~r~~~~~~~~~~~~--~~~~-~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 298 (632) .+..+...+|...+......++ ..++ .+...+.+...++.+..+++.+............. ...........+.. T Consensus 232 Aq~~aeeraRi~~I~~l~a~Fggr~~~l~~~~l~d~~~s~e~ar~~il~~l~~~~~p~~~~~~~--~~~~~~g~~~~d~~ 309 (652) T protein:vir:79 232 AQVLAEQKARVNGINDLFAMFGGRYQTLQAQCLADPECSLEQAREKLLNEMGRESTPSNKNTPA--HIYAGNGNFVGDGI 309 (652) T ss_pred HHHHHHHHHHHHHHHHHHHhhccccchHHHHHhhccCCCHHHHHHHHHHHHHhhcCCCCCCcce--eEeeccchhhHHHH Confidence 1122334556666666665554 2333 34455667788888888888875433221100000 00000000111111 Q ss_pred hHHHHHhhhhhhhhhhhhh-hhhhhhhhhhhHHHHHHHHHHHhhhh-hhhhhhhHHhhhhhcccccccccceechhhhhH Q lcl|Aclame:pro 299 IQHKELQQYSLMRAINAAA-TGDWSKAGFEREVSLAIADASGKEAR-GFYMPHEVLVQRQLEKKTAGKGGELVATELLSE 376 (632) Q Consensus 299 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~ 376 (632) . .... .+... ............+.+..+.++.+.+. ........+..+++. +++ +..+.|......+ T Consensus 310 ~-~aL~--------~R~g~~~~~~~~~~~g~~L~elAr~~L~~~G~~~~~~~~~~~v~~A~~-hsT-sDFp~IL~~~~nk 378 (652) T protein:vir:79 310 R-QALM--------ARAGFEKTERDNVYNGMTLREYARMSLTERGIGVSSYNPMQMVGAAFT-HST-SDFGNILLDVANK 378 (652) T ss_pred H-HHHH--------hhcCCcccccCccccCccHHHHHHHHHHhhccCCCCCCHHHHHHHHhh-cCc-chHHHHHHHHHHH Confidence 0 1111 11100 01111112233333344444433332 333344556666663 222 3345677788888 Q ss_pred HHHHHHhhhhhhhhhcceee-ccCceeEEEEEecCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcCh Q lcl|Aclame:pro 377 EFIDILRNKAIIGQMGARML-PGLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSS 455 (632) Q Consensus 377 ~i~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~ 455 (632) .+++.+...+..++.|++.. -.+|+.....+.++.+.+..|.|+|||++++++++.+++.+.|||+++.||||+|+||| T Consensus 379 ~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~~~L~~V~E~gEyk~~t~~e~~e~~~l~tyG~~~~iTRqaiINDD 458 (652) T protein:vir:79 379 AILQGWEDAPETYEQWTRKGQLSDFKIAHRVGMGGFSALRQVREGAEYKYVTTGDKQATIALATYGELFSITRQAIINDD 458 (652) T ss_pred HHHHHHhhhHHHHHHHhccCCCccccccceeecCCCCCccccCCCCccceeeecCccceeeeecccCeeeeehheeeccc Confidence 99999888876666666554 46788889999999999999999999999999999999999999999999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccc--cc-cee-ccccccccccccchhHHHHHHHHHHHHhhcccc----c Q lcl|Aclame:pro 456 IHVENLIREDLIEGIGVALDLAMLTGTGLANDP--VG-LLN-MTGVPALTYPAGGVDWASVVDMETKISTFNADA----G 527 (632) Q Consensus 456 ~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~--~G-il~-~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~----~ 527 (632) +++++.|+..|++++++++++.+|.-.-.++.. +| .|| |++|+|+..+ ++++.+.+..++.+|..|.... . T Consensus 459 L~a~~~ip~~~g~aA~~~~~~~vy~~l~~Np~~~~DGk~LF~hA~H~Nl~~~-aa~~~~~l~~ar~aM~~Qk~g~~~l~i 537 (652) T protein:vir:79 459 LNMLTDVPMKLGRAAKSTIADLVYAILTSNPKISTDNVSLFDKAKHANVLES-AAMDVASLDKARQLMRVQKEGERHLNI 537 (652) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccccCCceeeccccccccccc-ccCCHHHHHHHHHHHHHhccCCccccc Confidence 999999999999999999999998655444322 44 577 8999998655 5789999999999998886432 2 Q ss_pred cceEEeehhHHHHHHHHhhcccC--Ccee--eccccccCc-ceEEcCCCCCc---cEEEEehhh---EEEEEecceEE-E Q lcl|Aclame:pro 528 RLAYLTSVTQRGAAKKAQVFDNT--GERI--WQNNEVNGY-RAEASNQIPAD---TWIFGDWSQ---IVIAMWGVLDL-K 595 (632) Q Consensus 528 ~~~~~~~~~~~~~~~~~~~~d~~--g~~~--~~~~~l~G~-pv~~~~~~~~~---~~~~gd~s~---~~~~~~~~~~~-~ 595 (632) .+.|+++|..........+.... +... ...+++.|+ .|++++.+... .||+++... +.+++..|.+. . T Consensus 538 ~P~~llvp~~le~~a~~ll~s~~v~~a~~~~~~~Np~~~~~~~i~eprL~~~s~~~wylaa~~~~dtiev~yL~G~~~P~ 617 (652) T protein:vir:79 538 RPAFVLVPTAMESVANQVIRSSSVKGADINAGIINPVKDFATVIAEPRLDDNSQTTFYLAASKGSDTIEVAYLNGVDTPY 617 (652) T ss_pred cccEEEecchhHHHHHHHhccCCCcccccccccccccccccccccccccCCCCcccEEEecCCCCCeEEEEEecCCCCCe Confidence 35577777766554444442221 1111 123556664 77778877543 366776554 44444333321 2 Q ss_pred EecccccccCcEEEEEEEEeCcEEecccceEEEEe Q lcl|Aclame:pro 596 VDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKK 630 (632) Q Consensus 596 ~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~ 630 (632) ......|..+++.|+++++||++++|..++++.+- T Consensus 618 ie~~~gf~~dG~~~kvrlD~G~~~iD~RG~~k~t~ 652 (652) T protein:vir:79 618 IDQMEGFSVDGVTTKVRIDAGVAPVDHRGLVKCTA 652 (652) T ss_pred eeecCCCCcceEEEEEEEeccCceeeccceeeecC Confidence 23345799999999999999999999999999988 No 104 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=99.95 E-value=4.7e-30 Score=181.36 Aligned_cols=289 Identities=13% Similarity=0.095 Sum_probs=213.3 Q ss_pred HHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCc Q lcl|Aclame:pro 333 AIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGA 412 (632) Q Consensus 333 ~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 412 (632) ...+.+... ..........+.++.++|+.+++++... +++.+.+.+.+++. +++++.......++..+..+ T Consensus 1 ~~~k~~~~~-------l~~~~~~~~~~~~~~~~g~~v~~~~~~~-l~~~i~e~s~~l~~-i~v~~v~~~~~~i~~~~~~~ 71 (321) T protein:vir:31 1 MASRTINND-------LSRITEKNALTVDDLDAGGTLPDPLWDE-FWTDMIEETPLLDA-IRTETVGAKKTRIPTLNIGE 71 (321) T ss_pred CchHHHHHH-------HHHHHHhccccccccCCcceeCHHHHHH-HHHHHHHhhhhhhh-ceeeeccCcceeeeeeccCC Confidence 001111110 0112223344445667788888887654 55666677777776 66777777777777776666 Q ss_pred ccccccc-C-cccccCcccceeeeeeeeeeeeeehhhHHHhhcCh--hHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccc Q lcl|Aclame:pro 413 NFYWIGE-D-EDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSS--IHVENLIREDLIEGIGVALDLAMLTGTGLANDP 488 (632) Q Consensus 413 ~a~~v~E-~-~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~--~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~ 488 (632) ...|+++ + +..+.++++++++++..+++...+.||+++|.|+. .+++++|...++++++..++..+++|++.+..+ T Consensus 72 ~~~~~~~e~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~ 151 (321) T protein:vir:31 72 RHRRPQDEGEWNENESDVSTGTIDISTEKATVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANGDEDAEDS 151 (321) T ss_pred cccccccccccccccccceeeeeeeeeEEEEeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCCc Confidence 6677763 3 34556789999999999999999999999999875 589999999999999999999999999876654 Q ss_pred -----ccceecccc--ccccccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeec----- Q lcl|Aclame:pro 489 -----VGLLNMTGV--PALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQ----- 556 (632) Q Consensus 489 -----~Gil~~a~~--~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~----- 556 (632) .|++..+.. ..+..++..++++.+.+++..++..|++.....|+|+........ ..+++..+ ++|. T Consensus 152 ~~~~n~G~l~~a~~~~~~~~~~~~~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~-~~l~~~~~-~~~~~~l~~ 229 (321) T protein:vir:31 152 FENQNDGFITVAEGDVETIDAADDILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYH-YTLTDRDT-PLGDNVIMG 229 (321) T ss_pred ccccchhhhhhhccccccccccccccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHH-HHHhcCCC-ccccchhhc Confidence 688765432 223345566788999999999999999888889999988765433 24455544 5554 Q ss_pred --cccccCcceEEcCCCCCccEEEEehhhEEEEEecceEEEEecccccc---cCcEEEEEEEEeCcEEecccceEEEEec Q lcl|Aclame:pro 557 --NNEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGVLDLKVDPYTKAA---SDGLVLRVFQDVDAGVRRKEAFCIAKKG 631 (632) Q Consensus 557 --~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~r~~~~v~~~~a~~~~~~~ 631 (632) +.+|+|+||++++.+|++.++|+||+.+.++.+.++++.+..+..+. .+.+.+....++|+.|.+++|++.++-- T Consensus 230 ~~~~tl~G~pvv~~~~mP~~~il~t~~~nl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ve~~~a~a~~~~i 309 (321) T protein:vir:31 230 EADVNPFSFPIIGSGLWPDDKAMFTDPQNLIYALYRDLEIDVLTESDKVSERDLHARYFMRGDDDFAIENTEAVVLAEGL 309 (321) T ss_pred cccccccceeEEEcCCCCCCcEEEeccccEEEEEeeccEEEEeecCccccccceeeEeeeeeecceeEeccccEEEEecC Confidence 24699999999999999999999999999989888888776654432 3444455666899999999999998854 Q ss_pred C Q lcl|Aclame:pro 632 A 632 (632) Q Consensus 632 A 632 (632) - T Consensus 310 ~ 310 (321) T protein:vir:31 310 G 310 (321) T ss_pred C Confidence 4 No 105 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=99.95 E-value=2.8e-29 Score=177.14 Aligned_cols=579 Identities=12% Similarity=0.012 Sum_probs=270.0 Q ss_pred CCCccccccchhccccceeEEEEEEEeecccCCCcEEEEEEecCcceecCCCeEEEEecchhh--hhhhccCCc-EEEee Q lcl|Aclame:pro 1 MPQPTKKTTVLRTIEGRELQRELRVLSDSIDQEARTVELAASSEYPVPRWFGREILDHSPGAI--RMGRLKNGA-PLLDS 77 (632) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~-~~l~~ 77 (632) ||-.....+ ...+. +.-.++.. ..+. +.|.++..+ + ++| .+++..| +|..+.+.. .-++= T Consensus 14 ~p~~~~~~~--~~~~~---~~w~~i~~--~~~~--~~ei~iy~~--I---g~w---gita~~f~~~L~~~~d~~~I~v~I 76 (693) T protein:vir:95 14 LPMAAALTE--ANAPH---ESWYSIKA--AGRG--VAEVLLYDE--I---GVW---GITALQFARDLKAMGDLTKINLHI 76 (693) T ss_pred hcccccccC--CCCCC---Ccceeeee--cCCC--eeEEEEeec--c---ccc---ccCHHHHHHHHHhcCCCceeEEEE Confidence 332111111 01111 11122322 1222 344444322 1 111 1333333 232222111 11111 Q ss_pred CCCCCceEEEEEeeeecccceEEEEEeCCChhHHHHHHHHhc--CCcceeeeeEEEeecccc-cCCCCeeEEEEEEeeee Q lcl|Aclame:pro 78 HSLREQIGVVEEVWLDDDRRLRARVRFSRSAKAEELWQDVLD--GIRRHISIGYIIHEMVLE-SSGDQGDTYRVMDWEPY 154 (632) Q Consensus 78 H~~~~~iG~~~~~~~e~~~gl~~~~~~~~~~~~~~~~~~v~~--G~~~~~SiG~~~~~~~~~-~~~~~~~~~~~~~~~l~ 154 (632) |+ |=|. --.|-.+|..++. |.+.-.=.|+-..--.+. ..++......--.+-++ T Consensus 77 NS---pGGd--------------------V~~G~aIyn~Lk~~~~~Vtv~vdGlAASaASvIamagd~i~m~~~a~~MIH 133 (693) T protein:vir:95 77 HS---PGGD--------------------VFEGTAIYNLLRNHPASVDVYIDGLAASMASVIAMAGDTIYMPENAMMMVH 133 (693) T ss_pred EC---CCCc--------------------hhhHHHHHHHHhhcCCCeEEEEeehhhhHHHHHHhcCCeEEecCCCeEEEE Confidence 11 1111 2334445544443 222222122111100111 01110000000000011 Q ss_pred ccCccccc---------------------cccc-ceeeee-----------------------eccchhhhhhhhhhhhh Q lcl|Aclame:pro 155 EISLISVP---------------------ADPT-VGVGRS-----------------------IDIGNITIRGAEMPDKD 189 (632) Q Consensus 155 EvS~v~~p---------------------a~~~-a~v~~~-----------------------~~~~~~~~~~~~~~~~~ 189 (632) ..+-+..+ .|.. ++.... ....... ......... T Consensus 134 ~p~~~~~Gna~dl~~~a~~L~~~~~~i~~~Y~~ktG~~~e~i~~~m~~etwlta~EAve~Gf~Dei~e~~-~~~a~~~~~ 212 (693) T protein:vir:95 134 KPWGIQGGDADDMRRYAELLDKVEDTLVMAYANKTGKSADDIKALLKEETWMNGREAVAAGFADQLTEPL-QAAAHLSSK 212 (693) T ss_pred ccccccccCHHHHHHHHHHHHHHHHHHHHHHHHhhCCCHHHHHHHHhhhcCCCHHHHHhccchhhhhhhh-HHHHhhHHH Confidence 11111111 0000 000000 0000000 000000000 Q ss_pred hhhhhhhhhhh---------hh-----hhhhhhhhhhhhh---hhhhh----hhhhhhhhhhhhhhhhhhhhhhh--hhh Q lcl|Aclame:pro 190 KQTQTAGSQQT---------ET-----RGAETGAKNPAPA---ASGAN----ENDILSRERTRISEITAIGQQFS--QRS 246 (632) Q Consensus 190 ~~~~~~~~~~~---------~~-----~~~~~~~~~~~~~---~~~~~----~~~~~~~~~~r~~~~~~~~~~~~--~~~ 246 (632) ........+.. .. ...........+. ..... .....+....|...+......+. ..+ T Consensus 213 ~~~~~~~~p~~l~~~~~~~~~~p~~~~~~PaPTPaaaaPaaP~aaap~~adirA~~~aae~~r~aaI~a~fa~f~~~~a~ 292 (693) T protein:vir:95 213 RMQEFAHMPEALKTLLAPRAQTPAAPANTPAPTPASAAPAAPVAAAPTEADIRARILAEESGRRSAITAAFGAFSTGHAE 292 (693) T ss_pred HHHHhhchHHHHHHHHhhhcccccccccCcccCccCCCCCCCccCCCCcchhhHHHHHHHHHHHHHHHHHHHhccCChHH Confidence 00000000000 00 0000000000000 00000 00111122223333333332222 122 Q ss_pred -hhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 247 -LAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAG 325 (632) Q Consensus 247 -~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (632) ..+...+.+-..++.+..+++............... ............+........+.. .... ...... T Consensus 293 l~a~~l~d~~~s~d~ar~~lL~~l~~~~~p~~~~~~~-~~~~~~~g~~~~d~~~~al~~R~g------~~~~--~~~n~~ 363 (693) T protein:vir:95 293 LLATCLNDMNITVDQAREKLLAAIGADTQPAAALSAG-AHIHAGNGNLVGDSVRASVLARIG------RGER--QADNAY 363 (693) T ss_pred HHHHHHhhcCCCHHHHHHHHHHHHhhccCCCCCcCcC-ccccCCchhHHHHHHHHHHHHhcC------cccc--cCCccc Confidence 223334556777778877777765432221111000 000000000000110000000000 0000 001112 Q ss_pred hhhHHHHHHHHHHH-hhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceee-ccCceeE Q lcl|Aclame:pro 326 FEREVSLAIADASG-KEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARML-PGLVGDV 403 (632) Q Consensus 326 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~-~~~~~~~ 403 (632) ....+.+.++.++. ++..........+..+++. ++ ++..+.|...+..+.+++.+...+..++.|++.. -.+|+.. T Consensus 364 ~g~~L~elAr~~L~~rg~~~~~~~~~~~~~~a~~-ht-TSDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~ 441 (693) T protein:vir:95 364 NGMTLRELARASLVDRGIGVASLNAPQMVGLAFT-HT-SSDFGLILLDVANKSVLAGWEEAEETFPLWTKSGILTDFKPA 441 (693) T ss_pred cCCcHHHHHHHHHHhcCCccCCCCHHHHHHHHHh-cC-cchhHHHHHHHHHHHHHHHHHhhhhHHHHHhccCCCCccccc Confidence 22333333333333 3333333444556666663 22 2334567778888889998888876666666544 4678888 Q ss_pred EEEEecCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|Aclame:pro 404 DIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTG 483 (632) Q Consensus 404 ~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g 483 (632) +..+.+..+.+..|.|+|||+++++++...++.+.|||+++.||||+|+|||+++++.|+..|++++++++++.+|.-.. T Consensus 442 ~~~~lg~~~~L~~V~E~gEyk~~t~~e~~e~~~l~tyG~~~~iTRqaiINDDLga~~~ip~~~g~aA~~~~~~~vy~~L~ 521 (693) T protein:vir:95 442 RRVGLGEFSSLRQVREGAEYKYVTLGERGEQIILATYGELFSITRQAIINDDLQMLSDIPFKLGQAAKATIGDLVYAVLT 521 (693) T ss_pred ceeecCCCCChhhcCCCCceeeeecCCccceeehhhcCCeeeecHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 88899999999999999999999999999999999999999999999999999999999999999999999999885443 Q ss_pred Ccc-cccc-ceecccccccc-ccccchhHHHHHHHHHHHHhhcc---------ccccceEEeehhHHHHHHHHhhcccC- Q lcl|Aclame:pro 484 LAN-DPVG-LLNMTGVPALT-YPAGGVDWASVVDMETKISTFNA---------DAGRLAYLTSVTQRGAAKKAQVFDNT- 550 (632) Q Consensus 484 ~~~-~~~G-il~~a~~~~~~-~~~~~~~~~~i~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~d~~- 550 (632) .++ ..+| .|||++|+|+. +++++++.+.+..++.+|..+.. ....+.|+++|..........+.... T Consensus 522 ~Np~m~DGk~LFhadH~Nl~tga~sals~~sl~~a~~am~~qk~~~~~~~g~~L~i~P~~llvP~~le~~a~~l~~s~~~ 601 (693) T protein:vir:95 522 GNPAMSDGKTLFHADHSNLLTGAASALSIDSLSKAKTQMATQKAQVEKGKGRTLNIRPGFVLTPVALEDKANQIINSESV 601 (693) T ss_pred cCccccCCcceeeccccccccccccccChHHHHHHHHHHHHhhcchhccCCceeecccceEEecchHHHHHHHHhccccc Confidence 322 2233 69999999964 45678999999999999877642 12245677777665554444442211 Q ss_pred -Ccee--eccccccCc-ceEEcCCCCC---ccEEE-Eehhh--EEEEEecceEE-EEecccccccCcEEEEEEEEeCcEE Q lcl|Aclame:pro 551 -GERI--WQNNEVNGY-RAEASNQIPA---DTWIF-GDWSQ--IVIAMWGVLDL-KVDPYTKAASDGLVLRVFQDVDAGV 619 (632) Q Consensus 551 -g~~~--~~~~~l~G~-pv~~~~~~~~---~~~~~-gd~s~--~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~r~~~~v 619 (632) |... ...+++.|+ .|++++.+.. ..||+ +|+.. +.+++..|.+. .......|..+++.|+++++||+++ T Consensus 602 ~~a~~~~~~~NP~~~~~~vi~~prL~~~s~~~Wyl~a~~~~dtie~~yL~G~~~P~ie~~~gf~~dG~~~kvr~D~G~~~ 681 (693) T protein:vir:95 602 PGADVNSGIVNPIRAFAQVIGEPRLDDASATAWYMAAKKGSDTIEVAYLDGVDTPYLEQQEGFTVDGVASKVRIDAGVAP 681 (693) T ss_pred cccccccccccchhccccccccceecCCCCCceEEecCCCCCeEEEEEecCCCCCeEeecCCCCcceEEEEEEEeccCce Confidence 1111 113567675 6777777743 34444 56553 44444444322 2333457999999999999999999 Q ss_pred ecccceEEEEec Q lcl|Aclame:pro 620 RRKEAFCIAKKG 631 (632) Q Consensus 620 ~~~~a~~~~~~~ 631 (632) +|..++++-.-| T Consensus 682 iD~Rg~~kn~GA 693 (693) T protein:vir:95 682 LDFRGLQKSNGA 693 (693) T ss_pred eeccccccCCCC Confidence 999999987766 No 106 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=99.94 E-value=6.4e-28 Score=169.69 Aligned_cols=257 Identities=11% Similarity=0.036 Sum_probs=200.8 Q ss_pred cccccccccceechhhhhHHHHHHHhhhhhhhhhccee--ecc-CceeEEEEEecCCccccccccCcccccCcccceeee Q lcl|Aclame:pro 358 EKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARM--LPG-LVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLS 434 (632) Q Consensus 358 ~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~--~~~-~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 434 (632) +..+.++.+.++.|+++.+.+++.+.....+..+.... ..+ .+..+++|+.+..+.+.|++||+.++.++++++.++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~~~~~ 80 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGFKKTT 80 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccccccccceEE Confidence 33333455567888888888888887777666654321 122 234688999888889999999999999999999999 Q ss_pred eeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHH Q lcl|Aclame:pro 435 FSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVD 514 (632) Q Consensus 435 ~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~ 514 (632) +.+++++..+.+|++++.++..++.+.+.+.+++++++.+|..++....... ....+..+++.|.+ T Consensus 81 ~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~--------------~~~~~~~t~d~i~d 146 (272) T protein:vir:98 81 MTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKST--------------QTVEATATVDGVSK 146 (272) T ss_pred EEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------------cccccccCHHHHHH Confidence 9999999999999999999999999999999999999999999986432211 11123457889999 Q ss_pred HHHHHHhhccccccceEEeehhHHHHHHHHhhcccC-----Cceeec---cccccCcceEEcCCCCCccEEEEehhhEEE Q lcl|Aclame:pro 515 METKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNT-----GERIWQ---NNEVNGYRAEASNQIPADTWIFGDWSQIVI 586 (632) Q Consensus 515 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~-----g~~~~~---~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~ 586 (632) ++..+...+.. ...++||+.....+....+.+.. |..... .++++|+||++++++|.+++++.+...+.+ T Consensus 147 a~~~l~~~~~~--~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~~p~~t~~~~~~~a~~~ 224 (272) T protein:vir:98 147 ALDIFNDEDDA--ETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRKCPKGTAYMVRKGALRI 224 (272) T ss_pred HHHHHhccCCC--ccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCCCCcceEEEEcCCeEEE Confidence 99998876543 45788999887777655433321 111111 257999999999999999999999998888 Q ss_pred EEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 587 AMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 587 ~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +.+.++++..+. +..++...+++..|+++++.+|+++++++++| T Consensus 225 ~~~~~~~ve~~r--~~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~ 268 (272) T protein:vir:98 225 MLKRNTMVETDR--DITKAINQIVANKHYGVYLYKAEKAVKITLKD 268 (272) T ss_pred EecCCceeeecc--ccccceeEEEEEEEEEEEEEcCCceEEEEecc Confidence 888888876654 44567789999999999999999999999999 No 107 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=99.94 E-value=6.4e-28 Score=169.69 Aligned_cols=257 Identities=11% Similarity=0.036 Sum_probs=200.8 Q ss_pred cccccccccceechhhhhHHHHHHHhhhhhhhhhccee--ecc-CceeEEEEEecCCccccccccCcccccCcccceeee Q lcl|Aclame:pro 358 EKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARM--LPG-LVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLS 434 (632) Q Consensus 358 ~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~--~~~-~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 434 (632) +..+.++.+.++.|+++.+.+++.+.....+..+.... ..+ .+..+++|+.+..+.+.|++||+.++.++++++.++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~~~~~ 80 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGFKKTT 80 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccccccccceEE Confidence 33333455567888888888888887777666654321 122 234688999888889999999999999999999999 Q ss_pred eeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHH Q lcl|Aclame:pro 435 FSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVD 514 (632) Q Consensus 435 ~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~ 514 (632) +.+++++..+.+|++++.++..++.+.+.+.+++++++.+|..++....... ....+..+++.|.+ T Consensus 81 ~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~--------------~~~~~~~t~d~i~d 146 (272) T protein:vir:30 81 MTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKST--------------QTVEATATVDGVSK 146 (272) T ss_pred EEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------------cccccccCHHHHHH Confidence 9999999999999999999999999999999999999999999986432211 11123457889999 Q ss_pred HHHHHHhhccccccceEEeehhHHHHHHHHhhcccC-----Cceeec---cccccCcceEEcCCCCCccEEEEehhhEEE Q lcl|Aclame:pro 515 METKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNT-----GERIWQ---NNEVNGYRAEASNQIPADTWIFGDWSQIVI 586 (632) Q Consensus 515 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~-----g~~~~~---~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~ 586 (632) ++..+...+.. ...++||+.....+....+.+.. |..... .++++|+||++++++|.+++++.+...+.+ T Consensus 147 a~~~l~~~~~~--~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~~p~~t~~~~~~~a~~~ 224 (272) T protein:vir:30 147 ALDIFNDEDDA--ETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRKCPKGTAYMVRKGALRI 224 (272) T ss_pred HHHHHhccCCC--ccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCCCCcceEEEEcCCeEEE Confidence 99998876543 45788999887777655433321 111111 257999999999999999999999998888 Q ss_pred EEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 587 AMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 587 ~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +.+.++++..+. +..++...+++..|+++++.+|+++++++++| T Consensus 225 ~~~~~~~ve~~r--~~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~ 268 (272) T protein:vir:30 225 MLKRNTMVETDR--DITKAINQIVANKHYGVYLYKAEKAVKITLKD 268 (272) T ss_pred EecCCceeeecc--ccccceeEEEEEEEEEEEEEcCCceEEEEecc Confidence 888888876654 44567789999999999999999999999999 No 108 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=99.82 E-value=1.4e-21 Score=134.91 Aligned_cols=258 Identities=14% Similarity=0.053 Sum_probs=194.6 Q ss_pred cccccccccceechhhhhHHHHHHHhhhhhhhhhccee--ecc-CceeEEEEEecCCccccccccCcccccCcccceeee Q lcl|Aclame:pro 358 EKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARM--LPG-LVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLS 434 (632) Q Consensus 358 ~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~--~~~-~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 434 (632) +....+.-..++.|+++.+.+.+.+.....+..+...- ..+ ....+++|+....+.+.++.||..++.++++.+..+ T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg~~i~~~~it~~~~~ 80 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccccCCCcccccccccceeE Confidence 23334455667888888888888877766655554322 122 233688898876678889999999999999999999 Q ss_pred eeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHH Q lcl|Aclame:pro 435 FSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVD 514 (632) Q Consensus 435 ~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~ 514 (632) ..+++++..+.++++....+..++...+.+.++.++++.+|..++....++. .......++++.|.+ T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~-------------~~~~~~~~~~d~i~d 147 (274) T protein:vir:93 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK-------------LTVNADITKLNGLQS 147 (274) T ss_pred EEeeeecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccc-------------ccccccccCHHHHHH Confidence 9999999999999999998888899999999999999999999886543321 112234567899999 Q ss_pred HHHHHHhhccccccceEEeehhHHHHHHHHhhc----c-cCCceeec---cccccCcceEEcCCCCCccEEEEehhhEEE Q lcl|Aclame:pro 515 METKISTFNADAGRLAYLTSVTQRGAAKKAQVF----D-NTGERIWQ---NNEVNGYRAEASNQIPADTWIFGDWSQIVI 586 (632) Q Consensus 515 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----d-~~g~~~~~---~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~ 586 (632) ++.++..... .....++|+.....+...... + ..|..+.. -++++|++|++++.+|.++.++.....+.+ T Consensus 148 A~~~l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gai~~ 225 (274) T protein:vir:93 148 AIDKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKL 225 (274) T ss_pred HHHHhhhccC--CccEEEeCHHHHHHHHhhhhhcccccccccccceeecccceecCeeEEEcCCCCcceEEEEeCCeEEE Confidence 9999887643 334567777777666543211 1 11223222 357999999999999999999888888887 Q ss_pred EEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 587 AMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 587 ~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +....+.++..++. ......+++..++++++++|++++++++++ T Consensus 226 ~~~~~~~vE~~Rd~--~~~~d~i~~~~~y~~~~~~~~~~v~~t~~~ 269 (274) T protein:vir:93 226 ILKRDFFLEVARDA--STKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) T ss_pred EecCCcccccccch--hhcccEEEEEEEEEEEEEcCCceEEEeeCc Confidence 77777776554433 445678999999999999999999999999 No 109 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=99.80 E-value=3.2e-22 Score=138.41 Aligned_cols=384 Identities=13% Similarity=0.042 Sum_probs=195.1 Q ss_pred eeEEEeecccccCCCCeeEEEEEEeeeeccCcccccccccceeeeeeccchhhhhhhhhhhhhhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 127 IGYIIHEMVLESSGDQGDTYRVMDWEPYEISLISVPADPTVGVGRSIDIGNITIRGAEMPDKDKQTQTAGSQQTETRGAE 206 (632) Q Consensus 127 iG~~~~~~~~~~~~~~~~~~~~~~~~l~EvS~v~~pa~~~a~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 206 (632) .|- -....++...+..+.+++.|+|+|++|||.+|.+..+++.+........ .+....+ T Consensus 1 ~~n------~t~a~d~~~RR~~~~L~~~EvSvv~~PAY~nA~vt~vRe~e~~~~~e~~---~~~e~~e------------ 59 (410) T protein:vir:83 1 MGN------ATTASDEYIRRLENELREKESLVRGIYDRANASNRDVNEEEGQMVAECR---GRMEQIK------------ 59 (410) T ss_pred CCC------cccchhhHHHHHHHHhhhhheeeeccccccccccccchhhhcccccccc---Ccccchh------------ Confidence 110 0111122111223334456999999999999999876554221100000 0000000 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHH Q lcl|Aclame:pro 207 TGAKNPAPAASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLP 286 (632) Q Consensus 207 ~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 286 (632) +..+..++ .....++.+.......... ....+.... T Consensus 60 ------------------------------------n~~e~~~~---~~~~~~E~Rs~~~~i~~~~-~~~r~~p~~---- 95 (410) T protein:vir:83 60 ------------------------------------NQMEQAQE---VNRIAFETRSKGQAVDAAI-SAMRGSPVG---- 95 (410) T ss_pred ------------------------------------hhhHHHHH---HHHHHHHHHHHHHHHHhhh-ccCcCCCCC---- Confidence 00000000 0000000000000000000 000000000 Q ss_pred HHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHh--hhhhccccccc Q lcl|Aclame:pro 287 GKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLV--QRQLEKKTAGK 364 (632) Q Consensus 287 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~a~~~~~~~~ 364 (632) ...+.+.+ .++.+...............++ .++....++.+ T Consensus 96 ----------~~veyRSa---------------------------GE~lkal~~~~~Gd~~A~~~~e~~r~a~~~~~Tgd 138 (410) T protein:vir:83 96 ----------TEVEYRSA---------------------------GEYMLDMWNSAQGNASAADRLEVYARAADHQKTGD 138 (410) T ss_pred ----------CCcccccH---------------------------HHHHHHHhccCCchHHHHHHHHHHHHhhccCcccc Confidence 00000000 0000000000000000001111 12333333334 Q ss_pred ccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccc-------cccccCcccccCcccceeeeeee Q lcl|Aclame:pro 365 GGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANF-------YWIGEDEDVQDSDFDFTTLSFSP 437 (632) Q Consensus 365 ~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a-------~~v~E~~~~~~~~~~~~~~~~~~ 437 (632) ....+++++..+ .++++.+...+..+..+ .|....++.++..+..+.. +.-.||++.+.+++.|+..+..+ T Consensus 139 ~~~~i~~~~v~d-~i~li~q~r~i~slf~t-LP~~g~T~eY~v~t~~~tV~~q~~~~kqa~EGd~L~~gKl~~~t~tA~i 216 (410) T protein:vir:83 139 LQGVIPDPIVGP-VIDFIDSARPLVSTLGT-LPLNNATFYRPIVSQRPAVGLQGVAGGASDEKTELDSQKMVIDRLTVNA 216 (410) T ss_pred cccccchhHhhh-HHHHHhhccchhhhhhh-CCCCCCeeEEeeecccccccccccccccccccccccccceeeeecccee Confidence 445677776544 45666565555554443 6766667777777665532 33569999999999999999999 Q ss_pred eeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHH---hhcCCCccccccceeccccccccccccchhHHHHHH Q lcl|Aclame:pro 438 KTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAM---LTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVD 514 (632) Q Consensus 438 ~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~---~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~ 514 (632) +|||++..+|||.|+.+.+++.+...+.|+.++++.-+.++ |...-+. .. +..+. .+..-...|.+ T Consensus 217 kTyGGyt~LSRQ~IERs~v~~L~~~lraL~~AYA~atea~vra~L~~t~t~-----~~---a~~~~---Tad~~~~~i~d 285 (410) T protein:vir:83 217 KTLGGYVNVSRQAIDFSSPSALDLVVNGLGQQYAIETEALVGAALASTSTG-----AV---GYGNA---TADNVASAIWQ 285 (410) T ss_pred ehhcCcccccceeeecCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-----hh---hhhhc---cHHHHHHHHHH Confidence 99999999999999999999999999999999998888765 3222111 10 11111 01111122334 Q ss_pred HHHHHHhhccccccceEEeehhHHHHHHHHhhcc-------cCC---ceee--ccccccCcceEEcCCCCCccEEEEehh Q lcl|Aclame:pro 515 METKISTFNADAGRLAYLTSVTQRGAAKKAQVFD-------NTG---ERIW--QNNEVNGYRAEASNQIPADTWIFGDWS 582 (632) Q Consensus 515 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d-------~~g---~~~~--~~~~l~G~pv~~~~~~~~~~~~~gd~s 582 (632) +..++..+.++.......+++........ ..++ ..| .++. ..|.+++.||++.+..++++++|.|.+ T Consensus 286 a~~~v~da~~~~~~~~i~vS~DVl~~~~~-~f~~~~~~~~dt~Gfg~~~lg~gi~G~~~~ipVvm~~~a~AgTA~f~~~~ 364 (410) T protein:vir:83 286 AAGAVYTAVKGMGRLVIAIAPDVLGDFGP-LFAPVNPTNAHSTGFEAGRFGQGVMGSISGIPVVMSAALGSGDAYLFSTA 364 (410) T ss_pred HHHHHhhhhccceeeeEEechhhhhhccc-eeeccCCCCcccccccccccccchhhhhcccceEEecCCCcCeeeEeccc Confidence 44444443232222233444444221111 1122 223 0111 125678899999999999999999999 Q ss_pred hEEEEEecc--eEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEec Q lcl|Aclame:pro 583 QIVIAMWGV--LDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKG 631 (632) Q Consensus 583 ~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~ 631 (632) .+..++.++ +++...+.+...+ .|. .||.+++..+++++.|.-. T Consensus 365 Ai~~~eS~~gp~qL~d~~i~nLt~---~yS--gY~a~a~~~~~gliPv~g~ 410 (410) T protein:vir:83 365 AIECFEQRVGTLQVVEPSVFGLQV---AYA--GYFSTLVVNEDAIVPLVGS 410 (410) T ss_pred eeeeeecCCceeEeeCCchhhhhh---hhe--eeeeeccccccceeeeccC Confidence 999988875 6666555444333 455 6779999999999999888 No 110 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=99.78 E-value=6.5e-21 Score=131.26 Aligned_cols=270 Identities=11% Similarity=-0.030 Sum_probs=176.2 Q ss_pred hcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcccccCcccceeeeee Q lcl|Aclame:pro 357 LEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFS 436 (632) Q Consensus 357 ~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~ 436 (632) +..++ + ....+.+.+...+.+.+...+...+.+++.++.++...++...+..|.+..+ .|+++.++++....++. T Consensus 1 m~it~--~-~l~~l~~~~~~~~~~~y~~a~~~~~~~a~~~~sdf~~~~~~~lg~~p~l~e~--~Ge~~~~~l~~~~~~i~ 75 (302) T protein:vir:10 1 MLINK--Q-SLNAAFVAIKTIFNNAFAAAPTTWQKIAMEVPSNTSSNDYKWLSTFPKMRRW--IGAKVVKNLKAYKYVVE 75 (302) T ss_pred CcccH--H-HHHHHHHHHHHHHHHHHHhhhhhhhceeeecCCCcceeeceecCCCCCcccc--ccceeeccccccceeEE Confidence 11111 0 0111223345667777777776666677888888888888899888887655 38899999999999999 Q ss_pred eeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCC---cccccc-ceeccccccccc---------- Q lcl|Aclame:pro 437 PKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGL---ANDPVG-LLNMTGVPALTY---------- 502 (632) Q Consensus 437 ~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~---~~~~~G-il~~a~~~~~~~---------- 502 (632) .++|++.+.||||+|+||+++++..+.+.|++++++.+++.++.-... .+..+| .|++++|.+... T Consensus 76 ~~~~g~~v~i~R~~i~nDdlg~~~~~~~~~G~aaa~~~~~lv~~~L~~g~~~~~~DG~~fF~~dH~~g~~~~~N~g~~~~ 155 (302) T protein:vir:10 76 NEDFEATVEVDRNDIEDDQIGIYSPQAKMAGYSAAQLPDELVYEAVNGAFTKPCFDGQYFIDTDHPVGDASVSNKGTAPL 155 (302) T ss_pred eecccceecccHHhhcccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCCcccCCcceecccccccccccccccchhh Confidence 999999999999999999999999999999999999999998764332 223344 688888754322 Q ss_pred --cccchhHHHHHHHHHHHHhhccccc-----cceEEeehhHHHHHHHHhhcccCCceeeccccccC-cceEEcCCCCCc Q lcl|Aclame:pro 503 --PAGGVDWASVVDMETKISTFNADAG-----RLAYLTSVTQRGAAKKAQVFDNTGERIWQNNEVNG-YRAEASNQIPAD 574 (632) Q Consensus 503 --~~~~~~~~~i~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~~~l~G-~pv~~~~~~~~~ 574 (632) +...++.+.+.+++.+|..+....+ .+.++++|..........+.+... .--..+++.| ..+++++.+.++ T Consensus 156 ~~~~~~l~~~~~~aa~~am~~~k~~~G~~L~i~P~~LiVp~~le~~A~~ll~~~~~-~~g~~Np~~g~~~~vv~p~L~s~ 234 (302) T protein:vir:10 156 SNASQAAAKAGYGAARTAMKKFKDEEGRSLNVSPNVLLVGPALEDVAKMLLTNPKL-ADNTPNPYVGTAELVVDGRIESD 234 (302) T ss_pred hhcccccchHHHHHHHHHHHHHhhhcccccccCCCEEEecchhHHHHHHHhhcccc-CCCCcceeccceEEEEeeccCCC Confidence 2235667788888888877644332 345667765555544444444221 1122355656 477888887654 Q ss_pred c--EEEEehhhEEEEEecceE-EEEecccccccCcEEEEEEEEeCcEEecccceE------EEEecC Q lcl|Aclame:pro 575 T--WIFGDWSQIVIAMWGVLD-LKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFC------IAKKGA 632 (632) Q Consensus 575 ~--~~~gd~s~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~------~~~~~A 632 (632) + +++.|++.+......+.+ ........|..+.+.++.+.++|+.-+-.-++. .-+-.| T Consensus 235 ~aWyL~a~~~~i~~~~l~g~~~P~~~~~~~~~~dgv~~k~~~d~Gvd~R~~~G~~~wq~a~~s~g~~ 301 (302) T protein:vir:10 235 TAWFLLDTTKPVKPFIFQPRKQPEFVSQVNLDSDDVFNLRKLKFGAEARAAAGYGFWQLAYGSTGTG 301 (302) T ss_pred CceEEEecCCccceEEEcCccccEEEeccCCCCCceEEEEEEEEeeeeeeecchhhhhhhhccCccC Confidence 3 345577766444333322 233345678889999999888886222222221 111111 No 111 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=99.77 E-value=6.3e-20 Score=125.85 Aligned_cols=258 Identities=15% Similarity=0.073 Sum_probs=188.7 Q ss_pred cccccccccceechhhhhHHHHHHHhhhhhhhhhccee--ecc-CceeEEEEEecCCccccccccCcccccCcccceeee Q lcl|Aclame:pro 358 EKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARM--LPG-LVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLS 434 (632) Q Consensus 358 ~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~--~~~-~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 434 (632) +....+....++.|+++...+.+.+.....+..+...- ..+ ....+++|+....+.+..+.|+..++..+++.+..+ T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~g~~i~~~~it~~~~~ 80 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCccccCCCCcCchhhcccceeE Confidence 22222344567888888888888776555554442221 111 234688888876677788999999999999999999 Q ss_pred eeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHH Q lcl|Aclame:pro 435 FSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVD 514 (632) Q Consensus 435 ~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~ 514 (632) ..+++++..+.++++....+..+....+.+.++.++++.++..++....... .......++++.|.+ T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~-------------~~~~~~~~~~d~i~d 147 (274) T protein:vir:96 81 AKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT-------------LTVEADITKLDGLQT 147 (274) T ss_pred EEEEeeeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-------------CCcCcccccHHHHHH Confidence 9999999999999999888888899999999999999999998876442211 112334567899999 Q ss_pred HHHHHHhhccccccceEEeehhHHHHHHHHhh----ccc-CCceeec---cccccCcceEEcCCCCCccEEEEehhhEEE Q lcl|Aclame:pro 515 METKISTFNADAGRLAYLTSVTQRGAAKKAQV----FDN-TGERIWQ---NNEVNGYRAEASNQIPADTWIFGDWSQIVI 586 (632) Q Consensus 515 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~d~-~g~~~~~---~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~ 586 (632) +...+..... .....+||+.....+..... .+. .|..+.. -++++|++|++++.+|.++.++.....+.+ T Consensus 148 A~~~l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~g~ig~~~G~~Vi~s~~~p~~t~~l~~~gA~~~ 225 (274) T protein:vir:96 148 AIDKFNDEDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGAVKL 225 (274) T ss_pred HHHHhcccCC--CceEEEeCHHHHHHHHhcccccccccccccccceeecccceecCeeEEEcCCCCcceEEEEeCcceee Confidence 9999876543 34456777777666654321 111 1222222 357899999999999999988877777777 Q ss_pred EEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 587 AMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 587 ~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +...++.++... +.......+++.+++|+++++|+++|++++++ T Consensus 226 ~~~~~~~vE~~R--d~~~~~d~i~~~~~yg~~~~~~~~vv~~t~~~ 269 (274) T protein:vir:96 226 ITKRDFFLEKDR--DASRKSTALYSDKHYVAYLYDESKVVKITKGA 269 (274) T ss_pred eecCCccccccc--chhhcccEEEEeeEEEEEEEcCccEEEEEcCc Confidence 777766655433 34456678999999999999999999999999 No 112 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.77 E-value=4.7e-20 Score=126.54 Aligned_cols=257 Identities=12% Similarity=0.041 Sum_probs=184.0 Q ss_pred cccccccccceechhhhhHHHHHHHhhhhhhhhhccee--ecc-CceeEEEEEecCCccccccccCcccccCcccceeee Q lcl|Aclame:pro 358 EKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARM--LPG-LVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLS 434 (632) Q Consensus 358 ~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~--~~~-~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 434 (632) +..+.+.-..+|.|+++.+.+.+.+.....+..+...- ..+ ....+++|.....+.+.++.||.+++.+.++.++.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~eg~~i~~~~lt~~~~~ 80 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEGGEISLDKIGTTTKS 80 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccccccCCCCccChhhcCCccee Confidence 22233444567888888888877776666666654321 122 244688998877777889999999999999999999 Q ss_pred eeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHH Q lcl|Aclame:pro 435 FSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVD 514 (632) Q Consensus 435 ~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~ 514 (632) ..++.++..+.++++....+..++...+.++++.++++.+|..++....... ......++++.|.+ T Consensus 81 ~~i~~~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~--------------~~~~~~~~~d~i~~ 146 (272) T protein:vir:36 81 VTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTS--------------QTVSTKANVDGVQA 146 (272) T ss_pred EeeehhhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------------ccccccccHHHHHH Confidence 9999999999999999988888999999999999999999998875442211 11234567889999 Q ss_pred HHHHHHhhccccccceEEeehhHHHHHHHHhhc----ccCCceeec---cccccCcceEEcCCCCCccEEEEe----hhh Q lcl|Aclame:pro 515 METKISTFNADAGRLAYLTSVTQRGAAKKAQVF----DNTGERIWQ---NNEVNGYRAEASNQIPADTWIFGD----WSQ 583 (632) Q Consensus 515 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----d~~g~~~~~---~~~l~G~pv~~~~~~~~~~~~~gd----~s~ 583 (632) ++..|...... ....++|+.....+...... +..|..+.. -++++|++|++++.+|.++.++.. ... T Consensus 147 A~~~lgd~~~~--~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~p~~~~~~~~~~~~~gA 224 (272) T protein:vir:36 147 ALDIFNDEDAQ--AYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPA 224 (272) T ss_pred HHHHhhhcCCC--ceEEEEcHHHHHHHhcccccccccccccccceeeeccceecCeeEEEeCCCCCCceeEEEEEecccc Confidence 99999877653 34566777766555432211 122222222 257999999999999988754322 223 Q ss_pred EEEEEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 584 IVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 584 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +.++...+++++..+ +.......+++..++++++.+|+++|+++++= T Consensus 225 ~~~~~~~~~~vE~~R--~~~~~~d~i~~~~~y~~~v~~~~~vv~~t~~g 271 (272) T protein:vir:36 225 LKLVLKRGVQVETDR--DIVTKTTVITADEHYAAYLYDLTKVVNITFTG 271 (272) T ss_pred eeeeecCCccccccc--chhhcCcEEEEEEEEEEEEEcCccEEEEeecC Confidence 334445566655443 33445568999999999999999999999999 No 113 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=99.76 E-value=1.3e-19 Score=124.16 Aligned_cols=265 Identities=10% Similarity=0.040 Sum_probs=186.8 Q ss_pred hcccccccccceechhhhhHHHHHHHhhhhhhhhhccee--ecc-CceeEEEEEecCCccccccccCcccccCcccceee Q lcl|Aclame:pro 357 LEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARM--LPG-LVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTL 433 (632) Q Consensus 357 ~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~--~~~-~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~ 433 (632) +. ...+..+.++.|+++.+.+.+.+.+...+.++...- ..+ ....+++|+....+.+.++.|+..++..+++.++. T Consensus 1 Ma-~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~g~~i~~~~lt~~~~ 79 (278) T protein:vir:80 1 MA-DLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAEGAAIDYSALETESV 79 (278) T ss_pred CC-CcceehhheecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcceeecCCCcCccccccccee Confidence 22 122333566788888888888877766665553322 122 23467888887667788899999999999999999 Q ss_pred eeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHH Q lcl|Aclame:pro 434 SFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVV 513 (632) Q Consensus 434 ~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~ 513 (632) +..+++++..+.++++....+..++...+.+.++.++++.++..++........ ...+ ..........++.+. T Consensus 80 ~~~i~~~~~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~-----~~~~--~~t~~~~~~~~~~~~ 152 (278) T protein:vir:80 80 KHGIKKAGKGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTL-----EVKG--AINIGLIDKIENTFT 152 (278) T ss_pred eEeeehhhccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-----cccc--ccccchhhhHHHHHH Confidence 999999999999999999988889999999999999999999988765422110 0111 111112223457788 Q ss_pred HHHHHHHhhccccccceEEeehhHHHHHHHHhhcc-----cCCceeec---cccccCcceEEcCCCCCccEEEEehhhEE Q lcl|Aclame:pro 514 DMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFD-----NTGERIWQ---NNEVNGYRAEASNQIPADTWIFGDWSQIV 585 (632) Q Consensus 514 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d-----~~g~~~~~---~~~l~G~pv~~~~~~~~~~~~~gd~s~~~ 585 (632) ++..++...+.+.. ...++++.....+......+ ..|..+.. -++++|++|++++.+|.++.++.....+. T Consensus 153 da~~~l~~~~~~~~-~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gAi~ 231 (278) T protein:vir:80 153 DAPDAIEDESITTT-GVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGELLGWEIVRTKKLADGNALAVKAGALK 231 (278) T ss_pred HHHHhhcccCCCcc-cEEEECHHHHHHHHhhhhhhccccccccccceeeccceeecceeEEEcCCCCcceEEEEecccee Confidence 88888776655432 23567777665554332211 11233322 35799999999999999988877777776 Q ss_pred EEEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 586 IAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 586 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) ++....+.++.++ +.......+++.+++++++++|+++|++++.| T Consensus 232 ~~~~~~~~vE~~R--d~~~~~d~i~~~~~yg~~v~~~~~~v~it~~a 276 (278) T protein:vir:80 232 TFLKRNLLAESGR--DMDHKLTKFNADQHYAVALVDETKAVKVVPVA 276 (278) T ss_pred eeecCCccccccc--chhhccceeeeeeEEEEEEEcCcceEEEeecc Confidence 6666676665444 33456678999999999999999999999999 No 114 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.76 E-value=1.3e-19 Score=124.14 Aligned_cols=258 Identities=12% Similarity=0.043 Sum_probs=193.9 Q ss_pred cccccccccceechhhhhHHHHHHHhhhhhhhhhccee--ecc-CceeEEEEEecCCccccccccCcccccCcccceeee Q lcl|Aclame:pro 358 EKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARM--LPG-LVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLS 434 (632) Q Consensus 358 ~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~--~~~-~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 434 (632) +....+.-..+|.|+++.+.+.+.+.....+.++...- +.+ ....+++|.....+.+..+.||..++..+++.++.. T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg~~i~~~~lt~~~~~ 80 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEGQKIPVDKIETNRRE 80 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCCCccCccccccceee Confidence 22223344567888999988888887777776664321 222 345788888877778889999999999999999999 Q ss_pred eeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHH Q lcl|Aclame:pro 435 FSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVD 514 (632) Q Consensus 435 ~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~ 514 (632) ..+++++..+.++++....+..+....+.+.++.++++.++..++.-...+ ....+...++++.|.+ T Consensus 81 a~i~~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~-------------~~~~~~~~~t~d~i~~ 147 (276) T protein:vir:10 81 AKIHKIGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGT-------------KLTVSADIGTLAGLEA 147 (276) T ss_pred EEeehccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcc-------------cccccccccCHHHHHH Confidence 999999999999999999988899999999999999999999887533211 1112334578899999 Q ss_pred HHHHHHhhccccccceEEeehhHHHHHHHHhh----ccc-CCceeecc---ccccCcceEEcCCCCCccEEEEehhhEEE Q lcl|Aclame:pro 515 METKISTFNADAGRLAYLTSVTQRGAAKKAQV----FDN-TGERIWQN---NEVNGYRAEASNQIPADTWIFGDWSQIVI 586 (632) Q Consensus 515 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~d~-~g~~~~~~---~~l~G~pv~~~~~~~~~~~~~gd~s~~~~ 586 (632) ++..|..... .....+||+.....++...+ .+. .|..+..+ +.++|++|++++.+|.++.++.....+.+ T Consensus 148 A~~~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gAi~~ 225 (276) T protein:vir:10 148 AIDTFDDEDL--EPMVLFINPKDAGKLRSSASDNFTRATELGDNIIVKGAFGEALGAVIVRSKKLDEGEAILAKRGAVKL 225 (276) T ss_pred HHHHhccccC--cccEEEEcHHHHHHHHHhccccccccccccccceeccccceecceeEEEcCCCCcceEEEEeccceee Confidence 9999976543 23456788877766654321 111 22222222 57899999999999999988777777777 Q ss_pred EEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 587 AMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 587 ~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +...++.++.++.. ......+++...+++++.+|..+++++++. T Consensus 226 ~~~~~~~vE~dRd~--~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~ 269 (276) T protein:vir:10 226 ITKRDFFLETDRDP--STKTTALYSDKHYVAYLYDESKAVKVTKGA 269 (276) T ss_pred eecCCceeecccch--hhcccEEEEeeEEEEEEEcCcceEEEecCC Confidence 77777776655544 345668999999999999999999999988 No 115 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=99.75 E-value=2.8e-19 Score=122.29 Aligned_cols=258 Identities=14% Similarity=0.056 Sum_probs=190.9 Q ss_pred cccccccccceechhhhhHHHHHHHhhhhhhhhhccee--ecc-CceeEEEEEecCCccccccccCcccccCcccceeee Q lcl|Aclame:pro 358 EKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARM--LPG-LVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLS 434 (632) Q Consensus 358 ~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~--~~~-~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 434 (632) +....+.-..+|.|+++...+.+.+.....+..+...- ..+ ....+++|.....+.+..+.|+..++..+++.+..+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccccccccceeE Confidence 22233345567888888888887776665555543321 122 244688888776667888999999999999999999 Q ss_pred eeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHH Q lcl|Aclame:pro 435 FSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVD 514 (632) Q Consensus 435 ~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~ 514 (632) ..+++++..+.++++....+..+....+.+.++.++++.+|..++.-..++. ....+.+++++.|.+ T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~-------------~~~~~~~~~~d~i~d 147 (274) T protein:vir:97 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK-------------LTVNADITKLNGLQS 147 (274) T ss_pred EEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccC-------------ccccccccCHHHHHH Confidence 9999999999999999998878889999999999999999998875443211 112234567899999 Q ss_pred HHHHHHhhccccccceEEeehhHHHHHHHHhhc----c-cCCceeec---cccccCcceEEcCCCCCccEEEEehhhEEE Q lcl|Aclame:pro 515 METKISTFNADAGRLAYLTSVTQRGAAKKAQVF----D-NTGERIWQ---NNEVNGYRAEASNQIPADTWIFGDWSQIVI 586 (632) Q Consensus 515 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----d-~~g~~~~~---~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~ 586 (632) +..++..... .....+||+.....+....+. + ..|..+.. -++++|++|++++.+|.++.++.....+.+ T Consensus 148 A~~~l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gA~~~ 225 (274) T protein:vir:97 148 AIDKFNDEDL--EPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKL 225 (274) T ss_pred HHHHhhccCC--CceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecCeeEEEcCCCCcceEEEEeCcceEe Confidence 9999877543 334566777776666543211 1 12333333 257899999999999999988888888877 Q ss_pred EEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 587 AMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 587 ~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +...++.++.+++. ......+++..++++++++|.+++++++++ T Consensus 226 ~~~~~~~vE~~Rd~--~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~ 269 (274) T protein:vir:97 226 ILKRDFFLEVARDA--STKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) T ss_pred eecCCceeccccch--hhcccEEEEEEEEEEEEEcCCceEEEecCc Confidence 77777776655544 345568999999999999999999999988 No 116 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=99.75 E-value=2.8e-19 Score=122.29 Aligned_cols=258 Identities=14% Similarity=0.056 Sum_probs=190.9 Q ss_pred cccccccccceechhhhhHHHHHHHhhhhhhhhhccee--ecc-CceeEEEEEecCCccccccccCcccccCcccceeee Q lcl|Aclame:pro 358 EKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARM--LPG-LVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLS 434 (632) Q Consensus 358 ~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~--~~~-~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 434 (632) +....+.-..+|.|+++...+.+.+.....+..+...- ..+ ....+++|.....+.+..+.|+..++..+++.+..+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccccccccceeE Confidence 22233345567888888888887776665555543321 122 244688888776667888999999999999999999 Q ss_pred eeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHH Q lcl|Aclame:pro 435 FSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVD 514 (632) Q Consensus 435 ~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~ 514 (632) ..+++++..+.++++....+..+....+.+.++.++++.+|..++.-..++. ....+.+++++.|.+ T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~-------------~~~~~~~~~~d~i~d 147 (274) T protein:vir:94 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK-------------LTVNADITKLNGLQS 147 (274) T ss_pred EEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccC-------------ccccccccCHHHHHH Confidence 9999999999999999998878889999999999999999998875443211 112234567899999 Q ss_pred HHHHHHhhccccccceEEeehhHHHHHHHHhhc----c-cCCceeec---cccccCcceEEcCCCCCccEEEEehhhEEE Q lcl|Aclame:pro 515 METKISTFNADAGRLAYLTSVTQRGAAKKAQVF----D-NTGERIWQ---NNEVNGYRAEASNQIPADTWIFGDWSQIVI 586 (632) Q Consensus 515 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----d-~~g~~~~~---~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~ 586 (632) +..++..... .....+||+.....+....+. + ..|..+.. -++++|++|++++.+|.++.++.....+.+ T Consensus 148 A~~~l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gA~~~ 225 (274) T protein:vir:94 148 AIDKFNDEDL--EPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKL 225 (274) T ss_pred HHHHhhccCC--CceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecCeeEEEcCCCCcceEEEEeCcceEe Confidence 9999877543 334566777776666543211 1 12333333 257899999999999999988888888877 Q ss_pred EEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 587 AMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 587 ~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +...++.++.+++. ......+++..++++++++|.+++++++++ T Consensus 226 ~~~~~~~vE~~Rd~--~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~ 269 (274) T protein:vir:94 226 ILKRDFFLEVARDA--STKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) T ss_pred eecCCceeccccch--hhcccEEEEEEEEEEEEEcCCceEEEecCc Confidence 77777776655544 345568999999999999999999999988 No 117 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=99.75 E-value=1.5e-19 Score=123.83 Aligned_cols=259 Identities=15% Similarity=0.063 Sum_probs=190.1 Q ss_pred hhhcccccccccceechhhhhHHHHHHHhhhhhhhhhccee--ecc-CceeEEEEEecCCccccccccCcccccCcccce Q lcl|Aclame:pro 355 RQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARM--LPG-LVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFT 431 (632) Q Consensus 355 ~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~--~~~-~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~ 431 (632) -++. ..+.-..+|.|+++...+.+.+.....+..+...- ..+ ....+++|.....+.+..+.|+..++..+++.+ T Consensus 1 ~~~~--~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~ 78 (275) T protein:vir:96 1 MALE--NMTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEGEEIPIDLIETK 78 (275) T ss_pred CCCc--ccchhhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCccccccCCCCcchhhcccc Confidence 1222 22344457888998888888887776666653221 122 244688888877678888999999999999999 Q ss_pred eeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHH Q lcl|Aclame:pro 432 TLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWAS 511 (632) Q Consensus 432 ~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~ 511 (632) .....+++++..+.++++....+..+....+.+.++.++++.++..++.-.++.. .......++++. T Consensus 79 ~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~-------------~~~~~~~~~~d~ 145 (275) T protein:vir:96 79 KRQATIRKIGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGAT-------------LKVEADITKLAG 145 (275) T ss_pred eeeEEeehhcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccc-------------ccccccccCHHH Confidence 9999999999999999999888777889999999999999999998875443221 112334578999 Q ss_pred HHHHHHHHHhhccccccceEEeehhHHHHHHHHhh-----cccCCceeecc---ccccCcceEEcCCCCCccEEEEehhh Q lcl|Aclame:pro 512 VVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQV-----FDNTGERIWQN---NEVNGYRAEASNQIPADTWIFGDWSQ 583 (632) Q Consensus 512 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~d~~g~~~~~~---~~l~G~pv~~~~~~~~~~~~~gd~s~ 583 (632) |.+++..+..... .....++|+.....++.... .+..|..+... ++++|++|++++.+|.++.++..... T Consensus 146 i~dA~~~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~i~~~gA 223 (275) T protein:vir:96 146 LQTAIDKFNDEDL--EPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFGEALGAIIVRSNKIKEGEAILAKRGA 223 (275) T ss_pred HHHHHHHhccccC--CccEEEeCHHHHHHHHhcccccccccccccccceeccccceecCeeEEEeCCCCcceEEEEeccc Confidence 9999999976543 23456677777666544321 12223333332 57899999999999999887766666 Q ss_pred EEEEEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 584 IVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 584 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +.++....+.++.+++ .......+++..++++++++|+++|++++.+ T Consensus 224 ~~~~~~~~~~vE~~Rd--~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~ 270 (275) T protein:vir:96 224 VKLITKRDFFLETERH--ASHKSTALFSDKHYVAYLYDESKVVKITKSA 270 (275) T ss_pred eeeeecCCcccccccc--hhhcCcEEEEeEEEEEEEEcCccEEEEEecc Confidence 6666766666655443 3445668999999999999999999999999 No 118 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=99.70 E-value=4.1e-18 Score=115.89 Aligned_cols=258 Identities=14% Similarity=0.055 Sum_probs=185.9 Q ss_pred cccccccccceechhhhhHHHHHHHhhhhhhhhhccee--ecc-CceeEEEEEecCCccccccccCcccccCcccceeee Q lcl|Aclame:pro 358 EKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARM--LPG-LVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLS 434 (632) Q Consensus 358 ~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~--~~~-~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 434 (632) .....+.-..+|.|+++...+.+.+.....+..+...- ..+ ....+++|.....+.+..+.|+..++..+++.+... T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccchhhcccceee Confidence 22223344567888888888777766555544443221 112 344688888776667888999999999999999999 Q ss_pred eeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHH Q lcl|Aclame:pro 435 FSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVD 514 (632) Q Consensus 435 ~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~ 514 (632) ..+++.+..+.++++....+..+....+.+.++.++++.++..++....++. ......+++++.|.+ T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~-------------~~~~~~a~~~d~i~d 147 (274) T protein:vir:12 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK-------------LTVNADITKLNGLQS 147 (274) T ss_pred EEeeeecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhccc-------------ccccccccCHHHHHH Confidence 9999999999999988888777888999999999999999998875443211 112334578999999 Q ss_pred HHHHHHhhccccccceEEeehhHHHHHHHHh----hcccC-Cceeec---cccccCcceEEcCCCCCccEEEEehhhEEE Q lcl|Aclame:pro 515 METKISTFNADAGRLAYLTSVTQRGAAKKAQ----VFDNT-GERIWQ---NNEVNGYRAEASNQIPADTWIFGDWSQIVI 586 (632) Q Consensus 515 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~d~~-g~~~~~---~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~ 586 (632) ++..|..... .....+||+.....+.... +.+.. |..+.. -++++|++|++++.+|.++.++.....+.+ T Consensus 148 A~~~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gA~~~ 225 (274) T protein:vir:12 148 AIDKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRSNKLEAGTAILAKKGAVKL 225 (274) T ss_pred HHHHhccccc--cccEEEeCHHHHHHHHhhhhhhccccccccccceecccceeecCeeEEEeCCCCcceEEEEeccceee Confidence 9999876543 2345667777766655432 12222 223332 256899999999999998876665666666 Q ss_pred EEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 587 AMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 587 ~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +....+.++.+++. ......+++..++++++++|.++|+++++. T Consensus 226 ~~~~~~~vE~~Rd~--~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~ 269 (274) T protein:vir:12 226 ILKRDFFLEVARDA--STKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) T ss_pred eecCCceeccccch--hhcccEEEeeeEEEEEEEcCCceEEEEcCC Confidence 66677776655544 345568999999999999999999999888 No 119 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=99.69 E-value=7.1e-18 Score=114.62 Aligned_cols=258 Identities=14% Similarity=0.057 Sum_probs=185.7 Q ss_pred cccccccccceechhhhhHHHHHHHhhhhhhhhhcce--eecc-CceeEEEEEecCCccccccccCcccccCcccceeee Q lcl|Aclame:pro 358 EKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGAR--MLPG-LVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLS 434 (632) Q Consensus 358 ~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~--~~~~-~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 434 (632) .....+.-..+|.|+++...+.+.+.....+..+... ...+ ....+++|.....+.+..+.|+..++..+++.+... T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccchhhcccceeE Confidence 2222234456788888888887777655555454221 1122 244788888776677788999999999999999999 Q ss_pred eeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHH Q lcl|Aclame:pro 435 FSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVD 514 (632) Q Consensus 435 ~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~ 514 (632) ..+++++..+.++++....+..+....+.+.++.++++..+..++.-..++. ......+++++.|.+ T Consensus 81 ~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~-------------~~~~~~~~~~d~i~~ 147 (274) T protein:vir:95 81 AKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAK-------------LTVEADITKLTGLQT 147 (274) T ss_pred EEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccc-------------ccccccccCHHHHHH Confidence 9999999999999999888777899999999999999999998875443221 112234567899999 Q ss_pred HHHHHHhhccccccceEEeehhHHHHHHHHhhc----ccC-Cceeec---cccccCcceEEcCCCCCccEEEEehhhEEE Q lcl|Aclame:pro 515 METKISTFNADAGRLAYLTSVTQRGAAKKAQVF----DNT-GERIWQ---NNEVNGYRAEASNQIPADTWIFGDWSQIVI 586 (632) Q Consensus 515 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----d~~-g~~~~~---~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~ 586 (632) +...+..... .....+||+.....+....+. +.. |..+.. -++++|++|++++.+|.++.++.....+.+ T Consensus 148 A~~~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~~~t~~l~~~gA~~~ 225 (274) T protein:vir:95 148 AIDKFNDEDL--EPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGAVIVRSNKLEAGTAILAKKGAVKL 225 (274) T ss_pred HHHHhccccc--cccEEEeCHHHHHHHHhhccccccccccccccceeccccceecCeEEEEeCCCCCceEEEEeccceee Confidence 9999876543 234566777776666543221 122 222332 256899999999999998876655666666 Q ss_pred EEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 587 AMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 587 ~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +....+.++.++ +.......+++..++++++++|+++|++++.+ T Consensus 226 ~~~~~~~vE~~R--d~~~~~d~i~~~~~y~~~~~~~~~~v~~tk~~ 269 (274) T protein:vir:95 226 ITKRDFFLETDR--DPSTKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) T ss_pred eecCCccccccc--ccccccCEEEEeEEEEEEEEcCCcEEEEEcCC Confidence 666666655444 34456678999999999999999999999988 No 120 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=99.69 E-value=7.1e-18 Score=114.62 Aligned_cols=258 Identities=14% Similarity=0.057 Sum_probs=185.7 Q ss_pred cccccccccceechhhhhHHHHHHHhhhhhhhhhcce--eecc-CceeEEEEEecCCccccccccCcccccCcccceeee Q lcl|Aclame:pro 358 EKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGAR--MLPG-LVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLS 434 (632) Q Consensus 358 ~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~--~~~~-~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 434 (632) .....+.-..+|.|+++...+.+.+.....+..+... ...+ ....+++|.....+.+..+.|+..++..+++.+... T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~~ 80 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccchhhcccceeE Confidence 2222234456788888888887777655555454221 1122 244788888776677788999999999999999999 Q ss_pred eeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHH Q lcl|Aclame:pro 435 FSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVD 514 (632) Q Consensus 435 ~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~ 514 (632) ..+++++..+.++++....+..+....+.+.++.++++..+..++.-..++. ......+++++.|.+ T Consensus 81 ~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~-------------~~~~~~~~~~d~i~~ 147 (274) T protein:vir:96 81 AKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAK-------------LTVEADITKLTGLQT 147 (274) T ss_pred EEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccc-------------ccccccccCHHHHHH Confidence 9999999999999999888777899999999999999999998875443221 112234567899999 Q ss_pred HHHHHHhhccccccceEEeehhHHHHHHHHhhc----ccC-Cceeec---cccccCcceEEcCCCCCccEEEEehhhEEE Q lcl|Aclame:pro 515 METKISTFNADAGRLAYLTSVTQRGAAKKAQVF----DNT-GERIWQ---NNEVNGYRAEASNQIPADTWIFGDWSQIVI 586 (632) Q Consensus 515 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----d~~-g~~~~~---~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~ 586 (632) +...+..... .....+||+.....+....+. +.. |..+.. -++++|++|++++.+|.++.++.....+.+ T Consensus 148 A~~~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~~~t~~l~~~gA~~~ 225 (274) T protein:vir:96 148 AIDKFNDEDL--EPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGAVIVRSNKLEAGTAILAKKGAVKL 225 (274) T ss_pred HHHHhccccc--cccEEEeCHHHHHHHHhhccccccccccccccceeccccceecCeEEEEeCCCCCceEEEEeccceee Confidence 9999876543 234566777776666543221 122 222332 256899999999999998876655666666 Q ss_pred EEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 587 AMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 587 ~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +....+.++.++ +.......+++..++++++++|+++|++++.+ T Consensus 226 ~~~~~~~vE~~R--d~~~~~d~i~~~~~y~~~~~~~~~~v~~tk~~ 269 (274) T protein:vir:96 226 ITKRDFFLETDR--DPSTKTTALYSDKHYVAYLYDESKAVKITKGS 269 (274) T ss_pred eecCCccccccc--ccccccCEEEEeEEEEEEEEcCCcEEEEEcCC Confidence 666666655444 34456678999999999999999999999988 No 121 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=99.56 E-value=2e-16 Score=106.63 Aligned_cols=293 Identities=12% Similarity=0.143 Sum_probs=185.4 Q ss_pred hhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhh Q lcl|Aclame:pro 309 LMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAII 388 (632) Q Consensus 309 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~ 388 (632) +-....... ..+.+..... .-.-++.+.+....+.+.+ +.....+++.+.+.+.+ T Consensus 1 ~~~~~~~~~-----------------------~~~~~~~~~~-~p~l~m~alTLaea~~l~~-d~~~~~VIE~l~~~s~i 55 (330) T protein:vir:94 1 MVRICTPPL-----------------------RGRWRTLTHQ-FPELKMPTVTLAESAKLSQ-DHLVSGLIETIVEVNPL 55 (330) T ss_pred CceecCCcc-----------------------ccceeehhcc-ccccchhhhhhhHHhhcCc-hhhHHHHHHhhhccchH Confidence 000000000 0000000000 0011122223334444443 44566778888877766 Q ss_pred hhhcceeeccCceeEEEEEecCCccccccccCcccccCc-ccceeeeeeeeeeeeeehhhHHH--hhcChhHHHHHHHHH Q lcl|Aclame:pro 389 GQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSD-FDFTTLSFSPKTIAGAVPVTRKL--RKQSSIHVENLIRED 465 (632) Q Consensus 389 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~-~~~~~~~~~~~t~~~~~~iSre~--l~d~~~~~~~~i~~~ 465 (632) +.... .....+..+.+.+....+.+.|+..++.++.+. .+|.+++..++.+++.+.|.+.+ +.++..+...+..+. T Consensus 56 L~~lp-f~~ve~~~~~~~r~~~lp~a~~r~~n~~~~~~~~~Tf~q~t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~ 134 (330) T protein:vir:94 56 YEMMP-FTEIEGNALAYNRENVLGDVQFLAVGGTITAKNPATFTKVTSELTTLIGDAEVNGLIQATRSDFMDQTSVQVAS 134 (330) T ss_pred Hhhcc-cccccCCcceeeeeecCCcceeeeccccccccCcceeeeeeechhhhhhhHHHHHHHHHhcCCHHHHHHHHHHH Confidence 65432 222334456778888889999999999988765 57899999999999999999999 566677888899999 Q ss_pred HHHHHHHHHHHHHhhcCCCccccccceecccccc-cc--ccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHH Q lcl|Aclame:pro 466 LIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPA-LT--YPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAK 542 (632) Q Consensus 466 l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~-~~--~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 542 (632) +.+++++..+..+++|+.+++.+.|++......+ +. ..++.++.+++..++..+... ++....++|+......++ T Consensus 135 ~ieal~~~~e~~linGDs~~~~F~GL~~~~~~~q~i~tg~~gg~~T~d~LDeLl~~v~~~--~g~~~~~l~n~a~~r~I~ 212 (330) T protein:vir:94 135 KAKSIGRQYQASMITGDGTGNSFQGMMGLVAASQTISAGANGGTLTFELLDQLLDLVKDK--DGQVDYLMSSFAMRRKYF 212 (330) T ss_pred HHHHHHHHHHHHhhccCCCCccccchhhcCCcccEEecCCCCCCCCHHHHHHHHHHhcCC--CCCCcEEEechhHHHHHH Confidence 9999999999999999988778889865333222 22 245778999998888887542 223445666665544333 Q ss_pred HHhhcccCCceeec------c----ccccCcceEEcCCCCCc----------cEEEEehh-----hEEEEEe----cceE Q lcl|Aclame:pro 543 KAQVFDNTGERIWQ------N----NEVNGYRAEASNQIPAD----------TWIFGDWS-----QIVIAMW----GVLD 593 (632) Q Consensus 543 ~~~~~d~~g~~~~~------~----~~l~G~pv~~~~~~~~~----------~~~~gd~s-----~~~~~~~----~~~~ 593 (632) .+....|++-.. . ..+.|.|++.++.+|.+ ++|+..|. ....+.. .|+. T Consensus 213 --a~~R~~~~~~v~~~~~~~~G~~v~~~~GvPi~~~d~ip~~~~~~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~gls 290 (330) T protein:vir:94 213 --SLLRALGGAAIGEVMTLPSGRQIPTYRGVPWFVNDFIPSNMTQGTATNATAIFAGTFDDGSNKYGIAGLTARGSAGLR 290 (330) T ss_pred --HHHHhccCCCCCCcccccCCCEEeeeCCeEEEecccccCCCCcccCCCceeEEEEeecccccccceEeecCCCCCcce Confidence 333333332211 1 24778999999888753 35555443 2334442 2444 Q ss_pred EEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 594 LKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 594 ~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) + ++....-+++...+++..|++.++.+|+|+.+|+--. T Consensus 291 V-r~~G~~~~k~v~~~~v~~y~~~av~~~~a~~~L~~V~ 328 (330) T protein:vir:94 291 V-QNVGAKENADETITRVKMYCGFANFSQLGLAAIKGLI 328 (330) T ss_pred e-eeCCCccccceeeEEEEEeeeeEEechhheeeecccc Confidence 4 2332223556788999999999999999999999888 No 122 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=99.53 E-value=3e-15 Score=100.21 Aligned_cols=340 Identities=9% Similarity=0.022 Sum_probs=176.4 Q ss_pred HHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHH Q lcl|Aclame:pro 258 VDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADA 337 (632) Q Consensus 258 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 337 (632) .+.....+.+. .-.+...+.....+......+...+.. .+....... ..++.+. T Consensus 1 ~~~~~~~~~~~------------~~~~~~~~e~k~lr~~me~~et~~e~~-----~~~~~~~~~---------e~el~E~ 54 (393) T protein:vir:79 1 MENWLKQLKES------------GFTETQVQEQKSLRTRMERGETLAEAD-----ANKLALNEE---------ETQILES 54 (393) T ss_pred CchHHHHHHhc------------cCchhHHHHHHHHHHHhhhhhhhhhhh-----hhhhhcchh---------HHHHHHH Confidence 11111111000 000000000000000000000000000 000000000 0011111 Q ss_pred HHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCcccccc Q lcl|Aclame:pro 338 SGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWI 417 (632) Q Consensus 338 ~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v 417 (632) +..-.. ...+......+.+. ++.+ +..++|..+...+++..++..+..++.....-..++...++-.+ .-.+..| T Consensus 55 f~Kmm~-G~~p~~eV~~~e~m--tt~~-a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L~~Grsm~F~~~g-~~Ra~~I 129 (393) T protein:vir:79 55 FAKMME-GETPTNEVNLREFM--ATPS-AQILIPRVIVGTMREAAEPLYIGTKMLQKIRLKSGQSMIFPSIG-IMRAYDV 129 (393) T ss_pred HHHHhc-CCCchhheehhhhh--cCCC-cceechhhhhhhhhhcccchhHHHHHHHHHhhhcCcceeccchh-eeeeccc Confidence 111111 11111111112222 2223 34555677788888888777766666433322233333333332 4457889 Q ss_pred ccCcccccCc---ccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccc--cccce Q lcl|Aclame:pro 418 GEDEDVQDSD---FDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLAND--PVGLL 492 (632) Q Consensus 418 ~E~~~~~~~~---~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~--~~Gil 492 (632) +||+|+|..+ .++++++++.+++|..+.+|.|++.|+..++..+....+++++++..+..+|++..+..+ ..++. T Consensus 130 gEGgE~~~~sld~~T~dsv~~~~gK~G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~~ghtvfDa~s 209 (393) T protein:vir:79 130 AEGQEIPEDSIDWQTHESPEIRVGKSGIRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRSHGHTVFDNYS 209 (393) T ss_pred cccccccccchhhhcCCceeEEechhhhhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhhhhcccceeeeccc Confidence 9999999865 458899999999999999999999999999999999999999999999999998766554 34433 Q ss_pred eccc-----cccccccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHH-HhhcccCCcee-------ecccc Q lcl|Aclame:pro 493 NMTG-----VPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKK-AQVFDNTGERI-------WQNNE 559 (632) Q Consensus 493 ~~a~-----~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~d~~g~~~-------~~~~~ 559 (632) .... ..-...-++.++.++|.|+..++.... .....++|+|--+..... .++......++ +.... T Consensus 210 t~t~ahptGr~~~~~qNGTlSleDllDm~~av~~~h--yt~svi~MHPLAWnv~AKna~me~~~~na~gN~~~~~~~ts~ 287 (393) T protein:vir:79 210 TNKLAHTTGLDKNGVQNDTFSAEDFLDLIIAVMANE--YTPSDLMMHPLAWTVFAKNELMGSLQANPYGNYPAKGAPSSM 287 (393) T ss_pred cCccceeecCCccccccccccHHHHHHHHHHHhccc--CCcceEEEcCchhhhhhhhhhhcceeeccccccCccccchhh Confidence 2221 111123456788999999888776432 334566777665543322 22222221221 11112 Q ss_pred ccC-----------cceEEcCCCCCcc------EEEEehhhEEEE-EecceEEEEecccccccCcEEEEEEEEeCcEEec Q lcl|Aclame:pro 560 VNG-----------YRAEASNQIPADT------WIFGDWSQIVIA-MWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRR 621 (632) Q Consensus 560 l~G-----------~pv~~~~~~~~~~------~~~gd~s~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~ 621 (632) .+| +.|++++.+|-.+ ++..|-+..-+- .+.++. ++...+-..|.+.++...|+|++|.+ T Consensus 288 algp~~i~~~~~~nlnv~~sPfvp~d~k~~rFd~~~Vd~NnvgvlLV~D~i~--tdq~ddk~rdiq~iKl~ERYG~gvLn 365 (393) T protein:vir:79 288 ALGPDSIQGRLPFNFNVNLSPFIPLDKKSRRFDVYAVDRNNVGVLLVRDDLK--TDQWDEKARGLQNIKMIERYGIGILN 365 (393) T ss_pred hhchhhhccccccceeEEEecccccccccceeeEEEeecCCceEEEEecCcc--eeccccccccceeeeeeeeeceeeee Confidence 222 5789999888543 344444443221 222333 33333445688889999999999999 Q ss_pred cc-ceEE---EEecC Q lcl|Aclame:pro 622 KE-AFCI---AKKGA 632 (632) Q Consensus 622 ~~-a~~~---~~~~A 632 (632) .. |+.. +++.= T Consensus 366 ~gkaiavakNI~~~k 380 (393) T protein:vir:79 366 EGKAIAVAKNISMDK 380 (393) T ss_pred CCceEEEEecceeec Confidence 84 4433 33332 No 123 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.52 E-value=2.2e-15 Score=100.94 Aligned_cols=255 Identities=12% Similarity=0.053 Sum_probs=180.8 Q ss_pred cccccccceechhhhhHHHHHHHhhhhhhhhhcce--eecc-CceeEEEEEecCCccccccccCcccccCcccceeeeee Q lcl|Aclame:pro 360 KTAGKGGELVATELLSEEFIDILRNKAIIGQMGAR--MLPG-LVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFS 436 (632) Q Consensus 360 ~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~--~~~~-~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~ 436 (632) ...+.-..+|.|+++.+-+.+.+.....+.++... .+.+ .+..+++|.....+.+..+.|+..++...++.++.... T Consensus 1 Ma~T~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae~~~eg~~i~~~~lt~~~~~a~ 80 (270) T protein:vir:95 1 MTQTKKANLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAEDLQEGVAMDTTQMSMTTTKVT 80 (270) T ss_pred CCceehhhhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecCCCccccccCCCccchhhcccchheee Confidence 22223445788888888888887666666665432 1222 34578888887777888899999999999999999999 Q ss_pred eeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHH Q lcl|Aclame:pro 437 PKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDME 516 (632) Q Consensus 437 ~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~ 516 (632) ++++|+.+.++++....+.-+....+.+.++.++++.++..++.-.... . ......++++.|.+++ T Consensus 81 i~~~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~a------~--------~~~~~~~t~~~~~dA~ 146 (270) T protein:vir:95 81 VKETGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAELNKS------K--------QTATVSADATGILDAI 146 (270) T ss_pred eehhhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhccc------c--------cccccccCHHHHHHHH Confidence 9999999999999887765577889999999999999999887433211 1 0112346788999999 Q ss_pred HHHHhhccccccceEEeehhHHHHHHHHhhccc--CCceee---ccccccCcceEEcCCCC-CccEEEEehhhEEEEEec Q lcl|Aclame:pro 517 TKISTFNADAGRLAYLTSVTQRGAAKKAQVFDN--TGERIW---QNNEVNGYRAEASNQIP-ADTWIFGDWSQIVIAMWG 590 (632) Q Consensus 517 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~--~g~~~~---~~~~l~G~pv~~~~~~~-~~~~~~gd~s~~~~~~~~ 590 (632) ..+...... ....+||+.....++.....+. .|.-+. .-+.++|++|++++..+ .++.++.....+.++... T Consensus 147 ~~lgd~~~~--~~~i~vhs~~~~~Lrk~~~~~~~~~~~~~~~~G~ig~~~G~~Viv~s~~~~~~~~~l~~~gAi~~~~~~ 224 (270) T protein:vir:95 147 EVFNSENDE--DYVLYVNPKDYNKLVKSLFKVGGNVQDRAISKGDLVEIVGVSDIVKSKRVSENTAFLQRYGAMEIVNKK 224 (270) T ss_pred HHhccccCC--CcEEEEcHHHHHHHHhhhcccccccccchhcccccceecceeEEEeCCCCCceeEEEEeccceeeeecC Confidence 998765432 3446778777766654332211 122122 13578999998877554 455555555666677777 Q ss_pred ceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 591 VLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 591 ~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) ++.++.+++. ......+.+..++++++.++..+|+++++- T Consensus 225 ~~~vEtdRd~--~~~~d~i~~~~~y~v~~~~~skvv~~t~~~ 264 (270) T protein:vir:95 225 KPEAYTDFDI--LKRTHLLSTNYHYSVNLKDETGVVKVTFKP 264 (270) T ss_pred Cceeeeccch--hhcccEEEeeeEEEEEEEccceEEEEEecC Confidence 7776655544 445568899999999999999999999766 No 124 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.48 E-value=3e-15 Score=100.19 Aligned_cols=219 Identities=11% Similarity=0.031 Sum_probs=160.8 Q ss_pred ceeeccCceeEEEEEecCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 393 ARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGV 472 (632) Q Consensus 393 ~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~ 472 (632) ... ......+++|.. .+.+..+.||.+++...++.+..+..++++|..+.|+++...-..-+......+.++.++++ T Consensus 1 ~~~-~~~Gdtit~P~~--iGda~~v~eG~~i~~~~l~~t~~~atIk~~gk~~~itD~a~l~~~gDp~~ea~~Q~~~~iA~ 77 (231) T protein:vir:73 1 ENG-INLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) T ss_pred Ccc-ccCCceEEeccc--ccchhhhcCCCcCChhhccccceeeeEeeeccceeeeHHHHhhccCchHHHHHHHHHHHHHH Confidence 111 122335677755 45788999999999999999999999999999999999998876667789999999999999 Q ss_pred HHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHh----hcc Q lcl|Aclame:pro 473 ALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQ----VFD 548 (632) Q Consensus 473 ~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~d 548 (632) ++|..++.-..... . .....++++.|.+++..+..... .....+|||.....++... +++ T Consensus 78 kvD~di~~~~~~a~-------------l-~~~~~~t~d~i~~A~~~fgde~~--~~~vivv~p~~~~~Lrk~~~~~~~~~ 141 (231) T protein:vir:73 78 KVDDDLLKAAKTTS-------------Q-TVSTKANVDGVQAALDIFNDEDA--QAYVLIVNPKDAAKIRKDANAKNIGS 141 (231) T ss_pred hhhHHHHHhhcccc-------------c-cccccccHHHHHHHHHHhccccc--cceEEEEcchHHHhhhhccchhhhhh Confidence 99999885332211 0 12245789999999999987643 2334667777665554321 122 Q ss_pred cCCceeecc---ccccCcceEEcCCCCCccEEEEe----hhhEEEEEecceEEEEecccccccCcEEEEEEEEeCcEEec Q lcl|Aclame:pro 549 NTGERIWQN---NEVNGYRAEASNQIPADTWIFGD----WSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRR 621 (632) Q Consensus 549 ~~g~~~~~~---~~l~G~pv~~~~~~~~~~~~~gd----~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~ 621 (632) ..|..+... +.++|++|++|+.+|.++.++.. ...+.+....+++++.++ +.......+++...+++++.+ T Consensus 142 ~~g~~i~~~G~iG~i~G~~Vi~S~~~~~~~~~~~~~i~~~gAl~~~~k~~~~vEtdR--d~~~k~~~i~~~~~y~v~l~~ 219 (231) T protein:vir:73 142 EVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDR--DIVTKTTVITADEHYAAYLYD 219 (231) T ss_pred hhccceeeecccceEcceEEEEcCCCCCCceeeeeEEeeccceeeeecccceeeccc--cccccccEEEEeEEEEEEEEc Confidence 223333333 57899999999999998876433 334566677777776654 445566789999999999999 Q ss_pred ccceEEEEecC Q lcl|Aclame:pro 622 KEAFCIAKKGA 632 (632) Q Consensus 622 ~~a~~~~~~~A 632 (632) |..+|+++++- T Consensus 220 ~~~vv~~t~~g 230 (231) T protein:vir:73 220 LTKVVNITFTG 230 (231) T ss_pred CccEEEEEeec Confidence 99999999999 No 125 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=99.32 E-value=9.7e-14 Score=91.92 Aligned_cols=272 Identities=10% Similarity=-0.044 Sum_probs=157.4 Q ss_pred hhhhhcccccccccce------echhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecC---CccccccccCccc Q lcl|Aclame:pro 353 VQRQLEKKTAGKGGEL------VATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTS---GANFYWIGEDEDV 423 (632) Q Consensus 353 ~~~a~~~~~~~~~~~~------i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~a~~v~E~~~~ 423 (632) ...-....+...++.+ -.|+++...+++++...-+...+..+.-..+...+.+..... ...+..|.|++|+ T Consensus 1 ~~~~~~i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~~a~~~~~v~f~~~~p~~~~~d~e~VaEggEi 80 (318) T protein:vir:10 1 MTAPTGIVSVSDGPAITVRELVGNPLWIPTALKKMMVNQFISESLFRNGGANPNGVVAYNEGNPSFLEDDVADVAEFGEI 80 (318) T ss_pred CCCCCcceeeecCCceehHHhhCCchhHHHHHHHHHhccchhhhhhhcccccccceeEEEecccccccCcHhhccCcccc Confidence 0011111112222211 225566677777776666666665554344444555544332 2467789999999 Q ss_pred ccCcccceeeee-eeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccc Q lcl|Aclame:pro 424 QDSDFDFTTLSF-SPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTY 502 (632) Q Consensus 424 ~~~~~~~~~~~~-~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~ 502 (632) |.....++...+ ..+|+|..+.||+|++.....++.......+++++.+..|..++.-.-+...|. +.+++ ++ T Consensus 81 P~~~~~~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~--~~~s~----~w 154 (318) T protein:vir:10 81 PVSAGARGLPRTAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPT--LAVPT----AW 154 (318) T ss_pred cccCCCCCchhhhhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc--ccCCc----CC Confidence 999888876665 568999999999999999999999999999999999999998875432221111 00110 11 Q ss_pred cccchhHHHHHHHHHHH--------------HhhccccccceEEeehhHHHHHHHHhhccc------CCceee------- Q lcl|Aclame:pro 503 PAGGVDWASVVDMETKI--------------STFNADAGRLAYLTSVTQRGAAKKAQVFDN------TGERIW------- 555 (632) Q Consensus 503 ~~~~~~~~~i~~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~------~g~~~~------- 555 (632) .+......++.++...+ ...+-....-..+|||..+. .+.+-.+. ++.+++ T Consensus 155 ~~~~~~~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~--~l~~n~~~~~~y~~~a~~~~~~~~~tg 232 (318) T protein:vir:10 155 DNGGKVRTDIAIAIEQISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLP--ILMDNENFMKVYERNANYVSTAPDWTG 232 (318) T ss_pred CCcccccccchhhhhhhhhhhhhhhhhhhhhhhhccCccceeeEECHHHHH--HHhcchhhhhhhhccchhhhhcccccc Confidence 11111111222222111 11111222233455555544 33222221 222221 Q ss_pred -ccccccCcceEEcCCCCCccEEEEehhhE-EEEEecceEEEEec-c----cccccCcEEEEEEEEeCcEEecccceEEE Q lcl|Aclame:pro 556 -QNNEVNGYRAEASNQIPADTWIFGDWSQI-VIAMWGVLDLKVDP-Y----TKAASDGLVLRVFQDVDAGVRRKEAFCIA 628 (632) Q Consensus 556 -~~~~l~G~pv~~~~~~~~~~~~~gd~s~~-~~~~~~~~~~~~~~-~----~~~~~~~~~~~~~~r~~~~v~~~~a~~~~ 628 (632) .++.++|+.|+.++.+|.+++++.+-..+ .+.+-.++....-. + ..-.+.....++......+|.+|+|+|+| T Consensus 233 ~~~g~~lGl~vi~s~~~p~~~alvlq~g~vG~~~d~~pl~~t~~~~egg~~~g~~~~s~~~~~~~~~~~~V~~PkA~~~i 312 (318) T protein:vir:10 233 NFPGSVMGLNVIRSRTFPIDRVLIMERGTVGFYSDTRPLQFTALYPEGNGPNGGPTESYRADASHKRALAVDQPKAALWL 312 (318) T ss_pred cccceeeceEEeecCccCCCeeEEEecCCcceeeccccceeeecccCCCCCCCCcchhhheehheeeeeeeeCcceeEEE Confidence 14568999999999999999988765443 23344444432211 1 11233445677888888999999999999 Q ss_pred EecC Q lcl|Aclame:pro 629 KKGA 632 (632) Q Consensus 629 ~~~A 632 (632) +-== T Consensus 313 tgi~ 316 (318) T protein:vir:10 313 TGIV 316 (318) T ss_pred eecc Confidence 8655 No 126 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=99.31 E-value=5e-13 Score=88.04 Aligned_cols=257 Identities=10% Similarity=0.005 Sum_probs=157.6 Q ss_pred cccccccccceechhhhhHHHHHHHhhhhhhhhhccee---eccCceeEEEEEecCCccccccccCcccccCcccceeee Q lcl|Aclame:pro 358 EKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARM---LPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLS 434 (632) Q Consensus 358 ~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 434 (632) +. -. .+.++++...+++.+.....+..+..+. ......++++|+.+......+..+++..+...+..+.++ T Consensus 1 MA-----~~-~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 74 (273) T protein:vir:79 1 MA-----FN-NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) T ss_pred Cc-----ch-hhhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCcccccccccCCCccCccccccceEE Confidence 11 11 1346677777887777776666553222 222234788888887777778888988888888889999 Q ss_pred eeeeee-eeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHH Q lcl|Aclame:pro 435 FSPKTI-AGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVV 513 (632) Q Consensus 435 ~~~~t~-~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~ 513 (632) +.+.+. +..+.|+..-...+..++.. +.+.++.++++.+|..++.-....... ............++.|. T Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~vD~~i~~~~~~a~~~--------~~~~~~~~~~~~~~~i~ 145 (273) T protein:vir:79 75 LLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTA--------LTGSAPSDADDAFDLIA 145 (273) T ss_pred EEEeeecccceeeccHHHHhhcccHHH-HHHHHHHHHHHHHHHHHHHHHhhcccc--------cccccccchhhHHHHHH Confidence 998774 55667776444445567655 677889999999998765322111000 00011111123467889 Q ss_pred HHHHHHHhhccccccceEEeehhHHHHHHHH--hh--cccCCc--eee--ccccccCcceEEcCCCCCcc---EEEEehh Q lcl|Aclame:pro 514 DMETKISTFNADAGRLAYLTSVTQRGAAKKA--QV--FDNTGE--RIW--QNNEVNGYRAEASNQIPADT---WIFGDWS 582 (632) Q Consensus 514 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~--~d~~g~--~~~--~~~~l~G~pv~~~~~~~~~~---~~~gd~s 582 (632) ++..+|..++.+......+++|.....+... .+ .+..|. .+. .-+.|.|++|+.++.+|.++ ++.+-.+ T Consensus 146 ~a~~~ld~~~vP~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~i~~s~~lp~~~~~~~~a~~~~ 225 (273) T protein:vir:79 146 SALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPS 225 (273) T ss_pred HHHHHhhhccCCccCcEEEECHHHHHHHhhchhhhhhhhhcccccceeeeEeeEEeceEEEecccccccCceEEEEEecc Confidence 9999998888776666677777665544321 12 122222 121 23679999999999999654 3333333 Q ss_pred hEEEEEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 583 QIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 583 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) .+....+. ..+.. +..-..-...+++.+.+|+++++|++++.++..+ T Consensus 226 A~~~a~~~-~~~e~--~r~~~~~~~~v~~~~~yg~~v~~p~~vv~~~~~g 272 (273) T protein:vir:79 226 AAAYVSQI-DTVEA--LRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTG 272 (273) T ss_pred ceeeeeeh-hhhhc--ccCcccceeeeeeeeeeeeEEecCceEEEEeccC Confidence 33222211 11111 1111222456889999999999999999999888 No 127 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=99.27 E-value=1.5e-12 Score=85.48 Aligned_cols=257 Identities=11% Similarity=0.021 Sum_probs=157.2 Q ss_pred ccccceechhhhhHHHHHHHhhhhhhhhhcceeec---cCceeEEEEEecCCccccccccCcccccCcccceeeeeeeee Q lcl|Aclame:pro 363 GKGGELVATELLSEEFIDILRNKAIIGQMGARMLP---GLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKT 439 (632) Q Consensus 363 ~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t 439 (632) .+-. .+.++++...+.+.+.....+..+..+-.. ....++.+|+.+..+...+..+++......++.+.+++.+.+ T Consensus 1 MA~~-~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid~ 79 (273) T protein:vir:10 1 MAFN-NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQ 79 (273) T ss_pred Ccch-hhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCccccccceEEEEEee Confidence 1111 123567777777777777666655322211 123468888887777677788888877777888888888866 Q ss_pred e-eeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHHHH Q lcl|Aclame:pro 440 I-AGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMETK 518 (632) Q Consensus 440 ~-~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~ 518 (632) . +..+.|++.-...+..++.+ +.+.++.+++..+|..++.-....... .. ..+.....-.++.|.++..+ T Consensus 80 ~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~-----~~---~~~~~~~~~~~~~i~~a~~~ 150 (273) T protein:vir:10 80 EKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTA-----LT---GSAPTDADDAFDLIAKALKE 150 (273) T ss_pred eeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhccccc-----cc---cccccchhHHHHHHHHHHHH Confidence 4 44566776444444556655 777889999999998775322111000 00 00111122346789999999 Q ss_pred HHhhccccccceEEeehhHHHHHHHH--hhc--ccCCc-eee---ccccccCcceEEcCCCCCcc---EEEEehhhEEEE Q lcl|Aclame:pro 519 ISTFNADAGRLAYLTSVTQRGAAKKA--QVF--DNTGE-RIW---QNNEVNGYRAEASNQIPADT---WIFGDWSQIVIA 587 (632) Q Consensus 519 ~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~--d~~g~-~~~---~~~~l~G~pv~~~~~~~~~~---~~~gd~s~~~~~ 587 (632) |..++.+......+++|.....+... .+. +..|. -.+ .-+.+.|++|+.++.+|.+. ++.+-.+.+... T Consensus 151 ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~~~A~~~a 230 (273) T protein:vir:10 151 LTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV 230 (273) T ss_pred hhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecccccCCccEEEEEeccceeee Confidence 99888876666777777776655431 122 22221 122 23679999999999999643 444444443332 Q ss_pred EecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 588 MWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 588 ~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) . .+. ....+..-..-...+++.+.+|+++++|++++.++..+ T Consensus 231 ~--q~~-~~e~~r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g 272 (273) T protein:vir:10 231 S--QID-TVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTG 272 (273) T ss_pred e--eee-hhhcccCCCcceeeeeeeeeeeeeEeccceEEEEeccC Confidence 2 121 11111111222446889999999999999999999888 No 128 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=99.27 E-value=1.5e-12 Score=85.48 Aligned_cols=257 Identities=11% Similarity=0.021 Sum_probs=157.2 Q ss_pred ccccceechhhhhHHHHHHHhhhhhhhhhcceeec---cCceeEEEEEecCCccccccccCcccccCcccceeeeeeeee Q lcl|Aclame:pro 363 GKGGELVATELLSEEFIDILRNKAIIGQMGARMLP---GLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKT 439 (632) Q Consensus 363 ~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t 439 (632) .+-. .+.++++...+.+.+.....+..+..+-.. ....++.+|+.+..+...+..+++......++.+.+++.+.+ T Consensus 1 MA~~-~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid~ 79 (273) T protein:vir:10 1 MAFN-NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQ 79 (273) T ss_pred Ccch-hhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCccccccceEEEEEee Confidence 1111 123567777777777777666655322211 123468888887777677788888877777888888888866 Q ss_pred e-eeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHHHH Q lcl|Aclame:pro 440 I-AGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMETK 518 (632) Q Consensus 440 ~-~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~ 518 (632) . +..+.|++.-...+..++.+ +.+.++.+++..+|..++.-....... .. ..+.....-.++.|.++..+ T Consensus 80 ~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~-----~~---~~~~~~~~~~~~~i~~a~~~ 150 (273) T protein:vir:10 80 EKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTA-----LT---GSAPTDADDAFDLIAKALKE 150 (273) T ss_pred eeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhccccc-----cc---cccccchhHHHHHHHHHHHH Confidence 4 44566776444444556655 777889999999998775322111000 00 00111122346789999999 Q ss_pred HHhhccccccceEEeehhHHHHHHHH--hhc--ccCCc-eee---ccccccCcceEEcCCCCCcc---EEEEehhhEEEE Q lcl|Aclame:pro 519 ISTFNADAGRLAYLTSVTQRGAAKKA--QVF--DNTGE-RIW---QNNEVNGYRAEASNQIPADT---WIFGDWSQIVIA 587 (632) Q Consensus 519 ~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~--d~~g~-~~~---~~~~l~G~pv~~~~~~~~~~---~~~gd~s~~~~~ 587 (632) |..++.+......+++|.....+... .+. +..|. -.+ .-+.+.|++|+.++.+|.+. ++.+-.+.+... T Consensus 151 ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~~~A~~~a 230 (273) T protein:vir:10 151 LTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV 230 (273) T ss_pred hhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecccccCCccEEEEEeccceeee Confidence 99888876666777777776655431 122 22221 122 23679999999999999643 444444443332 Q ss_pred EecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 588 MWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 588 ~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) . .+. ....+..-..-...+++.+.+|+++++|++++.++..+ T Consensus 231 ~--q~~-~~e~~r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g 272 (273) T protein:vir:10 231 S--QID-TVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTG 272 (273) T ss_pred e--eee-hhhcccCCCcceeeeeeeeeeeeeEeccceEEEEeccC Confidence 2 121 11111111222446889999999999999999999888 No 129 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=99.20 E-value=1.2e-11 Score=80.51 Aligned_cols=293 Identities=13% Similarity=0.101 Sum_probs=170.4 Q ss_pred HHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEe Q lcl|Aclame:pro 329 EVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKK 408 (632) Q Consensus 329 ~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 408 (632) +...+....... ....... .+..+.+.. +++.+.++. ...|++.....+++.+. +++++..+.+..+.+. T Consensus 1 ~~~~~~~~~~~n------~~~~~i~-k~~it~~~l-~~g~L~p~~-a~~Fl~~v~~~t~iL~~-~r~~~~~s~~~ei~ki 70 (360) T protein:vir:99 1 MSSNSTIDSVRN------QNMNSLS-QKDIGLAEL-DGFQLPVDV-TEEFLERMQKGVQILGM-ADTMTLARLEMEVPQF 70 (360) T ss_pred CcchhHHHHHhh------hHHHHHH-hhhcccccc-CceeecHHH-HHHHHHHHhhccchhhh-cceeeccccccccccc Confidence 000111000000 0001111 222222222 345555554 56788888888887776 4666777666666655 Q ss_pred cCCccc-cccccCccccc-Ccccceeeee-eeeeeeeeehhhHHHhhcC----hhHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 409 TSGANF-YWIGEDEDVQD-SDFDFTTLSF-SPKTIAGAVPVTRKLRKQS----SIHVENLIREDLIEGIGVALDLAMLTG 481 (632) Q Consensus 409 ~~~~~a-~~v~E~~~~~~-~~~~~~~~~~-~~~t~~~~~~iSre~l~d~----~~~~~~~i~~~l~~a~a~~~~~~~~~g 481 (632) +-+... .--.|++..+. ..+....+.+ ..+++-....++.+.+++. .....+.|.+.|++++++.++...++| T Consensus 71 g~G~r~~r~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~~~g 150 (360) T protein:vir:99 71 GVPRLSGHTRDEEGSRTENSEAESGSVKFNATDKSYYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMGIRA 150 (360) T ss_pred ccceeeccccccCCCCCcCCcCccccCccccccceeeEeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHHhhc Confidence 443221 12234433322 3444444544 3445555666777776654 224568899999999999999988888 Q ss_pred CCCcc--------c-----cccceecccccc--cc--------------------------ccc----cchhHHHHHHHH Q lcl|Aclame:pro 482 TGLAN--------D-----PVGLLNMTGVPA--LT--------------------------YPA----GGVDWASVVDME 516 (632) Q Consensus 482 ~g~~~--------~-----~~Gil~~a~~~~--~~--------------------------~~~----~~~~~~~i~~~~ 516 (632) +.... . -.|++..+.... +. +.+ ...+..-+.+++ T Consensus 151 ~~ds~d~~~~~~~d~fl~~~dGwlKka~~~~~~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~lf~~~~ 230 (360) T protein:vir:99 151 GASSGNLQSIGGAAELDNTFKGWIARAEGDAQSVDDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTSLFNETI 230 (360) T ss_pred cchhcccccCcccchhhhhhHHHHHHhhcccchhhccccccccccccccccccccchhhhccccccccccchHHHHHHHH Confidence 74421 1 236555441000 00 000 112444577899 Q ss_pred HHHHhhccccc--cceEEeehhHHHHHHHHhhcc---cCCceeec-cc--cccCcceEEcCCCCCccEEEEehhhEEEEE Q lcl|Aclame:pro 517 TKISTFNADAG--RLAYLTSVTQRGAAKKAQVFD---NTGERIWQ-NN--EVNGYRAEASNQIPADTWIFGDWSQIVIAM 588 (632) Q Consensus 517 ~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~d---~~g~~~~~-~~--~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~ 588 (632) ..++..|++.. +..|+|++........ .+.+ +-|.-... .+ ..+|+|++..+.+|.+.++|-+++.+.++. T Consensus 231 ~~Lp~kyr~~~~~~~~~~~s~~~~~~yr~-~L~~R~t~LGd~~l~g~~~~~~~Gipi~~v~~~pd~~~mlT~p~NLi~g~ 309 (360) T protein:vir:99 231 QTLDSRYRESDAYSPVLMTSPNQVQSYTM-SLTEREDPLGSAVIFGDSDITPFSYDLVGVNGFPDEYMMFTDPNNLAFGL 309 (360) T ss_pred HhcchhhhcCcccceEEEccCchHHHHHH-HHhccCcccchhheecccccccceeeeEEcCCCCCCceEEeccCceeEEe Confidence 99999997643 4578888886544332 2222 22332222 22 467999999999999999999999999999 Q ss_pred ecceEEEEeccccc--ccC-cEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 589 WGVLDLKVDPYTKA--ASD-GLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 589 ~~~~~~~~~~~~~~--~~~-~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +..+++..+.+... .+. .+.+..+.++|+.+.+++|+|+++--= T Consensus 310 ~~~iri~~~~e~~~~~~~~~~~~~~~~~~~D~~iee~~Av~~vt~~~ 356 (360) T protein:vir:99 310 YEEMELDQSTDTDKVHEQRLHSRNWLEGQFDFQIKEQQAGVLVTDLE 356 (360) T ss_pred eeeeEEeecccchhhhhhceeeeEEEEEEeeEEEEecccEEEEecCC Confidence 99998865443222 221 244556778999999999999987544 No 130 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=99.18 E-value=1.9e-11 Score=79.39 Aligned_cols=267 Identities=12% Similarity=0.119 Sum_probs=161.5 Q ss_pred hcccccccccceechhhhhHHHHHHHhhhhhhhhh-cceeeccCceeEEEEEecCCccccccccC-----cccccCcccc Q lcl|Aclame:pro 357 LEKKTAGKGGELVATELLSEEFIDILRNKAIIGQM-GARMLPGLVGDVDIPKKTSGANFYWIGED-----EDVQDSDFDF 430 (632) Q Consensus 357 ~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~a~~v~E~-----~~~~~~~~~~ 430 (632) +..-+-.....+. ++.....+++.+...+.+.+. -...+.+ ..+.+.+...-+.+.+.+-+ ...+++..+| T Consensus 1 mpaltLaea~k~~-~d~l~~~ViE~~~~~s~lL~~LpF~~veg--~~~~ynR~~~~~~~~~~~v~~~~~~~g~~~~~~t~ 77 (310) T protein:vir:97 1 MASVTLAESAKLA-QDELVAGVIENIITVNRMFDVLPFDSIEG--NSLAYNRENVLGDVIMAGVGTTFSGAGAGKAAATF 77 (310) T ss_pred CcccchHHHhhcC-cchHHHHHHHHHhccchHHHhCCcccccC--CcceeeEeeccCCcccccccccccCCCcccccccc Confidence 1111111111222 223445667777666655544 2222233 34556666555444443322 2334567788 Q ss_pred eeeeeeeeeeeeeehhhHHHh--h-cChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceecccc-cccc--ccc Q lcl|Aclame:pro 431 TTLSFSPKTIAGAVPVTRKLR--K-QSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGV-PALT--YPA 504 (632) Q Consensus 431 ~~~~~~~~t~~~~~~iSre~l--~-d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~-~~~~--~~~ 504 (632) .+.++.++.+++.+.|.+.+. . ++..+....-.+...++++++.+..+|+|+.+++...|++...+. ..+. ..+ T Consensus 78 ~~~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a~n~F~GL~~~~~~~q~i~~~~~g 157 (310) T protein:vir:97 78 TKVNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGAGNEFAGLIQLCASGQKATTGATG 157 (310) T ss_pred ceeeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccCCCcccchhhcCCccceeecCCCC Confidence 999999999999999998543 2 334556566677888999999999999999887776788765433 2232 245 Q ss_pred cchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHH-HHhh----------cccCCceeeccccccCcceEEcCCCCC Q lcl|Aclame:pro 505 GGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAK-KAQV----------FDNTGERIWQNNEVNGYRAEASNQIPA 573 (632) Q Consensus 505 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~----------~d~~g~~~~~~~~l~G~pv~~~~~~~~ 573 (632) +.++.+++..++..+... ++....+++|+.....++ +.+- .+..|+++ ..+.|.|++.++.+|. T Consensus 158 g~~t~d~LDeLl~~v~~~--~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~~~~~G~~v---~~~~GiPi~~~d~ip~ 232 (310) T protein:vir:97 158 SAISFAILDELMDLVVDK--DGQVDYLTMHARTLRSYKALLRALGGASINEVVELPSGAEV---PAYSGTPIFRNDYIPT 232 (310) T ss_pred CCCCHHHHHHHHHHHhcC--CCCCCEEEecHHHHHHHHHHHHHhcCCCCCCccccCCCCEE---eeeCCeEEEEeCccCC Confidence 778889888888887532 234456888887533322 1111 22223333 3688999999998875 Q ss_pred c----------cEEEEehhh-----EEEEEe----cceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 574 D----------TWIFGDWSQ-----IVIAMW----GVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 574 ~----------~~~~gd~s~-----~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) + ++|+.-+.. -.++.. .|+.+.. ...--+++...+++..|++.++.+|+|+.+|+--= T Consensus 233 ~~~~~~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~-~G~~~~~~v~~~~V~~Y~~~av~~~~A~a~L~~V~ 309 (310) T protein:vir:97 233 NQTKGGTTGCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVD-VGESEDSDEHIWRVKWYCGLALFSEKGLACADGIT 309 (310) T ss_pred CccccccCCceeEEEEeeCccccccceeccccCCccceeEEe-CCcccCCcceeEEEEEeeeEEEecccceeeecccc Confidence 3 244443322 222221 2333322 11111456778999999999999999999987777 No 131 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=99.13 E-value=4.5e-12 Score=82.79 Aligned_cols=280 Identities=12% Similarity=0.000 Sum_probs=162.5 Q ss_pred hhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeec---cCceeEEEEEecCCccccccccCccc Q lcl|Aclame:pro 347 MPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLP---GLVGDVDIPKKTSGANFYWIGEDEDV 423 (632) Q Consensus 347 ~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~a~~v~E~~~~ 423 (632) +....-..++..+ +.....++ |+++...+++.+.....+..+. +-.+ ..+.++++|+.+ .+.+..+.+++.+ T Consensus 1 ~~~~~~~~~~~~~--t~~v~~fi-pei~s~~i~~~l~~~~v~~~~~-~d~~~~~~~Gdtv~ip~~g-~~~~~d~~~~~~i 75 (341) T protein:vir:94 1 MALGNTITGPSIN--TQRGQQFI-PEQWLSEVQMFRKAKMLDTSVV-KTWGAQVKKGDTFHVPRIS-ELGVEDKATDVPV 75 (341) T ss_pred Ccchhhhcccccc--chhHHHHH-HHHHHHHHHHHHHhhcchhhcc-ccccccccCCceEEEeccC-cceeeeecCCCcc Confidence 1111011111111 11112233 6777888888887777666653 2222 223467888775 4556667788888 Q ss_pred ccCcccceeeeeeeeee-eeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCC-cccccc-ceeccccccc Q lcl|Aclame:pro 424 QDSDFDFTTLSFSPKTI-AGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGL-ANDPVG-LLNMTGVPAL 500 (632) Q Consensus 424 ~~~~~~~~~~~~~~~t~-~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~-~~~~~G-il~~a~~~~~ 500 (632) +...++...+++.+.++ ...+.|++.-...+..++...+.+.++.++++..|..++..... ...+.+ ....... .. T Consensus 76 ~~~~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~~~~~-~~ 154 (341) T protein:vir:94 76 GVQPVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFSSSNG-AI 154 (341) T ss_pred ccccccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCccccCccc-cc Confidence 88888888888888555 55678888666667788999999999999999999887643211 111111 1111111 12 Q ss_pred cccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHh-h--cccCCceeecc---ccccCcceEEcCCCCCc Q lcl|Aclame:pro 501 TYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQ-V--FDNTGERIWQN---NEVNGYRAEASNQIPAD 574 (632) Q Consensus 501 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~--~d~~g~~~~~~---~~l~G~pv~~~~~~~~~ 574 (632) ......++++.|.++...|..+..+......+++|.....+.... + .+..|...+.. +.++|++|+.++++|.+ T Consensus 155 t~~~~~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~~~~~g~~~l~~G~ig~i~G~~V~~Sn~lp~~ 234 (341) T protein:vir:94 155 TGNGQAFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIPQFISKDFINNAPIAQGQIGSLMGVRVIRTSLIGNN 234 (341) T ss_pred cCchhhhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhchhhhhhhccccchhheeeeeeEeceEEEEecccccc Confidence 223345678899999999999888766666667776665553221 1 11223222222 47999999999999865 Q ss_pred cEEE---------------------------EehhhE--EEEEec---ceEEE------------Eecccccc--cCcEE Q lcl|Aclame:pro 575 TWIF---------------------------GDWSQI--VIAMWG---VLDLK------------VDPYTKAA--SDGLV 608 (632) Q Consensus 575 ~~~~---------------------------gd~s~~--~~~~~~---~~~~~------------~~~~~~~~--~~~~~ 608 (632) +... +++..+ .++.+. +..+. ...+..|. +.... T Consensus 235 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (341) T protein:vir:94 235 SATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAPRVTQSFENREQVWL 314 (341) T ss_pred ccccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhhccccccccccccchhhhhhhh Confidence 4210 011111 000000 11000 00111111 23335 Q ss_pred EEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 609 LRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 609 ~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +++..=+|+++.||++.+-|+.+| T Consensus 315 i~~~~~~G~~~lrp~~~v~~~~~~ 338 (341) T protein:vir:94 315 MVGRQAYGARLYRPLHAVNIHTTG 338 (341) T ss_pred hhhhhhhcccccCcceeEEEecCc Confidence 677788999999999999999999 No 132 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=98.95 E-value=1.8e-10 Score=73.95 Aligned_cols=279 Identities=14% Similarity=0.032 Sum_probs=153.4 Q ss_pred hhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhccee-ecc-CceeEEEEEecCCccccccccCcccc Q lcl|Aclame:pro 347 MPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARM-LPG-LVGDVDIPKKTSGANFYWIGEDEDVQ 424 (632) Q Consensus 347 ~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~-~~~-~~~~~~~~~~~~~~~a~~v~E~~~~~ 424 (632) +..-......+...-..+....+.|+++...+++.+.+...+..+..+. ... ...++++|+.+ .+.+..+.+++.++ T Consensus 1 ~~~~~~~~~~~~~~~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g-~~~a~d~~~g~~i~ 79 (381) T protein:vir:80 1 MATIQGTGGYKGSAVDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNIS-RAAVYDKQPQTPVN 79 (381) T ss_pred CceecccccccCcccchhhHHhhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccC-cceeeeecCCCccc Confidence 0000000000111111122223446788888888887777766653321 122 23467788776 45677788899888 Q ss_pred cCcccceeeeeeeeeee-eeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCc-cccc--------cceec Q lcl|Aclame:pro 425 DSDFDFTTLSFSPKTIA-GAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLA-NDPV--------GLLNM 494 (632) Q Consensus 425 ~~~~~~~~~~~~~~t~~-~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~-~~~~--------Gil~~ 494 (632) ...++..++++.+.++- ..+.|++.-...+..++.+.+.+.++.++++..|..++...... ..+. ++-.. T Consensus 80 ~~~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~t~~~~i~~~ 159 (381) T protein:vir:80 80 LQARTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIYSYDTTLGDG 159 (381) T ss_pred ccccCCceEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccccc Confidence 88888888888886654 34788886666777899999999999999999999886432110 0011 11111 Q ss_pred cccccccccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHh-h--cccCCceeec---cccccCcceEEc Q lcl|Aclame:pro 495 TGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQ-V--FDNTGERIWQ---NNEVNGYRAEAS 568 (632) Q Consensus 495 a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~--~d~~g~~~~~---~~~l~G~pv~~~ 568 (632) ....+........+++.|.++...|.....+......++.|.....|.... + .+..+...+. .+.|+|++|+.+ T Consensus 160 ~~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~~~~~~l~~G~Ig~i~G~~Vv~S 239 (381) T protein:vir:80 160 TVNAHLTGTPAPLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFISVDFSQVKPVTSGVVGTILGMEVIVT 239 (381) T ss_pred ccccccccchhhHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhhccchhhhceeeeEEcceEEEee Confidence 111122233445678999999999998887766666677777665554321 1 1111211222 257999999999 Q ss_pred CCCCCccEE-----EEehhhEEEEEecceEEEEec-ccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 569 NQIPADTWI-----FGDWSQIVIAMWGVLDLKVDP-YTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 569 ~~~~~~~~~-----~gd~s~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +.+|.+... ++-+.... ..+. -.. .-.|......++....+|..+......+..-..| T Consensus 240 n~lp~~~~t~~~~~agap~~~~----~~~~--~~~~~g~~s~~a~av~~~k~yd~~~~~~~~~~~~~~g~ 303 (381) T protein:vir:80 240 TQIGINSLTGYVNGQGAPTQPT----PGVL--GSPYLPDQAGTANVVNTGSASDLAVSLSYFGLPVFSGA 303 (381) T ss_pred cccccccccceeeecccccccc----cccc--ccccccccccceeeeeeeeeeceeeeeeeccceeeecc Confidence 999965321 11111100 0000 001 1123333445666666666664443333322212 No 133 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=98.95 E-value=1.1e-10 Score=75.12 Aligned_cols=285 Identities=12% Similarity=0.030 Sum_probs=147.6 Q ss_pred hhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcc Q lcl|Aclame:pro 343 RGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDED 422 (632) Q Consensus 343 ~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~ 422 (632) ............+........+ ...+.-+.+...+...+...+.++.+.....-..+.++.+++.+ ...+.....|.+ T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d-~~al~ie~~~geV~~~f~~~s~~~~~~~~rti~~G~sv~~~~iG-~~~~~~~~~G~~ 78 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGD-KLALFLKVFGGEVLTAFTRTSVTMNKHLVRSIQSGKSAQFPVLG-RTKAAYLQPGEN 78 (347) T ss_pred CCccccccccccccccCCcccc-hHHHHHHHHhHHHHHHHHHHHhhhhhhhheeccccceEEeeecc-ceeEeeeecCcC Confidence 0000000000011111111111 11122355566666667677777776433222234566777654 445666777777 Q ss_pred ccc--Ccccceeeeeeeeeee-eeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcC--------CCccccccc Q lcl|Aclame:pro 423 VQD--SDFDFTTLSFSPKTIA-GAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGT--------GLANDPVGL 491 (632) Q Consensus 423 ~~~--~~~~~~~~~~~~~t~~-~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~--------g~~~~~~Gi 491 (632) ... ..+..++.++.+.++- ..+.|.+.--....+++.+.+.+.++.++++..|..++... .....+.|. T Consensus 79 l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~~~~~g~ 158 (347) T protein:vir:94 79 LDDKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTANNENIAGL 158 (347) T ss_pred CCCCcCCccccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccC Confidence 654 3567777777776652 22344443333345678899999999999999998775321 111112221 Q ss_pred ee--------ccccccccccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceee-------c Q lcl|Aclame:pro 492 LN--------MTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIW-------Q 556 (632) Q Consensus 492 l~--------~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~-------~ 556 (632) .. .+.........+...++.|.++...|..++.+......++.|..+..+.. .+....+.+.. . T Consensus 159 ~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LLk-~~~~~~~~~~~~~~~~~G~ 237 (347) T protein:vir:94 159 GKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAILA-ALMPNAANYQALIDPSTGS 237 (347) T ss_pred CcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHHH-hhcccccccccccccccce Confidence 10 01111111112234467788888888888887665555555665544432 22222222211 1 Q ss_pred cccccCcceEEcCCCCCccE-------------------------EEEehhhE----------EEEEecceEEEEecccc Q lcl|Aclame:pro 557 NNEVNGYRAEASNQIPADTW-------------------------IFGDWSQI----------VIAMWGVLDLKVDPYTK 601 (632) Q Consensus 557 ~~~l~G~pv~~~~~~~~~~~-------------------------~~gd~s~~----------~~~~~~~~~~~~~~~~~ 601 (632) -+.++|++|+.++++|.... +=+||+.- ..+....+.+....+. T Consensus 238 V~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~~~e~~~~~- 316 (347) T protein:vir:94 238 IRNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALERARRA- 316 (347) T ss_pred eEEeeceEEEEcCccccccCcccccccccccccccccccccccccccccccceEEEEechhhhhhhhhcccceeeeech- Confidence 24689999999999985320 11233321 1111222333332222 Q ss_pred cccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 602 AASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 602 ~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) .+-...+.+..-+|.++.||++.+.++.++ T Consensus 317 -~~~~~~i~~~~a~G~g~~rPe~a~~i~~~~ 346 (347) T protein:vir:94 317 -NFQADQIIAKYAMGHGGLRPEACGALVFKK 346 (347) T ss_pred -hhhhhhhhhhhhhcCcccccceeEEEEecC Confidence 223335788888999999999987766666 No 134 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=98.91 E-value=1.3e-10 Score=74.79 Aligned_cols=284 Identities=10% Similarity=-0.034 Sum_probs=146.2 Q ss_pred hhhhhhHHhhhhhcccc-ccccc--ceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCc Q lcl|Aclame:pro 345 FYMPHEVLVQRQLEKKT-AGKGG--ELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDE 421 (632) Q Consensus 345 ~~~~~~~~~~~a~~~~~-~~~~~--~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~ 421 (632) .....+.........+. ..++. ..+.-+.+...+...+...+.++.+.....-..+.++.+++.+. ..+.....|. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~r~i~~G~tv~i~~ig~-~~~~~~~~g~ 79 (332) T protein:vir:78 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGK-LSAGYHTPGT 79 (332) T ss_pred CcccccccCCccccCCccccccccchhhhhhhhhhhHHHHHHHHhhhhhccccccccccceEEEEeccc-eeEeeecCCC Confidence 00000000000010001 11111 12333566667777777777776664322222355677777754 4445555555 Q ss_pred ccccC-cccceeeeeeeeee-eeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcC----CCccccccceecc Q lcl|Aclame:pro 422 DVQDS-DFDFTTLSFSPKTI-AGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGT----GLANDPVGLLNMT 495 (632) Q Consensus 422 ~~~~~-~~~~~~~~~~~~t~-~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~----g~~~~~~Gil~~a 495 (632) .+... .+.-++.++.+.+. ...+.|.+---.....++.+.+.+.++.++++..|..++... .......+... . T Consensus 80 ~l~~~~~~~~~~~~l~ID~~ky~~~~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~~~~g-~ 158 (332) T protein:vir:78 80 PIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPG-G 158 (332) T ss_pred CCCCCCCCCCceEEEEEehhhhhHHHHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCccccccc-c Confidence 55432 45556666666542 222333332222345578899999999999999998775321 11111121111 1 Q ss_pred cccccc---ccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHH---hhc-----ccCCceeec---ccccc Q lcl|Aclame:pro 496 GVPALT---YPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKA---QVF-----DNTGERIWQ---NNEVN 561 (632) Q Consensus 496 ~~~~~~---~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~-----d~~g~~~~~---~~~l~ 561 (632) ...++. ......-++.|.++...|..++.+......++.|..+..|... ++. +.+| .+.. -+.++ T Consensus 159 ~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~~~-~~~~g~~i~~i~ 237 (332) T protein:vir:78 159 FHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQG-DMNSGKGLYSIA 237 (332) T ss_pred cccccCCccccCHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHHhhcCceeeeeecccccc-ceecceeeeEEe Confidence 111111 1122334678889999999988876665566666665544321 110 1122 2222 24689 Q ss_pred CcceEEcCCCCCcc--------------EEEEehhhE--EEE--------EecceEEEEec-ccccccCcEEEEEEEEeC Q lcl|Aclame:pro 562 GYRAEASNQIPADT--------------WIFGDWSQI--VIA--------MWGVLDLKVDP-YTKAASDGLVLRVFQDVD 616 (632) Q Consensus 562 G~pv~~~~~~~~~~--------------~~~gd~s~~--~~~--------~~~~~~~~~~~-~~~~~~~~~~~~~~~r~~ 616 (632) |.+|+.++++|... .+-++|+.. .++ ...++++.... +..-.+-...+++.+-+| T Consensus 238 G~~V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~~~~~~~d~i~~~~~~G 317 (332) T protein:vir:78 238 GIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAMG 317 (332) T ss_pred eeEEEecCccccCcccccccccccccccccccccccceEEeecccceeeeeeeccchhhhhcccchhhhHhhhhhhhhhc Confidence 99999999998532 234444441 111 11222222111 111111223577778899 Q ss_pred cEEecccceEEEEec Q lcl|Aclame:pro 617 AGVRRKEAFCIAKKG 631 (632) Q Consensus 617 ~~v~~~~a~~~~~~~ 631 (632) +++++|++++.++.| T Consensus 318 ~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 318 CGSLRTSVAGSFQAA 332 (332) T ss_pred CceecccceEEEeeC Confidence 999999999999999 No 135 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=98.88 E-value=1e-09 Score=69.91 Aligned_cols=285 Identities=11% Similarity=-0.018 Sum_probs=146.7 Q ss_pred hhhh-HHhhhhhcccccc-----cccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccC Q lcl|Aclame:pro 347 MPHE-VLVQRQLEKKTAG-----KGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGED 420 (632) Q Consensus 347 ~~~~-~~~~~a~~~~~~~-----~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~ 420 (632) +... .....+...++.. .....+.-+.+...+...+...+.++.+...+.-..+.++.+++.+.. .+....-| T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~~~~rti~~Gksv~f~~iG~~-t~~~~t~G 79 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDLVTKRTLKNGKSLQFIYTGRM-TSSFHTPG 79 (375) T ss_pred CccccccccCccccCCccccccccchHHHHHHHHhHHHHHHHHHHHhhhccccccccccCceEEEEeeeee-EEeeecCC Confidence 0000 0000111111111 111122334556666677777777776643332234556777777543 44444445 Q ss_pred cccccC---cccceeeeeeeeee-eeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcC----CCc------- Q lcl|Aclame:pro 421 EDVQDS---DFDFTTLSFSPKTI-AGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGT----GLA------- 485 (632) Q Consensus 421 ~~~~~~---~~~~~~~~~~~~t~-~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~----g~~------- 485 (632) .++... .++-++.++.+.++ ...+.|..---.....++.+.+.++++.++++..|..++... ... T Consensus 80 ~~i~~~~~~d~~~te~~l~ID~~~y~~~~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~~~~~ 159 (375) T protein:vir:10 80 TPILGNADKAPPVAEKTIVMDDLLISSAFVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASPVSATN 159 (375) T ss_pred cCcCCccccCCCCCceEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Confidence 444321 33333333444333 122333332223345678899999999999999999875321 110 Q ss_pred -ccccccee--ccccccccccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHh----hc--ccCCceeec Q lcl|Aclame:pro 486 -NDPVGLLN--MTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQ----VF--DNTGERIWQ 556 (632) Q Consensus 486 -~~~~Gil~--~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~--d~~g~~~~~ 556 (632) ..+.|..- .++............++.|.++...|..++.+......++.|..+..+...+ +. |..|.-+.. T Consensus 160 ~~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~~~~n~d~~~~~~~~ 239 (375) T protein:vir:10 160 FVEPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQGRCAVLNPRQYYALIQDIGSNGLVNRDVQGSALQS 239 (375) T ss_pred ccccCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHHhcCCccceeeecccccceec Confidence 01111111 1111111112234456888889899999888866666666666555443221 11 111222222 Q ss_pred c---ccccCcceEEcCCCCCccE-------------------------------------EEEeh---h----------h Q lcl|Aclame:pro 557 N---NEVNGYRAEASNQIPADTW-------------------------------------IFGDW---S----------Q 583 (632) Q Consensus 557 ~---~~l~G~pv~~~~~~~~~~~-------------------------------------~~gd~---s----------~ 583 (632) . ..++|.+|+.++.+|..+. |-+|+ + . T Consensus 240 ~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~~~~~~A 319 (375) T protein:vir:10 240 GNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCGLIFQKEA 319 (375) T ss_pred cceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcceeeccccccccccccccCceEEEEEchhh Confidence 2 3588999999999884321 22233 1 1 Q ss_pred EEEEEecceEEEEec-ccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 584 IVIAMWGVLDLKVDP-YTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 584 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) .-...-.++.+.++. +..-.+....+.+.+-+|.++.||++.+.|+..| T Consensus 320 ~g~v~~~~~~~~~~~~~~~~~~q~~~i~~~~a~G~~~lrp~~av~l~~~~ 369 (375) T protein:vir:10 320 AGVVEAIGPQVQVTNGDVSVIYQGDVILGRMAMGADYLNPAAAVELYIGA 369 (375) T ss_pred eeeeeeeccccccccchhhheeeeeeeeeeeeeccCccCceeEEEEecCc Confidence 111223344444431 1223345556888899999999999999999998 No 136 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=98.87 E-value=3.2e-10 Score=72.62 Aligned_cols=284 Identities=11% Similarity=0.026 Sum_probs=147.2 Q ss_pred hhhhhhhhHHhhhhhccccccccc-ceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCc Q lcl|Aclame:pro 343 RGFYMPHEVLVQRQLEKKTAGKGG-ELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDE 421 (632) Q Consensus 343 ~~~~~~~~~~~~~a~~~~~~~~~~-~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~ 421 (632) .........+..+.-.. ..++. ..+.-+.+...+...+...+.+..+.....-..+..+.+++.+.. .+.....+. T Consensus 1 ~a~~~~~~~~~~~~g~~--~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~G~sv~~~~iG~~-~~~~~~~g~ 77 (347) T protein:vir:88 1 MANATGGQQIGANQGKG--QSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRT-KGYYLAPGE 77 (347) T ss_pred CCCcccchhhhccCCCC--ccccchHHHHHHHHHHHHHHHHHHHhhhhhccccccccCcceEEEeeecce-eeeeecccc Confidence 00000000000000011 11111 223335556666666666677666543322234556777766544 445555666 Q ss_pred cccc--Ccccceeeeeeeeeeee-eehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCC--------Ccccccc Q lcl|Aclame:pro 422 DVQD--SDFDFTTLSFSPKTIAG-AVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTG--------LANDPVG 490 (632) Q Consensus 422 ~~~~--~~~~~~~~~~~~~t~~~-~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g--------~~~~~~G 490 (632) ++.. -.+..+++++.+.++-. .+.|.+.--.....++.+.+.+.++.++++..|..++.... ....+.| T Consensus 78 ~l~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~~~~~~~g 157 (347) T protein:vir:88 78 NLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAG 157 (347) T ss_pred CCCCCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCC Confidence 5543 35677777777777632 34555444444456788899999999999999998763211 0111222 Q ss_pred ceeccccccccc-------cccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceee-------c Q lcl|Aclame:pro 491 LLNMTGVPALTY-------PAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIW-------Q 556 (632) Q Consensus 491 il~~a~~~~~~~-------~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~-------~ 556 (632) +-....+...+. ......++.|.++...|..++.+......++.|..+..|... .......+.. . T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~-~~~~~~~~~~~~~~~~G~ 236 (347) T protein:vir:88 158 LGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSA-LMPNAANYAALIDPETGN 236 (347) T ss_pred ccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHHHHHhcc-hhhhhhhhccccchhcce Confidence 111111111100 111223678888888999888887776666667665544321 1111222211 1 Q ss_pred cccccCcceEEcCCCCCcc---E----------------------EEEehhhEE-E---------EEecceEEEEecccc Q lcl|Aclame:pro 557 NNEVNGYRAEASNQIPADT---W----------------------IFGDWSQIV-I---------AMWGVLDLKVDPYTK 601 (632) Q Consensus 557 ~~~l~G~pv~~~~~~~~~~---~----------------------~~gd~s~~~-~---------~~~~~~~~~~~~~~~ 601 (632) .+.++|++|+.++++|.+. . +.+|++... + +....+.++..... T Consensus 237 vg~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~~e~~r~~- 315 (347) T protein:vir:88 237 IRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRP- 315 (347) T ss_pred eeeeccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEechhhhhheecccceeeeeech- Confidence 2568999999999998421 0 222333311 1 11112222222211 Q ss_pred cccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 602 AASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 602 ~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) .+-...+++.+-+|.++++|++.+.++..+ T Consensus 316 -~~~~d~i~~~~~~G~~~~rPe~a~~~~~~~ 345 (347) T protein:vir:88 316 -EFQADQIIGKYAMGHGGLRPEAAGALVFTP 345 (347) T ss_pred -hhHHHHhhhhhhhcCceeccceEEEEEeCC Confidence 122235888899999999999887766655 No 137 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=98.84 E-value=3e-10 Score=72.82 Aligned_cols=285 Identities=11% Similarity=0.014 Sum_probs=149.8 Q ss_pred hhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcccc Q lcl|Aclame:pro 345 FYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQ 424 (632) Q Consensus 345 ~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~ 424 (632) ..........+....++ .+...+.-+.+...+...+...+.++.+...+.-..+.++.+++. +.+.+....-|+++. T Consensus 1 m~~~~~~~~t~~~~~~~--~~~~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~G~s~~~~~i-G~~~~~~~~~g~~l~ 77 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGA--NSDVSLHIEEHLGLVDASFMYSSKFASWMNVRSLRGTNQLRVDRV-GASTIAGRKAGEELV 77 (334) T ss_pred CCCCcCCCccccccccc--cchheehhhhhhhHHHHHHHHhhhhhccceeeeccccceEEEeee-cceeeeeecCCCCCC Confidence 00000000001111111 111234435666677777777777777644433334556777766 455667777788887 Q ss_pred cCcccceeeeeeeeee-eeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhc----CCCc----ccc---ccce Q lcl|Aclame:pro 425 DSDFDFTTLSFSPKTI-AGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTG----TGLA----NDP---VGLL 492 (632) Q Consensus 425 ~~~~~~~~~~~~~~t~-~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g----~g~~----~~~---~Gil 492 (632) ...+..++.++.+.++ ...+.|..---.....++.+.+.+.++.++++..|..++.. .... ..+ .|+. T Consensus 78 ~~~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~~~G~~ 157 (334) T protein:vir:80 78 VQKNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAFHDGIL 157 (334) T ss_pred CCCcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccCCcc Confidence 7777778888888773 33445555433445668999999999999999999977522 1110 011 1222 Q ss_pred ecccccccc---ccccchhHHHHHHHHHHHHhhcccc---ccceEEeehhHHHHHHHH-hhccc-----C-Cceeec--c Q lcl|Aclame:pro 493 NMTGVPALT---YPAGGVDWASVVDMETKISTFNADA---GRLAYLTSVTQRGAAKKA-QVFDN-----T-GERIWQ--N 557 (632) Q Consensus 493 ~~a~~~~~~---~~~~~~~~~~i~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~-~~~d~-----~-g~~~~~--~ 557 (632) +.......+ .+....-.+.+.++...+..+..+. .....++.|..+..|... ++.+. . +..+.. - T Consensus 158 ~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~s~~~~~~~~g~i 237 (334) T protein:vir:80 158 LPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGAKEGGNSFVGGRI 237 (334) T ss_pred eeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceeccccccccccceeE Confidence 221111111 1111122344556666677666652 234455655555544332 11111 1 111111 2 Q ss_pred ccccCcceEEcCCCCCcc-----------EEEEehhhEEE--EEecce------EEEEecccccccCcEEEEEEEEeCcE Q lcl|Aclame:pro 558 NEVNGYRAEASNQIPADT-----------WIFGDWSQIVI--AMWGVL------DLKVDPYTKAASDGLVLRVFQDVDAG 618 (632) Q Consensus 558 ~~l~G~pv~~~~~~~~~~-----------~~~gd~s~~~~--~~~~~~------~~~~~~~~~~~~~~~~~~~~~r~~~~ 618 (632) ..++|.+|+.++++|... .+-|||+.... .-.+.+ .+..+-+..-.+-...+.+.+-+|.+ T Consensus 238 ~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~d~i~~~~a~G~g 317 (334) T protein:vir:80 238 AMLNGVRVVETPRFPQSAITANALGADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEEKKDFGHYLDTFQSYNIG 317 (334) T ss_pred EEEeceEEEeecCCCCccccccccccccccccccccceEEEEEeCceEEEEEEeecceeeeechhhHHHHHHHHHHcCCc Confidence 468999999999999652 45567665322 111211 11111111111111134555677999 Q ss_pred EecccceEEEEecC Q lcl|Aclame:pro 619 VRRKEAFCIAKKGA 632 (632) Q Consensus 619 v~~~~a~~~~~~~A 632 (632) ++||+|++.+++.= T Consensus 318 ~lRPeaa~vv~~~~ 331 (334) T protein:vir:80 318 QRRPDAVAVHDITV 331 (334) T ss_pred eeccceEEEEEEee Confidence 99999999999988 No 138 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=98.83 E-value=4.7e-10 Score=71.74 Aligned_cols=287 Identities=13% Similarity=0.027 Sum_probs=142.2 Q ss_pred hhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcc Q lcl|Aclame:pro 343 RGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDED 422 (632) Q Consensus 343 ~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~ 422 (632) ............+.-.....++.-.+.+ +.+...+...+...+.+..+.....-....++.+++.+.. .+.....|.+ T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~i-e~~~g~V~~~f~~~s~~~~~v~~r~~~~G~sv~i~~iG~~-t~~~~~~g~~ 78 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFL-KVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGRT-KAAYLKPGEN 78 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHH-HHHHHHHHHHHHHHHhhhhhhccccccccceeEeeeccce-eeeeecCCCC Confidence 0000000000011111111111111223 5566667777777777777644322233456677766544 4455566666 Q ss_pred cccC--cccceeeeeeeeeeee-eehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcC-----CCc------ccc Q lcl|Aclame:pro 423 VQDS--DFDFTTLSFSPKTIAG-AVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGT-----GLA------NDP 488 (632) Q Consensus 423 ~~~~--~~~~~~~~~~~~t~~~-~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~-----g~~------~~~ 488 (632) +... .+...+.++.+.++-. .+.|.+.--.....++.+.+.+.++.++++..|..++... ... ..+ T Consensus 79 l~~~~~~~~~~e~~ltiD~~~y~~~~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~~ 158 (347) T protein:vir:33 79 LDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNENIEGL 158 (347) T ss_pred CCCCCCCCccceEEEEechhhhhhHHHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccc Confidence 6443 3555665565544422 2333333333345678889999999999999999886211 100 000 Q ss_pred cc-cee-----ccccccccccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCcee----ec-- Q lcl|Aclame:pro 489 VG-LLN-----MTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERI----WQ-- 556 (632) Q Consensus 489 ~G-il~-----~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~----~~-- 556 (632) .+ ..+ .++........+..-++.|.++...|..+..+......++.|..+..|.... +-.++.+. .. T Consensus 159 ~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~-~~~~~d~~~~~~~~~G 237 (347) T protein:vir:33 159 GKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAAL-MPNAANYQALLDPERG 237 (347) T ss_pred cccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhccc-cccccccccccccccc Confidence 00 000 0000000001112235777888888988888766655666665554443211 11112221 11 Q ss_pred -cccccCcceEEcCCCCCccE-------EE---------------EehhhE--EE------EEecceEEEEecccccccC Q lcl|Aclame:pro 557 -NNEVNGYRAEASNQIPADTW-------IF---------------GDWSQI--VI------AMWGVLDLKVDPYTKAASD 605 (632) Q Consensus 557 -~~~l~G~pv~~~~~~~~~~~-------~~---------------gd~s~~--~~------~~~~~~~~~~~~~~~~~~~ 605 (632) -+.++|++|+.++++|...+ .. ++|+.. .+ +....+.+....+..-.+- T Consensus 238 ~V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r~~~~~ 317 (347) T protein:vir:33 238 TIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRANYQ 317 (347) T ss_pred eeEEEeceeEEEecccccCccccccccccccccccccCCcccceeccccceeeeeecchhheeeeeeceeeeeccchhhh Confidence 24689999999999986432 11 111110 01 1111221112111121222 Q ss_pred cEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 606 GLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 606 ~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) ...+++.+-+|.++++|++.+.++.+= T Consensus 318 ~d~i~~~~~~G~~vlrP~~av~i~~~~ 344 (347) T protein:vir:33 318 ADQIIAKYAMGHGGLRPEAAGAIVLPK 344 (347) T ss_pred hHhhhhhhhcCCceecccceEEEecCC Confidence 335778888899999999999888777 No 139 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=98.81 E-value=1e-09 Score=69.89 Aligned_cols=282 Identities=13% Similarity=0.050 Sum_probs=146.4 Q ss_pred hhhhHH-hhhhhcccccc--cc-cceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcc Q lcl|Aclame:pro 347 MPHEVL-VQRQLEKKTAG--KG-GELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDED 422 (632) Q Consensus 347 ~~~~~~-~~~a~~~~~~~--~~-~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~ 422 (632) +..... ......+.... ++ ...+.-+.+...+...+...+.++.+...+.-..+.++.+++. +..++.....|.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~r~i~~gks~~~~~i-G~~~~~~~~~G~~ 79 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVL-GRTQAAYLAPGEN 79 (345) T ss_pred CcccccchhcccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEEeee-cceEEEeeecCCC Confidence 000000 00001111110 11 1123335556666777777777776643332234556777776 4456677777777 Q ss_pred cccC--cccceeeeeeeeeeee-eehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCC--------Ccccccc- Q lcl|Aclame:pro 423 VQDS--DFDFTTLSFSPKTIAG-AVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTG--------LANDPVG- 490 (632) Q Consensus 423 ~~~~--~~~~~~~~~~~~t~~~-~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g--------~~~~~~G- 490 (632) +... .+..++.++.+.++-. .+.|..---..+.+++.+.+.++++.++++..|..++.... ....|.| T Consensus 80 l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~~~~~~ 159 (345) T protein:vir:22 80 LDDKRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNENIEGL 159 (345) T ss_pred CCCCCCCcccceEEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc Confidence 6553 3555664444443322 12333322223456788999999999999999998763211 1112222 Q ss_pred ---ceeccccccccc----cccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceee-----c-- Q lcl|Aclame:pro 491 ---LLNMTGVPALTY----PAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIW-----Q-- 556 (632) Q Consensus 491 ---il~~a~~~~~~~----~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~-----~-- 556 (632) ++.......... .....-++.|.++...|..++.+......++.|..+..|...... ....+.. . T Consensus 160 ~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~~-~~~~~~~~~~~~~G~ 238 (345) T protein:vir:22 160 GTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCDPDSYSAILAALMP-NAANYAALIDPEKGS 238 (345) T ss_pred ccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeChHHHHHHhccccc-cccccccccccccce Confidence 111111101111 111234678888888898988887776667777766655332111 1111211 1 Q ss_pred cccccCcceEEcCCCCCccE--------------------------------EEEehhhEEEEEecceEEEEeccccccc Q lcl|Aclame:pro 557 NNEVNGYRAEASNQIPADTW--------------------------------IFGDWSQIVIAMWGVLDLKVDPYTKAAS 604 (632) Q Consensus 557 ~~~l~G~pv~~~~~~~~~~~--------------------------------~~gd~s~~~~~~~~~~~~~~~~~~~~~~ 604 (632) -..++|.+|+.++++|.... +|.-.+.+..+....++++....... T Consensus 239 V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~~~~-- 316 (345) T protein:vir:22 239 IRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERARRANF-- 316 (345) T ss_pred EEEEeceEEEecccccccccCccccCcccccccccccccceeeeeccCceEEEEEehhheeeeeeecceeeeeechhH-- Confidence 23689999999998874210 11111112122222233333222221 Q ss_pred CcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 605 DGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 605 ~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) -...+++.+-+|.++++|++.+.++++= T Consensus 317 ~~d~I~~~~a~G~~vlRPeaa~~i~~~~ 344 (345) T protein:vir:22 317 QADQIIAKYAMGHGGLRPEAAGAVVFKV 344 (345) T ss_pred HHHHHHHHHhcCCcccccceeEEEEEee Confidence 1225778888999999999988888777 No 140 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=98.80 E-value=2.5e-09 Score=67.77 Aligned_cols=262 Identities=10% Similarity=0.039 Sum_probs=154.2 Q ss_pred hcccccccccceechhhhhHHHHHHHhhhhhhhhhcce--------ee--ccCceeEEEEEecCC-ccccccccCccccc Q lcl|Aclame:pro 357 LEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGAR--------ML--PGLVGDVDIPKKTSG-ANFYWIGEDEDVQD 425 (632) Q Consensus 357 ~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~--------~~--~~~~~~~~~~~~~~~-~~a~~v~E~~~~~~ 425 (632) +. .+.-..+|.|+++..-+.+.......+.+.+.- .. ...+..+++|....- +.+.-+.|+..++. T Consensus 1 MA---~T~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~~i~~ 77 (324) T protein:vir:59 1 MA---YTKISDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDTDDLVP 77 (324) T ss_pred CC---ceeeeceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccccCCCcccch Confidence 21 122245677777766555544444444332211 01 112335677777653 56788899999999 Q ss_pred CcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceecccccccccccc Q lcl|Aclame:pro 426 SDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAG 505 (632) Q Consensus 426 ~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~ 505 (632) .+++-++-.-.++..++.+.++.+...-+.-+....+.++++...++..+..++.-....-...+.-++ .....+.+.. T Consensus 78 ~~l~t~~~~a~i~~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l~g~~~~~~~~~~-~~dvsa~~~~ 156 (324) T protein:vir:59 78 QKINAGQDKAVLILRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAELAGVFSNDDMKDN-KLDISGTADG 156 (324) T ss_pred hhcccceeeEEEEeecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc-eeeeeccccc Confidence 999988888888889999999987666566677888999999999998888776432100000000000 0011122334 Q ss_pred chhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcc----cCCceeeccccccCcceEEcCCCCCc------- Q lcl|Aclame:pro 506 GVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFD----NTGERIWQNNEVNGYRAEASNQIPAD------- 574 (632) Q Consensus 506 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d----~~g~~~~~~~~l~G~pv~~~~~~~~~------- 574 (632) .++.+.|.++..+|..... ....++||+.....+....+.+ .++.. .-+.++|++|++++.+|.. T Consensus 157 ~~s~~~l~~A~~~~GD~~~--~~~~ivmhS~v~~~L~~~~li~~~~~s~~~~--~i~~~~G~~VivdD~~p~~~~~~~~~ 232 (324) T protein:vir:59 157 IYSAETFVDASYKLGDHES--LLTAIGMHSATMASAVKQDLIEFVKDSQSGI--RFPTYMNKRVIVDDSMPVETLEDGTK 232 (324) T ss_pred eecHHHHHHHHHHhCCccc--CcEEEEEchHHHHHHHHhhhhhhccccccCc--eeeeecccEEEEeCCCCccccCCCCc Confidence 5788999999999877543 4567889888887776554432 22221 2256899999999998842 Q ss_pred ---cEEEEehhhEEEEE-ecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 575 ---TWIFGDWSQIVIAM-WGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 575 ---~~~~gd~s~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +++|+. ..+.+.. ...+.++.++. ...+...+..+.++ +++|.++.+-+.+. T Consensus 233 ~y~s~l~~~-GAi~~~~~~~~v~vE~dRd--~~~g~~~l~~r~~~---~~~p~G~s~~~~~~ 288 (324) T protein:vir:59 233 VFTSYLFGA-GALGYAEGQPEVPTETARN--ALGSQDILINRKHF---VLHPRGVKFTENAM 288 (324) T ss_pred eEEEEEEec-CeEEEeecCCCcceecccC--ccccceEEEEeeEE---EeEeeeEEeccccc Confidence 244442 2222222 22233333332 23455556666554 46666666654432 No 141 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=98.77 E-value=9.2e-09 Score=64.65 Aligned_cols=281 Identities=10% Similarity=-0.016 Sum_probs=141.7 Q ss_pred hhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcccccC Q lcl|Aclame:pro 347 MPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDS 426 (632) Q Consensus 347 ~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~ 426 (632) +.......+.....+ .....+.-+.+..++...+...+.++.+...+.-..+.++.+++.+.. ++....-|.+.... T Consensus 1 ms~~n~~t~~~~~~~--~~~~al~le~f~geV~taf~~~s~~~~~~~~rti~~gkS~q~~~iG~~-~~~~~~~G~~ld~~ 77 (364) T protein:vir:10 1 MSNPNVLTQPAVSAS--GEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQEVVGTNSVSNKYIGET-ELQVLSPGKSPDAS 77 (364) T ss_pred CCCcccccccccccc--cchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEeeeeeee-EEeeeccCcccCCC Confidence 111111111111111 111233434555666666666666666533333334556777777443 44555555555545 Q ss_pred cccceeeeeeeeeeeee-ehhhHHHhhcChhH-HHHHHHHHHHHHHHHHHHHHHhhc---CC-Cccc---cccceecccc Q lcl|Aclame:pro 427 DFDFTTLSFSPKTIAGA-VPVTRKLRKQSSIH-VENLIREDLIEGIGVALDLAMLTG---TG-LAND---PVGLLNMTGV 497 (632) Q Consensus 427 ~~~~~~~~~~~~t~~~~-~~iSre~l~d~~~~-~~~~i~~~l~~a~a~~~~~~~~~g---~g-~~~~---~~Gil~~a~~ 497 (632) .+.-++.++.+.++-.. ..|-.---..++++ +-+.+.+.+|.++++..|..++.. .+ .+-. -.++....+. T Consensus 78 ~~~~~k~~itID~ll~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~~~~g~ 157 (364) T protein:vir:10 78 PTEFDKNRLVVDTTVIARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPRVAGHGF 157 (364) T ss_pred CcccCcEEEEecceeeechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCcccCCcc Confidence 56667777777664321 22222111123455 678899999999999999977421 11 1000 0111111110 Q ss_pred ---ccccccccc----hhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHH-hhcc------cCCceeec-cccccC Q lcl|Aclame:pro 498 ---PALTYPAGG----VDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKA-QVFD------NTGERIWQ-NNEVNG 562 (632) Q Consensus 498 ---~~~~~~~~~----~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~d------~~g~~~~~-~~~l~G 562 (632) .....+... .-.+.|.++...+..++.+......++.|..+..+... ++-+ +.+.+... ...+.| T Consensus 158 ~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~~~lvn~d~~~~~~~~~~~G~v~~v~G 237 (364) T protein:vir:10 158 SIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDADRIVDKSYTIAASDNTVDGFVLKSWN 237 (364) T ss_pred eeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcCCccccccccccCCCccccceeEEEec Confidence 011111112 22345566777788888877666666666665444322 1111 11222211 135899 Q ss_pred cceEEcCCCCCcc---------------------E--EEEehhhE----------EEEEecceEEEEecccccccCcEEE Q lcl|Aclame:pro 563 YRAEASNQIPADT---------------------W--IFGDWSQI----------VIAMWGVLDLKVDPYTKAASDGLVL 609 (632) Q Consensus 563 ~pv~~~~~~~~~~---------------------~--~~gd~s~~----------~~~~~~~~~~~~~~~~~~~~~~~~~ 609 (632) .||+.++++|... - ..+|++.. ..+...++......+. .+-...+ T Consensus 238 v~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~~~--~~~~~~i 315 (364) T protein:vir:10 238 TPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFYEK--KEKTWYI 315 (364) T ss_pred eEEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeeeecc--ceeeeee Confidence 9999999998421 0 11344331 1122233333332221 1222345 Q ss_pred EEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 610 RVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 610 ~~~~r~~~~v~~~~a~~~~~~~A 632 (632) .+.+-+|.++.||++.+.++.++ T Consensus 316 da~~a~G~g~lRPeaa~~i~~~~ 338 (364) T protein:vir:10 316 DTFLAEGAIPDRWEAVAVVTAAD 338 (364) T ss_pred eeehcccCcccCccceEEEEecC Confidence 56677999999999999999998 No 142 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=98.73 E-value=6.2e-10 Score=71.06 Aligned_cols=284 Identities=14% Similarity=0.083 Sum_probs=135.6 Q ss_pred hhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcc Q lcl|Aclame:pro 343 RGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDED 422 (632) Q Consensus 343 ~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~ 422 (632) .... ....+..+.-......+.-.+.+ +.+...+...+...+.++.+.....-..+..+.+++.+ ...+.....|++ T Consensus 1 m~~~-~~~~~~t~~g~~~~~~d~~al~i-k~f~~eV~~~f~~~s~~~~~~~~r~i~~G~sv~i~~iG-~~tv~~~t~G~~ 77 (347) T protein:vir:94 1 MANV-PGQKIGTDQGKGKSSSDALALFL-KVFAGEVLTAFTRRSVTADKHIVRTIQNGKSAQFPVMG-RTSGVYLAPGER 77 (347) T ss_pred CCCC-CccccccccccCCccccHHHHHH-HHHhHHHHHHHHHHHhhhcccccccccccceEEEeccc-ceeeeeecCCCC Confidence 0000 00000000000000000001222 33344445555555666655332222234567777764 445566666666 Q ss_pred cccC--cccceeeeeeeeeee-eeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcC-------C-Cccccccc Q lcl|Aclame:pro 423 VQDS--DFDFTTLSFSPKTIA-GAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGT-------G-LANDPVGL 491 (632) Q Consensus 423 ~~~~--~~~~~~~~~~~~t~~-~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~-------g-~~~~~~Gi 491 (632) +... .+.-.+.++.+.++- ..+.|.+.--.....++.+.+.+.++.++++..|..++... + +...+.|+ T Consensus 78 l~~~~~~~~~~e~~itID~~~~~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~~~~~g~ 157 (347) T protein:vir:94 78 LSDKRKGIKHTEKVITIDGLLTADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASNENIAGL 157 (347) T ss_pred cCCCCCCCCcceEEEEecchhhhhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCC Confidence 6443 345555555555442 12233222122234578888999999999999999775311 1 11111221 Q ss_pred eecc----cccccccc---ccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCce----eec---c Q lcl|Aclame:pro 492 LNMT----GVPALTYP---AGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGER----IWQ---N 557 (632) Q Consensus 492 l~~a----~~~~~~~~---~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~----~~~---~ 557 (632) -... +....+.+ ...--++.|.++...|..++.+......++.|..+..|... .......+ ... - T Consensus 158 ~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~-~~~~~~~~~~~~~~~~G~V 236 (347) T protein:vir:94 158 GTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAA-LMPNAANYAALIDPETGNI 236 (347) T ss_pred cccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhcc-chhhhhhccccccccccce Confidence 1110 00010000 01122456677777888887776665666666665444221 11111111 111 2 Q ss_pred ccccCcceEEcCCCCCcc-----------E---------------EEEehhhE--EE--------EEecceEEEEecccc Q lcl|Aclame:pro 558 NEVNGYRAEASNQIPADT-----------W---------------IFGDWSQI--VI--------AMWGVLDLKVDPYTK 601 (632) Q Consensus 558 ~~l~G~pv~~~~~~~~~~-----------~---------------~~gd~s~~--~~--------~~~~~~~~~~~~~~~ 601 (632) +.++|.+|+.++++|... + +-+||+.- .+ +....++++..... T Consensus 237 g~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~~- 315 (347) T protein:vir:94 237 RNVMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLALERDRDV- 315 (347) T ss_pred EEEeceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhcccccccchhch- Confidence 478999999999998421 1 11122211 00 11111222222211 Q ss_pred cccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 602 AASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 602 ~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) .+-...+++.+-+|.++++|++.+.++.++ T Consensus 316 -~~~~d~i~~~~~~G~~~~rP~~a~~~~~~~ 345 (347) T protein:vir:94 316 -DAQGDLIVGKYAMGHGGLRPEAAGALVFSP 345 (347) T ss_pred -hhHHHHhhhhhhhcCcccccceeEEEEecC Confidence 122236888999999999999999999888 No 143 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=98.72 E-value=1.5e-09 Score=68.96 Aligned_cols=286 Identities=11% Similarity=-0.015 Sum_probs=142.6 Q ss_pred HhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccc Q lcl|Aclame:pro 339 GKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIG 418 (632) Q Consensus 339 ~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~ 418 (632) ... ...........+.....+ .+ ...+.-+.+...+...+...+.++.+...+.-..+.++.+++.+ ...+.... T Consensus 1 ma~--~~~~~~~n~~~~~~~~~~-~~-~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~g~s~~~~~iG-~~~~~~~~ 75 (344) T protein:vir:10 1 MAN--MTGGQQLGTNQGKDVMAA-GD-KLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVLG-RTQAAYLA 75 (344) T ss_pred Ccc--ccccccCCcccCCccCCc-cc-hhHHHHHHHHHHHHHHHHHHhhhcccceeeeecccceEEEEeec-eeEEEeee Confidence 000 000000000000000000 00 11122245566666667777777766433322335567777764 44566666 Q ss_pred cCcccccC--cccceeeeeeeeeeee-eehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCC--------Cccc Q lcl|Aclame:pro 419 EDEDVQDS--DFDFTTLSFSPKTIAG-AVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTG--------LAND 487 (632) Q Consensus 419 E~~~~~~~--~~~~~~~~~~~~t~~~-~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g--------~~~~ 487 (632) .|.++..+ .+.-++.++.+.++-. .+.|..---..+..++.+.+.+.++.++++..|..++.... .+.. T Consensus 76 ~G~~l~~t~~~~~~~e~~l~ID~~~y~~~~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~~~~ 155 (344) T protein:vir:10 76 PGENLDDIRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQYNEN 155 (344) T ss_pred cCCCCCCCCCCcccceEEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Confidence 77776653 4566666666655322 23333332333456788999999999999999988753211 1111 Q ss_pred cc----cceecccccccccccc----chhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCce-----e Q lcl|Aclame:pro 488 PV----GLLNMTGVPALTYPAG----GVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGER-----I 554 (632) Q Consensus 488 ~~----Gil~~a~~~~~~~~~~----~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~-----~ 554 (632) |. |++..........+.. ..-++.|.++...|..++.+......++.|..+..|...... ....+ + T Consensus 156 ~~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~-~~~~~~~~~~~ 234 (344) T protein:vir:10 156 ITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAILAALMP-NAANYAALIDP 234 (344) T ss_pred cccccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHHhhcccc-cccccccccce Confidence 22 2221111111111111 223566778888888888876666666666666555332111 11111 1 Q ss_pred ec--cccccCcceEEcCCCCCcc------EE---------------EEehhhE----------EEEEecceEEEEecccc Q lcl|Aclame:pro 555 WQ--NNEVNGYRAEASNQIPADT------WI---------------FGDWSQI----------VIAMWGVLDLKVDPYTK 601 (632) Q Consensus 555 ~~--~~~l~G~pv~~~~~~~~~~------~~---------------~gd~s~~----------~~~~~~~~~~~~~~~~~ 601 (632) .. -+.++|++|+.++++|.+. .. .++++.. ..+....++++...... T Consensus 235 ~~G~V~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~e~~r~~~ 314 (344) T protein:vir:10 235 EKGSIRNVMGFEVVEVPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLALERARRAN 314 (344) T ss_pred eeeEEEEEeceEEEeccccccccCCcccccccCccccccCCcccceeeecceeEEEeechhhhhhhhhccceeecccchh Confidence 11 2458999999999998531 11 1122221 01111222332222211 Q ss_pred cccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 602 AASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 602 ~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +-...+++.+-+|.+++||++.+.++++- T Consensus 315 --~~~d~i~g~~~~G~~vlRPe~a~~v~~~~ 343 (344) T protein:vir:10 315 --FQADQIIAKYAMGHGGLRPEAAGAVVFKT 343 (344) T ss_pred --HHHHHHHHHhhcccceecccceEEEEeec Confidence 11225778888999999999885555544 No 144 >protein:vir:1991 Length: 305 # NCBI annotation: major head subunit # Family: family:all:776 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050638;genbank:gi:9633525;genbank:GeneID:2636267 Probab=98.71 E-value=2.8e-10 Score=72.95 Aligned_cols=209 Identities=12% Similarity=0.065 Sum_probs=123.6 Q ss_pred hhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCcccc-ccccCccccc Q lcl|Aclame:pro 347 MPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFY-WIGEDEDVQD 425 (632) Q Consensus 347 ~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~-~v~E~~~~~~ 425 (632) +.......+++ ..-+...+.+.+...++..+..+..++.++..-++..++..|... |+ |+... T Consensus 1 M~i~~~~l~~l-------------~~~~~~~f~~~~~~a~~~~~~iA~~vpSt~~~~tY~wLg~fP~lrewi---Ger~i 64 (305) T protein:vir:19 1 MIVTPASIKAL-------------MTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWV---GKRTI 64 (305) T ss_pred CccCHHHHHHH-------------HHHHHHHHHHHHhhcCcccceEEeEecCCCCcccccccccCCccchhh---cceee Confidence 00000001111 112344555566665555555678888888888999999999875 56 78889 Q ss_pred CcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCC---Ccccccc-ceecccccc-- Q lcl|Aclame:pro 426 SDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTG---LANDPVG-LLNMTGVPA-- 499 (632) Q Consensus 426 ~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g---~~~~~~G-il~~a~~~~-- 499 (632) .++.....++.-++|...+.|.|+.|+||.+++..-+.+.||++++...|..++.-.. +....+| -+|+++|.. T Consensus 65 ~~l~~~~y~i~Nk~fe~tV~V~R~dIeDD~lG~y~p~~~~~G~~aa~~pd~lv~~lL~~Gf~~~cyDGq~FFdtDHpv~~ 144 (305) T protein:vir:19 65 QQMEAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYP 144 (305) T ss_pred eeccccceeEeeccccceeccchhhccccccCchHHHHHHHHHHHhhchhhHHHHHHHhcCCccCCCCCcccCCCCCccc Confidence 9999999999999999999999999999999999999999999999999998864321 1122333 355555532 Q ss_pred -cccc--------------------------------------------------------------------------- Q lcl|Aclame:pro 500 -LTYP--------------------------------------------------------------------------- 503 (632) Q Consensus 500 -~~~~--------------------------------------------------------------------------- 503 (632) ..++ T Consensus 145 ~~~~tg~~~~vsn~~~~~~~~g~~w~Lld~~~~ikP~I~Q~Rk~~~~~~~~~~~d~~vf~~~e~~ygvd~R~n~Gygfwq 224 (305) T protein:vir:19 145 NVDGTGSAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASTRRAAGYGFWQ 224 (305) T ss_pred CCcccccccchhhhhcCCCCCCceeeeeecCCcceeEEEecccccceeeccCCCchhhhhhceeeeeeeeeeeccccchh Confidence 1111 Q ss_pred -----ccchhHHHHHHHHHHHHhhcccccc-----ceEEeehhHHHHHHHHhhcc--cCCceeeccccccC-cceEEcCC Q lcl|Aclame:pro 504 -----AGGVDWASVVDMETKISTFNADAGR-----LAYLTSVTQRGAAKKAQVFD--NTGERIWQNNEVNG-YRAEASNQ 570 (632) Q Consensus 504 -----~~~~~~~~i~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~d--~~g~~~~~~~~l~G-~pv~~~~~ 570 (632) .++++.+.+..++.+|..+..+.+. +.++++|..........+.. ..+.-.-..|++.| ..+++++. T Consensus 225 ~a~gS~~~Ls~~nl~aar~aM~~qk~d~G~pL~I~P~~LvVPp~LE~~A~qll~s~~i~~g~~~~~Np~~g~~eliV~P~ 304 (305) T protein:vir:19 225 MAVAVKGDLTLDNLWKGWQLMRSFEGDGGKKLGLKPTHIVVPVGLEKAAEQLLNRELFADGNTTVSNEMKGKLQLVVADY 304 (305) T ss_pred heecCCCCCCHHHHHHHHHHHHhhcCCCCceeeeecCeEEeCchhHHHHHHHHhhcccCCccccccceecceEEEEeccc Confidence 1234445555555555544332221 22344444333333322221 11111111233444 24445554 Q ss_pred C Q lcl|Aclame:pro 571 I 571 (632) Q Consensus 571 ~ 571 (632) + T Consensus 305 L 305 (305) T protein:vir:19 305 L 305 (305) T ss_pred C Confidence 4 No 145 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=98.70 E-value=4.7e-09 Score=66.26 Aligned_cols=268 Identities=16% Similarity=0.125 Sum_probs=152.4 Q ss_pred hccc-ccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcccccCcccceeeee Q lcl|Aclame:pro 357 LEKK-TAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSF 435 (632) Q Consensus 357 ~~~~-~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~ 435 (632) +..+ .++....++.+++++..+...+.+..+...+..+.--+...+++++..+.. ...-..+++.+....++..++++ T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d~g~GDtV~InsIg~~-tV~dY~~~~~i~~d~ltt~~~~l 79 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKLLDVNIARVVDFPDGDKLTIPSVGTP-VVRSRPEQGDFTFDNLDTGEISI 79 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhcccccCCCCeEEecccccc-ccccccCCCCcccccCCCceEEE Confidence 2222 222334556677777777777766655555422222234556777776543 33444455565555566665555 Q ss_pred eeee--eeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhh----cCCC-c--cccccceeccccc-ccccccc Q lcl|Aclame:pro 436 SPKT--IAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLT----GTGL-A--NDPVGLLNMTGVP-ALTYPAG 505 (632) Q Consensus 436 ~~~t--~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~----g~g~-~--~~~~Gil~~a~~~-~~~~~~~ 505 (632) .+.+ |-+ +.|++. ..+...++.+...++++.+++...|..+.. |... + +.|. ..+...+. ...+++. T Consensus 80 ~IDq~KYfa-f~VdDD-~~Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~-vin~~~~~iv~~gt~~ 156 (322) T protein:vir:31 80 ILRDEVYAG-NAISKK-LRQDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPN-VINGVPHRFVGTGTDQ 156 (322) T ss_pred EEehhhhhc-cccchh-HHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcc-eecCCccceeccCCCc Confidence 5544 433 457774 456778999999999999999988876521 1110 0 0111 11111111 1223344 Q ss_pred chhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHH-----HhhcccC-------Cc--eeeccccccCcceEEcCCC Q lcl|Aclame:pro 506 GVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKK-----AQVFDNT-------GE--RIWQNNEVNGYRAEASNQI 571 (632) Q Consensus 506 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~d~~-------g~--~~~~~~~l~G~pv~~~~~~ 571 (632) ...|+.|+++..+|..++.+......+++|.....|.. ..++|.. |- -+...+.++|+.|++|+.+ T Consensus 157 ~~ay~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~a~g~~~Vg~~~GF~V~~SN~l 236 (322) T protein:vir:31 157 TMDVTDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNNPRWEGIVESGIAPDMQFVRSVYGIDLFVSNLL 236 (322) T ss_pred hhhHHHHHHHHHHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhccccccccccccchhhHHHHHHHhceeeeeeccc Confidence 56789999999999999988766555555665443311 1233311 10 0112478999999999998 Q ss_pred CCcc--EEEEeh---------hhE----------EEEEecce---EEEEecccccccCcEEEEEEEEeCcEEecccceEE Q lcl|Aclame:pro 572 PADT--WIFGDW---------SQI----------VIAMWGVL---DLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCI 627 (632) Q Consensus 572 ~~~~--~~~gd~---------s~~----------~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~ 627 (632) +.+. ++.|.- +.+ .++-|..+ +-.+++ ....-.+++.+++|.++.+|+.++. T Consensus 237 ~~~~~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~~~l~~~e~~r~~----~~~~d~~~~~~~~g~g~~r~e~l~~ 312 (322) T protein:vir:31 237 ADANETINAGGDARSTTAGKCNMFMNVSDMGLLPFVVAWKEMPTTKSFIDD----YNDDLNTATTARWGNGLVRDENLVC 312 (322) T ss_pred cccccccccCcccccccceeecccccccchhhhhhhhHhhhhhhhhcccCc----cccccceeeeeeecceeecccceEE Confidence 7432 222211 111 11122222 111122 2233468999999999999999999 Q ss_pred EEecC Q lcl|Aclame:pro 628 AKKGA 632 (632) Q Consensus 628 ~~~~A 632 (632) |...| T Consensus 313 ~~a~~ 317 (322) T protein:vir:31 313 VLANA 317 (322) T ss_pred EEecc Confidence 88888 No 146 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=98.70 E-value=7.8e-09 Score=65.03 Aligned_cols=281 Identities=10% Similarity=-0.029 Sum_probs=142.6 Q ss_pred hhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcccccC Q lcl|Aclame:pro 347 MPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDS 426 (632) Q Consensus 347 ~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~ 426 (632) +..-.-..|.....+.... .+.+ +.+...+...+...+.++.+...+.-..+.++.+++. +...+....-|.++... T Consensus 1 ms~~~~~tr~~~~~s~~d~-al~l-e~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~i-G~~~~~~~~pG~~l~~~ 77 (335) T protein:vir:63 1 MSFLNDLTRPNYAGKNADV-DIHL-EEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRL-GNVEAKGRRAGEELERS 77 (335) T ss_pred CCCcccchhhhcccccchh-heeh-hhhhhhHHHHHHhhhhhccccceeeeccceeEEEeee-eeeeeecccCCcCcCCC Confidence 1111111111112222221 2233 5556666666666777766643333334556777777 44566777777777766 Q ss_pred cccceeeeeeeeeeee-eehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhh----cCCC--ccccc-----cceec Q lcl|Aclame:pro 427 DFDFTTLSFSPKTIAG-AVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLT----GTGL--ANDPV-----GLLNM 494 (632) Q Consensus 427 ~~~~~~~~~~~~t~~~-~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~----g~g~--~~~~~-----Gil~~ 494 (632) .+..++..+.+.++=. ...|-+.--..+.+++.+.+.+.+|.++++..|..++. +... ..... |+... T Consensus 78 ~~~~~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~~~G~~~~ 157 (335) T protein:vir:63 78 RVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVLEK 157 (335) T ss_pred CccccceEEEecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCCcCCCccee Confidence 6777777777776532 22233322233456889999999999999999997752 1111 01111 21111 Q ss_pred ccc-ccccccccchhHHHHHHHHHHHHhhcccc---ccceEEeehhHHHHHHHH-hhccc-----CCc--eee-cccccc Q lcl|Aclame:pro 495 TGV-PALTYPAGGVDWASVVDMETKISTFNADA---GRLAYLTSVTQRGAAKKA-QVFDN-----TGE--RIW-QNNEVN 561 (632) Q Consensus 495 a~~-~~~~~~~~~~~~~~i~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~-~~~d~-----~g~--~~~-~~~~l~ 561 (632) ... +....++...-.+.+.++...|..+..+. .....++.|..+..|... ++-+. +|. +.+ ....++ T Consensus 158 ~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~~v~ 237 (335) T protein:vir:63 158 LDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATGATNDYVKSRVAILN 237 (335) T ss_pred eeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhccccccccccccccccccccCceeEEee Confidence 100 00001111122244556667777766653 223455555555444322 11110 111 111 123589 Q ss_pred CcceEEcCCCCCcc-----------EEEEehhhEE--E--------EEecceEEEEecccccccCcEEEEEEEEeCcEEe Q lcl|Aclame:pro 562 GYRAEASNQIPADT-----------WIFGDWSQIV--I--------AMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVR 620 (632) Q Consensus 562 G~pv~~~~~~~~~~-----------~~~gd~s~~~--~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~ 620 (632) |.||+.++++|.+. .+-+|++... + +....+......+.. +-...+.+.+-+|.++. T Consensus 238 Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~--~~~~~i~~~~a~G~g~l 315 (335) T protein:vir:63 238 GVKVLETPRFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNE--KFSWVLDTFQMYNIGAR 315 (335) T ss_pred ceEEEeeccCCCCCcccccccccCCccccccceeEEEEEecceEEEEEEeecccceeeccc--hhhHHhHHHHHcCCccc Confidence 99999999998543 2344554322 1 111111111111111 11223555666999999 Q ss_pred cccceEEEEecC Q lcl|Aclame:pro 621 RKEAFCIAKKGA 632 (632) Q Consensus 621 ~~~a~~~~~~~A 632 (632) ||++.+.++..- T Consensus 316 RPe~a~~i~~tg 327 (335) T protein:vir:63 316 RPDTAGAIELKG 327 (335) T ss_pred ccceEEEEEEcC Confidence 999999999876 No 147 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=98.67 E-value=8.9e-09 Score=64.73 Aligned_cols=280 Identities=10% Similarity=-0.030 Sum_probs=141.7 Q ss_pred hhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcccccC Q lcl|Aclame:pro 347 MPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDS 426 (632) Q Consensus 347 ~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~ 426 (632) +..-....+.....+.... .+.+ +.+...+...+...+.++.+...+.-..+.++.+++. +...+....-|.+..-. T Consensus 1 ms~~~~~t~~~~~~s~~d~-al~l-e~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~i-G~~~~~~~~pG~~l~~~ 77 (335) T protein:vir:78 1 MSFLNDLTRPNYAGKNADV-DIHL-EEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRL-GNVEAKGRRAGEELERS 77 (335) T ss_pred CCccccccccccccccchh-hhhh-hhhhhHHHHHHHHhhhhccccceeeeccceeEEEeee-eeeeecccccCcccCCC Confidence 1111111111112222221 2233 5556666666666777776644333334556777766 44456667677777666 Q ss_pred cccceeeeeeeeeeee-eehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhc----CC--Cccccc-----cceec Q lcl|Aclame:pro 427 DFDFTTLSFSPKTIAG-AVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTG----TG--LANDPV-----GLLNM 494 (632) Q Consensus 427 ~~~~~~~~~~~~t~~~-~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g----~g--~~~~~~-----Gil~~ 494 (632) .+..++..+.+.++=. ...|-+.--..+.+++.+.+.+.+|.++++..|..++.. .. +..... |+... T Consensus 78 ~~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~G~~~~ 157 (335) T protein:vir:78 78 RVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVLEK 157 (335) T ss_pred CcccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCcCCCccee Confidence 6777777777776532 222333222334568899999999999999999987521 11 111111 21111 Q ss_pred ccc-ccccccccchhHHHHHHHHHHHHhhccccc---cceEEeehhHHHHHHHH-hhcc-----cCCc--eee-cccccc Q lcl|Aclame:pro 495 TGV-PALTYPAGGVDWASVVDMETKISTFNADAG---RLAYLTSVTQRGAAKKA-QVFD-----NTGE--RIW-QNNEVN 561 (632) Q Consensus 495 a~~-~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~-~~~d-----~~g~--~~~-~~~~l~ 561 (632) ... +....+....-.+.+.++...|..+..+.. ....++.|..+..|... ++.+ .+|. +.+ ....++ T Consensus 158 ~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~~v~ 237 (335) T protein:vir:78 158 LDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVEYQATGATNDYVKSRVAILN 237 (335) T ss_pred eeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhcccccccccccccccccccccceeEEee Confidence 100 011111112223444445555665555432 23456666666554432 1111 1111 111 123689 Q ss_pred CcceEEcCCCCCcc-----------EEEEehhh----------EEEEEecceEEEEecc-cccccCcEEEEEEEEeCcEE Q lcl|Aclame:pro 562 GYRAEASNQIPADT-----------WIFGDWSQ----------IVIAMWGVLDLKVDPY-TKAASDGLVLRVFQDVDAGV 619 (632) Q Consensus 562 G~pv~~~~~~~~~~-----------~~~gd~s~----------~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~r~~~~v 619 (632) |.||+.++++|.+. .+-+|++. +..+....+......+ ..| ...+.+.+-+|.++ T Consensus 238 Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~---~~~i~~~~a~G~g~ 314 (335) T protein:vir:78 238 GVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDHDQF---SWVLDTFQMYNIGA 314 (335) T ss_pred ceEEEeeccCCCCCCccccccccCCcccccccceEEEEEecceEEEEEEEecccceeeccchh---hHhhhHHHHcCCcc Confidence 99999999999543 22234433 1111111122111111 112 12355566689999 Q ss_pred ecccceEEEEecC Q lcl|Aclame:pro 620 RRKEAFCIAKKGA 632 (632) Q Consensus 620 ~~~~a~~~~~~~A 632 (632) .||++.+.++..- T Consensus 315 lRPe~a~~i~~tg 327 (335) T protein:vir:78 315 RRPDTAGAIELKG 327 (335) T ss_pred cCcceEEEEEecC Confidence 9999999999876 No 148 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=98.66 E-value=1.6e-08 Score=63.35 Aligned_cols=265 Identities=11% Similarity=0.013 Sum_probs=156.6 Q ss_pred cccccccccceechhhhhHHHHHHHhhhhhhhhhcce--------eeccCceeEEEEEecCC-ccccccccCc-ccccCc Q lcl|Aclame:pro 358 EKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGAR--------MLPGLVGDVDIPKKTSG-ANFYWIGEDE-DVQDSD 427 (632) Q Consensus 358 ~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~-~~a~~v~E~~-~~~~~~ 427 (632) +....+.-..+|.|+++..-+.+.....+.+.+.+.- ...+.+..+++|....- +.+..+.|+. .++..+ T Consensus 1 Ma~~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~G~~~~~~dg~~~i~~~k 80 (330) T protein:vir:10 1 MANELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLTGDSEVLGNGDKALETGK 80 (330) T ss_pred CCCCceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCccccchhh Confidence 2222233445677777666555555444444332211 11234456777777643 5677788885 588888 Q ss_pred ccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcC---CCcc--ccccceeccccccccc Q lcl|Aclame:pro 428 FDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGT---GLAN--DPVGLLNMTGVPALTY 502 (632) Q Consensus 428 ~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~---g~~~--~~~Gil~~a~~~~~~~ 502 (632) ++-++-...++..+..+.++.....-+.-+....+.++++....+..++.++... -... ...+.+........+. T Consensus 81 i~t~~~~a~i~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~l~gvf~~~~~~~~~~~~~~~~~~~~~ 160 (330) T protein:vir:10 81 ITAGADIACVLYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIATLNGIFATGTAGEKGALEETHVSDQSK 160 (330) T ss_pred cccceeEEEEEeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHHHHhhhhhhhcccchhhhhhheecccc Confidence 8888888889999999999998877777788888999999888888877665321 1100 0111111111222233 Q ss_pred cccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcc----cCCceeeccccccCcceEEcCCCCCc---- Q lcl|Aclame:pro 503 PAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFD----NTGERIWQNNEVNGYRAEASNQIPAD---- 574 (632) Q Consensus 503 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d----~~g~~~~~~~~l~G~pv~~~~~~~~~---- 574 (632) +...++.+.+.++..+|..... ....++||+.....++...+.+ ..+. ..-+.++|++|++++.+|.. T Consensus 161 ~~a~~s~~~l~~A~~~~GD~~~--~~~~ivmhS~v~~~L~~~~li~~~~~s~~~--~~i~~~~G~~VivdD~~p~~~~~y 236 (330) T protein:vir:10 161 ASTGIDAGMVLDAKQLLGDSAD--QVTAIAMHSAVYTKLQKDNLIQYIQPTTAT--INIPTYLGYRVIIDDGIAPTGDIY 236 (330) T ss_pred cccccCHHHHHHHHHHhccccc--cceEEEEcHHHHHHHHHhhhhhhhcccccC--cccccccceEEEEeCCCCCCCCce Confidence 4456788999999999877543 3567889998887776654433 2222 12367899999999999843 Q ss_pred -cEEEEehhhEEEEEe---cceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEec---C Q lcl|Aclame:pro 575 -TWIFGDWSQIVIAMW---GVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKG---A 632 (632) Q Consensus 575 -~~~~gd~s~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~---A 632 (632) +++|+ ...+.+.+. ..+.++.++ +...+...+..+.+ -+++|.++.+-+.. + T Consensus 237 t~yl~~-~GAi~~~~~~~~~~v~~EtdR--d~~~g~~~l~~r~~---~~~hp~G~s~~~~~~~~~ 295 (330) T protein:vir:10 237 TSYLFR-TGSIGLNTGNPSGLTTFETSR--EAAKGNDMIYTRRA---LVMHPYGVKWTGAEVDAG 295 (330) T ss_pred eEEEEe-cCceeeecccCCccccccccC--CccccceEEEEeeE---EEeeeeeeeecccccccC Confidence 23343 222222221 112223332 23345555555555 34667777775431 1 No 149 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=98.66 E-value=1e-08 Score=64.39 Aligned_cols=262 Identities=11% Similarity=0.009 Sum_probs=152.9 Q ss_pred hcccccccccceechhhhhHHHHHHHhhhhhhhhhccee--------eccCceeEEEEEecCC-ccccccccCcccccCc Q lcl|Aclame:pro 357 LEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARM--------LPGLVGDVDIPKKTSG-ANFYWIGEDEDVQDSD 427 (632) Q Consensus 357 ~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~-~~a~~v~E~~~~~~~~ 427 (632) +. .+.-..+|.|+++..-+.+.......+.+.+.-. .......+++|....- +.+..+.|+..++..+ T Consensus 1 MA---~T~lsd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~~~~~~~~~i~~~k 77 (351) T protein:vir:15 1 MA---ETHLSDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGDPDNWTDSDDIDVNN 77 (351) T ss_pred CC---ceeeeeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCcccchhe Confidence 21 1223456777777665555444444443322111 1223446777776543 5778889999999999 Q ss_pred ccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcC---CCcccc-ccceecccccccccc Q lcl|Aclame:pro 428 FDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGT---GLANDP-VGLLNMTGVPALTYP 503 (632) Q Consensus 428 ~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~---g~~~~~-~Gil~~a~~~~~~~~ 503 (632) ++-++-...++..+..+.++.....-+.-+....|.++++...++..+..++.-. -..... .+.. .+....+.+ T Consensus 78 itt~~~~a~i~~~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~l~gv~~~~~~~~~~~--~d~t~~~~~ 155 (351) T protein:vir:15 78 LTSGKQQGIKFYQTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSVLKGVMGVTKIANSKV--YDQTKVSPS 155 (351) T ss_pred ecccceeEEEEeeccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchhhcccce--ecccccccc Confidence 9888888888999999999997766666677888999999999988888776422 111111 0000 111122334 Q ss_pred ccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhh----cccCCceeeccccccCcceEEcCCCCCc----- Q lcl|Aclame:pro 504 AGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQV----FDNTGERIWQNNEVNGYRAEASNQIPAD----- 574 (632) Q Consensus 504 ~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~d~~g~~~~~~~~l~G~pv~~~~~~~~~----- 574 (632) ...++++.|.++..+|.....+ ....++||+.....++...+ ++.+|.. .-+.++|++|++++.+|.. T Consensus 156 ~~~is~~~l~~A~~~~GD~~~~-~~~~ivmhS~v~~~L~~~~li~~~~~s~~~~--~i~t~~G~~VivdD~~p~~~~~~~ 232 (351) T protein:vir:15 156 EPMFGAKGFTGAIGLMGDLQDT-AFGAIAVNSATYSLMKVQGLIETIQPQNGAT--PFEAYNGLRIVLDDDIEIDLTDKT 232 (351) T ss_pred ccccCHHHHHHHHHHhcccccc-ceEEEEEChHHHHHHHhhhhhhhccccccCc--ccceecceEEEEcCCCccccCCCC Confidence 4568889999999998765432 24677888888777665443 3333321 2367999999999999842 Q ss_pred -----cEEEEehhhEEEEEec-ceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEe--cC Q lcl|Aclame:pro 575 -----TWIFGDWSQIVIAMWG-VLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKK--GA 632 (632) Q Consensus 575 -----~~~~gd~s~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~--~A 632 (632) +++|+.= .+.+.... .+++.+++.. ..+...+..+.+ -+++|.++.+-+. .+ T Consensus 233 ~~~ytsyl~~~G-Ai~~~~~~~~ve~~rd~~~--~~g~d~l~~r~~---~~~hp~G~s~~~~~~~~ 292 (351) T protein:vir:15 233 KPVSTSYIFAPG-AVRYSTNMRSTETKYDPLI--NGGQDVIVQKRV---GTIHVAGTSIKASFSPS 292 (351) T ss_pred CceeEEEEEecc-eeeeecCCcCcceeecccC--CCCceEEEEeee---eeeeeeeeeeccccccc Confidence 2333321 11122211 1333333322 233333333333 3477777776432 11 No 150 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=98.65 E-value=5.1e-09 Score=66.05 Aligned_cols=235 Identities=13% Similarity=0.080 Sum_probs=125.4 Q ss_pred cceeeccCceeEEEEEecCCccccccccCccccc--Ccccceeeeeee--eeeeeeehhhHHHhhcChhHHHHHHHHHHH Q lcl|Aclame:pro 392 GARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQD--SDFDFTTLSFSP--KTIAGAVPVTRKLRKQSSIHVENLIREDLI 467 (632) Q Consensus 392 ~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~--~~~~~~~~~~~~--~t~~~~~~iSre~l~d~~~~~~~~i~~~l~ 467 (632) -.+.+++ +.++.+++.+ ...+....-|.++.. ..+.-++.++.+ .+|.. +.|...--..+.+++.+.+.+.++ T Consensus 1 ~vr~i~~-g~s~~~~~iG-~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~-~~VdDiD~~qa~~Dlr~e~s~~~G 77 (324) T protein:vir:99 1 MTRTITS-GKSAQFPVMG-RTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTD-VLIYDIEDAMNHYDVRSEYSTQMG 77 (324) T ss_pred Ceeeeec-CceEEEeeee-eeEeccccCCCCcCCCcCCcCcccEEEEecchhhhh-hhhhhHHHHhcCccchhHHHHHHH Confidence 1333333 4456777763 445555555665533 334445544443 33333 223332222345678999999999 Q ss_pred HHHHHHHHHHHhhcC---C---C---ccccc--c---ceeccccccccccccchhHHHHHHHHHHHHhhccccccceEEe Q lcl|Aclame:pro 468 EGIGVALDLAMLTGT---G---L---ANDPV--G---LLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLT 533 (632) Q Consensus 468 ~a~a~~~~~~~~~g~---g---~---~~~~~--G---il~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 533 (632) .++++..|..++... . + ..+.. | +.+.++...........-++.|.++...|..++.+....+.++ T Consensus 78 ~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP~~gR~~vv 157 (324) T protein:vir:99 78 EALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYIPAGDRTFYT 157 (324) T ss_pred HHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEe Confidence 999999998775221 1 0 00010 1 0111111111111122336778888889998888876666667 Q ss_pred ehhHHHHHHHHhhcccCCceee----cc---ccccCcceEEcCCCCCccE-------------------------EEEeh Q lcl|Aclame:pro 534 SVTQRGAAKKAQVFDNTGERIW----QN---NEVNGYRAEASNQIPADTW-------------------------IFGDW 581 (632) Q Consensus 534 ~~~~~~~~~~~~~~d~~g~~~~----~~---~~l~G~pv~~~~~~~~~~~-------------------------~~gd~ 581 (632) .|.....|. ...+...+.+.. .. +.++|++|+.++++|.... +-+|+ T Consensus 158 ~P~~y~~Ll-~~~~~~~~~~~~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky~~d~ 236 (324) T protein:vir:99 158 DPDTYSAIL-AALMPNAANYAALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKMTVGA 236 (324) T ss_pred ChHHHHHHh-hcccccccccccccceecceEEEEeceEEEecCCcccccccccccccccccccccccccccccccccccc Confidence 676665443 221222222221 11 4679999999999986421 22333 Q ss_pred hhE----------EEEEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 582 SQI----------VIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 582 s~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +.. .......+.+....+. .+-...+++.+-+|.++.||++.+.++..| T Consensus 237 ~~~~gl~~~~~a~~tv~~~~~~~e~~~~~--~~~~d~i~~~~a~G~~~lRPe~a~~v~l~~ 295 (324) T protein:vir:99 237 DNVVGLFVHRSAVATLKLKDMALERARRP--EYQADQIIAKYAMGHGGLRPEAVGAIIFED 295 (324) T ss_pred CceeEEEEehhheEEEeeecceecceech--hhHHHhhhhhhhhcCcccccceEEEEEEcc Confidence 321 1111222222222221 122335777888899999999998888777 No 151 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=98.65 E-value=5e-09 Score=66.08 Aligned_cols=285 Identities=11% Similarity=-0.006 Sum_probs=139.7 Q ss_pred HHHHHHhhhhhhhhhhhHHhhhhhccccccccc-ceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCc Q lcl|Aclame:pro 334 IADASGKEARGFYMPHEVLVQRQLEKKTAGKGG-ELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGA 412 (632) Q Consensus 334 ~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~-~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 412 (632) .+ ..........+.... ..++. ..+.-+.+...+...+...+.+..+.....-....++.+++.+. . T Consensus 1 ma---------~~~~~~~~~t~~~~~--~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~~~~~G~sv~i~~ig~-~ 68 (347) T protein:vir:15 1 MA---------NIQGGQQIGTNQGKG--QSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGR-T 68 (347) T ss_pred CC---------ccccCCccccccccC--CCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccccccccceeEeeeccc-e Confidence 00 000000000000000 01111 11233455666666777777766664332223345677777764 4 Q ss_pred cccccccCccccc--Ccccceeeeeeeeeeee-eehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCC------ Q lcl|Aclame:pro 413 NFYWIGEDEDVQD--SDFDFTTLSFSPKTIAG-AVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTG------ 483 (632) Q Consensus 413 ~a~~v~E~~~~~~--~~~~~~~~~~~~~t~~~-~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g------ 483 (632) .+.....|.+++. ..+..++.++.+.++-. .+.|.+.--.....++.+.+.+.++.++++..|..++.... T Consensus 69 t~~~~~~g~~l~~~~~~~~~~e~~ltID~~~~~~~~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~ 148 (347) T protein:vir:15 69 KAAYLKPGENLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLP 148 (347) T ss_pred eeeeeccCCCCCCCCCCCccceEEEEechhhhhhHHhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 5566666766644 33556666666554422 23343333333556788999999999999999998863211 Q ss_pred --Cccc---ccc--ceeccccccccccccchh----HHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHh-h--ccc Q lcl|Aclame:pro 484 --LAND---PVG--LLNMTGVPALTYPAGGVD----WASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQ-V--FDN 549 (632) Q Consensus 484 --~~~~---~~G--il~~a~~~~~~~~~~~~~----~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~--~d~ 549 (632) +... |.+ +................. ++.+.++...|..++.+......++.|..+..|.... + .+. T Consensus 149 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~~~~~d~ 228 (347) T protein:vir:15 149 DASNENIEGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANY 228 (347) T ss_pred ccccccccccCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhcccccccccc Confidence 0000 000 110000000000111111 4556666677888887766555555566655443221 1 111 Q ss_pred CCceeecc---ccccCcceEEcCCCCCccE-------E---------------EEehhh----------EEEEEecceEE Q lcl|Aclame:pro 550 TGERIWQN---NEVNGYRAEASNQIPADTW-------I---------------FGDWSQ----------IVIAMWGVLDL 594 (632) Q Consensus 550 ~g~~~~~~---~~l~G~pv~~~~~~~~~~~-------~---------------~gd~s~----------~~~~~~~~~~~ 594 (632) .|.-.... +.++|++|+.++++|.... . -++|+. +-......+.+ T Consensus 229 ~~~~~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~ 308 (347) T protein:vir:15 229 QALIDHERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLAL 308 (347) T ss_pred cccccccceEEEEEeceEEEecccccccccccccccccccccccccccccceeeeccccceeeeeccceeeeeEeeceee Confidence 11111122 4689999999999985321 0 111111 00111122233 Q ss_pred EEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 595 KVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 595 ~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +..... .+-...+++.+.+|.++++|++.+.++.+= T Consensus 309 e~~~~~--~~~~d~i~~~~~~G~~vlrP~~av~~~~~~ 344 (347) T protein:vir:15 309 ERARRA--NYQADQIIAKYAMGHGGLRPEAAGAIVLPK 344 (347) T ss_pred eecccc--hhhhhhhehhhhcCCceeccccEEEEecCC Confidence 322221 222335777788899999999988887777 No 152 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=98.54 E-value=4.5e-08 Score=60.86 Aligned_cols=278 Identities=10% Similarity=-0.035 Sum_probs=140.3 Q ss_pred Hhhhhhccc---ccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCcccccccc--Cc----- Q lcl|Aclame:pro 352 LVQRQLEKK---TAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGE--DE----- 421 (632) Q Consensus 352 ~~~~a~~~~---~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E--~~----- 421 (632) ....+..+. -..+-......++......-+.+..+.+..- ++..+.......+-..+. ..+.-+++ .. T Consensus 1 ~~~~~~~~~~~~Ms~~i~~~fv~qy~~~v~~~~qq~~s~L~~t-V~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~d 78 (322) T protein:vir:10 1 MKLNAIMSMLPLIAGDIDQAFVQTYETTLRILSQQKSAKLKQY-CQHKNESSESHNWETLAS-MDPDAVKRKRSRQQSAD 78 (322) T ss_pred CcccceeeeeeeeechhhhHHHHHHHHHHHHHHHHhhhhhhcc-cccccccccccceeeccc-ccccccccccccccccC Confidence 000111100 0001111111222222112222222222221 221111111111111111 11111111 11 Q ss_pred ---ccccCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcC-CCc--cccccceecc Q lcl|Aclame:pro 422 ---DVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGT-GLA--NDPVGLLNMT 495 (632) Q Consensus 422 ---~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~-g~~--~~~~Gil~~a 495 (632) ..|......+........+.....|.+.-......+..+...+..+.+++++.|+.++.+. |.+ ..+.+..... T Consensus 79 ~~~dtp~~~~~~~~r~~~~~d~~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~~~~gt~v~~~ 158 (322) T protein:vir:10 79 GTYPTPVNNKPFAKRRTNVDTYDTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPASIKGTGQPVEFL 158 (322) T ss_pred cccCCCccccccceEEEeecccccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhccccccccccccccC Confidence 1222233455566666666667788887777777788899999999999999999888643 211 1111222222 Q ss_pred ccccccccccchhHHHHHHHHHHHHhhcccccc-ceEEeehhHHHHHHHH-hhc--ccC-Cceeec---cccccCcceEE Q lcl|Aclame:pro 496 GVPALTYPAGGVDWASVVDMETKISTFNADAGR-LAYLTSVTQRGAAKKA-QVF--DNT-GERIWQ---NNEVNGYRAEA 567 (632) Q Consensus 496 ~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~-~~~--d~~-g~~~~~---~~~l~G~pv~~ 567 (632) ....+..++.+++++.|+++...+..+..+... ...++.|..+..|... .+. |-. ...+.. .+.++|+.++. T Consensus 159 ss~~i~~g~~g~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~~~~l~~~G~ig~~lGf~~i~ 238 (322) T protein:vir:10 159 ATQEIGDGTKPISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQITEATSADYTSAMDLQSKGIITNWMGYTWIV 238 (322) T ss_pred CCcccccCccchhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchhhhhhhcccchhhhhcCeeeeeeeEEEEE Confidence 233344556688999999999999988887544 4444555554332211 111 111 122222 35799999999 Q ss_pred cCCCCCc------------------cEEEEehhhEEEEEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEE Q lcl|Aclame:pro 568 SNQIPAD------------------TWIFGDWSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAK 629 (632) Q Consensus 568 ~~~~~~~------------------~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~ 629 (632) ++.+|.. +.+++.-+.+.++....+....+..... .....+++.+-+|+.+++|++++.+. T Consensus 239 s~~lp~~~~t~~~~~~~~~~~~~~~~~~a~~k~Av~~a~~~dv~~~i~~~~~~-~~a~~I~~~~~~Ga~ri~~~gVv~i~ 317 (322) T protein:vir:10 239 STRLDKFDPTQWGMAAEDGPQGDEIWCIAMTDMALGYHSCKDIWTKVAEDPSA-SFAWRIYSAFTADCVRVEDEHIFKLR 317 (322) T ss_pred eccCCccccccccccccCCCCccceeEEEEecCceeEEEeeeeeEEeeccCCc-chhhhhhhhhhhCceEeccCcEEEEE Confidence 9998832 1334444455555554444443322211 12345777789999999999999988 Q ss_pred ecC Q lcl|Aclame:pro 630 KGA 632 (632) Q Consensus 630 ~~A 632 (632) ..= T Consensus 318 ~~e 320 (322) T protein:vir:10 318 LKN 320 (322) T ss_pred Eec Confidence 866 No 153 >protein:vir:93858 Length: 400 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764266;genbank:gi:115315579;genbank:GeneID:5141552 Probab=98.46 E-value=2.4e-07 Score=56.84 Aligned_cols=377 Identities=12% Similarity=0.025 Sum_probs=156.3 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhh Q lcl|Aclame:pro 230 TRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSL 309 (632) Q Consensus 230 ~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 309 (632) .|.... ........+....+....+.....+.......... +........+ -.....+........+.... T Consensus 1 ~~~s~~--~~~k~~~~ek~~~~~~~~e~~~~lks~~~g~~~~~--~~~~~~k~~e-----l~kT~Sel~~ei~k~e~eln 71 (400) T protein:vir:93 1 MRISKR--NMNKPDLIEKQNRLAELKENNVSLKSQISGFEVKN--AIEDLPKVQE-----LEKTLSENSIEIIKIENELN 71 (400) T ss_pred Cccccc--ccccchHHHHHHHHhhhhhhhhhhhhhhhccchhh--hhhhchhHHH-----HHHHHHHhHHHHHHHhhhhh Confidence 000000 00000001111111000011101111000000000 0000000000 00000000000000000000 Q ss_pred hhhhhhhhhhhhhhhh-hhhHHHHHHHHHHHhhhh---hhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhh Q lcl|Aclame:pro 310 MRAINAAATGDWSKAG-FEREVSLAIADASGKEAR---GFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNK 385 (632) Q Consensus 310 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~ 385 (632) .. ............+ .......+..+.+..... ........+..++.. .+ +....+| .-+-..|...+... T Consensus 72 ~~-~E~~Kgk~~mtefLkT~~A~~~fa~~l~~nsg~sd~knaW~A~l~E~gvt-~t--d~n~iLP-~~il~aIq~al~~~ 146 (400) T protein:vir:93 72 AQ-EEKPKGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVT-IT--DTTFQLP-RKLVESINTALLNT 146 (400) T ss_pred hh-hhhcccchhHHHhhhhHHHHHHHHHHHHhhcCCcchhhhhhhhhhhcccc-cC--Cchhhcc-hHHHHHHHHhhhcc Confidence 00 0000000000000 000011111111111000 000011112222221 11 1122333 33445566777776 Q ss_pred hhhhhhcceeeccCceeEEEEEecCCccccc-cccCcccccCcccceeeeeeeeeeeeeehhhHHHh--hcChhHHHHHH Q lcl|Aclame:pro 386 AIIGQMGARMLPGLVGDVDIPKKTSGANFYW-IGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLR--KQSSIHVENLI 462 (632) Q Consensus 386 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~-v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l--~d~~~~~~~~i 462 (632) .++.++. +++.... +-+.+......-+| -.-|.+++++.+++..-++.++-+..+..+..-.. .++.-.+..|| T Consensus 147 ~~~~~f~--~v~n~p~-l~V~~~~dt~~qa~gHk~G~~K~eq~~tl~~rtL~P~~VYk~~~la~~~~~~~~tygaL~nYV 223 (400) T protein:vir:93 147 NPVFKVF--HVTNVGA-LLVSRSFDSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLI 223 (400) T ss_pred CCcccce--eeecCCc-eeeecchhhhcccceeccCCcccceeeeeeeeccCHHHHHHHhhhhhhhhhccccHHHHHHHH Confidence 6666642 1222211 11111112222233 44577788888899988888888877777733222 22334569999 Q ss_pred HHHHHHHHHHH-HHHHHhhcCCCccccccceeccccccc------cccccchhHHHHHHHHHHHHhhccccccceEEeeh Q lcl|Aclame:pro 463 REDLIEGIGVA-LDLAMLTGTGLANDPVGLLNMTGVPAL------TYPAGGVDWASVVDMETKISTFNADAGRLAYLTSV 535 (632) Q Consensus 463 ~~~l~~a~a~~-~~~~~~~g~g~~~~~~Gil~~a~~~~~------~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 535 (632) ..+|..++-+. .+.+++-|+|.+. ..++...+.+..+ +-.++...+.++..-...+... ....+...++.+ T Consensus 224 m~EL~q~vI~k~Ve~Aii~GdG~Ng-f~~~dk~t~Ik~I~~dt~kt~~a~~~~~qdl~E~~~d~~~~-~aad~~~Iv~s~ 301 (400) T protein:vir:93 224 VAELTQAIVNKIVDLALVEGDGTNG-FKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRP-TAGRRYLIVKAE 301 (400) T ss_pred HHHHHHHHHHHHhhhheeecccccc-cCCCcchhhhhhhhhhhhhhhhcCCccHHHHHHHHHhhhhh-ccCCceeEEecc Confidence 99999999975 6999999988753 1222111111111 1112233334443322222221 223344445555 Q ss_pred hHHHHHHHHhhcccCCceeecccc-------ccCc-ceEEcCCCCCc-cEEEEehhhEEEEEecceEEEEecccccccCc Q lcl|Aclame:pro 536 TQRGAAKKAQVFDNTGERIWQNNE-------VNGY-RAEASNQIPAD-TWIFGDWSQIVIAMWGVLDLKVDPYTKAASDG 606 (632) Q Consensus 536 ~~~~~~~~~~~~d~~g~~~~~~~~-------l~G~-pv~~~~~~~~~-~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~ 606 (632) .. +..+..++|.+|.+.+..+. -+|. .+++....+.. ..+..|-+.+. ++..+.......+.+++ T Consensus 302 d~--~A~L~~lk~a~~~a~f~~~n~d~~IA~~fGv~~Lv~~Tr~~~~kp~V~VDek~~i----~~~~~~t~~sf~~~tNs 375 (400) T protein:vir:93 302 DR--KALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKALKPTVLVDQKYHI----DMQDLTKVDAFEWKTNS 375 (400) T ss_pred ch--HHHHHHhcCCcceeeeeeccccchhhhhcccceeeeeccCCCCCceeeeehhhhc----cccCceeccceeeeecc Confidence 54 45567889999999885432 2453 44555555543 23333544332 33334444455667788 Q ss_pred EEEEEEEEeCcEEecccceEEEEec Q lcl|Aclame:pro 607 LVLRVFQDVDAGVRRKEAFCIAKKG 631 (632) Q Consensus 607 ~~~~~~~r~~~~v~~~~a~~~~~~~ 631 (632) -.+.++.+.++-+.-|.+-++++++ T Consensus 376 ~~ilvetlv~Gsi~~~N~~ay~~v~ 400 (400) T protein:vir:93 376 NMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred ceEEeeeeeccceecccceeeEeeC Confidence 8899999999999999999999999 No 154 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=98.33 E-value=3e-07 Score=56.33 Aligned_cols=269 Identities=10% Similarity=-0.003 Sum_probs=146.1 Q ss_pred hcccccccccceech--hhhhHHHHHHHhhhhhhhhhccee--eccCceeEEEEEecCCccccccccCc-ccccCcccce Q lcl|Aclame:pro 357 LEKKTAGKGGELVAT--ELLSEEFIDILRNKAIIGQMGARM--LPGLVGDVDIPKKTSGANFYWIGEDE-DVQDSDFDFT 431 (632) Q Consensus 357 ~~~~~~~~~~~~i~~--~~~~~~i~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~a~~v~E~~-~~~~~~~~~~ 431 (632) +...-..+++.++.. +.+.+.+++...+.-..+++.... .+.....+.+......+.+.|++..+ .+|..+.... T Consensus 1 ~~~~~a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~ 80 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIAQIVADYTDDLPLVDALAT 80 (296) T ss_pred CcccchhhhHHHHHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCceeEeCCCccccceeeccce Confidence 111111122222221 123344444433333333332211 12222345555666667778877654 4788888899 Q ss_pred eeeeeeeeeeeeehhhHHHhhcC---hhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceecccccccccc----c Q lcl|Aclame:pro 432 TLSFSPKTIAGAVPVTRKLRKQS---SIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYP----A 504 (632) Q Consensus 432 ~~~~~~~t~~~~~~iSre~l~d~---~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~----~ 504 (632) .....+..++..+.++.+-|+.. ..++...-....++++++.+|+.+|+|+.. ....|++|+.++...... . T Consensus 81 ~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~-~g~~GLlN~p~v~~~~~~~~W~~ 159 (296) T protein:vir:10 81 ERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTA-HGIPSVFDYPNINNVVSGGSWSQ 159 (296) T ss_pred eEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeeccc-ccceeEeecCCCccccccCCccC Confidence 99999999999999998777654 356778888888999999999999999754 345799998886544322 2 Q ss_pred cchhHHHHHHHHHHHHhhccccc-cceEEeehhHHHHHHHHhhcccCCceeecc-------ccccCcceEEcCCCCC-cc Q lcl|Aclame:pro 505 GGVDWASVVDMETKISTFNADAG-RLAYLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEASNQIPA-DT 575 (632) Q Consensus 505 ~~~~~~~i~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~-------~~l~G~pv~~~~~~~~-~~ 575 (632) ..--+++|..++..+..+..... +...++++.....+ ...-+.+|.-++.- .++-+.|......... +. T Consensus 160 ~t~i~~Di~~~~~~l~~~s~g~~~p~~l~L~p~~~~~L--~~~~~~~~~t~l~~ik~~~~~l~i~~~~~l~~a~~~g~~~ 237 (296) T protein:vir:10 160 PTTAVSDITSLLDIIETSTNGQHRATHLLLPTTARRIM--QNLVPGTSVSYGEFFRQNNSGVTVEFVQYLNDYNGTGTSA 237 (296) T ss_pred HHHHHHHHHHHHHHHHHhhCceecceeEEeCHHHHHHH--hhccCCCCccHHHHHHHhcCCceEEEeeeeccCCCCcceE Confidence 22336777777777765432222 22344544444433 33334444333221 1122222222211111 12 Q ss_pred EEEEe--hhhEEEEEecceEEEEecccccccCcEEEEEEEEeC-cEEecccceEEE---Eec Q lcl|Aclame:pro 576 WIFGD--WSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVD-AGVRRKEAFCIA---KKG 631 (632) Q Consensus 576 ~~~gd--~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~-~~v~~~~a~~~~---~~~ 631 (632) +++.+ ...+.+..-. .+...+ .....-...+....+++ +-+.+|.||+.+ ++| T Consensus 238 ~v~~~~~~~~~~~~v~~--~~~~~~-~e~~~l~~~~~~~~~~~Gv~i~~P~ai~~~dGI~~~ 296 (296) T protein:vir:10 238 AIAYEKDPNNMAIEIPE--ATNALP-AQPKDLHFKIPVTSKATGLIVYRPLTMAVMKGITFA 296 (296) T ss_pred EEEEEcCCceEEEEcCc--ceeeec-ccccCceEEEeeEeeEEEEEEECCceeEEEeeeecC Confidence 23322 2223332222 222222 12223445677788885 788999999997 888 No 155 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=98.29 E-value=2.5e-07 Score=56.80 Aligned_cols=253 Identities=12% Similarity=-0.016 Sum_probs=129.3 Q ss_pred hcccccccccceechhhhhHHHHHHHhhhh-hhhhhc-ceeecc-CceeEEEEEecCCccccccccCcccccCcccce-- Q lcl|Aclame:pro 357 LEKKTAGKGGELVATELLSEEFIDILRNKA-IIGQMG-ARMLPG-LVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFT-- 431 (632) Q Consensus 357 ~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~-~~~~~~-~~~~~~-~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~-- 431 (632) +..........+.++.. -+.+.+.-.... ++.-++ .+..|. ....+++|+..-...+.-|+||+++|.+.++.. T Consensus 1 mAe~nlt~~~dL~~~~s-idfv~~f~~~i~~L~~~Lgi~r~~p~a~G~tIt~pK~~~tgda~dVaEGe~Iplskvt~~~~ 79 (295) T protein:vir:99 1 MAEKNLNTMADLGDIKS-IDFVNKFSKNINDLLKLLGVTRRETLTNDLKIQTYKWEVTLDQTDPGEGETIPLSKVTRTKD 79 (295) T ss_pred CCCcccccHhhccCcee-ehhhHHhhhhHHHHHHHhccccccccccCCeEEeeeeeeecccccccCCcccchhhheeeee Confidence 10000001111111110 011111111111 111121 233343 345789999887888899999999999998865 Q ss_pred -eeeeeeeeeeeeehhhHHHhhcCh-hHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhH Q lcl|Aclame:pro 432 -TLSFSPKTIAGAVPVTRKLRKQSS-IHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDW 509 (632) Q Consensus 432 -~~~~~~~t~~~~~~iSre~l~d~~-~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~ 509 (632) ..+++++++++.+ |.|++..+. -+....-.++|..+++++++..++.-..++.. .....+-...+ T Consensus 80 ~t~t~kikK~rK~t--TdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lktat~-----------t~tg~~lq~a~ 146 (295) T protein:vir:99 80 KDYTVKWFKKRRAT--TAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKPT-----------KVKGVGLQKAL 146 (295) T ss_pred eeeEEEeeeecccc--cHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCce-----------eeehhhHHHHH Confidence 4778889998865 999985443 35778899999999999999999876643211 11111111222 Q ss_pred HHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccC-----C-ceeeccccccCcc-eEEcCCCCCccEEEEehh Q lcl|Aclame:pro 510 ASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNT-----G-ERIWQNNEVNGYR-AEASNQIPADTWIFGDWS 582 (632) Q Consensus 510 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~-----g-~~~~~~~~l~G~p-v~~~~~~~~~~~~~gd~s 582 (632) +.+.+.+......+ .......++|.+...++...-...+ | .||- .++|.. ++.+..+|.++++.--.. T Consensus 147 a~~~~al~~f~Ee~--~~~~V~FVnP~D~a~yl~~A~~~~~~a~~fG~~~L~---nfLG~q~II~S~kv~~G~~~aT~~~ 221 (295) T protein:vir:99 147 SASWAKLATFNEFE--GSPLVSFVSPLDVANYLGDTKVGADASNVFGMTLLK---NFLGMQNVIVMPSVPEGKIYSTAVE 221 (295) T ss_pred HHhhhhhhhccccc--CCceEEEEehHHHHHHHhccccccchhhhhhhhhhh---hhhccceEEEcccCCCceEEEeecc Confidence 23333333322222 2334566777776554432211111 1 1222 389997 999999999987755444 Q ss_pred hEEEEE---e-cceEEEEecccccccCcEEEEEEE-------------EeCc---EEecccceEEEEecC Q lcl|Aclame:pro 583 QIVIAM---W-GVLDLKVDPYTKAASDGLVLRVFQ-------------DVDA---GVRRKEAFCIAKKGA 632 (632) Q Consensus 583 ~~~~~~---~-~~~~~~~~~~~~~~~~~~~~~~~~-------------r~~~---~v~~~~a~~~~~~~A 632 (632) .+.+++ . +.+.- .+.+..|.+.+.+.. -+.+ -+-+.+++++.++.| T Consensus 222 Ni~~ay~~~~~g~l~~----~f~~~~D~tglIg~~h~~~~~~~t~et~~~~~~~lfpE~~dgiv~~tI~~ 287 (295) T protein:vir:99 222 NLVFASLNVKGGDLGG----LFADFTDETGLIAAARNRQLSNLTYESVFFGANVLFAEIPEGVVEATIEA 287 (295) T ss_pred ceEEEEecCCchhhhh----hhhhccCcccceEEEeccccceeeehhhhHhHHHhcccccceEEEEEEec Confidence 333211 1 11211 111111111111111 1122 234557899999887 No 156 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=98.27 E-value=7.2e-07 Score=54.27 Aligned_cols=269 Identities=10% Similarity=-0.008 Sum_probs=147.7 Q ss_pred ccccccccce-echhhhhHHHHHHHhhhhhhhhhcce--eeccCceeEEEEEecCCccccccccCcc-cccCcccceeee Q lcl|Aclame:pro 359 KKTAGKGGEL-VATELLSEEFIDILRNKAIIGQMGAR--MLPGLVGDVDIPKKTSGANFYWIGEDED-VQDSDFDFTTLS 434 (632) Q Consensus 359 ~~~~~~~~~~-i~~~~~~~~i~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~a~~v~E~~~-~~~~~~~~~~~~ 434 (632) ..+++++.++ -.-+.+.+.+++.+.+.-..+++... ..+.....+.+......+.+.+++.++. +|..+..+.... T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~dip~~~~~~~~~~ 80 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRSGAAKIIANGADDLPLVDVDMVRKS 80 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccceeEEEecCcccccccccccceeEE Confidence 1222222211 11233455666666666666665322 2233333455666666677788776554 687888889999 Q ss_pred eeeeeeeeeehhhHHHhhcC---hhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccc------- Q lcl|Aclame:pro 435 FSPKTIAGAVPVTRKLRKQS---SIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPA------- 504 (632) Q Consensus 435 ~~~~t~~~~~~iSre~l~d~---~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~------- 504 (632) ..+..++..+.++.+-|... .+++...-.....+++++.+|+.+|+|+.. ....|++|+.+........ T Consensus 81 ~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~-~g~~GLlN~p~~~~~~~~~~~~~~~~ 159 (301) T protein:vir:80 81 VPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKK-YAIKGAFEATGIQIDVSPTTGVGNVS 159 (301) T ss_pred EEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeeccc-ccceeeecCCCcccccccCccccccc Confidence 99999999999998777765 356778888889999999999999999764 3468999988754432111 Q ss_pred --cchh----HHHHHHHHHHHHhhccc-cccceEEeehhHHHHHHHHhhcccCCceeecc--ccccCcceEEcCCCC--- Q lcl|Aclame:pro 505 --GGVD----WASVVDMETKISTFNAD-AGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN--NEVNGYRAEASNQIP--- 572 (632) Q Consensus 505 --~~~~----~~~i~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~--~~l~G~pv~~~~~~~--- 572 (632) ...+ +++|..++.++..+... ..+...++++.....+......+..|..++.- ....+..|+..+.+. T Consensus 160 ~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~~~tvl~~l~~~~~~~~I~~~p~L~~~g 239 (301) T protein:vir:80 160 KWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINKKRYSNEDSRSVLKVLQDNAWFSAIVRVPDLAGMG 239 (301) T ss_pred ccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhhccccCCCCeeHHHHHHHHcCcceEEEcceeccCC Confidence 1112 45666666666544222 12334556666555554333445555444321 111122233322221 Q ss_pred --Ccc--EEEEe-hhhEEEEEecceEEEEecccccccCcEEEEEEEEe-CcEEecccceEEEEec Q lcl|Aclame:pro 573 --ADT--WIFGD-WSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDV-DAGVRRKEAFCIAKKG 631 (632) Q Consensus 573 --~~~--~~~gd-~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~-~~~v~~~~a~~~~~~~ 631 (632) ... +++-+ ...+.+..-..+ ...+-. ...-......+.|+ |+-+.+|.||+.++== T Consensus 240 ~~g~~~~v~~~~~~d~~~~~v~~~~--~~~~~e-~~~~~~~~~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 240 TAGSDSFAVIHDSNETAELIIPMDI--TRHPEE-YSFPRTKVPFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred CCcccEEEEEecCCcEEEEEecCce--eeecce-ecCceeEeeeeeeeEEEEEEccceEEEEecC Confidence 111 22222 222222222222 221111 11112233345566 5688999999997766 No 157 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=98.25 E-value=9.8e-08 Score=59.02 Aligned_cols=282 Identities=11% Similarity=-0.005 Sum_probs=136.1 Q ss_pred hhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcccccC Q lcl|Aclame:pro 347 MPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDS 426 (632) Q Consensus 347 ~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~ 426 (632) +.......+.....+. ....+.-+.+...+...+...+.++.+...+.-..+.++.+++.+. .++....-|.+.... T Consensus 1 Ms~~n~~t~~~~~~s~--~~~al~le~f~geV~taF~~~si~~~~~~vrti~~GkS~qf~~iG~-~~a~y~~~G~~ldg~ 77 (402) T protein:vir:97 1 MSTPNTLTNVAVSASG--EVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGE-TELQVLAPGQSPNAT 77 (402) T ss_pred CCCccccccccccccc--chhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEEEEEee-eEEeeeccccccCCC Confidence 1111111111111111 1123343455566666666666666653333333455677777743 344555555555445 Q ss_pred cccceeeeeeeeeeeee-ehhhHHHhhcChhH-HHHHHHHHHHHHHHHHHHHHHhh-----cC----CCccccccceecc Q lcl|Aclame:pro 427 DFDFTTLSFSPKTIAGA-VPVTRKLRKQSSIH-VENLIREDLIEGIGVALDLAMLT-----GT----GLANDPVGLLNMT 495 (632) Q Consensus 427 ~~~~~~~~~~~~t~~~~-~~iSre~l~d~~~~-~~~~i~~~l~~a~a~~~~~~~~~-----g~----g~~~~~~Gil~~a 495 (632) .+.-++..+.+.++=.. ..|-+---..++++ +-+.+.+.+|+++++..|..++. +. .....|.+.- +. T Consensus 78 ~~~~~k~~ItID~lL~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~~~-~g 156 (402) T protein:vir:97 78 PTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKG-HG 156 (402) T ss_pred CcccccEEEEeCceeechhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCcccc-cc Confidence 56666766666654321 12221111123455 67899999999999999997742 11 1111121111 11 Q ss_pred ccccccc--cccchh----HHHHHHHHHHHHhhccccccceEEeehhHHHHHHHH-hhc------ccCCceeec-ccccc Q lcl|Aclame:pro 496 GVPALTY--PAGGVD----WASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKA-QVF------DNTGERIWQ-NNEVN 561 (632) Q Consensus 496 ~~~~~~~--~~~~~~----~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~------d~~g~~~~~-~~~l~ 561 (632) ...++.. +....+ .+.|.++...+..++.+......++.|..+..+... ++- .+.|.+.+. ...++ T Consensus 157 ~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~rl~n~d~~~~~~g~~~~G~v~~v~ 236 (402) T protein:vir:97 157 FSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSY 236 (402) T ss_pred cccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhcccccchhhccccCCccccceeEEEe Confidence 1111111 111122 355567777788888877666666666665544422 111 122223221 14689 Q ss_pred CcceEEcCCCCCcc--E---------------EEEehhhE--EEEEecceEE------EEecccccccCcEEEEEEEEeC Q lcl|Aclame:pro 562 GYRAEASNQIPADT--W---------------IFGDWSQI--VIAMWGVLDL------KVDPYTKAASDGLVLRVFQDVD 616 (632) Q Consensus 562 G~pv~~~~~~~~~~--~---------------~~gd~s~~--~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~r~~ 616 (632) |.||+.++++|... + +-+|++.- .++.+..+-. ..+-+.+-.+-...+-+++-+| T Consensus 237 Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d~r~~~~~id~~~a~G 316 (402) T protein:vir:97 237 NCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMAEG 316 (402) T ss_pred ceEEEecCccccccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEEeeccccchhhchhHHHHHHHHHHHhC Confidence 99999999998531 1 11444431 1222221111 1111111111111244556678 Q ss_pred cEEecccceEEEEecC Q lcl|Aclame:pro 617 AGVRRKEAFCIAKKGA 632 (632) Q Consensus 617 ~~v~~~~a~~~~~~~A 632 (632) .++.+|++...+.++= T Consensus 317 ~g~~RPeaa~vv~~~~ 332 (402) T protein:vir:97 317 AIPDRWEAVSVVTTKR 332 (402) T ss_pred CcccCccceEEEEEec Confidence 9999999877764443 No 158 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=98.19 E-value=4e-07 Score=55.66 Aligned_cols=259 Identities=9% Similarity=-0.030 Sum_probs=131.2 Q ss_pred hhhhccccccc-ccceechhh---hhHHHHHHHhhhhhhhhhcceeecc-CceeE---EEEEecCCccccccccCccccc Q lcl|Aclame:pro 354 QRQLEKKTAGK-GGELVATEL---LSEEFIDILRNKAIIGQMGARMLPG-LVGDV---DIPKKTSGANFYWIGEDEDVQD 425 (632) Q Consensus 354 ~~a~~~~~~~~-~~~~i~~~~---~~~~i~~~~~~~~~~~~~~~~~~~~-~~~~~---~~~~~~~~~~a~~v~E~~~~~~ 425 (632) ..+....+... -+....-++ +...+.+++..-.+. +..|. ....+ +++..+-...+.-|+||+.+|. T Consensus 1 M~~e~nl~~~~dL~~a~siDF~~~f~~~i~~L~~~LGv~-----r~~pla~Gt~iktyK~~~~~y~gda~dVaEGe~Ipl 75 (303) T protein:vir:10 1 MSAENNLINVEALGKAKSIDFANKLGVGLNKLFEALAIQ-----NKIPMNVGSALKQYRFKVEDSEKPNGDVAEGDVIPL 75 (303) T ss_pred CCCCcCCcchhhcccceeehhhhhhhhhHHHHHHHhhhh-----ccccccCCceeeeeeeeceeeccccccccCCcccch Confidence 00111111111 111111111 112222222222222 22222 12234 3444455577889999999999 Q ss_pred Ccccce---eeeeeeeeeeeeehhhHHHhhcC-hhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceecccccccc Q lcl|Aclame:pro 426 SDFDFT---TLSFSPKTIAGAVPVTRKLRKQS-SIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALT 501 (632) Q Consensus 426 ~~~~~~---~~~~~~~t~~~~~~iSre~l~d~-~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~ 501 (632) ++++.. ..++.+++|++.+ |.|+|..+ .-+....-.+.|..+++++++..++.-..++. ++.+ . T Consensus 76 skvt~~~~~t~~~~~kK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lktaT-----~t~~-----~ 143 (303) T protein:vir:10 76 TKVTREQVDITELQFAKYRKST--SAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSAI-----ENGK-----R 143 (303) T ss_pred hhheeeecceEEEEeecccccc--cHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhcc-----cccc-----c Confidence 988754 5788899999966 99999543 34677889999999999999999986553321 0000 1 Q ss_pred ccccchhHHHHHHHHHHHH----hhccccccceEEeehhHHHHHHHHhhcccC----CceeeccccccCcceEEcCCCCC Q lcl|Aclame:pro 502 YPAGGVDWASVVDMETKIS----TFNADAGRLAYLTSVTQRGAAKKAQVFDNT----GERIWQNNEVNGYRAEASNQIPA 573 (632) Q Consensus 502 ~~~~~~~~~~i~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~----g~~~~~~~~l~G~pv~~~~~~~~ 573 (632) +....++.+.|.+++.... ....+..+....+||.+...+......... |--+.. .++|..|+.+..+|. T Consensus 144 t~~t~~s~~glq~Al~~~~~kl~~~~ed~~~~V~FvNP~Daa~yl~~A~i~~~~t~fG~n~L~--nfLG~~II~S~kv~~ 221 (303) T protein:vir:10 144 TNKTKLSAENLQGALSKGRANLSVLLDDEITPIAFVNPNDTAEYLANGFINSTGAQFGVNLLT--PYVGVKIVEFADVPQ 221 (303) T ss_pred ccceeecHHHHHHHHHhhhhhccccccccccEEEEEchHHHHHHhhcCCcchhhhhhhhhhhh--hhhcceEEEeccCCC Confidence 1122345566666666443 223334455677777776554321111111 111111 389999999999999 Q ss_pred ccEEEEehhhEEEE---EecceEEEEecccccccCcEEEEEE----------EEeCcE---EecccceEEEEecC Q lcl|Aclame:pro 574 DTWIFGDWSQIVIA---MWGVLDLKVDPYTKAASDGLVLRVF----------QDVDAG---VRRKEAFCIAKKGA 632 (632) Q Consensus 574 ~~~~~gd~s~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~----------~r~~~~---v~~~~a~~~~~~~A 632 (632) ++++.--...+.++ ..+.+.....-.++ .+|.+.+--. .-+.+. +-+.+++++.++.+ T Consensus 222 G~~~~T~~~Ni~~ay~~~~g~l~~~f~~t~D-~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~~dgiv~~ti~~ 295 (303) T protein:vir:10 222 GEVWMTVAENLNVAYANPRGELSRAFAFATD-ATGFVGVLHDIQPQRLTSDTIYASAISMFPENIDAVIKVTIKK 295 (303) T ss_pred ceEEEeeccceEEEEecCchhhhhhhhhccc-cccceEEEeccccceeeehhHhHhHHHhcccccceEEEEEEec Confidence 98775533333221 12111111100000 1111111100 011222 34557899999976 No 159 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=98.06 E-value=5.3e-07 Score=54.99 Aligned_cols=282 Identities=11% Similarity=0.000 Sum_probs=132.3 Q ss_pred hhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcccccC Q lcl|Aclame:pro 347 MPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDS 426 (632) Q Consensus 347 ~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~ 426 (632) +.......+.....+ .....+.-+.+...+...+...+.+..+...+.-..+.++.+++. +...+....-|.+.... T Consensus 1 Ms~~n~~t~~~~~~s--g~~~al~Le~f~GeV~taF~~~si~~~~~~vRti~~gkS~qf~~~-G~s~~~~~~pG~~ld~~ 77 (401) T protein:vir:70 1 MSTPNNLTNVAVSAS--GEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPAAT 77 (401) T ss_pred CCCCccccccccccc--cchhHhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEe-eeeEeeeecCCCCcCCC Confidence 110000001111111 111223334445555566666666666543333344556777777 44456666666666555 Q ss_pred cccceeeeeeeeeeee-eehhhHHHhhcChhH-HHHHHHHHHHHHHHHHHHHHHhh-----cC----CCccccccce--- Q lcl|Aclame:pro 427 DFDFTTLSFSPKTIAG-AVPVTRKLRKQSSIH-VENLIREDLIEGIGVALDLAMLT-----GT----GLANDPVGLL--- 492 (632) Q Consensus 427 ~~~~~~~~~~~~t~~~-~~~iSre~l~d~~~~-~~~~i~~~l~~a~a~~~~~~~~~-----g~----g~~~~~~Gil--- 492 (632) .+.-++..+.+.++=. ...|-.---..++++ +.+.+.+.+|+++++..|..++. +. +....|.|.- T Consensus 78 ~~~~dK~~ItID~lL~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p~~~~~G~ 157 (401) T protein:vir:70 78 STQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNPRVKGHGF 157 (401) T ss_pred CcccccEEEEeCceeehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCcCCCce Confidence 6666666666655422 122221111224555 67899999999999999986632 21 1112232211 Q ss_pred --eccccccccccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHh--hcc------cCCceeec-ccccc Q lcl|Aclame:pro 493 --NMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQ--VFD------NTGERIWQ-NNEVN 561 (632) Q Consensus 493 --~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~d------~~g~~~~~-~~~l~ 561 (632) +..+....+.....--.+.+.++...+..++.+......++ +..+..+.+.. +-+ ..|.+... ...+. T Consensus 158 ~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~~r~vvl~-pp~~Ys~Ll~~d~L~nrd~~~s~~g~~~~G~v~~va 236 (401) T protein:vir:70 158 SINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILM-PWRYFNVLRDADRIVDKTYTISQSGATIQGFTLSSY 236 (401) T ss_pred EEeccccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEc-CHHHHHHHHhcCcccchhhccccCCccccceEEEEe Confidence 11111111111111233556677777887777754322222 22222222221 211 11222221 13589 Q ss_pred CcceEEcCCCCCcc---------------EE--EEehhhEE--EEEecce------EEEEecccccccCcEEEEEEEEeC Q lcl|Aclame:pro 562 GYRAEASNQIPADT---------------WI--FGDWSQIV--IAMWGVL------DLKVDPYTKAASDGLVLRVFQDVD 616 (632) Q Consensus 562 G~pv~~~~~~~~~~---------------~~--~gd~s~~~--~~~~~~~------~~~~~~~~~~~~~~~~~~~~~r~~ 616 (632) |.||+.++++|... .+ -||++... ++.+..+ .+..+.+.+-.+-...+-+++-+| T Consensus 237 Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~lt~~~~~d~r~~~~~id~~~a~g 316 (401) T protein:vir:70 237 NCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDVTGDIFYEKKEKTYYIDTFMAEG 316 (401) T ss_pred ceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheEEEEeeccccchhhhhhhhHHHHHHHHHhC Confidence 99999999998632 11 14554421 1111111 111111111111122344666789 Q ss_pred cEEecccceEEEEecC Q lcl|Aclame:pro 617 AGVRRKEAFCIAKKGA 632 (632) Q Consensus 617 ~~v~~~~a~~~~~~~A 632 (632) .++.+|+|...++.+= T Consensus 317 ~g~~RPeaa~vv~~k~ 332 (401) T protein:vir:70 317 AIPDRWEAVSVVTTKR 332 (401) T ss_pred CcccchhheEEEeecC Confidence 9999999998886665 No 160 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=98.06 E-value=3.8e-07 Score=55.80 Aligned_cols=282 Identities=11% Similarity=0.012 Sum_probs=135.2 Q ss_pred hhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccccccccCcccccC Q lcl|Aclame:pro 347 MPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDS 426 (632) Q Consensus 347 ~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~ 426 (632) +.......+.....+. ....+.-+.+...+...+...+.+..+...+.-..+.++.+++. +...+....-|.++.-. T Consensus 1 Ms~~n~~t~p~~~gsg--~~~aL~Le~f~GeV~taF~~~si~~~~~~vRtI~~gkS~qf~~l-G~s~a~y~~pG~~ldg~ 77 (400) T protein:vir:10 1 MSTPNNLTNVAVSASG--EVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPAAT 77 (400) T ss_pred CCCCcccccccccccc--chhhhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEe-eeeEEeeecCCCCcCCC Confidence 1110000011111110 11122234445555566666666666543333344556777777 44566777777776555 Q ss_pred cccceeeeeeeeeeee-eehhhHHHhhcChhH-HHHHHHHHHHHHHHHHHHHHHhhcC-----CCc----cccccceecc Q lcl|Aclame:pro 427 DFDFTTLSFSPKTIAG-AVPVTRKLRKQSSIH-VENLIREDLIEGIGVALDLAMLTGT-----GLA----NDPVGLLNMT 495 (632) Q Consensus 427 ~~~~~~~~~~~~t~~~-~~~iSre~l~d~~~~-~~~~i~~~l~~a~a~~~~~~~~~g~-----g~~----~~~~Gil~~a 495 (632) .+.-++..+.+.++=. ...|-.---..++++ +-+.+.+.+|.++++..|..++... ... ..+.|+-... T Consensus 78 ~~~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g~~~g~ 157 (400) T protein:vir:10 78 STQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPRVKGHGF 157 (400) T ss_pred CcccCcEEEEeCceeeecchhhhHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCcccccc Confidence 5666676666665432 122222111224566 6799999999999999998765211 101 1122221111 Q ss_pred cccccc--ccccchh----HHHHHHHHHHHHhhccccccceEEeehhHHHHHHHH-hhcc------cCCceeec-ccccc Q lcl|Aclame:pro 496 GVPALT--YPAGGVD----WASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKA-QVFD------NTGERIWQ-NNEVN 561 (632) Q Consensus 496 ~~~~~~--~~~~~~~----~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~d------~~g~~~~~-~~~l~ 561 (632) .. .+. ......+ .+.+.++...+..++.+......++.+..+..+... ++-+ ..|.++.. ...+. T Consensus 158 s~-~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~Ys~Ll~~dkLvnrdf~~s~~g~~~~g~v~~v~ 236 (400) T protein:vir:10 158 SV-NVEVNEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKSYTISQSGATIQGFVLSSY 236 (400) T ss_pred ce-eecccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhCCcccchhccccCCCccccceEEEEe Confidence 11 111 1111112 234556666777777765433333323222222111 1111 11222211 12589 Q ss_pred CcceEEcCCCCCcc---------------E--EEEehhhEE--EEEecce------EEEEecccccccCcEEEEEEEEeC Q lcl|Aclame:pro 562 GYRAEASNQIPADT---------------W--IFGDWSQIV--IAMWGVL------DLKVDPYTKAASDGLVLRVFQDVD 616 (632) Q Consensus 562 G~pv~~~~~~~~~~---------------~--~~gd~s~~~--~~~~~~~------~~~~~~~~~~~~~~~~~~~~~r~~ 616 (632) |.||+.++++|... . +-||++... ++.+..+ .+..+.+.+-.+-...+-+++-+| T Consensus 237 Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt~~~~~d~r~~~~~id~~~a~G 316 (400) T protein:vir:10 237 NCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSIDVIGDIFYEKKEKTYYIDTFMSEG 316 (400) T ss_pred ceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheEEEEeeccccccccchhhHHHHHHHHHHhC Confidence 99999999998532 1 225555421 1112111 111111111122223355667789 Q ss_pred cEEecccceEEEEecC Q lcl|Aclame:pro 617 AGVRRKEAFCIAKKGA 632 (632) Q Consensus 617 ~~v~~~~a~~~~~~~A 632 (632) .++.+|++...++.+= T Consensus 317 ~g~~RPeaa~vv~~~~ 332 (400) T protein:vir:10 317 AIPDRWEAVSVVTTKR 332 (400) T ss_pred CcccchhheEEEEecC Confidence 9999999999999876 No 161 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=98.05 E-value=1.4e-06 Score=52.75 Aligned_cols=259 Identities=12% Similarity=0.073 Sum_probs=130.0 Q ss_pred cccccccccceechhhhhHHHHHHHhhhhhhhhhcceeecc-----CceeEEEEEecCCccccc----cccCcccccCcc Q lcl|Aclame:pro 358 EKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPG-----LVGDVDIPKKTSGANFYW----IGEDEDVQDSDF 428 (632) Q Consensus 358 ~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~a~~----v~E~~~~~~~~~ 428 (632) +. -.++.|+++.+.+++.++....+..+..+-..+ ....+++++........+ ..+++......+ T Consensus 1 Ma------~~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (392) T protein:vir:99 1 MA------NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDF 74 (392) T ss_pred Cc------cccccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeecccccceeeeccccccCCccccccc Confidence 11 123566778888888888777666653332211 123466665543322222 244566666777 Q ss_pred cceeeeeeee-eeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccch Q lcl|Aclame:pro 429 DFTTLSFSPK-TIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGV 507 (632) Q Consensus 429 ~~~~~~~~~~-t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~ 507 (632) ..+++++.+. ..+.-+.|+.+-...+..++...+.+..+++++..+|..++.-........+ .......... T Consensus 75 ~~~~~~~~id~~k~~~~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a~~~~~-------~~~~~~~~~~ 147 (392) T protein:vir:99 75 TEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAA-------GAVHEVAPDE 147 (392) T ss_pred ccceEEEEEeeeeecceeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-------ccccccChhh Confidence 8888888873 3444677888777667778888899999999999999887642211111000 0111112334 Q ss_pred hHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHH-hhc--ccCCc---eee---ccccccCcceEEcCCCCCccEEE Q lcl|Aclame:pro 508 DWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKA-QVF--DNTGE---RIW---QNNEVNGYRAEASNQIPADTWIF 578 (632) Q Consensus 508 ~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~--d~~g~---~~~---~~~~l~G~pv~~~~~~~~~~~~~ 578 (632) .++.|.++..+|..+..+.. ...++.+.....+... .+. +..|. ..+ .-+.+.|++|+.++.+|..+.+. T Consensus 148 ~~~~i~~a~~~L~~~~vP~~-R~~vv~p~~~~~l~~~~~~~~~~~~g~~~~~~l~~G~vg~i~G~~v~~s~~~~~~t~~a 226 (392) T protein:vir:99 148 FFKGVNGARRALNELYIPQG-RVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYL 226 (392) T ss_pred hHHHHHHHHHHHhhcCCCCC-CEEEEcHHHHHHHhcccceeecccccchhhhhhhcceeeeeeeeEEEeeccccccccee Confidence 67889999999998888753 4444554444333211 011 11111 112 22578999999999999877655 Q ss_pred EehhhEEEEEecc-----------------eEE--EEecccccccCcEEEEEEEEeCcEEecc---cceEE---EEecC Q lcl|Aclame:pro 579 GDWSQIVIAMWGV-----------------LDL--KVDPYTKAASDGLVLRVFQDVDAGVRRK---EAFCI---AKKGA 632 (632) Q Consensus 579 gd~s~~~~~~~~~-----------------~~~--~~~~~~~~~~~~~~~~~~~r~~~~v~~~---~a~~~---~~~~A 632 (632) +..+.+.+..... +.. ..+.......+...+ ....+...... .+|.. ++..+ T Consensus 227 ~~~~a~~~at~a~v~~~~~~~~~s~s~~~~v~~~~~~~~~~t~~s~~~~v--~~~~g~~~v~~~~~~~~~~~~~~~~~~ 303 (392) T protein:vir:99 227 YHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLI--DTYFGLKVVEDPNGVGFVRARKIHLIP 303 (392) T ss_pred eeccccccccccccccccccceeEEecccceecceeecccceeecccccc--ceeEEEEEEeeccccceeeeeeeeeec Confidence 5443332211111 000 000000000111110 00111111110 01111 00000 No 162 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=97.92 E-value=6.9e-06 Score=48.88 Aligned_cols=291 Identities=9% Similarity=-0.033 Sum_probs=143.3 Q ss_pred hhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceec--hhhhhHHHHHHHhhhhhhhhhccee--eccCce Q lcl|Aclame:pro 326 FEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVA--TELLSEEFIDILRNKAIIGQMGARM--LPGLVG 401 (632) Q Consensus 326 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~--~~~~~~~i~~~~~~~~~~~~~~~~~--~~~~~~ 401 (632) +......+.... ......... .+......+.|.+.. -+.+.+.+++...+.-..+++.... .+.... T Consensus 1 ~~~~~~~~~~~~---~~~~~~~~~------~~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~ 71 (319) T protein:vir:10 1 MTTKKFDEADKS---NVEMYLIQA------GVKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELSPTDK 71 (319) T ss_pred CCCcchhHHhhH---HHHHHHhhc------cchhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCCCCceE Confidence 000000000000 000000000 000001111122211 1233444555554444444443221 222333 Q ss_pred eEEEEEecCCccccccccCcc-cccCcccceeeeeeeeeeeeeehhhHHHhhcC---hhHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 402 DVDIPKKTSGANFYWIGEDED-VQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQS---SIHVENLIREDLIEGIGVALDLA 477 (632) Q Consensus 402 ~~~~~~~~~~~~a~~v~E~~~-~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~---~~~~~~~i~~~l~~a~a~~~~~~ 477 (632) .+.+......+.+.|++.+.. +|..+.........+..++..+.++.+-|... .+++...-.....+++++.+|+. T Consensus 72 ~~~~~~~~~~G~a~~~~d~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i 151 (319) T protein:vir:10 72 TFEYMTFDKVGTAQIIADYTDDLPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNRL 151 (319) T ss_pred EEEeeeeccccceeeecCccccccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceE Confidence 455666666677888876544 78888889999999999999999998777665 35677778888899999999999 Q ss_pred HhhcCCCccccccceeccccccccccc----cchh----HHHHHHHHHHHHhhccc-cccceEEeehhHHHHHHHHhhcc Q lcl|Aclame:pro 478 MLTGTGLANDPVGLLNMTGVPALTYPA----GGVD----WASVVDMETKISTFNAD-AGRLAYLTSVTQRGAAKKAQVFD 548 (632) Q Consensus 478 ~~~g~g~~~~~~Gil~~a~~~~~~~~~----~~~~----~~~i~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~d 548 (632) +|+|+... ...|++|+.++.....+. +..+ +++|..++.++..+... ..+...++++.....+ ..... T Consensus 152 ~f~G~~~~-g~~GLlN~p~~~~~~~~~~~~~~t~t~~~i~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L--~~~~~ 228 (319) T protein:vir:10 152 VFKGSAPH-KIVSVFNHPNITKITSGKWIDVSTMKPETAEAELTQAIETIETITRGQHRATNILIPPSMRKVL--AIRMP 228 (319) T ss_pred EEeecccc-cceeEEeCCCceeeecCCCCCccccCHHHHHHHHHHHHHHHHHhcCceeeceEEEecHHHHHhh--hcccC Confidence 99997543 467999998865543321 1112 34566666666544221 1223344555444333 33344 Q ss_pred cCCceeecc-------ccccCcceEEcCCCCC-ccEEEE--ehhhEEEEEecceEEEEecccccccCcEEEEEEEEeC-c Q lcl|Aclame:pro 549 NTGERIWQN-------NEVNGYRAEASNQIPA-DTWIFG--DWSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVD-A 617 (632) Q Consensus 549 ~~g~~~~~~-------~~l~G~pv~~~~~~~~-~~~~~g--d~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~-~ 617 (632) .+|..++.- .++.+.|......... +.+++. +...+.+..-. .+...+- ....-...+....|++ + T Consensus 229 ~~~~t~l~~lk~~~~~l~I~~~pel~~ag~~g~~~~v~y~~~~~~~~~~v~~--~~~~~~~-e~~~l~~~~~~~~r~~Gv 305 (319) T protein:vir:10 229 ETTMSYLDYFKSQNSGIEIDSIAELEDIDGAGTKGVLVYEKNPMNMSIEIPE--AFNMLPA-QPKDLHFKVPCTSKCTGL 305 (319) T ss_pred CCCeeHHHHHHHhcCCceEEEeeeecccCCCcceEEEEEecCCceEEEecCc--ceeeeee-eecCceEEEeeeeeeEEE Confidence 455433321 1222223322211111 112222 12222222222 2222221 1111223444455654 5 Q ss_pred EEecccceEEEEec Q lcl|Aclame:pro 618 GVRRKEAFCIAKKG 631 (632) Q Consensus 618 ~v~~~~a~~~~~~~ 631 (632) -+.+|.||+.++== T Consensus 306 ~i~~P~ai~~~dGI 319 (319) T protein:vir:10 306 TIYRPMTIVLITGV 319 (319) T ss_pred EEEccceeEeeecC Confidence 68999999997766 No 163 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=97.92 E-value=2.5e-06 Score=51.28 Aligned_cols=271 Identities=13% Similarity=0.161 Sum_probs=156.6 Q ss_pred hcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeec-cCceeEEEEEecCCccccccccCcccccCcccceeeee Q lcl|Aclame:pro 357 LEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLP-GLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSF 435 (632) Q Consensus 357 ~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~ 435 (632) +. .+++.-.+|..+..++.|...+...-+-..+...+.- ++..++.++.. +.+......|.+.+.+..+..+++++ T Consensus 1 ~~--~TSNT~A~I~SE~~s~~I~~~LH~~LL~~~~~R~V~DF~~G~~L~I~ti-Gs~~~~~~~E~~~~~~~~i~TGEIt~ 77 (313) T protein:vir:95 1 MQ--LTSNTRAFIESEQYSKFILLNLHDGLLPETFYRNVSDFGSGETLHIKTI-GSVTLQEAEEDTPLIYNPIETGEITF 77 (313) T ss_pred Cc--ccccchheehhhhHHHHHHHHhhccccchhhhhhhccCCCCCEEEeccc-CceeeeccccCCCeeecccccceEEE Confidence 11 2223345677777777777777665443333322332 33334555444 45667778899999999999999999 Q ss_pred eeeeeeee-ehhhHHHhhcCh--hHHHHHHHHHHHHHHHHHHHHHHhh-cCC---Cccccccceeccccccccccccchh Q lcl|Aclame:pro 436 SPKTIAGA-VPVTRKLRKQSS--IHVENLIREDLIEGIGVALDLAMLT-GTG---LANDPVGLLNMTGVPALTYPAGGVD 508 (632) Q Consensus 436 ~~~t~~~~-~~iSre~l~d~~--~~~~~~i~~~l~~a~a~~~~~~~~~-g~g---~~~~~~Gil~~a~~~~~~~~~~~~~ 508 (632) .+..|.+- ..||+.+-+|+- -++-+..+..-++++....+..++. |.. ..+.|.-+-....+...+.+..... T Consensus 78 ~i~~Y~G~A~~vt~~LR~D~~~I~~~~A~~~AE~~RAI~E~~~TD~L~~G~~~FA~~~~P~~vNG~PH~~V~~~T~~~~~ 157 (313) T protein:vir:95 78 QITEYKGDAWYVTDDLREDGTDIDRLMAERAAESTRAIQETFETDFLKTGAEYFAANPGPHNVNGFPHVIVSAETNGVFA 157 (313) T ss_pred EEEeecCChhhhhhhhhhcchhHHHHhhhcchhhHHHHHHHHhhHHHhhchhhhccCCCCcccccccceEEeccCCceeh Confidence 99999874 589999988752 2344455556667777766666653 110 1112221111111222234455667 Q ss_pred HHHHHHHHHHHHhhccccccceEEeehhHHHHHHHH-hh---cccCCceeeccc---------cccCcceEEcCCCCC-- Q lcl|Aclame:pro 509 WASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKA-QV---FDNTGERIWQNN---------EVNGYRAEASNQIPA-- 573 (632) Q Consensus 509 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~---~d~~g~~~~~~~---------~l~G~pv~~~~~~~~-- 573 (632) ...+..+...|...+.+.....+++-|.....+.-. .+ -...|+.|...+ .+.|..+.+|+.+.. T Consensus 158 ~~~~~~~~~~~~~a~~P~~G~v~IvDP~~~~~L~~l~~It~~vt~~~k~I~ESG~A~~~~Fi~~~YG~Di~~SN~L~~AN 237 (313) T protein:vir:95 158 LKHLIAMRLAFDKANVPAEGRVFIVDPVAEATLNGLVTITHDVTDFGKMILESGMARGQRFIMNLYGWDILTSNRLHVAN 237 (313) T ss_pred hhHHHHhhhhhhhccCCccceEEEEcchhhhhhhhhheeecccccccceeeeccCCchhHHHHHHhhhhhhhhhhhhhcc Confidence 788999999999988888888888888765443321 11 123466666543 366767777664321 Q ss_pred -------ccEEEEeh----hh----EEEEEecceEE-EEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 574 -------DTWIFGDW----SQ----IVIAMWGVLDL-KVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 574 -------~~~~~gd~----s~----~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +.-++|++ ++ -.++-|+.+-- .-....+-..+ ...+..|.|+++.+.+-++.+-..| T Consensus 238 ~~D~~tT~~G~~~NlFM~i~D~~~~P~~~AWr~MP~s~~~~~~~~~~~--~~~~~~R~G~Gi~R~~~L~~~~~~A 310 (313) T protein:vir:95 238 YNDGTTTGNGYVGNLFMCILDDQTKPIMGAWRRMPKSEGERNKDRARD--EHVVRCRYGFGIQRLDTLGLLATSA 310 (313) T ss_pred ccccccccCceeeeeeeeeecccccceeeeeccccccccccccccccc--cceeeeeecccceeecceeEEEecc Confidence 12233321 11 12233333322 11111122222 3455679999999999888877777 No 164 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=97.87 E-value=6.8e-06 Score=48.90 Aligned_cols=260 Identities=11% Similarity=0.004 Sum_probs=125.3 Q ss_pred hhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhc-ceeeccC-ceeEEE-EEecCCcccccc Q lcl|Aclame:pro 341 EARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMG-ARMLPGL-VGDVDI-PKKTSGANFYWI 417 (632) Q Consensus 341 ~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~-~~~~~~~-~~~~~~-~~~~~~~~a~~v 417 (632) ....+..+.. .+.. ..+-+....-+ +.+.|-.-+.. ++.-++ .+..|.. ...++. +..+-...+.-| T Consensus 1 ~~~~~~~~e~-----nlt~--~~dl~~~~siD-f~~~f~~~i~~--L~~~LGv~r~~pla~GstIkt~k~~~y~gda~dV 70 (296) T protein:vir:98 1 MVTSRTYPEE-----NLIK--STDLKYPITID-VTNKFQENISK--LLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNV 70 (296) T ss_pred CCCccccCcC-----CCcc--hhhhhhhhhhh-hHHHHhhhHHH--HHHHhhhcccccccCCCEEeeccceeeeeccccc Confidence 0000000000 0000 00001110111 11122211111 111121 2233332 335634 446667788899 Q ss_pred ccCcccccCccccee---eeeeeeeeeeeehhhHHHhhcC-hhHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccee Q lcl|Aclame:pro 418 GEDEDVQDSDFDFTT---LSFSPKTIAGAVPVTRKLRKQS-SIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLN 493 (632) Q Consensus 418 ~E~~~~~~~~~~~~~---~~~~~~t~~~~~~iSre~l~d~-~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~ 493 (632) +||+++|.+.++-.+ .+..++++++.+ |.|++..+ .-+....-.+.|..+++++++..++.-..++.. T Consensus 71 aEGe~Iplskvt~~~~~t~t~~ikK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~LktaT~------ 142 (296) T protein:vir:98 71 PEGEVIPLSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTG------ 142 (296) T ss_pred cCCcccchhhheeeecceEEEEeecccccc--CHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhcccc------ Confidence 999999999988764 778889999985 99998544 346778899999999999999999876643311 Q ss_pred ccccccccccccchhHHHHHHHH----HHHHhhccc--cccceEEeehhHHHHHHHHhhcccCCceeecc---ccccCcc Q lcl|Aclame:pro 494 MTGVPALTYPAGGVDWASVVDME----TKISTFNAD--AGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN---NEVNGYR 564 (632) Q Consensus 494 ~a~~~~~~~~~~~~~~~~i~~~~----~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~---~~l~G~p 564 (632) .... +.+.|.+++ .++.....+ .......++|.+...+.. -...+-+-.+.. ..++|.- T Consensus 143 -----t~~~-----t~~~lQ~Ala~~~~~l~~~feded~~~~V~FVnP~D~a~ylg--~a~it~qt~fG~tyl~nfLG~~ 210 (296) T protein:vir:98 143 -----TQDA-----LGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIA--KAGITTQTAFGLTYLVDFTGTV 210 (296) T ss_pred -----eeee-----chhhHHHHHHHHhhhhhhhccccCCCceEEEEehHHHHHHhc--CCccchhheechhhhhhccccE Confidence 1111 112223222 222222222 234555667766543221 111111111111 1378888 Q ss_pred eEEcCCCCCccEEEEehhhEEEEEec--ceEEEEecccccccCcEEEEEEE-------------EeCcE---EecccceE Q lcl|Aclame:pro 565 AEASNQIPADTWIFGDWSQIVIAMWG--VLDLKVDPYTKAASDGLVLRVFQ-------------DVDAG---VRRKEAFC 626 (632) Q Consensus 565 v~~~~~~~~~~~~~gd~s~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~-------------r~~~~---v~~~~a~~ 626 (632) ++.+..+|.++++.--...+.+++.. +-++ .....+..+.+.+.+.. -+.+. +-+.++++ T Consensus 211 II~S~kV~~G~~~~T~~~Ni~~ay~~~~~~~l--~~~f~~~~d~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~~dgiv 288 (296) T protein:vir:98 211 IISTNDVTKGEIWATVPENIIFAYINPNNSEL--AKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIV 288 (296) T ss_pred EEEcCcCCCceEEEeeecceEEEeecccccch--hhhhccccccccceEEEeccccceeeehhHhHhHHHhcccccceEE Confidence 99999999998876544443332211 1111 11111111111111111 11222 34557899 Q ss_pred EEEecC Q lcl|Aclame:pro 627 IAKKGA 632 (632) Q Consensus 627 ~~~~~A 632 (632) +.++.+ T Consensus 289 ~~tI~~ 294 (296) T protein:vir:98 289 KVTLTP 294 (296) T ss_pred EEEecC Confidence 999888 No 165 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=97.85 E-value=5e-06 Score=49.65 Aligned_cols=287 Identities=11% Similarity=0.030 Sum_probs=141.8 Q ss_pred hhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceech--hhhhHHHHHHHhhhhhhhhhccee--eccCce Q lcl|Aclame:pro 326 FEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVAT--ELLSEEFIDILRNKAIIGQMGARM--LPGLVG 401 (632) Q Consensus 326 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~--~~~~~~i~~~~~~~~~~~~~~~~~--~~~~~~ 401 (632) ..+.....++..... . ..+......+++.++.. +.+...+++...+.-..+++.... .+.... T Consensus 1 ~~~~~~~~~~~~~~~----------~---~~~~~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~~~~~~e 67 (314) T protein:vir:10 1 MAIKFDAEQAKITTH----------L---EQMGVEKADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTNEIPGHAK 67 (314) T ss_pred CccchHHHHHHHHHH----------H---HhhcccchhhhHHHHHHHHHHHHHHHhhhhccccccceeeccccCCCCcee Confidence 000000011000000 0 00111111122222222 223334444433333333332111 111223 Q ss_pred eEEEEEecCCccccccccCcc-cccCcccceeeeeeeeeeeeeehhhHHHhhcC---hhHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 402 DVDIPKKTSGANFYWIGEDED-VQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQS---SIHVENLIREDLIEGIGVALDLA 477 (632) Q Consensus 402 ~~~~~~~~~~~~a~~v~E~~~-~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~---~~~~~~~i~~~l~~a~a~~~~~~ 477 (632) .+.+......+.+.|++..+. +|..+..+.+....++.++..+.+|.+-|.-. .+++...-.....+++++.+|+. T Consensus 68 t~~~~~~e~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i 147 (314) T protein:vir:10 68 YFEYPEFDGVGIAQIIADYSDDLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKL 147 (314) T ss_pred EEEeeeeccccceeeeCCcccccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceE Confidence 456666666777888877554 78888889999999999999999988777654 34677888888899999999999 Q ss_pred HhhcCCCccccccceecccccccccc----ccchhHHHHHHHHHHHHhhccccccc-eEEeehhHHHHHHHHhhcccCCc Q lcl|Aclame:pro 478 MLTGTGLANDPVGLLNMTGVPALTYP----AGGVDWASVVDMETKISTFNADAGRL-AYLTSVTQRGAAKKAQVFDNTGE 552 (632) Q Consensus 478 ~~~g~g~~~~~~Gil~~a~~~~~~~~----~~~~~~~~i~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~d~~g~ 552 (632) +|+|+.. ....|++|+.+++..+.+ ...--+++|..++.++..+......+ ..++++.....+ ....+.+|. T Consensus 148 ~f~G~~~-~g~~GLlN~p~v~~~~~~~~WaT~~ei~~Di~~~~~~l~~~s~g~~~p~~l~Lpp~~~~~L--~~~~~~~~~ 224 (314) T protein:vir:10 148 VWSGSAP-HGIVSVFDQPNINNVVATPNWSVPQNAIDDVTAMIDAVESSTQGLHHVTDILLPASARRVM--QGLVPQTNL 224 (314) T ss_pred EEeeccc-ccceeEeecCCCccccCCCCcccHHHHHHHHHHHHHHHHHhcCccccceeEEecHHHHHhh--cccccCCCc Confidence 9999754 356799998876543322 11122566677777776543322222 344554444333 233333443 Q ss_pred eeec----c---ccccCcceEEcCCCCCcc-E-EEE-ehhhEEEEEecceEEEEecccccccCcEEEEEEEEe-CcEEec Q lcl|Aclame:pro 553 RIWQ----N---NEVNGYRAEASNQIPADT-W-IFG-DWSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDV-DAGVRR 621 (632) Q Consensus 553 ~~~~----~---~~l~G~pv~~~~~~~~~~-~-~~g-d~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~-~~~v~~ 621 (632) -++. . -.+-+.|........... + ++- +...+.+.....++ ..+- ....-...+....++ |+-+.+ T Consensus 225 tvl~~l~~n~~~l~I~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~--~l~~-e~~~~~~~~~~~~r~~Gv~i~~ 301 (314) T protein:vir:10 225 SYGELFTRNNPGLTIRFLQFLDNYDGAGGKAALAFEKSPLNMSIEIPEVTN--VLPA-QPKDLHFRYPVTSKATGLIVYR 301 (314) T ss_pred cHHHHHHHhCCCcEEEEcccccccCCCcceEEEEEecCCcEEEEecCccce--eecc-eecCceEEEcceeeeEEEEEEC Confidence 3322 1 122223332222211111 1 111 12222222212222 1111 111122344445666 467899 Q ss_pred ccceEE---EEec Q lcl|Aclame:pro 622 KEAFCI---AKKG 631 (632) Q Consensus 622 ~~a~~~---~~~~ 631 (632) |.||++ +++| T Consensus 302 P~ai~~~dGI~~~ 314 (314) T protein:vir:10 302 PLTMAVIKGITFA 314 (314) T ss_pred cceeEeeeeeecC Confidence 999995 7788 No 166 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=97.78 E-value=1.4e-05 Score=47.16 Aligned_cols=270 Identities=10% Similarity=-0.033 Sum_probs=140.6 Q ss_pred hhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCC-ccccccccCcccccCcccce-e Q lcl|Aclame:pro 355 RQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSG-ANFYWIGEDEDVQDSDFDFT-T 432 (632) Q Consensus 355 ~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~a~~v~E~~~~~~~~~~~~-~ 432 (632) -+..+.+..+-...-..+-+.+.|...-....++..+.-. .+..+..+.+....-. +..--..||++.+.....-. . T Consensus 1 ma~~~~~~~t~~~~g~~~dl~~~I~~isp~dTPf~S~i~~-~~a~~~~~~W~~d~l~~~~~~~~~EG~da~~~~~~~r~~ 79 (317) T protein:vir:88 1 MATPTNAVSTVEINGKREDLIDIIYNIAPYDTPFMSAIGK-GVATAITHEWQTDELRQPGKNTRVEGEDATIKAGSFTTM 79 (317) T ss_pred CCccccceEeeeeeeeeechhhhheecCCccCcceeeecC-ceecccEEEEEeeecCCccccccccCcccccccccCCEE Confidence 1111111111111111122233333333333344333222 2223334445443322 22233458887776543222 2 Q ss_pred eeeeeeeeeeeehhhHHHhhcChhH---HHHHHHHHHHHHHHHHHHHHHhhcCCC-----c---cccccceecccc---- Q lcl|Aclame:pro 433 LSFSPKTIAGAVPVTRKLRKQSSIH---VENLIREDLIEGIGVALDLAMLTGTGL-----A---NDPVGLLNMTGV---- 497 (632) Q Consensus 433 ~~~~~~t~~~~~~iSre~l~d~~~~---~~~~i~~~l~~a~a~~~~~~~~~g~g~-----~---~~~~Gil~~a~~---- 497 (632) +.=...-+...+.||.-+..-+..+ ...+-...-...+.+.+|..+++|... . ....|++..-.. T Consensus 80 ~~N~tQIf~k~v~VSgTa~av~~~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~~~i~t~~~~ 159 (317) T protein:vir:88 80 LNNYCQISDETLQVTGTADRVKKAGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIFAYYKTNGSL 159 (317) T ss_pred eccEEEEEEeEEEEeehhhhhhhcCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHHHHhccCcee Confidence 3333344555666665444333333 344444455666778888999887531 1 223455432100 Q ss_pred -------------ccccccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeecc--c---- Q lcl|Aclame:pro 498 -------------PALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN--N---- 558 (632) Q Consensus 498 -------------~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~--~---- 558 (632) ...+.+...++.+.|.+++.++-...... ..+++++.....+ ..+....+.++..+ . T Consensus 160 ~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~Gg~~--~~i~v~a~~k~~i--~~~~~~~~~~i~~~~~~~~~g 235 (317) T protein:vir:88 160 GANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRNGGQA--NSIQTSSSIKKAI--SKNMKGRATEITLDASDNRIA 235 (317) T ss_pred ccCccccccCCCccccccccccccHHHHHHHHHHHHhcCCCC--CEEEeChHHHHHH--HHHhcCCceeEEEcccCeEEE Confidence 01122334578888999888887765432 2345666555444 23322223333211 1 Q ss_pred -------cccC-cceEEcCCCCCccEEEEehhhEEEEEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEe Q lcl|Aclame:pro 559 -------EVNG-YRAEASNQIPADTWIFGDWSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKK 630 (632) Q Consensus 559 -------~l~G-~pv~~~~~~~~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~ 630 (632) .-+| ..++.+..+|++.+++.|++.+.+..-..+......-+ -+..+..++..++..+.+++|..++.- T Consensus 236 ~~v~~~~tdfG~v~ii~~r~lp~~~~~~~D~~~~~l~~Lr~~~~e~laKt---Gd~~k~~i~~E~tLe~~N~~a~a~i~~ 312 (317) T protein:vir:88 236 QTVDVYESDFGKYTIRANRWFHENTLFVFDPKMHSLCYLRPFFQHELAKT---GDSEKRQLLVEYTFRVNNEKSGALIRD 312 (317) T ss_pred EEEEEEEeCCeEEEEEeCCCCCCCeEEEEcccccceeecccceeeccCCC---cccceeEEEEEEEEEEcCccceeEEEE Confidence 1234 47888999999999999999988766655544332222 255678899999999999999998887 Q ss_pred cC Q lcl|Aclame:pro 631 GA 632 (632) Q Consensus 631 ~A 632 (632) -+ T Consensus 313 l~ 314 (317) T protein:vir:88 313 VV 314 (317) T ss_pred ec Confidence 76 No 167 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=97.63 E-value=2.4e-05 Score=45.87 Aligned_cols=296 Identities=7% Similarity=-0.030 Sum_probs=143.0 Q ss_pred hhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceech---hhhhHHHHHHHhhhhhhhhhcce--e Q lcl|Aclame:pro 321 WSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVAT---ELLSEEFIDILRNKAIIGQMGAR--M 395 (632) Q Consensus 321 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~---~~~~~~i~~~~~~~~~~~~~~~~--~ 395 (632) .+.....+++.-... ... .... . ........+.++...... +.+.+.+++...+.-..+++... . T Consensus 1 ~~~~~~~~~~~~d~~-----~~~--~~a~--~-~~~~~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~ 70 (329) T protein:vir:79 1 MRGNIMSKEMKYDEF-----EAN--VIAN--H-MQLRGAKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSE 70 (329) T ss_pred Cccchhhhhhccchh-----hhh--hHhh--h-cccccceeccchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccccC Confidence 000000000000000 000 0000 0 000111111121111221 23445566555554445554322 1 Q ss_pred eccCceeEEEEEecCCccccccccC-cccccCcccceeeeeeeeeeeeeehhhHHHhhcC---hhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 396 LPGLVGDVDIPKKTSGANFYWIGED-EDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQS---SIHVENLIREDLIEGIG 471 (632) Q Consensus 396 ~~~~~~~~~~~~~~~~~~a~~v~E~-~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~---~~~~~~~i~~~l~~a~a 471 (632) .+.....+.+......+.+.|++.. ..+|..+....+....+..++..+.++.+-|... .+++...-.....++++ T Consensus 71 ~~~~~~~~t~~~~~~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~ 150 (329) T protein:vir:79 71 LSDTDKTFEYQTFDKVGHAKIIADYTDDLSTVDALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHD 150 (329) T ss_pred CCCceeEEEeeeeecceeeeeecCcccccceeecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHH Confidence 2223335566666666778888764 4577778888888999999999999988777655 35677888888899999 Q ss_pred HHHHHHHhhcCCCccccccceecccccccccccc----------chhHHHHHHHHHHHHhhccccc-cceEEeehhHHHH Q lcl|Aclame:pro 472 VALDLAMLTGTGLANDPVGLLNMTGVPALTYPAG----------GVDWASVVDMETKISTFNADAG-RLAYLTSVTQRGA 540 (632) Q Consensus 472 ~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~----------~~~~~~i~~~~~~~~~~~~~~~-~~~~~~~~~~~~~ 540 (632) +.+|+.+|+|+.. ....|++|+.++.....++. .--+++|.+++.++..+..... +...++++..... T Consensus 151 ~~~n~i~f~G~~~-~g~~GLlN~p~v~~~~~~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~~p~~L~Lpp~~~~~ 229 (329) T protein:vir:79 151 QLVNHLVFKGSKP-HKIISVFEHPNLTTINSAGWNNAAGTGKKPETAQDELEQAIEKIETLTNGQHRANMILIPPSMRKV 229 (329) T ss_pred HhhccEEEeeccc-ccceeeecCCCccccccCCCCCccccccCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHH Confidence 9999999999754 34579999888765432211 1124566666666665433222 2344555554433 Q ss_pred HHHHhhcccCCceeecc-------ccccCcceEEcCCCCC-ccEEEE--ehhhEEEEEecceEEEEecccccccCcEEEE Q lcl|Aclame:pro 541 AKKAQVFDNTGERIWQN-------NEVNGYRAEASNQIPA-DTWIFG--DWSQIVIAMWGVLDLKVDPYTKAASDGLVLR 610 (632) Q Consensus 541 ~~~~~~~d~~g~~~~~~-------~~l~G~pv~~~~~~~~-~~~~~g--d~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 610 (632) + ......+|.-++.- -++-+.|-........ +.+++. +...+.+..- +.+...+- ....-...+. T Consensus 230 L--~~~~~~~~~tvl~~lk~~~~~l~I~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp--~~~~~l~~-q~~~~~~~v~ 304 (329) T protein:vir:79 230 L--MVRMPETTMSYLDYFKQQNGGITIESISELEDIDGAGTKAALVYEKDPMNMSIEIP--EAFNMLTA-QPKDLHFKVP 304 (329) T ss_pred h--hcccCCCCccHHHHHHHhCCCcEEEEcccccccCCCCceEEEEEecCCceEEEecC--cceeeeec-eecCceEEEc Confidence 3 23334445433321 1222222222221111 112222 2222222222 22222221 1112223444 Q ss_pred EEEEeC-cEEecccceEEEEecC Q lcl|Aclame:pro 611 VFQDVD-AGVRRKEAFCIAKKGA 632 (632) Q Consensus 611 ~~~r~~-~~v~~~~a~~~~~~~A 632 (632) ...|++ +-+.+|.||+.+.==- T Consensus 305 ~~~r~~Gv~i~~P~ai~~~dGI~ 327 (329) T protein:vir:79 305 CTSKCTGLTIYRPLTLVLIKGLV 327 (329) T ss_pred eeeeEEEEEEECcceeeeeeeee Confidence 455665 5778899888754333 No 168 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=97.62 E-value=3.6e-05 Score=44.93 Aligned_cols=258 Identities=12% Similarity=0.062 Sum_probs=129.3 Q ss_pred ccccccceechhhhhHHHHHHHhhhhhhhhhcceeec----cCceeEEEEEecCCccccccccCcccccCcccceeeeee Q lcl|Aclame:pro 361 TAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLP----GLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFS 436 (632) Q Consensus 361 ~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~ 436 (632) .......++.++++.+.+++.++...++.++..+... .-...+++++... ..+.++..+....+..+++.+. T Consensus 1 m~~~~N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~~----~~v~dg~~~~~~~~te~~v~l~ 76 (418) T protein:vir:10 1 MAVQDNNLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPYR----VKSASGRTLVKQPMVDQTIPFK 76 (418) T ss_pred CCccccccccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCCc----eeecccCCccccccccceEEEE Confidence 1111123455678888888888888877665433221 1223567766433 2234455566667788888888 Q ss_pred eeee-eeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHH Q lcl|Aclame:pro 437 PKTI-AGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDM 515 (632) Q Consensus 437 ~~t~-~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~ 515 (632) +.+. ...+.|+.+-...+..++...+.+..+.+++..+|..+..-..... +..+ . .......++.+.++ T Consensus 77 id~~k~~~~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~~~a~------~~~g--t--~gt~~~~~~~i~~a 146 (418) T protein:vir:10 77 IAYQEHVGLEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLALTLKKAF------HSSG--T--PGVRPGAFIDFANA 146 (418) T ss_pred EecccccceeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc------cccc--c--CCcCcchHHHHHHH Confidence 7444 4467888776666677888889999999999999988753211100 0000 0 11122358899999 Q ss_pred HHHHHhhccccc-cceEEeehhHHHHHHHHhhcccC--C-ceee---ccccccCcceEEcCCCCCccE--------EEEe Q lcl|Aclame:pro 516 ETKISTFNADAG-RLAYLTSVTQRGAAKKAQVFDNT--G-ERIW---QNNEVNGYRAEASNQIPADTW--------IFGD 580 (632) Q Consensus 516 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~d~~--g-~~~~---~~~~l~G~pv~~~~~~~~~~~--------~~gd 580 (632) ..+|...+.+.. ....++.|.....+.....+..+ + .-.+ .-+++.|+.|+.++++|..+. +.|- T Consensus 147 ~~~Ld~~~VP~~G~R~lVv~P~~~~~L~~~~~~~~~~~~~~~~lr~G~IG~i~GF~V~~S~nip~~tag~~~~t~~v~ga 226 (418) T protein:vir:10 147 GAKQTTYAVPQDGMRHAVLDPFTCASLSDEVTKLFKESMVEQAYKMGYRGNVAAYEVYESQNLPKHTVGDHGGTPLVNGT 226 (418) T ss_pred HHHHHhcCCCCCCceEEEeCHHHHHHHhhhccccccccccchhhheeeeeeeeceEEEEecCCCcccccccccceeeecc Confidence 999998888754 35555666555444322222111 1 0111 225799999999999985331 1110 Q ss_pred h-hhEEEEEecc-----eEEEEeccccc-------------ccCcEEEEEEEEe------CcEEecccceE--------- Q lcl|Aclame:pro 581 W-SQIVIAMWGV-----LDLKVDPYTKA-------------ASDGLVLRVFQDV------DAGVRRKEAFC--------- 626 (632) Q Consensus 581 ~-s~~~~~~~~~-----~~~~~~~~~~~-------------~~~~~~~~~~~r~------~~~v~~~~a~~--------- 626 (632) . +...+...++ -.+...+..-| ......|++.... +..|.-.-++. T Consensus 227 ~~~~~~~~~~~~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~~~~~~~~~tv~i~p~~~~~~~~~~~~ 306 (418) T protein:vir:10 227 VVNGDTVGFDGGTASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQEFVVLEDVDTDAGGAGSIKISPSLNDGTATINNE 306 (418) T ss_pred cccceeEEEeecceeeccceeeccEEEECceeecccccccccccceEEEEEeeccccccCcceeEecccccccccccccc Confidence 0 0001100000 00000110000 0012233333322 11121111110 Q ss_pred ---EEEecC Q lcl|Aclame:pro 627 ---IAKKGA 632 (632) Q Consensus 627 ---~~~~~A 632 (632) .+..++ T Consensus 307 ~~~~~~~~~ 315 (418) T protein:vir:10 307 NGDPVSLTA 315 (418) T ss_pred ccccccccC Confidence 011111 No 169 >protein:vir:96792 Length: 315 # NCBI annotation: major capsid protein # Family: family:all:47 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224246;genbank:gi:62362381;genbank:GeneID:3345731 Probab=97.57 E-value=4.4e-05 Score=44.47 Aligned_cols=261 Identities=10% Similarity=-0.003 Sum_probs=109.6 Q ss_pred hcccccccccceechhhhhHHHHHHHhhhhhhhhhcc--------eeeccCceeEEEEEecCCccccccccCcccccCcc Q lcl|Aclame:pro 357 LEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGA--------RMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDF 428 (632) Q Consensus 357 ~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~ 428 (632) +. ++-...-.+..+++...+++.+.+....+.... ..+.+++....+....+...-.-+.-.+....+.+ T Consensus 1 ~~--~t~~sdl~vfn~~~~~a~~e~~~~~~~~Fnaas~Gai~l~~~~~~GDf~~~~ff~i~~~~~~rnv~~~~~~t~~ki 78 (315) T protein:vir:96 1 MA--TTVNSDLVIYNDTAQTAYLERNMDNLAVFNENSRAAIGLNSELIEGDLKLRSFYKVGGAIADRDVNSTATVAGTKI 78 (315) T ss_pred Cc--eeeecceeeehhhhhhhHHhhhHHHHHHhhhhcCCcccccccccccccccccccccccchhhcccCCCccccceec Confidence 11 111222234455555555555554333332211 12234443333333111111112222333333333 Q ss_pred cceeeeeeeeeeeeeehh--hHHHhh---cChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceecccccccccc Q lcl|Aclame:pro 429 DFTTLSFSPKTIAGAVPV--TRKLRK---QSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYP 503 (632) Q Consensus 429 ~~~~~~~~~~t~~~~~~i--Sre~l~---d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~ 503 (632) +-++ ...++...+.-++ +...+. ++...+...|.+.+..+..+..=...+.+.... +-.++.. ....+ T Consensus 79 t~~~-dvaVk~~~~~~~~~~~~~~~a~~g~dp~~~~~~i~~~~~~~~l~~~l~~~l~~~~aa-----i~~~t~~-~~~~~ 151 (315) T protein:vir:96 79 AADE-MVSVKVPWKYGPYETTEEAFKRRARSPEEFSMLIGQDMADATMAGWIGYALNALQGA-----IGSNAGM-NVSGE 151 (315) T ss_pred cccc-ceeEEEeecCCchhccHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh-----hcccccc-ccccc Confidence 2221 1222223333333 333333 333344444555554444433322222222110 1111111 12234 Q ss_pred ccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcc----cCCceeec-cccccCcceEEcCCCCCccEEE Q lcl|Aclame:pro 504 AGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFD----NTGERIWQ-NNEVNGYRAEASNQIPADTWIF 578 (632) Q Consensus 504 ~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d----~~g~~~~~-~~~l~G~pv~~~~~~~~~~~~~ 578 (632) .+.++...+.++..++..+.. .-..++||......+....|.. ..+..++. +...+|++|+|++.+|..+++. T Consensus 152 ~a~~~~~~l~dA~~klGD~~~--~l~~~vMHS~v~~~L~~q~L~~~~~~~~~~~~~~~~~~~lGkrViVdD~~P~~~~~g 229 (315) T protein:vir:96 152 LATEGKKVLTKGLRTMGDKAS--SIAIWVMDSTSYFDIVDEAIDNKLYEEAGVVVYGGTPGTLGKPVLVTDQCPATKIFG 229 (315) T ss_pred ccccCHHHHHHHHHHhccccc--CeeEEEEchHHHHHHHHhhhhhhcccccceeEecCcCcccccEEEEECCCCcceeee Confidence 456888999999999865533 3456788888877666543321 12222222 1224599999999999765433 Q ss_pred EehhhEEEEEecceEEEEecccccccCcEEEEEEEEeCc-EEecccceEEEEecC Q lcl|Aclame:pro 579 GDWSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDA-GVRRKEAFCIAKKGA 632 (632) Q Consensus 579 gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~-~v~~~~a~~~~~~~A 632 (632) --...+.+.....+.... .+. .+.-.+....|..+ -+++|++|.+-+.+- T Consensus 230 l~~GAi~~~~~~~~~~~~-~~~---~g~e~l~~~~r~e~tf~l~p~G~sw~~~~~ 280 (315) T protein:vir:96 230 LVAGAVMITESQAPGMRS-YQI---DDQENLAIGFRAEGTANVEVLGYKWKTKTN 280 (315) T ss_pred eecceeeecCCCcccccc-ccC---CCcceeEEEEeeeeEeeeeeeeEEeecCCC Confidence 211222222211111110 011 12223333344444 368888888843221 No 170 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=97.01 E-value=0.00012 Score=42.18 Aligned_cols=224 Identities=14% Similarity=0.064 Sum_probs=121.7 Q ss_pred hhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccC-ceeEEEEEecCCccccccccC Q lcl|Aclame:pro 342 ARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGL-VGDVDIPKKTSGANFYWIGED 420 (632) Q Consensus 342 ~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~E~ 420 (632) .... .+...+........-++-....+++.+.+.+.+.... ...... ...+.+.+.++-|.+.|..=+ T Consensus 1 m~~~----------~~~~~TL~e~Akr~~~d~~~~~VIE~l~~~n~IL~~l-pf~e~n~gt~~~~~v~~~LP~~~fR~lN 69 (328) T protein:vir:95 1 MAVK----------GLTALTLADWGKRVDPNGKVDKIIELLGQTNPILQDM-PFVEGNLPTGHRTTIRSGLPSATWRLLN 69 (328) T ss_pred CCcc----------ccccccHHHHHhhhCcchhHHHHHHHHhccchhHhhc-ceeecccCCcceeeEeeccCCceeeecC Confidence 0000 0000000000000111223345677777766554442 223332 234677788999999999999 Q ss_pred cccccCcccceeeeeeeeeeeeeehhhHHHhhcChhHHH---HHHHHHHHHHHHHHHHHHHhhcCCCcc--ccccc---e Q lcl|Aclame:pro 421 EDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVE---NLIREDLIEGIGVALDLAMLTGTGLAN--DPVGL---L 492 (632) Q Consensus 421 ~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~---~~i~~~l~~a~a~~~~~~~~~g~g~~~--~~~Gi---l 492 (632) +.++.++.++.+++-.++-+++.+.|.+.+.... -+.. ..-...+.+++.+.....+|+|+.+.+ ...|+ + T Consensus 70 ~g~~~s~~tt~q~t~~l~ilgg~~eVDr~la~~~-Gn~~~~ra~q~~~~~ka~~~~~~~~~iyGdsa~~p~~F~GL~~R~ 148 (328) T protein:vir:95 70 YGVQPSKSTTVQVTDSVGMLETYAEVDKSLADLN-GNTAEFRLSEDRAFIEAMNQQMAQTLFYGDSSVNPQQFMGLSSRY 148 (328) T ss_pred CccCcccceeEEEEEEEEEEecceeechHHHhhc-CCHHHHHHHHHHHHHHHHHHHHHHHHhcCCccCChhhhcchhhhc Confidence 9999999999999999999999999999766443 2333 445566889999999999999864321 12222 1 Q ss_pred ecccc------------c-------------------------------------------------------------- Q lcl|Aclame:pro 493 NMTGV------------P-------------------------------------------------------------- 498 (632) Q Consensus 493 ~~a~~------------~-------------------------------------------------------------- 498 (632) +..+. + T Consensus 149 ~~~s~~~a~qiidaGgtg~~~TSi~~v~~g~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~g~~y~~y~~~~~w~~Gl~i~ 228 (328) T protein:vir:95 149 SSLSAGNAQNIIDAGGTGTDNTSIWLVVWGENTVHGIFPKGKKAGIQMEDKGQVTLEDANGGKYEGYRTHYKWDNGLALR 228 (328) T ss_pred CccccccccceeecccCCCCceEEEEEEEcCCeEEEecccccccCceeeecCceeeecCCCCeeeEEEEEEEeeeeeEEc Confidence 10000 0 Q ss_pred ---------cccccc--cchhHHHH----HHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeec------c Q lcl|Aclame:pro 499 ---------ALTYPA--GGVDWASV----VDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQ------N 557 (632) Q Consensus 499 ---------~~~~~~--~~~~~~~i----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~------~ 557 (632) |+..+. .....+.+ ++++.+++ +....++.|.||.....++.+...+-.+-+.-.. . T Consensus 229 d~r~vvrI~NId~~~l~~~~~~~~l~~lm~~a~~~ip--~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~~~~~~~~g~~~ 306 (328) T protein:vir:95 229 DWRYVVRIANIDVSNLSEPSSAANIAKLMVKALHRIP--NRGMGRPVFYMNRTVGQALDLQSLEKTSLAISVKETEGEWW 306 (328) T ss_pred CcccEEEEecCcccccccccChhhHHHHHHHHHHHhc--cCCCCcceeehhHHHHHHHHHHHhcCcceeeeeeccCCcce Confidence 000000 00011222 33333333 2334567788888888877764333322222111 1 Q ss_pred ccccCcceEEcCCCCCccEEEE Q lcl|Aclame:pro 558 NEVNGYRAEASNQIPADTWIFG 579 (632) Q Consensus 558 ~~l~G~pv~~~~~~~~~~~~~g 579 (632) ..+.|.||..++.+-.+...+. T Consensus 307 t~~~gipir~~dai~~tE~~vv 328 (328) T protein:vir:95 307 TSFRGVPIRETDALLETEARVV 328 (328) T ss_pred eEECCeEEEEEeeeecCccccC Confidence 3578888888877664432222 No 171 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=97.00 E-value=0.00015 Score=41.48 Aligned_cols=263 Identities=10% Similarity=-0.023 Sum_probs=129.6 Q ss_pred ccccceechh--hhhHHHHHHHhhhhhhhhhcc--eeeccCceeEEEEEecCCcccc--cccc-CcccccCcccceeeee Q lcl|Aclame:pro 363 GKGGELVATE--LLSEEFIDILRNKAIIGQMGA--RMLPGLVGDVDIPKKTSGANFY--WIGE-DEDVQDSDFDFTTLSF 435 (632) Q Consensus 363 ~~~~~~i~~~--~~~~~i~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~a~--~v~E-~~~~~~~~~~~~~~~~ 435 (632) .++...+..+ .+...+.+...+.-...++.. +..+.....+.+...+..+.+. |++- ...+|..+..+++... T Consensus 1 ~~~lafl~~qL~~id~~vye~~~~~~~~~~lipv~t~~~~~~~~~~~~~~d~~G~a~~~~i~~~a~dip~vd~~~~~~~~ 80 (304) T protein:vir:52 1 MSLLAYVKNGLTAVSKDIAETKYPEIVFPQFVYVDQQTAVGITEKLHYGADEHGSLDDGLITVGTSTLDQVEVGFTPTRS 80 (304) T ss_pred CchHHHHHHHHHHHhhhhhccccccchhhhhccccCCCCcccceEEEeeeeccCcccccccCCcCCccceeecccceeEE Confidence 1111111111 111222221112122222211 1111122234555555556666 7654 4568888999999999 Q ss_pred eeeeeeeeehhhHHHhhcCh---hHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccc-------c Q lcl|Aclame:pro 436 SPKTIAGAVPVTRKLRKQSS---IHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPA-------G 505 (632) Q Consensus 436 ~~~t~~~~~~iSre~l~d~~---~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~-------~ 505 (632) .++.++..+.+|.+-|.-.. .++.+.-.....+++...+|+..|.|+.......|++|++++...+.++ . T Consensus 81 ~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~g~~GllN~p~v~~~~~~~~~a~~~w~ 160 (304) T protein:vir:52 81 YIVPWAKSVTWTKPELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAKDSRLTGLLNNKSVEVYAIKGAAQNTKVQ 160 (304) T ss_pred EEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeeccccceEEEEeCCCcceeeecCCccCCccc Confidence 99999999999887666543 2566666677777888888888999975434567999999887543221 1 Q ss_pred chhHHHH----HHHHHHHHhhccccc-cceEEeehhHHHHHHHHhhcccCCceeec----ccc-ccCcceEEc--C--CC Q lcl|Aclame:pro 506 GVDWASV----VDMETKISTFNADAG-RLAYLTSVTQRGAAKKAQVFDNTGERIWQ----NNE-VNGYRAEAS--N--QI 571 (632) Q Consensus 506 ~~~~~~i----~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~d~~g~~~~~----~~~-l~G~pv~~~--~--~~ 571 (632) .-+.+.| ..++.++..+..... ....++.+.....+.... .+..+.-++. .++ ..|.|+-+- + .. T Consensus 161 ~~T~~eI~~di~~~~~~i~~~s~~~~~p~tl~Lpp~~~~~l~~~~-~~~~~~Tvl~~l~~n~~~~~g~~l~I~~v~~~~~ 239 (304) T protein:vir:52 161 AMDFDKAVAFFKEIFLKGMEKTKRIEAPNTFAIDSLDLAHLALVQ-RANTDTTALEFLTKHLSAAAGRQVAIKALPSNYG 239 (304) T ss_pred cCCHHHHHHHHHHHHHHHHhccCceecCceEEeCHHHHHHHhhcc-CCCCCchHHHHHHHhcccccCCcceEEEeccccc Confidence 1233444 444444443322212 234555555555554322 2333433321 121 234443321 1 11 Q ss_pred C---C--ccEEEEehh--hEEEEEecceEEEEecccccccCc--EEEEEEEEeCc-EEecccceEEEEe Q lcl|Aclame:pro 572 P---A--DTWIFGDWS--QIVIAMWGVLDLKVDPYTKAASDG--LVLRVFQDVDA-GVRRKEAFCIAKK 630 (632) Q Consensus 572 ~---~--~~~~~gd~s--~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~r~~~-~v~~~~a~~~~~~ 630 (632) . + +.+++.+-+ .+.+ .-.+.+...+.. ..+. ..+=.+.|+|+ .+..|.+|+++.. T Consensus 240 ~~g~~g~~r~vvY~~d~~~~~~--~vP~p~~~l~~q--~~~~~~~~vp~~~r~gGv~v~~P~a~~y~D~ 304 (304) T protein:vir:52 240 TRVTDGKTRAMVYVNSKEHVIF--DVPMSPTVLDAQ--PKGLLAFESGLRMAFGGVTFMEPDSALYVDY 304 (304) T ss_pred ccCCCCceEEEEEecChhheEE--ecCccccccchh--hcCCceEEecceeeeeeEEEEccceeeeecC Confidence 1 1 112222222 1222 111222222211 1232 23334556655 7789999999999 No 172 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=96.89 E-value=0.00027 Score=40.12 Aligned_cols=278 Identities=9% Similarity=-0.020 Sum_probs=119.4 Q ss_pred hhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhh Q lcl|Aclame:pro 309 LMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAII 388 (632) Q Consensus 309 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~ 388 (632) +. .......+. .. +...-+.. -......+...+.....+-+.+...... T Consensus 1 ~~---------------------~~~~~~~~~----~~-----~~~~~~~~-~~~~~nt~~l~~k~~~~LD~~~~~~~~s 49 (319) T protein:vir:94 1 MN---------------------KTIKNATGM----LK-----LNLQHFAN-KSVEPGQTLLKNKHVGILERVTAVNAYS 49 (319) T ss_pred CC---------------------cccccccce----eE-----eehhhhhc-cCCCcchHHHHHHHHHHHHHHHHHhhhh Confidence 00 000000000 00 00000000 0111112222233333333322222221 Q ss_pred hhh--cceeeccCceeEEEEEecCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcCh----hHHHHHH Q lcl|Aclame:pro 389 GQM--GARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSS----IHVENLI 462 (632) Q Consensus 389 ~~~--~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~----~~~~~~i 462 (632) ..+ .....-.....+++++.+..+-. -..-++-+..+.++.+.+++.+.. .+.+.+.=.-+..+. +.+...+ T Consensus 50 ~~~~~N~~~e~~gg~tVkIp~i~~~gl~-DY~R~~g~~~g~vt~~~~t~tidq-dR~~~F~VD~~D~~Etn~~l~a~~i~ 127 (319) T protein:vir:94 50 TPALISNDAIFMEGRSFTVMKGDTTELK-DYKRNATNEFDHPKIEETTYFLDQ-EKYWGRFVDALDRKDTEGNIDINYVV 127 (319) T ss_pred hhcccCcceEeccCcEEEEeeecccccc-cccCCCCcccCCcccceeEEEeec-ccccccccchhhHhhhhchhhHHHHH Confidence 111 11122234567888988764433 333344455666666666655533 233332211111111 1222223 Q ss_pred HHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHH Q lcl|Aclame:pro 463 REDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAK 542 (632) Q Consensus 463 ~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 542 (632) .+.+...++-.+|...+.....+ ++..........-.++.|.++...|..+..+. ....+++|.....+. T Consensus 128 ~~~~~~~v~PEiDay~~skla~~---------a~~~~~~~~t~~n~y~~i~~a~~~Lde~~VP~-~Rvl~Vtp~~~~~L~ 197 (319) T protein:vir:94 128 ARQGAEVVAPYLDNLRFATLARN---------KAKHLTVGTGSDAQYDAVLDVSVELDEIKAPE-NRVLFVSPTFYKGIK 197 (319) T ss_pred HHHHHHHhhhhhhHHHHHHHHhh---------cccccccccCHHHHHHHHHHHHHHHHhcCCCC-CcEEEeCHHHHHHHH Confidence 34444455555665544322111 00001111122335789999999999887763 444455565555543 Q ss_pred HHhh--cccC-Cc-eee--ccccccCcceEEcCC--CCCccEEEEehhhEEEEE-ecceEEEEecccccccCcEEEEEEE Q lcl|Aclame:pro 543 KAQV--FDNT-GE-RIW--QNNEVNGYRAEASNQ--IPADTWIFGDWSQIVIAM-WGVLDLKVDPYTKAASDGLVLRVFQ 613 (632) Q Consensus 543 ~~~~--~d~~-g~-~~~--~~~~l~G~pv~~~~~--~~~~~~~~gd~s~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 613 (632) .... ++.. +. .+. ..+.|.|++|+.++. +..-.+++|..+.+.... ...+++.....- .....|+.+. T Consensus 198 ~~~~f~~~~~~~~~~~~~g~Vg~idG~~Vi~vps~~~k~in~i~~h~~A~~~~~k~~~~~~~~p~~~---~~a~~v~gr~ 274 (319) T protein:vir:94 198 KFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQADLAKTNSNIPG---MFGTLAEQLL 274 (319) T ss_pred hhhhhhccccccccceeeeeceeecCeEEEEecccccccceEEEEcCCeeeeeeeeeeeeccCCCcc---ccceeeeeee Confidence 3211 1111 11 111 235789999987643 334457777665543322 122232221111 1246789999 Q ss_pred EeCcEEecccceEEE--EecC Q lcl|Aclame:pro 614 DVDAGVRRKEAFCIA--KKGA 632 (632) Q Consensus 614 r~~~~v~~~~a~~~~--~~~A 632 (632) ++|..|.+|++..+. ..++ T Consensus 275 y~d~~V~~~k~~~Iy~~~~~~ 295 (319) T protein:vir:94 275 YTGAFVPEHLQKYIFTIGGTE 295 (319) T ss_pred eeeeEEeccccceEEEeecCC Confidence 999999999854433 3333 No 173 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=96.89 E-value=0.00027 Score=40.12 Aligned_cols=278 Identities=9% Similarity=-0.020 Sum_probs=119.4 Q ss_pred hhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhh Q lcl|Aclame:pro 309 LMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAII 388 (632) Q Consensus 309 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~ 388 (632) +. .......+. .. +...-+.. -......+...+.....+-+.+...... T Consensus 1 ~~---------------------~~~~~~~~~----~~-----~~~~~~~~-~~~~~nt~~l~~k~~~~LD~~~~~~~~s 49 (319) T protein:vir:97 1 MN---------------------KTIKNATGM----LK-----LNLQHFAN-KSVEPGQTLLKNKHVGILERVTAVNAYS 49 (319) T ss_pred CC---------------------cccccccce----eE-----eehhhhhc-cCCCcchHHHHHHHHHHHHHHHHHhhhh Confidence 00 000000000 00 00000000 0111112222233333333322222221 Q ss_pred hhh--cceeeccCceeEEEEEecCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcCh----hHHHHHH Q lcl|Aclame:pro 389 GQM--GARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSS----IHVENLI 462 (632) Q Consensus 389 ~~~--~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~----~~~~~~i 462 (632) ..+ .....-.....+++++.+..+-. -..-++-+..+.++.+.+++.+.. .+.+.+.=.-+..+. +.+...+ T Consensus 50 ~~~~~N~~~e~~gg~tVkIp~i~~~gl~-DY~R~~g~~~g~vt~~~~t~tidq-dR~~~F~VD~~D~~Etn~~l~a~~i~ 127 (319) T protein:vir:97 50 TPALISNDAIFMEGRSFTVMKGDTTELK-DYKRNATNEFDHPKIEETTYFLDQ-EKYWGRFVDALDRKDTEGNIDINYVV 127 (319) T ss_pred hhcccCcceEeccCcEEEEeeecccccc-cccCCCCcccCCcccceeEEEeec-ccccccccchhhHhhhhchhhHHHHH Confidence 111 11122234567888988764433 333344455666666666655533 233332211111111 1222223 Q ss_pred HHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHH Q lcl|Aclame:pro 463 REDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAK 542 (632) Q Consensus 463 ~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 542 (632) .+.+...++-.+|...+.....+ ++..........-.++.|.++...|..+..+. ....+++|.....+. T Consensus 128 ~~~~~~~v~PEiDay~~skla~~---------a~~~~~~~~t~~n~y~~i~~a~~~Lde~~VP~-~Rvl~Vtp~~~~~L~ 197 (319) T protein:vir:97 128 ARQGAEVVAPYLDNLRFATLARN---------KAKHLTVGTGSDAQYDAVLDVSVELDEIKAPE-NRVLFVSPTFYKGIK 197 (319) T ss_pred HHHHHHHhhhhhhHHHHHHHHhh---------cccccccccCHHHHHHHHHHHHHHHHhcCCCC-CcEEEeCHHHHHHHH Confidence 34444455555665544322111 00001111122335789999999999887763 444455565555543 Q ss_pred HHhh--cccC-Cc-eee--ccccccCcceEEcCC--CCCccEEEEehhhEEEEE-ecceEEEEecccccccCcEEEEEEE Q lcl|Aclame:pro 543 KAQV--FDNT-GE-RIW--QNNEVNGYRAEASNQ--IPADTWIFGDWSQIVIAM-WGVLDLKVDPYTKAASDGLVLRVFQ 613 (632) Q Consensus 543 ~~~~--~d~~-g~-~~~--~~~~l~G~pv~~~~~--~~~~~~~~gd~s~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 613 (632) .... ++.. +. .+. ..+.|.|++|+.++. +..-.+++|..+.+.... ...+++.....- .....|+.+. T Consensus 198 ~~~~f~~~~~~~~~~~~~g~Vg~idG~~Vi~vps~~~k~in~i~~h~~A~~~~~k~~~~~~~~p~~~---~~a~~v~gr~ 274 (319) T protein:vir:97 198 KFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQADLAKTNSNIPG---MFGTLAEQLL 274 (319) T ss_pred hhhhhhccccccccceeeeeceeecCeEEEEecccccccceEEEEcCCeeeeeeeeeeeeccCCCcc---ccceeeeeee Confidence 3211 1111 11 111 235789999987643 334457777665543322 122232221111 1246789999 Q ss_pred EeCcEEecccceEEE--EecC Q lcl|Aclame:pro 614 DVDAGVRRKEAFCIA--KKGA 632 (632) Q Consensus 614 r~~~~v~~~~a~~~~--~~~A 632 (632) ++|..|.+|++..+. ..++ T Consensus 275 y~d~~V~~~k~~~Iy~~~~~~ 295 (319) T protein:vir:97 275 YTGAFVPEHLQKYIFTIGGTE 295 (319) T ss_pred eeeeEEeccccceEEEeecCC Confidence 999999999854433 3333 No 174 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=96.77 E-value=0.00035 Score=39.53 Aligned_cols=269 Identities=11% Similarity=-0.014 Sum_probs=121.7 Q ss_pred hhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhccee--------eccCceeEEEEEecCC-ccccccccCc--- Q lcl|Aclame:pro 354 QRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARM--------LPGLVGDVDIPKKTSG-ANFYWIGEDE--- 421 (632) Q Consensus 354 ~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~-~~a~~v~E~~--- 421 (632) ...+..- +.-..++.++++..-+.+...+.+.+.+.+.-. .......+.+|....- +....+.+.. T Consensus 1 M~~~~~~--T~l~Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~~~n~~~d~~~~ 78 (367) T protein:vir:80 1 MPDFNNQ--VRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNV 78 (367) T ss_pred Ccchhhh--hhhhhccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEEEeeeeccCCCCccccCCCCCcc Confidence 0000000 011123444444333333333333333332211 1123334566655332 2334444332 Q ss_pred ccccCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHh---hcCCCcc---cc-----cc Q lcl|Aclame:pro 422 DVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAML---TGTGLAN---DP-----VG 490 (632) Q Consensus 422 ~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~---~g~g~~~---~~-----~G 490 (632) +.+.++++-++-.-.+...++.+..+.=.-.-+.-+....|.++++..-.+...+.++ .|.-..+ .. .+ T Consensus 79 ~~t~~kittg~~~a~v~~r~kaw~~~Dla~~lsG~dpm~~Ia~qva~yW~r~~q~~Lla~L~Gvf~~~~a~~~~~~~~~~ 158 (367) T protein:vir:80 79 EAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRG 158 (367) T ss_pred cccccccccchheeeeehhcccchhhhHHHHhhCchHHHHHHHHHHHHhhhhhHHHHHHHHHHhhccccccchhhhhhhh Confidence 2333444444444444444555544442222223355566777766655555544443 2221100 00 00 Q ss_pred cee-------cccccccc----ccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhh----cccCCceee Q lcl|Aclame:pro 491 LLN-------MTGVPALT----YPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQV----FDNTGERIW 555 (632) Q Consensus 491 il~-------~a~~~~~~----~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~d~~g~~~~ 555 (632) .++ ...+..++ .+...++...+.++..+|..+.. .....+||......+...++ ++++|. . T Consensus 159 ~~~a~~~~~~~~~~~Dis~~t~~~~~~~s~~~~~~A~~~lGD~~~--~l~~i~mHS~V~~~L~~~~li~~i~~sd~~--~ 234 (367) T protein:vir:80 159 RVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVG--SIAAIAVHSMVYKRMTNNDEIEFIPDSKGQ--L 234 (367) T ss_pred ccccccccccCceeeeeeccCCCccceecHHHHHHHHHHhccccc--cccEEEEchHHHHHHHhccccccccCCCCc--c Confidence 000 00111111 12345788999999888877544 34567788888777766555 445542 2 Q ss_pred ccccccCcceEEcCCCCCc---------cEEEEehhhEEEEEec---ceEEEEecccccccCcEEEEEEEEeCcEEeccc Q lcl|Aclame:pro 556 QNNEVNGYRAEASNQIPAD---------TWIFGDWSQIVIAMWG---VLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKE 623 (632) Q Consensus 556 ~~~~l~G~pv~~~~~~~~~---------~~~~gd~s~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~ 623 (632) .-++++|++|++++.+|-. +++||.=+ +.++..+ ..+..+++-.. ..+.+.+....|. .+.||. T Consensus 235 ~i~ty~G~~VIvDD~~Pv~~~~a~~~yttYlfg~GA-i~~~~~~~~~~~E~~Rd~~~~-~~gG~d~L~~Rr~--~~~hP~ 310 (367) T protein:vir:80 235 TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAA-FGYADGAPQVPVAVGRRELRG-NGSGLEYILERKE--WIVHPG 310 (367) T ss_pred ccceecceeEEEeCCCcccccCCCceEEEEEEecce-eeecccCCccceecccchhhh-cCCceEEEEeeee--EEeecc Confidence 3467899999999999942 23444222 2122211 12322222211 1244444443333 688999 Q ss_pred ceEEEEec--C Q lcl|Aclame:pro 624 AFCIAKKG--A 632 (632) Q Consensus 624 a~~~~~~~--A 632 (632) +|.+.+.. | T Consensus 311 G~s~~~~~v~~ 321 (367) T protein:vir:80 311 GFNWLDADVTI 321 (367) T ss_pred eeeeccccccc Confidence 88876432 1 No 175 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=96.70 E-value=0.0004 Score=39.22 Aligned_cols=260 Identities=11% Similarity=-0.025 Sum_probs=96.8 Q ss_pred cccccccccceechhhhhHHHHHHHhhhhhhhhhc--------ceeeccCceeEEEEEecCC--ccccccccCcccccCc Q lcl|Aclame:pro 358 EKKTAGKGGELVATELLSEEFIDILRNKAIIGQMG--------ARMLPGLVGDVDIPKKTSG--ANFYWIGEDEDVQDSD 427 (632) Q Consensus 358 ~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~--~~a~~v~E~~~~~~~~ 427 (632) +.-++. .+.++.....+++.+.+........ .....+++....+.....+ .....+.+.+....++ T Consensus 1 m~lsD~----~vfN~~~~~a~~e~~~q~~~~fn~as~gai~l~~~~~~Gd~~~~pf~~~l~g~~~~~~~~~~~~~vt~~k 76 (325) T protein:vir:95 1 MALSDL----AVYSEYAYSAFSETLRQQVDLFNTATGGAIMLQSAAHQGDFSDVAFFAKVTGGLVRRRNAYGSGTVAEKV 76 (325) T ss_pred Cchhhh----hhhhhhhhhhhhhhhhhhHhhhhhcccceeEeccccccCceeeccccccccccccccccCCCCceeccce Confidence 000000 0111222222223222221111110 1111233322322221111 1223344444444444 Q ss_pred ccceeeeeeeeeeeeee----hhhHHHh-hcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCc--cccccceeccccccc Q lcl|Aclame:pro 428 FDFTTLSFSPKTIAGAV----PVTRKLR-KQSSIHVENLIREDLIEGIGVALDLAMLTGTGLA--NDPVGLLNMTGVPAL 500 (632) Q Consensus 428 ~~~~~~~~~~~t~~~~~----~iSre~l-~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~--~~~~Gil~~a~~~~~ 500 (632) ++-++ ...++...+.- .++..+- .++.-.+...|.+.+++...+.+-..++.+.... .+...+... .... T Consensus 77 itt~~-~~av~~~r~~g~~~~d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~a~~~~~~~v~di--s~~~ 153 (325) T protein:vir:95 77 LKHLV-DTSVKVAAGTPPVRLDPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVYSALSQVSDVVYDA--TANT 153 (325) T ss_pred ecccc-ceeeEEecccCcccccHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccceeee--eccc Confidence 43221 12222222221 1111111 1222222333444444433322222222221110 000111110 0011 Q ss_pred cccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcc------cCCceeeccccccCcceEEcCCCCCc Q lcl|Aclame:pro 501 TYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFD------NTGERIWQNNEVNGYRAEASNQIPAD 574 (632) Q Consensus 501 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d------~~g~~~~~~~~l~G~pv~~~~~~~~~ 574 (632) ......++...+.++..+|..+. ..-..++||......|....|-+ ..|... -++++|++|++++.+|.. T Consensus 154 ~~~~~~~s~~~l~~A~~klGD~~--~~l~~~~MHS~v~~~L~~~~L~~~~~~~~~~g~~~--i~t~~G~~VIVdD~~p~~ 229 (325) T protein:vir:95 154 DAADKLPTWNNLNNGQAKFGDQS--SQIAAWIMHSTPMHKLYGSNLTNGERLFTYGTVNV--VRDPFGKLLVMTDSPNLF 229 (325) T ss_pred CcccccccHHHHHHHHHHhcccc--cceeEEEEchHHHHHHHHhhccccccccccCCccc--ccccCCcEEEEeCCCCCC Confidence 11223367889999999986643 23456788888887776544432 222222 246899999999998853 Q ss_pred c---------EEEEehhhEEEEEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 575 T---------WIFGDWSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 575 ~---------~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) . ++|| ...+.+.+...+.....+...-.+....++.+ . --++||.++.+-+-.. T Consensus 230 ~~g~~~~ytty~lg-~GAi~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~-tf~lhp~G~sw~~s~~ 292 (325) T protein:vir:95 230 AAGTPNVYHILGLV-PGGVLIGQNNDFDANEETKNGDENIIRTYQAE--W-SYNIGVKGFAWDKANG 292 (325) T ss_pred CccCceeEEEEEEe-cCeEEecCCCCccccccccCcccceeeeeeee--e-eEEeecceeeeecccc Confidence 2 2222 11122222222222111111112222223321 1 1467999999843222 No 176 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=96.50 E-value=0.00056 Score=38.41 Aligned_cols=258 Identities=9% Similarity=-0.029 Sum_probs=123.4 Q ss_pred cccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccC------ceeEEEEEecCCccccccc-cCcccccCcccc Q lcl|Aclame:pro 358 EKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGL------VGDVDIPKKTSGANFYWIG-EDEDVQDSDFDF 430 (632) Q Consensus 358 ~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~a~~v~-E~~~~~~~~~~~ 430 (632) +..+.. ...|+++.+.+++.++...++.++..+...+. +..+++++........... .++.+....+.+ T Consensus 1 MAN~ll----T~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~~~~~~~~~~~~~~~e 76 (423) T protein:vir:35 1 MANNLE----SNISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQFKSERTETGDITGKDKNGLFS 76 (423) T ss_pred Cccchh----hhhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCcceeecccCcCCCCcccccccc Confidence 111110 12356677778888888777777644434332 2356666665443333322 234445566677 Q ss_pred eeeeeeeeeeee-eehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhH Q lcl|Aclame:pro 431 TTLSFSPKTIAG-AVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDW 509 (632) Q Consensus 431 ~~~~~~~~t~~~-~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~ 509 (632) .++.+.+.+.-. .+.++.+-...+.-++...+. .-..++++.++..++...-.+ .+ +..+ . .+.....+ T Consensus 77 ~~v~l~id~~k~~a~~v~d~e~~l~i~~~~~~l~-~a~~ala~~vd~~l~~~l~~~-a~----~~vg--t--~~t~~~~~ 146 (423) T protein:vir:35 77 AKATGKVGKYITVAVEWTQIEEALKLNQLDQILS-PIHERMVTDLETELAHFMMNN-GA----LSLG--S--PNTAIKKW 146 (423) T ss_pred ceeeEEeccceeccceeCHHHHHhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhc-cc----cccc--c--ccCCcchH Confidence 777777766555 567777655545556655444 445777888888775422110 00 0001 1 11112357 Q ss_pred HHHHHHHHHHHhhccccccceEEeehhHHHHHHH--HhhcccC--Cc-eeec---cccccCcceEEcCCCCCccEEE--- Q lcl|Aclame:pro 510 ASVVDMETKISTFNADAGRLAYLTSVTQRGAAKK--AQVFDNT--GE-RIWQ---NNEVNGYRAEASNQIPADTWIF--- 578 (632) Q Consensus 510 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~d~~--g~-~~~~---~~~l~G~pv~~~~~~~~~~~~~--- 578 (632) +.+.++..+|...+.+......++.+.....+.. .++...+ +. -+-. .+.+.|+.|+.|+++|..+..- T Consensus 147 ~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnvp~~T~gt~~~ 226 (423) T protein:vir:35 147 ADVAQTASFIKDIGIKTGENYAIMDPWSAQRLADAQSGLHAADQLVRTAWENAQISGNFGGIRALMSNGLASRKQGDFDG 226 (423) T ss_pred HHHHHHHHHHHHhcCCcCCCEEEeCHHHHHHHhccccceeccccchhHHHhhccceeeecceEEEEcCCCcccccccccc Confidence 8999999999999988776666666665444321 1122111 11 1111 2678999999999999643211 Q ss_pred -----Eehh--hEEEEE----ecceEEEEecccccccCcEEEEEEEEeCcEEecc------------cceEEEEecC Q lcl|Aclame:pro 579 -----GDWS--QIVIAM----WGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRK------------EAFCIAKKGA 632 (632) Q Consensus 579 -----gd~s--~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~------------~a~~~~~~~A 632 (632) +-.. ...+.. ..++...+...+++.+=+- +....|+..+++ ..+.+.-.++ T Consensus 227 ~~~v~~a~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD---~~t~aGv~~v~~~t~~~~~~~~t~~~~~~~V~~~ 300 (423) T protein:vir:35 227 AITVKTAPNVDYLSVKDSYQFTVALTGATPSKTGFLKAGD---QLKFTSTHWLNQQSKQTLYNGSTAMSFTATVLEE 300 (423) T ss_pred ceeeccccccccccccccccceeeeeeeeeccCCcEEecc---eEEeeeeeeccccccceeecccCCceeEEEEecc Confidence 0000 000000 0111111211222111111 112223233222 2222222222 No 177 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=96.31 E-value=0.00072 Score=37.80 Aligned_cols=225 Identities=11% Similarity=0.070 Sum_probs=119.8 Q ss_pred hhhhHHhhhhhccccccccccee-chhhhhHHHHHHHhhhhhhhhhcceeeccCcee-EEEEEecCCccccccccCcccc Q lcl|Aclame:pro 347 MPHEVLVQRQLEKKTAGKGGELV-ATELLSEEFIDILRNKAIIGQMGARMLPGLVGD-VDIPKKTSGANFYWIGEDEDVQ 424 (632) Q Consensus 347 ~~~~~~~~~a~~~~~~~~~~~~i-~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~a~~v~E~~~~~ 424 (632) ++. -.....+.......+ +...+...+++.+.+.+.+.... ....+.... ....+.++-|.+.|..=+..++ T Consensus 1 m~~-----~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~l-pf~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~ 74 (331) T protein:vir:98 1 MPT-----LSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDM-TVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQ 74 (331) T ss_pred CCc-----cccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhc-eeeeccCCccceeeEEeccCCchhhccCCccC Confidence 000 000000000000000 11122334777777766654432 223332222 3455678889999999999999 Q ss_pred cCcccceeeeeeeeeeeeeehhhHHHhhcCh--hHHHHHHHHHHHHHHHHHHHHHHhhcCCCc--cccccc---eecc-- Q lcl|Aclame:pro 425 DSDFDFTTLSFSPKTIAGAVPVTRKLRKQSS--IHVENLIREDLIEGIGVALDLAMLTGTGLA--NDPVGL---LNMT-- 495 (632) Q Consensus 425 ~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~--~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~--~~~~Gi---l~~a-- 495 (632) .++.++.+++-..+-+++.+.|.|.+..... -++.....+.+.+++.+.....+|+|+.+. ..+.|+ ++.. T Consensus 75 ~s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a 154 (331) T protein:vir:98 75 PEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSA 154 (331) T ss_pred cccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhcccccc Confidence 9999999999999999999999997665422 123344666788999999999999986321 111121 1000 Q ss_pred ----------ccc------------------------------------------------------------------- Q lcl|Aclame:pro 496 ----------GVP------------------------------------------------------------------- 498 (632) Q Consensus 496 ----------~~~------------------------------------------------------------------- 498 (632) +.+ T Consensus 155 ~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v 234 (331) T protein:vir:98 155 ENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYV 234 (331) T ss_pred ccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccE Confidence 000 Q ss_pred ----cccccc---cchh----HHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeec-------cccc Q lcl|Aclame:pro 499 ----ALTYPA---GGVD----WASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQ-------NNEV 560 (632) Q Consensus 499 ----~~~~~~---~~~~----~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~-------~~~l 560 (632) |+..+. .+-+ .+.++++..+++. ....++.|.||.....++.+...+-.+.+.+-. ...+ T Consensus 235 ~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~--~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~ 312 (331) T protein:vir:98 235 VRIANVDVSELTKNASAGADLIDLMTQAVELIPN--VGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAF 312 (331) T ss_pred EEEeccchhccCCCcchhhhHHHHHHHHHHHhcc--cCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEE Confidence 000000 0000 1122233333332 335667888999888888765433323222221 1247 Q ss_pred cCcceEEcCCCCCccEEEE Q lcl|Aclame:pro 561 NGYRAEASNQIPADTWIFG 579 (632) Q Consensus 561 ~G~pv~~~~~~~~~~~~~g 579 (632) .|.||..++.+-.+...+. T Consensus 313 ~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:98 313 DGIPCRRTDALLLTEARVV 331 (331) T ss_pred CCeeEEEeeeeecCccccC Confidence 8888888877654432222 No 178 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=96.31 E-value=0.00072 Score=37.80 Aligned_cols=225 Identities=11% Similarity=0.070 Sum_probs=119.8 Q ss_pred hhhhHHhhhhhccccccccccee-chhhhhHHHHHHHhhhhhhhhhcceeeccCcee-EEEEEecCCccccccccCcccc Q lcl|Aclame:pro 347 MPHEVLVQRQLEKKTAGKGGELV-ATELLSEEFIDILRNKAIIGQMGARMLPGLVGD-VDIPKKTSGANFYWIGEDEDVQ 424 (632) Q Consensus 347 ~~~~~~~~~a~~~~~~~~~~~~i-~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~a~~v~E~~~~~ 424 (632) ++. -.....+.......+ +...+...+++.+.+.+.+.... ....+.... ....+.++-|.+.|..=+..++ T Consensus 1 m~~-----~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~l-pf~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~ 74 (331) T protein:vir:10 1 MPT-----LSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDM-TVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQ 74 (331) T ss_pred CCc-----cccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhc-eeeeccCCccceeeEEeccCCchhhccCCccC Confidence 000 000000000000000 11122334777777766654432 223332222 3455678889999999999999 Q ss_pred cCcccceeeeeeeeeeeeeehhhHHHhhcCh--hHHHHHHHHHHHHHHHHHHHHHHhhcCCCc--cccccc---eecc-- Q lcl|Aclame:pro 425 DSDFDFTTLSFSPKTIAGAVPVTRKLRKQSS--IHVENLIREDLIEGIGVALDLAMLTGTGLA--NDPVGL---LNMT-- 495 (632) Q Consensus 425 ~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~--~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~--~~~~Gi---l~~a-- 495 (632) .++.++.+++-..+-+++.+.|.|.+..... -++.....+.+.+++.+.....+|+|+.+. ..+.|+ ++.. T Consensus 75 ~s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a 154 (331) T protein:vir:10 75 PEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSA 154 (331) T ss_pred cccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhcccccc Confidence 9999999999999999999999997665422 123344666788999999999999986321 111121 1000 Q ss_pred ----------ccc------------------------------------------------------------------- Q lcl|Aclame:pro 496 ----------GVP------------------------------------------------------------------- 498 (632) Q Consensus 496 ----------~~~------------------------------------------------------------------- 498 (632) +.+ T Consensus 155 ~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v 234 (331) T protein:vir:10 155 ENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYV 234 (331) T ss_pred ccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccE Confidence 000 Q ss_pred ----cccccc---cchh----HHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeec-------cccc Q lcl|Aclame:pro 499 ----ALTYPA---GGVD----WASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQ-------NNEV 560 (632) Q Consensus 499 ----~~~~~~---~~~~----~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~-------~~~l 560 (632) |+..+. .+-+ .+.++++..+++. ....++.|.||.....++.+...+-.+.+.+-. ...+ T Consensus 235 ~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~--~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~ 312 (331) T protein:vir:10 235 VRIANVDVSELTKNASAGADLIDLMTQAVELIPN--VGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAF 312 (331) T ss_pred EEEeccchhccCCCcchhhhHHHHHHHHHHHhcc--cCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEE Confidence 000000 0000 1122233333332 335667888999888888765433323222221 1247 Q ss_pred cCcceEEcCCCCCccEEEE Q lcl|Aclame:pro 561 NGYRAEASNQIPADTWIFG 579 (632) Q Consensus 561 ~G~pv~~~~~~~~~~~~~g 579 (632) .|.||..++.+-.+...+. T Consensus 313 ~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:10 313 DGIPCRRTDALLLTEARVV 331 (331) T ss_pred CCeeEEEeeeeecCccccC Confidence 8888888877654432222 No 179 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=96.31 E-value=0.00072 Score=37.80 Aligned_cols=225 Identities=11% Similarity=0.070 Sum_probs=119.8 Q ss_pred hhhhHHhhhhhccccccccccee-chhhhhHHHHHHHhhhhhhhhhcceeeccCcee-EEEEEecCCccccccccCcccc Q lcl|Aclame:pro 347 MPHEVLVQRQLEKKTAGKGGELV-ATELLSEEFIDILRNKAIIGQMGARMLPGLVGD-VDIPKKTSGANFYWIGEDEDVQ 424 (632) Q Consensus 347 ~~~~~~~~~a~~~~~~~~~~~~i-~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~a~~v~E~~~~~ 424 (632) ++. -.....+.......+ +...+...+++.+.+.+.+.... ....+.... ....+.++-|.+.|..=+..++ T Consensus 1 m~~-----~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~l-pf~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~ 74 (331) T protein:vir:10 1 MPT-----LSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDM-TVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQ 74 (331) T ss_pred CCc-----cccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhc-eeeeccCCccceeeEEeccCCchhhccCCccC Confidence 000 000000000000000 11122334777777766654432 223332222 3455678889999999999999 Q ss_pred cCcccceeeeeeeeeeeeeehhhHHHhhcCh--hHHHHHHHHHHHHHHHHHHHHHHhhcCCCc--cccccc---eecc-- Q lcl|Aclame:pro 425 DSDFDFTTLSFSPKTIAGAVPVTRKLRKQSS--IHVENLIREDLIEGIGVALDLAMLTGTGLA--NDPVGL---LNMT-- 495 (632) Q Consensus 425 ~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~--~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~--~~~~Gi---l~~a-- 495 (632) .++.++.+++-..+-+++.+.|.|.+..... -++.....+.+.+++.+.....+|+|+.+. ..+.|+ ++.. T Consensus 75 ~s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a 154 (331) T protein:vir:10 75 PEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSA 154 (331) T ss_pred cccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhcccccc Confidence 9999999999999999999999997665422 123344666788999999999999986321 111121 1000 Q ss_pred ----------ccc------------------------------------------------------------------- Q lcl|Aclame:pro 496 ----------GVP------------------------------------------------------------------- 498 (632) Q Consensus 496 ----------~~~------------------------------------------------------------------- 498 (632) +.+ T Consensus 155 ~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v 234 (331) T protein:vir:10 155 ENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYV 234 (331) T ss_pred ccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccE Confidence 000 Q ss_pred ----cccccc---cchh----HHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeec-------cccc Q lcl|Aclame:pro 499 ----ALTYPA---GGVD----WASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQ-------NNEV 560 (632) Q Consensus 499 ----~~~~~~---~~~~----~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~-------~~~l 560 (632) |+..+. .+-+ .+.++++..+++. ....++.|.||.....++.+...+-.+.+.+-. ...+ T Consensus 235 ~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~--~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~ 312 (331) T protein:vir:10 235 VRIANVDVSELTKNASAGADLIDLMTQAVELIPN--VGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAF 312 (331) T ss_pred EEEeccchhccCCCcchhhhHHHHHHHHHHHhcc--cCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEE Confidence 000000 0000 1122233333332 335667888999888888765433323222221 1247 Q ss_pred cCcceEEcCCCCCccEEEE Q lcl|Aclame:pro 561 NGYRAEASNQIPADTWIFG 579 (632) Q Consensus 561 ~G~pv~~~~~~~~~~~~~g 579 (632) .|.||..++.+-.+...+. T Consensus 313 ~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:10 313 DGIPCRRTDALLLTEARVV 331 (331) T ss_pred CCeeEEEeeeeecCccccC Confidence 8888888877654432222 No 180 >protein:vir:99228 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:776 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950457;genbank:gi:119953658;genbank:GeneID:4643088 Probab=96.22 E-value=9.9e-05 Score=42.55 Aligned_cols=209 Identities=15% Similarity=0.063 Sum_probs=119.9 Q ss_pred hh-hhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccc-cccccCcccc Q lcl|Aclame:pro 347 MP-HEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANF-YWIGEDEDVQ 424 (632) Q Consensus 347 ~~-~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a-~~v~E~~~~~ 424 (632) +. ......++ +..-+...+.+.+...++.....+..++..+..-.+.-++..|.. .|++ +.. T Consensus 1 M~ii~~~~L~~-------------l~~~~~~~f~~~~~~a~~~~~~iA~~VpSt~~~~~Y~WLg~~P~mreWiG---~r~ 64 (304) T protein:vir:99 1 MAIITPALISA-------------LKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIG---QRV 64 (304) T ss_pred CCccCHHHHHH-------------HHHHHHHHHHHHHhhcCcccceeEeEeecCccccccchhcccccchhhhh---hhh Confidence 00 00000000 111133445555555555555567788888777777778888875 5774 444 Q ss_pred cCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCC---Ccccccc-ceecccccc- Q lcl|Aclame:pro 425 DSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTG---LANDPVG-LLNMTGVPA- 499 (632) Q Consensus 425 ~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g---~~~~~~G-il~~a~~~~- 499 (632) ...+.....++.=++|-.-+.|.|.-|+||.+++..-+.+.||++++..=|..++.-.. +....+| -+|.++|.. T Consensus 65 i~~l~~~~y~I~Nk~fE~Tv~V~R~dIEDD~~Giy~p~~~~~G~~aa~~Pd~lvf~lL~~Gf~t~CyDGq~FFdtDHpv~ 144 (304) T protein:vir:99 65 IKDMAAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVY 144 (304) T ss_pred hhhhhhccceeeccccccccccccccccccccCchHHHHHHHHHHHhcCchhhHHHHHHhhhcccCCCcccccccCCccc Confidence 55666677788889999999999999999999999999999999999877776642211 0111111 111111110 Q ss_pred --------------c----------------------------------------------------------------- Q lcl|Aclame:pro 500 --------------L----------------------------------------------------------------- 500 (632) Q Consensus 500 --------------~----------------------------------------------------------------- 500 (632) . T Consensus 145 ~~~dg~g~~~~vsn~~~~~~~~g~~w~Lld~~r~iKP~I~Q~Rk~~~~~~~~~~~d~~Vf~~~e~~yGvd~R~n~GygfW 224 (304) T protein:vir:99 145 PNVDGTGTATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSMTKEDDEQVFMADEYRYGVRSRCNVGFGFW 224 (304) T ss_pred ccccccCcccccceeccCCCCCCCcEEEEeCCCCccceeeeccccceeeeccCCCchhhhhhcceeEeeeeeeccchhhh Confidence 0 Q ss_pred ---cccccchhHHHHHHHHHHHHhhcccccc-----ceEEeehhHHHHHHHHhhcccCCceeeccccccC-cceEEcCCC Q lcl|Aclame:pro 501 ---TYPAGGVDWASVVDMETKISTFNADAGR-----LAYLTSVTQRGAAKKAQVFDNTGERIWQNNEVNG-YRAEASNQI 571 (632) Q Consensus 501 ---~~~~~~~~~~~i~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~d~~g~~~~~~~~l~G-~pv~~~~~~ 571 (632) .++.++++.+.+..++.+|+.+..+.+. +.++++|..........+....- .-...|++.| ..+++++.+ T Consensus 225 QlA~gS~a~Lt~~nl~aAr~aMr~qk~d~G~pL~I~P~~LvVPp~LE~aA~~ll~a~~~-~~G~tNp~~g~~eliV~P~L 303 (304) T protein:vir:99 225 QLAAMSTEELNTANFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRL-ANGADNPNFELVQVLDTAWL 303 (304) T ss_pred hhhhhcCCCcChHHHHHHHHHHHhhcCCCCceeccccCeEEecchHHHHHHHHHhhhcc-CCCCcceecceEEEEeeccc Confidence 0122456667777777777766543322 34556666555444443332110 0012355556 466677666 Q ss_pred C Q lcl|Aclame:pro 572 P 572 (632) Q Consensus 572 ~ 572 (632) . T Consensus 304 d 304 (304) T protein:vir:99 304 N 304 (304) T ss_pred C Confidence 6 No 181 >protein:vir:79246 Length: 304 # NCBI annotation: conserved hypothetical protein # Family: family:all:776 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469162;genbank:gi:157835004;genbank:GeneID:5648827 Probab=96.20 E-value=9.9e-05 Score=42.53 Aligned_cols=209 Identities=14% Similarity=0.060 Sum_probs=119.6 Q ss_pred hh-hhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEecCCccc-cccccCcccc Q lcl|Aclame:pro 347 MP-HEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANF-YWIGEDEDVQ 424 (632) Q Consensus 347 ~~-~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a-~~v~E~~~~~ 424 (632) +. ......++ +..-+...+.+.+...++.....+..++..+..-++.-++..|.. .|++ +.. T Consensus 1 M~ii~~~~L~~-------------l~~~~~~~f~~~~~~a~~~~~~iA~~VpSt~~~~tY~WLg~~P~mreWiG---~r~ 64 (304) T protein:vir:79 1 MAIITPALISA-------------LKTSFQKHFQDALATAPSTYLQVATVIPSTTASNTYGWLGQFPKLREWIG---QRV 64 (304) T ss_pred CCccCHHHHHH-------------HHHHHHHHHHHHHhhcCcccceeEeEeecCccccccchhcccccchhhhh---hhh Confidence 00 00000000 111133445555555555555567788887777777778888875 5674 445 Q ss_pred cCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCC---Ccccccc-ceecccccc- Q lcl|Aclame:pro 425 DSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTG---LANDPVG-LLNMTGVPA- 499 (632) Q Consensus 425 ~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g---~~~~~~G-il~~a~~~~- 499 (632) ...+.....++.=++|-.-+.|.|.-|+||.+++..-+.+.||++++..=|..++.-.. +....+| -+|.++|.. T Consensus 65 i~~l~~~~y~I~Nk~fE~Tv~V~R~dIEDD~~Giy~p~~~~~G~~aa~~Pd~lvf~lL~~Gf~t~CyDGq~FFdtDHpv~ 144 (304) T protein:vir:79 65 IKDMAAQGYQITNKLFESTVGVKRTDIEDDNLGVYGPLMQEMGRAAGAHPDELVFALLKAGNANLCYDGQNFFDTDHPVY 144 (304) T ss_pred hhhhhhccceeeccccccceeeccccccccccCchHHHHHHHHHHHhcCchhhHHHHHHhhhcccCCCcccccccCCccc Confidence 55666677788889999999999999999999999999999999999877776642211 0111111 111111110 Q ss_pred --------------c----------------------------------------------------------------- Q lcl|Aclame:pro 500 --------------L----------------------------------------------------------------- 500 (632) Q Consensus 500 --------------~----------------------------------------------------------------- 500 (632) . T Consensus 145 ~~~d~~g~~~~vsn~~~~~~~~g~~w~LlD~sr~iKP~I~Q~Rk~~~~~~~~~~~d~~Vf~~~e~~yGvd~R~n~GygfW 224 (304) T protein:vir:79 145 PNVDGTGTATTVSNLFAPAADPGAAWYLLDTSRSLKPLIYQERMKPSFTSLTKEDNEQVFMADEYVYGVRSRCNVGFGFW 224 (304) T ss_pred cccccccccccceeeccCCCCCCCeEEEEeCCCcccceeeeccccceeeecCCCCchhhhhhcceEEeeeeeeccchhhh Confidence 0 Q ss_pred ---cccccchhHHHHHHHHHHHHhhccccc-----cceEEeehhHHHHHHHHhhcccCCceeeccccccC-cceEEcCCC Q lcl|Aclame:pro 501 ---TYPAGGVDWASVVDMETKISTFNADAG-----RLAYLTSVTQRGAAKKAQVFDNTGERIWQNNEVNG-YRAEASNQI 571 (632) Q Consensus 501 ---~~~~~~~~~~~i~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~~~l~G-~pv~~~~~~ 571 (632) .++.++++.+.+..++.+|+.+..+.+ .+.++++|..........+....- .-...|++.| ..+++++.+ T Consensus 225 QlA~gS~a~Ls~~nl~aAr~aMr~qk~d~G~pL~I~P~~LvVPp~LE~~A~~ll~a~~~-~~G~tNp~~g~~eliV~P~L 303 (304) T protein:vir:79 225 QLAAMSTEELNQVNFEKVYDAMRNQKADGGRPLDIRPNLLVVPTTLRSKAKEVVGVQRL-ANGADNPNFELVQVLDTAWL 303 (304) T ss_pred hhhhhcCCccchHHHHHHHHHHHhhcCCCCceeccccCEEEecchhHHHHHHHHhhhhc-CCCCcceecceEEEEeeccc Confidence 012245666777777777776654332 234555665554444433322110 0012355555 466666666 Q ss_pred C Q lcl|Aclame:pro 572 P 572 (632) Q Consensus 572 ~ 572 (632) . T Consensus 304 d 304 (304) T protein:vir:79 304 N 304 (304) T ss_pred C Confidence 6 No 182 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=96.17 E-value=0.00089 Score=37.31 Aligned_cols=225 Identities=11% Similarity=0.100 Sum_probs=116.4 Q ss_pred hhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCcee-EEEEEecCCccccccccCccccc Q lcl|Aclame:pro 347 MPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGD-VDIPKKTSGANFYWIGEDEDVQD 425 (632) Q Consensus 347 ~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~a~~v~E~~~~~~ 425 (632) ++. -+....+.........+.-....+++.+.+.+.+.... ....+.... ....+.++-|.+.|..=+..++. T Consensus 1 m~~-----~~~~a~TL~e~AKr~~~d~~~~~IIE~l~~tn~IL~~l-pf~e~N~~tg~~t~vrt~LP~~~fR~lN~g~~~ 74 (330) T protein:vir:10 1 MAT-----LSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDM-TAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLP 74 (330) T ss_pred CCc-----CCCCcccHHHHHhhcCcchhHHHHHHHHhcCchHHhhc-chhhccCCcccceeEEeecCCchhhhcCCcccc Confidence 000 00000000000000111112234667776665544331 112121111 22344567788999999999999 Q ss_pred CcccceeeeeeeeeeeeeehhhHHHhhcCh-h-HHHHHHHHHHHHHHHHHHHHHHhhcCCCc--cccccc---eecc--- Q lcl|Aclame:pro 426 SDFDFTTLSFSPKTIAGAVPVTRKLRKQSS-I-HVENLIREDLIEGIGVALDLAMLTGTGLA--NDPVGL---LNMT--- 495 (632) Q Consensus 426 ~~~~~~~~~~~~~t~~~~~~iSre~l~d~~-~-~~~~~i~~~l~~a~a~~~~~~~~~g~g~~--~~~~Gi---l~~a--- 495 (632) ++.++.+++-..+-+++.+.|.|.+..-.. . ++.....+.+.+++.+.....+|+|+.+. ..+.|+ ++.. T Consensus 75 s~~tt~qvt~~l~ilgg~~eVDr~la~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGD~a~~p~~F~GL~kR~~~~ta~ 154 (330) T protein:vir:10 75 NKSSTAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAE 154 (330) T ss_pred ccceEEEEEEEeEEecchhhhhhHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCChhhccchhhhcCCCCCC Confidence 999999999999999999999997754321 1 23344667789999999999999986432 112222 1000 Q ss_pred ---------cc----------------------------------c---------------------------------- Q lcl|Aclame:pro 496 ---------GV----------------------------------P---------------------------------- 498 (632) Q Consensus 496 ---------~~----------------------------------~---------------------------------- 498 (632) +. + T Consensus 155 ~~~qvIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~~~dg~gg~y~~~~~~~~w~~Gl~i~d~r~ 234 (330) T protein:vir:10 155 NKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRY 234 (330) T ss_pred chhheeeccccccCceEEEEEEEcCCeEEEEcccCccccceeeeccceeeecccCCCCceeEEeeeeeeeeeeEEeCccc Confidence 00 0 Q ss_pred -----cccccc--cchhHHHHHHHH----HHHHhhccccccceEEeehhHHHHHHHHhhcccCCceeec------ccccc Q lcl|Aclame:pro 499 -----ALTYPA--GGVDWASVVDME----TKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQ------NNEVN 561 (632) Q Consensus 499 -----~~~~~~--~~~~~~~i~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~------~~~l~ 561 (632) |+..+. ......+|++++ ..++ +....++.|.|+-....++.+....-.+-+.-+. ...+. T Consensus 235 vvRI~NIdvs~l~~~~~~~~li~lm~~A~~~ip--~~~~g~~~~y~n~~v~~~L~~q~~~k~n~~l~~~~~~g~~~t~~~ 312 (330) T protein:vir:10 235 VARVCNIDVSDLATSANAQALIKYMIMAAERIP--QLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFD 312 (330) T ss_pred EEEEeecccccCCCCccHHHHHHHHHHHHHhcc--CCCCCcceeeechHHHHHHHHHHhhcccceeeeeecCCeeeEEEC Confidence 000000 000112333332 2222 2234567788998888888765433332222221 13578 Q ss_pred CcceEEcCCCCCccEEEE Q lcl|Aclame:pro 562 GYRAEASNQIPADTWIFG 579 (632) Q Consensus 562 G~pv~~~~~~~~~~~~~g 579 (632) |.||..++++-.+.-.+. T Consensus 313 gipir~~Dail~tE~~vv 330 (330) T protein:vir:10 313 GIPVQRTDALLNTESRVV 330 (330) T ss_pred CeEEEEEeeeecCccccC Confidence 888888877654432222 No 183 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=95.71 E-value=0.0016 Score=35.94 Aligned_cols=264 Identities=11% Similarity=0.016 Sum_probs=118.9 Q ss_pred cccccccccceechh--hhhHHHHHHHhhhhhhhhhccee--------eccCceeEEEEEecCC-ccc-ccc-ccC--cc Q lcl|Aclame:pro 358 EKKTAGKGGELVATE--LLSEEFIDILRNKAIIGQMGARM--------LPGLVGDVDIPKKTSG-ANF-YWI-GED--ED 422 (632) Q Consensus 358 ~~~~~~~~~~~i~~~--~~~~~i~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~-~~a-~~v-~E~--~~ 422 (632) +. .+.-..++.++ ++..-+.+...+.+.+.+.+.-. .......+++|....- +.. ..+ ..+ +. T Consensus 1 Ma--~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~L~g~~e~nv~~D~~~~~ 78 (349) T protein:vir:78 1 MA--ITTIGDIVTGNIPVLASYMTEDPVEKTAFFDSGILTSTPYAAEIANGPSNIANLPFWKAIDTSIEPNYSNDVYQDI 78 (349) T ss_pred CC--ceEEeeeeccCHHHHHHHHHHhhHHhhhhhhccceeccHHHHHHhhcCCCEEEeeeeecCCCCcccccCCCCcccc Confidence 11 11222334443 34444444444444444443322 1122334555655331 221 112 222 22 Q ss_pred cccCcccceeeeeeeeeeeeeehhh---HHHhhcChhHHHHHHHHHHHHHHHHHHHHHHh---hcCCCcccccc--ceec Q lcl|Aclame:pro 423 VQDSDFDFTTLSFSPKTIAGAVPVT---RKLRKQSSIHVENLIREDLIEGIGVALDLAML---TGTGLANDPVG--LLNM 494 (632) Q Consensus 423 ~~~~~~~~~~~~~~~~t~~~~~~iS---re~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~---~g~g~~~~~~G--il~~ 494 (632) .+..+++-++-.-...-.+..+..+ ..+- .-+....|.++++....+...+.++ .|.-......+ ..+. T Consensus 79 ~t~~kitt~~~~a~~~~r~kaw~~~Dla~~ls---G~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~a~~~~~~~ 155 (349) T protein:vir:78 79 ATPRAIQTGEMMARVAYLNEGFGQADLTVELT---SQNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDAYHEQ 155 (349) T ss_pred cccccccccceeeeeeeeccccchhHHHHHhh---CchHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcccccccchhhhc Confidence 3334444333333333334444433 3322 2244666777777666665555443 22211111000 0111 Q ss_pred cccccccccccchhHHHHHHHHHHHHhhcc---ccccceEEeehhHHHHHHHHhh----cccCCceeeccccccCcceEE Q lcl|Aclame:pro 495 TGVPALTYPAGGVDWASVVDMETKISTFNA---DAGRLAYLTSVTQRGAAKKAQV----FDNTGERIWQNNEVNGYRAEA 567 (632) Q Consensus 495 a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~----~d~~g~~~~~~~~l~G~pv~~ 567 (632) .++..-..+.+..+...+.++..+|..... .......+||......+...++ ++.+|.. .-++++|++|++ T Consensus 156 ~~~t~d~s~~a~~~~~~~~dA~~~lgda~~Gd~~~~lt~i~mHS~v~~~L~~~~li~~i~~s~~~~--~i~ty~G~~Viv 233 (349) T protein:vir:78 156 NDMVVDVSATLGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLIDFIRDAENNT--MFATYQGYRVIV 233 (349) T ss_pred ccceeeeccccCCChhhhhhhHHHHHHHhccccccceeEEEEchHHHHHHHhhhhhhhccCcccCc--ccceecCeEEEE Confidence 122222223334677888888888766521 1223456788777776665544 3333321 226789999999 Q ss_pred cCCCCCc---------cEEEEehhhEEEEEec---ceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 568 SNQIPAD---------TWIFGDWSQIVIAMWG---VLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 568 ~~~~~~~---------~~~~gd~s~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) ++.+|-. +++||. ..+.+.... .++..+++...-..++..+..+.+ .+.||.++.+-+..- T Consensus 234 DD~~Pv~~~g~~~~yttylfg~-GAi~~~~~~~~~~~et~rd~~~g~~~G~d~l~~R~~---~~~hp~G~s~~~a~v 306 (349) T protein:vir:78 234 DDSMTVVGQGAQRKFISIIFGQ-GAIGYGEGNPVMPLEYEREASRANGGGVETLWTRKT---WLLHPFGYRFTSAVI 306 (349) T ss_pred eCCCccccCCCCceEEEEEeec-ceEEEccCCCccceeeecccccCCcceeEEEEEeeE---EEeeeeeeeeccccc Confidence 9999842 235552 222222222 133333332221224444555444 367777777765321 No 184 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=95.66 E-value=0.0017 Score=35.82 Aligned_cols=289 Identities=9% Similarity=-0.014 Sum_probs=121.9 Q ss_pred hhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhh--cce Q lcl|Aclame:pro 317 ATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQM--GAR 394 (632) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~--~~~ 394 (632) ..+. -......+..+.....+.... ...-+...+ .....+...+.....+-+.+...+....+ ... T Consensus 1 ~~~~--~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~-~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~ 68 (329) T protein:vir:10 1 MDGI--FITGVKTMNKEIKNATGKLKL---------NLQHFANKS-VEPGDTLLKNKHVGILEKVTAANSYSAPAVISND 68 (329) T ss_pred CCce--EEechhhhhhhhhcccceeEE---------ehhhhcCCc-cCCchhHHHHHHHHHHHHHHHhhceeeeeecccc Confidence 0000 000000111111100000000 000000000 00011111222222222222222211111 111 Q ss_pred eeccCceeEEEEEecCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcCh----hHHHHHHHHHHHHHH Q lcl|Aclame:pro 395 MLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSS----IHVENLIREDLIEGI 470 (632) Q Consensus 395 ~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~----~~~~~~i~~~l~~a~ 470 (632) ........+++|+.+..+-.. ..-++-+..+.++.+.+++.+.. .+.+.+.=.-+..+. +.+...+.+.+...+ T Consensus 69 ~e~~~g~tVkIp~i~~~gl~D-Y~R~~g~~~g~vt~~~~t~tidq-dR~~~F~VD~~D~dEtn~~l~a~~i~~~~~~~~v 146 (329) T protein:vir:10 69 AIFMQGRSFTVIKGDVTELKD-YKRNATNEFDHPQIQETTYFLDQ-EKYWGRFVDALDRRDTEGNIDINYVVAKQASEVV 146 (329) T ss_pred eeeccCcEEEEeeeccccccc-ccCCCCccccccccceeEEEeec-ccceeeecchhhHhhhhhhhhHHHHHHHHHHHHh Confidence 223345678888887644333 33344456666666666665544 334433321121111 122233344455555 Q ss_pred HHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHhhc--- Q lcl|Aclame:pro 471 GVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVF--- 547 (632) Q Consensus 471 a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 547 (632) +-.+|...+.-...+. | ..........-.++.|.++...|..+..+. ....++.|.....+...... T Consensus 147 ~pEiDay~~skla~~a---~------~~~~~~~t~~nay~~i~~a~~~Lde~~vp~-~Rvl~VtP~~~~~Lk~~~~f~~~ 216 (329) T protein:vir:10 147 APYLDNLRFATLARNK---A------KHLTVGSGADAQYDAVLDVSVELDEIGAGA-SRILFVTPKFYKGIKKFVIELPQ 216 (329) T ss_pred hhHHHHHHHHHHHhhc---c------cccccccCHHHHHHHHHHHHHHHHhcCCCC-CcEEEeCHHHHHHHHhhhhhhcc Confidence 5666665443221110 0 001111122335788999999998876553 44444555555544432111 Q ss_pred -ccCCceee--ccccccCcceEEcCC--CCCccEEEEehhhEEEE-EecceEEEEecccccccCcEEEEEEEEeCcEEec Q lcl|Aclame:pro 548 -DNTGERIW--QNNEVNGYRAEASNQ--IPADTWIFGDWSQIVIA-MWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRR 621 (632) Q Consensus 548 -d~~g~~~~--~~~~l~G~pv~~~~~--~~~~~~~~gd~s~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~ 621 (632) +.....+. ..+.|.|.+|+.++. +..-.++++..+.+... ....+++.....- .....|+.+.++|+.|.+ T Consensus 217 ~~~~~~~~~~g~Vg~idG~~Ii~vps~~~k~in~ii~~~~A~~~~~K~~~~~~~~p~~~---~~a~~v~gr~yyd~~V~~ 293 (329) T protein:vir:10 217 GDNRQQVLGKGVQGELDGFTIVKVPSKMLQGVEAMAVIGEVMASPIQANEAKLNSNVPG---MFGTLAEQMLYTGAFVPE 293 (329) T ss_pred ccccccceeeeeeeeecCeEEEEecCCcccceeEEEEcCCceeeeeeeeeeeeeCCCCc---cchheeeeeeeeeeEEEc Confidence 11111222 235799999987643 33445677766554332 2223333322222 234689999999999999 Q ss_pred cc--ceEEEEecC Q lcl|Aclame:pro 622 KE--AFCIAKKGA 632 (632) Q Consensus 622 ~~--a~~~~~~~A 632 (632) |+ ++.....+| T Consensus 294 ~k~~~I~~~~~~a 306 (329) T protein:vir:10 294 HLQKYIFTIGGKE 306 (329) T ss_pred cccCEEEEecccC Confidence 98 444444444 No 185 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=95.55 E-value=0.0019 Score=35.56 Aligned_cols=257 Identities=11% Similarity=0.043 Sum_probs=122.6 Q ss_pred cccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccC------ceeEEEEEecCCcccccc-ccCcccccCcccc Q lcl|Aclame:pro 358 EKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGL------VGDVDIPKKTSGANFYWI-GEDEDVQDSDFDF 430 (632) Q Consensus 358 ~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~a~~v-~E~~~~~~~~~~~ 430 (632) +..+.. ...++++.+.+++.+.+..++.++..+...+. ...+++++........+- ..+.......+.. T Consensus 1 MaN~ll----T~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~~~~~~~~~~~~~~~~~~~~l~e 76 (423) T protein:vir:17 1 MPNNLD----SNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLIS 76 (423) T ss_pred Cccchh----hhhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCCcceeecccCcccCCcccCcccc Confidence 111110 12356667777787777777666644433332 225666654433322222 2333344566777 Q ss_pred eeeeeeeeeeee-eehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcC-CCccccccceeccccccccccccchh Q lcl|Aclame:pro 431 TTLSFSPKTIAG-AVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGT-GLANDPVGLLNMTGVPALTYPAGGVD 508 (632) Q Consensus 431 ~~~~~~~~t~~~-~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~-g~~~~~~Gil~~a~~~~~~~~~~~~~ 508 (632) .++.+.+.+.-. .+.++.+-+..+.-++ ..+.+.-.++++..+|..++... +......| . .....-. T Consensus 77 ~~v~l~id~~k~va~~v~d~E~~~~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~a~~~~g------t----~~t~~~a 145 (423) T protein:vir:17 77 GKATGRVGNYITVAVEYQQLEEAIKLNQL-EEILAPVRQRIVTDLETELAHFMMNNGALSLG------S----PNTPITK 145 (423) T ss_pred ceeEEEeeceeeeeeeecHHHHhcChhHH-HHHHHHHHHHHHHHHHHHHHHHHhhccccccc------c----CCccccc Confidence 776666655544 4566665554444445 44556667889999998775331 11111111 0 0111124 Q ss_pred HHHHHHHHHHHHhhccccccceEEeehhHHHHHHHH--hhcc--cCCceeec----cccccCcceEEcCCCCCccEE-EE Q lcl|Aclame:pro 509 WASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKA--QVFD--NTGERIWQ----NNEVNGYRAEASNQIPADTWI-FG 579 (632) Q Consensus 509 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~d--~~g~~~~~----~~~l~G~pv~~~~~~~~~~~~-~g 579 (632) ++.+.++...|...+.+......++.+.....+... ++.. ..+.--+. .+++.|+.++.++++|..+.. ++ T Consensus 146 ~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdvy~Snnip~~T~gt~~ 225 (423) T protein:vir:17 146 WSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLASRTQGAFG 225 (423) T ss_pred HHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchHHHhhccceeeecceEEEEeCCCcccccccee Confidence 789999999999999887776666666654443211 1111 11111111 268999999999999964321 11 Q ss_pred e-----hh-hEEE-------EEecceEEEEecccccccCcEEEEEEEEeCcEEe--------------cccceEEEEec- Q lcl|Aclame:pro 580 D-----WS-QIVI-------AMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVR--------------RKEAFCIAKKG- 631 (632) Q Consensus 580 d-----~s-~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~--------------~~~a~~~~~~~- 631 (632) - .. .+.. ....++...+...+++.+-+-.+ ...|+..+ ..+-|+...-+ T Consensus 226 ~t~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~---t~aGv~~v~~~tk~v~~~~~t~~~~~~~v~~~~~ 302 (423) T protein:vir:17 226 GTLTVKTQPTVTYNAVKDSYQFTVTLTGATTSVTGFLKAGDQV---KFTNTYWLQQQTKQALYNGATPISFTATVTADAN 302 (423) T ss_pred ceeeecccccccccccccccceeeeeeeeeeeccCceeecceE---EecceeeecccccccccccccccceEEEEEeccc Confidence 0 00 0000 01111222222233322211111 22232222 22333332100 Q ss_pred --C Q lcl|Aclame:pro 632 --A 632 (632) Q Consensus 632 --A 632 (632) | T Consensus 303 ~~a 305 (423) T protein:vir:17 303 SDS 305 (423) T ss_pred ccc Confidence 1 No 186 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=95.53 E-value=0.0012 Score=36.53 Aligned_cols=178 Identities=13% Similarity=0.002 Sum_probs=89.5 Q ss_pred eeeehhhHHHhhc-----ChhHHHHHHHHHHHHHHHHHHHHHHhhcC--C-Cccccc--cceecc-ccccccccccchhH Q lcl|Aclame:pro 441 AGAVPVTRKLRKQ-----SSIHVENLIREDLIEGIGVALDLAMLTGT--G-LANDPV--GLLNMT-GVPALTYPAGGVDW 509 (632) Q Consensus 441 ~~~~~iSre~l~d-----~~~~~~~~i~~~l~~a~a~~~~~~~~~g~--g-~~~~~~--Gil~~a-~~~~~~~~~~~~~~ 509 (632) =--.-+|+-++.| +.+++.+.+.++++.++++..|..++... + ....|. ++-... ......+.....-+ T Consensus 1 iD~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g~~~~~~a~~t~~~~~l~ 80 (221) T protein:vir:17 1 MDDLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDGGFSVNIGAGNTNNAQAIV 80 (221) T ss_pred CCcchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccccCcceeccccccCCHHHHH Confidence 0112344444433 34678889999999999999999875321 1 111111 110000 00000111222336 Q ss_pred HHHHHHHHHHHhhccccccceEEeehhHHHHHHHH---hhc--c--cCCceeec---cccccCcceEEcCCCCCc--cEE Q lcl|Aclame:pro 510 ASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKA---QVF--D--NTGERIWQ---NNEVNGYRAEASNQIPAD--TWI 577 (632) Q Consensus 510 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~--d--~~g~~~~~---~~~l~G~pv~~~~~~~~~--~~~ 577 (632) +.|.++...|..++.+.....+++.|.....+... ++. + .++..+.. -+.+.|.+|+.|+++|.. +-+ T Consensus 81 dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~~i~~v~G~~V~~SnnlP~~~gt~~ 160 (221) T protein:vir:17 81 DGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGKGLYVNAGIRIYKSNVLASLYGTNL 160 (221) T ss_pred HHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeecccccccccccceeeeecCcEEEEeccCCccccccc Confidence 88888999999999887777777777655444321 111 1 11111222 235899999999999962 222 Q ss_pred EEehhhEEEEEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 578 FGDWSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 578 ~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) ..+...+.. .......+.-+..+. -+.+.+++|+..+|.=. T Consensus 161 ~~~ag~~~~--------~~~~~~~yr~~fs~~------~glv~~~~Avgtvkl~~ 201 (221) T protein:vir:17 161 VTDPGDATT--------SGENNGSYRPAITDR------AGLVFHKEAADTVEVLL 201 (221) T ss_pred ccCCccccc--------cccccccccccccce------EEEEEcchheeeeeeec Confidence 222221110 000000000000011 14567788877777766 No 187 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=95.40 E-value=0.0021 Score=35.22 Aligned_cols=225 Identities=10% Similarity=0.017 Sum_probs=115.6 Q ss_pred hhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCcee-EEEEEecCCccccccccC Q lcl|Aclame:pro 342 ARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGD-VDIPKKTSGANFYWIGED 420 (632) Q Consensus 342 ~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~a~~v~E~ 420 (632) ....... ..+.........+.-....+++.+.+.+.+..... ...+.... ....+.++-|.+.|..=+ T Consensus 1 m~~~~~~----------a~TL~E~Akr~~~d~~~~~IIE~l~~tneIL~~lp-f~e~N~~tg~~~~vrt~LP~~~fR~lN 69 (335) T protein:vir:73 1 MALIGQT----------LPSLLDIYNRTDKNGRIARIVEQLAKTNDILTDAI-YVPCNDGSKHKTTIRAGIPEPVWRRYN 69 (335) T ss_pred CCcCCCC----------chhHHHHHhhcCcchhHHHHHHHHhcCchHHhhcc-hhcccCCcccceeEEEecCCchhhhcC Confidence 0000000 00000000000011122236666666555443311 12121111 223445677889999999 Q ss_pred cccccCcccceeeeeeeeeeeeeehhhHHHhhcChhHH---HHHHHHHHHHHHHHHHHHHHhhcCCCc--cccccc---e Q lcl|Aclame:pro 421 EDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHV---ENLIREDLIEGIGVALDLAMLTGTGLA--NDPVGL---L 492 (632) Q Consensus 421 ~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~---~~~i~~~l~~a~a~~~~~~~~~g~g~~--~~~~Gi---l 492 (632) ..++.+..++.+++-..+-+++.+.|.|.+.. ..-+. .....+.+.+++.++....+|+|+.+. ....|+ + T Consensus 70 ~g~~~s~~tt~qvt~~l~ilgg~~eVDr~La~-~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGDsa~~p~~FdGL~kR~ 148 (335) T protein:vir:73 70 QGVQPTKTQTVPVTDTTGMLYDLGFVDKALAD-RSNNAAAFRVSENMGKLQGFNNKVARYSIYGNTDAEPEAFMGLAPRF 148 (335) T ss_pred CccccccceEEEEEEEEEEecchhhhhHHHHh-hcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCcCCChhhccchhhhh Confidence 99999999999999999999999999986543 33333 444556689999999999999986432 112232 1 Q ss_pred ---------------eccccc----------------------------------------------------------- Q lcl|Aclame:pro 493 ---------------NMTGVP----------------------------------------------------------- 498 (632) Q Consensus 493 ---------------~~a~~~----------------------------------------------------------- 498 (632) ..-+.+ T Consensus 149 ~~~st~~a~~a~~iIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~d~~G~~y~~~~~~~~w~~Gl 228 (335) T protein:vir:73 149 NTLSTSKAASAENVFSAGGSGSTNTSIWFMSWGENTAHMIYPEGMVAGFQHEDLGDDLVSDGNGGQFRAYRDEFKWDIGL 228 (335) T ss_pred cCccccccCcccceeeccccccCceEEEEEEEcCCeeEEEcccCccccceeeeccceeeecCCCCEEeEEEeeeeeeeee Confidence 000000 Q ss_pred ------------cccccc---cchhHHHHHHHH-HHH---HhhccccccceEEeehhHHHHHHHHhhcccCCcee-ec-- Q lcl|Aclame:pro 499 ------------ALTYPA---GGVDWASVVDME-TKI---STFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERI-WQ-- 556 (632) Q Consensus 499 ------------~~~~~~---~~~~~~~i~~~~-~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~~~-~~-- 556 (632) |+..+. .+.+.++|.+++ .++ .-.+.....+.|.|+-....++.+... +.....+ .. T Consensus 229 ~i~d~r~vvRI~NIdvs~l~~d~~~~~~l~~lmi~a~~~~~ip~~~~~~~~~y~n~~v~~~L~~q~~-~~~n~~l~~~~~ 307 (335) T protein:vir:73 229 SVRDWRSISRICNIDVTTLTKDASTGADLISMMVDAYYARDVAMLGDGKEVIYANKTIHAWLHKQAM-NAKNVNLTIEEY 307 (335) T ss_pred EEeCcccEEEEeecccccccccccchhhHHhhHHHHHHHHhccCCCCCceEEEechHHHHHHHHHHh-ccCceeeeeecc Confidence 000000 011113344433 222 112334556788999988888876543 3333222 11 Q ss_pred ----cccccCcceEEcCCCCCcc-EEEE Q lcl|Aclame:pro 557 ----NNEVNGYRAEASNQIPADT-WIFG 579 (632) Q Consensus 557 ----~~~l~G~pv~~~~~~~~~~-~~~g 579 (632) ...+.|.||..++.+-.+. .+.+ T Consensus 308 ~g~~~t~~~gipir~~Dail~tE~~v~~ 335 (335) T protein:vir:73 308 GGKKIVSFLGIPIRRVDAILNTESAVTA 335 (335) T ss_pred CCceeEEECCeEEEEEeeeecCcccccC Confidence 1246788888887765432 2222 No 188 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=95.09 E-value=0.0028 Score=34.59 Aligned_cols=282 Identities=10% Similarity=0.041 Sum_probs=125.1 Q ss_pred hhhhhHHhhhhhcccccccccceechhh-hhHHHHHHHhhhhhhhhhccee--ecc-CceeEEEEEecCCcccc-ccccC Q lcl|Aclame:pro 346 YMPHEVLVQRQLEKKTAGKGGELVATEL-LSEEFIDILRNKAIIGQMGARM--LPG-LVGDVDIPKKTSGANFY-WIGED 420 (632) Q Consensus 346 ~~~~~~~~~~a~~~~~~~~~~~~i~~~~-~~~~i~~~~~~~~~~~~~~~~~--~~~-~~~~~~~~~~~~~~~a~-~v~E~ 420 (632) .+..+.-. .+..+++.++-+..+..-+ ..+.+++.... ..+.++ +.. +|- .+.++++.+....+.+. ...|| T Consensus 1 ~~~~~a~~-~~~~~s~~g~~~~~~~t~y~~~k~L~~Aa~~-lv~~~f-A~~~piPkn~GkTIk~r~y~pl~~~~~pl~eG 77 (401) T protein:vir:95 1 MLNYNAPT-DGQKSSIDGANSDQMQTFFWLKKAIITARKE-QYFMPL-ASVTNMPKHYGKTIKVYEYVPLLDDRNINDQG 77 (401) T ss_pred CCccCCCc-ccccccccccccceeeehhhHHHHHhhhhhh-hhhhhc-ccccccccccCCeEEEEecccccccccchhcC Confidence 00000000 0111122222233332222 34445554443 444444 333 332 23355555554444321 12222 Q ss_pred c-----cc-----------------------------ccCcccceeeeeeeeeeeeeehhhHHHh-hcChhHHHHHHHHH Q lcl|Aclame:pro 421 E-----DV-----------------------------QDSDFDFTTLSFSPKTIAGAVPVTRKLR-KQSSIHVENLIRED 465 (632) Q Consensus 421 ~-----~~-----------------------------~~~~~~~~~~~~~~~t~~~~~~iSre~l-~d~~~~~~~~i~~~ 465 (632) - +. ..-.++-..+..++++||.+..+|..++ .+.|.++...+... T Consensus 78 v~a~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~D~~l~~h~s~e 157 (401) T protein:vir:95 78 IDASGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDSDDGLMEHLSRE 157 (401) T ss_pred CCcccccccCccccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhcchHHHHHHHHH Confidence 2 11 1111222345667999999999999654 44566777766444 Q ss_pred HHH-HHHHHHH---HHHhhcCCCccccccceeccccccccccccchhHHHHHHHHHHHHhhcccc--------------- Q lcl|Aclame:pro 466 LIE-GIGVALD---LAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADA--------------- 526 (632) Q Consensus 466 l~~-a~a~~~~---~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~--------------- 526 (632) +.. +...+++ ..+++.-++---+.+....+....-....+.++++++..+...|.....+. T Consensus 158 ll~g~~~~t~d~i~~dll~ag~~viyAg~ats~At~~~~~~~~t~vt~~~l~rl~~~L~~nRapk~t~~i~~s~~~dTk~ 237 (401) T protein:vir:95 158 LMNGATQITEAVLQKDLLAAAGTVLYAGAATSDATITGEGSTPSVVSYKNLMRLDQILTENRTPTQTTIITGSRMIDTKV 237 (401) T ss_pred HhhhhhhhHHHHHHHHHHhhcCeeecCCccceeeeccccccccceechhHHHHHHHHHHhcccccchhhhhhhhccCccc Confidence 433 3333333 234433221111111112222222334556788888888887776321111 Q ss_pred ccce--EEeehhHHHHHHHHhhcccCCceeec---------------cccccCcceEEcCCCC--------C-------- Q lcl|Aclame:pro 527 GRLA--YLTSVTQRGAAKKAQVFDNTGERIWQ---------------NNEVNGYRAEASNQIP--------A-------- 573 (632) Q Consensus 527 ~~~~--~~~~~~~~~~~~~~~~~d~~g~~~~~---------------~~~l~G~pv~~~~~~~--------~-------- 573 (632) ..+. .++++.....+ ..++|..|.+-|. -+.+.+.++++++.+- + T Consensus 238 i~~s~va~~h~~L~~di--~a~~D~~~~~~fi~v~kYa~~~~i~~gEiG~i~~vR~i~~p~~~~w~~ag~~a~~~~~~y~ 315 (401) T protein:vir:95 238 IGATRVMYVGSELVPEL--KAMKDLFGNKAFIETQHYADAGTIMNGEVGSIDKFRIIQVPEMLHWAGAGAQATGANPGYR 315 (401) T ss_pred cccceEEEEecCchhHH--HHHHHhcCCCCceehhhcCCccccccccccccCceeEEecccceeecCCcccccccccccc Confidence 1122 23444333322 3345555544332 2456677888777532 1 Q ss_pred -------------ccEEEEehhhEEEEEecce-----EEEEe-cc-----cccccCcEEEEE-EEEeCcEEecccceEEE Q lcl|Aclame:pro 574 -------------DTWIFGDWSQIVIAMWGVL-----DLKVD-PY-----TKAASDGLVLRV-FQDVDAGVRRKEAFCIA 628 (632) Q Consensus 574 -------------~~~~~gd~s~~~~~~~~~~-----~~~~~-~~-----~~~~~~~~~~~~-~~r~~~~v~~~~a~~~~ 628 (632) ..+++|+-+.-.+...++- .+.+. +. .+=..|+..+.. -...++.+.+++=++.+ T Consensus 316 ~~~~~~gg~~dVyp~lV~G~dAf~~~~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQ~g~vgwK~~~a~~vL~~e~m~~i 395 (401) T protein:vir:95 316 TSMVSGQEHYDVYPMLVVGDDSFTSIGFQTDGKSLKFTVMTKMPGKETADRNDPYGETGFSSIKWYYGILVKRPERLALI 395 (401) T ss_pred cccccCCCcceeeeeeEEccccceecccccCCccccceeEeecCCcCCCCCCCcccceehhhhhhhhhhheeccceeEEE Confidence 0134454433333222221 22221 11 011123333222 24678899999999999 Q ss_pred EecC Q lcl|Aclame:pro 629 KKGA 632 (632) Q Consensus 629 ~~~A 632 (632) +.+| T Consensus 396 es~a 399 (401) T protein:vir:95 396 KTVA 399 (401) T ss_pred Eeec Confidence 9999 No 189 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=94.84 E-value=0.0034 Score=34.13 Aligned_cols=264 Identities=11% Similarity=0.027 Sum_probs=118.0 Q ss_pred cccccccccceechh--hhhHHHHHHHhhhhhhhhhccee--------eccCceeEEEEEecC-Ccccc-cccc-C--cc Q lcl|Aclame:pro 358 EKKTAGKGGELVATE--LLSEEFIDILRNKAIIGQMGARM--------LPGLVGDVDIPKKTS-GANFY-WIGE-D--ED 422 (632) Q Consensus 358 ~~~~~~~~~~~i~~~--~~~~~i~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~-~~~a~-~v~E-~--~~ 422 (632) +. .+.-...+.++ ++..-+.+...+.+.+.+.+.-. .......+++|.... .+... .+.. . +. T Consensus 1 Ma--~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~l~g~~e~n~~~dt~~~~ 78 (349) T protein:vir:94 1 MA--ITTIGNIVTGNIPVLASYMTEDPVEKTAFFNSGILTPTPYAAEIARGPSNIANLPFWKAIDTSIEPNYSNDVYQDI 78 (349) T ss_pred CC--ceEEeeeeccChHHHHHHHHHhHHHhhhhhhccceeccHHHHHHHhcCCCEEEeeeeecCCCCcccccCCCCcccc Confidence 11 11122334443 34444444444445555544322 112233455554433 12221 1211 1 12 Q ss_pred cccCcccceeeeeeeeeeeeeeh---hhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhh---cCCCcccc--ccceec Q lcl|Aclame:pro 423 VQDSDFDFTTLSFSPKTIAGAVP---VTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLT---GTGLANDP--VGLLNM 494 (632) Q Consensus 423 ~~~~~~~~~~~~~~~~t~~~~~~---iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~---g~g~~~~~--~Gil~~ 494 (632) .+.++++-++-.-...-.+..+. ++..+-- -+....|.+.++....+...+.++. |.-..+.. ....+. T Consensus 79 ~t~~kit~~~~~a~~~~r~kaw~~~Dla~~lsG---~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~~~~~~~~~ 155 (349) T protein:vir:94 79 ATPRAIQTGEMMARVAYLNEGFGQADLTVELTS---QNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDAYHEQ 155 (349) T ss_pred cccccccccceeeeeeeeccccchhHHHHHhhC---chHHHHHHHHHHHHHhhHHHHHHHHHHHhhhccccccccccccc Confidence 33333333332222222333333 3333322 2445667777776666665555442 22111100 011111 Q ss_pred cccccccccccchhHHHHHHHHHHHHhhcc---ccccceEEeehhHHHHHHHHhh----cccCCceeeccccccCcceEE Q lcl|Aclame:pro 495 TGVPALTYPAGGVDWASVVDMETKISTFNA---DAGRLAYLTSVTQRGAAKKAQV----FDNTGERIWQNNEVNGYRAEA 567 (632) Q Consensus 495 a~~~~~~~~~~~~~~~~i~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~----~d~~g~~~~~~~~l~G~pv~~ 567 (632) .++.....+.+..+...+.++..+|..... .......+||......+...++ ++.+|.. .-++++|++|++ T Consensus 156 ~~~~~d~~~~a~~~~~~~~~A~~~~Gdaa~Gd~~~~lt~i~mHS~v~~~L~~~~li~~i~~s~~~~--~i~ty~G~~Viv 233 (349) T protein:vir:94 156 NDMVVDVSATSGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLIDFIRDAENNT--MFATYQGYRVIV 233 (349) T ss_pred CceeEEecccCCCChhhHHHHHHHHHHHhccccccceeEEEEchHHHHHHHhcchhhhccCcccCc--ccceecCcEEEE Confidence 122222233445677888888888766521 1223456777777766665544 3333321 126789999999 Q ss_pred cCCCCCc---------cEEEEehhhEEEEEec---ceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 568 SNQIPAD---------TWIFGDWSQIVIAMWG---VLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 568 ~~~~~~~---------~~~~gd~s~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) ++.+|-. +++|| ...+.+.... .++..+++...-..++..+..+.+ .+.||.+|.+-+..- T Consensus 234 DD~~Pv~~~g~~~~yttylfg-~GAi~~~~~~~~~~~E~~rd~~~g~~~G~d~L~~R~~---~~~hp~G~s~~~a~v 306 (349) T protein:vir:94 234 DDSMTVVGQDTSRKFISIIFG-QGAIGYGEGNPEMPLEYEREASRANGGGVETLWTRKT---WLLHPFGYSFTSAVI 306 (349) T ss_pred eCCCccccCCCCceEEEEEee-cceEEeecCCCCcceeeecccccCCcceeEEEEEeeE---EEeeeeeeeeccccc Confidence 9999841 24555 2223232222 123333332221223444444444 367888887765321 No 190 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=93.84 E-value=0.0062 Score=32.68 Aligned_cols=260 Identities=10% Similarity=0.022 Sum_probs=122.9 Q ss_pred cccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccC------ceeEEEEEecCCcccccc-ccCcccccCcccc Q lcl|Aclame:pro 358 EKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGL------VGDVDIPKKTSGANFYWI-GEDEDVQDSDFDF 430 (632) Q Consensus 358 ~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~a~~v-~E~~~~~~~~~~~ 430 (632) +..+. ....++++.+.+++.+.+..++.++.-+...+. ...+++++........+. ..+......++.. T Consensus 1 MaN~l----lT~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~~~~~~~~~~~dl~e 76 (423) T protein:vir:10 1 MPNNL----DSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLIS 76 (423) T ss_pred Cccch----hhhhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCceeeeccCCccccccccCcccc Confidence 11110 112356667777777777777666644433322 224566665544333333 2333445566777 Q ss_pred eeeeeeeeeeee-eehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcC-CCccccccceeccccccccccccchh Q lcl|Aclame:pro 431 TTLSFSPKTIAG-AVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGT-GLANDPVGLLNMTGVPALTYPAGGVD 508 (632) Q Consensus 431 ~~~~~~~~t~~~-~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~-g~~~~~~Gil~~a~~~~~~~~~~~~~ 508 (632) +++.+.+.+.-. .+.++.+-+..+.-++ ..+.+.-.++++..+|..++... +......| ......-. T Consensus 77 ~~v~l~id~~k~va~~v~d~E~~~~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~~~~~~g----------t~~t~~~a 145 (423) T protein:vir:10 77 GKATGRVGNYITVAVEYQQLEEAIKLNQL-EEILAPVRQRIVTDLETELAHFMMNNGALSLG----------SPNTPITK 145 (423) T ss_pred ceeEEEeeceeeeeeeechHHHhcChhhH-HHHHHHHHHHHHHHHHHHHHHHHhhccccccc----------cCCcccch Confidence 777766655544 4566654444444344 45556667899999999876421 11111110 01111124 Q ss_pred HHHHHHHHHHHHhhccccccceEEeehhHHHHHHHH--hhcccC--Cceeec----cccccCcceEEcCCCCCccEEEEe Q lcl|Aclame:pro 509 WASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKA--QVFDNT--GERIWQ----NNEVNGYRAEASNQIPADTWIFGD 580 (632) Q Consensus 509 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~d~~--g~~~~~----~~~l~G~pv~~~~~~~~~~~~~gd 580 (632) ++.+.++..+|...+.+......++.+.....+... ++...+ +.--+. .+++.|+.++.++++|..+....- T Consensus 146 ~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnip~~T~gt~~ 225 (423) T protein:vir:10 146 WSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLASRTQGAFG 225 (423) T ss_pred HHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchhhhhhccceeeecceEEEEeCCCccccccccc Confidence 789999999999999887776666666654443211 111111 111111 268999999999999964322110 Q ss_pred hh-----hEEE--E---Eecc----eEEEEeccccccc-C-cEEE---EEEEEeCcEEe------cccceEEEEec---C Q lcl|Aclame:pro 581 WS-----QIVI--A---MWGV----LDLKVDPYTKAAS-D-GLVL---RVFQDVDAGVR------RKEAFCIAKKG---A 632 (632) Q Consensus 581 ~s-----~~~~--~---~~~~----~~~~~~~~~~~~~-~-~~~~---~~~~r~~~~v~------~~~a~~~~~~~---A 632 (632) .+ ...+ . .... +...+-..+.+.+ | .+.| ...++....++ ...-|+...-+ + T Consensus 226 ~t~~~~~~~~v~~~a~~~a~~~~~~~~~~~~~~~~~l~~GD~~t~aGv~~v~~~tk~~~~~~~t~~~~~~~v~a~~~~~~ 305 (423) T protein:vir:10 226 GTLTVKTQPTVTYNAVKDSYQFTVTLTGATASVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSDS 305 (423) T ss_pred cceeeeecceeccccccccceeeeeeeeccccccCceeecceEEecceeeecccccccccccccCcceEEEEEeeeeecc Confidence 00 0000 0 0011 1111111122221 1 1111 11112222211 22334433211 1 No 191 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=93.64 E-value=0.0068 Score=32.45 Aligned_cols=307 Identities=12% Similarity=-0.009 Sum_probs=132.6 Q ss_pred hHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHH Q lcl|Aclame:pro 299 IQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEF 378 (632) Q Consensus 299 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i 378 (632) ..+... .....+.......... ....+... ..+ ........+.+ +...+......+++...+ T Consensus 1 ~~~~~~----~~~l~~~gi~~~~~~~----~~~~~~~~--------~~~-da~d~~~~~~~-~~~~~i~~~l~~~i~p~~ 62 (336) T protein:vir:10 1 MRDAQR----IQNLARAGVILPRSVQ----NVSTPLTE--------YAM-DAADLSPHLSS-TGSSGIPNYLTTYVDPAV 62 (336) T ss_pred CchHHH----HHHHhhcCeeecchhh----hhhhhHHH--------hhh-hhhhccCcccc-CCCchhHHHHHhhcccce Confidence 000000 0000000000000000 00000000 000 00000011111 112222223333443444 Q ss_pred HHHHhhhhhhhhhcceeeccC--ceeEEEEEecCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcCh- Q lcl|Aclame:pro 379 IDILRNKAIIGQMGARMLPGL--VGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSS- 455 (632) Q Consensus 379 ~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~- 455 (632) ++.+.+.-....+......++ ...+.+......+.+...+-+...|..+......+..++.++..+.++.+-+.... T Consensus 63 ~~~~~~p~~a~~l~pv~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~ 142 (336) T protein:vir:10 63 IDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGA 142 (336) T ss_pred eeehhhhhhhhhhccccccCCccceeEEEeeeeceeeEEEeeccCCCceeecccceeeeeEEEEEeeeeeCHHHHHHHHH Confidence 444433333333322111111 12345555566677788888888888888888888899999999999965554432 Q ss_pred --hHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccc------ccc--chhHHHHHHHHHHHHhhccc Q lcl|Aclame:pro 456 --IHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTY------PAG--GVDWASVVDMETKISTFNAD 525 (632) Q Consensus 456 --~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~------~~~--~~~~~~i~~~~~~~~~~~~~ 525 (632) +++.+.-...-.+++.+.+|+..+.|+.. ...-|++|+.......+ +.+ .--+++|..++..+..+... T Consensus 143 ~g~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~-~~~yGllN~P~l~a~~t~~t~~~~~~t~eei~~Di~~~~~~l~~qs~G 221 (336) T protein:vir:10 143 GRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQG 221 (336) T ss_pred hCCCcHHHHHHHHHHHHHHhhCcEEEEeccc-cceEEEEeCCCCccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCC Confidence 35666677777777778888777777753 34568999876642111 111 12356677777777765432 Q ss_pred ----cccceEEeehhHHHHHHHHhhcccCCceeecc--ccccCcceEEcCCCCCccEEEEehhhEEEEEecc---eEEEE Q lcl|Aclame:pro 526 ----AGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN--NEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGV---LDLKV 596 (632) Q Consensus 526 ----~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~--~~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~---~~~~~ 596 (632) ......++.+.....+ . ..+..|.-++.- ..+-++.++..+...... |+...+.+....+ ..+. T Consensus 222 ~i~~~~~~tL~LP~~~~~~L--s-~~n~~g~Tvl~~lk~n~Pnl~i~t~pEl~~a~---G~~~~l~~~~~~~~~t~~~~- 294 (336) T protein:vir:10 222 IITQEDVLRMGLPPTAMSDL--S-KTNQYGLAAAAKLKDIFPKLEFVTIPEYDTAS---GRLVQLWAPRVEGKDTATCG- 294 (336) T ss_pred eecccCcceEEecHHHHHhc--c-CCCccCccHHHHHHHhcCccEEEEccccccCC---CceEEEEEEecCCCcceeee- Confidence 1233444544443333 2 223334323221 112223333333221110 1111111111111 1110 Q ss_pred eccc------ccccCcEEEEEEEEeCc-EEecccceEEEEec Q lcl|Aclame:pro 597 DPYT------KAASDGLVLRVFQDVDA-GVRRKEAFCIAKKG 631 (632) Q Consensus 597 ~~~~------~~~~~~~~~~~~~r~~~-~v~~~~a~~~~~~~ 631 (632) .+.. ..........+..|.+| -+.+|.||+.++== T Consensus 295 ~p~~~~~l~vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:10 295 FTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred cchhhhccceeecCceeEeccccceeeeeeeccchheeeecC Confidence 1100 01112234444555555 56889998886655 No 192 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=93.56 E-value=0.0071 Score=32.36 Aligned_cols=306 Identities=11% Similarity=-0.023 Sum_probs=130.1 Q ss_pred hHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhH-HhhhhhcccccccccceechhhhhHH Q lcl|Aclame:pro 299 IQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEV-LVQRQLEKKTAGKGGELVATELLSEE 377 (632) Q Consensus 299 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~a~~~~~~~~~~~~i~~~~~~~~ 377 (632) ..+... .....+.......... ....+.. ....+. .....+ .++...+......+++... T Consensus 1 ~~~~~~----~~~l~~~gi~~~~~~~----~~~~~~~----------~~~~da~d~~~~~-~~~~~~~~~~~l~~~i~p~ 61 (336) T protein:vir:36 1 MRDAQR----IQNLARAGVILPRSVQ----NVSTPLT----------EYAMDAADLSPHL-SSTGSSGIPNYLTTYVDPS 61 (336) T ss_pred CchHHH----HHHHhhcCeeecchhh----hhhhHHH----------HhhhhhhhccCcc-ccCCCcchHHHHHHhhccc Confidence 000000 0000000000000000 0000000 000000 000001 1111222222222333233 Q ss_pred HHHHHhhhhhhhhhcceeeccC--ceeEEEEEecCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcCh Q lcl|Aclame:pro 378 FIDILRNKAIIGQMGARMLPGL--VGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSS 455 (632) Q Consensus 378 i~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~ 455 (632) +++.+.+.-....+......++ ...+.+......+.+...+-+...|..+......+..++.++..+.++.+-+.... T Consensus 62 ~~~~~~~~~~~~~l~pv~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~~d~~~~~~~~~v~~~~~g~~yg~~E~~~Aa 141 (336) T protein:vir:36 62 VIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAG 141 (336) T ss_pred eEeeecchhhhhhhccccccCCccceeEEEeeeeceeeEEEeeccCCCceeecccceeeeeEEEEEeeeeeCHHHHHHHH Confidence 4444333333333322111111 12345555566677788888888888888888888899999999999854444322 Q ss_pred ---hHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccc------cc--cchhHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 456 ---IHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTY------PA--GGVDWASVVDMETKISTFNA 524 (632) Q Consensus 456 ---~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~------~~--~~~~~~~i~~~~~~~~~~~~ 524 (632) +++.+.-...-.+++.+.+|+..+.|+.. ...-|++|+.......+ +. ..--+++|..++..+..+.. T Consensus 142 ~~~~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~-~~~yGllNdP~l~a~~t~~t~~~~~~t~~ei~~Di~~~~~~l~~qt~ 220 (336) T protein:vir:36 142 AGRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQ 220 (336) T ss_pred HhCCCcHHHHHHHHHHHHHHhhCcEEEEeccc-cceEEEEecCCCccccccCCCcccccCHHHHHHHHHHHHHHHHHhcC Confidence 35556666777777777777777777753 34568998776642111 11 11235667777777776543 Q ss_pred c----cccceEEeehhHHHHHHHHhhcccCCceeecc--ccccCcceEEcCCCCCccEEEEehhhEEEEEecc---eEEE Q lcl|Aclame:pro 525 D----AGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN--NEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGV---LDLK 595 (632) Q Consensus 525 ~----~~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~--~~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~---~~~~ 595 (632) . ......++.+.....+ . ..+..|.-++.- ..+-++.++..+.+.... |+...+.+....+ ..+. T Consensus 221 G~i~~~~~~tL~LP~~~~~~L--s-~~n~~g~Tvl~~lk~n~Pnl~i~t~pEl~~a~---g~~~~l~~~~~~~~~t~~~~ 294 (336) T protein:vir:36 221 GIITQEDVLRMGLPPTAMSDL--S-KTNQYGLAAAAKLKDIFPKLEFVTIPEYDTAS---GRLVQLWAPRVEGKDTATCG 294 (336) T ss_pred CeeeeccccEEEechHHHHhc--c-CCCccCccHHHHHHHhcCccEEEEccccccCC---CceEEEEEEecCCCcceeee Confidence 2 1233444544443333 2 223334323221 112222333333221110 1111111111111 1110 Q ss_pred Eeccc------ccccCcEEEEEEEEeCc-EEecccceEEEEec Q lcl|Aclame:pro 596 VDPYT------KAASDGLVLRVFQDVDA-GVRRKEAFCIAKKG 631 (632) Q Consensus 596 ~~~~~------~~~~~~~~~~~~~r~~~-~v~~~~a~~~~~~~ 631 (632) .+.. ..........+..|.+| -+.+|.||+.++== T Consensus 295 -~p~~~~~l~vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:36 295 -FTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred -cchhhhccceeecCceeEeccccceeeeeeeccchheeeecC Confidence 1100 01112234444555555 56889998886655 No 193 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=91.87 E-value=0.014 Score=30.78 Aligned_cols=322 Identities=10% Similarity=0.019 Sum_probs=123.7 Q ss_pred HhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhh Q lcl|Aclame:pro 270 NPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPH 349 (632) Q Consensus 270 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 349 (632) ....+..+...... ..+.. ..........+.+++.+....... T Consensus 1 ~~~~~~~~~~~~~~----------------------------~~~~~---------~~~~~~~~~~~~l~~~gi~~~~~~ 43 (382) T protein:vir:96 1 MSHISKTHSRLAGR----------------------------HAKPF---------DLKNVTHEAVAALGRIGLVFDHAV 43 (382) T ss_pred CCCcceeeeecCCc----------------------------cccch---------hhhcccHHHHHHHhccccccCccc Confidence 00000000000000 00000 000000000011111111100000 Q ss_pred ------------hHHhhhhhc-----cccc-ccccceechhhhhHHHHHHHhhhhhhhhhcceeeccC--ceeEEEEEec Q lcl|Aclame:pro 350 ------------EVLVQRQLE-----KKTA-GKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGL--VGDVDIPKKT 409 (632) Q Consensus 350 ------------~~~~~~a~~-----~~~~-~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~ 409 (632) ......++. ..+. ..+.+....++....+++.+.+.-....+......++ ...+.+.... T Consensus 44 ~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~~~g~p~~~l~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~t~ty~~~e 123 (382) T protein:vir:96 44 VQDQIKALAKAGAFRSGSAMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVE 123 (382) T ss_pred chhHhhhhhhhhhhhhhcccccccCCccccCCccHHHHHHhhhhhhhhhhhhhhhhhhhhccccccCCccceEEEEeeee Confidence 000001111 1111 1111222233344445555544444444422222222 2345666666 Q ss_pred CCccccccccCcccccCcccceeeeeeeeeeeeeehhhH-HHhhcC--hhHHHHHHHHHHHHHHHHHHHHHHhhcCCC-- Q lcl|Aclame:pro 410 SGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTR-KLRKQS--SIHVENLIREDLIEGIGVALDLAMLTGTGL-- 484 (632) Q Consensus 410 ~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSr-e~l~d~--~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~-- 484 (632) ..+.+.+++-+...|..+......+-.+..++..+.++. |+..-. .+++.+.-.....+++.+.+|+..|.|+.. T Consensus 124 ~~G~A~~ygd~~D~Pl~d~~~~~~~r~v~~~~~g~~yg~lE~~rAa~~~~~l~~~Ka~aA~~ale~~~N~i~f~G~~~g~ 203 (382) T protein:vir:96 124 PAGTAVEYGDHTNIPLTSWNANFERRTIVRGELGLLVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGL 203 (382) T ss_pred cccceEEeecccCCCccccccceeEEEEEEEEEeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHHhhceEEEEeeecCc Confidence 677888888888888877776666666777777777753 333221 345566667777788888888888888532 Q ss_pred ccccccceecccccccc----ccccchhH----HHHHHHHHHHHhhcccc-----ccceEEeehhHHHHHHHHhhcccCC Q lcl|Aclame:pro 485 ANDPVGLLNMTGVPALT----YPAGGVDW----ASVVDMETKISTFNADA-----GRLAYLTSVTQRGAAKKAQVFDNTG 551 (632) Q Consensus 485 ~~~~~Gil~~a~~~~~~----~~~~~~~~----~~i~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~d~~g 551 (632) .+..-|++|+..++... .....-+. ++|..++..+..+.... .+...++.+.....+ .. .+..| T Consensus 204 ~~~~yGllNdP~l~a~~t~a~~~Wa~kT~~eI~~Di~~l~~~i~~qt~G~~~~~~~~~~L~LP~~~~~~L--s~-~n~~g 280 (382) T protein:vir:96 204 GNRTYGFLNDPNLPPFQTPPSQGWATADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITMALATSKVDYL--SV-TTPYG 280 (382) T ss_pred CcceEEEEeCCCcccccccCCCCcccccHHHHHHHHHHHHHHHHhccCCeeeecccceEEeechHHHhhc--cc-cCccC Confidence 34456999988754221 11122233 45556666665544211 112234444433322 11 22223 Q ss_pred ceeec--cccccCcceEEcCCCCC-------cc---EEEEehhhE--EEEEecceEEEEecccccccCcEE-------EE Q lcl|Aclame:pro 552 ERIWQ--NNEVNGYRAEASNQIPA-------DT---WIFGDWSQI--VIAMWGVLDLKVDPYTKAASDGLV-------LR 610 (632) Q Consensus 552 ~~~~~--~~~l~G~pv~~~~~~~~-------~~---~~~gd~s~~--~~~~~~~~~~~~~~~~~~~~~~~~-------~~ 610 (632) .-++. ...+.++.++..+.+.. +. +++.+--.. .........+...--..+....+. .. T Consensus 281 ~Tvl~~lk~n~Pnl~i~t~peL~~a~~~g~g~~~~~~~~~~e~~~~~~~s~~~p~~f~q~~p~~~~~l~ve~~~~~~~~~ 360 (382) T protein:vir:96 281 ISVSDWIEQTYPKMRIVSAPELSGVQMQGKTPEDALVLFVEEVDASVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVED 360 (382) T ss_pred ccHHHHHHHhcCCcEEEEccccccccCCCccceeEEEEecchhhhhcccccccCcceeccccceeeeccceeecceeEec Confidence 22221 01122223333222210 00 111110000 000000000000000001111111 11 Q ss_pred EE-EEeCcEEecccceEEEEec Q lcl|Aclame:pro 611 VF-QDVDAGVRRKEAFCIAKKG 631 (632) Q Consensus 611 ~~-~r~~~~v~~~~a~~~~~~~ 631 (632) .. ...|+-+..|.||+.++== T Consensus 361 ~s~~t~Gv~i~~P~ai~~~~GI 382 (382) T protein:vir:96 361 FSNGTAGALCKRPWAVVRYLGI 382 (382) T ss_pred cccceeeeEEEcchhhhhccCC Confidence 11 2356677889888876544 No 194 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=91.61 E-value=0.015 Score=30.58 Aligned_cols=303 Identities=10% Similarity=-0.065 Sum_probs=131.6 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHH----hhhhhhhhhhhHHhhhh-----hcccccccccceechhhhh Q lcl|Aclame:pro 305 QQYSLMRAINAAATGDWSKAGFEREVSLAIADASG----KEARGFYMPHEVLVQRQ-----LEKKTAGKGGELVATELLS 375 (632) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~a-----~~~~~~~~~~~~i~~~~~~ 375 (632) ..-...+. ....+++..- +...............+ ...++..++-+....+++. T Consensus 1 ~~~~~~~~-----------------~~~~l~~~g~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~~~~~i~a~~~~~i~ 63 (339) T protein:vir:94 1 MSINNDRT-----------------DIKQLEKVGIIFDGYSPKSISSEVSAYAMDAVNLTPTLQTTANAGIPAWMTTFVD 63 (339) T ss_pred CceechHH-----------------HHHHHHhhceeeccchhhhcchhhHhhhccccccccccccccccchhhhhhhhhc Confidence 00000000 0000000000 00000000000000001 0111111222223344454 Q ss_pred HHHHHHHhhhhhhhhhcceeeccC--ceeEEEEEecCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhc Q lcl|Aclame:pro 376 EEFIDILRNKAIIGQMGARMLPGL--VGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQ 453 (632) Q Consensus 376 ~~i~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d 453 (632) ..+.+...+.-....+......++ ...+.+......+.+.+++.+...|..+..-....-.+..+...+.++.+-+.. T Consensus 64 ~~vy~~~~~~~~~~~l~pv~t~g~w~~~t~~y~~~e~~G~a~~ygd~ad~Pl~~~~v~~~~~~v~~~~~g~~y~~~E~~~ 143 (339) T protein:vir:94 64 RRVIDIQLAPMAAAKIFPEVKKGDWTTTYGVFIIAEPVGQVATYSDWSANGMSKANVNFESRQNYRYQTWTEYGDLEMAT 143 (339) T ss_pred hhheeecccccchhhhcccccCCCCcccEEEEeeeecccceEEcccccCCCcccccceeeEEeEEEEEEEEeecHHHHHH Confidence 555555555444444432222222 345777788888889999988888877755555555555555555555443332 Q ss_pred C---hhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccc---cccchhH----HHHHHHHHHHHhhc Q lcl|Aclame:pro 454 S---SIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTY---PAGGVDW----ASVVDMETKISTFN 523 (632) Q Consensus 454 ~---~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~---~~~~~~~----~~i~~~~~~~~~~~ 523 (632) . .+++.+.-.....+++.+.+|+..|.|+.. ....|++|+..+..... ..+.-+. ++|..++.++..+. T Consensus 144 A~~~g~~l~~~Ka~aA~~al~~~~N~i~~~Gd~~-~~~~GLlN~P~l~~~v~~s~~Wa~kT~~eI~~Di~~~~~~l~~~s 222 (339) T protein:vir:94 144 YGEAGIDYVARQEISASLVMAKFANSSYLLGVAG-IANYGLMNDPSLPAPVAATVNWATAAPEDIANDVVAMVGRLISQS 222 (339) T ss_pred HHhhCCChHHHHHHHHHHHHHHhhceEEeeeecc-cceEEEEeCCCccccccCCCCcccCCHHHHHHHHHHHHHHHHHhc Confidence 2 246667777778888888888888888643 34578999876643211 1112233 55556666665553 Q ss_pred ccc----ccceEEeehhHHHHHHHHhhcccCCceeecc--ccccCcceEEcCCCCC---ccE-EEEeh----hhEEEEEe Q lcl|Aclame:pro 524 ADA----GRLAYLTSVTQRGAAKKAQVFDNTGERIWQN--NEVNGYRAEASNQIPA---DTW-IFGDW----SQIVIAMW 589 (632) Q Consensus 524 ~~~----~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~--~~l~G~pv~~~~~~~~---~~~-~~gd~----s~~~~~~~ 589 (632) ... .+...++.+.....+. . .+..|.-++.- ....++.++..+.+.. +.. ++.+- ....+..- T Consensus 223 ~g~~~~~~~~~L~LP~~~~~~L~--~-~n~~~~Tvl~~lk~n~pnl~i~~~~el~~a~g~~~~~~~~~~~~~~~~~~~~p 299 (339) T protein:vir:94 223 GGLITGQERMVMALAPSALNNVN--R-TNNFGLSAGAKIAQTYPNIQFVAVPEFDTASGRLVQLWVPEVNGQPTGEVAFA 299 (339) T ss_pred CCeeeeccCcEEEecHHHHHhcc--c-CCcCCccHHHHHHHhcCCcEEEEccccccCCCceEEEEEEeccCCcceEEEcc Confidence 211 1223445555444332 2 23333323221 1122233333332211 111 11100 01111111 Q ss_pred cceEEEEecccccccCcEEEEEEEE-eCcEEecccceEEEEec Q lcl|Aclame:pro 590 GVLDLKVDPYTKAASDGLVLRVFQD-VDAGVRRKEAFCIAKKG 631 (632) Q Consensus 590 ~~~~~~~~~~~~~~~~~~~~~~~~r-~~~~v~~~~a~~~~~~~ 631 (632) . .+...+- .........-+..| .|+-++.|.||+.++== T Consensus 300 ~--~~~~lpv-q~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 300 E--KLRSHSI-ERYSTTTRQKHSGATFGAVIYQPWAVTQELGV 339 (339) T ss_pred h--hhhcccc-EEcCceEEecceeeeeeEEEEccceeeeeecC Confidence 0 1111110 11223344555566 45577899999886655 No 195 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=91.21 E-value=0.017 Score=30.29 Aligned_cols=307 Identities=11% Similarity=-0.010 Sum_probs=131.9 Q ss_pred hHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHH Q lcl|Aclame:pro 299 IQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEF 378 (632) Q Consensus 299 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i 378 (632) ..+... .....+. ...+. .+ ............+. .......+ .+....|.......++...+ T Consensus 1 ~~~~~~----~~~l~~~-------gi~~~----~~-~~~~~~~~~~~a~d-a~d~~~~~-~t~~~~g~~~~l~~~i~p~~ 62 (336) T protein:vir:78 1 MRDAQR----IQNLARA-------GVILP----RS-VKNVSTPLAEYAMD-AADLSPHL-SSTGSSGIPNYLTTYVDPSV 62 (336) T ss_pred CchHHH----HHHHhcc-------Ceecc----hh-hhhhhHHHHHHHHh-hhhhcccc-ccCCCcchHHHHHHhcccce Confidence 000000 0000000 00000 00 00000000000000 00000011 11111222223333443344 Q ss_pred HHHHhhhhhhhhhcceeeccC--ceeEEEEEecCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcCh- Q lcl|Aclame:pro 379 IDILRNKAIIGQMGARMLPGL--VGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSS- 455 (632) Q Consensus 379 ~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~- 455 (632) ++.+.+.-....+......++ ...+.+......+.+..++-+...|..+......+-.++.|+..+.++.+-+.... T Consensus 63 ~~~~~~~~~~~~l~~v~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~vd~~~~~~~~~v~~~~~g~~yg~~El~~A~~ 142 (336) T protein:vir:78 63 IDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGA 142 (336) T ss_pred eeehhhhhhhhhhcccccCCCccccEEEEeeeecceeeEEeecccCCCeeecceeeEEEEEEEEEeeeeecHHHHHHHHH Confidence 444444333333322211222 23455666666777888888888899999999999999999999999976555432 Q ss_pred --hHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccc----cch----hHHHHHHHHHHHHhhccc Q lcl|Aclame:pro 456 --IHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPA----GGV----DWASVVDMETKISTFNAD 525 (632) Q Consensus 456 --~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~----~~~----~~~~i~~~~~~~~~~~~~ 525 (632) +++.+.-.....+++.+.+|...+.|+.. ....|++|+..+....+.. +.. -+++|..++..+..+... T Consensus 143 ~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~-~~~~GllN~P~l~a~~t~~~~~w~~~T~~~I~~Di~~~~~~l~~qt~g 221 (336) T protein:vir:78 143 GRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQG 221 (336) T ss_pred hCCCcHHHHHHHHHHHHHHhhCeEEEEeccc-cceEEEEeCCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCC Confidence 35666666677777777777777777753 4567899987764322111 112 235566666666555432 Q ss_pred c----ccceEEeehhHHHHHHHHhhcccCCceeec--cccccCcceEEcCCCCCccEEEEehhhEEEEEecc---eEEE- Q lcl|Aclame:pro 526 A----GRLAYLTSVTQRGAAKKAQVFDNTGERIWQ--NNEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGV---LDLK- 595 (632) Q Consensus 526 ~----~~~~~~~~~~~~~~~~~~~~~d~~g~~~~~--~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~---~~~~- 595 (632) . .....++.+.....+ .. .+..|.-++. ...+-++.++..+.+... -|+...++..+..+ .++. T Consensus 222 ~~~~~~~~tL~Lp~~~~~~L--~~-~n~~g~tv~~~lk~n~Pnl~i~t~pel~~A---gg~~~~~~~~~~~~~~t~~~~~ 295 (336) T protein:vir:78 222 IITQEAVLHMGLPPTAMSDL--SK-TNQYGLSAAAKLKEIFPKLEFVTIPEYDTA---SGRLVQLWAPRVEGKDTATCGF 295 (336) T ss_pred eeeeccceEEEechHHHHhc--cC-CCccCccHHHHHHHhcCccEEEEccccccc---CcceEEEEEeeccCCcceeeec Confidence 1 122344444444333 22 2333322221 011112233333322210 01111111111111 1110 Q ss_pred -----EecccccccCcEEEEEEEEeCc-EEecccceEEEEec Q lcl|Aclame:pro 596 -----VDPYTKAASDGLVLRVFQDVDA-GVRRKEAFCIAKKG 631 (632) Q Consensus 596 -----~~~~~~~~~~~~~~~~~~r~~~-~v~~~~a~~~~~~~ 631 (632) ..+- .............|.+| -+.+|-||+.++== T Consensus 296 p~~f~~lpv-q~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:78 296 TEKMRAHSI-ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred chhhhccce-eecCceeEeccccceeeeeeeccchheeeccC Confidence 0000 01112233444455554 56788888876655 No 196 >protein:vir:270 Length: 341 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536650;genbank:gi:17975128;genbank:GeneID:929084 Probab=87.34 E-value=0.039 Score=28.28 Aligned_cols=289 Identities=10% Similarity=0.040 Sum_probs=125.9 Q ss_pred hhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEE Q lcl|Aclame:pro 326 FEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDI 405 (632) Q Consensus 326 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 405 (632) +...+....+..+........ .+......+..+.+.|.+ ...+...+.+.+.+.+.. .+++.+.-.-.. T Consensus 1 m~~~m~~~tr~~~~~y~~~~A---------~~ngv~~~~~~FsV~P~v-~q~L~~~i~ess~FL~~I-nvv~V~e~~Ge~ 69 (341) T protein:vir:27 1 MSQILTQSAREYMDNFAQQLA---------KSYGVSNVAELFNVSPQL-ETKLRAAITESAEFLKMI-TVTTVDQIEGQV 69 (341) T ss_pred CcccccHHHHHHHHHHHHHHH---------HHcCcccccceEeecHHH-HHHHHHHHHhhHHhhhcC-ccccccceeeeE Confidence 111122222222211111100 111112223344555554 356777777777665542 233333222222 Q ss_pred EEe-cCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcC-----hhHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 406 PKK-TSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQS-----SIHVENLIREDLIEGIGVALDLAML 479 (632) Q Consensus 406 ~~~-~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~-----~~~~~~~i~~~l~~a~a~~~~~~~~ 479 (632) .-. .+++-+.-.. .+..+. .++.+...|.....---..|+.+.|..- ..++...+.+.+.++++.-.=..-| T Consensus 70 v~lg~~g~iagrtd-t~R~~r-~~~l~~~~Y~c~qtn~dt~i~y~~lDaWA~~g~~~dF~~r~~~~i~~~~ALD~i~IGf 147 (341) T protein:vir:27 70 VDVGVSGLYTGRKA-GGRFTK-QVGVGGHKYKLAETDSCAAITWAMLCQWANQGGRDQFMKHLTEFSNQMFALDIMRIGW 147 (341) T ss_pred eecccccceeeccC-CCceec-ccccCCcceEEEEeeeeeeecHHHHHHHHhcCCChHHHHHHHHHHHHHHhhhhhhhcc Confidence 222 2233333322 122222 2355666666666666666777666321 2455666666666666554333335 Q ss_pred hcCCCc------cccc------ccee----cccccc----ccccccchhHHHHH----HHHH-HHHhhccccccceEEee Q lcl|Aclame:pro 480 TGTGLA------NDPV------GLLN----MTGVPA----LTYPAGGVDWASVV----DMET-KISTFNADAGRLAYLTS 534 (632) Q Consensus 480 ~g~g~~------~~~~------Gil~----~a~~~~----~~~~~~~~~~~~i~----~~~~-~~~~~~~~~~~~~~~~~ 534 (632) +|.-.+ .+|. |.+. ++.... ....+..-+|..|. +++. .+...+++....+.++. T Consensus 148 nGts~A~~Td~~anPllqDVNkGWlQ~~Re~a~~rVl~~~~~~~g~~gdy~nLDAlV~D~~~~lI~~~~~~d~dLVvivG 227 (341) T protein:vir:27 148 NGVSAEADTDPSANPLGQDVNEGWIAFVKNRKASQVVDVDVYFDETNGDYRTLDAMASDIINNQIHPMFRNDPRLTVFVG 227 (341) T ss_pred cceeeccCCChhhcccccccchhHHHHHHhhcccceeccceeeccCCCccccHHHHHHHHHhcccChHHhcCCCEEEEEc Confidence 554311 1222 2111 111000 01112223344443 4443 24556666555555554 Q ss_pred hhHHHHHHHHhhcccCCce---ee---ccccccCcceEEcCCCCCccEEEEehhhEEEEEecceE---EEEeccc----c Q lcl|Aclame:pro 535 VTQRGAAKKAQVFDNTGER---IW---QNNEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGVLD---LKVDPYT----K 601 (632) Q Consensus 535 ~~~~~~~~~~~~~d~~g~~---~~---~~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~~~---~~~~~~~----~ 601 (632) ......-.. .+-+....+ +. -..++.|+|.+..|.+|.+.+++--++...++...|-. +...++. . T Consensus 228 ~dLla~k~~-~l~n~~~~ptE~~Aa~~i~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~ 306 (341) T protein:vir:27 228 SGLIGAAQA-KLYDKADKPSEQIAAQKLDKTIAGRPAYVPPFLPDNAMVVTIPENLQVLTQHGTAQRKAKHESDRKRSKT 306 (341) T ss_pred hhhhhhhhh-hhhccCCCCHHHHHHHHHHHhhCCCeEEEccccCCCceEEeeccceEEEEecCcEEEEEEeccccccccc Confidence 333222221 221211111 00 02579999999999999999999999988776554432 1222221 2 Q ss_pred cccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 602 AASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 602 ~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +.+ .|.+ .++|+...-+-.=|++..+| T Consensus 307 yes---~YvV-Edyg~~~~~~~~~vkl~~~~ 333 (341) T protein:vir:27 307 HTG---AWKV-TQWVCWKRSPLTTQKKSTSA 333 (341) T ss_pred hhh---hhee-ehhhhhhhccccccccCccc Confidence 222 3433 34444333333333344444 No 197 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=87.32 E-value=0.04 Score=28.27 Aligned_cols=258 Identities=12% Similarity=0.057 Sum_probs=114.9 Q ss_pred cccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccC------ceeEEEEEecCCccccccccCccc---ccCcc Q lcl|Aclame:pro 358 EKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGL------VGDVDIPKKTSGANFYWIGEDEDV---QDSDF 428 (632) Q Consensus 358 ~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~a~~v~E~~~~---~~~~~ 428 (632) +..+ -..+.++++.+.+++.+++..++.++.-+-..+. ...+++++........ .-+..+ ...++ T Consensus 1 MANs----l~~l~p~iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~~~~d--~~~~~~t~~~~~~l 74 (423) T protein:vir:10 1 MANN----LDANVSQIVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQFKSER--TMDGDITGKSKNSL 74 (423) T ss_pred Cccc----cccccHHHHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCceeeec--ccCcccCccccccc Confidence 1111 1114567778888888888887777654434332 2245555543222111 111111 12244 Q ss_pred cceeeeeeeeeeee-eehhhHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHhhcCCC-ccccccceeccccccccccccc Q lcl|Aclame:pro 429 DFTTLSFSPKTIAG-AVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGL-ANDPVGLLNMTGVPALTYPAGG 506 (632) Q Consensus 429 ~~~~~~~~~~t~~~-~~~iSre~l~d~~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~-~~~~~Gil~~a~~~~~~~~~~~ 506 (632) ..+++.+.+.+.-. .+.++.+-+..+.-++ ..+.+.-.++++..+|..+...... ..+. .+. .+... T Consensus 75 ~e~~v~l~id~~k~~a~~v~d~E~~l~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~~~~~------vgt----~~t~~ 143 (423) T protein:vir:10 75 ISAKATGEVGNYITVAVEYRQIEEALKLNQL-DQILVPINERMVTDLETELALFMMKHGALS------LGS----PNTPI 143 (423) T ss_pred ccceEEEEecceeeeeeeeChHHHhcChhHH-HHHHHHHHHHHHHHHHHHHHHHhhhccccc------ccc----ccccc Confidence 55666666655544 4667665444444455 4566666788999999877532211 1110 011 01111 Q ss_pred hhHHHHHHHHHHHHhhccccccceEEeehhHHHHHHH--HhhcccC--Cceee----ccccccCcceEEcCCCCCc---c Q lcl|Aclame:pro 507 VDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKK--AQVFDNT--GERIW----QNNEVNGYRAEASNQIPAD---T 575 (632) Q Consensus 507 ~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~d~~--g~~~~----~~~~l~G~pv~~~~~~~~~---~ 575 (632) ..++.+.++...|...+.+......++.+.....+.. ..+...+ +.--+ ..+++.|+.++.++++|.. + T Consensus 144 ~a~~~~a~a~~~L~~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~~~i~G~~~GFdi~~Sn~vp~~T~g~ 223 (423) T protein:vir:10 144 KKWSDVAQTASFLKDLGINSGENYAVMDPWAAQRLADAQSGLHVSEQLVRTAWENAQISGNFGGIRALMSNGLASRTQGA 223 (423) T ss_pred ccHHHHHHHHHHHhhccCCcCCCEEEeCHHHHHHHhhhhhhhccccccchHHHHhcccceeecceEEEEecCCccccccc Confidence 2478999999999999888777666676665544431 1122211 11111 1367899999999999842 1 Q ss_pred E-----------EEEehhh------E-EEEEec--ceEEEEecc------------ccc------ccCcEEEEEEEEe-- Q lcl|Aclame:pro 576 W-----------IFGDWSQ------I-VIAMWG--VLDLKVDPY------------TKA------ASDGLVLRVFQDV-- 615 (632) Q Consensus 576 ~-----------~~gd~s~------~-~~~~~~--~~~~~~~~~------------~~~------~~~~~~~~~~~r~-- 615 (632) . +-|+.-. . ....-. ..-+...+. ++. .-....|++.... T Consensus 224 ~~ga~~~~~~~~vt~a~~~~~~~~~~~~~~~T~s~~g~l~~GD~~t~aGv~~v~~~tk~~l~~~~~~~~~~~~V~~~~~~ 303 (423) T protein:vir:10 224 FGGKLTVKGTPEVNYDSVKDSYAFTATLTGATASKKGFLKVGDQLQFDDTHWLNQQSKQTLYNGASALSFTATVMEDANA 303 (423) T ss_pred ccceeeeeeeeEEEecccccccccccceeeccceeceeEEecceEeecceeeecccccceeecccCCcceEEEEEecccc Confidence 1 0011100 0 000000 000111110 000 0011123332211 Q ss_pred ----CcEEecccceEE-------EEecC Q lcl|Aclame:pro 616 ----DAGVRRKEAFCI-------AKKGA 632 (632) Q Consensus 616 ----~~~v~~~~a~~~-------~~~~A 632 (632) +..|.-.-++.. -.+.| T Consensus 304 ~a~~~~tv~i~p~~~~~~~~~~~~~V~a 331 (423) T protein:vir:10 304 HSSGDVTVKISGVPIFDAGYPQYNAVDR 331 (423) T ss_pred cccCceEEEeccccccccCcccccceec Confidence 111111001000 00000 No 198 >protein:vir:1153 Length: 338 # NCBI annotation: predicted major capsid protein # Family: family:all:201 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490602;genbank:gi:17313222;genbank:GeneID:927319 Probab=86.85 E-value=0.043 Score=28.09 Aligned_cols=289 Identities=8% Similarity=0.014 Sum_probs=127.2 Q ss_pred HHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEE-e Q lcl|Aclame:pro 330 VSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPK-K 408 (632) Q Consensus 330 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 408 (632) +....+..+...... ...+......+..+.+.|.+ ...+...+.+.+.+.+.. .+++.+.-.-...- . T Consensus 1 M~~~tr~~~~~y~~~---------~A~~ngv~~~~~~FsV~P~v-~q~L~~~i~ess~FL~~I-nvv~V~e~~Ge~v~lg 69 (338) T protein:vir:11 1 MRNETRKQFDAYLAQ---------LAKLNGVNSAVQTFAVEPSV-QQKLEQRIQESSEFLKQI-NVYGVDELQGEKIGIG 69 (338) T ss_pred CCHHHHHHHHHHHHH---------HHHHhCCCcccceeeeCHHH-HHHHHHHHHHHHHhhccC-ceecccceeeeEeeec Confidence 111111111111110 01111122223345555554 455667777777665542 33333322222222 2 Q ss_pred cCCccccccc--cCcccccCc-ccceeeeeeeeeeeeeehhhHHHhhcC--hhHHHHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|Aclame:pro 409 TSGANFYWIG--EDEDVQDSD-FDFTTLSFSPKTIAGAVPVTRKLRKQS--SIHVENLIREDLIEGIGVALDLAMLTGTG 483 (632) Q Consensus 409 ~~~~~a~~v~--E~~~~~~~~-~~~~~~~~~~~t~~~~~~iSre~l~d~--~~~~~~~i~~~l~~a~a~~~~~~~~~g~g 483 (632) .+++-+.-+. .+++..... .+.+.-.|.....---..|+.+.|..- ..++...+.+.+.+.++.-.=..-|+|.- T Consensus 70 ~~g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfnG~s 149 (338) T protein:vir:11 70 VSGTIASRTDTTGDGVRKPRDVSALDNQRYECKHTDFDTAITYAMLDAWAKFPEFQALLRDAILKRQALDRLMIGFNGTS 149 (338) T ss_pred cCccccccccCCCCCccccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhccccee Confidence 2333333332 222222222 245555666666666667777776432 23566666666666665543333355542 Q ss_pred Cc------cccc------ccee----ccc---------cccccc-cccchhHHHHH----HHHH-HHHhhccccccceEE Q lcl|Aclame:pro 484 LA------NDPV------GLLN----MTG---------VPALTY-PAGGVDWASVV----DMET-KISTFNADAGRLAYL 532 (632) Q Consensus 484 ~~------~~~~------Gil~----~a~---------~~~~~~-~~~~~~~~~i~----~~~~-~~~~~~~~~~~~~~~ 532 (632) .+ .+|. |.+. ++. ...+.. .+..-+|..|. +++. .+...+++....+.+ T Consensus 150 ~A~~Td~~~nPllqDVNkGWlQ~~Re~ap~rv~~~~~~~~~i~i~~g~~gdy~nLDalV~d~~~~lI~~~~~~d~dLVvi 229 (338) T protein:vir:11 150 AAATTNRAANPLLQDVNIGWFQQYRNNAPARVLKEGKTTGKVVVGNGADADYKNLDALVFDVVSSLIDPWHRRDPGLVVI 229 (338) T ss_pred eccCCChhhCcCccccchhHHHHHHhhhhhhhhhcccccceeeecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEE Confidence 11 1222 2111 110 000000 11112343333 4443 345666666555655 Q ss_pred eehhHHHHHHHHhhcccCCc--------eeeccccccCcceEEcCCCCCccEEEEehhhEEEEEecce-EEEEecccccc Q lcl|Aclame:pro 533 TSVTQRGAAKKAQVFDNTGE--------RIWQNNEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGVL-DLKVDPYTKAA 603 (632) Q Consensus 533 ~~~~~~~~~~~~~~~d~~g~--------~~~~~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~~-~~~~~~~~~~~ 603 (632) +.......-.. .+-..... .+....++.|+|.+..|.+|.+.+++--++...++...|- +-..-+. -+ T Consensus 230 vG~dLladk~~-~l~n~~~~ptE~~Aa~~~~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~--p~ 306 (338) T protein:vir:11 230 LGRELVHDKYF-PMVNKDQPATEKIATDLILSQKRMGGLPPVEVPYVPEKGLMVTTLKNLSLYWQIGGRRRYLKEV--PE 306 (338) T ss_pred EchhhhHHHHh-HHHhcCCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEec--cc Confidence 55443322221 22222111 2223467999999999999999999999998877655443 2211111 11 Q ss_pred cCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 604 SDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 604 ~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) ++.+.-+-..--|..|-++.++|.+.--. T Consensus 307 r~rie~y~s~Ne~YvVEd~~~~a~ieni~ 335 (338) T protein:vir:11 307 KNRIENYESSNDAYVVEDYGLGCLVENIE 335 (338) T ss_pred cccccchhhhccceeeeccccEEEeecce Confidence 22222222223333444555444444333 No 199 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=84.59 E-value=0.059 Score=27.31 Aligned_cols=307 Identities=11% Similarity=-0.026 Sum_probs=126.3 Q ss_pred hHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHH Q lcl|Aclame:pro 299 IQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEF 378 (632) Q Consensus 299 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i 378 (632) ..+... .....+. ...+. .+ ............+. .......+ .+....|.......++...+ T Consensus 1 ~~~~~~----~~~l~~~-------gi~~~----~~-~~~~~~~~~~~a~d-a~d~~~~~-~t~~~~g~~~~l~~~i~p~~ 62 (336) T protein:vir:10 1 MRDAQR----IQNLARA-------GVILP----RS-VKNVSTPLAEYAMD-AADLSPHL-SSTGSSGIPNYLTTYVDPSV 62 (336) T ss_pred CchHHH----HHHHhcc-------Ceecc----hh-hhhhhHHHHHHHHh-hhhhcccc-ccCCCcchHHHHHhhcCcce Confidence 000000 0000000 00000 00 00000000000000 00000001 11111111222223332333 Q ss_pred HHHHhhhhhhhhhcceeeccC--ceeEEEEEecCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcCh- Q lcl|Aclame:pro 379 IDILRNKAIIGQMGARMLPGL--VGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSS- 455 (632) Q Consensus 379 ~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~- 455 (632) ++.+.+......+......++ ...+.++.....+.+...+.....|..+..-....-+++.++..+.++.+-+.... T Consensus 63 ~~~~~~~~~~~~l~~v~t~g~w~~~~~~~~~~e~~G~a~~ygd~~d~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~ 142 (336) T protein:vir:10 63 IDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGA 142 (336) T ss_pred eeeeechhchhhhcccccCCCcceeeEEEEeeeeeeeEEEccccCCCcceeeeeeeeeeeEEEEEEEEeeCHHHHHHHHH Confidence 333333322333221111222 12344445555566677777788888888888889999999999999976655432 Q ss_pred --hHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceecccccccccccc--------chhHHHHHHHHHHHHhhccc Q lcl|Aclame:pro 456 --IHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAG--------GVDWASVVDMETKISTFNAD 525 (632) Q Consensus 456 --~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~--------~~~~~~i~~~~~~~~~~~~~ 525 (632) +++.+.-.....+++.+.+|...+.|+.. ....|++|+..+....+..+ .--+++|..++..+..+... T Consensus 143 ~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~-~~~~GllN~P~l~a~~t~~~~~w~~~T~~eI~~Di~~~~~~l~~qt~g 221 (336) T protein:vir:10 143 GRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQG 221 (336) T ss_pred hCCCcHHHHHHHHHHHHHHhhCeEEEEeecc-cceEEEeecCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCC Confidence 35666666667777777777767777653 34578999877653221111 11245666666666655432 Q ss_pred c---c-cceEEeehhHHHHHHHHhhcccCCceeecc--ccccCcceEEcCCCCCccEEEEehhhEEEEEecc---eEEEE Q lcl|Aclame:pro 526 A---G-RLAYLTSVTQRGAAKKAQVFDNTGERIWQN--NEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGV---LDLKV 596 (632) Q Consensus 526 ~---~-~~~~~~~~~~~~~~~~~~~~d~~g~~~~~~--~~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~---~~~~~ 596 (632) . . ....++.+.....+ .. .+..|.-++.- ...-++.++..+.+... -|+...++..+..+ .++. T Consensus 222 ~i~~~~~~tL~Lp~~~~~~L--~~-~n~~g~tv~~~lk~n~Pnl~i~t~pel~~A---gg~~~~~~~~~~~~~~t~~~~- 294 (336) T protein:vir:10 222 IITQEAVLHMGLPPTAMSDL--SK-TNQYGLSAAAKLKEIFPKLEFVTIPEYDTA---SGRLVQLWAPRVEGKDTATCG- 294 (336) T ss_pred eeeeccceEEEechHHHHhc--cC-CCccCccHHHHHHHhCCccEEEEccccccc---CCceEEEEEecccCCcceeee- Confidence 1 1 22344444444333 22 23333322210 11112334333322210 01111111111110 1110 Q ss_pred eccc------ccccCcEEEEEEEEeCc-EEecccceEEEEec Q lcl|Aclame:pro 597 DPYT------KAASDGLVLRVFQDVDA-GVRRKEAFCIAKKG 631 (632) Q Consensus 597 ~~~~------~~~~~~~~~~~~~r~~~-~v~~~~a~~~~~~~ 631 (632) .++. .............|.+| -+.+|-||++++== T Consensus 295 ~P~~f~~lpvq~~~~~~~v~~~~rt~Gv~i~rP~ai~~~~GI 336 (336) T protein:vir:10 295 FTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMLGV 336 (336) T ss_pred cChhhhccceeecCceeEeccccceeeeeeeccchheeeccC Confidence 0100 01112233444445544 55788888876555 No 200 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=84.08 E-value=0.063 Score=27.16 Aligned_cols=261 Identities=11% Similarity=0.007 Sum_probs=109.9 Q ss_pred hcccccccccceechhhhhHHHHHHHhhhhhhhhhccee----ec-cCceeEEEEEecCCccccccccCcccccCcccce Q lcl|Aclame:pro 357 LEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARM----LP-GLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFT 431 (632) Q Consensus 357 ~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~----~~-~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~ 431 (632) +. + +-..+.....+.+.+...+....++... +. .+...+++|+.+..+-..+---+.-+..+.++.+ T Consensus 1 MA--~------~n~a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~~gl~DY~R~~~g~~~g~~~~~ 72 (299) T protein:vir:79 1 MA--A------LNYAKEYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTISTTGRVDSNRDTIAVAQRNYDNA 72 (299) T ss_pred Cc--c------chhHHHHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEeccccccccccccCCCcccccccCcc Confidence 10 0 0012445555666666666555544322 11 2345788888876544443322212333344444 Q ss_pred eeeeeeeeeee-eehhhHHHhhcC--hhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchh Q lcl|Aclame:pro 432 TLSFSPKTIAG-AVPVTRKLRKQS--SIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVD 508 (632) Q Consensus 432 ~~~~~~~t~~~-~~~iSre~l~d~--~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~ 508 (632) .+++.+..=.. .+.|..--...+ .+.+...+.+.....++-.+|...+...-++....| ......+. ...-- T Consensus 73 ~~t~~ldqdr~~~f~vD~~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~~~g----~~~~~~~~-T~~n~ 147 (299) T protein:vir:79 73 WEPKVLTNQRKWSTLVHPADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYADWTALG----NTADTTVL-TTTNV 147 (299) T ss_pred eeEEEeeccccceeccchhhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhhhcC----Cccccccc-CHHHH Confidence 43333322111 122221001111 111212222222333334445443332211100000 00001111 11223 Q ss_pred HHHHHHHHHHHHhhccccccceEEeehhHHHHHHHHh----hcccC-Cceee--ccccccCcceEEc--CCCCCc----- Q lcl|Aclame:pro 509 WASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQ----VFDNT-GERIW--QNNEVNGYRAEAS--NQIPAD----- 574 (632) Q Consensus 509 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~d~~-g~~~~--~~~~l~G~pv~~~--~~~~~~----- 574 (632) ++.|.++..+|..+..+......+++|.....|.... ..+.. ++... ..+.|.|.+|+.. +.+... T Consensus 148 y~~i~~~~~~lde~~vP~~~rvl~vtp~~~~~L~~~~~f~k~~~~~~~~~~~~g~Vg~idG~~Ii~Vps~r~~t~~~~~~ 227 (299) T protein:vir:79 148 LEVFDKLMEKMTEARVPENGRILYVTPVVNTLIKNAKEIQRTVNIKDAGTSLNRQTTDIDTVKIIKVPSNLMKTAYDFTT 227 (299) T ss_pred HHHHHHHHHHHHhcCCCCCCeEEEeCHHHHHHHhhchhhhcccccccccceeeeeeeeecceEEEEechhhcCccceecc Confidence 6888888999988877766666666666666554321 11111 11111 2256899999763 334321 Q ss_pred -----------cEEEEehhhE-EEEEecceEEEEecccccccCcEEEEEEEEeCcEEeccc--ceEEEEecC Q lcl|Aclame:pro 575 -----------TWIFGDWSQI-VIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKE--AFCIAKKGA 632 (632) Q Consensus 575 -----------~~~~gd~s~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~--a~~~~~~~A 632 (632) .++++..+.. .+.-...+.+ ..|...-.-+ ..+.-+.++|.=|.+.+ ++..-..+| T Consensus 228 G~~~~~~ak~in~ii~~~~a~~~~~K~~~~~~-~~P~~~~~~~-~~~~~r~y~d~~v~~nk~~~i~~~~~~a 297 (299) T protein:vir:79 228 GWKVGAGAKQIFMSLVHPSAIITPVSYQFSKL-DEPTAVTEGK-YFYFEESFEDVFILNKKADAIQFVVEGA 297 (299) T ss_pred CccccCcccccceEEEcCCeeeeeEeeeeEEe-ecCCCCCccc-eeeeeeeeeeeeeeccccCeEEEEeeec Confidence 2344433322 1222222332 2344332222 23445667777776664 454555555 No 201 >protein:vir:79157 Length: 339 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165257;genbank:gi:145708082;genbank:GeneID:5247168 Probab=83.91 E-value=0.065 Score=27.11 Aligned_cols=289 Identities=10% Similarity=0.050 Sum_probs=120.9 Q ss_pred HHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEE-e Q lcl|Aclame:pro 330 VSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPK-K 408 (632) Q Consensus 330 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 408 (632) +....+..+....... ..+......+..+.+.|.+ ...+...+.+.+.+.+.. .+++.+.-.-...- . T Consensus 1 M~~~tr~~~~~y~~~~---------A~~ngv~~~~~~FsV~P~v-~q~L~~~i~ess~FL~~I-Nvv~V~e~~Ge~v~lg 69 (339) T protein:vir:79 1 MRNDTRRLFAAYKAAI---------AKLNGVERVDEKFSVAPSV-QQKLETKVQESSDFLKSI-NFYGVPEQEGEKIGLG 69 (339) T ss_pred CChHHHHHHHHHHHHH---------HHHhCcccccceeeecHHH-HHHHHHHHHHHHHHhccC-cccccccceeeEEeec Confidence 1111111111110000 0111112223334455554 455666666777665542 23333222222222 2 Q ss_pred cCCcccccccc-CcccccCc-ccceeeeeeeeeeeeeehhhHHHhhcC--hhHHHHHHHHHHHHHHHHHHHHHHhhcCCC Q lcl|Aclame:pro 409 TSGANFYWIGE-DEDVQDSD-FDFTTLSFSPKTIAGAVPVTRKLRKQS--SIHVENLIREDLIEGIGVALDLAMLTGTGL 484 (632) Q Consensus 409 ~~~~~a~~v~E-~~~~~~~~-~~~~~~~~~~~t~~~~~~iSre~l~d~--~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~ 484 (632) .+++-+.-..- +.+..... .+.+.-.|.....---..|+.+.|..- ..++...+.+.+.+.++.-.=..-|+|.-. T Consensus 70 ~~g~iagrtdt~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~ 149 (339) T protein:vir:79 70 VSGPVASTTDTTQQDRETSDISTMDGRRYRCEQTNSDTHITYQKLDAWAKFADFQTRIRDAIIKRQALDRIMIGFNGVSR 149 (339) T ss_pred cCcceeecccCCCCCcccccccccCCCccEEEEeeeeceecHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceee Confidence 23333322211 12222222 345555666666655666777766431 234555555555555544322222455321 Q ss_pred c------cccc------cc------------eecc--ccccccccccchhHHHHH----HHHH-HHHhhccccccceEEe Q lcl|Aclame:pro 485 A------NDPV------GL------------LNMT--GVPALTYPAGGVDWASVV----DMET-KISTFNADAGRLAYLT 533 (632) Q Consensus 485 ~------~~~~------Gi------------l~~a--~~~~~~~~~~~~~~~~i~----~~~~-~~~~~~~~~~~~~~~~ 533 (632) + .+|. |. +... ..+.+-..+..-+|..|. +++. .+...+++....+.++ T Consensus 150 A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rV~~~g~~~s~~i~~~G~ggdy~NLDalV~d~~~~lId~~~~~d~dLVviv 229 (339) T protein:vir:79 150 AATSDRVANPMLQDVNKGWLQNLREQAPQRVMKEGKAAAGKITVGGAGADYGNLDALVYDITNHLVEPWYAEDPDLVVVC 229 (339) T ss_pred ecCCChhhCcCccccchhHHHHHHhhhhhhhhccceeccceeEeccCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEE Confidence 1 1222 21 1100 001111112222344443 4443 3456666665555555 Q ss_pred ehhHHHHHHHHhhcccCCce--------eeccccccCcceEEcCCCCCccEEEEehhhEEEEEecc-eEEEEeccccccc Q lcl|Aclame:pro 534 SVTQRGAAKKAQVFDNTGER--------IWQNNEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGV-LDLKVDPYTKAAS 604 (632) Q Consensus 534 ~~~~~~~~~~~~~~d~~g~~--------~~~~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~-~~~~~~~~~~~~~ 604 (632) .......-....+ .....+ +....++.|+|.+..|.+|.+.+++--++...++...| .+-..-+.- ++ T Consensus 230 G~dLla~k~~~l~-n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p--~r 306 (339) T protein:vir:79 230 GRNLLSDKYFPLV-NRDRDPVQQIAADLIISQKRIGNLPAIRVPYFPANGLLVTRLDNLSIYYQEGGRRRTILDNA--KR 306 (339) T ss_pred chhhhhhHhhhHh-hcCCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEecc--cc Confidence 4443332222222 222222 33346899999999999999999999999887765444 322211111 11 Q ss_pred CcEEEEEEEEeCcEEecccceEE-----EEecC Q lcl|Aclame:pro 605 DGLVLRVFQDVDAGVRRKEAFCI-----AKKGA 632 (632) Q Consensus 605 ~~~~~~~~~r~~~~v~~~~a~~~-----~~~~A 632 (632) +.+.-+-..--|..|-++.+++. +..+| T Consensus 307 ~rie~y~s~Ne~YvVEd~~~~a~iEni~~~~aa 339 (339) T protein:vir:79 307 DRIENYESSNDAYVIEDLACAAMAENIALAAAA 339 (339) T ss_pred ccccchhhccceeeeeccccEEEeeeeecccCC Confidence 22221111222233333333333 23334 No 202 >protein:vir:1829 Length: 355 # NCBI annotation: major capsid protein # Family: family:all:201 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052253;genbank:gi:9634060;genbank:GeneID:1262428 Probab=80.87 E-value=0.091 Score=26.29 Aligned_cols=291 Identities=12% Similarity=0.068 Sum_probs=123.6 Q ss_pred hhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEE Q lcl|Aclame:pro 326 FEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDI 405 (632) Q Consensus 326 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 405 (632) +.......+.....+-+...... ....+-.+.+.|.+ ...+...+.+.+.+.+.. .+++.+.-.-.. T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~-----------~~~~~~~Fsv~P~v-~q~L~~~i~ess~FL~~I-Nvv~V~e~~Ge~ 67 (355) T protein:vir:18 1 MRQETRFKFNAYLTQLAKLNGIS-----------VDDVSKKFTVEPSV-TQTLMNTVQASSAFLQMI-NILPVAEMKGEK 67 (355) T ss_pred CChHHHHHHHHHHHHHHHHhCCC-----------hhHccceeccCHHH-HHHHHHHHHHHHHHhhcC-ceeccccceeeE Confidence 00000111111111111100000 00112234444444 455667777777665542 333333222222 Q ss_pred EE-ecCCccccccccC--cc-cccCcccceeeeeeeeeeeeeehhhHHHhhcC--hhHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 406 PK-KTSGANFYWIGED--ED-VQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQS--SIHVENLIREDLIEGIGVALDLAML 479 (632) Q Consensus 406 ~~-~~~~~~a~~v~E~--~~-~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~--~~~~~~~i~~~l~~a~a~~~~~~~~ 479 (632) .- ..+++-+.-+.=+ .+ .+....+.+.-.|.....---..|+.+.|..- ..++...+.+.+.+.++.-.=..-| T Consensus 68 i~lgv~g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGf 147 (355) T protein:vir:18 68 IGVGVTGTIASTTDTSGDKERQTADFTALESNKYECNQINFDFHLTYKRLDLWARFQDFQRRIRDAIVQRQALDFIMAGF 147 (355) T ss_pred EeeccCcceeeccccCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcc Confidence 22 2233333333211 12 22223445555666666666666777766432 2355666666666665543333335 Q ss_pred hcCCCc------cccc------ccee----ccccc----------ccc----ccccchhHHHHH----HHHH-HHHhhcc Q lcl|Aclame:pro 480 TGTGLA------NDPV------GLLN----MTGVP----------ALT----YPAGGVDWASVV----DMET-KISTFNA 524 (632) Q Consensus 480 ~g~g~~------~~~~------Gil~----~a~~~----------~~~----~~~~~~~~~~i~----~~~~-~~~~~~~ 524 (632) +|.-.+ .+|. |.+. ++... ... .-+..-+|..|. +++. .+...++ T Consensus 148 NG~s~A~~Td~~~nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~d~~~~lI~~~~~ 227 (355) T protein:vir:18 148 NGTTRADTSDRVKNPMLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENLDALVMDGTNTLIDEIYQ 227 (355) T ss_pred cceeeeccCChhhCcCccccchhHHHHHHhcchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHh Confidence 554211 1232 2221 11000 000 001122344343 4443 3456666 Q ss_pred ccccceEEeehhHHHHHHHHhhcccCCc--------eeeccccccCcceEEcCCCCCccEEEEehhhEEEEEecce-EE- Q lcl|Aclame:pro 525 DAGRLAYLTSVTQRGAAKKAQVFDNTGE--------RIWQNNEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGVL-DL- 594 (632) Q Consensus 525 ~~~~~~~~~~~~~~~~~~~~~~~d~~g~--------~~~~~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~~-~~- 594 (632) +....+.++.......-.. .+-...+. .+....++.|+|.+..|.+|.+.+++--++...++...|- +- T Consensus 228 ~d~dLVvivG~dLla~k~~-~l~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~ 306 (355) T protein:vir:18 228 DDPKLVAIVGRKLLADKYF-PLVNKQQENTESLAADIIISQKRIGNLPAVRVPYFPANAVFVTTLENLSIYFMDESHRRS 306 (355) T ss_pred cCCCEEEEEchhhhHHHHh-HHhhccCChHHHHHHHHHHHHHhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEE Confidence 6656665555443322222 22222222 2233468999999999999999999998988777654443 22 Q ss_pred -EEeccc----ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 595 -KVDPYT----KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 595 -~~~~~~----~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) ...+.. .+..--..|.++-+--++.++ .+.+.+.++ T Consensus 307 ~~d~p~r~rie~y~s~Ne~YvVEd~~~~a~ie--ni~~~~~~~ 347 (355) T protein:vir:18 307 IDENPKKDRVENYESMNIDYVVEAYAAGCLLE--NITLGDFTA 347 (355) T ss_pred EEeccccccccchhhhcceeeeeccccEEEEe--eeeecCCCC Confidence 122211 122222244444333333333 333333222 No 203 >protein:vir:99576 Length: 388 # NCBI annotation: hypothetical protein # Family: family:all:1653 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039801;genbank:gi:126011051;genbank:GeneID:4818271 Probab=80.17 E-value=0.097 Score=26.15 Aligned_cols=331 Identities=9% Similarity=-0.029 Sum_probs=122.9 Q ss_pred HhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhh-hhhhhhh Q lcl|Aclame:pro 270 NPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKE-ARGFYMP 348 (632) Q Consensus 270 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 348 (632) ....+..+........ . ...+.. ..............+++..--. ....... T Consensus 1 ~~~~~~~~~~~~~~~~----------------------~----~~~~~~-~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~ 53 (388) T protein:vir:99 1 MKQLSKVHQSLAGRSV----------------------R----AFDMAN-GKADYRLTDMAVRELKKFGLVFDHATVKRQ 53 (388) T ss_pred CCCccceeeecCCccc----------------------c----hhhhhc-CCcceeeechhhHhhhhcceeccCccchhh Confidence 0000000000000000 0 000000 0000000000000011100000 0000000 Q ss_pred hh--------HHhhhhhc--cccccccc-ceechhhhhHHHHHHHhhhhhhhhhcceeeccC--ceeEEEEEecCCcccc Q lcl|Aclame:pro 349 HE--------VLVQRQLE--KKTAGKGG-ELVATELLSEEFIDILRNKAIIGQMGARMLPGL--VGDVDIPKKTSGANFY 415 (632) Q Consensus 349 ~~--------~~~~~a~~--~~~~~~~~-~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~a~ 415 (632) .. ....-+.. ..+..+.| +....++....+++.+.+.-....+......++ ...+.+......+.+. T Consensus 54 ~~~~~~~~~~~~a~da~~~~~~t~~~~gip~~~~~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~~~~f~v~e~~G~A~ 133 (388) T protein:vir:99 54 IELLHEGGVATQAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAM 133 (388) T ss_pred hhhhhhhhhhhcccCcccccccccCcccHHHHHhhhhccceeeeeechhhhhhhccccccCCccceeEEEeeeecceeEE Confidence 00 00000110 11111111 111222223333333333333233321111121 2245555666667778 Q ss_pred ccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcC---hhHHHHHHHHHHHHHHHHHHHHHHhhcCCC--cccccc Q lcl|Aclame:pro 416 WIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQS---SIHVENLIREDLIEGIGVALDLAMLTGTGL--ANDPVG 490 (632) Q Consensus 416 ~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~---~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~--~~~~~G 490 (632) ..+-+...|..+..-...+-.++.++..+.++.+-+... .+++.+.-...-.+++.+.+|+..|+|... ....-| T Consensus 134 ~ygd~~D~Pl~d~~~~~~~r~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~AA~~ale~~~N~i~f~G~~g~~~~~~yG 213 (388) T protein:vir:99 134 EYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFG 213 (388) T ss_pred EeecccCCCceeccceeeeeeEEEEEeeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHhhhceEEEEeecCCCccceEE Confidence 888888888887777777777788888788876544432 245667777777888888888888888532 224568 Q ss_pred ceecccccccccc--------ccchh----HHHHHHHHHHHHhhcccc---c--cceEEeehhHHHHHHHHhhcccCCce Q lcl|Aclame:pro 491 LLNMTGVPALTYP--------AGGVD----WASVVDMETKISTFNADA---G--RLAYLTSVTQRGAAKKAQVFDNTGER 553 (632) Q Consensus 491 il~~a~~~~~~~~--------~~~~~----~~~i~~~~~~~~~~~~~~---~--~~~~~~~~~~~~~~~~~~~~d~~g~~ 553 (632) ++|+..+.....+ .+.-+ +++|..++..+..+.... . ....++.+.....+ .. .+..|.- T Consensus 214 llNdP~l~a~v~at~~~~~~~Wa~kT~~eI~~Di~~~~~~i~~qs~g~~~~~~~~~tL~LP~~~~~~L--s~-~n~~g~T 290 (388) T protein:vir:99 214 FLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDML--SV-VTDLGIS 290 (388) T ss_pred EeeCCCcccccccccCCcCcccccCCHHHHHHHHHHHHHHHHHhcCCeeeecccceEEEechHHHHhc--cc-cCcCCcc Confidence 9988765432111 11112 345566666665554321 1 11233444333333 21 2222322 Q ss_pred eecc--ccccCcceEEcCCCC------CccE--EEEehhh-EEEE-EecceEEE-Eecccccc-------cCcEEEEEEE Q lcl|Aclame:pro 554 IWQN--NEVNGYRAEASNQIP------ADTW--IFGDWSQ-IVIA-MWGVLDLK-VDPYTKAA-------SDGLVLRVFQ 613 (632) Q Consensus 554 ~~~~--~~l~G~pv~~~~~~~------~~~~--~~gd~s~-~~~~-~~~~~~~~-~~~~~~~~-------~~~~~~~~~~ 613 (632) ++.- ....++.++..+.+. .+.. ++.+--. .... ..+..... ..+. .|. .......... T Consensus 291 vl~~lk~n~Pnl~i~t~pEl~~a~~tgg~~~~~~~~~~~~~~~~~~~~~~~t~~~~~p~-~~~~l~vq~~~~~~~~~~~~ 369 (388) T protein:vir:99 291 VRDWLKQTYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQS-KFVTLGVEKRVKNYVEAYSN 369 (388) T ss_pred HHHHHHHhcCCcEEEEecccccccccCCceeEEEEecccccccccCccCcceeEEeccc-ccccccceecCceeEecccc Confidence 2210 112223333322211 1111 1111000 0000 00000000 0011 111 1112222223 Q ss_pred E-eCcEEecccceEEEEec Q lcl|Aclame:pro 614 D-VDAGVRRKEAFCIAKKG 631 (632) Q Consensus 614 r-~~~~v~~~~a~~~~~~~ 631 (632) | .|+-+.+|.||+.++== T Consensus 370 rt~Gv~ir~P~Ai~~~~GI 388 (388) T protein:vir:99 370 ATAGVMLKRPWAVVRLIGL 388 (388) T ss_pred ceeeeEEeccchhheeccC Confidence 3 35567889998886655 No 204 >protein:vir:78777 Length: 358 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285647;genbank:gi:148727153;genbank:GeneID:5220125 Probab=79.16 E-value=0.11 Score=25.90 Aligned_cols=295 Identities=10% Similarity=0.005 Sum_probs=122.1 Q ss_pred hhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEE Q lcl|Aclame:pro 326 FEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDI 405 (632) Q Consensus 326 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 405 (632) +...+....+..+.......... .... ....+-.+.+.|.+ ...+...+.+.+.+.+.. .+++.+.-.-.. T Consensus 1 m~~~M~~~tr~~~~~y~~~~A~~------ngv~-~~~~~~~Fsv~p~v-~q~L~~~i~ess~FL~~I-Nvv~V~e~~Ge~ 71 (358) T protein:vir:78 1 MSQTLTVQAEQRLNKYCDALAKA------YGID-ISKLDKQFSVTGPV-ETTLRSALLASVEFLGLI-TCLDVDQIKGQV 71 (358) T ss_pred CcccccHHHHHHHHHHHHHHHHH------hCCC-hhHccceeeeChHH-HHHHHHHHHHHHHHhhcC-cccccccceeeE Confidence 11112222222221111110000 0000 01112234455554 345667777777665542 233332222222 Q ss_pred EE-ecCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcCh-----hHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 406 PK-KTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSS-----IHVENLIREDLIEGIGVALDLAML 479 (632) Q Consensus 406 ~~-~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~-----~~~~~~i~~~l~~a~a~~~~~~~~ 479 (632) .. ..+++-+.-..- -.+....+.+.-.|.....---..|+.+.|..-. .++...+...+.+.++.-.=..-| T Consensus 72 v~lg~~g~iagrt~t--r~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~f~~~~dF~~r~~~~i~~~~ALD~i~IGf 149 (358) T protein:vir:78 72 VQVGVGQLYTGRKKG--GRFKGKVGVDGNTYELTETDSCASLDWATLCTWANAGSEGEFIKLVGEFVNKAFALDMLRVGW 149 (358) T ss_pred EeecCCcccceecCC--CccccccccCCCccEEEEeceeeeccHHHHHHHHhCCChhHHHHHHHHHHHHHHhhccceecc Confidence 22 223333332221 1222334455556666665556667777664321 135555656565555543222224 Q ss_pred hcCCCc------cccc------ccee----cccc---------ccc-cccccchhHHHHH----HHH-HHHHhhcccccc Q lcl|Aclame:pro 480 TGTGLA------NDPV------GLLN----MTGV---------PAL-TYPAGGVDWASVV----DME-TKISTFNADAGR 528 (632) Q Consensus 480 ~g~g~~------~~~~------Gil~----~a~~---------~~~-~~~~~~~~~~~i~----~~~-~~~~~~~~~~~~ 528 (632) +|.-.+ .+|. |.+. ++.. ..+ -..+..-+|..|. +++ ..+...+++... T Consensus 150 NGts~A~~Td~~~nPllqDVN~GWlQ~~Re~a~~~v~~~~~~~~~i~ig~g~~Gdy~NLDalV~D~~~~lI~~~~~~d~d 229 (358) T protein:vir:78 150 NGVSAADDTDPTANPLGQDVNKGWHQLAREWKGGSQIIKAAAGEKIYFDPDGKGEYKTLDEMASDLINTTIDPLFQQDPR 229 (358) T ss_pred cceeeccCCChhhCcCccccchHHHHHHHhhchhhhhccccccCceeecCCCCCccccHHHHHHHHHhccCChHHhcCCC Confidence 553211 1222 2111 1100 000 0111122343333 343 345566666655 Q ss_pred ceEEeehhHHHHHHHHhhcccCCce---eec---cccccCcceEEcCCCCCccEEEEehhhEEEEEecc-eEEE--Eecc Q lcl|Aclame:pro 529 LAYLTSVTQRGAAKKAQVFDNTGER---IWQ---NNEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGV-LDLK--VDPY 599 (632) Q Consensus 529 ~~~~~~~~~~~~~~~~~~~d~~g~~---~~~---~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~-~~~~--~~~~ 599 (632) .+.++.......-....+ ...+.+ +.. ..++.|+|.+..+.+|.+.+++--++...++...| .+-. ..+. T Consensus 230 LVvivG~dLla~k~~~l~-n~~~~pTE~~Aa~~i~k~iGGlpa~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~ 308 (358) T protein:vir:78 230 LVVLVGTDLVAAAQAKLY-SEATKPSEQIAAQQLAKSIAGRKAYIPPFFPGKRMVVTTLDNLHCYTQRGTRKRKADDNQD 308 (358) T ss_pred EEEEEchhhhhHHhhhHh-hcCCCcHHHHHHHHHHHHhCCCeEEEccccCCCceEEeeccccEEEEecCcEEEEEEeccc Confidence 555554443332222222 222222 110 15789999999999999999999898877765443 3221 2221 Q ss_pred c----ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 600 T----KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 600 ~----~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) . .+..--..|.++-+--++.++.-.|......| T Consensus 309 r~riE~y~s~Ne~YvVEd~~~~a~iE~i~v~~~~~pa 345 (358) T protein:vir:78 309 SKSFDNQYWRMEGYALGEHKAYGGFEEADIEIGADPA 345 (358) T ss_pred cccccchhhhcceeeeeccccEEEEeeeeeeeCCCCC Confidence 1 11111123444433333444433333332222 No 205 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=77.00 E-value=0.13 Score=25.45 Aligned_cols=316 Identities=10% Similarity=-0.004 Sum_probs=131.5 Q ss_pred HhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhh Q lcl|Aclame:pro 270 NPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPH 349 (632) Q Consensus 270 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 349 (632) ....+..+......... ... ....... ......+. +.+....... T Consensus 1 ~~~~~~~~~~~~~~~~~----------------------------~~~-~~~~~~~--~~~~~~l~----~~gi~~~~~~ 45 (379) T protein:vir:10 1 MPQISKIHSSLNARQMT----------------------------QMV-MDSADVT--LDNLKHLE----SYGIHLNGRK 45 (379) T ss_pred CCCcceeeeecCccccc----------------------------hhh-hcccccc--HHHHHHHH----hcCccccchh Confidence 00000000000000000 000 0000000 00000000 0000000000 Q ss_pred hH---Hhhhhhc--------------ccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccC--ceeEEEEEecC Q lcl|Aclame:pro 350 EV---LVQRQLE--------------KKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGL--VGDVDIPKKTS 410 (632) Q Consensus 350 ~~---~~~~a~~--------------~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~ 410 (632) .. ....++. .....+|.......++ ..+++.+........+......++ ...+.++.... T Consensus 46 ~~~~~~~~~amd~~~~~~~~~~~~~l~~~~~~g~~~~l~~~~-p~~i~~~tap~~a~~l~pv~t~g~W~~~~~~~~v~e~ 124 (379) T protein:vir:10 46 NKLFELMQFAMDSNDIGPIPTPLSPLSPVSIPGLIQFLQNWL-PGHVRILTAVREADEFLGLSTVGQWDDEQIVQRVLEG 124 (379) T ss_pred hhhhhhhhhhhccccccccccccCccccccccchHHHHHhhc-chHHHHHhhhhhhhhhcccccCCCceeeeEEEeeeee Confidence 00 0000110 0011122223333344 345555544444444322222222 13455556666 Q ss_pred CccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcC---hhHHHHHHHHHHHHHHHHHHHHHHhhcCCC-cc Q lcl|Aclame:pro 411 GANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQS---SIHVENLIREDLIEGIGVALDLAMLTGTGL-AN 486 (632) Q Consensus 411 ~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~---~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~-~~ 486 (632) .+.+..++-+...|..+.......-.++.|+..+.++.+-+... -+++.+.-.....+++.+.+|+..|+|... +. T Consensus 125 ~G~A~~ygd~~d~pl~d~~~~~~~r~v~~~~~g~~yg~~El~~Aa~~g~~l~~~Ka~aA~~ale~~~N~i~f~G~~d~~~ 204 (379) T protein:vir:10 125 LGTAQPYTDGGNMALMSWTPTFETRTVVRFEAGLQVAPLEEARSSRVQVSSADEKRAMVGEALEVQRNRVAFYGYNDGSG 204 (379) T ss_pred eeeeEEeccccCCCeeeeeeeeeeeeeEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCCCc Confidence 67788888887888877777777777788888888776544432 346777778888888888888888999643 34 Q ss_pred ccccceeccccccccc---c------ccchh----HHHHHHHHHHHHhhcccc-----ccceEEeehhHHHHHHHHhhcc Q lcl|Aclame:pro 487 DPVGLLNMTGVPALTY---P------AGGVD----WASVVDMETKISTFNADA-----GRLAYLTSVTQRGAAKKAQVFD 548 (632) Q Consensus 487 ~~~Gil~~a~~~~~~~---~------~~~~~----~~~i~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~d 548 (632) ...|++|+..+..... . ...-+ +++|..++..+..+.... .....++.+.....+. . .+ T Consensus 205 ~~yGllNdP~l~a~~t~atg~~~~t~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~~~~~~~~tL~LP~~~~~~L~--~-~n 281 (379) T protein:vir:10 205 RTFGFLNDPNLPAYVAVPNGAGGSPLWAQKTTLEIIADLRNGLTALQVQSMGRIKSNKTPITIGIPNAYENYIT--T-PT 281 (379) T ss_pred ceEEEEeCCCCcccccccCCcccccccccCCHHHHHHHHHHHHHHHHHhhCCeecccccceeEEecHHHHHhhc--c-cc Confidence 4569999887643211 0 11112 244555555555443321 1113444444443332 1 22 Q ss_pred cCCceeec--cccccCcceEEcCCCCC----cc--EEEEehhhEEEEEecceEE-------EEeccc------ccccCcE Q lcl|Aclame:pro 549 NTGERIWQ--NNEVNGYRAEASNQIPA----DT--WIFGDWSQIVIAMWGVLDL-------KVDPYT------KAASDGL 607 (632) Q Consensus 549 ~~g~~~~~--~~~l~G~pv~~~~~~~~----~~--~~~gd~s~~~~~~~~~~~~-------~~~~~~------~~~~~~~ 607 (632) ..|.-++. ...+.++.++..+.+.. ++ +++.+- ..+... ...+.. ....... T Consensus 282 ~~g~Tvl~~lk~n~Pnl~i~t~pEL~~aggg~~~~~~~~~~-------~~~~~t~~~~~~~~~~p~k~~~l~ve~~~~~~ 354 (379) T protein:vir:10 282 ELGYSVAQYMRESYPNVTFVSAPELNDANGGSSAIYYYADA-------VENNGTDDGRTWLQVVPTKMFTLGVEKKIKGY 354 (379) T ss_pred ccCccHHHHHHHhcCCcEEEEcccccccCCCccEEEEEeec-------cCCCccCCcceEEEecchhhhhccceecCcee Confidence 22332222 11122333443333221 11 122210 011100 001110 0011222 Q ss_pred EEEEEEEe-CcEEecccceEEEEec Q lcl|Aclame:pro 608 VLRVFQDV-DAGVRRKEAFCIAKKG 631 (632) Q Consensus 608 ~~~~~~r~-~~~v~~~~a~~~~~~~ 631 (632) ......|. |+-+.+|.||+.+.=+ T Consensus 355 ~~~~~~rt~Gv~ir~P~Ai~~~~G~ 379 (379) T protein:vir:10 355 AEGYTNATAGAMLKRPFATYRQTGA 379 (379) T ss_pred EeccccceeeeeeecchhhheecCC Confidence 33334444 5567899999999888 No 206 >protein:vir:104011 Length: 337 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293748;genbank:gi:72537718;genbank:GeneID:3608142 Probab=76.10 E-value=0.14 Score=25.28 Aligned_cols=288 Identities=10% Similarity=0.032 Sum_probs=124.4 Q ss_pred HHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEe- Q lcl|Aclame:pro 330 VSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKK- 408 (632) Q Consensus 330 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 408 (632) +....+..+....... ..+......+..+.+.|.+ ...+...+.+.+.+.+.. .+++.+.-.-...-. T Consensus 1 M~~~tr~~~~~y~~~~---------A~~ngv~~~~~~FsV~P~v-~q~L~~~i~ess~FL~~I-nvv~V~e~~Ge~v~lg 69 (337) T protein:vir:10 1 MRKETRQAYEKYAAQI---------AKLNDTGDVSKKFAVEPTV-QQRLETKMQESSEFLKRI-NVLPVTELEGEKLGLS 69 (337) T ss_pred CChHHHHHHHHHHHHH---------HHhcChhhhcceeeecHHH-HHHHHHHHHHHHHhhccC-ceeccccceeeEEeec Confidence 1111111111110000 0011111223334455544 455667777777665542 233333222222222 Q ss_pred cCCccccccccCc-c-cccCcccceeeeeeeeeeeeeehhhHHHhhcC--hhHHHHHHHHHHHHHHHHHHHHHHhhcCCC Q lcl|Aclame:pro 409 TSGANFYWIGEDE-D-VQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQS--SIHVENLIREDLIEGIGVALDLAMLTGTGL 484 (632) Q Consensus 409 ~~~~~a~~v~E~~-~-~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~--~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~ 484 (632) .+++-+.-..-+. + .|..-.+.+.-.|.....---..|+.+.|..- ..++...+.+.+.+.++.-.=..-|+|.-. T Consensus 70 ~~g~iagrt~t~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~ 149 (337) T protein:vir:10 70 VSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKA 149 (337) T ss_pred cCcceeeeecCCCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceee Confidence 2333332222111 1 12223455566666666666667777776432 235666666666666655333333555421 Q ss_pred c------cccc------ccee----cccc----------ccccccccchhHHHHH----HHHH-HHHhhccccccceEEe Q lcl|Aclame:pro 485 A------NDPV------GLLN----MTGV----------PALTYPAGGVDWASVV----DMET-KISTFNADAGRLAYLT 533 (632) Q Consensus 485 ~------~~~~------Gil~----~a~~----------~~~~~~~~~~~~~~i~----~~~~-~~~~~~~~~~~~~~~~ 533 (632) + .+|. |.+. ++.. +.+.. +..-+|..|. +++. .+...+++....+.++ T Consensus 150 A~~Td~~~nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~i-G~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVviv 228 (337) T protein:vir:10 150 AATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLV-GKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVIC 228 (337) T ss_pred ccCCChhhCcCccccchhHHHHHHhcchhhhhccccccCcceee-cCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEE Confidence 1 1222 2111 1110 00110 1112344433 4454 3466667665566555 Q ss_pred ehhHHHHHHHHhhcccCCce--------eeccccccCcceEEcCCCCCccEEEEehhhEEEEEecce-EEEEeccccccc Q lcl|Aclame:pro 534 SVTQRGAAKKAQVFDNTGER--------IWQNNEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGVL-DLKVDPYTKAAS 604 (632) Q Consensus 534 ~~~~~~~~~~~~~~d~~g~~--------~~~~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~~-~~~~~~~~~~~~ 604 (632) .......-.. .+-.....+ +....++.|+|.+..|.+|.+.+++--++...++...|- +-..-+.. ++ T Consensus 229 G~dLladk~~-~l~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p--~r 305 (337) T protein:vir:10 229 GRELLHDKYF-PIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVP--ER 305 (337) T ss_pred chhhhhHHhh-HHhccCCCcHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEEcc--cc Confidence 5443332222 222222222 233467999999999999999999999998877655443 22111111 12 Q ss_pred CcEEEEEEEEeCcEEecccceEE---EEecC Q lcl|Aclame:pro 605 DGLVLRVFQDVDAGVRRKEAFCI---AKKGA 632 (632) Q Consensus 605 ~~~~~~~~~r~~~~v~~~~a~~~---~~~~A 632 (632) +.+.-+-..--|..|-++.++|. ++++. T Consensus 306 ~rie~y~s~Ne~YvVEd~~~~a~ienI~~~~ 336 (337) T protein:vir:10 306 DRIENYESSNDAYVVEDFGCGCVAENIELAA 336 (337) T ss_pred ccccchhhccceeeeeccccEEEEeceeecC Confidence 22222222222334444444433 22332 No 207 >protein:vir:98566 Length: 355 # NCBI annotation: gp5 # Family: family:all:201 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958060;genbank:gi:41057357;genbank:GeneID:2744237 Probab=75.12 E-value=0.15 Score=25.10 Aligned_cols=289 Identities=11% Similarity=0.056 Sum_probs=123.5 Q ss_pred HHHHHHHHHHhhhhhhhhhhhHHhhhhhcccc--cccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEE Q lcl|Aclame:pro 330 VSLAIADASGKEARGFYMPHEVLVQRQLEKKT--AGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPK 407 (632) Q Consensus 330 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~--~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 407 (632) +....+..+....... ..+.... ..+-.+.|.|.+ ...+...+.+.+.+.+.. .+++.+...-...- T Consensus 1 M~~~tr~~~~~y~~~~---------A~~ngv~~~~~~~~FsV~P~v-~q~L~~~i~ess~FL~~I-Nvv~V~e~~Ge~i~ 69 (355) T protein:vir:98 1 MRPETRFKFNAYLTRV---------AELNNISTDDVSKKFTVEPSV-TQTLMNTVQASSAFLKTI-NILPVAEMKGEKIG 69 (355) T ss_pred CChHHHHHHHHHHHHH---------HHHhCCChhHccceeecCHHH-HHHHHHHHHHHHHHhhcC-ceeccccceeeEee Confidence 1111111111110000 0011000 112234455544 345677777777666542 33333322222222 Q ss_pred -ecCCccccccccC--ccc-ccCcccceeeeeeeeeeeeeehhhHHHhhcC--hhHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 408 -KTSGANFYWIGED--EDV-QDSDFDFTTLSFSPKTIAGAVPVTRKLRKQS--SIHVENLIREDLIEGIGVALDLAMLTG 481 (632) Q Consensus 408 -~~~~~~a~~v~E~--~~~-~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~--~~~~~~~i~~~l~~a~a~~~~~~~~~g 481 (632) ..+++-+.-+.=+ .+. +..-.+.+.-.|.....---..|+.+.|..- ..++...+.+.+.+.++.-.=..-|+| T Consensus 70 lgv~g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG 149 (355) T protein:vir:98 70 VGVTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNG 149 (355) T ss_pred eccCccccccccCCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccc Confidence 2233333333211 121 2222345555666666666666777766432 235566666666666554333333555 Q ss_pred CCCc------cccc------ccee----cccccc----------cc----ccccchhHHHHH----HHHHH-HHhhcccc Q lcl|Aclame:pro 482 TGLA------NDPV------GLLN----MTGVPA----------LT----YPAGGVDWASVV----DMETK-ISTFNADA 526 (632) Q Consensus 482 ~g~~------~~~~------Gil~----~a~~~~----------~~----~~~~~~~~~~i~----~~~~~-~~~~~~~~ 526 (632) .-.+ .+|. |.+. ++.... +. .-+..-+|..|. +++.. +...+++. T Consensus 150 ~s~A~~Td~~~nPllqDVNkGWlQ~~Re~ap~~v~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~D~~~~lI~~~~~~d 229 (355) T protein:vir:98 150 TTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDD 229 (355) T ss_pred eeeeccCChhhCcCccccchhHHHHHHhcchhhhhhhhcccCccccccceeeCCCCCcccHHHHHHHHHhccCChHHhcC Confidence 4211 1232 2221 110000 00 001122343333 44443 45666666 Q ss_pred ccceEEeehhHHHHHHHHhhcccCCce--------eeccccccCcceEEcCCCCCccEEEEehhhEEEEEecce-EE--E Q lcl|Aclame:pro 527 GRLAYLTSVTQRGAAKKAQVFDNTGER--------IWQNNEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGVL-DL--K 595 (632) Q Consensus 527 ~~~~~~~~~~~~~~~~~~~~~d~~g~~--------~~~~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~~-~~--~ 595 (632) ...+.++.......-.. .+-.....+ +....++.|+|.+..|.+|.+.+++--++...++...|- +- . T Consensus 230 ~dLVvivG~dLla~k~~-~l~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~ 308 (355) T protein:vir:98 230 PNLVAIVGRKLLADKYF-PLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSID 308 (355) T ss_pred CCEEEEEchhhhHHHhh-hHhhccCCcHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEE Confidence 56665555443322222 222222222 333468999999999999999999998988777654443 21 1 Q ss_pred Eecc----cccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 596 VDPY----TKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 596 ~~~~----~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) ..+. ..+..--..|.++-+--++.++ .+.+.+..+ T Consensus 309 d~p~r~rie~y~s~Ne~YvVEd~~~~a~ie--nI~~~~~~~ 347 (355) T protein:vir:98 309 ENPKKDRVENYESMNIDYVVEVYAAGCLLE--NITLGDFTA 347 (355) T ss_pred eccccccccchhhhcceeeeeccccEEEee--ceeeeCCCC Confidence 2221 1122222244444333333333 333332222 No 208 >protein:vir:79171 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111033;genbank:gi:134288740;genbank:GeneID:4960690 Probab=74.89 E-value=0.15 Score=25.05 Aligned_cols=288 Identities=10% Similarity=0.031 Sum_probs=124.0 Q ss_pred HHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEe- Q lcl|Aclame:pro 330 VSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKK- 408 (632) Q Consensus 330 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 408 (632) +....+..+....... ..+......+-.+.+.|.+ ...+...+.+.+.+.+.. .+++.+.-.-...-. T Consensus 1 M~~~tr~~~~~y~~~~---------A~~ngv~~~~~~FsV~P~v-~q~L~~~i~ess~FL~~I-nvv~V~e~~Ge~v~lg 69 (337) T protein:vir:79 1 MRKETRQAYEKYAAQI---------AKLNDTGDVSKKFAVEPTV-QQRLETKMQESSEFLKRI-NVLPVTELEGEKLGLS 69 (337) T ss_pred CChHHHHHHHHHHHHH---------HHhcChhhhcceeeecHHH-HHHHHHHHHHHHHhhccC-ceeccccceeeEEeec Confidence 1111111111110000 0011111222234455544 455667777777665542 233333222222222 Q ss_pred cCCccccccccCc-c-cccCcccceeeeeeeeeeeeeehhhHHHhhcC--hhHHHHHHHHHHHHHHHHHHHHHHhhcCCC Q lcl|Aclame:pro 409 TSGANFYWIGEDE-D-VQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQS--SIHVENLIREDLIEGIGVALDLAMLTGTGL 484 (632) Q Consensus 409 ~~~~~a~~v~E~~-~-~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~--~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~ 484 (632) .+++-+.-..-+. + .|..-.+.+.-.|.....---..|+.+.|..- ..++...+.+.+.+.++.-.=..-|+|.-. T Consensus 70 ~~g~iagrt~t~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~ 149 (337) T protein:vir:79 70 VSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDAWAKFADFQQRIRDVILNQGALDRIMIGWNGVKA 149 (337) T ss_pred cCcceeeeecCCCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceee Confidence 2333333222111 1 12223455566666666666667777776432 235666666666666554333333555421 Q ss_pred c------cccc------ccee----cccc----------ccccccccchhHHHHH----HHHH-HHHhhccccccceEEe Q lcl|Aclame:pro 485 A------NDPV------GLLN----MTGV----------PALTYPAGGVDWASVV----DMET-KISTFNADAGRLAYLT 533 (632) Q Consensus 485 ~------~~~~------Gil~----~a~~----------~~~~~~~~~~~~~~i~----~~~~-~~~~~~~~~~~~~~~~ 533 (632) + .+|. |.+. ++.. +.+.. +..-+|..|. +++. .+...+++....+.++ T Consensus 150 A~~Td~~~nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~i-G~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVviv 228 (337) T protein:vir:79 150 AATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLV-GKAGDYENLDALVMDIVSSMIDPWFQEDTGLVAIC 228 (337) T ss_pred ccCCChhhCcCccccchhHHHHHHhcchhhhhccccccCcceee-cCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEE Confidence 1 1222 2111 1110 00100 1112344433 4444 3466667665666555 Q ss_pred ehhHHHHHHHHhhcccCCce--------eeccccccCcceEEcCCCCCccEEEEehhhEEEEEecce-EEEEeccccccc Q lcl|Aclame:pro 534 SVTQRGAAKKAQVFDNTGER--------IWQNNEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGVL-DLKVDPYTKAAS 604 (632) Q Consensus 534 ~~~~~~~~~~~~~~d~~g~~--------~~~~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~~-~~~~~~~~~~~~ 604 (632) .......-.. .+-.....+ +....++.|+|.+..|.+|.+.+++--++...++...|- +-..-+.- ++ T Consensus 229 G~dLladk~~-~l~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p--~r 305 (337) T protein:vir:79 229 GRELLHDKYF-PIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVP--ER 305 (337) T ss_pred chhhhhHHhh-HHhccCCCcHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEEcc--cc Confidence 5443332222 222222222 233467999999999999999999999998877655443 22111111 11 Q ss_pred CcEEEEEEEEeCcEEecccceEE---EEecC Q lcl|Aclame:pro 605 DGLVLRVFQDVDAGVRRKEAFCI---AKKGA 632 (632) Q Consensus 605 ~~~~~~~~~r~~~~v~~~~a~~~---~~~~A 632 (632) +.+.-+-..--|..|-++.++|. ++++. T Consensus 306 ~rie~y~s~Ne~YvVEd~~~~a~ienI~~~~ 336 (337) T protein:vir:79 306 DRIENYESSNDAYVVEDFGCGCVAENIELAA 336 (337) T ss_pred ccccchhhccceeeeeccccEEEEeceeecC Confidence 22222222222333444444333 22332 No 209 >protein:vir:78186 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111152;genbank:gi:134288735;genbank:GeneID:4960646 Probab=72.66 E-value=0.18 Score=24.67 Aligned_cols=288 Identities=10% Similarity=0.027 Sum_probs=120.3 Q ss_pred HHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEEe- Q lcl|Aclame:pro 330 VSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKK- 408 (632) Q Consensus 330 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 408 (632) +....+..+........ .+......+..+.+.|.+ ...+...+.+.+.+.+.. .+++.+.-.-...-. T Consensus 1 M~~~tr~~~~~y~~~~A---------~~ngv~~~~~~FsV~P~v-~q~L~~~i~ess~FL~~I-Nvv~V~e~~Ge~v~lg 69 (337) T protein:vir:78 1 MRKETRQAYEKYAAQIA---------KLNDTGDVSKKFAVEPTV-QQRLETKMQESSEFLKRI-NVLPVTELEGEKLGLS 69 (337) T ss_pred CChHHHHHHHHHHHHHH---------HhcChhhhcceeecChHH-HHHHHHHHHHHHHHhccC-CccccccceeeEEecc Confidence 11111111111100000 011111222334455554 455666666767665542 233332222222222 Q ss_pred cCCccccccccC-ccc-ccCcccceeeeeeeeeeeeeehhhHHHhhcC--hhHHHHHHHHHHHHHHHHHHHHHHhhcCCC Q lcl|Aclame:pro 409 TSGANFYWIGED-EDV-QDSDFDFTTLSFSPKTIAGAVPVTRKLRKQS--SIHVENLIREDLIEGIGVALDLAMLTGTGL 484 (632) Q Consensus 409 ~~~~~a~~v~E~-~~~-~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~--~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~ 484 (632) .+++-+.-..-+ .+. |....+.+.-.|.....---..|+.+.|..- ..++...+.+.+.+.++.-.=..-|+|.-. T Consensus 70 ~~g~iagrtdt~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~ 149 (337) T protein:vir:78 70 VSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKA 149 (337) T ss_pred cCcceeeeecCCCcccccccccccCCCccEEEEeceecccCHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceee Confidence 223333222211 111 1222345555666665555566777766431 234555555555555544322222455321 Q ss_pred c------cccc------ccee----cccc----------ccccccccchhHHHHH----HHHH-HHHhhccccccceEEe Q lcl|Aclame:pro 485 A------NDPV------GLLN----MTGV----------PALTYPAGGVDWASVV----DMET-KISTFNADAGRLAYLT 533 (632) Q Consensus 485 ~------~~~~------Gil~----~a~~----------~~~~~~~~~~~~~~i~----~~~~-~~~~~~~~~~~~~~~~ 533 (632) + .+|. |.+. ++.. +.+.. +..-+|..|. +++. .+...+++....+.++ T Consensus 150 A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rVl~~~~~~~~~i~i-G~~gdy~NLDalV~d~~~~lI~~~~~~d~dLVviv 228 (337) T protein:vir:78 150 AATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLI-GKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVIC 228 (337) T ss_pred ccCCChhhCcCccccchHHHHHHHhcchhhhhccccccCCceee-cCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEE Confidence 1 1222 2110 1110 00100 1122343333 4454 3466667665566555 Q ss_pred ehhHHHHHHHHhhcccCCce--------eeccccccCcceEEcCCCCCccEEEEehhhEEEEEecc-eEEEEeccccccc Q lcl|Aclame:pro 534 SVTQRGAAKKAQVFDNTGER--------IWQNNEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGV-LDLKVDPYTKAAS 604 (632) Q Consensus 534 ~~~~~~~~~~~~~~d~~g~~--------~~~~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~-~~~~~~~~~~~~~ 604 (632) .......-.. .+-...+.+ +....++.|+|.+..|.+|.+.+++--++...++...| .+-..-+.- ++ T Consensus 229 G~dLladk~~-~l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p--~r 305 (337) T protein:vir:78 229 GRELLHDKYF-PIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVP--ER 305 (337) T ss_pred chhhhHHHHH-HHHhcCCCcHHHHHHHHHHHhhhhcCcceEEccccCCCceEEeechhcEEEEecCcEEEEEEecc--cc Confidence 5444332222 222222222 33346899999999999999999999999877765444 322211111 12 Q ss_pred CcEEEEEEEEeCcEEecccceEE---EEecC Q lcl|Aclame:pro 605 DGLVLRVFQDVDAGVRRKEAFCI---AKKGA 632 (632) Q Consensus 605 ~~~~~~~~~r~~~~v~~~~a~~~---~~~~A 632 (632) +.+.-+-..--|..|-++.++|. ++++. T Consensus 306 ~rie~y~s~Ne~YvVEd~~~~a~iEnI~~~~ 336 (337) T protein:vir:78 306 DRIENYESSNDAYVVEDFGCGCVAENIELAA 336 (337) T ss_pred ccccchhhccceeeeeccccEEEEeceeecC Confidence 22222222222334444444433 22332 No 210 >protein:vir:6061 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878202;genbank:gi:33438901;genbank:GeneID:1457736 Probab=70.68 E-value=0.21 Score=24.35 Aligned_cols=291 Identities=12% Similarity=0.046 Sum_probs=120.4 Q ss_pred HHHHHHHHHHhhhhhhhhhhhHHhhhhhcccc--cccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEE Q lcl|Aclame:pro 330 VSLAIADASGKEARGFYMPHEVLVQRQLEKKT--AGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPK 407 (632) Q Consensus 330 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~--~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 407 (632) +....+..+........ .+.... ..+-.+.|.|.+ ...+...+.+.+.+.+.. .+++.+...-...- T Consensus 1 M~~~tr~~~~~y~~~~A---------~~ngv~~~d~~~~FsV~P~v-~q~L~~~i~ess~FL~~I-Nvv~V~e~~Ge~i~ 69 (357) T protein:vir:60 1 MRQETRFKFNAYLSRVA---------ELNGIDAGDVSKKFTVEPSV-TQTLMNTMQESSDFLTRI-NIVPVSEMKGEKIG 69 (357) T ss_pred CChHHHHHHHHHHHHHH---------HHhCCChHHhcceeecCHHH-HHHHHHHHHHHHHHhccC-CccccccceeeEEe Confidence 11111111111111000 011000 112233444444 455666666767665542 23333322222222 Q ss_pred e-cCCccccccc--cCcccccCc-ccceeeeeeeeeeeeeehhhHHHhhcC--hhHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 408 K-TSGANFYWIG--EDEDVQDSD-FDFTTLSFSPKTIAGAVPVTRKLRKQS--SIHVENLIREDLIEGIGVALDLAMLTG 481 (632) Q Consensus 408 ~-~~~~~a~~v~--E~~~~~~~~-~~~~~~~~~~~t~~~~~~iSre~l~d~--~~~~~~~i~~~l~~a~a~~~~~~~~~g 481 (632) . .+++-+.-+. -+.+..... .+.+.-.|.....---..|+.+.|..- ..++...+.+.+.+.++.-.=..-|+| T Consensus 70 lg~~g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNG 149 (357) T protein:vir:60 70 IGVTGSIASTTDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDLIMAGFNG 149 (357) T ss_pred cccCcccccccccCCCCCcccccccccCCCccEEEEeeeeccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccc Confidence 2 2333333321 111222222 345555666666655666777766431 124455555555555443222222455 Q ss_pred CCCc------cccc------cce----ecccc----------ccc----cccccchhHHHHH----HHHHH-HHhhcccc Q lcl|Aclame:pro 482 TGLA------NDPV------GLL----NMTGV----------PAL----TYPAGGVDWASVV----DMETK-ISTFNADA 526 (632) Q Consensus 482 ~g~~------~~~~------Gil----~~a~~----------~~~----~~~~~~~~~~~i~----~~~~~-~~~~~~~~ 526 (632) .-.+ .+|. |.+ .++.. +.. -.-+..-+|..|. +++.. +...+++. T Consensus 150 ts~A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d 229 (357) T protein:vir:60 150 VRRAETSDRSSNQMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQED 229 (357) T ss_pred eeeeccCChhhCcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcC Confidence 3211 1222 221 11100 000 0001112344333 44543 46666766 Q ss_pred ccceEEeehhHHHHHHHHhhcccCCce--------eeccccccCcceEEcCCCCCccEEEEehhhEEEEEecc-eEE--E Q lcl|Aclame:pro 527 GRLAYLTSVTQRGAAKKAQVFDNTGER--------IWQNNEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGV-LDL--K 595 (632) Q Consensus 527 ~~~~~~~~~~~~~~~~~~~~~d~~g~~--------~~~~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~-~~~--~ 595 (632) ...+.++.......-....+ ...+.+ +....++.|+|.+..|.+|.+.+++--++...++...| .+- . T Consensus 230 ~dLVvivG~dLla~k~~~l~-n~~~~pTE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~ 308 (357) T protein:vir:60 230 PDLVVIVGRQLLADKYFPIV-NREQDNSEMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMDDSHRRVIE 308 (357) T ss_pred CCEEEEEchhhhhHHhhhHh-hcCCChHHHHHHHHHHHhhhhcCcceEEccccCCCceEEeeccccEEEEecCcEEEEEE Confidence 56665555444332222222 222222 23346799999999999999999999888877765443 222 1 Q ss_pred Eeccc----ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 596 VDPYT----KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 596 ~~~~~----~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) ..+.. .+..--..|.++-+--++.++.-.|...+..| T Consensus 309 d~p~r~riE~y~s~Ne~YvVEd~~~~a~iE~i~~~~~~~pa 349 (357) T protein:vir:60 309 ENPKLDRVENYESMNIDYVVEDYAAGCLVEKIKVGDFSTPA 349 (357) T ss_pred eccccccccchhhhcceeeeeccccEEEeeeeeeccCcccc Confidence 22211 12211223444433333333322222222222 No 211 >protein:vir:5694 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839853;genbank:gi:30065708;genbank:GeneID:1260602 Probab=69.31 E-value=0.22 Score=24.14 Aligned_cols=291 Identities=12% Similarity=0.055 Sum_probs=119.6 Q ss_pred HHHHHHHHHHhhhhhhhhhhhHHhhhhhcccc--cccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEE Q lcl|Aclame:pro 330 VSLAIADASGKEARGFYMPHEVLVQRQLEKKT--AGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPK 407 (632) Q Consensus 330 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~--~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 407 (632) +....+..+........ .+.... ..+-.+.|.|.+ ...+...+.+.+.+.+.. .+++.+...-...- T Consensus 1 M~~~tr~~~~~y~~~~A---------~~ngv~~~d~~~~FsV~P~v-~q~L~~~i~ess~FL~~I-Nvv~V~e~~Ge~i~ 69 (357) T protein:vir:56 1 MRQETRFKFNAYLSRVA---------ELNGIDAGDVSKKFTVEPSV-TQTLMNTMQESSDFLTRI-NIVPVSEMKGEKIG 69 (357) T ss_pred CChHHHHHHHHHHHHHH---------HHhCCChHHhcceeecCHHH-HHHHHHHHHHHHHHhccC-CccccccceeeEEe Confidence 11111111111111000 011000 112234444444 455666666767665542 33333322222222 Q ss_pred e-cCCccccccc--cCcccccCc-ccceeeeeeeeeeeeeehhhHHHhhcC--hhHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 408 K-TSGANFYWIG--EDEDVQDSD-FDFTTLSFSPKTIAGAVPVTRKLRKQS--SIHVENLIREDLIEGIGVALDLAMLTG 481 (632) Q Consensus 408 ~-~~~~~a~~v~--E~~~~~~~~-~~~~~~~~~~~t~~~~~~iSre~l~d~--~~~~~~~i~~~l~~a~a~~~~~~~~~g 481 (632) . .+++-+.-+. -+.+..... .+.+.-.|.....---..|+.+.|..- ..++...+.+.+.+.++.-.=..-|+| T Consensus 70 lg~~g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNG 149 (357) T protein:vir:56 70 IGVTGSIASTTDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDFIMAGFNG 149 (357) T ss_pred cccCccccccccCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccc Confidence 2 2333333321 111222222 345555666666655666777766431 124455555555555443222222455 Q ss_pred CCCc------cccc------cce----ecccc----------ccc----cccccchhHHHHH----HHHHH-HHhhcccc Q lcl|Aclame:pro 482 TGLA------NDPV------GLL----NMTGV----------PAL----TYPAGGVDWASVV----DMETK-ISTFNADA 526 (632) Q Consensus 482 ~g~~------~~~~------Gil----~~a~~----------~~~----~~~~~~~~~~~i~----~~~~~-~~~~~~~~ 526 (632) .-.+ .+|. |.+ .++.. +.. -.-+..-+|..|. +++.. +...+++. T Consensus 150 ts~A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d 229 (357) T protein:vir:56 150 VKRAETSDRSSNPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQED 229 (357) T ss_pred eeeeccCChhhCcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcC Confidence 3211 1222 221 11100 000 0001112344433 44543 46666766 Q ss_pred ccceEEeehhHHHHHHHHhhcccCCce--------eeccccccCcceEEcCCCCCccEEEEehhhEEEEEecc-eEE--E Q lcl|Aclame:pro 527 GRLAYLTSVTQRGAAKKAQVFDNTGER--------IWQNNEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGV-LDL--K 595 (632) Q Consensus 527 ~~~~~~~~~~~~~~~~~~~~~d~~g~~--------~~~~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~-~~~--~ 595 (632) ...+.++.......-.. .+-+..+.+ +....++.|+|.+..+.+|.+.+++--++...++...| .+- . T Consensus 230 ~dLVvivG~dLla~k~~-~l~n~~~~pTE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~ 308 (357) T protein:vir:56 230 PDLVVIVGRQLLADKYF-PIVNKEQDNSEMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMDDSHRRVIE 308 (357) T ss_pred CCEEEEEchhhhhhhhh-hHhhccCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEE Confidence 55555554443332222 222222222 22346799999999999999999999888877765443 222 1 Q ss_pred Eeccc----ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 596 VDPYT----KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 596 ~~~~~----~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) ..+.. .+..--..|.++-+--++.++.-.|......| T Consensus 309 d~p~r~riE~y~s~Ne~YvVEd~~~~a~iE~i~i~~~~~~~ 349 (357) T protein:vir:56 309 ENPKLDRVENYESMNIDYVVEDYAAGCLVEKIKVGDFSTPA 349 (357) T ss_pred eccccccccchhhhcceeeeeccccEEEeeeeeeccCCCCc Confidence 22211 12111123444433333333322222222222 No 212 >protein:vir:861 Length: 318 # NCBI annotation: putative minor structural protein # Family: family:all:2417 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047120;genbank:gi:9630573;genbank:GeneID:1261764 Probab=66.10 E-value=0.27 Score=23.68 Aligned_cols=297 Identities=13% Similarity=0.043 Sum_probs=101.2 Q ss_pred hhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhccc-ccccccceechhhhhHHHHHHHhhhhh Q lcl|Aclame:pro 309 LMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKK-TAGKGGELVATELLSEEFIDILRNKAI 387 (632) Q Consensus 309 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~-~~~~~~~~i~~~~~~~~i~~~~~~~~~ 387 (632) +..... ......+..+.+...... .....+.+..-...+ +.++....+|..+ ...|...+....+ T Consensus 1 mtn~ie------------sq~A~~eF~~vL~~N~G~-S~~k~AW~A~L~E~GVtiTD~~~~LP~~l-v~sI~~A~~n~n~ 66 (318) T protein:vir:86 1 MTNFIE------------SQNAVTEFFDVLKKNSGK-SEIKNAWNAKLAENGVTITDTTFQLPRKL-VESINTALLNTNP 66 (318) T ss_pred Ccchhh------------hhHHHHHHHHHHhccCCc-hhhhhhhhhhhhhcCceeeccchhccHHH-HHHHHHhhhccCc Confidence 000000 000000011111110000 011111111111111 1122223344333 3334444444444 Q ss_pred hhhhcceeeccCceeEEEEEecCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhc---ChhHHHHHHHH Q lcl|Aclame:pro 388 IGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQ---SSIHVENLIRE 464 (632) Q Consensus 388 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d---~~~~~~~~i~~ 464 (632) +.+.. .++....-.......+...+..-..|..+++....|.--++.+.-+.....+ -++..+ +...++.+|.. T Consensus 67 v~~vf--HVT~~~~~~V~~s~~s~AeAq~HkdGqTK~eqa~~~~~~Tl~~~~VY~~~S~-Ae~~K~~~~sYsel~N~i~~ 143 (318) T protein:vir:86 67 VFKVF--HVTNVGALLVSRSFDSSAEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSL-AERVKRLQMSYSELYNLIVA 143 (318) T ss_pred ceeee--eeccchhhhhhhhhhhhhhhhhhccCCccccceeeeeeechhHHHHHHHHHH-HHHHHHhhhhHHHHHHHHHH Confidence 44321 1222211111111112223333334445555555554444444333333333 233333 33356899999 Q ss_pred HHHHHHH-HHHHHHHhhcCCCccccccceeccccccc------ccc-ccchhHHHHHHHHHHHHhhccccccceEEeehh Q lcl|Aclame:pro 465 DLIEGIG-VALDLAMLTGTGLANDPVGLLNMTGVPAL------TYP-AGGVDWASVVDMETKISTFNADAGRLAYLTSVT 536 (632) Q Consensus 465 ~l~~a~a-~~~~~~~~~g~g~~~~~~Gil~~a~~~~~------~~~-~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 536 (632) .+.+++- +..+.+++-|+|.+. .+.+-.-+.+..+ +.+ +..+-...|..+..- .++.....|++... T Consensus 144 ELtQ~~vnk~Vd~AlV~GDG~N~-f~~~DK~advK~I~k~Ttkaksagttpfanaieeavdf----vrptagrrylivka 218 (318) T protein:vir:86 144 ELTQAIVNKIVDLALVEGDGSNG-FKSIDKEADVKKIKKITTKAKSAGTTPFANAIEEAVDF----VRPTAGRRYLIVKA 218 (318) T ss_pred HHHHHHHHHHHHhhheeecCCCC-ccchhhHHHHHHHHHHhhhhhccCCCchhhHHHHHHhh----hccCCCceEEEEee Confidence 9999998 778889999998764 1111111222111 111 111222223333332 23333345555544 Q ss_pred HHHHHHHHhhcccCCce---eecccc-c---cCc-ceEEcCCC-CCccEEEEehhhEEEEEecceEEEEecccccccCcE Q lcl|Aclame:pro 537 QRGAAKKAQVFDNTGER---IWQNNE-V---NGY-RAEASNQI-PADTWIFGDWSQIVIAMWGVLDLKVDPYTKAASDGL 607 (632) Q Consensus 537 ~~~~~~~~~~~d~~g~~---~~~~~~-l---~G~-pv~~~~~~-~~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~ 607 (632) ......+..+...+... +-++++ + -|. .+++-... .-..-++.|-+ |.+-+. .+..-+-..+.++.- T Consensus 219 edrkalldelrqatanahvriknddteiasevgvdeiivytgskalkptvlvdqk-yhidmq---dltkvdafewktnsn 294 (318) T protein:vir:86 219 EDRKALLDELRQATANAHVRIKNDDTEIASEVGVDEIIVYTGSKALKPTVLVDQK-YHIDMQ---DLTKVDAFEWKTNSN 294 (318) T ss_pred cchHHHHHHHHhhcccceeEEeccchhhhhhcCcceeeeeeccccccceeeeccc-eecchh---hhhhhhcceeccCCc Confidence 43333344444443322 222221 1 122 12221111 11111222211 111000 000011112223333 Q ss_pred EEEEEEEeCcEEecccceEEEEec Q lcl|Aclame:pro 608 VLRVFQDVDAGVRRKEAFCIAKKG 631 (632) Q Consensus 608 ~~~~~~r~~~~v~~~~a~~~~~~~ 631 (632) -+.++.--.+-|--..|-..+++. T Consensus 295 milvetltsghvetynagavitvs 318 (318) T protein:vir:86 295 MILVETLTSGHVETYNAGAVITVS 318 (318) T ss_pred eEEEeecccCcceeecCceeEEeC Confidence 344444444444333443444444 No 213 >protein:vir:100331 Length: 342 # NCBI annotation: major capsid protein N # Family: family:all:201 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655472;genbank:gi:109289940;genbank:GeneID:4157374 Probab=65.43 E-value=0.28 Score=23.58 Aligned_cols=288 Identities=10% Similarity=0.047 Sum_probs=121.5 Q ss_pred HHHHHHHHHHhhhhhhhhhhhHHhhhhhcccc----cccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEE Q lcl|Aclame:pro 330 VSLAIADASGKEARGFYMPHEVLVQRQLEKKT----AGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDI 405 (632) Q Consensus 330 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~----~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 405 (632) +....+..+....... ..+.... +.+--+.+.|.+ ...+...+.+.+.+.+.. .+++.+.-.-.. T Consensus 1 M~~~tr~~~~~y~~~~---------A~~ngv~~~~~~~~~~FsV~P~v-~q~L~~~i~ess~FL~~I-Nvv~V~e~~Ge~ 69 (342) T protein:vir:10 1 MKDLTLEKYNAYLARQ---------AELNNLPFNALATGIKFTVQPSV-QQKLYEKVRESSDFLKSI-SFVFVDEQTGET 69 (342) T ss_pred CChHHHHHHHHHHHHH---------HHHhCCChhHccccceeecChHH-HHHHHHHHHHHHHHhccC-cccccccceeeE Confidence 1111111111110000 0011111 111123444444 455666676777665542 233333222222 Q ss_pred EE-ecCCccccccccC--ccccc-CcccceeeeeeeeeeeeeehhhHHHhhcC--hhHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 406 PK-KTSGANFYWIGED--EDVQD-SDFDFTTLSFSPKTIAGAVPVTRKLRKQS--SIHVENLIREDLIEGIGVALDLAML 479 (632) Q Consensus 406 ~~-~~~~~~a~~v~E~--~~~~~-~~~~~~~~~~~~~t~~~~~~iSre~l~d~--~~~~~~~i~~~l~~a~a~~~~~~~~ 479 (632) .- ..+++-+.-+.=+ ++... .-.+.+.-.|.....---..|+.+.|..- ..++...+.+.+.+.++.-.=..-| T Consensus 70 i~lg~~g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGf 149 (342) T protein:vir:10 70 LGLDSAHTVASTTDTSGDGERKTTSIAKLVKQTYHCQQINFDTHINYKQLDMWAKFPDFQQKVANVAAKQRKRDLIMIGF 149 (342) T ss_pred EecccCcccccccccCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecc Confidence 22 2233333333211 12222 22355566666666666666777766431 2345555555555555443222224 Q ss_pred hcCCCc------cccc------ccee----ccc---------cccccccccchhHHHHH----HHHHH-HHhhccccccc Q lcl|Aclame:pro 480 TGTGLA------NDPV------GLLN----MTG---------VPALTYPAGGVDWASVV----DMETK-ISTFNADAGRL 529 (632) Q Consensus 480 ~g~g~~------~~~~------Gil~----~a~---------~~~~~~~~~~~~~~~i~----~~~~~-~~~~~~~~~~~ 529 (632) +|.-.+ .+|. |.+. ++. ...+.. +..-+|..|. +++.. +...+++.... T Consensus 150 NGts~A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rv~~~~~~~~~i~i-G~~gdy~NLDalV~D~~~~lI~~~~~~d~dL 228 (342) T protein:vir:10 150 NGTSRAATSDRNSNPLLQDVAKGWLQKMREDAKERVMNGESTDNQVLV-GKGQEYANLDALVMDATEELIDEWHRDDTDL 228 (342) T ss_pred cceeeccCCChhhCcCccccchHHHHHHHhhhhhhhcccceeccceee-cCCCCcccHHHHHHHHHhccCChHHhcCCCE Confidence 553211 1222 2111 110 000100 1112343333 44543 46666766556 Q ss_pred eEEeehhHHHHHHHHhhcccCCc--------eeeccccccCcceEEcCCCCCccEEEEehhhEEEEEecc-eEEEEeccc Q lcl|Aclame:pro 530 AYLTSVTQRGAAKKAQVFDNTGE--------RIWQNNEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGV-LDLKVDPYT 600 (632) Q Consensus 530 ~~~~~~~~~~~~~~~~~~d~~g~--------~~~~~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~-~~~~~~~~~ 600 (632) +.++.......-.. .+-..... .+....++.|+|.+..|.+|.+.+++--++...++...| .+-..-+.- T Consensus 229 VvivG~dLladk~~-~l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p 307 (342) T protein:vir:10 229 VVITGRKLLADKYF-PIVNQQNAPTEELAADIVISQKRIGGLKAVRVPFFPANAILITKLENLAIYVQEGTTRKHIENVP 307 (342) T ss_pred EEEEchhhhHHHHH-HHHhcCCChHHHHHHHHHHhhhhhcCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEecc Confidence 55554443332222 22121111 223346799999999999999999999888877765443 322211111 Q ss_pred ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 601 KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 601 ~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +++.+.-+-..--|..|-++.+++.+.--- T Consensus 308 --~r~rie~y~s~Ne~YvVEd~~~~a~iE~i~ 337 (342) T protein:vir:10 308 --KKDRIETYESENIDYVVEDYGCAALIENIT 337 (342) T ss_pred --ccccccchhhhccceeeeccccEEEeecce Confidence 122222222223333444444444443222 No 214 >protein:vir:3746 Length: 336 # NCBI annotation: orf15 # Family: family:all:201 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043487;genbank:gi:9628622;genbank:GeneID:1261135 Probab=65.28 E-value=0.29 Score=23.56 Aligned_cols=287 Identities=11% Similarity=0.021 Sum_probs=120.9 Q ss_pred HHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEE-E Q lcl|Aclame:pro 329 EVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIP-K 407 (632) Q Consensus 329 ~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 407 (632) +....+.....+-+....... .....+--+.+.|.+ ...+...+.+.+.+.+.. .+++.+.-.-... . T Consensus 1 mtr~~~~~y~~~~A~~ngv~~---------a~~~~~~~Fsv~P~v-~q~L~~~i~ess~FL~~I-Nvv~V~e~~Ge~v~l 69 (336) T protein:vir:37 1 MNKQAYYALAAALAKHFNQPL---------DSVLRGESFALKAPE-AALLGENIQQRSDFLKQI-NMIQVAHTKGQKLFG 69 (336) T ss_pred CcHHHHHHHHHHHHHHhCCCh---------hhhccCceeecCHHH-HHHHHHHHHHHHHHhhcC-ceeecccccceEeee Confidence 000111111111111000000 000111124455554 455667777777665542 2233222211222 2 Q ss_pred ecCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHH--HHHHHH--hhcCC Q lcl|Aclame:pro 408 KTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGV--ALDLAM--LTGTG 483 (632) Q Consensus 408 ~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~--~~~~~~--~~g~g 483 (632) ..+++-+.-..- +-.+ ..++.+.-.|.....---..|+.+.|..-. ..-.+..+.+...+.+ ++|... |+|.- T Consensus 70 g~~g~iagrtdt-~R~~-~~~~l~~~~Y~c~qTn~dt~i~y~~LD~WA-~~~df~~~~~~~~~~r~iALD~i~IGfnG~s 146 (336) T protein:vir:37 70 ATEKGVTGRKQT-GRNL-ANLDHTQNGFELAETDSGIIVPWALFDSFA-IFKDRLVELYSEYFQNQVALDILQIGWNGQS 146 (336) T ss_pred ccCcccccccCC-Cccc-cccCcCCcccEEEEeeeeeeecHHHHHHHh-cChhHHHHHHHHHHHHHHhhchhhhccccee Confidence 222333322221 1122 224555566666666666677777774321 1222232333333332 234433 44532 Q ss_pred ---Ccccccc------ce----ecccc----------ccc---cccccchhHHH-HHHHHHHHHhhccccccceEEeehh Q lcl|Aclame:pro 484 ---LANDPVG------LL----NMTGV----------PAL---TYPAGGVDWAS-VVDMETKISTFNADAGRLAYLTSVT 536 (632) Q Consensus 484 ---~~~~~~G------il----~~a~~----------~~~---~~~~~~~~~~~-i~~~~~~~~~~~~~~~~~~~~~~~~ 536 (632) +..+|.+ .+ .++.. +.+ ..++.-.+.|. +.+++..+...+++....+.++... T Consensus 147 ~A~~TdnPllqDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~gdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~d 226 (336) T protein:vir:37 147 VADNTTKADLSDVNKGWLKLLQEQRAANFMTESTKSSGKITIFGDNADYANLDDLAFDLKQGLDFRHQNRNDLVFLVGAD 226 (336) T ss_pred eccCCCCCcccccchhHHHHHHhccchhhcccccccCCceEEecCCCCcccHHHHHHHHHhcCchHHhcCCCeEEEEchh Confidence 1224432 11 11110 000 11111222333 3456666666677665666555443 Q ss_pred HHHHHHHHhhcccCC-ce--------eeccccccCcceEEcCCCCCccEEEEehhhEEEEEecce-EEEEecccccccCc Q lcl|Aclame:pro 537 QRGAAKKAQVFDNTG-ER--------IWQNNEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGVL-DLKVDPYTKAASDG 606 (632) Q Consensus 537 ~~~~~~~~~~~d~~g-~~--------~~~~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~~-~~~~~~~~~~~~~~ 606 (632) .. .-....+-...+ .| +....++.|+|.+..|.+|.+.+++--++...++...|- +-..-+. -+++. T Consensus 227 Ll-a~~~~~l~~~~~~~PtE~~Aa~~~~~~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~--p~r~r 303 (336) T protein:vir:37 227 LV-SKETKLIQQKHGLTPTEKAALGSHNLMGSFGGMNAITPPNFPARAAAVTTLKNLSVYTEAESVRRSLRND--EDKKG 303 (336) T ss_pred hh-hhhhhhhhhhcCCCHHHHHHHHHHHHHHhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEEc--ccccc Confidence 32 222223333332 22 223467999999999999999999999998877655443 2211111 11233 Q ss_pred EEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 607 LVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 607 ~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +.-+-..--|..|-++.++|.+.-.. T Consensus 304 ie~y~s~Ne~YvVEd~~~~a~iE~i~ 329 (336) T protein:vir:37 304 LVTSYYRQEGYVVEDLGLMTAIDHTK 329 (336) T ss_pred ccchhhhcceeeeeccccEEEeeeee Confidence 33222233344555555555544333 No 215 >protein:vir:2016 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046760;genbank:gi:9630331;genbank:GeneID:1261541 Probab=65.22 E-value=0.29 Score=23.56 Aligned_cols=291 Identities=12% Similarity=0.056 Sum_probs=118.8 Q ss_pred HHHHHHHHHHhhhhhhhhhhhHHhhhhhcccc--cccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEE Q lcl|Aclame:pro 330 VSLAIADASGKEARGFYMPHEVLVQRQLEKKT--AGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPK 407 (632) Q Consensus 330 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~--~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 407 (632) +....+..+........ .+.... ..+-.+.|.|.+ ...+...+.+.+.+.+.. .+++.+...-...- T Consensus 1 M~~~tr~~~~~y~~~~A---------~~ngv~~~d~~~~FsV~P~v-~q~L~~~i~ess~FL~~I-Nvv~V~e~~Ge~i~ 69 (357) T protein:vir:20 1 MRQETRFKFNAYLSRVA---------ELNGIDAGDVSKKFTVEPSV-TQTLMNTMQESSDFLTRI-NIVPVSEMKGEKIG 69 (357) T ss_pred CChHHHHHHHHHHHHHH---------HHhCCChHHhcceeecCHHH-HHHHHHHHHHHHHHhccC-CccccccceeeEEe Confidence 11111111111111000 011000 112233444444 455666666767665542 23333322222222 Q ss_pred e-cCCccccccc--cCcccccCc-ccceeeeeeeeeeeeeehhhHHHhhcC--hhHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 408 K-TSGANFYWIG--EDEDVQDSD-FDFTTLSFSPKTIAGAVPVTRKLRKQS--SIHVENLIREDLIEGIGVALDLAMLTG 481 (632) Q Consensus 408 ~-~~~~~a~~v~--E~~~~~~~~-~~~~~~~~~~~t~~~~~~iSre~l~d~--~~~~~~~i~~~l~~a~a~~~~~~~~~g 481 (632) . .+++-+.-+. -+.+..... .+.+.-.|.....---..|+.+.|..- ..++...+.+.+.+.++.-.=..-|+| T Consensus 70 lg~~g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNG 149 (357) T protein:vir:20 70 IGVTGSIASTTDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRIRNAIIKRQSLDFIMAGFNG 149 (357) T ss_pred cccCccccccccCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccc Confidence 2 2333333321 112222222 345555666666655666777766431 124455555555555443222222455 Q ss_pred CCCc------cccc------cce----ecccc----------ccc----cccccchhHHHHH----HHHHH-HHhhcccc Q lcl|Aclame:pro 482 TGLA------NDPV------GLL----NMTGV----------PAL----TYPAGGVDWASVV----DMETK-ISTFNADA 526 (632) Q Consensus 482 ~g~~------~~~~------Gil----~~a~~----------~~~----~~~~~~~~~~~i~----~~~~~-~~~~~~~~ 526 (632) .-.+ .+|. |.+ .++.. +.. -..+..-+|..|. +++.. +...+++. T Consensus 150 ts~A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d 229 (357) T protein:vir:20 150 VKRAETSDRSSNPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGRTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQED 229 (357) T ss_pred eeeeccCChhhCcCccccchhHHHHHHhhchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcC Confidence 3211 1222 222 11100 000 0011112344333 44543 46666766 Q ss_pred ccceEEeehhHHHHHHHHhhcccCCce--------eeccccccCcceEEcCCCCCccEEEEehhhEEEEEecc-eEE--E Q lcl|Aclame:pro 527 GRLAYLTSVTQRGAAKKAQVFDNTGER--------IWQNNEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGV-LDL--K 595 (632) Q Consensus 527 ~~~~~~~~~~~~~~~~~~~~~d~~g~~--------~~~~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~-~~~--~ 595 (632) ...+.++.......-.. .+-+..+.+ +....++.|+|.+..|.+|.+.+++--++...++...| .+- . T Consensus 230 ~dLVvivG~dLla~k~~-~l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~ 308 (357) T protein:vir:20 230 PDLVVIVGRQLLADKYF-PIVNKEQDNSEMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMDDSHRRVIE 308 (357) T ss_pred CCEEEEEchhhhhhhhh-hHhhccCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEE Confidence 55555554443332222 222222222 22346799999999999999999999888877765443 222 1 Q ss_pred Eeccc----ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 596 VDPYT----KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 596 ~~~~~----~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) ..+.. .+..--..|.++-+--++.++.-.|...+..| T Consensus 309 d~p~r~riE~y~s~Ne~YvVEd~~~~a~iE~i~~~~~~~p~ 349 (357) T protein:vir:20 309 ENPKLDRVENYESMNIDYVVEDYAAGCLVEKIKVGDFSTPA 349 (357) T ss_pred eccccccccchhhhcceeeeeccccEEEeeeeeeccccCCc Confidence 22211 12111123444433333333321111111111 No 216 >protein:vir:348 Length: 321 # NCBI annotation: major virion structural protein # Family: family:all:3198 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203462;genbank:gi:15320618;genbank:GeneID:921734 Probab=63.53 E-value=0.32 Score=23.33 Aligned_cols=275 Identities=11% Similarity=0.018 Sum_probs=121.3 Q ss_pred hhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhh-hhhhhhhcc---eeeccCce Q lcl|Aclame:pro 326 FEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRN-KAIIGQMGA---RMLPGLVG 401 (632) Q Consensus 326 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~-~~~~~~~~~---~~~~~~~~ 401 (632) ++ . ....++..-++...+ ..+.+.+.. .+++.++.. ....+... T Consensus 1 mp-----------------~-~~lsel~t~tl~~rs--------------~~~~D~v~~~n~LL~~L~~kG~~~~~~gg~ 48 (321) T protein:vir:34 1 MP-----------------F-PNISDIITTTIESRS--------------GVIADNVTKNNAILARLAKRGKPRLVSGGY 48 (321) T ss_pred CC-----------------C-chHHHHHHHHHHhhc--------------chhhhhhhcccHHHHHHHhcCcccccCCCe Confidence 00 0 000111111111111 001111111 122222211 12223444 Q ss_pred eEEEEEecC-CccccccccCcccc-cCcccceeeeeeeeeeeeeehhhH-HHhhcChh-HHHHHHHHH---HHHHHHHHH Q lcl|Aclame:pro 402 DVDIPKKTS-GANFYWIGEDEDVQ-DSDFDFTTLSFSPKTIAGAVPVTR-KLRKQSSI-HVENLIRED---LIEGIGVAL 474 (632) Q Consensus 402 ~~~~~~~~~-~~~a~~v~E~~~~~-~~~~~~~~~~~~~~t~~~~~~iSr-e~l~d~~~-~~~~~i~~~---l~~a~a~~~ 474 (632) ++..+..-. ..++.|..=-..++ .-.-.+...+|..+.+...+.||- |++.++.- .+.+++... .-+.+.+.+ T Consensus 49 ~I~~~l~y~~~s~~~wy~Gyd~l~~~p~d~~~~Aef~wk~aa~~~~isg~e~l~n~g~~~~idll~~~~~~ae~t~~n~l 128 (321) T protein:vir:34 49 TILEELSFSGNSNGGWYSGYDVLPTAPQDVISSAEYALKQYAVPVVISGLEMLQNSGKEAQLDLLEARMNVAEATMANDI 128 (321) T ss_pred eEEEEEeeccCcceeEEEeeeeeccchhhhccccccchhheeEeeEEehhHHhhccchHHHHHHHHHHHHHHHHHHHhhh Confidence 555554433 56677754222222 234567778888888888888876 55555532 233333333 344555566 Q ss_pred HHHHhh-cCC-Cccccccc---eecc-cccc---c------------cccccchhHHHHHHHHHHHHhh-ccccccceEE Q lcl|Aclame:pro 475 DLAMLT-GTG-LANDPVGL---LNMT-GVPA---L------------TYPAGGVDWASVVDMETKISTF-NADAGRLAYL 532 (632) Q Consensus 475 ~~~~~~-g~g-~~~~~~Gi---l~~a-~~~~---~------------~~~~~~~~~~~i~~~~~~~~~~-~~~~~~~~~~ 532 (632) +..++. |++ .+....|+ .... +.+. + ...++..+..++..++.++-.+ .+....+.++ T Consensus 129 ~~~l~sdGTa~g~~~i~GL~~lv~~~p~tGtvGGIdra~~~~WRn~~~d~~~~~t~~tl~~~m~~~w~~~~Rg~~~PDli 208 (321) T protein:vir:34 129 SAALYGDGTAFGGRAINGLDGAVPVDPTVGTYGGINRALWPFWRSQVEDMAAVATINTIQPAMTKLWSRCVRGADMPDLI 208 (321) T ss_pred hHhhhccccccccchhhhhhhhcccCCCCceeccccccchhhhhhhhhhhhhcccHHHHHHHHHHHHHhhccCCCCccEE Confidence 666554 443 12333332 2111 1111 1 1111223445555555544332 3455567776 Q ss_pred eehhHHHHHHHHhh------cccC-CceeeccccccCcceEEcC----CCCCccEEEEehhhEEEEEecceEEEEecccc Q lcl|Aclame:pro 533 TSVTQRGAAKKAQV------FDNT-GERIWQNNEVNGYRAEASN----QIPADTWIFGDWSQIVIAMWGVLDLKVDPYTK 601 (632) Q Consensus 533 ~~~~~~~~~~~~~~------~d~~-g~~~~~~~~l~G~pv~~~~----~~~~~~~~~gd~s~~~~~~~~~~~~~~~~~~~ 601 (632) +............+ .+.. ++.=++.-.+.|..|+..+ .+|+++.||-|-+.+.+.....-.+.-..... T Consensus 209 i~~~~~y~~y~~s~q~~qR~~~~~~a~~Gf~~Lky~~~div~D~~~g~~~pan~~yfiNT~yl~~r~h~~~~~~pi~p~r 288 (321) T protein:vir:34 209 MSGNDAWTTYSNSLQVLQRFTSAEEANLGFRSLKFLSTDVVLDGGIGGFAGANTMYFLNTKYLHFRPHKDRNMVPLSPSR 288 (321) T ss_pred EechHHHHHHHHhhheeeeecccccccccceeeeeeeEEEEEeCCCCCCccccceeeeecceEEEEEcCCCceeecCccc Confidence 66655433322211 1111 1111222346677788877 68899999999999888866555544333322 Q ss_pred c-ccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 602 A-ASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 602 ~-~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) | ..|+-++....-+-+.++-.++..-..+.| T Consensus 289 ~~~~NqdA~~q~I~~~GnL~~sn~~~~~vL~~ 320 (321) T protein:vir:34 289 RAAFNQDAEAQILAWAGNLTCSGAQFQGRLIA 320 (321) T ss_pred ccccchhHHhhhhhhhheeeeecccceeEEee Confidence 2 123333333333333444333333333344 No 217 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=63.12 E-value=0.32 Score=23.28 Aligned_cols=254 Identities=8% Similarity=-0.046 Sum_probs=101.8 Q ss_pred ccccceechhhhhHHHHHHHhhhhhhhhhcce-----eeccCceeEEEEEecCCccccccccCcccccCcccceeeeeee Q lcl|Aclame:pro 363 GKGGELVATELLSEEFIDILRNKAIIGQMGAR-----MLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSP 437 (632) Q Consensus 363 ~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~ 437 (632) .+ +...+.....+.+.+........+... +...+...+++|+..+...+.-..=+.-++.+.++.+..++.+ T Consensus 1 Ma---in~~~k~~~~ld~~~~~~~~~~~l~~~~n~~~~~~~gak~VkIp~ist~~gl~dY~R~~g~~~g~v~~~~et~tl 77 (285) T protein:vir:79 1 MT---VVLDSKDLARIDEEYKADSQVWSYLTGGNGVTQRFRGHNEVRINKLSGFVDATAYKRGQDNARKTISVGKETVKL 77 (285) T ss_pred Cc---chhhHHHHHHHHHHHHHhhhhhhhcccCCcceeEecCCCEEEEeeecccccccccccccCccccccceeeeEEEe Confidence 00 001112223333334333333333221 2223456788888753222221222223444444444333333 Q ss_pred eee-eeeehhhHHHhhcC--hhHHHHHHHHHHHHHHH-HHHHHHHhhcCCCccccccceeccccccccccccchhHHHHH Q lcl|Aclame:pro 438 KTI-AGAVPVTRKLRKQS--SIHVENLIREDLIEGIG-VALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVV 513 (632) Q Consensus 438 ~t~-~~~~~iSre~l~d~--~~~~~~~i~~~l~~a~a-~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~ 513 (632) ..= +..+.|- -+..+ ..-....|...+.+... =.+|...+.-.-... | +....+.+ ..--++.|. T Consensus 78 ~~DR~~~f~iD--~mDvdEn~~~~~~ni~~ef~~~~vvPEiDayrfskla~~a---~-----~~~~~~~T-~~nv~~~i~ 146 (285) T protein:vir:79 78 THEDWFGYDLD--QFDMDENGAYTVENVVREHNKMITIPHRDKVAVQKLFDSA---A-----KKATDSIT-KDNALDAYD 146 (285) T ss_pred eccccceeccc--ccchhhhhhhhHHHHHHHHHhhhhcchhhHHHHHHHHhhc---c-----cccccccC-HHHHHHHHH Confidence 211 1111121 11101 11112222222222211 223332222111100 0 00011111 122367788 Q ss_pred HHHHHHHhhccccccceEEeehhHHHHHHHHhh--c--ccC-----CceeeccccccC-cceEEc--CCCCCc------c Q lcl|Aclame:pro 514 DMETKISTFNADAGRLAYLTSVTQRGAAKKAQV--F--DNT-----GERIWQNNEVNG-YRAEAS--NQIPAD------T 575 (632) Q Consensus 514 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~--d~~-----g~~~~~~~~l~G-~pv~~~--~~~~~~------~ 575 (632) +++.+|.....+. +...++.|.....+..... + +.+ |..--....|.| .|++.. +.+... . T Consensus 147 ~~~~~lde~~vp~-~rvl~vTp~~~~~Lk~s~~~~r~~~~~~~~~~~~i~~~V~~lDg~v~ii~Vps~r~kt~~~~k~In 225 (285) T protein:vir:79 147 TAEAYMFDNEVPG-GFVMFVSSAYYTALKQSAAVTRTFSTDGTMVINGIDRRVAQLDGGVPIVRVSSDRLKGLGITNHVN 225 (285) T ss_pred HHHHHHHHcCCCC-ceEEEEChHHHHHHHhhhhhheecccccceeccceeeeeccccceeEEEEcchhhccCcCcchhcc Confidence 8888888877663 4444556655555443211 1 111 111112356888 788764 344421 1 Q ss_pred EEEEehhhEEEEEecceE-EEEecccccccCcEEEEEEEEeCcEEeccc--ceEEEEecC Q lcl|Aclame:pro 576 WIFGDWSQIVIAMWGVLD-LKVDPYTKAASDGLVLRVFQDVDAGVRRKE--AFCIAKKGA 632 (632) Q Consensus 576 ~~~gd~s~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~--a~~~~~~~A 632 (632) +++...+. .+...+.-. ...+|...-.-+...|.-+.++|.=|.+.+ ++...+.|| T Consensus 226 fiiv~~~a-~i~~~K~~~~~~f~P~~~~~~d~~~~~~R~Y~d~fv~~nk~~~Iy~~~~a~ 284 (285) T protein:vir:79 226 FILTPLSA-IAPIVKYDSVSVIDPSTDRSGNRWTIKGLSYYDAIVLDNAKKGIYVAATAG 284 (285) T ss_pred EEEecCce-eccceeeeeeEeECCCCCCCcceeeeeeeeeeeeeehhhccceeeeeeccc Confidence 34444333 222222211 233455544445667777788888887764 565555555 No 218 >protein:vir:98856 Length: 343 # NCBI annotation: hypothetical protein # Family: family:all:201 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654732;genbank:gi:109302917;genbank:GeneID:4156061 Probab=60.56 E-value=0.37 Score=22.95 Aligned_cols=292 Identities=11% Similarity=0.039 Sum_probs=111.8 Q ss_pred hhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCce-eEE Q lcl|Aclame:pro 326 FEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVG-DVD 404 (632) Q Consensus 326 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~-~~~ 404 (632) +.......+.....+-+....... .....+.-+.|.|.+. ..+...+.+.+.+.+.. .+++.+-. ... T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~---------~~~~~~~~FsV~P~v~-q~L~~~i~ess~FL~~I-Nvv~V~q~~g~v 69 (343) T protein:vir:98 1 MNKTAQELFYSLIGDAAEYYGANP---------ALALAGKQFSIEAPKE-SVLLGAIQQRSNFLEKI-NCVFSERYQRAI 69 (343) T ss_pred CChHHHHHHHHHHHHHHHHhCCcc---------chhccCceeeecHHHH-HHHHHHHHHHHHHhhcC-ceecchhhcceE Confidence 000001111111111100000000 0011122244555554 55667777777665543 22222211 111 Q ss_pred EEEecCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcC--hhH-HHHHHHHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 405 IPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQS--SIH-VENLIREDLIEGIGVALDLAMLTG 481 (632) Q Consensus 405 ~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~--~~~-~~~~i~~~l~~a~a~~~~~~~~~g 481 (632) .....+.+...-....+...... ..+.-.|.....---..|+.+.|..- ..+ +...+.+.+.+.++.-.=..-++| T Consensus 70 ~~~~~sg~~t~r~~t~~~~~~~~-~~~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~deF~~r~~~~i~~~~ALD~i~IGfNG 148 (343) T protein:vir:98 70 DLRSNRKRHYGAHDRRTPIQQRW-TRQVMSMNVSRQIQACLIPWAKLDQWGHLKDKFASLYAEFVQNQIALDMIKIGFYG 148 (343) T ss_pred EEeecCccccCccccCCCccccc-cCCCCccEEEEeeeeeeccHHHHHHhhcChhHHHHHHHHHHHHHHhhccceecccc Confidence 22222222121111111111110 00111244444444455666655321 122 444444444444433221122455 Q ss_pred CC---Ccccccc------ce----eccccccc---------cccccchhHHHHH----HHHHHHHhhccccccceEEeeh Q lcl|Aclame:pro 482 TG---LANDPVG------LL----NMTGVPAL---------TYPAGGVDWASVV----DMETKISTFNADAGRLAYLTSV 535 (632) Q Consensus 482 ~g---~~~~~~G------il----~~a~~~~~---------~~~~~~~~~~~i~----~~~~~~~~~~~~~~~~~~~~~~ 535 (632) .- +..+|.| .+ .++....+ ..-+..-+|..|. ++...+...+++....+.++.. T Consensus 149 ts~A~~T~nPllqDVN~GWLQ~~Re~ap~rVm~~~~~~~~~~~~G~ggdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~ 228 (343) T protein:vir:98 149 TSVGTDTSDPNLADVNKGWIQFVRENKATQILTQGATSGEIRLFGEGADYVNLDELAYDLKQGLDARHRDAGDLVFLVGA 228 (343) T ss_pred eeeccCCCCcchhhcchHHHHHHHhcchhhhhccceeccceeEecCCCCcccHHHHHHHHHhcCchHHhcCCCEEEEEch Confidence 32 1123432 11 11110000 0001111343333 4445556666665555555544 Q ss_pred hHHHHHHHHhhcccCCce---------eeccccccCcceEEcCCCCCccEEEEehhhEEEEEecc-eEEEEecccccccC Q lcl|Aclame:pro 536 TQRGAAKKAQVFDNTGER---------IWQNNEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGV-LDLKVDPYTKAASD 605 (632) Q Consensus 536 ~~~~~~~~~~~~d~~g~~---------~~~~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~-~~~~~~~~~~~~~~ 605 (632) .....-.. .+-...+++ +....++.|+|.+..|.+|.+.+++--++...++...| .+-..-+.- +++ T Consensus 229 dLla~~~~-~l~n~~~~~ptEk~Aa~~~~~~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p--~r~ 305 (343) T protein:vir:98 229 DLVAKEAS-LVYKGNGLIATEKAALNTHDLMKSFGGMPAMIVPNMPPRAAIVTSLSNLSIYTQEGSMRRGMKDDD--DKK 305 (343) T ss_pred hhhhhhhh-hhhhhcCCChHHHHHHHHHHHHHhhCCCeeEEccccCCCceEEeeccccEEEEecCcEEEEEEecc--ccc Confidence 43322222 222222321 22336799999999999999999999999877765444 322211111 122 Q ss_pred cEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 606 GLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 606 ~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) .+.-+-..--|..|-++.++|.+.-.. T Consensus 306 rie~y~s~Ne~YvVEd~~~~a~iE~i~ 332 (343) T protein:vir:98 306 AVRDSYYRNEAYAVEDCGKFMAVDFTK 332 (343) T ss_pred cccchhhhcceeeeeccccEEEeeeee Confidence 222222223333444444444433222 No 219 >protein:vir:93966 Length: 400 # NCBI annotation: structural protein # Family: family:all:2417 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764320;genbank:gi:115315634;genbank:GeneID:5176553 Probab=60.38 E-value=0.37 Score=22.93 Aligned_cols=376 Identities=13% Similarity=0.047 Sum_probs=106.6 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhh Q lcl|Aclame:pro 230 TRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSL 309 (632) Q Consensus 230 ~r~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 309 (632) .|.... ........+....+....+.....+.......... +........+.. ........+...-..... T Consensus 1 mriS~~--~~~K~~l~EK~~~~a~~~E~~~~LKS~~~G~evkn--aiedl~K~~EL~-----~TlS~~~iEI~~~en~LN 71 (400) T protein:vir:93 1 MRISKR--NMNKPDLIEKQNRLAELKENNVSLKSQISGFEVKN--AIEDLPKVQELE-----KTLSENSIEIIKIENELN 71 (400) T ss_pred Cccccc--ccccchHHHHHHHHhhhhhhhhhhhhhhhcchhhh--hhhhchhHHHHH-----HhHhhcchhhhhhhhhhh Confidence 000000 00000001111111000011101111000000000 000000000000 000000000000000000 Q ss_pred hhhhhhhhhhhhhhhhh--hhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcc-cccccccceechhhhhHHHHHHHhhhh Q lcl|Aclame:pro 310 MRAINAAATGDWSKAGF--EREVSLAIADASGKEARGFYMPHEVLVQRQLEK-KTAGKGGELVATELLSEEFIDILRNKA 386 (632) Q Consensus 310 ~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~-~~~~~~~~~i~~~~~~~~i~~~~~~~~ 386 (632) .. .....+......+ ......+..+.+...... .....+....-... .+.++....+|..+ ...|...+.... T Consensus 72 a~--~E~~KGK~kMt~~i~sq~A~~eF~~vL~~N~G~-S~~k~AW~A~L~E~GVtiTD~~~~LP~~l-v~sI~~A~~n~n 147 (400) T protein:vir:93 72 AQ--EEKPKGKDKMTNFIESQNAVTEFFDVLKKNSGK-SEIKNAWSAKLAENGVTITDTTFQLPRKL-VESINTALLNTN 147 (400) T ss_pred hh--hhhhhhhHHHHHHHhhHHHHHHHHHHHhccCCc-hhhhhhhhhhHhhcCcceeccchhccHHH-HHHHHHhhhccC Confidence 00 0000000000000 000000111111110000 01111111111111 11122223344333 233444444444 Q ss_pred hhhhhcceeeccCceeEEEEE-ecCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhc---ChhHHHHHH Q lcl|Aclame:pro 387 IIGQMGARMLPGLVGDVDIPK-KTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQ---SSIHVENLI 462 (632) Q Consensus 387 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d---~~~~~~~~i 462 (632) ++.+.. .++.... +-+.+ ..+...+..-..|..+++....|.--++.+.-+.....+ -++..+ +...++.+| T Consensus 148 ~v~~vf--HVT~~~~-~~V~~s~~s~~~Aq~HkdGqTK~eqa~~~~~~Tl~~~~VY~~~S~-Ae~~K~~~~sYsel~N~i 223 (400) T protein:vir:93 148 PVFKVF--HVTNVGA-LLVSRSFDSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSL-AERVKRLQMSYSELYNLI 223 (400) T ss_pred cceeee--eeccchh-hhHHhhhhhhhhhhhhccCCccccceeeeeeechhHHHHHHHHHH-HHHHHHhhhhHHHHHHHH Confidence 433321 1122111 11111 111112222333444555555554444444333333333 222233 334568999 Q ss_pred HHHHHHHHH-HHHHHHHhhcCCCccccccceeccccccc------cc-cccchhHHHHHHHHHHHHhhccccccceEEee Q lcl|Aclame:pro 463 REDLIEGIG-VALDLAMLTGTGLANDPVGLLNMTGVPAL------TY-PAGGVDWASVVDMETKISTFNADAGRLAYLTS 534 (632) Q Consensus 463 ~~~l~~a~a-~~~~~~~~~g~g~~~~~~Gil~~a~~~~~------~~-~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 534 (632) ...+++++- +..+.+++-|+|++. ...+-.-+.+..+ +. ++..+-.+.|..+..- .++.....|++. T Consensus 224 ~~ELtQ~~vnk~Vd~AlV~GDG~N~-f~~~DK~advK~I~~~Ttkaksagktpfadaieeavdf----vrptagrryliv 298 (400) T protein:vir:93 224 VAELTQAIVNKIVDLALVEGDGTNG-FKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDF----VRPTAGRRYLIV 298 (400) T ss_pred HHHHHHHHHHHHHHhhhheecCCCC-ccchhhHHHHHHHHHHhhhhhhcCCCchhHHHHHHHhh----hccCCCceEEEE Confidence 999999998 788889999998764 1111111222111 11 1222223334443333 334444456665 Q ss_pred hhHHHHHHHHhhcccCCce---eeccc-cc---cCc-ceEEcCCC-CCccEEEEehhhEEEEEecceEEEEecccccccC Q lcl|Aclame:pro 535 VTQRGAAKKAQVFDNTGER---IWQNN-EV---NGY-RAEASNQI-PADTWIFGDWSQIVIAMWGVLDLKVDPYTKAASD 605 (632) Q Consensus 535 ~~~~~~~~~~~~~d~~g~~---~~~~~-~l---~G~-pv~~~~~~-~~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~ 605 (632) ........+..+...+... +-+++ .+ -|. .+++-... .-..-++.|-+ |++-+. .+..-+-..+.++ T Consensus 299 ktedrkalldelrqatanahvriknddaeiasevgvdeiivytgskalkptvlvdqk-yhidmq---dltkvdafewktn 374 (400) T protein:vir:93 299 KTEDRKALLDELRQATANAHVRIKNDDAEIASEVGVDEIIVYTGSKALKPTVLVDQK-YHIDMQ---DLTKVDAFEWKTN 374 (400) T ss_pred eccchHHHHHHHHhhccccceEeecchhhhhhhcCcceeeeeeccccccceeeeccc-cccchh---hhhhhhhheeccC Confidence 5544344444444444322 21211 11 122 12221111 11111222211 111000 0000111122233 Q ss_pred cEEEEEEEEeCcEEecccceEEEEec Q lcl|Aclame:pro 606 GLVLRVFQDVDAGVRRKEAFCIAKKG 631 (632) Q Consensus 606 ~~~~~~~~r~~~~v~~~~a~~~~~~~ 631 (632) .--+.++.--.+-|--..|-..+++. T Consensus 375 snmilvetltsghvetynagavitvs 400 (400) T protein:vir:93 375 SNMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred CceEEEeecccCcceeeccceeEeeC Confidence 33344444444444333443444444 No 220 >protein:vir:1663 Length: 393 # NCBI annotation: unknown # Family: family:all:2417 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044952;genbank:gi:9629659;genbank:GeneID:1261309 Probab=59.12 E-value=0.4 Score=22.77 Aligned_cols=367 Identities=13% Similarity=0.066 Sum_probs=105.7 Q ss_pred hhhhhhhhhhhhhhhhhhh----hhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhh Q lcl|Aclame:pro 217 SGANENDILSRERTRISEI----TAIGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIH 292 (632) Q Consensus 217 ~~~~~~~~~~~~~~r~~~~----~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 292 (632) ...+.- ...+.|.... ..+..+...-+ ...++..-...++.+..+... ..+....+.....+ T Consensus 1 mnkpdl---iekqnrlaelkennvslksqisgfe-vknaiedl~K~~ELe~TlSe~--------~iEI~k~en~LN~~-- 66 (393) T protein:vir:16 1 MNKPDL---IEKQNRLAELKENNVSLKSQISGFE-VKNAIEDLPKVQELEKTLSEN--------SIEIIKIENELNAQ-- 66 (393) T ss_pred CCCcch---hhhhhhhhhhhhcccchhhhccchh-hhhhhhhchhHHHHHHhHhhc--------chhhhhhhhhhhhh-- Confidence 000000 0000000000 00000000000 000000000000000000000 00000000000000 Q ss_pred hhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhccc-ccccccceech Q lcl|Aclame:pro 293 SARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKK-TAGKGGELVAT 371 (632) Q Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~-~~~~~~~~i~~ 371 (632) .+.. -....+..... ......+..+.+...... .....+....-...+ +.++....+|. T Consensus 67 --eE~~-----KGK~kMt~~ie------------sq~A~~eF~~vL~~N~G~-S~~k~AW~A~L~E~GVtiTD~~~~LP~ 126 (393) T protein:vir:16 67 --EEKP-----KGKDKMTNFIE------------SQNAVTEFFDVLKKNSGK-SEIKNAWSAKLAENGVTITDTTFQLPR 126 (393) T ss_pred --hhcc-----hhhHHHHHHHh------------hHHHHHHHHHHHhccCCc-hhhhhhhhhhHhhcCcceeccchhccH Confidence 0000 00000000000 000000011111110000 011111111111111 11222233443 Q ss_pred hhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEEE-ecCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHH Q lcl|Aclame:pro 372 ELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPK-KTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKL 450 (632) Q Consensus 372 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~ 450 (632) .+ ...|...+....++.... .++.... +-+.+ ..+...+..-..|..+++....|.--++.+.-+.....+ -++ T Consensus 127 ~l-v~sI~~A~~n~n~v~~vf--HVT~~~~-~~V~~s~~s~~eAq~HkdGqTK~eqa~~~~~~Tl~~~~VY~~~S~-Ae~ 201 (393) T protein:vir:16 127 KL-VESINTALLNTNPVFKVF--HVTNVGA-LLVSRSFDSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSL-AER 201 (393) T ss_pred HH-HHHHHHhhhccCcceeee--eeccchh-hhHHhhhhhhhhhhhhccCCccccceeeeeeechhHHHHHHHHHH-HHH Confidence 33 233444444444433321 1222111 11111 111112222333444555555554444444333333333 222 Q ss_pred hhc---ChhHHHHHHHHHHHHHHH-HHHHHHHhhcCCCccccccceeccccccc------cc-cccchhHHHHHHHHHHH Q lcl|Aclame:pro 451 RKQ---SSIHVENLIREDLIEGIG-VALDLAMLTGTGLANDPVGLLNMTGVPAL------TY-PAGGVDWASVVDMETKI 519 (632) Q Consensus 451 l~d---~~~~~~~~i~~~l~~a~a-~~~~~~~~~g~g~~~~~~Gil~~a~~~~~------~~-~~~~~~~~~i~~~~~~~ 519 (632) ..+ +...++.+|...+++++- +..+.+++-|+|++. ...+-.-+.+..+ +. ++..+-.+.|..+..-+ T Consensus 202 ~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N~-f~~~DK~advK~I~k~Ttkaksagktpfadaieeavdfv 280 (393) T protein:vir:16 202 VKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNG-FKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFV 280 (393) T ss_pred HHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhhheecCCCC-ccchhhHHHHHHHHHHhhhhhhcCCCchhHHHHHHHhhh Confidence 333 333568999999999998 788889999998764 1111111222111 11 12223233344443333 Q ss_pred HhhccccccceEEeehhHHHHHHHHhhcccCCc---eeecccc-c---cCc-ceEEcCCC-CCccEEEEehhhEEEEEec Q lcl|Aclame:pro 520 STFNADAGRLAYLTSVTQRGAAKKAQVFDNTGE---RIWQNNE-V---NGY-RAEASNQI-PADTWIFGDWSQIVIAMWG 590 (632) Q Consensus 520 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~g~---~~~~~~~-l---~G~-pv~~~~~~-~~~~~~~gd~s~~~~~~~~ 590 (632) ++.....|++.........+..+...+.. .+-++++ + -|. .+++-... .-..-++.|-+ |++-+. T Consensus 281 ----rptagrrylivktedrkalldelrqatananvriknddteiasevgvdeiivytgskalkptvlvdqk-yhidmq- 354 (393) T protein:vir:16 281 ----RPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKALKPTVLVDQK-YHIDMQ- 354 (393) T ss_pred ----ccCCCceEEEEeccchHHHHHHHHhhhccCceeeeccchhhhhhcCcceeeeeeccccccceeeeccc-cccchh- Confidence 34444456665554333334444443322 2222222 1 122 12221111 11111222211 111000 Q ss_pred ceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEec Q lcl|Aclame:pro 591 VLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKG 631 (632) Q Consensus 591 ~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~ 631 (632) .+..-+-..+.++.--+.++.--.+-|--..|-..+++. T Consensus 355 --dltkvdafewktnsnmilvetltsghvetynagavitvs 393 (393) T protein:vir:16 355 --DLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) T ss_pred --hhhhhhhheeccCCceEEEeecccCcceeeccceeEeeC Confidence 000011112223333344444444444334444444444 No 221 >protein:vir:100603 Length: 529 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656387;genbank:gi:109290138;genbank:GeneID:4156581 Probab=53.45 E-value=0.53 Score=22.10 Aligned_cols=339 Identities=11% Similarity=0.036 Sum_probs=108.8 Q ss_pred hhhhhhhhhhhHHHHHHHHHHhhhhHhhhhh-------hhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhh Q lcl|Aclame:pro 245 RSLAQEAIQKGHTVDQFRALVLERMNPGQPG-------NFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAA 317 (632) Q Consensus 245 ~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (632) +. ...+.+.+.+...... ...+........+.+.... . T Consensus 1 ~~-------------~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~----------------------~ 45 (529) T protein:vir:10 1 MS-------------LKTKEILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDS----------------------K 45 (529) T ss_pred Cc-------------cchHHHHHHhhHhhcCCccchhcchhhhhhhhhhhhhHHHHh----------------------h Confidence 00 0000011111110000 0000000000000000000 0 Q ss_pred hhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhh-cceee Q lcl|Aclame:pro 318 TGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQM-GARML 396 (632) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~-~~~~~ 396 (632) ... ......+.+.....+.......... ........+..++........+... .+...+.-+...+ +++++ T Consensus 46 ~~~---~~~~~~~~e~~~~~l~e~~~~~~~~----~~~~~ia~s~~t~~v~~~~P~Li~l-vRra~p~LIa~DIwGVQPM 117 (529) T protein:vir:10 46 TDP---VYRDDKLIEAFGQSLMEAEVAGDHG----YDPTNIAAGQSSGAITNIGPAVIGM-VRRAIPSLIAFDIAGVQPM 117 (529) T ss_pred ccc---ccchhhhhhhhhhccchhhcccccc----cccccccccccccccccccchhhhh-HHHHHHhHHhhhhheeccC Confidence 000 0000000011000000000000000 0000000111111111000000010 1100111111111 12222 Q ss_pred ccCceeEEEEE--------------------------------------------------------------------- Q lcl|Aclame:pro 397 PGLVGDVDIPK--------------------------------------------------------------------- 407 (632) Q Consensus 397 ~~~~~~~~~~~--------------------------------------------------------------------- 407 (632) ++..+-+.-.+ T Consensus 118 TgPTGLIFAMRsrY~~~~~~~~g~eaf~~~~e~dt~~SG~~~~~~~~~~~~~~~~~~t~~~a~~~~~~~~~~~nea~t~~ 197 (529) T protein:vir:10 118 TGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSGLAAKGATTSSDGTPFAALTAGQAVATGDIVYHFFYESGSAY 197 (529) T ss_pred CchhhhhhhheeeecCCcCCCcccccccccccccccccccccccccccccccccccccccceeeccccceeeeccccccc Confidence 11111000000 Q ss_pred ----------------------------------ecCCcccccccc---------CcccccCcccceeeeeeeeeeeeee Q lcl|Aclame:pro 408 ----------------------------------KTSGANFYWIGE---------DEDVQDSDFDFTTLSFSPKTIAGAV 444 (632) Q Consensus 408 ----------------------------------~~~~~~a~~v~E---------~~~~~~~~~~~~~~~~~~~t~~~~~ 444 (632) ..+.+...-.+| +..+++-.+.+++++..+++-+=+. T Consensus 198 s~~~tg~~~~~g~~~tg~~~~~~~~~~~a~~~~~~~~~gmsTa~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKA 277 (529) T protein:vir:10 198 LQNVTGGNVTVGTNETGAALDALVSAKIAAGELAEIAEGMATSIAELRQGFNGTTDNPWNEMSFRIDKQTVEAKSRQLKA 277 (529) T ss_pred ccccccccccccccccCCccccccccccccccccccccccchhhhhccccCCCCccccccceeeEEEEEEEeeeccceec Confidence 000000000112 1235555677888888888888888 Q ss_pred hhhHHHhhcC----hhHHHHHHHHHHHHHHHHHHHHHHhhcCCCcc------------ccccceeccccccccccccchh Q lcl|Aclame:pro 445 PVTRKLRKQS----SIHVENLIREDLIEGIGVALDLAMLTGTGLAN------------DPVGLLNMTGVPALTYPAGGVD 508 (632) Q Consensus 445 ~iSre~l~d~----~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~------------~~~Gil~~a~~~~~~~~~~~~~ 508 (632) ..|-|+..|- .+++++.|.+.|...+...||+.|+.-.-... ...|++........ .++-.. T Consensus 278 EYTiELAQDLKAvHGLDAEtELsNILStEImlEINReii~~i~~~a~~~~~g~~~~~~~~~gv~d~~~~~d~--~~~~~~ 355 (529) T protein:vir:10 278 QYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWINYTAQVGKSGWTQTVGSAAGVFDFQDPIDV--RGARWA 355 (529) T ss_pred cccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHhhhhceeeeeeeeccccccccceeccccccc--cccchh Confidence 8999988772 47899999999999999999999986221111 12233322211110 011112 Q ss_pred HHHHHHHHH-------HHHhhccccccceEE-eehhHHHHHHHHhhcccCCcee----e--------ccccccC-cceEE Q lcl|Aclame:pro 509 WASVVDMET-------KISTFNADAGRLAYL-TSVTQRGAAKKAQVFDNTGERI----W--------QNNEVNG-YRAEA 567 (632) Q Consensus 509 ~~~i~~~~~-------~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~d~~g~~~----~--------~~~~l~G-~pv~~ 567 (632) ...+..+.. .+..+.+ .....|+ +++.....|...-+.+..+..- | .-+.|.| |+|.+ T Consensus 356 ~e~~~~L~~~i~~~an~I~~~T~-rg~~n~vi~S~~Va~~L~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~ 434 (529) T protein:vir:10 356 GESYKALLIQIDKEANEIARQTG-RGAGNFIIASRNVVSALALVDAGITPAAQGMASGLNADTTKGVFAGVLGGRYKVYI 434 (529) T ss_pred HHHHHHHHHHHHHHHHHHHHhhc-cccceEEEEchHHHHHHhhhccccccccccccccceeecCCceEEEEecCceEEEe Confidence 222222222 2333333 2334444 4444444443221111111100 0 0145544 79999 Q ss_pred cCCCCCccEEEEeh--hhEE--EEEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEE-------------- Q lcl|Aclame:pro 568 SNQIPADTWIFGDW--SQIV--IAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAK-------------- 629 (632) Q Consensus 568 ~~~~~~~~~~~gd~--s~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~-------------- 629 (632) +++.+.+=+++|-- +.+. +++-=++.+..-+..+-.+-+=.+-...|++..+ +| |...+ T Consensus 435 D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP--~~~~~~~~~~~r~~~g~~~ 511 (529) T protein:vir:10 435 DQYARQDYFTMGYRGANNLDAGIYYCPYVALTPLRGSDPKNFQPVMGFKTRYAIGV-NP--FAESRTQAPTSRISNGMPG 511 (529) T ss_pred cCCCCcceEEEEEeCCcccccceeeccccccccccccCCCcccceeeeeeeeceee-cC--ccccccccccccccCCcch Confidence 99888765555511 0110 1111111111111112222222333444554432 33 22211 Q ss_pred --------------ecC Q lcl|Aclame:pro 630 --------------KGA 632 (632) Q Consensus 630 --------------~~A 632 (632) +|= T Consensus 512 ~~~ag~n~~~r~~~Vk~ 528 (529) T protein:vir:10 512 AHSVGKNAYFRRVWVKG 528 (529) T ss_pred hhhcCccceeeEeeecc Confidence 111 No 222 >protein:vir:99311 Length: 463 # NCBI annotation: putative capsid protein # Family: family:all:2450 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024474;genbank:gi:48696433;genbank:GeneID:2948039 Probab=52.88 E-value=0.54 Score=22.04 Aligned_cols=292 Identities=10% Similarity=0.042 Sum_probs=122.6 Q ss_pred hhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhccccccc-----ccceechhhhhHHHHHHHh Q lcl|Aclame:pro 309 LMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGK-----GGELVATELLSEEFIDILR 383 (632) Q Consensus 309 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~-----~~~~i~~~~~~~~i~~~~~ 383 (632) +... ... ............+ ...+++.++...+ ++..+-.+-+.+.+..+.. T Consensus 1 ~~~~--~~~--~~~~~~~~~~~~e-------------------~~~KS~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~ 57 (463) T protein:vir:99 1 MTIE--KNL--SDVQQKYADQFQE-------------------DVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTW 57 (463) T ss_pred CCcc--ccc--chHHHHHHhhhhH-------------------HHHHHhhcCCccCCccccCcchhhhhhhhhhhheeee Confidence 0000 000 0000000000000 1112222222111 1112222222222222221 Q ss_pred h---hhhhhhhcceeeccCceeE-EEEEecCCccccccccCcccccCcccceeeeeeeeeeeeeehhhH-HHhhcChhHH Q lcl|Aclame:pro 384 N---KAIIGQMGARMLPGLVGDV-DIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTR-KLRKQSSIHV 458 (632) Q Consensus 384 ~---~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSr-e~l~d~~~~~ 458 (632) . ...+..+..+.....-..+ .+...+..+.+.+++|++..+.+++.+......++-++....+|- .-|.|+..+. T Consensus 58 ~~~~f~~~~~i~k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~ 137 (463) T protein:vir:99 58 TNEDLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADP 137 (463) T ss_pred cccchhhhhhcCCchhhhhhhhheeeeccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccH Confidence 1 1122222222222221222 222344456788999999999999999999999999888877776 3356667788 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcCCCc--------cccccceeccccc-cccccccchhHHHHHHHHHHHHhhccccccc Q lcl|Aclame:pro 459 ENLIREDLIEGIGVALDLAMLTGTGLA--------NDPVGLLNMTGVP-ALTYPAGGVDWASVVDMETKISTFNADAGRL 529 (632) Q Consensus 459 ~~~i~~~l~~a~a~~~~~~~~~g~g~~--------~~~~Gil~~a~~~-~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 529 (632) .....+.-...++++++.++++|+..- -+.+|+.+.-+.. .+.+-+..++.+.|-.+-..+...+... . T Consensus 138 ~~~~~~dai~~ia~tiE~a~FyGds~l~~~~~~~gleFDGl~~lId~enviDarG~~Ls~~~ln~Aa~~i~~~fGt~--T 215 (463) T protein:vir:99 138 SQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTA--T 215 (463) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhccCCCcCccccchhhhhhhcCCCCeeecCCCcccHHHHhhhhhhhhcccCCh--h Confidence 888888889999999999999886421 1234443322222 2334456777777776666665544322 2 Q ss_pred eEEeehhHHHHHHHHhh-------cccCCceee----------------ccccccCcceEEcCCCCCccEEEEehhhEEE Q lcl|Aclame:pro 530 AYLTSVTQRGAAKKAQV-------FDNTGERIW----------------QNNEVNGYRAEASNQIPADTWIFGDWSQIVI 586 (632) Q Consensus 530 ~~~~~~~~~~~~~~~~~-------~d~~g~~~~----------------~~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~ 586 (632) .+.|+.-....+.-..+ .+..|.... .+.++++.|-+...... ..-+.|....+ T Consensus 216 D~~lp~~vka~f~~~~l~~qrv~~~~N~~~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~il~~~~~---~~p~ap~~~~~ 292 (463) T protein:vir:99 216 DAYMPIGVHADFVNSILGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQ---PLPNAPQPAKV 292 (463) T ss_pred heecchHHHHHHHHHhcCceEEEEcCCCCceeeeeeccceeeeeeeeeeCCceecCCcccccchhh---cCCCCccCcee Confidence 33344443333221111 111111100 01112222221111100 00001110000 Q ss_pred EEecceEEEEec-cc---ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 587 AMWGVLDLKVDP-YT---KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 587 ~~~~~~~~~~~~-~~---~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) . .++...+ .. ..+.....|++...-+.+=-.|..++-.+.++ T Consensus 293 t----atv~~~~~~~~~~~~~~a~~~Y~vv~~s~~geS~pS~ivtaT~a~ 338 (463) T protein:vir:99 293 T----ATVETKQKGAFENEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSN 338 (463) T ss_pred E----EEEeeccCCCCCCcccccceEEEEEEECCCCCcccchheeeeeee Confidence 0 0000000 00 01223334555444444444455555555544 No 223 >protein:vir:95603 Length: 463 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240903;genbank:gi:66394965;genbank:GeneID:5132544 Probab=52.88 E-value=0.54 Score=22.04 Aligned_cols=292 Identities=10% Similarity=0.042 Sum_probs=122.6 Q ss_pred hhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhccccccc-----ccceechhhhhHHHHHHHh Q lcl|Aclame:pro 309 LMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGK-----GGELVATELLSEEFIDILR 383 (632) Q Consensus 309 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~-----~~~~i~~~~~~~~i~~~~~ 383 (632) +... ... ............+ ...+++.++...+ ++..+-.+-+.+.+..+.. T Consensus 1 ~~~~--~~~--~~~~~~~~~~~~e-------------------~~~KS~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~ 57 (463) T protein:vir:95 1 MTIE--KNL--SDVQQKYADQFQE-------------------DVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTW 57 (463) T ss_pred CCcc--ccc--chHHHHHHhhhhH-------------------HHHHHhhcCCccCCccccCcchhhhhhhhhhhheeee Confidence 0000 000 0000000000000 1112222222111 1112222222222222221 Q ss_pred h---hhhhhhhcceeeccCceeE-EEEEecCCccccccccCcccccCcccceeeeeeeeeeeeeehhhH-HHhhcChhHH Q lcl|Aclame:pro 384 N---KAIIGQMGARMLPGLVGDV-DIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTR-KLRKQSSIHV 458 (632) Q Consensus 384 ~---~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSr-e~l~d~~~~~ 458 (632) . ...+..+..+.....-..+ .+...+..+.+.+++|++..+.+++.+......++-++....+|- .-|.|+..+. T Consensus 58 ~~~~f~~~~~i~k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~ 137 (463) T protein:vir:95 58 TNEDLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADP 137 (463) T ss_pred cccchhhhhhcCCchhhhhhhhheeeeccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccH Confidence 1 1122222222222221222 222344456788999999999999999999999999888877776 3356667788 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcCCCc--------cccccceeccccc-cccccccchhHHHHHHHHHHHHhhccccccc Q lcl|Aclame:pro 459 ENLIREDLIEGIGVALDLAMLTGTGLA--------NDPVGLLNMTGVP-ALTYPAGGVDWASVVDMETKISTFNADAGRL 529 (632) Q Consensus 459 ~~~i~~~l~~a~a~~~~~~~~~g~g~~--------~~~~Gil~~a~~~-~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 529 (632) .....+.-...++++++.++++|+..- -+.+|+.+.-+.. .+.+-+..++.+.|-.+-..+...+... . T Consensus 138 ~~~~~~dai~~ia~tiE~a~FyGds~l~~~~~~~gleFDGl~~lId~enviDarG~~Ls~~~ln~Aa~~i~~~fGt~--T 215 (463) T protein:vir:95 138 SQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTA--T 215 (463) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhccCCCcCccccchhhhhhhcCCCCeeecCCCcccHHHHhhhhhhhhcccCCh--h Confidence 888888889999999999999886421 1234443322222 2334456777777776666665544322 2 Q ss_pred eEEeehhHHHHHHHHhh-------cccCCceee----------------ccccccCcceEEcCCCCCccEEEEehhhEEE Q lcl|Aclame:pro 530 AYLTSVTQRGAAKKAQV-------FDNTGERIW----------------QNNEVNGYRAEASNQIPADTWIFGDWSQIVI 586 (632) Q Consensus 530 ~~~~~~~~~~~~~~~~~-------~d~~g~~~~----------------~~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~ 586 (632) .+.|+.-....+.-..+ .+..|.... .+.++++.|-+...... ..-+.|....+ T Consensus 216 D~~lp~~vka~f~~~~l~~qrv~~~~N~~~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~il~~~~~---~~p~ap~~~~~ 292 (463) T protein:vir:95 216 DAYMPIGVHADFVNSILGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQ---PLPNAPQPAKV 292 (463) T ss_pred heecchHHHHHHHHHhcCceEEEEcCCCCceeeeeeccceeeeeeeeeeCCceecCCcccccchhh---cCCCCccCcee Confidence 33344443333221111 111111100 01112222221111100 00001110000 Q ss_pred EEecceEEEEec-cc---ccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 587 AMWGVLDLKVDP-YT---KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 587 ~~~~~~~~~~~~-~~---~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) . .++...+ .. ..+.....|++...-+.+=-.|..++-.+.++ T Consensus 293 t----atv~~~~~~~~~~~~~~a~~~Y~vv~~s~~geS~pS~ivtaT~a~ 338 (463) T protein:vir:95 293 T----ATVETKQKGAFENEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSN 338 (463) T ss_pred E----EEEeeccCCCCCCcccccceEEEEEEECCCCCcccchheeeeeee Confidence 0 0000000 00 01223334555444444444455555555544 No 224 >protein:vir:106286 Length: 534 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944113;genbank:gi:38640157;genbank:GeneID:2658034 Probab=49.97 E-value=0.62 Score=21.71 Aligned_cols=343 Identities=10% Similarity=0.058 Sum_probs=110.4 Q ss_pred hhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHH Q lcl|Aclame:pro 252 IQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVS 331 (632) Q Consensus 252 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 331 (632) ...+...+.....+ +.....+.....+........+.+..... ......... T Consensus 1 ~~~~~l~~kw~p~l-~~~~~~~i~~~~~~~~~a~l~enq~~~~~---------------------------~~~~~~~~~ 52 (534) T protein:vir:10 1 MSKKSLLKKWQPLV-ESEGMPAIASMKRKDIVARIFENQDEDIA---------------------------HNEGGVYTD 52 (534) T ss_pred CchhHHHHHhHHhh-cCCccccccchhhhhhhhhhhhhHHHHHh---------------------------hhcccccch Confidence 00000000000000 00000000000000000000000000000 000000000 Q ss_pred HHHHHHHHhhhhhhhhhhhHHhh---hhhc-------ccccccccceechhhhhHHHHHHHh---hhhhhhhh-cceeec Q lcl|Aclame:pro 332 LAIADASGKEARGFYMPHEVLVQ---RQLE-------KKTAGKGGELVATELLSEEFIDILR---NKAIIGQM-GARMLP 397 (632) Q Consensus 332 ~~~~~~~~~~~~~~~~~~~~~~~---~a~~-------~~~~~~~~~~i~~~~~~~~i~~~~~---~~~~~~~~-~~~~~~ 397 (632) ....+.++.... ......+.. ...+ ..+..++..... ...++.+.| +.-+...+ ++++++ T Consensus 53 ~~~~~~~~~~~~--~~~~~~l~ea~~~~~~g~~~~~ia~s~~s~~v~~~----~P~Li~lvRra~p~LIa~DIwGVQPMT 126 (534) T protein:vir:10 53 QVVVNSMVDVKG--RIEEARLAEANIGGDHGYDATKIASGETSGSITNV----GPAVMGLVRRAIPQLIAFDICGVQPMT 126 (534) T ss_pred hhhhhhhhcccc--chhhccccccccccccccccccccccccccccccc----cchhhhHHHHHHHhhhhhhhheeccCC Confidence 000011100000 000000000 0000 000111111100 111111111 11122222 233333 Q ss_pred cCceeEEEEE--ec-C---------------------------------------------------------------- Q lcl|Aclame:pro 398 GLVGDVDIPK--KT-S---------------------------------------------------------------- 410 (632) Q Consensus 398 ~~~~~~~~~~--~~-~---------------------------------------------------------------- 410 (632) +..+-+.-.+ -. . T Consensus 127 gPTGLIFAMRsrY~n~~~~~s~~EAf~ne~~adt~fSG~~~a~~~~~~~~~~a~~~g~~~~~~~~~~t~~~~Gt~~~~~~ 206 (534) T protein:vir:10 127 SSTGQVFTLRAIYGGNSQDANAREAFHPTYGPDADFSGRGAAQDIAVFVRGTAVASGAFAKLHIEAATGVQAGTKTVQFI 206 (534) T ss_pred chhhhheeeeeeecCCCCCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccc Confidence 2221111111 00 0 Q ss_pred --------------------------------Ccccccccc---------CcccccCcccceeeeeeeeeeeeeehhhHH Q lcl|Aclame:pro 411 --------------------------------GANFYWIGE---------DEDVQDSDFDFTTLSFSPKTIAGAVPVTRK 449 (632) Q Consensus 411 --------------------------------~~~a~~v~E---------~~~~~~~~~~~~~~~~~~~t~~~~~~iSre 449 (632) .+...-.+| +.++++-.+.+++++..+++-+=+...|-| T Consensus 207 ~~~~v~~~~~~~~~ag~~~~~~~~~~~~y~~~~gm~Ta~AE~lg~~ggs~~~~f~EMsFsIdKvtVtAKSRaLKAEYTiE 286 (534) T protein:vir:10 207 KDYAVDALPADQTEAGLAYKWLLANGYAVETSSAMATAFAELQQGFNGSADNEWNEMSFRIDKQVVEAKSRQLKAQYSIE 286 (534) T ss_pred cccccccccCCccccccccccccccccceecccccchhhHhhhccCCCCcccchhhcceEEEEEEEeeeccceeccccHH Confidence 000000011 112455567778888888888888889999 Q ss_pred HhhcC----hhHHHHHHHHHHHHHHHHHHHHHHhhcCCCcc------------ccccceeccccccccccccchhHHHHH Q lcl|Aclame:pro 450 LRKQS----SIHVENLIREDLIEGIGVALDLAMLTGTGLAN------------DPVGLLNMTGVPALTYPAGGVDWASVV 513 (632) Q Consensus 450 ~l~d~----~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~------------~~~Gil~~a~~~~~~~~~~~~~~~~i~ 513 (632) +..|- .+++++.|.+.|...+...+|+.|+.-.-+.. .-.|++......+..+ +-.....+. T Consensus 287 LAQDLKAIHGLDAEtELsNILSTEImlEINReii~~l~~~a~~~k~~~~~~~~~~~G~~d~~~~~~~~~--~~~~~e~~~ 364 (534) T protein:vir:10 287 MAQDLRAVHGLDADSELSSILANEIMHEINREMVLWINATAKVGKTGWTNMHGGKAGVFDFQDTKDIRG--ARWAGESYK 364 (534) T ss_pred HHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhhheeecccccccccccceeeeeccccccc--hhHHHHHHH Confidence 88772 47899999999999999999998874322111 1123333222222111 111222333 Q ss_pred HHHHH-------HHhhccccccceEE-eehhHHHHHHHHhhcc---cCC---------ceeeccccccC-cceEEcCCCC Q lcl|Aclame:pro 514 DMETK-------ISTFNADAGRLAYL-TSVTQRGAAKKAQVFD---NTG---------ERIWQNNEVNG-YRAEASNQIP 572 (632) Q Consensus 514 ~~~~~-------~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~d---~~g---------~~~~~~~~l~G-~pv~~~~~~~ 572 (632) .+... +..+.+ ...+.|+ +++.....|...-.-+ ..| .....-+.|.| |+|.++++.+ T Consensus 365 ~L~~~i~~~an~i~~~T~-rg~~n~~v~S~~Va~~L~~~g~l~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~ 443 (534) T protein:vir:10 365 ALVVQIDKEANEIARQTG-RGQGNFIICSRNVAAALGHTDMLMTPAVMGANTTMNTDTTSSLFAGVLAGKYRVYIDQYAV 443 (534) T ss_pred HHHHHHHHHHHHHHHhhc-cccccEEEEchhHHHHHhhccchhccccccccccccccCCCceEEEEecCceEEEecCCCC Confidence 33332 222222 2334444 4444444443211110 001 11111345654 7999999888 Q ss_pred CccEEEEehh--hE--EEEEecceEEEEecccccccCcEEEEEEEEeCcEEecc-------c------------------ Q lcl|Aclame:pro 573 ADTWIFGDWS--QI--VIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRK-------E------------------ 623 (632) Q Consensus 573 ~~~~~~gd~s--~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~-------~------------------ 623 (632) .+=+++|--. .+ -+++-=++.+...+..+-.+-+=.+-...|++..+ +| + T Consensus 444 ~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP~~~~~~~~~~~~i~~g~~~~~~~ag~ 522 (534) T protein:vir:10 444 EDYFTVGYKGASEMDAGLYYCPYVALTPLRGTDPKNFQPVLGFKTRYGVKL-HPMADATQNKGFAKISNGMPQHTNMFGK 522 (534) T ss_pred cceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeeeceee-cCcccccCCccccccccCCcchhhhccc Confidence 7655554110 00 00111111111111112222222333333444332 11 0 Q ss_pred --ceEEEEecC Q lcl|Aclame:pro 624 --AFCIAKKGA 632 (632) Q Consensus 624 --a~~~~~~~A 632 (632) =|+++.+|= T Consensus 523 n~~~~~~~Vk~ 533 (534) T protein:vir:10 523 NAFFRRVLVAG 533 (534) T ss_pred ccceeeeeeec Confidence 122222222 No 225 >protein:vir:103463 Length: 521 # NCBI annotation: major head subunit precursor # Family: family:all:364 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803115;genbank:gi:116326395;genbank:GeneID:4405492 Probab=48.99 E-value=0.65 Score=21.60 Aligned_cols=341 Identities=13% Similarity=0.088 Sum_probs=110.0 Q ss_pred hhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhH Q lcl|Aclame:pro 250 EAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFERE 329 (632) Q Consensus 250 ~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 329 (632) -..... +.+.+.+.. ..+.... ......+.. ...................+ T Consensus 1 ~~~~~~-------~~l~~kw~p----~l~~~~~------~~i~~~~~~-----~~a~~~enq~~~~~~~~~~~------- 51 (521) T protein:vir:10 1 MTIKTK-------AELLNKWKP----LLEGEGL------PEIANSKQA-----IIAKIFENQEKDFQTAPEYK------- 51 (521) T ss_pred CCcchh-------HHHHHhhhh----hhccCCC------Cccccchhh-----hhhhhhhhhhhhhhhccccc------- Confidence 000000 001111110 0000000 000000000 00000000000000000000 Q ss_pred HHHHHHHHHHhhhhh----hhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHh---hhhhhhhh-cceeeccCce Q lcl|Aclame:pro 330 VSLAIADASGKEARG----FYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILR---NKAIIGQM-GARMLPGLVG 401 (632) Q Consensus 330 ~~~~~~~~~~~~~~~----~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~---~~~~~~~~-~~~~~~~~~~ 401 (632) .......++..... ..........+ .+..++..... ...++.+.| +.-+...+ +++++++..+ T Consensus 52 -~~~~~~~~~~~l~e~~~~~~~~~~~~~i~----es~~t~~v~~~----~P~Li~lvRra~p~LIa~DIwGVQPMTgPTG 122 (521) T protein:vir:10 52 -DEKIAQAFGSFLTEAEIGGDHGYNATNIA----AGQTSGAVTQI----GPAVMGMVRRAIPNLIAFDICGVQPMNSPTG 122 (521) T ss_pred -hhHHHHHHhhhhhhhcccCcccccccccc----ccccccccccC----CchhhhHHHHHHhhhhhhhceeeccCCchhh Confidence 00001111110000 00000000000 01111111100 111111111 11122222 2233222211 Q ss_pred eEEEEEe---cC-------------------------------------------------------------------- Q lcl|Aclame:pro 402 DVDIPKK---TS-------------------------------------------------------------------- 410 (632) Q Consensus 402 ~~~~~~~---~~-------------------------------------------------------------------- 410 (632) -+.-.+. +. T Consensus 123 LIFAMRsrY~~q~~~~~g~eaf~~~~~ada~fSG~~~at~~s~~~~~~~~~~Gd~~~~~~~~~g~~~~~~~~~~t~~~t~ 202 (521) T protein:vir:10 123 QVFALRAVYGKDPIAAGAKEAFHPMYGPDAMFSGQGAAKKFAALAASTQTTVGDIYTHFFQDTGTVYLQASAQVTISSTA 202 (521) T ss_pred hheeeeeeccCCccccccccccchhccccccccccccccccccccccccccccccccccccccccceecccccccCCCcc Confidence 1100000 00 Q ss_pred -----------------------Ccccccccc---------CcccccCcccceeeeeeeeeeeeeehhhHHHhhcC---- Q lcl|Aclame:pro 411 -----------------------GANFYWIGE---------DEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQS---- 454 (632) Q Consensus 411 -----------------------~~~a~~v~E---------~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~---- 454 (632) .+..--..| +..+++-.+.+++++..+++-+=+...|-|+..|- T Consensus 203 ~d~~~~~~~~~~~~~~~~~y~~~~GmsTa~aEal~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVH 282 (521) T protein:vir:10 203 DDAAKLDAEIKKQMEAGALVEIAEGMATSIAELQESFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVH 282 (521) T ss_pred cccccccccccccccccceeecccccchhhHhhhccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhc Confidence 000000011 11255566777888888888888888999988772 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhcCCCc------------cccccceeccccccccccccchhHHHHHHHHH----- Q lcl|Aclame:pro 455 SIHVENLIREDLIEGIGVALDLAMLTGTGLA------------NDPVGLLNMTGVPALTYPAGGVDWASVVDMET----- 517 (632) Q Consensus 455 ~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~------------~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~----- 517 (632) .+++++.|.+.|...+...+|+.|+.-.-.. +...|++.........+ +......+..+.. T Consensus 283 GLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~g~t~~~~~~~G~~d~~~~~d~~~--~~~~~e~~k~L~~~i~~~ 360 (521) T protein:vir:10 283 GMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGMTLTPGSKAGVFDFQDPIDIRG--ARWAGESFKALLFQIDKE 360 (521) T ss_pred CCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeeeeeeeeeccCccccceeccccccccc--chHHHHHHHHHHHHHHHH Confidence 4789999999999999999999987432111 11233333222111111 1111122222222 Q ss_pred --HHHhhccccccceEE-eehhHHHHHHHHhhcc---cCC-ceeec--------cccccC-cceEEcCCCCCccEEEEeh Q lcl|Aclame:pro 518 --KISTFNADAGRLAYL-TSVTQRGAAKKAQVFD---NTG-ERIWQ--------NNEVNG-YRAEASNQIPADTWIFGDW 581 (632) Q Consensus 518 --~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~d---~~g-~~~~~--------~~~l~G-~pv~~~~~~~~~~~~~gd~ 581 (632) .+..+.+ ...+.|+ +++.....|...-..+ ..| ..-|. -+.|.| |+|.++++.+.+=+++|-- T Consensus 361 an~i~~~T~-r~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~K 439 (521) T protein:vir:10 361 AVEIARQTG-RGEGNFIIASRNVVNVLASVDTGISYAAQGLATGFNTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYK 439 (521) T ss_pred HHHHHHhcc-cccceEEEEchHHHHHHhhcccccccccccccccccccCCCceEEEEecCceEEEecCCCCcceEEEEEe Confidence 2333333 2344444 5555555444211001 000 00011 145544 7999999888765555511 Q ss_pred h--hE--EEEEecceEEEEecccccccCcEEEEEEEEeCcEEecccc---------------------------eEEEEe Q lcl|Aclame:pro 582 S--QI--VIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEA---------------------------FCIAKK 630 (632) Q Consensus 582 s--~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a---------------------------~~~~~~ 630 (632) . .+ -+++-=++.+...+..+-.+-+=.+-...|++..+ +|=+ |.++++ T Consensus 440 G~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP~~~~~~~~~~~~i~~~~~~~~a~~~~~sy~r~v~v 518 (521) T protein:vir:10 440 GPNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFAESAAQAPASRIQSGMPSILNSLGKNAYFRRVYV 518 (521) T ss_pred CCcccccceeeccccccccccccCCccccceeeeeeeeceee-cCcccccCCccceeecccchhhhccccccceeeeeee Confidence 0 11 01111111111111112222222333334444432 2211 222222 Q ss_pred cC Q lcl|Aclame:pro 631 GA 632 (632) Q Consensus 631 ~A 632 (632) += T Consensus 519 ~~ 520 (521) T protein:vir:10 519 KG 520 (521) T ss_pred cC Confidence 22 No 226 >protein:vir:3783 Length: 336 # NCBI annotation: capsid # Family: family:all:201 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536823;genbank:gi:17981832;genbank:GeneID:929211 Probab=47.71 E-value=0.69 Score=21.46 Aligned_cols=287 Identities=10% Similarity=0.022 Sum_probs=121.1 Q ss_pred HHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCceeEEEE-E Q lcl|Aclame:pro 329 EVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIP-K 407 (632) Q Consensus 329 ~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 407 (632) +....+.....+-+....... .....+.-+.+.|.+ ...+...+.+.+.+.+.. .+++.+.-.-... . T Consensus 1 mtr~~~~~y~~~~A~~ngv~~---------a~~~~~~~Fsv~P~v-~q~L~~~i~ess~FL~~I-Nvv~V~e~~Ge~v~l 69 (336) T protein:vir:37 1 MNKQAYYALAAALAKHFNQPL---------DSVLRGESFALKAPE-AALLGENIQQRSDFLKGI-NMVQVAHTKGTKLFG 69 (336) T ss_pred CcHHHHHHHHHHHHHHhCCCh---------hhhcccceeecCHHH-HHHHHHHHHHHHHHhhcC-ceeecccccceEEee Confidence 000111111111110000000 000111234455554 445667777777665542 2233222211112 2 Q ss_pred ecCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHHhhcChhHHHHHHHHHHHHHHHH--HHHHHH--hhcCC Q lcl|Aclame:pro 408 KTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGV--ALDLAM--LTGTG 483 (632) Q Consensus 408 ~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~~~~~~~~i~~~l~~a~a~--~~~~~~--~~g~g 483 (632) ..+++-+.-..-+... .....+.-.|.....---..|+.+.|..- -..-.+..+.+...+.+ ++|... |+|.- T Consensus 70 g~~g~iagrtdt~r~r--~~~~l~~~~Y~c~qTn~dt~i~y~~LD~W-A~~~d~~~~~~~~~~~r~iALD~i~IGfnG~s 146 (336) T protein:vir:37 70 ATEKGVTGRKQTGRNL--ATLDHSQNGYELSETDSGILVNWSLFDSF-AIFKDRLVELYSEYFQNQVALDILQIGWNGQS 146 (336) T ss_pred ccCcccccccCCCCCc--cccCCCCCccEEEEeeeeeeccHHHHHHH-hcChhHHHHHHHHHHHHHHhcchhhhccccee Confidence 2223333322222211 12334445555555555566777776432 11222233333333332 234433 34432 Q ss_pred ---Ccccccc------ce----eccccc----------cc---cccccchhHHH-HHHHHHHHHhhccccccceEEeehh Q lcl|Aclame:pro 484 ---LANDPVG------LL----NMTGVP----------AL---TYPAGGVDWAS-VVDMETKISTFNADAGRLAYLTSVT 536 (632) Q Consensus 484 ---~~~~~~G------il----~~a~~~----------~~---~~~~~~~~~~~-i~~~~~~~~~~~~~~~~~~~~~~~~ 536 (632) +.++|.+ .+ .++... .+ ..++.-.+.|. +.+++..+...+++....+.++... T Consensus 147 ~A~~TdnPllqDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~gdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~d 226 (336) T protein:vir:37 147 VATNTTKTDLSDVNKGWLKLLQEQRAANFMTESTKSSGKITIFGDNADYANLDDLAFDLKQGLDFRHQNRNDLVFLVGAD 226 (336) T ss_pred eccCCCCccccccchhHHHHHHhccchhhcccccccCCceEEecCCCCcccHHHHHHHHHhccchHHhcCCCeEEEEchh Confidence 1224433 11 111100 00 11111222333 3456666666677665666555443 Q ss_pred HHHHHHHHhhcccCC-ce--------eeccccccCcceEEcCCCCCccEEEEehhhEEEEEecce-EEEEecccccccCc Q lcl|Aclame:pro 537 QRGAAKKAQVFDNTG-ER--------IWQNNEVNGYRAEASNQIPADTWIFGDWSQIVIAMWGVL-DLKVDPYTKAASDG 606 (632) Q Consensus 537 ~~~~~~~~~~~d~~g-~~--------~~~~~~l~G~pv~~~~~~~~~~~~~gd~s~~~~~~~~~~-~~~~~~~~~~~~~~ 606 (632) .. .-....+-...+ .| +....++.|+|.+..|.+|.+.+++--++...++...|- +-..-+. -+++. T Consensus 227 Ll-a~~~~~l~~~~~~~PtE~~Aa~~~~~~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~--p~r~r 303 (336) T protein:vir:37 227 LV-SKETKLIQQKHGLTPTEKAALGSHNLMGSFGGMNAITPPNFPARAAAVTTLKNLSVYTEAESVRRSLRND--EDKKG 303 (336) T ss_pred hh-hhhhhhhhhhcCCCHHHHHHHHHHHHHHhhCCceEEEccccCCCceEEeeccccEEEEecCcEEEEEEEc--ccccc Confidence 32 222223333322 22 223467999999999999999999999998877655443 2211111 12233 Q ss_pred EEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 607 LVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 607 ~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) +.-+-..--|..|-++.++|.+.-.. T Consensus 304 ie~y~s~Ne~YvVEd~~~~a~iE~i~ 329 (336) T protein:vir:37 304 LVTSYYRQEGYVVEDLGLMTAIDHTK 329 (336) T ss_pred ccchhhhcceeeeeccccEEEeeeee Confidence 33333333444556666666554444 No 227 >protein:vir:6901 Length: 522 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861877;genbank:gi:32453668;genbank:GeneID:1494303 Probab=46.63 E-value=0.73 Score=21.34 Aligned_cols=342 Identities=13% Similarity=0.059 Sum_probs=110.1 Q ss_pred hhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhh Q lcl|Aclame:pro 238 IGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAA 317 (632) Q Consensus 238 ~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (632) .. .........+.....+. .......... +........+.+. +... T Consensus 1 ~~----------~~~~~e~l~~kw~p~l~-~~~~~~~~~~-~~~~~a~l~enq~----------------------~~~~ 46 (522) T protein:vir:69 1 MT----------TIKTKAQLVDKWKELLE-GEGLPEIANS-KQAIIAKIFENQE----------------------KDFE 46 (522) T ss_pred CC----------ccchHHHHHHhhHHHhc-CCCCCccccc-hhhhhhhhhhhhh----------------------HHhh Confidence 00 00000000000000000 0000000000 0000000000000 0000 Q ss_pred hhhhhhhhhhhHHHHHHHHHHHhhhh----hhhhhhhHHhhhhhccccc-ccccceechhhhhHHHHHHHhhhhhhhhh- Q lcl|Aclame:pro 318 TGDWSKAGFEREVSLAIADASGKEAR----GFYMPHEVLVQRQLEKKTA-GKGGELVATELLSEEFIDILRNKAIIGQM- 391 (632) Q Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~a~~~~~~-~~~~~~i~~~~~~~~i~~~~~~~~~~~~~- 391 (632) .. .........+.++.... ...........+...++.. .+.++.++ . +.+...+.-+...+ T Consensus 47 ~~-------~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~es~~t~~v~~~~P~li-----~-lvrRa~p~LIa~DIw 113 (522) T protein:vir:69 47 VS-------PEYKDEKIAQAFGSFLTEAEIGGDHGYNAQNIAAGQTSGAVTQIGPAVM-----G-MVRRAIPNLIAFDIC 113 (522) T ss_pred cc-------cccchhHHHHhhhhhhhhhccccccCCCcccccccccccccccccchHH-----H-HHHHHHhhhhhhhce Confidence 00 00000000011110000 0000000000001111111 01111110 0 11111111112222 Q ss_pred cceeeccCceeEEEEE---ec----------------------------------------------------------- Q lcl|Aclame:pro 392 GARMLPGLVGDVDIPK---KT----------------------------------------------------------- 409 (632) Q Consensus 392 ~~~~~~~~~~~~~~~~---~~----------------------------------------------------------- 409 (632) +++++++..+-+.-.+ .+ T Consensus 114 GVQPMTgPTGLIFAMRsrY~~q~~~~~~~eaf~~~neadt~fSG~~~~t~~~~~~~~~~t~~G~~~~~~~~~~gt~~~~~ 193 (522) T protein:vir:69 114 GVQPMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGQGAAKKFPALAASTQTKVGDIYTHFFQETGTVYLQA 193 (522) T ss_pred eeccCCchhhhheeeeeeccCCcccCccccccccccccccccccccccccccccccccccccccccccccccccceeeec Confidence 2222222211100000 00 Q ss_pred --------------------------------CCcccccccc---------CcccccCcccceeeeeeeeeeeeeehhhH Q lcl|Aclame:pro 410 --------------------------------SGANFYWIGE---------DEDVQDSDFDFTTLSFSPKTIAGAVPVTR 448 (632) Q Consensus 410 --------------------------------~~~~a~~v~E---------~~~~~~~~~~~~~~~~~~~t~~~~~~iSr 448 (632) +.+...-.+| +..+++-.+.+++++..+++-+=+...|- T Consensus 194 ~a~~t~~~t~~~~~~~~~ai~s~~~~~~~y~~g~GmsTa~aEal~~lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTi 273 (522) T protein:vir:69 194 SAQVTISSSADDAAKLDAEIIKQMEAGALVEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSI 273 (522) T ss_pred ccCCcCCCCCcccccccchhccccccccceeeccccchhhhhhcccCCCCcccchhhhcceEeeEEEeeecccccccccH Confidence 0000000112 12356666778888888888888889999 Q ss_pred HHhhcC----hhHHHHHHHHHHHHHHHHHHHHHHhhcCCCcc------------ccccceeccccccccccccchhHHHH Q lcl|Aclame:pro 449 KLRKQS----SIHVENLIREDLIEGIGVALDLAMLTGTGLAN------------DPVGLLNMTGVPALTYPAGGVDWASV 512 (632) Q Consensus 449 e~l~d~----~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~------------~~~Gil~~a~~~~~~~~~~~~~~~~i 512 (632) |+..|- .+++++.|.+.|...+...+|+.|+.-.-... ...|++......+..++ -.....+ T Consensus 274 ELAQDLKAIHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~g~t~~~~~~~Gv~Dl~~~~~~~~~--rw~~e~~ 351 (522) T protein:vir:69 274 ELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGMTNIVGSKAGVFDFQDPIDIRGA--RWAGESF 351 (522) T ss_pred HHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhheeeccccccccccccceeecccccccccc--hhHHHHH Confidence 988772 47899999999999999999999874321111 12344433322221111 1111222 Q ss_pred HHHHH-------HHHhhccccccceE-EeehhHHHHHHHHhh--------------cccCCceeeccccccC-cceEEcC Q lcl|Aclame:pro 513 VDMET-------KISTFNADAGRLAY-LTSVTQRGAAKKAQV--------------FDNTGERIWQNNEVNG-YRAEASN 569 (632) Q Consensus 513 ~~~~~-------~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~--------------~d~~g~~~~~~~~l~G-~pv~~~~ 569 (632) ..++. .+..+.+. ..+.| ++++.....|...-. .|.++. ++ -+.|.| |+|.+++ T Consensus 352 k~L~~~i~~~an~i~~~T~r-g~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~g~~~d~~~~-~~-~G~l~~~~~vy~D~ 428 (522) T protein:vir:69 352 KALLFQIDKEAVEIARQTGR-GEGNFIIASRNVVNVLASVDTGISYAAQGLASGFNTDTTKS-VF-AGVLGGKYRVYIDQ 428 (522) T ss_pred HHHHHHHHHHHHHHHHhccc-ccccEEEEchhHHHHHhhcccccccccccccccccccCCCc-eE-EEEecCceEEEecC Confidence 22222 23333332 23344 455555555542110 111111 11 145544 7999999 Q ss_pred CCCCccEEEEehh--hE--EEEEecceEEEEecccccccCcEEEEEEEEeCcEEecc----------------------- Q lcl|Aclame:pro 570 QIPADTWIFGDWS--QI--VIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRK----------------------- 622 (632) Q Consensus 570 ~~~~~~~~~gd~s--~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~----------------------- 622 (632) +.+.+=+++|--. .+ -+++-=++.+...+..+-.+-+=.+-...|++..+ +| T Consensus 429 y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~v-NP~~~~~~~~~~~ri~~g~p~~~~~ 507 (522) T protein:vir:69 429 YAKQDYFTVGYKGANEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGV-NPFAESSLQAPGARIQSGMPSILNS 507 (522) T ss_pred CCCcceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeeeceee-cCcccccCCcccceeecccchhhcc Confidence 8887655555110 10 00111111111111112122222333334444332 22 Q ss_pred ---cc-eEEEEecC Q lcl|Aclame:pro 623 ---EA-FCIAKKGA 632 (632) Q Consensus 623 ---~a-~~~~~~~A 632 (632) .+ |.++.++= T Consensus 508 ~~~n~y~r~v~v~~ 521 (522) T protein:vir:69 508 LGKNAYFRRVYVKG 521 (522) T ss_pred cCCcceeeEEEeec Confidence 00 11222222 No 228 >protein:vir:98143 Length: 524 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239203;genbank:gi:66391678;genbank:GeneID:3416245 Probab=39.71 E-value=1 Score=20.57 Aligned_cols=343 Identities=10% Similarity=0.007 Sum_probs=107.1 Q ss_pred hhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHH Q lcl|Aclame:pro 251 AIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREV 330 (632) Q Consensus 251 a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 330 (632) ........+.....+.......+.....+........+.+..... ...... T Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~a~llenq~~~~~-----------------------------~~~~~~ 51 (524) T protein:vir:98 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAE-----------------------------TDPVYR 51 (524) T ss_pred CcchHHHHHHhHHHhcCCcCcchhcchhhHHHHHHHHhhHHHHHh-----------------------------cCcccc Confidence 000000000000000000000000000000000000000000000 000000 Q ss_pred HHHHHHHHHhh----hhhhhhhhhHHhhhhh-cccccccccceechhhhhHHHHHHHhhhhhhhhh-cceeeccCceeE- Q lcl|Aclame:pro 331 SLAIADASGKE----ARGFYMPHEVLVQRQL-EKKTAGKGGELVATELLSEEFIDILRNKAIIGQM-GARMLPGLVGDV- 403 (632) Q Consensus 331 ~~~~~~~~~~~----~~~~~~~~~~~~~~a~-~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~-~~~~~~~~~~~~- 403 (632) .....+.++.. .............+.. .+.....+.+.++ .. .+...+.-+...+ +++++++..+-+ T Consensus 52 ~~~~~~~~~~~l~ea~~~~~~~~~~~~i~~s~~t~~v~~~~P~Li-----~l-vRra~p~LIa~DIwGVQPMTgPTGLIF 125 (524) T protein:vir:98 52 DEKIVESFGGFLAEAEIAGDHNYDQTNIASGKSSGAITNIGPAVI-----GM-VRRAIPNLIAFDICGVQPMTGPTGQVF 125 (524) T ss_pred chHHHHhhhccccccccccccccccccccccccccccccccchhh-----hH-HHHHHHhhhhhhhheeccCCchhhhhh Confidence 00111111100 0000000000000000 0000111111111 00 0000011111111 112211111000 Q ss_pred ----EEEEe----------------------------------------------------------------------- Q lcl|Aclame:pro 404 ----DIPKK----------------------------------------------------------------------- 408 (632) Q Consensus 404 ----~~~~~----------------------------------------------------------------------- 408 (632) .+... T Consensus 126 AmRsrY~n~~~~~gteA~~nEAf~~~ye~dt~fSG~g~~t~~s~~~~g~~~~~g~~~~~~~~~~g~~~~~~~~~g~~~~t 205 (524) T protein:vir:98 126 ALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVT 205 (524) T ss_pred hhheeecCCCCCcccccccccccccccccccccCCccccccccccccccccccccccccccccccceeccccccCccccc Confidence 00000 Q ss_pred ----------------------cCCcccccccc---------CcccccCcccceeeeeeeeeeeeeehhhHHHhhcC--- Q lcl|Aclame:pro 409 ----------------------TSGANFYWIGE---------DEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQS--- 454 (632) Q Consensus 409 ----------------------~~~~~a~~v~E---------~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~--- 454 (632) ...+..--.+| +..+++-.+.+++++..+++-+=+...|-|+..|- T Consensus 206 gt~p~~~~~a~~~~~~~g~~~~~~~GmsTA~aEaL~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAV 285 (524) T protein:vir:98 206 GADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAV 285 (524) T ss_pred ccccccccccccccccccceeecccccchhhhhhhccCCCCccccccceeeEEEEEEEeeecccccccccHHHHHHHHHh Confidence 00000000112 22355666777888888888888888999988762 Q ss_pred -hhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccc------------cccceeccccccccccccchhHHHHHHHH----- Q lcl|Aclame:pro 455 -SIHVENLIREDLIEGIGVALDLAMLTGTGLAND------------PVGLLNMTGVPALTYPAGGVDWASVVDME----- 516 (632) Q Consensus 455 -~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~------------~~Gil~~a~~~~~~~~~~~~~~~~i~~~~----- 516 (632) .+++++.|.+.|...+...+|+.|+.-...... ..|++.......... +-.....+..+. T Consensus 286 HGLDAEtELsNILSTEImlEINReii~~i~~~a~~~~~g~t~~~~~~~G~~dl~~~~d~~~--~r~~~e~~~~L~~~i~~ 363 (524) T protein:vir:98 286 HGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRG--ARWAGESYKALLIQIDK 363 (524) T ss_pred cCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhheeceeecccccccccceeeccccccccc--cchhHHHHHHHHHHHHH Confidence 478999999999999999999998743211111 123333222211111 111112222222 Q ss_pred --HHHHhhccccccceEE-eehhHHHHHHH--HhhcccC----------Cceeecccccc-CcceEEcCCCCCccEEEEe Q lcl|Aclame:pro 517 --TKISTFNADAGRLAYL-TSVTQRGAAKK--AQVFDNT----------GERIWQNNEVN-GYRAEASNQIPADTWIFGD 580 (632) Q Consensus 517 --~~~~~~~~~~~~~~~~-~~~~~~~~~~~--~~~~d~~----------g~~~~~~~~l~-G~pv~~~~~~~~~~~~~gd 580 (632) ..+..+.+ ...+.|+ +++.....|.. ..+.+.. ......-+.|. ||+|.++++.+.+=+++|- T Consensus 364 ~an~I~~~T~-rg~~n~~i~S~~Va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~ 442 (524) T protein:vir:98 364 EANEIARQTG-RGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGF 442 (524) T ss_pred HHHHHHHhhc-cccccEEEEchHHHHHHhhhhcccccccchhhcccccCCccceEEEEecCceEEEecCCCCcceEEEEe Confidence 22333333 2234444 44544444442 1111111 11111115554 4799999988876555541 Q ss_pred hh--hE--EEEEecceEEEEecccccccCcEEEEEEEEeCcEEecccc--------------------------eEEEEe Q lcl|Aclame:pro 581 WS--QI--VIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEA--------------------------FCIAKK 630 (632) Q Consensus 581 ~s--~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a--------------------------~~~~~~ 630 (632) -. .+ -+++-=++.+...+..+-.+-+=.+-...|++..+ +|=+ |.++.+ T Consensus 443 KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NP~~~~~~~~~~~ri~~g~~~~~~ag~n~~~r~~~V 521 (524) T protein:vir:98 443 KGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI-NPFANSRSQAPADRITSGMISKEMCGKNAYFRKVWV 521 (524) T ss_pred eCCcccccceeeccccccccccccCCccccceeeeeeeeceee-cCcccccCCccccccccCcchHhhcCccceeeEeee Confidence 10 00 00111111111111112222222333334444332 2211 222222 Q ss_pred cC Q lcl|Aclame:pro 631 GA 632 (632) Q Consensus 631 ~A 632 (632) |= T Consensus 522 k~ 523 (524) T protein:vir:98 522 KG 523 (524) T ss_pred cc Confidence 22 No 229 >protein:vir:80986 Length: 528 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469506;genbank:gi:157311463;genbank:GeneID:5602119 Probab=39.61 E-value=1 Score=20.56 Aligned_cols=339 Identities=9% Similarity=-0.025 Sum_probs=108.7 Q ss_pred hhhhhHHHHHHHHHHhhhhHhhhhhhhhhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHH Q lcl|Aclame:pro 251 AIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREV 330 (632) Q Consensus 251 a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 330 (632) ....+...+..... ++.....+.....+........+.+... . ....... T Consensus 1 ~~~~~~l~~kw~p~-l~~~~~~~i~~~~~~~~~a~llenq~~~----------------------~-------~~~~~~~ 50 (528) T protein:vir:80 1 MKTTKELMEKWSPL-LENEKLPEIATASKQKLVAKILESQEAD----------------------F-------AVDPIYK 50 (528) T ss_pred CcchHHHHHhhhHh-hcCCccchhcchhhhhhhhhhhhhhhHH----------------------h-------hcccccc Confidence 00000000000000 0000000000000000000000000000 0 0000000 Q ss_pred HHHHHHHHHhhhhh----hhhhhhHHhhhhhcccccc-cccceechhhhhHHHHHHHhhhhhhhhh-cceeeccCceeEE Q lcl|Aclame:pro 331 SLAIADASGKEARG----FYMPHEVLVQRQLEKKTAG-KGGELVATELLSEEFIDILRNKAIIGQM-GARMLPGLVGDVD 404 (632) Q Consensus 331 ~~~~~~~~~~~~~~----~~~~~~~~~~~a~~~~~~~-~~~~~i~~~~~~~~i~~~~~~~~~~~~~-~~~~~~~~~~~~~ 404 (632) .....+.++..... ..........+...++... .+.+.++ . +.+...+.-+...+ +++++++..+-+. T Consensus 51 ~~~~~~~~~~~l~ea~~~~~~~~~~~~i~es~~t~~v~~~~P~Li-----~-lvRra~p~LIa~DIwGVQPMTgPTGLIF 124 (528) T protein:vir:80 51 DEKVVEAFGGFIAEAEVAGDHGYDASQIAAGQTTGAITNVGPAVI-----G-MVRRAIPNLIAFDICGVQPMSTPTSQIF 124 (528) T ss_pred chHHHHhhhhhccccccccccCCccccccccccccccccCCchhh-----h-HHHHHHhhhhhhhhheeccCCchhhhhe Confidence 00011111100000 0000000000001111111 1111111 1 11111111222222 2333332211110 Q ss_pred EEE---e------------------------------------------------------------------------- Q lcl|Aclame:pro 405 IPK---K------------------------------------------------------------------------- 408 (632) Q Consensus 405 ~~~---~------------------------------------------------------------------------- 408 (632) -.+ . T Consensus 125 AMRsrY~~~~~~~~~~ea~~~~~~~da~fS~~~t~~~a~~~ea~t~fs~~~~~~~~~~G~~~~~t~~~tg~~~~~~~~~~ 204 (528) T protein:vir:80 125 AIRSVYGPNPLASQAKEAFHPMYAPDAFHSSLAAKGAAVGSPTGTPFAKLAIGTQIEAGDIVHHTFAETGIAYLQNVTAE 204 (528) T ss_pred eeeeeecCCccccccccccccccccccccccccccccccccccccccccccccccccccceecccccccccccccccccc Confidence 000 0 Q ss_pred ---------------------------cCCcccccccc---------CcccccCcccceeeeeeeeeeeeeehhhHHHhh Q lcl|Aclame:pro 409 ---------------------------TSGANFYWIGE---------DEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRK 452 (632) Q Consensus 409 ---------------------------~~~~~a~~v~E---------~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~ 452 (632) ...+...-.+| +..+++-.+.+++++..+++-+=+...|-|+.. T Consensus 205 ~~~~~~~gt~~~~~~~~~~~~~~~~~~~~~Gm~Ta~AE~le~lg~ss~~~f~EMaFsIEKvTVtAKSRaLKAEYTiELAQ 284 (528) T protein:vir:80 205 QVTPTKAGSESEDEVVMKLMEEGKLAEIAFGMATSIAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQ 284 (528) T ss_pred ccCccccCCcccccccccccccccccccccccchhhhhhhcccCCCccccccceeeEEEEEEEeeeccceeccccHHHHH Confidence 00000000112 122556667778888888888888889999887 Q ss_pred cC----hhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccc------------cccceeccccccccccccchhHHHHHHHH Q lcl|Aclame:pro 453 QS----SIHVENLIREDLIEGIGVALDLAMLTGTGLAND------------PVGLLNMTGVPALTYPAGGVDWASVVDME 516 (632) Q Consensus 453 d~----~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~------------~~Gil~~a~~~~~~~~~~~~~~~~i~~~~ 516 (632) |- .+++++.|.+.|...+...+|+.|+.-...... ..|++.........++ --....+..+. T Consensus 285 DLKAIHGLDAEtELaNILStEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~~dl~~~~d~~g~--r~~~e~~k~L~ 362 (528) T protein:vir:80 285 DLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGA--RWAGESFKSLI 362 (528) T ss_pred HHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeeeeeeeccccccceeecccccccccc--chhHHHHHHHH Confidence 62 478999999999999999999999643211111 1233322221111110 11112222222 Q ss_pred -------HHHHhhccccccceEEeehhHHHHHHHHhh---cccCC-ceeec--------ccccc-CcceEEcCCCCCccE Q lcl|Aclame:pro 517 -------TKISTFNADAGRLAYLTSVTQRGAAKKAQV---FDNTG-ERIWQ--------NNEVN-GYRAEASNQIPADTW 576 (632) Q Consensus 517 -------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~d~~g-~~~~~--------~~~l~-G~pv~~~~~~~~~~~ 576 (632) ..+..+.+.......++++.....|...-. .+..| ...+. -+.|. ||+|.++++.+.+=+ T Consensus 363 ~~i~~~an~I~~~T~~~~gn~vi~S~~Va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~ 442 (528) T protein:vir:80 363 YQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYARQDYF 442 (528) T ss_pred HHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccccccccccccccccccCCCCceEEEEecCceEEEecCCCCcceE Confidence 233333332222233455555444433211 00111 11111 24454 479999988887655 Q ss_pred EEEehh--h-----EEEEEe-cceEEEEecccccccCcEEEEEEEEeCcEEeccc------------------------- Q lcl|Aclame:pro 577 IFGDWS--Q-----IVIAMW-GVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKE------------------------- 623 (632) Q Consensus 577 ~~gd~s--~-----~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~------------------------- 623 (632) ++|--. . |+--+. ..+....+|.. -+=.+-...|++..+ +|= T Consensus 443 ~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~s----fqP~~g~~tRY~l~~-NP~~~~~~~~~~~r~~~g~~~~~~ag~n 517 (528) T protein:vir:80 443 TVGYKGDNEMDAGIYYAPYVALTPLRATDPQS----FHPVLGFKTRYGIGI-NPFADSKSQAPSARITSGMLSKDSVGKN 517 (528) T ss_pred EEEEeCCcccccceeecccccceeeEeeCCcc----ccceeeeeeeeceee-cCcccccCCcccccccccchhhhhcCcc Confidence 554110 0 100000 11111222221 222233333444332 220 Q ss_pred -ceEEEEecC Q lcl|Aclame:pro 624 -AFCIAKKGA 632 (632) Q Consensus 624 -a~~~~~~~A 632 (632) =|.++.+|= T Consensus 518 ~~~r~~~Vk~ 527 (528) T protein:vir:80 518 AYFRRVWVKG 527 (528) T ss_pred ceeEEeeecc Confidence 012222222 No 230 >protein:vir:102823 Length: 470 # NCBI annotation: major structural protein # Family: family:all:2450 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874086;genbank:gi:118197693;genbank:GeneID:4496015 Probab=38.66 E-value=1.1 Score=20.45 Aligned_cols=284 Identities=9% Similarity=0.071 Sum_probs=108.8 Q ss_pred hhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHhhhhhhhhhcceeeccCce Q lcl|Aclame:pro 322 SKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVG 401 (632) Q Consensus 322 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 401 (632) .......++.++..+.+................+.+...+-.+..+.. +..+..+.....-. T Consensus 1 ~~~~~~~~~~~a~~~al~~a~~~g~AlR~EsLd~~l~~lt~~~~~ftf------------------~~~i~k~~a~STV~ 62 (470) T protein:vir:10 1 MPYEHLKHLDEATLKALNAAGQVAESLEREDLEPEVTQLNVLDTPLTD------------------LLSKNAVKAKAYEH 62 (470) T ss_pred CChhHhhhhhHHHHHHHHHhhhcchhhhhhhhccceeEeeecCccchh------------------hhhcCCchhhhHhh Confidence 000011111222222211111111001111111222222211111111 11111111111111 Q ss_pred eEEE-EEecCCccccccccCcccccCcccceeeeeeeeeeeeeehhhHHH---hhcChhHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 402 DVDI-PKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKL---RKQSSIHVENLIREDLIEGIGVALDLA 477 (632) Q Consensus 402 ~~~~-~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~---l~d~~~~~~~~i~~~l~~a~a~~~~~~ 477 (632) .+.. .-..+........|++-.+.+++.+...+..++-++....+|..+ +.|...+......+.--..++++++.+ T Consensus 63 ey~~~~~rhG~~g~s~~~E~~l~~~~d~~~~Rr~v~~K~l~~~~~VT~~a~~~~~n~v~d~~~~~~~dai~~ia~tiE~a 142 (470) T protein:vir:10 63 EYNVVTARHDKIGYAAFREGGLPRTVEVNVVRRRIRPMLVGHRITVTELATRTTQNGVMQIDELVKREKMIAVANEFEYL 142 (470) T ss_pred hhhhhccccccccceeecccccCccCCCceEEEEEEEEEEeecchhhhhhhhhhhccccchHHHHHHHHHHHHHHHHHhh Confidence 1111 111122222345799988889999999999999999999999775 455566888888888889999999999 Q ss_pred HhhcCCC----------ccccccceeccc----cccccccccchhHHHHHHHHHHHHhhcccccc-ceEEeehhHHHHHH Q lcl|Aclame:pro 478 MLTGTGL----------ANDPVGLLNMTG----VPALTYPAGGVDWASVVDMETKISTFNADAGR-LAYLTSVTQRGAAK 542 (632) Q Consensus 478 ~~~g~g~----------~~~~~Gil~~a~----~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 542 (632) +++|+.. +-+.+|+.+.-+ -+.+.+-+..++.+.|..+-..+... .+... ....|+.-....+ T Consensus 143 ~FyGDs~l~s~~~g~~~gleFDGl~~lId~~~~~NViDarG~~Ls~~~L~~aa~~I~~~-~~fGt~TD~~lp~~vka~f- 220 (470) T protein:vir:10 143 AFYGDNLLGDDVPGSPNNLQQDGIINIIKRGAPQNVLDAGGRPLSIDLLWEAESRVVST-QAFANPTAVFISYVDKLNL- 220 (470) T ss_pred hhhhccccccccCcccCceeccchhhhccCCCCccccccCCCCccHHHHHHHHhhhccc-ccccChhhhccchhHHHHH- Confidence 9988531 113456533221 12334555667777777766666421 11222 2233433333222 Q ss_pred HHhhcccCCceeeccc---cccCcce--EEcCC--CC-CccEEEEehhhEE-EEEec--------ceEEEEecccccccC Q lcl|Aclame:pro 543 KAQVFDNTGERIWQNN---EVNGYRA--EASNQ--IP-ADTWIFGDWSQIV-IAMWG--------VLDLKVDPYTKAASD 605 (632) Q Consensus 543 ~~~~~d~~g~~~~~~~---~l~G~pv--~~~~~--~~-~~~~~~gd~s~~~-~~~~~--------~~~~~~~~~~~~~~~ 605 (632) ..--...-|.+..++ -..|++| .++.. +. .+..+..++.... ...+. .+...++... + T Consensus 221 -~~~~~~~qRv~~~~N~~~~~~G~~v~~f~sa~G~I~L~~s~~m~~~~k~~p~~l~~~v~~~aAP~~~~tv~~t~----~ 295 (470) T protein:vir:10 221 -QASFYQISRVMTTADRRAGLLGADAQSYIGVRGEHSLYPSQFLGDFHKFNPARFGAEVGDFAAPSNSWTVSTTD----N 295 (470) T ss_pred -HHhhcCceEEEEecCCCceeeeeeccceeeeeeeeeecccccccchhhcCcccCCcccCCcccCceeEEeecCC----C Confidence 221222223333222 1234332 11110 00 0111111111100 00000 0111111111 1 Q ss_pred cEEEEEEEEeCcEEecccc-----eEEEEecC Q lcl|Aclame:pro 606 GLVLRVFQDVDAGVRRKEA-----FCIAKKGA 632 (632) Q Consensus 606 ~~~~~~~~r~~~~v~~~~a-----~~~~~~~A 632 (632) .+..- . .-+.+...++. ++.+...+ T Consensus 296 ~~a~~-~-~sk~g~~~~~~v~sy~y~v~~~~g 325 (470) T protein:vir:10 296 FVTLP-Y-NSGLGDPANTTVYSYAFKAANFYG 325 (470) T ss_pred ceeec-c-cCCCCcccCcceeEEEEEEEEecC Confidence 00000 0 00111111111 12222222 No 231 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=36.72 E-value=1.2 Score=20.24 Aligned_cols=256 Identities=11% Similarity=0.017 Sum_probs=106.8 Q ss_pred ccccceechhhhhHHHHHHHhhhhhhhhhcceeec-cCceeEEEEEecCCccccccccCcccccCcccceeeeeeeeeee Q lcl|Aclame:pro 363 GKGGELVATELLSEEFIDILRNKAIIGQMGARMLP-GLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIA 441 (632) Q Consensus 363 ~~~~~~i~~~~~~~~i~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~ 441 (632) .+- ...+.....+.+.+........+...... .+..++++++.+..+-..+ .-++.+..++++.+..++.+ +.- T Consensus 1 Mai---n~a~~~~~~Ld~~~~~~~~t~~l~~~~~~~~ggktVkI~~i~~~gl~DY-~R~~g~~~g~v~~~~et~tl-~qd 75 (290) T protein:vir:78 1 MAI---NYVDKYGKELDQKLVFGTYTNELETPNLLWLDAKTFKIQTITTTGLKAH-TRNKGYNEGSASNTNKSYTI-DFD 75 (290) T ss_pred Cch---hHHHHHHHHHHHHHHhhheeeeccccceeeccCCEEEEeeeccCccccc-ccCCCcccCccccceeeEEe-ecc Confidence 000 00112233333333333333333222121 2445788888875443322 22334555555544444432 222 Q ss_pred eeehhhHHHhhcC----hhHHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccceeccccccccccccchhHHHHHHHHH Q lcl|Aclame:pro 442 GAVPVTRKLRKQS----SIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMET 517 (632) Q Consensus 442 ~~~~iSre~l~d~----~~~~~~~i~~~l~~a~a~~~~~~~~~g~g~~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~ 517 (632) +.+.++=.-+.-+ .+.+...+.+.....++-.+|...+.-.-+.....+ ..... .....--++.|.++.. T Consensus 76 R~~~F~vD~~DvDEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~~~~-----~~~~~-t~t~~n~~~~i~~~~~ 149 (290) T protein:vir:78 76 RDVEFFVDVMDVDETGQALSAANVTKEFNSRHAGPEMDAYRFSKLATAAKTNS-----NSVAE-EITKDNVFTKLKAAIR 149 (290) T ss_pred ccceeeccccchhHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhhccC-----ccccc-ccCHHHHHHHHHHHHH Confidence 3333322211111 123344455555555666666554432211100000 00011 1112234567777777 Q ss_pred HHHhhccccccceEEeehhHHHHHHHHh--hc--c----cCCceeeccccccCcceEEcCCC---C------Cc------ Q lcl|Aclame:pro 518 KISTFNADAGRLAYLTSVTQRGAAKKAQ--VF--D----NTGERIWQNNEVNGYRAEASNQI---P------AD------ 574 (632) Q Consensus 518 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~--d----~~g~~~~~~~~l~G~pv~~~~~~---~------~~------ 574 (632) +|... +......++.|.....+.... .+ + ..|..--..+.|.|.+|+..+.. - ++ T Consensus 150 ~ldev--p~~~rvl~vtp~~~~lL~~~~~f~r~~~~~~~~~~~i~~~V~~idG~~ii~vps~~r~~t~~~f~~G~~~~~~ 227 (290) T protein:vir:78 150 KVKKY--GTQNLVMYVSPDVMAALELSDDFVRAINVQNIGPSSIETRITAIDGTRIVEVEAEDRFYDTFDFTDGYKPAAG 227 (290) T ss_pred HHHhc--CCCCeEEEECHHHHHHHhhChhhhccccccccccccccceeeeecCcEEEEecccchhhhhhhhcccccccCC Confidence 88653 344444555565555543221 11 1 11111112356899988765421 1 01 Q ss_pred ----cEEEEehhhEE-EEEecceEEEEecccccccCcEEEEEEEEeCcEEecccceEEEEecC Q lcl|Aclame:pro 575 ----TWIFGDWSQIV-IAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) Q Consensus 575 ----~~~~gd~s~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~A 632 (632) .+++...+... +.-..-+.+ ..|...-.-+...|.-+.++|.=|.+.+.=.+..-.| T Consensus 228 ak~in~ii~~~~a~i~~~K~~~~~~-~~P~~~~~~d~~~~~~r~y~d~~v~~nk~~~i~~~~~ 289 (290) T protein:vir:78 228 AKKLNFLLVNKGSVVGGAKHASIYL-HAPGSVGQGDGWLYQYRVYHDIFVLDQQKDGVIASTE 289 (290) T ss_pred ccceeEEEEcCCceeeeeeeeEEEe-eCCCCCcCcceeeeeeeeeeeeeeeccccCeeEEEee Confidence 13333333221 111222222 2355544445567777888888887776433322233 No 232 >protein:vir:80835 Length: 464 # NCBI annotation: putative major capsid protein # Family: family:all:2450 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504125;genbank:gi:158079312;genbank:GeneID:5666484 Probab=33.70 E-value=1.3 Score=19.89 Aligned_cols=291 Identities=10% Similarity=0.016 Sum_probs=108.4 Q ss_pred hhhhhhhhhhhhHHHHHHHHHHHhhhhhhhhhhhHHhhhhhcccccc-----cccceechhhhhHHHHHHHhh---hhhh Q lcl|Aclame:pro 317 ATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAG-----KGGELVATELLSEEFIDILRN---KAII 388 (632) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~-----~~~~~i~~~~~~~~i~~~~~~---~~~~ 388 (632) .......+.......+++. +++.++... .++..+-.+-+.+.+..+... ...+ T Consensus 1 ~~~~~n~~~~~~~~~e~~~-------------------Ks~ttgy~~~p~~q~~~~AlRrEsL~~~i~~Lt~~~~~f~f~ 61 (464) T protein:vir:80 1 MTEKKNTERQLTSVQEEVI-------------------KGFTTGYGITPESQTDAAALRREFLDDQITMLTWADGDLSFY 61 (464) T ss_pred CCcchhhHhhcCcccHHHH-------------------HHHHhCCccCcccccCcchhhhhhhhhhhheeeecccchhhh Confidence 0001111111111111111 122111111 111222222222222222111 1222 Q ss_pred hhhcceeeccCceeEE-EEEecCCccccccccCcccccCcccceeeeeeeeeeeee--ehhhHHHhhcChhHHHHHHHHH Q lcl|Aclame:pro 389 GQMGARMLPGLVGDVD-IPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGA--VPVTRKLRKQSSIHVENLIRED 465 (632) Q Consensus 389 ~~~~~~~~~~~~~~~~-~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~t~~~~--~~iSre~l~d~~~~~~~~i~~~ 465 (632) ..+..+.....-..+. +...+..+...++.|++..+.+++.+...+..++-+... +.|--. |.|+..+......+. T Consensus 62 ~di~k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~Kfl~~~r~vsia~~-lvn~~~d~~~~~~~d 140 (464) T protein:vir:80 62 RDITKRPATSTVAKYDVYLAHGRVGHTRFTREIGVAPISDPNLRQKTVNMKYVSDTKNMSIATG-LVNNIEDPMRILTDD 140 (464) T ss_pred hhcCCchhhhhhhhhheeeccCccccccccccccccccCCCceEEEEEEeeeeecceeeeeehh-hhcchhhHHHHHHHH Confidence 3322222222222222 223344566889999999999999999988888755443 333333 345566777777778 Q ss_pred HHHHHHHHHHHHHhhcCCCcc---------ccccceecc-ccccccccccchhHHHHHHHHHHHHhhccccccceEEeeh Q lcl|Aclame:pro 466 LIEGIGVALDLAMLTGTGLAN---------DPVGLLNMT-GVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSV 535 (632) Q Consensus 466 l~~a~a~~~~~~~~~g~g~~~---------~~~Gil~~a-~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 535 (632) -...++++++.+.++|+..-. +.+||...- ..+.+.+-+..++.+.|-.+-..+...+... ....|+. T Consensus 141 ai~~va~tiE~a~FyGds~l~~~~~~~~gleFDGl~~lI~~~NViDarG~~Ls~~~ln~Aa~~i~~~fGt~--TD~~lp~ 218 (464) T protein:vir:80 141 AISVVAKTIEWASFYGDSDLSENPDAGSGLEFDGLAKLIDKHNVLDAKGASLTEALLNQASVLVGKGYGTP--TDAYMPI 218 (464) T ss_pred HHHHHHHHHHHHHhhhccccCCCCCCccccchhhhHhhcCCCceeecCCCCcCHHHHhhhhhhhhcccCCh--hhcccch Confidence 888889999999998864321 233433221 2233455666777777776666665444322 2223333 Q ss_pred hHHHHHHHHhhcccCCceeec-c---ccccCcce--EEcCCCC---CccEEEEehhhEEE------EEecc--eEEEEec Q lcl|Aclame:pro 536 TQRGAAKKAQVFDNTGERIWQ-N---NEVNGYRA--EASNQIP---ADTWIFGDWSQIVI------AMWGV--LDLKVDP 598 (632) Q Consensus 536 ~~~~~~~~~~~~d~~g~~~~~-~---~~l~G~pv--~~~~~~~---~~~~~~gd~s~~~~------~~~~~--~~~~~~~ 598 (632) -..... ....-+ .++.+. + +...|++| +++..-. .++.+..++..+.- ..... +....++ T Consensus 219 ~v~a~f-~n~~l~--~q~~~~~~n~~~~~~G~~v~~f~sa~G~i~L~~s~~m~~~~~ld~~~~~~~~apaapsvt~tv~~ 295 (464) T protein:vir:80 219 GVQADF-VNQQLD--RQVQVISDNGQNATMGFNVKGFNSARGFIRLHGSTVMELEQILDENRMQLPNAPQKATVKATLEA 295 (464) T ss_pred hHHHHH-HhhhcC--ceeEEEcCCCCcceeeeecccccccccceeccCccccCcccccccccccCCCCcCCceeEEEecC Confidence 332111 111111 222211 1 11223322 1111000 01111111111000 00000 1111111 Q ss_pred ccccccCcEEEEEEEEeCcEEecccceEEE-E-ecC Q lcl|Aclame:pro 599 YTKAASDGLVLRVFQDVDAGVRRKEAFCIA-K-KGA 632 (632) Q Consensus 599 ~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~-~-~~A 632 (632) ...-.-+.-...+...+-+.+++..+=... + .-| T Consensus 296 ~~~g~f~~~~~~~~~~Ykv~~vn~~GeS~ps~~~~~ 331 (464) T protein:vir:80 296 GTKGKFRDEDLTIDTEYKVVVVSDDAESAPSDVASV 331 (464) T ss_pred CcccCCccccccceeEEEEEEECCCCccccceeeee Confidence 111000111111111222222222210000 0 000 No 233 >protein:vir:5670 Length: 514 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899609;genbank:gi:34419596;genbank:GeneID:2546039 Probab=31.17 E-value=1.5 Score=19.59 Aligned_cols=332 Identities=9% Similarity=0.027 Sum_probs=106.1 Q ss_pred HHHhhhhHhhhhhhh---------hhhhhhhHHHHhhhhhhhhhhhHHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHH Q lcl|Aclame:pro 263 ALVLERMNPGQPGNF---------EKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLA 333 (632) Q Consensus 263 ~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 333 (632) -.+.+.+........ .+........+.+.... ......+. ..+.+. T Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~l~enq~~~~---------------------~~~~~~~~----~~~~~~ 55 (514) T protein:vir:56 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDI---------------------NNDPMYRD----PQLVEA 55 (514) T ss_pred CchhhhhhHHhcccccccccccchhhhhhhhhhhhhHHHHH---------------------hcCCcccc----hhhhhh Confidence 001111110000000 00000000000000000 00000000 000000 Q ss_pred HHHHHHhhhhhhhhhhhHHhhhhhcccccccccceechhhhhHHHHHHHh---hhhhhhhh-cceeeccCceeEEEEE-- Q lcl|Aclame:pro 334 IADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILR---NKAIIGQM-GARMLPGLVGDVDIPK-- 407 (632) Q Consensus 334 ~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~i~~~~~~~~i~~~~~---~~~~~~~~-~~~~~~~~~~~~~~~~-- 407 (632) ....+.+......... .......+..++...- ....++.+.| +.-+...+ +++++++..+-+.-.+ T Consensus 56 ~~~~l~e~~~~~~~~~----~~~~ia~s~~t~~v~~----~~P~ll~lvRRa~~~LIa~DIwGVQPMTgPTGLIFAMRsr 127 (514) T protein:vir:56 56 FNAGLNEAVVNGDHGY----DPANIAQGVTTGAVTN----IGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSV 127 (514) T ss_pred hhcccccccccccccc----cccccccccccccccc----cchhHHHHHHHHHHhhhhhhhheeccCCchhhhheeeeee Confidence 0000000000000000 0000000011111100 0111111111 11112222 2222222111100000 Q ss_pred -ecC---Ccc---------------------------------------------------------------------- Q lcl|Aclame:pro 408 -KTS---GAN---------------------------------------------------------------------- 413 (632) Q Consensus 408 -~~~---~~~---------------------------------------------------------------------- 413 (632) ... ... T Consensus 128 Y~~~~~tg~EAf~~~nEadt~fSG~~~~~~~~~~~~~~~~~~G~~~~~~~t~~~gd~~~~~~~~~~~~~~~~~~~~~~t~ 207 (514) T protein:vir:56 128 YGKDPLTGAEAFHPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATE 207 (514) T ss_pred ecCCCcccccccccccccCcCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccc Confidence 000 000 Q ss_pred -----------------cccccc---------CcccccCcccceeeeeeeeeeeeeehhhHHHhhcC----hhHHHHHHH Q lcl|Aclame:pro 414 -----------------FYWIGE---------DEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQS----SIHVENLIR 463 (632) Q Consensus 414 -----------------a~~v~E---------~~~~~~~~~~~~~~~~~~~t~~~~~~iSre~l~d~----~~~~~~~i~ 463 (632) ..-.+| +..+++-.+.+++++..+++-+=+...|-|+..|- .+++++.|. T Consensus 208 ~~~~~a~~~~y~~~~Gm~Ta~aEal~~lggs~~~~f~EMaFsIdK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELs 287 (514) T protein:vir:56 208 YTDGVAGGLLVEIDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELS 287 (514) T ss_pred cccccccchhhhhhhhhhhhhhhhcccCCCCcccccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHH Confidence 000111 12255556777788888888888888999988772 478999999 Q ss_pred HHHHHHHHHHHHHHHhhc---CCC--------ccccccceeccccccccccccchhHHHHHHHHH-------HHHhhccc Q lcl|Aclame:pro 464 EDLIEGIGVALDLAMLTG---TGL--------ANDPVGLLNMTGVPALTYPAGGVDWASVVDMET-------KISTFNAD 525 (632) Q Consensus 464 ~~l~~a~a~~~~~~~~~g---~g~--------~~~~~Gil~~a~~~~~~~~~~~~~~~~i~~~~~-------~~~~~~~~ 525 (632) +.|...+...+|+.|+.- .-+ +....|++......... ++-.....+..+.. .+..+.+. T Consensus 288 NILSTEImlEINReii~~l~~~atv~~~~~~~~~~~~G~~d~~~~~d~~--~~~~~~e~~~~l~~~i~~~an~i~~~T~r 365 (514) T protein:vir:56 288 GILANEVMVELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVK--GARWAGEAYKALLIQIEKEANEIGRQTGR 365 (514) T ss_pred HHHHHHHHHHhhHHHHHHHHhheeehhcccccccccccccccccccccc--cchHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 999999999999998522 111 11223443332211111 11112222333222 22222232 Q ss_pred cccceE-EeehhHHHHHHHHhhcc---cC-----------Cceeecccccc-CcceEEcCCCCCccEEEEehh--hE--E Q lcl|Aclame:pro 526 AGRLAY-LTSVTQRGAAKKAQVFD---NT-----------GERIWQNNEVN-GYRAEASNQIPADTWIFGDWS--QI--V 585 (632) Q Consensus 526 ~~~~~~-~~~~~~~~~~~~~~~~d---~~-----------g~~~~~~~~l~-G~pv~~~~~~~~~~~~~gd~s--~~--~ 585 (632) ....| ++++.....|...-.-+ .. ...++ -+.|. ||+|.++++.+.+=+++|--. .+ - T Consensus 366 -g~gn~~i~S~~Va~~L~~sg~l~~~~~~g~~~~~~~~d~~~~~~-aG~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~g 443 (514) T protein:vir:56 366 -GNGNFIIASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVF-AGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAG 443 (514) T ss_pred -ccccEEEEchhHHHHHHhhhhhccccccCccccccccccCcceE-EEEecCceEEEecCCCCcceEEEEEecCcceecc Confidence 23344 45555555444211110 00 11111 14554 479999998886555544110 00 0 Q ss_pred EEEecceEEEEecccccccCcEEEEEEEEeCcEEeccc------------------------ceEEEEecC Q lcl|Aclame:pro 586 IAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKE------------------------AFCIAKKGA 632 (632) Q Consensus 586 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~------------------------a~~~~~~~A 632 (632) +++-=++.+......+-.+-+=.+-...|++..+ +|= -|.+++++= T Consensus 444 lfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~-NPy~~~~~~~~~~~~~~~~~a~~~~n~y~r~v~v~~ 513 (514) T protein:vir:56 444 VFYSPYVPLTPLRGSDSKNFQPVIGFKTRYGVQV-NPFADPTASATKVGNGAPVAASMGKNAYFRRVFVKG 513 (514) T ss_pred eeeccccccccccccCCccccceeeeeeeeceee-CCCCCccccccccCCcchhhhcccccceeeeEEEec Confidence 0010111111000011112222233333444332 220 111222222 Done!