Query lcl|Aclame:protein:vir:97397|NCBI_annot:major capsid protein|genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Match_columns 517 No_of_seqs 263 out of 1149 Neff 10.0 Searched_HMMs 1612 Date Mon Dec 2 12:53:50 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_34 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_34_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:97397 Length: 517 100.0 3E-130 2E-133 731.1 40.8 517 1-517 1-517 (517) 2 protein:vir:4074 Length: 480 # 100.0 9E-87 5.6E-90 492.3 30.7 464 5-517 1-479 (480) 3 protein:vir:93616 Length: 645 100.0 3.8E-82 2.3E-85 467.0 32.6 501 1-517 12-645 (645) 4 protein:vir:96762 Length: 632 100.0 1.2E-66 7.5E-70 382.0 32.3 500 1-513 28-632 (632) 5 protein:vir:485 Length: 407 # 100.0 1.7E-50 1.1E-53 293.4 26.3 376 127-517 1-407 (407) 6 protein:vir:4456 Length: 401 # 100.0 1.3E-49 8.2E-53 288.6 26.5 372 127-514 1-401 (401) 7 protein:vir:100135 Length: 418 100.0 2.3E-49 1.5E-52 287.2 27.2 396 106-517 1-418 (418) 8 protein:vir:100247 Length: 425 100.0 1.5E-48 9.1E-52 282.8 25.3 394 105-515 1-425 (425) 9 protein:vir:6242 Length: 390 # 100.0 2.1E-47 1.3E-50 276.5 25.6 369 125-515 1-390 (390) 10 protein:vir:4700 Length: 415 # 100.0 2.7E-47 1.7E-50 275.9 25.9 373 130-517 1-407 (415) 11 protein:vir:4600 Length: 415 # 100.0 2.7E-47 1.7E-50 275.9 25.9 373 130-517 1-407 (415) 12 protein:vir:4511 Length: 409 # 100.0 2.2E-47 1.4E-50 276.4 24.7 373 124-517 1-409 (409) 13 protein:vir:6212 Length: 434 # 100.0 5.1E-47 3.1E-50 274.4 25.6 376 131-517 1-434 (434) 14 protein:vir:1328 Length: 392 # 100.0 4.8E-47 3E-50 274.5 25.3 372 125-515 1-392 (392) 15 protein:vir:95376 Length: 425 100.0 1.6E-46 9.6E-50 271.7 26.9 390 112-517 1-424 (425) 16 protein:vir:4339 Length: 395 # 100.0 1.5E-46 9.5E-50 271.8 26.0 369 137-514 1-395 (395) 17 protein:vir:1025 Length: 408 # 100.0 8.8E-47 5.5E-50 273.1 24.5 371 124-517 1-396 (408) 18 protein:vir:98339 Length: 415 100.0 2.4E-46 1.5E-49 270.7 26.1 372 127-517 1-407 (415) 19 protein:vir:81100 Length: 415 100.0 2.4E-46 1.5E-49 270.7 26.1 372 127-517 1-407 (415) 20 protein:vir:79987 Length: 415 100.0 2.4E-46 1.5E-49 270.7 26.1 372 127-517 1-407 (415) 21 protein:vir:1268 Length: 397 # 100.0 5.3E-46 3.3E-49 268.8 25.9 368 122-514 1-397 (397) 22 protein:vir:7855 Length: 497 # 100.0 6.9E-46 4.3E-49 268.2 26.4 389 122-517 1-496 (497) 23 protein:vir:101650 Length: 497 100.0 6.9E-46 4.3E-49 268.2 26.4 389 122-517 1-496 (497) 24 protein:vir:4953 Length: 397 # 100.0 4.8E-46 2.9E-49 269.1 25.4 365 127-517 1-388 (397) 25 protein:vir:9410 Length: 415 # 100.0 7.4E-46 4.6E-49 268.0 26.4 372 130-517 1-407 (415) 26 protein:vir:102119 Length: 404 100.0 5E-46 3.1E-49 268.9 25.4 372 133-517 1-404 (404) 27 protein:vir:7409 Length: 408 # 100.0 4.6E-46 2.9E-49 269.1 24.9 371 124-517 1-396 (408) 28 protein:vir:191 Length: 385 # 100.0 8.8E-46 5.4E-49 267.6 25.9 372 127-515 1-385 (385) 29 protein:vir:1886 Length: 385 # 100.0 8.8E-46 5.4E-49 267.6 25.9 372 127-515 1-385 (385) 30 protein:vir:104256 Length: 458 100.0 3.3E-46 2E-49 269.9 23.0 405 96-514 1-458 (458) 31 protein:vir:10364 Length: 390 100.0 1.1E-45 6.8E-49 267.1 25.8 369 130-512 1-390 (390) 32 protein:vir:81070 Length: 390 100.0 1.1E-45 7E-49 267.0 25.6 369 130-512 1-390 (390) 33 protein:vir:97053 Length: 390 100.0 1.5E-45 9.5E-49 266.3 25.3 369 130-512 1-390 (390) 34 protein:vir:81227 Length: 413 100.0 2.5E-45 1.6E-48 265.1 26.4 384 113-517 1-413 (413) 35 protein:vir:4997 Length: 397 # 100.0 2E-45 1.2E-48 265.6 25.7 365 127-517 1-388 (397) 36 protein:vir:1433 Length: 435 # 100.0 2.1E-45 1.3E-48 265.5 24.9 371 128-516 1-435 (435) 37 protein:vir:100172 Length: 394 100.0 4E-45 2.5E-48 264.0 24.8 361 127-517 1-387 (394) 38 protein:vir:101607 Length: 379 100.0 9.4E-45 5.8E-48 262.0 26.0 366 112-514 1-379 (379) 39 protein:vir:3991 Length: 404 # 100.0 9.5E-45 5.9E-48 261.9 25.3 367 131-517 1-403 (404) 40 protein:vir:4830 Length: 397 # 100.0 2.2E-44 1.4E-47 259.9 26.6 365 127-517 1-388 (397) 41 protein:vir:80376 Length: 435 100.0 4.5E-44 2.8E-47 258.2 26.8 369 128-516 1-435 (435) 42 protein:vir:105038 Length: 428 100.0 2E-44 1.2E-47 260.2 24.5 368 127-514 1-428 (428) 43 protein:vir:105004 Length: 392 100.0 3.4E-44 2.1E-47 258.9 25.3 352 133-517 1-392 (392) 44 protein:vir:102873 Length: 392 100.0 3.4E-44 2.1E-47 258.9 25.3 352 133-517 1-392 (392) 45 protein:vir:107593 Length: 392 100.0 3.4E-44 2.1E-47 258.9 25.3 352 133-517 1-392 (392) 46 protein:vir:102082 Length: 392 100.0 3.4E-44 2.1E-47 258.9 25.3 352 133-517 1-392 (392) 47 protein:vir:9704 Length: 394 # 100.0 8.2E-44 5.1E-47 256.8 27.2 363 125-517 1-393 (394) 48 protein:vir:81160 Length: 371 100.0 2.6E-44 1.6E-47 259.6 23.5 342 133-514 1-371 (371) 49 protein:vir:100884 Length: 389 100.0 6.9E-44 4.3E-47 257.2 24.7 362 130-517 1-385 (389) 50 protein:vir:3845 Length: 395 # 100.0 9.5E-44 5.9E-47 256.4 25.4 361 124-517 1-386 (395) 51 protein:vir:3870 Length: 400 # 100.0 2.3E-43 1.4E-46 254.3 27.1 366 124-515 1-400 (400) 52 protein:vir:962 Length: 397 # 100.0 2.5E-42 1.5E-45 248.7 27.2 359 131-514 1-397 (397) 53 protein:vir:1084 Length: 437 # 100.0 1E-41 6.2E-45 245.3 29.2 387 112-517 1-430 (437) 54 protein:vir:98635 Length: 377 100.0 1.5E-43 9.6E-47 255.3 18.4 347 131-514 1-377 (377) 55 protein:vir:93881 Length: 387 100.0 1.5E-42 9.4E-46 249.9 23.7 368 127-517 1-384 (387) 56 protein:vir:94424 Length: 387 100.0 1E-42 6.5E-46 250.7 22.4 370 127-517 1-384 (387) 57 protein:vir:96978 Length: 387 100.0 1E-42 6.5E-46 250.7 22.4 370 127-517 1-384 (387) 58 protein:vir:2685 Length: 387 # 100.0 1E-42 6.5E-46 250.7 22.4 370 127-517 1-384 (387) 59 protein:vir:9361 Length: 402 # 100.0 1.4E-42 8.4E-46 250.1 21.7 368 130-517 1-399 (402) 60 protein:vir:94673 Length: 419 100.0 8.9E-42 5.5E-45 245.6 24.8 379 124-516 1-419 (419) 61 protein:vir:8102 Length: 543 # 100.0 4.2E-41 2.6E-44 242.0 25.2 420 85-515 1-543 (543) 62 protein:vir:78640 Length: 352 100.0 4.2E-42 2.6E-45 247.4 19.6 341 141-517 1-349 (352) 63 protein:vir:8420 Length: 477 # 100.0 4.9E-41 3E-44 241.6 25.0 381 130-517 1-474 (477) 64 protein:vir:4092 Length: 390 # 100.0 3.9E-41 2.4E-44 242.1 23.3 345 130-517 1-370 (390) 65 protein:vir:1383 Length: 421 # 100.0 1.1E-40 6.8E-44 239.7 24.1 367 124-517 1-386 (421) 66 protein:vir:7771 Length: 330 # 100.0 1E-41 6.3E-45 245.3 17.0 282 229-517 1-326 (330) 67 protein:vir:41 Length: 299 # N 100.0 9.5E-42 5.9E-45 245.5 16.3 275 229-515 1-299 (299) 68 protein:vir:93858 Length: 400 100.0 1.3E-40 7.9E-44 239.3 19.7 370 128-512 1-400 (400) 69 protein:vir:8187 Length: 311 # 100.0 5.5E-41 3.4E-44 241.3 17.3 273 241-516 1-311 (311) 70 protein:vir:95963 Length: 395 100.0 3.9E-40 2.4E-43 236.6 21.7 351 131-517 1-379 (395) 71 protein:vir:100632 Length: 381 100.0 2.7E-40 1.7E-43 237.5 20.3 336 131-517 1-376 (381) 72 protein:vir:80128 Length: 466 100.0 2E-39 1.2E-42 232.7 25.1 395 112-517 1-451 (466) 73 protein:vir:9574 Length: 300 # 100.0 6E-41 3.8E-44 241.1 16.5 273 237-517 1-300 (300) 74 protein:vir:97148 Length: 324 100.0 1.3E-40 7.9E-44 239.3 18.1 296 209-517 1-322 (324) 75 protein:vir:94142 Length: 304 100.0 5.6E-41 3.5E-44 241.2 15.9 273 229-513 1-304 (304) 76 protein:vir:105905 Length: 304 100.0 5.6E-41 3.5E-44 241.2 15.9 273 229-513 1-304 (304) 77 protein:vir:96392 Length: 324 100.0 1.2E-40 7.2E-44 239.5 17.4 296 191-517 1-322 (324) 78 protein:vir:78830 Length: 324 100.0 1.2E-40 7.2E-44 239.5 17.4 296 191-517 1-322 (324) 79 protein:vir:2344 Length: 397 # 100.0 2.6E-40 1.6E-43 237.6 17.9 281 227-517 1-320 (397) 80 protein:vir:5739 Length: 366 # 100.0 1.5E-40 9.5E-44 238.9 16.4 330 141-517 1-365 (366) 81 protein:vir:9643 Length: 377 # 100.0 2E-39 1.2E-42 232.7 22.0 336 131-514 1-377 (377) 82 protein:vir:2430 Length: 318 # 100.0 3.1E-40 1.9E-43 237.2 16.7 284 225-517 1-314 (318) 83 protein:vir:101291 Length: 381 100.0 1.6E-39 1E-42 233.2 20.5 338 131-517 1-376 (381) 84 protein:vir:9509 Length: 381 # 100.0 1.6E-39 1E-42 233.2 20.5 338 131-517 1-376 (381) 85 protein:vir:9759 Length: 303 # 100.0 3.5E-40 2.2E-43 236.9 16.8 272 241-514 1-303 (303) 86 protein:vir:1638 Length: 298 # 100.0 4.6E-40 2.9E-43 236.2 16.9 269 241-513 1-298 (298) 87 protein:vir:4856 Length: 293 # 100.0 4.8E-40 3E-43 236.1 17.0 262 237-517 1-284 (293) 88 protein:vir:9309 Length: 324 # 100.0 8.2E-40 5.1E-43 234.9 17.7 294 190-517 1-316 (324) 89 protein:vir:95763 Length: 297 100.0 5.9E-40 3.6E-43 235.7 16.1 274 225-515 1-297 (297) 90 protein:vir:103955 Length: 324 100.0 1.2E-39 7.2E-43 234.0 17.5 296 206-517 1-322 (324) 91 protein:vir:80684 Length: 315 100.0 1.5E-39 9.4E-43 233.4 17.3 277 237-517 1-314 (315) 92 protein:vir:99749 Length: 324 100.0 2.9E-39 1.8E-42 231.9 17.7 296 206-517 1-322 (324) 93 protein:vir:96223 Length: 324 100.0 2.6E-39 1.6E-42 232.1 17.2 294 206-517 1-316 (324) 94 protein:vir:94771 Length: 298 100.0 3.8E-39 2.3E-42 231.2 17.4 269 241-513 1-298 (298) 95 protein:vir:78350 Length: 383 100.0 8.4E-39 5.2E-42 229.3 18.9 355 131-517 1-383 (383) 96 protein:vir:104085 Length: 320 100.0 4.6E-39 2.9E-42 230.8 17.3 285 225-516 1-320 (320) 97 protein:vir:4226 Length: 326 # 100.0 1.8E-39 1.1E-42 233.0 14.4 289 209-517 1-324 (326) 98 protein:vir:2504 Length: 305 # 100.0 7.2E-39 4.4E-42 229.7 15.9 267 239-517 1-303 (305) 99 protein:vir:78223 Length: 333 100.0 3.4E-38 2.1E-41 226.0 17.5 282 219-514 1-333 (333) 100 protein:vir:78523 Length: 338 100.0 4.6E-38 2.8E-41 225.3 17.2 288 216-516 1-338 (338) 101 protein:vir:99920 Length: 311 100.0 2.2E-38 1.4E-41 227.0 14.5 272 238-516 1-311 (311) 102 protein:vir:4159 Length: 315 # 100.0 4.1E-33 2.6E-36 198.1 15.6 288 219-513 1-315 (315) 103 protein:vir:4197 Length: 314 # 100.0 1.3E-31 8.4E-35 189.8 15.2 288 222-517 1-314 (314) 104 protein:vir:3158 Length: 321 # 99.9 1.4E-24 8.4E-28 151.4 16.7 291 209-517 1-315 (321) 105 protein:vir:9820 Length: 272 # 99.8 7.6E-21 4.7E-24 130.9 15.3 260 237-517 1-272 (272) 106 protein:vir:3033 Length: 272 # 99.8 7.6E-21 4.7E-24 130.9 15.3 260 237-517 1-272 (272) 107 protein:vir:8324 Length: 410 # 99.6 3E-19 1.9E-22 122.1 7.4 378 93-517 1-409 (410) 108 protein:vir:93966 Length: 400 99.6 2.2E-17 1.4E-20 111.9 15.8 359 128-512 1-400 (400) 109 protein:vir:1663 Length: 393 # 99.5 3E-16 1.8E-19 105.7 16.2 357 124-512 1-393 (393) 110 protein:vir:94933 Length: 330 99.4 3.6E-15 2.3E-18 99.8 15.0 297 197-515 1-330 (330) 111 protein:vir:861 Length: 318 # 99.4 7.1E-15 4.4E-18 98.2 12.9 289 211-512 1-318 (318) 112 protein:vir:3613 Length: 272 # 99.3 1.1E-13 6.5E-17 91.7 14.5 257 237-514 1-272 (272) 113 protein:vir:96833 Length: 275 99.3 3.4E-13 2.1E-16 89.0 15.5 260 236-517 1-274 (275) 114 protein:vir:79928 Length: 393 99.2 2E-12 1.2E-15 84.8 15.6 342 137-517 1-384 (393) 115 protein:vir:93742 Length: 274 99.2 2.6E-12 1.6E-15 84.1 15.6 259 237-517 1-273 (274) 116 protein:vir:105334 Length: 276 99.2 2.9E-12 1.8E-15 83.8 15.7 258 237-516 1-276 (276) 117 protein:vir:80930 Length: 278 99.2 3.5E-12 2.2E-15 83.4 15.7 257 237-517 1-277 (278) 118 protein:vir:95107 Length: 270 99.1 1.1E-11 7.1E-15 80.6 15.2 258 241-517 1-268 (270) 119 protein:vir:96123 Length: 274 99.0 2.1E-11 1.3E-14 79.1 14.8 258 225-516 1-274 (274) 120 protein:vir:97255 Length: 310 99.0 4.7E-11 2.9E-14 77.2 14.7 274 225-514 1-310 (310) 121 protein:vir:94870 Length: 318 99.0 2.3E-11 1.4E-14 78.9 12.8 292 205-512 1-318 (318) 122 protein:vir:97433 Length: 274 98.9 1.7E-10 1.1E-13 74.1 15.2 259 237-517 1-273 (274) 123 protein:vir:94494 Length: 274 98.9 1.7E-10 1.1E-13 74.1 15.2 259 237-517 1-273 (274) 124 protein:vir:96262 Length: 274 98.8 7.9E-10 4.9E-13 70.5 14.6 256 237-517 1-270 (274) 125 protein:vir:95898 Length: 274 98.8 7.9E-10 4.9E-13 70.5 14.6 256 237-517 1-270 (274) 126 protein:vir:1239 Length: 274 # 98.7 1.1E-09 7.1E-13 69.6 14.8 259 225-517 1-273 (274) 127 protein:vir:739 Length: 231 # 98.7 3.8E-10 2.4E-13 72.2 10.3 219 275-514 1-231 (231) 128 protein:vir:94576 Length: 347 97.9 5.1E-06 3.2E-09 49.6 16.6 282 222-514 1-347 (347) 129 protein:vir:94622 Length: 341 97.8 3.1E-06 1.9E-09 50.8 14.4 279 231-515 1-341 (341) 130 protein:vir:108211 Length: 318 97.8 1.2E-06 7.4E-10 53.1 12.1 267 225-513 1-318 (318) 131 protein:vir:95318 Length: 328 97.7 3E-06 1.9E-09 50.9 13.0 228 225-463 1-328 (328) 132 protein:vir:103759 Length: 330 97.7 2.9E-06 1.8E-09 50.9 12.6 228 225-463 1-330 (330) 133 protein:vir:99675 Length: 324 97.7 2.8E-05 1.7E-08 45.5 17.9 238 272-517 1-308 (324) 134 protein:vir:105822 Length: 273 97.6 3E-06 1.9E-09 50.9 11.9 252 241-514 1-273 (273) 135 protein:vir:102605 Length: 273 97.6 3E-06 1.9E-09 50.9 11.9 252 241-514 1-273 (273) 136 protein:vir:7990 Length: 273 # 97.6 3.8E-06 2.4E-09 50.3 11.6 252 241-514 1-273 (273) 137 protein:vir:10450 Length: 344 97.3 9.4E-05 5.8E-08 42.7 18.2 276 222-514 1-344 (344) 138 protein:vir:8885 Length: 347 # 97.3 9.4E-05 5.9E-08 42.6 17.9 282 222-515 1-347 (347) 139 protein:vir:107388 Length: 331 97.3 3.7E-05 2.3E-08 44.9 14.0 227 225-463 1-331 (331) 140 protein:vir:98525 Length: 331 97.3 3.7E-05 2.3E-08 44.9 14.0 227 225-463 1-331 (331) 141 protein:vir:107826 Length: 331 97.3 3.7E-05 2.3E-08 44.9 14.0 227 225-463 1-331 (331) 142 protein:vir:2201 Length: 345 # 97.3 0.0001 6.4E-08 42.4 18.0 275 225-514 1-345 (345) 143 protein:vir:94711 Length: 347 97.1 0.0001 6.5E-08 42.4 14.8 279 225-515 1-347 (347) 144 protein:vir:9927 Length: 295 # 97.1 9.4E-05 5.8E-08 42.7 14.4 261 236-517 1-295 (295) 145 protein:vir:3364 Length: 347 # 97.1 0.00019 1.2E-07 41.0 18.1 281 222-517 1-347 (347) 146 protein:vir:80213 Length: 334 97.0 0.0002 1.2E-07 40.9 16.6 273 225-514 1-334 (334) 147 protein:vir:7324 Length: 335 # 97.0 4.9E-05 3E-08 44.2 12.0 229 225-464 1-335 (335) 148 protein:vir:1541 Length: 347 # 96.8 0.00033 2E-07 39.7 16.1 282 222-517 1-347 (347) 149 protein:vir:106647 Length: 303 96.7 0.00016 9.9E-08 41.4 13.0 259 235-517 1-299 (303) 150 protein:vir:99424 Length: 360 96.7 0.0003 1.9E-07 39.9 14.1 296 216-516 1-360 (360) 151 protein:vir:1583 Length: 351 # 96.3 0.00034 2.1E-07 39.6 12.3 262 241-517 1-294 (351) 152 protein:vir:9875 Length: 296 # 96.2 0.00074 4.6E-07 37.8 13.7 256 225-515 1-296 (296) 153 protein:vir:5974 Length: 324 # 96.1 0.00036 2.3E-07 39.4 11.4 256 241-517 1-290 (324) 154 protein:vir:80180 Length: 381 96.1 0.00035 2.2E-07 39.5 11.3 284 225-517 1-326 (381) 155 protein:vir:103323 Length: 364 96.0 0.0011 6.8E-07 36.8 13.5 272 225-517 1-341 (364) 156 protein:vir:103285 Length: 296 95.9 0.00012 7.2E-08 42.2 7.9 267 225-512 1-296 (296) 157 protein:vir:78935 Length: 335 95.9 0.0012 7.7E-07 36.5 19.1 268 225-517 1-331 (335) 158 protein:vir:80068 Length: 301 95.5 0.00078 4.9E-07 37.6 11.1 263 241-512 1-301 (301) 159 protein:vir:107687 Length: 319 95.5 0.00046 2.9E-07 38.9 9.8 287 205-512 1-319 (319) 160 protein:vir:97031 Length: 402 95.4 0.0021 1.3E-06 35.2 17.9 273 225-517 1-340 (402) 161 protein:vir:100057 Length: 375 95.3 0.0023 1.4E-06 35.1 16.4 283 225-517 1-372 (375) 162 protein:vir:6324 Length: 335 # 95.3 0.0023 1.4E-06 35.0 17.9 272 225-517 1-331 (335) 163 protein:vir:104342 Length: 314 94.9 0.00041 2.5E-07 39.2 7.6 283 210-514 1-314 (314) 164 protein:vir:8843 Length: 317 # 94.1 0.003 1.9E-06 34.4 10.5 271 241-515 1-317 (317) 165 protein:vir:102944 Length: 330 93.8 0.0062 3.9E-06 32.7 12.2 267 225-517 1-296 (330) 166 protein:vir:79642 Length: 329 92.7 0.005 3.1E-06 33.2 9.4 296 199-517 1-329 (329) 167 protein:vir:105645 Length: 400 91.7 0.015 9E-06 30.7 16.4 272 225-517 1-340 (400) 168 protein:vir:78739 Length: 332 90.5 0.021 1.3E-05 29.8 18.5 280 219-512 1-332 (332) 169 protein:vir:5255 Length: 304 # 89.5 0.0095 5.9E-06 31.7 7.7 262 228-511 1-304 (304) 170 protein:vir:3136 Length: 322 # 85.7 0.051 3.2E-05 27.7 10.8 272 239-517 1-321 (322) 171 protein:vir:1781 Length: 221 # 75.6 0.15 9E-05 25.2 11.8 184 318-517 1-206 (221) 172 protein:vir:7019 Length: 401 # 58.6 0.41 0.00025 22.7 15.9 272 225-517 1-341 (401) 173 protein:vir:95131 Length: 325 57.4 0.43 0.00027 22.6 13.6 268 236-517 1-293 (325) 174 protein:vir:108303 Length: 418 50.3 0.61 0.00038 21.8 13.3 248 241-517 1-326 (418) 175 protein:vir:102655 Length: 322 48.2 0.68 0.00042 21.5 17.3 272 225-515 1-322 (322) 176 protein:vir:3643 Length: 336 # 33.6 1.3 0.00084 19.9 6.5 299 180-512 1-336 (336) 177 protein:vir:101557 Length: 336 28.6 1.7 0.0011 19.3 7.7 299 180-512 1-336 (336) 178 protein:vir:94070 Length: 339 26.6 1.9 0.0012 19.0 8.3 298 178-512 1-339 (339) 179 protein:vir:99311 Length: 463 24.7 2.1 0.0013 18.8 12.2 276 204-517 1-302 (463) 180 protein:vir:95603 Length: 463 24.7 2.1 0.0013 18.8 12.2 276 204-517 1-302 (463) No 1 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=100.00 E-value=2.5e-130 Score=731.06 Aligned_cols=517 Identities=100% Similarity=1.396 Sum_probs=453.6 Q ss_pred CcccccceEEEEEEEecCCCCCCCcEECcchHHHHhccCCCeEEeecCCCCCcceeEEEEeecCceEEEEEeCchHHHHH Q lcl|Aclame:pro 1 MSGTFKDGVLIGKLVDYGSIDSYNTVFEPGAFDEYVGSEQTFNLDYRHDMQDKLAKFKVIGREDGIYIEAKPNNDIAYKR 80 (517) Q Consensus 1 ~~~~~~~g~~~g~a~~~~~~d~~~d~i~~gaf~~~~~~~~~~~~l~~Hd~~~~iG~~~~~~~~~Gl~~~~~~~~~~~~~~ 80 (517) ||+++++|+|+|||++||++|+|||+|+||||+++|.+++++||||+||+++|||+|++..++|||+|+++|++++++++ T Consensus 1 ~~~~~~~~~~~g~a~~~~~~d~~~~~~~~gaf~~~~~~~~~~~~l~~Hd~~~~ig~~~~~~~~~Gl~~~~~~~~~~~~~~ 80 (517) T protein:vir:97 1 MSGTFKDGVLIGKLVDYGSIDSYNTVFEPGAFDEYVGSEQTFNLDYRHDMQDKLAKFKVIGREDGIYIEAKPNNDIAYKR 80 (517) T ss_pred CccccCceEEEEEEEecCCCCCCCceEccchHHHHHhcCCCeEEeecCCCCCceEEEEEEEecCceEEEEeeCchHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999988899999999999999999 Q ss_pred HHHHHhhcCCeeEeeeeeecccCCCceEEEEehhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 81 MKEAIDKGAGLSVTFQPVEASEVDGVAYYKKCILAGGALTPNPSNKNAVVTYFREEKKKEENKMTFDQNLMQELLDAKKL 160 (517) Q Consensus 81 ~~~~~~~g~~~SiGf~~~~~~~~~~~~~~~~~~l~EvS~v~~pA~~~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~ 160 (517) ++++|++|.||||||+++++++.+|+|+|++++|+|||+|++|||++|+|+.+++.....+.+++.....+++..+..++ T Consensus 81 ~~~~~~~g~~~S~gf~~~~~~~~~~~~~~~~~~l~EvS~v~~pa~~~a~I~~vke~~~~e~~~~~~~~a~~ee~~e~~~k 160 (517) T protein:vir:97 81 MKEAIDKGAGLSVTFQPVEASEVDGVAYYKKCILAGGALTPNPSNKNAVVTYFREEKKKEENKMTFDQNLMQELLDAKKL 160 (517) T ss_pred HHHHHHcCCceEEEEEeecccCCCCceEEEEEeeeeeeecchhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh Confidence 99999999999999999998888899999999999999999999999999999998888888887777777777777788 Q ss_pred hhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhhhhhhhhhhhhhhHHHHHHHHHHHHhhccchhhHHHHhhh Q lcl|Aclame:pro 161 AADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVEALKVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAE 240 (517) Q Consensus 161 ~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (517) ..++.+++++++++..+...+....+..++.+.++.....+...........++.........+................ T Consensus 161 ~~el~a~l~~~~~~~~~~~~e~~~~l~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (517) T protein:vir:97 161 AADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVEALKVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAE 240 (517) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHhhhhcccccccccchhhHHHHHHHHHHHHHHhcccccccceeeee Confidence 88888888888887777777777777766666555444444444433434444444333333333333333333333444 Q ss_pred hhcccccccccchhhhhhHHHhHhhhhhhhhceeeeccccceeeeecccccceeeecccccccccccceeeEeeHhhhhH Q lcl|Aclame:pro 241 LKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPTLVVGGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYVYK 320 (517) Q Consensus 241 ~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~ 320 (517) +...+.+++.+|..+...+.+.+...+++.+++++.++++..++..+....+.|++||+.+|+++++|+++++.++++++ T Consensus 241 ~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~i~~~~~~~~~~~~~a~~~~eG~~kp~s~~tf~~~~~~~~~ia~ 320 (517) T protein:vir:97 241 LKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPTLVVGGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYVYK 320 (517) T ss_pred cccccccccccchHHHHHHHHhhhhhccceeeeeeccccceeeecccccceeeeeecCCcccccccceeeEEeeHhhhhh Confidence 55566778999999999999999999999999999999988888888888899999999999999999999999999999 Q ss_pred hHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcccccccccccccccccccccccHHHHHHHHHH Q lcl|Aclame:pro 321 YIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSV 400 (517) Q Consensus 321 ~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~ 400 (517) ++++|+++|.|+.+|+.++|++||.++|+++|+++++.+||+|||++.+.++++++++.........+...++++..+.. T Consensus 321 ~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~l~~~ee~a~l~GdGtg~~~~gi~~~a~~~~~~~~~~~~~~~d~i~~l~~ 400 (517) T protein:vir:97 321 YIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSV 400 (517) T ss_pred hhhhhHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccccccccccccccccccccchHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999998765555555555666778888888 Q ss_pred hhhhhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccceecCccceeccccCCceeeeecCceEEEeeeheeehhh Q lcl|Aclame:pro 401 ATPKAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDEKTAVSLSGYVTNGSRGMEFEQG 480 (517) Q Consensus 401 ~~~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 480 (517) +...+.+++|||||.+|.+|++|||++|||||+++.+.+.+.++||...+++.+++++.+++++++|++++++++..+++ T Consensus 401 a~~~a~~a~~vmn~~t~~~I~klKD~~G~Yl~~~~~~~~~~~~l~G~~~~~~~~~~~~~~~~~~~~y~i~~~~g~~~~~~ 480 (517) T protein:vir:97 401 ATPKAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDEKTAVSLSGYVTNGSRGMEFEQG 480 (517) T ss_pred HhhhccCCEEEECHHHHHHHHHhhcCCCCeeccCcCCcccccccCCccccccccccCceeEeeccccEEEeecceeeeee Confidence 77777899999999999999999999999999999999999999998888888999999999999999999999999999 Q ss_pred hhcccchHHHHHhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 481 TILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 481 ~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) |++.+|++.|+.++|+||+|++|+||+|++|+||||| T Consensus 481 fd~~~n~~~f~~~~~~~g~i~~~~r~a~~~~~p~~~~ 517 (517) T protein:vir:97 481 TILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) T ss_pred eecccCceeEeeeeeeccccccccceEEEEEcCCCCC Confidence 9999999999999999999999999999999999999 No 2 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=100.00 E-value=9e-87 Score=492.30 Aligned_cols=464 Identities=23% Similarity=0.266 Sum_probs=289.4 Q ss_pred ccceEEEEEEEecCCCCCCCcEECcchHHHHhccCCCeEEeecCCCCCcceeEEEEeecCceEEEEEeCchHHHHHHHHH Q lcl|Aclame:pro 5 FKDGVLIGKLVDYGSIDSYNTVFEPGAFDEYVGSEQTFNLDYRHDMQDKLAKFKVIGREDGIYIEAKPNNDIAYKRMKEA 84 (517) Q Consensus 5 ~~~g~~~g~a~~~~~~d~~~d~i~~gaf~~~~~~~~~~~~l~~Hd~~~~iG~~~~~~~~~Gl~~~~~~~~~~~~~~~~~~ 84 (517) -+.++|+|||++||++|+|||+|+||||+ +.++||||+|| +|||+|....+++++ .++..+++++++ T Consensus 1 ~~~~~~~G~a~~~~~~d~~gd~~~~~a~~-----~~~~~~l~~H~--~~iG~~~~~~~~~~~------~~t~~~~~~~~~ 67 (480) T protein:vir:40 1 MKVKAVRGIANPLGTIDAHGTVIESIANA-----GDGVDILNRHR--EKIGSGFVHLEGDNV------ILTGYVDEEQYT 67 (480) T ss_pred CcceEEEEEEecCCCCCCcchhecccccC-----CcCceeeeeCC--ceeeEEEEeecCCCC------ccchhHHHHHHH Confidence 77889999999999999999999999996 35799999997 799999888777664 468899999999 Q ss_pred HhhcC--CeeEeeeeeeccc--CCCceEEEEehhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh--hhhhhhhhhhhhh Q lcl|Aclame:pro 85 IDKGA--GLSVTFQPVEASE--VDGVAYYKKCILAGGALTPNPSNKNAVVTYFREEKKKEENKMT--FDQNLMQELLDAK 158 (517) Q Consensus 85 ~~~g~--~~SiGf~~~~~~~--~~~~~~~~~~~l~EvS~v~~pA~~~A~I~~vk~~~~~~~~~~~--~~~~~~~~~~e~~ 158 (517) ||+|. ||||||+++++++ .++.|+|++++|+|||+|++|||++|+|+.+++........+. +..+...+..+.. T Consensus 68 ~k~g~~~~~Sigf~~~~~~~~~~~~~~~~~~~~l~EvS~v~~pa~~~a~v~~vks~~~~~e~~~~~~e~~e~~~e~~e~~ 147 (480) T protein:vir:40 68 AEKIEETGLSVGFNANGVKAREIDGVGYYKDVTITEVSLTPLPSNKGAKVTKVREENKGEQEQMGANETQEIMKQAIEAG 147 (480) T ss_pred HHcCCccceeeeeeeeecccccCCCeEEEEEEEEEEeEEeecccchhhhhhhhhhhhhhhhhhhhhHHHHHHHHhhhhhh Confidence 99995 9999999998654 3567999999999999999999999999998886443322211 1111122222222 Q ss_pred hhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhhhhhhhhhhhhhhHHHHHHHHHHHH-hhccchhhHHHH Q lcl|Aclame:pro 159 KLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVEALKVTPEATEFLKTREAEVAYM-SASLTKDPKAAW 237 (517) Q Consensus 159 ~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 237 (517) .+..++.+++.++.+..++........... .........+.++. ......+ ..+..+...... T Consensus 148 ~~~~el~akl~el~k~~ee~k~~~~~~~~~------------~~~~~~~~~e~r~~----~~~~~~~~e~~~~~~~~~~~ 211 (480) T protein:vir:40 148 VKVRELEAKVEELNKEREELKKEREASIPS------------EKPEDAERKFMREL----GSKMAEMPEQGFLREFANGA 211 (480) T ss_pred hhhhhHHHHHHHHHhHHHHHhhhhhhhccc------------cchhhhhhHHHHHH----HHHhccchhhhhhhhhhhhc Confidence 222222233222222211111111000000 00000000000000 0000000 000001111100 Q ss_pred hhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeeccccceeeeecccccceeeecccccccccccceeeEeeH-- Q lcl|Aclame:pro 238 TAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPTLVVGGDNALTQGTGHTTGTDKTESNITLQTRVLTP-- 315 (517) Q Consensus 238 ~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~~~~~~-- 315 (517) .+......+. .|+.+...+........+....++.... ....+...+.++.+++..+.. ++....+.+ T Consensus 212 --~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----g~~~~~~~~e~~~~~~~~~~~--~~~~~~~~~~~ 281 (480) T protein:vir:40 212 --DLNVVNSLGS-ITSKYARKSGIYDGAMKARFQGLTLAED-----GVDDTFISGTFKAGTDKNKSQ--TATKRSLRPQM 281 (480) T ss_pred --cccccccccc-cccchhhheeechhhhhhhhhcceeeec-----cccceeeeeeeeccccccccc--ccccchhhHHH Confidence 0111111122 2233322222222222222222211110 001111122233333333333 344555554 Q ss_pred -hhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcccccccccccccccccccccccHHHH Q lcl|Aclame:pro 316 -QYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQEL 394 (517) Q Consensus 316 -~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l 394 (517) ++++++.+.|++++ ||++.|++||.++|+++|+.+++.+||+|+|+|.+.++++.+++...+... +.+++ T Consensus 282 v~~l~~~~k~t~~lL-----DDa~~l~~~i~~~l~~~~~~~ee~a~l~G~g~g~~~~~g~~~~~~~~~~~~----~~~d~ 352 (480) T protein:vir:40 282 AEAYLQMDKATVRGV-----NDSGALSEYVMSEMVNRVIQKVEYNMILGSVDGSNGFYGLKTATDGWTKQI----EYTDL 352 (480) T ss_pred HHHHHHhHHHHHHHh-----hhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceeecccccccc----hhHHH Confidence 44555555555554 455679999999999999999999999999988887777776654332222 23344 Q ss_pred HHHH-HHhhhh-hcCC-EEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccceecCccceeccccCC--ceeeeecCceEE Q lcl|Aclame:pro 395 LEKL-SVATPK-AADS-TLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVD--EKTAVSLSGYVT 469 (517) Q Consensus 395 ~~~l-~~~~~~-~~~a-~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~--~~~~~~~~~~~~ 469 (517) ++.| +....+ ..++ .|||||.||++|++|||++|||||||+++.+++.++||.++++.++.++ .+.+++++.|+. T Consensus 353 id~L~~al~~~y~~~a~~~vmn~~t~~~I~klKD~~G~Yi~q~~~~~~~~~~llG~pvv~~~~~~~~~~~~~~~~~~~~~ 432 (480) T protein:vir:40 353 FEGITDAVAECSISDAITIVMSPQTFAELRKAKGTDGHSRFNELATKEQIAQSFGAVNLETRVWMPKDEVAVYNHDEYVL 432 (480) T ss_pred HHHHHHhhhHHhhCCCCEEEECHHHHHHHHHhhcCCCCeeccCcccccCcceecccceeeeeccccCCcceeeeCCccEE Confidence 4433 333333 3566 6999999999999999999999999999999999999998877665554 456677888999 Q ss_pred EeeeheeehhhhhcccchHHHHHhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 470 NGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 470 ~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) ++++++++++||++.+|+++|+++.||||+|.+|+|++|....--. | T Consensus 433 ~~d~~~~~~~~~~~~~~~~~~~~e~~v~g~~~~~~~~~~~~~~~~~-~ 479 (480) T protein:vir:40 433 IGDLNVENYNDFDLRYNVEQWLSETLVGGSIRGKNRSAYLKKKGSL-G 479 (480) T ss_pred EEecccceecccccccchhhhhhhhhhceeeEccccEEEEEeccCc-C Confidence 9999999999999999999999999999999999999996644321 1 No 3 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=100.00 E-value=3.8e-82 Score=466.99 Aligned_cols=501 Identities=15% Similarity=0.135 Sum_probs=315.4 Q ss_pred Ccccc-cceEEEEEEEecCCCCCCCcEECcchHHHHhccCCCeEEeecCCCCCcceeEEEEeecCceEEEEEeCchH--- Q lcl|Aclame:pro 1 MSGTF-KDGVLIGKLVDYGSIDSYNTVFEPGAFDEYVGSEQTFNLDYRHDMQDKLAKFKVIGREDGIYIEAKPNNDI--- 76 (517) Q Consensus 1 ~~~~~-~~g~~~g~a~~~~~~d~~~d~i~~gaf~~~~~~~~~~~~l~~Hd~~~~iG~~~~~~~~~Gl~~~~~~~~~~--- 76 (517) ||... ++|+|+|||++ +++|+|||+|.|++|+. .+.+||||+||+++|||+|++.++++||+|++++..+. T Consensus 12 ~k~~~~~~~~~~g~as~-~~~d~~gd~i~~~~~~~----~~~~~~l~~H~~~~~iG~~~~~~~~~gl~~~~~~~~~~~~~ 86 (645) T protein:vir:93 12 VKSFSEDERVITGIAST-PSPDRDGDILEPEGAEF----GSALPFLWQHDHSRPVGQCTVRRVSEGLEITATLAKPVPDM 86 (645) T ss_pred EEeeecCceEEEEEEec-CCccccCceechhhhcc----cCCceeeeccCCCCceeEEEEEecCCceEEEEEeccccccc Confidence 77755 56899999997 77999999999999863 45789999999999999999888888999999985432 Q ss_pred ------HHHHHHHHHhhcC--CeeEeeeeeecccC-CCceEEEEehhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 77 ------AYKRMKEAIDKGA--GLSVTFQPVEASEV-DGVAYYKKCILAGGALTPNPSNKNAVVTYFREEKKKEENKMTFD 147 (517) Q Consensus 77 ------~~~~~~~~~~~g~--~~SiGf~~~~~~~~-~~~~~~~~~~l~EvS~v~~pA~~~A~I~~vk~~~~~~~~~~~~~ 147 (517) .++++|.+||+|. +|||||+++++++. ++.++|++|+|||||+|++|||++|.|+.+|+............ T Consensus 87 ~~~~~~~~~~~~~~~k~G~~~~~SiG~~~~~~~~~~~~~~~i~~~~l~EiS~V~~pAn~~a~v~~~ks~~~~~~~~~~~~ 166 (645) T protein:vir:93 87 PSQLAARLDEAWAAIKTGLVRGLSVGFRPHEYTFLDGGGLHFLRWELMEVSAVTVPANAECTIRTIKSYDRQFSAASGNR 166 (645) T ss_pred ccchHHHHHHHHHHHhcCcccceeeeeEEeeeeeecCCCeEEEEEEEEEEeeeccCCCCcchhhhhhhccchhhhhhhhh Confidence 4688999999994 99999999998754 55788999999999999999999999999886332111100000 Q ss_pred hh-----------------hhh-----hhhhhhhhhhhhhh------------------hhhhhHHHHHHHhhhhhhhHH Q lcl|Aclame:pro 148 QN-----------------LMQ-----ELLDAKKLAADLNA------------------KLKERENGGDNAALKTVSELA 187 (517) Q Consensus 148 ~~-----------------~~~-----~~~e~~~~~~e~~a------------------~l~~~~~~~~e~~~~~~~~~~ 187 (517) .. ... ...+..+...+..+ .+.+.+.+..+.....+.+++ T Consensus 167 ~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~e~i~~l~~~ra~~~~~~~~l~~~a~~~g~~l~aee~~~~d~l~aei~~l~ 246 (645) T protein:vir:93 167 KPVVKIASSAGAAAQSTTVFHKEKTIMNIGEQIKSFENKRAALAASLEEVMTKAAEEGRTLDVEEEEHYDNTAAEIRQVD 246 (645) T ss_pred cchhhhhhhhcchhhccccccccccccchhhhhhhhhHHHHHHHHHhhhhhhhHhhhccccCHHHHHHHHHHHHHHHHHH Confidence 00 000 00000000000000 111111111111222222233 Q ss_pred HHHHHHhhHHHhhhhhhh------hh-------hhhhhhHHHHHH-HHHHHH-------hhc-------------cch-- Q lcl|Aclame:pro 188 ANLMKQRESEKILGVEAL------KV-------TPEATEFLKTRE-AEVAYM-------SAS-------------LTK-- 231 (517) Q Consensus 188 ~~~~~~~~~~~~~~~~~~------~~-------~~~~~~~~~~~~-~~~~~~-------~~~-------------~~~-- 231 (517) .++.+.+........... .. ....+...+..+ ..+... .+. ... T Consensus 247 ~~i~r~e~~e~~~a~~a~pv~~~~~~~~~~~~~~~~~~~~~~~~kg~~f~~~~~al~~~~g~~~~a~e~a~~~~~~~~~~ 326 (645) T protein:vir:93 247 AHLKRLRELEAGKAATAQPVKQAGNGNVAAVASAPVIRVEQKLDKGIGFARFAKSLAAAKGVRSEALEVARRQYPDDSRL 326 (645) T ss_pred HHHHHHHHHHHHHHhcccccccccccccccccccccccchhhhhhhhhHHHHHHHHHhcccchhHHHHHHHhhcccchhh Confidence 232222211111100000 00 000000000000 000000 000 000 Q ss_pred --hhHHHHhhhh--hcccccccccchhhhhhHHHhHhhhhhhhhceeee-----ccc-cceeeeecccccceeeeccccc Q lcl|Aclame:pro 232 --DPKAAWTAEL--KERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHE-----NLP-TLVVGGDNALTQGTGHTTGTDK 301 (517) Q Consensus 232 --~~~~~~~~~~--~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~-----~~~-~~~~~~~~~~~~a~~~~eg~~~ 301 (517) ..+.+..... +....+++.+|+.+...|++.++..+.+.++.... .++ ...+|..+....+.|++||+.+ T Consensus 327 ~~~~~~a~~~~~~~~~~~~Gg~~vp~~~~~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip~~t~~~~a~wv~Eg~~~ 406 (645) T protein:vir:93 327 HHVLKSAVGAGTTTDPQWAGSLSEYQEYAQDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVHAQVSGGAAGWVGEGKTK 406 (645) T ss_pred hhhhhhhhhccccccccccCCccCchhhHHHHHHhhhhhhhHHhhccccccccccccCceeeeeeecCcceEEeccCccc Confidence 0011111111 11223578899999999999999988888775431 122 3467788888899999999999 Q ss_pred ccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCc---cccccccccc Q lcl|Aclame:pro 302 TESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGV---SETQIYPVVG 378 (517) Q Consensus 302 ~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~---~~~gi~~~~~ 378 (517) |+++++|+++++.++++++++++|++++.|+.++ +++||.++|++++++++|.+||+|+|++. .+.|++.... T Consensus 407 ~~s~~~f~~v~l~~~kla~~~~iS~ell~ds~~~----~~~~i~~~l~~aia~~~d~a~l~g~g~~~~~~~p~gi~~~~~ 482 (645) T protein:vir:93 407 PLTKFDFESITFSHAKVSAIAVLTEELIRFSSPA----ADALVRNALAEAVVARLDTDFVDPKKAAVADVSPASITHDVK 482 (645) T ss_pred cccccceeEEEEeeEEEEEeehhHHHHHhhchHH----HHHHHHHHHHHHHHHHHHHHhhcCCCcccCCccccceecccc Confidence 9999999999999999999999999999988765 89999999999999999999999998753 2334433221 Q ss_pred ccccccccccccHHHHHHHHHH---hhhhhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccceecCccceecccc Q lcl|Aclame:pro 379 DAWATNVTGTTNIQELLEKLSV---ATPKAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVA 455 (517) Q Consensus 379 ~~~~~~~~~~~~~d~l~~~l~~---~~~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~ 455 (517) .+...+ ....++...+.. +.....+++|||||.++.+|++|||++|+|+|+.. . ....+|+|.|.++ +.. T Consensus 483 ---~~~~~~-~~~~d~~~~~~~~~~a~~~~~~a~~vmn~~~~~~L~~lkd~~G~~~~~~~-~-~~~~tL~G~PV~~-s~~ 555 (645) T protein:vir:93 483 ---GTASSG-NPDADAEAAFGQFVAANLQPTGAVWLMSSTNALALSMRKNALGQKEYPDM-T-LLGGSFQGLPVIV-SQY 555 (645) T ss_pred ---cccccc-chHHHHHHHHHHHHhcCCCccccEEEEcHHHHHHHHhccccCCceeecCC-C-CCCceeeceeeEE-ecc Confidence 111111 222334333332 33334568999999999999999999999999543 2 2335788865544 445 Q ss_pred CCc-eeeeecCceEEEeeeheee----------hhhh--------------hcccchHHHHHhhhhcceeecccceEEEE Q lcl|Aclame:pro 456 VDE-KTAVSLSGYVTNGSRGMEF----------EQGT--------------ILVENNKEYLFEMPISGSLEYKGTTAYGT 510 (517) Q Consensus 456 ~~~-~~~~~~~~~~~~~~~~~~~----------~~d~--------------~~~~n~~~~~~~~rvgg~v~~~~a~~~~~ 510 (517) +++ ...++++.+.++.+-++.. .++. .|++|++.++++.|+++.+++|+||++++ T Consensus 556 vp~~~~~gd~s~~~ig~~~~v~i~~s~~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt 635 (645) T protein:vir:93 556 VGDQLVLVNAPDIYLADDGGVAVDMSREASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAVIT 635 (645) T ss_pred CCcceeEeccccEEEEEecceEEEeecceeEEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCccceEEEe Confidence 554 3445666666554333221 1110 16789999999999999999999998765 Q ss_pred ---eCCCCCC Q lcl|Aclame:pro 511 ---YTPPVAG 517 (517) Q Consensus 511 ---~tp~~a~ 517 (517) +-++--| T Consensus 636 ~~~~g~~~~~ 645 (645) T protein:vir:93 636 GVNYGSASGG 645 (645) T ss_pred cccCCcccCC Confidence 1222222 No 4 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=100.00 E-value=1.2e-66 Score=381.98 Aligned_cols=500 Identities=10% Similarity=0.055 Sum_probs=308.6 Q ss_pred CcccccceEEEEEEEecCCC-CCCC-cE--ECcchHHHHhccCCCeEEeecCCCCCcceeEEEEe-e-cCceEEEEEeCc Q lcl|Aclame:pro 1 MSGTFKDGVLIGKLVDYGSI-DSYN-TV--FEPGAFDEYVGSEQTFNLDYRHDMQDKLAKFKVIG-R-EDGIYIEAKPNN 74 (517) Q Consensus 1 ~~~~~~~g~~~g~a~~~~~~-d~~~-d~--i~~gaf~~~~~~~~~~~~l~~Hd~~~~iG~~~~~~-~-~~Gl~~~~~~~~ 74 (517) -+-..++.+|++.+++=..+ ..++ |+ +.|+|++-+-. ....||||+||+++|||+++... + ++||+++++|+. T Consensus 28 ~~~~~~~r~~~~~~~~~~~~~~~~~~e~l~~~~~~~~~~~~-~~~~~~l~~H~~~~~iG~v~~~~~~~~~~~~~~~~~~~ 106 (632) T protein:vir:96 28 DSIDQEARTVELAASSEYPVPRWFGREILDHSPGAIRMGRL-KNGAPLLDSHSLREQIGVVEEVWLDDDRRLRARVRFSR 106 (632) T ss_pred ccccccccEEEEEEecCCccccccCcccccccccccchhhc-cCCCeeeccCCCCCcceEEEEEEEeCCceEEEEEEeCC Confidence 56777888999998883223 3333 22 36888865422 23489999999999999997654 3 458999999999 Q ss_pred hHHHHHHHHHHhhcC--CeeEeeeeeecccC--CC---ceEEEEehhhhhhhhhhhhhhhhhhhhhhhhhhhhhh----- Q lcl|Aclame:pro 75 DIAYKRMKEAIDKGA--GLSVTFQPVEASEV--DG---VAYYKKCILAGGALTPNPSNKNAVVTYFREEKKKEEN----- 142 (517) Q Consensus 75 ~~~~~~~~~~~~~g~--~~SiGf~~~~~~~~--~~---~~~~~~~~l~EvS~v~~pA~~~A~I~~vk~~~~~~~~----- 142 (517) +..+.++++.|++|. +|||||++.++++. ++ .+++++|+|+|||+|++|||+.+.|...++....... T Consensus 107 ~~~~~~~~~~~~~g~~~~~SiG~~~~~~~~~~~~~~~~~~~~~~~~~~EiS~v~~pAd~~a~v~~~~~~~~~~~~~~~~~ 186 (632) T protein:vir:96 107 SAKAEELWQDVLDGIRRHISIGYIIHEMVLESSGDQGDTYRVMDWEPYEISLISVPADPTVGVGRSIDIGNITIRGAEMP 186 (632) T ss_pred ChhHHHHHHHHhcCcccceeeeeeeeeeeeecCCCCcceEEEEEEEEEEEEEeecCCCCcceeeeecccccccccccccc Confidence 999999999999994 99999999987632 12 2568999999999999999999988543321110000 Q ss_pred ----hhh---hhhhhh-------hhhhhhhhh------hhhhhhh---hhhhHHHH------HHHhh--hhhhhHHHHHH Q lcl|Aclame:pro 143 ----KMT---FDQNLM-------QELLDAKKL------AADLNAK---LKERENGG------DNAAL--KTVSELAANLM 191 (517) Q Consensus 143 ----~~~---~~~~~~-------~~~~e~~~~------~~e~~a~---l~~~~~~~------~e~~~--~~~~~~~~~~~ 191 (517) ... ...... ......... ..++..+ +.++.+.. .+... +......++.. T Consensus 187 ~~~~~~~~~~~~~~~~r~~~~~a~~~~~~~~~a~~~~~~~~E~~r~~eI~~l~~~~~~~~~~~~ai~~g~sld~~ra~~l 266 (632) T protein:vir:96 187 DKDKQTQTAGSQQTETRGAETGAKNPAPAASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRALVL 266 (632) T ss_pred chhhhhhccccccccccchhhcccccchhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhHHHHHhccccHHHHHHHHH Confidence 000 000000 000000000 0000000 00000000 00000 00000000000 Q ss_pred H-HhhHH---H-hhhhhhh----------hh---hhhhhhH--HHHHH-------------HHHH-HHhhccchhhH--- Q lcl|Aclame:pro 192 K-QRESE---K-ILGVEAL----------KV---TPEATEF--LKTRE-------------AEVA-YMSASLTKDPK--- 234 (517) Q Consensus 192 ~-~~~~~---~-~~~~~~~----------~~---~~~~~~~--~~~~~-------------~~~~-~~~~~~~~~~~--- 234 (517) . ..... . ....... .. ..+.+.. .+... .... ........+.+ T Consensus 267 d~l~~~~~a~~~~~~a~~~~~~~~~~~~~~i~~~~re~~~~~l~rai~a~a~~~~~~a~~~~e~a~~~a~~~G~~arg~~ 346 (632) T protein:vir:96 267 ERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFY 346 (632) T ss_pred HHHhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHHhhhccchhhhhhhhHHHHHHHHhhhhhhhhhh Confidence 0 00000 0 0000000 00 0000000 00000 0000 00000000000 Q ss_pred ----HHHhh--hhhcccccccccchhh-hhhHHHhHhhhhhhhhc-eeeecc--ccceeeeecccccceeeecccccccc Q lcl|Aclame:pro 235 ----AAWTA--ELKERGISGMPAPAGI-LKRIQDAVNDEGSLLPF-IRHENL--PTLVVGGDNALTQGTGHTTGTDKTES 304 (517) Q Consensus 235 ----~~~~~--~~~~~~~~~~~vp~~i-~~~i~~~~~~~~~~~~~-~~~~~~--~~~~~~~~~~~~~a~~~~eg~~~~~~ 304 (517) ..... .....+.+++++|+.+ ...+++.++..+.+.++ ++..+. ....+|..++...+.|++||+.++++ T Consensus 347 ~~~~~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l~~~~~~~~~g~~~ip~~~~~~~a~wv~E~~~~~~s 426 (632) T protein:vir:96 347 MPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDS 426 (632) T ss_pred hhHHHHHHhhhhcccccccccccccccchHHHHHHHhhcchhhhhcceEeecCCcceEEEEEeCCceeEeecCCcccccc Confidence 00011 1112344578899886 46788999888888776 444433 34567888888899999999999999 Q ss_pred cccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCccccccccccccccccc Q lcl|Aclame:pro 305 NITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATN 384 (517) Q Consensus 305 ~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~ 384 (517) +++|+++++.++++++++++|++++.|+.++ +++||.++|.++++.++|.++|+|+|++..+.||++.++...... T Consensus 427 ~~~f~~i~l~~~k~~~~v~iS~ell~ds~~~----~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~ 502 (632) T protein:vir:96 427 DFDFTTLSFSPKTIAGAVPVTRKLRKQSSIH----VENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTY 502 (632) T ss_pred ccceeeEEeeeeEEEEehhhHHHHHhccchH----HHHHHHHHHHHHHHHHHHHHhhcccCCCCccceeeecccccceec Confidence 9999999999999999999999999988665 899999999999999999999999998777789988765433333 Q ss_pred ccccccHHHHHHHHHHhh---hhhcCCEEEEcHHHHHHHHH--hhcCCCCEeccCCCCCCccceecCccceeccccCCc- Q lcl|Aclame:pro 385 VTGTTNIQELLEKLSVAT---PKAADSTLVIHRNDLAAIRF--LKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDE- 458 (517) Q Consensus 385 ~~~~~~~d~l~~~l~~~~---~~~~~a~~vmn~~~~~~l~~--lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~- 458 (517) ..+..+.+++.++..... ....+++|+||+.++..+.+ ++|++|+|||+++ +++|++. +.+..++. T Consensus 503 ~~~~~~~~~i~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~~~l~d~~G~~i~~~~-------~l~G~pv-~~s~~ip~~ 574 (632) T protein:vir:96 503 PAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQNN-------EVNGYRA-EASNQIPAD 574 (632) T ss_pred ccccCCHHHHHHHHHHHhhcccccCccEEEEchhHHHHHHHHhccCCCCceeecCC-------eecccce-EeccccccC Confidence 333344455555544433 23346789999998777765 7899999999753 5677654 44444544 Q ss_pred -eeeeecCceEEEeeeheeeh--hhhhcccchHHHHHhhhhcceeecccceEEEEeCC Q lcl|Aclame:pro 459 -KTAVSLSGYVTNGSRGMEFE--QGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTP 513 (517) Q Consensus 459 -~~~~~~~~~~~~~~~~~~~~--~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp 513 (517) ...++++.|.++...++... +..++.++.+.|++..|++++|++|++|+++...+ T Consensus 575 ~~~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 575 TWIFGDWSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred cEEEeecceEEEEEecceEEEEccccccccCceEEEEEeecCceeechhhhhheeecC Confidence 44566777776665554433 33456789999999999999999999999999777 No 5 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=100.00 E-value=1.7e-50 Score=293.44 Aligned_cols=376 Identities=10% Similarity=0.059 Sum_probs=237.5 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhhhhhhh Q lcl|Aclame:pro 127 NAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVEALK 206 (517) Q Consensus 127 ~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 206 (517) .+.+..+++..++..+..+..++...+. +++.+.++.++..+.+. ......+++......+...... .... T Consensus 1 l~~~k~l~~~i~e~~~~~~~~k~~~~~~------~~~~e~~~~~l~~~~e~-~~~~~~~~e~~~~~~~~~~~~~--~~~~ 71 (407) T protein:vir:48 1 MADVKDVEQVAQELQRKFDDFKEKNDKR------IDAIEQEKGKLAGEVET-LNGKLAELENLKSDLEAELAEV--KRPA 71 (407) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHh--hccc Confidence 1112222221111111111111111100 01111111111111110 0000111111111111000000 0000 Q ss_pred hhhhhhhHHHHHHHHHHHHhhccchhhHHHHhh--hhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc--ce Q lcl|Aclame:pro 207 VTPEATEFLKTREAEVAYMSASLTKDPKAAWTA--ELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT--LV 282 (517) Q Consensus 207 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~--~~ 282 (517) ............+++..++..+...+....... .......+|+.+|+.+...|++.++..+++++++++.++.+ .. T Consensus 72 ~~~~~~~~~e~~~a~~~~l~~g~~~~~~~~e~~a~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~ 151 (407) T protein:vir:48 72 GGTQNKVASEHKEAFIGFMRKGREDGLRELERKALQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQEATVITLGGSDYK 151 (407) T ss_pred cccccchhhHHHHHHHHHHhccchhhhhHHHHHhhhcccCCCCcccccHhHHHHHHHHHHhhhhhhhhceeeecCCCceE Confidence 000000111112233333333333332221111 12223456889999999999999999999999998877654 45 Q ss_pred eeeecccccceeeeccccccccc-ccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhh Q lcl|Aclame:pro 283 VGGDNALTQGTGHTTGTDKTESN-ITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAII 361 (517) Q Consensus 283 ~~~~~~~~~a~~~~eg~~~~~~~-~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l 361 (517) +++......+.|++|++..|++. ++|+++++.++++++++++|++++.|+.++ |++||.++|+++++++++.+|| T Consensus 152 ~~~~~~~~~a~~v~E~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~l~~~i~~~~~~a~l 227 (407) T protein:vir:48 152 KLVNLGGTTSGWVGETDARPETATSKLGLIEPFMGEIYGNPQATQKMLDDAFFN----VEDWINSELALEFAEQEEIAFT 227 (407) T ss_pred EEEecCCcceeeecccccccccccccceeEEeeeeeeEeehhhHHHHHhcchHH----HHHHHHHHHHHHHHHHHHhhhh Confidence 66777778899999999999764 799999999999999999999999998876 9999999999999999999999 Q ss_pred cccccCccccccccccccccc-------------ccccccccHHHHHHHHHHhhh-hhcCCEEEEcHHHHHHHHHhhcCC Q lcl|Aclame:pro 362 MGGVTGVSETQIYPVVGDAWA-------------TNVTGTTNIQELLEKLSVATP-KAADSTLVIHRNDLAAIRFLKDKN 427 (517) Q Consensus 362 ~G~G~~~~~~gi~~~~~~~~~-------------~~~~~~~~~d~l~~~l~~~~~-~~~~a~~vmn~~~~~~l~~lKD~~ 427 (517) +|+|+++| .||++..+.... ....+..+.+++++.+..... +..+++||||+++|..|++|||++ T Consensus 228 ~G~G~~~p-~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~~a~~v~n~~~~~~L~~lkD~~ 306 (407) T protein:vir:48 228 SGDGSKKP-KGFLAYESTDEDDKTRAFGKLQHIASGAASGVTADAIIKLIYTLRKAHRSGAKFMMNNSSLFAIRLLKDND 306 (407) T ss_pred ccCCCCcc-ceeeecccccccccccccccccccccccccccChHHHHHHHHhhchhhhcCCEEEEcHHHHHHHHHhhccC Confidence 99999865 688865442211 112233456788887776544 445789999999999999999999 Q ss_pred CCEeccCCCCCCccceecCccceeccccCCce-------eeeecC-ceEEEeeeheeehhhhhcccchHHHHHhhhhcce Q lcl|Aclame:pro 428 GNYVFPVGVSNQTIATHFGFNRLVQSVAVDEK-------TAVSLS-GYVTNGSRGMEFEQGTILVENNKEYLFEMPISGS 499 (517) Q Consensus 428 Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~~-------~~~~~~-~~~~~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~ 499 (517) |||||+++.+.+.+.+++|.|++ +++.++.. ..++++ .|.+..+.++....+.++.+|++.|++..|+++. T Consensus 307 Gr~l~~~~~~~g~~~~l~G~PV~-~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~i~~d~~~~~~~~~~~~~~r~d~~ 385 (407) T protein:vir:48 307 GNYLWRPGIELGQPSSLAGYGIV-ENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRDPYTNKPFVGFYTTKRTGGM 385 (407) T ss_pred CceeeccCcCCCCCceecceeeE-EecCcCCccCCccEEEEEeccccEEEEEeeceEEEeeccccCCcEEEEEEEEeccE Confidence 99999999999998999997654 44445432 224565 4777778888888777788999999999999999 Q ss_pred eecccceEEEEeCCCC----CC Q lcl|Aclame:pro 500 LEYKGTTAYGTYTPPV----AG 517 (517) Q Consensus 500 v~~~~a~~~~~~tp~~----a~ 517 (517) |.+|+||++++++++. |- T Consensus 386 v~~~~a~~~l~~~aa~~~~~~~ 407 (407) T protein:vir:48 386 LVDSQAIKLMKIGAATRQKAAA 407 (407) T ss_pred EecccceEEEEeeccCCCCCCC Confidence 9999999999998772 22 No 6 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=100.00 E-value=1.3e-49 Score=288.56 Aligned_cols=372 Identities=11% Similarity=0.064 Sum_probs=234.5 Q ss_pred hh-hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhh-hhh Q lcl|Aclame:pro 127 NA-VVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILG-VEA 204 (517) Q Consensus 127 ~A-~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 204 (517) .+ .+..+++...+..++.++.+....+.. +.++++.+.+.+..+..+ ....+++..+.+.+....... ... T Consensus 1 m~~~lk~l~~~~~el~~~~~~~k~~~~~~~---~~~e~~~~~l~~~~~~l~----~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (401) T protein:vir:44 1 MAVDIKDVEQVAQELQQKFDDFKAKNDKRV---EAIEQEKGKLAGQVETLN----GKLSELENLKSDLEKELLELKRPAR 73 (401) T ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHhhcccc Confidence 00 001111111111111111111100000 001111111111111111 111111111111111100000 000 Q ss_pred hhhhhhhhhHHHHHHHHHHHHhhccchhhHHHHhhhh--hcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc-- Q lcl|Aclame:pro 205 LKVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAEL--KERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT-- 280 (517) Q Consensus 205 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~-- 280 (517) ....... ...++.+..+...+...+........+ .....+++.+|+.+...|++.++..+++++++++.++.+ T Consensus 74 ~~~~~~~---~e~~~a~~~~lr~~~~~~~~~~e~~a~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 150 (401) T protein:vir:44 74 GAQNKVA---AEHKDAFVGFLRKGREDGLRDLERKALQVGTDEDGGYAVPEELDRSILSLLKDEVVMRQEATVITVGGSD 150 (401) T ss_pred ccccchh---HHHHHHHHHHHhhhhhhhhHHHHHHHhhcCCCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCc Confidence 0001111 111222223332222222222211112 222345889999999999999999999999999887754 Q ss_pred ceeeeecccccceeeecccccccc-cccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 281 LVVGGDNALTQGTGHTTGTDKTES-NITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRA 359 (517) Q Consensus 281 ~~~~~~~~~~~a~~~~eg~~~~~~-~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~ 359 (517) ..++.......+.|++||..+|++ .++|+++++.++++++++++|++++.|+.++ |++||.++|+++++++++.+ T Consensus 151 ~~~~~~~~~~~a~wv~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~la~ai~~~~~~~ 226 (401) T protein:vir:44 151 YKKLVNLGGTASGWVGETDTRSQTATSRLGLIEPFMGEIYGNPQATQKMLDDAFFN----VEAWINSELATEFAEQEEIA 226 (401) T ss_pred eEEEEecCCccceeeccccccCccccccceeeeeehhheeeehhhhHHHHhcchHH----HHHHHHHHHHHHHHHHHHhh Confidence 356667777888999999998865 4799999999999999999999999998765 99999999999999999999 Q ss_pred hhcccccCccccccccccccccc-------------ccccccccHHHHHHHHHHhhh-hhcCCEEEEcHHHHHHHHHhhc Q lcl|Aclame:pro 360 IIMGGVTGVSETQIYPVVGDAWA-------------TNVTGTTNIQELLEKLSVATP-KAADSTLVIHRNDLAAIRFLKD 425 (517) Q Consensus 360 ~l~G~G~~~~~~gi~~~~~~~~~-------------~~~~~~~~~d~l~~~l~~~~~-~~~~a~~vmn~~~~~~l~~lKD 425 (517) ||+|||+++| .||++..+.... .........+++++++..... +..+++||||+++|..|++||| T Consensus 227 ~l~G~G~~~p-~Gil~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd 305 (401) T protein:vir:44 227 FTTGDGTKKP-KGFLAYESTEESDKARAFGKLQHIVSGEATAVTADAIIKLIYTLRKAHRTGAKFMMNNNSLFAIRLLKD 305 (401) T ss_pred hhccCCCCcc-ceeeccccccccccccccccccccccccccccCHHHHHHHHHhcchhhhcCCEEEEcHHHHHHHHHhhc Confidence 9999999765 688865442111 112223456788887776544 4567899999999999999999 Q ss_pred CCCCEeccCCCCCCccceecCccceeccccCCce-------eeeecC-ceEEEeeeheeehhhhhcccchHHHHHhhhhc Q lcl|Aclame:pro 426 KNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDEK-------TAVSLS-GYVTNGSRGMEFEQGTILVENNKEYLFEMPIS 497 (517) Q Consensus 426 ~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~~-------~~~~~~-~~~~~~~~~~~~~~d~~~~~n~~~~~~~~rvg 497 (517) ++|||||+++.+.+.+.+++|.|+++ +..++.. ..++++ .|.+..+.+++.+.+.++.++++.|++..|+| T Consensus 306 ~~G~~l~~~~~~~g~~~~l~G~PVv~-~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~~~~~~~~~~~~v~~~a~~r~d 384 (401) T protein:vir:44 306 TEGNYLWRPGLELGQPSSLAGYGIAE-NEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRDPYTNKPFVGFYTTKRTG 384 (401) T ss_pred cCCceeecCCcCCCCCceecceeeEE-ecCcCCccCCccEEEEeehhccEEEEEecceEEeeeccccCCcEEEEEEEEec Confidence 99999999999999889999987654 4444431 224565 47777788888888888889999999999999 Q ss_pred ceeecccceEEEEeCCC Q lcl|Aclame:pro 498 GSLEYKGTTAYGTYTPP 514 (517) Q Consensus 498 g~v~~~~a~~~~~~tp~ 514 (517) +.+.+|+||+++.+.++ T Consensus 385 ~~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 385 GMLVDSQAIKLLKIAAA 401 (401) T ss_pred cEEecccceEEEEeecC Confidence 99999999999999988 No 7 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=100.00 E-value=2.3e-49 Score=287.19 Aligned_cols=396 Identities=13% Similarity=0.079 Sum_probs=233.2 Q ss_pred ceEEEEehhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh-hhhhhhhhhhhhHHHHHHHhhhhhh Q lcl|Aclame:pro 106 VAYYKKCILAGGALTPNPSNKNAVVTYFREEKKKEENKMTFDQNLMQELLDAKK-LAADLNAKLKERENGGDNAALKTVS 184 (517) Q Consensus 106 ~~~~~~~~l~EvS~v~~pA~~~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~-~~~e~~a~l~~~~~~~~e~~~~~~~ 184 (517) -|+++ -..-..-..+....+.++..+...+++.....+++..+... ..++......+. ++..+....... T Consensus 1 ~~~~~--------~~~~~~~~~~~~~el~~~~~e~~~~l~~~~~e~~~~~e~~~~e~~~~~~~~~e~-~~~~~~l~~~~~ 71 (418) T protein:vir:10 1 MSHMN--------EPRQFGRKSGGDSHPEQVLETVTKELKRIGDEVKSAGEKALAEAKRAGDLGVET-KATVDELLIKQG 71 (418) T ss_pred CCCch--------hHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHH-HHHHHHHHHHHH Confidence 11111 11111111111111111111111111111111111100000 000000000011 111111111222 Q ss_pred hHHHHHHHHhhHHHhhhhhhhhhhhh-hhhH---HHHHHHHHHHHhhccc-h-h---hHHHHhhhhhcccccccccchhh Q lcl|Aclame:pro 185 ELAANLMKQRESEKILGVEALKVTPE-ATEF---LKTREAEVAYMSASLT-K-D---PKAAWTAELKERGISGMPAPAGI 255 (517) Q Consensus 185 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~---~~~~~~~~~~~~~~~~-~-~---~~~~~~~~~~~~~~~~~~vp~~i 255 (517) ++..++.+.++............... ..+. ....+....+...... . + ...............++++|+.+ T Consensus 72 ~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lvp~~~ 151 (418) T protein:vir:10 72 ELQARLLEAEQKLARGGGSAELETPKTLGQLVTESEEMKGMDGSARKSVRVRVDRKSIMNVPATVGSGVSGSNSLVVADR 151 (418) T ss_pred HHHHHHHHHHHHHhhcccccccchhhhhhHHhhhHHHHHHHHHHHhhhhhhhhHHHHHHHhhhhccCCCCCCccccchhH Confidence 22222222222211111111100000 0000 0001111111111100 0 0 00000111112334577899999 Q ss_pred hhhHHHhHhhhhhhhhceeeecccc--ceeeeecc-cccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHh Q lcl|Aclame:pro 256 LKRIQDAVNDEGSLLPFIRHENLPT--LVVGGDNA-LTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSN 332 (517) Q Consensus 256 ~~~i~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~-~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~ 332 (517) ...|++.++..+++++++++.++++ ...+.... ...+.|+.||+.+|+++++|+++++.++++++++++|++++.++ T Consensus 152 ~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds 231 (418) T protein:vir:10 152 QAGIIAPPQRKMTIRDLLMPGQTSSSSIEYTVETGFTNNAAAVAEGAQKPTSDLKFNLKNQPVRTIAHLFKASRQILDDA 231 (418) T ss_pred HHHHHHHHhhhhhHHhhcceeeccCCceeEEEEecCCCceeeeccCccccccccceeeEEEeeeeEEEeehhhHHHHHhH Confidence 9999999999999999998877754 45666555 46788999999999999999999999999999999999999876 Q ss_pred hcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcccccccccccccc-cccccccccHHHHHHHHHHhh-hhhcCCEE Q lcl|Aclame:pro 333 ATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAW-ATNVTGTTNIQELLEKLSVAT-PKAADSTL 410 (517) Q Consensus 333 ~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~-~~~~~~~~~~d~l~~~l~~~~-~~~~~a~~ 410 (517) . .|++||.++|+++++++++.+||+|+|+++++.||++.++... ....+....+++++.++.... .++.+++| T Consensus 232 ~-----~l~~~i~~~l~~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 306 (418) T protein:vir:10 232 P-----ALQSYIDGRARYGLQLTEEGQILKGDGTGANILGILPQASAFMPSITLANATPIDKIRLALLQAVLAEFPATGI 306 (418) T ss_pred H-----HHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccccHHHHHHHHHhhccccCCCCEE Confidence 3 4999999999999999999999999999988899998776433 333334456678887776654 44567899 Q ss_pred EEcHHHHHHHHHhhcCCCCEeccCCCCCCccceecCccceeccccCCceee--eecCc-eEEEeeeheeeh----hhhhc Q lcl|Aclame:pro 411 VIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDEKTA--VSLSG-YVTNGSRGMEFE----QGTIL 483 (517) Q Consensus 411 vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~~~~--~~~~~-~~~~~~~~~~~~----~d~~~ 483 (517) ||||.+|..|++|||++|||||+ ++..+...+++|.++ +++..++...+ ++++. |.+..+.++... ...++ T Consensus 307 v~n~~~~~~L~~lkd~~G~~i~~-~~~~~~~~~l~G~pV-~~~~~~p~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~f 384 (418) T protein:vir:10 307 VLNPIDWASIELTKDSQGRYIVG-NPVNGTTPRLWNLPV-VETQAMTANEFLVGAFSMAAQIFDRMEIEVLLSTENVDDF 384 (418) T ss_pred EEcHHHHHHHHHhhcCCCceecc-ccccCCCceecceee-EEcCCCCCCcEEEeeccceEEEEEecceEEEEecccchhh Confidence 99999999999999999999996 455666778888654 55556665443 46665 555555554332 23347 Q ss_pred ccchHHHHHhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 484 VENNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 484 ~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) .+|.+.|+++.|+++.+++|+||+++++++|++| T Consensus 385 ~~~~~~~r~~~~~d~~~~~~~a~~~~~~~~~~~g 418 (418) T protein:vir:10 385 EKNMVSIRAEERLALAVYRPESFVTGALVEQAGG 418 (418) T ss_pred hcCceEEEEEEeeccEEecccceEEEEeccCCCC Confidence 8999999999999999999999999999999999 No 8 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=100.00 E-value=1.5e-48 Score=282.81 Aligned_cols=394 Identities=12% Similarity=0.100 Sum_probs=234.5 Q ss_pred CceEEEEehhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHH-Hhhhhh Q lcl|Aclame:pro 105 GVAYYKKCILAGGALTPNPSNKNAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDN-AALKTV 183 (517) Q Consensus 105 ~~~~~~~~~l~EvS~v~~pA~~~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e-~~~~~~ 183 (517) -...+. +-.|+--| +.|...-. ..+.+.++...+++++..+.+.+..+..+ ++.++++++.+..... ...+.. T Consensus 1 ~~~~~~-~~~~~~~~-~~~~~~~~--~~l~e~ra~~~~e~~~l~~~~~~~~~~~k--~~~~~~~~~~~~~~~~~e~~~~~ 74 (425) T protein:vir:10 1 MSKKLL-IAVLTAAL-TGPVGAVP--RGIISVRAEGPTEVKALIENLQKAFHDFK--AEHTKQLDAVKAGLPTSDALAKV 74 (425) T ss_pred CchhHH-HHhhHHHh-hhhhhhhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHhhhccHHHHHHH Confidence 000000 00011001 11111100 00001111111111111111100000000 0001111111100000 000001 Q ss_pred hhHHHHHHHHhhHHHhhhh-----hhhhhhhhhhhHHHHHHHHHHHHhhccchhhHHHHhhhhhcccccccccchhhhhh Q lcl|Aclame:pro 184 SELAANLMKQRESEKILGV-----EALKVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKR 258 (517) Q Consensus 184 ~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~ 258 (517) .++..++...+........ ..................+..+... .+.+++... .....+++++|+.+... T Consensus 75 ~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~af~~~l~~---~e~~~al~~--~t~~~gG~lvP~~~~~~ 149 (425) T protein:vir:10 75 DKVSADLEALQAAVDEANIKIAAAQMGANGVKPLRDPEYTEAFKAHVKR---GDVQAALNK--GEDSEGGYLTPIEWDRT 149 (425) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccHHHHHHHHHHhhh---hhhHHHhhc--CcCCCCceeccHhHHHH Confidence 1111111111111000000 0000000000000011111111111 122222222 23345688999999999 Q ss_pred HHHhHhhhhhhhhceeeecccc--ceeeeecccccceeeeccccccccc-ccceeeEeeHhhhhHhHhhhHHHHHHhhcc Q lcl|Aclame:pro 259 IQDAVNDEGSLLPFIRHENLPT--LVVGGDNALTQGTGHTTGTDKTESN-ITLQTRVLTPQYVYKYIKLPKIVMNSNATD 335 (517) Q Consensus 259 i~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~a~~~~eg~~~~~~~-~~f~~~~~~~~~~~~~~~iS~~li~d~~~d 335 (517) |++.++..+++++++++.++++ ..+|+......+.|++||+..|+++ ++|+++++.++++++++++|++++.|+.++ T Consensus 150 ii~~~~~~s~l~~l~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~~~~~~f~~v~~~~~k~~~~i~iS~ell~ds~~~ 229 (425) T protein:vir:10 150 ITNKLVLISPMRQLCRVQPVSKAGFSKLFNMGGTTSGWVGEASQRPQTNAATFQPLSFASGEIYANPAATQQILDDAEID 229 (425) T ss_pred HHHHHHhhhhhhhhceeeeccCCceEEEEEcCCcceeeeccccccccccccccceeeeeheeeEeehHhHHHHHhcchhH Confidence 9999999999999998877653 4567777888899999999999876 689999999999999999999999988765 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcccccccccccccccc-------------cccccccHHHHHHHHHHhh Q lcl|Aclame:pro 336 IAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWAT-------------NVTGTTNIQELLEKLSVAT 402 (517) Q Consensus 336 ~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~-------------~~~~~~~~d~l~~~l~~~~ 402 (517) |++||.++|+++++++++.+||+|+|+++| .||++..+..... ..++....+++++.+.... T Consensus 230 ----l~~~i~~~la~ai~~~~d~~~l~G~G~~~p-~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~ 304 (425) T protein:vir:10 230 ----LESWLATEVQTEFAKQEGKAFLAGDGTNKP-NGLLTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLP 304 (425) T ss_pred ----HHHHHHHHHHHHHHHHHHhhhhcccCCCCc-ceeeeccccccccccccccccccccccccccccHHHHHHHHhhhh Confidence 999999999999999999999999998764 6888765432111 1123345677887776644 Q ss_pred -hhhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccceecCccceeccccCCc-------eeeeecCc-eEEEeee Q lcl|Aclame:pro 403 -PKAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDE-------KTAVSLSG-YVTNGSR 473 (517) Q Consensus 403 -~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~-------~~~~~~~~-~~~~~~~ 473 (517) .+..+++|||||++|.+|++|||++|||||+++...+.+.+++|.|+++ +..++. ...++++. |.+..+. T Consensus 305 ~~~~~~a~~vmn~~~~~~L~~lkD~~G~~l~~~~~~~g~~~~l~G~PV~~-~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~ 383 (425) T protein:vir:10 305 SAFTGNARFAMNRNTQRQVRKLKDGQGNYLWQPSYVAGQPATLAGYPVTE-VPDMPDVAANSTPILFGDFQQTYLIIDRI 383 (425) T ss_pred hhhccCCEEEEchHHHHHHHHhhcCCCceeeccCccCCCCceecceeeEE-ecCcCCccCCccEEEEEehhccEEEEEec Confidence 4456789999999999999999999999999999999889999976554 334442 22345554 6777788 Q ss_pred heeehhhhhcccchHHHHHhhhhcceeecccceEEEEeCCCC Q lcl|Aclame:pro 474 GMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPV 515 (517) Q Consensus 474 ~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~ 515 (517) +++...+.++.++++.|++..|++|.|.+|+||+++.+..+= T Consensus 384 ~~~v~~d~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 384 GVRVLRDPYTAKPYVLFYTTKRVGGGLLNPEPMRAMKVAASE 425 (425) T ss_pred ceEEEecccccCCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 888888878889999999999999999999999999986666 No 9 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=100.00 E-value=2.1e-47 Score=276.52 Aligned_cols=369 Identities=14% Similarity=0.076 Sum_probs=226.5 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHH------ Q lcl|Aclame:pro 125 NKNAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEK------ 198 (517) Q Consensus 125 ~~~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~------ 198 (517) -+...+..++++......+++.......+ ..+.+.+.+..+........++.++++..+... T Consensus 1 m~~~~l~~l~e~r~~~~~e~~~L~~~~~~------------~~lt~e~~~~~~~l~~e~~~l~~~i~~~~~~~~~~~~~~ 68 (390) T protein:vir:62 1 MDATTLSANFEARERATAELRTLTDEFAG------------KEMTDEAREKEERLITAVSDYDARIKRGIEAIKAIDPVT 68 (390) T ss_pred CChhHHHHHHHHHHHHHHHHHHHHHHhhc------------ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 33344444444433333332222111100 011111111111111111222222211110000 Q ss_pred hhhhhhhhhhhhhhhHHHHHHHHHHHHhhccchhhHH----HHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhcee Q lcl|Aclame:pro 199 ILGVEALKVTPEATEFLKTREAEVAYMSASLTKDPKA----AWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIR 274 (517) Q Consensus 199 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~ 274 (517) ............... ........+...+...+.+. .........+..++..|+.....|.+.++....++++++ T Consensus 69 ~~~~~~~~~~~~~~~--~~~~~~~~~~r~~~~~~~r~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~ 146 (390) T protein:vir:62 69 SLLSGLQGSGSGAQR--SADVDDDATLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGAT 146 (390) T ss_pred HHHhhcccccccchh--hcchHHHHHHhhhhhhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcce Confidence 000000000000000 00000001111111111111 001111222233344444444445566777777888887 Q ss_pred eecccc---ceeeeecccccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHH Q lcl|Aclame:pro 275 HENLPT---LVVGGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDM 351 (517) Q Consensus 275 ~~~~~~---~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~ 351 (517) +.+..+ ..+|.......+.|++|++..|+++++|+++++.++++++++++|+++++|+.++ |++||.++|+++ T Consensus 147 ~~~~~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~l~~~ 222 (390) T protein:vir:62 147 TFTTSDANPLDFTVITGRSSASIVGETAEIPESYPATAQRSMGGFKYGFASVVSYEFATDQVLD----LVGFLVSDAGPA 222 (390) T ss_pred eeecCCCceeEEEEEcCCcceeeecccccccccccceeeeEeeeeeEEeehHHHHHHHhhhhHH----HHHHHHHHHHHH Confidence 766533 4577777878899999999999999999999999999999999999999998876 899999999999 Q ss_pred HHHHHHhhhhcccccCccccccccccccccc---ccccccccHHHHHHHHHHhhhh-hcCCEEEEcHHHHHHHHHhhcCC Q lcl|Aclame:pro 352 VIMAVNRAIIMGGVTGVSETQIYPVVGDAWA---TNVTGTTNIQELLEKLSVATPK-AADSTLVIHRNDLAAIRFLKDKN 427 (517) Q Consensus 352 ~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~---~~~~~~~~~d~l~~~l~~~~~~-~~~a~~vmn~~~~~~l~~lKD~~ 427 (517) ++.+++.+||+|+|. | .||++..+.... ...+...+.+++++.++..... ..+++||||+++|..|++|||++ T Consensus 223 i~~~~d~~~l~G~G~--p-~Gi~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~ 299 (390) T protein:vir:62 223 IGDAMGRHFITGTGQ--P-RGILTDASPATATFLATDTDSKVSDALIDLFHEVPSAYRANAKYVVNDLRAAQMRKLKDAN 299 (390) T ss_pred HHHHHHhhhhccCCc--c-ccccccccccccceecccccccchHHHHHHHHhhhhhhhcCCEEEEchHHHHHHHHhhccC Confidence 999999999999983 3 688776543221 1122334567788777665444 45789999999999999999999 Q ss_pred CCEeccCCCCCCccceecCccceeccccCCcee--eeecCceEEEeeeheee--hhhhhcccchHHHHHhhhhcceeecc Q lcl|Aclame:pro 428 GNYVFPVGVSNQTIATHFGFNRLVQSVAVDEKT--AVSLSGYVTNGSRGMEF--EQGTILVENNKEYLFEMPISGSLEYK 503 (517) Q Consensus 428 Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~~~--~~~~~~~~~~~~~~~~~--~~d~~~~~n~~~~~~~~rvgg~v~~~ 503 (517) |||||+++...+.+.+++|.|+ +++..++... .++++.|.++.+.++.. ..+..+.+|++.|++..|+||.+.+| T Consensus 300 g~~l~~~~~~~g~~~~l~G~Pv-~~~~~~p~~~i~~gd~s~~~i~~~~~~~v~~~~~~~~~~~~~~~~~~~r~d~~~~~~ 378 (390) T protein:vir:62 300 GQYLWQSGLTVGAPSLFNGKVV-ETDDGMPADKILFADLSKYRVRFAGSLRVDRSVDAKFSTDQIVYRFLQRADGLLVDA 378 (390) T ss_pred CCeeecCCcCCCccceecccce-EEecCCCCccEEEeeccceeEEeecceEEEeeccccccCCcEEEEEEEEeCcEeech Confidence 9999999999998888999754 4455555544 46677888877766644 44566889999999999999999999 Q ss_pred cceEEEEeCCCC Q lcl|Aclame:pro 504 GTTAYGTYTPPV 515 (517) Q Consensus 504 ~a~~~~~~tp~~ 515 (517) +||++.+++|+- T Consensus 379 ~A~~~l~~~~~a 390 (390) T protein:vir:62 379 RGAKVLTVTPGA 390 (390) T ss_pred hheEEEEeecCC Confidence 999999998877 No 10 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=100.00 E-value=2.7e-47 Score=275.90 Aligned_cols=373 Identities=10% Similarity=0.095 Sum_probs=225.7 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHH-------HhhHHHhhh- Q lcl|Aclame:pro 130 VTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMK-------QRESEKILG- 201 (517) Q Consensus 130 I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~-------~~~~~~~~~- 201 (517) +...++.. ..+.++ .+.+....+ ++...+.+...+..+...+.+..+..++.+ .+....... T Consensus 1 mk~~~em~-~~l~el---~~~~~~~~~------e~~~~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~ 70 (415) T protein:vir:47 1 MKTKEELQ-SEISDI---KRQIDLKVK------YATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSEN 70 (415) T ss_pred CchHHHHH-HHHHHH---HHHHHHHHH------HHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 11111111 111111 010000000 000000000000001111111111111111 111000000 Q ss_pred hhhhhhhhhhhhHHHH---HHHHHHHHhhccchhhHHH----------HhhhhhcccccccccchhhhhhHHHhHhhhhh Q lcl|Aclame:pro 202 VEALKVTPEATEFLKT---REAEVAYMSASLTKDPKAA----------WTAELKERGISGMPAPAGILKRIQDAVNDEGS 268 (517) Q Consensus 202 ~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~ 268 (517) ........+.+..... .................+. ..........+++.+|+.+...|++.+++.++ T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~ 150 (415) T protein:vir:47 71 NQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFN 150 (415) T ss_pred cccccccchhhhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhh Confidence 0000000000000000 0000000000000000000 01111122345778999999999999999999 Q ss_pred hhhceeeeccccc--eee--eecccccceeeeccccccc-ccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHH Q lcl|Aclame:pro 269 LLPFIRHENLPTL--VVG--GDNALTQGTGHTTGTDKTE-SNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTY 343 (517) Q Consensus 269 ~~~~~~~~~~~~~--~~~--~~~~~~~a~~~~eg~~~~~-~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~ 343 (517) +++++++.++++. .++ .......+.|+.||++.|+ +.++|+.+++.++++++++++|++++.|+.++ |++| T Consensus 151 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~----l~~~ 226 (415) T protein:vir:47 151 LDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVN----VLQE 226 (415) T ss_pred hhhhcceeeccCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHH----HHHH Confidence 9999988776543 233 3355567889999999997 56899999999999999999999999998875 8999 Q ss_pred HHHHHHHHHHHHHHhhhhcccccCcccccccccccccccccccccccHHHHHHHHHHhh-hhhcCCEEEEcHHHHHHHHH Q lcl|Aclame:pro 344 VMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVAT-PKAADSTLVIHRNDLAAIRF 422 (517) Q Consensus 344 i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~-~~~~~a~~vmn~~~~~~l~~ 422 (517) |.++|+++++++++.+||+|+|++.+..++............++..+++++++++.... .++.+++|||||++|.+|++ T Consensus 227 i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~ 306 (415) T protein:vir:47 227 LKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDK 306 (415) T ss_pred HHHHHHHHHHHHHHHHHhhccccCCccccccccccccceeccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHH Confidence 99999999999999999999999988766665554445555556667788888887764 45678999999999999999 Q ss_pred hhcCCCCEeccCCCCCCccceecCccceecc-ccCCc-----eeeeecCc-eEEEeeeheeehhhhhcccchHHHHHhhh Q lcl|Aclame:pro 423 LKDKNGNYVFPVGVSNQTIATHFGFNRLVQS-VAVDE-----KTAVSLSG-YVTNGSRGMEFEQGTILVENNKEYLFEMP 495 (517) Q Consensus 423 lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~-~~~~~-----~~~~~~~~-~~~~~~~~~~~~~d~~~~~n~~~~~~~~r 495 (517) |||++|||||++++.++.+.+++|.++++.+ .+.+. ...++++. |.+..+.++... ..++.++...+++..| T Consensus 307 lkd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~-~~~~~~~~~~~~~~~r 385 (415) T protein:vir:47 307 MKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQAS-WTDYMHFGECLMIAVR 385 (415) T ss_pred hhccCCCeeeccCcCCCCCccccceeeEEeccccccCCCccEEEEEehhccEEEEeecceEEE-eeccccCceEEEEEEE Confidence 9999999999999989888899998765543 12222 23345665 445555555432 1123455667889999 Q ss_pred hcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 496 ISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 496 vgg~v~~~~a~~~~~~tp~~a~ 517 (517) +++.+.+|+||++++++++++| T Consensus 386 ~d~~v~~~~a~~~~~~~~~~~~ 407 (415) T protein:vir:47 386 QDCRILDYKSAIVIEYDDSERG 407 (415) T ss_pred eccEEeccccEEEEEeeccCCC Confidence 9999999999999999999998 No 11 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=100.00 E-value=2.7e-47 Score=275.90 Aligned_cols=373 Identities=10% Similarity=0.095 Sum_probs=225.7 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHH-------HhhHHHhhh- Q lcl|Aclame:pro 130 VTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMK-------QRESEKILG- 201 (517) Q Consensus 130 I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~-------~~~~~~~~~- 201 (517) +...++.. ..+.++ .+.+....+ ++...+.+...+..+...+.+..+..++.+ .+....... T Consensus 1 mk~~~em~-~~l~el---~~~~~~~~~------e~~~~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~ 70 (415) T protein:vir:46 1 MKTKEELQ-SEISDI---KRQIDLKVK------YATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSEN 70 (415) T ss_pred CchHHHHH-HHHHHH---HHHHHHHHH------HHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 11111111 111111 010000000 000000000000001111111111111111 111000000 Q ss_pred hhhhhhhhhhhhHHHH---HHHHHHHHhhccchhhHHH----------HhhhhhcccccccccchhhhhhHHHhHhhhhh Q lcl|Aclame:pro 202 VEALKVTPEATEFLKT---REAEVAYMSASLTKDPKAA----------WTAELKERGISGMPAPAGILKRIQDAVNDEGS 268 (517) Q Consensus 202 ~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~ 268 (517) ........+.+..... .................+. ..........+++.+|+.+...|++.+++.++ T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~ 150 (415) T protein:vir:46 71 NQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFN 150 (415) T ss_pred cccccccchhhhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhh Confidence 0000000000000000 0000000000000000000 01111122345778999999999999999999 Q ss_pred hhhceeeeccccc--eee--eecccccceeeeccccccc-ccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHH Q lcl|Aclame:pro 269 LLPFIRHENLPTL--VVG--GDNALTQGTGHTTGTDKTE-SNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTY 343 (517) Q Consensus 269 ~~~~~~~~~~~~~--~~~--~~~~~~~a~~~~eg~~~~~-~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~ 343 (517) +++++++.++++. .++ .......+.|+.||++.|+ +.++|+.+++.++++++++++|++++.|+.++ |++| T Consensus 151 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~----l~~~ 226 (415) T protein:vir:46 151 LDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVN----VLQE 226 (415) T ss_pred hhhhcceeeccCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHH----HHHH Confidence 9999988776543 233 3355567889999999997 56899999999999999999999999998875 8999 Q ss_pred HHHHHHHHHHHHHHhhhhcccccCcccccccccccccccccccccccHHHHHHHHHHhh-hhhcCCEEEEcHHHHHHHHH Q lcl|Aclame:pro 344 VMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVAT-PKAADSTLVIHRNDLAAIRF 422 (517) Q Consensus 344 i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~-~~~~~a~~vmn~~~~~~l~~ 422 (517) |.++|+++++++++.+||+|+|++.+..++............++..+++++++++.... .++.+++|||||++|.+|++ T Consensus 227 i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~ 306 (415) T protein:vir:46 227 LKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDK 306 (415) T ss_pred HHHHHHHHHHHHHHHHHhhccccCCccccccccccccceeccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHH Confidence 99999999999999999999999988766665554445555556667788888887764 45678999999999999999 Q ss_pred hhcCCCCEeccCCCCCCccceecCccceecc-ccCCc-----eeeeecCc-eEEEeeeheeehhhhhcccchHHHHHhhh Q lcl|Aclame:pro 423 LKDKNGNYVFPVGVSNQTIATHFGFNRLVQS-VAVDE-----KTAVSLSG-YVTNGSRGMEFEQGTILVENNKEYLFEMP 495 (517) Q Consensus 423 lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~-~~~~~-----~~~~~~~~-~~~~~~~~~~~~~d~~~~~n~~~~~~~~r 495 (517) |||++|||||++++.++.+.+++|.++++.+ .+.+. ...++++. |.+..+.++... ..++.++...+++..| T Consensus 307 lkd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~-~~~~~~~~~~~~~~~r 385 (415) T protein:vir:46 307 MKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQAS-WTDYMHFGECLMIAVR 385 (415) T ss_pred hhccCCCeeeccCcCCCCCccccceeeEEeccccccCCCccEEEEEehhccEEEEeecceEEE-eeccccCceEEEEEEE Confidence 9999999999999989888899998765543 12222 23345665 445555555432 1123455667889999 Q ss_pred hcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 496 ISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 496 vgg~v~~~~a~~~~~~tp~~a~ 517 (517) +++.+.+|+||++++++++++| T Consensus 386 ~d~~v~~~~a~~~~~~~~~~~~ 407 (415) T protein:vir:46 386 QDCRILDYKSAIVIEYDDSERG 407 (415) T ss_pred eccEEeccccEEEEEeeccCCC Confidence 9999999999999999999998 No 12 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=100.00 E-value=2.2e-47 Score=276.36 Aligned_cols=373 Identities=13% Similarity=0.161 Sum_probs=232.2 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHH------ Q lcl|Aclame:pro 124 SNKNAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESE------ 197 (517) Q Consensus 124 A~~~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~------ 197 (517) ++ +..++++.+...++++.. ..+. +.....+++..++.+++.+ +..++.++.+.+... T Consensus 1 M~----l~eL~e~r~~l~~e~~~l---~~k~-~~~~~t~e~~~~~~~~~~e--------~~~l~~~i~~~e~~~~~~~~~ 64 (409) T protein:vir:45 1 MK----LHELKQKRNTIATDMRAL---NEKI-GDNAWTEEQRTEWNKAKSE--------LEALDERIAREEELRRQDQAY 64 (409) T ss_pred CC----HHHHHHHHHHHHHHHHHH---HHHh-hcCCCCHHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHHHHHH Confidence 00 111111111111111111 1100 0000111111222221111 111111111110000 Q ss_pred -Hh-hhhhhhhhhhh--hhhHHHHHHHHHHHHhh---ccchhhHHHHh----hhhhcccccccccchhhhhhHHHhHhhh Q lcl|Aclame:pro 198 -KI-LGVEALKVTPE--ATEFLKTREAEVAYMSA---SLTKDPKAAWT----AELKERGISGMPAPAGILKRIQDAVNDE 266 (517) Q Consensus 198 -~~-~~~~~~~~~~~--~~~~~~~~~~~~~~~~~---~~~~~~~~~~~----~~~~~~~~~~~~vp~~i~~~i~~~~~~~ 266 (517) .. .........++ ........+.+..+... ....+.++... ........+++.+|+.+...|++.++.. T Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~l~~~~~~~~~~e~~~~~~~~a~~~~~~~~gg~liP~~~~~~ii~~~~~~ 144 (409) T protein:vir:45 65 IESNEEEQRQNLDPENNSQQDEKRAQVFDKWMRHGASELTSEERKALRELRAQGVAQDEKGGYTVPETFLAKVVEKMKSY 144 (409) T ss_pred HhhhhhhhcccCCCCCcchhhHHHHHHHHHHHHhhhhhccHHHHHHHHHHhhccCccCcCCceeccHhHHHHHHHHHHhh Confidence 00 00000000000 00011111122222211 11112222211 1223334557899999999999999999 Q ss_pred hhhhhceeeeccccc--e-eeeecc-cccceeeecccccccccccceeeEeeHhhh-hHhHhhhHHHHHHhhcccHHHHH Q lcl|Aclame:pro 267 GSLLPFIRHENLPTL--V-VGGDNA-LTQGTGHTTGTDKTESNITLQTRVLTPQYV-YKYIKLPKIVMNSNATDIAGAIL 341 (517) Q Consensus 267 ~~~~~~~~~~~~~~~--~-~~~~~~-~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~-~~~~~iS~~li~d~~~d~~~~l~ 341 (517) +++++++++.++.+. . ++.... ...+.|++||+.+|+++++|..+++.++++ ++++++|++++.|+.++ |+ T Consensus 145 ~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~f~~~~l~~~k~~~~~i~is~ell~ds~~~----l~ 220 (409) T protein:vir:45 145 GGIASVAQILTTSDGRTMEWATADGTSEVGVLLGENEEAGEEDTDFGMGSLGALKMTSKIIRVSNELLQDSAID----ME 220 (409) T ss_pred hhhhhhceeeecCCCceEEEEeeccCccccccccccccccccccccceeeeeeeeeeeeehhhhHHHHhccHHH----HH Confidence 999999998877543 2 233222 345679999999999999999999999877 47899999999998765 99 Q ss_pred HHHHHHHHHHHHHHHHhhhhcccccCc--ccccccccccccccccccccccHHHHHHHHHHhhhhh-cCCE--EEEcHHH Q lcl|Aclame:pro 342 TYVMNRLPDMVIMAVNRAIIMGGVTGV--SETQIYPVVGDAWATNVTGTTNIQELLEKLSVATPKA-ADST--LVIHRND 416 (517) Q Consensus 342 ~~i~~~l~~~~~~~~e~~~l~G~G~~~--~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~~~~-~~a~--~vmn~~~ 416 (517) +||.++|++++.++++.+||+|+|++. .++||++..+.......+...+.+++++++......+ .++. |+||+.+ T Consensus 221 ~~i~~~la~a~~~~~~~a~l~G~G~~~~~~p~Gil~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~~a~~~~~~n~~~ 300 (409) T protein:vir:45 221 AYLARRIAERIGRGEARYLIQGTGAGTPKQPKGLAASVTGTTQTAAANAVKWQEILALKHSIDPAYRRGPKFRLAFNDNT 300 (409) T ss_pred HHHHHHHHHHHHHHHHHHhhccCCCCCccccceeeeccccccccccccccchHHHHHHHHhhhhhhccCCeEEEEECHHH Confidence 999999999999999999999999874 3578888776555555555566788888877665544 4565 4779999 Q ss_pred HHHHHHhhcCCCCEeccCCCCCCccceecCccceeccccCCc-----e--eeeecCceEEEeeehe--eehhhhhcccch Q lcl|Aclame:pro 417 LAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDE-----K--TAVSLSGYVTNGSRGM--EFEQGTILVENN 487 (517) Q Consensus 417 ~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~-----~--~~~~~~~~~~~~~~~~--~~~~d~~~~~n~ 487 (517) |.+|++|||++|||||++++..+.+.+++|.|++ ++..++. . ..++++.|++..+.++ ....+.++.+++ T Consensus 301 ~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~-~~~~~p~~~~~~~~i~~Gd~~~~~i~~~~~~~~~~~~d~~~~~~~ 379 (409) T protein:vir:45 301 LKLISEMEDGQGRPLWLPDIVGVAPASVLNVPYV-IDQEIDDIGAGKKFMFCGDFDRFIIRRVRYMILKRLVERYAEYDQ 379 (409) T ss_pred HHHHHHhhcCCCceeeccCcCCCCCceecceeeE-EecCcCCccCCccEEEEeehhhhheeeccceEEEEeecccccCCc Confidence 9999999999999999999999988899997554 4444443 1 2245667776655443 345566678899 Q ss_pred HHHHHhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 488 KEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 488 ~~~~~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) +.|++..|+++.+.+|+||+++++.++++| T Consensus 380 ~~~~~~~r~d~~~~~~~A~~~l~~k~s~~~ 409 (409) T protein:vir:45 380 TGFLAFHRFDCILEDTSAIKALVGKGSVGG 409 (409) T ss_pred EEEEEEEEeccEeechhheEEEEeccCCCC Confidence 999999999999999999999999999999 No 13 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=100.00 E-value=5.1e-47 Score=274.40 Aligned_cols=376 Identities=12% Similarity=0.078 Sum_probs=223.8 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhh----hhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHH-------- Q lcl|Aclame:pro 131 TYFREEKKKEENKMTFDQNLMQELLDAKKL----AADLNAKLKERENGGDNAALKTVSELAANLMKQRESEK-------- 198 (517) Q Consensus 131 ~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~----~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~-------- 198 (517) .++++..+...++.++....++...+..+. ..+..++++++..+... ..++..+++........... T Consensus 1 M~l~el~~~~~~~~~~~~a~l~~~~~~~~~~~ee~~~~~~e~~~l~~~~~~-l~~~i~~le~~~~~~~~~~~~~~~~~~~ 79 (434) T protein:vir:62 1 MNLKEILNASLTRTKSRLAELQGKVEKNEVRSEELAAVKAEVEQLTKEIQT-ISEELAKLEEKEKEEDPAKKKDDDPEKK 79 (434) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHhccCccHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhhcchhhhh Confidence 223232222222222221111111110000 00000111111100000 00000011000000000000 Q ss_pred --hhhhhhhhh----hhhhhhHH-------------------HHHHHHHHHHhhccchhhHHHHhhhhhcccccccccch Q lcl|Aclame:pro 199 --ILGVEALKV----TPEATEFL-------------------KTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPA 253 (517) Q Consensus 199 --~~~~~~~~~----~~~~~~~~-------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~ 253 (517) ......... ..+.+... ..+..+..+..+........+. ....+.+++++|+ T Consensus 80 ~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~r~a~~~~l~~~~~~~e~~a~---~~~t~~GG~lvP~ 156 (434) T protein:vir:62 80 EDPTAKENPNEKTELSEEQRSAISASIAAALSTKGHRTNKETEIRSVFANYIVGNIDEKEARAL---GLVTGNGSVTIPD 156 (434) T ss_pred cchhhhcchhhhHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHHHHHHhccccchhhhhhh---cccccccceecch Confidence 000000000 00000000 0000111111111111000110 1122456889999 Q ss_pred hhhhhHHHhHhhhhhhhhceeeeccccc-eeeeecccccceee---ecccccccccccceeeEeeHhhhhHhHhhhHHHH Q lcl|Aclame:pro 254 GILKRIQDAVNDEGSLLPFIRHENLPTL-VVGGDNALTQGTGH---TTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVM 329 (517) Q Consensus 254 ~i~~~i~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~a~~~---~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li 329 (517) .+.+.|++.++..+++++++++.+..+. .+|.......+.|. +++...|+++++|+++++.++++++++++|++++ T Consensus 157 ~~~~~Ii~~l~~~~~i~~~~~~~~~~~~~~~p~~~~~~~a~~~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell 236 (434) T protein:vir:62 157 FLSKEIITYAQEENFLRRLGTGVKTKENIKYPVLVKKAEAQGHKNERTNNEMPETDIEFDEIELSPTEFDALATVTKKLL 236 (434) T ss_pred hhHHHHHHhhhhhhhhhhhcceeccCCceEEEEEecCCcccceecccccccccccccceeeEEeeheeeEeehhhHHHHH Confidence 9999999999999999999988776543 56666655666665 4577888999999999999999999999999999 Q ss_pred HHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcccccccccccccccccccccccHHHHHHHHHHhhh-hhcCC Q lcl|Aclame:pro 330 NSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVATP-KAADS 408 (517) Q Consensus 330 ~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~~-~~~~a 408 (517) .|+.++ |++||.++|+++++.+++.+||+|+|++++..|++...+.. ...+.+...+++++++..... +..++ T Consensus 237 ~ds~~~----l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~~~~~~--~~~~~~~~~d~l~~l~~~l~~~~~~~a 310 (434) T protein:vir:62 237 ARTGLP----IEQIVMDELKKAYVRKETQYMVNGDEANNINDGALAKKAVE--FKTDEKNLYDALVKMKNTPVKEVRKKA 310 (434) T ss_pred hcchHH----HHHHHHHHHHHHHHHHHHHHHhccCCCCccccceeeccccc--ccccccchhhHHHHHHhhcchhhhcCC Confidence 998776 99999999999999999999999999999887877655432 223334456778877766543 44678 Q ss_pred EEEEcHHHHHHHHHhhcCCCCEeccCC--CCCCccceecCccceeccccCCc--------eeeeecCceEEEeeeh---e Q lcl|Aclame:pro 409 TLVIHRNDLAAIRFLKDKNGNYVFPVG--VSNQTIATHFGFNRLVQSVAVDE--------KTAVSLSGYVTNGSRG---M 475 (517) Q Consensus 409 ~~vmn~~~~~~l~~lKD~~Gryl~~~~--~~~~~~~~l~g~~~v~~~~~~~~--------~~~~~~~~~~~~~~~~---~ 475 (517) +|||||.+|.+|++|||++|||||++. ...+.+.+++|.++++. ..++. ...++|+.|.++++.+ + T Consensus 311 ~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~g~~~tl~G~pV~~~-~~~~~~~~~~~~~i~~Gdfs~~~i~~~~g~~~i 389 (434) T protein:vir:62 311 RWVLNTAALTKIETMKTDDGFPLLRPFNQAEGGIGYTLLGFPVEEE-DAIDIPDSPDTPVFYFGDFSKFYIQDVIGSLEV 389 (434) T ss_pred EEEEcHHHHHHHHHhhccCCCEeeccCCCccCCCCceecceeeEEe-cCccCccCCCceEEEEeeccceEEEEeeceeEE Confidence 999999999999999999999999874 34566778999766544 33322 2235778888777765 3 Q ss_pred eehhhhhcccchHHHHHhhhhcceee-cccceEEE--EeCCCCCC Q lcl|Aclame:pro 476 EFEQGTILVENNKEYLFEMPISGSLE-YKGTTAYG--TYTPPVAG 517 (517) Q Consensus 476 ~~~~d~~~~~n~~~~~~~~rvgg~v~-~~~a~~~~--~~tp~~a~ 517 (517) ....+.++.++++.|++..|++|.+. .|.+.++. +.++|++| T Consensus 390 ~~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:62 390 QKLVELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKAPTGA 434 (434) T ss_pred EeehhhhcccCceEEEEEeeecceeecCcccceEEEEEeccCCCC Confidence 44556667899999999999999855 58888755 45799999 No 14 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=100.00 E-value=4.8e-47 Score=274.52 Aligned_cols=372 Identities=11% Similarity=0.060 Sum_probs=227.9 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhH----HHhh Q lcl|Aclame:pro 125 NKNAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRES----EKIL 200 (517) Q Consensus 125 ~~~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~----~~~~ 200 (517) -+..++..++++.+....++......... ..+.+.+.+..+........++.++.+..+. .... T Consensus 1 m~~~~l~~l~e~r~~~~~e~~~l~~~~~~------------~~~~~e~~~~~~~l~~e~~~l~~~i~~~~e~~~~~~~~~ 68 (392) T protein:vir:13 1 MDATTLSANFEARERATAELRSLTDEFAG------------KEMTAEAREKEERLLTAVADFDGRIKRGIDAIKATDAVT 68 (392) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhhc------------ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22333444444333333222222111100 0011111111111111111122111111000 0000 Q ss_pred hhhhhhhhhhhhhHHHHHHHHHHHHhhccchhhHHHH----hhhhhcccccccccchhhhhhHHH-hHhhhhhhhhceee Q lcl|Aclame:pro 201 GVEALKVTPEATEFLKTREAEVAYMSASLTKDPKAAW----TAELKERGISGMPAPAGILKRIQD-AVNDEGSLLPFIRH 275 (517) Q Consensus 201 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~vp~~i~~~i~~-~~~~~~~~~~~~~~ 275 (517) .......................+...+...+.+... ...... ...+..+|+.+...++. .+...++++.++++ T Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~~~r~g~~~~~~~~~~~~~~~~~t~-~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~ 147 (392) T protein:vir:13 69 SLLSGLQGSGSGAQRSADHDDDAVLRAGNLGEARSFEFAPEKRDGTK-AGNPNVLSRTLYGQLIAQAVERSAIMRGGAST 147 (392) T ss_pred HHhcccCCcccchhhhhhHHHHHHHhccchhhhHHHHhhhhhhcccc-cCCCccccccchHHHHHHHHhhhhhhhhccee Confidence 0000000000000000000111111111111111110 111111 22234456666666554 45566677777776 Q ss_pred ecccc---ceeeeecccccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 276 ENLPT---LVVGGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMV 352 (517) Q Consensus 276 ~~~~~---~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~ 352 (517) .+..+ ..+|.......+.|++||+.+|+++++|+++++.++++++++++|++++.|+.++ |++||.++|++++ T Consensus 148 ~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~l~~~i 223 (392) T protein:vir:13 148 FTTSDANPMDFTVITGRATAGIVGETAEIPESYPATTQRSMGGFKYGFASVVSYEFATDQVLD----LVGFLVSDAGPAI 223 (392) T ss_pred eecCCCceeEEEEEcCCcceeeecccccccccccceeeEEeeeeeEEeeehhHHHHHhcchHH----HHHHHHHHHHHHH Confidence 65432 4566777778899999999999999999999999999999999999999998775 8999999999999 Q ss_pred HHHHHhhhhcccccCcccccccccccccccc---cccccccHHHHHHHHHHhhhh-hcCCEEEEcHHHHHHHHHhhcCCC Q lcl|Aclame:pro 353 IMAVNRAIIMGGVTGVSETQIYPVVGDAWAT---NVTGTTNIQELLEKLSVATPK-AADSTLVIHRNDLAAIRFLKDKNG 428 (517) Q Consensus 353 ~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~---~~~~~~~~d~l~~~l~~~~~~-~~~a~~vmn~~~~~~l~~lKD~~G 428 (517) +++++.+||+|+|+++| .||++..+..... ..+...+.+++++.+...... ..+++|||||++|..|++|||++| T Consensus 224 ~~~~d~~~l~G~Gt~~p-~Gil~~~~~~~~~~~~~~~~~~~~d~l~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G 302 (392) T protein:vir:13 224 GDAMGRHFLTGTGTGQP-RGILTDATGANAAFGEADADSKVSDALIDLFHEVPSAYRKNAKFVVNDLRAAQMRKLKDANG 302 (392) T ss_pred HHHHHHHHhcccCCccc-cccccccccccccccccccccccHHHHHHHHHhhhhhhhcCCEEEEcHHHHHHHHHhhccCC Confidence 99999999999999876 6888776533222 222344567888877665544 457899999999999999999999 Q ss_pred CEeccCCCCCCccceecCccceeccccCCcee--eeecCceEEEeeeheee--hhhhhcccchHHHHHhhhhcceeeccc Q lcl|Aclame:pro 429 NYVFPVGVSNQTIATHFGFNRLVQSVAVDEKT--AVSLSGYVTNGSRGMEF--EQGTILVENNKEYLFEMPISGSLEYKG 504 (517) Q Consensus 429 ryl~~~~~~~~~~~~l~g~~~v~~~~~~~~~~--~~~~~~~~~~~~~~~~~--~~d~~~~~n~~~~~~~~rvgg~v~~~~ 504 (517) ||||+++.+.+.+.+++|+|++ ++..++... .++|+.|++..+.++.. ..+..+.+|++.|++..|+|+.+.+|+ T Consensus 303 ~~l~~~~~~~g~~~~l~G~Pv~-~~~~~~~~~i~~Gdf~~~~i~~~~~~~i~~~~~~~~~~~~~~~r~~~r~d~~~~~~~ 381 (392) T protein:vir:13 303 QYLWQSALTVGAPDTFNGKVVE-TDDGMPADKVLFADLSKYRVRFAGSLRVDRSVDAKFSTDQIVYRFLQRADGLLVDAR 381 (392) T ss_pred ceeecCCcCCCCCceecceeeE-EcCCCCCCcEEEeeccceeEEeecceEEEeeccccccCCcEEEEEEEEeccEEeccc Confidence 9999999999988899997654 445555543 46778888877766644 456678999999999999999999999 Q ss_pred ceEEEEeCCCC Q lcl|Aclame:pro 505 TTAYGTYTPPV 515 (517) Q Consensus 505 a~~~~~~tp~~ 515 (517) ||+..+++++= T Consensus 382 A~~~~~~~~aa 392 (392) T protein:vir:13 382 GAKVLTVTPAA 392 (392) T ss_pred ceEEEEeeccC Confidence 99999997765 No 15 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=100.00 E-value=1.6e-46 Score=271.72 Aligned_cols=390 Identities=13% Similarity=0.129 Sum_probs=227.4 Q ss_pred ehhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh---hhhhhhhhhhhhhhhhhhhhhhhhHHHHH---HHhhhhhhh Q lcl|Aclame:pro 112 CILAGGALTPNPSNKNAVVTYFREEKKKEENKMTFD---QNLMQELLDAKKLAADLNAKLKERENGGD---NAALKTVSE 185 (517) Q Consensus 112 ~~l~EvS~v~~pA~~~A~I~~vk~~~~~~~~~~~~~---~~~~~~~~e~~~~~~e~~a~l~~~~~~~~---e~~~~~~~~ 185 (517) ..|.++- -...+...+++..+.+++.++. ...+.+..+..+ .+++...+++.-...+ ....+...+ T Consensus 1 ~~~~~~~-------~~~el~~~~~~l~el~~~~~el~~~~~el~~~~e~ak-~eee~~~l~~ei~~le~e~~~l~~~~~~ 72 (425) T protein:vir:95 1 MALRQLM-------LTKKIEQRKAALDELVKREQELQAKAAELEQAIEEAQ-TEEEVSAVEEEVAKLEDERNELNEKKSK 72 (425) T ss_pred CchHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-hHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111110 0011111111111111111000 000000000000 0000000000000000 000011111 Q ss_pred HHHHHHHHhhHHHhhhhhhhhhhh-hh-----hhHHH-HHHHHHHHH------hhccchhhHHHHhhhhhcccccccccc Q lcl|Aclame:pro 186 LAANLMKQRESEKILGVEALKVTP-EA-----TEFLK-TREAEVAYM------SASLTKDPKAAWTAELKERGISGMPAP 252 (517) Q Consensus 186 ~~~~~~~~~~~~~~~~~~~~~~~~-~~-----~~~~~-~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~vp 252 (517) ++.+....+............... .. ..... ......... ......++...... ......+++.+| T Consensus 73 le~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~gg~~vP 151 (425) T protein:vir:95 73 LEGEIAQLEDELEQINSKQPSNQSRQKMQGSKGDVVEMNRLQVREMLKTGEYYKRSEVVEFYEKFRN-LRAVAGGELTIP 151 (425) T ss_pred HHHHHHHHHHHHHHhhhhccchhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHh-hcccccCceecc Confidence 111111111100000000000000 00 00000 000000000 00111111111111 122234678999 Q ss_pred hhhhhhHHHhHhhhhhhhhceeeecccc-ceeeeecccccceeeeccccccccc-ccceeeEeeHhhhhHhHhhhHHHHH Q lcl|Aclame:pro 253 AGILKRIQDAVNDEGSLLPFIRHENLPT-LVVGGDNALTQGTGHTTGTDKTESN-ITLQTRVLTPQYVYKYIKLPKIVMN 330 (517) Q Consensus 253 ~~i~~~i~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~a~~~~eg~~~~~~~-~~f~~~~~~~~~~~~~~~iS~~li~ 330 (517) ..+.+.|++.++..+++++++++.++++ ..+|+....+.+.|+.||+..|+++ ++|+++++.++++++++++|++++. T Consensus 152 ~~~~~~Ii~~l~~~~~i~~~~~~~~~~g~~~ip~~~~~~~a~~v~E~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ 231 (425) T protein:vir:95 152 EVVVNRIMDIMGDYTTLYPLVDKIRVKGTTRILVDTDTSPATWIEQSGALPTGDVGTIASIDFDGFKVGKVTFVDNYLLQ 231 (425) T ss_pred HHHHHHHHHHHHhhhhHHHhhceeecCceeEEEEecCCccccccccccccccccccccceeeeeheeeeeeehhhHHHHh Confidence 9999999999999999999999888754 4577778888899999999999877 6899999999999999999999999 Q ss_pred HhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcc-cccccccccccccc-cccccccHHHHHHHHHHhhhh---h Q lcl|Aclame:pro 331 SNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVS-ETQIYPVVGDAWAT-NVTGTTNIQELLEKLSVATPK---A 405 (517) Q Consensus 331 d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~-~~gi~~~~~~~~~~-~~~~~~~~d~l~~~l~~~~~~---~ 405 (517) |+..+ |++||.++|++++++++|.+||+|+|++++ +.||++..+..... ..+...+.+++...+...... . T Consensus 232 ds~~~----l~~~i~~~l~~~i~~~~d~~il~G~G~~~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 307 (425) T protein:vir:95 232 DSIIN----LDDYVTKKIARAIAKALDLAIVKGTGAANKQPLGIIPSLPPENQVTVEADNNLLKNLVKQIGLIDTGDDSV 307 (425) T ss_pred ccHHH----HHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccccccccccccchHHHHHHHHHhhhhhcccc Confidence 88765 999999999999999999999999998754 46888765443322 223444566676665543322 3 Q ss_pred cCCEEEEcHHHH----HHHHHhhcCCCCEeccCCCCCCccceecCccceeccccCCceee--eecCceEEEeeehee--e Q lcl|Aclame:pro 406 ADSTLVIHRNDL----AAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDEKTA--VSLSGYVTNGSRGME--F 477 (517) Q Consensus 406 ~~a~~vmn~~~~----~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~~~~--~~~~~~~~~~~~~~~--~ 477 (517) .+++|+||+.+| ..|+++||++|||||+.. .+...++||.+ ++.++.+++..+ ++++.|.++.+.++. . T Consensus 308 ~~~~~v~~~~~~~~~l~~l~~~kd~~g~~i~~~~--~~~~~~l~G~p-vv~~~~~~~~~i~~Gd~~~~~~~~~~~~~i~~ 384 (425) T protein:vir:95 308 GEIVAVMKRSTYYNRLVEFSIQVDSNGNVVGKLP--NLRTPDLLGLR-VVFNNFLDDDTVLFGEFEQYTLVERENITIDS 384 (425) T ss_pred CceEEEEeChHHHHHHHHHHhhcCCCCceeeccC--CCCCcccccee-eEEcCcCCCccEEEEecccEEEEeecceEEEe Confidence 467899999985 357889999999999754 33445788765 456666666544 567788888776644 4 Q ss_pred hhhhhcccchHHHHHhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 478 EQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 478 ~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) .++..+.++++.|++..|++|.+.+|+||+++++|+|++| T Consensus 385 ~~~~~f~~~~~~~~~~~r~d~~~~~~~a~~~~~i~~~~~g 424 (425) T protein:vir:95 385 STHVKFTEDQTAFRGKGRFDGKPVKPEAFVLVTITDPVQG 424 (425) T ss_pred ecccccccCceEEEEEEeeCcEeecccceEEEEecCcCCC Confidence 5566788999999999999999999999999999999999 No 16 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=100.00 E-value=1.5e-46 Score=271.76 Aligned_cols=369 Identities=14% Similarity=0.118 Sum_probs=227.0 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhh----hhhhhhhhhh---HHHHHHHhhhhhhhHHHHHHHHhhHHHhhhhh--hhhh Q lcl|Aclame:pro 137 KKKEENKMTFDQNLMQELLDAKKLA----ADLNAKLKER---ENGGDNAALKTVSELAANLMKQRESEKILGVE--ALKV 207 (517) Q Consensus 137 ~~~~~~~~~~~~~~~~~~~e~~~~~----~e~~a~l~~~---~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~ 207 (517) +.+..+++++.+..+.+..+..+.. .+....+++. .++..+.......+++.++.+.+......... .... T Consensus 1 m~~~~k~l~el~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (395) T protein:vir:43 1 MSDFEKQIGELNASLKQVGDQIKSQAEQVNTQIANFGEMNKETRAKVDELLTAQGELQARLSAAEQAMLANEKRDGGEEA 80 (395) T ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccch Confidence 1111112222211111111111100 0000001000 00111111111122222222211111110000 0000 Q ss_pred hhhhhhHH---HHHHHHHHHHhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc--ce Q lcl|Aclame:pro 208 TPEATEFL---KTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT--LV 282 (517) Q Consensus 208 ~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~--~~ 282 (517) .+...+.. ...+....+............ ......+..+..+|+.+...|++.++..+++++++++.++++ .. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~ 158 (395) T protein:vir:43 81 PKTAGQMVAESLKEQGVTSSLRGSHRVSMPRS--AITSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTTESNSVE 158 (395) T ss_pred hhhHHHHHHHHHHHHHHHHHhhhhhhhhhhhh--hhcccCCCCccccchhhHHHHHHHHHhhhhHHhhccceecCCCceE Confidence 01111110 111111111111111111111 111223345678899999999999999999999999887765 45 Q ss_pred eeeecc-cccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhh Q lcl|Aclame:pro 283 VGGDNA-LTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAII 361 (517) Q Consensus 283 ~~~~~~-~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l 361 (517) +++... ...+.|++||+.+|+++++|+++++.++++++++++|++++.|+ +.|++||.++|+++++.+++.+|| T Consensus 159 ~~~~~~~~~~a~~v~E~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~-----~~l~~~v~~~la~a~~~~~d~~~l 233 (395) T protein:vir:43 159 YVRETGFVNNAAPVSEGTQKPYSDLTFELENAPVRTIAHLFKASRQILDDA-----SALQSYIDARARYGLMLVEECQLL 233 (395) T ss_pred EEEEecCCCceeeecCCccccccccceeEEEEeeeeEEEeehhhHHHHHhH-----HHHHHHHHHHHHHHHHHHHHHHHH Confidence 666554 35789999999999999999999999999999999999998765 348999999999999999999999 Q ss_pred cccccCccccccccccccccccc---ccccccHHHHHHHHHHhhh-hhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCC Q lcl|Aclame:pro 362 MGGVTGVSETQIYPVVGDAWATN---VTGTTNIQELLEKLSVATP-KAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVS 437 (517) Q Consensus 362 ~G~G~~~~~~gi~~~~~~~~~~~---~~~~~~~d~l~~~l~~~~~-~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~ 437 (517) +|+|++.++.||++..+...... .......+++..++..... ++.+++|||||.+|.+|+++||++|||||++ +. T Consensus 234 ~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~-~~ 312 (395) T protein:vir:43 234 YGNGTGANLHGIIPQAQAYAPPSGVVVTAEQRIDRIRLAILQAQLAEFPASGIVLNPIDWALIELNKDAENRYIIGS-PQ 312 (395) T ss_pred hccCCCCccccccccccccccccccccccchhHHHHHHHHHhhccccCCCcEEEEcHHHHHHHHHhhccCCceeccc-cc Confidence 99999999999998776433222 2223345667776665543 4467899999999999999999999999975 45 Q ss_pred CCccceecCccceeccccCCcee--eeecCc-eEEEeeeheeeh----hhhhcccchHHHHHhhhhcceeecccceEEEE Q lcl|Aclame:pro 438 NQTIATHFGFNRLVQSVAVDEKT--AVSLSG-YVTNGSRGMEFE----QGTILVENNKEYLFEMPISGSLEYKGTTAYGT 510 (517) Q Consensus 438 ~~~~~~l~g~~~v~~~~~~~~~~--~~~~~~-~~~~~~~~~~~~----~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~ 510 (517) .+...+++|.+ |+++..+++.. .++++. |.+.++.++... .+.++++|++.|+++.|+++.+++|+||++++ T Consensus 313 ~~~~~~l~G~p-Vv~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~ 391 (395) T protein:vir:43 313 NGTTPTLWRLP-VVETQAITQDEFLTGAFSLGAQIFDRMDIEVLVSTENDKDFENNMVTIRAEERLAFAVYRPEAFVTGS 391 (395) T ss_pred cCCCceeccee-eEEcCCCCCCcEEEEeccceEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEE Confidence 66677888865 55566776554 456665 445554454322 23357899999999999999999999999999 Q ss_pred eCCC Q lcl|Aclame:pro 511 YTPP 514 (517) Q Consensus 511 ~tp~ 514 (517) +|++ T Consensus 392 ~taa 395 (395) T protein:vir:43 392 LTAS 395 (395) T ss_pred eccC Confidence 9999 No 17 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=100.00 E-value=8.8e-47 Score=273.08 Aligned_cols=371 Identities=14% Similarity=0.115 Sum_probs=225.3 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhhhh Q lcl|Aclame:pro 124 SNKNAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVE 203 (517) Q Consensus 124 A~~~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 203 (517) ++-.-.+..++++..+...+++...+.+.+..+...... .+++++..+..+ ..+....+..++...+......... T Consensus 1 m~~~m~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---ee~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (408) T protein:vir:10 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSA---EAMSELKNKRDN-EKVRRDALREQLVEAQAEQVVNMRE 76 (408) T ss_pred CCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccH---HHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhcccc Confidence 333333444444333333333322222211111100000 011111111111 1111111111211111111000000 Q ss_pred hh--hhhhhhhh-HHHHHHHHHHHHhhccch-hhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeeccc Q lcl|Aclame:pro 204 AL--KVTPEATE-FLKTREAEVAYMSASLTK-DPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLP 279 (517) Q Consensus 204 ~~--~~~~~~~~-~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~ 279 (517) .. ........ .....+++..+....... ...............+++.+|+.+...|++.+++.+++++++++.+++ T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~ 156 (408) T protein:vir:10 77 EEKGPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVS 156 (408) T ss_pred ccccccccchhhhHHHHHHHHHHHhhcchhhhhhhhhhhhhcccccCCceeccHhHHHHHHHHHHhhchhhhhcceeecc Confidence 00 00000000 011111222222221110 011111112223345688999999999999999999999999987765 Q ss_pred cc--e--eeeec-ccccceeeecccccccc-cccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 280 TL--V--VGGDN-ALTQGTGHTTGTDKTES-NITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVI 353 (517) Q Consensus 280 ~~--~--~~~~~-~~~~a~~~~eg~~~~~~-~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~ 353 (517) +. . ++... ....+.|++||+.+|++ .++|+++++.++++++++++|+++++|+.++ |++||.++|+++++ T Consensus 157 ~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~l~~~~~ 232 (408) T protein:vir:10 157 TSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAEN----ILAWLSSWIAKKVV 232 (408) T ss_pred CCcceEEEeeccccccceeeecCccccccccCcceeeEEeeeeeEEeeehhHHHHHhhchHH----HHHHHHHHHHHHHH Confidence 42 2 22232 33567899999999975 5899999999999999999999999998776 89999999999999 Q ss_pred HHHHhhhhcccccCcccccccccccccccccccccccHHHHHHHHHHhh--hhhcCCEEEEcHHHHHHHHHhhcCCCCEe Q lcl|Aclame:pro 354 MAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVAT--PKAADSTLVIHRNDLAAIRFLKDKNGNYV 431 (517) Q Consensus 354 ~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~--~~~~~a~~vmn~~~~~~l~~lKD~~Gryl 431 (517) .+++.+||+|+|++++..+ ..+.++++++++... .+..+++|||||++|.+|+++||++|||| T Consensus 233 ~~~~~~il~g~g~~~~~~~---------------~~~~~~l~~~~~~~~~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i 297 (408) T protein:vir:10 233 VTRNQAIIEVMKAAPKKPT---------------IAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYL 297 (408) T ss_pred HHHHHHHhhcccccccccc---------------cccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCceE Confidence 9999999999998765322 234667887775433 34567899999999999999999999999 Q ss_pred ccCCCCCCccceecCccceecc-ccCCce-------eeeecCc-eEEEeeeheeehhh----hhcccchHHHHHhhhhcc Q lcl|Aclame:pro 432 FPVGVSNQTIATHFGFNRLVQS-VAVDEK-------TAVSLSG-YVTNGSRGMEFEQG----TILVENNKEYLFEMPISG 498 (517) Q Consensus 432 ~~~~~~~~~~~~l~g~~~v~~~-~~~~~~-------~~~~~~~-~~~~~~~~~~~~~d----~~~~~n~~~~~~~~rvgg 498 (517) |++++..+.+.+++|.|+++++ .+++.. ..++++. |.+..+.++....+ ..+.+|++.|+++.|+++ T Consensus 298 ~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~~~f~~~~~~~r~~~r~d~ 377 (408) T protein:vir:10 298 LEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDV 377 (408) T ss_pred eccCcCCCCCceecceeeEEecccccCccCCCceEEEEEehhccEEEEEecceEEEEcccccchhhcCceEEEEEEeecc Confidence 9999999989999998776643 234432 2345665 45555666553222 236789999999999999 Q ss_pred eeecccceEEEEeCCCCCC Q lcl|Aclame:pro 499 SLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 499 ~v~~~~a~~~~~~tp~~a~ 517 (517) .+.+|++|++++++++... T Consensus 378 ~v~~~~a~~~~~~~~~~~~ 396 (408) T protein:vir:10 378 KATDSEALVAGSFSAIADQ 396 (408) T ss_pred EEeccccEEEEEeeccccC Confidence 9999999999999885333 No 18 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=100.00 E-value=2.4e-46 Score=270.72 Aligned_cols=372 Identities=10% Similarity=0.089 Sum_probs=225.6 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhH-------HHh Q lcl|Aclame:pro 127 NAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRES-------EKI 199 (517) Q Consensus 127 ~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~-------~~~ 199 (517) .-.++.++++..+...++..... +....+.+.+.+..+...+....++.++.+.+.. ... T Consensus 1 mk~~~el~~~l~el~~~~~~~~~-------------e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~ 67 (415) T protein:vir:98 1 MKTKEELQSEISDIKRQIDLKVK-------------YATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGT 67 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHH-------------HHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 11111111111111111111110 0111111111111111111111111111111110 000 Q ss_pred hhh-hhhh---hhhhhhhHHHHHHHHHHHHhhccchhhHHHHh----------hhhhcccccccccchhhhhhHHHhHhh Q lcl|Aclame:pro 200 LGV-EALK---VTPEATEFLKTREAEVAYMSASLTKDPKAAWT----------AELKERGISGMPAPAGILKRIQDAVND 265 (517) Q Consensus 200 ~~~-~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~vp~~i~~~i~~~~~~ 265 (517) ... .... +...........................+.+. ........+++.+|..+...|++.+++ T Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~ 147 (415) T protein:vir:98 68 SENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEV 147 (415) T ss_pred hhhcccccccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHh Confidence 000 0000 00000000000000000000000000011110 111122345788999999999999999 Q ss_pred hhhhhhceeeeccccc--e--eeeecccccceeeecccccccc-cccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHH Q lcl|Aclame:pro 266 EGSLLPFIRHENLPTL--V--VGGDNALTQGTGHTTGTDKTES-NITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAI 340 (517) Q Consensus 266 ~~~~~~~~~~~~~~~~--~--~~~~~~~~~a~~~~eg~~~~~~-~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l 340 (517) .+++++++++.++++. . ++.......+.|+.||++.|++ .++|+++++.++++++++++|++++.|+.++ | T Consensus 148 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~----l 223 (415) T protein:vir:98 148 EFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVN----V 223 (415) T ss_pred hhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHH----H Confidence 9999999998876543 2 3444566778899999999875 5899999999999999999999999998775 8 Q ss_pred HHHHHHHHHHHHHHHHHhhhhcccccCcccccccccccccccccccccccHHHHHHHHHHhh-hhhcCCEEEEcHHHHHH Q lcl|Aclame:pro 341 LTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVAT-PKAADSTLVIHRNDLAA 419 (517) Q Consensus 341 ~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~-~~~~~a~~vmn~~~~~~ 419 (517) ++||.++|+++++++++.+|++|+|++++..++..........+..+..+++++++++.... .++.+++|||||++|.+ T Consensus 224 ~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~ 303 (415) T protein:vir:98 224 LQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAK 303 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHH Confidence 99999999999999999999999999988766666555555555566677888888887654 44568899999999999 Q ss_pred HHHhhcCCCCEeccCCCCCCccceecCccceeccc-cCCc-----eeeeecCc-eEEEeeeheeeh-hhhhcccchHHHH Q lcl|Aclame:pro 420 IRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSV-AVDE-----KTAVSLSG-YVTNGSRGMEFE-QGTILVENNKEYL 491 (517) Q Consensus 420 l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~-~~~~-----~~~~~~~~-~~~~~~~~~~~~-~d~~~~~n~~~~~ 491 (517) |++|||++|||||++++.++.+.+++|.+++..+. +.+. ...++|+. |++..+.++... .++ ..+...++ T Consensus 304 l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~--~~~~~~~~ 381 (415) T protein:vir:98 304 LDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY--MHFGECLM 381 (415) T ss_pred HHHhhccCCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEecc--ccCceEEE Confidence 99999999999999999998889999987655432 2222 22345565 445555555432 222 34455678 Q ss_pred HhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 492 FEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 492 ~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) +..|+++.+.+|+||+++++++++.| T Consensus 382 ~~~r~d~~v~~~~a~~~~~~~~~~~~ 407 (415) T protein:vir:98 382 IAVRQDCRILDYKSAIVIEYDDSERG 407 (415) T ss_pred EEEEeccEEeccccEEEEEEeccCCC Confidence 89999999999999999999999888 No 19 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=100.00 E-value=2.4e-46 Score=270.72 Aligned_cols=372 Identities=10% Similarity=0.089 Sum_probs=225.6 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhH-------HHh Q lcl|Aclame:pro 127 NAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRES-------EKI 199 (517) Q Consensus 127 ~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~-------~~~ 199 (517) .-.++.++++..+...++..... +....+.+.+.+..+...+....++.++.+.+.. ... T Consensus 1 mk~~~el~~~l~el~~~~~~~~~-------------e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~ 67 (415) T protein:vir:81 1 MKTKEELQSEISDIKRQIDLKVK-------------YATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGT 67 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHH-------------HHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 11111111111111111111110 0111111111111111111111111111111110 000 Q ss_pred hhh-hhhh---hhhhhhhHHHHHHHHHHHHhhccchhhHHHHh----------hhhhcccccccccchhhhhhHHHhHhh Q lcl|Aclame:pro 200 LGV-EALK---VTPEATEFLKTREAEVAYMSASLTKDPKAAWT----------AELKERGISGMPAPAGILKRIQDAVND 265 (517) Q Consensus 200 ~~~-~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~vp~~i~~~i~~~~~~ 265 (517) ... .... +...........................+.+. ........+++.+|..+...|++.+++ T Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~ 147 (415) T protein:vir:81 68 SENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEV 147 (415) T ss_pred hhhcccccccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHh Confidence 000 0000 00000000000000000000000000011110 111122345788999999999999999 Q ss_pred hhhhhhceeeeccccc--e--eeeecccccceeeecccccccc-cccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHH Q lcl|Aclame:pro 266 EGSLLPFIRHENLPTL--V--VGGDNALTQGTGHTTGTDKTES-NITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAI 340 (517) Q Consensus 266 ~~~~~~~~~~~~~~~~--~--~~~~~~~~~a~~~~eg~~~~~~-~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l 340 (517) .+++++++++.++++. . ++.......+.|+.||++.|++ .++|+++++.++++++++++|++++.|+.++ | T Consensus 148 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~----l 223 (415) T protein:vir:81 148 EFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVN----V 223 (415) T ss_pred hhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHH----H Confidence 9999999998876543 2 3444566778899999999875 5899999999999999999999999998775 8 Q ss_pred HHHHHHHHHHHHHHHHHhhhhcccccCcccccccccccccccccccccccHHHHHHHHHHhh-hhhcCCEEEEcHHHHHH Q lcl|Aclame:pro 341 LTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVAT-PKAADSTLVIHRNDLAA 419 (517) Q Consensus 341 ~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~-~~~~~a~~vmn~~~~~~ 419 (517) ++||.++|+++++++++.+|++|+|++++..++..........+..+..+++++++++.... .++.+++|||||++|.+ T Consensus 224 ~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~ 303 (415) T protein:vir:81 224 LQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAK 303 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHH Confidence 99999999999999999999999999988766666555555555566677888888887654 44568899999999999 Q ss_pred HHHhhcCCCCEeccCCCCCCccceecCccceeccc-cCCc-----eeeeecCc-eEEEeeeheeeh-hhhhcccchHHHH Q lcl|Aclame:pro 420 IRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSV-AVDE-----KTAVSLSG-YVTNGSRGMEFE-QGTILVENNKEYL 491 (517) Q Consensus 420 l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~-~~~~-----~~~~~~~~-~~~~~~~~~~~~-~d~~~~~n~~~~~ 491 (517) |++|||++|||||++++.++.+.+++|.+++..+. +.+. ...++|+. |++..+.++... .++ ..+...++ T Consensus 304 l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~--~~~~~~~~ 381 (415) T protein:vir:81 304 LDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY--MHFGECLM 381 (415) T ss_pred HHHhhccCCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEecc--ccCceEEE Confidence 99999999999999999998889999987655432 2222 22345565 445555555432 222 34455678 Q ss_pred HhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 492 FEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 492 ~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) +..|+++.+.+|+||+++++++++.| T Consensus 382 ~~~r~d~~v~~~~a~~~~~~~~~~~~ 407 (415) T protein:vir:81 382 IAVRQDCRILDYKSAIVIEYDDSERG 407 (415) T ss_pred EEEEeccEEeccccEEEEEEeccCCC Confidence 89999999999999999999999888 No 20 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=100.00 E-value=2.4e-46 Score=270.72 Aligned_cols=372 Identities=10% Similarity=0.089 Sum_probs=225.6 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhH-------HHh Q lcl|Aclame:pro 127 NAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRES-------EKI 199 (517) Q Consensus 127 ~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~-------~~~ 199 (517) .-.++.++++..+...++..... +....+.+.+.+..+...+....++.++.+.+.. ... T Consensus 1 mk~~~el~~~l~el~~~~~~~~~-------------e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~ 67 (415) T protein:vir:79 1 MKTKEELQSEISDIKRQIDLKVK-------------YATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGT 67 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHH-------------HHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 11111111111111111111110 0111111111111111111111111111111110 000 Q ss_pred hhh-hhhh---hhhhhhhHHHHHHHHHHHHhhccchhhHHHHh----------hhhhcccccccccchhhhhhHHHhHhh Q lcl|Aclame:pro 200 LGV-EALK---VTPEATEFLKTREAEVAYMSASLTKDPKAAWT----------AELKERGISGMPAPAGILKRIQDAVND 265 (517) Q Consensus 200 ~~~-~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~vp~~i~~~i~~~~~~ 265 (517) ... .... +...........................+.+. ........+++.+|..+...|++.+++ T Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~ 147 (415) T protein:vir:79 68 SENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEV 147 (415) T ss_pred hhhcccccccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHh Confidence 000 0000 00000000000000000000000000011110 111122345788999999999999999 Q ss_pred hhhhhhceeeeccccc--e--eeeecccccceeeecccccccc-cccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHH Q lcl|Aclame:pro 266 EGSLLPFIRHENLPTL--V--VGGDNALTQGTGHTTGTDKTES-NITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAI 340 (517) Q Consensus 266 ~~~~~~~~~~~~~~~~--~--~~~~~~~~~a~~~~eg~~~~~~-~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l 340 (517) .+++++++++.++++. . ++.......+.|+.||++.|++ .++|+++++.++++++++++|++++.|+.++ | T Consensus 148 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~----l 223 (415) T protein:vir:79 148 EFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVN----V 223 (415) T ss_pred hhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHH----H Confidence 9999999998876543 2 3444566778899999999875 5899999999999999999999999998775 8 Q ss_pred HHHHHHHHHHHHHHHHHhhhhcccccCcccccccccccccccccccccccHHHHHHHHHHhh-hhhcCCEEEEcHHHHHH Q lcl|Aclame:pro 341 LTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVAT-PKAADSTLVIHRNDLAA 419 (517) Q Consensus 341 ~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~-~~~~~a~~vmn~~~~~~ 419 (517) ++||.++|+++++++++.+|++|+|++++..++..........+..+..+++++++++.... .++.+++|||||++|.+ T Consensus 224 ~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~ 303 (415) T protein:vir:79 224 LQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAK 303 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHH Confidence 99999999999999999999999999988766666555555555566677888888887654 44568899999999999 Q ss_pred HHHhhcCCCCEeccCCCCCCccceecCccceeccc-cCCc-----eeeeecCc-eEEEeeeheeeh-hhhhcccchHHHH Q lcl|Aclame:pro 420 IRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSV-AVDE-----KTAVSLSG-YVTNGSRGMEFE-QGTILVENNKEYL 491 (517) Q Consensus 420 l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~-~~~~-----~~~~~~~~-~~~~~~~~~~~~-~d~~~~~n~~~~~ 491 (517) |++|||++|||||++++.++.+.+++|.+++..+. +.+. ...++|+. |++..+.++... .++ ..+...++ T Consensus 304 l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~--~~~~~~~~ 381 (415) T protein:vir:79 304 LDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY--MHFGECLM 381 (415) T ss_pred HHHhhccCCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEecc--ccCceEEE Confidence 99999999999999999998889999987655432 2222 22345565 445555555432 222 34455678 Q ss_pred HhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 492 FEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 492 ~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) +..|+++.+.+|+||+++++++++.| T Consensus 382 ~~~r~d~~v~~~~a~~~~~~~~~~~~ 407 (415) T protein:vir:79 382 IAVRQDCRILDYKSAIVIEYDDSERG 407 (415) T ss_pred EEEEeccEEeccccEEEEEEeccCCC Confidence 89999999999999999999999888 No 21 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=100.00 E-value=5.3e-46 Score=268.79 Aligned_cols=368 Identities=13% Similarity=0.060 Sum_probs=231.0 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHH---HhhhhhhhHHHHHHHHhhHHH Q lcl|Aclame:pro 122 NPSNKNAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDN---AALKTVSELAANLMKQRESEK 198 (517) Q Consensus 122 ~pA~~~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e---~~~~~~~~~~~~~~~~~~~~~ 198 (517) .|+.-..++..++++.+....+++..... ...+ +.+....+++++..+.++ ..................... T Consensus 1 ~~~~m~k~l~el~~~~~~~~~~~~~~~~~--~~~e---e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (397) T protein:vir:12 1 MPMQMSKKEIALRQQFTEKKQQADKALQE--GNTD---EARALLDEVKQLKNQIELMTEGRSLDVPDLPGGVNFVPEQER 75 (397) T ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHhhh--hhHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhh Confidence 66666666666665554444443322110 0001 111111122221111111 110000000000000000000 Q ss_pred hhhhhhhhhhhhhhhHHHHHHHHHHHHhhccchhh-HHHH------hhhhhcccccccccchhhhhhHHHhHhhhhhhhh Q lcl|Aclame:pro 199 ILGVEALKVTPEATEFLKTREAEVAYMSASLTKDP-KAAW------TAELKERGISGMPAPAGILKRIQDAVNDEGSLLP 271 (517) Q Consensus 199 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~------~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~ 271 (517) .....................+......+...... +... .........+++.+|+.+...|++.++..+++++ T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~ 155 (397) T protein:vir:12 76 NPEGQRSQGQGNEERQQQYSKAFLKGLRGKRLTDEERDLLDSPEFRAMSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQ 155 (397) T ss_pred hhcccccccchhhHHHHHHHHHHHHHHhccCCcHHHHHHHhhhhhhhccccccccCcccCchhHHHHHHHhhhhhhhHHh Confidence 00000000000000001111222222222221111 1111 0011223456788999999999999999999999 Q ss_pred ceeeecccc----ceeeeecccccceeeecccccccc-cccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHH Q lcl|Aclame:pro 272 FIRHENLPT----LVVGGDNALTQGTGHTTGTDKTES-NITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMN 346 (517) Q Consensus 272 ~~~~~~~~~----~~~~~~~~~~~a~~~~eg~~~~~~-~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~ 346 (517) ++++.++++ ..++.......+.|++||+.+|++ .++|+.+++.++++++++++|++++.|+.++ |++||.+ T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~is~e~l~ds~~~----l~~~i~~ 231 (397) T protein:vir:12 156 YVTVEPVTTRSGTRLLEKNADMVPFSPVEELGNLPEIDQPRFTKVSYSIIDYGGIMTLSNSMLNDSDQA----IMTYVAK 231 (397) T ss_pred hcceeeccCCceeEEEEEecCCcceeeecccccccccccccceeEEeeheeeEeeehhhHHHHhhchHH----HHHHHHH Confidence 998877653 245666777889999999999974 6899999999999999999999999988875 8999999 Q ss_pred HHHHHHHHHHHhhhhcccccCcccccccccccccccccccccccHHHHHHHHHHh--hhhhcCCEEEEcHHHHHHHHHhh Q lcl|Aclame:pro 347 RLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVA--TPKAADSTLVIHRNDLAAIRFLK 424 (517) Q Consensus 347 ~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~--~~~~~~a~~vmn~~~~~~l~~lK 424 (517) +|+++++++++.+|++|+|++++. |+ ...++++.++... ..+..+++|+|||.+|.+|++|| T Consensus 232 ~l~~~~~~~~d~~il~G~g~~~~~-g~---------------~~~~~i~~~~~~~l~~~~~~~a~~~~n~~~~~~L~~lk 295 (397) T protein:vir:12 232 WFAKKSVVTRNNLILAAIASLKKV-DI---------------DGLDGIKKALNVTLDPMVAPGSIVLTNQDGYDWLDTLK 295 (397) T ss_pred HHHHHHHHHHHHHHHhcccccccc-cc---------------ccHHHHHHHHhhccchhhhCCCEEEEcHHHHHHHHHhh Confidence 999999999999999999987652 22 2356677766532 34556789999999999999999 Q ss_pred cCCCCEeccCCCCCCccceecCccceeccccCCc-------eeeeecCce-EEEeeeheee----hhhhhcccchHHHHH Q lcl|Aclame:pro 425 DKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDE-------KTAVSLSGY-VTNGSRGMEF----EQGTILVENNKEYLF 492 (517) Q Consensus 425 D~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~-------~~~~~~~~~-~~~~~~~~~~----~~d~~~~~n~~~~~~ 492 (517) |++|||||++++.++.+.+++|.++++....++. ...++++.| ++..+.++.. ..+..+.+|++.|++ T Consensus 296 d~~G~~l~~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~ 375 (397) T protein:vir:12 296 DGTGRYLLQPDPTNPTKKLLDGRPVVPFTNRVLKTQKGKAPLIIGNLKEAIVLFDREQQSIASTDTGAGAFETNSTKVRG 375 (397) T ss_pred ccCCceeecccccCCCCccccceeeEEecccccccCCCccEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEE Confidence 9999999999999998899999877655433322 233466664 4555555433 223346789999999 Q ss_pred hhhhcceeecccceEEEEeCCC Q lcl|Aclame:pro 493 EMPISGSLEYKGTTAYGTYTPP 514 (517) Q Consensus 493 ~~rvgg~v~~~~a~~~~~~tp~ 514 (517) +.|+++.+.+|+||+++++|.- T Consensus 376 ~~r~d~~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 376 IEREDVRKWDEDAVVFGQITVE 397 (397) T ss_pred EEeeccEEecccceEEEEEeeC Confidence 9999999999999999999998 No 22 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=100.00 E-value=6.9e-46 Score=268.18 Aligned_cols=389 Identities=13% Similarity=0.074 Sum_probs=209.8 Q ss_pred hhh--hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh-hhhhhhhhhhhhhHHHHH-----HHhhhhhhhHHHHHHHH Q lcl|Aclame:pro 122 NPS--NKNAVVTYFREEKKKEENKMTFDQNLMQELLDAK-KLAADLNAKLKERENGGD-----NAALKTVSELAANLMKQ 193 (517) Q Consensus 122 ~pA--~~~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~-~~~~e~~a~l~~~~~~~~-----e~~~~~~~~~~~~~~~~ 193 (517) .|. +-.+....+.++..+...+.......+++..+.. ...+++...++...+... +.......+++.++.+. T Consensus 1 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~aE~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 80 (497) T protein:vir:78 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 110 0011111111111111100000111111111000 000111111110000000 00000001111111111 Q ss_pred hhHHHhh----hhhhhhhhhhhhh-------------HHHH-HHH----HHHHHhhc--cchhhH-HHHhhhhhcccccc Q lcl|Aclame:pro 194 RESEKIL----GVEALKVTPEATE-------------FLKT-REA----EVAYMSAS--LTKDPK-AAWTAELKERGISG 248 (517) Q Consensus 194 ~~~~~~~----~~~~~~~~~~~~~-------------~~~~-~~~----~~~~~~~~--~~~~~~-~~~~~~~~~~~~~~ 248 (517) +...... ............. .... ... ........ ...... ..........+..+ T Consensus 81 e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg 160 (497) T protein:vir:78 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) T ss_pred HhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcccc Confidence 0000000 0000000000000 0000 000 00000000 000000 00011122334567 Q ss_pred cccchhhhhhHHHhHhhhhhhhhceeeecccc--ceeeeeccc-ccceeeecccccccccccceeeEeeHhhhhHhHhhh Q lcl|Aclame:pro 249 MPAPAGILKRIQDAVNDEGSLLPFIRHENLPT--LVVGGDNAL-TQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLP 325 (517) Q Consensus 249 ~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~-~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS 325 (517) +.+|+.+...|++.++..+++++++++.++++ ..+|..... ..+.|++||+.+|+++++|+++++.++++++++++| T Consensus 161 ~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~a~~~~iS 240 (497) T protein:vir:78 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT 240 (497) T ss_pred cccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceeeccCcccccccccceeeEeeeeeeEeecHhH Confidence 88999999999999999999999998877654 355665553 578999999999999999999999999999999999 Q ss_pred HHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCccccccccccccccccccccc----------------- Q lcl|Aclame:pro 326 KIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGT----------------- 388 (517) Q Consensus 326 ~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~----------------- 388 (517) ++++.|+ +.|++||.++|++++++++|.+||+|+|+++ +.||++..+.......... T Consensus 241 ~ell~d~-----~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~-p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (497) T protein:vir:78 241 DEGLRDA-----PELFNFVQGRLLEGIQRKEEVQLLAGGGYPG-VNGLLQRSTGFTASSASSLFGATSATVSNVKFPADG 314 (497) T ss_pred HHHHHhH-----HHHHHHHHHHHHHHHHHHHHHHhhcCCCccc-ccccccccccccccccccchhhhhhhhhhhhhhccc Confidence 9998875 3499999999999999999999999999875 5688775532111100000 Q ss_pred --------------------------------------ccHHHHHHHHHHhh--hhhcCCEEEEcHHHHHHHHHhhcCCC Q lcl|Aclame:pro 389 --------------------------------------TNIQELLEKLSVAT--PKAADSTLVIHRNDLAAIRFLKDKNG 428 (517) Q Consensus 389 --------------------------------------~~~d~l~~~l~~~~--~~~~~a~~vmn~~~~~~l~~lKD~~G 428 (517) ...+.+..++.... .++.+.+|||||.+|..|++|||++| T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G 394 (497) T protein:vir:78 315 TNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANG 394 (497) T ss_pred ccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhhcCCC Confidence 00111112221111 12345689999999999999999999 Q ss_pred CEeccCCCCC------CccceecCccceeccccCCce--eeeecCc-e-EEEeeeheeeh----hhhhcccchHHHHHhh Q lcl|Aclame:pro 429 NYVFPVGVSN------QTIATHFGFNRLVQSVAVDEK--TAVSLSG-Y-VTNGSRGMEFE----QGTILVENNKEYLFEM 494 (517) Q Consensus 429 ryl~~~~~~~------~~~~~l~g~~~v~~~~~~~~~--~~~~~~~-~-~~~~~~~~~~~----~d~~~~~n~~~~~~~~ 494 (517) ||||++.... ....++||+++++ +..|+.. ..++++. | .+.++.++... ...+|.+|++.|+++. T Consensus 395 ~~i~~~~~~~~~~~~~~~~~~l~G~pV~~-t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~ 473 (497) T protein:vir:78 395 QYMGGNFFGNAYGNPVNGGKNIWGVPVVT-TPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEE 473 (497) T ss_pred ceeccCcccccccccccCCceeeceeeEe-cCCCCCCceEEeecccceEEEEEecccEEEeecccchhhhcCcEEEEEEE Confidence 9999875432 2334788865544 4445443 3445553 3 34455554322 2345789999999999 Q ss_pred hhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 495 PISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 495 rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) |+++.|++|+||++++++++++| T Consensus 474 r~~~~v~~p~A~~~l~~~~~~~~ 496 (497) T protein:vir:78 474 RLGLLVYRPSAFQLIQLKKGATG 496 (497) T ss_pred eecceeeccccEEEEEecCCccC Confidence 99999999999999999999999 No 23 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=100.00 E-value=6.9e-46 Score=268.18 Aligned_cols=389 Identities=13% Similarity=0.074 Sum_probs=209.8 Q ss_pred hhh--hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh-hhhhhhhhhhhhhHHHHH-----HHhhhhhhhHHHHHHHH Q lcl|Aclame:pro 122 NPS--NKNAVVTYFREEKKKEENKMTFDQNLMQELLDAK-KLAADLNAKLKERENGGD-----NAALKTVSELAANLMKQ 193 (517) Q Consensus 122 ~pA--~~~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~-~~~~e~~a~l~~~~~~~~-----e~~~~~~~~~~~~~~~~ 193 (517) .|. +-.+....+.++..+...+.......+++..+.. ...+++...++...+... +.......+++.++.+. T Consensus 1 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~aE~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 80 (497) T protein:vir:10 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 110 0011111111111111100000111111111000 000111111110000000 00000001111111111 Q ss_pred hhHHHhh----hhhhhhhhhhhhh-------------HHHH-HHH----HHHHHhhc--cchhhH-HHHhhhhhcccccc Q lcl|Aclame:pro 194 RESEKIL----GVEALKVTPEATE-------------FLKT-REA----EVAYMSAS--LTKDPK-AAWTAELKERGISG 248 (517) Q Consensus 194 ~~~~~~~----~~~~~~~~~~~~~-------------~~~~-~~~----~~~~~~~~--~~~~~~-~~~~~~~~~~~~~~ 248 (517) +...... ............. .... ... ........ ...... ..........+..+ T Consensus 81 e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg 160 (497) T protein:vir:10 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFA 160 (497) T ss_pred HhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcccc Confidence 0000000 0000000000000 0000 000 00000000 000000 00011122334567 Q ss_pred cccchhhhhhHHHhHhhhhhhhhceeeecccc--ceeeeeccc-ccceeeecccccccccccceeeEeeHhhhhHhHhhh Q lcl|Aclame:pro 249 MPAPAGILKRIQDAVNDEGSLLPFIRHENLPT--LVVGGDNAL-TQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLP 325 (517) Q Consensus 249 ~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~-~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS 325 (517) +.+|+.+...|++.++..+++++++++.++++ ..+|..... ..+.|++||+.+|+++++|+++++.++++++++++| T Consensus 161 ~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~a~~~~iS 240 (497) T protein:vir:10 161 PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTIT 240 (497) T ss_pred cccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceeeccCcccccccccceeeEeeeeeeEeecHhH Confidence 88999999999999999999999998877654 355665553 578999999999999999999999999999999999 Q ss_pred HHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCccccccccccccccccccccc----------------- Q lcl|Aclame:pro 326 KIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGT----------------- 388 (517) Q Consensus 326 ~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~----------------- 388 (517) ++++.|+ +.|++||.++|++++++++|.+||+|+|+++ +.||++..+.......... T Consensus 241 ~ell~d~-----~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~-p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (497) T protein:vir:10 241 DEGLRDA-----PELFNFVQGRLLEGIQRKEEVQLLAGGGYPG-VNGLLQRSTGFTASSASSLFGATSATVSNVKFPADG 314 (497) T ss_pred HHHHHhH-----HHHHHHHHHHHHHHHHHHHHHHhhcCCCccc-ccccccccccccccccccchhhhhhhhhhhhhhccc Confidence 9998875 3499999999999999999999999999875 5688775532111100000 Q ss_pred --------------------------------------ccHHHHHHHHHHhh--hhhcCCEEEEcHHHHHHHHHhhcCCC Q lcl|Aclame:pro 389 --------------------------------------TNIQELLEKLSVAT--PKAADSTLVIHRNDLAAIRFLKDKNG 428 (517) Q Consensus 389 --------------------------------------~~~d~l~~~l~~~~--~~~~~a~~vmn~~~~~~l~~lKD~~G 428 (517) ...+.+..++.... .++.+.+|||||.+|..|++|||++| T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G 394 (497) T protein:vir:10 315 TNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANG 394 (497) T ss_pred ccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhhcCCC Confidence 00111112221111 12345689999999999999999999 Q ss_pred CEeccCCCCC------CccceecCccceeccccCCce--eeeecCc-e-EEEeeeheeeh----hhhhcccchHHHHHhh Q lcl|Aclame:pro 429 NYVFPVGVSN------QTIATHFGFNRLVQSVAVDEK--TAVSLSG-Y-VTNGSRGMEFE----QGTILVENNKEYLFEM 494 (517) Q Consensus 429 ryl~~~~~~~------~~~~~l~g~~~v~~~~~~~~~--~~~~~~~-~-~~~~~~~~~~~----~d~~~~~n~~~~~~~~ 494 (517) ||||++.... ....++||+++++ +..|+.. ..++++. | .+.++.++... ...+|.+|++.|+++. T Consensus 395 ~~i~~~~~~~~~~~~~~~~~~l~G~pV~~-t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~ 473 (497) T protein:vir:10 395 QYMGGNFFGNAYGNPVNGGKNIWGVPVVT-TPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEE 473 (497) T ss_pred ceeccCcccccccccccCCceeeceeeEe-cCCCCCCceEEeecccceEEEEEecccEEEeecccchhhhcCcEEEEEEE Confidence 9999875432 2334788865544 4445443 3445553 3 34455554322 2345789999999999 Q ss_pred hhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 495 PISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 495 rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) |+++.|++|+||++++++++++| T Consensus 474 r~~~~v~~p~A~~~l~~~~~~~~ 496 (497) T protein:vir:10 474 RLGLLVYRPSAFQLIQLKKGATG 496 (497) T ss_pred eecceeeccccEEEEEecCCccC Confidence 99999999999999999999999 No 24 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=100.00 E-value=4.8e-46 Score=269.06 Aligned_cols=365 Identities=12% Similarity=0.075 Sum_probs=222.8 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhhhhhh- Q lcl|Aclame:pro 127 NAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVEAL- 205 (517) Q Consensus 127 ~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 205 (517) ...+..+++...+...+++...+...+......... .+++++..+..+.. +....+.....+............. T Consensus 1 Mk~~~el~~~~~~~~~~~~~l~~~~~~~~~~~~~~~---ee~~~~~~~i~~~~-~~~e~~~~~~~~~~~~~~~~~~~~~~ 76 (397) T protein:vir:49 1 MKTSNELHDLWVAQGDKVENLNEKLNVAMLDDSVSA---EELQAIKNERDTAK-MKRDMFKEQYTEARANEVANMSEEEK 76 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCH---HHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhhhccccccc Confidence 233333333322222222221111111100000000 01111111111100 0001111111111100000000000 Q ss_pred --hhhhhhhhHHHHHHHHHHHHhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeeccccc-- Q lcl|Aclame:pro 206 --KVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPTL-- 281 (517) Q Consensus 206 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~~-- 281 (517) ....+........++...+...... ..... ........+++.+|+.+...|++.+++.+++++++++.++++. T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~l~~~~~-~~~~~--~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 153 (397) T protein:vir:49 77 KPLTKSEEEVKAGFVKDFKNLVRGRYQ-NLLDS--KTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTG 153 (397) T ss_pred cccccchhHHHHHHHHHHHHHHhcchh-HHHHH--hhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceeecccCcc Confidence 0000000000111112222221111 11111 1222334568899999999999999999999999988776532 Q ss_pred --eeeeec-ccccceeeeccccccc-ccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 282 --VVGGDN-ALTQGTGHTTGTDKTE-SNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVN 357 (517) Q Consensus 282 --~~~~~~-~~~~a~~~~eg~~~~~-~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e 357 (517) .++... ....+.|++||+.+|+ +.++|+++++.++++++++++|++++.|+.++ |++||.++|+++++++++ T Consensus 154 ~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~l~~~~~~~~d 229 (397) T protein:vir:49 154 SRVYEKWTDITGLANIDDEAGKIADVDDPKLSLIKYTIKRYAGISTVTNSLLADSAEN----ILAWLSGWIAKKVVVTRN 229 (397) T ss_pred ceEEEeeccCCcceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhhhHHH----HHHHHHHHHHHHHHHHHH Confidence 233333 3356899999999986 67999999999999999999999999998765 899999999999999999 Q ss_pred hhhhcccccCcccccccccccccccccccccccHHHHHHHHHHhh-hhhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCC Q lcl|Aclame:pro 358 RAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVAT-PKAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGV 436 (517) Q Consensus 358 ~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~-~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~ 436 (517) .+||+|+|++++..++ ...+++++++.... .+..+++|||||++|..|++|||++|||||+++. T Consensus 230 ~ai~~G~g~~~~~~~~---------------~~~d~i~~~~~~l~~~~~~~a~~vmn~~~~~~l~~lkd~~G~~l~~~~~ 294 (397) T protein:vir:49 230 KAILEAIAALPTKPTL---------------TKWDDIIDLEAKVDPAIKQTSFFLTNTSGFTALKKVKNALGDYLMERDV 294 (397) T ss_pred HHHHhhcccccccccc---------------ccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceeeccCc Confidence 9999999988764322 34567777776654 3456799999999999999999999999999999 Q ss_pred CCCccceecCccceecc-ccCCc-------eeeeecCc-eEEEeeeheeeh----hhhhcccchHHHHHhhhhcceeecc Q lcl|Aclame:pro 437 SNQTIATHFGFNRLVQS-VAVDE-------KTAVSLSG-YVTNGSRGMEFE----QGTILVENNKEYLFEMPISGSLEYK 503 (517) Q Consensus 437 ~~~~~~~l~g~~~v~~~-~~~~~-------~~~~~~~~-~~~~~~~~~~~~----~d~~~~~n~~~~~~~~rvgg~v~~~ 503 (517) ..+...+++|.|+++++ .+++. ...++++. |.+..+.++... .+.++.+|++.|+++.|+++.+.+| T Consensus 295 ~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~ 374 (397) T protein:vir:49 295 KSPTGYSIDGFAVKEVADRWLANGTGGAMPLYFGDLKQAVTLFDRQHMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDT 374 (397) T ss_pred CCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecceEEEEeccccchhhcCceeEEEEeeeCcEEecc Confidence 99988999998776543 22322 22345665 455555565432 2335789999999999999999999 Q ss_pred cceEEEEeCCCCCC Q lcl|Aclame:pro 504 GTTAYGTYTPPVAG 517 (517) Q Consensus 504 ~a~~~~~~tp~~a~ 517 (517) ++|++++++++.+. T Consensus 375 ~a~~~~~~~~~~~~ 388 (397) T protein:vir:49 375 EAFVPASFKAIADQ 388 (397) T ss_pred cceEEEEeecccCC Confidence 99999999887554 No 25 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=100.00 E-value=7.4e-46 Score=268.01 Aligned_cols=372 Identities=10% Similarity=0.077 Sum_probs=226.6 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhH-------HHhhhh Q lcl|Aclame:pro 130 VTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRES-------EKILGV 202 (517) Q Consensus 130 I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~-------~~~~~~ 202 (517) +...++..+ .+ .+..+.+.+.. ++....+.+...+..+...+++..++.++.+.++. ...... T Consensus 1 mk~~~el~~-~l---~el~~~~~~~~------~~~~~~~~~~~~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~ 70 (415) T protein:vir:94 1 MKTKEELQS-EI---SDIKRQIDLKV------KYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSEN 70 (415) T ss_pred CChHHHHHH-HH---HHHHHHHHHHH------HHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 211111111 11 11000010000 00011111111111111111111111111111110 000000 Q ss_pred h-hhhhhhhhhhHHHH--HHHHHHHHhh--ccchhh---------HHHHhhhhhcccccccccchhhhhhHHHhHhhhhh Q lcl|Aclame:pro 203 E-ALKVTPEATEFLKT--REAEVAYMSA--SLTKDP---------KAAWTAELKERGISGMPAPAGILKRIQDAVNDEGS 268 (517) Q Consensus 203 ~-~~~~~~~~~~~~~~--~~~~~~~~~~--~~~~~~---------~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~ 268 (517) . .............. .......... ....+. +............+++.+|+.+...|++.++..++ T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~g~~~iP~~~~~~ii~~~~~~~~ 150 (415) T protein:vir:94 71 NQQSVEVNEASTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFN 150 (415) T ss_pred ccccccccchhhHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHhhhhhhhhhhccccccccccCcHHHHHHHHHHHHhhhh Confidence 0 00000000000000 0000000000 000000 01111111223345788999999999999999999 Q ss_pred hhhceeeeccccc--e--eeeecccccceeeecccccccc-cccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHH Q lcl|Aclame:pro 269 LLPFIRHENLPTL--V--VGGDNALTQGTGHTTGTDKTES-NITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTY 343 (517) Q Consensus 269 ~~~~~~~~~~~~~--~--~~~~~~~~~a~~~~eg~~~~~~-~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~ 343 (517) +++++++.++++. . ++.......+.|++||+++|+. .++|+++++.++++++++++|++++.|+.++ |++| T Consensus 151 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds~~~----~~~~ 226 (415) T protein:vir:94 151 LDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVN----VLQE 226 (415) T ss_pred hhhhcceeeccCCceeEEEEeecCCccceeccccccccccccccceeeEeeheeeeeechhhHHHHhhchHH----HHHH Confidence 9999998876543 2 3444566788999999999974 6799999999999999999999999998765 8999 Q ss_pred HHHHHHHHHHHHHHhhhhcccccCcccccccccccccccccccccccHHHHHHHHHHhh-hhhcCCEEEEcHHHHHHHHH Q lcl|Aclame:pro 344 VMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVAT-PKAADSTLVIHRNDLAAIRF 422 (517) Q Consensus 344 i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~-~~~~~a~~vmn~~~~~~l~~ 422 (517) |.++|+++++++++.+||+|+|++++..++............++...++++++++.... .++.+++|||||++|.+|++ T Consensus 227 i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~ 306 (415) T protein:vir:94 227 LKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDK 306 (415) T ss_pred HHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHH Confidence 99999999999999999999999988766666555445555556667888888887754 45578899999999999999 Q ss_pred hhcCCCCEeccCCCCCCccceecCccceeccc-cCCc-----eeeeecCc-eEEEeeeheeeh-hhhhcccchHHHHHhh Q lcl|Aclame:pro 423 LKDKNGNYVFPVGVSNQTIATHFGFNRLVQSV-AVDE-----KTAVSLSG-YVTNGSRGMEFE-QGTILVENNKEYLFEM 494 (517) Q Consensus 423 lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~-~~~~-----~~~~~~~~-~~~~~~~~~~~~-~d~~~~~n~~~~~~~~ 494 (517) +||++|||||++++.++.+.+++|.++++.+. +.+. ...++++. |++..+.++... .+ +..+...+++.. T Consensus 307 lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~--~~~~~~~~r~~~ 384 (415) T protein:vir:94 307 MKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD--YMHFGECLMIAV 384 (415) T ss_pred hhccCCCeeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEec--cccCceEEEEEE Confidence 99999999999999999889999987655432 2222 23345665 455555555432 22 245566788999 Q ss_pred hhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 495 PISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 495 rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) |+++.+.+|+||+++++++++.| T Consensus 385 r~d~~~~~~~a~~~~~~~~~~~~ 407 (415) T protein:vir:94 385 RQDCRILDYKSAIVIEYDDSERG 407 (415) T ss_pred EeccEEeccccEEEEEEeccCCC Confidence 99999999999999999999888 No 26 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=100.00 E-value=5e-46 Score=268.92 Aligned_cols=372 Identities=12% Similarity=0.121 Sum_probs=226.4 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhh----hhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhhhhhhhhh Q lcl|Aclame:pro 133 FREEKKKEENKMTFDQNLMQELLDAKK----LAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVEALKVT 208 (517) Q Consensus 133 vk~~~~~~~~~~~~~~~~~~~~~e~~~----~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 208 (517) +..+..+...+.+.....++...+..+ +++...+++.+++++.... ....+......+.. .......... T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~~~~~ee~~~~~~e~~~l~~~i~~~--~~~~~~~~~~~~~~----~~~~~~~~~~ 74 (404) T protein:vir:10 1 MSKELRELLNQLDSKNKELNSLLNKDGVTAEELNKTSNEIDILQAKIEAQ--KRKENIENNFNEDN----VKSLNTGKEE 74 (404) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHhhhh----ccccccccch Confidence 111111111111111111111111100 0111111111111110000 00000000000000 0000000000 Q ss_pred hh---hhhHHHHHH-HHHHHHh-h--ccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc- Q lcl|Aclame:pro 209 PE---ATEFLKTRE-AEVAYMS-A--SLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT- 280 (517) Q Consensus 209 ~~---~~~~~~~~~-~~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~- 280 (517) .. .....+... ....... . .......++.. ......+++.+|..+...|++.++..+++++++++.+++. T Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~--~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~ 152 (404) T protein:vir:10 75 NVIYNGALFVRAIADNLLKQKNQRGLNLSEKEINAIS--ENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEPVFTR 152 (404) T ss_pred hhHHHHHHHHHHHHHHHHHHHHhhhhcchhhHHhhhc--cccCCCCceeechhHHHHHHHHHhhhhhHhhhhceeeccCC Confidence 00 000000000 0000000 0 00111111111 1223455788999999999999999999999999887653 Q ss_pred ---ceeeeecccccceeeecccccccc--cccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 281 ---LVVGGDNALTQGTGHTTGTDKTES--NITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMA 355 (517) Q Consensus 281 ---~~~~~~~~~~~a~~~~eg~~~~~~--~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~ 355 (517) ..+++......+.|+.||+.++.+ +++|+++++.++++++++++|++++.|+.++ |++||.++|+++++++ T Consensus 153 ~g~~~~~~~~~~~~~~~v~e~~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~la~~~~~~ 228 (404) T protein:vir:10 153 SGSRTYEKRSKQKPMKPLSENQQIPTNGDNGKLERFNFKLKDLADFMSIPNDLLKFADKS----LEDWIINWFVDKVRIT 228 (404) T ss_pred ccceEEEEecCCcceeeccccccccccccccceeeeEeeheeeEeeehhhHHHHhhcHHH----HHHHHHHHHHHHHHHH Confidence 345666777889999999998875 5889999999999999999999999987764 9999999999999999 Q ss_pred HHhhhhcccccCcccccccccccccccccccccccHHHHHHHHHHhh--hhhcCCEEEEcHHHHHHHHHhhcCCCCEecc Q lcl|Aclame:pro 356 VNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVAT--PKAADSTLVIHRNDLAAIRFLKDKNGNYVFP 433 (517) Q Consensus 356 ~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~--~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~ 433 (517) ++.+||+|+|++++..|+++..+. ..........++++..++.... .+..+++|+|||.+|.+|++|||++|||+|+ T Consensus 229 ~~~~il~G~g~~~~~~gi~~~~~~-~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~ 307 (404) T protein:vir:10 229 RNAEILYGAGGDEHATGIMTANKF-KKITLPKSPALKDFKKCKNVELLNVFKATSSWIVNQDGFNYLDSLEDKTGRPYLQ 307 (404) T ss_pred HHHHHhhcCCCCCcccceeecccc-ceeeccccccHHHHHHHHHhhhhccccCCCEEEEcHHHHHHHHHhhccCCceeec Confidence 999999999999988888766543 3333444556777877765432 3445789999999999999999999999999 Q ss_pred CCCCCCccceecCccceeccccCCc-------eeeeecCc-eEEEeeeheeeh--h--hhhcccchHHHHHhhhhcceee Q lcl|Aclame:pro 434 VGVSNQTIATHFGFNRLVQSVAVDE-------KTAVSLSG-YVTNGSRGMEFE--Q--GTILVENNKEYLFEMPISGSLE 501 (517) Q Consensus 434 ~~~~~~~~~~l~g~~~v~~~~~~~~-------~~~~~~~~-~~~~~~~~~~~~--~--d~~~~~n~~~~~~~~rvgg~v~ 501 (517) ++...+.+.+++|.|+++.+..++. ...++++. |.+..+.++... + ..++.+|++.|+++.|+++.+. T Consensus 308 ~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~v~ 387 (404) T protein:vir:10 308 PDPKDPTQYRFLGLPVIELPNDLLLSTESAIPVLLGDTKEAYKYVSDGAYELATTNIGAGAFETNTTKARIIMRIDGNVK 387 (404) T ss_pred cCcCCCCCccccceeeEEecccccCCCCCccEEEEEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEe Confidence 9999998899999877655443332 23455665 455555554432 2 2346789999999999999999 Q ss_pred cccceEEEEeCCC-CCC Q lcl|Aclame:pro 502 YKGTTAYGTYTPP-VAG 517 (517) Q Consensus 502 ~~~a~~~~~~tp~-~a~ 517 (517) +|+||++++++++ .-+ T Consensus 388 ~~~a~~~~~~~~aa~~~ 404 (404) T protein:vir:10 388 DSEALLIAEIPVESVQA 404 (404) T ss_pred cccceEEEEeecccCCC Confidence 9999999999666 334 No 27 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=100.00 E-value=4.6e-46 Score=269.13 Aligned_cols=371 Identities=15% Similarity=0.130 Sum_probs=225.7 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhhhh Q lcl|Aclame:pro 124 SNKNAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVE 203 (517) Q Consensus 124 A~~~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 203 (517) ++-...|..++++..+...+++...+.+.+..+..+... ..++++..+..+. .+....+..++...+......... T Consensus 1 m~~~m~i~el~~~~~~~~~~~~~~~~e~~~~~~~~~~~~---e~i~e~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 76 (408) T protein:vir:74 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSA---EAMSELKNKRDNE-KVRRDALREQLVEAQAEQVVNMRE 76 (408) T ss_pred CChhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccH---HHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhccc Confidence 444445555544444444344333322222211111100 0111111111111 111111111111111111000000 Q ss_pred h--hhhhhhhhhH-HHHHHHHHHHHhhccc-hhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeeccc Q lcl|Aclame:pro 204 A--LKVTPEATEF-LKTREAEVAYMSASLT-KDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLP 279 (517) Q Consensus 204 ~--~~~~~~~~~~-~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~ 279 (517) . .......... ....+.+..+...... ......-.........+++.+|..+...|++.++..+++++++++.+++ T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~ 156 (408) T protein:vir:74 77 EEKGPLNKSENELKDKFVKDFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVS 156 (408) T ss_pred cccccccchhhhhHHHHHHHHHHHHhcchhhhhhhhhhhhcccccCCCceeechhHhhHHHHHHhhhcchhhhcceeecc Confidence 0 0000000000 0011111111111110 0011111111223445688999999999999999999999999887765 Q ss_pred cc----eeeeecc-cccceeeeccccccc-ccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 280 TL----VVGGDNA-LTQGTGHTTGTDKTE-SNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVI 353 (517) Q Consensus 280 ~~----~~~~~~~-~~~a~~~~eg~~~~~-~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~ 353 (517) +. .++.... ...+.|+.||+..|+ ++++|++++++++++++++++|++++.|+.++ |++||.++|+++++ T Consensus 157 ~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~l~~~~~ 232 (408) T protein:vir:74 157 TSSGSRVYEKWTDVTPLKAMDEEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAEN----ILAWLSSWIAKKVV 232 (408) T ss_pred CCcceEEEEeecCCcccccccccccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHH----HHHHHHHHHHHHHH Confidence 43 2333333 345678999999987 66999999999999999999999999988776 89999999999999 Q ss_pred HHHHhhhhcccccCcccccccccccccccccccccccHHHHHHHHHHhh--hhhcCCEEEEcHHHHHHHHHhhcCCCCEe Q lcl|Aclame:pro 354 MAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVAT--PKAADSTLVIHRNDLAAIRFLKDKNGNYV 431 (517) Q Consensus 354 ~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~--~~~~~a~~vmn~~~~~~l~~lKD~~Gryl 431 (517) ++++.+||+|+|++++..+ ..+.++++++++... .+..+++|||||.+|.+|++|||++|||| T Consensus 233 ~~~d~~il~G~G~~~~~~~---------------~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~l 297 (408) T protein:vir:74 233 VTRNQAIIAAMGTVPKKPT---------------IANFDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKYL 297 (408) T ss_pred HHHHHHHhhcccccccccc---------------cccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceE Confidence 9999999999998876422 223567777665432 34567899999999999999999999999 Q ss_pred ccCCCCCCccceecCccceecc-ccCCc-------eeeeecCc-eEEEeeeheeehhh----hhcccchHHHHHhhhhcc Q lcl|Aclame:pro 432 FPVGVSNQTIATHFGFNRLVQS-VAVDE-------KTAVSLSG-YVTNGSRGMEFEQG----TILVENNKEYLFEMPISG 498 (517) Q Consensus 432 ~~~~~~~~~~~~l~g~~~v~~~-~~~~~-------~~~~~~~~-~~~~~~~~~~~~~d----~~~~~n~~~~~~~~rvgg 498 (517) |+++++.+.+.+++|.+++..+ .+++. ...++++. |.+..+.++....+ ..+.+|++.++++.|+++ T Consensus 298 ~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~ 377 (408) T protein:vir:74 298 LEPDPTKPNSYLIKGKQVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDV 377 (408) T ss_pred eccCcCCCCCceecceeeEEecCcccccccCCcceEEEEehhccEEEEEecceEEEEeccccchhhcceeeEEEEEeeCc Confidence 9999999988999998766543 23332 22345565 55556666554322 236789999999999999 Q ss_pred eeecccceEEEEeCCCCCC Q lcl|Aclame:pro 499 SLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 499 ~v~~~~a~~~~~~tp~~a~ 517 (517) .+.+|+||+++++++.... T Consensus 378 ~~~~~~a~~~~~~~~~~~~ 396 (408) T protein:vir:74 378 KATDSEALVAGSFTAIADQ 396 (408) T ss_pred EEecccceEEEEeecccCC Confidence 9999999999999664332 No 28 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=100.00 E-value=8.8e-46 Score=267.60 Aligned_cols=372 Identities=14% Similarity=0.092 Sum_probs=230.0 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhhhhhhh Q lcl|Aclame:pro 127 NAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVEALK 206 (517) Q Consensus 127 ~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 206 (517) ..++..++++.+...++++...+.... ++++.....++++.+..+ ..+...+...++.+.++........... T Consensus 1 M~~l~el~~~~~~~~~e~~~l~~~~~~------e~~~~~~~~~~l~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (385) T protein:vir:19 1 MSELALIQKAIEESQQKMTQLFDAQKA------EIESTGQVSKQLQSDLMK-VQEELTKSGTRLFDLEQKLASGAENPGE 73 (385) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhccccccch Confidence 333333333333322222211111100 011111111111111100 0001111111111111111110000000 Q ss_pred hhhhhhhHHHHHHHHH-HHHhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc--cee Q lcl|Aclame:pro 207 VTPEATEFLKTREAEV-AYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT--LVV 283 (517) Q Consensus 207 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~--~~~ 283 (517) ................ .........+.++.... .....+..+|+.+...|++.++..+++++++++.++.+ ..+ T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 150 (385) T protein:vir:19 74 KKSFSERAAEELIKSWDGKQGTFGAKTFNKSLGS---DADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEY 150 (385) T ss_pred hhhhHHHHHHHHHHHHHHhhccchhhHHHhhhcc---ccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEE Confidence 0000000000000111 11111111222222211 12233556788899999999999999999999887754 345 Q ss_pred eeecc-cccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|Aclame:pro 284 GGDNA-LTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIM 362 (517) Q Consensus 284 ~~~~~-~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~ 362 (517) +.... ...+.|++||+.+|+++++|+++++.++++++++++|+++++|+ +.|++||.++|+++++.+++.+||+ T Consensus 151 ~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~-----~~l~~~i~~~la~a~~~~~d~~~l~ 225 (385) T protein:vir:19 151 VREEVFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQVMDDA-----PMLQSYINNRLMYGLALKEEGQLLN 225 (385) T ss_pred EEEecCCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHHHHhhH-----HHHHHHHHHHHHHHHHHHHHHHHHh Confidence 66554 46788999999999999999999999999999999999998865 3499999999999999999999999 Q ss_pred ccccCccccccccccccccc-ccccccccHHHHHHHHHHhh-hhhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCc Q lcl|Aclame:pro 363 GGVTGVSETQIYPVVGDAWA-TNVTGTTNIQELLEKLSVAT-PKAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQT 440 (517) Q Consensus 363 G~G~~~~~~gi~~~~~~~~~-~~~~~~~~~d~l~~~l~~~~-~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~ 440 (517) |+|+++++.||++.++.... ...++....+++.+++.... .++.+++|+|||.+|.+|++|||++|||||++ +..+. T Consensus 226 G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~l~~~-~~~~~ 304 (385) T protein:vir:19 226 GDGTGDNLEGLNKVATAYDTSLNATGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIALLKDNEGRYIFGG-PQAFT 304 (385) T ss_pred ccCCCCcccccccccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccC-cccCC Confidence 99999999999887754332 22334455677777776654 34567899999999999999999999999975 44666 Q ss_pred cceecCccceeccccCCcee--eeecCc-eEEEeeeheee----hhhhhcccchHHHHHhhhhcceeecccceEEEEeCC Q lcl|Aclame:pro 441 IATHFGFNRLVQSVAVDEKT--AVSLSG-YVTNGSRGMEF----EQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTP 513 (517) Q Consensus 441 ~~~l~g~~~v~~~~~~~~~~--~~~~~~-~~~~~~~~~~~----~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp 513 (517) +.+++|.+ ++++..+|+.. .++++. |.+..+.++.. ....++++|++.|+++.|+|+.+.+|+||+++++++ T Consensus 305 ~~~l~G~p-V~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~a 383 (385) T protein:vir:19 305 SNIMWGLP-VVPTKAQAAGTFTVGGFDMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPTAIIKGTFSS 383 (385) T ss_pred Cceeccee-eEEcCcCCCCcEEEeecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEecc Confidence 77888865 55666777654 445664 66666555432 222347899999999999999999999999999988 Q ss_pred CC Q lcl|Aclame:pro 514 PV 515 (517) Q Consensus 514 ~~ 515 (517) +- T Consensus 384 a~ 385 (385) T protein:vir:19 384 GS 385 (385) T ss_pred CC Confidence 88 No 29 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=100.00 E-value=8.8e-46 Score=267.60 Aligned_cols=372 Identities=14% Similarity=0.092 Sum_probs=230.0 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhhhhhhh Q lcl|Aclame:pro 127 NAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVEALK 206 (517) Q Consensus 127 ~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 206 (517) ..++..++++.+...++++...+.... ++++.....++++.+..+ ..+...+...++.+.++........... T Consensus 1 M~~l~el~~~~~~~~~e~~~l~~~~~~------e~~~~~~~~~~l~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (385) T protein:vir:18 1 MSELALIQKAIEESQQKMTQLFDAQKA------EIESTGQVSKQLQSDLMK-VQEELTKSGTRLFDLEQKLASGAENPGE 73 (385) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhccccccch Confidence 333333333333322222211111100 011111111111111100 0001111111111111111110000000 Q ss_pred hhhhhhhHHHHHHHHH-HHHhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc--cee Q lcl|Aclame:pro 207 VTPEATEFLKTREAEV-AYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT--LVV 283 (517) Q Consensus 207 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~--~~~ 283 (517) ................ .........+.++.... .....+..+|+.+...|++.++..+++++++++.++.+ ..+ T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 150 (385) T protein:vir:18 74 KKSFSERAAEELIKSWDGKQGTFGAKTFNKSLGS---DADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEY 150 (385) T ss_pred hhhhHHHHHHHHHHHHHHhhccchhhHHHhhhcc---ccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEE Confidence 0000000000000111 11111111222222211 12233556788899999999999999999999887754 345 Q ss_pred eeecc-cccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|Aclame:pro 284 GGDNA-LTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIM 362 (517) Q Consensus 284 ~~~~~-~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~ 362 (517) +.... ...+.|++||+.+|+++++|+++++.++++++++++|+++++|+ +.|++||.++|+++++.+++.+||+ T Consensus 151 ~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~-----~~l~~~i~~~la~a~~~~~d~~~l~ 225 (385) T protein:vir:18 151 VREEVFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQVMDDA-----PMLQSYINNRLMYGLALKEEGQLLN 225 (385) T ss_pred EEEecCCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHHHHhhH-----HHHHHHHHHHHHHHHHHHHHHHHHh Confidence 66554 46788999999999999999999999999999999999998865 3499999999999999999999999 Q ss_pred ccccCccccccccccccccc-ccccccccHHHHHHHHHHhh-hhhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCc Q lcl|Aclame:pro 363 GGVTGVSETQIYPVVGDAWA-TNVTGTTNIQELLEKLSVAT-PKAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQT 440 (517) Q Consensus 363 G~G~~~~~~gi~~~~~~~~~-~~~~~~~~~d~l~~~l~~~~-~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~ 440 (517) |+|+++++.||++.++.... ...++....+++.+++.... .++.+++|+|||.+|.+|++|||++|||||++ +..+. T Consensus 226 G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~l~~~-~~~~~ 304 (385) T protein:vir:18 226 GDGTGDNLEGLNKVATAYDTSLNATGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIALLKDNEGRYIFGG-PQAFT 304 (385) T ss_pred ccCCCCcccccccccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccC-cccCC Confidence 99999999999887754332 22334455677777776654 34567899999999999999999999999975 44666 Q ss_pred cceecCccceeccccCCcee--eeecCc-eEEEeeeheee----hhhhhcccchHHHHHhhhhcceeecccceEEEEeCC Q lcl|Aclame:pro 441 IATHFGFNRLVQSVAVDEKT--AVSLSG-YVTNGSRGMEF----EQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTP 513 (517) Q Consensus 441 ~~~l~g~~~v~~~~~~~~~~--~~~~~~-~~~~~~~~~~~----~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp 513 (517) +.+++|.+ ++++..+|+.. .++++. |.+..+.++.. ....++++|++.|+++.|+|+.+.+|+||+++++++ T Consensus 305 ~~~l~G~p-V~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~a 383 (385) T protein:vir:18 305 SNIMWGLP-VVPTKAQAAGTFTVGGFDMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPTAIIKGTFSS 383 (385) T ss_pred Cceeccee-eEEcCcCCCCcEEEeecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEecc Confidence 77888865 55666777654 445664 66666555432 222347899999999999999999999999999988 Q ss_pred CC Q lcl|Aclame:pro 514 PV 515 (517) Q Consensus 514 ~~ 515 (517) +- T Consensus 384 a~ 385 (385) T protein:vir:18 384 GS 385 (385) T ss_pred CC Confidence 88 No 30 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=100.00 E-value=3.3e-46 Score=269.95 Aligned_cols=405 Identities=13% Similarity=0.100 Sum_probs=224.0 Q ss_pred eeeecccCCCceEEE-Eehhhhh--hhhhhhhhhh-hhhhhhhhhhhhhhhhhhhhhhhhhh-hhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 96 QPVEASEVDGVAYYK-KCILAGG--ALTPNPSNKN-AVVTYFREEKKKEENKMTFDQNLMQE-LLDAKKLAADLNAKLKE 170 (517) Q Consensus 96 ~~~~~~~~~~~~~~~-~~~l~Ev--S~v~~pA~~~-A~I~~vk~~~~~~~~~~~~~~~~~~~-~~e~~~~~~e~~a~l~~ 170 (517) -++|.. .++ +..+.|. ++...-+-.. ......+. +...+.+....+...+ ..+..+..++...+++. T Consensus 1 ~~~~~~------~~~~e~~~~e~a~~~~~~~~~~k~~e~~~~~k--e~~~~~l~~~~e~~~k~~~E~~~~le~~~ee~k~ 72 (458) T protein:vir:10 1 MTIDIN------KLKEELGLGDLAKSLEGLTAAQKAQEAERMRK--EQEEKELARMNDLVSKAVGEDRKRLEEALELVKS 72 (458) T ss_pred Cccchh------hhhhhhchhhHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111111 000 0001110 0000000000 00000000 0000000000000000 00000000111111111 Q ss_pred hHH---HHHHHhh-------hhhhhHHHHHHHH----hhHHHhhhhhh--hhhhhhhhhHHHHHHHHHHHHhh-ccchhh Q lcl|Aclame:pro 171 REN---GGDNAAL-------KTVSELAANLMKQ----RESEKILGVEA--LKVTPEATEFLKTREAEVAYMSA-SLTKDP 233 (517) Q Consensus 171 ~~~---~~~e~~~-------~~~~~~~~~~~~~----~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 233 (517) +.+ +..+... +...+...+.... +.......... ..............+....+... ...... T Consensus 73 l~ee~~~~~~~~a~~~e~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~ 152 (458) T protein:vir:10 73 LDEKSKKSNELFAQTVEKQQETIVGLQDEIKSLLTAREGRSFVGDSVAKALYGTQENFEDEVEKLVLLSYVMEKGVFETE 152 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccchhhhhhHHHHHHHHHHHHHHHhhccchhh Confidence 000 0000000 0000000000000 00000000000 00000000000000011111111 111111 Q ss_pred HHH-Hh---hhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeeccccc--eeeeecccccceeeecccccccc--- Q lcl|Aclame:pro 234 KAA-WT---AELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPTL--VVGGDNALTQGTGHTTGTDKTES--- 304 (517) Q Consensus 234 ~~~-~~---~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~a~~~~eg~~~~~~--- 304 (517) +.. .. ........++..+|+.+...|++.++..+++++++++.++++. .+++......+.|+.|+..++++ T Consensus 153 ~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~v~e~~~~~~~~~~ 232 (458) T protein:vir:10 153 HGQRHLKAVNQSSSVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSKILTMLVEPDAGKATWVAASTYGTDTTTG 232 (458) T ss_pred hhhhhhhhhhhcccCccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCcceEEEEecCCcceeeccccccccccccc Confidence 110 00 0111223457789999999999999999999999988877653 46677777889999999888754 Q ss_pred ---cccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcccccccccccccc Q lcl|Aclame:pro 305 ---NITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAW 381 (517) Q Consensus 305 ---~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~ 381 (517) +++|+++++.++++++++++|++++.|+.++ |++||.++|+++++++++.+||+|+|+++| .||++..+... T Consensus 233 ~~~~~~~~~i~~~~~k~~~~v~is~ell~ds~~~----~~~~i~~~l~~~i~~~~d~~~l~G~G~~~p-~Gi~~~~~~~~ 307 (458) T protein:vir:10 233 EEVKGALKEIHFSTYKLAAKSFITDETEEDAIFS----LLPLLRKRLIEAHAVSIEEAFMTGDGSGKP-KGLLTLASEDS 307 (458) T ss_pred ccccccceeeEeeeeeEEeeehhhHHHHhcchHH----HHHHHHHHHHHHHHHHHHHHhhcCCCCCcc-ceeeecccccc Confidence 5689999999999999999999999988765 999999999999999999999999999865 68888654322 Q ss_pred cc-------cccccccHHHHHHHHHHhh-hhhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCC----CCCccceecCccc Q lcl|Aclame:pro 382 AT-------NVTGTTNIQELLEKLSVAT-PKAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGV----SNQTIATHFGFNR 449 (517) Q Consensus 382 ~~-------~~~~~~~~d~l~~~l~~~~-~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~----~~~~~~~l~g~~~ 449 (517) .. ......+.+++++++.... .+..+++|||||.+|.+|++|||++|||||++.. ..+.+.+++|+++ T Consensus 308 ~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pv 387 (458) T protein:vir:10 308 AKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDSVKLQGQVGRIYGLPV 387 (458) T ss_pred cceeecccccccccccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHhhcccCCceeeccccccccccCcCceecceee Confidence 11 1112335678888776654 4456789999999999999999999999997643 3444567888655 Q ss_pred eeccccCCce------eeeec-CceEEEeeeheeehhhhhcccchHHHHHhhhhcceeecccceEEEEeCCC Q lcl|Aclame:pro 450 LVQSVAVDEK------TAVSL-SGYVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPP 514 (517) Q Consensus 450 v~~~~~~~~~------~~~~~-~~~~~~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~ 514 (517) +++..||.. ..++| +.|+++++.++....|.+..++++.|+++.|+|+.|++|++|+++++... T Consensus 388 -~~~~~~p~~~~~~~~~~~~f~~~~~~~~~~~~~v~~d~~~~~~~~~~~~~~r~~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 388 -VVSEYFPAKANSAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFANGVVSGTYAAS 458 (458) T ss_pred -EEccccccccCCcceEEEEecccEEEEEeeceEEEeecccCCCceEEEEEEEecceEecccceEEEeeccC Confidence 445555442 33455 45777777788877777788999999999999999999999999999888 No 31 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=100.00 E-value=1.1e-45 Score=267.06 Aligned_cols=369 Identities=15% Similarity=0.143 Sum_probs=225.4 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhh---hhhhh Q lcl|Aclame:pro 130 VTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILG---VEALK 206 (517) Q Consensus 130 I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~ 206 (517) +..++. +...+..+..+.++...+......+ +.+......+...+...++..++.+.++...... ..... T Consensus 1 m~e~~~---~l~~~~~~~~~~~~~~~e~~~~~~~----~~~e~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~ 73 (390) T protein:vir:10 1 MTDITS---KLEATLANVTDSLRAFGERAVRDGE----LNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDV 73 (390) T ss_pred ChHHHH---HHHHHHHHHHHHHHHHHHHHHhhcc----cCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Confidence 222222 2222222222222221111111101 1111111111111222222222222111111100 00000 Q ss_pred hhhhhhhHHHHH---HHHHHHHhh---ccchhhHHHHhhh-hhcccccccccchhhhhhHHHhHhhhhhhhhceeeeccc Q lcl|Aclame:pro 207 VTPEATEFLKTR---EAEVAYMSA---SLTKDPKAAWTAE-LKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLP 279 (517) Q Consensus 207 ~~~~~~~~~~~~---~~~~~~~~~---~~~~~~~~~~~~~-~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~ 279 (517) ......+..... ......... ....+.+...... ....+..+..+|+.+...|++.++..+++++++++.+++ T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~ 153 (390) T protein:vir:10 74 QHVSVGDLFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGRTD 153 (390) T ss_pred cccchhhhhhhhHHHHHHHHhhhhhhhhhhhHHHHHHHhhhcccccccccccchhHHHHHHHHHHhhchhhhhcceeecc Confidence 000001111001 111111111 1111111111111 111223355678888899999999999999999988775 Q ss_pred c--ceeeeecc-cccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 280 T--LVVGGDNA-LTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAV 356 (517) Q Consensus 280 ~--~~~~~~~~-~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~ 356 (517) + ..++.... ...+.|++||+.+|+++++|+++++.++++++++++|++++.|+. .|++||.++|++++++++ T Consensus 154 ~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-----~l~~~i~~~l~~~~~~~~ 228 (390) T protein:vir:10 154 SALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAP-----QLASYMNNRLIRGLKVKE 228 (390) T ss_pred CCceEEEEEecCCcceeeecCCccccccccceeEEEEeeEEEEEeehhhHHHHHhHH-----HHHHHHHHHHHHHHHHHH Confidence 5 35565554 357899999999999999999999999999999999999988763 489999999999999999 Q ss_pred HhhhhcccccCccccccccccccccc-ccccccccHHHHHHHHHHhh-hhhcCCEEEEcHHHHHHHHHhhcCCCCEeccC Q lcl|Aclame:pro 357 NRAIIMGGVTGVSETQIYPVVGDAWA-TNVTGTTNIQELLEKLSVAT-PKAADSTLVIHRNDLAAIRFLKDKNGNYVFPV 434 (517) Q Consensus 357 e~~~l~G~G~~~~~~gi~~~~~~~~~-~~~~~~~~~d~l~~~l~~~~-~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~ 434 (517) +.++|+|+|+++++.||++.++.... ...+.....+++++++.... .++.+++|||||++|.+|++|||++|||||++ T Consensus 229 ~~~il~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~~l~~~ 308 (390) T protein:vir:10 229 DAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIELAKDANNQYLIGN 308 (390) T ss_pred HHHHhhcCCCCccccccccccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecC Confidence 99999999999988999987764433 23334445677777766654 34567899999999999999999999999988 Q ss_pred CCCCCccceecCccceeccccCCce--eeeecCc-eEEEeeeheee--hhh-hhcccchHHHHHhhhhcceeecccceEE Q lcl|Aclame:pro 435 GVSNQTIATHFGFNRLVQSVAVDEK--TAVSLSG-YVTNGSRGMEF--EQG-TILVENNKEYLFEMPISGSLEYKGTTAY 508 (517) Q Consensus 435 ~~~~~~~~~l~g~~~v~~~~~~~~~--~~~~~~~-~~~~~~~~~~~--~~d-~~~~~n~~~~~~~~rvgg~v~~~~a~~~ 508 (517) ....+ ..+++|.++ +.+..++.. ..++++. |.+..+.++.. .+. ..+++|++.|+++.|+++.|++|+||++ T Consensus 309 ~~~~~-~~~l~G~pv-~~~~~~p~~~~~~gdf~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~ 386 (390) T protein:vir:10 309 ARGTL-TPTLWGLPV-VATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEERLALVVYRPEALIS 386 (390) T ss_pred CcCcC-Cceecceee-EEcCCCCCCcEEEEeccceEEEEEecceEEEEeecccccccCcEEEEEEEeeccEEeccccEEE Confidence 75444 457888754 445556544 3456664 55555555432 222 3478899999999999999999999999 Q ss_pred EEeC Q lcl|Aclame:pro 509 GTYT 512 (517) Q Consensus 509 ~~~t 512 (517) +++- T Consensus 387 ~~~a 390 (390) T protein:vir:10 387 GSFA 390 (390) T ss_pred EEeC Confidence 9998 No 32 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=100.00 E-value=1.1e-45 Score=266.99 Aligned_cols=369 Identities=16% Similarity=0.140 Sum_probs=228.5 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhhhh---hhh Q lcl|Aclame:pro 130 VTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVE---ALK 206 (517) Q Consensus 130 I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~ 206 (517) +..++++ ...+..+..+.++...+..... ..+.+...+..+.......++..++.+.++........ ... T Consensus 1 m~~l~~~---l~~~~~~~~~~~~~~~e~~~~~----~~~~~e~~~~~~~l~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~ 73 (390) T protein:vir:81 1 MTDITSK---LEATLANVTDSLRAFGERAVRD----GELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDV 73 (390) T ss_pred ChHHHHH---HHHHHHHHHHHHHHHHHHHHhh----cCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 2222211 1111111111111111111110 01111111122222222233333332222211111100 000 Q ss_pred hhhhhh------hHHHHHHHHHHHHhhccchhhHHHHhhh-hhcccccccccchhhhhhHHHhHhhhhhhhhceeeeccc Q lcl|Aclame:pro 207 VTPEAT------EFLKTREAEVAYMSASLTKDPKAAWTAE-LKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLP 279 (517) Q Consensus 207 ~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~ 279 (517) ..+... ...+.................+...... ....+..+..+|+.+...|++.++..+++++++++.+++ T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~ 153 (390) T protein:vir:81 74 QHVSVGDMFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTD 153 (390) T ss_pred ccccchhhhhhhHHHHHHHHHHhhhhhhhhhHHHHHHHhhccccccCCcceechhhhHHHHHHHhhhhhhhhhcceeecc Confidence 000000 0001000001111111111111111111 122344567788889999999999999999999887765 Q ss_pred cc--eeeeeccc-ccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 280 TL--VVGGDNAL-TQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAV 356 (517) Q Consensus 280 ~~--~~~~~~~~-~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~ 356 (517) +. .++..... ..+.|++||+.+|+++++|+++++.++++++++++|++++.|+. .|++||.++|++++++++ T Consensus 154 ~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-----~~~~~i~~~l~~~~~~~~ 228 (390) T protein:vir:81 154 SALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAP-----QLASYMNNRLIRGLKVKE 228 (390) T ss_pred CCceEEEEEecCCcceeeecCCcccccccceeeEEEEeeeEEEEeehhhHHHHHhHH-----HHHHHHHHHHHHHHHHHH Confidence 53 45665543 57889999999999999999999999999999999999998763 399999999999999999 Q ss_pred HhhhhcccccCcccccccccccccccc-cccccccHHHHHHHHHHhh-hhhcCCEEEEcHHHHHHHHHhhcCCCCEeccC Q lcl|Aclame:pro 357 NRAIIMGGVTGVSETQIYPVVGDAWAT-NVTGTTNIQELLEKLSVAT-PKAADSTLVIHRNDLAAIRFLKDKNGNYVFPV 434 (517) Q Consensus 357 e~~~l~G~G~~~~~~gi~~~~~~~~~~-~~~~~~~~d~l~~~l~~~~-~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~ 434 (517) +.+||+|+|+++++.||++.++..... ..+.....+++++++.... .++.+++|||||++|..|++|||++|||||++ T Consensus 229 d~a~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~ 308 (390) T protein:vir:81 229 DAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYNPSGIVINPIDWAAIELAKDANNQYLIGN 308 (390) T ss_pred HHHHHhcCCCCCcccceeecccccccccccccchhHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecC Confidence 999999999999889999877644332 2334456677777776654 44567899999999999999999999999987 Q ss_pred CCCCCccceecCccceeccccCCce--eeeecCc-eEEEeeeheeeh--h-hhhcccchHHHHHhhhhcceeecccceEE Q lcl|Aclame:pro 435 GVSNQTIATHFGFNRLVQSVAVDEK--TAVSLSG-YVTNGSRGMEFE--Q-GTILVENNKEYLFEMPISGSLEYKGTTAY 508 (517) Q Consensus 435 ~~~~~~~~~l~g~~~v~~~~~~~~~--~~~~~~~-~~~~~~~~~~~~--~-d~~~~~n~~~~~~~~rvgg~v~~~~a~~~ 508 (517) .. .+...+++|.++ +.+..++.. ..++++. |.+.++.++... + ..++.+|++.|++..|+++.+.+|+||++ T Consensus 309 ~~-~~~~~~l~G~pv-~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~v~ 386 (390) T protein:vir:81 309 AR-GTLTPTLWGLPV-VATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVGEDFQRNMITVLAEERLALVVYRPEALIS 386 (390) T ss_pred cc-cccCceecceee-EEcCCCCCCcEEEEehhceEEEEEecceEEEEecccchhhcCcEEEEEEEeeccEEecccceEE Confidence 54 444568888764 445566654 4456665 566665555432 2 13468899999999999999999999999 Q ss_pred EEeC Q lcl|Aclame:pro 509 GTYT 512 (517) Q Consensus 509 ~~~t 512 (517) +++- T Consensus 387 ~t~a 390 (390) T protein:vir:81 387 GSFA 390 (390) T ss_pred EEeC Confidence 9998 No 33 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=100.00 E-value=1.5e-45 Score=266.28 Aligned_cols=369 Identities=14% Similarity=0.132 Sum_probs=229.2 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhhh---hhhh Q lcl|Aclame:pro 130 VTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGV---EALK 206 (517) Q Consensus 130 I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~ 206 (517) +..++++..+. .++..+.++...+......+ +.+...+..+.......++..++.+.+........ .... T Consensus 1 m~~~~~~l~~~---~~~~~~~~~~~~e~~~~~~~----~~~e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~ 73 (390) T protein:vir:97 1 MTDITAKLEAT---LANVTDSLKAFGERAVRDGE----LNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDV 73 (390) T ss_pred ChHHHHHHHHH---HHHHHHHHHHHHHHHHhhcC----CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 22222221111 11111111111111111111 11111111122222222222222222211111100 0000 Q ss_pred hhhhhhhHH---HHHHHHHHHHhhc---cchhhHHHHhh-hhhcccccccccchhhhhhHHHhHhhhhhhhhceeeeccc Q lcl|Aclame:pro 207 VTPEATEFL---KTREAEVAYMSAS---LTKDPKAAWTA-ELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLP 279 (517) Q Consensus 207 ~~~~~~~~~---~~~~~~~~~~~~~---~~~~~~~~~~~-~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~ 279 (517) ......... ............. .....+..... ........++++|+.+...|++.++..+++++++++.+++ T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~ 153 (390) T protein:vir:97 74 QHVSVGDMFVASEQFQASTGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTD 153 (390) T ss_pred ccccchhhhhhhHHHHHHHHHhhhhhhhhhhHHHHHHHhhhcccccccccccchhhhHHHHHHHhhhhhhHhhcceeecc Confidence 000000000 0011111111111 11111111111 1122345578899999999999999999999999988775 Q ss_pred cc--eeeeeccc-ccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 280 TL--VVGGDNAL-TQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAV 356 (517) Q Consensus 280 ~~--~~~~~~~~-~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~ 356 (517) +. .++..... ..+.|++||+.+|+++++|+++++.++++++++++|++++.|+. .|++||.++|++++++++ T Consensus 154 ~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds~-----~l~~~i~~~la~a~~~~~ 228 (390) T protein:vir:97 154 SALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAP-----QLASYMNNRLIRGLKVKE 228 (390) T ss_pred CCceEEEEEecCCcceeeecCCccccccccceeEEEEeeeeEEEeehhhHHHHHhHH-----HHHHHHHHHHHHHHHHHH Confidence 53 45665553 57899999999999999999999999999999999999998763 489999999999999999 Q ss_pred HhhhhcccccCcccccccccccccccc-cccccccHHHHHHHHHHhh-hhhcCCEEEEcHHHHHHHHHhhcCCCCEeccC Q lcl|Aclame:pro 357 NRAIIMGGVTGVSETQIYPVVGDAWAT-NVTGTTNIQELLEKLSVAT-PKAADSTLVIHRNDLAAIRFLKDKNGNYVFPV 434 (517) Q Consensus 357 e~~~l~G~G~~~~~~gi~~~~~~~~~~-~~~~~~~~d~l~~~l~~~~-~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~ 434 (517) +.+||+|+|+++++.||++.++..... ..++....+++.+++.... .++.+++|||||++|.+|++|||++|||||++ T Consensus 229 d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~ 308 (390) T protein:vir:97 229 DAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIELAKDANNQYLIGN 308 (390) T ss_pred HHHHhhcCCCCccccceeeccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecC Confidence 999999999999889999876544332 3334455667777766544 44567899999999999999999999999987 Q ss_pred CCCCCccceecCccceeccccCCce--eeeecCc-eEEEeeeheee--hh-hhhcccchHHHHHhhhhcceeecccceEE Q lcl|Aclame:pro 435 GVSNQTIATHFGFNRLVQSVAVDEK--TAVSLSG-YVTNGSRGMEF--EQ-GTILVENNKEYLFEMPISGSLEYKGTTAY 508 (517) Q Consensus 435 ~~~~~~~~~l~g~~~v~~~~~~~~~--~~~~~~~-~~~~~~~~~~~--~~-d~~~~~n~~~~~~~~rvgg~v~~~~a~~~ 508 (517) .. .+...+++|.+++ ++..++.. ..++++. |.+..+.++.. .+ +..+.+|++.|+++.|+|+.+++|+||++ T Consensus 309 ~~-~~~~~~l~G~pV~-~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~v~ 386 (390) T protein:vir:97 309 AR-GTLTPTLWGLPVV-ATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEERLALVVYRPEALIT 386 (390) T ss_pred cc-CCCCceecceeeE-EcCCCCCCcEEEEeccceEEEEEecceEEEEeecccccccCcEEEEEEEeeccEEeccccEEE Confidence 54 4555678887554 45566654 4456664 66666665543 22 24578999999999999999999999999 Q ss_pred EEeC Q lcl|Aclame:pro 509 GTYT 512 (517) Q Consensus 509 ~~~t 512 (517) +++- T Consensus 387 ~~~a 390 (390) T protein:vir:97 387 GSFA 390 (390) T ss_pred EEeC Confidence 9998 No 34 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=100.00 E-value=2.5e-45 Score=265.10 Aligned_cols=384 Identities=11% Similarity=0.047 Sum_probs=218.4 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHH Q lcl|Aclame:pro 113 ILAGGALTPNPSNKNAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMK 192 (517) Q Consensus 113 ~l~EvS~v~~pA~~~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~ 192 (517) .|-|-+= ...+... ...++.+...++........+++..+++...+...+...........+... T Consensus 1 ~~ke~~~------------~~~~~~~---~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 65 (413) T protein:vir:81 1 MVKEAGD------------APTNAQV---AEIAEVKSMVEQFKADEDAKRERAKSVKANQDFLRELQEATAGSVDSEKSG 65 (413) T ss_pred ChhhHHH------------HHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhHHHhh Confidence 1111110 0000000 000000000000000000000111111111000000000000000000000 Q ss_pred HhhHHHh-hhhhhhhhhhhhhhHHHHHHH--HHHHHhhc-cchhhHHHH--hhhhhcccccccccchhhhhhHHHhHhhh Q lcl|Aclame:pro 193 QRESEKI-LGVEALKVTPEATEFLKTREA--EVAYMSAS-LTKDPKAAW--TAELKERGISGMPAPAGILKRIQDAVNDE 266 (517) Q Consensus 193 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~-~~~~~~~~~--~~~~~~~~~~~~~vp~~i~~~i~~~~~~~ 266 (517) ....... ..................... ........ ...+.+... ..........++.+|+.+...|++.++.. T Consensus 66 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~ii~~~~~~ 145 (413) T protein:vir:81 66 ELTRKGEGYKSIGEFFAKRAGDQIKQQAGGAQLNYSVGEYVAPRVKAASDPASTATLTDEFQGGYGTTWNRNIIYRRREK 145 (413) T ss_pred hHhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhHHHhhhhhhhhcccccccccccchhhHHHHHHHHhhh Confidence 0000000 000000000000000000000 00000000 000111100 11112234567889999999999999999 Q ss_pred hhhhhceeeeccccc--eeeeecc----cccceeeeccccccccc-ccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHH Q lcl|Aclame:pro 267 GSLLPFIRHENLPTL--VVGGDNA----LTQGTGHTTGTDKTESN-ITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGA 339 (517) Q Consensus 267 ~~~~~~~~~~~~~~~--~~~~~~~----~~~a~~~~eg~~~~~~~-~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~ 339 (517) +++++++++.++++. .++.... ...+.|++||+.+|+++ .+|+.+++.++++++++++|++++.|+. . T Consensus 146 ~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~-----~ 220 (413) T protein:vir:81 146 LVVADLMDNLTMTNTTIKYLMEKANRVVEGGFKTVAEGGKKPYMRFADFDIVTESLSKIAGLTKITDEMIEDYD-----F 220 (413) T ss_pred hhHHhhcceeeccCCceeEEEeccccccccccceecCcccccccCcccceeeEeeeeeEEEeehhhHHHHHHHH-----H Confidence 999999988877654 3444432 24678999999999987 5899999999999999999999998763 4 Q ss_pred HHHHHHHHHHHHHHHHHHhhhhcccccCcccccccccccccccccccccccHHHHHHHHHHhh--hhhcCCEEEEcHHHH Q lcl|Aclame:pro 340 ILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVAT--PKAADSTLVIHRNDL 417 (517) Q Consensus 340 l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~--~~~~~a~~vmn~~~~ 417 (517) |++||.++|++++++++|.+||+|+|+++++.||++..+.......+.....+++..++.... ..+.+.+|||||.+| T Consensus 221 l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~Gi~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~ 300 (413) T protein:vir:81 221 LVSYINARLLEELAIEEERQLLLGDGTGNNLTGLLKRDGIQTLAVSNKDELADSIYKAMTNISLATPFQADALVINPLDY 300 (413) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCCCcccccccccccccccccccchhHHHHHHHHHHhhhhccCCCcEEEEcHHHH Confidence 999999999999999999999999999999889988776544433344445556666554432 234556799999999 Q ss_pred HHHHHhhcCCCCEeccCCCCCC-------ccceecCccceeccccCCce--eeeecCc-eEEEeeeheee----hhhhhc Q lcl|Aclame:pro 418 AAIRFLKDKNGNYVFPVGVSNQ-------TIATHFGFNRLVQSVAVDEK--TAVSLSG-YVTNGSRGMEF----EQGTIL 483 (517) Q Consensus 418 ~~l~~lKD~~Gryl~~~~~~~~-------~~~~l~g~~~v~~~~~~~~~--~~~~~~~-~~~~~~~~~~~----~~d~~~ 483 (517) .+|++|||++|||||++..... ...++||.+ ++++..++.. ..++++. |.+..+.++.. ..+.++ T Consensus 301 ~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~l~G~p-v~~s~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~ 379 (413) T protein:vir:81 301 QELRLAKDANGQYYGGGVFQGQYGSGGIMLDPAPWGLR-TVQSQVVPVGKPVVGAFRSAASVLRKGGVRIDSTNTNVDDF 379 (413) T ss_pred HHHHHhhccCCceeccccccccccccccccCceeccee-eEEcCCCCcccEEEEecccEEEEEEecceEEEEeccccchh Confidence 9999999999999997654432 234677765 4555555554 4456765 55555555432 223357 Q ss_pred ccchHHHHHhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 484 VENNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 484 ~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) ++|++.|+++.|+++.+.+|+||+++++++|++= T Consensus 380 ~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~p 413 (413) T protein:vir:81 380 ENNLITVRAEERVGLMVTFPEAIVQLDVAEVVTP 413 (413) T ss_pred hcCcEEEEEEEeeccEEecccceEEEEecCCCCC Confidence 8999999999999999999999999999999888 No 35 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=100.00 E-value=2e-45 Score=265.63 Aligned_cols=365 Identities=12% Similarity=0.069 Sum_probs=221.7 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhhhh--- Q lcl|Aclame:pro 127 NAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVE--- 203 (517) Q Consensus 127 ~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~--- 203 (517) ..++..+++...+...+++.....+.+.....+.. ..++++++.+..+.. +....+...+...+......... T Consensus 1 Mk~~~eL~~~~~~~~~~~~~l~~~~~~~~~~~~~~---~ee~~~l~~ei~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (397) T protein:vir:49 1 MKTSNELHDLWIAQGDKVENLNEKLNVAMLDDSVS---AEELQAIKNERDTAK-MKRDLFKEQYTEARANEVANMSEEEK 76 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhcchhh---HHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhhhccccccc Confidence 33333333333333222222211111110000000 001111111111100 00011111111111000000000 Q ss_pred hhhhhhhhhhHHHHHHHHHHHHhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeeccccc-- Q lcl|Aclame:pro 204 ALKVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPTL-- 281 (517) Q Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~~-- 281 (517) ...............+....+...+. ..... .........+++.+|+.+...|++.++..+++++++++.++++. T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~l~~~~-~~~~~--~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 153 (397) T protein:vir:49 77 KPLTKNEEEVKANFVKDFKNLVRGRY-QNLLD--SKTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTG 153 (397) T ss_pred ccccchhhHHHHHHHHHHHHHhhcch-hhHHH--hhhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcc Confidence 00000000000011111222222111 11111 11222334557889999999999999999999999888776542 Q ss_pred --eeeeec-ccccceeeeccccccccc-ccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 282 --VVGGDN-ALTQGTGHTTGTDKTESN-ITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVN 357 (517) Q Consensus 282 --~~~~~~-~~~~a~~~~eg~~~~~~~-~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e 357 (517) .++... ....+.|+.||+..|+++ ++|+++++.++++++++++|++++.|+.++ |++||.++|+++++++++ T Consensus 154 ~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~l~~~~~~~~d 229 (397) T protein:vir:49 154 SRVYEKWADITGLAKLDDEGGQIGQNDDPKLSLIRYAIKRYAGISTVTNSLLADSAEN----ILAWLSGWIAKKVVVTRN 229 (397) T ss_pred eEEEEeeccCCcceeeeccccccccccccceeeeEeeeeeeEeehhhHHHHHhhhhHH----HHHHHHHHHHHHHHHHHH Confidence 233333 335688999999999875 799999999999999999999999998876 899999999999999999 Q ss_pred hhhhcccccCcccccccccccccccccccccccHHHHHHHHHHhh-hhhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCC Q lcl|Aclame:pro 358 RAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVAT-PKAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGV 436 (517) Q Consensus 358 ~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~-~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~ 436 (517) .+||+|+|++++..++ .+.|++.+++.... .+..+++|||||.+|.+|++|||++|||||++++ T Consensus 230 ~ail~G~g~~~~~~~~---------------~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~g~~l~~~~~ 294 (397) T protein:vir:49 230 KAILEAIGTLPNKPTL---------------AKWDDIIDLQAKVDPAIKQTSLFLTNTSGFTALKKVKNAMGDYLMERDV 294 (397) T ss_pred HHHHhccccccccccc---------------cCHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceeecccc Confidence 9999999988764322 24567777766654 4556899999999999999999999999999998 Q ss_pred CCCccceecCccceecc-ccCCc-------eeeeecCc-eEEEeeeheeehh----hhhcccchHHHHHhhhhcceeecc Q lcl|Aclame:pro 437 SNQTIATHFGFNRLVQS-VAVDE-------KTAVSLSG-YVTNGSRGMEFEQ----GTILVENNKEYLFEMPISGSLEYK 503 (517) Q Consensus 437 ~~~~~~~l~g~~~v~~~-~~~~~-------~~~~~~~~-~~~~~~~~~~~~~----d~~~~~n~~~~~~~~rvgg~v~~~ 503 (517) ..+...+++|.+++++. .+++. ...++++. |.+..+.++.... +.++.+|++.|+++.|+++.+.+| T Consensus 295 ~~g~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~ 374 (397) T protein:vir:49 295 KSPTGYSIDGFVVKEISDRFLPNGTGGAMPLYFGDLKQAVTLFDRQHLSLLSTNIGGGAFETDTTKVRVIDRFDVVSTDT 374 (397) T ss_pred cCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecccEEEEeccccchhhcCeeeEEEEEeeccEEecc Confidence 88888999998765542 23332 22345565 5555555554332 234689999999999999999999 Q ss_pred cceEEEEeCCCCCC Q lcl|Aclame:pro 504 GTTAYGTYTPPVAG 517 (517) Q Consensus 504 ~a~~~~~~tp~~a~ 517 (517) +||+++++++++.- T Consensus 375 ~a~~~~~~~~~~~~ 388 (397) T protein:vir:49 375 EAFVPASFKAIADQ 388 (397) T ss_pred cceEEEEecccccc Confidence 99999999877443 No 36 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=100.00 E-value=2.1e-45 Score=265.55 Aligned_cols=371 Identities=11% Similarity=0.067 Sum_probs=218.5 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhh------- Q lcl|Aclame:pro 128 AVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKIL------- 200 (517) Q Consensus 128 A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~------- 200 (517) =+|..++++......+++... +.. .....+.+.+.+..+.+...+.+++.++++.++..... T Consensus 1 M~i~eL~e~r~~~~~~~~~l~---~~~--------~e~~~lt~ee~~~~~~l~~ei~~l~~~I~~~e~~~~~~~~~~~~~ 69 (435) T protein:vir:14 1 MNVNELRRERAAVNQRVQALA---QIE--------VGGTALSVEQQAEFDQLSSKFSELTAQIERAEAAERMAAAAAVPV 69 (435) T ss_pred CCHHHHHHHHHHHHHHHHHHH---HHH--------hccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Confidence 112222222222221111111 000 00111111111111111222222222222111100000 Q ss_pred ----------hhhhhhhhhhhhhHHHHH-HHHHHH---Hhhccch--------hhHHHH--hhhhhcccccccccchhhh Q lcl|Aclame:pro 201 ----------GVEALKVTPEATEFLKTR-EAEVAY---MSASLTK--------DPKAAW--TAELKERGISGMPAPAGIL 256 (517) Q Consensus 201 ----------~~~~~~~~~~~~~~~~~~-~~~~~~---~~~~~~~--------~~~~~~--~~~~~~~~~~~~~vp~~i~ 256 (517) ............+..... ...... ..+.... ...... .......+.+++.+|+.+. T Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~ 149 (435) T protein:vir:14 70 DPNPTAVAAPAAAPVHAQPKALEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLS 149 (435) T ss_pred cchhhhhhhccccccccccchhhhhHHHHHHHHHHHHhhcchhhHHHHHHHhhhhhhhhhhhcccCCcCCCccccchhHH Confidence 000000000000000000 000000 0000000 000001 1112233456788999999 Q ss_pred hhHHHhHhhhhhhhhc-eeeeccc--cceeeeecccccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhh Q lcl|Aclame:pro 257 KRIQDAVNDEGSLLPF-IRHENLP--TLVVGGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNA 333 (517) Q Consensus 257 ~~i~~~~~~~~~~~~~-~~~~~~~--~~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~ 333 (517) ..|++.++..++++++ ++..+.. ...+|..+....+.|+.||+.+|+++++|+++++.++++++++++|++++.|+. T Consensus 150 ~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~ 229 (435) T protein:vir:14 150 SEVIELLRPKSVVRKLGARTLPLSNGNITIPRLKGGAIVGYIGADTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAG 229 (435) T ss_pred HHHHHHHhhhchhhhhcceeeecCCCceEEEEEeCCcceeeeccCccccccccceeEEEeeeEEEEEeehhhHHHHHhhc Confidence 9999999999999886 5554432 456777778888999999999999999999999999999999999999999987 Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcccccccccccccccccccccccH-------HHHHHHHHHhhhhhc Q lcl|Aclame:pro 334 TDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNI-------QELLEKLSVATPKAA 406 (517) Q Consensus 334 ~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~-------d~l~~~l~~~~~~~~ 406 (517) ++ +.|++||.++|+++++++++.+||+|+|++..+.||++..............+. ..++..+..+..++. T Consensus 230 ~~--~~l~~~i~~~l~~ai~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 307 (435) T protein:vir:14 230 VN--PNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPSNVITASDASTLQKIETDLGKVILALENADANLT 307 (435) T ss_pred cC--HHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccceeccccccchhhHHHHHHHHHHHhhhcccccc Confidence 65 459999999999999999999999999998777898876543222222111111 222333333344567 Q ss_pred CCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccceecCccceeccccCCc----------eeeeecCceEEEeeehee Q lcl|Aclame:pro 407 DSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDE----------KTAVSLSGYVTNGSRGME 476 (517) Q Consensus 407 ~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~----------~~~~~~~~~~~~~~~~~~ 476 (517) +++|||||.+|.+|+++||++|||||+.. + ..+++|.|.++ +..+|. ...++++.|++..+.++. T Consensus 308 ~~~~v~n~~~~~~L~~lkd~~G~~l~~~~-~---~g~l~G~Pv~~-~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~~~ 382 (435) T protein:vir:14 308 QPGWIMAPRTFRFLEGLRDGNGNKVYPEL-A---NGMLKGYPVGK-TTQVPINLGETGKESEIYFTDFGDVFIGEEETLE 382 (435) T ss_pred CCEEEEcHHHHHHHHHhhccCCceeccCC-C---CCeeecceeEe-eccccccccCCCccceEEEeecccEEEEEecccE Confidence 88999999999999999999999999532 2 23688866544 333432 345677788877665554 Q ss_pred ehhh-------------hhcccchHHHHHhhhhcceeecccceEEEEeCCCCC Q lcl|Aclame:pro 477 FEQG-------------TILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVA 516 (517) Q Consensus 477 ~~~d-------------~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~a 516 (517) ...+ ..|++|++.|+++.|+++++.+|+||++++=-+-+| T Consensus 383 ~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:14 383 IDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGVAWGA 435 (435) T ss_pred EEEeccccccccccchhhhhhcChhheeeeeeeCceeecccceEEEecCCCCC Confidence 3211 126789999999999999999999999888666666 No 37 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=100.00 E-value=4e-45 Score=264.00 Aligned_cols=361 Identities=13% Similarity=0.079 Sum_probs=214.9 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhhh---- Q lcl|Aclame:pro 127 NAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGV---- 202 (517) Q Consensus 127 ~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~---- 202 (517) ..++..++++......++ +..+.+..+...... .++++.+.+.. ........+..++...+........ T Consensus 1 M~~l~~l~~~~~~~~~e~---~~~~~~~~~~~~~~~---ee~~~~~~~~~-~~~~~~~~l~~~i~~~e~~~~~~~~~~~~ 73 (394) T protein:vir:10 1 MDKLQTLFNEVSAKCADL---NAQLNAKLQDENASV---DDFQKIKDDLT-AAKARRDAINDQIKDLEAENKANSDPDKP 73 (394) T ss_pred ChHHHHHHHHHHHHHHHH---HHHHHHHHhhhhccH---HHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcchhhh Confidence 222222222222111111 111111111000000 00111111111 1111111222222111111100000 Q ss_pred ----hhhhhhhhhhhHHHHHHHHHHHHhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecc Q lcl|Aclame:pro 203 ----EALKVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENL 278 (517) Q Consensus 203 ----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~ 278 (517) ................+....+...... .... .........+++.+|+.+...|++.++..+++++++++.++ T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~~~~--~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~ 150 (394) T protein:vir:10 74 VDNAQPNGTDLKKKPIDAKKKAINDFIHSHGK-VIDN--AAGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPV 150 (394) T ss_pred hhhhcccccchhhhHHHHHHHHHHHHHhccch-hhhh--hhcccccccCceeccHHHHHHHHHHHHhhhhhhhhceeeec Confidence 0000000000000111122222222111 1111 11223344567899999999999999999999999998877 Q ss_pred ccc--eeeeec-ccccceeeeccccccc-ccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 279 PTL--VVGGDN-ALTQGTGHTTGTDKTE-SNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIM 354 (517) Q Consensus 279 ~~~--~~~~~~-~~~~a~~~~eg~~~~~-~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~ 354 (517) ++. .++... ....+.|++|++..|+ ++++|+++++.++++++++++|++++.|+.++ |++||.++|+++++. T Consensus 151 ~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~la~~~~~ 226 (394) T protein:vir:10 151 TTPKGTYPILKRATDRFSSVAELAENPALAEPEFEQVDWSVSTYRGAIPLSEEAIADSAVD----LTSLVGQSINEKSVN 226 (394) T ss_pred cCCceEEEEEecCCCccccccccccccccccccceeEEeeeeeeEeeehhHHHHHhhhhHH----HHHHHHHHHHHHHHH Confidence 643 454433 3466789999999996 67999999999999999999999999998765 999999999999999 Q ss_pred HHHhhhhcccccCcccccccccccccccccccccccHHHHHHHHHHhhhhhcCCEEEEcHHHHHHHHHhhcCCCCEeccC Q lcl|Aclame:pro 355 AVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVATPKAADSTLVIHRNDLAAIRFLKDKNGNYVFPV 434 (517) Q Consensus 355 ~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~ 434 (517) +++.+|++|+|++++. + ..+....|++++.+........+++|||||++|.+|++|||++|||||++ T Consensus 227 ~~~~~il~g~g~~~~~-~------------~~~~~~~d~l~~~~~~~~~~~~~a~~vmn~~~~~~l~~lkd~~G~~i~~~ 293 (394) T protein:vir:10 227 TYNAMIAPVLQSFTAK-A------------TTTDTLVDSLKHILNVDLDPAYSRALVVTQSLFNTLDTLKDKNGRYLLHD 293 (394) T ss_pred HHHHHHhhcccccccc-c------------ccccccHHHHHHHHHhhhhhhccCEEEecHHHHHHHHHhhccCCCeeeec Confidence 9999999999987552 1 12334567787777665555668999999999999999999999999988 Q ss_pred CCCC----CccceecCccceeccc-cCC----c--eeeeecCc-eEEEeeeheee--hhhhhcccchHHHHHhhhhccee Q lcl|Aclame:pro 435 GVSN----QTIATHFGFNRLVQSV-AVD----E--KTAVSLSG-YVTNGSRGMEF--EQGTILVENNKEYLFEMPISGSL 500 (517) Q Consensus 435 ~~~~----~~~~~l~g~~~v~~~~-~~~----~--~~~~~~~~-~~~~~~~~~~~--~~d~~~~~n~~~~~~~~rvgg~v 500 (517) +... +.+.+|+|.|+++.+. .++ + ...++|+. |++..+.++.. .++..+. ..+++..|+++++ T Consensus 294 ~~~~~~~~~~~~~L~G~PV~~~~~~~~~~~~~~~~i~~gd~s~~~~~~~~~~~~v~~~~~~~~~---~~~~~~~r~d~~~ 370 (394) T protein:vir:10 294 ASDSITDGTAKGTVLGVPVYVVGDALLGSAAGDQKAFVGDLKRGVLFADRQQVTLAWEDSKIYG---RYLGAAFRFGVKQ 370 (394) T ss_pred cccccccCCcccccccceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEEecccccc---eeEEEEEEeccEE Confidence 7654 3345789987765432 222 2 23346666 44444455443 2222223 3467889999999 Q ss_pred ecccceEEEEeCCCCCC Q lcl|Aclame:pro 501 EYKGTTAYGTYTPPVAG 517 (517) Q Consensus 501 ~~~~a~~~~~~tp~~a~ 517 (517) .+|++|++++++++.+| T Consensus 371 ~~~~ai~~~~~~~~~~~ 387 (394) T protein:vir:10 371 ADSNAGYFVTNTDAASG 387 (394) T ss_pred eccccEEEEEeecccCC Confidence 99999999999999888 No 38 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=100.00 E-value=9.4e-45 Score=261.97 Aligned_cols=366 Identities=11% Similarity=0.077 Sum_probs=220.7 Q ss_pred ehhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHH Q lcl|Aclame:pro 112 CILAGGALTPNPSNKNAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLM 191 (517) Q Consensus 112 ~~l~EvS~v~~pA~~~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~ 191 (517) .++. .+++..++...++++......+ +.....+++..++.+..+...+.......++..++. T Consensus 1 m~~~----------------e~~~~~~~~~~~l~~~~~~~~~--e~~~~~e~~~~~~~~~~~~~~~e~~~~~~~l~~~~~ 62 (379) T protein:vir:10 1 MEAL----------------EIKVALEAIKGQVDSKSSAQAL--EVKGLIEALEAKMTSEKDLAVNELKSDMAALQAHAD 62 (379) T ss_pred CCHH----------------HHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHhHhhHHHHHHHHHHHHHHHHHHHHHH Confidence 0000 1111111111111111110000 011112222222222222222222222233333333 Q ss_pred HHhhHHHhhhhhhhhhhhhhhhHHHHHHHHHHHHhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhh Q lcl|Aclame:pro 192 KQRESEKILGVEALKVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLP 271 (517) Q Consensus 192 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~ 271 (517) +.+++.................. .............. .........+......+..+|+.+...|++.+...+++++ T Consensus 63 ~~e~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~ 139 (379) T protein:vir:10 63 KLDVKLKEKAKSEDKSDSLVKSI-TENFNDIKEVRNGK--SIQVKAVGDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSD 139 (379) T ss_pred HHHHHHHhcccccccchhHHHHH-HHHHHhHHHHHhhh--hhhhhhhcccccCCCCccccchhhhhHHHHhHHhhhhHHh Confidence 33322222111111110000000 00000000000000 0000011112222333446799999999999999999999 Q ss_pred ceeeeccccc--eeeeeccc--ccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHH Q lcl|Aclame:pro 272 FIRHENLPTL--VVGGDNAL--TQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNR 347 (517) Q Consensus 272 ~~~~~~~~~~--~~~~~~~~--~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~ 347 (517) ++++.++.+. .++..+.. ..+.|+.||+.+|+++++|+++++.++++++++++|++++.|+ +.|++||.++ T Consensus 140 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~D~-----~~l~~~i~~~ 214 (379) T protein:vir:10 140 IVGAVSISGGTYTFVRENGAGEGAIGAQVEGATKGQKDYDISMIDVNTDFIAGFTRYSKKMANNL-----PFLTSFIPNA 214 (379) T ss_pred hceeeeccCCceEEEEeecCCCcccccccCCccccccccceeeeEeeeeeEEeeehhhHHHHhhH-----HHHHHHHHHH Confidence 9988776553 55655543 4566889999999999999999999999999999999998876 3499999999 Q ss_pred HHHHHHHHHHhhhhcccccCcccccccccccccccccccccccHHHHHHHHHHh-hhhhcCCEEEEcHHHHHHHHHhhcC Q lcl|Aclame:pro 348 LPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVA-TPKAADSTLVIHRNDLAAIRFLKDK 426 (517) Q Consensus 348 l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~-~~~~~~a~~vmn~~~~~~l~~lKD~ 426 (517) |+++++.+++.+|++|+|++.... ....+....++++++++... ..++.+++|||||.+|.+|++|||+ T Consensus 215 la~~~~~~~~~~~~~g~~~~~~~~----------~~~~~~~~~~d~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~ 284 (379) T protein:vir:10 215 LRRDYAKAENAAFNAVLAANATAS----------TEIITNKNKVEMLINEIAKQENLDFPVTAIVLRPTDYYDILVTQKS 284 (379) T ss_pred HHHHHHHHHHHHHhcccccccccc----------cccccCcccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcc Confidence 999999999999999988653311 11123344567788777654 4556788999999999999999999 Q ss_pred CCCEeccCCCC--CCccceecCccceeccccCCc--eeeeecCceEEEeeeheee--h--hhhhcccchHHHHHhhhhcc Q lcl|Aclame:pro 427 NGNYVFPVGVS--NQTIATHFGFNRLVQSVAVDE--KTAVSLSGYVTNGSRGMEF--E--QGTILVENNKEYLFEMPISG 498 (517) Q Consensus 427 ~Gryl~~~~~~--~~~~~~l~g~~~v~~~~~~~~--~~~~~~~~~~~~~~~~~~~--~--~d~~~~~n~~~~~~~~rvgg 498 (517) +|||||+++.. .+.+.+++|.+++ ++..++. ...++++.|.++.+.++.. . ...++.+|++.|+++.|+|+ T Consensus 285 ~G~~l~~~~~~~~~~~~~~l~G~pvv-~s~~~~ag~~~~gdf~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~R~~~ 363 (379) T protein:vir:10 285 VGAGYGLPGVVTQDNGVLRINGIPLF-RATWLAANKYYVGDWTRVTKVTTEGLSLEFSEVEGTNFVKNNITARIEAQVAL 363 (379) T ss_pred CCceeccCCccCCCCCcceecceeeE-ecCCCCCCceEEeecccEEEEEEeceEEEEeecccccccCCcEEEEEEEEecc Confidence 99999998764 4555678886554 4455554 4556788888887765432 2 23357899999999999999 Q ss_pred eeecccceEEEEeCCC Q lcl|Aclame:pro 499 SLEYKGTTAYGTYTPP 514 (517) Q Consensus 499 ~v~~~~a~~~~~~tp~ 514 (517) .|++|+||++++||+- T Consensus 364 ~v~~p~a~v~~~~~~~ 379 (379) T protein:vir:10 364 AVEQPAALIFGDFTAV 379 (379) T ss_pred EEecCccEEEEEecCC Confidence 9999999999999887 No 39 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=100.00 E-value=9.5e-45 Score=261.93 Aligned_cols=367 Identities=14% Similarity=0.111 Sum_probs=219.3 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhh-hhhhhh---hhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhhh--hh Q lcl|Aclame:pro 131 TYFREEKKKEENKMTFDQNLMQELLDAKKL-AADLNA---KLKERENGGDNAALKTVSELAANLMKQRESEKILGV--EA 204 (517) Q Consensus 131 ~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~-~~e~~a---~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~--~~ 204 (517) ..++...++..++..+....+++..+.... ..+... ++++...+..+ .......+..++.+.+........ .. T Consensus 1 ~~~~m~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (404) T protein:vir:39 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDN-EKVRRDALREQLVEAQAEQVVNMREEEK 79 (404) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhccccccc Confidence 222221112111111111111111111100 001000 11111111111 111111122222221111111000 00 Q ss_pred hh-hhhhhhhHHHHHHHHHHHHhhccch-hhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeeccccc- Q lcl|Aclame:pro 205 LK-VTPEATEFLKTREAEVAYMSASLTK-DPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPTL- 281 (517) Q Consensus 205 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~~- 281 (517) .. ............+++..+....... ...............+++.+|+.+...|++.++..+++++++++.++++. T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 159 (404) T protein:vir:39 80 GPLNKSEYELKDKFVKEFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSN 159 (404) T ss_pred cccccchhhhHHHHHHHHHHHHhcchhhhhhhhhhhhhcccccCCceeccHHHHHHHHHHHHhhhhHHhhcceeeccCCc Confidence 00 0000011111112222222221111 01111111223334567889999999999999999999999988776543 Q ss_pred -ee--eee-cccccceeeeccccccc-ccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 282 -VV--GGD-NALTQGTGHTTGTDKTE-SNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAV 356 (517) Q Consensus 282 -~~--~~~-~~~~~a~~~~eg~~~~~-~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~ 356 (517) .+ +.. .....+.|+.||+.+|+ +.++|+++++.++++++++++|++++.|+.++ |++||.++|++++++++ T Consensus 160 ~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~l~~~~~~~~ 235 (404) T protein:vir:39 160 GSRVYEKWTDVTPLTVMDAEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAEN----ILAWLSSWIAKKVVVTR 235 (404) T ss_pred ceEEEEeecCCccceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHH----HHHHHHHHHHHHHHHHH Confidence 22 222 23356789999999997 57999999999999999999999999988765 89999999999999999 Q ss_pred HhhhhcccccCcccccccccccccccccccccccHHHHHHHHHHhh--hhhcCCEEEEcHHHHHHHHHhhcCCCCEeccC Q lcl|Aclame:pro 357 NRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVAT--PKAADSTLVIHRNDLAAIRFLKDKNGNYVFPV 434 (517) Q Consensus 357 e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~--~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~ 434 (517) +.+||+|+|++++..+ ....+++.++++... .+..+++|||||++|.+|++|||++|||||++ T Consensus 236 d~~il~g~g~~~~~~~---------------~~~~~~i~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~ 300 (404) T protein:vir:39 236 NQAIIAAMGTVPKKPT---------------IAKFDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEP 300 (404) T ss_pred HHHHHhcccccccccc---------------cccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCceeecc Confidence 9999999998765322 223566777766433 34467899999999999999999999999999 Q ss_pred CCCCCccceecCccceecc-ccCCc-------eeeeecCc-eEEEeeeheeeh--h--hhhcccchHHHHHhhhhcceee Q lcl|Aclame:pro 435 GVSNQTIATHFGFNRLVQS-VAVDE-------KTAVSLSG-YVTNGSRGMEFE--Q--GTILVENNKEYLFEMPISGSLE 501 (517) Q Consensus 435 ~~~~~~~~~l~g~~~v~~~-~~~~~-------~~~~~~~~-~~~~~~~~~~~~--~--d~~~~~n~~~~~~~~rvgg~v~ 501 (517) ++..+.+.+++|.|++++. .+++. ...++++. |.+..+.++... + ...+.+|++.++++.|+|+.+. T Consensus 301 ~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~ 380 (404) T protein:vir:39 301 DPTKPNSYLIKGKKVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKTT 380 (404) T ss_pred CcCCCCcceecceeEEEecccccCccCCCccEEEEEeccccEEEEeecceEEEEeccchhhhhhceeeEEEEeeeccEEe Confidence 9999988999998776643 23332 23345665 445555554432 1 2246789999999999999999 Q ss_pred cccceEEEEeCCCC-------CC Q lcl|Aclame:pro 502 YKGTTAYGTYTPPV-------AG 517 (517) Q Consensus 502 ~~~a~~~~~~tp~~-------a~ 517 (517) +|+||++++++++- || T Consensus 381 ~~~a~~~~~~~~~a~~~~~~~~~ 403 (404) T protein:vir:39 381 DSEALVAGSFTAIADQVGNFTAG 403 (404) T ss_pred cccceEEEEeeccccCCCCCCCC Confidence 99999999987662 22 No 40 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=100.00 E-value=2.2e-44 Score=259.89 Aligned_cols=365 Identities=11% Similarity=0.064 Sum_probs=218.0 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhh---hhh Q lcl|Aclame:pro 127 NAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKIL---GVE 203 (517) Q Consensus 127 ~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~---~~~ 203 (517) ...+..+++...+...++....+.+.......+.. ..++++++.+..+. .+....+.......+...... ... T Consensus 1 Mk~~~el~~~~~~~~~~i~~~~~~~~~~~~~~~~~---~ee~~~l~~ei~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (397) T protein:vir:48 1 MKTSNELHDLWVAQGDKVENLNEKLNVAMLDDSVT---AEELQAIKNERDTA-KMKRDMFKEQYTEARANEVVNMSEEEK 76 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhh---HHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhhhhhhhhhcc Confidence 22222222222111111111111100000000000 00001111111000 000011111111100000000 000 Q ss_pred hhhhhhhhhhHHHHHHHHHHHHhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeeccccc-- Q lcl|Aclame:pro 204 ALKVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPTL-- 281 (517) Q Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~~-- 281 (517) ...............+....+...... .... .........+++.+|+.+...|++.++..+++++++++.++++. T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~--~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 153 (397) T protein:vir:48 77 KPLTKSEEEVKAGFVKDFKNLVRGRYQ-NLLD--SKTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTG 153 (397) T ss_pred ccccchhhHHHHHHHHHHHHHHhhhhh-HHHH--HhhccCCccccccccHHHHHHHHHHHHHHHHHHhhhceeeccCCcc Confidence 000001110001111112222211111 1111 11222334567899999999999999999999999988776543 Q ss_pred e--eeee-cccccceeeecccccccc-cccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 282 V--VGGD-NALTQGTGHTTGTDKTES-NITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVN 357 (517) Q Consensus 282 ~--~~~~-~~~~~a~~~~eg~~~~~~-~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e 357 (517) . ++.. .....+.|+.||+..+++ .++|+++++.++++++++++|++++.|+.++ |++||.++|+++++++++ T Consensus 154 ~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~----l~~~v~~~l~~~~~~~~d 229 (397) T protein:vir:48 154 SRVYEKWADITGLAKLDDEAGSIGTNDDPKLYPIRYAIKRYAGISTVTNSLLADSAEN----ILAWLSGWIAKKVVVTRN 229 (397) T ss_pred eEEEEeecCCCcceeeeccccccccccccceeeEEeeheeeeeehhhHHHHHhhchHH----HHHHHHHHHHHHHHHHHH Confidence 2 2222 334568899999999986 5899999999999999999999999998775 899999999999999999 Q ss_pred hhhhcccccCcccccccccccccccccccccccHHHHHHHHHHhh-hhhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCC Q lcl|Aclame:pro 358 RAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVAT-PKAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGV 436 (517) Q Consensus 358 ~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~-~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~ 436 (517) .+||+|+|++++..+ ..+.+++++++.... .+..+++|||||.+|..|++|||++|||||+++. T Consensus 230 ~~il~G~g~~~~~~~---------------~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~i~~~~~ 294 (397) T protein:vir:48 230 KAILEAIATLPTKPT---------------LTKWDDIIDLQAKVDPAIKQTSFFLTNTSGFTALKKVKNAFGDYLMERDV 294 (397) T ss_pred HHHhhcccccccccc---------------cccHHHHHHHHHHhhhhhcCCCEEEECHHHHHHHHHhhcCCCceeeccCc Confidence 999999998766422 234567777665544 4456799999999999999999999999999999 Q ss_pred CCCccceecCccceecc-ccC-----Cce--eeeecCce-EEEeeeheeeh----hhhhcccchHHHHHhhhhcceeecc Q lcl|Aclame:pro 437 SNQTIATHFGFNRLVQS-VAV-----DEK--TAVSLSGY-VTNGSRGMEFE----QGTILVENNKEYLFEMPISGSLEYK 503 (517) Q Consensus 437 ~~~~~~~l~g~~~v~~~-~~~-----~~~--~~~~~~~~-~~~~~~~~~~~----~d~~~~~n~~~~~~~~rvgg~v~~~ 503 (517) ..+.+.+++|.|++.+. .++ ++. ..++++.| .+..+.++... .+.++.+|++.|++..|+++.+.+| T Consensus 295 ~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~ 374 (397) T protein:vir:48 295 KSPTGYSIDGFAVKEVADRWLANASSGAMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKIRVIDRFDVVATDT 374 (397) T ss_pred CCCCCceeccceeEEecccccCCcCCCceEEEEEeccceEEEEeecceEEEEeccchhhhhcCceeEEEEeeeccEEecc Confidence 99999999998776543 122 222 23456654 44555554432 2335789999999999999999999 Q ss_pred cceEEEEeCCCCCC Q lcl|Aclame:pro 504 GTTAYGTYTPPVAG 517 (517) Q Consensus 504 ~a~~~~~~tp~~a~ 517 (517) ++|++++++.+.+. T Consensus 375 ~a~~~~~~~~~~~~ 388 (397) T protein:vir:48 375 ESFVPASFKAIADQ 388 (397) T ss_pred cceEEEEecccccC Confidence 99999999877655 No 41 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=100.00 E-value=4.5e-44 Score=258.20 Aligned_cols=369 Identities=11% Similarity=0.073 Sum_probs=216.7 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHh-------- Q lcl|Aclame:pro 128 AVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKI-------- 199 (517) Q Consensus 128 A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~-------- 199 (517) =.+..++++......++++. .+.. .....+.+.+.+..+.+...+..++.++.+++..... T Consensus 1 M~l~eL~~~r~~~~~~~~~l---~~~~--------~e~~~l~~ee~~~~~~l~~ei~~l~~~i~~~e~~e~~~~~~~~~~ 69 (435) T protein:vir:80 1 MNVNELRRERAAVNQRVQAL---AQIE--------VGGTALSVEQQAEFDQLSSKFNELTAQIERAEAAERMAAAAAVPV 69 (435) T ss_pred CCHHHHHHHHHHHHHHHHHH---HHHH--------hccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Confidence 11122222222222111111 1000 0001111111111111111111222222111110000 Q ss_pred ---------hhhhhh---hhhhhhhhHHHHHH-HHHHHH------------hhccchhhHHHHhhhhhcccccccccchh Q lcl|Aclame:pro 200 ---------LGVEAL---KVTPEATEFLKTRE-AEVAYM------------SASLTKDPKAAWTAELKERGISGMPAPAG 254 (517) Q Consensus 200 ---------~~~~~~---~~~~~~~~~~~~~~-~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~vp~~ 254 (517) ...... ....+.+....... ...... ......+.. ........+.+++.+|.. T Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~gg~lvP~~ 147 (435) T protein:vir:80 70 DPNPAAVTASAAAPVYAQPKAPEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVA--MSLNTLSPGAGGVLVPEN 147 (435) T ss_pred cchhhhhccccccccccccchhhhhHHHHHHHHHHHHhccchhHHHHHHHHhhhhhhhhh--hhhcccCCCCCccccchh Confidence 000000 00000000000000 000000 000000001 111122334567899999 Q ss_pred hhhhHHHhHhhhhhhhhc-eeeecc--ccceeeeecccccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHH Q lcl|Aclame:pro 255 ILKRIQDAVNDEGSLLPF-IRHENL--PTLVVGGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNS 331 (517) Q Consensus 255 i~~~i~~~~~~~~~~~~~-~~~~~~--~~~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d 331 (517) +...|++.++..++++++ +++.+. +...+|.......+.|+.||+.+|+++++|+++++.++++++++++|++++.| T Consensus 148 ~~~~ii~~l~~~~~i~~~~~~~v~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~d 227 (435) T protein:vir:80 148 LSSEVIELLRPKSVVRKLGARTLPLSNGNITIPRLKGGAIVGYIGADTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKY 227 (435) T ss_pred HHHHHHHHHhhhchhhhccceeeecCCCceEEEEEeCCcceeeeccCccccccccceeeEEEeeEEEEEeehhhHHHHHh Confidence 999999999999999887 555443 34567778888889999999999999999999999999999999999999999 Q ss_pred hhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcccccccccccccccccccccccH----HHHHHHHHH---hhhh Q lcl|Aclame:pro 332 NATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNI----QELLEKLSV---ATPK 404 (517) Q Consensus 332 ~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~----d~l~~~l~~---~~~~ 404 (517) +.++ +.|++||.++|+++++++++.+||+|+|++..+.||++..............+. .++..++.. +..+ T Consensus 228 s~~~--~~l~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~ 305 (435) T protein:vir:80 228 AGVN--PNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPGNVITASDGSTLQKIETDLGKAILALENADAN 305 (435) T ss_pred hccc--HHHHHHHHHHHHHHHHHHHHHHhhccCCCCCcccceeecccccceeecccccchhhHHHHHHHHHHHhhccccc Confidence 8764 458999999999999999999999999998778899887654333322222222 233333322 2335 Q ss_pred hcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccceecCccceeccccCCc----------eeeeecCceEEEeeeh Q lcl|Aclame:pro 405 AADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDE----------KTAVSLSGYVTNGSRG 474 (517) Q Consensus 405 ~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~----------~~~~~~~~~~~~~~~~ 474 (517) +.+++|||||.+|.+|++|||++|||||+.. + ..+++|.+++ ++..+|. ...++++.|+++.+.+ T Consensus 306 ~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~-~---~~~l~G~pv~-~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~ 380 (435) T protein:vir:80 306 LTQPGWIMAPRTFRFLEGLRDGNGNKVYPEL-A---NGMLKGYPVG-KTTQVPINLGEAGKESEIYFTDFGDVFIGEEET 380 (435) T ss_pred cccCEEEEcHHHHHHHHhhhccCCceeccCC-C---CCeEeeeeeE-EeccccccccCCCCcceEEEEEcccEEEEeecc Confidence 5678999999999999999999999999643 2 2368886544 4444432 3346677788776555 Q ss_pred eeeh--hh-----------hhcccchHHHHHhhhhcceeecccceEEEEeCCCCC Q lcl|Aclame:pro 475 MEFE--QG-----------TILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVA 516 (517) Q Consensus 475 ~~~~--~d-----------~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~a 516 (517) +.+. ++ -.|.+|++.|+++.|+++.+.+|+||++.+=-.=+| T Consensus 381 ~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:80 381 LEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGVAWGA 435 (435) T ss_pred eEEEEeccccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEeccCCCC Confidence 4432 21 126789999999999999999999999988544444 No 42 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=100.00 E-value=2e-44 Score=260.21 Aligned_cols=368 Identities=11% Similarity=0.120 Sum_probs=215.2 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHh------- Q lcl|Aclame:pro 127 NAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKI------- 199 (517) Q Consensus 127 ~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~------- 199 (517) .-++..+++++.....+++...... .+...+.+.+.+..+........++.++...+..+.. T Consensus 1 M~kl~~L~e~r~~l~~~~~~l~~~~-----------~e~~~lt~ee~~~~~~l~~e~~~l~~~i~~~e~~e~~~~~~~~~ 69 (428) T protein:vir:10 1 MPQIEELRRQRAGINEQIQALATIE-----------ATNGTLTAEQLTEFAGLQQQFTDISAKMDRMEATERAAALVAKP 69 (428) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHH-----------hccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 3334444444333332222211100 0001111111111111111112222222111100000 Q ss_pred -hhhhhhh---hhhhhhhHHH---HH--------HHHHHHHhhccchhh-HHHHhhh-hhcccccccccchhhhhhHHHh Q lcl|Aclame:pro 200 -LGVEALK---VTPEATEFLK---TR--------EAEVAYMSASLTKDP-KAAWTAE-LKERGISGMPAPAGILKRIQDA 262 (517) Q Consensus 200 -~~~~~~~---~~~~~~~~~~---~~--------~~~~~~~~~~~~~~~-~~~~~~~-~~~~~~~~~~vp~~i~~~i~~~ 262 (517) ....... ...+...... .. ............... ....... ....+.+++.+|+.+...|++. T Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~liP~~~~~~ii~~ 149 (428) T protein:vir:10 70 VKATQHGPAVIVKAEPKQYTGAGMTRMVMSIAAAQGNLQDAAKFASDELNDQSVSMAISTAAGSGGVLIPQNIHSEVIEL 149 (428) T ss_pred hhchhhccccccccccchhhhHHHHHHHHHHHHhhhhHHHHHHHhhhhhhhhhHhhhhcccccCCccccchhHHHHHHHH Confidence 0000000 0000000000 00 000000000000000 0000011 1122345788999999999999 Q ss_pred Hhhhhhhhhc-eeeecc--ccceeeeecccccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHH Q lcl|Aclame:pro 263 VNDEGSLLPF-IRHENL--PTLVVGGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGA 339 (517) Q Consensus 263 ~~~~~~~~~~-~~~~~~--~~~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~ 339 (517) ++..++++++ +++.+. ....+|+......+.|++||+.+|+++++|+++++.++++++++++|++++.|+.++ T Consensus 150 l~~~~~l~~~~~~~~~~~~g~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~is~ell~ds~~~---- 225 (428) T protein:vir:10 150 LRDRTIVRKLGARSIPLPNGNMSLPRLAGGATASYTGENQDAKVSEARFDDVKLTAKTMIAMVPISNALIGRAGFN---- 225 (428) T ss_pred HhhhchhhhhcceeeecCCcceEEEEEeCCcceeeeccCccccccccceeeEEeeeEEEEEeehhhHHHHhhhhHH---- Confidence 9999999988 555443 235778877788899999999999999999999999999999999999999988765 Q ss_pred HHHHHHHHHHHHHHHHHHhhhhcccccCccccccccccccccccc---ccccccH---HHHHHHHHH----hhhhhcCCE Q lcl|Aclame:pro 340 ILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATN---VTGTTNI---QELLEKLSV----ATPKAADST 409 (517) Q Consensus 340 l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~---~~~~~~~---d~l~~~l~~----~~~~~~~a~ 409 (517) |++||.++|+++++++++.+||+|+|++..+.||++.++...... .....+. +..++.+.. ...+..+++ T Consensus 226 l~~~i~~~l~~ai~~~~d~~~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 305 (428) T protein:vir:10 226 VEQLVLQDILTAISVREDKAFMRDDGTGDTPIGMKARATQWNRLLPWAADAAVNLDTIDTYLDSIILMSMDGNSNMISSG 305 (428) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccccccHHHHHHHHHHHHHhhhccccccccCE Confidence 999999999999999999999999999877889998765432211 1112222 222333222 223445789 Q ss_pred EEEcHHHHHHHHHhhcCCCCEeccCCCCCCccceecCccceeccccCCc----------eeeeecCceEEEeeeheeeh- Q lcl|Aclame:pro 410 LVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDE----------KTAVSLSGYVTNGSRGMEFE- 478 (517) Q Consensus 410 ~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~- 478 (517) |+||+.+|.+|++|||++|||||++.. ..+++|.+.+ ++..+|. ...++++.|++..+.++... T Consensus 306 ~v~n~~~~~~L~~lkd~~G~~i~~~~~----~g~l~G~pv~-~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~i~i~~ 380 (428) T protein:vir:10 306 WGMSNRTYMKLFGLRDGNGNKVYPEMA----QGMLKGYPIQ-RTSAIPANLGEGGKESEIYFADFNDVVIGEDGNMKVDF 380 (428) T ss_pred EEEcHHHHHHHHHhhccCCceeccCCC----CCeeeceeeE-EeccccccccCCCccceEEEEecceEEEEEecceEEEe Confidence 999999999999999999999997532 2368886644 4444432 23456777777665554432 Q ss_pred -hh-----------hhcccchHHHHHhhhhcceeecccceEEEEeCCC Q lcl|Aclame:pro 479 -QG-----------TILVENNKEYLFEMPISGSLEYKGTTAYGTYTPP 514 (517) Q Consensus 479 -~d-----------~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~ 514 (517) ++ ..+.+|++.++++.|+++.|.+|+||++++-.-= T Consensus 381 ~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 381 SKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred ecccccccccccccchhhcchhheeeeeeeCceeeccceEEEEeccCC Confidence 11 1368999999999999999999999998873222 No 43 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=100.00 E-value=3.4e-44 Score=258.89 Aligned_cols=352 Identities=14% Similarity=0.150 Sum_probs=215.3 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHH----Hhhhhhhhhhh Q lcl|Aclame:pro 133 FREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESE----KILGVEALKVT 208 (517) Q Consensus 133 vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~ 208 (517) +..+..+...++......++...+.. +.++ .+...++...++.+++...+.. ........... T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~-~~~e------------~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~ 67 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEEVRSLMGED-KVAE------------AEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVET 67 (392) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHH-HHHH------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Confidence 11111111111111111111111000 0000 0111111111221111111000 00000000000 Q ss_pred hhhhhHHHHHHHHHHHHhhccchhhHHHH--------hhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc Q lcl|Aclame:pro 209 PEATEFLKTREAEVAYMSASLTKDPKAAW--------TAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT 280 (517) Q Consensus 209 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~ 280 (517) .........+................... .........+++.+|+.+...|++.+++.+++++++++.++++ T Consensus 68 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~ 147 (392) T protein:vir:10 68 RNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRT 147 (392) T ss_pred cCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccC Confidence 00011111111222222211111111101 1111223446788999999999999999999999999887754 Q ss_pred c----eeeeecccccceeeecccccccc-cccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 281 L----VVGGDNALTQGTGHTTGTDKTES-NITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMA 355 (517) Q Consensus 281 ~----~~~~~~~~~~a~~~~eg~~~~~~-~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~ 355 (517) . .++.......+.|++||+.++++ .++|+++++.++++++++++|++++.|+.++ |++||.++|+++++++ T Consensus 148 ~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~l~~~i~~~ 223 (392) T protein:vir:10 148 RSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQN----ILKYVTKWLGKKSKVT 223 (392) T ss_pred CceeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHH----HHHHHHHHHHHHHHHH Confidence 2 34555666788999999999976 5899999999999999999999999988765 8999999999999999 Q ss_pred HHhhhhcccccCcccccccccccccccccccccccHHHHHHHHHHh-h-hhhcCCEEEEcHHHHHHHHHhhcCCCCEecc Q lcl|Aclame:pro 356 VNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVA-T-PKAADSTLVIHRNDLAAIRFLKDKNGNYVFP 433 (517) Q Consensus 356 ~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~-~-~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~ 433 (517) ++.+|++|+|++++. +..+++++++++... . .+..+++|||||++|.+|++|||++|||||+ T Consensus 224 ~d~~~~~g~g~~~~~----------------~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~ 287 (392) T protein:vir:10 224 RNVLILGVIEKLTKQ----------------AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQ 287 (392) T ss_pred HHHHHhhcccccccc----------------CccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEee Confidence 999999999976542 123456777776532 2 3446789999999999999999999999999 Q ss_pred CCCCCCccceecCccceecccc---------CCce--eeeecCc-eEEEeeeheeeh----hhhhcccchHHHHHhhhhc Q lcl|Aclame:pro 434 VGVSNQTIATHFGFNRLVQSVA---------VDEK--TAVSLSG-YVTNGSRGMEFE----QGTILVENNKEYLFEMPIS 497 (517) Q Consensus 434 ~~~~~~~~~~l~g~~~v~~~~~---------~~~~--~~~~~~~-~~~~~~~~~~~~----~d~~~~~n~~~~~~~~rvg 497 (517) ++.+.+.+.+++|.+++++... .++. ..++|+. |.+..+.++... .+..+++|++.|+++.|+| T Consensus 288 ~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d 367 (392) T protein:vir:10 288 SDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDD 367 (392) T ss_pred cCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeec Confidence 9999999999999877764321 1121 2235555 444455555432 1234688999999999999 Q ss_pred ceeecccceEEEEeCCC-----CCC Q lcl|Aclame:pro 498 GSLEYKGTTAYGTYTPP-----VAG 517 (517) Q Consensus 498 g~v~~~~a~~~~~~tp~-----~a~ 517 (517) +.+.+|++|++++++++ .+| T Consensus 368 ~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 368 VQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred cEEecccceEEEEecccccccCCCC Confidence 99999999999988554 456 No 44 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=100.00 E-value=3.4e-44 Score=258.89 Aligned_cols=352 Identities=14% Similarity=0.150 Sum_probs=215.3 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHH----Hhhhhhhhhhh Q lcl|Aclame:pro 133 FREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESE----KILGVEALKVT 208 (517) Q Consensus 133 vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~ 208 (517) +..+..+...++......++...+.. +.++ .+...++...++.+++...+.. ........... T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~-~~~e------------~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~ 67 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEEVRSLMGED-KVAE------------AEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVET 67 (392) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHH-HHHH------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Confidence 11111111111111111111111000 0000 0111111111221111111000 00000000000 Q ss_pred hhhhhHHHHHHHHHHHHhhccchhhHHHH--------hhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc Q lcl|Aclame:pro 209 PEATEFLKTREAEVAYMSASLTKDPKAAW--------TAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT 280 (517) Q Consensus 209 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~ 280 (517) .........+................... .........+++.+|+.+...|++.+++.+++++++++.++++ T Consensus 68 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~ 147 (392) T protein:vir:10 68 RNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRT 147 (392) T ss_pred cCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccC Confidence 00011111111222222211111111101 1111223446788999999999999999999999999887754 Q ss_pred c----eeeeecccccceeeecccccccc-cccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 281 L----VVGGDNALTQGTGHTTGTDKTES-NITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMA 355 (517) Q Consensus 281 ~----~~~~~~~~~~a~~~~eg~~~~~~-~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~ 355 (517) . .++.......+.|++||+.++++ .++|+++++.++++++++++|++++.|+.++ |++||.++|+++++++ T Consensus 148 ~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~l~~~i~~~ 223 (392) T protein:vir:10 148 RSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQN----ILKYVTKWLGKKSKVT 223 (392) T ss_pred CceeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHH----HHHHHHHHHHHHHHHH Confidence 2 34555666788999999999976 5899999999999999999999999988765 8999999999999999 Q ss_pred HHhhhhcccccCcccccccccccccccccccccccHHHHHHHHHHh-h-hhhcCCEEEEcHHHHHHHHHhhcCCCCEecc Q lcl|Aclame:pro 356 VNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVA-T-PKAADSTLVIHRNDLAAIRFLKDKNGNYVFP 433 (517) Q Consensus 356 ~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~-~-~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~ 433 (517) ++.+|++|+|++++. +..+++++++++... . .+..+++|||||++|.+|++|||++|||||+ T Consensus 224 ~d~~~~~g~g~~~~~----------------~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~ 287 (392) T protein:vir:10 224 RNVLILGVIEKLTKQ----------------AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQ 287 (392) T ss_pred HHHHHhhcccccccc----------------CccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEee Confidence 999999999976542 123456777776532 2 3446789999999999999999999999999 Q ss_pred CCCCCCccceecCccceecccc---------CCce--eeeecCc-eEEEeeeheeeh----hhhhcccchHHHHHhhhhc Q lcl|Aclame:pro 434 VGVSNQTIATHFGFNRLVQSVA---------VDEK--TAVSLSG-YVTNGSRGMEFE----QGTILVENNKEYLFEMPIS 497 (517) Q Consensus 434 ~~~~~~~~~~l~g~~~v~~~~~---------~~~~--~~~~~~~-~~~~~~~~~~~~----~d~~~~~n~~~~~~~~rvg 497 (517) ++.+.+.+.+++|.+++++... .++. ..++|+. |.+..+.++... .+..+++|++.|+++.|+| T Consensus 288 ~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d 367 (392) T protein:vir:10 288 SDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDD 367 (392) T ss_pred cCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeec Confidence 9999999999999877764321 1121 2235555 444455555432 1234688999999999999 Q ss_pred ceeecccceEEEEeCCC-----CCC Q lcl|Aclame:pro 498 GSLEYKGTTAYGTYTPP-----VAG 517 (517) Q Consensus 498 g~v~~~~a~~~~~~tp~-----~a~ 517 (517) +.+.+|++|++++++++ .+| T Consensus 368 ~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 368 VQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred cEEecccceEEEEecccccccCCCC Confidence 99999999999988554 456 No 45 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=100.00 E-value=3.4e-44 Score=258.89 Aligned_cols=352 Identities=14% Similarity=0.150 Sum_probs=215.3 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHH----Hhhhhhhhhhh Q lcl|Aclame:pro 133 FREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESE----KILGVEALKVT 208 (517) Q Consensus 133 vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~ 208 (517) +..+..+...++......++...+.. +.++ .+...++...++.+++...+.. ........... T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~-~~~e------------~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~ 67 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEEVRSLMGED-KVAE------------AEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVET 67 (392) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHH-HHHH------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Confidence 11111111111111111111111000 0000 0111111111221111111000 00000000000 Q ss_pred hhhhhHHHHHHHHHHHHhhccchhhHHHH--------hhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc Q lcl|Aclame:pro 209 PEATEFLKTREAEVAYMSASLTKDPKAAW--------TAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT 280 (517) Q Consensus 209 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~ 280 (517) .........+................... .........+++.+|+.+...|++.+++.+++++++++.++++ T Consensus 68 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~ 147 (392) T protein:vir:10 68 RNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRT 147 (392) T ss_pred cCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccC Confidence 00011111111222222211111111101 1111223446788999999999999999999999999887754 Q ss_pred c----eeeeecccccceeeecccccccc-cccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 281 L----VVGGDNALTQGTGHTTGTDKTES-NITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMA 355 (517) Q Consensus 281 ~----~~~~~~~~~~a~~~~eg~~~~~~-~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~ 355 (517) . .++.......+.|++||+.++++ .++|+++++.++++++++++|++++.|+.++ |++||.++|+++++++ T Consensus 148 ~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~l~~~i~~~ 223 (392) T protein:vir:10 148 RSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQN----ILKYVTKWLGKKSKVT 223 (392) T ss_pred CceeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHH----HHHHHHHHHHHHHHHH Confidence 2 34555666788999999999976 5899999999999999999999999988765 8999999999999999 Q ss_pred HHhhhhcccccCcccccccccccccccccccccccHHHHHHHHHHh-h-hhhcCCEEEEcHHHHHHHHHhhcCCCCEecc Q lcl|Aclame:pro 356 VNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVA-T-PKAADSTLVIHRNDLAAIRFLKDKNGNYVFP 433 (517) Q Consensus 356 ~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~-~-~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~ 433 (517) ++.+|++|+|++++. +..+++++++++... . .+..+++|||||++|.+|++|||++|||||+ T Consensus 224 ~d~~~~~g~g~~~~~----------------~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~ 287 (392) T protein:vir:10 224 RNVLILGVIEKLTKQ----------------AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQ 287 (392) T ss_pred HHHHHhhcccccccc----------------CccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEee Confidence 999999999976542 123456777776532 2 3446789999999999999999999999999 Q ss_pred CCCCCCccceecCccceecccc---------CCce--eeeecCc-eEEEeeeheeeh----hhhhcccchHHHHHhhhhc Q lcl|Aclame:pro 434 VGVSNQTIATHFGFNRLVQSVA---------VDEK--TAVSLSG-YVTNGSRGMEFE----QGTILVENNKEYLFEMPIS 497 (517) Q Consensus 434 ~~~~~~~~~~l~g~~~v~~~~~---------~~~~--~~~~~~~-~~~~~~~~~~~~----~d~~~~~n~~~~~~~~rvg 497 (517) ++.+.+.+.+++|.+++++... .++. ..++|+. |.+..+.++... .+..+++|++.|+++.|+| T Consensus 288 ~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d 367 (392) T protein:vir:10 288 SDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDD 367 (392) T ss_pred cCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeec Confidence 9999999999999877764321 1121 2235555 444455555432 1234688999999999999 Q ss_pred ceeecccceEEEEeCCC-----CCC Q lcl|Aclame:pro 498 GSLEYKGTTAYGTYTPP-----VAG 517 (517) Q Consensus 498 g~v~~~~a~~~~~~tp~-----~a~ 517 (517) +.+.+|++|++++++++ .+| T Consensus 368 ~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 368 VQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred cEEecccceEEEEecccccccCCCC Confidence 99999999999988554 456 No 46 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=100.00 E-value=3.4e-44 Score=258.89 Aligned_cols=352 Identities=14% Similarity=0.150 Sum_probs=215.3 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHH----Hhhhhhhhhhh Q lcl|Aclame:pro 133 FREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESE----KILGVEALKVT 208 (517) Q Consensus 133 vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~ 208 (517) +..+..+...++......++...+.. +.++ .+...++...++.+++...+.. ........... T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~-~~~e------------~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~ 67 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEEVRSLMGED-KVAE------------AEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVET 67 (392) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHH-HHHH------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Confidence 11111111111111111111111000 0000 0111111111221111111000 00000000000 Q ss_pred hhhhhHHHHHHHHHHHHhhccchhhHHHH--------hhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc Q lcl|Aclame:pro 209 PEATEFLKTREAEVAYMSASLTKDPKAAW--------TAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT 280 (517) Q Consensus 209 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~ 280 (517) .........+................... .........+++.+|+.+...|++.+++.+++++++++.++++ T Consensus 68 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~ 147 (392) T protein:vir:10 68 RNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRT 147 (392) T ss_pred cCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccC Confidence 00011111111222222211111111101 1111223446788999999999999999999999999887754 Q ss_pred c----eeeeecccccceeeecccccccc-cccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 281 L----VVGGDNALTQGTGHTTGTDKTES-NITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMA 355 (517) Q Consensus 281 ~----~~~~~~~~~~a~~~~eg~~~~~~-~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~ 355 (517) . .++.......+.|++||+.++++ .++|+++++.++++++++++|++++.|+.++ |++||.++|+++++++ T Consensus 148 ~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~l~~~i~~~ 223 (392) T protein:vir:10 148 RSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQN----ILKYVTKWLGKKSKVT 223 (392) T ss_pred CceeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHH----HHHHHHHHHHHHHHHH Confidence 2 34555666788999999999976 5899999999999999999999999988765 8999999999999999 Q ss_pred HHhhhhcccccCcccccccccccccccccccccccHHHHHHHHHHh-h-hhhcCCEEEEcHHHHHHHHHhhcCCCCEecc Q lcl|Aclame:pro 356 VNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVA-T-PKAADSTLVIHRNDLAAIRFLKDKNGNYVFP 433 (517) Q Consensus 356 ~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~-~-~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~ 433 (517) ++.+|++|+|++++. +..+++++++++... . .+..+++|||||++|.+|++|||++|||||+ T Consensus 224 ~d~~~~~g~g~~~~~----------------~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~ 287 (392) T protein:vir:10 224 RNVLILGVIEKLTKQ----------------AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQ 287 (392) T ss_pred HHHHHhhcccccccc----------------CccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEee Confidence 999999999976542 123456777776532 2 3446789999999999999999999999999 Q ss_pred CCCCCCccceecCccceecccc---------CCce--eeeecCc-eEEEeeeheeeh----hhhhcccchHHHHHhhhhc Q lcl|Aclame:pro 434 VGVSNQTIATHFGFNRLVQSVA---------VDEK--TAVSLSG-YVTNGSRGMEFE----QGTILVENNKEYLFEMPIS 497 (517) Q Consensus 434 ~~~~~~~~~~l~g~~~v~~~~~---------~~~~--~~~~~~~-~~~~~~~~~~~~----~d~~~~~n~~~~~~~~rvg 497 (517) ++.+.+.+.+++|.+++++... .++. ..++|+. |.+..+.++... .+..+++|++.|+++.|+| T Consensus 288 ~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d 367 (392) T protein:vir:10 288 SDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDD 367 (392) T ss_pred cCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeec Confidence 9999999999999877764321 1121 2235555 444455555432 1234688999999999999 Q ss_pred ceeecccceEEEEeCCC-----CCC Q lcl|Aclame:pro 498 GSLEYKGTTAYGTYTPP-----VAG 517 (517) Q Consensus 498 g~v~~~~a~~~~~~tp~-----~a~ 517 (517) +.+.+|++|++++++++ .+| T Consensus 368 ~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 368 VQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred cEEecccceEEEEecccccccCCCC Confidence 99999999999988554 456 No 47 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=100.00 E-value=8.2e-44 Score=256.79 Aligned_cols=363 Identities=13% Similarity=0.083 Sum_probs=215.5 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh--hhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhhh Q lcl|Aclame:pro 125 NKNAVVTYFREEKKKEENKMTFDQNLMQELLD--AKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGV 202 (517) Q Consensus 125 ~~~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e--~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 202 (517) --..++..++++..+...++.......+.... ..++.+++.++++++. +.+.++..++...+........ T Consensus 1 M~~~~l~el~~~l~e~~~~i~~~~~e~~~~~~~~~~~~~~~l~~eie~l~--------~ei~~l~~~~~~~e~~~e~~~~ 72 (394) T protein:vir:97 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAK--------ANLVEAENDLKLYESSVEVGGA 72 (394) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchhhHHHHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHhhhhcc Confidence 11112222222222222111111111100000 0000111111111111 1111111111111110000000 Q ss_pred h------hhhhhhhhh----hHHHHHHHHHHHHhhccch-hhHHH--------HhhhhhcccccccccchhhhhhHHHhH Q lcl|Aclame:pro 203 E------ALKVTPEAT----EFLKTREAEVAYMSASLTK-DPKAA--------WTAELKERGISGMPAPAGILKRIQDAV 263 (517) Q Consensus 203 ~------~~~~~~~~~----~~~~~~~~~~~~~~~~~~~-~~~~~--------~~~~~~~~~~~~~~vp~~i~~~i~~~~ 263 (517) . ......+.+ .+.+............... +.... ..........+++.+|+.+...|++.+ T Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~liP~~~~~~ii~~~ 152 (394) T protein:vir:97 73 ENIGGKEVTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREV 152 (394) T ss_pred ccccccccchhhHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHhhhhhhhhccccccccccccChHHHHHHHHHHh Confidence 0 000000000 0000000000000000000 00000 000111233467889999999999999 Q ss_pred hhhhhhhhceeeecccc--ceeeeec-ccccceeeeccccccc-ccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHH Q lcl|Aclame:pro 264 NDEGSLLPFIRHENLPT--LVVGGDN-ALTQGTGHTTGTDKTE-SNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGA 339 (517) Q Consensus 264 ~~~~~~~~~~~~~~~~~--~~~~~~~-~~~~a~~~~eg~~~~~-~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~ 339 (517) +..+++++++++.++++ ..+|... ....+.|++||+..|+ ++++|+.+++.++++++++++|++++.|+.++ T Consensus 153 ~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~i~is~ell~ds~~~---- 228 (394) T protein:vir:97 153 KTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVD---- 228 (394) T ss_pred hhhhhhhhhceeeeccCcceEEEEEecCCCccceecccccccccccccceeEEeehhheeeehhhHHHHHhhhhHH---- Confidence 99999999999877654 3455443 4456889999999996 56999999999999999999999999998775 Q ss_pred HHHHHHHHHHHHHHHHHHhhhhcccccCcccccccccccccccccccccccHHHHHHHHHHhhhhhcCCEEEEcHHHHHH Q lcl|Aclame:pro 340 ILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVATPKAADSTLVIHRNDLAA 419 (517) Q Consensus 340 l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~~~~~~a~~vmn~~~~~~ 419 (517) |++||.++|+++++.+++.+|++|.|++++. +....+++++++.....++.+++|||||++|.+ T Consensus 229 ~~~~i~~~la~~~~~~~~~~i~~g~~~~~~~----------------~~~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~ 292 (394) T protein:vir:97 229 LVGIVSESISQIKVNTTNDAIAKVLKSFTTK----------------TVKNLDEIKALLNGGFDPAYNVSLIVSQSFYQT 292 (394) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcccccccc----------------ccccHHHHHHHHHhhhhhhhCCEEEEcHHHHHH Confidence 8999999999999999999999998765431 223467788888776677778999999999999 Q ss_pred HHHhhcCCCCEeccCCCCCCccceecCccceec-cccCCceee--eecCc-eEEEeeeheeeh-hhhhcccchHHHHHhh Q lcl|Aclame:pro 420 IRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQ-SVAVDEKTA--VSLSG-YVTNGSRGMEFE-QGTILVENNKEYLFEM 494 (517) Q Consensus 420 l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~-~~~~~~~~~--~~~~~-~~~~~~~~~~~~-~d~~~~~n~~~~~~~~ 494 (517) |++|||++|||||+++++++.+.+++|.++++. ...+++..+ ++++. |.+..+.++..- .++ ..+...+++.. T Consensus 293 l~~lkd~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~ 370 (394) T protein:vir:97 293 LDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADN--EIYGQYLQAVL 370 (394) T ss_pred HHHhhccCCCeeeecCcCCCCCceeccceeEEecccccCCccEEEeeccccEEEEEecceEEEEecc--cccceeEEEEE Confidence 999999999999999999998899999876653 344555443 55555 445545444321 222 22344678999 Q ss_pred hhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 495 PISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 495 rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) |+|+.|.+|++|+++++||+.+= T Consensus 371 r~d~~v~~~~a~~~~~~~~~~~p 393 (394) T protein:vir:97 371 RFGVSKVDDKAGYYVTFTPEPLP 393 (394) T ss_pred EEccEEecccceEEEEecccccC Confidence 99999999999999999887555 No 48 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=100.00 E-value=2.6e-44 Score=259.58 Aligned_cols=342 Identities=14% Similarity=0.137 Sum_probs=212.7 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHh--hhhhhhhhhhh Q lcl|Aclame:pro 133 FREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKI--LGVEALKVTPE 210 (517) Q Consensus 133 vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~ 210 (517) +..+..+...+.....+.++.... +...+..+....++..++.++...+..... ........... T Consensus 1 M~k~l~~l~e~~~~~~~e~~~~~~-------------~~~~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~ 67 (371) T protein:vir:81 1 MPKELRELLEQINNKKEEARKLLA-------------ENKIEEAKKLKEEIVALQEKFDVAKELYEEQKQTIEDKEPLKP 67 (371) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhh-------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Confidence 111111111111111111111000 000000111111112222222211111000 00000000000 Q ss_pred hhhHH-HHHHHHHHHHhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeeccccc----eeee Q lcl|Aclame:pro 211 ATEFL-KTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPTL----VVGG 285 (517) Q Consensus 211 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~~----~~~~ 285 (517) ..... ...+.+..+..+ ..+++... .....+++.+|+.+...|++.++..+++++++++.++++. .++. T Consensus 68 ~~~~~~~~~~~~~~~l~~----~~~~a~~~--~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~ 141 (371) T protein:vir:81 68 TVQVKENEVEAFVNHIRT----RFRNAMSE--GSNQDGGYTVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKK 141 (371) T ss_pred chhhHHHHHHHHHHHHHH----HHHHhhcc--CCCccCceeecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEe Confidence 00000 111111111111 11222222 2234458899999999999999999999999988777542 2444 Q ss_pred ecccccceeeeccccccc-ccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhccc Q lcl|Aclame:pro 286 DNALTQGTGHTTGTDKTE-SNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGG 364 (517) Q Consensus 286 ~~~~~~a~~~~eg~~~~~-~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~ 364 (517) ......+.|++||+.+|+ +.++|++++++++++++++++|++++.|+.++ |++||.++|+++++++++.+|++|+ T Consensus 142 ~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~l~~a~~~~~~~~i~~g~ 217 (371) T protein:vir:81 142 RSQQTGFVEVAEGAAIGEKATPQFTLLQYQVKKYAGFFRVTNELLNDSTEA----IVNTLVRWIGDESRVTRNGLIINVL 217 (371) T ss_pred ecCCcceeeeccccccccccccceeeEEeeeeEEEEeehhhHHHHhhhhHH----HHHHHHHHHHHHHHHHHHHHHHhhc Confidence 555678899999999986 67999999999999999999999999988765 9999999999999999999999999 Q ss_pred ccCcccccccccccccccccccccccHHHHHHHHHHh--hhhhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccc Q lcl|Aclame:pro 365 VTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVA--TPKAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIA 442 (517) Q Consensus 365 G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~--~~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~ 442 (517) |++.+. + ....++++..+... ..+..+++|||||++|.+|++|||++|||||+++++.+.+. T Consensus 218 g~~~~~-~---------------~~~~~~i~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~~ 281 (371) T protein:vir:81 218 NTKAKT-A---------------IADLDGLKQIINVQLDPVFRSTSSVIVNQDAFNWLDTLKDQNGQYLLQPSISSPTGR 281 (371) T ss_pred cccccc-c---------------cccHHHHHHHHHhhcchhhhcCCEEEEcHHHHHHHHHhhccCCCeeeecccCCCCCc Confidence 987652 2 22345666655432 23456789999999999999999999999999999999999 Q ss_pred eecCccceeccccCC--------------ceeeeecCce-EEEeeeheeehh----hhhcccchHHHHHhhhhcceeecc Q lcl|Aclame:pro 443 THFGFNRLVQSVAVD--------------EKTAVSLSGY-VTNGSRGMEFEQ----GTILVENNKEYLFEMPISGSLEYK 503 (517) Q Consensus 443 ~l~g~~~v~~~~~~~--------------~~~~~~~~~~-~~~~~~~~~~~~----d~~~~~n~~~~~~~~rvgg~v~~~ 503 (517) +++|.++++.. .++ ....++++.| .+..+.++.... ...+.+|++.|+++.|+++.+.+| T Consensus 282 ~l~G~pV~~~~-~~~~~~~~~~~~~~~~~~i~~Gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~v~~~~~~r~d~~~~~~ 360 (371) T protein:vir:81 282 QLLGLPVVIVS-NKVLANRVDGGTGAQFAPIIVGDLKEAVVMFDRQRTEIMSSNVAMDAFETDATLWRAIERMDVKMRDD 360 (371) T ss_pred eecceeEEEec-ccccCccccccccCCcceEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecc Confidence 99997766543 222 1223455554 444455554322 123578999999999999999999 Q ss_pred cceEEEEeCCC Q lcl|Aclame:pro 504 GTTAYGTYTPP 514 (517) Q Consensus 504 ~a~~~~~~tp~ 514 (517) +||++++++.+ T Consensus 361 ~a~~~~~~~~A 371 (371) T protein:vir:81 361 EAFVFGEVQLA 371 (371) T ss_pred cceEEEEEecC Confidence 99999999999 No 49 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=100.00 E-value=6.9e-44 Score=257.22 Aligned_cols=362 Identities=12% Similarity=0.039 Sum_probs=214.6 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhh-------hh Q lcl|Aclame:pro 130 VTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKIL-------GV 202 (517) Q Consensus 130 I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~-------~~ 202 (517) ++.+++...+....+++....+.+.........+ +++++..+..+ ..+....++.++...+...... .. T Consensus 1 meeL~~~~~~~~~~~~e~~~~l~~~~~~~~~~~e---~~~~l~~ei~~-~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 76 (389) T protein:vir:10 1 MDKLQTLFNDVSAKCADLNAQLNAKLQDENASVD---DFQKIKDDLTA-AKARRDAINDQIKALEAEKPAEPKTEPKDDG 76 (389) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhHH---HHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhhhccccccc Confidence 3333332222222222222222111111000000 01111111111 1111111111111111100000 00 Q ss_pred hhhhhhhhhhhHHHHHHHHHHHHhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc-- Q lcl|Aclame:pro 203 EALKVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT-- 280 (517) Q Consensus 203 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~-- 280 (517) .....................++.... ...+... ......+++.+|+.+...|++.++..+++++++++.++++ T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~lr~~~--~~~~~~~--~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~ 152 (389) T protein:vir:10 77 SKKGTDLSKKPIDAKKKAINDFIHSHG--KVIDATS--KVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPK 152 (389) T ss_pred cccccccchhHHHHHHHHHHHHhhcch--hhhhhhc--ccccCCcceeehHHHHHHHHHHHHhhhhHHhhcceeeccCCe Confidence 000000000010111112222222111 1111111 1223455889999999999999999999999999887754 Q ss_pred ceeeeec-ccccceeeeccccccc-ccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 281 LVVGGDN-ALTQGTGHTTGTDKTE-SNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNR 358 (517) Q Consensus 281 ~~~~~~~-~~~~a~~~~eg~~~~~-~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~ 358 (517) ..++... ....+.|++|++.+++ ++++|+.+++.++++++++++|++++.|+.++ |++||.++|+++++.+++. T Consensus 153 ~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~la~~~~~~~~~ 228 (389) T protein:vir:10 153 GTYPILKRATDRFSSVAELAENPKLAEPEFNKVDWSVATYRGAIPLSEEAIADSAVD----LTALVGQSIKEKSVNTYNA 228 (389) T ss_pred eEEEEEecCCCccccccccccccccccccceeeeeeheeeEeeehhhHHHHhhhhHH----HHHHHHHHHHHHHHHHHHH Confidence 3444433 3456678899988885 78999999999999999999999999998775 8999999999999999999 Q ss_pred hhhcccccCcccccccccccccccccccccccHHHHHHHHHHhhhhhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCC Q lcl|Aclame:pro 359 AIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVATPKAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSN 438 (517) Q Consensus 359 ~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~ 438 (517) +|++|+|++.+. ..++....|++.+.+........+++|||||++|.+|++|||++|||||+++... T Consensus 229 ~i~~g~~~~~~~-------------~~~~~~~~d~l~~~~~~~~~~~~~a~~~~n~~~~~~L~~lkd~~G~~i~~~~~~~ 295 (389) T protein:vir:10 229 MIAPVLQSFTAK-------------KTTTDTLVDSLKHILNVDLDPAYSRALVVTQSLFNTLDTLKDKNGRYLLHDASDS 295 (389) T ss_pred HHhhhhcccccc-------------cccccccHHHHHHHHHhhhhhhhCcEEEecHHHHHHHHHhhccCCCeeeecCccc Confidence 999998876432 1233445678888777655555689999999999999999999999999887654 Q ss_pred ----CccceecCccceecccc-CCc------eeeeecCc-eEEEeeeheeehhhhhcccchHHHHHhhhhcceeecccce Q lcl|Aclame:pro 439 ----QTIATHFGFNRLVQSVA-VDE------KTAVSLSG-YVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTT 506 (517) Q Consensus 439 ----~~~~~l~g~~~v~~~~~-~~~------~~~~~~~~-~~~~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~ 506 (517) +...+++|.++++.+.. ++. ...++++. |.+..+.++......+ ......+++..|+||.+.+|+|| T Consensus 296 ~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~-~~~~~~~~~~~r~d~~~~~~~a~ 374 (389) T protein:vir:10 296 ITDGTAKGTILGVPVYVVGDTLLGSLAGDQKAFVGDLKRGVLFTDRQQVTLAWEDS-KIYGKYLGAAFRFGVQKADSKAG 374 (389) T ss_pred ccccccccccccceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEeecc-ccccceEEEEEEeccEEecccce Confidence 33457899877654432 222 23456665 5666555554321111 22234567778999999999999 Q ss_pred EEEEeCCCCCC Q lcl|Aclame:pro 507 AYGTYTPPVAG 517 (517) Q Consensus 507 ~~~~~tp~~a~ 517 (517) +++++++..++ T Consensus 375 ~~~~~~~~~~~ 385 (389) T protein:vir:10 375 YFVTNTDVPGS 385 (389) T ss_pred EEEEeeccCCC Confidence 99999887666 No 50 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=100.00 E-value=9.5e-44 Score=256.44 Aligned_cols=361 Identities=11% Similarity=0.055 Sum_probs=216.3 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhH----HHh Q lcl|Aclame:pro 124 SNKNAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRES----EKI 199 (517) Q Consensus 124 A~~~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~----~~~ 199 (517) +|- ..++++..+...+++...+.+++.... ....+.....++. +...+.+..+.......+.. ... T Consensus 1 M~~----~eL~~~~~~~~~~~~~l~e~~~~~~~~-~~~~~~~~~~ee~-----~~l~~~i~~~~~~~~~~~~~~~~~~~~ 70 (395) T protein:vir:38 1 MNI----NQLKDAFDMAGQKVQDLEDKRAQFAID-LGNDASSHSVDDI-----NKLNASLKNAKMAQELAKSAYEDARAN 70 (395) T ss_pred CCH----HHHHHHHHHHHHHHHHHHHHHHHHHHH-HhhhHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 111 111111111111111111111110000 0000000000000 00000111111100000000 000 Q ss_pred hhh-hhhhhhhhhhhHHHHHHHHHHHHhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecc Q lcl|Aclame:pro 200 LGV-EALKVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENL 278 (517) Q Consensus 200 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~ 278 (517) ... ............... ............+..........+.+++.+|+.+...|++.++..+++++++++.++ T Consensus 71 ~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~ 146 (395) T protein:vir:38 71 LNAEPVNKKPLPVKDGKPD----AQAMKNQFVKDFKNLVTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENV 146 (395) T ss_pred hhhccccccccchhhhhHH----HHHHHHHHHHHHHHHHhhccCccCCCceecchhHhhHHHHHHHhhcchhhhcceeec Confidence 000 000000000000000 001111111222333333344445668899999999999999999999999988766 Q ss_pred ccc--e--eeeec-ccccceeeecccccccc-cccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 279 PTL--V--VGGDN-ALTQGTGHTTGTDKTES-NITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMV 352 (517) Q Consensus 279 ~~~--~--~~~~~-~~~~a~~~~eg~~~~~~-~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~ 352 (517) ++. . ++... ....+.|+.||+.+|++ .++|+.+++.++++++++++|++++.|+.++ |++||.++|++++ T Consensus 147 ~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~la~~~ 222 (395) T protein:vir:38 147 TTSHGSRVYEKLADITPLKDLDDESALIGDNDDPELTVVKYLIHRYAGITTVTNTLLKDTVDN----IIQWLVNWAAKKD 222 (395) T ss_pred cCCcceEEEEeeccCCccccccccccccccccccceeeEEeeeeeeEeehhhHHHHHhhhHHH----HHHHHHHHHHHHH Confidence 432 2 22232 33567899999999976 5899999999999999999999999998775 8999999999999 Q ss_pred HHHHHhhhhcccccCcccccccccccccccccccccccHHHHHHHHHHhh--hhhcCCEEEEcHHHHHHHHHhhcCCCCE Q lcl|Aclame:pro 353 IMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVAT--PKAADSTLVIHRNDLAAIRFLKDKNGNY 430 (517) Q Consensus 353 ~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~--~~~~~a~~vmn~~~~~~l~~lKD~~Gry 430 (517) +++++.+|++|+|++.+..++ ...++++++++... .+..+++|+|||.+|.+|++|||++||| T Consensus 223 ~~~~~~~il~g~g~~~~~~~~---------------~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~ 287 (395) T protein:vir:38 223 VVTRNAKILEVMGKAPKKPTI---------------SQFDNIKDLENNTLDPAIESTSSFITNQSGYNILSKVKDADGRY 287 (395) T ss_pred HHHHHHHHhhccccccccccc---------------ccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCce Confidence 999999999999988765332 23456676665332 3456789999999999999999999999 Q ss_pred eccCCCCCCccceecCccceeccc-cC----Cce--eeeecCc-eEEEeeeheee--hh--hhhcccchHHHHHhhhhcc Q lcl|Aclame:pro 431 VFPVGVSNQTIATHFGFNRLVQSV-AV----DEK--TAVSLSG-YVTNGSRGMEF--EQ--GTILVENNKEYLFEMPISG 498 (517) Q Consensus 431 l~~~~~~~~~~~~l~g~~~v~~~~-~~----~~~--~~~~~~~-~~~~~~~~~~~--~~--d~~~~~n~~~~~~~~rvgg 498 (517) ||++++..+.+.+++|.++++... ++ ++. ..++++. |.+..+.++.. .+ +..+.+|++.|+++.|+++ T Consensus 288 l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~i~~gd~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~ 367 (395) T protein:vir:38 288 LMQPDVTSPDKYLIDGKPVIRIADKWLPDVSGSHPLYFGDLKQGITLFDRQQMQIDTTNVGAGSFEHDTTKLRFIDRFDV 367 (395) T ss_pred eeccCcCCCCcceeccceeEEecccccCcCCCcceEEEEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEeecc Confidence 999999999999999987665432 22 222 2345565 55555555432 22 2347899999999999999 Q ss_pred eeecccceEEEEeCCCCCC Q lcl|Aclame:pro 499 SLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 499 ~v~~~~a~~~~~~tp~~a~ 517 (517) .+.+|+||++++++++... T Consensus 368 ~~~~~~a~~~~~~~~~~~~ 386 (395) T protein:vir:38 368 QLIDDGAFAAASFKTVANQ 386 (395) T ss_pred EEecccceEEEEeecccCC Confidence 9999999999999877443 No 51 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=100.00 E-value=2.3e-43 Score=254.34 Aligned_cols=366 Identities=12% Similarity=0.075 Sum_probs=221.0 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhhhh Q lcl|Aclame:pro 124 SNKNAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVE 203 (517) Q Consensus 124 A~~~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 203 (517) .|-+..+..++.+..+...++......+++..+..+ .++...+.++...+.. .......+++.++...+......... T Consensus 1 ~~l~e~i~e~~~~l~el~~~~~~~~~e~r~~~e~~~-~~~~~~~~~e~~~~~~-~l~~ei~~l~e~~~~~~~~~~~~~~~ 78 (400) T protein:vir:38 1 MTLDEKLAAVKKQLDEKRSALPAMKTELRSLLEGED-SEENLKKAEGVRAKYD-KAGKEIKDLEEKRDLYEAALKGNEQS 78 (400) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-cchHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 444444444443333333333322222222222111 1111111111111111 11112222222222111111100000 Q ss_pred hh-----hhhh----hhhhHHHHHHH---HHHH-------Hhhc--cchhhHHHHhhhhhcccccccccchhhhhhHHHh Q lcl|Aclame:pro 204 AL-----KVTP----EATEFLKTREA---EVAY-------MSAS--LTKDPKAAWTAELKERGISGMPAPAGILKRIQDA 262 (517) Q Consensus 204 ~~-----~~~~----~~~~~~~~~~~---~~~~-------~~~~--~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~ 262 (517) .. .... ........... .... .... .....+.... .......+++.+|+.+...|++. T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~gg~~vP~~~~~~ii~~ 157 (400) T protein:vir:38 79 SGKKPDHPEEHSYRDALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVN-AGVKAADAASTIPETISNTPQRE 157 (400) T ss_pred ccccccchhhhhHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHh-hcccccCCcccccHHHHHHHHHH Confidence 00 0000 00000000000 0000 0000 0011111111 11233446789999999999999 Q ss_pred Hhhhhhhhhceeeecccc--ceeeeec-ccccceeeeccccccc-ccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHH Q lcl|Aclame:pro 263 VNDEGSLLPFIRHENLPT--LVVGGDN-ALTQGTGHTTGTDKTE-SNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAG 338 (517) Q Consensus 263 ~~~~~~~~~~~~~~~~~~--~~~~~~~-~~~~a~~~~eg~~~~~-~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~ 338 (517) ++..+++++++++.++++ ..+|... ..+.+.|++||+..|+ ++++|+++++.++++++++++|++++.|+.++ T Consensus 158 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~--- 234 (400) T protein:vir:38 158 LQTVVDLKPFTNVFQASTQKGTYPTVANATTKMVTVAELEKNPAMAKPEFKPVNWSVETYRQALPVSQESIDDSAID--- 234 (400) T ss_pred HHhhhhhhhcceeEeccCcceEEEEEecCCCccccccccccccccccccceeeEeehhheeeehhhHHHHHhhhHHH--- Confidence 999999999999887754 3455543 4466889999998886 68999999999999999999999999998775 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhcccccCcccccccccccccccccccccccHHHHHHHHHHhhhhhcCCEEEEcHHHHH Q lcl|Aclame:pro 339 AILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVATPKAADSTLVIHRNDLA 418 (517) Q Consensus 339 ~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~~~~~~a~~vmn~~~~~ 418 (517) |++||.++|++++..+++.++++|+|++++. +....+++++.+......+.+++|||||++|. T Consensus 235 -~~~~i~~~l~~~~~~~~~~~i~~~~~~~~~~----------------~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~ 297 (400) T protein:vir:38 235 -LVGLIAQNGQQIKVNTTNGAVATLLKGFTAK----------------TISSVDDLKHINNVDLDPAYSRVIIASQSFYN 297 (400) T ss_pred -HHHHHHHHHHHHHHHHHHHhhhhcccccccc----------------ccccHHHHHHHHHhhhhhhhCcEEEEcHHHHH Confidence 9999999999999999999999998865431 22345677777777667777899999999999 Q ss_pred HHHHhhcCCCCEeccCCCCCCccceecCccceeccc-cC---Cce--eeeecCceEEE-eeeheee--hhhhhcccchHH Q lcl|Aclame:pro 419 AIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSV-AV---DEK--TAVSLSGYVTN-GSRGMEF--EQGTILVENNKE 489 (517) Q Consensus 419 ~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~-~~---~~~--~~~~~~~~~~~-~~~~~~~--~~d~~~~~n~~~ 489 (517) +|++|||++|||||++++..+.+.+++|.++++.+. +. ++. ..++++.|+++ .+.++.. .++ ..+... T Consensus 298 ~l~~lkd~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~g~~~~~~gd~s~~~~~~~~~~~~~~~~~~---~~~~~~ 374 (400) T protein:vir:38 298 FLDTVKDGNGRYLLQDSILTPSGKSVLGMPIAVVSDDTLGAAGEAHAFLGDIKRAILFANRADFMVRWVDD---QIYGQF 374 (400) T ss_pred HHHHhhccCCCeeeecCcCCCCccccccceeEEecccccCCCCceEEEEEeccccEEEEeecceEEEEecc---ccccee Confidence 999999999999999999999889999987665431 11 222 23456664444 4444432 222 233456 Q ss_pred HHHhhhhcceeecccceEEEEeCCCC Q lcl|Aclame:pro 490 YLFEMPISGSLEYKGTTAYGTYTPPV 515 (517) Q Consensus 490 ~~~~~rvgg~v~~~~a~~~~~~tp~~ 515 (517) +++..|+|+.+.+|++|++++++|+= T Consensus 375 ~~~~~r~d~~~~~~~a~~~l~~~~~a 400 (400) T protein:vir:38 375 LQAGMRFGVSVADEKAGYFLTYTPKA 400 (400) T ss_pred EEEEEEeccEEecccceEEEEeecCC Confidence 78889999999999999999997655 No 52 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=100.00 E-value=2.5e-42 Score=248.68 Aligned_cols=359 Identities=14% Similarity=0.095 Sum_probs=206.7 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhh----hhhhhhhhhh---------hHHHHHHHhhhhhhhHHHHHHHHhhHH Q lcl|Aclame:pro 131 TYFREEKKKEENKMTFDQNLMQELLDAKKL----AADLNAKLKE---------RENGGDNAALKTVSELAANLMKQRESE 197 (517) Q Consensus 131 ~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~----~~e~~a~l~~---------~~~~~~e~~~~~~~~~~~~~~~~~~~~ 197 (517) .+.++ ....+++++....+.+..+..+. .+++...+++ .+.+. +.+...+..++.++.+.+... T Consensus 1 m~~k~--~~l~~~~~el~~~l~eL~e~~~~l~~~~~el~~~~ee~~~~e~~~~~~~~~-~~l~~~i~~l~~~i~~~~~~~ 77 (397) T protein:vir:96 1 MALKQ--LILNKQIKERSSEIDKLLSQRSDLEKQENDLERALEEAKTDEEISTVSDSA-DDLEKQVKDLDEKIAELQKEK 77 (397) T ss_pred CcHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH Confidence 11110 00111111111111111111100 0011110000 00000 011111122222222111111 Q ss_pred Hhhhhhh----hhhhhhhhhHHHH-----HHHH--HHHHhhccchhhH--HHHhhhhhcccccccccchhhhhhHHHhHh Q lcl|Aclame:pro 198 KILGVEA----LKVTPEATEFLKT-----REAE--VAYMSASLTKDPK--AAWTAELKERGISGMPAPAGILKRIQDAVN 264 (517) Q Consensus 198 ~~~~~~~----~~~~~~~~~~~~~-----~~~~--~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~ 264 (517) ....... ............. .... ............+ .............++.+|+.+...|.+ +. T Consensus 78 ~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~-~~ 156 (397) T protein:vir:96 78 QDLEDELAKAADPTDQKPKDGEKRKMKKFKVTEEELAEKRSAINAFVKSKGAEKRDGFTSVEGGALIPQELLQPQLE-PK 156 (397) T ss_pred HHHHHHHHhhhhhhhhhhHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHhhhhhhhhcccccccccchhHHHHHHHHH-hh Confidence 0000000 0000000000000 0000 0000000000000 000111122345578899999998887 46 Q ss_pred hhhhhhhceeeecccc--ceeee-ecccccceeeeccccccc-ccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHH Q lcl|Aclame:pro 265 DEGSLLPFIRHENLPT--LVVGG-DNALTQGTGHTTGTDKTE-SNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAI 340 (517) Q Consensus 265 ~~~~~~~~~~~~~~~~--~~~~~-~~~~~~a~~~~eg~~~~~-~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l 340 (517) ...++++++++.+++. ...+. ......+.|+.|++..|+ ++++|+.+++.++++++++++|++++.|+.++ | T Consensus 157 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~~~~~~~~~s~ell~ds~~~----l 232 (397) T protein:vir:96 157 DIVDLSKYVRSVPVNSASGKFPVISKSGSKMATVQQLEKNPQLANPKMVEIDYSVATRRGYIPISQEMIDDASYD----V 232 (397) T ss_pred hhhhHHHhhhhccccccceeEEEEeccCCccccccccccccccccccccceeecHhHhhcchhhHHHHHhhhHHH----H Confidence 6778888888766543 33333 334567789999999986 68999999999999999999999999998765 8 Q ss_pred HHHHHHHHHHHHHHHHHhhhhcccccCcccccccccccccccccccccccHHHHHHHHHHhhhhhcCCEEEEcHHHHHHH Q lcl|Aclame:pro 341 LTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVATPKAADSTLVIHRNDLAAI 420 (517) Q Consensus 341 ~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~~~~~~a~~vmn~~~~~~l 420 (517) ++||.++|++.++.+++.+|++|+|++++. +..+.|++++++......+.+++|||||++|..| T Consensus 233 ~~~i~~~l~~~~~~~~~~~i~~g~g~~~~~----------------~~~~~d~~~~~~~~~~~~~~~a~~v~n~~~~~~l 296 (397) T protein:vir:96 233 TGLIADEIQDQSLNTKNADIAAVLKTATAK----------------SVVGVDGLKDLINKEIKKVYDVKLFISASMYSEL 296 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcccccccc----------------cccchHHHHHHHHHhhhhhcCcEEEEcHHHHHHH Confidence 999999999999999999999999876542 2234677888887777777899999999999999 Q ss_pred HHhhcCCCCEeccCCCCCCccceecCccceeccccCC-----c--eeeeecCce-EEEeeeheeehhhhhcccchHHHHH Q lcl|Aclame:pro 421 RFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVD-----E--KTAVSLSGY-VTNGSRGMEFEQGTILVENNKEYLF 492 (517) Q Consensus 421 ~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~-----~--~~~~~~~~~-~~~~~~~~~~~~d~~~~~n~~~~~~ 492 (517) ++|||++|||||+++++.+.+.+++|.|+++.+..++ + ...++|+.| .+..+.++...... .......+++ T Consensus 297 ~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 375 (397) T protein:vir:96 297 DKLKDKNGRYLLQDSITAASGKQLLGKEVVVLDDDVIGKSVGNVVGFIGDAKAFASFFDRKQVSVSWVD-NNIYGQLLAG 375 (397) T ss_pred HHhhccCCCeEeccCccCCCcccccccceEEecccccCCCCCceEEEEeehhcceEeEeecceEEEEec-ccccceeEEE Confidence 9999999999999999999899999988776543222 2 233467764 44555554432111 1222345678 Q ss_pred hhhhcceeecccceEEEEeCCC Q lcl|Aclame:pro 493 EMPISGSLEYKGTTAYGTYTPP 514 (517) Q Consensus 493 ~~rvgg~v~~~~a~~~~~~tp~ 514 (517) ..|+||.|.+|+||+.+++|.| T Consensus 376 ~~r~d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 376 IIRYDVKATDKKAGFYVTFTIG 397 (397) T ss_pred EEEEccEEecccceEEEEeecC Confidence 8999999999999999999999 No 53 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=100.00 E-value=1e-41 Score=245.34 Aligned_cols=387 Identities=12% Similarity=0.029 Sum_probs=202.4 Q ss_pred ehhhhhh--hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhh---- Q lcl|Aclame:pro 112 CILAGGA--LTPNPSNKNAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSE---- 185 (517) Q Consensus 112 ~~l~EvS--~v~~pA~~~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~---- 185 (517) .+|-|+- +-..=+.-..++..+++.........++......+..+..+++.++..++++..........+.... T Consensus 1 Mki~elk~el~~~~~el~~~~~elr~~~~~~~~~~~el~~~~~e~~~~~~ei~el~~~l~~~~~~~~~~~e~~~~~~~~~ 80 (437) T protein:vir:10 1 MKIEKLKKDLATKTAELNTKKAEIRSFTESEDKTIDEVKAGMTEIKEKEDEIKEIRSNIEVLEQASALKVEEKRDDSDLV 80 (437) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111110 0000000001111111110000000000000000000001111111111111100000000000000 Q ss_pred --------HHHHHHHHhhHHHhhhhhhhhhhhhhhhHHHHHHHHH-----H---HHhhccchhhHH------HHhhhhhc Q lcl|Aclame:pro 186 --------LAANLMKQRESEKILGVEALKVTPEATEFLKTREAEV-----A---YMSASLTKDPKA------AWTAELKE 243 (517) Q Consensus 186 --------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~---~~~~~~~~~~~~------~~~~~~~~ 243 (517) ...+..........................+...... . .........+.. ........ T Consensus 81 ~~e~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~ 160 (437) T protein:vir:10 81 APELEENSADNEEDDPEKLKTETKSEAEKDKKTVKDEEKRDAGGLQDMKLKVGGEIADKKVTAFADYLKTGEVRDVTGIA 160 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHhHHHHHHHHHHHHhhhhhhHHHHHhhhhhhhhhcc Confidence 0000000000000000000000000000000000000 0 000000000000 00111123 Q ss_pred ccccccccchhhhhhHHHhHhhhhhhhhceeeecccc--ceeeee-cccccceeeeccccccc-ccccceeeEeeHhhhh Q lcl|Aclame:pro 244 RGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT--LVVGGD-NALTQGTGHTTGTDKTE-SNITLQTRVLTPQYVY 319 (517) Q Consensus 244 ~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~--~~~~~~-~~~~~a~~~~eg~~~~~-~~~~f~~~~~~~~~~~ 319 (517) ....++.+|..+...|. .+.....++.++++.++++ ...+.. .....+.|+.|+...++ ++++|+++++.+++++ T Consensus 161 ~~~~g~lvp~~~~~~i~-~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~e~~~~~~~~v~~~~~k~~ 239 (437) T protein:vir:10 161 LKDGKVIIPETILTPEK-EVHQFPRLGSLVRTESVTTTTGKLPIFNNSTDLLTAHTEYGQTTKNATPVITPILWDLKTYT 239 (437) T ss_pred cccccccchHHHHHHHH-HhhhhhhhhhcceeEeeccCceeeEEeeccccccccccccccccccccccceeeeeehhhee Confidence 34567889999887665 4577888888888776643 344444 34467889999999986 5689999999999999 Q ss_pred HhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcccccccccccccccccccccccHHHHHHHHH Q lcl|Aclame:pro 320 KYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLS 399 (517) Q Consensus 320 ~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~ 399 (517) +++++|++++.|+.++ |++||.++|+++++.+++.+||+|+|++.+. .+++...+++.+++. T Consensus 240 ~~~~is~ell~ds~~~----~~~~i~~~l~~~~~~~~~~~i~~g~g~~~~~--------------~~~~~~~~~~~~~~~ 301 (437) T protein:vir:10 240 GGYVFSQELISDSSYD----WQAELQSRLIELRDNTDDSLIITALTDGIKK--------------TTSTYLLGDLKKVLN 301 (437) T ss_pred eehhhhHHHHhhhHHH----HHHHHHHHHHHHHHHHHHHHHhhhhcccccc--------------cccccchhhHHHHHH Confidence 9999999999998775 9999999999999999999999999987542 112233456666665 Q ss_pred Hh--hhhhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccceecCccceeccc-cCC-----ce--eeeecCce-E Q lcl|Aclame:pro 400 VA--TPKAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSV-AVD-----EK--TAVSLSGY-V 468 (517) Q Consensus 400 ~~--~~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~-~~~-----~~--~~~~~~~~-~ 468 (517) .. ..+..+++|||||++|.+|++|||++|||||+++++.+.+.+++|.|++++.. .++ +. ..++|+.| . T Consensus 302 ~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~ 381 (437) T protein:vir:10 302 VTLKPQDSAAASIVMSQSAYNLFDMATDAMGRPLLQPNVTAATGYTLLGKTVVIVDDKLFPSASAGDVNIVVAPLKKAVI 381 (437) T ss_pred hhhhhhhhcCCEEEEcHHHHHHHHHhhccCCCeeeccCccCCCCcccccceeEEecccccCCcCCCceEEEEeeccccEE Confidence 33 23446789999999999999999999999999999999889999987766432 222 22 23456654 4 Q ss_pred EEeeeheeehhhhhcccchHHHHHhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 469 TNGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 469 ~~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) +.++.++......++..+...+++..|++|.|.+|+||++.+.++|... T Consensus 382 ~~~r~~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~~~~ 430 (437) T protein:vir:10 382 NFKLTEITGQFQDTYDIWYKQLGIFLRQNVVQASKDLIVNLTGKLKAVT 430 (437) T ss_pred EEeeeceEEEEecccccccceeeEEEEEccEEecccceEEEEeeccccc Confidence 5555565543222234455677888899999999999999886644222 No 54 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=100.00 E-value=1.5e-43 Score=255.30 Aligned_cols=347 Identities=12% Similarity=0.060 Sum_probs=219.1 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhhhhhhhhhhh Q lcl|Aclame:pro 131 TYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVEALKVTPE 210 (517) Q Consensus 131 ~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 210 (517) ..++ ++++++..+.+.+..+.. ++.. ..++ ..+..+...+ .+..++.+....+............ T Consensus 1 M~i~------~k~~~~~~~~~~~l~~~~---~~~~-~~ee-~~~~~~~~~~---~~~~~~~~~~~~e~~~~~~~~~~~~- 65 (377) T protein:vir:98 1 MAIN------LKELPKYREAVAELSAKI---SAGA-TSEE-QEKLFEAAFT---TMGDEILAKNEEEMERMFDLRDKNR- 65 (377) T ss_pred CCCc------HHHHHHHHHHHHHHHHHH---Hhhh-hhHH-HHHHHHHHHH---hHHHHHHHHHHHHHHHHHHhccCCc- Confidence 1111 111111111111111110 0000 0000 0000110000 0111111000000000000000000 Q ss_pred hhhHHHHHHHHHHHHhhccchhhHHHHhhh--hhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc-ceeeeec Q lcl|Aclame:pro 211 ATEFLKTREAEVAYMSASLTKDPKAAWTAE--LKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT-LVVGGDN 287 (517) Q Consensus 211 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 287 (517) ....+.++..... ......+++.+|+.+.++|++.+...++++++|++.++++ ..++... T Consensus 66 -----------------~lt~ee~~~~~~~~~~~~~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~~~~~~~~ 128 (377) T protein:vir:98 66 -----------------ELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSLRLKALTAE 128 (377) T ss_pred -----------------ccCHHHHHHHHHHHhccCCCCCccccCHHHHHHHHHHHHHhhhhhhheeeEecCcceEEEEec Confidence 0000111111111 1223456899999999999999999999999999888754 4677778 Q ss_pred ccccceeeeccccc-ccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhccccc Q lcl|Aclame:pro 288 ALTQGTGHTTGTDK-TESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVT 366 (517) Q Consensus 288 ~~~~a~~~~eg~~~-~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~ 366 (517) ..+.+.|+.|+++. ++++++|+++++.++++++++++|+++++|+.+| |++||.++|+++|+++++.+||+|+|+ T Consensus 129 ~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~a~~~is~elL~ds~~~----ie~~i~~~la~~~a~~~~~a~i~G~G~ 204 (377) T protein:vir:98 129 TSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKW----IKQFITEQLKEAIAVALELAIVKGDGL 204 (377) T ss_pred CCcceeEeecccccCcccCccceeEeecceeEEeeecccHHhhhccHhH----HHHHHHHHHHHHHHHHHhhceEeccCC Confidence 88889999997765 4678999999999999999999999999999887 999999999999999999999999999 Q ss_pred Cccccccccccccccc--ccc----cccccHHHHHHHHHHhh-hhhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCC--- Q lcl|Aclame:pro 367 GVSETQIYPVVGDAWA--TNV----TGTTNIQELLEKLSVAT-PKAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGV--- 436 (517) Q Consensus 367 ~~~~~gi~~~~~~~~~--~~~----~~~~~~d~l~~~l~~~~-~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~--- 436 (517) ++| .||++..+.... ... +.....+.+.+...... .+..+++|+||+.++..+++|||.+|+|+|..++ T Consensus 205 ~qP-~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~a~~~m~~~t~~~~~klkd~~G~~i~~~n~~~~ 283 (377) T protein:vir:98 205 LQP-VGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLTPDNAPKKLVPVMKHLSVNDKKRPLKIAGQVKLILNPEDR 283 (377) T ss_pred Ccc-eeeeecccccccccccccccccccchhhhHhhhhhhchhHHHHHHHHHHHHHHHHHHhhhhccCCceEEEecccch Confidence 876 689876532211 111 11112234444444433 3445689999999999999999999999994332 Q ss_pred -----------CCCccceecCccc-eeccccCCc--eeeeecCceEEEeeehee--ehhhhhcccchHHHHHhhhhccee Q lcl|Aclame:pro 437 -----------SNQTIATHFGFNR-LVQSVAVDE--KTAVSLSGYVTNGSRGME--FEQGTILVENNKEYLFEMPISGSL 500 (517) Q Consensus 437 -----------~~~~~~~l~g~~~-v~~~~~~~~--~~~~~~~~~~~~~~~~~~--~~~d~~~~~n~~~~~~~~rvgg~v 500 (517) .+|...+++|++. ++.+..|++ ...++++.|.++.+.+++ .+++..+.++++.|++..|++|.+ T Consensus 284 ~~~~p~~~~~~~~G~~~t~lg~p~~vv~s~~~p~~~i~fgdf~~Y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dg~~ 363 (377) T protein:vir:98 284 WALEAQFTSRNQFGEYVTVLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKA 363 (377) T ss_pred hhccccccccCCCCccccccCCCceEEecCCCCcccEEEEEecceeEEeecceEEEeechhhhhcCceEEEEEEEEcCEE Confidence 2455567888664 445555654 455678889998877654 456677889999999999999999 Q ss_pred ecccceEEEEeCCC Q lcl|Aclame:pro 501 EYKGTTAYGTYTPP 514 (517) Q Consensus 501 ~~~~a~~~~~~tp~ 514 (517) ++|+||++.+++-= T Consensus 364 ~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 364 KDNHTAALLTLAGG 377 (377) T ss_pred eccCcEEEEEEecC Confidence 99999999998754 No 55 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=100.00 E-value=1.5e-42 Score=249.86 Aligned_cols=368 Identities=11% Similarity=0.087 Sum_probs=218.6 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhh----hh Q lcl|Aclame:pro 127 NAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKIL----GV 202 (517) Q Consensus 127 ~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~----~~ 202 (517) .-++..+++...+...+++.....+.+.....+...+ .+++.+++.. ...+....+..++.+.+...... .. T Consensus 1 Mk~l~el~~~~~e~~~~~~~~~~~~~~~~~~~~~~~e---e~~~~~~~~~-~l~~~~~~l~~~~~~~e~~~~~~~~~~~~ 76 (387) T protein:vir:93 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDME---DIKQLETEKA-GLQQRFNIVERQVKDIEEKEKAKVKDTGE 76 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHH---HHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhhhccc Confidence 2222222222222222222222222221111111000 0111111111 01111111111111111110000 00 Q ss_pred hhhhhhhhhhhHHHHHHHHHHHH-hh-------ccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhcee Q lcl|Aclame:pro 203 EALKVTPEATEFLKTREAEVAYM-SA-------SLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIR 274 (517) Q Consensus 203 ~~~~~~~~~~~~~~~~~~~~~~~-~~-------~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~ 274 (517) .......+.+. .+......... .. ......+++.. ......+|+.+|+.+...|++.+++.++++++++ T Consensus 77 ~~~~~~~~~~~-~~~~~~~~r~~~~~~~~~~~~~~~~~~~~al~--~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~ 153 (387) T protein:vir:93 77 AYQSLNDHEKM-VKAKAEFYRHAILPNEFEKPSMEAQRLLHALP--TGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKAR 153 (387) T ss_pred cCCCcchhhHH-HHHHHHHHHHHhhhhhhhhhhhhhHHHHHhhc--cCcCCCCceeechhHHHHHHHHHHhhchhhhhee Confidence 00000000000 00000001000 00 00111112211 2233455899999999999999999999999999 Q ss_pred eeccccceeeee-cccccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 275 HENLPTLVVGGD-NALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVI 353 (517) Q Consensus 275 ~~~~~~~~~~~~-~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~ 353 (517) +.++++...|.. .....+.|++||+..++++++|+++++.++++++++++|++++.|+.+| |++||.++|+++++ T Consensus 154 v~~~~~~~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~Ds~~~----l~~~i~~~la~~~~ 229 (387) T protein:vir:93 154 LTNIKGLEIPRVSYTLDDDDFITDVETAKELKLKGDTVKFTTNKFKVFAAISDTVIHGSDVD----LVNWVENALQSGLA 229 (387) T ss_pred eeecCCceEEEEeecCCccccccCcccccccccccceeeeeheeeeeechhhHHHHhhhHHH----HHHHHHHHHHHHHH Confidence 998887777653 4556789999999999999999999999999999999999999998876 99999999999999 Q ss_pred HHHHh-hhhcccccCcccccccccccccccccccccccHHHHHHHHHHhhh-hhcCCEEEEcHHHHHH-HHHhhcCCCCE Q lcl|Aclame:pro 354 MAVNR-AIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVATP-KAADSTLVIHRNDLAA-IRFLKDKNGNY 430 (517) Q Consensus 354 ~~~e~-~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~~-~~~~a~~vmn~~~~~~-l~~lKD~~Gry 430 (517) .+++. .|.+|+|++.+ .|++...+.. .+++...+|+++++++.... +..+++|+||+.+|.. +++++|++|+| T Consensus 230 ~~e~~~~~~~g~g~g~p-~g~l~~~~~~---~v~~~~~~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~~~~~~~d~~~~~ 305 (387) T protein:vir:93 230 AKERKDALAVSPKSGLD-HMSFYNGSVK---EVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNF 305 (387) T ss_pred HHHHHhHhhcCCCcccc-ceeeeccccc---cccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcc Confidence 99765 56778888876 4655543322 22344457888888887654 4567899999999766 46667776665 Q ss_pred eccCCCCCCccceecCccceeccccCCceeeeecCceEEEeeeheeehhhhhcccchHHHHHhhhhcceeecccceEEEE Q lcl|Aclame:pro 431 VFPVGVSNQTIATHFGFNRLVQSVAVDEKTAVSLSGYVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGT 510 (517) Q Consensus 431 l~~~~~~~~~~~~l~g~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~ 510 (517) ++ +.+.+++|.|+++ ...++....++|+.|.++.+ ++....+.+..++.+.|++..|+||.|++|+||++++ T Consensus 306 ~~------~~~~~llG~PV~~-~~~~~~~~~GDf~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~eA~~~l~ 377 (387) T protein:vir:93 306 FD------TPAEKVFGKPVVF-TDAAVKPIVGDFNYFGINYD-GTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAK 377 (387) T ss_pred cc------cCCccccccceEE-ecCCCceeeeehhhhheehh-hheeeecccccCCceeEEEEeeeCceeechhheEEEE Confidence 54 2345788875554 44566677778887765543 2222223345678899999999999999999999999 Q ss_pred eCCCCCC Q lcl|Aclame:pro 511 YTPPVAG 517 (517) Q Consensus 511 ~tp~~a~ 517 (517) +++|.+- T Consensus 378 ~k~~~~~ 384 (387) T protein:vir:93 378 AKENTGS 384 (387) T ss_pred eecCCCC Confidence 9877555 No 56 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=100.00 E-value=1e-42 Score=250.73 Aligned_cols=370 Identities=11% Similarity=0.069 Sum_probs=221.1 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhhhhhhh Q lcl|Aclame:pro 127 NAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVEALK 206 (517) Q Consensus 127 ~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 206 (517) .-++..+++...+...++++....+.+.....+...+. +.+.+++..+ ..+....+..++...+............ T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~ee---i~~~~~~~~~-l~~~~~~l~~~~~~~e~~~~~~~~~~~~ 76 (387) T protein:vir:94 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMED---IKQLETEKAG-LQQRFNIVERQVQDIEEKEKAKVKDKGE 76 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHH---HHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhhhhccc Confidence 33333333333333222222222222111111110010 1111111111 1111111222211111111110000000 Q ss_pred hhhhhhhHHHHHHHHHHHHhh----c-------cchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceee Q lcl|Aclame:pro 207 VTPEATEFLKTREAEVAYMSA----S-------LTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRH 275 (517) Q Consensus 207 ~~~~~~~~~~~~~~~~~~~~~----~-------~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~ 275 (517) .........+...+...+... . ......++.. ......+|+.+|+.+...|++.++..+++++++++ T Consensus 77 ~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~--~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~ 154 (387) T protein:vir:94 77 AYQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALP--TGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARL 154 (387) T ss_pred cCCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhc--cCCCCCCceeechhHHHHHHHHHHhhchhhhhcee Confidence 000000000000000000000 0 0001111111 12233458999999999999999999999999999 Q ss_pred eccccceeeee-cccccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 276 ENLPTLVVGGD-NALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIM 354 (517) Q Consensus 276 ~~~~~~~~~~~-~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~ 354 (517) .++++...|+. .....+.|++||+..++++++|+++++.++++++++++|++++.|+.++ |++||.++|+++++. T Consensus 155 ~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~----l~~~i~~~la~~~~~ 230 (387) T protein:vir:94 155 TNIKGLEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVD----LVNWVENALQSGLAA 230 (387) T ss_pred eecCCceeeeeeccCCccccccccccccccccccceeeechheeeeechhhHHHHhhhHHH----HHHHHHHHHHHHHHH Confidence 88887776653 3556789999999999999999999999999999999999999998876 999999999999999 Q ss_pred HHH-hhhhcccccCcccccccccccccccccccccccHHHHHHHHHHhhh-hhcCCEEEEcHHHHHHHHHhhcCCCCEec Q lcl|Aclame:pro 355 AVN-RAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVATP-KAADSTLVIHRNDLAAIRFLKDKNGNYVF 432 (517) Q Consensus 355 ~~e-~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~~-~~~~a~~vmn~~~~~~l~~lKD~~Gryl~ 432 (517) +++ ..|.+|+|++.+ .|++...+.. ..++...+|+++++++.... +..+++|+||+.+|..+.++++..|+|+| T Consensus 231 ~e~~~~~~~g~g~g~~-~g~~~~~~~~---~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~ 306 (387) T protein:vir:94 231 KERKDALAVSPKSGLE-HMSFYNGSVK---EVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFF 306 (387) T ss_pred HHHHhHhhcCCCcccc-ceeeeccccc---cccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCccc Confidence 965 456778888776 4555443321 22344567889988887654 45688999999999887777777788887 Q ss_pred cCCCCCCccceecCccceeccccCCceeeeecCceEEEeeeheeehhhhhcccchHHHHHhhhhcceeecccceEEEEeC Q lcl|Aclame:pro 433 PVGVSNQTIATHFGFNRLVQSVAVDEKTAVSLSGYVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYT 512 (517) Q Consensus 433 ~~~~~~~~~~~l~g~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~t 512 (517) +. .+.++||.|+++ +..++....++|+.|.++.+ ++......+..++.+.|++..|++|.|++|+||+++.++ T Consensus 307 ~~-----~~~~llG~PV~~-~~~~~~~~~GDf~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~k 379 (387) T protein:vir:94 307 DT-----PAEKVFGKPVVF-TDAAVKPIVGDFNYFGINYD-GTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAK 379 (387) T ss_pred cc-----CCccccccceEE-ecCCCceeeechhhhhhhhh-hhhheecccccCCceEEEEEEEeCcEeechhheEEEEee Confidence 53 346789875544 45566667777877655443 222212223457888999999999999999999999998 Q ss_pred CCCCC Q lcl|Aclame:pro 513 PPVAG 517 (517) Q Consensus 513 p~~a~ 517 (517) ++.+= T Consensus 380 a~~~~ 384 (387) T protein:vir:94 380 ENTGP 384 (387) T ss_pred cCCCC Confidence 87655 No 57 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=100.00 E-value=1e-42 Score=250.73 Aligned_cols=370 Identities=11% Similarity=0.069 Sum_probs=221.1 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhhhhhhh Q lcl|Aclame:pro 127 NAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVEALK 206 (517) Q Consensus 127 ~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 206 (517) .-++..+++...+...++++....+.+.....+...+. +.+.+++..+ ..+....+..++...+............ T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~ee---i~~~~~~~~~-l~~~~~~l~~~~~~~e~~~~~~~~~~~~ 76 (387) T protein:vir:96 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMED---IKQLETEKAG-LQQRFNIVERQVQDIEEKEKAKVKDKGE 76 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHH---HHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhhhhccc Confidence 33333333333333222222222222111111110010 1111111111 1111111222211111111110000000 Q ss_pred hhhhhhhHHHHHHHHHHHHhh----c-------cchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceee Q lcl|Aclame:pro 207 VTPEATEFLKTREAEVAYMSA----S-------LTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRH 275 (517) Q Consensus 207 ~~~~~~~~~~~~~~~~~~~~~----~-------~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~ 275 (517) .........+...+...+... . ......++.. ......+|+.+|+.+...|++.++..+++++++++ T Consensus 77 ~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~--~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~ 154 (387) T protein:vir:96 77 AYQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALP--TGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARL 154 (387) T ss_pred cCCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhc--cCCCCCCceeechhHHHHHHHHHHhhchhhhhcee Confidence 000000000000000000000 0 0001111111 12233458999999999999999999999999999 Q ss_pred eccccceeeee-cccccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 276 ENLPTLVVGGD-NALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIM 354 (517) Q Consensus 276 ~~~~~~~~~~~-~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~ 354 (517) .++++...|+. .....+.|++||+..++++++|+++++.++++++++++|++++.|+.++ |++||.++|+++++. T Consensus 155 ~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~----l~~~i~~~la~~~~~ 230 (387) T protein:vir:96 155 TNIKGLEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVD----LVNWVENALQSGLAA 230 (387) T ss_pred eecCCceeeeeeccCCccccccccccccccccccceeeechheeeeechhhHHHHhhhHHH----HHHHHHHHHHHHHHH Confidence 88887776653 3556789999999999999999999999999999999999999998876 999999999999999 Q ss_pred HHH-hhhhcccccCcccccccccccccccccccccccHHHHHHHHHHhhh-hhcCCEEEEcHHHHHHHHHhhcCCCCEec Q lcl|Aclame:pro 355 AVN-RAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVATP-KAADSTLVIHRNDLAAIRFLKDKNGNYVF 432 (517) Q Consensus 355 ~~e-~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~~-~~~~a~~vmn~~~~~~l~~lKD~~Gryl~ 432 (517) +++ ..|.+|+|++.+ .|++...+.. ..++...+|+++++++.... +..+++|+||+.+|..+.++++..|+|+| T Consensus 231 ~e~~~~~~~g~g~g~~-~g~~~~~~~~---~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~ 306 (387) T protein:vir:96 231 KERKDALAVSPKSGLE-HMSFYNGSVK---EVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFF 306 (387) T ss_pred HHHHhHhhcCCCcccc-ceeeeccccc---cccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCccc Confidence 965 456778888776 4555443321 22344567889988887654 45688999999999887777777788887 Q ss_pred cCCCCCCccceecCccceeccccCCceeeeecCceEEEeeeheeehhhhhcccchHHHHHhhhhcceeecccceEEEEeC Q lcl|Aclame:pro 433 PVGVSNQTIATHFGFNRLVQSVAVDEKTAVSLSGYVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYT 512 (517) Q Consensus 433 ~~~~~~~~~~~l~g~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~t 512 (517) +. .+.++||.|+++ +..++....++|+.|.++.+ ++......+..++.+.|++..|++|.|++|+||+++.++ T Consensus 307 ~~-----~~~~llG~PV~~-~~~~~~~~~GDf~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~k 379 (387) T protein:vir:96 307 DT-----PAEKVFGKPVVF-TDAAVKPIVGDFNYFGINYD-GTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAK 379 (387) T ss_pred cc-----CCccccccceEE-ecCCCceeeechhhhhhhhh-hhhheecccccCCceEEEEEEEeCcEeechhheEEEEee Confidence 53 346789875544 45566667777877655443 222212223457888999999999999999999999998 Q ss_pred CCCCC Q lcl|Aclame:pro 513 PPVAG 517 (517) Q Consensus 513 p~~a~ 517 (517) ++.+= T Consensus 380 a~~~~ 384 (387) T protein:vir:96 380 ENTGP 384 (387) T ss_pred cCCCC Confidence 87655 No 58 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=100.00 E-value=1e-42 Score=250.73 Aligned_cols=370 Identities=11% Similarity=0.069 Sum_probs=221.1 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhhhhhhh Q lcl|Aclame:pro 127 NAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVEALK 206 (517) Q Consensus 127 ~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 206 (517) .-++..+++...+...++++....+.+.....+...+. +.+.+++..+ ..+....+..++...+............ T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~ee---i~~~~~~~~~-l~~~~~~l~~~~~~~e~~~~~~~~~~~~ 76 (387) T protein:vir:26 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMED---IKQLETEKAG-LQQRFNIVERQVQDIEEKEKAKVKDKGE 76 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHH---HHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhhhhccc Confidence 33333333333333222222222222111111110010 1111111111 1111111222211111111110000000 Q ss_pred hhhhhhhHHHHHHHHHHHHhh----c-------cchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceee Q lcl|Aclame:pro 207 VTPEATEFLKTREAEVAYMSA----S-------LTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRH 275 (517) Q Consensus 207 ~~~~~~~~~~~~~~~~~~~~~----~-------~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~ 275 (517) .........+...+...+... . ......++.. ......+|+.+|+.+...|++.++..+++++++++ T Consensus 77 ~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~--~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~ 154 (387) T protein:vir:26 77 AYQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALP--TGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARL 154 (387) T ss_pred cCCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhc--cCCCCCCceeechhHHHHHHHHHHhhchhhhhcee Confidence 000000000000000000000 0 0001111111 12233458999999999999999999999999999 Q ss_pred eccccceeeee-cccccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 276 ENLPTLVVGGD-NALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIM 354 (517) Q Consensus 276 ~~~~~~~~~~~-~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~ 354 (517) .++++...|+. .....+.|++||+..++++++|+++++.++++++++++|++++.|+.++ |++||.++|+++++. T Consensus 155 ~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~----l~~~i~~~la~~~~~ 230 (387) T protein:vir:26 155 TNIKGLEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVD----LVNWVENALQSGLAA 230 (387) T ss_pred eecCCceeeeeeccCCccccccccccccccccccceeeechheeeeechhhHHHHhhhHHH----HHHHHHHHHHHHHHH Confidence 88887776653 3556789999999999999999999999999999999999999998876 999999999999999 Q ss_pred HHH-hhhhcccccCcccccccccccccccccccccccHHHHHHHHHHhhh-hhcCCEEEEcHHHHHHHHHhhcCCCCEec Q lcl|Aclame:pro 355 AVN-RAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVATP-KAADSTLVIHRNDLAAIRFLKDKNGNYVF 432 (517) Q Consensus 355 ~~e-~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~~-~~~~a~~vmn~~~~~~l~~lKD~~Gryl~ 432 (517) +++ ..|.+|+|++.+ .|++...+.. ..++...+|+++++++.... +..+++|+||+.+|..+.++++..|+|+| T Consensus 231 ~e~~~~~~~g~g~g~~-~g~~~~~~~~---~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~ 306 (387) T protein:vir:26 231 KERKDALAVSPKSGLE-HMSFYNGSVK---EVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFF 306 (387) T ss_pred HHHHhHhhcCCCcccc-ceeeeccccc---cccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCccc Confidence 965 456778888776 4555443321 22344567889988887654 45688999999999887777777788887 Q ss_pred cCCCCCCccceecCccceeccccCCceeeeecCceEEEeeeheeehhhhhcccchHHHHHhhhhcceeecccceEEEEeC Q lcl|Aclame:pro 433 PVGVSNQTIATHFGFNRLVQSVAVDEKTAVSLSGYVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYT 512 (517) Q Consensus 433 ~~~~~~~~~~~l~g~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~t 512 (517) +. .+.++||.|+++ +..++....++|+.|.++.+ ++......+..++.+.|++..|++|.|++|+||+++.++ T Consensus 307 ~~-----~~~~llG~PV~~-~~~~~~~~~GDf~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~k 379 (387) T protein:vir:26 307 DT-----PAEKVFGKPVVF-TDAAVKPIVGDFNYFGINYD-GTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAK 379 (387) T ss_pred cc-----CCccccccceEE-ecCCCceeeechhhhhhhhh-hhhheecccccCCceEEEEEEEeCcEeechhheEEEEee Confidence 53 346789875544 45566667777877655443 222212223457888999999999999999999999998 Q ss_pred CCCCC Q lcl|Aclame:pro 513 PPVAG 517 (517) Q Consensus 513 p~~a~ 517 (517) ++.+= T Consensus 380 a~~~~ 384 (387) T protein:vir:26 380 ENTGP 384 (387) T ss_pred cCCCC Confidence 87655 No 59 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=100.00 E-value=1.4e-42 Score=250.11 Aligned_cols=368 Identities=11% Similarity=0.068 Sum_probs=215.5 Q ss_pred hhhhhhhhh----hhhhhhhhhhhhhhhhhhhhhhhhhhhh-----------hhhhhHHHHHHHhhhhhhhHHHHHHHHh Q lcl|Aclame:pro 130 VTYFREEKK----KEENKMTFDQNLMQELLDAKKLAADLNA-----------KLKERENGGDNAALKTVSELAANLMKQR 194 (517) Q Consensus 130 I~~vk~~~~----~~~~~~~~~~~~~~~~~e~~~~~~e~~a-----------~l~~~~~~~~e~~~~~~~~~~~~~~~~~ 194 (517) .-++|+..+ ..++.+.+.+....+..+..+...++.+ ++.+.+++..+ ..+....+..++...+ T Consensus 1 ~~~~~~~~~~~~g~~mk~l~el~~~~~e~~~~~~~~~~el~~~~~~~~~~~ee~~~~~~~~~~-l~~~~~~l~~~~~~~e 79 (402) T protein:vir:93 1 MRNFKNDNELLGGNEMPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAG-LQQRFNIVERQVQDIE 79 (402) T ss_pred CcchhhhhhcCCCCCChHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHH-HHHHHHHHHHHHHHHH Confidence 222222222 1111111111111111111111100000 01111111110 1111111111111111 Q ss_pred hHHHhhhhhhhhh----hhhhhhHHHHHHHHHHH-Hhhc-------cchhhHHHHhhhhhcccccccccchhhhhhHHHh Q lcl|Aclame:pro 195 ESEKILGVEALKV----TPEATEFLKTREAEVAY-MSAS-------LTKDPKAAWTAELKERGISGMPAPAGILKRIQDA 262 (517) Q Consensus 195 ~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~-~~~~-------~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~ 262 (517) ............. ...... .......+.. .... ......++. .......+|+.+|+++...|++. T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~--~~~t~~~GG~lIP~~~~~~Ii~~ 156 (402) T protein:vir:93 80 EKEKAKVKDKGEAYQSLSDNEKM-VKAKAEFYRHAILPNEFEKPSMEAQRLLHAL--PTGNDSGGDKLLPKTLSKEIVSE 156 (402) T ss_pred HHHHhhhhhccccCCCCchhHHH-HHHHHHHHHHHHhhhhHHHHHHhHHHHHhhh--ccCCCcCCccccchhHHHHHHHh Confidence 1111100000000 000000 0000000000 0000 000111111 11223345899999999999999 Q ss_pred Hhhhhhhhhceeeeccccceeeee-cccccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHH Q lcl|Aclame:pro 263 VNDEGSLLPFIRHENLPTLVVGGD-NALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAIL 341 (517) Q Consensus 263 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~ 341 (517) ++..+++++++++.++++...|+. .....+.|++||+..++++++|+++++.++++++++++|++++.|+.++ |+ T Consensus 157 ~~~~~~l~~~~~v~~~~~~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~i~iS~ell~Ds~~~----l~ 232 (402) T protein:vir:93 157 PFAKNQLREKARLTNIKGLEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVD----LV 232 (402) T ss_pred HHhhhhhhhhceeeecCCceeeeeeccCCccccccccccccccccccceeeecceeeeeechhhHHHHhhhHHH----HH Confidence 999999999999988887777653 4556789999999999999999999999999999999999999999876 89 Q ss_pred HHHHHHHHHHHHHHHHh-hhhcccccCcccccccccccccccccccccccHHHHHHHHHHhhh-hhcCCEEEEcHHHHHH Q lcl|Aclame:pro 342 TYVMNRLPDMVIMAVNR-AIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVATP-KAADSTLVIHRNDLAA 419 (517) Q Consensus 342 ~~i~~~l~~~~~~~~e~-~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~~-~~~~a~~vmn~~~~~~ 419 (517) +||.++|+++++.+++. .|.+|+|++.+ .|++...+.. .+++...+|+++++++.... +..+++|+||+.+|.. T Consensus 233 ~~i~~~la~~~~~~e~~~~~~~g~g~g~p-~g~~~~~~~~---~~~~~~~~d~l~~~~~~l~~~y~~na~~imn~~t~~~ 308 (402) T protein:vir:93 233 NWVENALQSGLAAKERKDALAVSPKSGLE-HMSFYNGSVK---EVEGADMYDAIINALADLHEDYRDNATIYMRYADYVK 308 (402) T ss_pred HHHHHHHHHHHHHHHHHhHhhcCCCcccc-ceeeeccccc---cccccchHHHHHHHHhccChhhhcCCEEEEechHHHH Confidence 99999999999998754 56778888876 4655544322 22344457888888887644 4468899999999888 Q ss_pred HHHhhcCCCCEeccCCCCCCccceecCccceeccccCCceeeeecCceEEEee-eheeehhhhhcccchHHHHHhhhhcc Q lcl|Aclame:pro 420 IRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDEKTAVSLSGYVTNGS-RGMEFEQGTILVENNKEYLFEMPISG 498 (517) Q Consensus 420 l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~d~~~~~n~~~~~~~~rvgg 498 (517) +.++++.+|+|+|+. .+.++||.|++ +...++....++|+.|.++.+ ..+..+.+ ..++++.|++..|+|| T Consensus 309 ~~~~~~d~~~~~~~~-----~~~~llG~PV~-~t~~~~~i~~GDf~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~r~Dg 380 (402) T protein:vir:93 309 IISVLSNGTTNFFDT-----PAEKVFGKPVV-FTDAAVKPIVGDFNYFGINYDGTTYDTDKD--VKKGEYLFVLTAWYDQ 380 (402) T ss_pred HHHHHhcCCCccccc-----CCccccccceE-EecCCCceeeechhhhhhhhhhhhhhhhhc--ccCCceEEEEEEEeCc Confidence 766666667777742 34678987554 445566666677776554433 33333333 3568899999999999 Q ss_pred eeecccceEEEEeCCCCCC Q lcl|Aclame:pro 499 SLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 499 ~v~~~~a~~~~~~tp~~a~ 517 (517) .|.+|+||++++++++.+= T Consensus 381 ~v~~~~A~~~l~ik~~~~~ 399 (402) T protein:vir:93 381 QRTLDSAFRIAKAKENTGP 399 (402) T ss_pred EEechhheEEEEeecCCCC Confidence 9999999999999887444 No 60 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=100.00 E-value=8.9e-42 Score=245.63 Aligned_cols=379 Identities=12% Similarity=0.088 Sum_probs=213.1 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhhhh Q lcl|Aclame:pro 124 SNKNAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVE 203 (517) Q Consensus 124 A~~~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 203 (517) +...-.++.++.+........+... .+..+.......+..++++. .+........+.......+......... T Consensus 1 m~~~~~lee~~a~l~~~~~~~~~~~---~~~~~~~~e~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (419) T protein:vir:94 1 MPPTPTLEEQRAALLARLDDTSLTT---EQVQEIVAEARGLADALQAE----SDRAAARAALLRTAPPAPKGPADGGTPL 73 (419) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHhhhhccc Confidence 2222222222221111111111111 11101111111111111111 1111111111111111111100000000 Q ss_pred h---hhhhhhhhhHHH---HHHHHHHHHhhcc-chh---hHHHHhhhhhc-----ccccccccchhhhhhHHHhHhhhhh Q lcl|Aclame:pro 204 A---LKVTPEATEFLK---TREAEVAYMSASL-TKD---PKAAWTAELKE-----RGISGMPAPAGILKRIQDAVNDEGS 268 (517) Q Consensus 204 ~---~~~~~~~~~~~~---~~~~~~~~~~~~~-~~~---~~~~~~~~~~~-----~~~~~~~vp~~i~~~i~~~~~~~~~ 268 (517) . ............ ............. ... ........... .......+|..+...+...+..... T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~ 153 (419) T protein:vir:94 74 TPAEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLL 153 (419) T ss_pred cccccccccchhhhhhhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhccccccccccCCcccccchhhhHHHHHHHhhhhh Confidence 0 000000000000 0000000000000 000 11111111110 0111233455555555666677777 Q ss_pred hhhceeeeccccc--eeeee--------cccccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHH Q lcl|Aclame:pro 269 LLPFIRHENLPTL--VVGGD--------NALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAG 338 (517) Q Consensus 269 ~~~~~~~~~~~~~--~~~~~--------~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~ 338 (517) +++++++.+..+. .++.. ...+.+.|++||+.+|+++++|+++++.++++++++++|+++++|+. T Consensus 154 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~----- 228 (419) T protein:vir:94 154 VADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS----- 228 (419) T ss_pred hhhcceeeeccCCceeeeeeccccccccccCcccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHhHH----- Confidence 8888887766543 23332 23356789999999999999999999999999999999999998653 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhcccccCcccccccccccccccc------cccccccHHHHHHHHHHhh-hhhcCCEEE Q lcl|Aclame:pro 339 AILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWAT------NVTGTTNIQELLEKLSVAT-PKAADSTLV 411 (517) Q Consensus 339 ~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~------~~~~~~~~d~l~~~l~~~~-~~~~~a~~v 411 (517) .|++||.++|+++++.+++.+||+|+|++++ .|+++..+..... ..+.....+++.+++.... .++.+++|+ T Consensus 229 ~l~~~i~~~la~a~~~~~d~aii~G~G~~~p-~Gi~~~~~~~~~~~~~~~~~~t~~~~~~~l~~~~~~~~~~~~~~~~~v 307 (419) T protein:vir:94 229 QLMGYIQGRLTYGLRFLRDRQLLNGNGSTEM-QGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVV 307 (419) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCcccc-cceecccccccccccccccccccchhHHHHHHHHHhhhhccCCCCEEE Confidence 4999999999999999999999999999765 6888765432211 2223334667777776654 445678999 Q ss_pred EcHHHHHHHHHhhcCCCC-EeccCCCCCCccceecCccceeccccCCceee--eecCc-eEEEeeeheee----hhhhhc Q lcl|Aclame:pro 412 IHRNDLAAIRFLKDKNGN-YVFPVGVSNQTIATHFGFNRLVQSVAVDEKTA--VSLSG-YVTNGSRGMEF----EQGTIL 483 (517) Q Consensus 412 mn~~~~~~l~~lKD~~Gr-yl~~~~~~~~~~~~l~g~~~v~~~~~~~~~~~--~~~~~-~~~~~~~~~~~----~~d~~~ 483 (517) |||.+|..|+++||++|+ |+|++++..+.+.+++|.++ +++..+++..+ ++++. |.+..+.++.. ..+.++ T Consensus 308 ~n~~~~~~l~~~k~~~~~~~~~~~~~~~~~~~~l~G~pV-~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~ 386 (419) T protein:vir:94 308 VHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNV-VSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFF 386 (419) T ss_pred EcHHHHHHHHHHhhcCCCceeecCCcccCCCccccceee-EEcCCCCCccEEEeeccceEEEEEecceEEEEeccccchh Confidence 999999999999998655 77898888888899998754 44556665543 45665 55555555443 223347 Q ss_pred ccchHHHHHhhhhcceeecccceEEEEeCCCCC Q lcl|Aclame:pro 484 VENNKEYLFEMPISGSLEYKGTTAYGTYTPPVA 516 (517) Q Consensus 484 ~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~a 516 (517) ++|++.|+++.|+++.+++|+||+++++++++- T Consensus 387 ~~~~~~~r~~~r~d~~v~~~~a~~~~~~~aa~~ 419 (419) T protein:vir:94 387 TANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) T ss_pred hcCcEEEEEEEeeccEEeccccEEEEEeccCCC Confidence 899999999999999999999999999999988 No 61 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=100.00 E-value=4.2e-41 Score=241.96 Aligned_cols=420 Identities=11% Similarity=0.044 Sum_probs=219.7 Q ss_pred Hhh-------cC-Ce-eEeeeeeecc-cCCCceEEEEehhhhhhhhhhhhhhhhhhhhhhhhhh---------------- Q lcl|Aclame:pro 85 IDK-------GA-GL-SVTFQPVEAS-EVDGVAYYKKCILAGGALTPNPSNKNAVVTYFREEKK---------------- 138 (517) Q Consensus 85 ~~~-------g~-~~-SiGf~~~~~~-~~~~~~~~~~~~l~EvS~v~~pA~~~A~I~~vk~~~~---------------- 138 (517) |.. .. +| |+|.--.... +..| .+-+..+=+-....+....+...++...+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~e~~~~~e~l~~~~~~~~~e~~~~ 76 (543) T protein:vir:81 1 MNTLDTLPVHPRTGLRAIGMGKRGPIWPVMG----ASDDHKDDAPTLTYSQARNRADEVHARMEQIAELDKPTDEENEEF 76 (543) T ss_pred CCccccCcCChhHHHHHHHhhccCccchhcc----cccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 110 00 11 3332110000 0000 00000111111111111111111111000 Q ss_pred --------hhhhhhhhhh--hhhhhhh---hhh---h-------hhh-----------hhh-----hhhhhhHHHHHHHh Q lcl|Aclame:pro 139 --------KEENKMTFDQ--NLMQELL---DAK---K-------LAA-----------DLN-----AKLKERENGGDNAA 179 (517) Q Consensus 139 --------~~~~~~~~~~--~~~~~~~---e~~---~-------~~~-----------e~~-----a~l~~~~~~~~e~~ 179 (517) +...+.++.. +..+... ++. . ..+ ... ..+.++..+..... T Consensus 77 ~~~~~e~~el~~~~~~l~~~e~~~~~~e~~~~~~~~~~~~~~e~r~e~~a~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~ 156 (543) T protein:vir:81 77 RALGAEFDSLVNHMSRLERAAELARVRSTHEQIGKPQSGGQRRMRVEAGSSQGGRGDYDRDAILEPDSIEDCRFRDPWNL 156 (543) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhHHHhhHHHHHhhhccCccHHHHHHHHHHHH Confidence 0000000000 0000000 000 0 000 000 00001000000000 Q ss_pred hh------hh----hhH----HHHHHHHhhHHHhhhhhhhhh--h---hhhh--------hHHHHHHHHHHHHhhcc--- Q lcl|Aclame:pro 180 LK------TV----SEL----AANLMKQRESEKILGVEALKV--T---PEAT--------EFLKTREAEVAYMSASL--- 229 (517) Q Consensus 180 ~~------~~----~~~----~~~~~~~~~~~~~~~~~~~~~--~---~~~~--------~~~~~~~~~~~~~~~~~--- 229 (517) .+ .. .++ ...+++............... . .+.. ...........+..... T Consensus 157 ~e~k~~~e~~~~e~~e~~~~~~~~~e~l~~~~e~~~~~~~~~~~~~d~~e~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 236 (543) T protein:vir:81 157 SEMRTFGRDAEEVKGELRARALSAIEKMQGASDNVRAAATKIIERFDDEDSTLARQCLATSSPAYLRAWSKMARNPHAAI 236 (543) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhHHHHHHHhhHHHH Confidence 00 00 000 000000000000000000000 0 0000 00000000000000000 Q ss_pred -chhhHHHHh---hhhhcccccccccchhhhhhHH-HhHhhhhhhhhceeeecccc-ceeeeecccccceeeeccccccc Q lcl|Aclame:pro 230 -TKDPKAAWT---AELKERGISGMPAPAGILKRIQ-DAVNDEGSLLPFIRHENLPT-LVVGGDNALTQGTGHTTGTDKTE 303 (517) Q Consensus 230 -~~~~~~~~~---~~~~~~~~~~~~vp~~i~~~i~-~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~a~~~~eg~~~~~ 303 (517) ......... ........+++++|..+...++ +.+...+++..++++.+.++ ..+++......+.|++||+.+|+ T Consensus 237 l~~~e~~~~~~~~~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~~~~g~~~~~~~~~~~~a~~v~Eg~~~~~ 316 (543) T protein:vir:81 237 LTEEEKRAINEVRAMGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQVVATGDVWHGVSSAAVQWSWDAEFEEVSD 316 (543) T ss_pred hhhhhhhhhhhhhhcccccccCcccCchhhhhHHHHHHHhhhchhhhhcccccCCcceEEEEecCCcceeecccCccccc Confidence 000011111 1112234567889999887765 66788899999988776554 45667777788999999999999 Q ss_pred ccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcccccccccccccccc Q lcl|Aclame:pro 304 SNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWAT 383 (517) Q Consensus 304 ~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~ 383 (517) ++++|+++++.++++++++++|++++.|+ . .|.+||.++|+++++++++.+||+|+|++..++||++..+..... T Consensus 317 ~~~~~~~i~~~~~k~~~~~~is~ell~d~-~----~~~~~i~~~l~~~~~~~~d~ail~G~Gt~~~p~Gi~~~~~~~~~~ 391 (543) T protein:vir:81 317 DSPEFGQPEIPVKKAQGFVPISIEALQDE-A----NVTETVALLFAEGKDELEAVTLTTGTGQGNQPTGIVTALAGTAAE 391 (543) T ss_pred cccccceeeeeeeeeEeeehhhHHHHhcc-H----HHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccchhhccccccc Confidence 99999999999999999999999999865 2 499999999999999999999999999987888998765432221 Q ss_pred ---cccccccHHHHHHHHHHhhhh-hcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccceecCccceeccccCCc- Q lcl|Aclame:pro 384 ---NVTGTTNIQELLEKLSVATPK-AADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDE- 458 (517) Q Consensus 384 ---~~~~~~~~d~l~~~l~~~~~~-~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~- 458 (517) ....+...++++.++...... ..+++|||||.+|..|+++||++|+|||++.. .+.+.+++|.++++. ..++. T Consensus 392 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~-~g~~~~l~G~pv~~~-~~~~~~ 469 (543) T protein:vir:81 392 IAPVTAETFALADVYAVYEQLAARHRRQGAWLANNLIYNKIRQFDTQGGAGLWTTIG-NGEPSQLLGRPVGEA-EAMDAN 469 (543) T ss_pred ccccccccccHHHHHHHHHhhhccccCCcEEEEcHHHHHHHHHhhcCCCceeccCcC-CCCCccccceeeEEe-cccccc Confidence 122334456676666655444 45688999999999999999999999998644 455678888765544 33332 Q ss_pred -----------eeeeecCceEEEeeeheeeh------hhhhcccchHHHHHhhhhcceeecccceEEEEeCCCC Q lcl|Aclame:pro 459 -----------KTAVSLSGYVTNGSRGMEFE------QGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPV 515 (517) Q Consensus 459 -----------~~~~~~~~~~~~~~~~~~~~------~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~ 515 (517) ...++++.|+++.+.++.+. .++++.+|++.|+++.|+|+.|.+|+||+++++.++= T Consensus 470 ~~~~~~~~~~~i~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~~~a 543 (543) T protein:vir:81 470 WNTSASADNFVLLYGNFQNYVIADRIGMTVEFIPHLFGTNRRPNGSRGWFAYYRMGADVVNPNAFRLLNVETAS 543 (543) T ss_pred ccccccCCcceEEEeeccceeEEeecccEEEEeccccccchhhcCceEEEEEEeeccEeecccceEEEEecccC Confidence 22356777777766565432 2345678889999999999999999999999985444 No 62 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=100.00 E-value=4.2e-42 Score=247.42 Aligned_cols=341 Identities=10% Similarity=0.047 Sum_probs=210.6 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhhhhhhhhhhhhhhHHHHHHH Q lcl|Aclame:pro 141 ENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVEALKVTPEATEFLKTREA 220 (517) Q Consensus 141 ~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 220 (517) ++++++.. ++.++++++...++.+.. +++.+...................+....+....+. T Consensus 1 ~eei~~l~----------~~~~~l~~~~~~l~~~~d--------~~e~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~ 62 (352) T protein:vir:78 1 MEDIKQLE----------TEKAGLQQRFNIVERQVQ--------DIEEKEKAKVKDKGEAYQSLNDNEKLVKAKAEFYRH 62 (352) T ss_pred ChhHHHHH----------HHHHHHHHHHHHHHHHHH--------HHHHHHHHHhhhccccccccchhhhHHHHHHHHHHH Confidence 11111100 001111111111110000 000000000000000000000000000000000000 Q ss_pred HH--HHHhh--ccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeeccccceeeee-cccccceee Q lcl|Aclame:pro 221 EV--AYMSA--SLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPTLVVGGD-NALTQGTGH 295 (517) Q Consensus 221 ~~--~~~~~--~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~a~~~ 295 (517) .. ..... .......++. .......+++.+|.++...|++.++..+++++++++.++.+...|+. .+...+.|+ T Consensus 63 ~~~~~~~~~~~~~~~~~~~al--~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~~~~~~~p~~~~~~~~a~~v 140 (352) T protein:vir:78 63 AILPNEFEKPSMEAQRLLHAL--PTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGLEIPRVSYTLDDDDFI 140 (352) T ss_pred HhhhhHHHHHHhhHHHHHHHh--ccCCCCCCceeccHhHHHHHHHHHHhhcchhhheeeEecCCceEEEEecCCCccccc Confidence 00 00000 0000111111 11233456899999999999999999999999999988877776664 344678999 Q ss_pred ecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHh-hhhcccccCccccccc Q lcl|Aclame:pro 296 TTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNR-AIIMGGVTGVSETQIY 374 (517) Q Consensus 296 ~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~-~~l~G~G~~~~~~gi~ 374 (517) +||+..++++++|+++++.++++++++++|++++.|+.++ |++||.++|+++++.+++. .|.+|+|++.+. |++ T Consensus 141 ~E~~~~~~~~~~f~~v~~~~~k~~~~i~is~ell~Ds~~~----l~~~i~~~la~~~~~~e~~~~~~~g~g~~~~~-g~l 215 (352) T protein:vir:78 141 TDVETAKELKLKGDTVKFTTNKFKVFAAISDTVIHGSDVD----LVNWVENALQSGLAAKERKDALAVSPKSGLEH-MSF 215 (352) T ss_pred ccccccccccccceeeeecceeEEeechhhHHHHhhhhHH----HHHHHHHHHHHHHHHHHHHhhhhcCCCCcccc-cce Confidence 9999999999999999999999999999999999998876 9999999999999988655 566888887764 555 Q ss_pred ccccccccccccccccHHHHHHHHHHhhhh-hcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccceecCccceecc Q lcl|Aclame:pro 375 PVVGDAWATNVTGTTNIQELLEKLSVATPK-AADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQS 453 (517) Q Consensus 375 ~~~~~~~~~~~~~~~~~d~l~~~l~~~~~~-~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~ 453 (517) ...+.. .+++...+|++++++...... ..+++|+||+.+|.+|.+++|.+|+|+|+. .+.++||.|++ +. T Consensus 216 ~~~~~~---~~t~~~~~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~~~~~-----~~~~llG~PV~-~~ 286 (352) T protein:vir:78 216 YNGSVK---EVEGANMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFDT-----PAEKVFGKPVV-FT 286 (352) T ss_pred eccccc---cccccchHHHHHHHHhccChhhhcCCEEEEehHHHHHHHHHHhccCCccccc-----CCccccccceE-Ee Confidence 443322 233444578888888876544 457899999999999999999999999953 34578886544 44 Q ss_pred ccCCceeeeecCceEEEee-eheeehhhhhcccchHHHHHhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 454 VAVDEKTAVSLSGYVTNGS-RGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 454 ~~~~~~~~~~~~~~~~~~~-~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) ..++....++|+.|.++.+ .....+.+ ..++++.|++..|++|.|.+|+||+++++.++-+- T Consensus 287 ~~~~~~~~Gdf~~~~~~~~~~~~~~~~~--~~~g~~~f~~~~r~Dg~~~~~eA~~~l~~~a~~~~ 349 (352) T protein:vir:78 287 DAAVKPIVGDFNYFGINYDGTTYDTDKD--VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKESTGS 349 (352) T ss_pred cCCCceeEeehhhhhhhhhhheeeeecc--ccCCeeEEEEEeeeCceeechhheEEEEeecccCC Confidence 5666666777877655433 22223333 35678899999999999999999999997665333 No 63 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=100.00 E-value=4.9e-41 Score=241.58 Aligned_cols=381 Identities=11% Similarity=0.080 Sum_probs=198.0 Q ss_pred hhhhhhhhhhhhhhhhhhhh----hhhhhhhhhhh------hhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHh Q lcl|Aclame:pro 130 VTYFREEKKKEENKMTFDQN----LMQELLDAKKL------AADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKI 199 (517) Q Consensus 130 I~~vk~~~~~~~~~~~~~~~----~~~~~~e~~~~------~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~ 199 (517) +++..+++...+.++++... .++...+..+. .+++.++++....+..+.. +...++.+++.+++..... T Consensus 1 ~~k~~eem~~~i~eL~e~r~~l~~e~~~l~d~ak~e~~~~~~~~e~~e~~a~~~el~~ei-~~le~~~~~~~~~~~~~~~ 79 (477) T protein:vir:84 1 MEKHLEELRALRAAAVEAVATLKAERQAIADGAKAEERAALSADETAEFRAKSASIKAEL-DKVEDLDEQIRELESEIER 79 (477) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHH Confidence 22222222222222211111 11111111110 0011111111000000000 0001111111111110000 Q ss_pred h---hhhh--------hhhhh------hhhhHHHH-------------HHHHHHHH-hhccchhhHHH-----Hhhhh-h Q lcl|Aclame:pro 200 L---GVEA--------LKVTP------EATEFLKT-------------REAEVAYM-SASLTKDPKAA-----WTAEL-K 242 (517) Q Consensus 200 ~---~~~~--------~~~~~------~~~~~~~~-------------~~~~~~~~-~~~~~~~~~~~-----~~~~~-~ 242 (517) . .... ..... ........ ........ ......+.+.. ....+ . T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (477) T protein:vir:84 80 SGKLEAETKTVRKATVEVNEALTYEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKEIRKIAKVGEEYRDLDR 159 (477) T ss_pred hhcchhhhhhhcccccccccchhhhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHhhhhhccccc Confidence 0 0000 00000 00000000 00000000 00000000000 00011 1 Q ss_pred cccccccccchhh-hhhHHHhHhhhhhhhhceeeecccc----ceeeeecc-cccceeeeccc-----ccccccccceee Q lcl|Aclame:pro 243 ERGISGMPAPAGI-LKRIQDAVNDEGSLLPFIRHENLPT----LVVGGDNA-LTQGTGHTTGT-----DKTESNITLQTR 311 (517) Q Consensus 243 ~~~~~~~~vp~~i-~~~i~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~-~~~a~~~~eg~-----~~~~~~~~f~~~ 311 (517) ....+++.+|+.+ .+.|++.++..++++++++..++++ ..+|...+ ...+.|++||+ .+|+++++|+.+ T Consensus 160 ~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~~f~~i 239 (477) T protein:vir:84 160 NGGTGGYAVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDLTDGFV 239 (477) T ss_pred cCCCcceeeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcceeeeeccCcccccccccccccceeeE Confidence 2233467787764 5779999999999999887766543 34555433 34567888885 457889999999 Q ss_pred EeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcccccccccccccccccccc---c Q lcl|Aclame:pro 312 VLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTG---T 388 (517) Q Consensus 312 ~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~---~ 388 (517) +++++++++++++|++++.|+.++ |++||.++|+++++.++|.+||+|+|++.++.||++..+......... . T Consensus 240 ~~~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~t~ 315 (477) T protein:vir:84 240 QANVKTIAGQQGIAIQLLDQAAVS----VDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRATAGITQVTATSAGSAL 315 (477) T ss_pred EEeeeeEEeeeHHHHHHHhccchh----HHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeeeccccccccccccccch Confidence 999999999999999999998876 999999999999999999999999998877789998765432221111 1 Q ss_pred ccHHHHHHHHHH----hhhhh--cCCEEEEcHHHHHHHHHhhcCCCCEeccCC-------------CCCCccceecCccc Q lcl|Aclame:pro 389 TNIQELLEKLSV----ATPKA--ADSTLVIHRNDLAAIRFLKDKNGNYVFPVG-------------VSNQTIATHFGFNR 449 (517) Q Consensus 389 ~~~d~l~~~l~~----~~~~~--~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~-------------~~~~~~~~l~g~~~ 449 (517) ...+.+...+.. ....+ .+++|+|||++|..|++|||++|||||+++ +..+...+++|.++ T Consensus 316 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G~pV 395 (477) T protein:vir:84 316 EKHQIIYQKIADAIQRVHTSRFLEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHGLPV 395 (477) T ss_pred hhHHHHHHHHHHHHhhccccccCCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhcccce Confidence 122233333332 22222 245799999999999999999999999876 23334456777655 Q ss_pred eeccccCCc----------eeeeecCceEEEeeeheeehhhh--hcccchHHHHHhhhhc-ceeecccceEEEEeCCCCC Q lcl|Aclame:pro 450 LVQSVAVDE----------KTAVSLSGYVTNGSRGMEFEQGT--ILVENNKEYLFEMPIS-GSLEYKGTTAYGTYTPPVA 516 (517) Q Consensus 450 v~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~d~--~~~~n~~~~~~~~rvg-g~v~~~~a~~~~~~tp~~a 516 (517) + .+..+|. ...++++.|+++. .++....+. +..+....|.+..+++ ..++.|++|+..|.+...| T Consensus 396 v-~s~~~p~~~~~~~d~~~i~~gd~~~~~i~~-~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~r~~~afv~~t~~~~~~ 473 (477) T protein:vir:84 396 V-TDPTLPTTLGTGTDQDVIHVLRASDLALFE-SSVRMRALQETRAENLSVLLQVYGYLAFTAARFPQSVVEIGGTALTA 473 (477) T ss_pred E-ecCcccccccccCCcceEEEEEeceEEEEe-eceeEEeccccccccceeeeeehhhhhhhhhccccceEEeecccccc Confidence 4 4444442 2334555665543 344433222 2334444455555444 4677899999999876666 Q ss_pred C Q lcl|Aclame:pro 517 G 517 (517) Q Consensus 517 ~ 517 (517) - T Consensus 474 ~ 474 (477) T protein:vir:84 474 P 474 (477) T ss_pred c Confidence 5 No 64 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=100.00 E-value=3.9e-41 Score=242.15 Aligned_cols=345 Identities=14% Similarity=0.102 Sum_probs=206.3 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhhhhhhhhhh Q lcl|Aclame:pro 130 VTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVEALKVTP 209 (517) Q Consensus 130 I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 209 (517) |..++++..+..+.. +... +..+...+..+..+.+.+.... +..+.....+... .. T Consensus 1 ik~L~e~~~e~~e~~---~~~~-~~~~~~~~~~e~~~~~~~~~~~-----------~~~~~~~~~~~~~---------~~ 56 (390) T protein:vir:40 1 MNNLDKKDSETLNIS---TAFL-NAIKEGATEAEQVTAFTNMAEQ-----------IQNNIIAQARKEV---------NR 56 (390) T ss_pred CchHHHHHHHHHHHH---HHHH-HHHhhhhhHHHHHHHHHHHHHH-----------HHHHHHHHHHHHH---------HH Confidence 222222211111000 0000 0000000000000111100000 0000000000000 00 Q ss_pred hhhhHHHHHHHHHHHHhhccchhhHHHHhhhh--hcccccccccchhhhhhHHHhHhhhhhhhhceeeeccccc--eeee Q lcl|Aclame:pro 210 EATEFLKTREAEVAYMSASLTKDPKAAWTAEL--KERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPTL--VVGG 285 (517) Q Consensus 210 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~~--~~~~ 285 (517) +........ .........+.++.....+ .....+++.+|+.+.+.|++.++..+++++++++.++.+. .+|. T Consensus 57 ~~~~~~~~~----~~~~~~l~~~~r~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~~~~i~~ 132 (390) T protein:vir:40 57 EMNDNNVLA----SRGANALTSDESKYYNEVIAGNGFAGVTALLPPTVFERVFEDLTVEHPLLSKINFVNTTATTEWIIS 132 (390) T ss_pred HHHHHHHHH----hcCchhccHHHHHHHHHHHhccCcccCcccccHHHHHHHHHHHHhhhhhhhhceeeecCCceeEEEE Confidence 000000000 0000001111122111111 1223568899999999999999999999999999887653 4677 Q ss_pred ecccccceeeecccccc-cccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhccc Q lcl|Aclame:pro 286 DNALTQGTGHTTGTDKT-ESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGG 364 (517) Q Consensus 286 ~~~~~~a~~~~eg~~~~-~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~ 364 (517) ......+.|+.|++..+ .++++|+++++.++++++++++|++++.|+.++ |++||.++|+++++.+++.+||+|+ T Consensus 133 ~~~~~~a~~~~E~~~~~~~~~~~f~~i~l~~~k~~~~i~iS~ell~ds~~~----l~~~i~~~la~~i~~~~~~a~l~G~ 208 (390) T protein:vir:40 133 VGDVATAWWGPLCAEIKEVLDNGFDKIQTGMYKLSAYIPVCNAMLDLGPSW----LDQYVRTILGEAMALGLEAGIVNGS 208 (390) T ss_pred EcCCcceeeeccccccCccccccceeeEeeeeeEEEeehhhHHHHhcchHH----HHHHHHHHHHHHHHHHHHhhhhccc Confidence 77778899999988766 578999999999999999999999999998876 8999999999999999999999999 Q ss_pred ccCccccccccccccccccc----ccccccHH---HHHHHHHHh-----hhhhcCCEEEEcHHHH----HHHHHhhcCCC Q lcl|Aclame:pro 365 VTGVSETQIYPVVGDAWATN----VTGTTNIQ---ELLEKLSVA-----TPKAADSTLVIHRNDL----AAIRFLKDKNG 428 (517) Q Consensus 365 G~~~~~~gi~~~~~~~~~~~----~~~~~~~d---~l~~~l~~~-----~~~~~~a~~vmn~~~~----~~l~~lKD~~G 428 (517) |+++| .||++..+...... .....+.. ++...+..+ ...+.+++|+||+.++ ..++++||++| T Consensus 209 G~~~P-~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~~~~~d~~G 287 (390) T protein:vir:40 209 GKDQP-IGMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAATSYMTPQG 287 (390) T ss_pred CCCcc-ceeeeccccccccccccccccccchhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHHhhccCCCC Confidence 99876 58887554221110 11111111 222222221 1234678999999884 35568999999 Q ss_pred CEeccCCCCCCccceecCccceeccccCCc--eeeeecCceEEEeeehee--ehhhhhcccchHHHHHhhhhcceeeccc Q lcl|Aclame:pro 429 NYVFPVGVSNQTIATHFGFNRLVQSVAVDE--KTAVSLSGYVTNGSRGME--FEQGTILVENNKEYLFEMPISGSLEYKG 504 (517) Q Consensus 429 ryl~~~~~~~~~~~~l~g~~~v~~~~~~~~--~~~~~~~~~~~~~~~~~~--~~~d~~~~~n~~~~~~~~rvgg~v~~~~ 504 (517) +|+|+.. .+|.+ ++.+..|++ ...++++.|.++.+.++. ..++..+.++++.|++..|++|.+++|+ T Consensus 288 ~~v~~~~--------~~g~p-vv~~~~~p~~~i~~Gd~s~~~i~~~~~~~v~~~~~~~f~~~~~~~r~~~r~dg~v~~~~ 358 (390) T protein:vir:40 288 VWVTGIL--------PVPLE-IVQSVAVPVGKAVAGRAKDYFMGIGSEQVIRTSTEYRLLDDETLYYAKQYANGRPKDNS 358 (390) T ss_pred ccccccC--------CCcee-EEEcCCCCCCcEEEEeeceEEEEeecceEEEecchhhhhcCcEEEEEEEEeCCEEeccc Confidence 9998543 24544 444555555 445677888877766654 3344567889999999999999999999 Q ss_pred ceEEEEeCCCCCC Q lcl|Aclame:pro 505 TTAYGTYTPPVAG 517 (517) Q Consensus 505 a~~~~~~tp~~a~ 517 (517) ||+++.++++- | T Consensus 359 A~~~l~~~~~~-~ 370 (390) T protein:vir:40 359 SFLVFDITGLE-G 370 (390) T ss_pred ceEEEEeeccC-C Confidence 99999988763 3 No 65 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=100.00 E-value=1.1e-40 Score=239.65 Aligned_cols=367 Identities=10% Similarity=0.026 Sum_probs=213.5 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh--hhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhh Q lcl|Aclame:pro 124 SNKNAVVTYFREEKKKEENKMTFDQNLMQELLDAK--KLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILG 201 (517) Q Consensus 124 A~~~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~--~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~ 201 (517) +|-.-++..++++..+...+.+...+.++...... +..+.+.+++++++.+.... .+..........+......... T Consensus 1 Mn~~e~lkel~~~~~el~~~~~~~~~~~~~~~~e~~~~e~~~~~~e~~~l~~~i~~~-~~~~~~~~~~~~~~~~~~~~~~ 79 (421) T protein:vir:13 1 MNLFERLKELRAKKKELEEKRCGIVEEIRSLAKEKKEEEARSKALEREKIEARMEII-EEEIESVMTAIDEERKNTNFTG 79 (421) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhhcccc Confidence 44444444444433333322222221111111100 00111111111111111100 0011111111111110000000 Q ss_pred hhhhhhhhhhhhH--HHHHHHHHHHHhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeeccc Q lcl|Aclame:pro 202 VEALKVTPEATEF--LKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLP 279 (517) Q Consensus 202 ~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~ 279 (517) .. .......... .....++..+..+..... ..........+++++|..+...|++.+++.+++++++++.+++ T Consensus 80 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~ra~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~~~~ 154 (421) T protein:vir:13 80 GR-VIINGDSKEEKRSLQLSAMSKTIRGIQLSE----EERDIMSSTNNGAVIPQEFVNEFEKLKEGYPSLKEHCHVIPVN 154 (421) T ss_pred cc-cccccchhHHHHHHHHHHHHHhhhccchhH----HHhhccccCCcceecchhhHHHHHHHHHhhhhhhhhceeeecc Confidence 00 0000000000 000111111111111111 1112233445688999999999999999999999999988775 Q ss_pred c--ceeeeeccc--ccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 280 T--LVVGGDNAL--TQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMA 355 (517) Q Consensus 280 ~--~~~~~~~~~--~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~ 355 (517) + ..++..... ..+.|+.||..+++++++|+.+++.++++++++++|++++.|+..+ |++||.++|++++..+ T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~E~~~~~~s~~~f~~i~~~~~k~~~~v~iS~ell~ds~~~----l~~~i~~~la~~~~~~ 230 (421) T protein:vir:13 155 RNAGKMPVRAGASVDKLANLAKDTELVKAMLKTQPMAYDIDDYGLLAPIDNSLLEDSEIN----FLEFVNEEFAEFAVNT 230 (421) T ss_pred CCceEEEEeecCCccceeeccccccccccccceeEEEeeeeeeEeehhhhHHHHhhhHHH----HHHHHHHHHHHHHHHH Confidence 4 344444333 3456688999999999999999999999999999999999988765 8999999999999999 Q ss_pred HHhhhhcccccCcccccccccccccccccccccccHHHHHHHHHHhhh-hhcCCEEEEcHHHHHHHHHhhcCCCCEeccC Q lcl|Aclame:pro 356 VNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVATP-KAADSTLVIHRNDLAAIRFLKDKNGNYVFPV 434 (517) Q Consensus 356 ~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~~-~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~ 434 (517) ++.++++ . +.|+++.. +..+.+++++++..... ++.+++|||||.+|.+|++|||++|||||++ T Consensus 231 ~~~~i~~-----~-~~g~~~~~---------~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~ 295 (421) T protein:vir:13 231 ENAEIVK-----Q-AKAVLAEE---------TINDYAGLVKTINSLVPNARKRAIIVTNSDGRAYLDGLMDKQGRPLLKE 295 (421) T ss_pred hhhhHhh-----h-hhhccccc---------cccchHHHHHHHHHhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceeecC Confidence 9877763 2 23444322 22346788888777544 4567899999999999999999999999976 Q ss_pred CCCCCccceecCccceeccccCCc-------eeeeecCc-eEEEeeehe--eehhhhhcccchHHHHHhhhhcceeeccc Q lcl|Aclame:pro 435 GVSNQTIATHFGFNRLVQSVAVDE-------KTAVSLSG-YVTNGSRGM--EFEQGTILVENNKEYLFEMPISGSLEYKG 504 (517) Q Consensus 435 ~~~~~~~~~l~g~~~v~~~~~~~~-------~~~~~~~~-~~~~~~~~~--~~~~d~~~~~n~~~~~~~~rvgg~v~~~~ 504 (517) +..+.+.+++|.+++..+ .++. ...++++. |.+..+.++ ...++.++.+|++.+++..|+++.+.+|+ T Consensus 296 -~~~~~~~tl~G~pV~~~~-~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~f~~~~~~~r~~~r~d~~~~~~~ 373 (421) T protein:vir:13 296 -LSDGGDLVFKGRPVIELE-ESIFDVGDETKFIVSDFKTLIKFMDRKQYLIDQSKEAGYTKNETIARIIERFDVNSPLDK 373 (421) T ss_pred -cCCCCCceecceeeEEec-cccccCCCceEEEEEeccccEEEEEecceEEEeecccccccCeeEEEEEeeecceeecch Confidence 566677889997765443 3322 23455665 455555554 34455567899999999999999999999 Q ss_pred ceEEEEeCCCCCC Q lcl|Aclame:pro 505 TTAYGTYTPPVAG 517 (517) Q Consensus 505 a~~~~~~tp~~a~ 517 (517) ||+.....++.|- T Consensus 374 a~~~~~~~~~~a~ 386 (421) T protein:vir:13 374 SSDAEKIRKFGVI 386 (421) T ss_pred hhheeeeccccee Confidence 9765444433221 No 66 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=100.00 E-value=1e-41 Score=245.34 Aligned_cols=282 Identities=9% Similarity=0.002 Sum_probs=207.4 Q ss_pred cchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc--ceeeeecccccceeeecccccccccc Q lcl|Aclame:pro 229 LTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT--LVVGGDNALTQGTGHTTGTDKTESNI 306 (517) Q Consensus 229 ~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~a~~~~eg~~~~~~~~ 306 (517) +..+..++... ......+..+|+.+.+.+++.++..+++++++++.+..+ ..+|+......+.|++||+.+|++++ T Consensus 1 m~~~~~~a~~~--~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~ 78 (330) T protein:vir:77 1 MAGSTVPSTQV--ALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGPTGISIPHWTGAVSASWTGEAERKPITKG 78 (330) T ss_pred Ccccccchhhc--cccCCCcceechhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcceeEecCCCccccccc Confidence 11111111111 112223456777888999999999999999999887654 45777778888999999999999999 Q ss_pred cceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcccccccccccccccc--- Q lcl|Aclame:pro 307 TLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWAT--- 383 (517) Q Consensus 307 ~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~--- 383 (517) +|+++++.++++++++++|+++++|+..+ +++||.++|++++++++++++|+|+|++++..|+++........ T Consensus 79 ~f~~i~~~~~k~~~~~~is~ell~ds~~~----~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~~~~~~ 154 (330) T protein:vir:77 79 SFGKQELEPVKITTIFAESAEVVRLNPLN----YLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVVSLADT 154 (330) T ss_pred eeeEEEEeEEEEEEeehhhHHHHhcchHH----HHHHHHHHHHHHHHHHHHHHhhcccCCCCccccccccccccceeecc Confidence 99999999999999999999999988765 99999999999999999999999999999988887655322111 Q ss_pred -ccc----ccccHHHHHHHHHHhh-hhhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCC-----ccceecCccceec Q lcl|Aclame:pro 384 -NVT----GTTNIQELLEKLSVAT-PKAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQ-----TIATHFGFNRLVQ 452 (517) Q Consensus 384 -~~~----~~~~~d~l~~~l~~~~-~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~-----~~~~l~g~~~v~~ 452 (517) ..+ .....+++..++.... .+..+++|+|||++|..|++|||++|||||+++...+ ...+++|.+.+ . T Consensus 155 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~PV~-~ 233 (330) T protein:vir:77 155 NLTTASGPQGNAYLAVNNALSLLVNSGKKWTGTLLDNVTEPILNTAVDGNGRPLFVESTYTEQVGAIREGRILGRPTY-V 233 (330) T ss_pred cccccccccchhHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHHhccCCceeecCccccccccccCCceecceeeE-E Confidence 111 1122345555554432 3455789999999999999999999999998865544 34578886554 4 Q ss_pred cccCCce--------eeeecCceEEEeeehee--ehhhh------------------hcccchHHHHHhhhhcceeeccc Q lcl|Aclame:pro 453 SVAVDEK--------TAVSLSGYVTNGSRGME--FEQGT------------------ILVENNKEYLFEMPISGSLEYKG 504 (517) Q Consensus 453 ~~~~~~~--------~~~~~~~~~~~~~~~~~--~~~d~------------------~~~~n~~~~~~~~rvgg~v~~~~ 504 (517) +..+++. ..++++.|++..+.++. .+++. .+++|++.|+++.|+++.+.+|+ T Consensus 234 ~~~~p~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~ 313 (330) T protein:vir:77 234 ADNVVNGTVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDKD 313 (330) T ss_pred eccccCCCCCCccEEEEEecceEEEEEecCcEEEEeecceeeecccccccccccccchhhcCcEEEEEEEEeccEEeccc Confidence 4455542 23466777766655543 33332 15788999999999999999999 Q ss_pred ceEEEEeCCCCCC Q lcl|Aclame:pro 505 TTAYGTYTPPVAG 517 (517) Q Consensus 505 a~~~~~~tp~~a~ 517 (517) ||++.+...|+|= T Consensus 314 a~~~i~~~~~~~~ 326 (330) T protein:vir:77 314 AFVKLTDQVAGTD 326 (330) T ss_pred ceEEEEeccCCcC Confidence 9998765443333 No 67 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=100.00 E-value=9.5e-42 Score=245.49 Aligned_cols=275 Identities=11% Similarity=0.026 Sum_probs=211.5 Q ss_pred cchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeeccccc-eeeeecccccceeeeccccccccccc Q lcl|Aclame:pro 229 LTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPTL-VVGGDNALTQGTGHTTGTDKTESNIT 307 (517) Q Consensus 229 ~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~a~~~~eg~~~~~~~~~ 307 (517) .. ............+..+|+.+...|++.++..+++++++++.++++. ...+..+...+.|+.||+.+|+++++ T Consensus 1 ~g-----~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~ 75 (299) T protein:vir:41 1 MG-----FNPDTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMTKPEEEFTFMSGVGAFWVDEAERIQTSKPT 75 (299) T ss_pred CC-----cCCCcccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecCCCcEEEEEEcCCceeeeecCccccccccc Confidence 00 0011111222345679999999999999999999999998887653 23334455778999999999999999 Q ss_pred ceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcccccccccccccccccccc Q lcl|Aclame:pro 308 LQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTG 387 (517) Q Consensus 308 f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~ 387 (517) |+++++.++++++++++|+++++|+..+ |++||.++|++++++++|.++|+|+|++++ .|++...+......... T Consensus 76 f~~v~l~~~k~~~~~~is~ell~ds~~~----~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~-~gil~~~~~~~~~~~~~ 150 (299) T protein:vir:41 76 FTKAKMRSKKMGVIIPTTKENLNYSVTN----FFSLMQAEIVEAFYKKFDQAVFTGVESPYN-WNILKSATDASNLVEET 150 (299) T ss_pred eeEEEEeeEEEEEeehhhHHHHhcCHHH----HHHHHHHHHHHHHHHHHHHHHhhcccCccc-ccccccccccceeeccc Confidence 9999999999999999999999988765 999999999999999999999999998876 57877665444444445 Q ss_pred cccHHHHHHHHHHhh-hhhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccceecCccceeccccCCc------ee Q lcl|Aclame:pro 388 TTNIQELLEKLSVAT-PKAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDE------KT 460 (517) Q Consensus 388 ~~~~d~l~~~l~~~~-~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~------~~ 460 (517) ....+++++++.... .++.+++|+|||.+|.+|++|||++|||||++....+. .+++|.+.++ ++.++. .. T Consensus 151 ~~~~~~l~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~-~~l~G~PV~~-~~~~~~~~~~~~~~ 228 (299) T protein:vir:41 151 ANKYDDLNEAIGLIEAEDLEPNGIATIRKQRVKYRSTKDGNGMPIFNTATSNGV-DDVLGLPIAY-TPKYTFGDKDISEL 228 (299) T ss_pred cccHHHHHHHHHhhhcccCCcCEEEEcHHHHHHHHHhhccCCceeecCCcCCCC-ceecceeeEE-ecccCCCCCceEEE Confidence 556788888877643 44567899999999999999999999999998877655 4688866544 334432 23 Q ss_pred eeecCceEEEeeehe--eehhhh--------------hcccchHHHHHhhhhcceeecccceEEEEeCCCC Q lcl|Aclame:pro 461 AVSLSGYVTNGSRGM--EFEQGT--------------ILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPV 515 (517) Q Consensus 461 ~~~~~~~~~~~~~~~--~~~~d~--------------~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~ 515 (517) .++++.|.++.+.++ +..++. .+++|++.++++.|+|+.+++|+||++++...+= T Consensus 229 ~gdfs~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~aa~ 299 (299) T protein:vir:41 229 VGDWNQAYYGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQPKAGN 299 (299) T ss_pred EEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCC Confidence 456777776665443 332221 1578889999999999999999999999876555 No 68 >protein:vir:93858 Length: 400 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764266;genbank:gi:115315579;genbank:GeneID:5141552 Probab=100.00 E-value=1.3e-40 Score=239.29 Aligned_cols=370 Identities=18% Similarity=0.204 Sum_probs=277.2 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHH-----------HHHHHhhhhhhhHHHHHHHHhhH Q lcl|Aclame:pro 128 AVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKEREN-----------GGDNAALKTVSELAANLMKQRES 196 (517) Q Consensus 128 A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~-----------~~~e~~~~~~~~~~~~~~~~~~~ 196 (517) -++.. ..-++..+.+.++.....++....++.... ...+.+.+.+.++..++++.+. T Consensus 1 ~~~s~-----------~~~~k~~~~ek~~~~~~~~e~~~~lks~~~g~~~~~~~~~~~k~~el~kT~Sel~~ei~k~e~- 68 (400) T protein:vir:93 1 MRISK-----------RNMNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIEN- 68 (400) T ss_pred Ccccc-----------cccccchHHHHHHHHhhhhhhhhhhhhhhhccchhhhhhhchhHHHHHHHHHHhHHHHHHHhh- Confidence 11111 111111111111111111222222211110 0112233444555555555222 Q ss_pred HHhhhhhhhhhhhhhhhHHHHHHHHHHH----HhhccchhhHHHHhhhhhccccccccc----chhhhhhHHHhHhhhhh Q lcl|Aclame:pro 197 EKILGVEALKVTPEATEFLKTREAEVAY----MSASLTKDPKAAWTAELKERGISGMPA----PAGILKRIQDAVNDEGS 268 (517) Q Consensus 197 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~v----p~~i~~~i~~~~~~~~~ 268 (517) +........+-.+++.+|++.+++...+ +......+.++++...+.+.++++.++ |..++..|.+.+....+ T Consensus 69 eln~~~E~~Kgk~~mtefLkT~~A~~~fa~~l~~nsg~sd~knaW~A~l~E~gvt~td~n~iLP~~il~aIq~al~~~~~ 148 (400) T protein:vir:93 69 ELNAQEEKPKGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNP 148 (400) T ss_pred hhhhhhhhcccchhHHHhhhhHHHHHHHHHHHHhhcCCcchhhhhhhhhhhcccccCCchhhcchHHHHHHHHhhhccCC Confidence 2224445556667788999999998888 455556678889999999999987777 99999999999999999 Q ss_pred hhhceeeeccccceee-eecccccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHH Q lcl|Aclame:pro 269 LLPFIRHENLPTLVVG-GDNALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNR 347 (517) Q Consensus 269 ~~~~~~~~~~~~~~~~-~~~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~ 347 (517) ++++++++++|+..+. ...+...+.+|..|++|+++.++|...++.|+.++.++++.....++.+ +.++|.+||+++ T Consensus 149 ~~~f~~v~n~p~l~V~~~~dt~~qa~gHk~G~~K~eq~~tl~~rtL~P~~VYk~~~la~~~~~~~~--tygaL~nYVm~E 226 (400) T protein:vir:93 149 VFKVFHVTNVGALLVSRSFDSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQM--SYSELYNLIVAE 226 (400) T ss_pred cccceeeecCCceeeecchhhhcccceeccCCcccceeeeeeeeccCHHHHHHHhhhhhhhhhccc--cHHHHHHHHHHH Confidence 9999999999887655 5566678888999999999999999999999999999999766555443 358899999999 Q ss_pred HHHHHHH-HHHhhhhcccccCc-----ccccccccccccccccccccccHHHHHHHHHHhhh--hhcCCEEEEcHHHHHH Q lcl|Aclame:pro 348 LPDMVIM-AVNRAIIMGGVTGV-----SETQIYPVVGDAWATNVTGTTNIQELLEKLSVATP--KAADSTLVIHRNDLAA 419 (517) Q Consensus 348 l~~~~~~-~~e~~~l~G~G~~~-----~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~~--~~~~a~~vmn~~~~~~ 419 (517) |..++.. +.+++++.|||++. ..+.|.+.++++.++..++.+...++++.+.+-.. ...+..+||+|..|+. T Consensus 227 L~q~vI~k~Ve~Aii~GdG~Ngf~~~dk~t~Ik~I~~dt~kt~~a~~~~~qdl~E~~~d~~~~~aad~~~Iv~s~d~~A~ 306 (400) T protein:vir:93 227 LTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKAEDRKAL 306 (400) T ss_pred HHHHHHHHHhhhheeecccccccCCCcchhhhhhhhhhhhhhhhcCCccHHHHHHHHHhhhhhccCCceeEEeccchHHH Confidence 9999996 57999999999764 35677788888888888888888888887655322 2345689999999999 Q ss_pred HHHhhcCCCCEeccCCCCCCccceecCcccee--ccccCCceeeeecCceEEEeeeheeehhhhhcccchHHHHHhhhhc Q lcl|Aclame:pro 420 IRFLKDKNGNYVFPVGVSNQTIATHFGFNRLV--QSVAVDEKTAVSLSGYVTNGSRGMEFEQGTILVENNKEYLFEMPIS 497 (517) Q Consensus 420 l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~n~~~~~~~~rvg 497 (517) |++|||++|+|.|+.+..+-.+.+-||+..++ +..|++...+.. +.|..++..++...++|+|.+|+.+++.|+++| T Consensus 307 L~~lk~a~~~a~f~~~n~d~~IA~~fGv~~Lv~~Tr~~~~kp~V~V-Dek~~i~~~~~~t~~sf~~~tNs~~ilvetlv~ 385 (400) T protein:vir:93 307 LDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKALKPTVLV-DQKYHIDMQDLTKVDAFEWKTNSNMILVETLTS 385 (400) T ss_pred HHHhcCCcceeeeeeccccchhhhhcccceeeeeccCCCCCceeee-ehhhhccccCceeccceeeeeccceEEeeeeec Confidence 99999999999999999999999999987764 566776555433 667777889999999999999999999999999 Q ss_pred ceeecccceEEEEeC Q lcl|Aclame:pro 498 GSLEYKGTTAYGTYT 512 (517) Q Consensus 498 g~v~~~~a~~~~~~t 512 (517) |+|+.|++.+|.++. T Consensus 386 Gsi~~~N~~ay~~v~ 400 (400) T protein:vir:93 386 GHVETYNAGAVITVS 400 (400) T ss_pred cceecccceeeEeeC Confidence 999999999999988 No 69 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=100.00 E-value=5.5e-41 Score=241.32 Aligned_cols=273 Identities=12% Similarity=0.026 Sum_probs=204.5 Q ss_pred hhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc--ceeeeecccccceeeecccccccccccceeeEeeHhhh Q lcl|Aclame:pro 241 LKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT--LVVGGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYV 318 (517) Q Consensus 241 ~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~ 318 (517) +.....+++.+|+.+.+.|++.++..+++++++++.++++ ..+|.......+.|++||+.+|+++++|+++++.++++ T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v~l~~~kl 80 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRKV 80 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhcchhhhhcceeecCCCceEEEEEeCCceeEEeecCcccccccceeeEEEEeeEEE Confidence 4555566899999999999999999999999999887653 56788888889999999999999999999999999999 Q ss_pred hHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccc--cCcccccccccccccc---cccccccccHHH Q lcl|Aclame:pro 319 YKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGV--TGVSETQIYPVVGDAW---ATNVTGTTNIQE 393 (517) Q Consensus 319 ~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G--~~~~~~gi~~~~~~~~---~~~~~~~~~~d~ 393 (517) ++++++|++++.++. |+...|++||.++|++++++++|.++++|++ ++.+..|+.+.+.... ..+.......+. T Consensus 81 ~~~~~iS~ell~~~~-d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~~~~~~~~~~ 159 (311) T protein:vir:81 81 QVTQRFSQEVKWADE-SRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDL 159 (311) T ss_pred EEeehhhHHHhhcCc-ccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeeeecccccchHHH Confidence 999999999986443 3344599999999999999999999999975 4445556655433221 112222223344 Q ss_pred HHHHHHHhh-h-hhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccceecCccceeccccCCc------------- Q lcl|Aclame:pro 394 LLEKLSVAT-P-KAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDE------------- 458 (517) Q Consensus 394 l~~~l~~~~-~-~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~------------- 458 (517) .+..+.... . .+.+++|+|||.+|.+|++|||++|||+|++....+.+.+++|.|.++. ..++. T Consensus 160 ~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~Pv~~~-~~i~~~~~~~~~~~~~~~ 238 (311) T protein:vir:81 160 AVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVS-DTVRGGPEAVTASTGVYR 238 (311) T ss_pred HHHHHHHHhhhcCCCceEEEEcHHHHHHHHhhhccCCCeeecCccccCCCceecceeEEec-ccccccccccccccchhc Confidence 444333332 2 2345679999999999999999999999999888888899999765543 22221 Q ss_pred -------eeeeecCceEEEeeehee--ehhhh-------hcccchHHHHHhhhhcceeecccceEEEEeCCCCC Q lcl|Aclame:pro 459 -------KTAVSLSGYVTNGSRGME--FEQGT-------ILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVA 516 (517) Q Consensus 459 -------~~~~~~~~~~~~~~~~~~--~~~d~-------~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~a 516 (517) ...++|+.|++..+.++. ..++- .+.+|++.++++.|+|+.|.+|+||++.+- ..-| T Consensus 239 ~~~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~-a~~~ 311 (311) T protein:vir:81 239 TTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRD-ADES 311 (311) T ss_pred ccCCccEEEEEecccEEEEEeccceEEEeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEEEe-eccC Confidence 124566677766554443 22221 268899999999999999999999998763 1122 No 70 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=100.00 E-value=3.9e-40 Score=236.64 Aligned_cols=351 Identities=14% Similarity=0.129 Sum_probs=206.9 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhhhhhhhhhhh Q lcl|Aclame:pro 131 TYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVEALKVTPE 210 (517) Q Consensus 131 ~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 210 (517) .....+..+.....++....+.+..+.....++..+.+ .+....+..++.+....... .+ T Consensus 1 mt~~~~~~e~~~~~~e~~~~~~~~~~~~~~~e~~~~~~-----------~~~~~~~~~~~~~~~~~e~~---------~~ 60 (395) T protein:vir:95 1 MADMKQNNVKLKNYHEHKKQFANLVQNGASDEEQSKAF-----------GAMFDALSNDLQEEITAEIN---------NR 60 (395) T ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHH-----------HHHHHHHHHHHHHHHHHHHH---------HH Confidence 00000001001111100000000000000000000000 00011111111000000000 00 Q ss_pred hhhHHHHHHHHHHHHhhccchhhHHHHh-hhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeeccccc-eeeeecc Q lcl|Aclame:pro 211 ATEFLKTREAEVAYMSASLTKDPKAAWT-AELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPTL-VVGGDNA 288 (517) Q Consensus 211 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 288 (517) ....... ..........+.++... -.......+++.+|+.+.+.|++.+...+++++++++.++++. .++.... T Consensus 61 ~~~~~~~----~~r~~~~l~~ee~~~~~~~~~~t~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~~~~~~i~~~~~ 136 (395) T protein:vir:95 61 VVDNGIL----AKRSQDPLTSEERKFFNDINYDVGYTDEKILPETVVERVFDDLQKDHPLLSKINFQNAGIKTRVIKADP 136 (395) T ss_pred HHHHHHH----hhcCccccchHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCceEEEEecC Confidence 0000000 00000001111111110 1112334568999999999999999999999999999887654 4666677 Q ss_pred cccceeeecccc-cccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccC Q lcl|Aclame:pro 289 LTQGTGHTTGTD-KTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTG 367 (517) Q Consensus 289 ~~~a~~~~eg~~-~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~ 367 (517) ...+.|+.|+.. +++++++|+++++.++++++++++|++|+.|+.+| |++||.++|+++++++++.+||+|+|++ T Consensus 137 ~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~iS~ell~ds~~~----ie~~i~~~la~~ia~~~~~a~i~G~G~~ 212 (395) T protein:vir:95 137 AGQAVWGKVFGEIKGQLDAAFREENFTQYKLTCFVVLPDDLSTFGPAW----IERFVRTQIQEAISVALESAIINGGGAA 212 (395) T ss_pred CcceEEeecccccCccccccceeeeeceeeEEEeecccHHHHhcchhH----HHHHHHHHHHHHHHHHHhhheeeccCCC Confidence 778888877554 46789999999999999999999999999999877 9999999999999999999999999998 Q ss_pred cc-ccccccccccccccc----ccccccHHHH---HHHHHH----h--------hhhhcCCEEEEcHHHHHHHHHhhcCC Q lcl|Aclame:pro 368 VS-ETQIYPVVGDAWATN----VTGTTNIQEL---LEKLSV----A--------TPKAADSTLVIHRNDLAAIRFLKDKN 427 (517) Q Consensus 368 ~~-~~gi~~~~~~~~~~~----~~~~~~~d~l---~~~l~~----~--------~~~~~~a~~vmn~~~~~~l~~lKD~~ 427 (517) .+ +.||++......... .+.....+++ ...+.. . ..+..+..|+|||.++. |.+ T Consensus 213 ~~qP~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~------~~~ 286 (395) T protein:vir:95 213 KTQPVGLMKDVNTNSGAVTDKASSGTLTFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSW------DVQ 286 (395) T ss_pred CcCceeeeecccccccccccccccchhhhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhh------hcC Confidence 53 578887643321111 1111111111 111111 1 11223578999999864 668 Q ss_pred CCEeccCCCCCCccceecCccc-eeccccCCce--eeeecCceEEEeeehe--eehhhhhcccchHHHHHhhhhcceeec Q lcl|Aclame:pro 428 GNYVFPVGVSNQTIATHFGFNR-LVQSVAVDEK--TAVSLSGYVTNGSRGM--EFEQGTILVENNKEYLFEMPISGSLEY 502 (517) Q Consensus 428 Gryl~~~~~~~~~~~~l~g~~~-v~~~~~~~~~--~~~~~~~~~~~~~~~~--~~~~d~~~~~n~~~~~~~~rvgg~v~~ 502 (517) |+|+|++. +|.+.+++|++. ++.+..|++. ..++|+.|.++.+.++ ..+++..+.++++.|++..|+||.+.+ T Consensus 287 g~~~~~~~--~G~~~~~lg~g~~v~~~~~~p~~~i~fgdfs~y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dg~~~~ 364 (395) T protein:vir:95 287 ARYTYLTA--NGGFVTVLPYNVTIITSEFVPEGKLVAFVTDRYNAVRGGGLTVKKFDQTLALEDAVLFTAKTFAYGQPDD 364 (395) T ss_pred CcceeccC--CCcceeccCCcceEEEcCCCCCCcEEEEecccEEEEEecceEEEeccchhhhCCcEEEEEEEEECCEEec Confidence 99999874 556667776543 4556666654 4567888988877665 455666788899999999999999999 Q ss_pred ccceEEEEeCCCCCC Q lcl|Aclame:pro 503 KGTTAYGTYTPPVAG 517 (517) Q Consensus 503 ~~a~~~~~~tp~~a~ 517 (517) ++||+++++|-+.|+ T Consensus 365 ~~A~~~l~i~~~~~~ 379 (395) T protein:vir:95 365 NKASAVYDLKVASAP 379 (395) T ss_pred cccEEEEEeeccCCC Confidence 999999999855444 No 71 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=100.00 E-value=2.7e-40 Score=237.49 Aligned_cols=336 Identities=13% Similarity=0.114 Sum_probs=204.7 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhhhhhhhhhhh Q lcl|Aclame:pro 131 TYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVEALKVTPE 210 (517) Q Consensus 131 ~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 210 (517) ..+|.. +++.+.+. +..+.+++.. .++.+.+..+...+ .+..+..+.. ..+ T Consensus 1 m~~kl~-----~~~~~~~~------~~~~~~~~~~--~~~~~~~~~~~~~~---~~~~~~~~~~-------------~~e 51 (381) T protein:vir:10 1 MTINLS-----ETFANAKN------EFINAVNNGE--PQERQNELYGDMIN---QLFEETKLQA-------------KAE 51 (381) T ss_pred CchhHH-----HHHHHHHH------HHHHHHHhhh--HHHHHHHHHHHHHH---hhhhhHHHHH-------------HHH Confidence 111111 00100000 0000000000 00000000000000 0000000000 000 Q ss_pred hhhHHHHHHHHHHHHhhccchhhHH---HHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeeccccc-eeeee Q lcl|Aclame:pro 211 ATEFLKTREAEVAYMSASLTKDPKA---AWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPTL-VVGGD 286 (517) Q Consensus 211 ~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 286 (517) ..+... ..........+.++ .+.. .....+++++|+.+.+.|++.+...+++++++++.++++. .++.. T Consensus 52 ~~~~~~-----~~~~~~~l~~~e~~~~~~~~~--~t~~~Gg~lvP~~~~~~I~~~l~~~spir~~a~v~~~~~~~~i~~~ 124 (381) T protein:vir:10 52 AERVSS-----LPKSAQTLSANQRNFFMDINK--SVGYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKS 124 (381) T ss_pred HHHHHH-----hcccccccCHHHHHHHHHHhh--cCCCCCceecCHHHHHHHHHHHHhhcceeeeeeeEecCcceEEEee Confidence 000000 00000000111111 1111 2234568999999999999999999999999999887653 56666 Q ss_pred cccccceeeeccccc-ccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccc Q lcl|Aclame:pro 287 NALTQGTGHTTGTDK-TESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGV 365 (517) Q Consensus 287 ~~~~~a~~~~eg~~~-~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G 365 (517) +..+.+.|+.+++.. .+++++|+++++.++++++++++|++|++|+.+| |++||.++|+++|+++++.+||+||| T Consensus 125 ~~~~~a~W~~e~~~~~~~~~~~f~~i~l~~~kl~a~i~is~elL~Ds~~~----le~~i~~~la~~~a~~~~~afi~GdG 200 (381) T protein:vir:10 125 ETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAW----IERFVRVQIEEAFAVALETAFLKGTG 200 (381) T ss_pred cCCcceEEeecccccccccCccceeEeecceeEEeeccccHHHHhccHHH----HHHHHHHHHHHHHHHHhhceeEeccc Confidence 777788898887654 5678999999999999999999999999999887 99999999999999999999999999 Q ss_pred cCcccccccccccccccc--------ccccc---ccHHHHHHHH---HHh---------hhhhcCCEEEEcHHHHHHHHH Q lcl|Aclame:pro 366 TGVSETQIYPVVGDAWAT--------NVTGT---TNIQELLEKL---SVA---------TPKAADSTLVIHRNDLAAIRF 422 (517) Q Consensus 366 ~~~~~~gi~~~~~~~~~~--------~~~~~---~~~d~l~~~l---~~~---------~~~~~~a~~vmn~~~~~~l~~ 422 (517) +++| .||++........ ....+ .....+...+ ... ..+..++.|+|||.++..|++ T Consensus 201 ~~qP-~Gil~~~~~~~~~~~g~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~ 279 (381) T protein:vir:10 201 KDQP-IGLNRQVQKGVSVTDGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQA 279 (381) T ss_pred CCCc-eeeeecCCccccccccccccccccccccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhcc Confidence 9987 5887643221100 00001 0111111111 111 122346789999999999987 Q ss_pred hh---cCCCCEeccCCCCCCccceecCccceeccccCCc--eeeeecCceEEEeeehee--ehhhhhcccchHHHHHhhh Q lcl|Aclame:pro 423 LK---DKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDE--KTAVSLSGYVTNGSRGME--FEQGTILVENNKEYLFEMP 495 (517) Q Consensus 423 lK---D~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~--~~~~~~~~~~~~~~~~~~--~~~d~~~~~n~~~~~~~~r 495 (517) ++ |++|+|+|... +|. .++.+..|++ ...++|+.|.++.+.++. .+++..+.++++.|++..| T Consensus 280 ~~~~~~~~G~~v~~lp---------~g~-~vv~~~~~p~~~i~fGDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r 349 (381) T protein:vir:10 280 QYTHLNANGVYVTALP---------FNL-NVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQF 349 (381) T ss_pred ccccCCCCCceeecCC---------CCc-eeEEcCCCCcCcEEEEEcccEEEEEecccEEEeechhhhhcCceEEEEEEE Confidence 55 89999998532 233 2333445554 455678889988887654 5567778999999999999 Q ss_pred hcceeecccceEEEEeC-----CCCCC Q lcl|Aclame:pro 496 ISGSLEYKGTTAYGTYT-----PPVAG 517 (517) Q Consensus 496 vgg~v~~~~a~~~~~~t-----p~~a~ 517 (517) ++|.+.+|+||++.+++ |+|-+ T Consensus 350 ~dG~~~~~~A~~v~~l~~~~~~~~~~~ 376 (381) T protein:vir:10 350 AYGKAKDNKVAAVWKLDLKGHKPALED 376 (381) T ss_pred EcCEEecCCcEEEEEEeecCCcccccc Confidence 99999999999999998 88888 No 72 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=100.00 E-value=2e-39 Score=232.73 Aligned_cols=395 Identities=13% Similarity=0.106 Sum_probs=211.4 Q ss_pred ehhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh--hhhhhhhhhhhhhhhHHHHHHH---hhhhhhhH Q lcl|Aclame:pro 112 CILAGGALTPNPSNKNAVVTYFREEKKKEENKMTFDQNLMQELLD--AKKLAADLNAKLKERENGGDNA---ALKTVSEL 186 (517) Q Consensus 112 ~~l~EvS~v~~pA~~~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e--~~~~~~e~~a~l~~~~~~~~e~---~~~~~~~~ 186 (517) +.|.-+=|-.-=....+.+..+.++..+..++.++....+.+... .....++...+++....+..+. ..+++..+ T Consensus 1 ~~~~~~~l~~~~~~~~~~l~el~e~~~~l~k~~~el~~~l~ea~~~ee~~~~ee~i~~l~~~~~el~e~~~~l~~ei~~l 80 (466) T protein:vir:80 1 MALRQLMLAKKIEQRKAALAELLEQEKALQKRSEELEAAIDEANTDEEIAVVEDEINKLEGEKTELEEKKSKLEGEIKEL 80 (466) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000000000000011111111111111111111110000000000 0000000000011000011110 00111111 Q ss_pred HHHHHHHhhHHHhhhhhhhhhhhhhhhHHH----HHHHHHHHH-------------hhccchhhHHHHhhhhhccccccc Q lcl|Aclame:pro 187 AANLMKQRESEKILGVEALKVTPEATEFLK----TREAEVAYM-------------SASLTKDPKAAWTAELKERGISGM 249 (517) Q Consensus 187 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~~~ 249 (517) +.++.+.+...................... ......... ......+.+...... .....+++ T Consensus 81 e~el~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~g~~~ 159 (466) T protein:vir:80 81 ENELEQLNNKEPKNNSEPAQVSGARTQQFVGGETRMKGFFRNMPYEQRAALIARSEVKEFLAQVRTLAQQK-RAVSGAEL 159 (466) T ss_pred HHHHHHHHHhhhccCchhHHHHhhhhhHHhhHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHhhhh-hhhccccc Confidence 112111111000000000000000000000 000000000 000000111111111 11223357 Q ss_pred ccchhhhhhHHHhHhhhhhhhhceeeeccccc-eeeeecccccceeeecccccccccccceeeEeeHhhhhHhHhhhHHH Q lcl|Aclame:pro 250 PAPAGILKRIQDAVNDEGSLLPFIRHENLPTL-VVGGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIV 328 (517) Q Consensus 250 ~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~l 328 (517) .+|..+.+.|++.+++.+++++++++.++++. .++.......+.|+.||+..|+++++|+++++.++++++++++|+++ T Consensus 160 ~vP~~~~~~i~~~l~~~~~l~~~~~v~~~~g~~~~~~~~~~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~el 239 (466) T protein:vir:80 160 TIPDVMLELLRDNMHRYSKLISKVRLRPLKGTARQNIAGAIPEGVWTEAVANLNELSLSFSQIEVDGYKVGGFIPIPNST 239 (466) T ss_pred cccHHHHHHHHHhhhhhhhhhhheeeeecCceeEeeeecCCcceeecccccccccccccccceeecceeeeeehhhhHHH Confidence 89999999999999999999999998887653 45556666778999999999999999999999999999999999999 Q ss_pred HHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCccccccccccccccccccc-----c--cccHHHH------- Q lcl|Aclame:pro 329 MNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVT-----G--TTNIQEL------- 394 (517) Q Consensus 329 i~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~-----~--~~~~d~l------- 394 (517) +.|+.++ |++||..+|+++++.+++.+||+|+|+++| .||++..+........ . ....+.+ T Consensus 240 l~ds~~~----l~~~i~~~la~~~~~~~~~ail~G~G~~~P-~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (466) T protein:vir:80 240 LEDSDLN----LADEILDAIGQAIGFALDKAILYGTGTKMP-VGIVTRLAQTTQPPNWGTKAPAWTNLSTTNLLKIDPTG 314 (466) T ss_pred HhcchHH----HHHHHHHHHHHHHHHHHhhheeeccCCCCc-ceeeecccccccccccccccccccccchhhhhhhhhhc Confidence 9998876 999999999999999999999999999887 4988764322111100 0 0001111 Q ss_pred ----------HHHHHHhhhhh--cCCEEEEcHHHHHHHHHhh---cCCCCEeccCCCCCCccceecCccceeccccCCc- Q lcl|Aclame:pro 395 ----------LEKLSVATPKA--ADSTLVIHRNDLAAIRFLK---DKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDE- 458 (517) Q Consensus 395 ----------~~~l~~~~~~~--~~a~~vmn~~~~~~l~~lK---D~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~- 458 (517) ...+......+ .+..|+||+.++..|..++ +++|.|++.+..+ .+++|.+ ++.+..+++ T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~~~~~g~~~~~~~~~----~~i~G~p-vv~s~~~~~~ 389 (466) T protein:vir:80 315 KSAEEFFSELVLKLSKARANYSNGMKFWAMSSNTHAVLMSKAITFNSAGALVASLNNT----MPIVGGD-IVILDFIPDN 389 (466) T ss_pred cchhhHHHHHHHHHHhhhccccCCceeEEecchhHHHhhcccccccCCccccccCCCc----ccccccc-eeecCccCcc Confidence 11111111222 2457999999999999998 7888888865422 2477765 455555554 Q ss_pred -eeeeecCceEEEeeeheee--hhhhhcccchHHHHHhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 459 -KTAVSLSGYVTNGSRGMEF--EQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 459 -~~~~~~~~~~~~~~~~~~~--~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) ...++++.|.++.+.++.. .++..+.+|++.|++..|++|.+++|+||++++++-.-.+ T Consensus 390 ~~~~g~~~~y~i~~r~~~~i~~~~~~~f~~d~~~~r~~~r~dg~~~~~~afv~~~~~~~~~~ 451 (466) T protein:vir:80 390 DIIGGYGSLYLLAERADIKLAQSEHVRFIEDQTVFKGTARYDGKPVFGEGFVAVNIANANPT 451 (466) T ss_pred ceeeeccccEEEEeecceEEEechhhhhhcCcEEEEEEEEEccEEeccCceEEEEecCCCcc Confidence 4556788898888777653 4455678899999999999999999999999987554333 No 73 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=100.00 E-value=6e-41 Score=241.08 Aligned_cols=273 Identities=10% Similarity=0.005 Sum_probs=201.3 Q ss_pred HhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeeccc--cceeeeecccccceeeecccccccccccceeeEee Q lcl|Aclame:pro 237 WTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLP--TLVVGGDNALTQGTGHTTGTDKTESNITLQTRVLT 314 (517) Q Consensus 237 ~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~~~~~ 314 (517) +.... ...++.+|+.+...|++.++..+++++++++.+++ ...+|.......+.|+.||+.+|+++++|++++++ T Consensus 1 ma~~t---~~~G~lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l~ 77 (300) T protein:vir:95 1 MSEAQ---LSKGNLFNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQREFVFDFDSDIDIVAENGKKTHGGVSLDPVTIV 77 (300) T ss_pred Ccccc---cCCcceechhhHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecCcceEEeeCCcccccccccceeeEee Confidence 11111 22366789999999999999999999998887765 35677777778899999999999999999999999 Q ss_pred HhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccc----cCcccccccccccccc-cccccccc Q lcl|Aclame:pro 315 PQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGV----TGVSETQIYPVVGDAW-ATNVTGTT 389 (517) Q Consensus 315 ~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G----~~~~~~gi~~~~~~~~-~~~~~~~~ 389 (517) ++++++++++|++++..+. |+...|++||.++|++++++++|.++|+|++ ++....+.....+... ......+. T Consensus 78 ~~k~~~~~~iS~ell~~~~-d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (300) T protein:vir:95 78 PLKVEYGARVSDEFLHASE-EAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQTVPFKDTN 156 (300) T ss_pred eEEEEEeehhhHHHhccCC-CCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccceeecccccc Confidence 9999999999999985332 3445699999999999999999999999954 3333334333333222 22223334 Q ss_pred cHHHHHHHHHHhh-hhhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccceecCccceeccccCCce--------e Q lcl|Aclame:pro 390 NIQELLEKLSVAT-PKAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDEK--------T 460 (517) Q Consensus 390 ~~d~l~~~l~~~~-~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~~--------~ 460 (517) ..+++..++.... .++.+++|+|||+++.+|++|||++|||||++....+.+.+++|.+.++ +..++.. . T Consensus 157 ~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~Pv~~-s~~v~~~~~~~~~~~~ 235 (300) T protein:vir:95 157 PDESMEDAVGMIDGSERDITGAILDPIFTTALSKMKNAEGGKLYPELAWGGVPDAINGLAVDK-NRTVSYSQTDPKNTAI 235 (300) T ss_pred hHHHHHHHHHHhhhcCCCccEEEECHHHHHHHHHhhccCCCeeccCccccCCCceecceeeEE-ecCCCCCCCCCccEEE Confidence 4556666655443 3456689999999999999999999999999888888888999976544 3333321 1 Q ss_pred eeecCceE-EEeeehe--eehhh--h------hcccchHHHHHhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 461 AVSLSGYV-TNGSRGM--EFEQG--T------ILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 461 ~~~~~~~~-~~~~~~~--~~~~d--~------~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) .++|+.++ ++.+.++ +..+. . .+.+|++.++++.|+|+.|++|+||++.+ .+|| T Consensus 236 ~GDf~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~---~~~g 300 (300) T protein:vir:95 236 VGDFETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIV---KTGG 300 (300) T ss_pred EeeccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEeecceeecccceEEEe---cCCC Confidence 23444433 4444443 22221 1 26899999999999999999999999987 5666 No 74 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=100.00 E-value=1.3e-40 Score=239.30 Aligned_cols=296 Identities=9% Similarity=0.018 Sum_probs=211.0 Q ss_pred hhhhhHHHHHHHHHHHHhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc--ceeeee Q lcl|Aclame:pro 209 PEATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT--LVVGGD 286 (517) Q Consensus 209 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~--~~~~~~ 286 (517) .+.++..+.... .+..........++ .........++.+|..+.+.|++.++..+++++++++.+.++ ..+|.. T Consensus 1 ~~~~~~~~~~~~--~f~~~~~~~~~~~a--~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~ 76 (324) T protein:vir:97 1 MEQTQKLKLNLQ--HFASNNVKPQVFNP--DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFW 76 (324) T ss_pred CccchhHHHHHH--HHHHhhhhhhhhcc--ccccccCCCcceechhHHHHHHHHHHhhcchhhhcceeeccCCceEEEEE Confidence 000000000000 00000000000000 111122334678999999999999999999999999888764 467787 Q ss_pred cccccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhccccc Q lcl|Aclame:pro 287 NALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVT 366 (517) Q Consensus 287 ~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~ 366 (517) .....+.|++||+.+|+++++|+.+++.++++++++++|+++++|+.++ |++||.++|++++++++|+++|+|+|+ T Consensus 77 ~~~~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~----l~~~i~~~l~~aia~~~d~a~l~G~g~ 152 (324) T protein:vir:97 77 ADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQ----FFEEMKPMIAEAFYKKFDEAGILNQGN 152 (324) T ss_pred ecCcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHH----HHHHHHHHHHHHHHHHHHHHhhccCCC Confidence 7888899999999999999999999999999999999999999988765 899999999999999999999999998 Q ss_pred CcccccccccccccccccccccccHHHHHHHHHHhhh-hhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccceec Q lcl|Aclame:pro 367 GVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVATP-KAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHF 445 (517) Q Consensus 367 ~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~~-~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~ 445 (517) +....++.+....... ...+..+.+++.+++..... ++.+++|+|||.+|..|+++||++|||+|++. ...+++ T Consensus 153 ~~~~~gi~~~~~~~~~-~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~~~~~~~----~~~tl~ 227 (324) T protein:vir:97 153 NPFGKSIAQSIEKTNK-VIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDR----NSDTLD 227 (324) T ss_pred CccCccccccccccce-eccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcCCCceeecCC----CCcccc Confidence 8776666665443332 23344556777777665443 45678999999999999999999999999743 345688 Q ss_pred Cccceecc-ccCCc--eeeeecCceEEEeeehee--ehhhh--------------hcccchHHHHHhhhhcceeecccce Q lcl|Aclame:pro 446 GFNRLVQS-VAVDE--KTAVSLSGYVTNGSRGME--FEQGT--------------ILVENNKEYLFEMPISGSLEYKGTT 506 (517) Q Consensus 446 g~~~v~~~-~~~~~--~~~~~~~~~~~~~~~~~~--~~~d~--------------~~~~n~~~~~~~~rvgg~v~~~~a~ 506 (517) |.+.+... ..++. ...++++.++++.+.++. ..++. .+.+|++.|+++.|+++.+.+|+|| T Consensus 228 G~PV~~~~~~~~~~~~~~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~ 307 (324) T protein:vir:97 228 GLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAF 307 (324) T ss_pred ceeeEeecCCCCCcceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccce Confidence 87655432 22333 334567777766655543 33221 2578999999999999999999999 Q ss_pred EEEEeCCC----CCC Q lcl|Aclame:pro 507 AYGTYTPP----VAG 517 (517) Q Consensus 507 ~~~~~tp~----~a~ 517 (517) ++++..-| ++| T Consensus 308 ~~l~~~~~~~~~~~~ 322 (324) T protein:vir:97 308 AKLVPADKKTDSVPG 322 (324) T ss_pred EEEEeccCCCCCCCC Confidence 99887544 333 No 75 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=100.00 E-value=5.6e-41 Score=241.24 Aligned_cols=273 Identities=11% Similarity=0.004 Sum_probs=205.3 Q ss_pred cchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc--ceeeeecccccceeeecccccccccc Q lcl|Aclame:pro 229 LTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT--LVVGGDNALTQGTGHTTGTDKTESNI 306 (517) Q Consensus 229 ~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~a~~~~eg~~~~~~~~ 306 (517) ++.+.. .+.+......+++.+|+.+.+.|++.++..+++++++++.++++ ..+|+......+.|++|++.+|++++ T Consensus 1 ma~~~~--~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~ 78 (304) T protein:vir:94 1 MATPTY--TPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVGAYWVSETERIQTSKP 78 (304) T ss_pred Cccccc--ccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEeecCcccccccc Confidence 111111 11122233445788999999999999999999999999888754 45777778888999999999999999 Q ss_pred cceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcccc----ccccccccccc Q lcl|Aclame:pro 307 TLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSET----QIYPVVGDAWA 382 (517) Q Consensus 307 ~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~----gi~~~~~~~~~ 382 (517) +|+++++.++++++++++|++++.|+.++ |++||.++|++++++++|.++|+|+|++.+.. +++..++.... T Consensus 79 ~~~~i~~~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~ 154 (304) T protein:vir:94 79 EYAQAEMEAKKIGVIIPLSKEFLKWTAKD----FFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKGN 154 (304) T ss_pred eeeEEEEEEEEEEEeehhhHHHHhcchHH----HHHHHHHHHHHHHHHHHHhhheeccCCCccccccccccccccccccc Confidence 99999999999999999999999998766 99999999999999999999999999876542 22222222222 Q ss_pred ccccccccHHHHHHHHHHhhh-hhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccceecCccceeccccCCc--- Q lcl|Aclame:pro 383 TNVTGTTNIQELLEKLSVATP-KAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDE--- 458 (517) Q Consensus 383 ~~~~~~~~~d~l~~~l~~~~~-~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~--- 458 (517) .........+++.+++..... +..+++|+|||++|.+|+++||++|||+|+++. .+++|.+.++ +..++. T Consensus 155 ~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~~-----~~l~G~PV~~-~~~~~~~~~ 228 (304) T protein:vir:94 155 VVTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDANG-----NEIMGLPLSY-TGADVYDKK 228 (304) T ss_pred ccccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecCCC-----ccccceeeEE-ecccccCCC Confidence 222334456777777666543 456789999999999999999999999997643 5688876543 333331 Q ss_pred ---eeeeecCceEEEeeehee--ehhhh----------------hcccchHHHHHhhhhcceeecccceEEEEeCC Q lcl|Aclame:pro 459 ---KTAVSLSGYVTNGSRGME--FEQGT----------------ILVENNKEYLFEMPISGSLEYKGTTAYGTYTP 513 (517) Q Consensus 459 ---~~~~~~~~~~~~~~~~~~--~~~d~----------------~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp 513 (517) ...++++.++++.+.++. ..++. .+.+|++.+|++.|+|+.+.+|+||++.+.+- T Consensus 229 ~~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 229 KSLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred CcEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 234566666666544432 22221 26889999999999999999999999999888 No 76 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=100.00 E-value=5.6e-41 Score=241.24 Aligned_cols=273 Identities=11% Similarity=0.004 Sum_probs=205.3 Q ss_pred cchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc--ceeeeecccccceeeecccccccccc Q lcl|Aclame:pro 229 LTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT--LVVGGDNALTQGTGHTTGTDKTESNI 306 (517) Q Consensus 229 ~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~a~~~~eg~~~~~~~~ 306 (517) ++.+.. .+.+......+++.+|+.+.+.|++.++..+++++++++.++++ ..+|+......+.|++|++.+|++++ T Consensus 1 ma~~~~--~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~ 78 (304) T protein:vir:10 1 MATPTY--TPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVGAYWVSETERIQTSKP 78 (304) T ss_pred Cccccc--ccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEeecCcccccccc Confidence 111111 11122233445788999999999999999999999999888754 45777778888999999999999999 Q ss_pred cceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcccc----ccccccccccc Q lcl|Aclame:pro 307 TLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSET----QIYPVVGDAWA 382 (517) Q Consensus 307 ~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~----gi~~~~~~~~~ 382 (517) +|+++++.++++++++++|++++.|+.++ |++||.++|++++++++|.++|+|+|++.+.. +++..++.... T Consensus 79 ~~~~i~~~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~ 154 (304) T protein:vir:10 79 EYAQAEMEAKKIGVIIPLSKEFLKWTAKD----FFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKGN 154 (304) T ss_pred eeeEEEEEEEEEEEeehhhHHHHhcchHH----HHHHHHHHHHHHHHHHHHhhheeccCCCccccccccccccccccccc Confidence 99999999999999999999999998766 99999999999999999999999999876542 22222222222 Q ss_pred ccccccccHHHHHHHHHHhhh-hhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccceecCccceeccccCCc--- Q lcl|Aclame:pro 383 TNVTGTTNIQELLEKLSVATP-KAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDE--- 458 (517) Q Consensus 383 ~~~~~~~~~d~l~~~l~~~~~-~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~--- 458 (517) .........+++.+++..... +..+++|+|||++|.+|+++||++|||+|+++. .+++|.+.++ +..++. T Consensus 155 ~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~~-----~~l~G~PV~~-~~~~~~~~~ 228 (304) T protein:vir:10 155 VVTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDANG-----NEIMGLPLSY-TGADVYDKK 228 (304) T ss_pred ccccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecCCC-----ccccceeeEE-ecccccCCC Confidence 222334456777777666543 456789999999999999999999999997643 5688876543 333331 Q ss_pred ---eeeeecCceEEEeeehee--ehhhh----------------hcccchHHHHHhhhhcceeecccceEEEEeCC Q lcl|Aclame:pro 459 ---KTAVSLSGYVTNGSRGME--FEQGT----------------ILVENNKEYLFEMPISGSLEYKGTTAYGTYTP 513 (517) Q Consensus 459 ---~~~~~~~~~~~~~~~~~~--~~~d~----------------~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp 513 (517) ...++++.++++.+.++. ..++. .+.+|++.+|++.|+|+.+.+|+||++.+.+- T Consensus 229 ~~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 229 KSLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred CcEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 234566666666544432 22221 26889999999999999999999999999888 No 77 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=100.00 E-value=1.2e-40 Score=239.51 Aligned_cols=296 Identities=8% Similarity=0.020 Sum_probs=209.7 Q ss_pred HHHhhHHHhhhhhhhhhhhhhhhHHHHHHHHHHHHhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhh Q lcl|Aclame:pro 191 MKQRESEKILGVEALKVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLL 270 (517) Q Consensus 191 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~ 270 (517) ++ +..+.......+.......+..++ .........++.+|..+...|++.+++.++++ T Consensus 1 ~~--------------------~~~~~~~~~~~~~~~~~~~~~~~a--~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~ 58 (324) T protein:vir:96 1 ME--------------------QTQKLKLNLQHFASNNVKPQVFNP--DNVMMHEKKDGTLMNEFTTPILQEVMENSKIM 58 (324) T ss_pred CC--------------------cchhhhHHHHHHHHHhhhhhhhcc--ccccccCcCccccchhHHHHHHHHHHhhchhh Confidence 00 000000000000000000011111 11122234567899999999999999999999 Q ss_pred hceeeecccc--ceeeeecccccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHH Q lcl|Aclame:pro 271 PFIRHENLPT--LVVGGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRL 348 (517) Q Consensus 271 ~~~~~~~~~~--~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l 348 (517) +++++.++++ ..+|+......+.|++||+.+|+++++|+++++.++++++++++|++++.|+..+ |++||.++| T Consensus 59 ~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~----l~~~i~~~l 134 (324) T protein:vir:96 59 QLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQ----FFEEMKPMI 134 (324) T ss_pred hhcceeeccCCceEEEEEecCcceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHH----HHHHHHHHH Confidence 9999888764 4678888888899999999999999999999999999999999999999988765 999999999 Q ss_pred HHHHHHHHHhhhhcccccCcccccccccccccccccccccccHHHHHHHHHHhh-hhhcCCEEEEcHHHHHHHHHhhcCC Q lcl|Aclame:pro 349 PDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVAT-PKAADSTLVIHRNDLAAIRFLKDKN 427 (517) Q Consensus 349 ~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~-~~~~~a~~vmn~~~~~~l~~lKD~~ 427 (517) ++++++++|.++|+|+|++....++.......... ..++.+.+++.+++.... .++.+++|+|||++|.+|+++||++ T Consensus 135 a~ai~~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~-~~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~ 213 (324) T protein:vir:96 135 AEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKV-IKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPE 213 (324) T ss_pred HHHHHHHHHHHHhccCCCCCcCcccccccccccee-ccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccC Confidence 99999999999999999887666666554433322 334456777887776544 3456789999999999999999999 Q ss_pred CCEeccCCCCCCccceecCccceecc-ccCCc--eeeeecCceEEEeeehee--ehhh--------------hhcccchH Q lcl|Aclame:pro 428 GNYVFPVGVSNQTIATHFGFNRLVQS-VAVDE--KTAVSLSGYVTNGSRGME--FEQG--------------TILVENNK 488 (517) Q Consensus 428 Gryl~~~~~~~~~~~~l~g~~~v~~~-~~~~~--~~~~~~~~~~~~~~~~~~--~~~d--------------~~~~~n~~ 488 (517) |||+|+. +...+++|.+.+... ..+++ ...++++.++++.+.++. ..++ ..+.+|++ T Consensus 214 G~~~~~~----~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~ 289 (324) T protein:vir:96 214 TKERIYD----RNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMV 289 (324) T ss_pred CCeeecC----CCCCcccceeeEeeCCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcE Confidence 9999964 334578887655432 22333 334566677666554433 2222 12678999 Q ss_pred HHHHhhhhcceeecccceEEEEe----CCCCCC Q lcl|Aclame:pro 489 EYLFEMPISGSLEYKGTTAYGTY----TPPVAG 517 (517) Q Consensus 489 ~~~~~~rvgg~v~~~~a~~~~~~----tp~~a~ 517 (517) .|+++.|+++.+.+|+||++.+- +.+++| T Consensus 290 ~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~ 322 (324) T protein:vir:96 290 ALRATMHVALHIADDKAFAKLVPADKRTDSVPG 322 (324) T ss_pred EEEEEEEEccEEecccceEEEecccccCCCCCC Confidence 99999999999999999987553 122444 No 78 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=100.00 E-value=1.2e-40 Score=239.51 Aligned_cols=296 Identities=8% Similarity=0.020 Sum_probs=209.7 Q ss_pred HHHhhHHHhhhhhhhhhhhhhhhHHHHHHHHHHHHhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhh Q lcl|Aclame:pro 191 MKQRESEKILGVEALKVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLL 270 (517) Q Consensus 191 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~ 270 (517) ++ +..+.......+.......+..++ .........++.+|..+...|++.+++.++++ T Consensus 1 ~~--------------------~~~~~~~~~~~~~~~~~~~~~~~a--~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~ 58 (324) T protein:vir:78 1 ME--------------------QTQKLKLNLQHFASNNVKPQVFNP--DNVMMHEKKDGTLMNEFTTPILQEVMENSKIM 58 (324) T ss_pred CC--------------------cchhhhHHHHHHHHHhhhhhhhcc--ccccccCcCccccchhHHHHHHHHHHhhchhh Confidence 00 000000000000000000011111 11122234567899999999999999999999 Q ss_pred hceeeecccc--ceeeeecccccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHH Q lcl|Aclame:pro 271 PFIRHENLPT--LVVGGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRL 348 (517) Q Consensus 271 ~~~~~~~~~~--~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l 348 (517) +++++.++++ ..+|+......+.|++||+.+|+++++|+++++.++++++++++|++++.|+..+ |++||.++| T Consensus 59 ~l~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~----l~~~i~~~l 134 (324) T protein:vir:78 59 QLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQ----FFEEMKPMI 134 (324) T ss_pred hhcceeeccCCceEEEEEecCcceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHH----HHHHHHHHH Confidence 9999888764 4678888888899999999999999999999999999999999999999988765 999999999 Q ss_pred HHHHHHHHHhhhhcccccCcccccccccccccccccccccccHHHHHHHHHHhh-hhhcCCEEEEcHHHHHHHHHhhcCC Q lcl|Aclame:pro 349 PDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVAT-PKAADSTLVIHRNDLAAIRFLKDKN 427 (517) Q Consensus 349 ~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~-~~~~~a~~vmn~~~~~~l~~lKD~~ 427 (517) ++++++++|.++|+|+|++....++.......... ..++.+.+++.+++.... .++.+++|+|||++|.+|+++||++ T Consensus 135 a~ai~~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~-~~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~ 213 (324) T protein:vir:78 135 AEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKV-IKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPE 213 (324) T ss_pred HHHHHHHHHHHHhccCCCCCcCcccccccccccee-ccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccC Confidence 99999999999999999887666666554433322 334456777887776544 3456789999999999999999999 Q ss_pred CCEeccCCCCCCccceecCccceecc-ccCCc--eeeeecCceEEEeeehee--ehhh--------------hhcccchH Q lcl|Aclame:pro 428 GNYVFPVGVSNQTIATHFGFNRLVQS-VAVDE--KTAVSLSGYVTNGSRGME--FEQG--------------TILVENNK 488 (517) Q Consensus 428 Gryl~~~~~~~~~~~~l~g~~~v~~~-~~~~~--~~~~~~~~~~~~~~~~~~--~~~d--------------~~~~~n~~ 488 (517) |||+|+. +...+++|.+.+... ..+++ ...++++.++++.+.++. ..++ ..+.+|++ T Consensus 214 G~~~~~~----~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~ 289 (324) T protein:vir:78 214 TKERIYD----RNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMV 289 (324) T ss_pred CCeeecC----CCCCcccceeeEeeCCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcE Confidence 9999964 334578887655432 22333 334566677666554433 2222 12678999 Q ss_pred HHHHhhhhcceeecccceEEEEe----CCCCCC Q lcl|Aclame:pro 489 EYLFEMPISGSLEYKGTTAYGTY----TPPVAG 517 (517) Q Consensus 489 ~~~~~~rvgg~v~~~~a~~~~~~----tp~~a~ 517 (517) .|+++.|+++.+.+|+||++.+- +.+++| T Consensus 290 ~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~ 322 (324) T protein:vir:78 290 ALRATMHVALHIADDKAFAKLVPADKRTDSVPG 322 (324) T ss_pred EEEEEEEEccEEecccceEEEecccccCCCCCC Confidence 99999999999999999987553 122444 No 79 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=100.00 E-value=2.6e-40 Score=237.61 Aligned_cols=281 Identities=12% Similarity=0.043 Sum_probs=206.2 Q ss_pred hccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc--ceeeeecccccceeeecccccccc Q lcl|Aclame:pro 227 ASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT--LVVGGDNALTQGTGHTTGTDKTES 304 (517) Q Consensus 227 ~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~a~~~~eg~~~~~~ 304 (517) -+...+.+..... ..... +..+|+.+...|++.++..+++++++++.++++ ..+|+......+.|+.||+.+|++ T Consensus 1 ~g~~~e~~~~~~~--~t~~~-~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s 77 (397) T protein:vir:23 1 MGFSADHSQIAQT--KDTMF-TGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATGIVIPHWTGDVSAQWIGEGDMKPIT 77 (397) T ss_pred CCcCHHHHHHhhc--cCCCC-ccccchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcceEEecCCcccccc Confidence 1111121111111 11112 334566788999999999999999998887754 567888888889999999999999 Q ss_pred cccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCccccccccccccccccc Q lcl|Aclame:pro 305 NITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATN 384 (517) Q Consensus 305 ~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~ 384 (517) +++|+++++.++++++++++|+++++|+.++ |++||.++|++++++++|.++|+|+|++++..++......... T Consensus 78 ~~~f~~v~l~~~k~~~~v~iS~ell~ds~~~----l~~~i~~~l~~aia~~~d~a~l~G~gt~~~~~~~~~~~~~~~~-- 151 (397) T protein:vir:23 78 KGNMTKRDVHPAKIATIFVASAETVRANPAN----YLGTMRTKVATAIAMAFDNAALHGTNAPSAFQGYLDQSNKTQS-- 151 (397) T ss_pred ccceeEEEEeeEEEEEeehhhHHHHhcchHH----HHHHHHHHHHHHHHHHHHHHHhhcccCCcccccccccccceee-- Confidence 9999999999999999999999999988765 9999999999999999999999999998877665554433222 Q ss_pred ccccccHHHHHHHHHHh-hhhhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCcc-----ceecCccceeccccCCc Q lcl|Aclame:pro 385 VTGTTNIQELLEKLSVA-TPKAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTI-----ATHFGFNRLVQSVAVDE 458 (517) Q Consensus 385 ~~~~~~~d~l~~~l~~~-~~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~-----~~l~g~~~v~~~~~~~~ 458 (517) .......+++++++... ..++.+++|+||+++|..|+++||++|||||++....+.. .+++|.+. +.+..+++ T Consensus 152 ~~~~~~~~~~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~tl~G~Pv-~~s~~~~~ 230 (397) T protein:vir:23 152 ISPNAYQGLGVSGLTKLVTDGKKWTHTLLDDTVEPVLNGSVDANGRPLFVESTYESLTTPFREGRILGRPT-ILSDHVAE 230 (397) T ss_pred ecccchhHHHHHHHHhhhhcccCCCEEEEcHHHHHHHHHhhccCCceeecccccccccccccCceeeeeeE-EEeCCCCC Confidence 23334445556555443 3455678999999999999999999999999987766544 36777654 44445553 Q ss_pred e--e--eeecCceEEEeeehee--ehhhh--------------hcccchHHHHHhhhhcceeecccceEEEEeCCC---- Q lcl|Aclame:pro 459 K--T--AVSLSGYVTNGSRGME--FEQGT--------------ILVENNKEYLFEMPISGSLEYKGTTAYGTYTPP---- 514 (517) Q Consensus 459 ~--~--~~~~~~~~~~~~~~~~--~~~d~--------------~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~---- 514 (517) . . .++++.++++.+.++. ..++. .+.+|++.|+++.|+++.+++|+||++...++. T Consensus 231 g~~~~~~gDfs~~~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~~~~~ 310 (397) T protein:vir:23 231 GDVVGYAGDFSQIIWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPVLTTY 310 (397) T ss_pred CceEEEEeecceEEEEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeeccceecccceEEEeecccccee Confidence 2 2 2356666665554432 32221 167899999999999999999999998887444 Q ss_pred -------CCC Q lcl|Aclame:pro 515 -------VAG 517 (517) Q Consensus 515 -------~a~ 517 (517) .+| T Consensus 311 ~~~~~~~~~~ 320 (397) T protein:vir:23 311 ALDLDGASAG 320 (397) T ss_pred eecccccCcc Confidence 333 No 80 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=100.00 E-value=1.5e-40 Score=238.87 Aligned_cols=330 Identities=13% Similarity=0.090 Sum_probs=203.9 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhhhhhhhhhhhhhhHHHHHHH Q lcl|Aclame:pro 141 ENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVEALKVTPEATEFLKTREA 220 (517) Q Consensus 141 ~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 220 (517) +.... .....+.... .........+.... ....+....... .. ....+... T Consensus 1 ~a~~~------------a~~~~~~~~~----~~~~~~~~~~~~kg--~~~~~~~~a~a~---~~---g~~~~a~~----- 51 (366) T protein:vir:57 1 MAAAV------------AVPVKAHSVA----PGIIIKEELQQYKG--AGMTRMVMSIAA---GK---GNLADAAK----- 51 (366) T ss_pred Ccccc------------cccccccccc----cccccccccccccc--hhHHHHHHHHHh---cc---cchhHHHH----- Confidence 00000 0000000000 00000000000000 000000000000 00 00000000 Q ss_pred HHHHHhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhc-eeeeccc--cceeeeecccccceeeec Q lcl|Aclame:pro 221 EVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPF-IRHENLP--TLVVGGDNALTQGTGHTT 297 (517) Q Consensus 221 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~-~~~~~~~--~~~~~~~~~~~~a~~~~e 297 (517) ....... ......+.. ...+.+++++|+.+...|++.++..++++++ +++.+.. ...+|..+....+.|+.| T Consensus 52 -~a~~~~~-~~~~~~a~~---~~~~~Gg~lvP~~~~~~ii~~l~~~s~l~~lg~~~v~~~~g~~~~p~~t~~~~a~wv~E 126 (366) T protein:vir:57 52 -FAATELG-DTGLSMAIS---TAAGSGGALIPQNMQNEVIELLRDRTVVRILGARSIPLPNGNLSMPRLSGGATAGYVGE 126 (366) T ss_pred -HHHHhhc-chhhhhhcc---ccccCCccccchhHHHHHHHHHhhhcchhhhceeeeecCCCceEEEEEeCCcceeeecc Confidence 0000000 000011111 1223467889999999999999999999887 6665543 356788888888999999 Q ss_pred ccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcccccccccc Q lcl|Aclame:pro 298 GTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVV 377 (517) Q Consensus 298 g~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~ 377 (517) |+.+|+++++|+++++.++++++++++|++++.|+.++ +++||.++|++++++++|.+||+|+|++.++.||++.. T Consensus 127 ~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~----~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~ 202 (366) T protein:vir:57 127 GKDVVATGATFDDVKLSAKTMIALVPVSNQLIGRAGFN----VEQLLLGDILSAIATREDKAFLRDDGTGDTPKGMKAVA 202 (366) T ss_pred CccccccccceeEEEEeeEEEEEeehhhHHHHhhhhHH----HHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecc Confidence 99999999999999999999999999999999988765 99999999999999999999999999987788998766 Q ss_pred cccccccc-c----ccccHHHHHHHHHHh----hhhhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccceecCcc Q lcl|Aclame:pro 378 GDAWATNV-T----GTTNIQELLEKLSVA----TPKAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFN 448 (517) Q Consensus 378 ~~~~~~~~-~----~~~~~d~l~~~l~~~----~~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~ 448 (517) +....... + ....++.+++.+... ..+..++.|+|||.+|.+|++|||++|||+|++. ...+++|++ T Consensus 203 ~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~l~~~~----~~g~l~G~P 278 (366) T protein:vir:57 203 TAANRLVAWTGTAINLTTIDEYLDSLILKHMDSNSNMIRCGWGLSNRTYMTLFGLRDGNGNKVYPEM----SQGILKGYP 278 (366) T ss_pred ccccceeeccccccchhhHHHHHHHHHHhhhccccccccCEEEecHHHHHHHHhhhccCCceeccCC----CCCeeccee Confidence 54322111 1 112233333333222 2344578999999999999999999999999643 224688865 Q ss_pred ceeccccCCc----------eeeeecCceEEEeeeheee--hhh-----------hhcccchHHHHHhhhhcceeecccc Q lcl|Aclame:pro 449 RLVQSVAVDE----------KTAVSLSGYVTNGSRGMEF--EQG-----------TILVENNKEYLFEMPISGSLEYKGT 505 (517) Q Consensus 449 ~v~~~~~~~~----------~~~~~~~~~~~~~~~~~~~--~~d-----------~~~~~n~~~~~~~~rvgg~v~~~~a 505 (517) . +++..+++ ...++++.|.+..+.++.. .++ ..+++|++.++++.|+++.|++|+| T Consensus 279 v-v~s~~ip~~~~~~~~~~~i~~gdfs~~~i~~~~~i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a 357 (366) T protein:vir:57 279 I-QRTSAIPANLGDDGNESEIYFCDFNDVVIGEDGMMKVDFSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEG 357 (366) T ss_pred e-EEccccccccccCCCccEEEEEecceEEEEEecceEEEEeeccccccccccchhhhhcCceeEEeeeeeCcEeecccc Confidence 4 44444443 2345677777666555432 222 1257889999999999999999999 Q ss_pred eEEEEeCCCCCC Q lcl|Aclame:pro 506 TAYGTYTPPVAG 517 (517) Q Consensus 506 ~~~~~~tp~~a~ 517 (517) |++++ .+| T Consensus 358 ~~~lt----~~~ 365 (366) T protein:vir:57 358 LVLGT----GVI 365 (366) T ss_pred EEEEe----ccc Confidence 99988 334 No 81 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=100.00 E-value=2e-39 Score=232.74 Aligned_cols=336 Identities=12% Similarity=0.079 Sum_probs=208.5 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhh---hhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhhhhhhhh Q lcl|Aclame:pro 131 TYFREEKKKEENKMTFDQNLMQELLDAKK---LAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVEALKV 207 (517) Q Consensus 131 ~~vk~~~~~~~~~~~~~~~~~~~~~e~~~---~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 207 (517) ..+ .++++++..+...+..+..+ ..++..+.+++. ...+..++..+...+........ . T Consensus 1 M~i------~~~~~~~~~e~~~~l~~~~~~~~~~e~~~~~~~~~-----------~~~~~~~~~~~~~~e~~~~~~~~-~ 62 (377) T protein:vir:96 1 MAI------NLKELPKYREAVAELSAKISAGATPEEQEKLFEAA-----------FTTMGDEILAKNEEEMERMFDLR-D 62 (377) T ss_pred CCc------cHHHHHHHHHHHHHHHHHHhhcccHHHHHHHHHHH-----------HHHHHHHHHHHHHHHHHHHHHhc-c Confidence 111 11111111111111111110 000111111100 01111111111000000000000 0 Q ss_pred hhhhhhHHHHHHHHHHHHhhccchhhHHHHhh--hhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc-ceee Q lcl|Aclame:pro 208 TPEATEFLKTREAEVAYMSASLTKDPKAAWTA--ELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT-LVVG 284 (517) Q Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~-~~~~ 284 (517) .. .....+.++.... .......+++.+|+.+.++|++.+...+++++++++.++++ ..++ T Consensus 63 ~~-----------------~~lt~ee~~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~~~i~ 125 (377) T protein:vir:96 63 KN-----------------RELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSLRLKAL 125 (377) T ss_pred CC-----------------cccCHHHHHHHHHHHhcCCCCCCceecCHHHHHHHHHHHHhhhhhhhhceeEecCCceEEE Confidence 00 0000011111111 11223456899999999999999999999999999988765 3577 Q ss_pred eecccccceeeeccccc-ccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcc Q lcl|Aclame:pro 285 GDNALTQGTGHTTGTDK-TESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMG 363 (517) Q Consensus 285 ~~~~~~~a~~~~eg~~~-~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G 363 (517) .......+.|++|+++. ++++++|+++++.++++++++++|+++++|+.+| |++||.++|+++|+++++.+||+| T Consensus 126 ~~~~~~~a~wv~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~~ll~ds~~~----le~~i~~~l~~~~~~~~~~a~i~G 201 (377) T protein:vir:96 126 TAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKW----LKQFITEQLKEAIAVALELAIVKG 201 (377) T ss_pred EecCCcceeEeecccccccccCccceeEeeeeeeEEeechhhHHHhhcchhh----HHHHHHHHHHHHHHHHHhhceEec Confidence 77788889999998765 4678999999999999999999999999999887 999999999999999999999999 Q ss_pred cccCccccccccccccccccc-----------------ccccccHHHHHHHHHHhhhh------------hcCCEEEEcH Q lcl|Aclame:pro 364 GVTGVSETQIYPVVGDAWATN-----------------VTGTTNIQELLEKLSVATPK------------AADSTLVIHR 414 (517) Q Consensus 364 ~G~~~~~~gi~~~~~~~~~~~-----------------~~~~~~~d~l~~~l~~~~~~------------~~~a~~vmn~ 414 (517) +|+++| .||++..+...... .....+.+.+++.+...... ..+++|+||| T Consensus 202 ~G~~~P-~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~ 280 (377) T protein:vir:96 202 NGLLQP-VGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNP 280 (377) T ss_pred cCCCcc-eeeeeccccccccccccccccceeeccccccccccCChhHHHHHHHHHHHhhccccccccccccCceEEEEch Confidence 999976 59887543211100 00112234444443332211 1356899999 Q ss_pred HHHHHHHHhhcCCCCEeccCCCCCCccceecCccc-eeccccCCc--eeeeecCceEEEeeehee--ehhhhhcccchHH Q lcl|Aclame:pro 415 NDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNR-LVQSVAVDE--KTAVSLSGYVTNGSRGME--FEQGTILVENNKE 489 (517) Q Consensus 415 ~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~-v~~~~~~~~--~~~~~~~~~~~~~~~~~~--~~~d~~~~~n~~~ 489 (517) .|+..+ .|+|+|++. +|.+.+++|++. ++.+..|++ ...++|+.|.++.+.+++ .+++..+.++++. T Consensus 281 ~t~~~~------~~~~~~~~~--~G~~~~~l~~p~~v~~s~~~p~~~i~fgdf~~Y~i~~r~~~~i~~~~~~~~~~d~~~ 352 (377) T protein:vir:96 281 EDRWTL------EAKFTSRNQ--FGEYVTVLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAMEDLQL 352 (377) T ss_pred hhHHhc------cccccccCC--CCCceeccCCCceEEecCCCCcccEEEEEcCcEEEEEecccEEEeehhhhhhcCCeE Confidence 997754 578888763 456667787654 344445554 455678899998887654 5566778899999 Q ss_pred HHHhhhhcceeecccceEEEEeCCC Q lcl|Aclame:pro 490 YLFEMPISGSLEYKGTTAYGTYTPP 514 (517) Q Consensus 490 ~~~~~rvgg~v~~~~a~~~~~~tp~ 514 (517) |++..|++|.+.+++||++.+++=- T Consensus 353 f~~~~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 353 YLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred EEEEEEEcCEEecCCcEEEEEEecC Confidence 9999999999999999999997655 No 82 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=100.00 E-value=3.1e-40 Score=237.20 Aligned_cols=284 Identities=13% Similarity=0.034 Sum_probs=202.5 Q ss_pred HhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc--ceeeeecccccceeeecccccc Q lcl|Aclame:pro 225 MSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT--LVVGGDNALTQGTGHTTGTDKT 302 (517) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~a~~~~eg~~~~ 302 (517) ++.+........... .......+..+|+.+...|++.++..+++++++++.++++ ..+|+.+....+.|+.||+.+| T Consensus 1 ~~~~~~~~~e~~~~~-~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~ 79 (318) T protein:vir:24 1 MAAGTAFAVDHAQIA-QTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWVGDVSAQWIGEGDMKP 79 (318) T ss_pred CCCCCCCCHHHHHhh-cccCcccceeechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCcceEEecCCcccc Confidence 111111111111111 1112233556899999999999999999999999888754 4678888888999999999999 Q ss_pred cccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCccccccccccccccc Q lcl|Aclame:pro 303 ESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWA 382 (517) Q Consensus 303 ~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~ 382 (517) +++++|+++++.++++++++++|++++.|+..+ +++||.++|++++++++|.++|+|+|++.+ .++......... T Consensus 80 ~~~~~f~~i~~~~~k~~~~~~iS~e~l~ds~~~----~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~-~~~~~~~~~~~~ 154 (318) T protein:vir:24 80 ITKGNMTSQTIAPHKIATIFVASAETVRANPAN----YLGTMRTKVATAFAMAFDGAAMHGTDSPFP-TYIGQTTKAISI 154 (318) T ss_pred ccccceeEEEEeeEEEEEeehhhHHHhhcChHH----HHHHHHHHHHHHHHHHHHHhhhcccCCCCC-cccccccccccc Confidence 999999999999999999999999999988765 999999999999999999999999998766 456554433222 Q ss_pred cc-ccccccH-HHHHHHHHHh-hhhhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccce-----ecCccceeccc Q lcl|Aclame:pro 383 TN-VTGTTNI-QELLEKLSVA-TPKAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIAT-----HFGFNRLVQSV 454 (517) Q Consensus 383 ~~-~~~~~~~-d~l~~~l~~~-~~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~-----l~g~~~v~~~~ 454 (517) .. ....... +++...+... ..++.+++|||||++|..|++|||++|||||+++...+...+ ++|.+. +.++ T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv-~~~~ 233 (318) T protein:vir:24 155 ADTTGATTVYDQVAVNGLSLLVNDGKKWTHTLLDDITEPILNGAKDQNGRPLFIESTYGEAASPFRSGRIVARPT-ILSD 233 (318) T ss_pred cccccccchHHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhccCCceeecCccccCccccccCceEEEEee-EEeC Confidence 21 1222222 3333333332 345567899999999999999999999999998877766554 444333 3333 Q ss_pred cCCce----eeeecCceEEEeeehee--ehhhh--------------hcccchHHHHHhhhhcceeecccceEEEEeCCC Q lcl|Aclame:pro 455 AVDEK----TAVSLSGYVTNGSRGME--FEQGT--------------ILVENNKEYLFEMPISGSLEYKGTTAYGTYTPP 514 (517) Q Consensus 455 ~~~~~----~~~~~~~~~~~~~~~~~--~~~d~--------------~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~ 514 (517) .+++. ..++++.++++.+.++. ..++. .+++|++.+++..|+++.|.+|+||+..+ +. T Consensus 234 ~~~~~~~~~~~gdfs~~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~--~~ 311 (318) T protein:vir:24 234 HVVEGTTVGFMGDFSQLIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAFVALT--NV 311 (318) T ss_pred CCCCCccEEEEeecceEEEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEE--ee Confidence 44432 23456666665544432 22221 16789999999999999999999998854 55 Q ss_pred CCC Q lcl|Aclame:pro 515 VAG 517 (517) Q Consensus 515 ~a~ 517 (517) .|| T Consensus 312 ~a~ 314 (318) T protein:vir:24 312 VSG 314 (318) T ss_pred ccC Confidence 555 No 83 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=100.00 E-value=1.6e-39 Score=233.21 Aligned_cols=338 Identities=13% Similarity=0.121 Sum_probs=203.7 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhhhhhhhhhhh Q lcl|Aclame:pro 131 TYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVEALKVTPE 210 (517) Q Consensus 131 ~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 210 (517) ..++... .+. +...+....+++..+ .+.. .+...+....+..+....... + T Consensus 1 m~ik~~~-----~~~------~~~~e~~~~~~~~~~--~~~~---~~~~~~~~~~~~~~~~~~~~~-------------e 51 (381) T protein:vir:10 1 MTINLSE-----TFA------NAKNEFINAVNNGEP--QERQ---NELYGDMINQLFEETKLQAKA-------------E 51 (381) T ss_pred CchhhHH-----HHH------HHHHHHHHHHhhhhh--hHHH---HHHHHHHHHhhhhhHHHHHHH-------------H Confidence 1111110 000 000000000000000 0000 000000000000000000000 0 Q ss_pred hhhHHHHHHHHHHHHhhccchhhHHHHhh-hhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc-ceeeeecc Q lcl|Aclame:pro 211 ATEFLKTREAEVAYMSASLTKDPKAAWTA-ELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT-LVVGGDNA 288 (517) Q Consensus 211 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 288 (517) .++..... ........+.++.... .......+++.+|+.+.+.|++.+...+++++++++.++++ ..+++... T Consensus 52 ~~~~~~~~-----~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~~~i~~~~~ 126 (381) T protein:vir:10 52 AERVSSLP-----KSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKSET 126 (381) T ss_pred HHHHHHhc-----cCcccccHHHHHHHHHHhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecCcceEEEEecC Confidence 00000000 0000000111111110 11223356899999999999999999999999999988765 35677777 Q ss_pred cccceeeecccccc-cccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccC Q lcl|Aclame:pro 289 LTQGTGHTTGTDKT-ESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTG 367 (517) Q Consensus 289 ~~~a~~~~eg~~~~-~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~ 367 (517) ...+.|+.|++..+ +++++|+++++.++++++++++|++|++|+.+| |++||.++|+++++.+++.+||+|+|++ T Consensus 127 ~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~----ie~~i~~~la~~~a~~~~~a~i~G~G~~ 202 (381) T protein:vir:10 127 SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAW----IERFVRVQIEEAFAVALETAFLKGTGKD 202 (381) T ss_pred CcceeeecccccccccccccceeeeecceeEEeechhhHHHhhcCHHH----HHHHHHHHHHHHHHHHhhheeEeccCCC Confidence 78899999987654 568999999999999999999999999998876 9999999999999999999999999998 Q ss_pred cccccccccccccccc--------ccccc-------ccHHHHHHHHHHhh--------hhhcCCEEEEcHHHHHHHHHhh Q lcl|Aclame:pro 368 VSETQIYPVVGDAWAT--------NVTGT-------TNIQELLEKLSVAT--------PKAADSTLVIHRNDLAAIRFLK 424 (517) Q Consensus 368 ~~~~gi~~~~~~~~~~--------~~~~~-------~~~d~l~~~l~~~~--------~~~~~a~~vmn~~~~~~l~~lK 424 (517) +| .||++..+..... ....+ ...+.+...+.... .+..++.|+|||.++..|++++ T Consensus 203 qP-~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~ 281 (381) T protein:vir:10 203 QP-IGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY 281 (381) T ss_pred Cc-eeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhcccc Confidence 87 5888654321110 01111 11122222222221 2334678999999999998776 Q ss_pred ---cCCCCEeccCCCCCCccceecCccceeccccCCc--eeeeecCceEEEeeehee--ehhhhhcccchHHHHHhhhhc Q lcl|Aclame:pro 425 ---DKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDE--KTAVSLSGYVTNGSRGME--FEQGTILVENNKEYLFEMPIS 497 (517) Q Consensus 425 ---D~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~--~~~~~~~~~~~~~~~~~~--~~~d~~~~~n~~~~~~~~rvg 497 (517) |.+|+|+|..+ +|. .++.+..|++ ...++|+.|.++.+.++. .++...+.++++.|++..|++ T Consensus 282 ~~~~~~G~~v~~l~---------~g~-~vv~s~~~p~~~iifgDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~d 351 (381) T protein:vir:10 282 THLNANGVYVTALP---------FNL-NVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAY 351 (381) T ss_pred ccCCCCCceeecCC---------CCc-eEEecCCCCcCcEEEEecccEEEEEecccEEEeechhHhhcCCeEEEEEEEEc Confidence 77899998532 222 2344445554 555678889998887654 456677899999999999999 Q ss_pred ceeecccceEEEEeCCCC-----CC Q lcl|Aclame:pro 498 GSLEYKGTTAYGTYTPPV-----AG 517 (517) Q Consensus 498 g~v~~~~a~~~~~~tp~~-----a~ 517 (517) |.+++++||++++++-.. -+ T Consensus 352 g~~~~~~A~~v~~l~~~~~~~~~~~ 376 (381) T protein:vir:10 352 GKAKDNKVAAVWKLDLKGHKPALEG 376 (381) T ss_pred CEEecCceEEEEEEEecCCCcCccc Confidence 999999999998875442 22 No 84 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=100.00 E-value=1.6e-39 Score=233.21 Aligned_cols=338 Identities=13% Similarity=0.121 Sum_probs=203.7 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhhhhhhhhhhh Q lcl|Aclame:pro 131 TYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVEALKVTPE 210 (517) Q Consensus 131 ~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 210 (517) ..++... .+. +...+....+++..+ .+.. .+...+....+..+....... + T Consensus 1 m~ik~~~-----~~~------~~~~e~~~~~~~~~~--~~~~---~~~~~~~~~~~~~~~~~~~~~-------------e 51 (381) T protein:vir:95 1 MTINLSE-----TFA------NAKNEFINAVNNGEP--QERQ---NELYGDMINQLFEETKLQAKA-------------E 51 (381) T ss_pred CchhhHH-----HHH------HHHHHHHHHHhhhhh--hHHH---HHHHHHHHHhhhhhHHHHHHH-------------H Confidence 1111110 000 000000000000000 0000 000000000000000000000 0 Q ss_pred hhhHHHHHHHHHHHHhhccchhhHHHHhh-hhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc-ceeeeecc Q lcl|Aclame:pro 211 ATEFLKTREAEVAYMSASLTKDPKAAWTA-ELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT-LVVGGDNA 288 (517) Q Consensus 211 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 288 (517) .++..... ........+.++.... .......+++.+|+.+.+.|++.+...+++++++++.++++ ..+++... T Consensus 52 ~~~~~~~~-----~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~~~i~~~~~ 126 (381) T protein:vir:95 52 AERVSSLP-----KSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKSET 126 (381) T ss_pred HHHHHHhc-----cCcccccHHHHHHHHHHhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecCcceEEEEecC Confidence 00000000 0000000111111110 11223356899999999999999999999999999988765 35677777 Q ss_pred cccceeeecccccc-cccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccC Q lcl|Aclame:pro 289 LTQGTGHTTGTDKT-ESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTG 367 (517) Q Consensus 289 ~~~a~~~~eg~~~~-~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~ 367 (517) ...+.|+.|++..+ +++++|+++++.++++++++++|++|++|+.+| |++||.++|+++++.+++.+||+|+|++ T Consensus 127 ~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~----ie~~i~~~la~~~a~~~~~a~i~G~G~~ 202 (381) T protein:vir:95 127 SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAW----IERFVRVQIEEAFAVALETAFLKGTGKD 202 (381) T ss_pred CcceeeecccccccccccccceeeeecceeEEeechhhHHHhhcCHHH----HHHHHHHHHHHHHHHHhhheeEeccCCC Confidence 78899999987654 568999999999999999999999999998876 9999999999999999999999999998 Q ss_pred cccccccccccccccc--------ccccc-------ccHHHHHHHHHHhh--------hhhcCCEEEEcHHHHHHHHHhh Q lcl|Aclame:pro 368 VSETQIYPVVGDAWAT--------NVTGT-------TNIQELLEKLSVAT--------PKAADSTLVIHRNDLAAIRFLK 424 (517) Q Consensus 368 ~~~~gi~~~~~~~~~~--------~~~~~-------~~~d~l~~~l~~~~--------~~~~~a~~vmn~~~~~~l~~lK 424 (517) +| .||++..+..... ....+ ...+.+...+.... .+..++.|+|||.++..|++++ T Consensus 203 qP-~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~ 281 (381) T protein:vir:95 203 QP-IGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY 281 (381) T ss_pred Cc-eeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhcccc Confidence 87 5888654321110 01111 11122222222221 2334678999999999998776 Q ss_pred ---cCCCCEeccCCCCCCccceecCccceeccccCCc--eeeeecCceEEEeeehee--ehhhhhcccchHHHHHhhhhc Q lcl|Aclame:pro 425 ---DKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDE--KTAVSLSGYVTNGSRGME--FEQGTILVENNKEYLFEMPIS 497 (517) Q Consensus 425 ---D~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~--~~~~~~~~~~~~~~~~~~--~~~d~~~~~n~~~~~~~~rvg 497 (517) |.+|+|+|..+ +|. .++.+..|++ ...++|+.|.++.+.++. .++...+.++++.|++..|++ T Consensus 282 ~~~~~~G~~v~~l~---------~g~-~vv~s~~~p~~~iifgDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~d 351 (381) T protein:vir:95 282 THLNANGVYVTALP---------FNL-NVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAY 351 (381) T ss_pred ccCCCCCceeecCC---------CCc-eEEecCCCCcCcEEEEecccEEEEEecccEEEeechhHhhcCCeEEEEEEEEc Confidence 77899998532 222 2344445554 555678889998887654 456677899999999999999 Q ss_pred ceeecccceEEEEeCCCC-----CC Q lcl|Aclame:pro 498 GSLEYKGTTAYGTYTPPV-----AG 517 (517) Q Consensus 498 g~v~~~~a~~~~~~tp~~-----a~ 517 (517) |.+++++||++++++-.. -+ T Consensus 352 g~~~~~~A~~v~~l~~~~~~~~~~~ 376 (381) T protein:vir:95 352 GKAKDNKVAAVWKLDLKGHKPALEG 376 (381) T ss_pred CEEecCceEEEEEEEecCCCcCccc Confidence 999999999998875442 22 No 85 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=100.00 E-value=3.5e-40 Score=236.87 Aligned_cols=272 Identities=9% Similarity=-0.029 Sum_probs=200.8 Q ss_pred hhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc--ceeeeecccccceeeecccccccccccceeeEeeHhhh Q lcl|Aclame:pro 241 LKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT--LVVGGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYV 318 (517) Q Consensus 241 ~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~ 318 (517) +.....+++.+|+.+...|++.++..+++++++++.++++ ..+|+......+.|++||+++|+++++|++++++++++ T Consensus 1 m~t~t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~E~~~~~~s~~~f~~v~l~~~kl 80 (303) T protein:vir:97 1 MGTETSKASLFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEFTFTLDSDIDVVAENGKKTHGGLSLEPVTIVPIKV 80 (303) T ss_pred CcccCCCCeEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEecCcceEEeecCccccccccceeeEEeeeEEE Confidence 3344455789999999999999999999999999888764 56788888889999999999999999999999999999 Q ss_pred hHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcccc----cccc--cccccccccccccccHH Q lcl|Aclame:pro 319 YKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSET----QIYP--VVGDAWATNVTGTTNIQ 392 (517) Q Consensus 319 ~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~----gi~~--~~~~~~~~~~~~~~~~d 392 (517) ++++++|++++..+. |+...|.+||.++|++++++++|.++|+|++.++... +... ..+.............+ T Consensus 81 ~~~~~iS~ell~~~~-d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (303) T protein:vir:97 81 EYGARLSDEFLYATE-EEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKVTQVVKFTESEDADA 159 (303) T ss_pred EEeehhhHHHhhcCc-cchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCccccccccccccccccccccccccccchHH Confidence 999999999985332 3455699999999999999999999999976433321 1111 11111111112233456 Q ss_pred HHHHHHHHhh-hhhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCC-ccceecCccceeccccCCce----------e Q lcl|Aclame:pro 393 ELLEKLSVAT-PKAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQ-TIATHFGFNRLVQSVAVDEK----------T 460 (517) Q Consensus 393 ~l~~~l~~~~-~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~-~~~~l~g~~~v~~~~~~~~~----------~ 460 (517) ++..++.... .++.++.|||||+++.+|++|||++|+|+|+++...+ .+.+++|.+.++ +..++.. . T Consensus 160 ~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~g~~~~~~~~~~~~~~~~l~G~Pv~~-s~~v~~~~~~~~~~~~~~ 238 (303) T protein:vir:97 160 NIEAAVNLIQGAEGVVTGLAMDTEFSTALAKVTNGEMGPKMYPELAWGANPDSINGLKSSV-NTTVGAGADEAESKDLVI 238 (303) T ss_pred HHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCCeEEecCccCCCCCceecceeeEE-ecccCCccccCCCccEEE Confidence 6666665543 3566789999999999999999999999999876544 456788865544 3334321 1 Q ss_pred eeec-CceEEEeeehee--ehhh--------hhcccchHHHHHhhhhcceeecccceEEEEeCCC Q lcl|Aclame:pro 461 AVSL-SGYVTNGSRGME--FEQG--------TILVENNKEYLFEMPISGSLEYKGTTAYGTYTPP 514 (517) Q Consensus 461 ~~~~-~~~~~~~~~~~~--~~~d--------~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~ 514 (517) .++| ..|.++.+.++. ..+. ..+.+|++.+|++.|+++.|++|+||++.+..+- T Consensus 239 ~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 239 IGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKGEV 303 (303) T ss_pred EeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCCCC Confidence 2334 234455554443 2211 1268899999999999999999999999986655 No 86 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=100.00 E-value=4.6e-40 Score=236.23 Aligned_cols=269 Identities=14% Similarity=0.015 Sum_probs=198.2 Q ss_pred hhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc--ceeeeecccccceeeecccccccccccceeeEeeHhhh Q lcl|Aclame:pro 241 LKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT--LVVGGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYV 318 (517) Q Consensus 241 ~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~ 318 (517) +. ..+++.+|+.+...|++.++..+++++++++.++++ ..+|+.+....+.|++||+++|+++++|+++++.++++ T Consensus 1 ma--~~gG~lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~f~~v~l~~~k~ 78 (298) T protein:vir:16 1 MV--LNKGTLFDPTLVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKV 78 (298) T ss_pred Cc--ccCcceechhHHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecCcceEEecCCccccccccceeEEEEeeeeE Confidence 22 234778999999999999999999999999887653 56788888889999999999999999999999999999 Q ss_pred hHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccc--cCcc--ccccccccccccc---ccccccccH Q lcl|Aclame:pro 319 YKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGV--TGVS--ETQIYPVVGDAWA---TNVTGTTNI 391 (517) Q Consensus 319 ~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G--~~~~--~~gi~~~~~~~~~---~~~~~~~~~ 391 (517) ++++++|++++.++. |+...|++||.++|++++++++|.++++|.+ ++++ ..+.......... ......... T Consensus 79 a~~~~iS~ell~~s~-d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (298) T protein:vir:16 79 EYGARISDEFMYASD-EEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPN 157 (298) T ss_pred EEeehhhHHHhhcCc-ccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcccccccccccccccccccccccccccHH Confidence 999999999986543 3344599999999999999999999999954 3333 2222121111111 111112223 Q ss_pred HHHHHHHHHhhh-hhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccceecCccceeccccCCc--------eeee Q lcl|Aclame:pro 392 QELLEKLSVATP-KAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDE--------KTAV 462 (517) Q Consensus 392 d~l~~~l~~~~~-~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~--------~~~~ 462 (517) +++.+++..... ++.+++|||||++|..|++|||++|||||++....+.+.+++|.|+++ +..++. ...+ T Consensus 158 ~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~-~~~v~~~~~~~~~~~~~G 236 (298) T protein:vir:16 158 GAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQDNALFPELKWGATPDTINGLPVDV-NKTVSDMSLTQRDRAIIG 236 (298) T ss_pred HHHHHHHHHhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCceecceeeEE-ecccccccCCCccEEEEe Confidence 455555554433 456778999999999999999999999999998888888999976554 333432 1224 Q ss_pred ecCceE-EEeeeh--eeehhhh--------hcccchHHHHHhhhhcceeecccceEEEEeCC Q lcl|Aclame:pro 463 SLSGYV-TNGSRG--MEFEQGT--------ILVENNKEYLFEMPISGSLEYKGTTAYGTYTP 513 (517) Q Consensus 463 ~~~~~~-~~~~~~--~~~~~d~--------~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp 513 (517) +|+.++ +..+.+ ++..++- .+.+|++.++++.|+|+.|.+|+||++.+-.- T Consensus 237 Dfs~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 237 DFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred eccceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 455433 333333 2332221 26789999999999999999999999987533 No 87 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=100.00 E-value=4.8e-40 Score=236.13 Aligned_cols=262 Identities=13% Similarity=0.098 Sum_probs=204.7 Q ss_pred Hhhhh--hcccccccccchhhhhhHHHhHhhhhhhhhceeeeccccc----eeeeec-ccccceeeeccccccc-ccccc Q lcl|Aclame:pro 237 WTAEL--KERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPTL----VVGGDN-ALTQGTGHTTGTDKTE-SNITL 308 (517) Q Consensus 237 ~~~~~--~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~~----~~~~~~-~~~~a~~~~eg~~~~~-~~~~f 308 (517) +.+.+ .....+++.+|+.+...|++.++..+++++++++.++++. .++... ....+.|++||+..|+ +.++| T Consensus 1 ~l~~~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~ 80 (293) T protein:vir:48 1 MLDSKTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADIDDPKL 80 (293) T ss_pred CceeecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCcccccccccce Confidence 22211 2233457889999999999999999999999888766432 234333 3466899999999997 56899 Q ss_pred eeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCccccccccccccccccccccc Q lcl|Aclame:pro 309 QTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGT 388 (517) Q Consensus 309 ~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~ 388 (517) +++++.++++++++++|+++++|+.++ |++||.++|+++++++++.+|++|+|.+... ... T Consensus 81 ~~i~l~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~la~~~~~~~~~~i~~g~~~~~~~---------------~~~ 141 (293) T protein:vir:48 81 SLIKYTIKRYAGISTVTNSLLADSAEN----ILAWLSGWIAKKVVVTRNKAILGVVDKLPTK---------------PTL 141 (293) T ss_pred eEEEEeeeEEEEeehhhHHHHhhhhHH----HHHHHHHHHHHHHHHHHHhHHhhcccccccc---------------ccc Confidence 999999999999999999999998876 9999999999999999999999998764431 223 Q ss_pred ccHHHHHHHHHHhh-hhhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccceecCccceecc-ccCCc-------e Q lcl|Aclame:pro 389 TNIQELLEKLSVAT-PKAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQS-VAVDE-------K 459 (517) Q Consensus 389 ~~~d~l~~~l~~~~-~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~-~~~~~-------~ 459 (517) ...+++++++.... .+..+++|+||+++|..|++|||++|||||++++.++.+.+++|.+.+++. ..++. . T Consensus 142 ~~~d~i~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~l~G~Pv~~~~~~~~~~~~~~~~~~ 221 (293) T protein:vir:48 142 TKWDDIIDLEAKVDPAIKQTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPTGYSIAGFAVKEISDRWLPNASSGVMPL 221 (293) T ss_pred cCHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceEeecCcCCCCCceecceeeEEecccccCCccCCceEE Confidence 45678888776654 445678999999999999999999999999999999999999998776543 23332 1 Q ss_pred eeeecCc-eEEEeeeheeehh----hhhcccchHHHHHhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 460 TAVSLSG-YVTNGSRGMEFEQ----GTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 460 ~~~~~~~-~~~~~~~~~~~~~----d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) ..++++. |.+..+.++.... ..++.+|++.|+++.|+++.+.+|+||++++++.+.+- T Consensus 222 ~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~~ 284 (293) T protein:vir:48 222 YFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQ 284 (293) T ss_pred EEEeccceEEEEEecceEEEEecccchhhhcCeEEEEEEEeeCcEEecccceEEEEeeccccC Confidence 2345665 4555555554321 23468999999999999999999999999998776544 No 88 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=100.00 E-value=8.2e-40 Score=234.88 Aligned_cols=294 Identities=7% Similarity=0.009 Sum_probs=209.0 Q ss_pred HHHHhhHHHhhhhhhhhhhhhhhhHHHHHHHHHHHHhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhh Q lcl|Aclame:pro 190 LMKQRESEKILGVEALKVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSL 269 (517) Q Consensus 190 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~ 269 (517) .++.++.. .+.+.+.... .....++ ..........++.+|..+.+.|++.+...+++ T Consensus 1 ~~~~~~~~-----------~~~~~f~~~~---------~~~~~~~---a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l 57 (324) T protein:vir:93 1 MEQTQKLK-----------LNLQHFASNN---------VKPQVFN---PDNVMMHEKKDGTLLNDFTTPILQEVMENSKI 57 (324) T ss_pred CchhHHHH-----------HHHHHHHHhh---------hhhhhcc---cccccccCCCcceechhHHHHHHHHHHhhchh Confidence 00000000 0000000000 0000010 01111122335578999999999999999999 Q ss_pred hhceeeecccc--ceeeeecccccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHH Q lcl|Aclame:pro 270 LPFIRHENLPT--LVVGGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNR 347 (517) Q Consensus 270 ~~~~~~~~~~~--~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~ 347 (517) ++++++.++++ ..+|+......+.|++||+.+|+++++|+++++.++++++++++|++++.|+..+ |++||.++ T Consensus 58 ~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~ 133 (324) T protein:vir:93 58 MQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQ----FFEEMKPM 133 (324) T ss_pred hhhcceeeccCCceEEEEEecCcceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHHhcchHH----HHHHHHHH Confidence 99999888754 4677877888899999999999999999999999999999999999999988755 89999999 Q ss_pred HHHHHHHHHHhhhhcccccCcccccccccccccccccccccccHHHHHHHHHHhhh-hhcCCEEEEcHHHHHHHHHhhcC Q lcl|Aclame:pro 348 LPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVATP-KAADSTLVIHRNDLAAIRFLKDK 426 (517) Q Consensus 348 l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~~-~~~~a~~vmn~~~~~~l~~lKD~ 426 (517) |++++++++|.++|+|+|++....+++......... ..++.+.+++.+++..... ++.+++|+|||++|..|+++||+ T Consensus 134 l~~aia~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~-~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~ 212 (324) T protein:vir:93 134 IAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKV-IKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDP 212 (324) T ss_pred HHHHHHHHHHHHHhcCCCCCCcCcccccccccccee-ccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCC Confidence 999999999999999999887666776655433332 3344567788877766543 45678999999999999999999 Q ss_pred CCCEeccCCCCCCccceecCccceecc-ccCCc--eeeeecCceEEEeeehee--ehhhh--------------hcccch Q lcl|Aclame:pro 427 NGNYVFPVGVSNQTIATHFGFNRLVQS-VAVDE--KTAVSLSGYVTNGSRGME--FEQGT--------------ILVENN 487 (517) Q Consensus 427 ~Gryl~~~~~~~~~~~~l~g~~~v~~~-~~~~~--~~~~~~~~~~~~~~~~~~--~~~d~--------------~~~~n~ 487 (517) +|||+|+++ ...+++|.+.+... ..++. ...++++.+.++.+.++. ..++. .+.+|+ T Consensus 213 ~G~~~~~~~----~~~~l~G~PVv~~~~~~~~~~~i~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~ 288 (324) T protein:vir:93 213 ETKERIYDR----NSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDM 288 (324) T ss_pred CCCeeecCC----CCCcccceeeEeecCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCc Confidence 999999743 34568887665432 22332 334567777666554433 22221 267899 Q ss_pred HHHHHhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 488 KEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 488 ~~~~~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) +.|+++.|+|+.|.+|+||++++ .+.+| T Consensus 289 ~~~r~~~r~d~~v~~~~a~~~l~--~a~~~ 316 (324) T protein:vir:93 289 VALRATMHVALHIADDKAFAKLV--PADKR 316 (324) T ss_pred EEEEEEEEeccEEecccceEEEe--ccccc Confidence 99999999999999999999765 55555 No 89 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=100.00 E-value=5.9e-40 Score=235.66 Aligned_cols=274 Identities=10% Similarity=0.047 Sum_probs=209.5 Q ss_pred HhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc---ceeeeecccccceeeeccccc Q lcl|Aclame:pro 225 MSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT---LVVGGDNALTQGTGHTTGTDK 301 (517) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~a~~~~eg~~~ 301 (517) |.. ..+ + ..+.......+..+|+.+.+.|++.+.+.+++++++++.++++ ..++.......+.|++||+.+ T Consensus 1 m~~---~~~-~--~~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~ 74 (297) T protein:vir:95 1 MTV---QTF-N--PENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGISAYWVNETEKI 74 (297) T ss_pred CCc---ccc-c--cccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCceeEEeecCccc Confidence 111 111 1 1111222334567999999999999999999999998887643 345666677789999999999 Q ss_pred ccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcccccccccccccc Q lcl|Aclame:pro 302 TESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAW 381 (517) Q Consensus 302 ~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~ 381 (517) |+++++|+++++.++++++++++|+++++|+..+ |++||.++|++++++++|.++|+|+|++++ .++++..+... T Consensus 75 ~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~----l~~~i~~~la~ai~~~~d~a~l~G~g~~~~-~gi~~~~~~~~ 149 (297) T protein:vir:95 75 KTDKPEVVPVTLKAHKLGIILVTSREALNYTWKK----FFEDMKPQIVEAFYKKIDEAGLLGHDTPFA-NSVAKAAKDAN 149 (297) T ss_pred cccccceeEEEEeeEEEEEeehhhHHHHhcCHHH----HHHHHHHHHHHHHHHHHHHHHhcccCCccc-ccccccccccc Confidence 9999999999999999999999999999988765 999999999999999999999999998766 57776655433 Q ss_pred cccccccccHHHHHHHHHHhhh-hhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccceecCccceec-cccCCce Q lcl|Aclame:pro 382 ATNVTGTTNIQELLEKLSVATP-KAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQ-SVAVDEK 459 (517) Q Consensus 382 ~~~~~~~~~~d~l~~~l~~~~~-~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~-~~~~~~~ 459 (517) .. .++..+.+++++++..... ++.+++|+|||.+|.+|++|||++|||||++. ..+++|.+.+.. ...++.. T Consensus 150 ~~-~~~~~t~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~G~~i~~~~-----~~~l~G~Pv~~~~~~~~~~~ 223 (297) T protein:vir:95 150 KV-IGGPINYDNILKLQDALYDADVEPNAFVSKIQNRSALREARDGNKVSIYDKA-----ANTIDGITTVDLKSARFEKG 223 (297) T ss_pred ee-cccccCHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCceeecCC-----CCcccceeeEeecCCCCCCc Confidence 32 2334567788887766544 45678999999999999999999999999754 356788765543 2223333 Q ss_pred --eeeecCceEEEeeehee--ehhhh--------------hcccchHHHHHhhhhcceeecccceEEEEeCCCC Q lcl|Aclame:pro 460 --TAVSLSGYVTNGSRGME--FEQGT--------------ILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPV 515 (517) Q Consensus 460 --~~~~~~~~~~~~~~~~~--~~~d~--------------~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~ 515 (517) ..++++.|+++.+.++. .+++. .+++|++.++++.|+|+.+.+|+||+.++...|| T Consensus 224 ~~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~at~~ 297 (297) T protein:vir:95 224 DLLAGDFDNLIYGVPYNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKLTPAERV 297 (297) T ss_pred eEEEEecccEEEEEecCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEeecCCC Confidence 34667777766554433 33321 1578999999999999999999999999999999 No 90 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=100.00 E-value=1.2e-39 Score=234.05 Aligned_cols=296 Identities=7% Similarity=-0.004 Sum_probs=208.6 Q ss_pred hhhhhhhhHHHHHHHHHHHHhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc--cee Q lcl|Aclame:pro 206 KVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT--LVV 283 (517) Q Consensus 206 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~--~~~ 283 (517) .++++..+. ..+.+........... + .........+..+|..+.+.|++.+++.+++++++++.++++ ..+ T Consensus 1 ~~~~~~~~~--~~~~f~~~~~~~~~~~---a--~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~ 73 (324) T protein:vir:10 1 MEQTQKLKL--NLQHFASNNVKPQVFN---P--DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKF 73 (324) T ss_pred CCCchHHHH--HHHHHHHHhhccceec---c--cceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEE Confidence 000000000 0000111111111000 0 011111223457899999999999999999999999887754 466 Q ss_pred eeecccccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcc Q lcl|Aclame:pro 284 GGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMG 363 (517) Q Consensus 284 ~~~~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G 363 (517) |.......+.|++||+.+|+++++|+++++.++++++++++|++++.|+..+ |++||.++|++++++++|.++|+| T Consensus 74 p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~l~~ai~~~~d~a~l~G 149 (324) T protein:vir:10 74 TFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQ----FFEEMKPMIAEAFYKKFDEAGILN 149 (324) T ss_pred EEEeCCcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHH----HHHHHHHHHHHHHHHHHHHHhhhc Confidence 7777778899999999999999999999999999999999999999988755 999999999999999999999999 Q ss_pred cccCcccccccccccccccccccccccHHHHHHHHHHhhh-hhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccc Q lcl|Aclame:pro 364 GVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVATP-KAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIA 442 (517) Q Consensus 364 ~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~~-~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~ 442 (517) +|++....++++....... ...++.+.+++.+++..... ++.+++|+|||++|..|+++||++|||+|++. ... T Consensus 150 ~g~~~~~~~i~~~~~~~~~-~~~~~~t~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~~~~~~~----~~~ 224 (324) T protein:vir:10 150 QGNNPFGKSIAQSIEKTNK-VIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDR----NSD 224 (324) T ss_pred CCCCccCccccccccccce-eccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCceeecCC----CCc Confidence 9988766666655443332 23344556778877766544 45678999999999999999999999999753 345 Q ss_pred eecCccceecc-ccCCc--eeeeecCceEEEeeehe--eehhhh--------------hcccchHHHHHhhhhcceeecc Q lcl|Aclame:pro 443 THFGFNRLVQS-VAVDE--KTAVSLSGYVTNGSRGM--EFEQGT--------------ILVENNKEYLFEMPISGSLEYK 503 (517) Q Consensus 443 ~l~g~~~v~~~-~~~~~--~~~~~~~~~~~~~~~~~--~~~~d~--------------~~~~n~~~~~~~~rvgg~v~~~ 503 (517) +++|.+.+... ..++. ...++++.++++.+.++ +.+++. .+.+|++.++++.|+|+.+.+| T Consensus 225 ~l~G~PV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~ 304 (324) T protein:vir:10 225 TLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADD 304 (324) T ss_pred cccceeEEeecCCCCCcceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecc Confidence 68887765432 22333 33456777776655443 333321 2578999999999999999999 Q ss_pred cceEEEEeCC----CCCC Q lcl|Aclame:pro 504 GTTAYGTYTP----PVAG 517 (517) Q Consensus 504 ~a~~~~~~tp----~~a~ 517 (517) +||++.+-.. +++| T Consensus 305 ~A~~~l~~a~~~~~~~~~ 322 (324) T protein:vir:10 305 KAFAKLVPADKKTDSVPG 322 (324) T ss_pred cceEEEEeccCCCCCCCC Confidence 9999876522 2344 No 91 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=100.00 E-value=1.5e-39 Score=233.42 Aligned_cols=277 Identities=12% Similarity=0.050 Sum_probs=201.2 Q ss_pred HhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc--ceeeeecccccceeeecccccccccccceeeEee Q lcl|Aclame:pro 237 WTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT--LVVGGDNALTQGTGHTTGTDKTESNITLQTRVLT 314 (517) Q Consensus 237 ~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~~~~~ 314 (517) +. ......+++.+|..+...|++.+++.+++++++++.+.++ ..+|+......+.|++||+.+|+++++|+++++. T Consensus 1 Ma--~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l~ 78 (315) T protein:vir:80 1 MA--DDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQ 78 (315) T ss_pred CC--CCcCCcCceEcchHHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeCCccccccccceeeeEee Confidence 11 1222345889999999999999999999999998887754 5678888888999999999999999999999999 Q ss_pred HhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccC--cccccccccccccccccccccccHH Q lcl|Aclame:pro 315 PQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTG--VSETQIYPVVGDAWATNVTGTTNIQ 392 (517) Q Consensus 315 ~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~--~~~~gi~~~~~~~~~~~~~~~~~~d 392 (517) ++++++++++|++++.++..+....|++||.++|++++++++|.++++|+|.+ .+..++..................+ T Consensus 79 ~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (315) T protein:vir:80 79 PIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKTKNIVDATDSATA 158 (315) T ss_pred eeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCccccccccccccccceeeccccchH Confidence 99999999999999999988877889999999999999999999999997743 3332322222111111111222334 Q ss_pred HHHHHHHHhh--hhhcCCEEEEcHHHHHHHHHhhcCCCC-----EeccCCCCCCccceecCccceeccccCCce------ Q lcl|Aclame:pro 393 ELLEKLSVAT--PKAADSTLVIHRNDLAAIRFLKDKNGN-----YVFPVGVSNQTIATHFGFNRLVQSVAVDEK------ 459 (517) Q Consensus 393 ~l~~~l~~~~--~~~~~a~~vmn~~~~~~l~~lKD~~Gr-----yl~~~~~~~~~~~~l~g~~~v~~~~~~~~~------ 459 (517) ++..++.... .+..+++|+|||.++..|++|||++|+ |+|+ ....+...+++|.++++ +..|+.. T Consensus 159 d~~~~~~~~~~~~~~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~-~~~~g~~~tl~G~PV~~-~~~~~~~~~~~~~ 236 (315) T protein:vir:80 159 DLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYP-AAGFAGLDNWRGLNVGA-SSTVSGAPEMSPA 236 (315) T ss_pred HHHHHHHHHhhccCccceEEEEcHHHHHHHHHHhhccCCccccccccc-ccccCCCceecceeeEe-cCcCCcccccccc Confidence 5555443322 223457899999999999999877654 6663 44555567899976554 3444422 Q ss_pred -----eeeecCceEEEeeehe--eehhh--------hhcccchHHHHHhhhhcceeecccceEEEEeCC-C----CCC Q lcl|Aclame:pro 460 -----TAVSLSGYVTNGSRGM--EFEQG--------TILVENNKEYLFEMPISGSLEYKGTTAYGTYTP-P----VAG 517 (517) Q Consensus 460 -----~~~~~~~~~~~~~~~~--~~~~d--------~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp-~----~a~ 517 (517) ..++|+.|.++.+.++ ++.++ ..+.+|++.|+++.|+|+.|++|+||++.+... | .|| T Consensus 237 ~~~~~~~GDfs~~~~g~~~~~~i~i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~a~~~~~~~~ 314 (315) T protein:vir:80 237 SGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAE 314 (315) T ss_pred cccEEEEeecccEEEEEecCeeEEEeccccccCcccchhhcCcEEEEEEEEecceeecccceEEEeeccCCCCCCCCC Confidence 2245666666555443 33322 126789999999999999999999999987532 2 555 No 92 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=100.00 E-value=2.9e-39 Score=231.85 Aligned_cols=296 Identities=7% Similarity=-0.013 Sum_probs=208.2 Q ss_pred hhhhhhhhHHHHHHHHHHHHhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc--cee Q lcl|Aclame:pro 206 KVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT--LVV 283 (517) Q Consensus 206 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~--~~~ 283 (517) .++++..+.. .+.+........ .+ ++ .........+..+|+.+.+.|++.+++.+++++++++.++++ ..+ T Consensus 1 ~~k~~~~~~~--~~~~~~~~~~~~--~~-~a--~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~ 73 (324) T protein:vir:99 1 MEQTQKLKLN--LQHFASNNVKPQ--VF-NP--DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYEPMEGTEKKF 73 (324) T ss_pred CCCchHhhHH--HHHHHHHhhhhh--hc-cc--cceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEE Confidence 0000000000 000000000000 00 00 011111223457899999999999999999999999888754 467 Q ss_pred eeecccccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcc Q lcl|Aclame:pro 284 GGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMG 363 (517) Q Consensus 284 ~~~~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G 363 (517) |.......+.|++||+.+|+++++|+++++.++++++++++|++++.|+..+ |++||.++|++++++++|.++|+| T Consensus 74 p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~----l~~~i~~~l~~ai~~~~d~~~l~G 149 (324) T protein:vir:99 74 TFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQ----FFEEMKPMIAEAFYKKFDEAGILN 149 (324) T ss_pred EEEecCcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHH----HHHHHHHHHHHHHHHHHHHHhhhc Confidence 7777778899999999999999999999999999999999999999988754 999999999999999999999999 Q ss_pred cccCcccccccccccccccccccccccHHHHHHHHHHhhh-hhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccc Q lcl|Aclame:pro 364 GVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVATP-KAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIA 442 (517) Q Consensus 364 ~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~~-~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~ 442 (517) +|++....++++...... ....++...+++++++..... ++.+++|+|||++|..|+++||++|||+|++. ... T Consensus 150 ~g~~~~~~~~~~~~~~~~-~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~~~~~~~----~~~ 224 (324) T protein:vir:99 150 QGNNPFGKSIAQSIEKTN-KVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDR----NSD 224 (324) T ss_pred CCCCccCccccccccccc-eeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcCCCceeecCC----CCc Confidence 998876666665543332 223344556778877766544 45678999999999999999999999999643 345 Q ss_pred eecCccceecc-ccCCc--eeeeecCceEEEeeehee--ehhhh--------------hcccchHHHHHhhhhcceeecc Q lcl|Aclame:pro 443 THFGFNRLVQS-VAVDE--KTAVSLSGYVTNGSRGME--FEQGT--------------ILVENNKEYLFEMPISGSLEYK 503 (517) Q Consensus 443 ~l~g~~~v~~~-~~~~~--~~~~~~~~~~~~~~~~~~--~~~d~--------------~~~~n~~~~~~~~rvgg~v~~~ 503 (517) +++|.+.+... ...+. ...++++.|+++.+.++. ..++. .+.+|++.++++.|+|+.+.+| T Consensus 225 ~l~G~PVv~~~~~~~~~~~~i~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~ 304 (324) T protein:vir:99 225 TLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADD 304 (324) T ss_pred cccceeEEeecCCCCCcceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecc Confidence 68887655432 22232 334567777766555433 32221 2578999999999999999999 Q ss_pred cceEEEEeCCC----CCC Q lcl|Aclame:pro 504 GTTAYGTYTPP----VAG 517 (517) Q Consensus 504 ~a~~~~~~tp~----~a~ 517 (517) +||++.+..-+ .+| T Consensus 305 ~a~~~lt~a~~~~~~~~~ 322 (324) T protein:vir:99 305 KAFAKLVPADKKTDSVPG 322 (324) T ss_pred cceEEEEeccCCCCCCCC Confidence 99999876333 333 No 93 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=100.00 E-value=2.6e-39 Score=232.08 Aligned_cols=294 Identities=6% Similarity=-0.011 Sum_probs=206.8 Q ss_pred hhhhhhhhHHHHHHHHHHHHhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc--cee Q lcl|Aclame:pro 206 KVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT--LVV 283 (517) Q Consensus 206 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~--~~~ 283 (517) .++.+.... ..+.+......... . ++ .........+..+|+.+.+.|++.+++.+++++++++.++++ ..+ T Consensus 1 ~~~~~~~~~--~~~~f~~~~~~~~~--~-~a--~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~ 73 (324) T protein:vir:96 1 MEQTQKLKL--NLQHFASNNVKPQV--F-NP--DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKF 73 (324) T ss_pred CCcchhhhH--HHHHHHHhhhhhhh--c-cc--ccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEE Confidence 000000000 00000000000000 0 00 011111223557899999999999999999999998888764 467 Q ss_pred eeecccccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcc Q lcl|Aclame:pro 284 GGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMG 363 (517) Q Consensus 284 ~~~~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G 363 (517) |.......+.|++||+.+|+++++|+++++.++++++++++|++++.|+..+ |++||.++|++++++++|.++|+| T Consensus 74 p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~----l~~~i~~~l~~aia~~~d~~~l~G 149 (324) T protein:vir:96 74 TFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQ----FFEEMKPMIAEAFYKKFDEAGILN 149 (324) T ss_pred EEEecCcceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHHhcchHH----HHHHHHHHHHHHHHHHHHHHhhhc Confidence 7777778899999999999999999999999999999999999999987654 999999999999999999999999 Q ss_pred cccCcccccccccccccccccccccccHHHHHHHHHHhh-hhhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccc Q lcl|Aclame:pro 364 GVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVAT-PKAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIA 442 (517) Q Consensus 364 ~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~-~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~ 442 (517) +|++....++......... ...+..+.+++.+++.... .++.+++|+|||++|.+|+++||++|||+|+.+ ... T Consensus 150 ~g~~~~~~~~~~~~~~~~~-~~~~~~~~~~i~~~~~~i~~~~~~~~~~i~n~~~~~~L~~lkd~~G~~~~~~~----~~~ 224 (324) T protein:vir:96 150 QGNNPFGKSIAQSIKKTNK-VIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDR----NSD 224 (324) T ss_pred CCCCCcCccccccccccce-ecccccchHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCCCCCeeecCC----CCC Confidence 9988776666655443332 2334455778887776654 345678999999999999999999999999643 345 Q ss_pred eecCccceecc-ccCCc--eeeeecCceEEEeeehee--ehhh--------------hhcccchHHHHHhhhhcceeecc Q lcl|Aclame:pro 443 THFGFNRLVQS-VAVDE--KTAVSLSGYVTNGSRGME--FEQG--------------TILVENNKEYLFEMPISGSLEYK 503 (517) Q Consensus 443 ~l~g~~~v~~~-~~~~~--~~~~~~~~~~~~~~~~~~--~~~d--------------~~~~~n~~~~~~~~rvgg~v~~~ 503 (517) +++|.+.+... ..++. ...++++.+.++.+.++. ..++ ..+.+|++.++++.|+|+.+.+| T Consensus 225 ~l~G~PV~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~r~~~r~d~~v~~~ 304 (324) T protein:vir:96 225 SLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADD 304 (324) T ss_pred cccceeeEeecCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecc Confidence 78887765432 22333 334566666665544432 2222 12678999999999999999999 Q ss_pred cceEEEEeCCCCCC Q lcl|Aclame:pro 504 GTTAYGTYTPPVAG 517 (517) Q Consensus 504 ~a~~~~~~tp~~a~ 517 (517) +||++.+ ++.+| T Consensus 305 ~a~~~l~--~a~~~ 316 (324) T protein:vir:96 305 KAFAKLV--PADKR 316 (324) T ss_pred cceEEEe--ccccc Confidence 9998755 55554 No 94 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=100.00 E-value=3.8e-39 Score=231.24 Aligned_cols=269 Identities=14% Similarity=0.031 Sum_probs=198.2 Q ss_pred hhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc--ceeeeecccccceeeecccccccccccceeeEeeHhhh Q lcl|Aclame:pro 241 LKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT--LVVGGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYV 318 (517) Q Consensus 241 ~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~ 318 (517) +.. .+++.+|+.+...|++.++..+++++++++.++++ ..+|+......+.|+.||+.+|+++++|+++++.++++ T Consensus 1 ma~--~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~ 78 (298) T protein:vir:94 1 MVL--NKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKV 78 (298) T ss_pred Cee--ccccccChhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceEEeeCCccccccccceeEEEEeeeEE Confidence 222 33778999999999999999999999999887754 46778777888999999999999999999999999999 Q ss_pred hHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccc--cCccccccccc--cccc---ccccccccccH Q lcl|Aclame:pro 319 YKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGV--TGVSETQIYPV--VGDA---WATNVTGTTNI 391 (517) Q Consensus 319 ~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G--~~~~~~gi~~~--~~~~---~~~~~~~~~~~ 391 (517) ++++++|++++.++. ++...|++||.++|++++++++|.++|+|.+ ++++..++... .... ........... T Consensus 79 ~~~~~iS~ell~~~~-~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (298) T protein:vir:94 79 EYGARISDEFMYASD-EEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPN 157 (298) T ss_pred EEeeehhHHHhccCC-ccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccccccccccccccccccccccHH Confidence 999999999986443 3455699999999999999999999999943 33332222211 1111 11111122234 Q ss_pred HHHHHHHHHhhh-hhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccceecCccceeccccCCce--------eee Q lcl|Aclame:pro 392 QELLEKLSVATP-KAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDEK--------TAV 462 (517) Q Consensus 392 d~l~~~l~~~~~-~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~~--------~~~ 462 (517) +++..++..... +..+++|||||++|.+|++|||++|||||++....+.+.+++|.+.+.. ..++.. ..+ T Consensus 158 ~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~PV~~~-~~v~~~~~~~~~~~~~G 236 (298) T protein:vir:94 158 GAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVN-KTVSDMSLTQRDRAIIG 236 (298) T ss_pred HHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCceecceeeEEe-cccccccCCCccEEEEe Confidence 456666555433 4567889999999999999999999999999988888899999765543 334321 223 Q ss_pred ecCceE-EEeeeh--eeehhh--------hhcccchHHHHHhhhhcceeecccceEEEEeCC Q lcl|Aclame:pro 463 SLSGYV-TNGSRG--MEFEQG--------TILVENNKEYLFEMPISGSLEYKGTTAYGTYTP 513 (517) Q Consensus 463 ~~~~~~-~~~~~~--~~~~~d--------~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp 513 (517) +++..+ +..+.+ ++..++ ..+.+|++.++++.|+|+.+.+|+||++.+-.- T Consensus 237 dfs~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 237 DFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred eccceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 444322 333333 233221 136789999999999999999999999887533 No 95 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=100.00 E-value=8.4e-39 Score=229.32 Aligned_cols=355 Identities=14% Similarity=0.113 Sum_probs=203.8 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhhhhhhhhhhh Q lcl|Aclame:pro 131 TYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVEALKVTPE 210 (517) Q Consensus 131 ~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 210 (517) ..++ ..+ ..+...+..++..+..+..+.. +.+.+ ...+....+.....++.... ..+ T Consensus 1 M~~k--l~~---~~~~~~e~~~~l~~~~~~~~~~-----~~~~~---~~~~~~~~~~~~~~~~~~~~----------~~~ 57 (383) T protein:vir:78 1 MTIK--LKN---NLANYEEKRTAFVNAVKNEDTQ-----EIQNK---AYVEMVDAMAADIMEQAKKE----------ARQ 57 (383) T ss_pred Cchh--HHH---HHHHHHHHHHHHHHHHhccChH-----HHHHH---HHHHHHHHHHHHHHHHHHHH----------HHH Confidence 1100 001 1111111111111111100000 00000 00000000110100000000 000 Q ss_pred hhhHHHHHHHHHHHHhhccchhhHHHHh-hhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc-ceeeeecc Q lcl|Aclame:pro 211 ATEFLKTREAEVAYMSASLTKDPKAAWT-AELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT-LVVGGDNA 288 (517) Q Consensus 211 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 288 (517) ..+.......... ....+.++... -.......+++.+|+.+.+.|++.+...+++++++++.++++ ..++.... T Consensus 58 ~~~~~~~~~~g~~----~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~~~~~~i~~~~~ 133 (383) T protein:vir:78 58 EADAYISASRTDK----NITNEEIKFFNDINKEVGYKEETLLPQTVVDEIFEDLTTEHPFLASIGMRTTGLRTKFLKSET 133 (383) T ss_pred HHHHHHHhcCChh----hhhHHHHHHHHHHhccCCCCCccccCHHHHHHHHHHHHhhccceeeeeeEecCCceEEEEEcC Confidence 0000000000000 00001111110 011233456899999999999999999999999999988765 46777777 Q ss_pred cccceeeeccccc-ccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccC Q lcl|Aclame:pro 289 LTQGTGHTTGTDK-TESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTG 367 (517) Q Consensus 289 ~~~a~~~~eg~~~-~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~ 367 (517) .+.+.|++|++.. .+++++|+++++.++++++++++|++|++|+.++ |++||.++|+++++.+++.+||+|+|++ T Consensus 134 ~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~i~is~ell~Ds~~~----ie~~i~~~l~~~~a~~~~~a~i~G~G~~ 209 (383) T protein:vir:78 134 SGVAVWGKIFGEIKGQLDATFSDEESIQNKLTAFVVVPKDLEKFGPAW----VKRFVVTQIEEAFAVALESAYIVGDGND 209 (383) T ss_pred CcceEEeecccccccccCcceeeEeecceeeEeeccchHHHhhccHHH----HHHHHHHHHHHHHHHHHhhheEeccCCC Confidence 8889999997664 5678999999999999999999999999999886 9999999999999999999999999998 Q ss_pred cccccccccccccccc--------cccccccHHHHHHHHHHhhhhhcCCEEEEcHHHHHHHHHhh---cCCCCEeccCCC Q lcl|Aclame:pro 368 VSETQIYPVVGDAWAT--------NVTGTTNIQELLEKLSVATPKAADSTLVIHRNDLAAIRFLK---DKNGNYVFPVGV 436 (517) Q Consensus 368 ~~~~gi~~~~~~~~~~--------~~~~~~~~d~l~~~l~~~~~~~~~a~~vmn~~~~~~l~~lK---D~~Gryl~~~~~ 436 (517) +| .||++........ ........+++..........+.++.|+||..++..++++| +..+.|.|++.. T Consensus 210 qP-~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~ 288 (383) T protein:vir:78 210 KP-IGLNRKVGKGSTVVDGVYAEKAATGTLTFANPKTTVNELTDVYKYHSVKENGHPLNVAGKVTLLVNPTDAWDVKKQY 288 (383) T ss_pred Cc-eeeeeccCCcccccccccccccccchhhhhhhHHHHHHHHHHHhccchhcccchhhhcCceEEEEcCcchhhhccch Confidence 76 5887643321111 11111122233322233333445566666666666666665 222223344332 Q ss_pred ----CCCccceecCccc-eeccccCCc--eeeeecCceEEEeeehee--ehhhhhcccchHHHHHhhhhcceeecccceE Q lcl|Aclame:pro 437 ----SNQTIATHFGFNR-LVQSVAVDE--KTAVSLSGYVTNGSRGME--FEQGTILVENNKEYLFEMPISGSLEYKGTTA 507 (517) Q Consensus 437 ----~~~~~~~l~g~~~-v~~~~~~~~--~~~~~~~~~~~~~~~~~~--~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~ 507 (517) .+|...+++|++. ++.+..|++ ...++|+.|.++.+.+++ .+++..+.++++.|++..|++|.+.+|+||+ T Consensus 289 ~~~~~~G~~~t~l~~~~~iv~s~~~p~~~iifgdfs~Y~i~~r~~~~i~~~~~~~f~~d~~~f~~~~r~dG~~~~~~A~~ 368 (383) T protein:vir:78 289 TSLNANGVYVTALPFNLNIIESLFVPEKKAISYVAERYDALIGGPLDIGTYDQTLAIEDLNLYAAKQFAYGKAKDDKAAA 368 (383) T ss_pred hccCCCCceeeecCCCceEEecCCCCcccEEEeeccceEEEecccceEEecchhhhhcCceEEEEEEEEcCEEecCCeEE Confidence 3344456666554 344555665 445677889988876654 5566778899999999999999999999999 Q ss_pred EEEeCCCCC-----C Q lcl|Aclame:pro 508 YGTYTPPVA-----G 517 (517) Q Consensus 508 ~~~~tp~~a-----~ 517 (517) +.+++-.-+ | T Consensus 369 vl~~~~~~~~~~~~~ 383 (383) T protein:vir:78 369 VWTLNINPAEQTPEG 383 (383) T ss_pred EEEEEecCCCCCCCC Confidence 988863322 2 No 96 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=100.00 E-value=4.6e-39 Score=230.76 Aligned_cols=285 Identities=13% Similarity=0.039 Sum_probs=200.3 Q ss_pred HhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc--ceeeeecccccceeeecccccc Q lcl|Aclame:pro 225 MSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT--LVVGGDNALTQGTGHTTGTDKT 302 (517) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~a~~~~eg~~~~ 302 (517) +..+...+........ ......+..+|+.+...+++.++..+++++++++.++++ ..+|+......+.|+.||+.+| T Consensus 1 ~~~~~~~~~~~~~~~~-t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~ 79 (320) T protein:vir:10 1 MAAGTAFQVDHAQIAQ-TGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWIGDVSAQWIGEGDMKP 79 (320) T ss_pred CCCCccCCHHHHHhhc-cccccccccccHHHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEecCCcccc Confidence 2222211111111111 111223446899999999999999999999998887653 5677777788899999999999 Q ss_pred cccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcccccccccccc--- Q lcl|Aclame:pro 303 ESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGD--- 379 (517) Q Consensus 303 ~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~--- 379 (517) +++++|+++++.++++++++++|++++.|+..+ |++||.++|++++++++|.++|+|+|++.+. ++...... T Consensus 80 ~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~----l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~-~~~~~~~~~~~ 154 (320) T protein:vir:10 80 ITKGNMTSQNIAPHKIATIFVASAETVRANPAN----YLGTMRTKVATAFAMAFDSAALNGTDSPFPT-YLAQTTKSVSL 154 (320) T ss_pred ccccceeEEEEeeEEEEEeehhhHHHHhcChHH----HHHHHHHHHHHHHHHHHHHHhhcccCCCCCc-ccccccccccc Confidence 999999999999999999999999999988765 9999999999999999999999999987653 33322221 Q ss_pred ccccccc--ccccHH-HHHHHHHHhh-hhhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCcc-----ceecCccce Q lcl|Aclame:pro 380 AWATNVT--GTTNIQ-ELLEKLSVAT-PKAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTI-----ATHFGFNRL 450 (517) Q Consensus 380 ~~~~~~~--~~~~~d-~l~~~l~~~~-~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~-----~~l~g~~~v 450 (517) ......+ .....+ ++.+.+.... .+..+++|||||++|.+|++|||++|||||++....+.. .+++|.++ T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv- 233 (320) T protein:vir:10 155 ADPGGATASDLTAYDAVAVNGLSLLVNAKKKWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDENSPFRAGRIVSRPT- 233 (320) T ss_pred eecccccccccccHHHHHHHHHhhhhcccCCCcEEEEcHHHHHHHHHhhccCCceeeccccccCccccccCceeeeeee- Confidence 1111111 111222 2333333332 345678999999999999999999999999876655543 34555444 Q ss_pred eccccCCcee----eeecCceEEEeeehee--ehhhh--------------hcccchHHHHHhhhhcceeecccceEEEE Q lcl|Aclame:pro 451 VQSVAVDEKT----AVSLSGYVTNGSRGME--FEQGT--------------ILVENNKEYLFEMPISGSLEYKGTTAYGT 510 (517) Q Consensus 451 ~~~~~~~~~~----~~~~~~~~~~~~~~~~--~~~d~--------------~~~~n~~~~~~~~rvgg~v~~~~a~~~~~ 510 (517) +++..+++.. .++++.|+++.+.++. ..++. .+++|++.++++.|+++.|.+|+||++++ T Consensus 234 ~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~~a~~~l~ 313 (320) T protein:vir:10 234 ILSDHVADGTTVGYMGDFRNVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDKDAFVKLT 313 (320) T ss_pred EecCCCCCCceEEEEeecceEEEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeeccEEecccceEEEE Confidence 3444454432 3456666666554443 22221 25789999999999999999999999887 Q ss_pred -eCCCCC Q lcl|Aclame:pro 511 -YTPPVA 516 (517) Q Consensus 511 -~tp~~a 516 (517) .+.|=| T Consensus 314 ~~~ap~~ 320 (320) T protein:vir:10 314 NVVTPDA 320 (320) T ss_pred eccCCCC Confidence 455556 No 97 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=100.00 E-value=1.8e-39 Score=233.04 Aligned_cols=289 Identities=11% Similarity=0.025 Sum_probs=199.6 Q ss_pred hhhhhHHHHHHHHHHHHhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc--ceeeee Q lcl|Aclame:pro 209 PEATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT--LVVGGD 286 (517) Q Consensus 209 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~--~~~~~~ 286 (517) .+.... +...+ ...+.+++...... ..+..+|+.+.+.|++.++..+++++++++.++++ ..+|+. T Consensus 1 ~~~~~~-----r~~~~----~~~~e~~a~~~~~~---~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~ 68 (326) T protein:vir:42 1 MAVNPD-----RTTPF----LGVNDPKVAQTGDS---MFEGYLEPEQAQDYFAEAEKISIVQQFAQKIPMGTTGQKIPHW 68 (326) T ss_pred CCCCcc-----chhhh----cCcchhhheecccc---CCcceechhhHHHHHHHHHhcchhhhhcceeeccCCceEEEEE Confidence 000000 00000 00011111111111 12335899999999999999999999998887653 567888 Q ss_pred cccccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhccccc Q lcl|Aclame:pro 287 NALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVT 366 (517) Q Consensus 287 ~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~ 366 (517) +....+.|++||+.+|+++++|+++++.++++++++++|++++.++..+ +++||.++|++++++++|.++|+|+|+ T Consensus 69 ~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~~s~~~----~~~~i~~~l~~a~~~~~d~a~l~G~gs 144 (326) T protein:vir:42 69 TGDVSASWIGEGDMKPITKGNMTSQTIAPHKIATIFVASAETVRANPAN----YLGTMRTKVATAFAMAFDNAAINGTDS 144 (326) T ss_pred eCCcceEEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHH----HHHHHHHHHHHHHHHHHHHHhhcccCC Confidence 8888999999999999999999999999999999999999999998765 999999999999999999999999998 Q ss_pred Ccccccccccccccccccc-----cccccH-HHHH-HHHHH-hhhhhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCC Q lcl|Aclame:pro 367 GVSETQIYPVVGDAWATNV-----TGTTNI-QELL-EKLSV-ATPKAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSN 438 (517) Q Consensus 367 ~~~~~gi~~~~~~~~~~~~-----~~~~~~-d~l~-~~l~~-~~~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~ 438 (517) +++ .|++...+....... ...... +..+ ..+.. ...++.+++|||||++|.+|++|||++|||||++.... T Consensus 145 ~~p-~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~ 223 (326) T protein:vir:42 145 PFP-TFLAQTTKEVSLVDPDGTGSNADLTVYDAVAVNALSLLVNAGKKWTHTLLDDITEPILNGAKDKSGRPLFIESTYT 223 (326) T ss_pred Ccc-ccccccccccceeecccccccccchhHHHHHHHHHhhhhhhccCccEEEEeHHHHHHHHHhhccCCceeecccccc Confidence 766 566655433222111 111111 2212 22222 23445678999999999999999999999999987666 Q ss_pred Cccc-----eecCccceeccccCCcee----eeecCceEEEeeehe--eehhhhh--------------cccchHHHHHh Q lcl|Aclame:pro 439 QTIA-----THFGFNRLVQSVAVDEKT----AVSLSGYVTNGSRGM--EFEQGTI--------------LVENNKEYLFE 493 (517) Q Consensus 439 ~~~~-----~l~g~~~v~~~~~~~~~~----~~~~~~~~~~~~~~~--~~~~d~~--------------~~~n~~~~~~~ 493 (517) +... +++|.+.++ +..+++.. .++++.+.++.+.++ +.+++.. +.+|++.|++. T Consensus 224 ~~~~~~~~~~l~G~pv~~-~~~~~~~~~~~~~Gd~s~~~~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~ 302 (326) T protein:vir:42 224 EENSPFRLGRIVARPTIL-SDHVASGTVVGYQGDFRQLVWGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVE 302 (326) T ss_pred CccccccCceeeeeeEEE-cCCCCCCceEEEEeecceEEEEEecceEEEEeecceeeecccccccchhhhhcCcEEEEEE Confidence 5543 466655443 34454432 235566666554443 3333211 56788999999 Q ss_pred hhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 494 MPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 494 ~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) .|+++.|.+|+||+.++ ...|+ T Consensus 303 ~~~d~~v~~~~a~~~l~--~~~~~ 324 (326) T protein:vir:42 303 AEYAFHCNDKDAFVKLT--NVDAT 324 (326) T ss_pred EEeccEEecccceEEEe--ecccc Confidence 99999999999998755 44444 No 98 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=100.00 E-value=7.2e-39 Score=229.71 Aligned_cols=267 Identities=16% Similarity=0.122 Sum_probs=196.1 Q ss_pred hhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc--ceeeeecccccceeeecccc-----cccccccceee Q lcl|Aclame:pro 239 AELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT--LVVGGDNALTQGTGHTTGTD-----KTESNITLQTR 311 (517) Q Consensus 239 ~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~a~~~~eg~~-----~~~~~~~f~~~ 311 (517) .+.......++.+|+.+...|++.+++.+++++++++.++.+ ..+|.......+.|++||+. +|.++++|+++ T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~~s~~~f~~i 80 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCcEEEEEEeCCcceEEeecccccccccccccccceeeE Confidence 122233445788999999999999999999999999888754 46777888889999999986 45578899999 Q ss_pred EeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcccc--ccccccccccc--ccccc Q lcl|Aclame:pro 312 VLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSET--QIYPVVGDAWA--TNVTG 387 (517) Q Consensus 312 ~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~--gi~~~~~~~~~--~~~~~ 387 (517) ++.++++++++++|+++++|+..+ +++||.++|++++++++|.++|+|+|.+.+.. ++.+....... ..... T Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~----~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (305) T protein:vir:25 81 TLVAEEIAVIIPVHENVIDDATVA----VLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGG 156 (305) T ss_pred EeeeEEEEEeehhhHHHHhcchHH----HHHHHHHHHHHHHHHHHhhhheeccCCCCCcccccccccccccccccccccc Confidence 999999999999999999988765 89999999999999999999999999765421 22222211111 11111 Q ss_pred cccHHHHHHHHHHhhh-----hhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccceecCccceeccccCC----c Q lcl|Aclame:pro 388 TTNIQELLEKLSVATP-----KAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVD----E 458 (517) Q Consensus 388 ~~~~d~l~~~l~~~~~-----~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~----~ 458 (517) ....+++.+.+..+.. .+..+.|+|||.+|..|++|||++|||||+++ +++|.+.++.. .++ . T Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~-------~l~G~Pv~~~~-~~~~~~~~ 228 (305) T protein:vir:25 157 VANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRDD-------SFAGFRTFFNR-NGAWDADA 228 (305) T ss_pred chhhhHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHHhhccCCceeecCC-------cccccceEEcC-ccCCCCCc Confidence 2222344444443322 23445799999999999999999999999864 57776655432 222 1 Q ss_pred --eeeeecCceEEEeeeheee--hhhh----------hcccchHHHHHhhhhcceeecccceEEEEeCCC--CCC Q lcl|Aclame:pro 459 --KTAVSLSGYVTNGSRGMEF--EQGT----------ILVENNKEYLFEMPISGSLEYKGTTAYGTYTPP--VAG 517 (517) Q Consensus 459 --~~~~~~~~~~~~~~~~~~~--~~d~----------~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~--~a~ 517 (517) ...++++.|.++.+.++.. .++. .++.|++.+|++.|+|+.|.+|+++++++.+|. |+. T Consensus 229 ~~~~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~~~~~p 303 (305) T protein:vir:25 229 AIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAP 303 (305) T ss_pred cEEEEEecceEEEEEecCeEEEEeeeeeeecCCceeeeeecCcEEEEEEEeecceeeCcccEEEEccccccccCC Confidence 2345677787776655432 2221 257788999999999999999999999998765 444 No 99 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=100.00 E-value=3.4e-38 Score=225.99 Aligned_cols=282 Identities=14% Similarity=0.051 Sum_probs=200.6 Q ss_pred HHHHHHHhhccchhhHHHHhhhhhcc----cccccccchhhhhhHHHhHhhhhhhhhceeeecccc--ceeeeecccccc Q lcl|Aclame:pro 219 EAEVAYMSASLTKDPKAAWTAELKER----GISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT--LVVGGDNALTQG 292 (517) Q Consensus 219 ~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~a 292 (517) .+.+.. .+... ...... ...+..+|+.+.+.|++.++..+++++++++.++++ ..+|.......+ T Consensus 1 ~a~l~e--------l~~~~-~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a 71 (333) T protein:vir:78 1 MATLNE--------LLPNS-AGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISYGETIIPTTVKRPEV 71 (333) T ss_pred CchhHH--------hhhhc-ccccccCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCcee Confidence 011110 00000 000000 012336899999999999999999999999888764 467777777777 Q ss_pred eeeecc--------cccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhccc Q lcl|Aclame:pro 293 TGHTTG--------TDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGG 364 (517) Q Consensus 293 ~~~~eg--------~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~ 364 (517) .|++|| +.+|+++++|+++++.+++++.++++|++++.++..+ +++||.++|++++++++|.++|+|+ T Consensus 72 ~~v~eg~~~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~s~~~----~~~~i~~~la~ai~~~~d~~~l~G~ 147 (333) T protein:vir:78 72 GQVGVGTSNEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARMNPSG----LYTKLQGDLAYAIGRGIDLAVFHGK 147 (333) T ss_pred EeecCcccccccccccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHH----HHHHHHHHHHHHHHHHHHHHHhccc Confidence 776655 5677889999999999999999999999999988765 9999999999999999999999999 Q ss_pred ccCcc--cccccccccccc-----cccccccccHHHHHHHHHHhhh--hhcCCEEEEcHHHHHHHH---HhhcCCCCEec Q lcl|Aclame:pro 365 VTGVS--ETQIYPVVGDAW-----ATNVTGTTNIQELLEKLSVATP--KAADSTLVIHRNDLAAIR---FLKDKNGNYVF 432 (517) Q Consensus 365 G~~~~--~~gi~~~~~~~~-----~~~~~~~~~~d~l~~~l~~~~~--~~~~a~~vmn~~~~~~l~---~lKD~~Gryl~ 432 (517) |++++ ..|+.+...... ..........+++++++..... ++.+++|+|||.+|..|+ ++||++|+|+| T Consensus 148 g~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~ 227 (333) T protein:vir:78 148 SPLTGSALQGIDTDNVIANTTNVDYLQETGDPLLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLRAQAYRDANGNVDP 227 (333) T ss_pred CCCCCcccccccccccccccccccccccccchhHHHHHHHHHhhccccccCceEEEEcchHHHHHHHHhhhcCCCCceee Confidence 98654 334443322211 1122233345666666655433 334568999999987665 57899999999 Q ss_pred cCCCCCCccceecCccceeccccCCc-----------eeeeecCceEEEeeeheee--hhh-----------hhcccchH Q lcl|Aclame:pro 433 PVGVSNQTIATHFGFNRLVQSVAVDE-----------KTAVSLSGYVTNGSRGMEF--EQG-----------TILVENNK 488 (517) Q Consensus 433 ~~~~~~~~~~~l~g~~~v~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~--~~d-----------~~~~~n~~ 488 (517) ++....+.+.+++|.+.++. ..++. ...++++.|+++.+.++.. .++ ..+.+|++ T Consensus 228 ~~~~~~~~~~~l~G~Pv~~~-~~i~~~~~~~~~~~~~~~~gD~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v 306 (333) T protein:vir:78 228 SRINLAAQTGDVLGLPAQFG-RAVGGDLGAAVDSKTRIIGGDFSQLKFGFADEIRIKMSDTATLTDSGSATVSMWQTNQI 306 (333) T ss_pred cCccccCCCceeeceeeEEc-cccCCCccccCCCccEEEEEecccEEEEEeeccEEEEeccccccccccceeehhhcCcE Confidence 99888888899999765543 33321 2345677777776655443 221 12578899 Q ss_pred HHHHhhhhcceeecccceEEEEe-CCC Q lcl|Aclame:pro 489 EYLFEMPISGSLEYKGTTAYGTY-TPP 514 (517) Q Consensus 489 ~~~~~~rvgg~v~~~~a~~~~~~-tp~ 514 (517) .+|++.|+++.|++|+||++.+. +.| T Consensus 307 ~~r~~~r~d~~v~~~~a~~~l~~~~a~ 333 (333) T protein:vir:78 307 AILIEVTFGWLLGDKQAFVKFVDDEQP 333 (333) T ss_pred EEEEEEEEccEEecccceEEEeccCCC Confidence 99999999999999999999886 555 No 100 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=100.00 E-value=4.6e-38 Score=225.29 Aligned_cols=288 Identities=15% Similarity=0.079 Sum_probs=200.2 Q ss_pred HHHHHHHHHHhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc--ceeeeeccc---- Q lcl|Aclame:pro 216 KTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT--LVVGGDNAL---- 289 (517) Q Consensus 216 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~---- 289 (517) -....++..+..+..... .. ....+..+|+.+.+.|++.++..+++++++++.++++ ..+|..... T Consensus 1 ~~~~~e~~~~~~~~~~~~------~~--~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~ 72 (338) T protein:vir:78 1 MATLNELAPNTAGSNHQG------RL--AHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPISYGETIIPTTVKRPEVG 72 (338) T ss_pred CcchHHhhhhhccccccc------ce--ecccccccchHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCccce Confidence 000000000000000000 00 0112447999999999999999999999999988765 445554432 Q ss_pred ----ccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccc Q lcl|Aclame:pro 290 ----TQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGV 365 (517) Q Consensus 290 ----~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G 365 (517) ..+.|++||+.+|+++++|+++++.++++++++++|++++.|+..+ +++||.++|++++++++|.++|+|+| T Consensus 73 ~v~~~~~~~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~----~~~~i~~~la~a~~~~~d~~~l~G~g 148 (338) T protein:vir:78 73 QVGVGTSNEQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARMNPSG----LYTKLQADLAYAIGRGIDLAVFHGKS 148 (338) T ss_pred eecccccccccccccccccccceeEEEEEEEEEEEeehhhHHHHhcCHHH----HHHHHHHHHHHHHHHHHHHHhhcccC Confidence 4466778999999999999999999999999999999999988765 89999999999999999999999999 Q ss_pred cCcc--cccccccccccccccc-----cccccHHHHHHHHHHhh--hhhcCCEEEEcHHHHHHH---HHhhcCCCCEecc Q lcl|Aclame:pro 366 TGVS--ETQIYPVVGDAWATNV-----TGTTNIQELLEKLSVAT--PKAADSTLVIHRNDLAAI---RFLKDKNGNYVFP 433 (517) Q Consensus 366 ~~~~--~~gi~~~~~~~~~~~~-----~~~~~~d~l~~~l~~~~--~~~~~a~~vmn~~~~~~l---~~lKD~~Gryl~~ 433 (517) ++++ ..|+.+.......... .....++++.+++.... ..+..++|+|||.++.+| +++||++|||||+ T Consensus 149 ~~~~~~~~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~~ 228 (338) T protein:vir:78 149 PLTGSALQGIDTNNVIVNTTNVDYLQTGTTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAYRDANGNVDPT 228 (338) T ss_pred CCccccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhhccCCCceeec Confidence 7653 4555554433222211 11223345554443322 334567899999998776 5688999999999 Q ss_pred CCCCCCccceecCccceeccccCCc-----------eeeeecCceEEEeeeheee--hhh--------------hhcccc Q lcl|Aclame:pro 434 VGVSNQTIATHFGFNRLVQSVAVDE-----------KTAVSLSGYVTNGSRGMEF--EQG--------------TILVEN 486 (517) Q Consensus 434 ~~~~~~~~~~l~g~~~v~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~--~~d--------------~~~~~n 486 (517) +....+.+.+++|.|.++ +..+++ ...++++.|.++.+.++.. .++ ..+.+| T Consensus 229 ~~~~~~~~~~l~G~PV~~-~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 307 (338) T protein:vir:78 229 RINLAASAGDLLGLPVQF-GKAVGGDLGAATDSKVRVVGGDFSQLKYGFADEIRVKMSDTATLTDNTSPTPQTVSMWQTN 307 (338) T ss_pred ccccCCCCceeeeeeEEE-ccccCccccccCCcccEEEEEecceEEEEeecccEEEEeecccccccccccccchhhhhcC Confidence 998888889999976554 333332 2235677777776655432 221 115789 Q ss_pred hHHHHHhhhhcceeecccceEEEEe-CCCCC Q lcl|Aclame:pro 487 NKEYLFEMPISGSLEYKGTTAYGTY-TPPVA 516 (517) Q Consensus 487 ~~~~~~~~rvgg~v~~~~a~~~~~~-tp~~a 516 (517) ++.+|++.|+|+.|.+|+||++.+- +.|-| T Consensus 308 ~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~ 338 (338) T protein:vir:78 308 QIAILIEVTFGWLLGDKQAFVKFVDDEDPDA 338 (338) T ss_pred cEEEEEEEEeccEeecccceEEEecccCCCC Confidence 9999999999999999999988665 44455 No 101 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=100.00 E-value=2.2e-38 Score=226.99 Aligned_cols=272 Identities=11% Similarity=0.023 Sum_probs=190.7 Q ss_pred hhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc--ceeeeecccccceeeecccccccccccceeeEeeH Q lcl|Aclame:pro 238 TAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT--LVVGGDNALTQGTGHTTGTDKTESNITLQTRVLTP 315 (517) Q Consensus 238 ~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~~~~~~ 315 (517) ...++ ...++.+|+.+.+.|++.++..+++++++++.++++ ..+|.......+.|++||+.+|+++++|+++++.+ T Consensus 1 Mat~t--t~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v~l~~ 78 (311) T protein:vir:99 1 MATFG--TGNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRFGNEDIITFNGRPKAEFVGEGQQKSSTTGEFDFVTSTP 78 (311) T ss_pred Cceec--CCCceeccHHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCceeEEeecCcccccccceeeEEEEee Confidence 11111 234678999999999999999999999998887753 56788888889999999999999999999999999 Q ss_pred hhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCc--cccccccccc---cccccccccccc Q lcl|Aclame:pro 316 QYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGV--SETQIYPVVG---DAWATNVTGTTN 390 (517) Q Consensus 316 ~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~--~~~gi~~~~~---~~~~~~~~~~~~ 390 (517) +++++++++|++++..+. |....|.+||.++|++++++++|.++|+|+|+++ +..|+.+..+ ............ T Consensus 79 ~k~~~~~~iS~ell~~~~-d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~ 157 (311) T protein:vir:99 79 KKAQVTMRFNEEVQWADE-DYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRVELTADTIAN 157 (311) T ss_pred EEEEEeehhhHHHhhccc-ccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccceeeccccccch Confidence 999999999999985322 2334599999999999999999999999998654 2333222211 111111122222 Q ss_pred HHHHHH-HHHH---hhhhhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccceecCccceeccccCCce------- Q lcl|Aclame:pro 391 IQELLE-KLSV---ATPKAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDEK------- 459 (517) Q Consensus 391 ~d~l~~-~l~~---~~~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~~------- 459 (517) .++.+. ++.. ....+..+.|+|||.+|..|++|||++|||||++....+...+++|.+.++.. .++.. T Consensus 158 ~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~Pv~~s~-~i~~~~~~~~~~ 236 (311) T protein:vir:99 158 PDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARYTDGRKKFPELGLGIGVSSFEGIDASVSD-TVNGGDEADPDD 236 (311) T ss_pred hHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhccCCCeeecCcccCCCCceecceeeEeec-cccccccccccc Confidence 222222 2222 22334556799999999999999999999999999888888899997765432 22110 Q ss_pred -----------eeeecCceE-EEeeehe--eehhh--h-----hcccchHHHHHhhhhcceeecccceEEEEeCCCCC Q lcl|Aclame:pro 460 -----------TAVSLSGYV-TNGSRGM--EFEQG--T-----ILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVA 516 (517) Q Consensus 460 -----------~~~~~~~~~-~~~~~~~--~~~~d--~-----~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~a 516 (517) ..++++.++ +..+.++ +..+. . .+.+|++.+|++.|+|+.|++|+ |+ .++.++| T Consensus 237 ~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~~~~-~v--~~~~~~A 311 (311) T protein:vir:99 237 EDLDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIALRLEIVYGWYVFTDR-FV--VIENAVA 311 (311) T ss_pred chhhccCcceEEEeeccccEEEEEecCceEEEeecCCCCcchhhhhcCcEEEEEEEeecceecChh-He--eeecccC Confidence 112333322 2223222 22211 1 16789999999999999999974 44 3566666 No 102 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=99.96 E-value=4.1e-33 Score=198.12 Aligned_cols=288 Identities=9% Similarity=0.012 Sum_probs=199.1 Q ss_pred HHHHHHHhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecc-ccceeee------eccccc Q lcl|Aclame:pro 219 EAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENL-PTLVVGG------DNALTQ 291 (517) Q Consensus 219 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~-~~~~~~~------~~~~~~ 291 (517) -...+....+......+... . ....+++++|... +++++.+.+.+++++++++... .+..... ...... T Consensus 1 ~~~~~~~~~~~~~~~~k~~t--~-~d~~Gg~l~P~~~-~~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~~~~~g 76 (315) T protein:vir:41 1 MLTIEDIRGGKPFEIVPKID--V-PDLGRGVLSVDRF-GEFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVLDVGPG 76 (315) T ss_pred CcccchhhcCChhhhhhhcC--C-cCCCCceechHHH-HHHHHHHHhhhhhhhhceeeeccccccccccccccCcccccc Confidence 01111111222222222211 1 1224577777665 5688899999999999987532 2211111 111123 Q ss_pred ceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcc-- Q lcl|Aclame:pro 292 GTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVS-- 369 (517) Q Consensus 292 a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~-- 369 (517) ..|.++++..++++++|.++++.+++++..+++|++++.|++.. +.|++||..++++++++.++.++++|||+..+ T Consensus 77 ~~~~~~~~~~~~~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~--~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s~~p~ 154 (315) T protein:vir:41 77 RDETGQKLAPPESTAEVKTNTLYMREMVTKVVIHEDAIEDNIEG--KAFEQKIVTLLGEGISYVLEKYYLHGDTSSSDPL 154 (315) T ss_pred cccccCcCCCCCCccccceeeeceeeeeeeccccHHHHHhhhcc--ccHHHHHHHHHHHHHHHHHHHHhhccCCcCcCcc Confidence 45666777788899999999999999999999999999998753 34999999999999999999999999996432 Q ss_pred ---ccccccccccccc-ccc---cccccHHHHHHHHHHhhhhhc----CCEEEEcHHHHHHHHHhhcCCCCEeccCCCCC Q lcl|Aclame:pro 370 ---ETQIYPVVGDAWA-TNV---TGTTNIQELLEKLSVATPKAA----DSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSN 438 (517) Q Consensus 370 ---~~gi~~~~~~~~~-~~~---~~~~~~d~l~~~l~~~~~~~~----~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~ 438 (517) ..|++..++.... ... +...+.+.+++.+......|. +++|+||+.++.+++++||++|+|||++.... T Consensus 155 ~~~~~G~l~~a~~~~~~~~~~~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rklk~~~g~~lw~~~~~~ 234 (315) T protein:vir:41 155 LRMSDGWLKLASEKLTESDVDPEAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDALKGRETGLGDQALTG 234 (315) T ss_pred ccccccceecccccccccccccccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHHhccCCCccccchhhc Confidence 2577765543221 111 112234556666666555553 56899999999999999999999999999999 Q ss_pred CccceecCccceeccccCC-----ce--eeeecCceEEEeeeheeehhhhhcccchHHHHHhhhhcceeecccceEEEEe Q lcl|Aclame:pro 439 QTIATHFGFNRLVQSVAVD-----EK--TAVSLSGYVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTY 511 (517) Q Consensus 439 ~~~~~l~g~~~v~~~~~~~-----~~--~~~~~~~~~~~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~ 511 (517) +.+.+++|+++...+ .|+ +. ...+++.|+.+.+.++++..+.+..++..+|++..|+|+.+..+++.+..++ T Consensus 235 g~~~tl~G~PV~~~~-~m~~~~~~~~~ilf~d~~nl~~~~~~~i~i~~~~~a~~~~~~~~~~~r~d~~~~~~~~~a~~~~ 313 (315) T protein:vir:41 235 ANSILYDGRPVQYVP-ALEALNDGKSRALFVVPTQLVYGFWRNIKVVPDYDAEMRLTKYVASLRTDNHYEDEEGAVSATI 313 (315) T ss_pred CCCceecccceEecc-cccccCCCCccEEEecccceEEEeccccEEEeeecCCCCceEEEEEEEeceeEEeccceeEeee Confidence 999999997765433 232 22 2345667777777778777777777888899999999998877777666666 Q ss_pred CC Q lcl|Aclame:pro 512 TP 513 (517) Q Consensus 512 tp 513 (517) +- T Consensus 314 ~v 315 (315) T protein:vir:41 314 TV 315 (315) T ss_pred eC Confidence 55 No 103 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=99.95 E-value=1.3e-31 Score=189.83 Aligned_cols=288 Identities=9% Similarity=0.023 Sum_probs=206.0 Q ss_pred HHHHhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecc-cc--ceeeee----ccccccee Q lcl|Aclame:pro 222 VAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENL-PT--LVVGGD----NALTQGTG 294 (517) Q Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~-~~--~~~~~~----~~~~~a~~ 294 (517) ++.+... .+..+... . ....++|++|... .++++.+.+.+++++++++.+. .+ ..++.- .......| T Consensus 1 ~~~~~~~--~~~~k~it--~-~d~~gG~L~P~~~-~~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~~~~~~~~~ 74 (314) T protein:vir:41 1 MDFLNKP--FQITPKID--V-PDLGKGILAVQRF-GEFVREVRENSAIIKDARVLNALKSYEVDISRISLGVELEPGRNT 74 (314) T ss_pred CchhhhH--HHhhcccc--c-ccCCCceeChHHH-HHHHHHHHhccchhhheeeecccCccceeecccccCccccccccc Confidence 0100000 00111111 1 1223588999876 5788999999999999987643 32 222221 11223445 Q ss_pred eecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCc------ Q lcl|Aclame:pro 295 HTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGV------ 368 (517) Q Consensus 295 ~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~------ 368 (517) .++..+.++++++|+++++.++++...++||+++++|++.. +.|++||.+++++++++.++..++||||+.. T Consensus 75 ~~~~~~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a~~--~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~ 152 (314) T protein:vir:41 75 SGTKVAPTADEVTVSTNTLEMKELVTKVVLEDEALEDNIEQ--SAFEQTITSLLASGVTYDLECFFLHADSSLTTGRELY 152 (314) T ss_pred ccCCccCCcccccccceeeeeEEEEEeecccHHHHHhhhch--hhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccch Confidence 56667778899999999999999999999999999999863 3499999999999999999999999999642 Q ss_pred -cccccccccccccc-ccccccccHHH-HHHHHHHhhhhhc----CCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCcc Q lcl|Aclame:pro 369 -SETQIYPVVGDAWA-TNVTGTTNIQE-LLEKLSVATPKAA----DSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTI 441 (517) Q Consensus 369 -~~~gi~~~~~~~~~-~~~~~~~~~d~-l~~~l~~~~~~~~----~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~ 441 (517) .+.|++..++.... .+.......++ +.+++....+.|. +.+|+||+.++.++++++|.+|+|+|++....+.+ T Consensus 153 ~~p~G~l~~a~~~~~~~~~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~~~~~l~~~~~~~~~~ 232 (314) T protein:vir:41 153 RINDGWMKLAGNQYTDAEPEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLLVRETGLGDSALIGATG 232 (314) T ss_pred hcchhhhhhcccceeecCccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHhccCCcccchhhhCCCC Confidence 23577765543211 11122223334 4445555545443 45799999999999999999999999999999998 Q ss_pred ceecCccceecccc----CCcee--eeecCceEEEeeeheeehhhhhcccchHHHHHhhhhcceeecccceEEEEeCCCC Q lcl|Aclame:pro 442 ATHFGFNRLVQSVA----VDEKT--AVSLSGYVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPV 515 (517) Q Consensus 442 ~~l~g~~~v~~~~~----~~~~~--~~~~~~~~~~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~ 515 (517) .+++|++++.++.. ++... ..+++.|+.+....+++..+++..+++..|.+..|++..+..+++.++.++-.+= T Consensus 233 ~~l~G~PV~~~~~~~~~~~~~~~i~fgd~~nlv~~~~~~ir~~~~~~a~~~~~~~~~~~r~d~~~~~~~aa~~~~~~~~~ 312 (314) T protein:vir:41 233 LQYDGIPIQYVPALDALGDDKARALLTVPTNLVYGFWRNIRIEPKRDAAMRRTEYIASLRADCNYEDENAAVAAVIDMSS 312 (314) T ss_pred ceecceeeEecccccccCCCCceEEEechhheEEEeeceeEEeecccCcCCeEEEEEEEEeceEEEEcCcEEEEEeeccC Confidence 99999877655421 22222 2456677777777788878888888999999999999999999999999999999 Q ss_pred CC Q lcl|Aclame:pro 516 AG 517 (517) Q Consensus 516 a~ 517 (517) || T Consensus 313 ~~ 314 (314) T protein:vir:41 313 GG 314 (314) T ss_pred CC Confidence 99 No 104 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=99.87 E-value=1.4e-24 Score=151.43 Aligned_cols=291 Identities=13% Similarity=0.124 Sum_probs=184.2 Q ss_pred hhhhhHHHHHHHHHHHHhhccchhhHHHHhhhhh-cccccccccchhhhhhHHHhHhhhhhhhhceeeeccccce--eee Q lcl|Aclame:pro 209 PEATEFLKTREAEVAYMSASLTKDPKAAWTAELK-ERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPTLV--VGG 285 (517) Q Consensus 209 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~~~--~~~ 285 (517) .. .+....++. ..... ..+. .....++.+|+++...+++.+.+.+++++.+++.++.... ++. T Consensus 1 ~~-------~k~~~~~l~-----~~~~~--~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~~~~i~~ 66 (321) T protein:vir:31 1 MA-------SRTINNDLS-----RITEK--NALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAKKTRIPT 66 (321) T ss_pred Cc-------hHHHHHHHH-----HHHHh--ccccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCcceeeee Confidence 00 000000000 00000 0111 1223467889999999999999999999999988876533 332 Q ss_pred ecccccceeee-cc-cccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcc Q lcl|Aclame:pro 286 DNALTQGTGHT-TG-TDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMG 363 (517) Q Consensus 286 ~~~~~~a~~~~-eg-~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G 363 (517) -.......|.. ++ ...+.++++|+++++..+++...++||+++|.|++. .+.+++||.+.++++++..++..+++| T Consensus 67 ~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d~a~--~~d~e~~i~~~ia~~~a~~~~~~~~nG 144 (321) T protein:vir:31 67 LNIGERHRRPQDEGEWNENESDVSTGTIDISTEKATVAWDLPREVVQENPE--GEALADRILNLMTDAWSADVEDLAANG 144 (321) T ss_pred eccCCcccccccccccccccccceeeeeeeeeEEEEeehhccHHHHHhhhc--chhHHHHHHHHHHHHHHHHHHhheeec Confidence 22223334443 33 334567899999999999999999999999998864 245999999999999999999999999 Q ss_pred cccCccc-----cccccccccc-cccc-ccccccHHHHHHHHHHhhhhhc---CCEEEEcHHHHHHHHH-hhcCCCCEec Q lcl|Aclame:pro 364 GVTGVSE-----TQIYPVVGDA-WATN-VTGTTNIQELLEKLSVATPKAA---DSTLVIHRNDLAAIRF-LKDKNGNYVF 432 (517) Q Consensus 364 ~G~~~~~-----~gi~~~~~~~-~~~~-~~~~~~~d~l~~~l~~~~~~~~---~a~~vmn~~~~~~l~~-lKD~~Gryl~ 432 (517) +|++++. .|++..+... .... .....+.+.+.+++......|+ +.+|+||+.++.++++ |+|.++ ++| T Consensus 145 d~~~~~~~~~~n~G~l~~a~~~~~~~~~~~~~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~~~~~-~~~ 223 (321) T protein:vir:31 145 DEDAEDSFENQNDGFITVAEGDVETIDAADDILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYTLTDRDT-PLG 223 (321) T ss_pred cccCCCcccccchhhhhhhccccccccccccccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHHHhcCCC-ccc Confidence 9987764 4666544322 1111 1223344556665555544443 4589999999988765 666554 789 Q ss_pred cCCCCCCccceecCccceeccccCCceee--eecCceEEEeeeheee--hhhhh-c--ccchHHHHHhhhhcceeecccc Q lcl|Aclame:pro 433 PVGVSNQTIATHFGFNRLVQSVAVDEKTA--VSLSGYVTNGSRGMEF--EQGTI-L--VENNKEYLFEMPISGSLEYKGT 505 (517) Q Consensus 433 ~~~~~~~~~~~l~g~~~v~~~~~~~~~~~--~~~~~~~~~~~~~~~~--~~d~~-~--~~n~~~~~~~~rvgg~v~~~~a 505 (517) ++....+...+++|.+. +.++.||+..+ ..++.++++...++.+ ..+.. . ..+...++....++..|..+++ T Consensus 224 ~~~l~~~~~~tl~G~pv-v~~~~mP~~~il~t~~~nl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ve~~~a 302 (321) T protein:vir:31 224 DNVIMGEADVNPFSFPI-IGSGLWPDDKAMFTDPQNLIYALYRDLEIDVLTESDKVSERDLHARYFMRGDDDFAIENTEA 302 (321) T ss_pred cchhhccccccccceeE-EEcCCCCCCcEEEeccccEEEEEeeccEEEEeecCccccccceeeEeeeeeecceeEecccc Confidence 88888877778777654 45556776544 4566766555444332 22211 1 1233344455668888999999 Q ss_pred eEEEEe-CCCCCC Q lcl|Aclame:pro 506 TAYGTY-TPPVAG 517 (517) Q Consensus 506 ~~~~~~-tp~~a~ 517 (517) +++.+= .-|+-= T Consensus 303 ~a~~~~i~~~~~~ 315 (321) T protein:vir:31 303 VVLAEGLGDPLEH 315 (321) T ss_pred EEEEecCCcchhc Confidence 999882 211111 No 105 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=99.77 E-value=7.6e-21 Score=130.89 Aligned_cols=260 Identities=13% Similarity=0.117 Sum_probs=179.5 Q ss_pred HhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeec----cc--cceeeeecccccceeeeccccccccccccee Q lcl|Aclame:pro 237 WTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHEN----LP--TLVVGGDNALTQGTGHTTGTDKTESNITLQT 310 (517) Q Consensus 237 ~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~----~~--~~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~ 310 (517) +. ..........+|.-+.+.+.+.+.....+.+++.... .+ ...+|.....+.+.|+.||+..+.++++++. T Consensus 1 MA--~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~~~ 78 (272) T protein:vir:98 1 MA--VGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGFKK 78 (272) T ss_pred CC--CccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccccccccce Confidence 10 0011222456788888888887777776666655432 22 2456776667789999999999999999999 Q ss_pred eEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCccccccccccccccccccccccc Q lcl|Aclame:pro 311 RVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTN 390 (517) Q Consensus 311 ~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~ 390 (517) +++.+++++..+++|++++.++..| +.+++.+++++.+++++|..++.--... ....++..+ T Consensus 79 ~~~~~~~~~~~~~itd~~~~~s~~d----~~~~~~~~~~~~~a~~~d~~i~~~~~~a--------------~~~~~~~~t 140 (272) T protein:vir:98 79 TTMTIKKAGKGVEITDEAILSGYGD----PVGQAAKQIVEAIDHKVDADVLDALSKS--------------TQTVEATAT 140 (272) T ss_pred EEEEeeeeeeeeeecHHHHhhcccc----HHHHHHHHHHHHHHHHHHHHHHHHhccc--------------ccccccccC Confidence 9999999999999999998887665 7889999999999999999998532111 011122334 Q ss_pred HHHHHHHHHHhh-hhhcCCEEEEcHHHHHHHHHhhcCCC---CEeccCCCCCCccceecCccceeccccCCceeeeecC- Q lcl|Aclame:pro 391 IQELLEKLSVAT-PKAADSTLVIHRNDLAAIRFLKDKNG---NYVFPVGVSNQTIATHFGFNRLVQSVAVDEKTAVSLS- 465 (517) Q Consensus 391 ~d~l~~~l~~~~-~~~~~a~~vmn~~~~~~l~~lKD~~G---ryl~~~~~~~~~~~~l~g~~~v~~~~~~~~~~~~~~~- 465 (517) .+.+.+++.... .+.....|+|||.++..|++.+..+. .........++...+++|. +++.+..+++.+++.++ T Consensus 141 ~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~-~Vi~s~~~p~~t~~~~~~ 219 (272) T protein:vir:98 141 VDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGV-QIVRSRKCPKGTAYMVRK 219 (272) T ss_pred HHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCe-eEEEcCCCCcceEEEEcC Confidence 566776655432 23456789999999999987652221 1122233455666778886 56666778776654433 Q ss_pred -ceEEEeeeheeehhhhhcccchHHHHHhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 466 -GYVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 466 -~~~~~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) .+.+..+.++....+.+..+....+....|.|..|.+|++++..++.|+.=- T Consensus 220 ~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:98 220 GALRIMLKRNTMVETDRDITKAINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred CeEEEEecCCceeeeccccccceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 4444445555555555566667778888999999999999999999876444 No 106 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=99.77 E-value=7.6e-21 Score=130.89 Aligned_cols=260 Identities=13% Similarity=0.117 Sum_probs=179.5 Q ss_pred HhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeec----cc--cceeeeecccccceeeeccccccccccccee Q lcl|Aclame:pro 237 WTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHEN----LP--TLVVGGDNALTQGTGHTTGTDKTESNITLQT 310 (517) Q Consensus 237 ~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~----~~--~~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~ 310 (517) +. ..........+|.-+.+.+.+.+.....+.+++.... .+ ...+|.....+.+.|+.||+..+.++++++. T Consensus 1 MA--~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~~~ 78 (272) T protein:vir:30 1 MA--VGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGFKK 78 (272) T ss_pred CC--CccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccccccccce Confidence 10 0011222456788888888887777776666655432 22 2456776667789999999999999999999 Q ss_pred eEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCccccccccccccccccccccccc Q lcl|Aclame:pro 311 RVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTN 390 (517) Q Consensus 311 ~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~ 390 (517) +++.+++++..+++|++++.++..| +.+++.+++++.+++++|..++.--... ....++..+ T Consensus 79 ~~~~~~~~~~~~~itd~~~~~s~~d----~~~~~~~~~~~~~a~~~d~~i~~~~~~a--------------~~~~~~~~t 140 (272) T protein:vir:30 79 TTMTIKKAGKGVEITDEAILSGYGD----PVGQAAKQIVEAIDHKVDADVLDALSKS--------------TQTVEATAT 140 (272) T ss_pred EEEEeeeeeeeeeecHHHHhhcccc----HHHHHHHHHHHHHHHHHHHHHHHHhccc--------------ccccccccC Confidence 9999999999999999998887665 7889999999999999999998532111 011122334 Q ss_pred HHHHHHHHHHhh-hhhcCCEEEEcHHHHHHHHHhhcCCC---CEeccCCCCCCccceecCccceeccccCCceeeeecC- Q lcl|Aclame:pro 391 IQELLEKLSVAT-PKAADSTLVIHRNDLAAIRFLKDKNG---NYVFPVGVSNQTIATHFGFNRLVQSVAVDEKTAVSLS- 465 (517) Q Consensus 391 ~d~l~~~l~~~~-~~~~~a~~vmn~~~~~~l~~lKD~~G---ryl~~~~~~~~~~~~l~g~~~v~~~~~~~~~~~~~~~- 465 (517) .+.+.+++.... .+.....|+|||.++..|++.+..+. .........++...+++|. +++.+..+++.+++.++ T Consensus 141 ~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~-~Vi~s~~~p~~t~~~~~~ 219 (272) T protein:vir:30 141 VDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGV-QIVRSRKCPKGTAYMVRK 219 (272) T ss_pred HHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCe-eEEEcCCCCcceEEEEcC Confidence 566776655432 23456789999999999987652221 1122233455666778886 56666778776654433 Q ss_pred -ceEEEeeeheeehhhhhcccchHHHHHhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 466 -GYVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 466 -~~~~~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) .+.+..+.++....+.+..+....+....|.|..|.+|++++..++.|+.=- T Consensus 220 ~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:30 220 GALRIMLKRNTMVETDRDITKAINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred CeEEEEecCCceeeeccccccceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 4444445555555555566667778888999999999999999999876444 No 107 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=99.63 E-value=3e-19 Score=122.12 Aligned_cols=378 Identities=11% Similarity=0.024 Sum_probs=168.2 Q ss_pred EeeeeeecccCCCc--eEEEEehhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 93 VTFQPVEASEVDGV--AYYKKCILAGGALTPNPSNKNAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKE 170 (517) Q Consensus 93 iGf~~~~~~~~~~~--~~~~~~~l~EvS~v~~pA~~~A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~ 170 (517) .|- +..-.+++ |..+.+.+.|+|+|+||||.+|+|+.++++......+.+...+.++.. .+...++.+ T Consensus 1 ~~n---~t~a~d~~~RR~~~~L~~~EvSvv~~PAY~nA~vt~vRe~e~~~~~e~~~~~e~~en~-------~e~~~~~~~ 70 (410) T protein:vir:83 1 MGN---ATTASDEYIRRLENELREKESLVRGIYDRANASNRDVNEEEGQMVAECRGRMEQIKNQ-------MEQAQEVNR 70 (410) T ss_pred CCC---cccchhhHHHHHHHHhhhhheeeeccccccccccccchhhhccccccccCcccchhhh-------hHHHHHHHH Confidence 110 00112222 234555677999999999999999999886432222221111111111 111111111 Q ss_pred hHHHHHHHhhhhhhhHHHHHHHHhhHHHhhhhhhhhhhhhhhhHHHHHHHHH--HHHhhccchhhHHHHhhhhhcccccc Q lcl|Aclame:pro 171 RENGGDNAALKTVSELAANLMKQRESEKILGVEALKVTPEATEFLKTREAEV--AYMSASLTKDPKAAWTAELKERGISG 248 (517) Q Consensus 171 ~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~ 248 (517) .+.+. ......+.......+ ........+.++..++++..-... +..........+.+... ...++.. T Consensus 71 ~~~E~----Rs~~~~i~~~~~~~r----~~p~~~~veyRSaGE~lkal~~~~~Gd~~A~~~~e~~r~a~~~--~~Tgd~~ 140 (410) T protein:vir:83 71 IAFET----RSKGQAVDAAISAMR----GSPVGTEVEYRSAGEYMLDMWNSAQGNASAADRLEVYARAADH--QKTGDLQ 140 (410) T ss_pred HHHHH----HHHHHHHHhhhccCc----CCCCCCCcccccHHHHHHHHhccCCchHHHHHHHHHHHHhhcc--Ccccccc Confidence 11111 000000111110000 000111223334444444321000 00000000111111111 1112222 Q ss_pred cccchhhhhhHHHhHhhhhhhhhceeeeccccceeeee--ccccc-cee------eecccccccccccceeeEeeHhhhh Q lcl|Aclame:pro 249 MPAPAGILKRIQDAVNDEGSLLPFIRHENLPTLVVGGD--NALTQ-GTG------HTTGTDKTESNITLQTRVLTPQYVY 319 (517) Q Consensus 249 ~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~-a~~------~~eg~~~~~~~~~f~~~~~~~~~~~ 319 (517) ...|.+++...++.+.+..++..++..-|.++..+... ..... +.. -.||...+-..++|+..+..+++++ T Consensus 141 ~~i~~~~v~d~i~li~q~r~i~slf~tLP~~g~T~eY~v~t~~~tV~~q~~~~kqa~EGd~L~~gKl~~~t~tA~ikTyG 220 (410) T protein:vir:83 141 GVIPDPIVGPVIDFIDSARPLVSTLGTLPLNNATFYRPIVSQRPAVGLQGVAGGASDEKTELDSQKMVIDRLTVNAKTLG 220 (410) T ss_pred cccchhHhhhHHHHHhhccchhhhhhhCCCCCCeeEEeeecccccccccccccccccccccccccceeeeeccceeehhc Confidence 23344466666778888888888777666665544332 11111 111 1356666777788899999999999 Q ss_pred HhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcccccccccccccccccccccccHHHHHHHHH Q lcl|Aclame:pro 320 KYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLS 399 (517) Q Consensus 320 ~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~ 399 (517) ++..+|||.|++|... .-+...+-|..+++.+-|++.= ..+...+.... .....+.+.+..++. T Consensus 221 Gyt~LSRQ~IERs~v~----~L~~~lraL~~AYA~atea~vr------a~L~~t~t~~~------a~~~~Tad~~~~~i~ 284 (410) T protein:vir:83 221 GYVNVSRQAIDFSSPS----ALDLVVNGLGQQYAIETEALVG------AALASTSTGAV------GYGNATADNVASAIW 284 (410) T ss_pred CcccccceeeecCChh----hHHHHHHHHHHHHHHHHHHHHH------HHHHHhhhhhh------hhhhccHHHHHHHHH Confidence 9999999999999764 3356777788888877776541 11222222111 111224556666555 Q ss_pred Hhhhhhc----C---CEEEEcHHHHHHHHHhhcCCCCEeccCCCC-------CCccceecCccceeccccCCceeeeecC Q lcl|Aclame:pro 400 VATPKAA----D---STLVIHRNDLAAIRFLKDKNGNYVFPVGVS-------NQTIATHFGFNRLVQSVAVDEKTAVSLS 465 (517) Q Consensus 400 ~~~~~~~----~---a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~-------~~~~~~l~g~~~v~~~~~~~~~~~~~~~ 465 (517) ++..... + ..+.++|..+..+..+- .+++++|....+ .+-.+.++++++ +..+..+..++.+++ T Consensus 285 da~~~v~da~~~~~~~~i~vS~DVl~~~~~~f-~~~~~~~~dt~Gfg~~~lg~gi~G~~~~ipV-vm~~~a~AgTA~f~~ 362 (410) T protein:vir:83 285 QAAGAVYTAVKGMGRLVIAIAPDVLGDFGPLF-APVNPTNAHSTGFEAGRFGQGVMGSISGIPV-VMSAALGSGDAYLFS 362 (410) T ss_pred HHHHHHhhhhccceeeeEEechhhhhhcccee-eccCCCCcccccccccccccchhhhhcccce-EEecCCCcCeeeEec Confidence 5433221 2 35889999976654432 223333322211 111222334433 333445555555554 Q ss_pred ceEEEe-eeheeehhhhhcccchHHHHHhhh---hcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 466 GYVTNG-SRGMEFEQGTILVENNKEYLFEMP---ISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 466 ~~~~~~-~~~~~~~~d~~~~~n~~~~~~~~r---vgg~v~~~~a~~~~~~tp~~a~ 517 (517) .-.+-. ..++ .-++++...+.-|.+.+ ++-++..|.+++ ||-| T Consensus 363 ~~Ai~~~eS~~---gp~qL~d~~i~nLt~~ySgY~a~a~~~~~gli------Pv~g 409 (410) T protein:vir:83 363 TAAIECFEQRV---GTLQVVEPSVFGLQVAYAGYFSTLVVNEDAIV------PLVG 409 (410) T ss_pred cceeeeeecCC---ceeEeeCCchhhhhhhheeeeeecccccccee------eecc Confidence 433322 1111 01233344444444444 222222232222 2344 No 108 >protein:vir:93966 Length: 400 # NCBI annotation: structural protein # Family: family:all:2417 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764320;genbank:gi:115315634;genbank:GeneID:5176553 Probab=99.61 E-value=2.2e-17 Score=111.87 Aligned_cols=359 Identities=19% Similarity=0.219 Sum_probs=193.9 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHH-----------HHHhhhhhhhHHHHHHHHhhH Q lcl|Aclame:pro 128 AVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGG-----------DNAALKTVSELAANLMKQRES 196 (517) Q Consensus 128 A~I~~vk~~~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~-----------~e~~~~~~~~~~~~~~~~~~~ 196 (517) -++.. ..-++..+.+.++.....++....++.....+ .+.+.+.+.+...++.+.+.. T Consensus 1 mriS~-----------~~~~K~~l~EK~~~~a~~~E~~~~LKS~~~G~evknaiedl~K~~EL~~TlS~~~iEI~~~en~ 69 (400) T protein:vir:93 1 MRISK-----------RNMNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENE 69 (400) T ss_pred Ccccc-----------cccccchHHHHHHHHhhhhhhhhhhhhhhhcchhhhhhhhchhHHHHHHhHhhcchhhhhhhhh Confidence 11111 11111111111111111112122221110000 111122222333333322221 Q ss_pred HH-hhhhhhhhhhhhhhhHHHHHHHHHHHHh----hccchhhHHHHhhhhhcccc----cccccchhhhhhHHHhHhhhh Q lcl|Aclame:pro 197 EK-ILGVEALKVTPEATEFLKTREAEVAYMS----ASLTKDPKAAWTAELKERGI----SGMPAPAGILKRIQDAVNDEG 267 (517) Q Consensus 197 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~----~~~~vp~~i~~~i~~~~~~~~ 267 (517) .. ...... -...+.++.+...+...+.. .....+.++++...+...|+ +....|..++..|.+.+..++ T Consensus 70 LNa~~E~~K--GK~kMt~~i~sq~A~~eF~~vL~~N~G~S~~k~AW~A~L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n 147 (400) T protein:vir:93 70 LNAQEEKPK--GKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTN 147 (400) T ss_pred hhhhhhhhh--hhHHHHHHHhhHHHHHHHHHHHhccCCchhhhhhhhhhHhhcCcceeccchhccHHHHHHHHHhhhccC Confidence 11 111111 11223355555555444432 33344789999999998877 345679999999999999999 Q ss_pred hhhhceeeeccccceeeee-cccccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHH Q lcl|Aclame:pro 268 SLLPFIRHENLPTLVVGGD-NALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMN 346 (517) Q Consensus 268 ~~~~~~~~~~~~~~~~~~~-~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~ 346 (517) ++++.+++++.+...+.+. .+...|..|..|..+++..++|..-++.+..++....+. ++..+... .-+.|.+||++ T Consensus 148 ~v~~vfHVT~~~~~~V~~s~~s~~~Aq~HkdGqTK~eqa~~~~~~Tl~~~~VY~~~S~A-e~~K~~~~-sYsel~N~i~~ 225 (400) T protein:vir:93 148 PVFKVFHVTNVGALLVSRSFDSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLA-ERVKRLQM-SYSELYNLIVA 225 (400) T ss_pred cceeeeeeccchhhhHHhhhhhhhhhhhhccCCccccceeeeeeechhHHHHHHHHHHH-HHHHHhhh-hHHHHHHHHHH Confidence 9999988888876544332 334578899999999999999999999888877766663 33444333 23568999999 Q ss_pred HHHHHHH-HHHHhhhhcccccCcc-----cccccccccccccc-cccccccHHHHHHHHHHhhhhhcCCEEEEcHHHH-H Q lcl|Aclame:pro 347 RLPDMVI-MAVNRAIIMGGVTGVS-----ETQIYPVVGDAWAT-NVTGTTNIQELLEKLSVATPKAADSTLVIHRNDL-A 418 (517) Q Consensus 347 ~l~~~~~-~~~e~~~l~G~G~~~~-----~~gi~~~~~~~~~~-~~~~~~~~d~l~~~l~~~~~~~~~a~~vmn~~~~-~ 418 (517) +|+++|. +..|.+++-|||++.- .+.+......+... .+..+...|.+-.+..-........-+++...+. + T Consensus 226 ELtQ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~~~Ttkaksagktpfadaieeavdfvrptagrrylivktedrka 305 (400) T protein:vir:93 226 ELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKTEDRKA 305 (400) T ss_pred HHHHHHHHHHHHhhhheecCCCCccchhhHHHHHHHHHHhhhhhhcCCCchhHHHHHHHhhhccCCCceEEEEeccchHH Confidence 9999999 8999999999997641 11222222222222 2223334444443332211111112244444443 3 Q ss_pred HHHHhhcC--CCCEeccCCCCCCccceecCccceeccccCCceeeeec----CceEEEeeeh---e---eehhhhhcccc Q lcl|Aclame:pro 419 AIRFLKDK--NGNYVFPVGVSNQTIATHFGFNRLVQSVAVDEKTAVSL----SGYVTNGSRG---M---EFEQGTILVEN 486 (517) Q Consensus 419 ~l~~lKD~--~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~~~~~~~----~~~~~~~~~~---~---~~~~d~~~~~n 486 (517) .|+-|+-+ +.+.-...+ .. .+...+.+++..++.. ...++.+... | ...|.|.|.+| T Consensus 306 lldelrqatanahvriknd--da---------eiasevgvdeiivytgskalkptvlvdqkyhidmqdltkvdafewktn 374 (400) T protein:vir:93 306 LLDELRQATANAHVRIKND--DA---------EIASEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKTN 374 (400) T ss_pred HHHHHHhhccccceEeecc--hh---------hhhhhcCcceeeeeeccccccceeeeccccccchhhhhhhhhheeccC Confidence 34444422 222111111 11 1223344455444432 2344554432 2 23356889999 Q ss_pred hHHHHHhhhhcceeecccceEEEEeC Q lcl|Aclame:pro 487 NKEYLFEMPISGSLEYKGTTAYGTYT 512 (517) Q Consensus 487 ~~~~~~~~rvgg~v~~~~a~~~~~~t 512 (517) +..+++++...|.|.-++|-+..++. T Consensus 375 snmilvetltsghvetynagavitvs 400 (400) T protein:vir:93 375 SNMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred CceEEEeecccCcceeeccceeEeeC Confidence 99999999999999999999999988 No 109 >protein:vir:1663 Length: 393 # NCBI annotation: unknown # Family: family:all:2417 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044952;genbank:gi:9629659;genbank:GeneID:1261309 Probab=99.55 E-value=3e-16 Score=105.72 Aligned_cols=357 Identities=19% Similarity=0.237 Sum_probs=193.5 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhh---------hhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHh Q lcl|Aclame:pro 124 SNKNAVVTYFREEKKKEENKMTFDQNLMQE---------LLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQR 194 (517) Q Consensus 124 A~~~A~I~~vk~~~~~~~~~~~~~~~~~~~---------~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~ 194 (517) +|.--.|+. ...+.++++....++. ..+...++.+++. .+.+...++.+.+ T Consensus 1 mnkpdliek-----qnrlaelkennvslksqisgfevknaiedl~K~~ELe~---------------TlSe~~iEI~k~e 60 (393) T protein:vir:16 1 MNKPDLIEK-----QNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEK---------------TLSENSIEIIKIE 60 (393) T ss_pred CCCcchhhh-----hhhhhhhhhcccchhhhccchhhhhhhhhchhHHHHHH---------------hHhhcchhhhhhh Confidence 111111110 1111112211111111 1111111222211 1222222222211 Q ss_pred hHHHhhhhhhhhhhhhhhhHHHHHHHHHHHHh----hccchhhHHHHhhhhhcccc----cccccchhhhhhHHHhHhhh Q lcl|Aclame:pro 195 ESEKILGVEALKVTPEATEFLKTREAEVAYMS----ASLTKDPKAAWTAELKERGI----SGMPAPAGILKRIQDAVNDE 266 (517) Q Consensus 195 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~----~~~~vp~~i~~~i~~~~~~~ 266 (517) ...... ....+-...+.++.+...+...+.. .....+.++++...+...|+ +....|..++..|.+.+..+ T Consensus 61 n~LN~~-eE~~KGK~kMt~~iesq~A~~eF~~vL~~N~G~S~~k~AW~A~L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~ 139 (393) T protein:vir:16 61 NELNAQ-EEKPKGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNT 139 (393) T ss_pred hhhhhh-hhcchhhHHHHHHHhhHHHHHHHHHHHhccCCchhhhhhhhhhHhhcCcceeccchhccHHHHHHHHHhhhcc Confidence 111000 0111111223355555555444432 33344789999999998887 34566999999999999999 Q ss_pred hhhhhceeeeccccceeeee-cccccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHH Q lcl|Aclame:pro 267 GSLLPFIRHENLPTLVVGGD-NALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVM 345 (517) Q Consensus 267 ~~~~~~~~~~~~~~~~~~~~-~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~ 345 (517) .++++.++++..+...+.+. .+...|.+|..|..+++..++|..-++.+..++....+. ++..+... .-+.|.+||+ T Consensus 140 n~v~~vfHVT~~~~~~V~~s~~s~~eAq~HkdGqTK~eqa~~~~~~Tl~~~~VY~~~S~A-e~~K~~~~-sYsel~N~i~ 217 (393) T protein:vir:16 140 NPVFKVFHVTNVGALLVSRSFDSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLA-ERVKRLQM-SYSELYNLIV 217 (393) T ss_pred CcceeeeeeccchhhhHHhhhhhhhhhhhhccCCccccceeeeeeechhHHHHHHHHHHH-HHHHHhhh-hHHHHHHHHH Confidence 99999888888876543332 334578899999999999999999999888777766663 33444333 2356899999 Q ss_pred HHHHHHHH-HHHHhhhhcccccCcc-----cccccccccccccc-cccccccHHHHHHHHHHhhhhhcCCEEEEcHHHHH Q lcl|Aclame:pro 346 NRLPDMVI-MAVNRAIIMGGVTGVS-----ETQIYPVVGDAWAT-NVTGTTNIQELLEKLSVATPKAADSTLVIHRNDLA 418 (517) Q Consensus 346 ~~l~~~~~-~~~e~~~l~G~G~~~~-----~~gi~~~~~~~~~~-~~~~~~~~d~l~~~l~~~~~~~~~a~~vmn~~~~~ 418 (517) ++|+++|. +..|.+++-|||++.- .+.+......+... .+..+...|.+-.+..-....+...-+++...+.. T Consensus 218 ~ELtQ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~k~Ttkaksagktpfadaieeavdfvrptagrrylivktedrk 297 (393) T protein:vir:16 218 AELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKTEDRK 297 (393) T ss_pred HHHHHHHHHHHHHhhhheecCCCCccchhhHHHHHHHHHHhhhhhhcCCCchhHHHHHHHhhhccCCCceEEEEeccchH Confidence 99999999 8999999999997641 12222222222222 22233344444433322111111122444444433 Q ss_pred -HHHHhhcCCCCEeccCCCCCCccceecCccceeccccCCceeeeecC----ceEEEeeeh---e---eehhhhhcccch Q lcl|Aclame:pro 419 -AIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDEKTAVSLS----GYVTNGSRG---M---EFEQGTILVENN 487 (517) Q Consensus 419 -~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~~~~~~~~----~~~~~~~~~---~---~~~~d~~~~~n~ 487 (517) .|+-|+-+..+ ....-...-+.+...+.+++..++..+ .-++.+... | ...|.|.|.+|+ T Consensus 298 alldelrqatan---------anvriknddteiasevgvdeiivytgskalkptvlvdqkyhidmqdltkvdafewktns 368 (393) T protein:vir:16 298 ALLDELRQATAN---------ANVRIKNDDTEIASEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKTNS 368 (393) T ss_pred HHHHHHHhhhcc---------CceeeeccchhhhhhcCcceeeeeeccccccceeeeccccccchhhhhhhhhheeccCC Confidence 33444322111 111111112233344555665554432 345554432 2 233568899999 Q ss_pred HHHHHhhhhcceeecccceEEEEeC Q lcl|Aclame:pro 488 KEYLFEMPISGSLEYKGTTAYGTYT 512 (517) Q Consensus 488 ~~~~~~~rvgg~v~~~~a~~~~~~t 512 (517) ..+++++...|.|.-++|-+..++. T Consensus 369 nmilvetltsghvetynagavitvs 393 (393) T protein:vir:16 369 NMILVETLTSGHVETYNAGAVITVS 393 (393) T ss_pred ceEEEeecccCcceeeccceeEeeC Confidence 9999999999999999999999988 No 110 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=99.45 E-value=3.6e-15 Score=99.75 Aligned_cols=297 Identities=9% Similarity=-0.049 Sum_probs=175.4 Q ss_pred HHhhhhhhhhhhhhhhhHHHHHHHHHHHHhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeee Q lcl|Aclame:pro 197 EKILGVEALKVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHE 276 (517) Q Consensus 197 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~ 276 (517) .... -......+ -+. ....+.......+.- ...+...|......+++.+.+.+.++...++. T Consensus 1 ~~~~-~~~~~~~~---------~~~-------~~~~~p~l~m~alTL-aea~~l~~d~~~~~VIE~l~~~s~iL~~lpf~ 62 (330) T protein:vir:94 1 MVRI-CTPPLRGR---------WRT-------LTHQFPELKMPTVTL-AESAKLSQDHLVSGLIETIVEVNPLYEMMPFT 62 (330) T ss_pred Ccee-cCCccccc---------eee-------hhccccccchhhhhh-hHHhhcCchhhHHHHHHhhhccchHHhhcccc Confidence 0000 00000000 000 000000000001110 11234457777888999999999999988876 Q ss_pred cccc--ceeeeecccccceeeecccccccc-cccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 277 NLPT--LVVGGDNALTQGTGHTTGTDKTES-NITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVI 353 (517) Q Consensus 277 ~~~~--~~~~~~~~~~~a~~~~eg~~~~~~-~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~ 353 (517) .+.+ ....+..+...++|...+...+++ ..+|.+++...+.+.+++.+++.+.+.+. .......+-.....+++. T Consensus 63 ~ve~~~~~~~r~~~lp~a~~r~~n~~~~~~~~~Tf~q~t~~l~~l~~~~~Vd~~iadl~g--~~~d~~~~q~~~~ieal~ 140 (330) T protein:vir:94 63 EIEGNALAYNRENVLGDVQFLAVGGTITAKNPATFTKVTSELTTLIGDAEVNGLIQATRS--DFMDQTSVQVASKAKSIG 140 (330) T ss_pred cccCCcceeeeeecCCcceeeeccccccccCcceeeeeeechhhhhhhHHHHHHHHHhcC--CHHHHHHHHHHHHHHHHH Confidence 5543 345566777888998877776654 45899999999999999999998865432 222355666667789999 Q ss_pred HHHHhhhhcccccCcccccccccccccccccc--c-ccccHHHHHHHHHHhh--hhhcCCEEEEcHHHHHHHHHhhcCCC Q lcl|Aclame:pro 354 MAVNRAIIMGGVTGVSETQIYPVVGDAWATNV--T-GTTNIQELLEKLSVAT--PKAADSTLVIHRNDLAAIRFLKDKNG 428 (517) Q Consensus 354 ~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~--~-~~~~~d~l~~~l~~~~--~~~~~a~~vmn~~~~~~l~~lKD~~G 428 (517) ++.+.+|||||+++..+.|++......+...+ . +..+.|+ ++.+.... ....++.|+||++...+|+.++...| T Consensus 141 ~~~e~~linGDs~~~~F~GL~~~~~~~q~i~tg~~gg~~T~d~-LDeLl~~v~~~~g~~~~~l~n~a~~r~I~a~~R~~~ 219 (330) T protein:vir:94 141 RQYQASMITGDGTGNSFQGMMGLVAASQTISAGANGGTLTFEL-LDQLLDLVKDKDGQVDYLMSSFAMRRKYFSLLRALG 219 (330) T ss_pred HHHHHHhhccCCCCccccchhhcCCcccEEecCCCCCCCCHHH-HHHHHHHhcCCCCCCcEEEechhHHHHHHHHHHhcc Confidence 99999999999876666688876655554433 2 2223333 34444332 23457899999999999999999999 Q ss_pred CEeccCCCCCCccc--eecCccceeccccCCce------------eeeecC-c----eEEEe------eeheeehhhhhc Q lcl|Aclame:pro 429 NYVFPVGVSNQTIA--THFGFNRLVQSVAVDEK------------TAVSLS-G----YVTNG------SRGMEFEQGTIL 483 (517) Q Consensus 429 ryl~~~~~~~~~~~--~l~g~~~v~~~~~~~~~------------~~~~~~-~----~~~~~------~~~~~~~~d~~~ 483 (517) +|-..+...+..+. ..|++.++.+.+.++.. .++.++ + .+.+- .+.++.+... - T Consensus 220 ~~~v~~~~~~~~G~~v~~~~GvPi~~~d~ip~~~~~~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~glsVr~~G~~-~ 298 (330) T protein:vir:94 220 GAAIGEVMTLPSGRQIPTYRGVPWFVNDFIPSNMTQGTATNATAIFAGTFDDGSNKYGIAGLTARGSAGLRVQNVGAK-E 298 (330) T ss_pred CCCCCCcccccCCCEEeeeCCeEEEecccccCCCCcccCCCceeEEEEeecccccccceEeecCCCCCcceeeeCCCc-c Confidence 88765443332222 33444455555444321 112221 1 11111 1111222111 1 Q ss_pred ccchHHHHHhhhhcceeecccceEEEEeCCCC Q lcl|Aclame:pro 484 VENNKEYLFEMPISGSLEYKGTTAYGTYTPPV 515 (517) Q Consensus 484 ~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~ 515 (517) ++.-..++++.++|.+|..|.|++...=--++ T Consensus 299 ~k~v~~~~v~~y~~~av~~~~a~~~L~~V~~g 330 (330) T protein:vir:94 299 NADETITRVKMYCGFANFSQLGLAAIKGLIPG 330 (330) T ss_pred ccceeeEEEEEeeeeEEechhheeeeccccCC Confidence 34456789999999999999999877654445 No 111 >protein:vir:861 Length: 318 # NCBI annotation: putative minor structural protein # Family: family:all:2417 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047120;genbank:gi:9630573;genbank:GeneID:1261764 Probab=99.39 E-value=7.1e-15 Score=98.17 Aligned_cols=289 Identities=21% Similarity=0.267 Sum_probs=179.0 Q ss_pred hhhHHHHHHHHHHHH----hhccchhhHHHHhhhhhcccc----cccccchhhhhhHHHhHhhhhhhhhceeeeccccce Q lcl|Aclame:pro 211 ATEFLKTREAEVAYM----SASLTKDPKAAWTAELKERGI----SGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPTLV 282 (517) Q Consensus 211 ~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~----~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~~~ 282 (517) +..+++...+...+. ......+.++++...+.++|+ .....|..++..|.+.+..++++++.+++++.+... T Consensus 1 mtn~iesq~A~~eF~~vL~~N~G~S~~k~AW~A~L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~ 80 (318) T protein:vir:86 1 MTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALL 80 (318) T ss_pred CcchhhhhHHHHHHHHHHhccCCchhhhhhhhhhhhhcCceeeccchhccHHHHHHHHHhhhccCcceeeeeeccchhhh Confidence 344444444443332 233344788999999988876 345679999999999999999999988888887654 Q ss_pred eeee-cccccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHH-HHHHhhh Q lcl|Aclame:pro 283 VGGD-NALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVI-MAVNRAI 360 (517) Q Consensus 283 ~~~~-~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~-~~~e~~~ 360 (517) +.+. .+...|..|..|..+++..++|..-++.+..++....+. ++..+... .-+.|.+||+++|+++|. +..|.++ T Consensus 81 V~~s~~s~AeAq~HkdGqTK~eqa~~~~~~Tl~~~~VY~~~S~A-e~~K~~~~-sYsel~N~i~~ELtQ~~vnk~Vd~Al 158 (318) T protein:vir:86 81 VSRSFDSSAEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLA-ERVKRLQM-SYSELYNLIVAELTQAIVNKIVDLAL 158 (318) T ss_pred hhhhhhhhhhhhhhccCCccccceeeeeeechhHHHHHHHHHHH-HHHHHhhh-hHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 3322 233778899999999999999999999888777766663 33444433 235689999999999999 8999999 Q ss_pred hcccccCcc-----ccccccccccccccccccccc-HHHHHHHHHHhhhhhcCCEEEEcHHHHHH-HHHhhcC--CCCEe Q lcl|Aclame:pro 361 IMGGVTGVS-----ETQIYPVVGDAWATNVTGTTN-IQELLEKLSVATPKAADSTLVIHRNDLAA-IRFLKDK--NGNYV 431 (517) Q Consensus 361 l~G~G~~~~-----~~gi~~~~~~~~~~~~~~~~~-~d~l~~~l~~~~~~~~~a~~vmn~~~~~~-l~~lKD~--~Gryl 431 (517) +-|||.+.- .+.+......+.....+++++ ...+-.+..-........-+++...+..+ |+.|+-+ +.+.- T Consensus 159 V~GDG~N~f~~~DK~advK~I~k~Ttkaksagttpfanaieeavdfvrptagrrylivkaedrkalldelrqatanahvr 238 (318) T protein:vir:86 159 VEGDGSNGFKSIDKEADVKKIKKITTKAKSAGTTPFANAIEEAVDFVRPTAGRRYLIVKAEDRKALLDELRQATANAHVR 238 (318) T ss_pred eeecCCCCccchhhHHHHHHHHHHhhhhhccCCCchhhHHHHHHhhhccCCCceEEEEeecchHHHHHHHHhhcccceeE Confidence 999997541 122222222222222223322 22222222111111111124444444333 3444422 22211 Q ss_pred ccCCCCCCccceecCccceeccccCCceeeeec----CceEEEeeeh---e---eehhhhhcccchHHHHHhhhhcceee Q lcl|Aclame:pro 432 FPVGVSNQTIATHFGFNRLVQSVAVDEKTAVSL----SGYVTNGSRG---M---EFEQGTILVENNKEYLFEMPISGSLE 501 (517) Q Consensus 432 ~~~~~~~~~~~~l~g~~~v~~~~~~~~~~~~~~----~~~~~~~~~~---~---~~~~d~~~~~n~~~~~~~~rvgg~v~ 501 (517) ... .-+.+...+.+++..++.. ..-++.+... | ...|.|.|.+|+..+++|+...|.|. T Consensus 239 ikn-----------ddteiasevgvdeiivytgskalkptvlvdqkyhidmqdltkvdafewktnsnmilvetltsghve 307 (318) T protein:vir:86 239 IKN-----------DDTEIASEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGHVE 307 (318) T ss_pred Eec-----------cchhhhhhcCcceeeeeeccccccceeeeccceecchhhhhhhhcceeccCCceEEEeecccCcce Confidence 111 1122333445555555433 2345554432 2 23356889999999999999999999 Q ss_pred cccceEEEEeC Q lcl|Aclame:pro 502 YKGTTAYGTYT 512 (517) Q Consensus 502 ~~~a~~~~~~t 512 (517) -+++-+..++. T Consensus 308 tynagavitvs 318 (318) T protein:vir:86 308 TYNAGAVITVS 318 (318) T ss_pred eecCceeEEeC Confidence 99999999988 No 112 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.30 E-value=1.1e-13 Score=91.73 Aligned_cols=257 Identities=16% Similarity=0.139 Sum_probs=157.5 Q ss_pred HhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecc----c--cceeeeecccccceeeeccccccccccccee Q lcl|Aclame:pro 237 WTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENL----P--TLVVGGDNALTQGTGHTTGTDKTESNITLQT 310 (517) Q Consensus 237 ~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~----~--~~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~ 310 (517) +... ......+.+|.-+...+.+.+.....+.+++...+. + ...+|.....+.+.++.||.+.+.+.++.++ T Consensus 1 ma~~--~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~eg~~i~~~~lt~~~ 78 (272) T protein:vir:36 1 MSKQ--KTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEGGEISLDKIGTTT 78 (272) T ss_pred CCCc--ceehhhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccccccCCCCccChhhcCCcc Confidence 0000 011123445766666666666655555555544331 1 2446665555677889999999999999999 Q ss_pred eEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCccccccccccccccccccccccc Q lcl|Aclame:pro 311 RVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTN 390 (517) Q Consensus 311 ~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~ 390 (517) .++.++.++...+++.+....+..| +.+.+.+++++.+++.++..++..-... ....+.... T Consensus 79 ~~~~i~~~~k~~~vtD~~~~~~~~d----~~~~~~~~~a~~~a~~~d~~i~~~l~~~--------------~~~~~~~~~ 140 (272) T protein:vir:36 79 KSVTIKKAAKGTEITDEAALSGYGD----PIGESNKQLGLSLANKVDDDLLSAAKTT--------------SQTVSTKAN 140 (272) T ss_pred eeEeeehhhccccccHHHHhhccch----HHHHHHHHHHHHHHHHHHHHHHHHhccc--------------ccccccccc Confidence 9999999988888888766655444 6677889999999999999887432110 001122334 Q ss_pred HHHHHHHHHHhhh-hhcCCEEEEcHHHHHHHHHhhcCCCC--EeccCCCCCCccceecCccceeccccCCceeee----- Q lcl|Aclame:pro 391 IQELLEKLSVATP-KAADSTLVIHRNDLAAIRFLKDKNGN--YVFPVGVSNQTIATHFGFNRLVQSVAVDEKTAV----- 462 (517) Q Consensus 391 ~d~l~~~l~~~~~-~~~~a~~vmn~~~~~~l~~lKD~~Gr--yl~~~~~~~~~~~~l~g~~~v~~~~~~~~~~~~----- 462 (517) .|.+.+++...-. ......++|||.++..|++..+-.-. +.......++.+.+++|. .++.+..+|..... T Consensus 141 ~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~-~Vv~s~~~p~~~~~~~~~~ 219 (272) T protein:vir:36 141 VDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGA-QIVRSKKLAEGSALMFKIV 219 (272) T ss_pred HHHHHHHHHHhhhcCCCceEEEEcHHHHHHHhcccccccccccccccceeeeccceecCe-eEEEeCCCCCCceeEEEEE Confidence 5566665544322 22346899999999999764322111 222222334556677775 45666666654432 Q ss_pred e-cCceEEEeeeheeehhhhhcccchHHHHHhhhhcceeecccceEEEEeCCC Q lcl|Aclame:pro 463 S-LSGYVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPP 514 (517) Q Consensus 463 ~-~~~~~~~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~ 514 (517) + .+.+.+....++....+.+..+....++...+.|..|.+|++++.++++=- T Consensus 220 ~~~gA~~~~~~~~~~vE~~R~~~~~~d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 220 SNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred ecccceeeeecCCcccccccchhhcCcEEEEEEEEEEEEEcCccEEEEeecCC Confidence 1 122222223344444444555555667778889999999999999997654 No 113 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=99.27 E-value=3.4e-13 Score=88.97 Aligned_cols=260 Identities=12% Similarity=0.047 Sum_probs=162.3 Q ss_pred HHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecc----cc--ceeeeecccccceeeecccccccccccce Q lcl|Aclame:pro 236 AWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENL----PT--LVVGGDNALTQGTGHTTGTDKTESNITLQ 309 (517) Q Consensus 236 ~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~----~~--~~~~~~~~~~~a~~~~eg~~~~~~~~~f~ 309 (517) +...+ ......+.+|.-+...+.+.+.....+.+++...+. ++ ..+|.....+.+..+.+|+..+...++.+ T Consensus 1 ~~~~~--~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~ 78 (275) T protein:vir:96 1 MALEN--MTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEGEEIPIDLIETK 78 (275) T ss_pred CCCcc--cchhhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCccccccCCCCcchhhcccc Confidence 00000 011113446776666677777666666666654332 22 34565555567778899999999999999 Q ss_pred eeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcccccccccccccccccccccc Q lcl|Aclame:pro 310 TRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTT 389 (517) Q Consensus 310 ~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~ 389 (517) +.++.++..+....++.+....+..| +...+.+++++.+++.++..++.--+++.. .. .+... T Consensus 79 ~~~~~i~~~~~~~~i~D~~~~~~~~d----~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~------------~~-~~~~~ 141 (275) T protein:vir:96 79 KRQATIRKIGKGTVLTDEALLSGYGD----PKGEAVRQHGLAIANKVDNDVLEALQGATL------------KV-EADIT 141 (275) T ss_pred eeeEEeehhcccccccHHHHHhhccc----hHHHHHHHHHHHHHHHHHHHHHHHHhcccc------------cc-ccccc Confidence 99999999888888888766555444 445567789999999999988743221110 00 11223 Q ss_pred cHHHHHHHHHHhhh-hhcCCEEEEcHHHHHHHHHhhcCCCCEecc-----CCCCCCccceecCccceeccccCCceeeee Q lcl|Aclame:pro 390 NIQELLEKLSVATP-KAADSTLVIHRNDLAAIRFLKDKNGNYVFP-----VGVSNQTIATHFGFNRLVQSVAVDEKTAVS 463 (517) Q Consensus 390 ~~d~l~~~l~~~~~-~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~-----~~~~~~~~~~l~g~~~v~~~~~~~~~~~~~ 463 (517) +.|.+.+++..... ......++|||..+..|+++.+ -+++-. +...++.+.+++|. .|+.+..++..+++. T Consensus 142 ~~d~i~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~~--~~f~~~~~~g~~~~~~G~ig~~~G~-~Vi~s~~~p~~t~~i 218 (275) T protein:vir:96 142 KLAGLQTAIDKFNDEDLEPMVLFVNPLDAGKLRASAT--DNFTRATLLGDNVIVKGAFGEALGA-IIVRSNKIKEGEAIL 218 (275) T ss_pred CHHHHHHHHHHhccccCCccEEEeCHHHHHHHHhccc--ccccccccccccceeccccceecCe-eEEEeCCCCcceEEE Confidence 45666666554322 2245689999999999987631 122211 12335566777776 455556677665544 Q ss_pred cC--ceEEEeeeheeehhhhhcccchHHHHHhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 464 LS--GYVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 464 ~~--~~~~~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) ++ .+.+....++....+.+..+....+....+.|..|.+|++.+..+++|++=| T Consensus 219 ~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~ 274 (275) T protein:vir:96 219 AKRGAVKLITKRDFFLETERHASHKSTALFSDKHYVAYLYDESKVVKITKSASGLG 274 (275) T ss_pred EeccceeeeecCCcccccccchhhcCcEEEEeEEEEEEEEcCccEEEEEecccccC Confidence 33 2223333444444455555555566777778889999999999999999999 No 114 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=99.18 E-value=2e-12 Score=84.76 Aligned_cols=342 Identities=8% Similarity=-0.004 Sum_probs=159.4 Q ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHhhhhhhhHHHHHHHHhhHHHhhhhhhhhhhhhhhhHHH Q lcl|Aclame:pro 137 KKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGVEALKVTPEATEFLK 216 (517) Q Consensus 137 ~~~~~~~~~~~~~~~~~~~e~~~~~~e~~a~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 216 (517) ++. .+++..+.. ..+.+.++..++..+.++.+....... .......+..+. T Consensus 1 ~~~----------~~~~~~~~~----------------~~~~~~~e~k~lr~~me~~et~~e~~~-~~~~~~~~e~el-- 51 (393) T protein:vir:79 1 MEN----------WLKQLKESG----------------FTETQVQEQKSLRTRMERGETLAEADA-NKLALNEEETQI-- 51 (393) T ss_pred Cch----------HHHHHHhcc----------------CchhHHHHHHHHHHHhhhhhhhhhhhh-hhhhcchhHHHH-- Confidence 111 111111110 000000111111111111110000000 000000000000 Q ss_pred HHHHHHHHHhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeeccccce-e-eeeccccccee Q lcl|Aclame:pro 217 TREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPTLV-V-GGDNALTQGTG 294 (517) Q Consensus 217 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~~~-~-~~~~~~~~a~~ 294 (517) -..+..+..+......-...+.+.+ +.....+|.-+...+.+..+......+++....+..+. + -...+..++.- T Consensus 52 --~E~f~Kmm~G~~p~~eV~~~e~mtt-~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L~~Grsm~F~~~g~~Ra~~ 128 (393) T protein:vir:79 52 --LESFAKMMEGETPTNEVNLREFMAT-PSAQILIPRVIVGTMREAAEPLYIGTKMLQKIRLKSGQSMIFPSIGIMRAYD 128 (393) T ss_pred --HHHHHHHhcCCCchhheehhhhhcC-CCcceechhhhhhhhhhcccchhHHHHHHHHHhhhcCcceeccchheeeecc Confidence 0001111111111111111111221 22345678888777776555444444443333221111 0 11122456677 Q ss_pred eecccccccc---cccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcc-- Q lcl|Aclame:pro 295 HTTGTDKTES---NITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVS-- 369 (517) Q Consensus 295 ~~eg~~~~~~---~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~-- 369 (517) ++||.+.|+. ..+++.+++..++++..+.+|+|+++||.+| |-+|..+.+.+++++..+...+++.-+... T Consensus 129 IgEGgE~~~~sld~~T~dsv~~~~gK~G~~Ia~SqEmIsDSg~D----vin~~l~aA~RaMaRkKee~a~n~fk~~ghtv 204 (393) T protein:vir:79 129 VAEGQEIPEDSIDWQTHESPEIRVGKSGIRLRFTDEMISDSQWD----LMSMMIKQAGRAMGRHKEQKAYHQFRSHGHTV 204 (393) T ss_pred ccccccccccchhhhcCCceeEEechhhhhhhhHHHHhhcchHH----HHHHHHHHHHHHHHhhhHHHHHhhhhccccee Confidence 8888887764 3679999999999999999999999999998 778888899999999999999998776544 Q ss_pred ccccccccccccc------ccccccccHHHHHHHHHHhh-hhhcCCEEEEcHHHHHHHHHhhcCCCCEecc---CCCCCC Q lcl|Aclame:pro 370 ETQIYPVVGDAWA------TNVTGTTNIQELLEKLSVAT-PKAADSTLVIHRNDLAAIRFLKDKNGNYVFP---VGVSNQ 439 (517) Q Consensus 370 ~~gi~~~~~~~~~------~~~~~~~~~d~l~~~l~~~~-~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~---~~~~~~ 439 (517) .-+ +++.+..-. ....++-..+++++.+.+.. ..|.+++|+|||-.|+.+.|=--=.+-|.-+ .+...- T Consensus 205 fDa-~st~t~ahptGr~~~~~qNGTlSleDllDm~~av~~~hyt~svi~MHPLAWnv~AKna~me~~~~na~gN~~~~~~ 283 (393) T protein:vir:79 205 FDN-YSTNKLAHTTGLDKNGVQNDTFSAEDFLDLIIAVMANEYTPSDLMMHPLAWTVFAKNELMGSLQANPYGNYPAKGA 283 (393) T ss_pred eec-cccCccceeecCCccccccccccHHHHHHHHHHHhcccCCcceEEEcCchhhhhhhhhhhcceeeccccccCcccc Confidence 223 222221111 12345667788988877654 5678899999999999987632111212111 000000 Q ss_pred ccceecC----------ccceeccccCCceeeeecCceEEEe-----------eeheeehhhhhcccchHHHHHhhhhcc Q lcl|Aclame:pro 440 TIATHFG----------FNRLVQSVAVDEKTAVSLSGYVTNG-----------SRGMEFEQGTILVENNKEYLFEMPISG 498 (517) Q Consensus 440 ~~~~l~g----------~~~v~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~d~~~~~n~~~~~~~~rvgg 498 (517) ....-+| -..+.+++.++-.....--+|..++ ++....++|. ..+.+.+-...|-|- T Consensus 284 ~ts~algp~~i~~~~~~nlnv~~sPfvp~d~k~~rFd~~~Vd~NnvgvlLV~D~i~tdq~ddk--~rdiq~iKl~ERYG~ 361 (393) T protein:vir:79 284 PSSMALGPDSIQGRLPFNFNVNLSPFIPLDKKSRRFDVYAVDRNNVGVLLVRDDLKTDQWDEK--ARGLQNIKMIERYGI 361 (393) T ss_pred chhhhhchhhhccccccceeEEEecccccccccceeeEEEeecCCceEEEEecCcceeccccc--cccceeeeeeeeece Confidence 0001111 1234444444422111000222222 2222333333 233445566777777 Q ss_pred eeecccceEE----EEeCCCCCC Q lcl|Aclame:pro 499 SLEYKGTTAY----GTYTPPVAG 517 (517) Q Consensus 499 ~v~~~~a~~~----~~~tp~~a~ 517 (517) .|.+....++ .+++-.++- T Consensus 362 gvLn~gkaiavakNI~~~k~y~~ 384 (393) T protein:vir:79 362 GILNEGKAIAVAKNISMDKSYAE 384 (393) T ss_pred eeeeCCceEEEEecceeeccccc Confidence 6776554332 233333333 No 115 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=99.17 E-value=2.6e-12 Score=84.11 Aligned_cols=259 Identities=11% Similarity=0.027 Sum_probs=159.4 Q ss_pred HhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeec----cc--cceeeeecccccceeeeccccccccccccee Q lcl|Aclame:pro 237 WTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHEN----LP--TLVVGGDNALTQGTGHTTGTDKTESNITLQT 310 (517) Q Consensus 237 ~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~----~~--~~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~ 310 (517) +..... ......+|.-+...+.+.+.....+.+++...+ .+ ...+|.....+.+.++.+|+..+.+.+++++ T Consensus 1 ma~~~T--~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg~~i~~~~it~~~ 78 (274) T protein:vir:93 1 MPQGIT--KTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKK 78 (274) T ss_pred CCccce--ehhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccccCCCcccccccccce Confidence 111001 111345677666666666665555555554432 12 2345665545678888999999999999999 Q ss_pred eEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCccccccccccccccccccccccc Q lcl|Aclame:pro 311 RVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTN 390 (517) Q Consensus 311 ~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~ 390 (517) .++.++..+....++.+....+..| +.+.+.+++++.+++.++..++..-.++.. .. .+.... T Consensus 79 ~~~~i~~~~~~~~i~D~~~~~~~~d----~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~------------~~-~~~~~~ 141 (274) T protein:vir:93 79 REAKIRKIAKGTSITDEALLSGYGD----PQGEQVRQHGLAHANKVDNDVLEALMGAKL------------TV-NADITK 141 (274) T ss_pred eEEEeeeecccccccHHHHHhhccc----hHHHHHHHHHHHHHHHHHHHHHHHHhcccc------------cc-cccccC Confidence 9999998887888888776666544 566788899999999999998864322211 00 111223 Q ss_pred HHHHHHHHHHhhh-hhcCCEEEEcHHHHHHHHHhhcCCCCEeccC-----CCCCCccceecCccceeccccCCceeeeec Q lcl|Aclame:pro 391 IQELLEKLSVATP-KAADSTLVIHRNDLAAIRFLKDKNGNYVFPV-----GVSNQTIATHFGFNRLVQSVAVDEKTAVSL 464 (517) Q Consensus 391 ~d~l~~~l~~~~~-~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~-----~~~~~~~~~l~g~~~v~~~~~~~~~~~~~~ 464 (517) .+.+.+++...-. ......++|||.++..|++ |..-+++-.. ...++...+++|. +|+.+..+|..+++.+ T Consensus 142 ~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k--~~~~~f~~~s~~g~~~~~~G~ig~~~G~-~Vi~s~~~p~~t~~l~ 218 (274) T protein:vir:93 142 LNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRG--DASTNFTRATELGDDIIVKGAFGEALGA-IIVRTNKLEAGTAILA 218 (274) T ss_pred HHHHHHHHHHhhhccCCccEEEeCHHHHHHHHh--hhhhcccccccccccceeecccceecCe-eEEEcCCCCcceEEEE Confidence 5566665544322 2245689999999999975 3322232111 1234556677776 4555666776655443 Q ss_pred C--ceEEEeeeheeehhhhhcccchHHHHHhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 465 S--GYVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 465 ~--~~~~~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) + .+.+.....+....+++..+....+....+.|.++.+|++++..++.++==- T Consensus 219 ~~gai~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~t~~~~s~~ 273 (274) T protein:vir:93 219 KKGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGSLE 273 (274) T ss_pred eCCeEEEEecCCcccccccchhhcccEEEEEEEEEEEEEcCCceEEEeeCccccC Confidence 3 3333334444455555555666677788889999999999988885433111 No 116 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.17 E-value=2.9e-12 Score=83.82 Aligned_cols=258 Identities=13% Similarity=0.052 Sum_probs=160.7 Q ss_pred HhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeec----cc--cceeeeecccccceeeeccccccccccccee Q lcl|Aclame:pro 237 WTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHEN----LP--TLVVGGDNALTQGTGHTTGTDKTESNITLQT 310 (517) Q Consensus 237 ~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~----~~--~~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~ 310 (517) +... ......+.+|.-+...+.+.+.....+.+++...+ .+ ...+|.....+.+..+.||.+.+...+++++ T Consensus 1 Ma~~--~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg~~i~~~~lt~~~ 78 (276) T protein:vir:10 1 MAQG--TTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEGQKIPVDKIETNR 78 (276) T ss_pred CCcc--eeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCCCccCccccccce Confidence 0000 11112345677666667777766666666665432 12 2456665555677889999999999999999 Q ss_pred eEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCccccccccccccccccccccccc Q lcl|Aclame:pro 311 RVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTN 390 (517) Q Consensus 311 ~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~ 390 (517) .++.+++.+....++.+....+..| ....+.++++..+++.++..++.=- .+ .+ ... .....+ T Consensus 79 ~~a~i~~~~k~~~~tD~a~~~~~~d----p~~~~~~~~~~~~a~~~d~~~~~~l-~~---------~~--~~~-~~~~~t 141 (276) T protein:vir:10 79 REAKIHKIGKGTDITDEALLSGYGD----PQGEAVRQHGLAIANKVDNDVLEAL-RG---------TK--LTV-SADIGT 141 (276) T ss_pred eeEEeehccccccccHHHHHhhccc----hHHHHHHHHHHHHHHHHHHHHHHHH-hc---------cc--ccc-cccccC Confidence 9999999998889988877766555 4566778899999999998877310 00 00 000 111223 Q ss_pred HHHHHHHHHHhhh-hhcCCEEEEcHHHHHHHHHhhcCCCCEeccC-----CCCCCccceecCccceeccccCCceeeeec Q lcl|Aclame:pro 391 IQELLEKLSVATP-KAADSTLVIHRNDLAAIRFLKDKNGNYVFPV-----GVSNQTIATHFGFNRLVQSVAVDEKTAVSL 464 (517) Q Consensus 391 ~d~l~~~l~~~~~-~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~-----~~~~~~~~~l~g~~~v~~~~~~~~~~~~~~ 464 (517) .+.+.+++...-. .....+++|||.++..|+++.+.+ ++-.. ...++...+++|. .|+.++.++..+.+.+ T Consensus 142 ~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~--f~~~s~~g~~~~~~G~ig~~~G~-~Vi~s~~~p~~t~~l~ 218 (276) T protein:vir:10 142 LAGLEAAIDTFDDEDLEPMVLFINPKDAGKLRSSASDN--FTRATELGDNIIVKGAFGEALGA-VIVRSKKLDEGEAILA 218 (276) T ss_pred HHHHHHHHHHhccccCcccEEEEcHHHHHHHHHhcccc--ccccccccccceeccccceecce-eEEEcCCCCcceEEEE Confidence 4555555444322 234568999999999999764322 22211 1234556677775 5555666777666554 Q ss_pred CceEE--EeeeheeehhhhhcccchHHHHHhhhhcceeecccceEEEEeC----CCCC Q lcl|Aclame:pro 465 SGYVT--NGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYT----PPVA 516 (517) Q Consensus 465 ~~~~~--~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~t----p~~a 516 (517) +...+ ....++....+.+..+....+....+.|..+.+|++++.+++- |..| T Consensus 219 ~~gAi~~~~~~~~~vE~dRd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~~~ 276 (276) T protein:vir:10 219 KRGAVKLITKRDFFLETDRDPSTKTTALYSDKHYVAYLYDESKAVKVTKGAGTTDSGA 276 (276) T ss_pred eccceeeeecCCceeecccchhhcccEEEEeeEEEEEEEcCcceEEEecCCcCCcCCC Confidence 43332 2334445555666666666677778888899999999998862 2333 No 117 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=99.16 E-value=3.5e-12 Score=83.41 Aligned_cols=257 Identities=14% Similarity=0.092 Sum_probs=152.3 Q ss_pred HhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecc----cc--ceeeeecccccceeeeccccccccccccee Q lcl|Aclame:pro 237 WTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENL----PT--LVVGGDNALTQGTGHTTGTDKTESNITLQT 310 (517) Q Consensus 237 ~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~----~~--~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~ 310 (517) +.... .......+|.-+...+.+.+.....+.+++..... ++ ..+|.....+.+.++.+|...+..++++++ T Consensus 1 Ma~~~--T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~g~~i~~~~lt~~~ 78 (278) T protein:vir:80 1 MADLT--TKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAEGAAIDYSALETES 78 (278) T ss_pred CCCcc--eehhheecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcceeecCCCcCcccccccce Confidence 10000 11123456777777777666665555555443321 12 345655545667788999999988999999 Q ss_pred eEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhccc-ccCcccccccccccccccccccccc Q lcl|Aclame:pro 311 RVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGG-VTGVSETQIYPVVGDAWATNVTGTT 389 (517) Q Consensus 311 ~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~-G~~~~~~gi~~~~~~~~~~~~~~~~ 389 (517) .++.++..+....++.+....+..| +.+.+.+++++.+++.++..+++.- |..... + ...+ .. T Consensus 79 ~~~~i~~~~~a~~v~D~~~~~~~~d----~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~~-------~--~~~t---~~ 142 (278) T protein:vir:80 79 VKHGIKKAGKGVKLTDESVLSGYGD----PVEEAQKQIRMAIASKVDNDILEEALTTTLEV-------K--GAIN---IG 142 (278) T ss_pred eeEeeehhhccccccHHHHhhcccc----HHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-------c--cccc---cc Confidence 9999988888888888766666555 5677888999999999999887542 111100 0 0000 11 Q ss_pred cHHHHHHHHHHhhhh-----h-cCCEEEEcHHHHHHHHHhhcCCCCEe-----ccCCCCCCccceecCccceeccccCCc Q lcl|Aclame:pro 390 NIQELLEKLSVATPK-----A-ADSTLVIHRNDLAAIRFLKDKNGNYV-----FPVGVSNQTIATHFGFNRLVQSVAVDE 458 (517) Q Consensus 390 ~~d~l~~~l~~~~~~-----~-~~a~~vmn~~~~~~l~~lKD~~Gryl-----~~~~~~~~~~~~l~g~~~v~~~~~~~~ 458 (517) ..+...+.+.++... . ....++|||..+..|++... -+|+ -.+...++.+.+++|+ .|+.+..+|. T Consensus 143 ~~~~~~~~~~da~~~l~~~~~~~~~~ivv~p~~~~~L~k~~~--~~~~~~~~~g~~~~~~G~ig~~~G~-~Vi~s~~~p~ 219 (278) T protein:vir:80 143 LIDKIENTFTDAPDAIEDESITTTGVLFLNYKDTAKLREEAA--GSWTKASQLGDDLLVKGAFGELLGW-EIVRTKKLAD 219 (278) T ss_pred hhhhHHHHHHHHHHhhcccCCCcccEEEECHHHHHHHHhhhh--hhccccccccccceeeccceeecce-eEEEcCCCCc Confidence 112222223222211 1 23468899999999886532 2222 1222345667777775 5555666776 Q ss_pred eeeeec--CceEEEeeeheeehhhhhcccchHHHHHhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 459 KTAVSL--SGYVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 459 ~~~~~~--~~~~~~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) ...+.+ +.+.+..........+.+..+....+....+.|..|.+|++++..+. +|| T Consensus 220 ~t~~l~~~gAi~~~~~~~~~vE~~Rd~~~~~d~i~~~~~yg~~v~~~~~~v~it~---~a~ 277 (278) T protein:vir:80 220 GNALAVKAGALKTFLKRNLLAESGRDMDHKLTKFNADQHYAVALVDETKAVKVVP---VAG 277 (278) T ss_pred ceEEEEeccceeeeecCCcccccccchhhccceeeeeeEEEEEEEcCcceEEEee---ccC Confidence 554433 23333333444444444444555566677888999999999999984 556 No 118 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.07 E-value=1.1e-11 Score=80.56 Aligned_cols=258 Identities=14% Similarity=0.128 Sum_probs=153.7 Q ss_pred hhcccccccccchhhhhhHHHhHhhhhhhhhceeeecc----c--cceeeeecccccceeeecccccccccccceeeEee Q lcl|Aclame:pro 241 LKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENL----P--TLVVGGDNALTQGTGHTTGTDKTESNITLQTRVLT 314 (517) Q Consensus 241 ~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~----~--~~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~~~~~ 314 (517) +.......+.+|.-+...+.+.+.....+.+++...+. + .+.+|.....+.+..+.||++.+...+++++.... T Consensus 1 Ma~T~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae~~~eg~~i~~~~lt~~~~~a~ 80 (270) T protein:vir:95 1 MTQTKKANLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAEDLQEGVAMDTTQMSMTTTKVT 80 (270) T ss_pred CCceehhhhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecCCCccccccCCCccchhhcccchheee Confidence 11111123346766666666666666666666554322 2 23466655556677788999999889999999999 Q ss_pred HhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcccccccccccccccccccccccHHHH Q lcl|Aclame:pro 315 PQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQEL 394 (517) Q Consensus 315 ~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l 394 (517) ++..+....++.+....+.-| ....+.++++..++++++..++.= ..+ +. . ..+...+.+.+ T Consensus 81 i~~~gk~~~itD~a~~~~~~d----p~~~~~~q~a~~~a~~~d~~li~~------l~~----a~--~--~~~~~~t~~~~ 142 (270) T protein:vir:95 81 VKETGKAVEVTQTAIITNVNG----TLQEASRQLAMSLADKVEIDYIAE------LNK----SK--Q--TATVSADATGI 142 (270) T ss_pred eehhhCcceecHHHHhhhccc----hHHHHHHHHHHHHHHHHHHHHHHH------hcc----cc--c--ccccccCHHHH Confidence 999888888776654444334 244567789999999998887621 111 00 0 01122334556 Q ss_pred HHHHHHhhh-hhcCCEEEEcHHHHHHHHHhhcCC-CCEeccCCCCCCccceecCccceeccccCCceeeeecC--ceEEE Q lcl|Aclame:pro 395 LEKLSVATP-KAADSTLVIHRNDLAAIRFLKDKN-GNYVFPVGVSNQTIATHFGFNRLVQSVAVDEKTAVSLS--GYVTN 470 (517) Q Consensus 395 ~~~l~~~~~-~~~~a~~vmn~~~~~~l~~lKD~~-Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~~~~~~~~--~~~~~ 470 (517) .+++...-. .....+++|||.++..|++...-. .++- +....++...+++|..+++.+-..++...+.+. ...+. T Consensus 143 ~dA~~~lgd~~~~~~~i~vhs~~~~~Lrk~~~~~~~~~~-~~~~~~G~ig~~~G~~Viv~s~~~~~~~~~l~~~gAi~~~ 221 (270) T protein:vir:95 143 LDAIEVFNSENDEDYVLYVNPKDYNKLVKSLFKVGGNVQ-DRAISKGDLVEIVGVSDIVKSKRVSENTAFLQRYGAMEIV 221 (270) T ss_pred HHHHHHhccccCCCcEEEEcHHHHHHHHhhhcccccccc-cchhcccccceecceeEEEeCCCCCceeEEEEeccceeee Confidence 665544322 233568999999999998642111 1111 112334567777786655544333333333222 22222 Q ss_pred eeeheeehhhhhcccchHHHHHhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 471 GSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 471 ~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) ...++....+++..+....+....+.+..+.+|.+++..++.|+..= T Consensus 222 ~~~~~~vEtdRd~~~~~d~i~~~~~y~v~~~~~skvv~~t~~~a~~~ 268 (270) T protein:vir:95 222 NKKKPEAYTDFDILKRTHLLSTNYHYSVNLKDETGVVKVTFKPSGSL 268 (270) T ss_pred ecCCceeeeccchhhcccEEEeeeEEEEEEEccceEEEEEecCCCCc Confidence 23344444555556666667777888899999999999999877555 No 119 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=99.03 E-value=2.1e-11 Score=79.07 Aligned_cols=258 Identities=12% Similarity=0.073 Sum_probs=154.3 Q ss_pred HhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecc----c--cceeeeecccccceeeecc Q lcl|Aclame:pro 225 MSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENL----P--TLVVGGDNALTQGTGHTTG 298 (517) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~----~--~~~~~~~~~~~~a~~~~eg 298 (517) |... . .....+.+|.-+...+.+.+.....+.+++...+. + ...+|.....+.+..+.+| T Consensus 1 ma~~------------~--T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~g 66 (274) T protein:vir:96 1 MAQG------------T--TKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEG 66 (274) T ss_pred CCcc------------c--cchhhhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCccccCCC Confidence 1000 0 01113445666666666665555544555443221 1 2345554444567778889 Q ss_pred cccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCccccccccccc Q lcl|Aclame:pro 299 TDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVG 378 (517) Q Consensus 299 ~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~ 378 (517) ...+.++++++..++.++.++....++......+..| +.+.+.+++++.+++.++..+++--..+. . T Consensus 67 ~~i~~~~it~~~~~~~i~~~~~~~~i~D~~~~~~~~d----~~~~~~~~~~~~~a~~~d~~i~~~l~~a~-~-------- 133 (274) T protein:vir:96 67 EKIPVDQIGTSKREAKVRKIGKGTELTDEAVLSGFGD----PQGEAVRQHGLAIANKVDNDVLEALKGAT-L-------- 133 (274) T ss_pred CcCchhhcccceeEEEEEeeeceeeecHHHHHhhcch----HHHHHHHHHHHHHHHHHHHHHHHHHhcCC-C-------- Confidence 9999889999999998888777777777665555444 56677889999999999998875321111 0 Q ss_pred ccccccccccccHHHHHHHHHHhhh-hhcCCEEEEcHHHHHHHHHhhcCCCCEeccCC-----CCCCccceecCccceec Q lcl|Aclame:pro 379 DAWATNVTGTTNIQELLEKLSVATP-KAADSTLVIHRNDLAAIRFLKDKNGNYVFPVG-----VSNQTIATHFGFNRLVQ 452 (517) Q Consensus 379 ~~~~~~~~~~~~~d~l~~~l~~~~~-~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~-----~~~~~~~~l~g~~~v~~ 452 (517) . ......+.+.+.++....-. ......++|||..+..|+++. ..+|+-... ...+...+++|. .|+. T Consensus 134 ---~-~~~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~--~~~f~~~~~~g~~~~~~g~ig~~~G~-~Vi~ 206 (274) T protein:vir:96 134 ---T-VEADITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSA--SDNFTRPTQLGDNIIVKGAFGEALGA-VIVR 206 (274) T ss_pred ---C-cCcccccHHHHHHHHHHhcccCCCceEEEeCHHHHHHHHhcc--cccccccccccccceeecccceecCe-eEEE Confidence 0 01122235666665554321 234568999999999998753 123332111 224456677776 4566 Q ss_pred cccCCceeeeecC--ceEEEeeeheeehhhhhcccchHHHHHhhhhcceeecccceEEEEeCCC--CC Q lcl|Aclame:pro 453 SVAVDEKTAVSLS--GYVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPP--VA 516 (517) Q Consensus 453 ~~~~~~~~~~~~~--~~~~~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~--~a 516 (517) +..+|...++.++ .+.+....+.....+.+..+....+....+.|..+.+|++++..+...+ |- T Consensus 207 s~~~p~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~yg~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:96 207 SNKLNKGEALLAKKGAVKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) T ss_pred cCCCCcceEEEEeCcceeeeecCCcccccccchhhcccEEEEeeEEEEEEEcCccEEEEEcCcccccC Confidence 6677766654443 3333333444444455555556667777888999999999998875444 22 No 120 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=98.97 E-value=4.7e-11 Score=77.20 Aligned_cols=274 Identities=10% Similarity=0.053 Sum_probs=150.3 Q ss_pred HhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeeccccce--eeeecccccceeeec----- Q lcl|Aclame:pro 225 MSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPTLV--VGGDNALTQGTGHTT----- 297 (517) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~a~~~~e----- 297 (517) |-. -.+... ....+..+...+++.+...+.++...++..+.+.. ..+..+...+..... T Consensus 1 mpa-----------ltLaea---~k~~~d~l~~~ViE~~~~~s~lL~~LpF~~veg~~~~ynR~~~~~~~~~~~v~~~~~ 66 (310) T protein:vir:97 1 MAS-----------VTLAES---AKLAQDELVAGVIENIITVNRMFDVLPFDSIEGNSLAYNRENVLGDVIMAGVGTTFS 66 (310) T ss_pred Ccc-----------cchHHH---hhcCcchHHHHHHHHHhccchHHHhCCcccccCCcceeeEeeccCCccccccccccc Confidence 000 000111 12234556677888888888888888877665532 222233222222211 Q ss_pred ccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcccccccccc Q lcl|Aclame:pro 298 GTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVV 377 (517) Q Consensus 298 g~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~ 377 (517) ....+++..+|++++...+.+++.+.+.+.+..-..=+...++ .+-.+...+++.++.+..|||||.++.++.|++... T Consensus 67 ~~g~~~~~~t~~~~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~-~~Ql~~~iea~~~~~e~~lINGD~a~n~F~GL~~~~ 145 (310) T protein:vir:97 67 GAGAGKAAATFTKVNSNLTTIMGDAEVNGLIQATRSGDGNDQT-AVQIASKAKSAGRKYQDQLINGNGAGNEFAGLIQLC 145 (310) T ss_pred CCCccccccccceeeeeeeeeeehhhhhhHHHhhhcCChHHHH-HHHHHHHHHHHHHHHHHHhhccccCCCcccchhhcC Confidence 2233467788999999999999999987654332211111122 222334568999999999999999877767888887 Q ss_pred cccccccc--cccccHHHHHHHHHHhh--hhhcCCEEEEcHHHHHHHHHh-hcCCCCEeccCCC-CCCccceecCcccee Q lcl|Aclame:pro 378 GDAWATNV--TGTTNIQELLEKLSVAT--PKAADSTLVIHRNDLAAIRFL-KDKNGNYVFPVGV-SNQTIATHFGFNRLV 451 (517) Q Consensus 378 ~~~~~~~~--~~~~~~d~l~~~l~~~~--~~~~~a~~vmn~~~~~~l~~l-KD~~Gryl~~~~~-~~~~~~~l~g~~~v~ 451 (517) ...+.+.. .+....-+.++.+.... ....++.|+|||++..+|+.+ +...++.++++.. ..|..-..|++.++. T Consensus 146 ~~~q~i~~~~~gg~~t~d~LDeLl~~v~~~~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~~~~~G~~v~~~~GiPi~ 225 (310) T protein:vir:97 146 ASGQKATTGATGSAISFAILDELMDLVVDKDGQVDYLTMHARTLRSYKALLRALGGASINEVVELPSGAEVPAYSGTPIF 225 (310) T ss_pred CccceeecCCCCCCCCHHHHHHHHHHHhcCCCCCCEEEecHHHHHHHHHHHHHhcCCCCCCccccCCCCEEeeeCCeEEE Confidence 66555533 22222223444444433 234578999999998887654 3445555555432 233333345545555 Q ss_pred ccccCCce------------eeeecCc----eEEEe-------eeheeehhhhhcccchHHHHHhhhhcceeecccceEE Q lcl|Aclame:pro 452 QSVAVDEK------------TAVSLSG----YVTNG-------SRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAY 508 (517) Q Consensus 452 ~~~~~~~~------------~~~~~~~----~~~~~-------~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~ 508 (517) +.+.++.. .++-++. +-+.+ -+.++.+...+ .+.-..++++.++|.+|..|.|++. T Consensus 226 ~~d~ip~~~~~~~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~~G~~~-~~~v~~~~V~~Y~~~av~~~~A~a~ 304 (310) T protein:vir:97 226 RNDYIPTNQTKGGTTGCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVDVGESE-DSDEHIWRVKWYCGLALFSEKGLAC 304 (310) T ss_pred EeCccCCCccccccCCceeEEEEeeCccccccceeccccCCccceeEEeCCccc-CCcceeEEEEEeeeEEEecccceee Confidence 44444321 1122221 11111 11222221111 3444567888999999999999886 Q ss_pred EEeCCC Q lcl|Aclame:pro 509 GTYTPP 514 (517) Q Consensus 509 ~~~tp~ 514 (517) ..=--- T Consensus 305 L~~V~~ 310 (310) T protein:vir:97 305 ADGITN 310 (310) T ss_pred eccccC Confidence 653222 No 121 >protein:vir:94870 Length: 318 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762518;genbank:gi:115304217;genbank:GeneID:5141183 Probab=98.97 E-value=2.3e-11 Score=78.87 Aligned_cols=292 Identities=20% Similarity=0.231 Sum_probs=175.3 Q ss_pred hhhhhhhhhHHHHHHHHHHHHhhccc-hhhHHHHhhhhhcccc----cccccchhhhhhHHHhHhhhhhhhhceeeeccc Q lcl|Aclame:pro 205 LKVTPEATEFLKTREAEVAYMSASLT-KDPKAAWTAELKERGI----SGMPAPAGILKRIQDAVNDEGSLLPFIRHENLP 279 (517) Q Consensus 205 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~----~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~ 279 (517) ..... +.......+++-...+.. .+.+.++...+..+++ +++..|..++..|...+....|+...+.+++++ T Consensus 1 mtnfi---esqnavteffdvlkknsgkseiknawnaklaengvtitdttfqlprklvesintallntnpvfkvfhvtnvg 77 (318) T protein:vir:94 1 MTNFI---ESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVG 77 (318) T ss_pred Cccch---hhhhhHHHHHHHHhcccChhhhhhhhhhhhhhCCceeecchhhhHHHHHHhhhhhhccCCcceeeeeehhhh Confidence 11111 112222334444444333 3567777777776654 456779999999999999999999999999998 Q ss_pred cceeeee-cccccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHH-HH Q lcl|Aclame:pro 280 TLVVGGD-NALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMA-VN 357 (517) Q Consensus 280 ~~~~~~~-~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~-~e 357 (517) ++.+.+. .+.+.+..+..|..+++...++..-++.|-.++.+..+...+-...+ .-+.|.+.|..+|.+++..+ .+ T Consensus 78 allvsrsfdssneaqvhkdgqtkteqaatltidtlepvmvyklqslaervkrlqm--syselynlivaeltqaivnkivd 155 (318) T protein:vir:94 78 ALLVSRSFDSSNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQM--SYSELYNLIVAELTQAIVNKIVD 155 (318) T ss_pred heeeeccccccchhhhhcccccccccceeeeecccchhHHHHHHHHHHHHHHHhh--hHHHHHHHHHHHHHHHHHhhhhh Confidence 8776654 45677888999999999988888888999888888777665433222 23568889999999988866 57 Q ss_pred hhhhcccccCcccccccccccc-------c-ccccccccccHHHHHHHHHHhhhhhcCCEEEEcHHHHHH-HHHhhcCCC Q lcl|Aclame:pro 358 RAIIMGGVTGVSETQIYPVVGD-------A-WATNVTGTTNIQELLEKLSVATPKAADSTLVIHRNDLAA-IRFLKDKNG 428 (517) Q Consensus 358 ~~~l~G~G~~~~~~gi~~~~~~-------~-~~~~~~~~~~~d~l~~~l~~~~~~~~~a~~vmn~~~~~~-l~~lKD~~G 428 (517) .+++-|||++.- ..+....+ + .+..+..+...|.+-.+..-........-.++...+..+ |+-|+-+.. T Consensus 156 lalvegdgtngf--ksidkeadvkkikkittkaksagktpfadaieeavdfvrptagrrylivktedrkalldelrqata 233 (318) T protein:vir:94 156 LALVEGDGTNGF--KSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKTEDRKALLDELRQATA 233 (318) T ss_pred eeeeecCCcchh--hhhchhhhHHHHHHhhhhhhhcCCCchhHHHHHHHhhhccCCCceEEEEeccchHHHHHHHHhhhc Confidence 788889987531 21111111 1 111222233344443332221111111224444444333 344432211 Q ss_pred CEeccCCCCCCccceecCccceeccccCCceeeeecC----ceEEEeee---he---eehhhhhcccchHHHHHhhhhcc Q lcl|Aclame:pro 429 NYVFPVGVSNQTIATHFGFNRLVQSVAVDEKTAVSLS----GYVTNGSR---GM---EFEQGTILVENNKEYLFEMPISG 498 (517) Q Consensus 429 ryl~~~~~~~~~~~~l~g~~~v~~~~~~~~~~~~~~~----~~~~~~~~---~~---~~~~d~~~~~n~~~~~~~~rvgg 498 (517) + ....-...-+.+...+.+++..++..+ .-++.+.. +| ...|.|.|.+|+..+++|+...| T Consensus 234 n---------anvriknddteiasevgvdeiivytgskavkptvlvdqkyhidmqdltkvdafewktnsnmilvetltsg 304 (318) T protein:vir:94 234 N---------ANVRIKNDDTEIASEVGVDEIIVYTGSKAVKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSG 304 (318) T ss_pred c---------cceEEeccchhhhhhcCcceeEEeeccccccceeEeccceecchhhhhhhhceeeccCCceEEEEecccC Confidence 1 111111112223344455555554433 23444433 22 23356889999999999999999 Q ss_pred eeecccceEEEEeC Q lcl|Aclame:pro 499 SLEYKGTTAYGTYT 512 (517) Q Consensus 499 ~v~~~~a~~~~~~t 512 (517) .|.-++|-+..++. T Consensus 305 hvetynagavitvs 318 (318) T protein:vir:94 305 HVETYNAGAVITVS 318 (318) T ss_pred cceeecCceeEEeC Confidence 99999999999988 No 122 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=98.90 E-value=1.7e-10 Score=74.14 Aligned_cols=259 Identities=11% Similarity=0.023 Sum_probs=152.2 Q ss_pred HhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecc----cc--ceeeeecccccceeeeccccccccccccee Q lcl|Aclame:pro 237 WTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENL----PT--LVVGGDNALTQGTGHTTGTDKTESNITLQT 310 (517) Q Consensus 237 ~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~----~~--~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~ 310 (517) +..... ....+.+|.-+...+.+.+.....+.+++...+. ++ ..+|.....+.+..+.+|+..+...++.++ T Consensus 1 ma~~~T--~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~lt~~~ 78 (274) T protein:vir:97 1 MPQGLT--KTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKK 78 (274) T ss_pred CCccce--ehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccccccccce Confidence 000000 1113456766666666555555444555544321 22 345554544667778899999888899999 Q ss_pred eEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCccccccccccccccccccccccc Q lcl|Aclame:pro 311 RVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTN 390 (517) Q Consensus 311 ~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~ 390 (517) .++.++..+....++......+.-| +.+.+.+++++.+++.++..++.--.++.. .. .+...+ T Consensus 79 ~~~~i~~~~~~~~i~D~~~~~~~~d----p~~~~~~~~a~a~a~~vd~~~~~~l~~a~~------------~~-~~~~~~ 141 (274) T protein:vir:97 79 REAKIRKIAKGTSITDEALLSGYGD----PQGEQVRQHGLAHANKVDNDVLEALMGAKL------------TV-NADITK 141 (274) T ss_pred eEEEeeeecceecccHHHHHhccch----HHHHHHHHHHHHHHHHHHHHHHHHHhccCc------------cc-cccccC Confidence 9999888876677766655544433 456678899999999999998753221110 00 111223 Q ss_pred HHHHHHHHHHhhh-hhcCCEEEEcHHHHHHHHHhhcCCCCEeccC-----CCCCCccceecCccceeccccCCceeeeec Q lcl|Aclame:pro 391 IQELLEKLSVATP-KAADSTLVIHRNDLAAIRFLKDKNGNYVFPV-----GVSNQTIATHFGFNRLVQSVAVDEKTAVSL 464 (517) Q Consensus 391 ~d~l~~~l~~~~~-~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~-----~~~~~~~~~l~g~~~v~~~~~~~~~~~~~~ 464 (517) .+.+.+++...-. ......++|||..+..|++ |..-+|+-.. ...++.+.+++|. .|+.+..+|....+.+ T Consensus 142 ~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k--~~~~~f~~~s~~g~~~~~~G~ig~~~G~-~Vi~s~~~p~~t~~l~ 218 (274) T protein:vir:97 142 LNGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRG--DASTNFTRATELGDDIIVKGAFGEALGA-IIVRTNKLEAGTAILA 218 (274) T ss_pred HHHHHHHHHHhhccCCCceEEEeCHHHHHHHHh--hhhhhccccCcccccceeccccceecCe-eEEEcCCCCcceEEEE Confidence 5666666544322 2244679999999999875 3322333211 1234556677776 5555666776655443 Q ss_pred C--ceEEEeeeheeehhhhhcccchHHHHHhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 465 S--GYVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 465 ~--~~~~~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) + .+.+....++....+.+..+....+....+.|.++.+|++++.++++.+=-- T Consensus 219 ~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~ 273 (274) T protein:vir:97 219 KKGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGSLE 273 (274) T ss_pred eCcceEeeecCCceeccccchhhcccEEEEEEEEEEEEEcCCceEEEecCccccc Confidence 3 2222233444444455555555666777888999999999998885443111 No 123 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=98.90 E-value=1.7e-10 Score=74.14 Aligned_cols=259 Identities=11% Similarity=0.023 Sum_probs=152.2 Q ss_pred HhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecc----cc--ceeeeecccccceeeeccccccccccccee Q lcl|Aclame:pro 237 WTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENL----PT--LVVGGDNALTQGTGHTTGTDKTESNITLQT 310 (517) Q Consensus 237 ~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~----~~--~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~ 310 (517) +..... ....+.+|.-+...+.+.+.....+.+++...+. ++ ..+|.....+.+..+.+|+..+...++.++ T Consensus 1 ma~~~T--~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~lt~~~ 78 (274) T protein:vir:94 1 MPQGLT--KTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKK 78 (274) T ss_pred CCccce--ehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCcccccccccce Confidence 000000 1113456766666666555555444555544321 22 345554544667778899999888899999 Q ss_pred eEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCccccccccccccccccccccccc Q lcl|Aclame:pro 311 RVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTN 390 (517) Q Consensus 311 ~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~ 390 (517) .++.++..+....++......+.-| +.+.+.+++++.+++.++..++.--.++.. .. .+...+ T Consensus 79 ~~~~i~~~~~~~~i~D~~~~~~~~d----p~~~~~~~~a~a~a~~vd~~~~~~l~~a~~------------~~-~~~~~~ 141 (274) T protein:vir:94 79 REAKIRKIAKGTSITDEALLSGYGD----PQGEQVRQHGLAHANKVDNDVLEALMGAKL------------TV-NADITK 141 (274) T ss_pred eEEEeeeecceecccHHHHHhccch----HHHHHHHHHHHHHHHHHHHHHHHHHhccCc------------cc-cccccC Confidence 9999888876677766655544433 456678899999999999998753221110 00 111223 Q ss_pred HHHHHHHHHHhhh-hhcCCEEEEcHHHHHHHHHhhcCCCCEeccC-----CCCCCccceecCccceeccccCCceeeeec Q lcl|Aclame:pro 391 IQELLEKLSVATP-KAADSTLVIHRNDLAAIRFLKDKNGNYVFPV-----GVSNQTIATHFGFNRLVQSVAVDEKTAVSL 464 (517) Q Consensus 391 ~d~l~~~l~~~~~-~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~-----~~~~~~~~~l~g~~~v~~~~~~~~~~~~~~ 464 (517) .+.+.+++...-. ......++|||..+..|++ |..-+|+-.. ...++.+.+++|. .|+.+..+|....+.+ T Consensus 142 ~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k--~~~~~f~~~s~~g~~~~~~G~ig~~~G~-~Vi~s~~~p~~t~~l~ 218 (274) T protein:vir:94 142 LNGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRG--DASTNFTRATELGDDIIVKGAFGEALGA-IIVRTNKLEAGTAILA 218 (274) T ss_pred HHHHHHHHHHhhccCCCceEEEeCHHHHHHHHh--hhhhhccccCcccccceeccccceecCe-eEEEcCCCCcceEEEE Confidence 5666666544322 2244679999999999875 3322333211 1234556677776 5555666776655443 Q ss_pred C--ceEEEeeeheeehhhhhcccchHHHHHhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 465 S--GYVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 465 ~--~~~~~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) + .+.+....++....+.+..+....+....+.|.++.+|++++.++++.+=-- T Consensus 219 ~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~ 273 (274) T protein:vir:94 219 KKGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGSLE 273 (274) T ss_pred eCcceEeeecCCceeccccchhhcccEEEEEEEEEEEEEcCCceEEEecCccccc Confidence 3 2222233444444455555555666777888999999999998885443111 No 124 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=98.76 E-value=7.9e-10 Score=70.50 Aligned_cols=256 Identities=12% Similarity=0.042 Sum_probs=150.2 Q ss_pred HhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeec----cc--cceeeeecccccceeeeccccccccccccee Q lcl|Aclame:pro 237 WTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHEN----LP--TLVVGGDNALTQGTGHTTGTDKTESNITLQT 310 (517) Q Consensus 237 ~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~----~~--~~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~ 310 (517) +..... ....+.+|.-+...+.+.+.....+.+++...+ .+ ...+|.....+.+..+.+|...+...++..+ T Consensus 1 m~~~~T--~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~ 78 (274) T protein:vir:96 1 MAQGMT--KLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTDILETKK 78 (274) T ss_pred CCccee--ehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccchhhcccce Confidence 111000 112344676666566655555555555544332 12 2345554544667778889888888888899 Q ss_pred eEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCccccccccccccccccccccccc Q lcl|Aclame:pro 311 RVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTN 390 (517) Q Consensus 311 ~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~ 390 (517) .++.++..+....++.+....+.-| +.+.+.+++++.+++.++..++.--.++.. ... +...+ T Consensus 79 ~~~~i~~~~~a~~i~D~~~~~~~~d----~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~------------~~~-~~~~~ 141 (274) T protein:vir:96 79 REAKIRKIAKGTSISDEALLSGYGD----PQGEQVRQHGLAHANKVDDDVLEALKSAKL------------TVE-ADITK 141 (274) T ss_pred eEEEeeeeecceeehHHHHhhccch----HHHHHHHHHHHHHHHHHHHHHHHHHhcccc------------ccc-ccccC Confidence 8888888777777776544433333 556678899999999999988742221111 000 11223 Q ss_pred HHHHHHHHHHhhh-hhcCCEEEEcHHHHHHHHHhhcCCCCEecc-----CCCCCCccceecCccceeccccCCceeeeec Q lcl|Aclame:pro 391 IQELLEKLSVATP-KAADSTLVIHRNDLAAIRFLKDKNGNYVFP-----VGVSNQTIATHFGFNRLVQSVAVDEKTAVSL 464 (517) Q Consensus 391 ~d~l~~~l~~~~~-~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~-----~~~~~~~~~~l~g~~~v~~~~~~~~~~~~~~ 464 (517) .+.+.+++...-. ......++|||..+..|++. ..-+|+-. ....++.+.+++|.. |+.+..++..+.+.+ T Consensus 142 ~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~--~~~~f~~~s~~g~~~~~~G~ig~~~G~~-Vi~s~~~~~~t~~l~ 218 (274) T protein:vir:96 142 LTGLQTAIDKFNDEDLEPMVLFISPLDAGKLRGD--ATTNFTRATELGDDVIVKGAFGEALGAV-IVRSNKLEAGTAILA 218 (274) T ss_pred HHHHHHHHHHhccccccccEEEeCHHHHHHHHhh--ccccccccccccccceeccccceecCeE-EEEeCCCCCceEEEE Confidence 5556665544321 22446799999999998753 22223221 122355566777754 555566766555433 Q ss_pred C--ceEEEeeeheeehhhhhcccchHHHHHhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 465 S--GYVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 465 ~--~~~~~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) + .+.+.....+....+.+..+....+....+.|..+.+|++.++++ ..+| T Consensus 219 ~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~t---k~~~ 270 (274) T protein:vir:96 219 KKGAVKLITKRDFFLETDRDPSTKTTALYSDKHYVAYLYDESKAVKIT---KGSG 270 (274) T ss_pred eccceeeeecCCcccccccccccccCEEEEeEEEEEEEEcCCcEEEEE---cCCc Confidence 3 233333344444555555555566677788899999999999988 4566 No 125 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=98.76 E-value=7.9e-10 Score=70.50 Aligned_cols=256 Identities=12% Similarity=0.042 Sum_probs=150.2 Q ss_pred HhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeec----cc--cceeeeecccccceeeeccccccccccccee Q lcl|Aclame:pro 237 WTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHEN----LP--TLVVGGDNALTQGTGHTTGTDKTESNITLQT 310 (517) Q Consensus 237 ~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~----~~--~~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~ 310 (517) +..... ....+.+|.-+...+.+.+.....+.+++...+ .+ ...+|.....+.+..+.+|...+...++..+ T Consensus 1 m~~~~T--~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~ 78 (274) T protein:vir:95 1 MAQGMT--KLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTDILETKK 78 (274) T ss_pred CCccee--ehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccchhhcccce Confidence 111000 112344676666566655555555555544332 12 2345554544667778889888888888899 Q ss_pred eEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCccccccccccccccccccccccc Q lcl|Aclame:pro 311 RVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTN 390 (517) Q Consensus 311 ~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~ 390 (517) .++.++..+....++.+....+.-| +.+.+.+++++.+++.++..++.--.++.. ... +...+ T Consensus 79 ~~~~i~~~~~a~~i~D~~~~~~~~d----~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~------------~~~-~~~~~ 141 (274) T protein:vir:95 79 REAKIRKIAKGTSISDEALLSGYGD----PQGEQVRQHGLAHANKVDDDVLEALKSAKL------------TVE-ADITK 141 (274) T ss_pred eEEEeeeeecceeehHHHHhhccch----HHHHHHHHHHHHHHHHHHHHHHHHHhcccc------------ccc-ccccC Confidence 8888888777777776544433333 556678899999999999988742221111 000 11223 Q ss_pred HHHHHHHHHHhhh-hhcCCEEEEcHHHHHHHHHhhcCCCCEecc-----CCCCCCccceecCccceeccccCCceeeeec Q lcl|Aclame:pro 391 IQELLEKLSVATP-KAADSTLVIHRNDLAAIRFLKDKNGNYVFP-----VGVSNQTIATHFGFNRLVQSVAVDEKTAVSL 464 (517) Q Consensus 391 ~d~l~~~l~~~~~-~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~-----~~~~~~~~~~l~g~~~v~~~~~~~~~~~~~~ 464 (517) .+.+.+++...-. ......++|||..+..|++. ..-+|+-. ....++.+.+++|.. |+.+..++..+.+.+ T Consensus 142 ~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~--~~~~f~~~s~~g~~~~~~G~ig~~~G~~-Vi~s~~~~~~t~~l~ 218 (274) T protein:vir:95 142 LTGLQTAIDKFNDEDLEPMVLFISPLDAGKLRGD--ATTNFTRATELGDDVIVKGAFGEALGAV-IVRSNKLEAGTAILA 218 (274) T ss_pred HHHHHHHHHHhccccccccEEEeCHHHHHHHHhh--ccccccccccccccceeccccceecCeE-EEEeCCCCCceEEEE Confidence 5556665544321 22446799999999998753 22223221 122355566777754 555566766555433 Q ss_pred C--ceEEEeeeheeehhhhhcccchHHHHHhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 465 S--GYVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 465 ~--~~~~~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) + .+.+.....+....+.+..+....+....+.|..+.+|++.++++ ..+| T Consensus 219 ~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~t---k~~~ 270 (274) T protein:vir:95 219 KKGAVKLITKRDFFLETDRDPSTKTTALYSDKHYVAYLYDESKAVKIT---KGSG 270 (274) T ss_pred eccceeeeecCCcccccccccccccCEEEEeEEEEEEEEcCCcEEEEE---cCCc Confidence 3 233333344444555555555566677788899999999999988 4566 No 126 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=98.74 E-value=1.1e-09 Score=69.60 Aligned_cols=259 Identities=12% Similarity=0.016 Sum_probs=148.7 Q ss_pred HhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeec----ccc--ceeeeecccccceeeecc Q lcl|Aclame:pro 225 MSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHEN----LPT--LVVGGDNALTQGTGHTTG 298 (517) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~----~~~--~~~~~~~~~~~a~~~~eg 298 (517) |. .... ....+.+|.-+...+.+.+.....+.+++.... .++ ..+|.....+.+..+.+| T Consensus 1 ma------------~~~T--~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~g 66 (274) T protein:vir:12 1 MA------------QGLT--KTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEG 66 (274) T ss_pred CC------------ccee--ehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCC Confidence 00 0000 111334566555555555554444445544422 122 345554444567778889 Q ss_pred cccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCccccccccccc Q lcl|Aclame:pro 299 TDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVG 378 (517) Q Consensus 299 ~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~ 378 (517) +..+...++..+..+.++..+....++......+.-| +.+.+.+++++.+++.++..++.--.++.. T Consensus 67 ~~i~~~~lt~~~~~~~i~~~~~~~~i~D~~~~~~~~d----~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~--------- 133 (274) T protein:vir:12 67 EKIPTDILETKKREAKIRKIAKGTSITDEALLSGYGD----PQGEQVRQHGLAHANKVDNDVLEALMGAKL--------- 133 (274) T ss_pred CccchhhcccceeeEEeeeecceeeecHHHHHhcccc----hHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------- Confidence 9888888899998888888777777776554444333 455677899999999999988753221111 Q ss_pred ccccccccccccHHHHHHHHHHhhh-hhcCCEEEEcHHHHHHHHHhhcCCCCEecc-----CCCCCCccceecCccceec Q lcl|Aclame:pro 379 DAWATNVTGTTNIQELLEKLSVATP-KAADSTLVIHRNDLAAIRFLKDKNGNYVFP-----VGVSNQTIATHFGFNRLVQ 452 (517) Q Consensus 379 ~~~~~~~~~~~~~d~l~~~l~~~~~-~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~-----~~~~~~~~~~l~g~~~v~~ 452 (517) .. .....+.+.+.+++...-. ......++|||..+..|++. ..-+|+-. +...++.+.+++|. .|+. T Consensus 134 ---~~-~~~a~~~d~i~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~--~~~~fv~~s~~g~~~~~~G~ig~~~G~-~Vi~ 206 (274) T protein:vir:12 134 ---TV-NADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGD--ASTNFTRATELGDDIIVKGAFGEALGA-IIVR 206 (274) T ss_pred ---cc-cccccCHHHHHHHHHHhccccccccEEEeCHHHHHHHHhh--hhhhccccccccccceecccceeecCe-eEEE Confidence 00 1112345566666554322 22446799999999998763 21222211 12234556677775 5555 Q ss_pred cccCCceeeeecCc--eEEEeeeheeehhhhhcccchHHHHHhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 453 SVAVDEKTAVSLSG--YVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 453 ~~~~~~~~~~~~~~--~~~~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) +..+|..+.+.++. +......++....+.+..+....+....+.|..+.+|++.+..++..+--- T Consensus 207 s~~~p~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~ 273 (274) T protein:vir:12 207 SNKLEAGTAILAKKGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGSLE 273 (274) T ss_pred eCCCCcceEEEEeccceeeeecCCceeccccchhhcccEEEeeeEEEEEEEcCCceEEEEcCCcccc Confidence 66677666544432 222233444444455555555566777888889999999998884322111 No 127 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=98.67 E-value=3.8e-10 Score=72.22 Aligned_cols=219 Identities=16% Similarity=0.117 Sum_probs=139.7 Q ss_pred eeccc---cceeeeecccccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHH Q lcl|Aclame:pro 275 HENLP---TLVVGGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDM 351 (517) Q Consensus 275 ~~~~~---~~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~ 351 (517) ...+. .+.+|.. .+.+.-+.||++.+...+++++.++.++.++...+++.+-.....-| ......++|+.. T Consensus 1 ~~~~~~Gdtit~P~~--iGda~~v~eG~~i~~~~l~~t~~~atIk~~gk~~~itD~a~l~~~gD----p~~ea~~Q~~~~ 74 (231) T protein:vir:73 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGD----PIGESNKQLGLS 74 (231) T ss_pred CccccCCceEEeccc--ccchhhhcCCCcCChhhccccceeeeEeeeccceeeeHHHHhhccCc----hHHHHHHHHHHH Confidence 11111 2234422 45678889999999999999999999999999888887654433323 356688899999 Q ss_pred HHHHHHhhhhcccccCcccccccccccccccccccccccHHHHHHHHHHhhhh-hcCCEEEEcHHHHHHHHHhhcCCC-- Q lcl|Aclame:pro 352 VIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVATPK-AADSTLVIHRNDLAAIRFLKDKNG-- 428 (517) Q Consensus 352 ~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~~~-~~~a~~vmn~~~~~~l~~lKD~~G-- 428 (517) ++.+++..++.= ..+ ++ . ..+...+.+.+.+++...-.. ..+.+++|||.++..||+..+.+- T Consensus 75 iA~kvD~di~~~-~~~---------a~--l--~~~~~~t~d~i~~A~~~fgde~~~~~vivv~p~~~~~Lrk~~~~~~~~ 140 (231) T protein:vir:73 75 LANKVDDDLLKA-AKT---------TS--Q--TVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIG 140 (231) T ss_pred HHHhhhHHHHHh-hcc---------cc--c--cccccccHHHHHHHHHHhccccccceEEEEcchHHHhhhhccchhhhh Confidence 999999998731 111 11 0 111223455565555443322 234579999999999998543322 Q ss_pred CEeccCCCCCCccceecCccceeccccCCceeeee------cCceEEEeeeheeehhhhhcccchHHHHHhhhhcceeec Q lcl|Aclame:pro 429 NYVFPVGVSNQTIATHFGFNRLVQSVAVDEKTAVS------LSGYVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLEY 502 (517) Q Consensus 429 ryl~~~~~~~~~~~~l~g~~~v~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~ 502 (517) ..+-++-..+|....++|. .++.+..++...... .+...+....++....++++......+......+-.+.+ T Consensus 141 ~~~g~~i~~~G~iG~i~G~-~Vi~S~~~~~~~~~~~~~i~~~gAl~~~~k~~~~vEtdRd~~~k~~~i~~~~~y~v~l~~ 219 (231) T protein:vir:73 141 SEVGANALINGTYADVLGA-QIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYD 219 (231) T ss_pred hhhccceeeecccceEcce-EEEEcCCCCCCceeeeeEEeeccceeeeecccceeeccccccccccEEEEeEEEEEEEEc Confidence 1122233446667777775 555555555543321 234444455565666667777777777778888889999 Q ss_pred ccceEEEEeCCC Q lcl|Aclame:pro 503 KGTTAYGTYTPP 514 (517) Q Consensus 503 ~~a~~~~~~tp~ 514 (517) |.+++.++++=- T Consensus 220 ~~~vv~~t~~g~ 231 (231) T protein:vir:73 220 LTKVVNITFTGV 231 (231) T ss_pred CccEEEEEeecC Confidence 999999997544 No 128 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=97.89 E-value=5.1e-06 Score=49.59 Aligned_cols=282 Identities=10% Similarity=0.026 Sum_probs=124.7 Q ss_pred HHHHhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeeccccc--eeeeecccccceeeeccc Q lcl|Aclame:pro 222 VAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPTL--VVGGDNALTQGTGHTTGT 299 (517) Q Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~a~~~~eg~ 299 (517) +..+..+.....+..+. ...++...+--..+...+.+.....+.+++++++..+.+. .-........+..+..|. T Consensus 1 ma~~~~~~~~~t~~g~~---~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~rti~~G~sv~~~~iG~~~~~~~~~G~ 77 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKG---MSAGDKLALFLKVFGGEVLTAFTRTSVTMNKHLVRSIQSGKSAQFPVLGRTKAAYLQPGE 77 (347) T ss_pred CCccccccccccccccC---CcccchHHHHHHHHhHHHHHHHHHHHhhhhhhhheeccccceEEeeeccceeEeeeecCc Confidence 11111110000000000 0000000000123444555666666777777765444322 112234445566777777 Q ss_pred cccc--ccccceeeEeeHhh--hhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhc----cccc----C Q lcl|Aclame:pro 300 DKTE--SNITLQTRVLTPQY--VYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIM----GGVT----G 367 (517) Q Consensus 300 ~~~~--~~~~f~~~~~~~~~--~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~----G~G~----~ 367 (517) .... .+++..+.++.+-. +..+ .|.. +++..... .+.+.+.+++.+++++..|+.++. +... . T Consensus 78 ~l~~~~~~~~~~e~~ltID~~~y~~~-~Vdd--iD~~q~~~--D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~ 152 (347) T protein:vir:94 78 NLDDKRKDMKHTEKTINIDGLLTADV-LIYD--IEDAMNHY--DVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTANN 152 (347) T ss_pred CCCCCcCCccccceEEEEcchhhhhh-hhhh--HHHHhcCc--chHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Confidence 6543 35666766665433 3321 1111 22222111 267778889999999999987752 2111 1 Q ss_pred cccccccc----cccccccccccccccHHHHHHHHHHhhhh-----hc--CCEEEEcHHHHHHHHHh-hcCCCCEeccCC Q lcl|Aclame:pro 368 VSETQIYP----VVGDAWATNVTGTTNIQELLEKLSVATPK-----AA--DSTLVIHRNDLAAIRFL-KDKNGNYVFPVG 435 (517) Q Consensus 368 ~~~~gi~~----~~~~~~~~~~~~~~~~d~l~~~l~~~~~~-----~~--~a~~vmn~~~~~~l~~l-KD~~Gryl~~~~ 435 (517) .+..|... ..+...............+++.+.++... .+ +-.+|++|..|..|.+. .+..+.|-...+ T Consensus 153 ~~~~g~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~~~~~~~~~~~ 232 (347) T protein:vir:94 153 ENIAGLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAILAALMPNAANYQALID 232 (347) T ss_pred cccccCCcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHHHhhcccccccccccc Confidence 11111110 11111111111112233444444433221 12 33567789999887653 344444544344 Q ss_pred CCCCccceecCccceeccccCCceee-----e---------------ecCce---------EEEee----------ehee Q lcl|Aclame:pro 436 VSNQTIATHFGFNRLVQSVAVDEKTA-----V---------------SLSGY---------VTNGS----------RGME 476 (517) Q Consensus 436 ~~~~~~~~l~g~~~v~~~~~~~~~~~-----~---------------~~~~~---------~~~~~----------~~~~ 476 (517) ...+.+.++.|+..+. +..+|.... . ....| ++..+ .-++ T Consensus 233 ~~~G~V~~v~G~~V~~-Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~~~e 311 (347) T protein:vir:94 233 PSTGSIRNVMGFEVIE-VPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALE 311 (347) T ss_pred cccceeEEeeceEEEE-cCccccccCcccccccccccccccccccccccccccccccceEEEEechhhhhhhhhccccee Confidence 5566777777764443 333332110 0 00112 11100 0122 Q ss_pred ehhhhhcccchHHHHHhhhhcceeecccceEEEEeCCC Q lcl|Aclame:pro 477 FEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPP 514 (517) Q Consensus 477 ~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~ 514 (517) .+.|. ..-.-.+....-+|-.++||++.+...++.| T Consensus 312 ~~~~~--~~~~~~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 312 RARRA--NFQADQIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred eeech--hhhhhhhhhhhhhcCcccccceeEEEEecCC Confidence 22221 1111233445567789999999999999999 No 129 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=97.82 E-value=3.1e-06 Score=50.77 Aligned_cols=279 Identities=11% Similarity=-0.020 Sum_probs=115.5 Q ss_pred hhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecc---cc--ceeeeecccccceeeeccccccccc Q lcl|Aclame:pro 231 KDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENL---PT--LVVGGDNALTQGTGHTTGTDKTESN 305 (517) Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~---~~--~~~~~~~~~~~a~~~~eg~~~~~~~ 305 (517) -++.+...............+|.-+...+++.+.....+.++++.... .+ ..+|. .....+.-+..+...+..+ T Consensus 1 ~~~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~~~~d~~~~~~~Gdtv~ip~-~g~~~~~d~~~~~~i~~~~ 79 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTSVVKTWGAQVKKGDTFHVPR-ISELGVEDKATDVPVGVQP 79 (341) T ss_pred CcchhhhccccccchhHHHHHHHHHHHHHHHHHHhhcchhhccccccccccCCceEEEec-cCcceeeeecCCCcccccc Confidence 011111111001111122335666666677766666666665543211 11 22332 2223344455555544444 Q ss_pred ccceeeEeeHhh-hhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCccccccccccccccccc Q lcl|Aclame:pro 306 ITLQTRVLTPQY-VYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATN 384 (517) Q Consensus 306 ~~f~~~~~~~~~-~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~ 384 (517) ++-..+++...+ .+.-..++..-......| +.+.+.++..++++++.|..++.---......+............ T Consensus 80 ~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d----~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~~~~~~~t 155 (341) T protein:vir:94 80 VNDTDFVITVDTDRTTAVALDDLLEIQASYD----LRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFSSSNGAIT 155 (341) T ss_pred ccCceEEEEEeeeeecceeechHHHHhhccc----hHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCccccCcccccc Confidence 444444444422 122233443333333334 556667788999999999887642111111001000111011111 Q ss_pred ccccc-cHHHHHHH---HHHhhhhhcCCEEEEcHHHHHHHHHhhcCCC-CEeccCCCCCCccceecCccceeccccCCce Q lcl|Aclame:pro 385 VTGTT-NIQELLEK---LSVATPKAADSTLVIHRNDLAAIRFLKDKNG-NYVFPVGVSNQTIATHFGFNRLVQSVAVDEK 459 (517) Q Consensus 385 ~~~~~-~~d~l~~~---l~~~~~~~~~a~~vmn~~~~~~l~~lKD~~G-ryl~~~~~~~~~~~~l~g~~~v~~~~~~~~~ 459 (517) ..... ..+.++.+ |.....+..+-.+|++|..+..|.+...-.. .|.-+.....|.+..++|+..+. +..++.. T Consensus 156 ~~~~~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~~~~~g~~~l~~G~ig~i~G~~V~~-Sn~lp~~ 234 (341) T protein:vir:94 156 GNGQAFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIPQFISKDFINNAPIAQGQIGSLMGVRVIR-TSLIGNN 234 (341) T ss_pred CchhhhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhchhhhhhhccccchhheeeeeeEeceEEEE-ecccccc Confidence 11111 12223222 2222222234568899999999965321111 12222234566677777765443 3333321 Q ss_pred eee-----------------------------ecCceE-EEe-ee---heee----------------hhhhhcccchHH Q lcl|Aclame:pro 460 TAV-----------------------------SLSGYV-TNG-SR---GMEF----------------EQGTILVENNKE 489 (517) Q Consensus 460 ~~~-----------------------------~~~~~~-~~~-~~---~~~~----------------~~d~~~~~n~~~ 489 (517) ... +++... +.. +. .++. ..+|+...-... T Consensus 235 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (341) T protein:vir:94 235 SATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAPRVTQSFENREQVWL 314 (341) T ss_pred ccccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhhccccccccccccchhhhhhhh Confidence 110 000000 000 00 0111 111211111223 Q ss_pred HHHhhhhcceeecccceEEEEeCCC-C Q lcl|Aclame:pro 490 YLFEMPISGSLEYKGTTAYGTYTPP-V 515 (517) Q Consensus 490 ~~~~~rvgg~v~~~~a~~~~~~tp~-~ 515 (517) +.....+|-.+.+|++.+..-...+ | T Consensus 315 i~~~~~~G~~~lrp~~~v~~~~~~~~~ 341 (341) T protein:vir:94 315 MVGRQAYGARLYRPLHAVNIHTTGDTV 341 (341) T ss_pred hhhhhhhcccccCcceeEEEecCcCCC Confidence 4455667888999999766554333 4 No 130 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=97.82 E-value=1.2e-06 Score=53.07 Aligned_cols=267 Identities=13% Similarity=0.048 Sum_probs=123.7 Q ss_pred HhhccchhhHHHHhhhhhc---cc---ccc-cccchhhhhhHHHhHhhhhhhhhceeeecc-ccceeeee-----ccccc Q lcl|Aclame:pro 225 MSASLTKDPKAAWTAELKE---RG---ISG-MPAPAGILKRIQDAVNDEGSLLPFIRHENL-PTLVVGGD-----NALTQ 291 (517) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~---~~---~~~-~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~-~~~~~~~~-----~~~~~ 291 (517) +.. ...+.+ .+ +.. +.-|+-+..++.+.+...-..-.+.+.... .+..+... ..... T Consensus 1 ~~~----------~~~i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~~a~~~~~v~f~~~~p~~~~~d 70 (318) T protein:vir:10 1 MTA----------PTGIVSVSDGPAITVRELVGNPLWIPTALKKMMVNQFISESLFRNGGANPNGVVAYNEGNPSFLEDD 70 (318) T ss_pred CCC----------CCcceeeecCCceehHHhhCCchhHHHHHHHHHhccchhhhhhhcccccccceeEEEecccccccCc Confidence 000 000000 00 000 112444444444444333332223333222 22222221 22356 Q ss_pred ceeeecccccccccccceeeEe-eHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhc----cccc Q lcl|Aclame:pro 292 GTGHTTGTDKTESNITLQTRVL-TPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIM----GGVT 366 (517) Q Consensus 292 a~~~~eg~~~~~~~~~f~~~~~-~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~----G~G~ 366 (517) +..+.||.+.|.+..+++...+ ..++++.-+.+|+|++.+...+.. +-...+|++.+.+..++..+- +. + T Consensus 71 ~e~VaEggEiP~~~~~~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v----~r~~~~l~Nti~r~~d~~a~dal~sa~-t 145 (318) T protein:vir:10 71 VADVAEFGEIPVSAGARGLPRTAFAVKKALGVRVSKEMIDENRVGAV----NDQMLQLRNTFIRANDRSAKALLQSPI-V 145 (318) T ss_pred HhhccCcccccccCCCCCchhhhhhehhccceeccHHHHhhcChhHH----HHHHHHHHHHHHHHHHHHHHHHHhccc-c Confidence 6778999999998888876665 557889999999999999988733 334456777777777765442 11 0 Q ss_pred Ccccccccccccccccccc---cccccH-HHHHHHHH----------HhhhhhcCCEEEEcHHHHHHHHH------hhcC Q lcl|Aclame:pro 367 GVSETQIYPVVGDAWATNV---TGTTNI-QELLEKLS----------VATPKAADSTLVIHRNDLAAIRF------LKDK 426 (517) Q Consensus 367 ~~~~~gi~~~~~~~~~~~~---~~~~~~-d~l~~~l~----------~~~~~~~~a~~vmn~~~~~~l~~------lKD~ 426 (517) +. ++.++ +..... ...... +.+..+.. ...-.|...++||||.+|..|.+ +-.. T Consensus 146 --~~---~~~s~-~w~~~~~~~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~~~~~y~~ 219 (318) T protein:vir:10 146 --PT---LAVPT-AWDNGGKVRTDIAIAIEQISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNENFMKVYER 219 (318) T ss_pred --cc---ccCCc-CCCCcccccccchhhhhhhhhhhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcchhhhhhhhc Confidence 00 00000 000000 000000 00010000 11223567899999999999943 3334 Q ss_pred CCCEecc-CCCCCCccceecCccceeccccCCceeeeecCceEEE---eeehe--eehh----hhhcccchH-HHHHhhh Q lcl|Aclame:pro 427 NGNYVFP-VGVSNQTIATHFGFNRLVQSVAVDEKTAVSLSGYVTN---GSRGM--EFEQ----GTILVENNK-EYLFEMP 495 (517) Q Consensus 427 ~Gryl~~-~~~~~~~~~~l~g~~~v~~~~~~~~~~~~~~~~~~~~---~~~~~--~~~~----d~~~~~n~~-~~~~~~r 495 (517) ++.+++. +..+...+..++| ..++.+...+...++.+..-.++ +...+ .-+. +.+...|+. ..++..+ T Consensus 220 ~a~~~~~~~~~tg~~~g~~lG-l~vi~s~~~p~~~alvlq~g~vG~~~d~~pl~~t~~~~egg~~~g~~~~s~~~~~~~~ 298 (318) T protein:vir:10 220 NANYVSTAPDWTGNFPGSVMG-LNVIRSRTFPIDRVLIMERGTVGFYSDTRPLQFTALYPEGNGPNGGPTESYRADASHK 298 (318) T ss_pred cchhhhhcccccccccceeec-eEEeecCccCCCeeEEEecCCcceeeccccceeeecccCCCCCCCCcchhhheehhee Confidence 4555542 2233333455667 44555555554444322211111 11111 1110 112223322 1223334 Q ss_pred hcceeecccceEEEE--eCC Q lcl|Aclame:pro 496 ISGSLEYKGTTAYGT--YTP 513 (517) Q Consensus 496 vgg~v~~~~a~~~~~--~tp 513 (517) -.-.|.+|.|.+.++ .|| T Consensus 299 ~~~~V~~PkA~~~itgi~~~ 318 (318) T protein:vir:10 299 RALAVDQPKAALWLTGIVTP 318 (318) T ss_pred eeeeeeCcceeEEEeeccCC Confidence 455799999999655 577 No 131 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=97.73 E-value=3e-06 Score=50.86 Aligned_cols=228 Identities=11% Similarity=0.071 Sum_probs=130.3 Q ss_pred HhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeeccc---cceeeeecccccceeeeccccc Q lcl|Aclame:pro 225 MSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLP---TLVVGGDNALTQGTGHTTGTDK 301 (517) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~a~~~~eg~~~ 301 (517) +... ....-.+.. ......|......|++.+...++++...++.... .-...++.+...+.|..-++.. T Consensus 1 m~~~------~~~~~TL~e--~Akr~~~d~~~~~VIE~l~~~n~IL~~lpf~e~n~gt~~~~~v~~~LP~~~fR~lN~g~ 72 (328) T protein:vir:95 1 MAVK------GLTALTLAD--WGKRVDPNGKVDKIIELLGQTNPILQDMPFVEGNLPTGHRTTIRSGLPSATWRLLNYGV 72 (328) T ss_pred CCcc------ccccccHHH--HHhhhCcchhHHHHHHHHhccchhHhhcceeecccCCcceeeEeeccCCceeeecCCcc Confidence 0000 000000000 0011224455667888899999999888876653 2345567788889999999999 Q ss_pred ccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcc--cc-------- Q lcl|Aclame:pro 302 TESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVS--ET-------- 371 (517) Q Consensus 302 ~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~--~~-------- 371 (517) +++..++.+++...+.+++...+.+.+..... ....+...-.....+++.+..+.+|++||.+..+ +. T Consensus 73 ~~s~~tt~q~t~~l~ilgg~~eVDr~la~~~G--n~~~~ra~q~~~~~ka~~~~~~~~~iyGdsa~~p~~F~GL~~R~~~ 150 (328) T protein:vir:95 73 QPSKSTTVQVTDSVGMLETYAEVDKSLADLNG--NTAEFRLSEDRAFIEAMNQQMAQTLFYGDSSVNPQQFMGLSSRYSS 150 (328) T ss_pred CcccceeEEEEEEEEEEecceeechHHHhhcC--CHHHHHHHHHHHHHHHHHHHHHHHHhcCCccCChhhhcchhhhcCc Confidence 99999999999999999999999998766543 2233444444557788999999999999654221 00 Q ss_pred ---------------------------------ccccccc---------------------------------------- Q lcl|Aclame:pro 372 ---------------------------------QIYPVVG---------------------------------------- 378 (517) Q Consensus 372 ---------------------------------gi~~~~~---------------------------------------- 378 (517) ||++.-. T Consensus 151 ~s~~~a~qiidaGgtg~~~TSi~~v~~g~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~g~~y~~y~~~~~w~~Gl~i~d~ 230 (328) T protein:vir:95 151 LSAGNAQNIIDAGGTGTDNTSIWLVVWGENTVHGIFPKGKKAGIQMEDKGQVTLEDANGGKYEGYRTHYKWDNGLALRDW 230 (328) T ss_pred cccccccceeecccCCCCceEEEEEEEcCCeEEEecccccccCceeeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCc Confidence 1111000 Q ss_pred -------ccccccccccccHHHHHHHHHHhhhhh-----cCCEEEEcHHHHHHHHHh-hcCCCCEeccCCCCCCccceec Q lcl|Aclame:pro 379 -------DAWATNVTGTTNIQELLEKLSVATPKA-----ADSTLVIHRNDLAAIRFL-KDKNGNYVFPVGVSNQTIATHF 445 (517) Q Consensus 379 -------~~~~~~~~~~~~~d~l~~~l~~~~~~~-----~~a~~vmn~~~~~~l~~l-KD~~Gryl~~~~~~~~~~~~l~ 445 (517) +.+....+..+...++++.|..+.... .+.+|.||++....|++. .++..-++-........+.... T Consensus 231 r~vvrI~NId~~~l~~~~~~~~l~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~~~~~~~~g~~~t~~~ 310 (328) T protein:vir:95 231 RYVVRIANIDVSNLSEPSSAANIAKLMVKALHRIPNRGMGRPVFYMNRTVGQALDLQSLEKTSLAISVKETEGEWWTSFR 310 (328) T ss_pred ccEEEEecCcccccccccChhhHHHHHHHHHHHhccCCCCcceeehhHHHHHHHHHHHhcCcceeeeeeccCCcceeEEC Confidence 000000011122345666666654433 346899999999999985 4555555544444444444444 Q ss_pred Cccceecccc-CCceeeee Q lcl|Aclame:pro 446 GFNRLVQSVA-VDEKTAVS 463 (517) Q Consensus 446 g~~~v~~~~~-~~~~~~~~ 463 (517) |++ +...+. .....++. T Consensus 311 gip-ir~~dai~~tE~~vv 328 (328) T protein:vir:95 311 GVP-IRETDALLETEARVV 328 (328) T ss_pred CeE-EEEEeeeecCccccC Confidence 443 333222 11111111 No 132 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=97.70 E-value=2.9e-06 Score=50.91 Aligned_cols=228 Identities=17% Similarity=0.144 Sum_probs=126.5 Q ss_pred HhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecc--cc-ceeeeecccccceeeeccccc Q lcl|Aclame:pro 225 MSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENL--PT-LVVGGDNALTQGTGHTTGTDK 301 (517) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~--~~-~~~~~~~~~~~a~~~~eg~~~ 301 (517) +........ .+.. ......|......|++.+...++++...++... +. .....+++...++|..-+... T Consensus 1 m~~~~~~a~------TL~e--~AKr~~~d~~~~~IIE~l~~tn~IL~~lpf~e~N~~tg~~t~vrt~LP~~~fR~lN~g~ 72 (330) T protein:vir:10 1 MATLSTNNP------TMAD--VAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGV 72 (330) T ss_pred CCcCCCCcc------cHHH--HHhhcCcchhHHHHHHHHhcCchHHhhcchhhccCCcccceeEEeecCCchhhhcCCcc Confidence 000000000 0000 001112344556688888888888887776532 11 223445667788899888888 Q ss_pred ccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcc--cc-------- Q lcl|Aclame:pro 302 TESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVS--ET-------- 371 (517) Q Consensus 302 ~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~--~~-------- 371 (517) +++..++.+++...+.++++..+.+.+..... ....+.........+++.+.....|++||-+..+ +. T Consensus 73 ~~s~~tt~qvt~~l~ilgg~~eVDr~la~~~G--n~a~~ra~e~~~~ikam~q~~~~~~iyGD~a~~p~~F~GL~kR~~~ 150 (330) T protein:vir:10 73 LPNKSSTAQVTDNCGMLEAYAEVDKALADLNG--NTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNS 150 (330) T ss_pred ccccceEEEEEEEeEEecchhhhhhHHHhhcC--CHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCChhhccchhhhcCC Confidence 88999999999999999999999998766544 2234444455567889999999999999643211 11 Q ss_pred ---------------------------------ccccccc-----------------c---------------------- Q lcl|Aclame:pro 372 ---------------------------------QIYPVVG-----------------D---------------------- 379 (517) Q Consensus 372 ---------------------------------gi~~~~~-----------------~---------------------- 379 (517) ||++... + T Consensus 151 ~ta~~~~qvIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~~~dg~gg~y~~~~~~~~w~~Gl~i~ 230 (330) T protein:vir:10 151 LSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLR 230 (330) T ss_pred CCCCchhheeeccccccCceEEEEEEEcCCeEEEEcccCccccceeeeccceeeecccCCCCceeEEeeeeeeeeeeEEe Confidence 1111000 0 Q ss_pred ----------cccccccccccHHHHHHHHHHhhhhhc-----CCEEEEcHHHHHHHHHh-hcCCCCEeccCCCCCCccce Q lcl|Aclame:pro 380 ----------AWATNVTGTTNIQELLEKLSVATPKAA-----DSTLVIHRNDLAAIRFL-KDKNGNYVFPVGVSNQTIAT 443 (517) Q Consensus 380 ----------~~~~~~~~~~~~d~l~~~l~~~~~~~~-----~a~~vmn~~~~~~l~~l-KD~~Gryl~~~~~~~~~~~~ 443 (517) ............+++++.+..+....+ +.+|.||++....|++. .+++.-.|=......-.+ + T Consensus 231 d~r~vvRI~NIdvs~l~~~~~~~~li~lm~~A~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~k~n~~l~~~~~~g~~~-t 309 (330) T protein:vir:10 231 DWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERV-M 309 (330) T ss_pred CcccEEEEeecccccCCCCccHHHHHHHHHHHHHhccCCCCCcceeeechHHHHHHHHHHhhcccceeeeeecCCeee-E Confidence 000001112244567777777665433 46799999999999985 566554443223322223 3 Q ss_pred ecCccceeccccC-Cceeeee Q lcl|Aclame:pro 444 HFGFNRLVQSVAV-DEKTAVS 463 (517) Q Consensus 444 l~g~~~v~~~~~~-~~~~~~~ 463 (517) .|++-++...+.. ....++. T Consensus 310 ~~~gipir~~Dail~tE~~vv 330 (330) T protein:vir:10 310 TFDGIPVQRTDALLNTESRVV 330 (330) T ss_pred EECCeEEEEEeeeecCccccC Confidence 3433333322221 1111111 No 133 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=97.69 E-value=2.8e-05 Score=45.54 Aligned_cols=238 Identities=13% Similarity=0.074 Sum_probs=101.9 Q ss_pred ceeeeccc-cceeeeecccccceeeeccccccc--ccccceeeEe--eHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHH Q lcl|Aclame:pro 272 FIRHENLP-TLVVGGDNALTQGTGHTTGTDKTE--SNITLQTRVL--TPQYVYKYIKLPKIVMNSNATDIAGAILTYVMN 346 (517) Q Consensus 272 ~~~~~~~~-~~~~~~~~~~~~a~~~~eg~~~~~--~~~~f~~~~~--~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~ 346 (517) ++|....+ +..++ +.....+..+..|+.... .+++-.+.++ .-..+..+ .+...--..+.+| +.+...+ T Consensus 1 ~vr~i~~g~s~~~~-~iG~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~-~VdDiD~~qa~~D----lr~e~s~ 74 (324) T protein:vir:99 1 MTRTITSGKSAQFP-VMGRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDV-LIYDIEDAMNHYD----VRSEYST 74 (324) T ss_pred CeeeeecCceEEEe-eeeeeEeccccCCCCcCCCcCCcCcccEEEEecchhhhhh-hhhhHHHHhcCcc----chhHHHH Confidence 44432222 22222 233344555666665422 2233344333 33333221 1111111112233 7778888 Q ss_pred HHHHHHHHHHHhhhhc----cc--c---cCccc--ccccccccccccccccccccHHHHHHHHHHhhhh-----h--cCC Q lcl|Aclame:pro 347 RLPDMVIMAVNRAIIM----GG--V---TGVSE--TQIYPVVGDAWATNVTGTTNIQELLEKLSVATPK-----A--ADS 408 (517) Q Consensus 347 ~l~~~~~~~~e~~~l~----G~--G---~~~~~--~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~~~-----~--~~a 408 (517) ++.+++++..|+.++. +. . ...+. .|...... ............+.+.+++..+... . .+- T Consensus 75 ~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~-~~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP~~gR 153 (324) T protein:vir:99 75 QMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVK-ITGKKEDPAKYGTQVIQALTYARAAFAKKYIPAGDR 153 (324) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceec-ccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCC Confidence 9999999999977641 11 0 01111 11111110 0111111122334455544433221 1 234 Q ss_pred EEEEcHHHHHHHHHh-hcCCCCEeccCCCCCCccceecCccceeccccCCceeee------ec--------------Cce Q lcl|Aclame:pro 409 TLVIHRNDLAAIRFL-KDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDEKTAV------SL--------------SGY 467 (517) Q Consensus 409 ~~vmn~~~~~~l~~l-KD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~~~~~------~~--------------~~~ 467 (517) .+||+|..|..|..- +-.++.|.-.....++.+..+.|+.. +.+..++..... +. ..| T Consensus 154 ~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~V~~i~Gf~V-~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky 232 (324) T protein:vir:99 154 TFYTDPDTYSAILAALMPNAANYAALIDPETGNIRNVMGFEV-VETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKM 232 (324) T ss_pred EEEeChHHHHHHhhcccccccccccccceecceEEEEeceEE-EecCCcccccccccccccccccccccccccccccccc Confidence 689999999877533 23344555444556677777777544 434444332100 00 011 Q ss_pred EE-----------------EeeeheeehhhhhcccchHHHHHhhhhcceeecccceEEEEeC--------CCC-CC Q lcl|Aclame:pro 468 VT-----------------NGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYT--------PPV-AG 517 (517) Q Consensus 468 ~~-----------------~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~t--------p~~-a~ 517 (517) .. .-.+.+.....++...-...+.....+|-.+.||++.+..++. |.| .| T Consensus 233 ~~d~~~~~gl~~~~~a~~tv~~~~~~~e~~~~~~~~~d~i~~~~a~G~~~lRPe~a~~v~l~~~~~~~~~~~~~~~ 308 (324) T protein:vir:99 233 TVGADNVVGLFVHRSAVATLKLKDMALERARRPEYQADQIIAKYAMGHGGLRPEAVGAIIFEDGETPAVAPDVITG 308 (324) T ss_pred ccccCceeEEEEehhheEEEeeecceecceechhhHHHhhhhhhhhcCcccccceEEEEEEccCccccccchhhhh Confidence 10 0011111111111111122334456677789999999877752 221 11 No 134 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=97.64 E-value=3e-06 Score=50.87 Aligned_cols=252 Identities=10% Similarity=-0.030 Sum_probs=106.4 Q ss_pred hhcccccccccchhhhhhHHHhHhhhhhhhhceeeec----cc--cceeeeecccccceeeecccccccccccceeeEee Q lcl|Aclame:pro 241 LKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHEN----LP--TLVVGGDNALTQGTGHTTGTDKTESNITLQTRVLT 314 (517) Q Consensus 241 ~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~----~~--~~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~~~~~ 314 (517) +. ....+|.-+...+.+.+.....+.+++.... .. +..+|.-.....+....++...+..+.+...+++. T Consensus 1 MA----~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~t 76 (273) T protein:vir:10 1 MA----FNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLL 76 (273) T ss_pred Cc----chhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCccccccceEEEE Confidence 11 1222455555666666666555555543211 11 12333322222222223333333334444444444 Q ss_pred Hhhh-hHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhc---ccccCccccccccccccccccccccccc Q lcl|Aclame:pro 315 PQYV-YKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIM---GGVTGVSETQIYPVVGDAWATNVTGTTN 390 (517) Q Consensus 315 ~~~~-~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~---G~G~~~~~~gi~~~~~~~~~~~~~~~~~ 390 (517) ..+. +.-..++..--.....+ +++ +.+++.++++.++|..++. +.+... ......+.... T Consensus 77 id~~~~~~~~i~d~d~~~~~~~----~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~~-----------~~~~~~~~~~~ 140 (273) T protein:vir:10 77 IDQEKSIDFLVDDIDRVQVAGS----LEA-YTRAGATALATDTDKFIADMLVDNGTAL-----------TGSAPTDADDA 140 (273) T ss_pred EeeeeecceEeecHHHhhhhcc----HHH-HHHHHHHHHHHHHHHHHHHHHhcccccc-----------ccccccchhHH Confidence 3221 11122332111122222 566 4556888999999877652 211100 00111111122 Q ss_pred HHHHHHHHHHh---hhhhcCCEEEEcHHHHHHHHHhhcCCCC-Eec-c-CCCCCCccceecCccceeccccCCce---ee Q lcl|Aclame:pro 391 IQELLEKLSVA---TPKAADSTLVIHRNDLAAIRFLKDKNGN-YVF-P-VGVSNQTIATHFGFNRLVQSVAVDEK---TA 461 (517) Q Consensus 391 ~d~l~~~l~~~---~~~~~~a~~vmn~~~~~~l~~lKD~~Gr-yl~-~-~~~~~~~~~~l~g~~~v~~~~~~~~~---~~ 461 (517) .+.+.++.... ..+..+-.+|++|..+..|.+..+---+ +.. . ....+|.++.+.|+..+ .+..+|.. .+ T Consensus 141 ~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~-~s~~lp~~~~~~~ 219 (273) T protein:vir:10 141 FDLIAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIV-ESNNLRDTDDEQF 219 (273) T ss_pred HHHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEE-EecccccCCccEE Confidence 33333332222 2222345789999999998764321111 111 1 12235667777786444 44334321 12 Q ss_pred ee-c-CceEEEeeeheeehhhhhcccchHHHHHhhhhcceeecccceEEEEeCCC Q lcl|Aclame:pro 462 VS-L-SGYVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPP 514 (517) Q Consensus 462 ~~-~-~~~~~~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~ 514 (517) +. + +.+....+. .......+-..-...+....+.|..|.+|++++....|-. T Consensus 220 ~~~~~~A~~~a~q~-~~~e~~r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 220 VAFHPSAAAYVSQI-DTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred EEEeccceeeeeee-ehhhcccCCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 21 1 222221111 1111111111111233445667888999998877655444 No 135 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=97.64 E-value=3e-06 Score=50.87 Aligned_cols=252 Identities=10% Similarity=-0.030 Sum_probs=106.4 Q ss_pred hhcccccccccchhhhhhHHHhHhhhhhhhhceeeec----cc--cceeeeecccccceeeecccccccccccceeeEee Q lcl|Aclame:pro 241 LKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHEN----LP--TLVVGGDNALTQGTGHTTGTDKTESNITLQTRVLT 314 (517) Q Consensus 241 ~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~----~~--~~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~~~~~ 314 (517) +. ....+|.-+...+.+.+.....+.+++.... .. +..+|.-.....+....++...+..+.+...+++. T Consensus 1 MA----~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~t 76 (273) T protein:vir:10 1 MA----FNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLL 76 (273) T ss_pred Cc----chhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCccccccceEEEE Confidence 11 1222455555666666666555555543211 11 12333322222222223333333334444444444 Q ss_pred Hhhh-hHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhc---ccccCccccccccccccccccccccccc Q lcl|Aclame:pro 315 PQYV-YKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIM---GGVTGVSETQIYPVVGDAWATNVTGTTN 390 (517) Q Consensus 315 ~~~~-~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~---G~G~~~~~~gi~~~~~~~~~~~~~~~~~ 390 (517) ..+. +.-..++..--.....+ +++ +.+++.++++.++|..++. +.+... ......+.... T Consensus 77 id~~~~~~~~i~d~d~~~~~~~----~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~~-----------~~~~~~~~~~~ 140 (273) T protein:vir:10 77 IDQEKSIDFLVDDIDRVQVAGS----LEA-YTRAGATALATDTDKFIADMLVDNGTAL-----------TGSAPTDADDA 140 (273) T ss_pred EeeeeecceEeecHHHhhhhcc----HHH-HHHHHHHHHHHHHHHHHHHHHhcccccc-----------ccccccchhHH Confidence 3221 11122332111122222 566 4556888999999877652 211100 00111111122 Q ss_pred HHHHHHHHHHh---hhhhcCCEEEEcHHHHHHHHHhhcCCCC-Eec-c-CCCCCCccceecCccceeccccCCce---ee Q lcl|Aclame:pro 391 IQELLEKLSVA---TPKAADSTLVIHRNDLAAIRFLKDKNGN-YVF-P-VGVSNQTIATHFGFNRLVQSVAVDEK---TA 461 (517) Q Consensus 391 ~d~l~~~l~~~---~~~~~~a~~vmn~~~~~~l~~lKD~~Gr-yl~-~-~~~~~~~~~~l~g~~~v~~~~~~~~~---~~ 461 (517) .+.+.++.... ..+..+-.+|++|..+..|.+..+---+ +.. . ....+|.++.+.|+..+ .+..+|.. .+ T Consensus 141 ~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~-~s~~lp~~~~~~~ 219 (273) T protein:vir:10 141 FDLIAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIV-ESNNLRDTDDEQF 219 (273) T ss_pred HHHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEE-EecccccCCccEE Confidence 33333332222 2222345789999999998764321111 111 1 12235667777786444 44334321 12 Q ss_pred ee-c-CceEEEeeeheeehhhhhcccchHHHHHhhhhcceeecccceEEEEeCCC Q lcl|Aclame:pro 462 VS-L-SGYVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPP 514 (517) Q Consensus 462 ~~-~-~~~~~~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~ 514 (517) +. + +.+....+. .......+-..-...+....+.|..|.+|++++....|-. T Consensus 220 ~~~~~~A~~~a~q~-~~~e~~r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 220 VAFHPSAAAYVSQI-DTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred EEEeccceeeeeee-ehhhcccCCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 21 1 222221111 1111111111111233445667888999998877655444 No 136 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=97.57 E-value=3.8e-06 Score=50.28 Aligned_cols=252 Identities=12% Similarity=-0.007 Sum_probs=109.9 Q ss_pred hhcccccccccchhhhhhHHHhHhhhhhhhhceeee----ccc--cceeeeecccccceeeecccccccccccceeeEee Q lcl|Aclame:pro 241 LKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHE----NLP--TLVVGGDNALTQGTGHTTGTDKTESNITLQTRVLT 314 (517) Q Consensus 241 ~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~----~~~--~~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~~~~~ 314 (517) +.. ...+|.-+...+.+.+.....+.+++... +.. +..+|.-.....+....+|...+..+++...+++. T Consensus 1 MA~----~~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~t 76 (273) T protein:vir:79 1 MAF----NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLL 76 (273) T ss_pred Ccc----hhhhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCcccccccccCCCccCccccccceEEEE Confidence 111 12345555555666665555555554221 111 22333322222222334555444445555555555 Q ss_pred Hhhh-hHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhc---ccccCccccccccccccccccccccccc Q lcl|Aclame:pro 315 PQYV-YKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIM---GGVTGVSETQIYPVVGDAWATNVTGTTN 390 (517) Q Consensus 315 ~~~~-~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~---G~G~~~~~~gi~~~~~~~~~~~~~~~~~ 390 (517) +.+. +.-..++..-...+..| +.++ .+++.+.+++++|..++. +.++.. ......+.... T Consensus 77 id~~~~~~~~i~d~d~~~~~~~----~~~~-~~~~~~ala~~vD~~i~~~~~~a~~~~-----------~~~~~~~~~~~ 140 (273) T protein:vir:79 77 IDQEKSIDFLVDDIDRVQVAGS----LEAY-TRAGATALATDTDKFIADMLVDNGTAL-----------TGSAPSDADDA 140 (273) T ss_pred EeeecccceeeccHHHHhhccc----HHHH-HHHHHHHHHHHHHHHHHHHHhhccccc-----------ccccccchhhH Confidence 5332 22223333222222222 5664 456888899999876542 211110 00111111112 Q ss_pred HHHHHHHHHH---hhhhhcCCEEEEcHHHHHHHHHhhcC--CCCEecc-CCCCCCccceecCccceeccccCCcee---e Q lcl|Aclame:pro 391 IQELLEKLSV---ATPKAADSTLVIHRNDLAAIRFLKDK--NGNYVFP-VGVSNQTIATHFGFNRLVQSVAVDEKT---A 461 (517) Q Consensus 391 ~d~l~~~l~~---~~~~~~~a~~vmn~~~~~~l~~lKD~--~Gryl~~-~~~~~~~~~~l~g~~~v~~~~~~~~~~---~ 461 (517) .+.+.++... ...+-.+-.+|++|..+..|.+..+. +-.+.-. ....+|.++.++|+. ++.+..++... . T Consensus 141 ~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~-i~~s~~lp~~~~~~~ 219 (273) T protein:vir:79 141 FDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGAR-IVESNNLRDTDDEQF 219 (273) T ss_pred HHHHHHHHHHhhhccCCccCcEEEECHHHHHHHhhchhhhhhhhhcccccceeeeEeeEEeceE-EEecccccccCceEE Confidence 2333333222 21222345789999999998765431 1111111 123356677788864 44444444321 1 Q ss_pred e-ec-CceEEEeeeheeehhhhhcccchHHHHHhhhhcceeecccceEEEEeCCC Q lcl|Aclame:pro 462 V-SL-SGYVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPP 514 (517) Q Consensus 462 ~-~~-~~~~~~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~ 514 (517) + .+ +.+...-+. .......+...-...+....+.|..|.+|++++....+-. T Consensus 220 ~a~~~~A~~~a~~~-~~~e~~r~~~~~~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 220 VAFHPSAAAYVSQI-DTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred EEEeccceeeeeeh-hhhhcccCcccceeeeeeeeeeeeEEecCceEEEEeccCC Confidence 1 11 222221111 1111111111112233445667888999998877655444 No 137 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=97.33 E-value=9.4e-05 Score=42.66 Aligned_cols=276 Identities=11% Similarity=0.029 Sum_probs=120.1 Q ss_pred HHHHhhccchhhHHHHhhhhhccccccccc-----chhhhhhHHHhHhhhhhhhhceeeeccccc---eeeeecccccce Q lcl|Aclame:pro 222 VAYMSASLTKDPKAAWTAELKERGISGMPA-----PAGILKRIQDAVNDEGSLLPFIRHENLPTL---VVGGDNALTQGT 293 (517) Q Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v-----p~~i~~~i~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~a~ 293 (517) +..+.++..... .......+..- -..+..++.+.....+.+++++++..+.+. .++ ......+. T Consensus 1 ma~~~~~~~~n~-------~~~~~~~~~~~~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~g~s~~~~-~iG~~~~~ 72 (344) T protein:vir:10 1 MANMTGGQQLGT-------NQGKDVMAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFP-VLGRTQAA 72 (344) T ss_pred CccccccccCCc-------ccCCccCCccchhHHHHHHHHHHHHHHHHHHhhhcccceeeeecccceEEEE-eeceeEEE Confidence 110000000000 00000000000 123444566666777777888776555432 222 33345566 Q ss_pred eeecccccccc--cccceeeEeeHhh--hhH-hHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhc----cc Q lcl|Aclame:pro 294 GHTTGTDKTES--NITLQTRVLTPQY--VYK-YIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIM----GG 364 (517) Q Consensus 294 ~~~eg~~~~~~--~~~f~~~~~~~~~--~~~-~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~----G~ 364 (517) .+..|+....+ ++.-.++++.+-+ +.. ++. . +++...+ -.+.+.+.+++.+++++..|+.++. +. T Consensus 73 ~~~~G~~l~~t~~~~~~~e~~l~ID~~~y~~~~Vd--D--iD~~q~~--~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a 146 (344) T protein:vir:10 73 YLAPGENLDDIRKDIKHTEKVITIDGLLTADVLIY--D--IEDAMNH--YDVRSEYTSQLGESLAMAADGAVLAEIAGLC 146 (344) T ss_pred eeecCCCCCCCCCCcccceEEEEEcchhhhhhhhh--h--HHHHhcC--cchHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 77777776532 3555565554322 222 221 1 2222211 1267778889999999999987642 22 Q ss_pred ccCcc----cc----cccccccccccccccccccHHHHHHHHHHhhhh-----h--cCCEEEEcHHHHHHHHHhhc-CCC Q lcl|Aclame:pro 365 VTGVS----ET----QIYPVVGDAWATNVTGTTNIQELLEKLSVATPK-----A--ADSTLVIHRNDLAAIRFLKD-KNG 428 (517) Q Consensus 365 G~~~~----~~----gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~~~-----~--~~a~~vmn~~~~~~l~~lKD-~~G 428 (517) ....+ .. ++....+.............+.+.+++..+... . .+-.+|++|..|..|..-+. .+. T Consensus 147 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~ 226 (344) T protein:vir:10 147 NVESQYNENITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAILAALMPNAA 226 (344) T ss_pred ccccccccccccccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHHhhccccccc Confidence 11111 11 111111111111111122223444444333221 1 23457889999998854322 123 Q ss_pred CEeccCCCCCCccceecCccceeccccCCcee------eeecCceEEEe---------------------------ee-- Q lcl|Aclame:pro 429 NYVFPVGVSNQTIATHFGFNRLVQSVAVDEKT------AVSLSGYVTNG---------------------------SR-- 473 (517) Q Consensus 429 ryl~~~~~~~~~~~~l~g~~~v~~~~~~~~~~------~~~~~~~~~~~---------------------------~~-- 473 (517) .|.-......|.+..+.|+.++. +..++... +.....|.... .+ T Consensus 227 ~~~~~~~~~~G~V~~v~G~~V~~-Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~ 305 (344) T protein:vir:10 227 NYAALIDPEKGSIRNVMGFEVVE-VPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDL 305 (344) T ss_pred ccccccceeeeEEEEEeceEEEe-ccccccccCCcccccccCccccccCCcccceeeecceeEEEeechhhhhhhhhccc Confidence 33322233455666677754433 33333210 11111111100 00 Q ss_pred heeehhhhhcccchHHHHHhhhhcceeecccceEEEEeCCC Q lcl|Aclame:pro 474 GMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPP 514 (517) Q Consensus 474 ~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~ 514 (517) -++.+.+. ..-...+......|-.+.+|++.+...+++- T Consensus 306 ~~e~~r~~--~~~~d~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 306 ALERARRA--NFQADQIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred eeecccch--hHHHHHHHHHhhcccceecccceEEEEeecC Confidence 11111111 1111123445567778999999999999888 No 138 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=97.32 E-value=9.4e-05 Score=42.65 Aligned_cols=282 Identities=11% Similarity=-0.002 Sum_probs=118.4 Q ss_pred HHHHhhccchhhHHHHhhhhhcccc-cccccchhhhhhHHHhHhhhhhhhhceeeeccccc--eeeeecccccceeeecc Q lcl|Aclame:pro 222 VAYMSASLTKDPKAAWTAELKERGI-SGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPTL--VVGGDNALTQGTGHTTG 298 (517) Q Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~a~~~~eg 298 (517) +..+.++.....+.-. ....++ ..+.+ ..+...+.......+.+.++++...+.+. .-..+.....+..+..| T Consensus 1 ~a~~~~~~~~~~~~g~---~~~~~d~~al~i-e~~~geV~~~f~~~s~~~~~~~~r~i~~G~sv~~~~iG~~~~~~~~~g 76 (347) T protein:vir:88 1 MANATGGQQIGANQGK---GQSAADKLALFL-KVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGYYLAPG 76 (347) T ss_pred CCCcccchhhhccCCC---CccccchHHHHH-HHHHHHHHHHHHHHhhhhhccccccccCcceEEEeeecceeeeeeccc Confidence 0000000000000000 000000 00011 23334455556666667777766444322 11223333445556666 Q ss_pred ccccc--ccccceeeEeeHhh--hhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhc----ccccC--- Q lcl|Aclame:pro 299 TDKTE--SNITLQTRVLTPQY--VYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIM----GGVTG--- 367 (517) Q Consensus 299 ~~~~~--~~~~f~~~~~~~~~--~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~----G~G~~--- 367 (517) ..... .++...++++.+-+ +.. ..|...---...+| +.+-+.+++.+++++..|+.++. +.... T Consensus 77 ~~l~~~~~~~~~~~~~i~ID~~~y~~-~~Vdd~D~~q~~~D----~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~~ 151 (347) T protein:vir:88 77 ENLDDKRKDIKHSEKVIQIDGLLTSD-VLIYDIEDAMNHYD----VRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAAS 151 (347) T ss_pred cCCCCCCCCCccceEEEEEechhhhh-hhhhhHHHHhhcCC----chHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Confidence 65433 34556666654433 222 11222111122223 56667788999999999887752 21110 Q ss_pred -ccccccccccccc---cccccccccc----HHHHHHHHHHhh---hhhcCCEEEEcHHHHHHHHHhh-cCCCCEeccCC Q lcl|Aclame:pro 368 -VSETQIYPVVGDA---WATNVTGTTN----IQELLEKLSVAT---PKAADSTLVIHRNDLAAIRFLK-DKNGNYVFPVG 435 (517) Q Consensus 368 -~~~~gi~~~~~~~---~~~~~~~~~~----~d~l~~~l~~~~---~~~~~a~~vmn~~~~~~l~~lK-D~~Gryl~~~~ 435 (517) ....|+-...... .......... .+.++++..... .+..+-.+|++|..|..|.+-. .....|.-... T Consensus 152 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~ 231 (347) T protein:vir:88 152 NENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALID 231 (347) T ss_pred ccccCCccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHHHHHhcchhhhhhhhccccc Confidence 1111111110000 0000011111 233333322211 1222457899999998875432 33344443334 Q ss_pred CCCCccceecCccceeccccCCceee----------eec----------CceE--------EEe---------ee--hee Q lcl|Aclame:pro 436 VSNQTIATHFGFNRLVQSVAVDEKTA----------VSL----------SGYV--------TNG---------SR--GME 476 (517) Q Consensus 436 ~~~~~~~~l~g~~~v~~~~~~~~~~~----------~~~----------~~~~--------~~~---------~~--~~~ 476 (517) ...+.+..+.|+..+. +..+|.... ... ..|. ++. -+ -++ T Consensus 232 ~~~G~vg~i~G~~V~~-s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~~e 310 (347) T protein:vir:88 232 PETGNIRNVMGFEVIE-VPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALE 310 (347) T ss_pred hhcceeeeeccceEEE-eecccccccccccccccccccccccccccccccccccccCcEEEEEechhhhhheecccceee Confidence 4556666777754433 333331100 000 0000 000 00 011 Q ss_pred ehhhhhcccchHHHHHhhhhcceeecccceEEEEeCCCC Q lcl|Aclame:pro 477 FEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPV 515 (517) Q Consensus 477 ~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~ 515 (517) .+.+. ..-...+......|-.+.+|++.+...++++- T Consensus 311 ~~r~~--~~~~d~i~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 311 RARRP--EFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred eeech--hhHHHHhhhhhhhcCceeccceEEEEEeCCCC Confidence 11111 11112345567788899999999999988877 No 139 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=97.30 E-value=3.7e-05 Score=44.90 Aligned_cols=227 Identities=14% Similarity=0.070 Sum_probs=123.3 Q ss_pred Hhh-ccc-hhhHHHHhhhhhcccccccccch-hhhhhHHHhHhhhhhhhhceeeeccc--c-ceeeeecccccceeeecc Q lcl|Aclame:pro 225 MSA-SLT-KDPKAAWTAELKERGISGMPAPA-GILKRIQDAVNDEGSLLPFIRHENLP--T-LVVGGDNALTQGTGHTTG 298 (517) Q Consensus 225 ~~~-~~~-~~~~~~~~~~~~~~~~~~~~vp~-~i~~~i~~~~~~~~~~~~~~~~~~~~--~-~~~~~~~~~~~a~~~~eg 298 (517) |.. +.. ..... +.. ..-|. .+...|++.+.+.++++...++.... . -....+++...+.|..-+ T Consensus 1 m~~~~~~~~TL~e-~Ak---------~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN 70 (331) T protein:vir:10 1 MPTLSTTNPTLAD-VAA---------RMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLN 70 (331) T ss_pred CCccccCcccHHH-HHH---------hcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccC Confidence 000 000 00000 000 00021 23345778888888988887776332 1 233456677889999989 Q ss_pred cccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccC----------- Q lcl|Aclame:pro 299 TDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTG----------- 367 (517) Q Consensus 299 ~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~----------- 367 (517) +..+++..++.+++...+.+++...+.+.+..... ....+...-.....+++.+..+.+|++||-+. T Consensus 71 ~g~~~s~~tt~q~t~~l~ilgg~~eVDk~la~~~G--n~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR 148 (331) T protein:vir:10 71 YGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNG--NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPR 148 (331) T ss_pred CccCcccceeEEEEEEEEEeccceeechHHHhhcC--CHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhh Confidence 99999999999999999999999999998766554 23344444555577889999999999997321 Q ss_pred -------------------cccc-------------ccccccc-------c----------------------------- Q lcl|Aclame:pro 368 -------------------VSET-------------QIYPVVG-------D----------------------------- 379 (517) Q Consensus 368 -------------------~~~~-------------gi~~~~~-------~----------------------------- 379 (517) .+.+ ||++.-. + T Consensus 149 ~~~~~a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i 228 (331) T protein:vir:10 149 FNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTL 228 (331) T ss_pred ccccccccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEE Confidence 1111 1111000 0 Q ss_pred -----------ccccccc-ccccHHHHHHHHHHhhhhh-----cCCEEEEcHHHHHHHHHh-hcCCCCEeccCCCCCCcc Q lcl|Aclame:pro 380 -----------AWATNVT-GTTNIQELLEKLSVATPKA-----ADSTLVIHRNDLAAIRFL-KDKNGNYVFPVGVSNQTI 441 (517) Q Consensus 380 -----------~~~~~~~-~~~~~d~l~~~l~~~~~~~-----~~a~~vmn~~~~~~l~~l-KD~~Gryl~~~~~~~~~~ 441 (517) ....... .+..-.++++++..+.... .+.+|.||++....|++. .++..-+.+...-..+.. T Consensus 229 ~d~r~v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~ 308 (331) T protein:vir:10 229 RDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKK 308 (331) T ss_pred cCcccEEEEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcc Confidence 0000000 0111134666666665433 346899999999999986 455343433333333333 Q ss_pred ceecCccceeccccC-Cceeeee Q lcl|Aclame:pro 442 ATHFGFNRLVQSVAV-DEKTAVS 463 (517) Q Consensus 442 ~~l~g~~~v~~~~~~-~~~~~~~ 463 (517) .+.|++-++...+.. ....++. T Consensus 309 ~t~~~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:10 309 VVAFDGIPCRRTDALLLTEARVV 331 (331) T ss_pred eeEECCeeEEEeeeeecCccccC Confidence 344443334332221 1111111 No 140 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=97.30 E-value=3.7e-05 Score=44.90 Aligned_cols=227 Identities=14% Similarity=0.070 Sum_probs=123.3 Q ss_pred Hhh-ccc-hhhHHHHhhhhhcccccccccch-hhhhhHHHhHhhhhhhhhceeeeccc--c-ceeeeecccccceeeecc Q lcl|Aclame:pro 225 MSA-SLT-KDPKAAWTAELKERGISGMPAPA-GILKRIQDAVNDEGSLLPFIRHENLP--T-LVVGGDNALTQGTGHTTG 298 (517) Q Consensus 225 ~~~-~~~-~~~~~~~~~~~~~~~~~~~~vp~-~i~~~i~~~~~~~~~~~~~~~~~~~~--~-~~~~~~~~~~~a~~~~eg 298 (517) |.. +.. ..... +.. ..-|. .+...|++.+.+.++++...++.... . -....+++...+.|..-+ T Consensus 1 m~~~~~~~~TL~e-~Ak---------~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN 70 (331) T protein:vir:98 1 MPTLSTTNPTLAD-VAA---------RMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLN 70 (331) T ss_pred CCccccCcccHHH-HHH---------hcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccC Confidence 000 000 00000 000 00021 23345778888888988887776332 1 233456677889999989 Q ss_pred cccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccC----------- Q lcl|Aclame:pro 299 TDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTG----------- 367 (517) Q Consensus 299 ~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~----------- 367 (517) +..+++..++.+++...+.+++...+.+.+..... ....+...-.....+++.+..+.+|++||-+. T Consensus 71 ~g~~~s~~tt~q~t~~l~ilgg~~eVDk~la~~~G--n~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR 148 (331) T protein:vir:98 71 YGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNG--NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPR 148 (331) T ss_pred CccCcccceeEEEEEEEEEeccceeechHHHhhcC--CHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhh Confidence 99999999999999999999999999998766554 23344444555577889999999999997321 Q ss_pred -------------------cccc-------------ccccccc-------c----------------------------- Q lcl|Aclame:pro 368 -------------------VSET-------------QIYPVVG-------D----------------------------- 379 (517) Q Consensus 368 -------------------~~~~-------------gi~~~~~-------~----------------------------- 379 (517) .+.+ ||++.-. + T Consensus 149 ~~~~~a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i 228 (331) T protein:vir:98 149 FNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTL 228 (331) T ss_pred ccccccccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEE Confidence 1111 1111000 0 Q ss_pred -----------ccccccc-ccccHHHHHHHHHHhhhhh-----cCCEEEEcHHHHHHHHHh-hcCCCCEeccCCCCCCcc Q lcl|Aclame:pro 380 -----------AWATNVT-GTTNIQELLEKLSVATPKA-----ADSTLVIHRNDLAAIRFL-KDKNGNYVFPVGVSNQTI 441 (517) Q Consensus 380 -----------~~~~~~~-~~~~~d~l~~~l~~~~~~~-----~~a~~vmn~~~~~~l~~l-KD~~Gryl~~~~~~~~~~ 441 (517) ....... .+..-.++++++..+.... .+.+|.||++....|++. .++..-+.+...-..+.. T Consensus 229 ~d~r~v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~ 308 (331) T protein:vir:98 229 RDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKK 308 (331) T ss_pred cCcccEEEEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcc Confidence 0000000 0111134666666665433 346899999999999986 455343433333333333 Q ss_pred ceecCccceeccccC-Cceeeee Q lcl|Aclame:pro 442 ATHFGFNRLVQSVAV-DEKTAVS 463 (517) Q Consensus 442 ~~l~g~~~v~~~~~~-~~~~~~~ 463 (517) .+.|++-++...+.. ....++. T Consensus 309 ~t~~~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:98 309 VVAFDGIPCRRTDALLLTEARVV 331 (331) T ss_pred eeEECCeeEEEeeeeecCccccC Confidence 344443334332221 1111111 No 141 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=97.30 E-value=3.7e-05 Score=44.90 Aligned_cols=227 Identities=14% Similarity=0.070 Sum_probs=123.3 Q ss_pred Hhh-ccc-hhhHHHHhhhhhcccccccccch-hhhhhHHHhHhhhhhhhhceeeeccc--c-ceeeeecccccceeeecc Q lcl|Aclame:pro 225 MSA-SLT-KDPKAAWTAELKERGISGMPAPA-GILKRIQDAVNDEGSLLPFIRHENLP--T-LVVGGDNALTQGTGHTTG 298 (517) Q Consensus 225 ~~~-~~~-~~~~~~~~~~~~~~~~~~~~vp~-~i~~~i~~~~~~~~~~~~~~~~~~~~--~-~~~~~~~~~~~a~~~~eg 298 (517) |.. +.. ..... +.. ..-|. .+...|++.+.+.++++...++.... . -....+++...+.|..-+ T Consensus 1 m~~~~~~~~TL~e-~Ak---------~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN 70 (331) T protein:vir:10 1 MPTLSTTNPTLAD-VAA---------RMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLN 70 (331) T ss_pred CCccccCcccHHH-HHH---------hcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccC Confidence 000 000 00000 000 00021 23345778888888988887776332 1 233456677889999989 Q ss_pred cccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccC----------- Q lcl|Aclame:pro 299 TDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTG----------- 367 (517) Q Consensus 299 ~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~----------- 367 (517) +..+++..++.+++...+.+++...+.+.+..... ....+...-.....+++.+..+.+|++||-+. T Consensus 71 ~g~~~s~~tt~q~t~~l~ilgg~~eVDk~la~~~G--n~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR 148 (331) T protein:vir:10 71 YGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNG--NSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPR 148 (331) T ss_pred CccCcccceeEEEEEEEEEeccceeechHHHhhcC--CHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhh Confidence 99999999999999999999999999998766554 23344444555577889999999999997321 Q ss_pred -------------------cccc-------------ccccccc-------c----------------------------- Q lcl|Aclame:pro 368 -------------------VSET-------------QIYPVVG-------D----------------------------- 379 (517) Q Consensus 368 -------------------~~~~-------------gi~~~~~-------~----------------------------- 379 (517) .+.+ ||++.-. + T Consensus 149 ~~~~~a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i 228 (331) T protein:vir:10 149 FNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTL 228 (331) T ss_pred ccccccccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEE Confidence 1111 1111000 0 Q ss_pred -----------ccccccc-ccccHHHHHHHHHHhhhhh-----cCCEEEEcHHHHHHHHHh-hcCCCCEeccCCCCCCcc Q lcl|Aclame:pro 380 -----------AWATNVT-GTTNIQELLEKLSVATPKA-----ADSTLVIHRNDLAAIRFL-KDKNGNYVFPVGVSNQTI 441 (517) Q Consensus 380 -----------~~~~~~~-~~~~~d~l~~~l~~~~~~~-----~~a~~vmn~~~~~~l~~l-KD~~Gryl~~~~~~~~~~ 441 (517) ....... .+..-.++++++..+.... .+.+|.||++....|++. .++..-+.+...-..+.. T Consensus 229 ~d~r~v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~ 308 (331) T protein:vir:10 229 RDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKK 308 (331) T ss_pred cCcccEEEEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcc Confidence 0000000 0111134666666665433 346899999999999986 455343433333333333 Q ss_pred ceecCccceeccccC-Cceeeee Q lcl|Aclame:pro 442 ATHFGFNRLVQSVAV-DEKTAVS 463 (517) Q Consensus 442 ~~l~g~~~v~~~~~~-~~~~~~~ 463 (517) .+.|++-++...+.. ....++. T Consensus 309 ~t~~~gipir~~dai~~tE~~Vv 331 (331) T protein:vir:10 309 VVAFDGIPCRRTDALLLTEARVV 331 (331) T ss_pred eeEECCeeEEEeeeeecCccccC Confidence 344443334332221 1111111 No 142 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=97.29 E-value=0.0001 Score=42.45 Aligned_cols=275 Identities=12% Similarity=0.040 Sum_probs=122.3 Q ss_pred HhhccchhhHHHHhhhhh-ccccccccc-----chhhhhhHHHhHhhhhhhhhceeeeccccc---eeeeecccccceee Q lcl|Aclame:pro 225 MSASLTKDPKAAWTAELK-ERGISGMPA-----PAGILKRIQDAVNDEGSLLPFIRHENLPTL---VVGGDNALTQGTGH 295 (517) Q Consensus 225 ~~~~~~~~~~~~~~~~~~-~~~~~~~~v-----p~~i~~~i~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~a~~~ 295 (517) |......+ ..+.. ..+..+... -..+..++.+.....+.++++++...+.+. .++ +.....+..+ T Consensus 1 ~~~~~~~~-----~~~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~r~i~~gks~~~~-~iG~~~~~~~ 74 (345) T protein:vir:22 1 MASMTGGQ-----QMGTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFP-VLGRTQAAYL 74 (345) T ss_pred Ccccccch-----hcccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEEe-eecceEEEee Confidence 11000000 00000 000001101 123444566667777777888776555432 222 3344566777 Q ss_pred ecccccccc--cccceeeEee--HhhhhH-hHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhc----cccc Q lcl|Aclame:pro 296 TTGTDKTES--NITLQTRVLT--PQYVYK-YIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIM----GGVT 366 (517) Q Consensus 296 ~eg~~~~~~--~~~f~~~~~~--~~~~~~-~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~----G~G~ 366 (517) ..|+....+ ++...+.++. -..+.. ++. -+++.... -.+.+.+.+++.+++++..|+.++. +... T Consensus 75 ~~G~~l~~~~~~~~~~e~~ltID~~~y~~~~Vd----diD~~q~~--~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~ 148 (345) T protein:vir:22 75 APGENLDDKRKDIKHTEKVITIDGLLTADVLIY----DIEDAMNH--YDVRSEYTSQLGESLAMAADGAVLAEIAGLCNV 148 (345) T ss_pred ecCCCCCCCCCCcccceEEEEecchhhhhhhHh----hHHHHhcC--chhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 777765332 4556664444 222322 221 12222211 1277788889999999999987762 1111 Q ss_pred Ccc----c----ccccccccccccccccccccHHHHHHHHHHhhhh-------hcCCEEEEcHHHHHHHHHhhcC-CCCE Q lcl|Aclame:pro 367 GVS----E----TQIYPVVGDAWATNVTGTTNIQELLEKLSVATPK-------AADSTLVIHRNDLAAIRFLKDK-NGNY 430 (517) Q Consensus 367 ~~~----~----~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~~~-------~~~a~~vmn~~~~~~l~~lKD~-~Gry 430 (517) ..+ . .++....+.............+.+.+++..+... ..+-.+|++|..|..|..-+.- +..| T Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~~~~~~~ 228 (345) T protein:vir:22 149 ESKYNENIEGLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCDPDSYSAILAALMPNAANY 228 (345) T ss_pred cccccccccccccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeChHHHHHHhcccccccccc Confidence 111 1 1111111111111111112233444444433221 1234689999999988543322 3345 Q ss_pred eccCCCCCCccceecCccceeccccCCceee---------------------------------eecCceEEEe-eeh-- Q lcl|Aclame:pro 431 VFPVGVSNQTIATHFGFNRLVQSVAVDEKTA---------------------------------VSLSGYVTNG-SRG-- 474 (517) Q Consensus 431 l~~~~~~~~~~~~l~g~~~v~~~~~~~~~~~---------------------------------~~~~~~~~~~-~~~-- 474 (517) .-......|.+..+.|+.++. +..++...+ +++...+... .+. T Consensus 229 ~~~~~~~~G~V~~i~G~~V~~-sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~ 307 (345) T protein:vir:22 229 AALIDPEKGSIRNVMGFEVVE-VPHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLA 307 (345) T ss_pred ccccccccceEEEEeceEEEe-cccccccccCccccCcccccccccccccceeeeeccCceEEEEEehhheeeeeeecce Confidence 433334456666777764443 222221100 0011111001 111 Q ss_pred eeehhhhhcccchHHHHHhhhhcceeecccceEEEEeCCC Q lcl|Aclame:pro 475 MEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPP 514 (517) Q Consensus 475 ~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~ 514 (517) ++.+.+. ..-.-.+......|-.+.+|++.+..++.-- T Consensus 308 ~e~~r~~--~~~~d~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 308 LERARRA--NFQADQIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred eeeeech--hHHHHHHHHHHhcCCcccccceeEEEEEeeC Confidence 2222221 1111233445667788999999998887655 No 143 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=97.12 E-value=0.0001 Score=42.41 Aligned_cols=279 Identities=11% Similarity=0.037 Sum_probs=115.6 Q ss_pred HhhccchhhHHHHhhhhhccccc----ccccchhhhhhHHHhHhhhhhhhhceeeeccccce--eeeecccccceeeecc Q lcl|Aclame:pro 225 MSASLTKDPKAAWTAELKERGIS----GMPAPAGILKRIQDAVNDEGSLLPFIRHENLPTLV--VGGDNALTQGTGHTTG 298 (517) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~----~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~a~~~~eg 298 (517) |....... ....-...+.. .+.+ ..+...+.......+.+.++++...+.+.. -........+..++.| T Consensus 1 m~~~~~~~----~~t~~g~~~~~~d~~al~i-k~f~~eV~~~f~~~s~~~~~~~~r~i~~G~sv~i~~iG~~tv~~~t~G 75 (347) T protein:vir:94 1 MANVPGQK----IGTDQGKGKSSSDALALFL-KVFAGEVLTAFTRRSVTADKHIVRTIQNGKSAQFPVMGRTSGVYLAPG 75 (347) T ss_pred CCCCCccc----cccccccCCccccHHHHHH-HHHhHHHHHHHHHHHhhhcccccccccccceEEEecccceeeeeecCC Confidence 00000000 00000000000 0000 223334444455555666666655543321 1222334455566666 Q ss_pred cccccc--cccceeeEeeH--hhhhH-hHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhc-----ccccC- Q lcl|Aclame:pro 299 TDKTES--NITLQTRVLTP--QYVYK-YIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIM-----GGVTG- 367 (517) Q Consensus 299 ~~~~~~--~~~f~~~~~~~--~~~~~-~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~-----G~G~~- 367 (517) +..+.+ +.+-.++++.+ .++.. ++. . +++.... -.+.+-+.+++.+++++..|+.++. .+-.+ T Consensus 76 ~~l~~~~~~~~~~e~~itID~~~~~~~~Vd--d--iD~~q~~--~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~ 149 (347) T protein:vir:94 76 ERLSDKRKGIKHTEKVITIDGLLTADVMIF--D--IEDAMNH--YDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAA 149 (347) T ss_pred CCcCCCCCCCCcceEEEEecchhhhhHHhh--h--HHHHhcC--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc Confidence 654322 23344433432 22222 221 1 1222211 1266677888999999999987752 11111 Q ss_pred --ccccccccccccc-ccc--cccccccHHHHHHHHHHhhhh-------hcCCEEEEcHHHHHHHHHhhc-CCCCEeccC Q lcl|Aclame:pro 368 --VSETQIYPVVGDA-WAT--NVTGTTNIQELLEKLSVATPK-------AADSTLVIHRNDLAAIRFLKD-KNGNYVFPV 434 (517) Q Consensus 368 --~~~~gi~~~~~~~-~~~--~~~~~~~~d~l~~~l~~~~~~-------~~~a~~vmn~~~~~~l~~lKD-~~Gryl~~~ 434 (517) ....|+....... ... ..+.....+.+.+++..+... ..+-..|++|..|..|..-++ .+..|.-+. T Consensus 150 ~~~~~~g~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~~~~~~~ 229 (347) T protein:vir:94 150 SNENIAGLGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAANYAALI 229 (347) T ss_pred cccccCCCcccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhhhccccc Confidence 1111111100000 000 001111223344444332211 123468999999987743332 222344333 Q ss_pred CCCCCccceecCccceeccccCCceeee---ecCceEEE--------e---------------------------eehee Q lcl|Aclame:pro 435 GVSNQTIATHFGFNRLVQSVAVDEKTAV---SLSGYVTN--------G---------------------------SRGME 476 (517) Q Consensus 435 ~~~~~~~~~l~g~~~v~~~~~~~~~~~~---~~~~~~~~--------~---------------------------~~~~~ 476 (517) ...+|.+..++|++.+. +..+|..... ...+|.+. . .+.++ T Consensus 230 ~~~~G~Vg~i~G~~V~~-Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~ 308 (347) T protein:vir:94 230 DPETGNIRNVMGFVVVE-VPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLA 308 (347) T ss_pred cccccceEEEeceEEEe-cCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhccccc Confidence 45567777888865444 4444421110 00011000 0 00111 Q ss_pred ehhhhhcccchHHHHHhhhhcceeecccceEEEEeCCCC Q lcl|Aclame:pro 477 FEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPV 515 (517) Q Consensus 477 ~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~ 515 (517) ....++...-...+......|..+.+|++.+..++++|- T Consensus 309 ~e~~r~~~~~~d~i~~~~~~G~~~~rP~~a~~~~~~~A~ 347 (347) T protein:vir:94 309 LERDRDVDAQGDLIVGKYAMGHGGLRPEAAGALVFSPAE 347 (347) T ss_pred ccchhchhhHHHHhhhhhhhcCcccccceeEEEEecCCC Confidence 111111111112345566788899999999999988777 No 144 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=97.10 E-value=9.4e-05 Score=42.66 Aligned_cols=261 Identities=13% Similarity=0.057 Sum_probs=119.0 Q ss_pred HHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeeccc---cceeeeecccccceeeecccccccccccce--- Q lcl|Aclame:pro 236 AWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLP---TLVVGGDNALTQGTGHTTGTDKTESNITLQ--- 309 (517) Q Consensus 236 ~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~a~~~~eg~~~~~~~~~f~--- 309 (517) +..+++....+...+.--++.++..+-+.....++...|..+.. .+.+|...-...+.-++||+..|-+.++.. T Consensus 1 mAe~nlt~~~dL~~~~sidfv~~f~~~i~~L~~~Lgi~r~~p~a~G~tIt~pK~~~tgda~dVaEGe~Iplskvt~~~~~ 80 (295) T protein:vir:99 1 MAEKNLNTMADLGDIKSIDFVNKFSKNINDLLKLLGVTRRETLTNDLKIQTYKWEVTLDQTDPGEGETIPLSKVTRTKDK 80 (295) T ss_pred CCCcccccHhhccCceeehhhHHhhhhHHHHHHHhccccccccccCCeEEeeeeeeecccccccCCcccchhhheeeeee Confidence 11111111111111111122233222233333334333444443 355666665677888999999998888765 Q ss_pred eeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcccccccccccccccccccc-c Q lcl|Aclame:pro 310 TRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTG-T 388 (517) Q Consensus 310 ~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~-~ 388 (517) ..+++.++++.-+ |-+-+..+.+.+. ...-.++|...++.++++.++.=-.+++- +....+ . T Consensus 81 t~t~kikK~rK~t--TdEAIqlsGygdp---vgead~qL~~~ia~kId~D~~~~lktat~------------t~tg~~lq 143 (295) T protein:vir:99 81 DYTVKWFKKRRAT--TAEAIARHGAARA---ITEADKRIMRELQNGIKDAFFTFLKTKPT------------KVKGVGLQ 143 (295) T ss_pred eeEEEeeeecccc--cHHHHHhcCCCch---hHHHHHHHHHHHHHhhhHHHHHHhccCce------------eeehhhHH Confidence 3667777777744 7788877888763 34467789999999999998853222110 000000 0 Q ss_pred ccHHHHHHHHHHhhhhh-cCCEEEEcHHHHHHHHHhhcCCCCEeccCC--CCCCccceecCccceeccccCCceeeee-- Q lcl|Aclame:pro 389 TNIQELLEKLSVATPKA-ADSTLVIHRNDLAAIRFLKDKNGNYVFPVG--VSNQTIATHFGFNRLVQSVAVDEKTAVS-- 463 (517) Q Consensus 389 ~~~d~l~~~l~~~~~~~-~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~--~~~~~~~~l~g~~~v~~~~~~~~~~~~~-- 463 (517) .....++.++....... .+.+.++||.+...+++-..- -|+.. .+..-...++|.-.++.+.-+++..++. T Consensus 144 ~a~a~~~~al~~f~Ee~~~~~V~FVnP~D~a~yl~~A~~----~~~~a~~fG~~~L~nfLG~q~II~S~kv~~G~~~aT~ 219 (295) T protein:vir:99 144 KALSASWAKLATFNEFEGSPLVSFVSPLDVANYLGDTKV----GADASNVFGMTLLKNFLGMQNVIVMPSVPEGKIYSTA 219 (295) T ss_pred HHHHHhhhhhhhcccccCCceEEEEehHHHHHHHhcccc----ccchhhhhhhhhhhhhhccceEEEcccCCCceEEEee Confidence 01111222222222222 356899999999988754311 13221 1112222466655455554455433321 Q ss_pred cCceEEEe-e-eheeehhhhhcccc----------h--HHHHHhhh-hcc---eeecccceEEEEeC----CCCCC Q lcl|Aclame:pro 464 LSGYVTNG-S-RGMEFEQGTILVEN----------N--KEYLFEMP-ISG---SLEYKGTTAYGTYT----PPVAG 517 (517) Q Consensus 464 ~~~~~~~~-~-~~~~~~~d~~~~~n----------~--~~~~~~~r-vgg---~v~~~~a~~~~~~t----p~~a~ 517 (517) .+.-.+-+ + .+...-+-|.+..+ . ...-.|+. ++| --.+++..+.++.+ |.+-| T Consensus 220 ~~Ni~~ay~~~~~g~l~~~f~~~~D~tglIg~~h~~~~~~~t~et~~~~~~~lfpE~~dgiv~~tI~~~~~~~~~~ 295 (295) T protein:vir:99 220 VENLVFASLNVKGGDLGGLFADFTDETGLIAAARNRQLSNLTYESVFFGANVLFAEIPEGVVEATIEAAAVPGIGG 295 (295) T ss_pred ccceEEEEecCCchhhhhhhhhccCcccceEEEeccccceeeehhhhHhHHHhcccccceEEEEEEecCcCCCCCC Confidence 11111100 0 00000011111110 0 00001111 111 23456778888883 33444 No 145 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=97.05 E-value=0.00019 Score=40.97 Aligned_cols=281 Identities=12% Similarity=0.068 Sum_probs=119.6 Q ss_pred HHHHhhccchhhHHHHhhhhhcccccc----cccchhhhhhHHHhHhhhhhhhhceeeeccccc--eeeeecccccceee Q lcl|Aclame:pro 222 VAYMSASLTKDPKAAWTAELKERGISG----MPAPAGILKRIQDAVNDEGSLLPFIRHENLPTL--VVGGDNALTQGTGH 295 (517) Q Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~a~~~ 295 (517) +.....+. .........+..+ ..+ ..+...+.......+.+.++++...+.+. .-........+..+ T Consensus 1 ~~~~~~~~------~~~t~~g~~~~~~~~~al~i-e~~~g~V~~~f~~~s~~~~~v~~r~~~~G~sv~i~~iG~~t~~~~ 73 (347) T protein:vir:33 1 MANIQGGQ------QIGTNQGKGQSAADKLALFL-KVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGRTKAAYL 73 (347) T ss_pred CCCCccCc------ccccccccCCcccchHHHHH-HHHHHHHHHHHHHHHhhhhhhccccccccceeEeeeccceeeeee Confidence 00000000 0000000000000 011 23344455556666667777665443322 11223333445556 Q ss_pred eccccccc--ccccceeeEeeH--hhhh-HhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhh-----cccc Q lcl|Aclame:pro 296 TTGTDKTE--SNITLQTRVLTP--QYVY-KYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAII-----MGGV 365 (517) Q Consensus 296 ~eg~~~~~--~~~~f~~~~~~~--~~~~-~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l-----~G~G 365 (517) ..|+..+. .+.+..+.++.+ .++. .++. ..--..+..| +.+-+.++..+++++..|+.++ .++. T Consensus 74 ~~g~~l~~~~~~~~~~e~~ltiD~~~y~~~~Vd--diD~~q~~~D----~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~ 147 (347) T protein:vir:33 74 KPGENLDDKRKDIKHTEKVIHIDGLLTADVLIY--DIEDAMNHYD----VRAEYTAQLGESLAMAADGAVLAELAGLVNL 147 (347) T ss_pred cCCCCCCCCCCCCccceEEEEechhhhhhHHHh--hHHHHhcCCc----hhHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 66665433 234555655542 2222 2222 1111122223 6667788999999999998886 2222 Q ss_pred cCcc-----ccccc--c-cccccccccccccccHHHHHHHHHHhhhh-----h--cCCEEEEcHHHHHHHHHhh-cCCCC Q lcl|Aclame:pro 366 TGVS-----ETQIY--P-VVGDAWATNVTGTTNIQELLEKLSVATPK-----A--ADSTLVIHRNDLAAIRFLK-DKNGN 429 (517) Q Consensus 366 ~~~~-----~~gi~--~-~~~~~~~~~~~~~~~~d~l~~~l~~~~~~-----~--~~a~~vmn~~~~~~l~~lK-D~~Gr 429 (517) ...+ .++.- . ....+...........+.+.+++..+... . .+-.+|++|..|..|.+-+ -.+.. T Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~d 227 (347) T protein:vir:33 148 PDGSNENIEGLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAAN 227 (347) T ss_pred hcccccccccccccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhccccccccc Confidence 1111 00000 0 00000001111112234444444333221 1 2346899999999886533 23444 Q ss_pred EeccCCCCCCccceecCccceeccccCCceee---------eecCce-----------------EE--------Eeeehe Q lcl|Aclame:pro 430 YVFPVGVSNQTIATHFGFNRLVQSVAVDEKTA---------VSLSGY-----------------VT--------NGSRGM 475 (517) Q Consensus 430 yl~~~~~~~~~~~~l~g~~~v~~~~~~~~~~~---------~~~~~~-----------------~~--------~~~~~~ 475 (517) |.-......+.+..++|++.+ .+..+|...+ +.+..| ++ .-.++. T Consensus 228 ~~~~~~~~~G~V~~i~G~~V~-~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~ 306 (347) T protein:vir:33 228 YQALLDPERGTIRNVMGFEVV-EVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDL 306 (347) T ss_pred cccccccccceeEEEeceeEE-EecccccCccccccccccccccccccCCcccceeccccceeeeeecchhheeeeeece Confidence 543334556667777786544 3444443221 000000 00 001111 Q ss_pred eehhhhhcccchHHHHHhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 476 EFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 476 ~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) +....++...-...++.....|..+.+|++.+...+ |-|+- T Consensus 307 ~~e~~r~~~~~~d~i~~~~~~G~~vlrP~~av~i~~-~~~~~ 347 (347) T protein:vir:33 307 ALERARRANYQADQIIAKYAMGHGGLRPEAAGAIVL-PKVSE 347 (347) T ss_pred eeeeccchhhhhHhhhhhhhcCCceecccceEEEec-CCCCC Confidence 222222222222334555667888999998777653 44444 No 146 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=97.04 E-value=0.0002 Score=40.90 Aligned_cols=273 Identities=10% Similarity=0.045 Sum_probs=121.6 Q ss_pred Hhhccchh-hHHHHhhhhhcccccccccc-hhhhhhHHHhHhhhhhhhhceeeeccccc---eeeeecccccceeeeccc Q lcl|Aclame:pro 225 MSASLTKD-PKAAWTAELKERGISGMPAP-AGILKRIQDAVNDEGSLLPFIRHENLPTL---VVGGDNALTQGTGHTTGT 299 (517) Q Consensus 225 ~~~~~~~~-~~~~~~~~~~~~~~~~~~vp-~~i~~~i~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~a~~~~eg~ 299 (517) |....... .+..+ .. .......+ ..+...+.+.....+.+++++++..+.+. .++ ......+..+..|+ T Consensus 1 m~~~~~~~~t~~~~----~~-~~~~~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~G~s~~~~-~iG~~~~~~~~~g~ 74 (334) T protein:vir:80 1 MTYPAANTHTRPGW----GG-ANSDVSLHIEEHLGLVDASFMYSSKFASWMNVRSLRGTNQLRVD-RVGASTIAGRKAGE 74 (334) T ss_pred CCCCcCCCcccccc----cc-ccchheehhhhhhhHHHHHHHHhhhhhccceeeeccccceEEEe-eecceeeeeecCCC Confidence 11110000 00000 00 00011222 33445566666666777777776655432 222 33445566677777 Q ss_pred ccccccccceeeEeeHhh---hhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhh----cccccCccc-- Q lcl|Aclame:pro 300 DKTESNITLQTRVLTPQY---VYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAII----MGGVTGVSE-- 370 (517) Q Consensus 300 ~~~~~~~~f~~~~~~~~~---~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l----~G~G~~~~~-- 370 (517) ....+.++.++.++.+-. ...++. . +++.... -.+.+.+.+++.++++++.|++++ .|.....+. T Consensus 75 ~l~~~~~~~~~~~l~ID~~l~~~~~Vd--d--iD~~q~~--~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~ 148 (334) T protein:vir:80 75 ELVVQKNVSDKLNLTVDTVLYARHFFD--K--FDEWTSN--LDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHL 148 (334) T ss_pred CCCCCCcccCceEEEEeeeeehhhhHh--h--HHHHhcC--cchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc Confidence 666555555666655433 122221 1 2222211 127888889999999999998764 232221111 Q ss_pred -----ccccccccccccccccccccHHHHHHHHHHhh-----hhhc-----CCEEEEcHHHHHHHHHhhcCCC-CEeccC Q lcl|Aclame:pro 371 -----TQIYPVVGDAWATNVTGTTNIQELLEKLSVAT-----PKAA-----DSTLVIHRNDLAAIRFLKDKNG-NYVFPV 434 (517) Q Consensus 371 -----~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~-----~~~~-----~a~~vmn~~~~~~l~~lKD~~G-ryl~~~ 434 (517) .|+........ .+.......+.+..++..+. .+.+ .-..||+|..|..|..-+.--. .|.-.. T Consensus 149 ~~~~~~G~~~~~~~~g-~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~s~ 227 (334) T protein:vir:80 149 KPAFHDGILLPSTISG-LAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGAKE 227 (334) T ss_pred cccccCCcceeecccc-cccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceecccc Confidence 12222221111 11122233444444433221 1112 2468999999999876432111 121111 Q ss_pred ---CCCCCccceecCccceeccccCCceeee------ecCce--------EEEee-----------eheeehhhhhcccc Q lcl|Aclame:pro 435 ---GVSNQTIATHFGFNRLVQSVAVDEKTAV------SLSGY--------VTNGS-----------RGMEFEQGTILVEN 486 (517) Q Consensus 435 ---~~~~~~~~~l~g~~~v~~~~~~~~~~~~------~~~~~--------~~~~~-----------~~~~~~~d~~~~~n 486 (517) +...+.+..+.|+ .|+.+..+|....- .++.| .+... +..+.+.+. .. T Consensus 228 ~~~~~~~g~i~~v~G~-~V~~Sn~~P~~~~t~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~---~~ 303 (334) T protein:vir:80 228 GGNSFVGGRIAMLNGV-RVVETPRFPQSAITANALGADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEEK---KD 303 (334) T ss_pred ccccccceeEEEEece-EEEeecCCCCccccccccccccccccccccceEEEEEeCceEEEEEEeecceeeeech---hh Confidence 1234445566665 44445555543210 11111 01111 111222221 12 Q ss_pred hHHHH-HhhhhcceeecccceEEEEe--CCC Q lcl|Aclame:pro 487 NKEYL-FEMPISGSLEYKGTTAYGTY--TPP 514 (517) Q Consensus 487 ~~~~~-~~~rvgg~v~~~~a~~~~~~--tp~ 514 (517) +..+. .....|-.+.+|++.+...+ |-| T Consensus 304 ~~d~i~~~~a~G~g~lRPeaa~vv~~~~~~~ 334 (334) T protein:vir:80 304 FGHYLDTFQSYNIGQRRPDAVAVHDITVTNP 334 (334) T ss_pred HHHHHHHHHHcCCceeccceEEEEEEeeecC Confidence 22222 23445779999998886555 666 No 147 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=97.00 E-value=4.9e-05 Score=44.21 Aligned_cols=229 Identities=13% Similarity=0.024 Sum_probs=121.2 Q ss_pred HhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecc--cc-ceeeeecccccceeeeccccc Q lcl|Aclame:pro 225 MSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENL--PT-LVVGGDNALTQGTGHTTGTDK 301 (517) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~--~~-~~~~~~~~~~~a~~~~eg~~~ 301 (517) +........ .+.. ......+......|++.+...++++...++... +. .....+++...++|..-+... T Consensus 1 m~~~~~~a~------TL~E--~Akr~~~d~~~~~IIE~l~~tneIL~~lpf~e~N~~tg~~~~vrt~LP~~~fR~lN~g~ 72 (335) T protein:vir:73 1 MALIGQTLP------SLLD--IYNRTDKNGRIARIVEQLAKTNDILTDAIYVPCNDGSKHKTTIRAGIPEPVWRRYNQGV 72 (335) T ss_pred CCcCCCCch------hHHH--HHhhcCcchhHHHHHHHHhcCchHHhhcchhcccCCcccceeEEEecCCchhhhcCCcc Confidence 000000000 0000 000111334445677888888888887776532 11 223445667788899888888 Q ss_pred ccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcc--ccc------- Q lcl|Aclame:pro 302 TESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVS--ETQ------- 372 (517) Q Consensus 302 ~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~--~~g------- 372 (517) +++..++.+++...+.+++...+.+.+..... + ...+.........+++.+....+|++||-...+ +.| T Consensus 73 ~~s~~tt~qvt~~l~ilgg~~eVDr~La~~~G-n-~a~~ra~e~~~~ikam~q~~~~~~iyGDsa~~p~~FdGL~kR~~~ 150 (335) T protein:vir:73 73 QPTKTQTVPVTDTTGMLYDLGFVDKALADRSN-N-AAAFRVSENMGKLQGFNNKVARYSIYGNTDAEPEAFMGLAPRFNT 150 (335) T ss_pred ccccceEEEEEEEEEEecchhhhhHHHHhhcC-C-HHHHHHHHHHHHHHHHHHHHHHHhccCCcCCChhhccchhhhhcC Confidence 89999999999999999999999987655443 2 334555555667889999999999999643221 111 Q ss_pred -------------------------------------cccccc------------------------------------- Q lcl|Aclame:pro 373 -------------------------------------IYPVVG------------------------------------- 378 (517) Q Consensus 373 -------------------------------------i~~~~~------------------------------------- 378 (517) |++... T Consensus 151 ~st~~a~~a~~iIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~d~~G~~y~~~~~~~~w~~Gl~i 230 (335) T protein:vir:73 151 LSTSKAASAENVFSAGGSGSTNTSIWFMSWGENTAHMIYPEGMVAGFQHEDLGDDLVSDGNGGQFRAYRDEFKWDIGLSV 230 (335) T ss_pred ccccccCcccceeeccccccCceEEEEEEEcCCeeEEEcccCccccceeeeccceeeecCCCCEEeEEEeeeeeeeeeEE Confidence 111000 Q ss_pred ----------cccccccc-ccccHHHHHHHHHHhhh-h-hc-----CCEEEEcHHHHHHHHHh-hcCCCCEeccCCCCCC Q lcl|Aclame:pro 379 ----------DAWATNVT-GTTNIQELLEKLSVATP-K-AA-----DSTLVIHRNDLAAIRFL-KDKNGNYVFPVGVSNQ 439 (517) Q Consensus 379 ----------~~~~~~~~-~~~~~d~l~~~l~~~~~-~-~~-----~a~~vmn~~~~~~l~~l-KD~~Gryl~~~~~~~~ 439 (517) +.+..... ......+|++.+..+.. + .+ ..+|.||++....|++. +++....+=... ..+ T Consensus 231 ~d~r~vvRI~NIdvs~l~~d~~~~~~l~~lmi~a~~~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~l~~~~-~~g 309 (335) T protein:vir:73 231 RDWRSISRICNIDVTTLTKDASTGADLISMMVDAYYARDVAMLGDGKEVIYANKTIHAWLHKQAMNAKNVNLTIEE-YGG 309 (335) T ss_pred eCcccEEEEeecccccccccccchhhHHhhHHHHHHHHhccCCCCCceEEEechHHHHHHHHHHhccCceeeeeec-cCC Confidence 00000000 01111345555555532 1 12 36899999999999985 455443332222 333 Q ss_pred ccceecCccceecccc-CCceeeeec Q lcl|Aclame:pro 440 TIATHFGFNRLVQSVA-VDEKTAVSL 464 (517) Q Consensus 440 ~~~~l~g~~~v~~~~~-~~~~~~~~~ 464 (517) ..-+.|++-++...+. .....++.. T Consensus 310 ~~~t~~~gipir~~Dail~tE~~v~~ 335 (335) T protein:vir:73 310 KKIVSFLGIPIRRVDAILNTESAVTA 335 (335) T ss_pred ceeEEECCeEEEEEeeeecCcccccC Confidence 3333343333333222 221111111 No 148 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=96.81 E-value=0.00033 Score=39.70 Aligned_cols=282 Identities=12% Similarity=0.053 Sum_probs=116.9 Q ss_pred HHHHhhccchhhHHHHhhhhhcccccc--c-ccchhhhhhHHHhHhhhhhhhhceeeeccccc--eeeeecccccceeee Q lcl|Aclame:pro 222 VAYMSASLTKDPKAAWTAELKERGISG--M-PAPAGILKRIQDAVNDEGSLLPFIRHENLPTL--VVGGDNALTQGTGHT 296 (517) Q Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~~~~~~--~-~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~a~~~~ 296 (517) +.....+... .......+..+ . .--..+...+.......+.+.++++...+... .-........+..+. T Consensus 1 ma~~~~~~~~------~t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~~~~~G~sv~i~~ig~~t~~~~~ 74 (347) T protein:vir:15 1 MANIQGGQQI------GTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGRTKAAYLK 74 (347) T ss_pred CCccccCCcc------ccccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccccccccceeEeeeccceeeeeec Confidence 0000000000 00000000000 0 00122344555666666777777765444322 112223334455566 Q ss_pred ccccccc--ccccceeeEee--Hhhh-hHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhc---cc---- Q lcl|Aclame:pro 297 TGTDKTE--SNITLQTRVLT--PQYV-YKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIM---GG---- 364 (517) Q Consensus 297 eg~~~~~--~~~~f~~~~~~--~~~~-~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~---G~---- 364 (517) .|...+. .+.+..++++. -.++ ..++. ..--..+..| +.+-+.++..+++++..|+.++. +- T Consensus 75 ~g~~l~~~~~~~~~~e~~ltID~~~~~~~~Vd--dlD~~q~~~D----~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~ 148 (347) T protein:vir:15 75 PGENLDDKRKDIKHTEKVIHIDGLLTADVLIY--DIEDAMNHYD----VRAEYTAQLGESLAMAADGAVLAELAGLVNLP 148 (347) T ss_pred cCCCCCCCCCCCccceEEEEechhhhhhHHhh--hHHHHhcCCc----chHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 6665533 33455565553 2223 32331 1111122223 66777788999999999988762 10 Q ss_pred -ccCccc--cccccc---ccccccccccccccHHHHHHHHHHhhh-----hh--cCCEEEEcHHHHHHHHHhhcC-CCCE Q lcl|Aclame:pro 365 -VTGVSE--TQIYPV---VGDAWATNVTGTTNIQELLEKLSVATP-----KA--ADSTLVIHRNDLAAIRFLKDK-NGNY 430 (517) Q Consensus 365 -G~~~~~--~gi~~~---~~~~~~~~~~~~~~~d~l~~~l~~~~~-----~~--~~a~~vmn~~~~~~l~~lKD~-~Gry 430 (517) .+..+. +|.... ...............+.+++++..+.. .. .+-.+|++|..|..|.+-.+- +..| T Consensus 149 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~~~~~d~ 228 (347) T protein:vir:15 149 DASNENIEGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANY 228 (347) T ss_pred ccccccccccCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhcccccccccc Confidence 011110 110000 000000001111123444554443221 11 234578899999998654322 2223 Q ss_pred eccCCCCCCccceecCccceeccccCCceee-------eecCceEEEe---------------------------eehee Q lcl|Aclame:pro 431 VFPVGVSNQTIATHFGFNRLVQSVAVDEKTA-------VSLSGYVTNG---------------------------SRGME 476 (517) Q Consensus 431 l~~~~~~~~~~~~l~g~~~v~~~~~~~~~~~-------~~~~~~~~~~---------------------------~~~~~ 476 (517) .=.....+|.+..++|++.+ .+..+|.... .....|.... .++.. T Consensus 229 ~~~~~~~~G~Vg~i~G~~V~-~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~ 307 (347) T protein:vir:15 229 QALIDHERGTIRNVMGFEVV-EVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLA 307 (347) T ss_pred cccccccceEEEEEeceEEE-ecccccccccccccccccccccccccccccceeeeccccceeeeeccceeeeeEeecee Confidence 22223446667777776444 3444442111 1111111110 11112 Q ss_pred ehhhhhcccchHHHHHhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 477 FEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 477 ~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) ....++...-...++.....|..+.+|++.+... -|-|+- T Consensus 308 ~e~~~~~~~~~d~i~~~~~~G~~vlrP~~av~~~-~~~~~~ 347 (347) T protein:vir:15 308 LERARRANYQADQIIAKYAMGHGGLRPEAAGAIV-LPKVSE 347 (347) T ss_pred eeecccchhhhhhhehhhhcCCceeccccEEEEe-cCCCCC Confidence 2222222222223344556688899998877664 344444 No 149 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=96.74 E-value=0.00016 Score=41.40 Aligned_cols=259 Identities=12% Similarity=0.030 Sum_probs=119.2 Q ss_pred HHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc---ce---eeeecccccceeeecccccccccccc Q lcl|Aclame:pro 235 AAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT---LV---VGGDNALTQGTGHTTGTDKTESNITL 308 (517) Q Consensus 235 ~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~---~~---~~~~~~~~~a~~~~eg~~~~~~~~~f 308 (517) -....++....+-+...--++.++....+....-++...|..|... +. ++...-...+.-++||+..|-+.++. T Consensus 1 M~~e~nl~~~~dL~~a~siDF~~~f~~~i~~L~~~LGv~r~~pla~Gt~iktyK~~~~~y~gda~dVaEGe~Iplskvt~ 80 (303) T protein:vir:10 1 MSAENNLINVEALGKAKSIDFANKLGVGLNKLFEALAIQNKIPMNVGSALKQYRFKVEDSEKPNGDVAEGDVIPLTKVTR 80 (303) T ss_pred CCCCcCCcchhhcccceeehhhhhhhhhHHHHHHHhhhhccccccCCceeeeeeeeceeeccccccccCCcccchhhhee Confidence 0011111111111111222333333333333333333334444332 22 22233335677889999999888775 Q ss_pred e---eeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcccccccccccccccccc Q lcl|Aclame:pro 309 Q---TRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNV 385 (517) Q Consensus 309 ~---~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~ 385 (517) . ..+++.++++.-+ |-+-|..+.+++. ...-.++|...++.++++.|+.=-.+ ++.....+. T Consensus 81 ~~~~t~~~~~kK~rK~t--TdEAIqlsGyg~a---Vgetd~qL~~~Iq~kIdnd~~~~lkt----------aT~t~~~t~ 145 (303) T protein:vir:10 81 EQVDITELQFAKYRKST--SAEAIQAHGYDLA---INQTDNEMIKYVQKKFRAKFFETLKS----------AIENGKRTN 145 (303) T ss_pred eecceEEEEeecccccc--cHHHHHhhcCCch---hHHHHHHHHHHHHhhhhHHHHHHHhh----------ccccccccc Confidence 4 4677788888754 7888877887653 34466788888999999888742111 110011111 Q ss_pred cccccHHHHHHHHHHhh-------hhhcCCEEEEcHHHHHHHHHhhcCCCCEeccC-CCCCCccceecCccceeccccCC Q lcl|Aclame:pro 386 TGTTNIQELLEKLSVAT-------PKAADSTLVIHRNDLAAIRFLKDKNGNYVFPV-GVSNQTIATHFGFNRLVQSVAVD 457 (517) Q Consensus 386 ~~~~~~d~l~~~l~~~~-------~~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~-~~~~~~~~~l~g~~~v~~~~~~~ 457 (517) ......+.+..++.... ....+.+++|||.+...+++ +++=. .+. ..+..-....+|.. ++.+.-++ T Consensus 146 ~t~~s~~glq~Al~~~~~kl~~~~ed~~~~V~FvNP~Daa~yl~--~A~i~--~~~t~fG~n~L~nfLG~~-II~S~kv~ 220 (303) T protein:vir:10 146 KTKLSAENLQGALSKGRANLSVLLDDEITPIAFVNPNDTAEYLA--NGFIN--STGAQFGVNLLTPYVGVK-IVEFADVP 220 (303) T ss_pred ceeecHHHHHHHHHhhhhhccccccccccEEEEEchHHHHHHhh--cCCcc--hhhhhhhhhhhhhhhcce-EEEeccCC Confidence 12223444555544322 12234689999999998864 22110 000 00111122356653 44454444 Q ss_pred ceeeee--cCc----eEEE-eeeheeehhhhhcccc----------h--HHHHHhhh-hcc---eeecccceEEEEeCCC Q lcl|Aclame:pro 458 EKTAVS--LSG----YVTN-GSRGMEFEQGTILVEN----------N--KEYLFEMP-ISG---SLEYKGTTAYGTYTPP 514 (517) Q Consensus 458 ~~~~~~--~~~----~~~~-~~~~~~~~~d~~~~~n----------~--~~~~~~~r-vgg---~v~~~~a~~~~~~tp~ 514 (517) +..++. .+. |+-. ++++ .-|.++.+ . ...-.|+. ++| --.+++..+..++++. T Consensus 221 ~G~~~~T~~~Ni~~ay~~~~g~l~----~~f~~t~D~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~~dgiv~~ti~~~ 296 (303) T protein:vir:10 221 QGEVWMTVAENLNVAYANPRGELS----RAFAFATDATGFVGVLHDIQPQRLTSDTIYASAISMFPENIDAVIKVTIKKD 296 (303) T ss_pred CceEEEeeccceEEEEecCchhhh----hhhhhccccccceEEEeccccceeeehhHhHhHHHhcccccceEEEEEEecc Confidence 433221 111 1100 1111 11111100 0 00001111 111 2346778889999888 Q ss_pred CCC Q lcl|Aclame:pro 515 VAG 517 (517) Q Consensus 515 ~a~ 517 (517) -++ T Consensus 297 e~~ 299 (303) T protein:vir:10 297 EAG 299 (303) T ss_pred ccC Confidence 777 No 150 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=96.69 E-value=0.0003 Score=39.89 Aligned_cols=296 Identities=10% Similarity=0.086 Sum_probs=122.3 Q ss_pred HHHHHHHHHHhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeeccccceeeeecc-ccc--c Q lcl|Aclame:pro 216 KTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPTLVVGGDNA-LTQ--G 292 (517) Q Consensus 216 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~--a 292 (517) -.+++..+...........+. ........+...++......++.+.+.+++++.+++....+........ .+. - T Consensus 1 ~~~~~~~~~~~n~~~~~i~k~---~it~~~l~~g~L~p~~a~~Fl~~v~~~t~iL~~~r~~~~~s~~~ei~kig~G~r~~ 77 (360) T protein:vir:99 1 MSSNSTIDSVRNQNMNSLSQK---DIGLAELDGFQLPVDVTEEFLERMQKGVQILGMADTMTLARLEMEVPQFGVPRLSG 77 (360) T ss_pred CcchhHHHHHhhhHHHHHHhh---hccccccCceeecHHHHHHHHHHHhhccchhhhcceeecccccccccccccceeec Confidence 000011111111111111111 0111112234456667778888899999999988877644322111000 000 0 Q ss_pred eeeeccccccc-ccccceeeEe-eHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcc- Q lcl|Aclame:pro 293 TGHTTGTDKTE-SNITLQTRVL-TPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVS- 369 (517) Q Consensus 293 ~~~~eg~~~~~-~~~~f~~~~~-~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~- 369 (517) ....|+...++ .+.+...+.. ..+.+.....+..+.+.+....-..+.++.|.+.|+++++.-++.-.++|+.+..+ T Consensus 78 r~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~~~g~~ds~d~ 157 (360) T protein:vir:99 78 HTRDEEGSRTENSEAESGSVKFNATDKSYYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMGIRAGASSGNL 157 (360) T ss_pred cccccCCCCCcCCcCccccCccccccceeeEeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHHhhccchhccc Confidence 01112111111 1121212211 11111111223333333332222234678999999999999999999999865321 Q ss_pred --------c----ccccccccccc----------ccccc-----------------c----cccHHHHHHHH-HHhhhhh Q lcl|Aclame:pro 370 --------E----TQIYPVVGDAW----------ATNVT-----------------G----TTNIQELLEKL-SVATPKA 405 (517) Q Consensus 370 --------~----~gi~~~~~~~~----------~~~~~-----------------~----~~~~d~l~~~l-~~~~~~~ 405 (517) . -|.+..+.... .+... + ......++..+ ......| T Consensus 158 ~~~~~~d~fl~~~dGwlKka~~~~~~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~lf~~~~~~Lp~ky 237 (360) T protein:vir:99 158 QSIGGAAELDNTFKGWIARAEGDAQSVDDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTSLFNETIQTLDSRY 237 (360) T ss_pred ccCcccchhhhhhHHHHHHhhcccchhhccccccccccccccccccccchhhhccccccccccchHHHHHHHHHhcchhh Confidence 1 23333321100 00000 0 00122334333 3334445 Q ss_pred cC-----CEEEEcHHHHHH-HHHhhcCCCCEeccCCCCCCccceecCccceeccccCCceeeee--cCceEEEeeeheee Q lcl|Aclame:pro 406 AD-----STLVIHRNDLAA-IRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDEKTAVS--LSGYVTNGSRGMEF 477 (517) Q Consensus 406 ~~-----a~~vmn~~~~~~-l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~~~~~~--~~~~~~~~~~~~~~ 477 (517) .+ ..|+||+.+... .+.|.+-+. .|-..-...+.....+|++.+.+ +.+++..++. +...+.+--..+++ T Consensus 238 r~~~~~~~~~~~s~~~~~~yr~~L~~R~t-~LGd~~l~g~~~~~~~Gipi~~v-~~~pd~~~mlT~p~NLi~g~~~~iri 315 (360) T protein:vir:99 238 RESDAYSPVLMTSPNQVQSYTMSLTERED-PLGSAVIFGDSDITPFSYDLVGV-NGFPDEYMMFTDPNNLAFGLYEEMEL 315 (360) T ss_pred hcCcccceEEEccCchHHHHHHHHhccCc-ccchhheecccccccceeeeEEc-CCCCCCceEEeccCceeEEeeeeeEE Confidence 43 379999988544 444554442 12111112223344667654443 3455443332 33333322222222 Q ss_pred --hhhhhcccchH---HHHHhhhhcceeecccceEEEEe-CCCCC Q lcl|Aclame:pro 478 --EQGTILVENNK---EYLFEMPISGSLEYKGTTAYGTY-TPPVA 516 (517) Q Consensus 478 --~~d~~~~~n~~---~~~~~~rvgg~v~~~~a~~~~~~-tp~~a 516 (517) ..+.++..... .+.++..+.-.+..++|.++.+= .-|.| T Consensus 316 ~~~~e~~~~~~~~~~~~~~~~~~~D~~iee~~Av~~vt~~~~~~~ 360 (360) T protein:vir:99 316 DQSTDTDKVHEQRLHSRNWLEGQFDFQIKEQQAGVLVTDLETPTA 360 (360) T ss_pred eecccchhhhhhceeeeEEEEEEeeEEEEecccEEEEecCCCCCC Confidence 11111111111 11123345555677888887775 44566 No 151 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=96.33 E-value=0.00034 Score=39.63 Aligned_cols=262 Identities=9% Similarity=0.053 Sum_probs=101.2 Q ss_pred hhcccccccccchhhhhhHHHhHhhhhhh------hhceee-----eccccceeeeecc-cccceeeecccccccccccc Q lcl|Aclame:pro 241 LKERGISGMPAPAGILKRIQDAVNDEGSL------LPFIRH-----ENLPTLVVGGDNA-LTQGTGHTTGTDKTESNITL 308 (517) Q Consensus 241 ~~~~~~~~~~vp~~i~~~i~~~~~~~~~~------~~~~~~-----~~~~~~~~~~~~~-~~~a~~~~eg~~~~~~~~~f 308 (517) +...-...+.+|.-+..-+.+.......+ .+.... .+.....+|.... .+.+..+.++...+...++- T Consensus 1 MA~T~lsd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~~~~~~~~~i~~~kitt 80 (351) T protein:vir:15 1 MAETHLSDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGDPDNWTDSDDIDVNNLTS 80 (351) T ss_pred CCceeeeeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCcccchheecc Confidence 11000111222322222221111111111 111000 0112233444332 24555566666666555555 Q ss_pred eeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhc---c---cccCccccccccccccccc Q lcl|Aclame:pro 309 QTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIM---G---GVTGVSETQIYPVVGDAWA 382 (517) Q Consensus 309 ~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~---G---~G~~~~~~gi~~~~~~~~~ 382 (517) .+.....+..+.-..++.....-+.-| ....|.++|+..+.+..+..+|. | +.+-.+ ...+.. +.. T Consensus 81 ~~~~a~i~~~~kg~~~tD~a~~~sg~d----p~~~i~~q~a~~w~~~~q~~lla~l~gv~~~~~~~~-~~~~d~---t~~ 152 (351) T protein:vir:15 81 GKQQGIKFYQTKAYGYTDLGTMISGAP----VQETIGNRFAAFWQRADQKTLLSVLKGVMGVTKIAN-SKVYDQ---TKV 152 (351) T ss_pred cceeEEEEeeccceehhhhhHhhccch----HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchhhcc-cceecc---ccc Confidence 544444444443333322211111112 23336677777777766666553 2 111000 001111 111 Q ss_pred ccccccccHHHHHHHHHHhhhhhcC--CEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccceecCccceeccccCCcee Q lcl|Aclame:pro 383 TNVTGTTNIQELLEKLSVATPKAAD--STLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDEKT 460 (517) Q Consensus 383 ~~~~~~~~~d~l~~~l~~~~~~~~~--a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~~~ 460 (517) .........+.+.+++...-....+ ++|+||+.++..|++.+-- .|+ ++......+.+++|..+ ++++.+|-.. T Consensus 153 ~~~~~~is~~~l~~A~~~~GD~~~~~~~~ivmhS~v~~~L~~~~li--~~~-~~s~~~~~i~t~~G~~V-ivdD~~p~~~ 228 (351) T protein:vir:15 153 SPSEPMFGAKGFTGAIGLMGDLQDTAFGAIAVNSATYSLMKVQGLI--ETI-QPQNGATPFEAYNGLRI-VLDDDIEIDL 228 (351) T ss_pred cccccccCHHHHHHHHHHhccccccceEEEEEChHHHHHHHhhhhh--hhc-cccccCcccceecceEE-EEcCCCcccc Confidence 1112223345666666654333322 6899999999999875410 111 12222344667777544 4444343211 Q ss_pred e-eec---CceEEE------eee--heeehhhhhcccchHHHHHhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 461 A-VSL---SGYVTN------GSR--GMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 461 ~-~~~---~~~~~~------~~~--~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) . ... ..|++. ... .++..++.....+...+..+.+ .+.+|..+.+-.-+.+.+| T Consensus 229 ~~~~~~~ytsyl~~~GAi~~~~~~~~ve~~rd~~~~~g~d~l~~r~~---~~~hp~G~s~~~~~~~~~~ 294 (351) T protein:vir:15 229 TDKTKPVSTSYIFAPGAVRYSTNMRSTETKYDPLINGGQDVIVQKRV---GTIHVAGTSIKASFSPSKA 294 (351) T ss_pred CCCCCceeEEEEEecceeeeecCCcCcceeecccCCCCceEEEEeee---eeeeeeeeeecccccccCc Confidence 0 001 112221 111 1222233322222222333333 3566666666432222233 No 152 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=96.24 E-value=0.00074 Score=37.75 Aligned_cols=256 Identities=13% Similarity=0.050 Sum_probs=117.4 Q ss_pred HhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecccc---ceeee-ecccccceeeecccc Q lcl|Aclame:pro 225 MSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPT---LVVGG-DNALTQGTGHTTGTD 300 (517) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~---~~~~~-~~~~~~a~~~~eg~~ 300 (517) +..... --.+++....+.+...--++.++....+....-++...|..+... +..++ ..-...+.-+.||+. T Consensus 1 ~~~~~~-----~~e~nlt~~~dl~~~~siDf~~~f~~~i~~L~~~LGv~r~~pla~GstIkt~k~~~y~gda~dVaEGe~ 75 (296) T protein:vir:98 1 MVTSRT-----YPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEV 75 (296) T ss_pred CCCccc-----cCcCCCcchhhhhhhhhhhhHHHHhhhHHHHHHHhhhcccccccCCCEEeeccceeeeeccccccCCcc Confidence 000000 000011111111111122334443333333333444445555433 22223 344566778899999 Q ss_pred ccccccccee---eEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcccccccccc Q lcl|Aclame:pro 301 KTESNITLQT---RVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVV 377 (517) Q Consensus 301 ~~~~~~~f~~---~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~ 377 (517) .|-+.++... .+++.++++.-+ |-+-|..+.+.+. ...-.++|...++.++++.++.=-.+++ T Consensus 76 Iplskvt~~~~~t~t~~ikK~rK~t--TdEAIqlsGyg~a---Vgetd~qL~~~iq~kId~d~~t~LktaT--------- 141 (296) T protein:vir:98 76 IPLSKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEA---VTNTDNALVRQLQKKIRTDFVTALKTGT--------- 141 (296) T ss_pred cchhhheeeecceEEEEeecccccc--CHHHHHhhcCCch---hHHHHHHHHHHHHHhhhHHHHHHHhccc--------- Confidence 9988887654 667778877764 7888878887653 3446778999999999999875322211 Q ss_pred cccccccccccccHHHHHHHHHHh-------hhhh--cCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccceecCcc Q lcl|Aclame:pro 378 GDAWATNVTGTTNIQELLEKLSVA-------TPKA--ADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFN 448 (517) Q Consensus 378 ~~~~~~~~~~~~~~d~l~~~l~~~-------~~~~--~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~ 448 (517) .+.. ...+.+..++... ..+. .+.+.+|||.+...++ +|++ ---+.-.+..-....+|. T Consensus 142 ---~t~~----~t~~~lQ~Ala~~~~~l~~~feded~~~~V~FVnP~D~a~yl--g~a~--it~qt~fG~tyl~nfLG~- 209 (296) T protein:vir:98 142 ---GTQD----ALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYI--AKAG--ITTQTAFGLTYLVDFTGT- 209 (296) T ss_pred ---ceee----echhhHHHHHHHHhhhhhhhccccCCCceEEEEehHHHHHHh--cCCc--cchhheechhhhhhcccc- Confidence 0100 1123333333211 1111 3468999999988754 2321 000000001111124553 Q ss_pred ceeccccCCceeeee--cC----ceEE--Eeeeheeehhhhhccc----------ch--HHHHHhhh-hc---ceeeccc Q lcl|Aclame:pro 449 RLVQSVAVDEKTAVS--LS----GYVT--NGSRGMEFEQGTILVE----------NN--KEYLFEMP-IS---GSLEYKG 504 (517) Q Consensus 449 ~v~~~~~~~~~~~~~--~~----~~~~--~~~~~~~~~~d~~~~~----------n~--~~~~~~~r-vg---g~v~~~~ 504 (517) .++.+.-+++..++. .+ .|+- +++++ ..|.+.. +. ...-.|+. ++ .--.+++ T Consensus 210 ~II~S~kV~~G~~~~T~~~Ni~~ay~~~~~~~l~----~~f~~~~d~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~~d 285 (296) T protein:vir:98 210 VIISTNDVTKGEIWATVPENIIFAYINPNNSELA----KEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERID 285 (296) T ss_pred EEEEcCcCCCceEEEeeecceEEEeecccccchh----hhhccccccccceEEEeccccceeeehhHhHhHHHhcccccc Confidence 455554454433221 11 1111 01111 1111100 00 00001111 11 1234678 Q ss_pred ceEEEEeCCCC Q lcl|Aclame:pro 505 TTAYGTYTPPV 515 (517) Q Consensus 505 a~~~~~~tp~~ 515 (517) ..+.++.+|+| T Consensus 286 giv~~tI~~~~ 296 (296) T protein:vir:98 286 GIVKVTLTPGV 296 (296) T ss_pred eEEEEEecCCC Confidence 89999999999 No 153 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=96.10 E-value=0.00036 Score=39.43 Aligned_cols=256 Identities=9% Similarity=0.026 Sum_probs=102.6 Q ss_pred hhcccccccccchhhhhhHHHhHhhhhhh------h------hceee-eccccceeeeeccc-ccceeeecccccccccc Q lcl|Aclame:pro 241 LKERGISGMPAPAGILKRIQDAVNDEGSL------L------PFIRH-ENLPTLVVGGDNAL-TQGTGHTTGTDKTESNI 306 (517) Q Consensus 241 ~~~~~~~~~~vp~~i~~~i~~~~~~~~~~------~------~~~~~-~~~~~~~~~~~~~~-~~a~~~~eg~~~~~~~~ 306 (517) +...-...+.+|.-+..-+.+.......+ . .+... .+-....+|..... +.+.-+.++.+.+...+ T Consensus 1 MA~T~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~~i~~~~l 80 (324) T protein:vir:59 1 MAYTKISDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDTDDLVPQKI 80 (324) T ss_pred CCceeeeceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccccCCCcccchhhc Confidence 11000111222322222222111111111 1 11110 11112334443332 45666677777666666 Q ss_pred cceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhc---c----cccCcccccccccccc Q lcl|Aclame:pro 307 TLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIM---G----GVTGVSETQIYPVVGD 379 (517) Q Consensus 307 ~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~---G----~G~~~~~~gi~~~~~~ 379 (517) +..+.....+..+.-..++.....-+.-| -...+.++++..+.+..+..+|. | ++.+.+ .+..++ T Consensus 81 ~t~~~~a~i~~~~k~~~~tD~a~~~sg~d----p~~~i~~q~a~~~~~~~~~~lia~l~g~~~~~~~~~~---~~dvsa- 152 (324) T protein:vir:59 81 NAGQDKAVLILRGNAWSSHDLAATLSGSD----PMQAIGSRVAAYWAREMQKIVFAELAGVFSNDDMKDN---KLDISG- 152 (324) T ss_pred ccceeeEEEEeecCceeehhhhhhhccch----HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc---eeeeec- Confidence 66555555554444333332221112212 22336777777777777766653 2 111110 011111 Q ss_pred cccccccccccHHHHHHHHHHhhhhh-cCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccceecCccceeccccCCc Q lcl|Aclame:pro 380 AWATNVTGTTNIQELLEKLSVATPKA-ADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDE 458 (517) Q Consensus 380 ~~~~~~~~~~~~d~l~~~l~~~~~~~-~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~ 458 (517) ......+.+.+.+++...-... .-.+|+||+.++..|++++-- .|+. +........+++|..++ +++.+|- T Consensus 153 ----~~~~~~s~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li--~~~~-~s~~~~~i~~~~G~~Vi-vdD~~p~ 224 (324) T protein:vir:59 153 ----TADGIYSAETFVDASYKLGDHESLLTAIGMHSATMASAVKQDLI--EFVK-DSQSGIRFPTYMNKRVI-VDDSMPV 224 (324) T ss_pred ----cccceecHHHHHHHHHHhCCcccCcEEEEEchHHHHHHHHhhhh--hhcc-ccccCceeeeecccEEE-EeCCCCc Confidence 1111123345555554422221 236899999999999976421 2332 22234456677775444 4443331 Q ss_pred e-eeeecC---ceEEE-------e-eeheeehhhhhcccchHHHHHhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 459 K-TAVSLS---GYVTN-------G-SRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 459 ~-~~~~~~---~~~~~-------~-~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) . .....+ .|.+. . ...+....+.+.......+..+.+ .+.+|..+.+-. ++++| T Consensus 225 ~~~~~~~~~y~s~l~~~GAi~~~~~~~~v~vE~dRd~~~g~~~l~~r~~---~~~~p~G~s~~~--~~~~~ 290 (324) T protein:vir:59 225 ETLEDGTKVFTSYLFGAGALGYAEGQPEVPTETARNALGSQDILINRKH---FVLHPRGVKFTE--NAMAG 290 (324) T ss_pred cccCCCCceEEEEEEecCeEEEeecCCCcceecccCccccceEEEEeeE---EEeEeeeEEecc--cccCC Confidence 1 001111 12211 1 112222233333333334444444 344555554432 33455 No 154 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=96.09 E-value=0.00035 Score=39.51 Aligned_cols=284 Identities=7% Similarity=-0.022 Sum_probs=110.9 Q ss_pred HhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeecc---c--cceeeeecccccceeeeccc Q lcl|Aclame:pro 225 MSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENL---P--TLVVGGDNALTQGTGHTTGT 299 (517) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~---~--~~~~~~~~~~~~a~~~~eg~ 299 (517) |...... ............. ...+|.-+...+.+.+.....+.+++..... . +..+|. .....+..+.+|. T Consensus 1 ~~~~~~~--~~~~~~~~~~t~~-~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~-~g~~~a~d~~~g~ 76 (381) T protein:vir:80 1 MATIQGT--GGYKGSAVDLSNV-QVFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPN-ISRAAVYDKQPQT 76 (381) T ss_pred Cceeccc--ccccCcccchhhH-HhhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeec-cCcceeeeecCCC Confidence 1111100 0001111111111 2235766666777777666666555433211 1 122332 2233455566666 Q ss_pred ccccccccceeeEeeHhhh-hHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhccc----ccCcc--ccc Q lcl|Aclame:pro 300 DKTESNITLQTRVLTPQYV-YKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGG----VTGVS--ETQ 372 (517) Q Consensus 300 ~~~~~~~~f~~~~~~~~~~-~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~----G~~~~--~~g 372 (517) ..+..+.+...+++...+. +.-..++..-...+..| +.+.+.+++.++++++.|+.++.-- ....+ .++ T Consensus 77 ~i~~~~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D----~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~t~ 152 (381) T protein:vir:80 77 PVNLQARTDSEFTFTVTKYKESSFMIEDIVNTQASYT----LRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIYSY 152 (381) T ss_pred cccccccCCceEEEEEeeeeecceeechHHHHhhccC----hHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Confidence 5555555555555554222 12234444333344444 6677778899999999988876321 11111 000 Q ss_pred c--cccccccccccc-cccccHHHHHHHHHH---hhhhhcCCEEEEcHHHHHHHHHhhc-CCCCEeccCCCCCCccceec Q lcl|Aclame:pro 373 I--YPVVGDAWATNV-TGTTNIQELLEKLSV---ATPKAADSTLVIHRNDLAAIRFLKD-KNGNYVFPVGVSNQTIATHF 445 (517) Q Consensus 373 i--~~~~~~~~~~~~-~~~~~~d~l~~~l~~---~~~~~~~a~~vmn~~~~~~l~~lKD-~~Gryl~~~~~~~~~~~~l~ 445 (517) . +.........+. ......+.++++... ...+..+-.+|++|..+..|.+... .+-.|.-.....++.+..++ T Consensus 153 ~~~i~~~~~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~~~~~~l~~G~Ig~i~ 232 (381) T protein:vir:80 153 DTTLGDGTVNAHLTGTPAPLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFISVDFSQVKPVTSGVVGTIL 232 (381) T ss_pred cccccccccccccccchhhHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhhccchhhhceeeeEEc Confidence 0 000000000000 111122333333222 1112223478999999999875431 12234444445677777888 Q ss_pred CccceeccccCCceeeee----cC--ceE--------EEeee-----heeehhhhhcccchHHHHHhhhhc-ceeecccc Q lcl|Aclame:pro 446 GFNRLVQSVAVDEKTAVS----LS--GYV--------TNGSR-----GMEFEQGTILVENNKEYLFEMPIS-GSLEYKGT 505 (517) Q Consensus 446 g~~~v~~~~~~~~~~~~~----~~--~~~--------~~~~~-----~~~~~~d~~~~~n~~~~~~~~rvg-g~v~~~~a 505 (517) |+..+. +..+|...+.. .. ... ...+. .+.....++...-.....+....| +...+... T Consensus 233 G~~Vv~-Sn~lp~~~~t~~~~~agap~~~~~~~~~~~~~g~~s~~a~av~~~k~yd~~~~~~~~~~~~~~g~~~~~~~~~ 311 (381) T protein:vir:80 233 GMEVIV-TTQIGINSLTGYVNGQGAPTQPTPGVLGSPYLPDQAGTANVVNTGSASDLAVSLSYFGLPVFSGAGATAADGG 311 (381) T ss_pred ceEEEe-ecccccccccceeeeccccccccccccccccccccccceeeeeeeeeeceeeeeeeccceeeecceeeecCCC Confidence 865544 33444321100 00 000 00000 000111111111111111111110 12222222 Q ss_pred eEEEEeCCC---CCC Q lcl|Aclame:pro 506 TAYGTYTPP---VAG 517 (517) Q Consensus 506 ~~~~~~tp~---~a~ 517 (517) -..++++-. ++| T Consensus 312 ~~~~~~~~~~~~~~~ 326 (381) T protein:vir:80 312 QTLGSFGGANRWATA 326 (381) T ss_pred ceeeeehhhhhhhhh Confidence 333443222 122 No 155 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=95.98 E-value=0.0011 Score=36.81 Aligned_cols=272 Identities=10% Similarity=0.044 Sum_probs=111.9 Q ss_pred HhhccchhhHHHHhhhhhccccc----ccccc-hhhhhhHHHhHhhhhhhhhceeeeccccc---eeeeecccccceeee Q lcl|Aclame:pro 225 MSASLTKDPKAAWTAELKERGIS----GMPAP-AGILKRIQDAVNDEGSLLPFIRHENLPTL---VVGGDNALTQGTGHT 296 (517) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~----~~~vp-~~i~~~i~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~a~~~~ 296 (517) |.... .+...+.. ..... ..+...+.+.....+.++++.++..+.+. .++ ......+..+. T Consensus 1 ms~~n----------~~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~rti~~gkS~q~~-~iG~~~~~~~~ 69 (364) T protein:vir:10 1 MSNPN----------VLTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQEVVGTNSVSNK-YIGETELQVLS 69 (364) T ss_pred CCCcc----------cccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEee-eeeeeEEeeec Confidence 11000 00000000 11111 22334455556666667776665554332 222 22334445555 Q ss_pred cccccccccccceeeEeeHhhh---hHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhh----cccccC-- Q lcl|Aclame:pro 297 TGTDKTESNITLQTRVLTPQYV---YKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAII----MGGVTG-- 367 (517) Q Consensus 297 eg~~~~~~~~~f~~~~~~~~~~---~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l----~G~G~~-- 367 (517) .|+....+.+..++.++.+-+. ..++. -|.+..-+ .-.+.+.+..++.+++++..|+.++ .+.=+. T Consensus 70 ~G~~ld~~~~~~~k~~itID~ll~a~~~V~----diDe~q~~-~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~ 144 (364) T protein:vir:10 70 PGKSPDASPTEFDKNRLVVDTTVIARNTVA----HFHDVQND-IDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTE 144 (364) T ss_pred cCcccCCCCcccCcEEEEecceeeechhhh----hHHHHhcC-ccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccc Confidence 5655444445555555544321 12211 12222211 1124566677888888888888764 110000 Q ss_pred -ccccccccccccc---ccccccccccHHHHHHHHHHhhhh-------hcCCEEEEcHHHHHHHHHhhcC-CCCEecc-- Q lcl|Aclame:pro 368 -VSETQIYPVVGDA---WATNVTGTTNIQELLEKLSVATPK-------AADSTLVIHRNDLAAIRFLKDK-NGNYVFP-- 433 (517) Q Consensus 368 -~~~~gi~~~~~~~---~~~~~~~~~~~d~l~~~l~~~~~~-------~~~a~~vmn~~~~~~l~~lKD~-~Gryl~~-- 433 (517) .+..++....+.. ...........+.|.+++..+... ...-.+||+|..|..|.+-++= |-.|... T Consensus 145 ~~~~~~~~~~~g~~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~~~lvn~d~~~~~~ 224 (364) T protein:vir:10 145 AIRKNPRVAGHGFSIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDADRIVDKSYTIAAS 224 (364) T ss_pred ccccCCcccCCcceeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcCCccccccccccCC Confidence 0111111111110 111112233344555555443322 1234789999999888763211 1112111 Q ss_pred CCCCCCccceecCccceeccccCCcee----------------eeecCceEEEee-------------------eh--ee Q lcl|Aclame:pro 434 VGVSNQTIATHFGFNRLVQSVAVDEKT----------------AVSLSGYVTNGS-------------------RG--ME 476 (517) Q Consensus 434 ~~~~~~~~~~l~g~~~v~~~~~~~~~~----------------~~~~~~~~~~~~-------------------~~--~~ 476 (517) .+...+.+..+.|++ |+.+..+|... .++.+.|-...+ .+ .+ T Consensus 225 ~~~~~G~v~~v~Gv~-Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e 303 (364) T protein:vir:10 225 DNTVDGFVLKSWNTP-IVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGD 303 (364) T ss_pred CccccceeEEEeceE-EEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecceEEEEEEecceee Confidence 112344455666654 34444443210 011222211111 11 11 Q ss_pred ehhhhhcccchHH-HHHhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 477 FEQGTILVENNKE-YLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 477 ~~~d~~~~~n~~~-~~~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) .+.+. ..+.. +-...-.|-.++||++++..+ +++.+| T Consensus 304 ~~~~~---~~~~~~ida~~a~G~g~lRPeaa~~i~-~~~~~~ 341 (364) T protein:vir:10 304 IFYEK---KEKTWYIDTFLAEGAIPDRWEAVAVVT-AADTAE 341 (364) T ss_pred eeecc---ceeeeeeeeehcccCcccCccceEEEE-ecCCCC Confidence 22111 11111 112344677899998887765 555555 No 156 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=95.93 E-value=0.00012 Score=42.17 Aligned_cols=267 Identities=7% Similarity=-0.084 Sum_probs=116.9 Q ss_pred HhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeec-ccc----ceeeeecccccceeeeccc Q lcl|Aclame:pro 225 MSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHEN-LPT----LVVGGDNALTQGTGHTTGT 299 (517) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~-~~~----~~~~~~~~~~~a~~~~eg~ 299 (517) +.-..+.+....... .-+.+...+.+.....-..++++++.. .+. .........+.+.|+..++ T Consensus 1 ~~~~~a~~~~~f~~~-----------ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~ 69 (296) T protein:vir:10 1 MGVDKADAAGIWTVK-----------QLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIAQIVADYT 69 (296) T ss_pred CcccchhhhHHHHHH-----------HHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCceeEeCCCc Confidence 000000000000000 001122223332222223333433322 111 1222233445666776554 Q ss_pred -ccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCccccccccccc Q lcl|Aclame:pro 300 -DKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVG 378 (517) Q Consensus 300 -~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~ 378 (517) ..|..+..++.....++.++..+.++.+-++.+...-. .|..--....++++.+.++..+++|+.. ....|+++..+ T Consensus 70 ~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~-~l~~~ka~aA~~~~~~~~n~~~f~G~~~-~g~~GLlN~p~ 147 (296) T protein:vir:10 70 DDLPLVDALATERQGKVFRFGNAFLISIDEIKVGQATGQ-SLSTRKQSLAFEAHDKLLDKLVWSGSTA-HGIPSVFDYPN 147 (296) T ss_pred cccceeeccceeEEEEEEEEEeeeeecHHHHHHHHHhCC-ChHHHHHHHHHHHHHHhhceEEEeeccc-ccceeEeecCC Confidence 35666777777777777777766666555544422111 1333344467788999999999999864 34567776544 Q ss_pred ccccccc----cccccHHHHHHHHHHhhh---h-hcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccceecCccce Q lcl|Aclame:pro 379 DAWATNV----TGTTNIQELLEKLSVATP---K-AADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRL 450 (517) Q Consensus 379 ~~~~~~~----~~~~~~d~l~~~l~~~~~---~-~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v 450 (517) ....... ..+...+++..++..... + ..+..++|+|..+..|...-+..|.-++.-. ...+....+ T Consensus 148 v~~~~~~~~W~~~t~i~~Di~~~~~~l~~~s~g~~~p~~l~L~p~~~~~L~~~~~~~~~t~l~~i------k~~~~~l~i 221 (296) T protein:vir:10 148 INNVVSGGSWSQPTTAVSDITSLLDIIETSTNGQHRATHLLLPTTARRIMQNLVPGTSVSYGEFF------RQNNSGVTV 221 (296) T ss_pred CccccccCCccCHHHHHHHHHHHHHHHHHhhCceecceeEEeCHHHHHHHhhccCCCCccHHHHH------HHhcCCceE Confidence 3221111 112234555555544322 1 2456899999999998765444443332111 000111111 Q ss_pred eccccCCceeeeecCceEEEee---------eheeehhhhhcc-cch-HHHHHhhhhc-ceeecccceEEE---EeC Q lcl|Aclame:pro 451 VQSVAVDEKTAVSLSGYVTNGS---------RGMEFEQGTILV-ENN-KEYLFEMPIS-GSLEYKGTTAYG---TYT 512 (517) Q Consensus 451 ~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~d~~~~-~n~-~~~~~~~rvg-g~v~~~~a~~~~---~~t 512 (517) ...++..... ....+-.+.+. +.+.+ ...... .+. ..+....|+| ..|++|.++++. ||- T Consensus 222 ~~~~~l~~a~-~~g~~~~v~~~~~~~~~~~~v~~~~-~~~~~e~~~l~~~~~~~~~~~Gv~i~~P~ai~~~dGI~~~ 296 (296) T protein:vir:10 222 EFVQYLNDYN-GTGTSAAIAYEKDPNNMAIEIPEAT-NALPAQPKDLHFKIPVTSKATGLIVYRPLTMAVMKGITFA 296 (296) T ss_pred EEeeeeccCC-CCcceEEEEEEcCCceEEEEcCcce-eeecccccCceEEEeeEeeEEEEEEECCceeEEEeeeecC Confidence 2111111110 00011111111 11110 000111 111 1222345564 689999999988 443 No 157 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=95.93 E-value=0.0012 Score=36.52 Aligned_cols=268 Identities=8% Similarity=0.012 Sum_probs=121.7 Q ss_pred HhhccchhhHHHHhhhhhcccccccc----c-chhhhhhHHHhHhhhhhhhhceeeecccc---ceeeeecccccceeee Q lcl|Aclame:pro 225 MSASLTKDPKAAWTAELKERGISGMP----A-PAGILKRIQDAVNDEGSLLPFIRHENLPT---LVVGGDNALTQGTGHT 296 (517) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~----v-p~~i~~~i~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~a~~~~ 296 (517) |..- ..+...+..+.. + -..+...+.+.....+.+++++++..+.+ ...+ ......+..+. T Consensus 1 ms~~----------~~~t~~~~~~s~~d~al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~-~iG~~~~~~~~ 69 (335) T protein:vir:78 1 MSFL----------NDLTRPNYAGKNADVDIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLD-RLGNVEAKGRR 69 (335) T ss_pred CCcc----------ccccccccccccchhhhhhhhhhhHHHHHHHHhhhhccccceeeeccceeEEEe-eeeeeeecccc Confidence 1000 000001110000 0 12233445566666677777766555433 2233 33444556666 Q ss_pred cccccccccccceeeEeeHhh---hhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhh----cccccCcc Q lcl|Aclame:pro 297 TGTDKTESNITLQTRVLTPQY---VYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAII----MGGVTGVS 369 (517) Q Consensus 297 eg~~~~~~~~~f~~~~~~~~~---~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l----~G~G~~~~ 369 (517) .|+....+.+..++..+.+-+ ...++. -+++..-+ -.+.+.+.+++.+++++..|+.++ .+.....+ T Consensus 70 pG~~l~~~~~~~~k~~itID~ll~a~~~Vd----dlDe~~~~--yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~ 143 (335) T protein:vir:78 70 AGEELERSRVVNDKWNLTVDTLLYLRHQFD----HQDEWTQS--FDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAP 143 (335) T ss_pred cCcccCCCCcccCCeEEEecceeechhhHh----hHHHhhcC--chhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Confidence 666554444555555554433 122211 12222211 127777888899999998888654 44333221 Q ss_pred c-------ccccccccccccccccccccHHHHHHHHHHhhh-----hhc-----CCEEEEcHHHHHHHHHhhcCCC-CEe Q lcl|Aclame:pro 370 E-------TQIYPVVGDAWATNVTGTTNIQELLEKLSVATP-----KAA-----DSTLVIHRNDLAAIRFLKDKNG-NYV 431 (517) Q Consensus 370 ~-------~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~~-----~~~-----~a~~vmn~~~~~~l~~lKD~~G-ryl 431 (517) . .|+......+ ..+.....+.+.+++..+.. +.+ .-+.||+|..|..|..-+.--. .|. T Consensus 144 ~~~~~~~~~G~~~~~~~t---g~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~~~ 220 (335) T protein:vir:78 144 VDLEDAFSPGVLEKLDLT---GLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVEYQ 220 (335) T ss_pred cccCCCcCCCcceeeeec---cccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhccccccccccc Confidence 1 1222211111 11222334455554444321 122 2468999999999986432212 122 Q ss_pred cc---CCCCCCccceecCccceeccccCCceee--------eecCce------EEEee-----------eheeehhhhhc Q lcl|Aclame:pro 432 FP---VGVSNQTIATHFGFNRLVQSVAVDEKTA--------VSLSGY------VTNGS-----------RGMEFEQGTIL 483 (517) Q Consensus 432 ~~---~~~~~~~~~~l~g~~~v~~~~~~~~~~~--------~~~~~~------~~~~~-----------~~~~~~~d~~~ 483 (517) .- .+...+.+..+.|++ |+.+.++|.... ++...| .+... +..+.+. T Consensus 221 ~s~~~~~~~~g~v~~v~Gv~-V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~---- 295 (335) T protein:vir:78 221 ATGATNDYVKSRVAILNGVK-VLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWE---- 295 (335) T ss_pred ccccccccccceeEEeeceE-EEeeccCCCCCCccccccccCCcccccccceEEEEEecceEEEEEEEecccceee---- Confidence 11 122345566677765 455556654321 111111 11111 1112221 Q ss_pred ccchHHHH--HhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 484 VENNKEYL--FEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 484 ~~n~~~~~--~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) ......++ ...-.|-.++||++.+..++|=..|= T Consensus 296 ~~~~~~~~i~~~~a~G~g~lRPe~a~~i~~tg~~~~ 331 (335) T protein:vir:78 296 DHDQFSWVLDTFQMYNIGARRPDTAGAIELKGIEAF 331 (335) T ss_pred ccchhhHhhhHHHHcCCcccCcceEEEEEecCCCcc Confidence 12222222 33446778999999999988655433 No 158 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=95.55 E-value=0.00078 Score=37.61 Aligned_cols=263 Identities=9% Similarity=0.009 Sum_probs=114.9 Q ss_pred hhccccccccc--chhhhhhHHHhHhhhhhhhhceeeec-cccc----eeeeecccccceeeecccc-cccccccceeeE Q lcl|Aclame:pro 241 LKERGISGMPA--PAGILKRIQDAVNDEGSLLPFIRHEN-LPTL----VVGGDNALTQGTGHTTGTD-KTESNITLQTRV 312 (517) Q Consensus 241 ~~~~~~~~~~v--p~~i~~~i~~~~~~~~~~~~~~~~~~-~~~~----~~~~~~~~~~a~~~~eg~~-~~~~~~~f~~~~ 312 (517) +.......++. -..+...+.+.+......+.++.+.. .+.. ........+.+.++..++. .|..+..++... T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~dip~~~~~~~~~~ 80 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRSGAAKIIANGADDLPLVDVDMVRKS 80 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccceeEEEecCcccccccccccceeEE Confidence 11111111111 11222334444444444455544422 2211 1122233455666665543 466666666666 Q ss_pred eeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCccccccccccccccccc-------- Q lcl|Aclame:pro 313 LTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATN-------- 384 (517) Q Consensus 313 ~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~-------- 384 (517) .+...++.-+.++.+-++.+..--. .|..--....++++.+.+++.+++|+.. ....|+++..+...... T Consensus 81 ~~i~~~~~~~~~~~~El~~a~~~g~-~l~~~k~~aa~~~~~~~~n~~~f~G~~~-~g~~GLlN~p~~~~~~~~~~~~~~~ 158 (301) T protein:vir:80 81 VPIYSIGIGLSYTIQDLRAARMQGT-TVDAAKATTVRRAIAEKENSIAFRGEKK-YAIKGAFEATGIQIDVSPTTGVGNV 158 (301) T ss_pred EEEEEEEeeeeecHHHHHHHHHhCC-ChHHHHHHHHHHHHHHhhceEEeeeccc-ccceeeecCCCcccccccCcccccc Confidence 6666666655555544443321100 1333334567888999999999999874 34567776654211110 Q ss_pred -cccccc----HHHHHHHHHHhhh---h-hcCCEEEEcHHHHHHHHHhh--cCCCCEeccCCCCCCccceecCccceecc Q lcl|Aclame:pro 385 -VTGTTN----IQELLEKLSVATP---K-AADSTLVIHRNDLAAIRFLK--DKNGNYVFPVGVSNQTIATHFGFNRLVQS 453 (517) Q Consensus 385 -~~~~~~----~d~l~~~l~~~~~---~-~~~a~~vmn~~~~~~l~~lK--D~~Gryl~~~~~~~~~~~~l~g~~~v~~~ 453 (517) ...+.+ .+++..++..... + ..+..++|+|..|..|.... +..|.-+++-...+. ....++.. T Consensus 159 ~~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~~~tvl~~l~~~~------~~~~I~~~ 232 (301) T protein:vir:80 159 SKWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINKKRYSNEDSRSVLKVLQDNA------WFSAIVRV 232 (301) T ss_pred cccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhhccccCCCCeeHHHHHHHHc------CcceEEEc Confidence 011122 3344444433221 1 13467999999999997543 555544442211110 11112221 Q ss_pred ccCCceeeeecCceEEEe--------eeheeehhhhhcccchHHHH--Hhhhh-cceeecccceEEEEeC Q lcl|Aclame:pro 454 VAVDEKTAVSLSGYVTNG--------SRGMEFEQGTILVENNKEYL--FEMPI-SGSLEYKGTTAYGTYT 512 (517) Q Consensus 454 ~~~~~~~~~~~~~~~~~~--------~~~~~~~~d~~~~~n~~~~~--~~~rv-gg~v~~~~a~~~~~~t 512 (517) ++.........+.+++.. .+.+.+..-..-.++. .++ ...|+ |.-+++|.++++.+=- T Consensus 233 p~L~~~g~~g~~~~v~~~~~~d~~~~~v~~~~~~~~~e~~~~-~~~~~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 233 PDLAGMGTAGSDSFAVIHDSNETAELIIPMDITRHPEEYSFP-RTKVPFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred ceeccCCCCcccEEEEEecCCcEEEEEecCceeeecceecCc-eeEeeeeeeeEEEEEEccceEEEEecC Confidence 111111000001111110 1111110000011121 111 23455 4589999999988722 No 159 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=95.55 E-value=0.00046 Score=38.86 Aligned_cols=287 Identities=9% Similarity=-0.055 Sum_probs=116.6 Q ss_pred hhhhhhhhhHHHHHHHHHHHHhhccchhhHHHHhhhhhcccccccc---cchhhhhhHHHhHhhhhhhhhceeeec-ccc Q lcl|Aclame:pro 205 LKVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMP---APAGILKRIQDAVNDEGSLLPFIRHEN-LPT 280 (517) Q Consensus 205 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---vp~~i~~~i~~~~~~~~~~~~~~~~~~-~~~ 280 (517) .+ + ..+................ .+... ....++. .-..+...+.+.....-..+.++.+.. .+. T Consensus 1 ~~---~-~~~~~~~~~~~~~~~~~~~--~~~da------~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~ 68 (319) T protein:vir:10 1 MT---T-KKFDEADKSNVEMYLIQAG--VKQDA------AATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELSP 68 (319) T ss_pred CC---C-cchhHHhhHHHHHHHhhcc--chhhh------hhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCCCC Confidence 00 0 0000000000000000000 00000 0000110 011122233333333333444444332 111 Q ss_pred ----ceeeeecccccceeeecccc-cccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 281 ----LVVGGDNALTQGTGHTTGTD-KTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMA 355 (517) Q Consensus 281 ----~~~~~~~~~~~a~~~~eg~~-~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~ 355 (517) .........+.+.|+..++. .|..+..++....++..++..+.++.+-+..+..--. .|..--....++++.+. T Consensus 69 ~~~~~~~~~~~~~G~a~~~~d~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~-~l~~~k~~aA~~~~~~~ 147 (319) T protein:vir:10 69 TDKTFEYMTFDKVGTAQIIADYTDDLPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATGR-PLSTRKASACQLAHDQL 147 (319) T ss_pred ceEEEEeeeeccccceeeecCccccccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhCC-ChHHHHHHHHHHHHHHh Confidence 11222344466777766543 5666666777777777776666666554444421111 13333344677889999 Q ss_pred HHhhhhcccccCccccccccccccccccccc----cccc----HHHHHHHHHHhhh----hhcCCEEEEcHHHHHHHHHh Q lcl|Aclame:pro 356 VNRAIIMGGVTGVSETQIYPVVGDAWATNVT----GTTN----IQELLEKLSVATP----KAADSTLVIHRNDLAAIRFL 423 (517) Q Consensus 356 ~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~----~~~~----~d~l~~~l~~~~~----~~~~a~~vmn~~~~~~l~~l 423 (517) +++-+++|+... ...|+++..+........ .+.+ .+++..++..... ...+..++|+|..|..|... T Consensus 148 ~n~i~f~G~~~~-g~~GLlN~p~~~~~~~~~~~~~~t~t~~~i~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~ 226 (319) T protein:vir:10 148 VNRLVFKGSAPH-KIVSVFNHPNITKITSGKWIDVSTMKPETAEAELTQAIETIETITRGQHRATNILIPPSMRKVLAIR 226 (319) T ss_pred hceEEEeecccc-cceeEEeCCCceeeecCCCCCccccCHHHHHHHHHHHHHHHHHhcCceeeceEEEecHHHHHhhhcc Confidence 999999998643 446777655432211111 1112 2334333333221 12456899999999999765 Q ss_pred hcCCCCEeccCCCCCCccceecCccceeccccCCceeeeecCceEEEe--------eeheeehhhhhcccchHHHH--Hh Q lcl|Aclame:pro 424 KDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDEKTAVSLSGYVTNG--------SRGMEFEQGTILVENNKEYL--FE 493 (517) Q Consensus 424 KD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~d~~~~~n~~~~~--~~ 493 (517) ....|.-++.-.-.+.. .-.+...++.........+.+++.. .+.+. +.-.......-.+. .. T Consensus 227 ~~~~~~t~l~~lk~~~~------~l~I~~~pel~~ag~~g~~~~v~y~~~~~~~~~~v~~~-~~~~~~e~~~l~~~~~~~ 299 (319) T protein:vir:10 227 MPETTMSYLDYFKSQNS------GIEIDSIAELEDIDGAGTKGVLVYEKNPMNMSIEIPEA-FNMLPAQPKDLHFKVPCT 299 (319) T ss_pred cCCCCeeHHHHHHHhcC------CceEEEeeeecccCCCcceEEEEEecCCceEEEecCcc-eeeeeeeecCceEEEeee Confidence 55555444322111110 1111111111111000001111110 01111 00001111111111 23 Q ss_pred hhhc-ceeecccceEEEEeC Q lcl|Aclame:pro 494 MPIS-GSLEYKGTTAYGTYT 512 (517) Q Consensus 494 ~rvg-g~v~~~~a~~~~~~t 512 (517) .|+| .-|++|.++++.+=- T Consensus 300 ~r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 300 SKCTGLTIYRPMTIVLITGV 319 (319) T ss_pred eeeEEEEEEccceeEeeecC Confidence 4444 578899999987722 No 160 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=95.40 E-value=0.0021 Score=35.22 Aligned_cols=273 Identities=11% Similarity=0.029 Sum_probs=114.0 Q ss_pred HhhccchhhHHHHhhhhhccccc----ccccc-hhhhhhHHHhHhhhhhhhhceeeeccccc---eeeeecccccceeee Q lcl|Aclame:pro 225 MSASLTKDPKAAWTAELKERGIS----GMPAP-AGILKRIQDAVNDEGSLLPFIRHENLPTL---VVGGDNALTQGTGHT 296 (517) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~----~~~vp-~~i~~~i~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~a~~~~ 296 (517) |.... .+...+.. ..... ..+...+.+.....+.+++++++..+.+. .++ +.....+..+. T Consensus 1 Ms~~n----------~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~vrti~~GkS~qf~-~iG~~~a~y~~ 69 (402) T protein:vir:97 1 MSTPN----------TLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNK-YLGETELQVLA 69 (402) T ss_pred CCCcc----------cccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEEE-EEeeeEEeeec Confidence 11000 00000000 11111 22334455556566667776665554332 222 22334455566 Q ss_pred cccccccccccceeeEeeHhhh---hHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhc-----ccccCc Q lcl|Aclame:pro 297 TGTDKTESNITLQTRVLTPQYV---YKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIM-----GGVTGV 368 (517) Q Consensus 297 eg~~~~~~~~~f~~~~~~~~~~---~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~-----G~G~~~ 368 (517) .|+..-.+.+..++..+.+-+. ..++. -|.+..-+. -.+.+.+.+++.+++++..|+.++. |-.... T Consensus 70 ~G~~ldg~~~~~~k~~ItID~lL~a~~~V~----diDeaq~~y-D~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~ 144 (402) T protein:vir:97 70 PGQSPNATPTQADKNQLVIDTTVIARNTVA----HIHDVQGDI-DSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTK 144 (402) T ss_pred cccccCCCCcccccEEEEeCceeechhhhh----hHHHHHhcc-cchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Confidence 6655444445555555544321 12221 112221111 1145667778888888888886641 110000 Q ss_pred cc---ccccccccc--cccccccccccHHHHHHHHHHhhh-------hhcCCEEEEcHHHHHHHHHhhcC-CCCEeccCC Q lcl|Aclame:pro 369 SE---TQIYPVVGD--AWATNVTGTTNIQELLEKLSVATP-------KAADSTLVIHRNDLAAIRFLKDK-NGNYVFPVG 435 (517) Q Consensus 369 ~~---~gi~~~~~~--~~~~~~~~~~~~d~l~~~l~~~~~-------~~~~a~~vmn~~~~~~l~~lKD~-~Gryl~~~~ 435 (517) +. .+.....+. ..........+.+.+.+++..+.. +...-+++|+|..|..|.+-++= |-.|....+ T Consensus 145 ~~~~~~~~~~~g~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~rl~n~d~~~~~~ 224 (402) T protein:vir:97 145 AERNKPRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQS 224 (402) T ss_pred cccccCcccccccccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhcccccchhhccccC Confidence 00 011110010 011111112344455544444321 12235789999999998864221 112221111 Q ss_pred --CCCCccceecCccceeccccCCcee----------eeecCceEE-----------Eee----------eheeehhhhh Q lcl|Aclame:pro 436 --VSNQTIATHFGFNRLVQSVAVDEKT----------AVSLSGYVT-----------NGS----------RGMEFEQGTI 482 (517) Q Consensus 436 --~~~~~~~~l~g~~~v~~~~~~~~~~----------~~~~~~~~~-----------~~~----------~~~~~~~d~~ 482 (517) ...+.+..+.|+.. +.+..+|... +.+...|-. .-+ +-.+.+.| T Consensus 225 g~~~~G~v~~v~Gv~V-v~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d-- 301 (402) T protein:vir:97 225 GATINGFVLSSYNCPV-IPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYE-- 301 (402) T ss_pred CccccceeEEEeceEE-EecCccccccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEEeeccccchhhc-- Confidence 23444556666543 3333443211 111111211 000 01112222 Q ss_pred cccchHHHH-HhhhhcceeecccceEEEEe----CCCCCC Q lcl|Aclame:pro 483 LVENNKEYL-FEMPISGSLEYKGTTAYGTY----TPPVAG 517 (517) Q Consensus 483 ~~~n~~~~~-~~~rvgg~v~~~~a~~~~~~----tp~~a~ 517 (517) .+.+..+. ...-.|-.++||++.+..++ ||++|| T Consensus 302 -~r~~~~~id~~~a~G~g~~RPeaa~vv~~~~~~t~~~~~ 340 (402) T protein:vir:97 302 -KKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAG 340 (402) T ss_pred -hhHHHHHHHHHHHhCCcccCccceEEEEEecccccccCC Confidence 12222222 33456778999999988765 778887 No 161 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=95.32 E-value=0.0023 Score=35.06 Aligned_cols=283 Identities=11% Similarity=0.020 Sum_probs=114.0 Q ss_pred HhhccchhhHHHHhhhhhcccccccccc-----hhhhhhHHHhHhhhhhhhhceeeeccccce--eeeecccccceeeec Q lcl|Aclame:pro 225 MSASLTKDPKAAWTAELKERGISGMPAP-----AGILKRIQDAVNDEGSLLPFIRHENLPTLV--VGGDNALTQGTGHTT 297 (517) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~vp-----~~i~~~i~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~a~~~~e 297 (517) +.......... .......+..+..-+ ..+..++.+.....+.++++++...+.+.. ...+.....+..+.- T Consensus 1 ~~~~~~~~~~~--~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~~~~rti~~Gksv~f~~iG~~t~~~~t~ 78 (375) T protein:vir:10 1 MANANQVALGR--SNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDLVTKRTLKNGKSLQFIYTGRMTSSFHTP 78 (375) T ss_pred CccccccccCc--cccCCccccccccchHHHHHHHHhHHHHHHHHHHHhhhccccccccccCceEEEEeeeeeEEeeecC Confidence 11000000000 000011111111111 223344556666667777777765554321 112233344555665 Q ss_pred ccccc---ccccccee--eEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhc----ccccCc Q lcl|Aclame:pro 298 GTDKT---ESNITLQT--RVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIM----GGVTGV 368 (517) Q Consensus 298 g~~~~---~~~~~f~~--~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~----G~G~~~ 368 (517) |+... ..+++..+ +++.-.++..+ .|. -+++.... ..|.+.+.+++.+++++..|+.++. |..... T Consensus 79 G~~i~~~~~~d~~~te~~l~ID~~~y~~~-~Vd--DiD~aqa~--~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~ 153 (375) T protein:vir:10 79 GTPILGNADKAPPVAEKTIVMDDLLISSA-FVY--DLDETLAH--YELRGEISKKIGYALAEKYDRLIFRSITRGARSAS 153 (375) T ss_pred CcCcCCccccCCCCCceEEEecchhhhhh-hHh--hHHHHhcC--chhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc Confidence 65432 22333333 33333333321 111 12222221 1277788889999999999987752 222211 Q ss_pred c-------cccccccccccccccccccccHHHHHHHHHHhhhh-----h--cCCEEEEcHHHHHHHHHhhcCCC----CE Q lcl|Aclame:pro 369 S-------ETQIYPVVGDAWATNVTGTTNIQELLEKLSVATPK-----A--ADSTLVIHRNDLAAIRFLKDKNG----NY 430 (517) Q Consensus 369 ~-------~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~~~-----~--~~a~~vmn~~~~~~l~~lKD~~G----ry 430 (517) + ..|+..... ..........+.+.+.+++..+... . .+-.+|++|..|..|.+-||.+. .| T Consensus 154 p~~~~~~~~~Gg~~i~~-~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~~~~n~d~ 232 (375) T protein:vir:10 154 PVSATNFVEPGGTQIRV-GSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQGRCAVLNPRQYYALIQDIGSNGLVNRDV 232 (375) T ss_pred ccccccccccCcceeee-ccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHHhcCCccceeeecc Confidence 1 112111111 1111111122334444444433221 1 23468999999999877665431 12 Q ss_pred eccCCCCCCccceecCccceeccccCCcee--------------------------------eeecCceEEE-------- Q lcl|Aclame:pro 431 VFPVGVSNQTIATHFGFNRLVQSVAVDEKT--------------------------------AVSLSGYVTN-------- 470 (517) Q Consensus 431 l~~~~~~~~~~~~l~g~~~v~~~~~~~~~~--------------------------------~~~~~~~~~~-------- 470 (517) .=......+....+.|++ ++.+..+|..+ ++.+++|... T Consensus 233 ~~~~~~~~g~v~~i~Gv~-V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~ 311 (375) T protein:vir:10 233 QGSALQSGNGVIEIAGIH-IYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSC 311 (375) T ss_pred cccceeccceEEEEeceE-EEEeccccccccccccccccccccchhhhhccccccCCcceeeccccccccccccccCceE Confidence 111111222233455543 33333333211 1111222100 Q ss_pred ---ee---------eh--eeeh-hhhhcccchHHHHHhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 471 ---GS---------RG--MEFE-QGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 471 ---~~---------~~--~~~~-~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) -. ++ ++.+ .+++.....-.+....-+|-.+.+|++.+.+... +.|= T Consensus 312 ~~~~~~~A~g~v~~~~~~~~~~~~~~~~~~q~~~i~~~~a~G~~~lrp~~av~l~~~-~~~~ 372 (375) T protein:vir:10 312 GLIFQKEAAGVVEAIGPQVQVTNGDVSVIYQGDVILGRMAMGADYLNPAAAVELYIG-ATAP 372 (375) T ss_pred EEEEchhheeeeeeeccccccccchhhheeeeeeeeeeeeeccCccCceeEEEEecC-cCcc Confidence 00 11 1111 1122221122234455577788999998777643 3222 No 162 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=95.31 E-value=0.0023 Score=35.03 Aligned_cols=272 Identities=8% Similarity=-0.033 Sum_probs=121.9 Q ss_pred Hhhcc--chhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeeccccce--eeeecccccceeeecccc Q lcl|Aclame:pro 225 MSASL--TKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLPTLV--VGGDNALTQGTGHTTGTD 300 (517) Q Consensus 225 ~~~~~--~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~a~~~~eg~~ 300 (517) |..-. .+.... ....+..+. -..+...+.+.....+.++++.++..+.+.- --+......+..+..|+. T Consensus 1 ms~~~~~tr~~~~------~s~~d~al~-le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~iG~~~~~~~~pG~~ 73 (335) T protein:vir:63 1 MSFLNDLTRPNYA------GKNADVDIH-LEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRLGNVEAKGRRAGEE 73 (335) T ss_pred CCCcccchhhhcc------cccchhhee-hhhhhhhHHHHHHhhhhhccccceeeeccceeEEEeeeeeeeeecccCCcC Confidence 11000 000000 000000011 1234445566666667777776665554321 112234455666666666 Q ss_pred cccccccceeeEeeHhhh---hHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhh----hcccccCccc--- Q lcl|Aclame:pro 301 KTESNITLQTRVLTPQYV---YKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAI----IMGGVTGVSE--- 370 (517) Q Consensus 301 ~~~~~~~f~~~~~~~~~~---~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~----l~G~G~~~~~--- 370 (517) ...+.+..++..+.+-++ ..++. -+++..-. -.+.+.+.+++.+++++..|+++ +.+.....+. T Consensus 74 l~~~~~~~~k~~itVD~ll~a~~~I~----dlDe~~~~--yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~ 147 (335) T protein:vir:63 74 LERSRVVNDKWNLTVDTLLYLRHQFD----HQDEWTQS--FDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLE 147 (335) T ss_pred cCCCCccccceEEEecceeechhhhh----hHHHHhcC--chhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccC Confidence 554445555555554432 22211 12222211 12778888899999999999865 3443332211 Q ss_pred ----ccccccccccccccccccccHHHHHHHHHHhhh-----hhc-----CCEEEEcHHHHHHHHHhhcCCCC-Eecc-- Q lcl|Aclame:pro 371 ----TQIYPVVGDAWATNVTGTTNIQELLEKLSVATP-----KAA-----DSTLVIHRNDLAAIRFLKDKNGN-YVFP-- 433 (517) Q Consensus 371 ----~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~~-----~~~-----~a~~vmn~~~~~~l~~lKD~~Gr-yl~~-- 433 (517) .|+......+ ..+.....+.+..++..+.. +.+ +-..+|+|..|..|..-+.--.+ |..- T Consensus 148 ~~~~~G~~~~~~~t---g~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~~ 224 (335) T protein:vir:63 148 DAFSPGVLEKLDLT---GLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATGA 224 (335) T ss_pred CCcCCCcceeeeec---cCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhccccccccccccccc Confidence 1222211111 11222234444444443221 122 24689999999998874322222 2211 Q ss_pred -CCCCCCccceecCccceeccccCCceeee------ecCce--------EEEee-----------eheeehhhhhcccch Q lcl|Aclame:pro 434 -VGVSNQTIATHFGFNRLVQSVAVDEKTAV------SLSGY--------VTNGS-----------RGMEFEQGTILVENN 487 (517) Q Consensus 434 -~~~~~~~~~~l~g~~~v~~~~~~~~~~~~------~~~~~--------~~~~~-----------~~~~~~~d~~~~~n~ 487 (517) .+...+.+..+.|++ |+.+..+|..... .++.| .+... +..+.+.+ ... T Consensus 225 ~~~~~~g~v~~v~Gv~-V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~----~~~ 299 (335) T protein:vir:63 225 TNDYVKSRVAILNGVK-VLETPRFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWED----NEK 299 (335) T ss_pred cccccCceeEEeeceE-EEeeccCCCCCcccccccccCCccccccceeEEEEEecceEEEEEEeecccceeec----cch Confidence 112345566677765 4444555532210 01111 11111 11112211 122 Q ss_pred HHH--HHhhhhcceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 488 KEY--LFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 488 ~~~--~~~~rvgg~v~~~~a~~~~~~tp~~a~ 517 (517) ..+ -...-.|-.++||++++..++|=.+|= T Consensus 300 ~~~~i~~~~a~G~g~lRPe~a~~i~~tg~~~~ 331 (335) T protein:vir:63 300 FSWVLDTFQMYNIGARRPDTAGAIELKGIGAF 331 (335) T ss_pred hhHHhHHHHHcCCcccccceEEEEEEcCCCce Confidence 222 233446778999999999998544333 No 163 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=94.92 E-value=0.00041 Score=39.17 Aligned_cols=283 Identities=7% Similarity=-0.073 Sum_probs=114.2 Q ss_pred hhhhHH---HHHHHHHHHHhhccchhhHHHHhhhhhcccccccccc--hhhhhhHHHhHhhhhhhhhceeeec-cc---- Q lcl|Aclame:pro 210 EATEFL---KTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAP--AGILKRIQDAVNDEGSLLPFIRHEN-LP---- 279 (517) Q Consensus 210 ~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp--~~i~~~i~~~~~~~~~~~~~~~~~~-~~---- 279 (517) ...++. .........+......+.. -+.+. +.+-..+.+.....-..++++++.. ++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~d~~~-------------~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~~~~~~e 67 (314) T protein:vir:10 1 MAIKFDAEQAKITTHLEQMGVEKADAAG-------------IWAVSQLTAALNRAYEKEYAENSVVNIFPVTNEIPGHAK 67 (314) T ss_pred CccchHHHHHHHHHHHHhhcccchhhhH-------------HHHHHHHHHHHHHHhhhhccccccceeeccccCCCCcee Confidence 000000 0000000000000000000 01110 1112222222222222333333321 11 Q ss_pred cceeeeecccccceeeecccc-cccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 280 TLVVGGDNALTQGTGHTTGTD-KTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNR 358 (517) Q Consensus 280 ~~~~~~~~~~~~a~~~~eg~~-~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~ 358 (517) +......+..+.+.|++.++. .|..+..++....+++.++..+.++.+-+..+..--. .|..--....+..+.+.++. T Consensus 68 t~~~~~~e~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~-~l~~~k~~aA~~~~~~~~n~ 146 (314) T protein:vir:10 68 YFEYPEFDGVGIAQIIADYSDDLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQ-SLSARKQALAFEAHDNLLDK 146 (314) T ss_pred EEEeeeeccccceeeeCCcccccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCC-ChHHHHHHHHHHHHHHhhce Confidence 112223344566677766543 5666777777777777777766666554444422111 13333344567788889999 Q ss_pred hhhcccccCcccccccccccccccccc----cccccHHHHHHHHHHhhh---h-hcCCEEEEcHHHHHHHHHhhcCCCCE Q lcl|Aclame:pro 359 AIIMGGVTGVSETQIYPVVGDAWATNV----TGTTNIQELLEKLSVATP---K-AADSTLVIHRNDLAAIRFLKDKNGNY 430 (517) Q Consensus 359 ~~l~G~G~~~~~~gi~~~~~~~~~~~~----~~~~~~d~l~~~l~~~~~---~-~~~a~~vmn~~~~~~l~~lKD~~Gry 430 (517) .+++|+... ...|+++.......... +.....+++..++..... + ..+..++|+|.-+..|...-+..|.- T Consensus 147 i~f~G~~~~-g~~GLlN~p~v~~~~~~~~WaT~~ei~~Di~~~~~~l~~~s~g~~~p~~l~Lpp~~~~~L~~~~~~~~~t 225 (314) T protein:vir:10 147 LVWSGSAPH-GIVSVFDQPNINNVVATPNWSVPQNAIDDVTAMIDAVESSTQGLHHVTDILLPASARRVMQGLVPQTNLS 225 (314) T ss_pred EEEeecccc-cceeEeecCCCccccCCCCcccHHHHHHHHHHHHHHHHHhcCccccceeEEecHHHHHhhcccccCCCcc Confidence 999998643 45677765443222111 111123444444444322 1 23467999999998886544444433 Q ss_pred eccCCCCCCccceecCccceeccccCCceeeeecCceEEEeeeheeeh-----hhhh---cccchHHH--HHhhhh-cce Q lcl|Aclame:pro 431 VFPVGVSNQTIATHFGFNRLVQSVAVDEKTAVSLSGYVTNGSRGMEFE-----QGTI---LVENNKEY--LFEMPI-SGS 499 (517) Q Consensus 431 l~~~~~~~~~~~~l~g~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~d~~---~~~n~~~~--~~~~rv-gg~ 499 (517) ++.-...+...- .+...++..... ...+.-.+.+..+-+.+ .++. .....-.+ -...|+ |.- T Consensus 226 vl~~l~~n~~~l------~I~~~~el~~ag-~~g~~~~v~y~~~~~~~~~~vp~~~~~l~~e~~~~~~~~~~~~r~~Gv~ 298 (314) T protein:vir:10 226 YGELFTRNNPGL------TIRFLQFLDNYD-GAGGKAALAFEKSPLNMSIEIPEVTNVLPAQPKDLHFRYPVTSKATGLI 298 (314) T ss_pred HHHHHHHhCCCc------EEEEcccccccC-CCcceEEEEEecCCcEEEEecCccceeecceecCceEEEcceeeeEEEE Confidence 321111111001 111111111111 11111111111111000 0110 11111111 123454 457 Q ss_pred eecccceEEEE-eCCC Q lcl|Aclame:pro 500 LEYKGTTAYGT-YTPP 514 (517) Q Consensus 500 v~~~~a~~~~~-~tp~ 514 (517) +++|.++++.+ +|=+ T Consensus 299 i~~P~ai~~~dGI~~~ 314 (314) T protein:vir:10 299 VYRPLTMAVIKGITFA 314 (314) T ss_pred EECcceeEeeeeeecC Confidence 89999999765 1323 No 164 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=94.07 E-value=0.003 Score=34.41 Aligned_cols=271 Identities=8% Similarity=0.008 Sum_probs=110.6 Q ss_pred hhcc--ccccc---ccchhhhhhHHHhHhhhhhhhhceeeeccccceeeeeccc---ccceeeecccccccccccce-ee Q lcl|Aclame:pro 241 LKER--GISGM---PAPAGILKRIQDAVNDEGSLLPFIRHENLPTLVVGGDNAL---TQGTGHTTGTDKTESNITLQ-TR 311 (517) Q Consensus 241 ~~~~--~~~~~---~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~a~~~~eg~~~~~~~~~f~-~~ 311 (517) +... ...++ ....++.+.|...-....|+..++......+.....++.. .......||.+.+....... .+ T Consensus 1 ma~~~~~~~t~~~~g~~~dl~~~I~~isp~dTPf~S~i~~~~a~~~~~~W~~d~l~~~~~~~~~EG~da~~~~~~~r~~~ 80 (317) T protein:vir:88 1 MATPTNAVSTVEINGKREDLIDIIYNIAPYDTPFMSAIGKGVATAITHEWQTDELRQPGKNTRVEGEDATIKAGSFTTML 80 (317) T ss_pred CCccccceEeeeeeeeeechhhhheecCCccCcceeeecCceecccEEEEEeeecCCccccccccCcccccccccCCEEe Confidence 1100 00011 1122344444444444445555443332222222222111 11122346665443221110 00 Q ss_pred EeeHhhhhHhHhhhHHH--HHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccc-----cCc---ccccccccccc-- Q lcl|Aclame:pro 312 VLTPQYVYKYIKLPKIV--MNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGV-----TGV---SETQIYPVVGD-- 379 (517) Q Consensus 312 ~~~~~~~~~~~~iS~~l--i~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G-----~~~---~~~gi~~~~~~-- 379 (517) .-..+-+...+.+|... +...... ..+. |-..+=...+.+-+|.++|+|.- ..+ ...|++.-... T Consensus 81 ~N~tQIf~k~v~VSgTa~av~~~G~~--~ela-~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~~~i~t~~ 157 (317) T protein:vir:88 81 NNYCQISDETLQVTGTADRVKKAGRK--NELA-YQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIFAYYKTNG 157 (317) T ss_pred ccEEEEEEeEEEEeehhhhhhhcCcc--chhH-HHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHHHHhccCc Confidence 00001111112222211 1111111 1122 22223344578889999999852 111 12333321100 Q ss_pred ------------ccccc--ccc-cccHHHHHHHHHHhhh-hhcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCC------ Q lcl|Aclame:pro 380 ------------AWATN--VTG-TTNIQELLEKLSVATP-KAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVS------ 437 (517) Q Consensus 380 ------------~~~~~--~~~-~~~~d~l~~~l~~~~~-~~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~------ 437 (517) ..... .+. ..+-+++.+++...-. ...+..+++|+..-..|.++-..++.|+..+... T Consensus 158 ~~~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~Gg~~~~i~v~a~~k~~i~~~~~~~~~~i~~~~~~~~~g~~ 237 (317) T protein:vir:88 158 SLGANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRNGGQANSIQTSSSIKKAISKNMKGRATEITLDASDNRIAQT 237 (317) T ss_pred eeccCccccccCCCccccccccccccHHHHHHHHHHHHhcCCCCCEEEeChHHHHHHHHHhcCCceeEEEcccCeEEEEE Confidence 00000 111 1233455555555432 2344578899999999998855566666432211 Q ss_pred CCccceecCccceeccccCCceeeeecC--ceEEEeeeheeehhhhhcccchHHHHHhhhhcceeecccceEEEEe-CCC Q lcl|Aclame:pro 438 NQTIATHFGFNRLVQSVAVDEKTAVSLS--GYVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTY-TPP 514 (517) Q Consensus 438 ~~~~~~l~g~~~v~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~-tp~ 514 (517) -....+-||...+++..+++...++.++ .+.+..=+.. ...+.--+-+..+..+..-+++.+++|.|.+.++- +.+ T Consensus 238 v~~~~tdfG~v~ii~~r~lp~~~~~~~D~~~~~l~~Lr~~-~~e~laKtGd~~k~~i~~E~tLe~~N~~a~a~i~~l~~~ 316 (317) T protein:vir:88 238 VDVYESDFGKYTIRANRWFHENTLFVFDPKMHSLCYLRPF-FQHELAKTGDSEKRQLLVEYTFRVNNEKSGALIRDVVAQ 316 (317) T ss_pred EEEEEeCCeEEEEEeCCCCCCCeEEEEcccccceeecccc-eeeccCCCcccceeEEEEEEEEEEcCccceeEEEEeccc Confidence 1112233555555666666643333222 1111110110 00111112334566777888899999998886654 333 Q ss_pred C Q lcl|Aclame:pro 515 V 515 (517) Q Consensus 515 ~ 515 (517) + T Consensus 317 ~ 317 (317) T protein:vir:88 317 L 317 (317) T ss_pred C Confidence 4 No 165 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=93.83 E-value=0.0062 Score=32.68 Aligned_cols=267 Identities=12% Similarity=0.056 Sum_probs=94.8 Q ss_pred HhhccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhh------hhceee-----eccccceeeeeccc-ccc Q lcl|Aclame:pro 225 MSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSL------LPFIRH-----ENLPTLVVGGDNAL-TQG 292 (517) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~------~~~~~~-----~~~~~~~~~~~~~~-~~a 292 (517) |. ... .-...+.+|.-+..-+.+.+...+.+ .+.... .+.....+|..... +.+ T Consensus 1 Ma------------~~~--T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~G~~ 66 (330) T protein:vir:10 1 MA------------NEL--TKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLTGDS 66 (330) T ss_pred CC------------CCc--eEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCCCcc Confidence 00 000 00011223333222222222111111 111000 12222345544322 445 Q ss_pred eeeeccc-ccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhc------ccc Q lcl|Aclame:pro 293 TGHTTGT-DKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIM------GGV 365 (517) Q Consensus 293 ~~~~eg~-~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~------G~G 365 (517) .-+.+|. ..+...++-.+.....+..+.-..++.....-+.-| -...+.++++....+..+..+|. ++. T Consensus 67 ~~~~dg~~~i~~~ki~t~~~~a~i~~~~k~~~~tD~a~~~~g~d----p~~~i~~q~a~~w~~~~q~~lla~l~gvf~~~ 142 (330) T protein:vir:10 67 EVLGNGDKALETGKITAGADIACVLYRGRGWAANELTGVVAGSD----PVRAILNRIGAYWLREDQKALIATLNGIFATG 142 (330) T ss_pred cccCCCccccchhhcccceeEEEEEeecceeeehhhhhhhcchh----HHHHHHHHHHHHhhhhHHHHHHHHHHhhhhhh Confidence 5556664 355444554444444444443333322111111111 22336666666655555544442 111 Q ss_pred cCcccccccccccccccccccccccHHHHHHHHHHhhhhh-cCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCcccee Q lcl|Aclame:pro 366 TGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVATPKA-ADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATH 444 (517) Q Consensus 366 ~~~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~~~~-~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l 444 (517) ..... +-+.................+.+.++....-... .-.+|+||+.++..|++.+=- .|+ ++......+.++ T Consensus 143 ~~~~~-~~~~~~~~~~~~~~~a~~s~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li--~~~-~~s~~~~~i~~~ 218 (330) T protein:vir:10 143 TAGEK-GALEETHVSDQSKASTGIDAGMVLDAKQLLGDSADQVTAIAMHSAVYTKLQKDNLI--QYI-QPTTATINIPTY 218 (330) T ss_pred hcccc-hhhhhhheecccccccccCHHHHHHHHHHhccccccceEEEEcHHHHHHHHHhhhh--hhh-cccccCcccccc Confidence 11110 0000000001111111222344555433321111 236899999999999875311 122 222334456677 Q ss_pred cCccceeccccCCce----eeeecC--ceEEEe-eeh--eeehhhhhcccchHHHHHhhhhcceeecccceEEEEeCCCC Q lcl|Aclame:pro 445 FGFNRLVQSVAVDEK----TAVSLS--GYVTNG-SRG--MEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPV 515 (517) Q Consensus 445 ~g~~~v~~~~~~~~~----~~~~~~--~~~~~~-~~~--~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~tp~~ 515 (517) +|.. |++++.++.. +.+.+. .+.... ... +...-+.+.......+..+.+ .+.+|..+.+-.-..+. T Consensus 219 ~G~~-VivdD~~p~~~~~yt~yl~~~GAi~~~~~~~~~~v~~EtdRd~~~g~~~l~~r~~---~~~hp~G~s~~~~~~~~ 294 (330) T protein:vir:10 219 LGYR-VIIDDGIAPTGDIYTSYLFRTGSIGLNTGNPSGLTTFETSREAAKGNDMIYTRRA---LVMHPYGVKWTGAEVDA 294 (330) T ss_pred cceE-EEEeCCCCCCCCceeEEEEecCceeeecccCCccccccccCCccccceEEEEeeE---EEeeeeeeeeccccccc Confidence 7754 4444444311 111111 111111 000 111112222333333333333 34556555554322222 Q ss_pred CC Q lcl|Aclame:pro 516 AG 517 (517) Q Consensus 516 a~ 517 (517) +| T Consensus 295 ~~ 296 (330) T protein:vir:10 295 GN 296 (330) T ss_pred Cc Confidence 33 No 166 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=92.66 E-value=0.005 Score=33.19 Aligned_cols=296 Identities=7% Similarity=-0.087 Sum_probs=112.8 Q ss_pred hhhhhhhhhhhhhhhHHHHHHHHHHHHhhccchhhHHHHhhhhhcccccccccc--hhhhhhHHHhHhhhhhhhhceeee Q lcl|Aclame:pro 199 ILGVEALKVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAP--AGILKRIQDAVNDEGSLLPFIRHE 276 (517) Q Consensus 199 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp--~~i~~~i~~~~~~~~~~~~~~~~~ 276 (517) .++ ....++++... ...+.......... ...+..+...+.+. ..+...+.+.....-..+.++.+. T Consensus 1 ~~~---~~~~~~~~~d~-~~~~~~a~~~~~~~--------~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~ 68 (329) T protein:vir:79 1 MRG---NIMSKEMKYDE-FEANVIANHMQLRG--------AKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVT 68 (329) T ss_pred Ccc---chhhhhhccch-hhhhhHhhhccccc--------ceeccchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccc Confidence 000 00011111000 00000000000000 00000000001110 112223333333333344444433 Q ss_pred c-ccc----ceeeeecccccceeeecc-cccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHH Q lcl|Aclame:pro 277 N-LPT----LVVGGDNALTQGTGHTTG-TDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPD 350 (517) Q Consensus 277 ~-~~~----~~~~~~~~~~~a~~~~eg-~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~ 350 (517) . .+. ......+..+.+.|++.+ ...|..+..+.....+++.++..+.++.+-+..+..--. .|..--....++ T Consensus 69 ~~~~~~~~~~t~~~~~~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~-~l~~~k~~aA~~ 147 (329) T protein:vir:79 69 SELSDTDKTFEYQTFDKVGHAKIIADYTDDLSTVDALMTSEFGKVFRLGNAFLISIDEIKAGQRTGK-SLSTRKANAAQN 147 (329) T ss_pred cCCCCceeEEEeeeeecceeeeeecCcccccceeecccceeEEEEEEEEEEEEecHHHHHHHHHhCC-ChHHHHHHHHHH Confidence 2 111 122223344566676654 345655555666555555555544554443333321100 133334445678 Q ss_pred HHHHHHHhhhhcccccCcccccccccccccccccc------ccccc----HHHHHHHHHHhhh---h-hcCCEEEEcHHH Q lcl|Aclame:pro 351 MVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNV------TGTTN----IQELLEKLSVATP---K-AADSTLVIHRND 416 (517) Q Consensus 351 ~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~------~~~~~----~d~l~~~l~~~~~---~-~~~a~~vmn~~~ 416 (517) .+.+.++.-+++|++. ....|+++.......... ....+ .+++..++..... + ..+..++|+|.- T Consensus 148 ~~~~~~n~i~f~G~~~-~g~~GLlN~p~v~~~~~~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~~p~~L~Lpp~~ 226 (329) T protein:vir:79 148 AHDQLVNHLVFKGSKP-HKIISVFEHPNLTTINSAGWNNAAGTGKKPETAQDELEQAIEKIETLTNGQHRANMILIPPSM 226 (329) T ss_pred HHHHhhccEEEeeccc-ccceeeecCCCccccccCCCCCccccccCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHH Confidence 8889999999999864 334577765443211111 11122 3344444443322 1 235689999999 Q ss_pred HHHHHHhhcCCCCEeccCCCCCCccceecCccceeccccCCceeeeecCceEEEeeeheee--------hhhhhcccchH Q lcl|Aclame:pro 417 LAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDEKTAVSLSGYVTNGSRGMEF--------EQGTILVENNK 488 (517) Q Consensus 417 ~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~d~~~~~n~~ 488 (517) +..|.....+.|.-++.-.-.+.. ...+...++....... ..+-.+.+..+-+. +...-.+...- T Consensus 227 ~~~L~~~~~~~~~tvl~~lk~~~~------~l~I~~~~el~~ag~~-g~~~~v~y~~~~~~~~~~vp~~~~~l~~q~~~~ 299 (329) T protein:vir:79 227 RKVLMVRMPETTMSYLDYFKQQNG------GITIESISELEDIDGA-GTKAALVYEKDPMNMSIEIPEAFNMLTAQPKDL 299 (329) T ss_pred HHHhhcccCCCCccHHHHHHHhCC------CcEEEEcccccccCCC-CceEEEEEecCCceEEEecCcceeeeeceecCc Confidence 999876555556443321111110 0112222222111100 01111111111000 00000111111 Q ss_pred HHH--Hhhhhc-ceeecccceEEEEeCCCCCC Q lcl|Aclame:pro 489 EYL--FEMPIS-GSLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 489 ~~~--~~~rvg-g~v~~~~a~~~~~~tp~~a~ 517 (517) .+. ...|+| .-+++|.++++.+=-- -| T Consensus 300 ~~~v~~~~r~~Gv~i~~P~ai~~~dGI~--~~ 329 (329) T protein:vir:79 300 HFKVPCTSKCTGLTIYRPLTLVLIKGLV--VG 329 (329) T ss_pred eEEEceeeeEEEEEEECcceeeeeeeee--eC Confidence 111 234554 4788999999766111 11 No 167 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=91.72 E-value=0.015 Score=30.66 Aligned_cols=272 Identities=10% Similarity=0.071 Sum_probs=109.3 Q ss_pred HhhccchhhHHHHhhhhhccccccccc-----chhhhhhHHHhHhhhhhhhhceeeeccccc---eeeeecccccceeee Q lcl|Aclame:pro 225 MSASLTKDPKAAWTAELKERGISGMPA-----PAGILKRIQDAVNDEGSLLPFIRHENLPTL---VVGGDNALTQGTGHT 296 (517) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~v-----p~~i~~~i~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~a~~~~ 296 (517) |.... .+...+..+..- -..+...+.+.....+.++++.++..+.+. ..+ +.....+..+. T Consensus 1 Ms~~n----------~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~vRtI~~gkS~qf~-~lG~s~a~y~~ 69 (400) T protein:vir:10 1 MSTPN----------NLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNK-YLGETELQVLA 69 (400) T ss_pred CCCCc----------cccccccccccchhhhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEE-EeeeeEEeeec Confidence 11000 000000000000 112333445556666677777666555432 222 23445666777 Q ss_pred cccccccccccceeeEeeHhhh---hHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhh----ccc-c-cC Q lcl|Aclame:pro 297 TGTDKTESNITLQTRVLTPQYV---YKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAII----MGG-V-TG 367 (517) Q Consensus 297 eg~~~~~~~~~f~~~~~~~~~~---~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l----~G~-G-~~ 367 (517) .|+..-.+.+..++..+.+-+. .+.+. .+.+..-+. -.+.+-+.+++.+++++..|+.++ .+. - +. T Consensus 70 pG~~ldg~~~~~dk~~ItIDtLL~a~~~V~----dlDd~q~~y-D~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~ 144 (400) T protein:vir:10 70 PGQSPAATSTQADKNQLVIDATVIARNTVA----HLHDVQGDI-DSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQ 144 (400) T ss_pred CCCCcCCCCcccCcEEEEeCceeeecchhh----hHHHHhhcc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc Confidence 7766544444455554443321 11111 011211111 013455666777788888777654 221 0 00 Q ss_pred cc--cccccccccc--cccccccccccHHHHHHHHHHhhhhh-------cCCEEEEcHHHHHHHHHhhcC--CCCEeccC Q lcl|Aclame:pro 368 VS--ETQIYPVVGD--AWATNVTGTTNIQELLEKLSVATPKA-------ADSTLVIHRNDLAAIRFLKDK--NGNYVFPV 434 (517) Q Consensus 368 ~~--~~gi~~~~~~--~~~~~~~~~~~~d~l~~~l~~~~~~~-------~~a~~vmn~~~~~~l~~lKD~--~Gryl~~~ 434 (517) .+ ..++...... ..........+.+.+..++..+...+ ..-+++|.|..|..|..- |. |-.|.... T Consensus 145 ~~~~~~~g~~~g~s~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~Ys~Ll~~-dkLvnrdf~~s~ 223 (400) T protein:vir:10 145 AKRTNPRVKGHGFSVNVEVNEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDA-DRIVDKSYTISQ 223 (400) T ss_pred cccccCCccccccceeecccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhC-CcccchhccccC Confidence 00 1111111000 01111112223344544444332211 123455666666666422 20 11122111 Q ss_pred --CCCCCccceecCccceeccccCCcee----------eeecCceEE------------------Ee---eeheeehhhh Q lcl|Aclame:pro 435 --GVSNQTIATHFGFNRLVQSVAVDEKT----------AVSLSGYVT------------------NG---SRGMEFEQGT 481 (517) Q Consensus 435 --~~~~~~~~~l~g~~~v~~~~~~~~~~----------~~~~~~~~~------------------~~---~~~~~~~~d~ 481 (517) +...+.+..+.|++.+ .+..+|... +...+.|-. .. ++..+.++| T Consensus 224 ~g~~~~g~v~~v~Gv~Iv-~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt~~~~~d- 301 (400) T protein:vir:10 224 SGATIQGFVLSSYNCPVI-PSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSIDVIGDIFYE- 301 (400) T ss_pred CCccccceEEEEeceEEE-eeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheEEEEeeccccccccc- Confidence 1223334455665443 333333211 111112211 01 011122222 Q ss_pred hcccchHHHH-HhhhhcceeecccceEEEEe----CCCCCC Q lcl|Aclame:pro 482 ILVENNKEYL-FEMPISGSLEYKGTTAYGTY----TPPVAG 517 (517) Q Consensus 482 ~~~~n~~~~~-~~~rvgg~v~~~~a~~~~~~----tp~~a~ 517 (517) .+.+..++ ...-.|-.+++|++.+..+. ||+||| T Consensus 302 --~r~~~~~id~~~a~G~g~~RPeaa~vv~~~~~~~~~~~~ 340 (400) T protein:vir:10 302 --KKEKTYYIDTFMSEGAIPDRWEAVSVVTTKRQSTGAVDS 340 (400) T ss_pred --hhhHHHHHHHHHHhCCcccchhheEEEEecCCccccccc Confidence 22223332 34556778999999998877 888997 No 168 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=90.50 E-value=0.021 Score=29.84 Aligned_cols=280 Identities=9% Similarity=-0.010 Sum_probs=110.5 Q ss_pred HHHHHHHhh-ccchhhHHHHhhhhhccccccccc-chhhhhhHHHhHhhhhhhhhceeeeccccc--eeeeeccccccee Q lcl|Aclame:pro 219 EAEVAYMSA-SLTKDPKAAWTAELKERGISGMPA-PAGILKRIQDAVNDEGSLLPFIRHENLPTL--VVGGDNALTQGTG 294 (517) Q Consensus 219 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~v-p~~i~~~i~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~a~~ 294 (517) ...+..+.. +..+.. .....++..+.. -..+...+.+.....+.++++++...+.+. ..........+.. T Consensus 1 ~~~~~~~~~~~~~~~~------~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~r~i~~G~tv~i~~ig~~~~~~ 74 (332) T protein:vir:78 1 MTTLSNFSLPNQANGG------ARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLSAGY 74 (332) T ss_pred CcccccccCCccccCC------ccccccccchhhhhhhhhhhHHHHHHHHhhhhhccccccccccceEEEEeccceeEee Confidence 000000000 000000 000011111001 122334455566666667777665444322 1112233344555 Q ss_pred eecccccc-cccccceeeEeeH--hhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhc----ccccC Q lcl|Aclame:pro 295 HTTGTDKT-ESNITLQTRVLTP--QYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIM----GGVTG 367 (517) Q Consensus 295 ~~eg~~~~-~~~~~f~~~~~~~--~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~----G~G~~ 367 (517) +..|.... ..+++-+++++.+ .++..+. +.. +++...+ ..|.+-+.++..+++++..|+.++. +...+ T Consensus 75 ~~~g~~l~~~~~~~~~~~~l~ID~~ky~~~~-Vdd--iD~~q~~--~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~ 149 (332) T protein:vir:78 75 HTPGTPIVGDAGIKANEKTLVMDDLLVSSQF-VYS--LDEIFSQ--YSTRAEVSKQIGEALATHYDERIARVLAKASAEA 149 (332) T ss_pred ecCCCCCCCCCCCCCceEEEEEehhhhhHHH-HHh--HHHHhcC--cchHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Confidence 66665542 2234444444433 2333321 111 2222221 1277778889999999999977652 22222 Q ss_pred cccccccccccccccccccccccHHHHHHHHHHhhhh-----hc--CCEEEEcHHHHHHHHHhhcCC--CC-Eec-cCCC Q lcl|Aclame:pro 368 VSETQIYPVVGDAWATNVTGTTNIQELLEKLSVATPK-----AA--DSTLVIHRNDLAAIRFLKDKN--GN-YVF-PVGV 436 (517) Q Consensus 368 ~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~~~-----~~--~a~~vmn~~~~~~l~~lKD~~--Gr-yl~-~~~~ 436 (517) .+..+... +.....+.+...+.+.+.+++..+... .+ +-.+|++|..|..|.+.+|.. .+ +.- +... T Consensus 150 ~~~~~~~g--~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~~~~~ 227 (332) T protein:vir:78 150 SPVTGEPG--GFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDM 227 (332) T ss_pred Cccccccc--ccccccCCccccCHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHHhhcCceeeeeeccccccce Confidence 22211100 000111112223334444444433222 22 234677999999987654321 00 000 0111 Q ss_pred CCCc-cceecCccceeccccCCceee---------eecCceEE--------E-ee--------eh--eeeh-hhhhcccc Q lcl|Aclame:pro 437 SNQT-IATHFGFNRLVQSVAVDEKTA---------VSLSGYVT--------N-GS--------RG--MEFE-QGTILVEN 486 (517) Q Consensus 437 ~~~~-~~~l~g~~~v~~~~~~~~~~~---------~~~~~~~~--------~-~~--------~~--~~~~-~d~~~~~n 486 (517) -++. +..+.|++ |+.+..+|..+. +.++.|.. + -+ ++ ++.- .+++...- T Consensus 228 ~~g~~i~~i~G~~-V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~~~~~~ 306 (332) T protein:vir:78 228 NSGKGLYSIAGIR-ILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQ 306 (332) T ss_pred ecceeeeEEeeeE-EEecCccccCcccccccccccccccccccccccceEEeecccceeeeeeeccchhhhhcccchhhh Confidence 1222 44556654 444444443211 11112210 0 00 11 1110 11111111 Q ss_pred hHHHHHhhhhcceeecccceEEEEeC Q lcl|Aclame:pro 487 NKEYLFEMPISGSLEYKGTTAYGTYT 512 (517) Q Consensus 487 ~~~~~~~~rvgg~v~~~~a~~~~~~t 512 (517) ...++.....|..+.+|++.+...-- T Consensus 307 ~d~i~~~~~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 307 GDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) T ss_pred HhhhhhhhhhcCceecccceEEEeeC Confidence 12334445677889999988776633 No 169 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=89.49 E-value=0.0095 Score=31.68 Aligned_cols=262 Identities=11% Similarity=0.013 Sum_probs=103.7 Q ss_pred ccchhhHHHHhhhhhcccccccccchhhhhhHHHhHhhhhhhhhceeeeccc-----cceeeeecccccce--eeecc-c Q lcl|Aclame:pro 228 SLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLP-----TLVVGGDNALTQGT--GHTTG-T 299 (517) Q Consensus 228 ~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~a~--~~~eg-~ 299 (517) ..+-.+-....+ .+-.++.+.....-...+++++.... ..........+.+. |...+ . T Consensus 1 ~~~lafl~~qL~--------------~id~~vye~~~~~~~~~~lipv~t~~~~~~~~~~~~~~d~~G~a~~~~i~~~a~ 66 (304) T protein:vir:52 1 MSLLAYVKNGLT--------------AVSKDIAETKYPEIVFPQFVYVDQQTAVGITEKLHYGADEHGSLDDGLITVGTS 66 (304) T ss_pred CchHHHHHHHHH--------------HHhhhhhccccccchhhhhccccCCCCcccceEEEeeeeccCcccccccCCcCC Confidence 000000000000 01111111111111122232222211 11122223334454 44433 4 Q ss_pred ccccccccceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcccccccccccc Q lcl|Aclame:pro 300 DKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGD 379 (517) Q Consensus 300 ~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~ 379 (517) ..|..+..+++...+++.++.-+..+.+-+..+..--- .|..-=..-+.+++.+.+++..+.|+-......|+++.... T Consensus 67 dip~vd~~~~~~~~~i~~~~~~~~y~~~El~~a~~~g~-~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~g~~GllN~p~v 145 (304) T protein:vir:52 67 TLDQVEVGFTPTRSYIVPWAKSVTWTKPELEQGKLLGL-ALNTAKIMALNKNAQQTLQKVAFLGHAKDSRLTGLLNNKSV 145 (304) T ss_pred ccceeecccceeEEEEEEEeeeeeecHHHHHHHHHhCC-CcHHHHHHHHHHHHHhhhceEEEEeeccccceEEEEeCCCc Confidence 45666666666666555555444444333332221000 02222223355677788888899997433445677765443 Q ss_pred ccccc-------ccccccHHHHHHHHHHhhhh--------hcCCEEEEcHHHHHHHHHhh-cCCCCEec----cCCCCCC Q lcl|Aclame:pro 380 AWATN-------VTGTTNIQELLEKLSVATPK--------AADSTLVIHRNDLAAIRFLK-DKNGNYVF----PVGVSNQ 439 (517) Q Consensus 380 ~~~~~-------~~~~~~~d~l~~~l~~~~~~--------~~~a~~vmn~~~~~~l~~lK-D~~Gryl~----~~~~~~~ 439 (517) ..... ...+.+.+.++..++.+... ..+..++|.|+.+..|.... +..|.-+| +..+. . T Consensus 146 ~~~~~~~~~a~~~w~~~T~~eI~~di~~~~~~i~~~s~~~~~p~tl~Lpp~~~~~l~~~~~~~~~~Tvl~~l~~n~~~-~ 224 (304) T protein:vir:52 146 EVYAIKGAAQNTKVQAMDFDKAVAFFKEIFLKGMEKTKRIEAPNTFAIDSLDLAHLALVQRANTDTTALEFLTKHLSA-A 224 (304) T ss_pred ceeeecCCccCCccccCCHHHHHHHHHHHHHHHHhccCceecCceEEeCHHHHHHHhhccCCCCCchHHHHHHHhccc-c Confidence 22111 11122344444444333211 13568999999999996543 22222222 11110 0 Q ss_pred ccceecCcc-ceeccccCCceeeee-cCceEEEeeehe---eeh--hhhh---c-ccchHHH--HHhhhhcc-eeecccc Q lcl|Aclame:pro 440 TIATHFGFN-RLVQSVAVDEKTAVS-LSGYVTNGSRGM---EFE--QGTI---L-VENNKEY--LFEMPISG-SLEYKGT 505 (517) Q Consensus 440 ~~~~l~g~~-~v~~~~~~~~~~~~~-~~~~~~~~~~~~---~~~--~d~~---~-~~n~~~~--~~~~rvgg-~v~~~~a 505 (517) . |.+ .+....+--. .++. ..+-.++++..- .+. ..+. . .++...| -...|+|| .++.|.+ T Consensus 225 ~-----g~~l~I~~v~~~~~-~~g~~g~~r~vvY~~d~~~~~~~vP~p~~~l~~q~~~~~~~~vp~~~r~gGv~v~~P~a 298 (304) T protein:vir:52 225 A-----GRQVAIKALPSNYG-TRVTDGKTRAMVYVNSKEHVIFDVPMSPTVLDAQPKGLLAFESGLRMAFGGVTFMEPDS 298 (304) T ss_pred c-----CCcceEEEeccccc-ccCCCCceEEEEEecChhheEEecCccccccchhhcCCceEEecceeeeeeEEEEccce Confidence 0 111 1111111111 1111 112222332211 110 0111 1 1222222 24566665 8899999 Q ss_pred eEEEEe Q lcl|Aclame:pro 506 TAYGTY 511 (517) Q Consensus 506 ~~~~~~ 511 (517) ++|.++ T Consensus 299 ~~y~D~ 304 (304) T protein:vir:52 299 ALYVDY 304 (304) T ss_pred eeeecC Confidence 999999 No 170 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=85.70 E-value=0.051 Score=27.67 Aligned_cols=272 Identities=10% Similarity=0.010 Sum_probs=105.4 Q ss_pred hhhhcc-cccc-cccchhhhhhHHHhHhhhhhhhhceeeeccccc-ee-eeecccccceeeecccccccccccce--eeE Q lcl|Aclame:pro 239 AELKER-GISG-MPAPAGILKRIQDAVNDEGSLLPFIRHENLPTL-VV-GGDNALTQGTGHTTGTDKTESNITLQ--TRV 312 (517) Q Consensus 239 ~~~~~~-~~~~-~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~~~~a~~~~eg~~~~~~~~~f~--~~~ 312 (517) ..+.++ ..+. +.+|+-...+|+.-++.......+.+....+.+ .+ ..........-+..++..+-.+++-. .+. T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d~g~GDtV~InsIg~~tV~dY~~~~~i~~d~ltt~~~~l~ 80 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKLLDVNIARVVDFPDGDKLTIPSVGTPVVRSRPEQGDFTFDNLDTGEISII 80 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhcccccCCCCeEEeccccccccccccCCCCcccccCCCceEEEE Confidence 111111 1111 223554545555333333222222222111111 00 00111111122223333222222222 333 Q ss_pred eeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhc--ccccC----cccccccccccc-cccccc Q lcl|Aclame:pro 313 LTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIM--GGVTG----VSETQIYPVVGD-AWATNV 385 (517) Q Consensus 313 ~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~--G~G~~----~~~~gi~~~~~~-~~~~~~ 385 (517) +...++.++. ++.. .......|.+...++++++++...|..+.. =+|.. .+.+..++.... ...... T Consensus 81 IDq~KYfaf~-VdDD-----~~Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~vin~~~~~iv~~gt 154 (322) T protein:vir:31 81 LRDEVYAGNA-ISKK-----LRQDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPNVINGVPHRFVGTGT 154 (322) T ss_pred Eehhhhhccc-cchh-----HHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcceecCCccceeccCC Confidence 4555555543 3331 123445588889999999999888877632 11110 000000110000 000111 Q ss_pred cccccHHHHHHH---HHHhhhhhcCCEEEEcHHHHHHHHHhh-----cCCCCE--eccCCCCCCc--cceecCccceecc Q lcl|Aclame:pro 386 TGTTNIQELLEK---LSVATPKAADSTLVIHRNDLAAIRFLK-----DKNGNY--VFPVGVSNQT--IATHFGFNRLVQS 453 (517) Q Consensus 386 ~~~~~~d~l~~~---l~~~~~~~~~a~~vmn~~~~~~l~~lK-----D~~Gry--l~~~~~~~~~--~~~l~g~~~v~~~ 453 (517) ......+.++++ |..+..+..+-.+|++|..+..|..++ -.++|+ +...+...+. +.+++|+ .|+.+ T Consensus 155 ~~~~ay~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~a~g~~~Vg~~~GF-~V~~S 233 (322) T protein:vir:31 155 DQTMDVTDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNNPRWEGIVESGIAPDMQFVRSVYGI-DLFVS 233 (322) T ss_pred CchhhHHHHHHHHHHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhccccccccccccchhhHHHHHHHhce-eeeee Confidence 112223334333 333333333445788999887774421 234454 2233333332 4556664 34433 Q ss_pred ccCCce-------------eeeecCceEEEeeehee-------eh---hhhhcccc-hHHHHHhhhhcceeecccceEEE Q lcl|Aclame:pro 454 VAVDEK-------------TAVSLSGYVTNGSRGME-------FE---QGTILVEN-NKEYLFEMPISGSLEYKGTTAYG 509 (517) Q Consensus 454 ~~~~~~-------------~~~~~~~~~~~~~~~~~-------~~---~d~~~~~n-~~~~~~~~rvgg~v~~~~a~~~~ 509 (517) -.+++. .++-.+.|..+.+.+.. .+ +.|.-..+ .-.++...|.|-.|.+||.+++. T Consensus 234 N~l~~~~~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~~~l~~~e~~r~~~~~~d~~~~~~~~g~g~~r~e~l~~~ 313 (322) T protein:vir:31 234 NLLADANETINAGGDARSTTAGKCNMFMNVSDMGLLPFVVAWKEMPTTKSFIDDYNDDLNTATTARWGNGLVRDENLVCV 313 (322) T ss_pred ccccccccccccCcccccccceeecccccccchhhhhhhhHhhhhhhhhcccCccccccceeeeeeecceeecccceEEE Confidence 333211 11112333333332221 01 01100111 11234567778888999998876 Q ss_pred EeCCCCCC Q lcl|Aclame:pro 510 TYTPPVAG 517 (517) Q Consensus 510 ~~tp~~a~ 517 (517) .-+.+--- T Consensus 314 ~a~~~~~~ 321 (322) T protein:vir:31 314 LANADKVT 321 (322) T ss_pred Eecccccc Confidence 64433111 No 171 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=75.57 E-value=0.15 Score=25.18 Aligned_cols=184 Identities=10% Similarity=0.036 Sum_probs=75.2 Q ss_pred hhHhHhhhHHHHHHhhc-ccHHHHHHHHHHHHHHHHHHHHHhhhhc----ccccCcccccccccccccccccccccccHH Q lcl|Aclame:pro 318 VYKYIKLPKIVMNSNAT-DIAGAILTYVMNRLPDMVIMAVNRAIIM----GGVTGVSETQIYPVVGDAWATNVTGTTNIQ 392 (517) Q Consensus 318 ~~~~~~iS~~li~d~~~-d~~~~l~~~i~~~l~~~~~~~~e~~~l~----G~G~~~~~~gi~~~~~~~~~~~~~~~~~~d 392 (517) +-++. +|+.++.|-.- -....+.+...+++.+++++..|+.++. +.....+..+... +.........+...+ T Consensus 1 iD~lL-~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~--g~~~~~~a~~t~~~~ 77 (221) T protein:vir:17 1 MDDLL-VASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDG--GFSVNIGAGNTNNAQ 77 (221) T ss_pred CCcch-hHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCccccccc--CcceeccccccCCHH Confidence 11111 12222221100 0012377788889999999998888752 3322222221111 001111112223334 Q ss_pred HHHHHHHHhhhhh-----c--CCEEEEcHHHHHHHHHhhc-CCCCEeccCC---CCCC-ccceecCccceeccccCCcee Q lcl|Aclame:pro 393 ELLEKLSVATPKA-----A--DSTLVIHRNDLAAIRFLKD-KNGNYVFPVG---VSNQ-TIATHFGFNRLVQSVAVDEKT 460 (517) Q Consensus 393 ~l~~~l~~~~~~~-----~--~a~~vmn~~~~~~l~~lKD-~~Gryl~~~~---~~~~-~~~~l~g~~~v~~~~~~~~~~ 460 (517) .+.+++..+...+ + +-.+|++|..|..|-+-.| .--++.+..+ ..++ .+..+.|+ .|+.+..+|... T Consensus 78 ~l~dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~~i~~v~G~-~V~~SnnlP~~~ 156 (221) T protein:vir:17 78 AIVDGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGKGLYVNAGI-RIYKSNVLASLY 156 (221) T ss_pred HHHHHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeecccccccccccceeeeecCc-EEEEeccCCccc Confidence 4444444332221 2 2356779998877764322 1122222211 1122 23445554 455555666532 Q ss_pred e----eecCceEEEeeeheeehhhhhcccchHHHHHhhhhcceeecccceEEEEe-CCCCCC Q lcl|Aclame:pro 461 A----VSLSGYVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTY-TPPVAG 517 (517) Q Consensus 461 ~----~~~~~~~~~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~~~~a~~~~~~-tp~~a~ 517 (517) . .+.+.|+.... ....+ .--+...+ |-|.+|+|.....+ -||--- T Consensus 157 gt~~~~~ag~~~~~~~----~~~~y-------r~~fs~~~-glv~~~~Avgtvkl~~~~~~~ 206 (221) T protein:vir:17 157 GTNLVTDPGDATTSGE----NNGSY-------RPAITDRA-GLVFHKEAADTVEVLLPPSRP 206 (221) T ss_pred ccccccCCcccccccc----ccccc-------cccccceE-EEEEcchheeeeeeecCCCCC Confidence 1 12222221111 00011 00122333 45788888877666 333211 No 172 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=58.64 E-value=0.41 Score=22.71 Aligned_cols=272 Identities=10% Similarity=0.038 Sum_probs=102.0 Q ss_pred HhhccchhhHHHHhhhhhccccccccc-----chhhhhhHHHhHhhhhhhhhceeeeccccc---eeeeecccccceeee Q lcl|Aclame:pro 225 MSASLTKDPKAAWTAELKERGISGMPA-----PAGILKRIQDAVNDEGSLLPFIRHENLPTL---VVGGDNALTQGTGHT 296 (517) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~v-----p~~i~~~i~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~a~~~~ 296 (517) |.... .+...+..+..- -..+...+.+.....+.++++.++..+.+. ..+ +.....+..+. T Consensus 1 Ms~~n----------~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~vRti~~gkS~qf~-~~G~s~~~~~~ 69 (401) T protein:vir:70 1 MSTPN----------NLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNK-YLGETELQVLA 69 (401) T ss_pred CCCCc----------cccccccccccchhHhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEE-EeeeeEeeeec Confidence 11000 000000000000 112233344555666667777666555432 222 23344566666 Q ss_pred cccccccccccceeeEeeHhhh---hHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhh-----cccc--- Q lcl|Aclame:pro 297 TGTDKTESNITLQTRVLTPQYV---YKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAII-----MGGV--- 365 (517) Q Consensus 297 eg~~~~~~~~~f~~~~~~~~~~---~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l-----~G~G--- 365 (517) .|+....+.+..++..+.+-+. ...+. .|++..-+. -.+.+-+.+++.+++++..|+.++ .|-. T Consensus 70 pG~~ld~~~~~~dK~~ItID~lL~a~~~V~----dlDe~q~~y-D~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~ 144 (401) T protein:vir:70 70 PGQSPAATSTQADKNQLVIDATVIARNTVA----HLHDVQGDI-DSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQ 144 (401) T ss_pred CCCCcCCCCcccccEEEEeCceeehhhhhh----hHHHHHhcc-cccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccc Confidence 6665544445555544443221 11111 122222111 013455666777888887776552 2211 Q ss_pred -cCcccccc-cccccccccccccccccHHHHHHHHHHhhhhh-----c-CCE-EEEcHHHHHHHHHhhcC--CCCEeccC Q lcl|Aclame:pro 366 -TGVSETQI-YPVVGDAWATNVTGTTNIQELLEKLSVATPKA-----A-DST-LVIHRNDLAAIRFLKDK--NGNYVFPV 434 (517) Q Consensus 366 -~~~~~~gi-~~~~~~~~~~~~~~~~~~d~l~~~l~~~~~~~-----~-~a~-~vmn~~~~~~l~~lKD~--~Gryl~~~ 434 (517) ...+..+. -..+-...........+.+.|..++..+...+ + ... ++|.|..|..|.. +|. |-.|-... T Consensus 145 ~~~~~p~~~~~G~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~~r~vvl~pp~~Ys~Ll~-~d~L~nrd~~~s~ 223 (401) T protein:vir:70 145 AKRTNPRVKGHGFSINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRD-ADRIVDKTYTISQ 223 (401) T ss_pred ccccCCCcCCCceEEeccccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHh-cCcccchhhcccc Confidence 01111110 00000001111112223344554444432221 1 123 4445555555543 221 11121111 Q ss_pred --CCCCCccceecCccceeccccCCcee----------eeecCceEE------------------Ee---eeheeehhhh Q lcl|Aclame:pro 435 --GVSNQTIATHFGFNRLVQSVAVDEKT----------AVSLSGYVT------------------NG---SRGMEFEQGT 481 (517) Q Consensus 435 --~~~~~~~~~l~g~~~v~~~~~~~~~~----------~~~~~~~~~------------------~~---~~~~~~~~d~ 481 (517) ....+.+..+.|++. +.+..+|... +.+...|-. .. ++..+.++| T Consensus 224 ~g~~~~G~v~~vaGv~V-v~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~lt~~~~~d- 301 (401) T protein:vir:70 224 SGATIQGFTLSSYNCPV-IPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDVTGDIFYE- 301 (401) T ss_pred CCccccceEEEEeceEE-EeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheEEEEeeccccchhhh- Confidence 122334445566543 3333343211 111122211 00 011122222 Q ss_pred hcccchHHH-HHhhhhcceeecccceEEEE-----eCCCCCC Q lcl|Aclame:pro 482 ILVENNKEY-LFEMPISGSLEYKGTTAYGT-----YTPPVAG 517 (517) Q Consensus 482 ~~~~n~~~~-~~~~rvgg~v~~~~a~~~~~-----~tp~~a~ 517 (517) .+.+..+ -...-.|-.++||++.+..+ +|+++-| T Consensus 302 --~r~~~~~id~~~a~g~g~~RPeaa~vv~~k~~~~~~~~~~ 341 (401) T protein:vir:70 302 --KKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRNTTTGAVEG 341 (401) T ss_pred --hhhhHHHHHHHHHhCCcccchhheEEEeecCccccccccc Confidence 1122222 23455677899999987652 2444433 No 173 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=57.42 E-value=0.43 Score=22.57 Aligned_cols=268 Identities=10% Similarity=-0.021 Sum_probs=71.8 Q ss_pred HHhhhhhcccccccccchhhhhhHHH---hHhhhhhh-hhceeeecccc--ceeeeecccccc----eeeeccccccccc Q lcl|Aclame:pro 236 AWTAELKERGISGMPAPAGILKRIQD---AVNDEGSL-LPFIRHENLPT--LVVGGDNALTQG----TGHTTGTDKTESN 305 (517) Q Consensus 236 ~~~~~~~~~~~~~~~vp~~i~~~i~~---~~~~~~~~-~~~~~~~~~~~--~~~~~~~~~~~a----~~~~eg~~~~~~~ 305 (517) +....+.. ...+..+. ...++.+ .....+.- ..+.+ .+..+ ..+|........ .-+.+....+... T Consensus 1 m~lsD~~v--fN~~~~~a-~~e~~~q~~~~fn~as~gai~l~~-~~~~Gd~~~~pf~~~l~g~~~~~~~~~~~~~vt~~k 76 (325) T protein:vir:95 1 MALSDLAV--YSEYAYSA-FSETLRQQVDLFNTATGGAIMLQS-AAHQGDFSDVAFFAKVTGGLVRRRNAYGSGTVAEKV 76 (325) T ss_pred Cchhhhhh--hhhhhhhh-hhhhhhhhHhhhhhcccceeEecc-ccccCceeeccccccccccccccccCCCCceeccce Confidence 00000000 00000000 0011111 00100000 00000 00000 001111100000 0000000111111 Q ss_pred c-cceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccC-cccccccccccccccc Q lcl|Aclame:pro 306 I-TLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTG-VSETQIYPVVGDAWAT 383 (517) Q Consensus 306 ~-~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~-~~~~gi~~~~~~~~~~ 383 (517) + +..++......-.+|.......+ .+..+....+.+.|.+.+++...+.+-+.++.+-... ........... +... T Consensus 77 itt~~~~av~~~r~~g~~~~d~~~~-~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~a~~~~~~~v~dis-~~~~ 154 (325) T protein:vir:95 77 LKHLVDTSVKVAAGTPPVRLDPGQF-RWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVYSALSQVSDVVYDAT-ANTD 154 (325) T ss_pred eccccceeeEEecccCcccccHHHH-hhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccceeeee-cccC Confidence 1 12222222211111111110000 1111222222222333333322222222222111100 00011111110 0000 Q ss_pred cccccccHHHHHHHHHHhhhhhcC--CEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccceecCccceeccccCCceee Q lcl|Aclame:pro 384 NVTGTTNIQELLEKLSVATPKAAD--STLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDEKTA 461 (517) Q Consensus 384 ~~~~~~~~d~l~~~l~~~~~~~~~--a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~~~~ 461 (517) ......+...+.++.... -+... +.|+||..++..|.+++-.+...++...... ...+.+|..+++ .+.+|-... T Consensus 155 ~~~~~~s~~~l~~A~~kl-GD~~~~l~~~~MHS~v~~~L~~~~L~~~~~~~~~~g~~-~i~t~~G~~VIV-dD~~p~~~~ 231 (325) T protein:vir:95 155 AADKLPTWNNLNNGQAKF-GDQSSQIAAWIMHSTPMHKLYGSNLTNGERLFTYGTVN-VVRDPFGKLLVM-TDSPNLFAA 231 (325) T ss_pred cccccccHHHHHHHHHHh-cccccceeEEEEchHHHHHHHHhhccccccccccCCcc-cccccCCcEEEE-eCCCCCCCc Confidence 011111233444444442 12111 5799999999999987766655554433222 334667754444 444443222 Q ss_pred eecCceEE---E-eeeheeehhhhhcccchHHHHHhhhhcc-------eeecccceEEEEeCCCCCC Q lcl|Aclame:pro 462 VSLSGYVT---N-GSRGMEFEQGTILVENNKEYLFEMPISG-------SLEYKGTTAYGTYTPPVAG 517 (517) Q Consensus 462 ~~~~~~~~---~-~~~~~~~~~d~~~~~n~~~~~~~~rvgg-------~v~~~~a~~~~~~tp~~a~ 517 (517) +.-..|.+ + +-+++....++.+.... ..+ +.+++. -+.+|..+.+-. +..| T Consensus 232 g~~~~ytty~lg~GAi~~~~~~~~~~~~~~-~~~-~~~~~~~~~~~~tf~lhp~G~sw~~---s~~g 293 (325) T protein:vir:95 232 GTPNVYHILGLVPGGVLIGQNNDFDANEET-KNG-DENIIRTYQAEWSYNIGVKGFAWDK---ANGG 293 (325) T ss_pred cCceeEEEEEEecCeEEecCCCCccccccc-cCc-ccceeeeeeeeeeEEeecceeeeec---cccc Confidence 22222321 1 11222222222111000 000 112221 234677777621 2333 No 174 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=50.34 E-value=0.61 Score=21.75 Aligned_cols=248 Identities=9% Similarity=0.018 Sum_probs=86.8 Q ss_pred hhcccccccccchhhhhhHHHhHhhhhhhhhceeeeccc-------cceeeeecccccceeeecccccccccccceeeEe Q lcl|Aclame:pro 241 LKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHENLP-------TLVVGGDNALTQGTGHTTGTDKTESNITLQTRVL 313 (517) Q Consensus 241 ~~~~~~~~~~vp~~i~~~i~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~~~~ 313 (517) +.. ....++.|.-+...+++.++....+.+++....-+ ...++.-. ......+...+..+++-..+++ T Consensus 1 m~~-~~N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~----~~~v~dg~~~~~~~~te~~v~l 75 (418) T protein:vir:10 1 MAV-QDNNLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPY----RVKSASGRTLVKQPMVDQTIPF 75 (418) T ss_pred CCc-cccccccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCC----ceeecccCCccccccccceEEE Confidence 111 01123446666677777777776666655432211 11122110 1111122222222333233333 Q ss_pred e--HhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCcccccccccccccccccccccccH Q lcl|Aclame:pro 314 T--PQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNI 391 (517) Q Consensus 314 ~--~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~~~~~~~~ 391 (517) . -+++. -+.++..-. .++. ..+..-+.++..++++..+|..++.- -.+ +........+..... T Consensus 76 ~id~~k~~-~~~itD~e~---a~~~-~d~~~~~l~~A~~aLA~~vD~~ia~l-~~~---------a~~~~gt~gt~~~~~ 140 (418) T protein:vir:10 76 KIAYQEHV-GLEYTVKDK---TLDI-MQFSERYLKSGMVQIANQIDRSLALT-LKK---------AFHSSGTPGVRPGAF 140 (418) T ss_pred EEeccccc-ceeechHHH---hhhh-hHHHHHHHHHHHHHHHHHHHHHHHHH-Hhh---------cccccccCCcCcchH Confidence 2 22221 223332211 1111 12444455567888999999887531 000 100000001111224 Q ss_pred HHHHHHHH---HhhhhhcC-CEEEEcHHHHHHHHHhhcCCCCEeccCC-----CCCCccceecCccceeccccCCceeee Q lcl|Aclame:pro 392 QELLEKLS---VATPKAAD-STLVIHRNDLAAIRFLKDKNGNYVFPVG-----VSNQTIATHFGFNRLVQSVAVDEKTAV 462 (517) Q Consensus 392 d~l~~~l~---~~~~~~~~-a~~vmn~~~~~~l~~lKD~~Gryl~~~~-----~~~~~~~~l~g~~~v~~~~~~~~~~~~ 462 (517) ++++++-. ....+-.. -..|++|..+..|. +|.. .++... .-++.+..+.|+ .++.+..++..+++ T Consensus 141 ~~i~~a~~~Ld~~~VP~~G~R~lVv~P~~~~~L~--~~~~--~~~~~~~~~~~lr~G~IG~i~GF-~V~~S~nip~~tag 215 (418) T protein:vir:10 141 IDFANAGAKQTTYAVPQDGMRHAVLDPFTCASLS--DEVT--KLFKESMVEQAYKMGYRGNVAAY-EVYESQNLPKHTVG 215 (418) T ss_pred HHHHHHHHHHHhcCCCCCCceEEEeCHHHHHHHh--hhcc--ccccccccchhhheeeeeeeece-EEEEecCCCccccc Confidence 44544322 22222221 34589998877664 3332 233322 235566677775 45555555543332 Q ss_pred ecCc--eEEEe-----ee----------heeehhhhhcccc---------------hHHHHHhhhhcceeecccceEEEE Q lcl|Aclame:pro 463 SLSG--YVTNG-----SR----------GMEFEQGTILVEN---------------NKEYLFEMPISGSLEYKGTTAYGT 510 (517) Q Consensus 463 ~~~~--~~~~~-----~~----------~~~~~~d~~~~~n---------------~~~~~~~~rvgg~v~~~~a~~~~~ 510 (517) .+.+ ++.+. .. +.-...| .++.. .++|.+..-+ ...-.+-.-.+ T Consensus 216 ~~~~t~~v~ga~~~~~~~~~~~~t~s~~g~l~~Gd-~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~---~~~~~~~~tv~ 291 (418) T protein:vir:10 216 DHGGTPLVNGTVVNGDTVGFDGGTASTTGFLKAGD-VITFGGVFGVNPQNYETTGLLQEFVVLEDV---DTDAGGAGSIK 291 (418) T ss_pred ccccceeeecccccceeEEEeecceeeccceeecc-EEEECceeecccccccccccceEEEEEeec---cccccCcceeE Confidence 2211 11000 00 0000000 00111 1111110000 00000011122 Q ss_pred eCCCC----------------------------CC Q lcl|Aclame:pro 511 YTPPV----------------------------AG 517 (517) Q Consensus 511 ~tp~~----------------------------a~ 517 (517) +.||. +| T Consensus 292 i~p~~~~~~~~~~~~~~~~~~~~~~~~v~a~~a~~ 326 (418) T protein:vir:10 292 ISPSLNDGTATINNENGDPVSLTAYQNVTALPADN 326 (418) T ss_pred eccccccccccccccccccccccCCCcccccccCc Confidence 22221 00 No 175 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=48.17 E-value=0.68 Score=21.51 Aligned_cols=272 Identities=10% Similarity=0.029 Sum_probs=108.8 Q ss_pred HhhccchhhHHHHhhhhhcccccccccchhhhhh----HHHhH-hhhhhhhhceeeeccccce-eeeecccccceeee-- Q lcl|Aclame:pro 225 MSASLTKDPKAAWTAELKERGISGMPAPAGILKR----IQDAV-NDEGSLLPFIRHENLPTLV-VGGDNALTQGTGHT-- 296 (517) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~~~----i~~~~-~~~~~~~~~~~~~~~~~~~-~~~~~~~~~a~~~~-- 296 (517) ++-+...+. -..+...++..++.+ ..-.. +..+.+.+.++..+..... ..-.........++ T Consensus 1 ~~~~~~~~~----------~~~Ms~~i~~~fv~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~~~~~~~~~~~~ 70 (322) T protein:vir:10 1 MKLNAIMSM----------LPLIAGDIDQAFVQTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETLASMDPDAVKRK 70 (322) T ss_pred Ccccceeee----------eeeeechhhhHHHHHHHHHHHHHHHHhhhhhhcccccccccccccceeecccccccccccc Confidence 100000000 001111223333222 11111 1223344444422211110 00000000000011 Q ss_pred -------ccc-ccccccc--cceeeEeeHhhhhHhHhhhHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHhhhhccc-c Q lcl|Aclame:pro 297 -------TGT-DKTESNI--TLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGG-V 365 (517) Q Consensus 297 -------eg~-~~~~~~~--~f~~~~~~~~~~~~~~~iS~~li~d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~-G 365 (517) .+. ..|.... ....+.+..+..+ ..|...-......| ..+...++.+++++++.+..++.+- | T Consensus 71 ~~~~~~~d~~~dtp~~~~~~~~r~~~~~d~~~~--~~VDd~D~~k~~~D----~~~~~~~~~a~AL~R~~D~~I~~a~~g 144 (322) T protein:vir:10 71 RSRQQSADGTYPTPVNNKPFAKRRTNVDTYDTG--HVVEQEDISQMLLD----PNSALITSQAYAMARKTDDLIIAGAWK 144 (322) T ss_pred cccccccCcccCCCccccccceEEEeecccccc--eecchHHHHHhhcC----chHHHHHHHHHHhhhHHHHHHHhhhhc Confidence 111 1122222 2233334444333 34445444445555 4556667899999999999887632 2 Q ss_pred cCccccccccc-cccccccccccc-ccHHHHHHHHHHhh-hhhcC---CEEEEcHHHHHHHHHhhc-CCCCEeccCCC-C Q lcl|Aclame:pro 366 TGVSETQIYPV-VGDAWATNVTGT-TNIQELLEKLSVAT-PKAAD---STLVIHRNDLAAIRFLKD-KNGNYVFPVGV-S 437 (517) Q Consensus 366 ~~~~~~gi~~~-~~~~~~~~~~~~-~~~d~l~~~l~~~~-~~~~~---a~~vmn~~~~~~l~~lKD-~~Gryl~~~~~-~ 437 (517) ........-+. ....+.....+. .+.+.++.+..... .+.+. -.+|++|..|..|..... ++-.|.=.... . T Consensus 145 ~a~~~~~gt~v~~~ss~~i~~g~~g~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~~~~l~~ 224 (322) T protein:vir:10 145 PASIKGTGQPVEFLATQEIGDGTKPISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQITEATSADYTSAMDLQS 224 (322) T ss_pred cccccccccccccCCCcccccCccchhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchhhhhhhcccchhhhh Confidence 11110000111 111222222222 22334444332221 22332 247889999988754331 23445432222 3 Q ss_pred CCccceecCccceeccccCCce--e------------------eeecC--ceEEEeeeheeehhhhhcccc-hHHHHHhh Q lcl|Aclame:pro 438 NQTIATHFGFNRLVQSVAVDEK--T------------------AVSLS--GYVTNGSRGMEFEQGTILVEN-NKEYLFEM 494 (517) Q Consensus 438 ~~~~~~l~g~~~v~~~~~~~~~--~------------------~~~~~--~~~~~~~~~~~~~~d~~~~~n-~~~~~~~~ 494 (517) +|.+.+.+|+..+.. -.+|.. + ++..+ .|+..-++..+...+. .+. ...+..-. T Consensus 225 ~G~ig~~lGf~~i~s-~~lp~~~~t~~~~~~~~~~~~~~~~~~a~~k~Av~~a~~~dv~~~i~~~~--~~~~a~~I~~~~ 301 (322) T protein:vir:10 225 KGIITNWMGYTWIVS-TRLDKFDPTQWGMAAEDGPQGDEIWCIAMTDMALGYHSCKDIWTKVAEDP--SASFAWRIYSAF 301 (322) T ss_pred cCeeeeeeeEEEEEe-ccCCccccccccccccCCCCccceeEEEEecCceeEEEeeeeeEEeeccC--Ccchhhhhhhhh Confidence 566777788654432 222210 0 01111 1221112222222122 222 24455557 Q ss_pred hhcceeecccceEEEEeCCCC Q lcl|Aclame:pro 495 PISGSLEYKGTTAYGTYTPPV 515 (517) Q Consensus 495 rvgg~v~~~~a~~~~~~tp~~ 515 (517) ..|..+.+|+.++-..+.-.. T Consensus 302 ~~Ga~ri~~~gVv~i~~~e~~ 322 (322) T protein:vir:10 302 TADCVRVEDEHIFKLRLKNSL 322 (322) T ss_pred hhCceEeccCcEEEEEEeccC Confidence 778888899999999998888 No 176 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=33.58 E-value=1.3 Score=19.87 Aligned_cols=299 Identities=10% Similarity=0.001 Sum_probs=107.1 Q ss_pred hhhhhhHHHHHHHHhhHHHhhhhhhhhhhhhhhhHHHHHHHHHHHHhhccchhhHHHHhhhhhcccccccccchhhh--- Q lcl|Aclame:pro 180 LKTVSELAANLMKQRESEKILGVEALKVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGIL--- 256 (517) Q Consensus 180 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~i~--- 256 (517) .+..+. +.+++ ..+.. ....+..+.......... +.... ..+......+ +|.-+. T Consensus 1 ~~~~~~----~~~l~----~~gi~----~~~~~~~~~~~~~~~~~d----a~d~~----~~~~~~~~~~--~~~~l~~~i 58 (336) T protein:vir:36 1 MRDAQR----IQNLA----RAGVI----LPRSVQNVSTPLTEYAMD----AADLS----PHLSSTGSSG--IPNYLTTYV 58 (336) T ss_pred CchHHH----HHHHh----hcCee----ecchhhhhhhHHHHhhhh----hhhcc----CccccCCCcc--hHHHHHHhh Confidence 000000 00000 00000 000000000000000000 00000 0000000001 111111 Q ss_pred -hhHHHhHhhhhhhhhceeeecccc-----ceeeeecccccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHH Q lcl|Aclame:pro 257 -KRIQDAVNDEGSLLPFIRHENLPT-----LVVGGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMN 330 (517) Q Consensus 257 -~~i~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~ 330 (517) .++.+.+........++.+...+. ......+..+.+..++.++.-|..+......+..++.++..+.++.+-+. T Consensus 59 ~p~~~~~~~~~~~~~~l~pv~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~~d~~~~~~~~~v~~~~~g~~yg~~E~~ 138 (336) T protein:vir:36 59 DPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELE 138 (336) T ss_pred ccceEeeecchhhhhhhccccccCCccceeEEEeeeeceeeEEEeeccCCCceeecccceeeeeEEEEEeeeeeCHHHHH Confidence 122233333333344444444322 12233445567778888888888776666666666676666666633333 Q ss_pred HhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCccccccccccccccccc----ccccc----cHHHHHHHHHHhh Q lcl|Aclame:pro 331 SNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATN----VTGTT----NIQELLEKLSVAT 402 (517) Q Consensus 331 d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~----~~~~~----~~d~l~~~l~~~~ 402 (517) .+..-- -.|.+--...-++++.+.++.-.+.|++.. ...|+++........+ ....+ .++++..++.... T Consensus 139 ~Aa~~~-~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~~-~~yGllNdP~l~a~~t~~t~~~~~~t~~ei~~Di~~~~~~l~ 216 (336) T protein:vir:36 139 MAGAGR-VDLASELNYSSALGLAKFLNGSYLFGVAGL-ENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQ 216 (336) T ss_pred HHHHhC-CCcHHHHHHHHHHHHHHhhCcEEEEecccc-ceEEEEecCCCccccccCCCcccccCHHHHHHHHHHHHHHHH Confidence 222110 013332333445566666776677787643 3356666432211111 11122 2344444444332 Q ss_pred hh-------hcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccceecCccceeccccCCceeeeecCc-eEEEeeeh Q lcl|Aclame:pro 403 PK-------AADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDEKTAVSLSG-YVTNGSRG 474 (517) Q Consensus 403 ~~-------~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~~~~~~~~~-~~~~~~~~ 474 (517) .. -.+..++|.+.-+..|.+- +..|.-+++-. ..-+..-.++..+.... + ..+. +.+..... T Consensus 217 ~qt~G~i~~~~~~tL~LP~~~~~~Ls~~-n~~g~Tvl~~l------k~n~Pnl~i~t~pEl~~--a-~g~~~~l~~~~~~ 286 (336) T protein:vir:36 217 TQSQGIITQEDVLRMGLPPTAMSDLSKT-NQYGLAAAAKL------KDIFPKLEFVTIPEYDT--A-SGRLVQLWAPRVE 286 (336) T ss_pred HhcCCeeeeccccEEEechHHHHhccCC-CccCccHHHHH------HHhcCccEEEEcccccc--C-CCceEEEEEEecC Confidence 21 1245799999988887532 33343222100 00000001111111111 0 0111 11111110 Q ss_pred ------eeehhhhh---cccchH--HHHHhhhhcc-eeecccceEEEEeC Q lcl|Aclame:pro 475 ------MEFEQGTI---LVENNK--EYLFEMPISG-SLEYKGTTAYGTYT 512 (517) Q Consensus 475 ------~~~~~d~~---~~~n~~--~~~~~~rvgg-~v~~~~a~~~~~~t 512 (517) +.+-..|. .+...- ..-...|.|| -|++|.++++.+=- T Consensus 287 ~~~t~~~~~p~~~~~l~vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:36 287 GKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred CCcceeeecchhhhccceeecCceeEeccccceeeeeeeccchheeeecC Confidence 00000110 000000 0112344444 67788888876622 No 177 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=28.57 E-value=1.7 Score=19.27 Aligned_cols=299 Identities=10% Similarity=0.003 Sum_probs=108.1 Q ss_pred hhhhhhHHHHHHHHhhHHHhhhhhhhhhhhhhhhHHHHHHHHHHHHhhccchhhHHHHhhhhhcccccccccchh----h Q lcl|Aclame:pro 180 LKTVSELAANLMKQRESEKILGVEALKVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAG----I 255 (517) Q Consensus 180 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~----i 255 (517) .+..+. +.+++ ..+.. ....+..+..........+ . ... ..+......+ +|.- + T Consensus 1 ~~~~~~----~~~l~----~~gi~----~~~~~~~~~~~~~~~~~da-~---d~~----~~~~~~~~~~--i~~~l~~~i 58 (336) T protein:vir:10 1 MRDAQR----IQNLA----RAGVI----LPRSVQNVSTPLTEYAMDA-A---DLS----PHLSSTGSSG--IPNYLTTYV 58 (336) T ss_pred CchHHH----HHHHh----hcCee----ecchhhhhhhhHHHhhhhh-h---hcc----CccccCCCch--hHHHHHhhc Confidence 000000 00000 00000 0000000000000000000 0 000 0011110001 1111 1 Q ss_pred hhhHHHhHhhhhhhhhceeeecccc-----ceeeeecccccceeeecccccccccccceeeEeeHhhhhHhHhhhHHHHH Q lcl|Aclame:pro 256 LKRIQDAVNDEGSLLPFIRHENLPT-----LVVGGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMN 330 (517) Q Consensus 256 ~~~i~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~li~ 330 (517) ...+.+.+........++.+...+. ......+..+.+..++.++.-|..+......+..++.++..+.++.+-+. T Consensus 59 ~p~~~~~~~~p~~a~~l~pv~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~~d~~~~~~~~~v~~~~~g~~yg~~El~ 138 (336) T protein:vir:10 59 DPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELE 138 (336) T ss_pred ccceeeehhhhhhhhhhccccccCCccceeEEEeeeeceeeEEEeeccCCCceeecccceeeeeEEEEEeeeeeCHHHHH Confidence 1222333333333344444444322 12333445567778888888888776666666666777666666643333 Q ss_pred HhhcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCccccccccccccccccc----ccccc----cHHHHHHHHHHhh Q lcl|Aclame:pro 331 SNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATN----VTGTT----NIQELLEKLSVAT 402 (517) Q Consensus 331 d~~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~~~----~~~~~----~~d~l~~~l~~~~ 402 (517) .+..--. .|.+--...-++++.+.++.-.+.|++.. ...|+++........+ ....+ .++++..++.... T Consensus 139 ~A~~~g~-~l~~~Ka~aA~~ale~~~N~i~~~Gd~~~-~~yGllN~P~l~a~~t~~t~~~~~~t~eei~~Di~~~~~~l~ 216 (336) T protein:vir:10 139 MAGAGRV-DLASELNYSSALGLAKFLNGSYLFGVAGL-ENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQ 216 (336) T ss_pred HHHHhCC-CcHHHHHHHHHHHHHHhhCcEEEEecccc-ceEEEEeCCCCccccccCCCcccccCHHHHHHHHHHHHHHHH Confidence 3322100 13332333445566666776677787643 3346665432211111 11122 2344444444332 Q ss_pred hh-------hcCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccceecCccceeccccCCceeeeecCc-eEEEeeeh Q lcl|Aclame:pro 403 PK-------AADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDEKTAVSLSG-YVTNGSRG 474 (517) Q Consensus 403 ~~-------~~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~~~~~~~~~-~~~~~~~~ 474 (517) .. -.+..++|.+.-+..|.+- +..|.-+++-. ..-+..-.++..+.... + ..+. +.+..... T Consensus 217 ~qs~G~i~~~~~~tL~LP~~~~~~Ls~~-n~~g~Tvl~~l------k~n~Pnl~i~t~pEl~~--a-~G~~~~l~~~~~~ 286 (336) T protein:vir:10 217 TQSQGIITQEDVLRMGLPPTAMSDLSKT-NQYGLAAAAKL------KDIFPKLEFVTIPEYDT--A-SGRLVQLWAPRVE 286 (336) T ss_pred HhcCCeecccCcceEEecHHHHHhccCC-CccCccHHHHH------HHhcCccEEEEcccccc--C-CCceEEEEEEecC Confidence 21 1246799999988887532 33343222100 00000001111111110 0 0111 11111110 Q ss_pred ------eeehhhhh---cccchH--HHHHhhhhcc-eeecccceEEEEeC Q lcl|Aclame:pro 475 ------MEFEQGTI---LVENNK--EYLFEMPISG-SLEYKGTTAYGTYT 512 (517) Q Consensus 475 ------~~~~~d~~---~~~n~~--~~~~~~rvgg-~v~~~~a~~~~~~t 512 (517) +.+-..|. .+...- ..-...|.|| -|++|.++++.+=- T Consensus 287 ~~~t~~~~~p~~~~~l~vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:10 287 GKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred CCcceeeecchhhhccceeecCceeEeccccceeeeeeeccchheeeecC Confidence 00000110 000000 0112344444 67788888876622 No 178 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=26.63 E-value=1.9 Score=19.02 Aligned_cols=298 Identities=9% Similarity=-0.013 Sum_probs=106.1 Q ss_pred HhhhhhhhHHHHHHHHhhHHHhhhhhhhhhhhhhhhHHHHHHHHHHHHhhccchhhHHHHhhhhhcccccccccchh--- Q lcl|Aclame:pro 178 AALKTVSELAANLMKQRESEKILGVEALKVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAG--- 254 (517) Q Consensus 178 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~--- 254 (517) ...+..+.-..++++ .+.-..... ............+. +.... ..+......+ +|.. T Consensus 1 ~~~~~~~~~~~~l~~-------~g~~~~~~~---~~~~~~~~~~~a~d----~~~~~----~~~~~~~~~~--i~a~~~~ 60 (339) T protein:vir:94 1 MSINNDRTDIKQLEK-------VGIIFDGYS---PKSISSEVSAYAMD----AVNLT----PTLQTTANAG--IPAWMTT 60 (339) T ss_pred CceechHHHHHHHHh-------hceeeccch---hhhcchhhHhhhcc----ccccc----cccccccccc--hhhhhhh Confidence 000000000011110 000000000 00000000000000 00000 0001111111 2222 Q ss_pred -hhhhHHHhHhhhhhhhhceeeecccc-----ceeeeecccccceeeeccccccccc--ccceeeEeeHhhhhHhHhhhH Q lcl|Aclame:pro 255 -ILKRIQDAVNDEGSLLPFIRHENLPT-----LVVGGDNALTQGTGHTTGTDKTESN--ITLQTRVLTPQYVYKYIKLPK 326 (517) Q Consensus 255 -i~~~i~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~a~~~~eg~~~~~~~--~~f~~~~~~~~~~~~~~~iS~ 326 (517) +...+.+..........++++.+.+. .+....+..+.+.+++.+++-|..+ ..+...+.....++ +.++. T Consensus 61 ~i~~~vy~~~~~~~~~~~l~pv~t~g~w~~~t~~y~~~e~~G~a~~ygd~ad~Pl~~~~v~~~~~~v~~~~~g--~~y~~ 138 (339) T protein:vir:94 61 FVDRRVIDIQLAPMAAAKIFPEVKKGDWTTTYGVFIIAEPVGQVATYSDWSANGMSKANVNFESRQNYRYQTW--TEYGD 138 (339) T ss_pred hhchhheeecccccchhhhcccccCCCCcccEEEEeeeecccceEEcccccCCCcccccceeeEEeEEEEEEE--EeecH Confidence 22333344444444555555554432 2334456667788888887777655 34444443333222 22222 Q ss_pred HHHHHh---hcccHHHHHHHHHHHHHHHHHHHHHhhhhcccccCccccccccccccccc---ccccccccHHHHH----H Q lcl|Aclame:pro 327 IVMNSN---ATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWA---TNVTGTTNIQELL----E 396 (517) Q Consensus 327 ~li~d~---~~d~~~~l~~~i~~~l~~~~~~~~e~~~l~G~G~~~~~~gi~~~~~~~~~---~~~~~~~~~d~l~----~ 396 (517) +-+... .++ |.+--....++++.+.++.-.++|+-. ....|+++....... .+...+.+.+.++ . T Consensus 139 ~E~~~A~~~g~~----l~~~Ka~aA~~al~~~~N~i~~~Gd~~-~~~~GLlN~P~l~~~v~~s~~Wa~kT~~eI~~Di~~ 213 (339) T protein:vir:94 139 LEMATYGEAGID----YVARQEISASLVMAKFANSSYLLGVAG-IANYGLMNDPSLPAPVAATVNWATAAPEDIANDVVA 213 (339) T ss_pred HHHHHHHhhCCC----hHHHHHHHHHHHHHHhhceEEeeeecc-cceEEEEeCCCccccccCCCCcccCCHHHHHHHHHH Confidence 111111 222 222233345666777777778888743 334566654322111 1111223333333 3 Q ss_pred HHHHhhhh-----h--cCCEEEEcHHHHHHHHHhhcCCCCEeccCCCCCCccceecCccceeccccCCceeeeecCceEE Q lcl|Aclame:pro 397 KLSVATPK-----A--ADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDEKTAVSLSGYVT 469 (517) Q Consensus 397 ~l~~~~~~-----~--~~a~~vmn~~~~~~l~~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~~~~~~~~~~~~ 469 (517) ++...... . .+..++|.|+-+..|... +..|.-++.-.-.+ +..-.++..++.... +.+...+ T Consensus 214 ~~~~l~~~s~g~~~~~~~~~L~LP~~~~~~L~~~-n~~~~Tvl~~lk~n------~pnl~i~~~~el~~a---~g~~~~~ 283 (339) T protein:vir:94 214 MVGRLISQSGGLITGQERMVMALAPSALNNVNRT-NNFGLSAGAKIAQT------YPNIQFVAVPEFDTA---SGRLVQL 283 (339) T ss_pred HHHHHHHhcCCeeeeccCcEEEecHHHHHhcccC-CcCCccHHHHHHHh------cCCcEEEEccccccC---CCceEEE Confidence 33332111 1 134799999999988643 44443332111000 111122222222211 1111111 Q ss_pred Ee-ee-h-----eeehhhhh---cccchH--HHHHhhhhc-ceeecccceEEEEeC Q lcl|Aclame:pro 470 NG-SR-G-----MEFEQGTI---LVENNK--EYLFEMPIS-GSLEYKGTTAYGTYT 512 (517) Q Consensus 470 ~~-~~-~-----~~~~~d~~---~~~n~~--~~~~~~rvg-g~v~~~~a~~~~~~t 512 (517) .. .. + +.+-..+. .+...- ..-...|.| .-|++|.++++.+=- T Consensus 284 ~~~~~~~~~~~~~~~p~~~~~lpvq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 284 WVPEVNGQPTGEVAFAEKLRSHSIERYSTTTRQKHSGATFGAVIYQPWAVTQELGV 339 (339) T ss_pred EEEeccCCcceEEEcchhhhccccEEcCceEEecceeeeeeEEEEccceeeeeecC Confidence 11 10 0 00000000 000000 111234444 477889998877622 No 179 >protein:vir:99311 Length: 463 # NCBI annotation: putative capsid protein # Family: family:all:2450 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024474;genbank:gi:48696433;genbank:GeneID:2948039 Probab=24.73 E-value=2.1 Score=18.77 Aligned_cols=276 Identities=9% Similarity=0.003 Sum_probs=100.0 Q ss_pred hhhhhhhhhhHHHHHHHHHHHHhhccchhhHHHHhhhh----hcccccccccchhhhhhHHHhHhhhh--hhhhceeeec Q lcl|Aclame:pro 204 ALKVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAEL----KERGISGMPAPAGILKRIQDAVNDEG--SLLPFIRHEN 277 (517) Q Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~vp~~i~~~i~~~~~~~~--~~~~~~~~~~ 277 (517) ...+.+.. +. .... .....+.+.+.+..+- +.-...+.++...+...|..+..... .+++-+...+ T Consensus 1 ~~~~~~~~-~~---~~~~----~~~~~e~~~KS~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~ 72 (463) T protein:vir:99 1 MTIEKNLS-DV---QQKY----ADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRP 72 (463) T ss_pred CCcccccc-hH---HHHH----HhhhhHHHHHHhhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCch Confidence 00000000 00 0000 0000111111111100 00011122222222222221111111 1111111111 Q ss_pred ccc-----ceeeeecccccceeeecccccccccccceeeEeeHhhhhHhHhhhHHH-HHHhhcccHHHHHHHHHHHHHHH Q lcl|Aclame:pro 278 LPT-----LVVGGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIV-MNSNATDIAGAILTYVMNRLPDM 351 (517) Q Consensus 278 ~~~-----~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~l-i~d~~~d~~~~l~~~i~~~l~~~ 351 (517) ..+ ......-..+.+..+.|+...+.+++.+..+...+|-+.+...+|..+ +.+...| ......+.-.-. T Consensus 73 a~STV~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d----~~~~~~~dai~~ 148 (463) T protein:vir:99 73 AQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIAD----PSQILTEDAIAV 148 (463) T ss_pred hhhhhhhheeeeccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhccccc----HHHHHHHHHHHH Confidence 111 111111222445667889888899999999999999888877777643 2222222 223333445556 Q ss_pred HHHHHHhhhhcccccC--------cccccccccccccccccccccccHHHHHHHHHHhh-hhh-cCCEEEEcHHHHHHHH Q lcl|Aclame:pro 352 VIMAVNRAIIMGGVTG--------VSETQIYPVVGDAWATNVTGTTNIQELLEKLSVAT-PKA-ADSTLVIHRNDLAAIR 421 (517) Q Consensus 352 ~~~~~e~~~l~G~G~~--------~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~-~~~-~~a~~vmn~~~~~~l~ 421 (517) ++..+|.+.+.||-.= .++-||.+....-+...+-+......++....... ..+ .+.-+.|+..+.+.+. T Consensus 149 ia~tiE~a~FyGds~l~~~~~~~gleFDGl~~lId~enviDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~ 228 (463) T protein:vir:99 149 VAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFV 228 (463) T ss_pred HHHHHHHHHhhhhhccCCCcCccccchhhhhhhcCCCCeeecCCCcccHHHHhhhhhhhhcccCChhheecchHHHHHHH Confidence 8889999999887432 22334433322122222222222222222111111 122 2344778888888877 Q ss_pred HhhcCCCCEeccCCCCCCccceecCccceeccccCCceeeeecCceEEEeeeheeehhhhhcccchHHHHHhhhhcceee Q lcl|Aclame:pro 422 FLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDEKTAVSLSGYVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLE 501 (517) Q Consensus 422 ~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~ 501 (517) .--=..-|.+.+++.... ..|.+.-. .+..-.-+.+.++. .++ ...++-+.+. . T Consensus 229 ~~~l~~qrv~~~~N~~~~----~~G~~v~~---f~s~~G~I~L~~s~--------~m~-------~~~il~~~~~----~ 282 (463) T protein:vir:99 229 NSILGRQMQLMQDNSGNV----NTGYSVNG---FYSSRGFIKLHGST--------VME-------NELILDESLQ----P 282 (463) T ss_pred HHhcCceEEEEcCCCCce----eeeeeccc---eeeeeeeeeeCCce--------ecC-------Ccccccchhh----c Confidence 322222223333322211 22211000 00000001111111 111 1112333331 3 Q ss_pred cccceEEE----EeCCCCCC Q lcl|Aclame:pro 502 YKGTTAYG----TYTPPVAG 517 (517) Q Consensus 502 ~~~a~~~~----~~tp~~a~ 517 (517) .|.|++-. +++|.-.| T Consensus 283 ~p~ap~~~~~tatv~~~~~~ 302 (463) T protein:vir:99 283 LPNAPQPAKVTATVETKQKG 302 (463) T ss_pred CCCCccCceeEEEEeeccCC Confidence 44444332 33332222 No 180 >protein:vir:95603 Length: 463 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240903;genbank:gi:66394965;genbank:GeneID:5132544 Probab=24.73 E-value=2.1 Score=18.77 Aligned_cols=276 Identities=9% Similarity=0.003 Sum_probs=100.0 Q ss_pred hhhhhhhhhhHHHHHHHHHHHHhhccchhhHHHHhhhh----hcccccccccchhhhhhHHHhHhhhh--hhhhceeeec Q lcl|Aclame:pro 204 ALKVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAEL----KERGISGMPAPAGILKRIQDAVNDEG--SLLPFIRHEN 277 (517) Q Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~vp~~i~~~i~~~~~~~~--~~~~~~~~~~ 277 (517) ...+.+.. +. .... .....+.+.+.+..+- +.-...+.++...+...|..+..... .+++-+...+ T Consensus 1 ~~~~~~~~-~~---~~~~----~~~~~e~~~KS~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~ 72 (463) T protein:vir:95 1 MTIEKNLS-DV---QQKY----ADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRP 72 (463) T ss_pred CCcccccc-hH---HHHH----HhhhhHHHHHHhhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCch Confidence 00000000 00 0000 0000111111111100 00011122222222222221111111 1111111111 Q ss_pred ccc-----ceeeeecccccceeeecccccccccccceeeEeeHhhhhHhHhhhHHH-HHHhhcccHHHHHHHHHHHHHHH Q lcl|Aclame:pro 278 LPT-----LVVGGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIV-MNSNATDIAGAILTYVMNRLPDM 351 (517) Q Consensus 278 ~~~-----~~~~~~~~~~~a~~~~eg~~~~~~~~~f~~~~~~~~~~~~~~~iS~~l-i~d~~~d~~~~l~~~i~~~l~~~ 351 (517) ..+ ......-..+.+..+.|+...+.+++.+..+...+|-+.+...+|..+ +.+...| ......+.-.-. T Consensus 73 a~STV~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d----~~~~~~~dai~~ 148 (463) T protein:vir:95 73 AQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIAD----PSQILTEDAIAV 148 (463) T ss_pred hhhhhhhheeeeccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhccccc----HHHHHHHHHHHH Confidence 111 111111222445667889888899999999999999888877777643 2222222 223333445556 Q ss_pred HHHHHHhhhhcccccC--------cccccccccccccccccccccccHHHHHHHHHHhh-hhh-cCCEEEEcHHHHHHHH Q lcl|Aclame:pro 352 VIMAVNRAIIMGGVTG--------VSETQIYPVVGDAWATNVTGTTNIQELLEKLSVAT-PKA-ADSTLVIHRNDLAAIR 421 (517) Q Consensus 352 ~~~~~e~~~l~G~G~~--------~~~~gi~~~~~~~~~~~~~~~~~~d~l~~~l~~~~-~~~-~~a~~vmn~~~~~~l~ 421 (517) ++..+|.+.+.||-.= .++-||.+....-+...+-+......++....... ..+ .+.-+.|+..+.+.+. T Consensus 149 ia~tiE~a~FyGds~l~~~~~~~gleFDGl~~lId~enviDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~ 228 (463) T protein:vir:95 149 VAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFV 228 (463) T ss_pred HHHHHHHHHhhhhhccCCCcCccccchhhhhhhcCCCCeeecCCCcccHHHHhhhhhhhhcccCChhheecchHHHHHHH Confidence 8889999999887432 22334433322122222222222222222111111 122 2344778888888877 Q ss_pred HhhcCCCCEeccCCCCCCccceecCccceeccccCCceeeeecCceEEEeeeheeehhhhhcccchHHHHHhhhhcceee Q lcl|Aclame:pro 422 FLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDEKTAVSLSGYVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLE 501 (517) Q Consensus 422 ~lKD~~Gryl~~~~~~~~~~~~l~g~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~n~~~~~~~~rvgg~v~ 501 (517) .--=..-|.+.+++.... ..|.+.-. .+..-.-+.+.++. .++ ...++-+.+. . T Consensus 229 ~~~l~~qrv~~~~N~~~~----~~G~~v~~---f~s~~G~I~L~~s~--------~m~-------~~~il~~~~~----~ 282 (463) T protein:vir:95 229 NSILGRQMQLMQDNSGNV----NTGYSVNG---FYSSRGFIKLHGST--------VME-------NELILDESLQ----P 282 (463) T ss_pred HHhcCceEEEEcCCCCce----eeeeeccc---eeeeeeeeeeCCce--------ecC-------Ccccccchhh----c Confidence 322222223333322211 22211000 00000001111111 111 1112333331 3 Q ss_pred cccceEEE----EeCCCCCC Q lcl|Aclame:pro 502 YKGTTAYG----TYTPPVAG 517 (517) Q Consensus 502 ~~~a~~~~----~~tp~~a~ 517 (517) .|.|++-. +++|.-.| T Consensus 283 ~p~ap~~~~~tatv~~~~~~ 302 (463) T protein:vir:95 283 LPNAPQPAKVTATVETKQKG 302 (463) T ss_pred CCCCccCceeEEEEeeccCC Confidence 44444332 33332222 Done!