Query lcl|Aclame:protein:vir:99075|NCBI_annot:gp30|genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Match_columns 392 No_of_seqs 265 out of 1666 Neff 10.3 Searched_HMMs 1612 Date Sun Dec 1 20:07:15 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_147 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_147_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:99075 Length: 392 100.0 5E-93 3.1E-96 526.6 37.1 392 1-392 1-392 (392) 2 protein:vir:108303 Length: 418 100.0 4.8E-52 3E-55 301.9 26.8 358 1-392 1-398 (418) 3 protein:vir:174 Length: 423 # 100.0 1.1E-51 7E-55 299.9 27.6 375 1-392 1-405 (423) 4 protein:vir:105374 Length: 423 100.0 8.7E-52 5.4E-55 300.5 26.8 375 1-392 1-405 (423) 5 protein:vir:3525 Length: 423 # 100.0 1.3E-50 8.2E-54 294.0 26.8 374 1-392 1-405 (423) 6 protein:vir:105522 Length: 423 100.0 1.3E-49 7.8E-53 288.7 26.9 378 1-392 1-405 (423) 7 protein:vir:102605 Length: 273 100.0 7.5E-47 4.7E-50 273.5 22.9 268 1-299 1-273 (273) 8 protein:vir:105822 Length: 273 100.0 7.5E-47 4.7E-50 273.5 22.9 268 1-299 1-273 (273) 9 protein:vir:7990 Length: 273 # 100.0 9.1E-47 5.6E-50 273.0 22.4 268 1-299 1-273 (273) 10 protein:vir:94622 Length: 341 100.0 2.8E-42 1.8E-45 248.4 20.5 289 1-318 3-341 (341) 11 protein:vir:80180 Length: 381 100.0 6.6E-39 4.1E-42 229.9 21.6 333 1-361 15-381 (381) 12 protein:vir:3136 Length: 322 # 100.0 2.4E-38 1.5E-41 226.8 14.5 287 1-313 1-322 (322) 13 protein:vir:80930 Length: 278 100.0 1.1E-35 7E-39 212.2 21.2 268 1-314 1-278 (278) 14 protein:vir:9265 Length: 430 # 100.0 9.9E-35 6.1E-38 207.0 20.9 371 1-392 1-410 (430) 15 protein:vir:100939 Length: 430 100.0 9.9E-35 6.1E-38 207.0 20.9 371 1-392 1-410 (430) 16 protein:vir:1239 Length: 274 # 100.0 5.7E-34 3.6E-37 202.8 21.6 264 1-320 1-274 (274) 17 protein:vir:95898 Length: 274 100.0 5.8E-34 3.6E-37 202.8 21.6 264 1-322 1-274 (274) 18 protein:vir:96262 Length: 274 100.0 5.8E-34 3.6E-37 202.8 21.6 264 1-322 1-274 (274) 19 protein:vir:94494 Length: 274 100.0 1.2E-33 7.2E-37 201.1 21.3 264 1-320 1-274 (274) 20 protein:vir:97433 Length: 274 100.0 1.2E-33 7.2E-37 201.1 21.3 264 1-320 1-274 (274) 21 protein:vir:96123 Length: 274 100.0 1.3E-33 8.2E-37 200.8 21.5 264 1-307 1-274 (274) 22 protein:vir:93742 Length: 274 100.0 1.4E-33 8.9E-37 200.6 20.8 264 1-320 1-274 (274) 23 protein:vir:96833 Length: 275 100.0 1.9E-33 1.2E-36 200.0 20.3 265 1-313 3-275 (275) 24 protein:vir:2106 Length: 430 # 100.0 7.9E-33 4.9E-36 196.6 21.8 372 1-392 1-410 (430) 25 protein:vir:78739 Length: 332 100.0 3.8E-33 2.4E-36 198.3 19.5 280 1-296 7-332 (332) 26 protein:vir:10450 Length: 344 100.0 5.4E-32 3.4E-35 192.0 19.7 283 1-311 1-344 (344) 27 protein:vir:3613 Length: 272 # 100.0 7.5E-32 4.7E-35 191.2 20.5 261 1-311 1-272 (272) 28 protein:vir:3364 Length: 347 # 100.0 5E-32 3.1E-35 192.2 17.0 285 1-315 1-347 (347) 29 protein:vir:105334 Length: 276 100.0 3.7E-31 2.3E-34 187.4 20.5 267 1-316 1-276 (276) 30 protein:vir:1541 Length: 347 # 100.0 5.1E-31 3.1E-34 186.7 19.1 285 1-315 1-347 (347) 31 protein:vir:94576 Length: 347 99.9 5.7E-30 3.5E-33 180.9 18.3 284 1-316 1-347 (347) 32 protein:vir:8885 Length: 347 # 99.9 8.2E-30 5.1E-33 180.0 18.3 286 1-317 1-347 (347) 33 protein:vir:2201 Length: 345 # 99.9 1.1E-29 7.1E-33 179.2 18.9 299 1-333 1-345 (345) 34 protein:vir:94711 Length: 347 99.9 3.2E-30 2E-33 182.3 15.8 287 1-317 1-347 (347) 35 protein:vir:100057 Length: 375 99.9 1.8E-28 1.1E-31 172.7 21.8 314 1-346 9-375 (375) 36 protein:vir:9820 Length: 272 # 99.9 1.3E-28 8.4E-32 173.4 20.8 263 1-317 1-272 (272) 37 protein:vir:3033 Length: 272 # 99.9 1.3E-28 8.4E-32 173.4 20.8 263 1-317 1-272 (272) 38 protein:vir:80213 Length: 334 99.9 8E-29 5E-32 174.6 18.6 283 1-313 1-334 (334) 39 protein:vir:79008 Length: 299 99.9 3.9E-27 2.4E-30 165.4 19.4 275 1-291 1-299 (299) 40 protein:vir:99675 Length: 324 99.9 1E-26 6.2E-30 163.1 16.2 282 28-334 1-324 (324) 41 protein:vir:107120 Length: 329 99.9 7.4E-26 4.6E-29 158.4 19.7 288 1-330 36-329 (329) 42 protein:vir:6324 Length: 335 # 99.9 2.2E-25 1.4E-28 155.8 19.8 286 1-320 1-335 (335) 43 protein:vir:78935 Length: 335 99.9 1E-25 6.5E-29 157.5 17.9 288 1-320 1-335 (335) 44 protein:vir:103323 Length: 364 99.9 4.1E-25 2.6E-28 154.3 20.2 316 1-357 1-364 (364) 45 protein:vir:94800 Length: 319 99.9 3.6E-25 2.2E-28 154.6 19.2 290 1-322 25-319 (319) 46 protein:vir:97331 Length: 319 99.9 3.6E-25 2.2E-28 154.6 19.2 290 1-322 25-319 (319) 47 protein:vir:95107 Length: 270 99.9 3.8E-25 2.4E-28 154.5 18.0 264 1-322 1-270 (270) 48 protein:vir:78920 Length: 290 99.9 6.9E-25 4.3E-28 153.1 19.2 273 1-299 1-290 (290) 49 protein:vir:97031 Length: 402 99.9 2.2E-23 1.3E-26 144.8 18.7 348 1-369 1-402 (402) 50 protein:vir:7019 Length: 401 # 99.8 2.1E-23 1.3E-26 145.0 15.2 349 1-370 1-401 (401) 51 protein:vir:102655 Length: 322 99.8 6.1E-22 3.8E-25 136.9 18.1 284 1-318 13-322 (322) 52 protein:vir:739 Length: 231 # 99.8 6.5E-22 4E-25 136.7 16.5 227 34-311 1-231 (231) 53 protein:vir:118 Length: 449 # 99.8 8.6E-20 5.3E-23 125.1 25.4 345 1-392 52-431 (449) 54 protein:vir:1781 Length: 221 # 99.8 1.8E-21 1.1E-24 134.3 12.9 204 83-326 1-221 (221) 55 protein:vir:105464 Length: 346 99.8 3.9E-20 2.4E-23 127.0 16.8 316 1-357 1-346 (346) 56 protein:vir:102335 Length: 312 99.7 2.3E-19 1.4E-22 122.8 20.2 290 1-312 1-312 (312) 57 protein:vir:79712 Length: 285 99.7 8.4E-19 5.2E-22 119.7 18.3 270 1-290 1-285 (285) 58 protein:vir:105645 Length: 400 99.7 1E-17 6.3E-21 113.8 18.0 329 1-342 1-400 (400) 59 protein:vir:99523 Length: 311 99.5 1E-15 6.4E-19 102.7 18.7 277 1-300 1-311 (311) 60 protein:vir:1583 Length: 351 # 99.5 5.4E-15 3.4E-18 98.8 16.8 324 1-347 1-351 (351) 61 protein:vir:78090 Length: 302 99.5 1.9E-14 1.2E-17 95.8 19.3 279 1-302 1-302 (302) 62 protein:vir:5974 Length: 324 # 99.4 2.1E-14 1.3E-17 95.5 18.4 298 1-336 1-324 (324) 63 protein:vir:102944 Length: 330 99.4 1.9E-14 1.2E-17 95.8 16.8 297 1-324 1-330 (330) 64 protein:vir:5202 Length: 448 # 99.3 3.2E-13 2E-16 89.1 18.6 340 1-392 52-430 (448) 65 protein:vir:95451 Length: 313 99.3 2.7E-14 1.7E-17 95.0 11.3 287 1-340 1-313 (313) 66 protein:vir:9927 Length: 295 # 98.8 1.6E-10 1E-13 74.3 13.4 277 1-326 1-295 (295) 67 protein:vir:9875 Length: 296 # 98.8 3E-10 1.9E-13 72.8 14.7 270 1-324 22-296 (296) 68 protein:vir:106647 Length: 303 98.7 2.9E-09 1.8E-12 67.4 15.1 277 1-337 1-303 (303) 69 protein:vir:80446 Length: 367 98.5 2.6E-08 1.6E-11 62.2 16.9 309 1-360 1-367 (367) 70 protein:vir:41 Length: 299 # N 98.5 3.2E-08 2E-11 61.6 17.1 277 1-315 1-299 (299) 71 protein:vir:108211 Length: 318 98.5 7.5E-09 4.7E-12 65.1 13.5 278 1-314 1-318 (318) 72 protein:vir:80684 Length: 315 98.5 2.9E-08 1.8E-11 61.9 15.6 291 1-316 1-315 (315) 73 protein:vir:2344 Length: 397 # 98.4 4.6E-07 2.9E-10 55.3 20.2 353 1-392 10-394 (397) 74 protein:vir:78223 Length: 333 98.4 1.7E-07 1E-10 57.8 17.5 281 1-305 20-333 (333) 75 protein:vir:105905 Length: 304 98.4 1.4E-07 8.8E-11 58.1 17.0 266 1-312 1-304 (304) 76 protein:vir:94142 Length: 304 98.4 1.4E-07 8.8E-11 58.1 17.0 266 1-312 1-304 (304) 77 protein:vir:4339 Length: 395 # 98.4 1.6E-07 1E-10 57.8 17.2 262 1-311 117-395 (395) 78 protein:vir:6242 Length: 390 # 98.3 1.9E-07 1.2E-10 57.4 15.9 261 1-315 116-390 (390) 79 protein:vir:7771 Length: 330 # 98.3 4.9E-07 3E-10 55.2 17.9 286 1-339 1-330 (330) 80 protein:vir:104256 Length: 458 98.2 5.3E-07 3.3E-10 55.0 17.1 270 1-316 165-458 (458) 81 protein:vir:9309 Length: 324 # 98.2 4.2E-07 2.6E-10 55.6 16.5 282 1-313 30-324 (324) 82 protein:vir:96223 Length: 324 98.2 5.4E-07 3.3E-10 54.9 16.4 283 1-313 30-324 (324) 83 protein:vir:97148 Length: 324 98.2 5.3E-07 3.3E-10 55.0 16.4 282 1-313 31-324 (324) 84 protein:vir:9759 Length: 303 # 98.2 4.2E-07 2.6E-10 55.5 15.7 278 1-313 1-303 (303) 85 protein:vir:78387 Length: 349 98.2 4.8E-07 3E-10 55.2 16.0 309 1-352 1-349 (349) 86 protein:vir:100135 Length: 418 98.2 5E-07 3.1E-10 55.1 16.1 265 1-310 136-418 (418) 87 protein:vir:95763 Length: 297 98.2 6.6E-07 4.1E-10 54.5 16.7 271 1-311 9-297 (297) 88 protein:vir:78830 Length: 324 98.2 8E-07 4.9E-10 54.0 17.0 282 1-313 30-324 (324) 89 protein:vir:96392 Length: 324 98.2 8E-07 4.9E-10 54.0 17.0 282 1-313 30-324 (324) 90 protein:vir:9410 Length: 415 # 98.2 7.4E-07 4.6E-10 54.2 16.6 280 1-332 127-415 (415) 91 protein:vir:1886 Length: 385 # 98.2 1.1E-06 7E-10 53.2 17.6 263 1-312 105-385 (385) 92 protein:vir:191 Length: 385 # 98.2 1.1E-06 7E-10 53.2 17.6 263 1-312 105-385 (385) 93 protein:vir:1328 Length: 392 # 98.1 6.6E-07 4.1E-10 54.5 16.0 265 1-315 114-392 (392) 94 protein:vir:81100 Length: 415 98.1 1.1E-06 6.8E-10 53.3 17.1 279 1-332 127-415 (415) 95 protein:vir:79987 Length: 415 98.1 1.1E-06 6.8E-10 53.3 17.1 279 1-332 127-415 (415) 96 protein:vir:98339 Length: 415 98.1 1.1E-06 6.8E-10 53.3 17.1 279 1-332 127-415 (415) 97 protein:vir:78523 Length: 338 98.1 1.3E-06 8.2E-10 52.8 17.5 283 1-315 1-338 (338) 98 protein:vir:99749 Length: 324 98.1 8.5E-07 5.3E-10 53.9 16.4 283 1-313 30-324 (324) 99 protein:vir:1383 Length: 421 # 98.1 1.7E-06 1E-09 52.3 17.5 301 1-351 117-421 (421) 100 protein:vir:4600 Length: 415 # 98.1 1.5E-06 9.2E-10 52.6 16.5 279 1-332 127-415 (415) 101 protein:vir:4700 Length: 415 # 98.1 1.5E-06 9.2E-10 52.6 16.5 279 1-332 127-415 (415) 102 protein:vir:97053 Length: 390 98.1 2.2E-06 1.3E-09 51.6 17.3 259 1-314 113-390 (390) 103 protein:vir:4830 Length: 397 # 98.0 1.6E-06 1E-09 52.3 16.1 275 1-311 111-397 (397) 104 protein:vir:103955 Length: 324 98.0 2.5E-06 1.5E-09 51.3 16.6 283 1-313 30-324 (324) 105 protein:vir:93616 Length: 645 98.0 2.7E-06 1.7E-09 51.1 16.8 293 1-319 344-645 (645) 106 protein:vir:104085 Length: 320 98.0 4.2E-06 2.6E-09 50.1 17.7 281 1-304 14-320 (320) 107 protein:vir:81070 Length: 390 98.0 3.7E-06 2.3E-09 50.4 17.4 261 1-314 113-390 (390) 108 protein:vir:94989 Length: 349 98.0 3.1E-06 1.9E-09 50.8 16.6 309 1-352 1-349 (349) 109 protein:vir:8420 Length: 477 # 97.9 2E-06 1.3E-09 51.8 15.1 289 1-319 163-477 (477) 110 protein:vir:94673 Length: 419 97.9 4.1E-06 2.5E-09 50.1 16.6 267 1-309 130-419 (419) 111 protein:vir:9574 Length: 300 # 97.9 4.4E-06 2.7E-09 50.0 16.7 274 1-323 1-300 (300) 112 protein:vir:2430 Length: 318 # 97.9 8.6E-06 5.3E-09 48.4 18.0 282 1-318 14-318 (318) 113 protein:vir:2504 Length: 305 # 97.9 6.5E-06 4E-09 49.0 17.3 281 1-336 1-305 (305) 114 protein:vir:99920 Length: 311 97.9 1.2E-05 7.5E-09 47.5 18.7 283 1-313 1-311 (311) 115 protein:vir:94771 Length: 298 97.9 5.6E-06 3.5E-09 49.4 16.6 268 1-312 1-298 (298) 116 protein:vir:8187 Length: 311 # 97.9 1.3E-05 7.9E-09 47.4 18.3 288 1-314 1-311 (311) 117 protein:vir:101607 Length: 379 97.8 8.7E-06 5.4E-09 48.3 16.6 262 1-311 109-379 (379) 118 protein:vir:10364 Length: 390 97.8 1.7E-05 1E-08 46.7 18.0 260 1-314 114-390 (390) 119 protein:vir:4953 Length: 397 # 97.8 1.5E-05 9.3E-09 47.0 17.4 276 1-311 109-397 (397) 120 protein:vir:6212 Length: 434 # 97.8 3.8E-06 2.3E-09 50.3 14.1 276 1-316 141-434 (434) 121 protein:vir:4511 Length: 409 # 97.7 1.6E-05 9.8E-09 46.9 17.1 273 1-324 117-409 (409) 122 protein:vir:1638 Length: 298 # 97.7 2.3E-05 1.4E-08 46.0 17.8 268 1-312 1-298 (298) 123 protein:vir:95875 Length: 401 97.7 1.2E-05 7.6E-09 47.5 16.2 306 1-337 19-401 (401) 124 protein:vir:80376 Length: 435 97.7 2.6E-05 1.6E-08 45.8 17.7 281 1-315 135-435 (435) 125 protein:vir:102119 Length: 404 97.6 6.5E-06 4.1E-09 49.0 13.8 276 1-311 110-404 (404) 126 protein:vir:81227 Length: 413 97.6 2.5E-05 1.6E-08 45.8 17.0 272 1-310 118-413 (413) 127 protein:vir:95376 Length: 425 97.6 8E-06 5E-09 48.5 14.3 269 1-311 144-425 (425) 128 protein:vir:4997 Length: 397 # 97.6 2.1E-05 1.3E-08 46.2 16.5 276 1-324 109-397 (397) 129 protein:vir:4856 Length: 293 # 97.6 3.5E-05 2.2E-08 45.0 18.0 271 1-333 5-293 (293) 130 protein:vir:81160 Length: 371 97.6 2.5E-05 1.6E-08 45.8 16.7 271 1-314 91-371 (371) 131 protein:vir:96762 Length: 632 97.6 1.5E-05 9E-09 47.1 15.2 260 1-306 347-632 (632) 132 protein:vir:4226 Length: 326 # 97.6 3.1E-05 1.9E-08 45.3 16.9 279 1-316 22-326 (326) 133 protein:vir:7409 Length: 408 # 97.6 3.4E-05 2.1E-08 45.1 17.1 278 1-344 116-408 (408) 134 protein:vir:1433 Length: 435 # 97.6 4.7E-05 2.9E-08 44.3 18.2 281 1-315 130-435 (435) 135 protein:vir:100172 Length: 394 97.5 2.7E-05 1.7E-08 45.6 15.8 271 1-326 111-394 (394) 136 protein:vir:1025 Length: 408 # 97.5 4.6E-05 2.9E-08 44.4 16.6 278 1-320 121-408 (408) 137 protein:vir:8102 Length: 543 # 97.5 5.9E-05 3.6E-08 43.8 18.5 273 1-315 249-543 (543) 138 protein:vir:105038 Length: 428 97.4 6.5E-05 4E-08 43.5 18.6 280 1-313 125-428 (428) 139 protein:vir:1268 Length: 397 # 97.4 7.3E-05 4.5E-08 43.3 16.4 262 1-309 123-397 (397) 140 protein:vir:3991 Length: 404 # 97.4 8.3E-05 5.2E-08 42.9 17.1 274 1-324 116-404 (404) 141 protein:vir:485 Length: 407 # 97.3 0.00011 6.7E-08 42.3 16.3 272 1-318 106-407 (407) 142 protein:vir:5739 Length: 366 # 97.2 0.00012 7.5E-08 42.1 17.8 279 1-313 64-366 (366) 143 protein:vir:3845 Length: 395 # 97.2 0.00014 8.4E-08 41.8 16.3 277 1-326 105-395 (395) 144 protein:vir:4456 Length: 401 # 97.2 0.0001 6.5E-08 42.4 15.2 267 1-314 107-401 (401) 145 protein:vir:100247 Length: 425 97.2 0.00015 9.3E-08 41.6 16.4 267 1-315 130-425 (425) 146 protein:vir:3870 Length: 400 # 96.8 0.00034 2.1E-07 39.6 15.5 257 1-312 140-400 (400) 147 protein:vir:3158 Length: 321 # 96.7 0.00028 1.7E-07 40.1 14.2 275 1-316 24-321 (321) 148 protein:vir:100884 Length: 389 96.7 0.00042 2.6E-07 39.1 16.0 267 1-324 109-389 (389) 149 protein:vir:79928 Length: 393 96.6 0.00017 1E-07 41.3 12.1 282 1-324 74-393 (393) 150 protein:vir:95131 Length: 325 96.6 0.00051 3.2E-07 38.6 16.3 300 1-349 1-325 (325) 151 protein:vir:78640 Length: 352 96.3 0.00082 5.1E-07 37.5 15.2 260 1-320 83-352 (352) 152 protein:vir:4092 Length: 390 # 96.1 0.001 6.3E-07 37.0 15.4 281 1-324 84-390 (390) 153 protein:vir:9704 Length: 394 # 96.0 0.0011 7.1E-07 36.7 15.7 259 1-311 133-394 (394) 154 protein:vir:93696 Length: 364 95.5 0.0019 1.2E-06 35.5 18.2 292 1-310 1-364 (364) 155 protein:vir:80128 Length: 466 95.4 0.0021 1.3E-06 35.3 14.3 282 1-311 154-466 (466) 156 protein:vir:8324 Length: 410 # 95.2 0.0017 1E-06 35.8 11.7 265 1-311 136-410 (410) 157 protein:vir:2685 Length: 387 # 95.1 0.0027 1.7E-06 34.6 14.6 260 1-314 118-387 (387) 158 protein:vir:94424 Length: 387 95.1 0.0027 1.7E-06 34.6 14.6 260 1-314 118-387 (387) 159 protein:vir:96978 Length: 387 95.1 0.0027 1.7E-06 34.6 14.6 260 1-314 118-387 (387) 160 protein:vir:102873 Length: 392 95.1 0.0028 1.7E-06 34.6 16.5 273 1-312 106-392 (392) 161 protein:vir:102082 Length: 392 95.1 0.0028 1.7E-06 34.6 16.5 273 1-312 106-392 (392) 162 protein:vir:107593 Length: 392 95.1 0.0028 1.7E-06 34.6 16.5 273 1-312 106-392 (392) 163 protein:vir:105004 Length: 392 95.1 0.0028 1.7E-06 34.6 16.5 273 1-312 106-392 (392) 164 protein:vir:93881 Length: 387 95.1 0.0028 1.7E-06 34.6 14.4 259 1-314 118-387 (387) 165 protein:vir:9643 Length: 377 # 95.1 0.0028 1.7E-06 34.6 14.3 256 1-311 82-377 (377) 166 protein:vir:95963 Length: 395 95.1 0.0028 1.7E-06 34.6 14.2 272 1-330 91-395 (395) 167 protein:vir:4197 Length: 314 # 94.7 0.0036 2.3E-06 34.0 17.4 272 1-317 19-314 (314) 168 protein:vir:7855 Length: 497 # 94.4 0.0046 2.9E-06 33.4 16.8 270 1-317 151-497 (497) 169 protein:vir:101650 Length: 497 94.4 0.0046 2.9E-06 33.4 16.8 270 1-317 151-497 (497) 170 protein:vir:9361 Length: 402 # 94.3 0.0048 3E-06 33.3 15.0 259 1-314 133-402 (402) 171 protein:vir:2770 Length: 318 # 94.1 0.0054 3.3E-06 33.0 15.2 228 1-239 22-318 (318) 172 protein:vir:98635 Length: 377 93.7 0.0068 4.2E-06 32.5 12.3 256 1-311 79-377 (377) 173 protein:vir:101291 Length: 381 93.3 0.0081 5E-06 32.0 13.3 269 1-328 76-381 (381) 174 protein:vir:9509 Length: 381 # 93.3 0.0081 5E-06 32.0 13.3 269 1-328 76-381 (381) 175 protein:vir:962 Length: 397 # 93.1 0.0087 5.4E-06 31.9 13.4 258 1-321 138-397 (397) 176 protein:vir:96792 Length: 315 91.0 0.018 1.1E-05 30.2 16.7 295 1-334 1-315 (315) 177 protein:vir:1084 Length: 437 # 89.6 0.026 1.6E-05 29.3 16.2 268 1-314 156-437 (437) 178 protein:vir:105610 Length: 430 89.1 0.028 1.8E-05 29.1 14.5 304 1-315 1-430 (430) 179 protein:vir:10123 Length: 404 88.4 0.032 2E-05 28.7 16.7 300 1-308 22-404 (404) 180 protein:vir:104439 Length: 404 88.4 0.032 2E-05 28.7 16.7 300 1-308 22-404 (404) 181 protein:vir:819 Length: 404 # 88.4 0.032 2E-05 28.7 16.7 300 1-308 22-404 (404) 182 protein:vir:3298 Length: 404 # 88.4 0.032 2E-05 28.7 16.7 300 1-308 22-404 (404) 183 protein:vir:78350 Length: 383 88.2 0.034 2.1E-05 28.7 12.7 263 1-317 83-383 (383) 184 protein:vir:100632 Length: 381 82.6 0.075 4.7E-05 26.7 14.7 271 1-328 80-381 (381) 185 protein:vir:4159 Length: 315 # 80.2 0.098 6.1E-05 26.1 16.0 270 1-308 19-315 (315) No 1 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=100.00 E-value=5e-93 Score=526.59 Aligned_cols=392 Identities=100% Similarity=1.401 Sum_probs=357.2 Q ss_pred CccccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccccCceEE Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTEDSFP 80 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (392) |||++|+||+|+++++++|+++|+|+++|||||++||.+++||||+||+|+.+.+++|+..+.+.+.++.+|++.+++++ T Consensus 1 Ma~~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (392) T protein:vir:99 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTEDSFP 80 (392) T ss_pred CccccccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeecccccceeeeccccccCCcccccccccceEE Confidence 99999999999999999999999999999999999999999999999999999999999888888889999999999999 Q ss_pred EEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHHHHHHHhh Q lcl|Aclame:pro 81 VTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNGARRALN 160 (392) Q Consensus 81 ~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~ 160 (392) ++|||++|++|+|+|+|+.+.+.|+++++++|++++||+++|.++++++..+++..........+...|+.|++++++|+ T Consensus 81 ~~id~~k~~~~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a~~~~~~~~~~~~~~~~~~~i~~a~~~L~ 160 (392) T protein:vir:99 81 VTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNGARRALN 160 (392) T ss_pred EEEeeeeecceeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccChhhhHHHHHHHHHHHh Confidence 99999999999999999999999999999999999999999999999999988877777777888899999999999999 Q ss_pred hccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccceeecccccccchhhh Q lcl|Aclame:pro 161 ELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTAFIMATRAPA 240 (392) Q Consensus 161 ~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a~~~a~~~~~ 240 (392) ++++|+||+++++|++++.|+++++|.+..+.|+....++++|.+|+++||+|++++++|..+...++++++..+...+. T Consensus 161 ~~~vP~~R~~vv~p~~~~~l~~~~~~~~~~~~g~~~~~~l~~G~vg~i~G~~v~~s~~~~~~t~~a~~~~a~~~at~a~v 240 (392) T protein:vir:99 161 ELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTAFIMATRAPA 240 (392) T ss_pred hcCCCCCCEEEEcHHHHHHHhcccceeecccccchhhhhhhcceeeeeeeeEEEeecccccccceeeecccccccccccc Confidence 99999999999999999999999999999999988888899999999999999999999999999999999888888777 Q ss_pred ccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeecccccccceeee Q lcl|Aclame:pro 241 PPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGANATITA 320 (392) Q Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~ 320 (392) .+.+........+.......++..++.....+....+.+.+..........+......+......+.+.++.++...+++ T Consensus 241 ~~~~~~~~~s~s~~~~v~~~~~~~~~~t~~s~~~~v~~~~g~~~v~~~~~~~~~~~~~~~~~~~~v~v~~v~~~~~~~~~ 320 (392) T protein:vir:99 241 PPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGANATITA 320 (392) T ss_pred ccccccceeEEecccceecceeecccceeeccccccceeEEEEEEeeccccceeeeeeeeeecceeeeeeeecccceeEe Confidence 77776666666676666777777777777777777777777666655555555555566666667778888889989999 Q ss_pred eeccCeeEEEEEeecCcccccceEEEEEcCCceEEECCCceEEEEecceEEEEEEEecCCCcEEEEEEEEeC Q lcl|Aclame:pro 321 AAGEDHTVQLKVTDANGDDVTALCDFESSATDKATVAAGGLVTGVAAGTSTVTATLVTPSGDREDTIVITVV 392 (392) Q Consensus 321 ~~~~~~t~~~t~~~~~~~~~~~~vtw~Ssn~~VAtVd~~G~VTa~~~GtatITat~~~~~g~~tat~~VtVv 392 (392) ..+.+.++++++.+.+.++.++.++|+||||+|||||++|+|||+++|+++|||++.+.+|+++++|+|||| T Consensus 321 ~~~~~~~~~~t~~~~~~~~~~~~vtw~Ssn~~vAtV~~~G~Vt~v~~G~atITa~~~~~~~~~t~t~~vtV~ 392 (392) T protein:vir:99 321 AAGEDHTVQLKVTDANGDDVTALCDFESSATDKATVAAGGLVTGVAAGTSTVTATLVTPSGDREDTIVITVV 392 (392) T ss_pred eeccceeEEEEEEecCCccccceEEEEEcCCeeEEEcCCceEEEEecceEEEEEEEEcCCCcEEEEEEEEeC Confidence 999999999999999999988999999999999999999999999999999999999999999999999999 No 2 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=100.00 E-value=4.8e-52 Score=301.94 Aligned_cols=358 Identities=17% Similarity=0.173 Sum_probs=223.1 Q ss_pred Cc---cccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccccCc Q lcl|Aclame:pro 1 MA---NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTED 77 (392) Q Consensus 1 Ma---n~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (392) || |+||+||+|++++|+.|+++|+|+++|||||++||. +.||||+||+|+.+.++|+. ++.++++.++ T Consensus 1 m~~~~N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~-~~GDTV~I~vp~~~~v~dg~--------~~~~~~~te~ 71 (418) T protein:vir:10 1 MAVQDNNLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFG-KVGDTIRLKLPYRVKSASGR--------TLVKQPMVDQ 71 (418) T ss_pred CCccccccccHHHHHHHHHHHHHHhccchhhhcCCCchHHh-hCCCEEEEeeCCceeecccC--------Cccccccccc Confidence 98 899999999999999999999999999999999996 57999999999999998852 3668899999 Q ss_pred eEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHHHHHH Q lcl|Aclame:pro 78 SFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNGARR 157 (392) Q Consensus 78 ~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~ 157 (392) .++++|||+||++|+|+|+|+.+.+.+|+++++++++++||+++|.+++.+++.+++..+... .....|++|+++++ T Consensus 72 ~v~l~id~~k~~~~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~~~a~~~~gt~g---t~~~~~~~i~~a~~ 148 (418) T protein:vir:10 72 TIPFKIAYQEHVGLEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLALTLKKAFHSSGTPG---VRPGAFIDFANAGA 148 (418) T ss_pred eEEEEEecccccceeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccCC---cCcchHHHHHHHHH Confidence 999999999999999999999999999999999999999999999999999998876544322 23346999999999 Q ss_pred HhhhccCCC-C-CEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccceeeccccccc Q lcl|Aclame:pro 158 ALNELYIPQ-G-RVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTAFIMA 235 (392) Q Consensus 158 ~l~~~~vp~-~-r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a~~~a 235 (392) +|++++||+ | |++|++|+.++.|+++..+.. ...+. ..++|+|.+|+++||+|++++++|..+...++.+.+..+ T Consensus 149 ~Ld~~~VP~~G~R~lVv~P~~~~~L~~~~~~~~-~~~~~--~~~lr~G~IG~i~GF~V~~S~nip~~tag~~~~t~~v~g 225 (418) T protein:vir:10 149 KQTTYAVPQDGMRHAVLDPFTCASLSDEVTKLF-KESMV--EQAYKMGYRGNVAAYEVYESQNLPKHTVGDHGGTPLVNG 225 (418) T ss_pred HHHhcCCCCCCceEEEeCHHHHHHHhhhccccc-ccccc--chhhheeeeeeeeceEEEEecCCCcccccccccceeeec Confidence 999999994 5 999999999999988877643 33333 357999999999999999999999877665554443332 Q ss_pred chhhhccccccccceeecccceeeeeeeccccceeeeeccccc--------------ceeeeEEEeeccccceeeeeccc Q lcl|Aclame:pro 236 TRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDT--------------YFGLKVVEDPNGVGFVRARKIHL 301 (392) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~ 301 (392) ........... +. +..........+...... .....+........ .....+.. T Consensus 226 a~~~~~~~~~~------~~------t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~~~~~-~~~~tv~i 292 (418) T protein:vir:10 226 TVVNGDTVGFD------GG------TASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQEFVVLEDVDTDA-GGAGSIKI 292 (418) T ss_pred ccccceeEEEe------ec------ceeeccceeeccEEEECceeecccccccccccceEEEEEeeccccc-cCcceeEe Confidence 22111111000 00 000000011111111111 11111111100000 00001111 Q ss_pred eeeee----eecccccc------cceeeeeeccCeeEEEEEeecCcccccceEEEEEcCCceEEECCCceEEEEecceEE Q lcl|Aclame:pro 302 IPGSI----EVAPEAGA------NATITAAAGEDHTVQLKVTDANGDDVTALCDFESSATDKATVAAGGLVTGVAAGTST 371 (392) Q Consensus 302 ~~~~v----~v~~~~~~------~~~~~~~~~~~~t~~~t~~~~~~~~~~~~vtw~Ssn~~VAtVd~~G~VTa~~~Gtat 371 (392) .+... ........ -..++.... ....+++.........+++.|+-+-...++..= .-..+.+... T Consensus 293 ~p~~~~~~~~~~~~~~~~~~~~~~~~v~a~~a--~~~~it~~~~a~~~~~~nl~f~~~a~~l~~~~l---~~p~g~~~~~ 367 (418) T protein:vir:10 293 SPSLNDGTATINNENGDPVSLTAYQNVTALPA--DNAPITVLGAANTTYEQNYLFHRDAIALAMIDL---ELPQSAVIKS 367 (418) T ss_pred ccccccccccccccccccccccCCCccccccc--CcceeeeecccccceeeeeeeecceEEEEEeec---cCCCCCCcce Confidence 11100 00000000 000011111 111233322223333456777777777777652 2222233333 Q ss_pred EEEEE-----------ecCCCcEEEEEEEEeC Q lcl|Aclame:pro 372 VTATL-----------VTPSGDREDTIVITVV 392 (392) Q Consensus 372 ITat~-----------~~~~g~~tat~~VtVv 392 (392) ++++. .+.. ...-.|.+.++ T Consensus 368 ~~~~~~~G~s~r~~~~~d~~-~~~~~~r~d~l 398 (418) T protein:vir:10 368 RAADPETGLSLTLTGAYDIN-EQSEIHRIDAV 398 (418) T ss_pred EEEeccCCeEEEEEEccccc-ccceEEEEEee Confidence 33331 1111 11223333333 No 3 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=100.00 E-value=1.1e-51 Score=299.90 Aligned_cols=375 Identities=10% Similarity=0.072 Sum_probs=224.5 Q ss_pred Cccccc--cHHHHHHHHHHHHHHhhcccceeeeccccccc-CCCCCeEEEEeccceeeeccccccccCCCccccccccCc Q lcl|Aclame:pro 1 MANAFS--KPTAVVDTAIQMLQNELILTNLVWLNGIGDFA-HKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTED 77 (392) Q Consensus 1 Man~~~--~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~-~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (392) |||+|+ +||+|++++|+.|+++|+|+++|||+|++||. ++.||||+||+|+.+.+.+|..... ..+.++++.++ T Consensus 1 MaN~llT~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~~~~~~~~~~~~~---~~~~~~~l~e~ 77 (423) T protein:vir:17 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDI---SGQNKNNLISG 77 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCCcceeecccCccc---CCcccCccccc Confidence 999996 59999999999999999999999999999996 5799999999999999999865332 33678999999 Q ss_pred eEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHHHHHH Q lcl|Aclame:pro 78 SFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNGARR 157 (392) Q Consensus 78 ~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~ 157 (392) +++++|||+||++|+|+|+|+.+++.+| ++++++|+++||++||.++++++......... +.+ .+...|++|+++++ T Consensus 78 ~v~l~id~~k~va~~v~d~E~~~~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~a~~~~g-t~~-t~~~a~~~i~~a~~ 154 (423) T protein:vir:17 78 KATGRVGNYITVAVEYQQLEEAIKLNQL-EEILAPVRQRIVTDLETELAHFMMNNGALSLG-SPN-TPITKWSDVAQTAS 154 (423) T ss_pred eeEEEeeceeeeeeeecHHHHhcChhHH-HHHHHHHHHHHHHHHHHHHHHHHhhccccccc-cCC-cccccHHHHHHHHH Confidence 9999999999999999999999999998 79999999999999999999997664433222 222 22246999999999 Q ss_pred HhhhccCCC-CCEEEEchHHHHHhhcccc-eeeeeccccceeeeEeeeee-eeEeeeEEEEecceeecccceeecccccc Q lcl|Aclame:pro 158 ALNELYIPQ-GRVLVVGTAVTEQILNDDR-FIKYESQGQSAVSALQEARL-GRIYGYEIVESTLIPHGDAYLYHPTAFIM 234 (392) Q Consensus 158 ~l~~~~vp~-~r~~vv~~~~~~~l~~~~~-~~~~~~~G~~~~~a~~~g~i-g~~~g~~v~~s~~v~~~~~~~~~~~a~~~ 234 (392) +|+++++|+ ||++|++|+.+..|++++. |......++ +++++|.+ |+++||+||+|+++|.++...++.+.... T Consensus 155 ~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~---~alr~g~i~G~i~GFdvy~Snnip~~T~gt~~~t~~~~ 231 (423) T protein:vir:17 155 FLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVR---TAWENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVK 231 (423) T ss_pred HHHhccCCcCCCEEEeChHHHHHHhccccceecccccch---HHHhhccceeeecceEEEEeCCCccccccceeceeeec Confidence 999999995 7999999999999998765 444333333 67899987 89999999999999988888776654322 Q ss_pred cchhhhccccccccceeecccceeeeeeeccccceeeeecccccceee-------------------eEEEeecccccee Q lcl|Aclame:pro 235 ATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGL-------------------KVVEDPNGVGFVR 295 (392) Q Consensus 235 a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------------------~~~~~~~~~~~~~ 295 (392) . ............ .....+....+...+......+..+....... .+..+... .... T Consensus 232 ~--~~~v~~~a~~~~-~~~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v~~~tk~v~~~~~t~~~~~~~v~~~~~~-~a~~ 307 (423) T protein:vir:17 232 T--QPTVTYNAVKDS-YQFTVTLTGATTSVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANS-DSSG 307 (423) T ss_pred c--cccccccccccc-cceeeeeeeeeeeccCceeecceEEecceeeecccccccccccccccceEEEEEecccc-cccC Confidence 1 111111111011 00111122222222222222222221111110 00000000 0000 Q ss_pred eeeccceeeeeeecccccccceeeeeeccCeeEEEEEeecCcccccceEEEEEcCCceEEECCC--ceEE---EEecceE Q lcl|Aclame:pro 296 ARKIHLIPGSIEVAPEAGANATITAAAGEDHTVQLKVTDANGDDVTALCDFESSATDKATVAAG--GLVT---GVAAGTS 370 (392) Q Consensus 296 ~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~t~~~t~~~~~~~~~~~~vtw~Ssn~~VAtVd~~--G~VT---a~~~Gta 370 (392) ...+...+..+.. .....-..++.....+ ..+++..........++.|+-+-...+++.-. |..- +--.|-. T Consensus 308 ~~tv~i~p~~i~~-~~~~~~~~v~a~~a~~--~~vT~~~~a~~t~~~nl~~~~~a~~l~~~pl~~~~~~~~~~~~~~g~s 384 (423) T protein:vir:17 308 DVTVTLSGVPIYD-TTNPQYNSVSRQVAAG--DAVSVVGTASQTMKPNLFYNKFFCGLGSIPLPKLHSIDSAVATYEGFS 384 (423) T ss_pred ceEEEecCccccc-cCCcccccceecccCC--ceeeccccccCCeeEEEEecCcceEEEEEcccCCCccceeecccCCcE Confidence 0011111110000 0000000111111111 12222222222334567777777666666421 1100 0001211 Q ss_pred EEEEEEecCCCcEEEEEEEEeC Q lcl|Aclame:pro 371 TVTATLVTPSGDREDTIVITVV 392 (392) Q Consensus 371 tITat~~~~~g~~tat~~VtVv 392 (392) ..-.++.+.. ...-.|.+.|+ T Consensus 385 ~r~~~~~d~~-~~~~~~r~d~l 405 (423) T protein:vir:17 385 IRVHKYADGD-ANVQKMRFDLL 405 (423) T ss_pred EEEEEecccc-cceeEEEEEee Confidence 1111111111 12223444444 No 4 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=100.00 E-value=8.7e-52 Score=300.51 Aligned_cols=375 Identities=10% Similarity=0.061 Sum_probs=225.1 Q ss_pred Cccccc--cHHHHHHHHHHHHHHhhcccceeeeccccccc-CCCCCeEEEEeccceeeeccccccccCCCccccccccCc Q lcl|Aclame:pro 1 MANAFS--KPTAVVDTAIQMLQNELILTNLVWLNGIGDFA-HKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTED 77 (392) Q Consensus 1 Man~~~--~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~-~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (392) |||+|+ +||+|++++|+.|+++|+|+++|||+|++||. ++.||||+||+|+.+++.+|.... ...+.++++.++ T Consensus 1 MaN~llT~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~~~~---~~~~~~~dl~e~ 77 (423) T protein:vir:10 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGD---ISGQNKNNLISG 77 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCceeeeccCCcc---ccccccCccccc Confidence 999996 59999999999999999999999999999995 679999999999999999997532 234678999999 Q ss_pred eEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHHHHHH Q lcl|Aclame:pro 78 SFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNGARR 157 (392) Q Consensus 78 ~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~ 157 (392) +++++|||+||++|+|+|+|+.+++.+| ++++++|+++||++||.++++++....+..... .+. +...|++++++++ T Consensus 78 ~v~l~id~~k~va~~v~d~E~~~~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~~~~~~gt-~~t-~~~a~~~i~~a~~ 154 (423) T protein:vir:10 78 KATGRVGNYITVAVEYQQLEEAIKLNQL-EEILAPVRQRIVTDLETELAHFMMNNGALSLGS-PNT-PITKWSDVAQTAS 154 (423) T ss_pred eeEEEeeceeeeeeeechHHHhcChhhH-HHHHHHHHHHHHHHHHHHHHHHHhhcccccccc-CCc-ccchHHHHHHHHH Confidence 9999999999999999999999999998 899999999999999999999887765543322 222 2346999999999 Q ss_pred HhhhccCCC-CCEEEEchHHHHHhhcccc-eeeeeccccceeeeEeeeee-eeEeeeEEEEecceeecccceeecccccc Q lcl|Aclame:pro 158 ALNELYIPQ-GRVLVVGTAVTEQILNDDR-FIKYESQGQSAVSALQEARL-GRIYGYEIVESTLIPHGDAYLYHPTAFIM 234 (392) Q Consensus 158 ~l~~~~vp~-~r~~vv~~~~~~~l~~~~~-~~~~~~~G~~~~~a~~~g~i-g~~~g~~v~~s~~v~~~~~~~~~~~a~~~ 234 (392) +|+++++|. ||++|++|+.+..|++++. |......++ +++++|.+ |+++||++|+|+++|.++...++.+.... T Consensus 155 ~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~---~alr~g~i~G~i~GFdv~~Snnip~~T~gt~~~t~~~~ 231 (423) T protein:vir:10 155 FLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVR---TAWENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVK 231 (423) T ss_pred HHHhccCCcCCCEEEeChHHHHHHhccccceecccccch---hhhhhccceeeecceEEEEeCCCccccccccccceeee Confidence 999999994 7999999999999997665 444443333 67899987 99999999999999998888777654331 Q ss_pred cchhhhccccccccceeecccceeeeeeeccccceeeeeccccccee-------------------eeEEEeecccccee Q lcl|Aclame:pro 235 ATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFG-------------------LKVVEDPNGVGFVR 295 (392) Q Consensus 235 a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------------------~~~~~~~~~~~~~~ 295 (392) . . .......... ......+....+...+......+......... ..+..+....+.. T Consensus 232 ~-~-~~v~~~a~~~-a~~~~~~~~~~~~~~~~~l~~GD~~t~aGv~~v~~~tk~~~~~~~t~~~~~~~v~a~~~~~~~g- 307 (423) T protein:vir:10 232 T-Q-PTVTYNAVKD-SYQFTVTLTGATASVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSDSGG- 307 (423) T ss_pred e-c-ceeccccccc-cceeeeeeeeccccccCceeecceEEecceeeecccccccccccccCcceEEEEEeeeeeccCC- Confidence 1 1 1111110000 00001111111111111111111111111110 0011000000000 Q ss_pred eeeccceeeeeeecccccccceeeeeeccCeeEEEEEeecCcccccceEEEEEcCCceEEECCC--ceE---EEEecceE Q lcl|Aclame:pro 296 ARKIHLIPGSIEVAPEAGANATITAAAGEDHTVQLKVTDANGDDVTALCDFESSATDKATVAAG--GLV---TGVAAGTS 370 (392) Q Consensus 296 ~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~t~~~t~~~~~~~~~~~~vtw~Ssn~~VAtVd~~--G~V---Ta~~~Gta 370 (392) ...+...+..+.. .....-..++.....+ ..+++..........++.|+-+-...++..-. |.. ++--.|.. T Consensus 308 ~~tv~i~p~~i~~-~~~~~~~~v~a~~a~~--~~vT~~~~a~~t~~~nl~~~~~a~~l~~~pl~~~~~~~~~~~~~~g~s 384 (423) T protein:vir:10 308 DVTVTLSGVPIYD-TTNPQYNSVSRQVEAG--DAVSVVGTASQTMKPNLFYNKFFCGLGSIPLPKLHSIDSAVATYEGFS 384 (423) T ss_pred ceeeeccCccccc-cCCcccccccccccCC--ceeeccccccCCeeEEEEecCcceEEEEEcccCCCccceeeccccCce Confidence 0001111110000 0000000111111111 12222222222334567777777767666421 110 00011222 Q ss_pred EEEEEEecCCCcEEEEEEEEeC Q lcl|Aclame:pro 371 TVTATLVTPSGDREDTIVITVV 392 (392) Q Consensus 371 tITat~~~~~g~~tat~~VtVv 392 (392) ..-.++.+.. ...-.|.+.|+ T Consensus 385 ~r~~~~~d~~-~~~~~~r~d~l 405 (423) T protein:vir:10 385 IRVHKYADGD-ANVQKMRFDLL 405 (423) T ss_pred EEEEEeeecc-ccceEEEEEee Confidence 2222222221 12234444444 No 5 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=100.00 E-value=1.3e-50 Score=294.03 Aligned_cols=374 Identities=9% Similarity=0.061 Sum_probs=224.4 Q ss_pred Ccccccc--HHHHHHHHHHHHHHhhcccceeeeccccccc-CCCCCeEEEEeccceeeeccccccccCCCccccccccCc Q lcl|Aclame:pro 1 MANAFSK--PTAVVDTAIQMLQNELILTNLVWLNGIGDFA-HKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTED 77 (392) Q Consensus 1 Man~~~~--~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~-~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (392) |||+|++ ||+|++++|+.|+++|+|+++|||+|++||. ++.||||+||+|+.++++||... ....+.++++.++ T Consensus 1 MAN~llT~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~~~~---~~~~~~~~~~~e~ 77 (423) T protein:vir:35 1 MANNLESNISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQFKSERTETG---DITGKDKNGLFSA 77 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCcceeecccCc---CCCCccccccccc Confidence 9999966 9999999999999999999999999999996 57899999999999999998532 2345778999999 Q ss_pred eEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHh-ccccccccccccccchhhHHHHHHHH Q lcl|Aclame:pro 78 SFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIV-GAPYEAAGAVHEVAPDEFFKGVNGAR 156 (392) Q Consensus 78 ~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~i~~a~ 156 (392) +++++|||+||++|+|+|+|+.+++.+| ++++++++++||+++|.+++..+. .+++..+. ...+...|++|++++ T Consensus 78 ~v~l~id~~k~~a~~v~d~e~~l~i~~~-~~~l~~a~~ala~~vd~~l~~~l~~~a~~~vgt---~~t~~~~~~~i~~a~ 153 (423) T protein:vir:35 78 KATGKVGKYITVAVEWTQIEEALKLNQL-DQILSPIHERMVTDLETELAHFMMNNGALSLGS---PNTAIKKWADVAQTA 153 (423) T ss_pred eeeEEeccceeccceeCHHHHHhhHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhcccccccc---ccCCcchHHHHHHHH Confidence 9999999999999999999999999999 689999999999999999998654 44443322 223335699999999 Q ss_pred HHhhhccCCC-CCEEEEchHHHHHhhccc-ceeeeeccccceeeeEeeeee-eeEeeeEEEEecceeecccceeeccccc Q lcl|Aclame:pro 157 RALNELYIPQ-GRVLVVGTAVTEQILNDD-RFIKYESQGQSAVSALQEARL-GRIYGYEIVESTLIPHGDAYLYHPTAFI 233 (392) Q Consensus 157 ~~l~~~~vp~-~r~~vv~~~~~~~l~~~~-~~~~~~~~G~~~~~a~~~g~i-g~~~g~~v~~s~~v~~~~~~~~~~~a~~ 233 (392) ++|++.++|+ +|++|++|+.+..|++++ +|......++ +++++|.+ |+++||+||+|+++|.++...++..... T Consensus 154 ~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~---~alr~g~i~G~i~GFdv~~Snnvp~~T~gt~~~~~~v 230 (423) T protein:vir:35 154 SFIKDIGIKTGENYAIMDPWSAQRLADAQSGLHAADQLVR---TAWENAQISGNFGGIRALMSNGLASRKQGDFDGAITV 230 (423) T ss_pred HHHHHhcCCcCCCEEEeCHHHHHHHhccccceeccccchh---HHHhhccceeeecceEEEEcCCCccccccccccceee Confidence 9999999995 799999999999998755 4555554443 57899976 9999999999999998887766554322 Q ss_pred ccchhhhccccccccceeecccceeeeeeeccccceeeeeccccccee-------------------eeEEEeeccccce Q lcl|Aclame:pro 234 MATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFG-------------------LKVVEDPNGVGFV 294 (392) Q Consensus 234 ~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------------------~~~~~~~~~~~~~ 294 (392) .. ............ .....+....+...++.....+..+...... ..+...... ... T Consensus 231 ~~--a~~v~~~a~~~~-~~~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v~~~t~~~~~~~~t~~~~~~~V~~~~~~-~a~ 306 (423) T protein:vir:35 231 KT--APNVDYLSVKDS-YQFTVALTGATPSKTGFLKAGDQLKFTSTHWLNQQSKQTLYNGSTAMSFTATVLEETNS-TAS 306 (423) T ss_pred cc--cccccccccccc-ccceeeeeeeeeccCCcEEecceEEeeeeeeccccccceeecccCCceeEEEEeccccc-ccc Confidence 11 111111111110 0011111112222222222222221111100 000000000 000 Q ss_pred eeeeccceeeeeeecccccccceeeeeeccCeeEEEEEeecCcccccceEEEEEcCCceEEECCC--ceEE---EEecce Q lcl|Aclame:pro 295 RARKIHLIPGSIEVAPEAGANATITAAAGEDHTVQLKVTDANGDDVTALCDFESSATDKATVAAG--GLVT---GVAAGT 369 (392) Q Consensus 295 ~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~t~~~t~~~~~~~~~~~~vtw~Ssn~~VAtVd~~--G~VT---a~~~Gt 369 (392) ....+...+..+.. .-...-..++.....+ ..+++..........++.|+-+-...|++.-. |..- +--.|. T Consensus 307 g~~~v~i~p~~~~~-~~~~~~~~v~a~~a~~--~~vt~~~~a~~~~~~nl~~~~~a~~l~~~~l~~~~~~~~~~~~~~g~ 383 (423) T protein:vir:35 307 GDVTVKLSGVPIYD-EKNSQYNAVDAKVKAG--DAVSIIGTAKQQMKPNLFYNKFFCGLGTIPLPKLHSLDSAVATYEGF 383 (423) T ss_pred CceeEEcccccccc-CCCcccccccccccCC--ceeeeeecCCCceeEEEeecCceeEEEEEccccCCccceeeccccCc Confidence 00011111110000 0000000111111111 12222222233334567777777777766431 1110 011122 Q ss_pred EEEEEEEecCCCcEEEEEEEEeC Q lcl|Aclame:pro 370 STVTATLVTPSGDREDTIVITVV 392 (392) Q Consensus 370 atITat~~~~~g~~tat~~VtVv 392 (392) ...-.+..+.. ...-.|.+.|+ T Consensus 384 s~r~~~~~d~~-~~~~~~r~d~l 405 (423) T protein:vir:35 384 SIRVHKYADGD-ANKQMMRFDLL 405 (423) T ss_pred eEEEEEeeccc-cCceEEEEEee Confidence 22222222222 12334555555 No 6 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=100.00 E-value=1.3e-49 Score=288.67 Aligned_cols=378 Identities=12% Similarity=0.064 Sum_probs=217.8 Q ss_pred Ccccc--ccHHHHHHHHHHHHHHhhcccceeeeccccccc-CCCCCeEEEEeccceeeeccccccccCCCccccccccCc Q lcl|Aclame:pro 1 MANAF--SKPTAVVDTAIQMLQNELILTNLVWLNGIGDFA-HKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTED 77 (392) Q Consensus 1 Man~~--~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~-~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (392) |||+| |+||+|++++|+.|+++|+|+++|||+|++||. ++.||||+||+|+.+.+++...... .....+++.++ T Consensus 1 MANsl~~l~p~iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~~~~d~~~~~~---t~~~~~~l~e~ 77 (423) T protein:vir:10 1 MANNLDANVSQIVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQFKSERTMDGDI---TGKSKNSLISA 77 (423) T ss_pred CccccccccHHHHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCceeeecccCccc---Ccccccccccc Confidence 99999 999999999999999999999999999999996 6789999999999999988533222 22346788899 Q ss_pred eEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHHHHHH Q lcl|Aclame:pro 78 SFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNGARR 157 (392) Q Consensus 78 ~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~ 157 (392) +++++||++||++|+|+|+|+.+++.+| ++++++++++||++||++++..+......... ..+. +...|++++++++ T Consensus 78 ~v~l~id~~k~~a~~v~d~E~~l~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~~~~~vg-t~~t-~~~a~~~~a~a~~ 154 (423) T protein:vir:10 78 KATGEVGNYITVAVEYRQIEEALKLNQL-DQILVPINERMVTDLETELALFMMKHGALSLG-SPNT-PIKKWSDVAQTAS 154 (423) T ss_pred eEEEEecceeeeeeeeChHHHhcChhHH-HHHHHHHHHHHHHHHHHHHHHHhhhccccccc-cccc-ccccHHHHHHHHH Confidence 9999999999999999999999999999 79999999999999999998655443333222 2222 2246899999999 Q ss_pred HhhhccCCC-CCEEEEchHHHHHhhcccc-eeeeeccccceeeeEeeeee-eeEeeeEEEEecceeecccc----eeecc Q lcl|Aclame:pro 158 ALNELYIPQ-GRVLVVGTAVTEQILNDDR-FIKYESQGQSAVSALQEARL-GRIYGYEIVESTLIPHGDAY----LYHPT 230 (392) Q Consensus 158 ~l~~~~vp~-~r~~vv~~~~~~~l~~~~~-~~~~~~~G~~~~~a~~~g~i-g~~~g~~v~~s~~v~~~~~~----~~~~~ 230 (392) +|+++++|+ +|++|++|+.++.|++++. +......++ +++++|.+ |+++||++++|+++|..+.. ..+.+ T Consensus 155 ~L~~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~---~alr~~~i~G~~~GFdi~~Sn~vp~~T~g~~~ga~~~~ 231 (423) T protein:vir:10 155 FLKDLGINSGENYAVMDPWAAQRLADAQSGLHVSEQLVR---TAWENAQISGNFGGIRALMSNGLASRTQGAFGGKLTVK 231 (423) T ss_pred HHhhccCCcCCCEEEeCHHHHHHHhhhhhhhccccccch---HHHHhcccceeecceEEEEecCCcccccccccceeeee Confidence 999999995 7999999999999987554 444444444 57889976 99999999999999965432 23333 Q ss_pred cccccchhhhccccccccceeecc----cceeeeeeeccccceeeeecc--------cccceeeeEEEeeccccceeeee Q lcl|Aclame:pro 231 AFIMATRAPAPPMGAVRSTAISGD----QRIAMRWLVDYDSTITSNRSL--------IDTYFGLKVVEDPNGVGFVRARK 298 (392) Q Consensus 231 a~~~a~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~ 298 (392) +.....+................+ ........+........+... .+......+..+....... ... T Consensus 232 ~~~~vt~a~~~~~~~~~~~~~~~T~s~~g~l~~GD~~t~aGv~~v~~~tk~~l~~~~~~~~~~~~V~~~~~~~a~~-~~t 310 (423) T protein:vir:10 232 GTPEVNYDSVKDSYAFTATLTGATASKKGFLKVGDQLQFDDTHWLNQQSKQTLYNGASALSFTATVMEDANAHSSG-DVT 310 (423) T ss_pred eeeEEEecccccccccccceeeccceeceeEEecceEeecceeeecccccceeecccCCcceEEEEEecccccccC-ceE Confidence 332222222111111111111100 011111111111100000000 0011111111110000000 000 Q ss_pred ccceeeeeeecccccccceeeeeeccCeeEEEEEeecCcccccceEEEEEcCCceEEECCC--ceE---EEEecceEEEE Q lcl|Aclame:pro 299 IHLIPGSIEVAPEAGANATITAAAGEDHTVQLKVTDANGDDVTALCDFESSATDKATVAAG--GLV---TGVAAGTSTVT 373 (392) Q Consensus 299 ~~~~~~~v~v~~~~~~~~~~~~~~~~~~t~~~t~~~~~~~~~~~~vtw~Ssn~~VAtVd~~--G~V---Ta~~~GtatIT 373 (392) +...+..+. ......-..++.....+ ..+|+..........++.|+-+-...+++.-. |.. ++--.|....- T Consensus 311 v~i~p~~~~-~~~~~~~~~V~a~~a~~--~~vT~~~~~~~t~~~nl~~~~~a~~l~~~pl~~~~~~~~~~~~~~g~s~r~ 387 (423) T protein:vir:10 311 VKISGVPIF-DAGYPQYNAVDRLLAEG--DTVSVIGTSKQAMKPNLFYNKLFCGLGTIPLPKLHSIDSAVATYEGFSIRV 387 (423) T ss_pred EEecccccc-ccCcccccceeccccCC--ceeEEeeccCCceeEEEEecCcceEEEEEcccCCCccceeecccccceEEE Confidence 111111100 00000011111111111 22333333333344567777776666666421 110 00011222222 Q ss_pred EEEecCCCcEEEEEEEEeC Q lcl|Aclame:pro 374 ATLVTPSGDREDTIVITVV 392 (392) Q Consensus 374 at~~~~~g~~tat~~VtVv 392 (392) .++.+.. ...-.|.+.|+ T Consensus 388 ~~~~d~~-~~~~~~r~d~l 405 (423) T protein:vir:10 388 HKYADGD-ANKQMMRFDLL 405 (423) T ss_pred EEeeecc-ccceEEEEEee Confidence 2222221 12234445544 No 7 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=100.00 E-value=7.5e-47 Score=273.45 Aligned_cols=268 Identities=17% Similarity=0.174 Sum_probs=200.6 Q ss_pred CccccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccccCceEE Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTEDSFP 80 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (392) |||++|+||+|++++++.|+++++|.+++||||+.++ +.||||+||+++...+.||.. .+.++.++++.++.++ T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~--~~Gdtv~ip~~~~~~~~d~~~----~~~~~~~~~~~~~~~~ 74 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTA--SKGNVVHIAGVVAPTVKDYKA----AGRQTSADAISDTGVD 74 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhcccccccc--ccCceEEEeeccccccccccc----CCCccCccccccceEE Confidence 9999999999999999999999999999999997764 679999999999999998853 3445778899999999 Q ss_pred EEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHHHHHHHhh Q lcl|Aclame:pro 81 VTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNGARRALN 160 (392) Q Consensus 81 ~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~ 160 (392) ++||+++++++.|+|+|+.+.++++. .+++|++++||+++|+++++++..+..... .....++...++.|++++++|+ T Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~~-~~~~~~~~alA~~vD~~i~~~~~~a~~~~~-~~~~~~~~~~~~~i~~a~~~ld 152 (273) T protein:vir:10 75 LLIDQEKSIDFLVDDIDRVQVAGSLE-AYTRAGATALATDTDKFIADMLVDNGTALT-GSAPTDADDAFDLIAKALKELT 152 (273) T ss_pred EEEeeeeecceEeecHHHhhhhccHH-HHHHHHHHHHHHHHHHHHHHHHhccccccc-cccccchhHHHHHHHHHHHHhh Confidence 99999999999999999999999985 599999999999999999999887654432 2334455678999999999999 Q ss_pred hccCC-CCCEEEEchHHHHHhhcccc-eeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecc---cceeeccccccc Q lcl|Aclame:pro 161 ELYIP-QGRVLVVGTAVTEQILNDDR-FIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGD---AYLYHPTAFIMA 235 (392) Q Consensus 161 ~~~vp-~~r~~vv~~~~~~~l~~~~~-~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~---~~~~~~~a~~~a 235 (392) +++|| ++|+++++|+.++.|+++++ +...+..|+. ..+++|.+|+++||+|++++++|... ...+|++++..+ T Consensus 153 ~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~--~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~~~A~~~a 230 (273) T protein:vir:10 153 KANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDA--AGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV 230 (273) T ss_pred hcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccc--cceeeeeeeEEeceEEEEecccccCCccEEEEEeccceeee Confidence 99999 58999999999999999876 4456666654 46889999999999999999998654 345666666655 Q ss_pred chhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeec Q lcl|Aclame:pro 236 TRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKI 299 (392) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 299 (392) .+..................+. ..+|..+.......... .... T Consensus 231 ~q~~~~e~~r~~~~~~~~v~~~--------------------~~yg~~v~~~~~~~~l~-~~g~ 273 (273) T protein:vir:10 231 SQIDTVEALRDQDSFSDRIRAL--------------------HVYGGKVVRPTGVVVFN-KTGS 273 (273) T ss_pred eeeehhhcccCCCcceeeeeee--------------------eeeeeeEeccceEEEEe-ccCC Confidence 4433322222211111111110 11111111110000000 0000 No 8 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=100.00 E-value=7.5e-47 Score=273.45 Aligned_cols=268 Identities=17% Similarity=0.174 Sum_probs=200.6 Q ss_pred CccccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccccCceEE Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTEDSFP 80 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (392) |||++|+||+|++++++.|+++++|.+++||||+.++ +.||||+||+++...+.||.. .+.++.++++.++.++ T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~--~~Gdtv~ip~~~~~~~~d~~~----~~~~~~~~~~~~~~~~ 74 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTA--SKGNVVHIAGVVAPTVKDYKA----AGRQTSADAISDTGVD 74 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhcccccccc--ccCceEEEeeccccccccccc----CCCccCccccccceEE Confidence 9999999999999999999999999999999997764 679999999999999998853 3445778899999999 Q ss_pred EEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHHHHHHHhh Q lcl|Aclame:pro 81 VTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNGARRALN 160 (392) Q Consensus 81 ~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~ 160 (392) ++||+++++++.|+|+|+.+.++++. .+++|++++||+++|+++++++..+..... .....++...++.|++++++|+ T Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~~-~~~~~~~~alA~~vD~~i~~~~~~a~~~~~-~~~~~~~~~~~~~i~~a~~~ld 152 (273) T protein:vir:10 75 LLIDQEKSIDFLVDDIDRVQVAGSLE-AYTRAGATALATDTDKFIADMLVDNGTALT-GSAPTDADDAFDLIAKALKELT 152 (273) T ss_pred EEEeeeeecceEeecHHHhhhhccHH-HHHHHHHHHHHHHHHHHHHHHHhccccccc-cccccchhHHHHHHHHHHHHhh Confidence 99999999999999999999999985 599999999999999999999887654432 2334455678999999999999 Q ss_pred hccCC-CCCEEEEchHHHHHhhcccc-eeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecc---cceeeccccccc Q lcl|Aclame:pro 161 ELYIP-QGRVLVVGTAVTEQILNDDR-FIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGD---AYLYHPTAFIMA 235 (392) Q Consensus 161 ~~~vp-~~r~~vv~~~~~~~l~~~~~-~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~---~~~~~~~a~~~a 235 (392) +++|| ++|+++++|+.++.|+++++ +...+..|+. ..+++|.+|+++||+|++++++|... ...+|++++..+ T Consensus 153 ~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~--~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~~~A~~~a 230 (273) T protein:vir:10 153 KANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDA--AGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV 230 (273) T ss_pred hcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccc--cceeeeeeeEEeceEEEEecccccCCccEEEEEeccceeee Confidence 99999 58999999999999999876 4456666654 46889999999999999999998654 345666666655 Q ss_pred chhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeec Q lcl|Aclame:pro 236 TRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKI 299 (392) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 299 (392) .+..................+. ..+|..+.......... .... T Consensus 231 ~q~~~~e~~r~~~~~~~~v~~~--------------------~~yg~~v~~~~~~~~l~-~~g~ 273 (273) T protein:vir:10 231 SQIDTVEALRDQDSFSDRIRAL--------------------HVYGGKVVRPTGVVVFN-KTGS 273 (273) T ss_pred eeeehhhcccCCCcceeeeeee--------------------eeeeeeEeccceEEEEe-ccCC Confidence 4433322222211111111110 11111111110000000 0000 No 9 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=100.00 E-value=9.1e-47 Score=273.00 Aligned_cols=268 Identities=17% Similarity=0.186 Sum_probs=201.5 Q ss_pred CccccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccccCceEE Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTEDSFP 80 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (392) |||++|+||+|++++++.|+++++|.+++||||+. .+++||||+||+++...+.||.. .+.++.++++.++.++ T Consensus 1 MA~~~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~--~~~~GdTv~ip~~~~~~~~d~~~----~~~~~~~~~~~~~~~~ 74 (273) T protein:vir:79 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEG--IASKGNVVHIAGVVAPTVKDYKA----AGRQTSADAISDTGVD 74 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhccchhhhhccccc--cccCCcEEEEeecCccccccccc----CCCccCccccccceEE Confidence 99999999999999999999999999999999965 46789999999999999998853 3456778899999999 Q ss_pred EEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHHHHHHHhh Q lcl|Aclame:pro 81 VTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNGARRALN 160 (392) Q Consensus 81 ~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~ 160 (392) ++|+|++++++.|+|+|+.+.+++++ ++++|++++||+++|+++++++..+...... ....++...++.|++++.+|| T Consensus 75 ~tid~~~~~~~~i~d~d~~~~~~~~~-~~~~~~~~ala~~vD~~i~~~~~~a~~~~~~-~~~~~~~~~~~~i~~a~~~ld 152 (273) T protein:vir:79 75 LLIDQEKSIDFLVDDIDRVQVAGSLE-AYTRAGATALATDTDKFIADMLVDNGTALTG-SAPSDADDAFDLIASALKELT 152 (273) T ss_pred EEEeeecccceeeccHHHHhhcccHH-HHHHHHHHHHHHHHHHHHHHHHhhccccccc-ccccchhhHHHHHHHHHHHhh Confidence 99999999999999999999999985 6999999999999999999999876543322 233455678999999999999 Q ss_pred hccCC-CCCEEEEchHHHHHhhcccc-eeeeeccccceeeeEeeeeeeeEeeeEEEEecceeeccc---ceeeccccccc Q lcl|Aclame:pro 161 ELYIP-QGRVLVVGTAVTEQILNDDR-FIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDA---YLYHPTAFIMA 235 (392) Q Consensus 161 ~~~vp-~~r~~vv~~~~~~~l~~~~~-~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~---~~~~~~a~~~a 235 (392) +++|| ++|+++++|+++..|+++++ |...+..|+. ..+++|.+|+++||+|++++++|.... ..+|++++.++ T Consensus 153 ~~~vP~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~--~~l~~G~ig~~~G~~i~~s~~lp~~~~~~~~a~~~~A~~~a 230 (273) T protein:vir:79 153 KANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDA--AGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV 230 (273) T ss_pred hccCCccCcEEEECHHHHHHHhhchhhhhhhhhcccc--cceeeeEeeEEeceEEEecccccccCceEEEEEeccceeee Confidence 99999 58999999999999998875 6667776654 468999999999999999999997653 45577776665 Q ss_pred chhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeec Q lcl|Aclame:pro 236 TRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKI 299 (392) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 299 (392) .+..................+. ..+|..+.......... .... T Consensus 231 ~~~~~~e~~r~~~~~~~~v~~~--------------------~~yg~~v~~p~~vv~~~-~~g~ 273 (273) T protein:vir:79 231 SQIDTVEALRDQDSFSDRIRAL--------------------HVYGGKVVRPTGVVVFN-KTGS 273 (273) T ss_pred eehhhhhcccCcccceeeeeee--------------------eeeeeEEecCceEEEEe-ccCC Confidence 5443333222221111111110 11111111110000000 0000 No 10 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=100.00 E-value=2.8e-42 Score=248.35 Aligned_cols=289 Identities=13% Similarity=0.119 Sum_probs=194.1 Q ss_pred Ccccc------------ccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCc Q lcl|Aclame:pro 1 MANAF------------SKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERN 68 (392) Q Consensus 1 Man~~------------~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~ 68 (392) |+|+| |+||+|++++++.|+++++|.+++ |||+.++ +.||||+||+++...+.||. ++.+ T Consensus 3 ~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~~~-~d~~~~~--~~Gdtv~ip~~g~~~~~d~~-----~~~~ 74 (341) T protein:vir:94 3 LGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTSVV-KTWGAQV--KKGDTFHVPRISELGVEDKA-----TDVP 74 (341) T ss_pred chhhhccccccchhHHHHHHHHHHHHHHHHHHhhcchhhcc-ccccccc--cCCceEEEeccCcceeeeec-----CCCc Confidence 55655 789999999999999999999987 7987775 45999999999999999984 3556 Q ss_pred cccccccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc--------c Q lcl|Aclame:pro 69 LTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGA--------V 140 (392) Q Consensus 69 ~~~~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~--------~ 140 (392) +.++++.++.++++||+++++++.|+|+|+.+.++|++++++++++++||+++|++++++++.+....... . T Consensus 75 i~~~~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~~~~~~~ 154 (341) T protein:vir:94 75 VGVQPVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFSSSNGAI 154 (341) T ss_pred cccccccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCccccCccccc Confidence 88899999999999999999999999999999999999999999999999999999999887654322111 1 Q ss_pred ccccchhhHHHHHHHHHHhhhccCC-CCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecce Q lcl|Aclame:pro 141 HEVAPDEFFKGVNGARRALNELYIP-QGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLI 219 (392) Q Consensus 141 ~~~~~~~~~~~i~~a~~~l~~~~vp-~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v 219 (392) .+......|+.|+++++.|++++|| ++|+++++|++++.|+++++|.+.+..|+. .+++|.+|+++||+|++++++ T Consensus 155 t~~~~~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~~~~~g~~---~l~~G~ig~i~G~~V~~Sn~l 231 (341) T protein:vir:94 155 TGNGQAFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIPQFISKDFINNA---PIAQGQIGSLMGVRVIRTSLI 231 (341) T ss_pred cCchhhhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhchhhhhhhccccc---hhheeeeeeEeceEEEEeccc Confidence 1112234689999999999999999 589999999999999999999999888763 578999999999999999999 Q ss_pred eecccceeecccccccchh-hhccccccccceeecccceeeeeeeccccce----------------------------e Q lcl|Aclame:pro 220 PHGDAYLYHPTAFIMATRA-PAPPMGAVRSTAISGDQRIAMRWLVDYDSTI----------------------------T 270 (392) Q Consensus 220 ~~~~~~~~~~~a~~~a~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------------------------~ 270 (392) |..+...++.......... .....+........+................ . T Consensus 232 p~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (341) T protein:vir:94 232 GNNSATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAPRVTQSFENREQ 311 (341) T ss_pred cccccccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhhccccccccccccchhhhh Confidence 9877665544332211111 0000111000000110000000000000000 0 Q ss_pred eeecccccceeeeEEEeeccccceeeeeccceeeeeeeccccccccee Q lcl|Aclame:pro 271 SNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGANATI 318 (392) Q Consensus 271 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~~~~~~~ 318 (392) .+.......+|..+... ...+.+ .....++ T Consensus 312 ~~~i~~~~~~G~~~lrp---------------~~~v~~---~~~~~~~ 341 (341) T protein:vir:94 312 VWLMVGRQAYGARLYRP---------------LHAVNI---HTTGDTV 341 (341) T ss_pred hhhhhhhhhhcccccCc---------------ceeEEE---ecCcCCC Confidence 00000000011111100 000000 0000000 No 11 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=100.00 E-value=6.6e-39 Score=229.91 Aligned_cols=333 Identities=12% Similarity=0.060 Sum_probs=215.7 Q ss_pred Ccc---ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccccCc Q lcl|Aclame:pro 1 MAN---AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTED 77 (392) Q Consensus 1 Man---~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (392) |+. +.|+||+|++++++.|+++++|.+++++ .++.++.||||+||+++...+.++. ++.++.++++.+. T Consensus 15 ~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~---~~~~~~~GdTV~ip~~g~~~a~d~~-----~g~~i~~~~~~~~ 86 (381) T protein:vir:80 15 VDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKK---IPFEGKKGDLIHIPNISRAAVYDKQ-----PQTPVNLQARTDS 86 (381) T ss_pred cchhhHHhhhhHHHHHHHHHHHHHhhhhhhcccc---ccceeecCceEEeeccCcceeeeec-----CCCcccccccCCc Confidence 332 3477999999999999999999998865 2445667999999999999888885 3567888999999 Q ss_pred eEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccc----------------cccc Q lcl|Aclame:pro 78 SFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAA----------------GAVH 141 (392) Q Consensus 78 ~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~----------------~~~~ 141 (392) +++++||+++++++.|+|.|+.+.+.|+++++.++++++||+++|++++.++........ .... T Consensus 87 ~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~t~~~~i~~~~~~~~~t 166 (381) T protein:vir:80 87 EFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIYSYDTTLGDGTVNAHLT 166 (381) T ss_pred eEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999987754332110 0011 Q ss_pred cccchhhHHHHHHHHHHhhhccCC-CCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEeccee Q lcl|Aclame:pro 142 EVAPDEFFKGVNGARRALNELYIP-QGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIP 220 (392) Q Consensus 142 ~~~~~~~~~~i~~a~~~l~~~~vp-~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~ 220 (392) ......+++.|++++++|++++|| ++|+++++|+++..|+++++|.+++..++ ..+++|.+|+++||+|++++++| T Consensus 167 ~~~~~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~~~~---~~l~~G~Ig~i~G~~Vv~Sn~lp 243 (381) T protein:vir:80 167 GTPAPLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFISVDFSQV---KPVTSGVVGTILGMEVIVTTQIG 243 (381) T ss_pred cchhhHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhhccc---hhhhceeeeEEcceEEEeecccc Confidence 223456789999999999999999 58999999999999999999998876544 46899999999999999999999 Q ss_pred ecccceeecccccccchhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeecc Q lcl|Aclame:pro 221 HGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIH 300 (392) Q Consensus 221 ~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 300 (392) ......++..+.......... .+..... ..........+...++.....+...+..+.+................. T Consensus 244 ~~~~t~~~~~agap~~~~~~~-~~~~~~g-~~s~~a~av~~~k~yd~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~-- 319 (381) T protein:vir:80 244 INSLTGYVNGQGAPTQPTPGV-LGSPYLP-DQAGTANVVNTGSASDLAVSLSYFGLPVFSGAGATAADGGQTLGSFGG-- 319 (381) T ss_pred cccccceeeeccccccccccc-ccccccc-ccccceeeeeeeeeeceeeeeeeccceeeecceeeecCCCceeeeehh-- Confidence 876655443332211111111 1111111 111234567788888888888888877766654443332222211100 Q ss_pred ceeeeeeecccccccceeeeeeccCeeEEEEEeecCcccccceEEEE-------------EcCCceEE-ECCCce Q lcl|Aclame:pro 301 LIPGSIEVAPEAGANATITAAAGEDHTVQLKVTDANGDDVTALCDFE-------------SSATDKAT-VAAGGL 361 (392) Q Consensus 301 ~~~~~v~v~~~~~~~~~~~~~~~~~~t~~~t~~~~~~~~~~~~vtw~-------------Ssn~~VAt-Vd~~G~ 361 (392) ....+ ..+..-.. -..++.+ ++.+ +. .+..+.|. .=+|.=|- .-.+|. T Consensus 320 -~~~~~--~~~~~~~~--~~~~~~~--~~~~----~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 381 (381) T protein:vir:80 320 -ANRWA--TAVVCHPD--WLAVGVQ--QNVK----SE--SSRETMYLADAFVTSCVYGAKVFRPDHCVLLHTSGI 381 (381) T ss_pred -hhhhh--hhcccccc--cccccce--eEee----cc--cchhheeehhhhhhhhhhccccccchhhhhhhhcCC Confidence 00000 00000000 0000000 0000 00 00011111 01111000 000111 No 12 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=100.00 E-value=2.4e-38 Score=226.81 Aligned_cols=287 Identities=14% Similarity=0.117 Sum_probs=179.7 Q ss_pred Cc--c------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccc Q lcl|Aclame:pro 1 MA--N------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVS 72 (392) Q Consensus 1 Ma--n------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (392) |+ | .||+||+|++++++.|++.|++.++.++.. | +.||||||+.++..++.||.. ..++.+| T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d---~--g~GDtV~InsIg~~tV~dY~~-----~~~i~~d 70 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKLLDVNIARVVD---F--PDGDKLTIPSVGTPVVRSRPE-----QGDFTFD 70 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhcccc---c--CCCCeEEeccccccccccccC-----CCCcccc Confidence 77 2 457799999999999999999999877543 3 359999999999999999953 4568999 Q ss_pred cccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccc--------------c Q lcl|Aclame:pro 73 DFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAA--------------G 138 (392) Q Consensus 73 ~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~--------------~ 138 (392) +++++.++++|||.||++|.++| |+.|...+++..+.++++++|++.+|+.+..+++..+.... . T Consensus 71 ~ltt~~~~l~IDq~KYfaf~VdD-D~~Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~vin~~~~~i 149 (322) T protein:vir:31 71 NLDTGEISIILRDEVYAGNAISK-KLRQDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPNVINGVPHRF 149 (322) T ss_pred cCCCceEEEEEehhhhhccccch-hHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcceecCCccce Confidence 99999999999999999999999 99999999999999999999999999999987764332100 0 Q ss_pred ccccccchhhHHHHHHHHHHhhhccCC-CCCEEEEchHHHH---------HhhcccceeeeeccccceeeeEeeeeeeeE Q lcl|Aclame:pro 139 AVHEVAPDEFFKGVNGARRALNELYIP-QGRVLVVGTAVTE---------QILNDDRFIKYESQGQSAVSALQEARLGRI 208 (392) Q Consensus 139 ~~~~~~~~~~~~~i~~a~~~l~~~~vp-~~r~~vv~~~~~~---------~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~ 208 (392) ...+..+...|+.|++++.+||+++|| .|||+||+|+++. .|++|++|....++|.. ..++ .+|++ T Consensus 150 v~~gt~~~~ay~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~a--~g~~--~Vg~~ 225 (322) T protein:vir:31 150 VGTGTDQTMDVTDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNNPRWEGIVESGIA--PDMQ--FVRSV 225 (322) T ss_pred eccCCCchhhHHHHHHHHHHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhccccccccccccch--hhHH--HHHHH Confidence 123445567899999999999999999 5899999999876 45779999998888863 2222 48999 Q ss_pred eeeEEEEecceeeccccee--ecccccccchhhhcccc-ccccceeecccceeeeeeeccccceeeeecccccceeeeEE Q lcl|Aclame:pro 209 YGYEIVESTLIPHGDAYLY--HPTAFIMATRAPAPPMG-AVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVV 285 (392) Q Consensus 209 ~g~~v~~s~~v~~~~~~~~--~~~a~~~a~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 285 (392) +||+|+.|+.++...-... .....+.+...+.-... ........+.+..-...-...+...-.+....-..+|.... T Consensus 226 ~GF~V~~SN~l~~~~~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~~~l~~~e~~r~~~~~~d~~~~~~~~g~g~~ 305 (322) T protein:vir:31 226 YGIDLFVSNLLADANETINAGGDARSTTAGKCNMFMNVSDMGLLPFVVAWKEMPTTKSFIDDYNDDLNTATTARWGNGLV 305 (322) T ss_pred hceeeeeeccccccccccccCcccccccceeecccccccchhhhhhhhHhhhhhhhhcccCccccccceeeeeeecceee Confidence 9999999999864221111 11111111111000000 00000000000000000000000000000000011111111 Q ss_pred Eeeccccceeeeeccceeeeeeeccccc Q lcl|Aclame:pro 286 EDPNGVGFVRARKIHLIPGSIEVAPEAG 313 (392) Q Consensus 286 ~~~~~~~~~~~~~~~~~~~~v~v~~~~~ 313 (392) ......... ....++.. T Consensus 306 r~e~l~~~~-----------a~~~~~~~ 322 (322) T protein:vir:31 306 RDENLVCVL-----------ANADKVTF 322 (322) T ss_pred cccceEEEE-----------eccccccC Confidence 000000000 00000000 No 13 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=100.00 E-value=1.1e-35 Score=212.19 Aligned_cols=268 Identities=15% Similarity=0.174 Sum_probs=196.8 Q ss_pred Ccc------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccce-eeeccccccccCCCcccccc Q lcl|Aclame:pro 1 MAN------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPS-RGHTRKLRGAGAERNLTVSD 73 (392) Q Consensus 1 Man------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~-~~~~~~~~~~~~~~~~~~~~ 73 (392) ||| ++|+||+|++++++.|++.++|.+++.+++ ++.+++||+|+||++... .+.++ ..+..+.+++ T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~--~l~g~~G~tv~ip~~~~~g~a~~~-----~~g~~i~~~~ 73 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDN--SLEGQPGSEITVPKYKYIGDAQDV-----AEGAAIDYSA 73 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhcccceecc--cccCCCCCEEEEeeeccCCcceee-----cCCCcCcccc Confidence 998 559999999999999999999999998875 566889999999997643 23444 2345688999 Q ss_pred ccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHH Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVN 153 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 153 (392) ++.++.+++|++. +++|.++|++..++..|+++++.+++++++++++|.++++.+.++.............+..++.+. T Consensus 74 lt~~~~~~~i~~~-~~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~~~~~~t~~~~~~~~~~~~ 152 (278) T protein:vir:80 74 LETESVKHGIKKA-GKGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTLEVKGAINIGLIDKIENTFT 152 (278) T ss_pred cccceeeEeeehh-hccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhhhHHHHHH Confidence 9999999999775 579999999999999999999999999999999999999999887665544444444556789999 Q ss_pred HHHHHhhhccCCCCCEEEEchHHHHHhhccc--ceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccceeeccc Q lcl|Aclame:pro 154 GARRALNELYIPQGRVLVVGTAVTEQILNDD--RFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTA 231 (392) Q Consensus 154 ~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~--~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a 231 (392) ++..+|+++++|..++++++|++++.|+++. +|......|+ ..+++|.+|++.||+|++++++|..+.+.+++++ T Consensus 153 da~~~l~~~~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~---~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gA 229 (278) T protein:vir:80 153 DAPDAIEDESITTTGVLFLNYKDTAKLREEAAGSWTKASQLGD---DLLVKGAFGELLGWEIVRTKKLADGNALAVKAGA 229 (278) T ss_pred HHHHhhcccCCCcccEEEECHHHHHHHHhhhhhhccccccccc---cceeeccceeecceeEEEcCCCCcceEEEEeccc Confidence 9999999999999889999999999998875 6777766665 3578999999999999999999999999988887 Q ss_pred ccccchhhh-ccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeecc Q lcl|Aclame:pro 232 FIMATRAPA-PPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAP 310 (392) Q Consensus 232 ~~~a~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~ 310 (392) +.+..+... .+..+... ...+.......++..+......... +... T Consensus 230 i~~~~~~~~~vE~~Rd~~--------------------~~~d~i~~~~~yg~~v~~~~~~v~i------t~~a------- 276 (278) T protein:vir:80 230 LKTFLKRNLLAESGRDMD--------------------HKLTKFNADQHYAVALVDETKAVKV------VPVA------- 276 (278) T ss_pred eeeeecCCcccccccchh--------------------hccceeeeeeEEEEEEEcCcceEEE------eecc------- Confidence 765433321 11111111 0011111111111111111000000 0000 Q ss_pred cccc Q lcl|Aclame:pro 311 EAGA 314 (392) Q Consensus 311 ~~~~ 314 (392) -. T Consensus 277 --~~ 278 (278) T protein:vir:80 277 --GN 278 (278) T ss_pred --CC Confidence 00 No 14 >protein:vir:9265 Length: 430 # NCBI annotation: 5 # Family: family:all:1412 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720329;genbank:gi:24371587;genbank:GeneID:955820 Probab=100.00 E-value=9.9e-35 Score=207.01 Aligned_cols=371 Identities=13% Similarity=0.100 Sum_probs=207.2 Q ss_pred CccccccH-HHHHHHHHHHHHHhhcccce--eeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccccCc Q lcl|Aclame:pro 1 MANAFSKP-TAVVDTAIQMLQNELILTNL--VWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTED 77 (392) Q Consensus 1 Man~~~~~-~~~~~~~~~~l~~~l~~~~~--v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (392) |||++.+- +++++|+++.|+++|+|+++ ++|+|+.+|. +.||||+||.|......+. .......+++.|. T Consensus 1 MAn~l~~~~~ii~~eal~~l~n~~v~a~~~~~~r~~d~~~~-r~Gdti~~p~~~~~~~~~G------~~~t~~~~~i~e~ 73 (430) T protein:vir:92 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQ-RSSNTIWMPVEQESPTQEG------WDLTDKATGLLEL 73 (430) T ss_pred CccchhhHHHHHHHHHHHHHhhhhhhhhhhcccCCchhhhh-cccceEEeccccccccccC------cccCCCCCccccc Confidence 99999885 99999999999999999986 7799988875 7899999999988777662 2222334678899 Q ss_pred eEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc--cccccchhhHHHHHHH Q lcl|Aclame:pro 78 SFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGA--VHEVAPDEFFKGVNGA 155 (392) Q Consensus 78 ~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~i~~a 155 (392) +++++|++++.++|+|+++|+ ...++.++++++++++||++||.++++++......+... .........++++.++ T Consensus 74 ~v~~~v~~~k~V~~~~~~kel--~~~~~~~~~i~~Am~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A~a 151 (430) T protein:vir:92 74 NVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADA 151 (430) T ss_pred eEEEEEeeeccceEEechhHh--cChhHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccccccCCCcCCcchhhHHHH Confidence 999999999999999999995 466667899999999999999999999987654433211 1112223357999999 Q ss_pred HHHhhhccCCC--CCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeee-EeeeE-EEEecceeecccceeeccc Q lcl|Aclame:pro 156 RRALNELYIPQ--GRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGR-IYGYE-IVESTLIPHGDAYLYHPTA 231 (392) Q Consensus 156 ~~~l~~~~vp~--~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~-~~g~~-v~~s~~v~~~~~~~~~~~a 231 (392) ++.|++.++|. +|.++++|+.+..+... +.+....+....+++++|.+|+ ++||+ ++.++.+|..+........ T Consensus 152 ~~~L~~~~vP~~~~R~~vldp~~~~~l~~~--l~~l~~~~~~~~~A~r~g~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~t 229 (430) T protein:vir:92 152 EELMFSRELNRDMGTSYFFNPQDYKKAGYD--LTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGIT 229 (430) T ss_pred HHHHHHhcCCCCCCcEEEeChHHHHHHHhh--hccccccccchhHHHhhccccccchhhhhhhhcCCcccccCccCcCce Confidence 99999999995 59999999999998642 4444455555557899999997 88996 5778888776543322111 Q ss_pred ccccchhhhcccc-ccccceeeccccee-eeeeeccccceeeeecccccc--------------eeeeEEEeecccccee Q lcl|Aclame:pro 232 FIMATRAPAPPMG-AVRSTAISGDQRIA-MRWLVDYDSTITSNRSLIDTY--------------FGLKVVEDPNGVGFVR 295 (392) Q Consensus 232 ~~~a~~~~~~~~~-~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~ 295 (392) ...+......... ...+.....+.... ...... ......+..++..+ ..+.+..... T Consensus 230 v~gA~~~~~~~~~v~~~g~~~~~d~~~~tit~s~t-g~l~~GD~ftiaGV~~v~~~tkq~~~~l~~F~Vt~~~~------ 302 (430) T protein:vir:92 230 VSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSAT-TGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVD------ 302 (430) T ss_pred eccccccccccceecccccccccccccceeeeecc-cceecccEEEecceeeeccccccccCCccEEEEEEecC------ Confidence 1111100000000 00000000000000 000000 00111111111111 1111111110 Q ss_pred eeeccceeeeeeecccccc---cceeeeeeccCeeEEEEEeecCcccccceEEEEEcCCceEEECC---CceEEEEec-- Q lcl|Aclame:pro 296 ARKIHLIPGSIEVAPEAGA---NATITAAAGEDHTVQLKVTDANGDDVTALCDFESSATDKATVAA---GGLVTGVAA-- 367 (392) Q Consensus 296 ~~~~~~~~~~v~v~~~~~~---~~~~~~~~~~~~t~~~t~~~~~~~~~~~~vtw~Ssn~~VAtVd~---~G~VTa~~~-- 367 (392) ...+.+.++.++....... ...-.++...-....+++.. ..+...++.|+-+--..|++.= .|.-.++.. T Consensus 303 atsv~I~paii~~~~~~~~~~~~~y~nVsaspa~~aavTvv~--~a~~~~Nl~fhr~A~aLa~~pL~~~~~~~~~~~~~~ 380 (430) T protein:vir:92 303 GTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILN--VKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTS 380 (430) T ss_pred CceeEEeccccccccccccccccccceeccccccCceeEEec--cCCcccceeEcccceEEEEecccCCCCHHHhhhhhe Confidence 1111221111111000000 00000000111111122211 1222467888888777887752 221111111 Q ss_pred ------ceEEEEEEEecCCCcEEEEEEEEeC Q lcl|Aclame:pro 368 ------GTSTVTATLVTPSGDREDTIVITVV 392 (392) Q Consensus 368 ------GtatITat~~~~~g~~tat~~VtVv 392 (392) |-..+-.+..+.. +..-.|.+.|+ T Consensus 381 ~~~~~~Glsirv~~~yd~~-~~~~~~r~DvL 410 (430) T protein:vir:92 381 FSIPDVGLNGIFATQGDIS-TLSGLCRIALW 410 (430) T ss_pred eccccceEEEEEEEecccc-cCceEEEEeee Confidence 2222222222222 22446667666 No 15 >protein:vir:100939 Length: 430 # NCBI annotation: Gp5 # Family: family:all:1412 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006408;genbank:gi:46358700;genbank:GeneID:2777089 Probab=100.00 E-value=9.9e-35 Score=207.01 Aligned_cols=371 Identities=13% Similarity=0.100 Sum_probs=207.2 Q ss_pred CccccccH-HHHHHHHHHHHHHhhcccce--eeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccccCc Q lcl|Aclame:pro 1 MANAFSKP-TAVVDTAIQMLQNELILTNL--VWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTED 77 (392) Q Consensus 1 Man~~~~~-~~~~~~~~~~l~~~l~~~~~--v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (392) |||++.+- +++++|+++.|+++|+|+++ ++|+|+.+|. +.||||+||.|......+. .......+++.|. T Consensus 1 MAn~l~~~~~ii~~eal~~l~n~~v~a~~~~~~r~~d~~~~-r~Gdti~~p~~~~~~~~~G------~~~t~~~~~i~e~ 73 (430) T protein:vir:10 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQ-RSSNTIWMPVEQESPTQEG------WDLTDKATGLLEL 73 (430) T ss_pred CccchhhHHHHHHHHHHHHHhhhhhhhhhhcccCCchhhhh-cccceEEeccccccccccC------cccCCCCCccccc Confidence 99999885 99999999999999999986 7799988875 7899999999988777662 2222334678899 Q ss_pred eEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc--cccccchhhHHHHHHH Q lcl|Aclame:pro 78 SFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGA--VHEVAPDEFFKGVNGA 155 (392) Q Consensus 78 ~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~i~~a 155 (392) +++++|++++.++|+|+++|+ ...++.++++++++++||++||.++++++......+... .........++++.++ T Consensus 74 ~v~~~v~~~k~V~~~~~~kel--~~~~~~~~~i~~Am~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A~a 151 (430) T protein:vir:10 74 NVAVNMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADA 151 (430) T ss_pred eEEEEEeeeccceEEechhHh--cChhHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccccccCCCcCCcchhhHHHH Confidence 999999999999999999995 466667899999999999999999999987654433211 1112223357999999 Q ss_pred HHHhhhccCCC--CCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeee-EeeeE-EEEecceeecccceeeccc Q lcl|Aclame:pro 156 RRALNELYIPQ--GRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGR-IYGYE-IVESTLIPHGDAYLYHPTA 231 (392) Q Consensus 156 ~~~l~~~~vp~--~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~-~~g~~-v~~s~~v~~~~~~~~~~~a 231 (392) ++.|++.++|. +|.++++|+.+..+... +.+....+....+++++|.+|+ ++||+ ++.++.+|..+........ T Consensus 152 ~~~L~~~~vP~~~~R~~vldp~~~~~l~~~--l~~l~~~~~~~~~A~r~g~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~t 229 (430) T protein:vir:10 152 EELMFSRELNRDMGTSYFFNPQDYKKAGYD--LTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGIT 229 (430) T ss_pred HHHHHHhcCCCCCCcEEEeChHHHHHHHhh--hccccccccchhHHHhhccccccchhhhhhhhcCCcccccCccCcCce Confidence 99999999995 59999999999998642 4444455555557899999997 88996 5778888776543322111 Q ss_pred ccccchhhhcccc-ccccceeeccccee-eeeeeccccceeeeecccccc--------------eeeeEEEeecccccee Q lcl|Aclame:pro 232 FIMATRAPAPPMG-AVRSTAISGDQRIA-MRWLVDYDSTITSNRSLIDTY--------------FGLKVVEDPNGVGFVR 295 (392) Q Consensus 232 ~~~a~~~~~~~~~-~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~ 295 (392) ...+......... ...+.....+.... ...... ......+..++..+ ..+.+..... T Consensus 230 v~gA~~~~~~~~~v~~~g~~~~~d~~~~tit~s~t-g~l~~GD~ftiaGV~~v~~~tkq~~~~l~~F~Vt~~~~------ 302 (430) T protein:vir:10 230 VSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSAT-TGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVD------ 302 (430) T ss_pred eccccccccccceecccccccccccccceeeeecc-cceecccEEEecceeeeccccccccCCccEEEEEEecC------ Confidence 1111100000000 00000000000000 000000 00111111111111 1111111110 Q ss_pred eeeccceeeeeeecccccc---cceeeeeeccCeeEEEEEeecCcccccceEEEEEcCCceEEECC---CceEEEEec-- Q lcl|Aclame:pro 296 ARKIHLIPGSIEVAPEAGA---NATITAAAGEDHTVQLKVTDANGDDVTALCDFESSATDKATVAA---GGLVTGVAA-- 367 (392) Q Consensus 296 ~~~~~~~~~~v~v~~~~~~---~~~~~~~~~~~~t~~~t~~~~~~~~~~~~vtw~Ssn~~VAtVd~---~G~VTa~~~-- 367 (392) ...+.+.++.++....... ...-.++...-....+++.. ..+...++.|+-+--..|++.= .|.-.++.. T Consensus 303 atsv~I~paii~~~~~~~~~~~~~y~nVsaspa~~aavTvv~--~a~~~~Nl~fhr~A~aLa~~pL~~~~~~~~~~~~~~ 380 (430) T protein:vir:10 303 GTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILN--VKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTS 380 (430) T ss_pred CceeEEeccccccccccccccccccceeccccccCceeEEec--cCCcccceeEcccceEEEEecccCCCCHHHhhhhhe Confidence 1111221111111000000 00000000111111122211 1222467888888777887752 221111111 Q ss_pred ------ceEEEEEEEecCCCcEEEEEEEEeC Q lcl|Aclame:pro 368 ------GTSTVTATLVTPSGDREDTIVITVV 392 (392) Q Consensus 368 ------GtatITat~~~~~g~~tat~~VtVv 392 (392) |-..+-.+..+.. +..-.|.+.|+ T Consensus 381 ~~~~~~Glsirv~~~yd~~-~~~~~~r~DvL 410 (430) T protein:vir:10 381 FSIPDVGLNGIFATQGDIS-TLSGLCRIALW 410 (430) T ss_pred eccccceEEEEEEEecccc-cCceEEEEeee Confidence 2222222222222 22446667666 No 16 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=100.00 E-value=5.7e-34 Score=202.83 Aligned_cols=264 Identities=15% Similarity=0.178 Sum_probs=193.1 Q ss_pred Ccccc------ccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccce-eeeccccccccCCCcccccc Q lcl|Aclame:pro 1 MANAF------SKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPS-RGHTRKLRGAGAERNLTVSD 73 (392) Q Consensus 1 Man~~------~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~-~~~~~~~~~~~~~~~~~~~~ 73 (392) |||.. ++||+|++++++.|++.++|.+++.+|+ ++.+++||||+||.+... .+.++ ..+..+.+++ T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~--~l~g~~G~tv~iP~~~~ig~a~~~-----~~g~~i~~~~ 73 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDS--TLQGQPGDTLTFPAFVYSGDAQVV-----AEGEKIPTDI 73 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecc--cccCCCCCEEEEeeecCCCccccc-----cCCCccchhh Confidence 99955 8999999999999999999999999985 677889999999987642 34444 2345688999 Q ss_pred ccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHH Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVN 153 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 153 (392) ++.++.+++|++ .+++|.++|++..+...|++.++++|++++||+++|++++..+..+..... .....++.|. T Consensus 74 lt~~~~~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~~~~------~~a~~~d~i~ 146 (274) T protein:vir:12 74 LETKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVN------ADITKLNGLQ 146 (274) T ss_pred cccceeeEEeee-ecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccc------ccccCHHHHH Confidence 999999999966 689999999999999999999999999999999999999999887654322 2235689999 Q ss_pred HHHHHhhhccCCCCCEEEEchHHHHHhhccc--ceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccceeeccc Q lcl|Aclame:pro 154 GARRALNELYIPQGRVLVVGTAVTEQILNDD--RFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTA 231 (392) Q Consensus 154 ~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~--~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a 231 (392) +|...|++++. .+|+++|+|+.++.|++++ +|+.....|. ..+++|.+|++.||.|+.++.+|..+++.++..+ T Consensus 147 dA~~~lgd~~~-~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~---~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gA 222 (274) T protein:vir:12 147 SAIDKFNDEDL-EPMVLFINPLDAGKLRGDASTNFTRATELGD---DIIVKGAFGEALGAIIVRSNKLEAGTAILAKKGA 222 (274) T ss_pred HHHHHhccccc-cccEEEeCHHHHHHHHhhhhhhccccccccc---cceecccceeecCeeEEEeCCCCcceEEEEeccc Confidence 99999988765 7899999999999999985 6787766664 4678999999999999999999999998888877 Q ss_pred ccccchhhh-ccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeecc Q lcl|Aclame:pro 232 FIMATRAPA-PPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAP 310 (392) Q Consensus 232 ~~~a~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~ 310 (392) +.+..+... .+..+. .....+.......++.......... .++.... T Consensus 223 ~~~~~~~~~~vE~~Rd--------------------~~~~~d~i~~~~~y~~~~~~~~~vv------~~t~~~~------ 270 (274) T protein:vir:12 223 VKLILKRDFFLEVARD--------------------ASTKTTALYSDKHYVAYLYDESKAV------KITKGSG------ 270 (274) T ss_pred eeeeecCCceeccccc--------------------hhhcccEEEeeeEEEEEEEcCCceE------EEEcCCc------ Confidence 765443321 111111 1111111111122222221111000 0000000 Q ss_pred cccccceeee Q lcl|Aclame:pro 311 EAGANATITA 320 (392) Q Consensus 311 ~~~~~~~~~~ 320 (392) ++.+ T Consensus 271 ------~~~~ 274 (274) T protein:vir:12 271 ------SLEM 274 (274) T ss_pred ------cccC Confidence 0000 No 17 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=100.00 E-value=5.8e-34 Score=202.79 Aligned_cols=264 Identities=14% Similarity=0.156 Sum_probs=191.2 Q ss_pred Ccc------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccce-eeeccccccccCCCcccccc Q lcl|Aclame:pro 1 MAN------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPS-RGHTRKLRGAGAERNLTVSD 73 (392) Q Consensus 1 Man------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~-~~~~~~~~~~~~~~~~~~~~ 73 (392) ||| ++++||+|++++++.|++.++|.+++..+ +++.+++||||+||++... .+.++ ..+..+.+++ T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~--~~l~g~~G~tv~iP~~~~ig~a~~~-----~~g~~i~~~~ 73 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEID--NTLVGQPGDTLTFPAFIYSGDAKVV-----AEGEKIPTDI 73 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceec--ccccCCCCCEEEeeeecCCCccccc-----cCCCccchhh Confidence 998 56899999999999999999999997666 4567788999999997642 34443 2345688999 Q ss_pred ccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHH Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVN 153 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 153 (392) ++.++.+++|++ .+++|.++|++..+...|++.++++|++++||+++|++++..+..+..... .....++.|. T Consensus 74 lt~~~~~~~i~~-~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~------~~~~~~d~i~ 146 (274) T protein:vir:95 74 LETKKREAKIRK-IAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTVE------ADITKLTGLQ 146 (274) T ss_pred cccceeEEEeee-eecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc------ccccCHHHHH Confidence 999999999976 689999999999999999999999999999999999999999887654321 2234589999 Q ss_pred HHHHHhhhccCCCCCEEEEchHHHHHhhccc--ceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccceeeccc Q lcl|Aclame:pro 154 GARRALNELYIPQGRVLVVGTAVTEQILNDD--RFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTA 231 (392) Q Consensus 154 ~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~--~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a 231 (392) +|...|++++. .+|+++|+|+.++.|++++ +|+.....|. ..+++|.+|++.||.|+.++.+|..+.+.++..+ T Consensus 147 ~A~~~lgd~~~-~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~---~~~~~G~ig~~~G~~Vi~s~~~~~~t~~l~~~gA 222 (274) T protein:vir:95 147 TAIDKFNDEDL-EPMVLFISPLDAGKLRGDATTNFTRATELGD---DVIVKGAFGEALGAVIVRSNKLEAGTAILAKKGA 222 (274) T ss_pred HHHHHhccccc-cccEEEeCHHHHHHHHhhccccccccccccc---cceeccccceecCeEEEEeCCCCCceEEEEeccc Confidence 99999988765 7899999999999999985 6777766664 4678999999999999999999999998888887 Q ss_pred ccccchhh-hccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeecc Q lcl|Aclame:pro 232 FIMATRAP-APPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAP 310 (392) Q Consensus 232 ~~~a~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~ 310 (392) +.+..+.. ..+..+... ...+.......++......... +.++ T Consensus 223 ~~~~~~~~~~vE~~Rd~~--------------------~~~d~i~~~~~y~~~~~~~~~~---------------v~~t- 266 (274) T protein:vir:95 223 VKLITKRDFFLETDRDPS--------------------TKTTALYSDKHYVAYLYDESKA---------------VKIT- 266 (274) T ss_pred eeeeecCCcccccccccc--------------------cccCEEEEeEEEEEEEEcCCcE---------------EEEE- Confidence 76644432 111111111 1111111111122211111100 0000 Q ss_pred cccccceeeeee Q lcl|Aclame:pro 311 EAGANATITAAA 322 (392) Q Consensus 311 ~~~~~~~~~~~~ 322 (392) ....++. + T Consensus 267 --k~~~~~~--~ 274 (274) T protein:vir:95 267 --KGSGSLE--M 274 (274) T ss_pred --cCCcccc--C Confidence 0000000 0 No 18 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=100.00 E-value=5.8e-34 Score=202.79 Aligned_cols=264 Identities=14% Similarity=0.156 Sum_probs=191.2 Q ss_pred Ccc------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccce-eeeccccccccCCCcccccc Q lcl|Aclame:pro 1 MAN------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPS-RGHTRKLRGAGAERNLTVSD 73 (392) Q Consensus 1 Man------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~-~~~~~~~~~~~~~~~~~~~~ 73 (392) ||| ++++||+|++++++.|++.++|.+++..+ +++.+++||||+||++... .+.++ ..+..+.+++ T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~--~~l~g~~G~tv~iP~~~~ig~a~~~-----~~g~~i~~~~ 73 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEID--NTLVGQPGDTLTFPAFIYSGDAKVV-----AEGEKIPTDI 73 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceec--ccccCCCCCEEEeeeecCCCccccc-----cCCCccchhh Confidence 998 56899999999999999999999997666 4567788999999997642 34443 2345688999 Q ss_pred ccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHH Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVN 153 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 153 (392) ++.++.+++|++ .+++|.++|++..+...|++.++++|++++||+++|++++..+..+..... .....++.|. T Consensus 74 lt~~~~~~~i~~-~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~------~~~~~~d~i~ 146 (274) T protein:vir:96 74 LETKKREAKIRK-IAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTVE------ADITKLTGLQ 146 (274) T ss_pred cccceeEEEeee-eecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc------ccccCHHHHH Confidence 999999999976 689999999999999999999999999999999999999999887654321 2234589999 Q ss_pred HHHHHhhhccCCCCCEEEEchHHHHHhhccc--ceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccceeeccc Q lcl|Aclame:pro 154 GARRALNELYIPQGRVLVVGTAVTEQILNDD--RFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTA 231 (392) Q Consensus 154 ~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~--~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a 231 (392) +|...|++++. .+|+++|+|+.++.|++++ +|+.....|. ..+++|.+|++.||.|+.++.+|..+.+.++..+ T Consensus 147 ~A~~~lgd~~~-~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~---~~~~~G~ig~~~G~~Vi~s~~~~~~t~~l~~~gA 222 (274) T protein:vir:96 147 TAIDKFNDEDL-EPMVLFISPLDAGKLRGDATTNFTRATELGD---DVIVKGAFGEALGAVIVRSNKLEAGTAILAKKGA 222 (274) T ss_pred HHHHHhccccc-cccEEEeCHHHHHHHHhhccccccccccccc---cceeccccceecCeEEEEeCCCCCceEEEEeccc Confidence 99999988765 7899999999999999985 6777766664 4678999999999999999999999998888887 Q ss_pred ccccchhh-hccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeecc Q lcl|Aclame:pro 232 FIMATRAP-APPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAP 310 (392) Q Consensus 232 ~~~a~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~ 310 (392) +.+..+.. ..+..+... ...+.......++......... +.++ T Consensus 223 ~~~~~~~~~~vE~~Rd~~--------------------~~~d~i~~~~~y~~~~~~~~~~---------------v~~t- 266 (274) T protein:vir:96 223 VKLITKRDFFLETDRDPS--------------------TKTTALYSDKHYVAYLYDESKA---------------VKIT- 266 (274) T ss_pred eeeeecCCcccccccccc--------------------cccCEEEEeEEEEEEEEcCCcE---------------EEEE- Confidence 76644432 111111111 1111111111122211111100 0000 Q ss_pred cccccceeeeee Q lcl|Aclame:pro 311 EAGANATITAAA 322 (392) Q Consensus 311 ~~~~~~~~~~~~ 322 (392) ....++. + T Consensus 267 --k~~~~~~--~ 274 (274) T protein:vir:96 267 --KGSGSLE--M 274 (274) T ss_pred --cCCcccc--C Confidence 0000000 0 No 19 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=100.00 E-value=1.2e-33 Score=201.13 Aligned_cols=264 Identities=14% Similarity=0.169 Sum_probs=192.0 Q ss_pred Ccccc------ccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc-eeeeccccccccCCCcccccc Q lcl|Aclame:pro 1 MANAF------SKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSD 73 (392) Q Consensus 1 Man~~------~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~-~~~~~~~~~~~~~~~~~~~~~ 73 (392) |||.. ++||+|++++++.|++.++|.+++.+|+ ++.+++|++|+||++.. ..+.++ ..+..+.+++ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~--~l~g~~G~tv~iP~~~~~g~a~~~-----~~g~~i~~~~ 73 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDS--TLQGQPGDTLTFPAFVYSGDAQVV-----AEGEKIPTDI 73 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecc--cccCCCCCEEEEeeecCCCccccc-----cCCCcccccc Confidence 99955 9999999999999999999999999985 56688899999998764 234444 2355688999 Q ss_pred ccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHH Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVN 153 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 153 (392) ++.++.+++|++ .++.|.++|++..+...|++.++.++++++||+++|++++..+..++.... .....++.|+ T Consensus 74 lt~~~~~~~i~~-~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~------~~~~~~d~i~ 146 (274) T protein:vir:94 74 LETKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVN------ADITKLNGLQ 146 (274) T ss_pred cccceeEEEeee-ecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccc------ccccCHHHHH Confidence 999999999966 568999999999999999999999999999999999999999887654321 2234589999 Q ss_pred HHHHHhhhccCCCCCEEEEchHHHHHhhccc--ceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccceeeccc Q lcl|Aclame:pro 154 GARRALNELYIPQGRVLVVGTAVTEQILNDD--RFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTA 231 (392) Q Consensus 154 ~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~--~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a 231 (392) +|+..|++++. ..|+++|+|+.+..|+++. +|.+....|+ ..+++|.+|++.||+|+.++.+|..+.+.+++.+ T Consensus 147 dA~~~l~d~~~-~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~---~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gA 222 (274) T protein:vir:94 147 SAIDKFNDEDL-EPMVLFVNPLDAGKLRGDASTNFTRATELGD---DIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGA 222 (274) T ss_pred HHHHHhhccCC-CceEEEeCHHHHHHHHhhhhhhccccCcccc---cceeccccceecCeeEEEcCCCCcceEEEEeCcc Confidence 99999998776 6799999999999999985 7777776665 3578999999999999999999999988888877 Q ss_pred ccccchhhh-ccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeecc Q lcl|Aclame:pro 232 FIMATRAPA-PPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAP 310 (392) Q Consensus 232 ~~~a~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~ 310 (392) +.+..+... .+..+... ...+.......++.......... .++ T Consensus 223 ~~~~~~~~~~vE~~Rd~~--------------------~~~d~i~~~~~y~~~~~~~~~vv---------------~~t- 266 (274) T protein:vir:94 223 VKLILKRDFFLEVARDAS--------------------TKTTALYSDKHYVAYLYDESKAV---------------KIT- 266 (274) T ss_pred eEeeecCCceeccccchh--------------------hcccEEEEEEEEEEEEEcCCceE---------------EEe- Confidence 665444321 11111111 11111111112222111111000 000 Q ss_pred cccccceeee Q lcl|Aclame:pro 311 EAGANATITA 320 (392) Q Consensus 311 ~~~~~~~~~~ 320 (392) ....++.+ T Consensus 267 --~~~~~~~~ 274 (274) T protein:vir:94 267 --KGSGSLEM 274 (274) T ss_pred --cCcccccC Confidence 00000000 No 20 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=100.00 E-value=1.2e-33 Score=201.13 Aligned_cols=264 Identities=14% Similarity=0.169 Sum_probs=192.0 Q ss_pred Ccccc------ccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc-eeeeccccccccCCCcccccc Q lcl|Aclame:pro 1 MANAF------SKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSD 73 (392) Q Consensus 1 Man~~------~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~-~~~~~~~~~~~~~~~~~~~~~ 73 (392) |||.. ++||+|++++++.|++.++|.+++.+|+ ++.+++|++|+||++.. ..+.++ ..+..+.+++ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~--~l~g~~G~tv~iP~~~~~g~a~~~-----~~g~~i~~~~ 73 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDS--TLQGQPGDTLTFPAFVYSGDAQVV-----AEGEKIPTDI 73 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecc--cccCCCCCEEEEeeecCCCccccc-----cCCCcccccc Confidence 99955 9999999999999999999999999985 56688899999998764 234444 2355688999 Q ss_pred ccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHH Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVN 153 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 153 (392) ++.++.+++|++ .++.|.++|++..+...|++.++.++++++||+++|++++..+..++.... .....++.|+ T Consensus 74 lt~~~~~~~i~~-~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~------~~~~~~d~i~ 146 (274) T protein:vir:97 74 LETKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVN------ADITKLNGLQ 146 (274) T ss_pred cccceeEEEeee-ecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccc------ccccCHHHHH Confidence 999999999966 568999999999999999999999999999999999999999887654321 2234589999 Q ss_pred HHHHHhhhccCCCCCEEEEchHHHHHhhccc--ceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccceeeccc Q lcl|Aclame:pro 154 GARRALNELYIPQGRVLVVGTAVTEQILNDD--RFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTA 231 (392) Q Consensus 154 ~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~--~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a 231 (392) +|+..|++++. ..|+++|+|+.+..|+++. +|.+....|+ ..+++|.+|++.||+|+.++.+|..+.+.+++.+ T Consensus 147 dA~~~l~d~~~-~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~---~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gA 222 (274) T protein:vir:97 147 SAIDKFNDEDL-EPMVLFVNPLDAGKLRGDASTNFTRATELGD---DIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGA 222 (274) T ss_pred HHHHHhhccCC-CceEEEeCHHHHHHHHhhhhhhccccCcccc---cceeccccceecCeeEEEcCCCCcceEEEEeCcc Confidence 99999998776 6799999999999999985 7777776665 3578999999999999999999999988888877 Q ss_pred ccccchhhh-ccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeecc Q lcl|Aclame:pro 232 FIMATRAPA-PPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAP 310 (392) Q Consensus 232 ~~~a~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~ 310 (392) +.+..+... .+..+... ...+.......++.......... .++ T Consensus 223 ~~~~~~~~~~vE~~Rd~~--------------------~~~d~i~~~~~y~~~~~~~~~vv---------------~~t- 266 (274) T protein:vir:97 223 VKLILKRDFFLEVARDAS--------------------TKTTALYSDKHYVAYLYDESKAV---------------KIT- 266 (274) T ss_pred eEeeecCCceeccccchh--------------------hcccEEEEEEEEEEEEEcCCceE---------------EEe- Confidence 665444321 11111111 11111111112222111111000 000 Q ss_pred cccccceeee Q lcl|Aclame:pro 311 EAGANATITA 320 (392) Q Consensus 311 ~~~~~~~~~~ 320 (392) ....++.+ T Consensus 267 --~~~~~~~~ 274 (274) T protein:vir:97 267 --KGSGSLEM 274 (274) T ss_pred --cCcccccC Confidence 00000000 No 21 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=100.00 E-value=1.3e-33 Score=200.82 Aligned_cols=264 Identities=15% Similarity=0.152 Sum_probs=192.8 Q ss_pred Ccc------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc-eeeeccccccccCCCcccccc Q lcl|Aclame:pro 1 MAN------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSD 73 (392) Q Consensus 1 Man------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~-~~~~~~~~~~~~~~~~~~~~~ 73 (392) ||| ++++||+|++++++.|++.++|.+++.+++ ++.+++||+|+||++.. ..+.+|. .+..+.+++ T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~--~l~g~~G~tv~ip~~~~~g~~~~~~-----~g~~i~~~~ 73 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDS--TLVGQPGDTLTFPAFTYSGDAQVIA-----EGEKIPVDQ 73 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhcccccccc--cccCCCCCEEEEEeeccCCCccccC-----CCCcCchhh Confidence 997 558999999999999999999999998884 66788999999998763 3445442 345688999 Q ss_pred ccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHH Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVN 153 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 153 (392) ++.++.+++|++ .++.|.++|++..+...|++.++.+++++++|+++|.+++..+..++... ..+...++.|+ T Consensus 74 it~~~~~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~~~------~~~~~~~d~i~ 146 (274) T protein:vir:96 74 IGTSKREAKVRK-IGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTV------EADITKLDGLQ 146 (274) T ss_pred cccceeEEEEEe-eeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCc------CcccccHHHHH Confidence 999999999976 58999999999999999999999999999999999999999987754322 22334689999 Q ss_pred HHHHHhhhccCCCCCEEEEchHHHHHhhccc--ceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccceeeccc Q lcl|Aclame:pro 154 GARRALNELYIPQGRVLVVGTAVTEQILNDD--RFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTA 231 (392) Q Consensus 154 ~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~--~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a 231 (392) +|+..|+++++ .+|+++++|+.++.|+++. +|......|+ ..+++|.+|++.||+|++++++|..+++.++..+ T Consensus 147 dA~~~l~d~~~-~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~---~~~~~g~ig~~~G~~Vi~s~~~p~~t~~l~~~gA 222 (274) T protein:vir:96 147 TAIDKFNDEDL-EPMVLFVNPLDAGGLRTSASDNFTRPTQLGD---NIIVKGAFGEALGAVIVRSNKLNKGEALLAKKGA 222 (274) T ss_pred HHHHHhcccCC-CceEEEeCHHHHHHHHhcccccccccccccc---cceeecccceecCeeEEEcCCCCcceEEEEeCcc Confidence 99999998876 6799999999999999875 6777666654 4678999999999999999999999999888888 Q ss_pred ccccchhhhc-cccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeee Q lcl|Aclame:pro 232 FIMATRAPAP-PMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIE 307 (392) Q Consensus 232 ~~~a~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 307 (392) +.+..+.... +..+. .....+.......++.................- .+- T Consensus 223 ~~~~~~~~~~vE~~Rd--------------------~~~~~d~i~~~~~yg~~~~~~~~vv~~t~~~~~-----~~~ 274 (274) T protein:vir:96 223 VKLITKRDFFLEKDRD--------------------ASRKSTALYSDKHYVAYLYDESKVVKITKGAGD-----EVM 274 (274) T ss_pred eeeeecCCcccccccc--------------------hhhcccEEEEeeEEEEEEEcCccEEEEEcCccc-----ccC Confidence 7765543311 11111 111111111111222222211111100000000 000 No 22 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=100.00 E-value=1.4e-33 Score=200.65 Aligned_cols=264 Identities=14% Similarity=0.166 Sum_probs=192.4 Q ss_pred Ccccc------ccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc-eeeeccccccccCCCcccccc Q lcl|Aclame:pro 1 MANAF------SKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSD 73 (392) Q Consensus 1 Man~~------~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~-~~~~~~~~~~~~~~~~~~~~~ 73 (392) |||+. ++||+|++++++.|++.++|.+++.+++ ++.+++|++|+||++.. ..+.++ .++..+.+++ T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~--~l~g~~G~tv~ip~~~~~g~~~~~-----~eg~~i~~~~ 73 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDS--TLQGQPGDTLTFPAFVYSGDAQVV-----AEGEKIPTDI 73 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhcccccccc--cccCCCCCEEEEEeeccCCCcccc-----cCCCcccccc Confidence 99965 8999999999999999999999999985 56788899999999764 344444 2355688999 Q ss_pred ccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHH Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVN 153 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 153 (392) ++.++.+++|++ .++.|.++|++..+...|++.++.+++++++++++|++++..+..+.... ......++.|+ T Consensus 74 it~~~~~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~~~------~~~~~~~d~i~ 146 (274) T protein:vir:93 74 LETKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTV------NADITKLNGLQ 146 (274) T ss_pred cccceeEEEeee-ecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------cccccCHHHHH Confidence 999999999966 57899999999999999999999999999999999999999987765332 12334689999 Q ss_pred HHHHHhhhccCCCCCEEEEchHHHHHhhccc--ceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccceeeccc Q lcl|Aclame:pro 154 GARRALNELYIPQGRVLVVGTAVTEQILNDD--RFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTA 231 (392) Q Consensus 154 ~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~--~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a 231 (392) +|...|++++. .+|+++++|+.++.|+++. +|......|+ ..+++|.+|++.||+|++++.+|..+.+.+++.+ T Consensus 147 dA~~~l~d~~~-~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~---~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~ga 222 (274) T protein:vir:93 147 SAIDKFNDEDL-EPMVLFINPLDAGKLRGDASTNFTRATELGD---DIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGA 222 (274) T ss_pred HHHHHhhhccC-CccEEEeCHHHHHHHHhhhhhcccccccccc---cceeecccceecCeeEEEcCCCCcceEEEEeCCe Confidence 99999998876 6799999999999999885 6777766665 3578999999999999999999999999888888 Q ss_pred ccccchhhhc-cccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeecc Q lcl|Aclame:pro 232 FIMATRAPAP-PMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAP 310 (392) Q Consensus 232 ~~~a~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~ 310 (392) +.+..+.... +..+. .....+.......++.......... .++. T Consensus 223 i~~~~~~~~~vE~~Rd--------------------~~~~~d~i~~~~~y~~~~~~~~~~v------~~t~--------- 267 (274) T protein:vir:93 223 VKLILKRDFFLEVARD--------------------ASTKTTALYSDKHYVAYLYDESKAV------KITK--------- 267 (274) T ss_pred EEEEecCCcccccccc--------------------hhhcccEEEEEEEEEEEEEcCCceE------EEee--------- Confidence 7765443211 11111 1111111111112222111111000 0000 Q ss_pred cccccceeee Q lcl|Aclame:pro 311 EAGANATITA 320 (392) Q Consensus 311 ~~~~~~~~~~ 320 (392) ...++.+ T Consensus 268 ---~~~s~~~ 274 (274) T protein:vir:93 268 ---GSGSLEM 274 (274) T ss_pred ---CccccCC Confidence 0000000 No 23 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=100.00 E-value=1.9e-33 Score=199.98 Aligned_cols=265 Identities=17% Similarity=0.158 Sum_probs=190.2 Q ss_pred Ccc-----ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCcccccccc Q lcl|Aclame:pro 1 MAN-----AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFT 75 (392) Q Consensus 1 Man-----~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (392) |+| ++++||+|++++++.|++.++|.+++..+ .++.+++|++|+||++... .+.... ..+..+.+++++ T Consensus 3 ~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~--~~l~g~~G~tv~iP~~~~i--g~a~~~--~~g~~i~~~~lt 76 (275) T protein:vir:96 3 LENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADID--NTLVGQPGNTITFPAFVYS--GDAKVV--PEGEEIPIDLIE 76 (275) T ss_pred CcccchhhhhhchHHHHHHHHHHHHHhhhhcccceec--ccccCCCCCEEEeeeeccC--Cccccc--cCCCCcchhhcc Confidence 555 56899999999999999999999998777 4677889999999987653 232222 335568899999 Q ss_pred CceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHHHH Q lcl|Aclame:pro 76 EDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNGA 155 (392) Q Consensus 76 ~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a 155 (392) .++.+++|.+ ++++|.++|++..+...|++.++++|++++||+++|++++..+..+.... ..+...++.|.+| T Consensus 77 ~~~~~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~~~------~~~~~~~d~i~dA 149 (275) T protein:vir:96 77 TKKRQATIRK-IGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGATLKV------EADITKLAGLQTA 149 (275) T ss_pred cceeeEEeeh-hcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------cccccCHHHHHHH Confidence 9999999955 69999999999999999999999999999999999999999888754332 2234568999999 Q ss_pred HHHhhhccCCCCCEEEEchHHHHHhhccc--ceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccceeeccccc Q lcl|Aclame:pro 156 RRALNELYIPQGRVLVVGTAVTEQILNDD--RFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTAFI 233 (392) Q Consensus 156 ~~~l~~~~vp~~r~~vv~~~~~~~l~~~~--~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a~~ 233 (392) ...|++++. ..|+++++|+.+..|+++. +|......|+ ..+++|.+|++.|+.|++++.+|..+++.+++.++. T Consensus 150 ~~~lgd~~~-~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~---~~~~~G~ig~~~G~~Vi~s~~~p~~t~~i~~~gA~~ 225 (275) T protein:vir:96 150 IDKFNDEDL-EPMVLFVNPLDAGKLRASATDNFTRATLLGD---NVIVKGAFGEALGAIIVRSNKIKEGEAILAKRGAVK 225 (275) T ss_pred HHHhccccC-CccEEEeCHHHHHHHHhcccccccccccccc---cceeccccceecCeeEEEeCCCCcceEEEEecccee Confidence 999987765 6799999999999998874 6887777765 357899999999999999999999999888887776 Q ss_pred ccchhhh-ccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeecccc Q lcl|Aclame:pro 234 MATRAPA-PPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEA 312 (392) Q Consensus 234 ~a~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~ 312 (392) +..+... .+..+... ...+.......++......... ..++..+. +.. T Consensus 226 ~~~~~~~~vE~~Rd~~--------------------~~~d~i~~~~~y~~~~~~~~~v------v~~t~~~~-----~~~ 274 (275) T protein:vir:96 226 LITKRDFFLETERHAS--------------------HKSTALFSDKHYVAYLYDESKV------VKITKSAS-----GLG 274 (275) T ss_pred eeecCCcccccccchh--------------------hcCcEEEEeEEEEEEEEcCccE------EEEEeccc-----ccC Confidence 6544321 11111111 1111111111122111111000 00000000 000 Q ss_pred c Q lcl|Aclame:pro 313 G 313 (392) Q Consensus 313 ~ 313 (392) + T Consensus 275 ~ 275 (275) T protein:vir:96 275 V 275 (275) T ss_pred C Confidence 0 No 24 >protein:vir:2106 Length: 430 # NCBI annotation: coat protein # Family: family:all:1412 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059630;genbank:gi:9635538;genbank:GeneID:1262831 Probab=100.00 E-value=7.9e-33 Score=196.59 Aligned_cols=372 Identities=13% Similarity=0.091 Sum_probs=203.4 Q ss_pred Ccccccc-HHHHHHHHHHHHHHhhcccce--eeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccccCc Q lcl|Aclame:pro 1 MANAFSK-PTAVVDTAIQMLQNELILTNL--VWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTED 77 (392) Q Consensus 1 Man~~~~-~~~~~~~~~~~l~~~l~~~~~--v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (392) |||++-+ -++.-+|+|+.|+++|+|.++ ++|+|+.+|. +.||||+||.|......+.. ......+++.|+ T Consensus 1 Ma~~~~~~lti~~~eal~~~~n~lV~a~~~~~~r~~d~~~~-r~Gdti~ip~p~~~~~~~G~------~~t~~~~~~~e~ 73 (430) T protein:vir:21 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQ-RSSNTIWMPVEQESPTQEGW------DLTDKATGLLEL 73 (430) T ss_pred CccccchhhHHHHHHHHHHhhhhhhhhhhhhccCCchhhhh-cccceEEeeccccccccccc------cccCCCccceee Confidence 9998722 234449999999999999996 7899988875 78999999999887766532 223345689999 Q ss_pred eEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc--cccccchhhHHHHHHH Q lcl|Aclame:pro 78 SFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGA--VHEVAPDEFFKGVNGA 155 (392) Q Consensus 78 ~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~i~~a 155 (392) +++++|++++.+.|+|+++|+ ...++.++++++++++||++||.+|++++......+... .........++++.++ T Consensus 74 ~v~~~~~~~~~V~~~~~~kEl--~~~~~~er~l~pAm~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A~a 151 (430) T protein:vir:21 74 NVAVNMGEPDNDFFQLRADDL--RDETAYRRRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADA 151 (430) T ss_pred eEeEEEeeeccceEEeehhHh--cChhhHHHHHHHHHHHHHHHHHHHHHHHhhhhhhccccccCCCCCCCCcchhhHHHH Confidence 999999999999999999985 588888999999999999999999999987654433211 1111222358999999 Q ss_pred HHHhhhccCCC--CCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeee-EeeeE-EEEecceeecccceeeccc Q lcl|Aclame:pro 156 RRALNELYIPQ--GRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGR-IYGYE-IVESTLIPHGDAYLYHPTA 231 (392) Q Consensus 156 ~~~l~~~~vp~--~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~-~~g~~-v~~s~~v~~~~~~~~~~~a 231 (392) ++.|++.++|. +|.++++|+.+..+... +.+....+....+++++|.+|+ ++||+ ++.++.+|..+........ T Consensus 152 ~~~L~~~~vP~~~~R~~~~~p~~~~~l~~~--l~~~~~~~~~~~~A~r~g~i~r~~~Gfd~~~~s~~~~~~t~gt~t~~t 229 (430) T protein:vir:21 152 EEIMFSRELNRDMGTSYFFNPQDYKKAGYD--LTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGIT 229 (430) T ss_pred HHHHHHhcCCCCCCcEEEeChHHHHHHhhh--hccccccccchhHHHhhcccccccchhhhhhhcCCcccccCccCcCce Confidence 99999999995 59999999999988653 3344444444457899999997 88997 5778888876544322111 Q ss_pred ccccchhhhc----c-cccc------ccc-eeecccceeeeeeecccccee---eeecccccceeeeEEEeeccccceee Q lcl|Aclame:pro 232 FIMATRAPAP----P-MGAV------RST-AISGDQRIAMRWLVDYDSTIT---SNRSLIDTYFGLKVVEDPNGVGFVRA 296 (392) Q Consensus 232 ~~~a~~~~~~----~-~~~~------~~~-~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~ 296 (392) ...+...... . .+.. ..+ ..+...+......+....... ......+....+.+..... . T Consensus 230 v~gA~~~~~~~~tv~~~g~~~~~d~~~~~it~s~tg~l~~GD~ftiaGV~~v~~itk~~~~~l~qf~V~a~~~------~ 303 (430) T protein:vir:21 230 VSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGMKRGDKISFAGVKFLGQMAKNVLAQDATFSVVRVVD------G 303 (430) T ss_pred eccccccccccceeccccccccccccceeeeeecccceecccEEEecceeeeccccccccCCcceEEEEEecC------C Confidence 1111100000 0 0000 000 000000111111111100000 0000011111111111110 0 Q ss_pred eeccceeeeeeeccccc---ccceeeeeeccCeeEEEEEeecCcccccceEEEEEcCCceEEECC---Cce----EEEE- Q lcl|Aclame:pro 297 RKIHLIPGSIEVAPEAG---ANATITAAAGEDHTVQLKVTDANGDDVTALCDFESSATDKATVAA---GGL----VTGV- 365 (392) Q Consensus 297 ~~~~~~~~~v~v~~~~~---~~~~~~~~~~~~~t~~~t~~~~~~~~~~~~vtw~Ssn~~VAtVd~---~G~----VTa~- 365 (392) +.+.+.+..++...... ....-+++...-....+++.. ..+...++.|+-+--..|++.= .|. .+.- T Consensus 304 ttv~I~Pai~~~~~~~~~~~~~~y~nVsaspa~~aavT~v~--~a~~~~Nl~fh~~A~~La~~pl~~p~~~~~~~~~~~~ 381 (430) T protein:vir:21 304 THVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILN--VKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSF 381 (430) T ss_pred ceeEEeecccccccccccccccccceeccccccCceeEEec--cCCcccceeEccceeEEEEecccCCCChhHhhheeee Confidence 11111111111100000 000000111111111222222 1122357888888777777751 121 1110 Q ss_pred ---ecceEEEEEEEecCCCcEEEEEEEEeC Q lcl|Aclame:pro 366 ---AAGTSTVTATLVTPSGDREDTIVITVV 392 (392) Q Consensus 366 ---~~GtatITat~~~~~g~~tat~~VtVv 392 (392) ..|-..+-.+..+.. ...-.|.+.++ T Consensus 382 ~~~~~Glsirv~~~yd~~-~~~~~~r~Dil 410 (430) T protein:vir:21 382 SIPDVGLNGIFATQGDIS-TLSGLCRIALW 410 (430) T ss_pred eccccceEEEEEEccccc-cCceEEEEEee Confidence 012222222211111 23445666666 No 25 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=100.00 E-value=3.8e-33 Score=198.32 Aligned_cols=280 Identities=14% Similarity=0.111 Sum_probs=182.2 Q ss_pred Cc----------------c-ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccc Q lcl|Aclame:pro 1 MA----------------N-AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGA 63 (392) Q Consensus 1 Ma----------------n-~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~ 63 (392) |+ + ++|. |+|+.+++..|.+..+|.+++++. ++. -|++|+|++.+..++.+|.+ T Consensus 7 ~~~~~~~~~~~~~~~~d~~~al~l-e~~~geV~~~f~~~s~~~~~~~~r---~i~--~G~tv~i~~ig~~~~~~~~~--- 77 (332) T protein:vir:78 7 FSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLVRSY---DLR--GGKSKQFMFTGKLSAGYHTP--- 77 (332) T ss_pred ccCCccccCCccccccccchhhhh-hhhhhhHHHHHHHHhhhhhccccc---ccc--ccceEEEEeccceeEeeecC--- Confidence 22 1 3666 999999999999999999998742 443 49999999999999999864 Q ss_pred cCCCccccc-cccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccc----- Q lcl|Aclame:pro 64 GAERNLTVS-DFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAA----- 137 (392) Q Consensus 64 ~~~~~~~~~-~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~----- 137 (392) +..+.++ ++++++++|+||+.+|+.+.|+|.|+.+..+|++.++.++++++||+.+|+.++.++..+..... T Consensus 78 --g~~l~~~~~~~~~~~~l~ID~~ky~~~~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~~~ 155 (332) T protein:vir:78 78 --GTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGE 155 (332) T ss_pred --CCCCCCCCCCCCceEEEEEehhhhhHHHHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCccccc Confidence 3345554 58889999999999999999999999999999999999999999999999999988865432211 Q ss_pred --------cccccccchhhHHHHHHHHHHhhhccCC-CCCEEEEchHHHHHhhc--ccceeeeeccccceeeeEeeee-e Q lcl|Aclame:pro 138 --------GAVHEVAPDEFFKGVNGARRALNELYIP-QGRVLVVGTAVTEQILN--DDRFIKYESQGQSAVSALQEAR-L 205 (392) Q Consensus 138 --------~~~~~~~~~~~~~~i~~a~~~l~~~~vp-~~r~~vv~~~~~~~l~~--~~~~~~~~~~G~~~~~a~~~g~-i 205 (392) ..+..+++...|+.|+++++.|++++|| .|||++++|++|..|++ +++|.+.+..++.. .+++|. + T Consensus 156 ~g~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~~~--~~~~g~~i 233 (332) T protein:vir:78 156 PGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQG--DMNSGKGL 233 (332) T ss_pred ccccccccCCccccCHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHHhhcCceeeeeecccccc--ceecceee Confidence 1223445677899999999999999999 58999999999999997 78888887776653 466665 8 Q ss_pred eeEeeeEEEEecceeecccceeecccccccchhhhccccccccceeecccceeeeeee-----------ccccceeeeec Q lcl|Aclame:pro 206 GRIYGYEIVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLV-----------DYDSTITSNRS 274 (392) Q Consensus 206 g~~~g~~v~~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~ 274 (392) ++++||+|++++++|..........+.+.... .............-...+...+. ..+.....+.. T Consensus 234 ~~i~G~~V~~Sn~lp~~~g~~~~~~~~~~~~n---~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~~~~~~~d~i 310 (332) T protein:vir:78 234 YSIAGIRILKSNNLAGLYGQDLSSAAVTGENN---DYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLI 310 (332) T ss_pred eEEeeeEEEecCccccCccccccccccccccc---ccccccccceEEeecccceeeeeeeccchhhhhcccchhhhHhhh Confidence 99999999999999966543332222110000 00000000000000000000000 00000000101 Q ss_pred ccccceeeeEEEeeccccceee Q lcl|Aclame:pro 275 LIDTYFGLKVVEDPNGVGFVRA 296 (392) Q Consensus 275 ~~~~~~~~~~~~~~~~~~~~~~ 296 (392) ..-..+|..+...........+ T Consensus 311 ~~~~~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 311 VGKLAMGCGSLRTSVAGSFQAA 332 (332) T ss_pred hhhhhhcCceecccceEEEeeC Confidence 0001111111111111000000 No 26 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=99.96 E-value=5.4e-32 Score=192.00 Aligned_cols=283 Identities=13% Similarity=0.092 Sum_probs=177.6 Q ss_pred Cccc----------------------cccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeecc Q lcl|Aclame:pro 1 MANA----------------------FSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTR 58 (392) Q Consensus 1 Man~----------------------~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~ 58 (392) |||. +|. |+|+.|++..|.+..+|.+++++. ++.+ |++++|++.+..++..+ T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~i-e~~~geV~~~f~~~s~~~~~~~~r---~i~~--g~s~~~~~iG~~~~~~~ 74 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFL-KVFGGEVLTAFARTSVTTSRHMVR---SISS--GKSAQFPVLGRTQAAYL 74 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHH-HHHHHHHHHHHHHHhhhcccceee---eecc--cceEEEEeeceeEEEee Confidence 7764 243 899999999999999999998752 5654 89999999999999877 Q ss_pred ccccccCCCccccccccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccc--- Q lcl|Aclame:pro 59 KLRGAGAERNLTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYE--- 135 (392) Q Consensus 59 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~--- 135 (392) .+.. .-....+++..++++|+||+.+|+.+.|+|.|+.+.++|++.++.++++++||+.+|+.++..+..+... T Consensus 75 ~~G~---~l~~t~~~~~~~e~~l~ID~~~y~~~~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~ 151 (344) T protein:vir:10 75 APGE---NLDDIRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQ 151 (344) T ss_pred ecCC---CCCCCCCCcccceEEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Confidence 6422 2223346788899999999999999999999999999999999999999999999999998766432110 Q ss_pred --c-------c------ccc-----ccccchhhHHHHHHHHHHhhhccCC-CCCEEEEchHHHHHhhcccceeeeecccc Q lcl|Aclame:pro 136 --A-------A------GAV-----HEVAPDEFFKGVNGARRALNELYIP-QGRVLVVGTAVTEQILNDDRFIKYESQGQ 194 (392) Q Consensus 136 --~-------~------~~~-----~~~~~~~~~~~i~~a~~~l~~~~vp-~~r~~vv~~~~~~~l~~~~~~~~~~~~G~ 194 (392) . . ... ........++.|.++++.|++++|| .+||++++|++|..|+.++.|......|. T Consensus 152 ~~~~~~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~ 231 (344) T protein:vir:10 152 YNENITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAILAALMPNAANYAAL 231 (344) T ss_pred cccccccccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHHhhcccccccccccc Confidence 0 0 000 0011234688899999999999999 58999999999999999998887776544 Q ss_pred ceeeeEeeeeeeeEeeeEEEEecceeecccceeecccccccchhhhccccccccceeeccc--ceee------------- Q lcl|Aclame:pro 195 SAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQ--RIAM------------- 259 (392) Q Consensus 195 ~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~--~~~~------------- 259 (392) ..+++|.+++++||+|++++++|..... .+..+.+.... ..+.+........... +..+ T Consensus 232 ---~~~~~G~V~~v~G~~V~~Sn~lp~~~~~-~~~~~~tg~~~--~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~ 305 (344) T protein:vir:10 232 ---IDPEKGSIRNVMGFEVVEVPHLTAGGAG-TSREGTTGQKH--AFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDL 305 (344) T ss_pred ---cceeeeEEEEEeceEEEeccccccccCC-cccccccCccc--cccCCcccceeeecceeEEEeechhhhhhhhhccc Confidence 3478999999999999999999864322 12222111110 0000000000000000 0000 Q ss_pred eeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeeccc Q lcl|Aclame:pro 260 RWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPE 311 (392) Q Consensus 260 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~ 311 (392) ..-..++.....+....-..+|..+....... .++.... T Consensus 306 ~~e~~r~~~~~~d~i~g~~~~G~~vlRPe~a~-------------~v~~~~~ 344 (344) T protein:vir:10 306 ALERARRANFQADQIIAKYAMGHGGLRPEAAG-------------AVVFKTK 344 (344) T ss_pred eeecccchhHHHHHHHHHhhcccceecccceE-------------EEEeecC Confidence 00000011111111100011111111100000 0000000 No 27 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.96 E-value=7.5e-32 Score=191.21 Aligned_cols=261 Identities=14% Similarity=0.122 Sum_probs=182.8 Q ss_pred Ccc------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccc Q lcl|Aclame:pro 1 MAN------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDF 74 (392) Q Consensus 1 Man------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (392) ||| ++++||+|++++++.|.+.++|.+++.+++ ++.+++|+||+||++... .+.+.. .++..+.++++ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~--~l~g~~G~ti~iP~~~~~--gda~~~--~eg~~i~~~~l 74 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDT--TLQGQPGNTLKFPAFTYI--GDAADV--AEGGEISLDKI 74 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhcccccccc--ccccCCCCEEEEeeeccC--cccccc--CCCCccChhhc Confidence 997 558899999999999999999999998874 567889999999987643 333322 34566889999 Q ss_pred cCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHHH Q lcl|Aclame:pro 75 TEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNG 154 (392) Q Consensus 75 ~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~ 154 (392) +.+..+++|++ .++.|.++|++..+...|++.++.+++++++|+++|++++..+.++... .+....++.|.+ T Consensus 75 t~~~~~~~i~~-~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~~~-------~~~~~~~d~i~~ 146 (272) T protein:vir:36 75 GTTTKSVTIKK-AAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTSQT-------VSTKANVDGVQA 146 (272) T ss_pred CCcceeEeeeh-hhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-------ccccccHHHHHH Confidence 99999999966 5789999999999999999999999999999999999999988765432 233456899999 Q ss_pred HHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccc----eeecc Q lcl|Aclame:pro 155 ARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAY----LYHPT 230 (392) Q Consensus 155 a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~----~~~~~ 230 (392) |+..|++++.+ .|+++++|..++.|+++..|......+. ...+++|.+|++.|++|++++.+|.++.. .+.+. T Consensus 147 A~~~lgd~~~~-~~~ivv~p~~~~~L~k~~~~~~~~~~~~--~~~~~~G~ig~~~G~~Vv~s~~~p~~~~~~~~~~~~~g 223 (272) T protein:vir:36 147 ALDIFNDEDAQ-AYVLIVNPKDAAKIRKDANAKNIGSEVG--ANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSP 223 (272) T ss_pred HHHHhhhcCCC-ceEEEEcHHHHHHHhccccccccccccc--ccceeeeccceecCeeEEEeCCCCCCceeEEEEEeccc Confidence 99999998885 5899999999999999988766643322 24688999999999999999999977653 22233 Q ss_pred cccccchhh-hccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeec Q lcl|Aclame:pro 231 AFIMATRAP-APPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVA 309 (392) Q Consensus 231 a~~~a~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~ 309 (392) ++.+..+.. ..+..+ +.....+.......++.......... .++.. T Consensus 224 A~~~~~~~~~~vE~~R--------------------~~~~~~d~i~~~~~y~~~v~~~~~vv-------------~~t~~ 270 (272) T protein:vir:36 224 ALKLVLKRGVQVETDR--------------------DIVTKTTVITADEHYAAYLYDLTKVV-------------NITFT 270 (272) T ss_pred ceeeeecCCccccccc--------------------chhhcCcEEEEEEEEEEEEEcCccEE-------------EEeec Confidence 332221111 111111 11111111111111221111111000 00011 Q ss_pred cc Q lcl|Aclame:pro 310 PE 311 (392) Q Consensus 310 ~~ 311 (392) ++ T Consensus 271 g~ 272 (272) T protein:vir:36 271 GV 272 (272) T ss_pred CC Confidence 11 No 28 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=99.96 E-value=5e-32 Score=192.20 Aligned_cols=285 Identities=11% Similarity=0.046 Sum_probs=180.6 Q ss_pred Ccc---------------------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccc Q lcl|Aclame:pro 1 MAN---------------------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRK 59 (392) Q Consensus 1 Man---------------------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~ 59 (392) ||| .+|+ |+|+.+++..|+++.+|.++++.. ++ +.|++++|++.+..++.+|. T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~i-e~~~g~V~~~f~~~s~~~~~v~~r---~~--~~G~sv~i~~iG~~t~~~~~ 74 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFL-KVFGGEVLTAFARTSVTMPRHMLR---SI--ASGKSAQFPVIGRTKAAYLK 74 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHH-HHHHHHHHHHHHHHHhhhhhhccc---cc--cccceeEeeeccceeeeeec Confidence 654 2455 999999999999999999998742 34 34999999999999999986 Q ss_pred cccccCCCccccccccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhcccc----- Q lcl|Aclame:pro 60 LRGAGAERNLTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPY----- 134 (392) Q Consensus 60 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~----- 134 (392) +.. +-+...+++.+.++.|+||+.+|+.+.|+|.|+.+..+|++.++.++++++||+++|+.++..+..+.. T Consensus 75 ~g~---~l~~~~~~~~~~e~~ltiD~~~y~~~~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~ 151 (347) T protein:vir:33 75 PGE---NLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGS 151 (347) T ss_pred CCC---CCCCCCCCCccceEEEEechhhhhhHHHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccc Confidence 522 222234667889999999999999999999999999999999999999999999999999865532110 Q ss_pred --------cc-----ccccccc------cchhhHHHHHHHHHHhhhccCC-CCCEEEEchHHHHHhhcccceeeeecccc Q lcl|Aclame:pro 135 --------EA-----AGAVHEV------APDEFFKGVNGARRALNELYIP-QGRVLVVGTAVTEQILNDDRFIKYESQGQ 194 (392) Q Consensus 135 --------~~-----~~~~~~~------~~~~~~~~i~~a~~~l~~~~vp-~~r~~vv~~~~~~~l~~~~~~~~~~~~G~ 194 (392) .. ...+++. .....|+.|++++..|++++|| .+||++++|++|..|+++++|...++.|+ T Consensus 152 ~~~~~~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~d~~~~ 231 (347) T protein:vir:33 152 NENIEGLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQAL 231 (347) T ss_pred ccccccccccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhccccccccccccc Confidence 00 0001111 1245689999999999999999 58999999999999999999998887654 Q ss_pred ceeeeEeeeeeeeEeeeEEEEecceeecccceeecccccccchhhhcccccccc-ceeecc--ccee-----e------- Q lcl|Aclame:pro 195 SAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRS-TAISGD--QRIA-----M------- 259 (392) Q Consensus 195 ~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~-~~~~~~--~~~~-----~------- 259 (392) ..+++|.+++++||+|++|+++|...... +..+... +........... ...... .+.. . T Consensus 232 ---~~~~~G~V~~i~G~~V~~Sn~lp~~~~~~-~~~~~~a--g~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~ 305 (347) T protein:vir:33 232 ---LDPERGTIRNVMGFEVVEVPHLTAGGAGD-TREDAPA--DQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKD 305 (347) T ss_pred ---cccccceeEEEeceeEEEecccccCcccc-ccccccc--cccccccCCcccceeccccceeeeeecchhheeeeeec Confidence 35788999999999999999998754322 1111100 000000000000 000000 0000 0 Q ss_pred -eeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeeccccccc Q lcl|Aclame:pro 260 -RWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGAN 315 (392) Q Consensus 260 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~~~~ 315 (392) .....++.....+....-...|..+........+ ....+ +. T Consensus 306 ~~~e~~r~~~~~~d~i~~~~~~G~~vlrP~~av~i-------------~~~~~--~~ 347 (347) T protein:vir:33 306 LALERARRANYQADQIIAKYAMGHGGLRPEAAGAI-------------VLPKV--SE 347 (347) T ss_pred eeeeeccchhhhhHhhhhhhhcCCceecccceEEE-------------ecCCC--CC Confidence 0111111111111111112222222221111000 00000 00 No 29 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.96 E-value=3.7e-31 Score=187.44 Aligned_cols=267 Identities=15% Similarity=0.170 Sum_probs=191.1 Q ss_pred Cccc------cccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccc Q lcl|Aclame:pro 1 MANA------FSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDF 74 (392) Q Consensus 1 Man~------~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (392) |||. +++||+|++++++.|++.++|.+++.++ .++.+++|++|+||.+... .+.+.. .++..+.++.+ T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~--~~l~g~~G~ti~iP~~~~i--gda~~~--~eg~~i~~~~l 74 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADID--STLVGQPGDTLTFPAFVYS--GDATVV--PEGQKIPVDKI 74 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceec--ccccCCCCCEEEeeeecCC--Cccccc--cCCCccCcccc Confidence 9974 4899999999999999999999999887 5778889999999887553 343332 34567889999 Q ss_pred cCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHHH Q lcl|Aclame:pro 75 TEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNG 154 (392) Q Consensus 75 ~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~ 154 (392) +.++..++|. ++++.|.++|++..+...|++.+++++++++||+++|++++..+....... ......++.|.+ T Consensus 75 t~~~~~a~i~-~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~~------~~~~~t~d~i~~ 147 (276) T protein:vir:10 75 ETNRREAKIH-KIGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKLTV------SADIGTLAGLEA 147 (276) T ss_pred ccceeeEEee-hccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------cccccCHHHHHH Confidence 9999999995 468999999999999999999999999999999999999999888754332 122345899999 Q ss_pred HHHHhhhccCCCCCEEEEchHHHHHhhcc--cceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccceeecccc Q lcl|Aclame:pro 155 ARRALNELYIPQGRVLVVGTAVTEQILND--DRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTAF 232 (392) Q Consensus 155 a~~~l~~~~vp~~r~~vv~~~~~~~l~~~--~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a~ 232 (392) |...|++++. ..++++++|+.++.|+++ .+|......|.. .+++|.+|.+.|+.|++++.+|..+.+.+++.++ T Consensus 148 A~~~lgd~~~-~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~---~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gAi 223 (276) T protein:vir:10 148 AIDTFDDEDL-EPMVLFINPKDAGKLRSSASDNFTRATELGDN---IIVKGAFGEALGAVIVRSKKLDEGEAILAKRGAV 223 (276) T ss_pred HHHHhccccC-cccEEEEcHHHHHHHHHhcccccccccccccc---ceeccccceecceeEEEcCCCCcceEEEEeccce Confidence 9999988776 678999999999999764 578888777653 5789999999999999999999999998888877 Q ss_pred cccchhhh-ccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeeccc Q lcl|Aclame:pro 233 IMATRAPA-PPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPE 311 (392) Q Consensus 233 ~~a~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~ 311 (392) .+..+... .+..+.... ..+.......++................ +. T Consensus 224 ~~~~~~~~~vE~dRd~~~--------------------~~d~i~~~~~y~~~~~~~~~vv~~t~~~------------~~ 271 (276) T protein:vir:10 224 KLITKRDFFLETDRDPST--------------------KTTALYSDKHYVAYLYDESKAVKVTKGA------------GT 271 (276) T ss_pred eeeecCCceeecccchhh--------------------cccEEEEeeEEEEEEEcCcceEEEecCC------------cC Confidence 65544321 111111111 1111111111111111110000000000 00 Q ss_pred ccccc Q lcl|Aclame:pro 312 AGANA 316 (392) Q Consensus 312 ~~~~~ 316 (392) .++.. T Consensus 272 ~~~~~ 276 (276) T protein:vir:10 272 TDSGA 276 (276) T ss_pred CcCCC Confidence 00000 No 30 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=99.95 E-value=5.1e-31 Score=186.67 Aligned_cols=285 Identities=11% Similarity=0.032 Sum_probs=176.1 Q ss_pred Ccccc---------------------ccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccc Q lcl|Aclame:pro 1 MANAF---------------------SKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRK 59 (392) Q Consensus 1 Man~~---------------------~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~ 59 (392) |||+. +. |+|+.+++..|++..+|.+++++. ++ +.|++++|++.+..++.+|. T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~i-e~f~g~V~~~f~~~s~~~~~~~~~---~~--~~G~sv~i~~ig~~t~~~~~ 74 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFL-KVFGGEVLTAFARTSVTMPRHMLR---SI--ASGKSAQFPVIGRTKAAYLK 74 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHH-HHHHHHHHHHHHHhhhhhhccccc---cc--cccceeEeeeccceeeeeec Confidence 55422 22 788999999999999999998753 44 34999999999999999986 Q ss_pred cccccCCCccccccccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhcccccc--- Q lcl|Aclame:pro 60 LRGAGAERNLTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEA--- 136 (392) Q Consensus 60 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~--- 136 (392) +.. +-+...+++...++.|+||+.+|+.+.|+|.|+.+..+|++.++.++++++||+.+|+.++..+..+.... T Consensus 75 ~g~---~l~~~~~~~~~~e~~ltID~~~~~~~~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~ 151 (347) T protein:vir:15 75 PGE---NLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDAS 151 (347) T ss_pred cCC---CCCCCCCCCccceEEEEechhhhhhHHhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Confidence 532 22234466888999999999999999999999999999999999999999999999999997765331110 Q ss_pred -----------------cccccccc----chhhHHHHHHHHHHhhhccCC-CCCEEEEchHHHHHhhcccceeeeecccc Q lcl|Aclame:pro 137 -----------------AGAVHEVA----PDEFFKGVNGARRALNELYIP-QGRVLVVGTAVTEQILNDDRFIKYESQGQ 194 (392) Q Consensus 137 -----------------~~~~~~~~----~~~~~~~i~~a~~~l~~~~vp-~~r~~vv~~~~~~~l~~~~~~~~~~~~G~ 194 (392) ..+..... ....++.+.+|++.|++++|| ++||++|+|++|..|+++++|...++.|. T Consensus 152 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~~~~~d~~~~ 231 (347) T protein:vir:15 152 NENIEGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQAL 231 (347) T ss_pred cccccccCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhccccccccccccc Confidence 00000111 123477888899999999999 68999999999999999999998887665 Q ss_pred ceeeeEeeeeeeeEeeeEEEEecceeecccceeecccccccchhhhccccccccc-eee--cccceeee-----eee--- Q lcl|Aclame:pro 195 SAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRST-AIS--GDQRIAMR-----WLV--- 263 (392) Q Consensus 195 ~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~-~~~--~~~~~~~~-----~~~--- 263 (392) . .+++|.+++++||+|++++++|..........+.. +............ ... ...+..+. .+. T Consensus 232 ~---~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~---g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~ 305 (347) T protein:vir:15 232 I---DHERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPA---DQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKD 305 (347) T ss_pred c---cccceEEEEEeceEEEecccccccccccccccccc---cccccccccccceeeeccccceeeeeccceeeeeEeec Confidence 3 47899999999999999999986543221111110 0000000000000 000 00000000 000 Q ss_pred -----ccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeeccccccc Q lcl|Aclame:pro 264 -----DYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGAN 315 (392) Q Consensus 264 -----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~~~~ 315 (392) ..+.....+.......+|..+... ...+.+.-..++. T Consensus 306 ~~~e~~~~~~~~~d~i~~~~~~G~~vlrP---------------~~av~~~~~~~~~ 347 (347) T protein:vir:15 306 LALERARRANYQADQIIAKYAMGHGGLRP---------------EAAGAIVLPKVSE 347 (347) T ss_pred eeeeecccchhhhhhhehhhhcCCceecc---------------ccEEEEecCCCCC Confidence 001111111111001111111110 0000000000000 No 31 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=99.95 E-value=5.7e-30 Score=180.92 Aligned_cols=284 Identities=12% Similarity=0.053 Sum_probs=173.8 Q ss_pred Ccc---------------------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccc Q lcl|Aclame:pro 1 MAN---------------------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRK 59 (392) Q Consensus 1 Man---------------------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~ 59 (392) ||| .+|. |+|+.|++..|.+..+|.+++++ .++. .|++++||+.+..++.++. T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~i-e~~~geV~~~f~~~s~~~~~~~~---rti~--~G~sv~~~~iG~~~~~~~~ 74 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFL-KVFGGEVLTAFTRTSVTMNKHLV---RSIQ--SGKSAQFPVLGRTKAAYLQ 74 (347) T ss_pred CCccccccccccccccCCcccchHHHHH-HHHhHHHHHHHHHHHhhhhhhhh---eecc--ccceEEeeeccceeEeeee Confidence 554 2455 99999999999999999999875 2454 4999999999999998876 Q ss_pred cccccCCCccccccccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-- Q lcl|Aclame:pro 60 LRGAGAERNLTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAA-- 137 (392) Q Consensus 60 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~-- 137 (392) +.. +.....+++...+++|+||+.+|+.+.|+|.|+.+.++|++.++.++++++||+.+|+.++..+..+..... T Consensus 75 ~G~---~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~ 151 (347) T protein:vir:94 75 PGE---NLDDKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTAN 151 (347) T ss_pred cCc---CCCCCcCCccccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Confidence 432 222344678889999999999999999999999999999999999999999999999999865543211100 Q ss_pred ---------------------cccccccchhhHHHHHHHHHHhhhccCC-CCCEEEEchHHHHHhhcccceeeeeccccc Q lcl|Aclame:pro 138 ---------------------GAVHEVAPDEFFKGVNGARRALNELYIP-QGRVLVVGTAVTEQILNDDRFIKYESQGQS 195 (392) Q Consensus 138 ---------------------~~~~~~~~~~~~~~i~~a~~~l~~~~vp-~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~ 195 (392) .......+...|+.|.+++..|++++|| .+|+++++|++|..|++...+...+..+ T Consensus 152 ~~~~~g~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~~~~~~~~-- 229 (347) T protein:vir:94 152 NENIAGLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAILAALMPNAANYQA-- 229 (347) T ss_pred ccccccCCcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHHHhhccccccccc-- Confidence 0001123445688999999999999999 5899999999999999865544444332 Q ss_pred eeeeEeeeeeeeEeeeEEEEecceeecccceeeccc-ccccchhhhccccccccceeecccceeeeeeec---------- Q lcl|Aclame:pro 196 AVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTA-FIMATRAPAPPMGAVRSTAISGDQRIAMRWLVD---------- 264 (392) Q Consensus 196 ~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a-~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------- 264 (392) ...+++|.++.++||+|++++++|..........+ ...+.............+....+. ....+.. T Consensus 230 -~~~~~~G~V~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~--~~~l~~~~~A~~tv~~~ 306 (347) T protein:vir:94 230 -LIDPSTGSIRNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDN--VVGLFNHRSAVGTVKLK 306 (347) T ss_pred -ccccccceeEEeeceEEEEcCccccccCcccccccccccccccccccccccccccccccc--eEEEEechhhhhhhhhc Confidence 24577899999999999999999875432211111 111111000000000001000000 1111111 Q ss_pred -------cccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeecccccccc Q lcl|Aclame:pro 265 -------YDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGANA 316 (392) Q Consensus 265 -------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~~~~~ 316 (392) ++.....+........|.... .+...+. +.+... T Consensus 307 ~~~~e~~~~~~~~~~~i~~~~a~G~g~~---------------rPe~a~~---i~~~~a 347 (347) T protein:vir:94 307 DMALERARRANFQADQIIAKYAMGHGGL---------------RPEACGA---LVFKKA 347 (347) T ss_pred ccceeeeechhhhhhhhhhhhhhcCccc---------------ccceeEE---EEecCC Confidence 011110110000000000000 0000000 000000 No 32 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=99.94 E-value=8.2e-30 Score=180.03 Aligned_cols=286 Identities=12% Similarity=0.074 Sum_probs=174.5 Q ss_pred Ccc---------------------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccc Q lcl|Aclame:pro 1 MAN---------------------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRK 59 (392) Q Consensus 1 Man---------------------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~ 59 (392) ||| .+|+ |+|+.+++..|.+..+|.++++.. ++ +.|++++||+.+..++..+. T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d~~al~i-e~~~geV~~~f~~~s~~~~~~~~r---~i--~~G~sv~~~~iG~~~~~~~~ 74 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAADKLALFL-KVFGGEVLTAFVRRSVTMDKHMVR---TI--QNGKSASFPVMGRTKGYYLA 74 (347) T ss_pred CCCcccchhhhccCCCCccccchHHHHH-HHHHHHHHHHHHHHhhhhhccccc---cc--cCcceEEEeeecceeeeeec Confidence 653 3455 999999999999999999998652 44 45999999999999988765 Q ss_pred cccccCCCccccccccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhcccccc--- Q lcl|Aclame:pro 60 LRGAGAERNLTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEA--- 136 (392) Q Consensus 60 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~--- 136 (392) +.. +......++.+++++|+||+.+|+.+.|+|.|+.+..+|++.++.++++++||+.+|+.++..+..+.... T Consensus 75 ~g~---~l~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~~ 151 (347) T protein:vir:88 75 PGE---NLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAAS 151 (347) T ss_pred ccc---CCCCCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Confidence 422 22233457888999999999999999999999999999999999999999999999999987664332110 Q ss_pred ---------------ccccc----cccchhhHHHHHHHHHHhhhccCC-CCCEEEEchHHHHHhhcccceeeeeccccce Q lcl|Aclame:pro 137 ---------------AGAVH----EVAPDEFFKGVNGARRALNELYIP-QGRVLVVGTAVTEQILNDDRFIKYESQGQSA 196 (392) Q Consensus 137 ---------------~~~~~----~~~~~~~~~~i~~a~~~l~~~~vp-~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~ 196 (392) +.+.. .......++.|.+++++|++++|| ++|+++++|++|..|+++..+...+..+.. T Consensus 152 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~- 230 (347) T protein:vir:88 152 NENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALI- 230 (347) T ss_pred ccccCCccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHHHHHhcchhhhhhhhcccc- Confidence 00000 011123488899999999999999 589999999999999998888777665432 Q ss_pred eeeEeeeeeeeEeeeEEEEecceeecccceeecccccccchhhh--ccccccccceeecccceee--------------- Q lcl|Aclame:pro 197 VSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTAFIMATRAPA--PPMGAVRSTAISGDQRIAM--------------- 259 (392) Q Consensus 197 ~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a~~~a~~~~~--~~~~~~~~~~~~~~~~~~~--------------- 259 (392) .+++|.+++++||+|++++++|...... ++.+......... ........+....+..... T Consensus 231 --~~~~G~vg~i~G~~V~~s~nlp~~~~~~-~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~ 307 (347) T protein:vir:88 231 --DPETGNIRNVMGFEVIEVPHLTVGGAGD-NNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDM 307 (347) T ss_pred --chhcceeeeeccceEEEeeccccccccc-ccccccccccccccccccccccccccccCcEEEEEechhhhhheecccc Confidence 4678999999999999999998543321 1111111000000 0000000000000000000 Q ss_pred eeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeecccccccce Q lcl|Aclame:pro 260 RWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGANAT 317 (392) Q Consensus 260 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~~~~~~ 317 (392) ..-..++.....+.......+|..+......... .++..- T Consensus 308 ~~e~~r~~~~~~d~i~~~~~~G~~~~rPe~a~~~------------------~~~~a~ 347 (347) T protein:vir:88 308 ALERARRPEFQADQIIGKYAMGHGGLRPEAAGAL------------------VFTPAA 347 (347) T ss_pred eeeeeechhhHHHHhhhhhhhcCceeccceEEEE------------------EeCCCC Confidence 0000011111111111111111111110000000 000000 No 33 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=99.94 E-value=1.1e-29 Score=179.24 Aligned_cols=299 Identities=13% Similarity=0.085 Sum_probs=174.5 Q ss_pred Ccc----------------------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeecc Q lcl|Aclame:pro 1 MAN----------------------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTR 58 (392) Q Consensus 1 Man----------------------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~ 58 (392) |++ .+|. |+|+.|++..|.+..+|.++++. .++.+ |++++|++.+..++..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~al~l-e~f~geV~~~f~~~s~~~~~~~~---r~i~~--gks~~~~~iG~~~~~~~ 74 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVAAGDKLALFL-KVFGGEVLTAFARTSVTTSRHMV---RSISS--GKSAQFPVLGRTQAAYL 74 (345) T ss_pred CcccccchhcccccccccccCCchhHHHH-HHHhHHHHHHHHHHhhhccccee---eeccc--cceEEEeeecceEEEee Confidence 322 3454 89999999999999999999864 25554 89999999999999888 Q ss_pred ccccccCCCccccccccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccc- Q lcl|Aclame:pro 59 KLRGAGAERNLTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAA- 137 (392) Q Consensus 59 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~- 137 (392) .+... -.....++...+..|+||+.+|+.+.|+|.|+.+.++|++.++.++++++||+.+|+.++..+..+..... T Consensus 75 ~~G~~---l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~ 151 (345) T protein:vir:22 75 APGEN---LDDKRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESK 151 (345) T ss_pred ecCCC---CCCCCCCcccceEEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Confidence 65322 11223356668888999999999999999999999999999999999999999999999876643211100 Q ss_pred -----------------c-c----cccccchhhHHHHHHHHHHhhhccCC-CCCEEEEchHHHHHhhcccceeeeecccc Q lcl|Aclame:pro 138 -----------------G-A----VHEVAPDEFFKGVNGARRALNELYIP-QGRVLVVGTAVTEQILNDDRFIKYESQGQ 194 (392) Q Consensus 138 -----------------~-~----~~~~~~~~~~~~i~~a~~~l~~~~vp-~~r~~vv~~~~~~~l~~~~~~~~~~~~G~ 194 (392) . + .....+...|+.|.++++.|++++|| .+||++++|++|..|+.++.|....+.|. T Consensus 152 ~~~~~~~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~~~~~~~~~~ 231 (345) T protein:vir:22 152 YNENIEGLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCDPDSYSAILAALMPNAANYAAL 231 (345) T ss_pred ccccccccccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeChHHHHHHhccccccccccccc Confidence 0 0 01112345699999999999999999 58999999999999999999988776654 Q ss_pred ceeeeEeeeeeeeEeeeEEEEecceeecccceeecccccccchhhhccccccccceeecccceeeeeeeccccceeeeec Q lcl|Aclame:pro 195 SAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRS 274 (392) Q Consensus 195 ~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 274 (392) . ..++|.+++++||+|++++++|...... ...+. .......+.............. ................. T Consensus 232 ~---~~~~G~V~~i~G~~V~~sn~lp~~~~~~-~~~~~--~~~~~~~~~~~g~~~~~~~~~~-~~~l~~h~~A~~~v~~~ 304 (345) T protein:vir:22 232 I---DPEKGSIRNVMGFEVVEVPHLTAGGAGT-AREGT--TGQKHVFPANKGEGNVKVAKDN-VIGLFMHRSAVGTVKLR 304 (345) T ss_pred c---ccccceEEEEeceEEEecccccccccCc-cccCc--ccccccccccccceeeeeccCc-eEEEEEehhheeeeeee Confidence 3 4578999999999999999998532211 01110 1111111111111110000000 00000000000000000 Q ss_pred ccccceeeeEEEeeccccceeeeeccceeeeeeecccccccceeeeeeccCeeEEEEEe Q lcl|Aclame:pro 275 LIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGANATITAAAGEDHTVQLKVT 333 (392) Q Consensus 275 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~t~~~t~~ 333 (392) +. ......... ....... ...+-+..+-....-.. +.+.+. T Consensus 305 --~~--~~e~~r~~~--~~~d~I~------~~~a~G~~vlRPeaa~~------i~~~~~ 345 (345) T protein:vir:22 305 --DL--ALERARRAN--FQADQII------AKYAMGHGGLRPEAAGA------VVFKVE 345 (345) T ss_pred --cc--eeeeeechh--HHHHHHH------HHHhcCCcccccceeEE------EEEeeC Confidence 00 000000000 0000000 00000000000000000 000000 No 34 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=99.94 E-value=3.2e-30 Score=182.25 Aligned_cols=287 Identities=11% Similarity=0.018 Sum_probs=169.1 Q ss_pred Cccc--------------------cccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeecccc Q lcl|Aclame:pro 1 MANA--------------------FSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKL 60 (392) Q Consensus 1 Man~--------------------~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~ 60 (392) |||. ++. |.|..|+...|.+..+|.+++.+. ++ +.|++++||+.+..++.++.+ T Consensus 1 m~~~~~~~~~t~~g~~~~~~d~~al~i-k~f~~eV~~~f~~~s~~~~~~~~r---~i--~~G~sv~i~~iG~~tv~~~t~ 74 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGKSSSDALALFL-KVFAGEVLTAFTRRSVTADKHIVR---TI--QNGKSAQFPVMGRTSGVYLAP 74 (347) T ss_pred CCCCCccccccccccCCccccHHHHHH-HHHhHHHHHHHHHHHhhhcccccc---cc--cccceEEEecccceeeeeecC Confidence 4431 222 455555556677777888887543 44 459999999999999999865 Q ss_pred ccccCCCccccccccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccc--- Q lcl|Aclame:pro 61 RGAGAERNLTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAA--- 137 (392) Q Consensus 61 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~--- 137 (392) .. .-....+++.+.++.|+||+.+|+.+.|+|.|+.+..+|++.++.++++++||+.+|+.++.++........ T Consensus 75 G~---~l~~~~~~~~~~e~~itID~~~~~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~ 151 (347) T protein:vir:94 75 GE---RLSDKRKGIKHTEKVITIDGLLTADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASN 151 (347) T ss_pred CC---CcCCCCCCCCcceEEEEecchhhhhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 22 222245678899999999999999999999999999999999999999999999999999876643111000 Q ss_pred ---c------------ccc----cccchhhHHHHHHHHHHhhhccCC-CCCEEEEchHHHHHhhcccceeeeecccccee Q lcl|Aclame:pro 138 ---G------------AVH----EVAPDEFFKGVNGARRALNELYIP-QGRVLVVGTAVTEQILNDDRFIKYESQGQSAV 197 (392) Q Consensus 138 ---~------------~~~----~~~~~~~~~~i~~a~~~l~~~~vp-~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~ 197 (392) . ... ...+...++.|.+++..|++++|| .+||++++|++|..|+.+..|......+. T Consensus 152 ~~~~g~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~~~~~~--- 228 (347) T protein:vir:94 152 ENIAGLGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAANYAAL--- 228 (347) T ss_pred cccCCCcccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhhhcccc--- Confidence 0 000 011244678899999999999999 58999999999999999888877766554 Q ss_pred eeEeeeeeeeEeeeEEEEecceeecccceeecc-cccccchhhhccccc-cccceeeccc--ceee-------------e Q lcl|Aclame:pro 198 SALQEARLGRIYGYEIVESTLIPHGDAYLYHPT-AFIMATRAPAPPMGA-VRSTAISGDQ--RIAM-------------R 260 (392) Q Consensus 198 ~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~-a~~~a~~~~~~~~~~-~~~~~~~~~~--~~~~-------------~ 260 (392) ..+++|.+++++||+|++|+++|.......... .+....+........ ........+. +..+ . T Consensus 229 ~~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~ 308 (347) T protein:vir:94 229 IDPETGNIRNVMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLA 308 (347) T ss_pred ccccccceEEEeceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhccccc Confidence 246789999999999999999996543221110 001000000000000 0000000000 0000 0 Q ss_pred eeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeecccccccce Q lcl|Aclame:pro 261 WLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGANAT 317 (392) Q Consensus 261 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~~~~~~ 317 (392) .-..++.....+....-..+|..+...... +. +...... T Consensus 309 ~e~~r~~~~~~d~i~~~~~~G~~~~rP~~a---------------~~---~~~~~A~ 347 (347) T protein:vir:94 309 LERDRDVDAQGDLIVGKYAMGHGGLRPEAA---------------GA---LVFSPAE 347 (347) T ss_pred ccchhchhhHHHHhhhhhhhcCccccccee---------------EE---EEecCCC Confidence 000011111111111111111111100000 00 0000000 No 35 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=99.94 E-value=1.8e-28 Score=172.67 Aligned_cols=314 Identities=14% Similarity=0.120 Sum_probs=177.5 Q ss_pred Cc--c--------------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeecccccccc Q lcl|Aclame:pro 1 MA--N--------------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAG 64 (392) Q Consensus 1 Ma--n--------------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~ 64 (392) |+ | .++. |+|+.+++..|.+..+|.++++. .++.+ |++++|++.+..++.++.+...- T Consensus 9 ~~~~n~~t~~~~~~~~~~~al~l-e~f~geV~~~f~~~si~~~~~~~---rti~~--Gksv~f~~iG~~t~~~~t~G~~i 82 (375) T protein:vir:10 9 LGRSNLSTGTGYGGATDKYALYL-KLFSGEMFKGFQHETIARDLVTK---RTLKN--GKSLQFIYTGRMTSSFHTPGTPI 82 (375) T ss_pred cCccccCCccccccccchHHHHH-HHHhHHHHHHHHHHHhhhccccc---ccccc--CceEEEEeeeeeEEeeecCCcCc Confidence 22 1 3444 89999999999999999998864 25544 89999999999999998753321 Q ss_pred CCCccccccccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-------- Q lcl|Aclame:pro 65 AERNLTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEA-------- 136 (392) Q Consensus 65 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~-------- 136 (392) .++ ...++..++++|+||+.+|+.|.|+|.|+.+..+|++.++.++++++||+.+|+.++..+..+.... T Consensus 83 ~~~--~~~d~~~te~~l~ID~~~y~~~~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~~~~~~ 160 (375) T protein:vir:10 83 LGN--ADKAPPVAEKTIVMDDLLISSAFVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASPVSATNF 160 (375) T ss_pred CCc--cccCCCCCceEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc Confidence 111 1236667888899999999999999999999999999999999999999999999998775432110 Q ss_pred -------------ccccccccchhhHHHHHHHHHHhhhccCC-CCCEEEEchHHHHHhhcc---cceeeeeccccceeee Q lcl|Aclame:pro 137 -------------AGAVHEVAPDEFFKGVNGARRALNELYIP-QGRVLVVGTAVTEQILND---DRFIKYESQGQSAVSA 199 (392) Q Consensus 137 -------------~~~~~~~~~~~~~~~i~~a~~~l~~~~vp-~~r~~vv~~~~~~~l~~~---~~~~~~~~~G~~~~~a 199 (392) .......++...|+.|.++++.|++++|| .+||++++|++|..|+.+ +.|...+..|+ .. T Consensus 161 ~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~~~~n~d~~~~---~~ 237 (375) T protein:vir:10 161 VEPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQGRCAVLNPRQYYALIQDIGSNGLVNRDVQGS---AL 237 (375) T ss_pred cccCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHHhcCCccceeeeccccc---ce Confidence 00011234567899999999999999999 589999999999999865 56777666444 24 Q ss_pred EeeeeeeeEeeeEEEEecceeecccceeecccccccchhhhccccccccceeecccceeeee-eeccccceeeeeccccc Q lcl|Aclame:pro 200 LQEARLGRIYGYEIVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRW-LVDYDSTITSNRSLIDT 278 (392) Q Consensus 200 ~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 278 (392) ..+|.+++++||+|++++++|..+...+...+.... ........ .............. ...|..........++. T Consensus 238 ~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~-~a~~~~~~---~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~ 313 (375) T protein:vir:10 238 QSGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGE-TSPGNLGS---HIGPTPENANATGGVNNDYGTNAELGAKSCGL 313 (375) T ss_pred eccceEEEEeceEEEEeccccccccccccccccccc-cchhhhhc---cccccCCcceeeccccccccccccccCceEEE Confidence 567889999999999999999765433221111100 00000000 00000000000000 00000000000001111 Q ss_pred ceeeeEEEeeccccceeeeeccceeeeeeecccccc--------cceeeeeec---cCeeEEEEEeecCcccccceEEE Q lcl|Aclame:pro 279 YFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGA--------NATITAAAG---EDHTVQLKVTDANGDDVTALCDF 346 (392) Q Consensus 279 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~~~--------~~~~~~~~~---~~~t~~~t~~~~~~~~~~~~vtw 346 (392) +. ...+.. ..... ...+.++.-... .....++.+ ...-..+.. ..++. ..| T Consensus 314 ~~----~~~A~g--~v~~~-----~~~~~~~~~~~~~~~q~~~i~~~~a~G~~~lrp~~av~l~~-~~~~~-----~~~ 375 (375) T protein:vir:10 314 IF----QKEAAG--VVEAI-----GPQVQVTNGDVSVIYQGDVILGRMAMGADYLNPAAAVELYI-GATAP-----SAF 375 (375) T ss_pred EE----chhhee--eeeee-----ccccccccchhhheeeeeeeeeeeeeccCccCceeEEEEec-CcCcc-----ccC Confidence 00 000000 00000 000000000000 000000000 000001110 01111 111 No 36 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=99.94 E-value=1.3e-28 Score=173.38 Aligned_cols=263 Identities=17% Similarity=0.183 Sum_probs=183.8 Q ss_pred Cccc------cccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc-eeeeccccccccCCCcccccc Q lcl|Aclame:pro 1 MANA------FSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSD 73 (392) Q Consensus 1 Man~------~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~-~~~~~~~~~~~~~~~~~~~~~ 73 (392) ||++ +|+||+|++++++.|++.++|.+++.+++ ++.+++|++|+||+... ..+.++ +++..+..++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~--~~~g~~G~tv~iP~~~~~~~a~~v-----~eg~~i~~~~ 73 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDT--TLEGQPGTTLTVPKWDYIGDAEDV-----AEGEAIPMTQ 73 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccc--cccCCCCCEEEEEEecCCCCcccc-----cCCCcccccc Confidence 9974 59999999999999999999999998874 56778899999998653 344443 3455678889 Q ss_pred ccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHH Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVN 153 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 153 (392) ++.+++++++++. ++.|.++|++..++..|++.++.+++++++++++|.+++..+.++... .+....++.|+ T Consensus 74 ~~~~~~~~~~~~~-~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~-------~~~~~t~d~i~ 145 (272) T protein:vir:98 74 LGFKKTTMTIKKA-GKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQT-------VEATATVDGVS 145 (272) T ss_pred cccceEEEEeeee-eeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-------cccccCHHHHH Confidence 9999999999774 678999999999999999999999999999999999999988765432 22334689999 Q ss_pred HHHHHhhhccCCCCCEEEEchHHHHHhhccc--ceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccceeeccc Q lcl|Aclame:pro 154 GARRALNELYIPQGRVLVVGTAVTEQILNDD--RFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTA 231 (392) Q Consensus 154 ~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~--~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a 231 (392) +|...|++++. ..|+++++|+.+..|+++. +|......|. ..+++|.+|++.|++|+.++.+|.++.+.++..+ T Consensus 146 da~~~l~~~~~-~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~---~~~~~g~ig~i~G~~Vi~s~~~p~~t~~~~~~~a 221 (272) T protein:vir:98 146 KALDIFNDEDD-AETVIVMNPADASTLRLDAAKEWLGATEVGA---NRVVSGVYGEVLGVQIVRSRKCPKGTAYMVRKGA 221 (272) T ss_pred HHHHHHhccCC-CccEEEEcHHHHHHHHHhccccccccccccc---cccccccchhhcCeeEEEcCCCCcceEEEEcCCe Confidence 99999987764 4689999999999998764 4555544443 3578899999999999999999998888888777 Q ss_pred ccccchhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeeccc Q lcl|Aclame:pro 232 FIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPE 311 (392) Q Consensus 232 ~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~ 311 (392) +.+..+........ .+.....+.......++..+...... ..++. . T Consensus 222 ~~~~~~~~~~ve~~-------------------r~~~~~~~~i~~~~~~~~~v~~~~~v------v~~t~-----~---- 267 (272) T protein:vir:98 222 LRIMLKRNTMVETD-------------------RDITKAINQIVANKHYGVYLYKAEKA------VKITL-----K---- 267 (272) T ss_pred EEEEecCCceeeec-------------------cccccceeEEEEEEEEEEEEEcCCce------EEEEe-----c---- Confidence 66554332111100 00000011111111111111110000 00000 0 Q ss_pred ccccce Q lcl|Aclame:pro 312 AGANAT 317 (392) Q Consensus 312 ~~~~~~ 317 (392) .+... T Consensus 268 -~a~~~ 272 (272) T protein:vir:98 268 -DAAKK 272 (272) T ss_pred -ccccC Confidence 00000 No 37 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=99.94 E-value=1.3e-28 Score=173.38 Aligned_cols=263 Identities=17% Similarity=0.183 Sum_probs=183.8 Q ss_pred Cccc------cccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc-eeeeccccccccCCCcccccc Q lcl|Aclame:pro 1 MANA------FSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSD 73 (392) Q Consensus 1 Man~------~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~-~~~~~~~~~~~~~~~~~~~~~ 73 (392) ||++ +|+||+|++++++.|++.++|.+++.+++ ++.+++|++|+||+... ..+.++ +++..+..++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~--~~~g~~G~tv~iP~~~~~~~a~~v-----~eg~~i~~~~ 73 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDT--TLEGQPGTTLTVPKWDYIGDAEDV-----AEGEAIPMTQ 73 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccc--cccCCCCCEEEEEEecCCCCcccc-----cCCCcccccc Confidence 9974 59999999999999999999999998874 56778899999998653 344443 3455678889 Q ss_pred ccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHH Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVN 153 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 153 (392) ++.+++++++++. ++.|.++|++..++..|++.++.+++++++++++|.+++..+.++... .+....++.|+ T Consensus 74 ~~~~~~~~~~~~~-~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~-------~~~~~t~d~i~ 145 (272) T protein:vir:30 74 LGFKKTTMTIKKA-GKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQT-------VEATATVDGVS 145 (272) T ss_pred cccceEEEEeeee-eeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-------cccccCHHHHH Confidence 9999999999774 678999999999999999999999999999999999999988765432 22334689999 Q ss_pred HHHHHhhhccCCCCCEEEEchHHHHHhhccc--ceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccceeeccc Q lcl|Aclame:pro 154 GARRALNELYIPQGRVLVVGTAVTEQILNDD--RFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTA 231 (392) Q Consensus 154 ~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~--~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a 231 (392) +|...|++++. ..|+++++|+.+..|+++. +|......|. ..+++|.+|++.|++|+.++.+|.++.+.++..+ T Consensus 146 da~~~l~~~~~-~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~---~~~~~g~ig~i~G~~Vi~s~~~p~~t~~~~~~~a 221 (272) T protein:vir:30 146 KALDIFNDEDD-AETVIVMNPADASTLRLDAAKEWLGATEVGA---NRVVSGVYGEVLGVQIVRSRKCPKGTAYMVRKGA 221 (272) T ss_pred HHHHHHhccCC-CccEEEEcHHHHHHHHHhccccccccccccc---cccccccchhhcCeeEEEcCCCCcceEEEEcCCe Confidence 99999987764 4689999999999998764 4555544443 3578899999999999999999998888888777 Q ss_pred ccccchhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeeccc Q lcl|Aclame:pro 232 FIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPE 311 (392) Q Consensus 232 ~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~ 311 (392) +.+..+........ .+.....+.......++..+...... ..++. . T Consensus 222 ~~~~~~~~~~ve~~-------------------r~~~~~~~~i~~~~~~~~~v~~~~~v------v~~t~-----~---- 267 (272) T protein:vir:30 222 LRIMLKRNTMVETD-------------------RDITKAINQIVANKHYGVYLYKAEKA------VKITL-----K---- 267 (272) T ss_pred EEEEecCCceeeec-------------------cccccceeEEEEEEEEEEEEEcCCce------EEEEe-----c---- Confidence 66554332111100 00000011111111111111110000 00000 0 Q ss_pred ccccce Q lcl|Aclame:pro 312 AGANAT 317 (392) Q Consensus 312 ~~~~~~ 317 (392) .+... T Consensus 268 -~a~~~ 272 (272) T protein:vir:30 268 -DAAKK 272 (272) T ss_pred -ccccC Confidence 00000 No 38 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=99.94 E-value=8e-29 Score=174.61 Aligned_cols=283 Identities=11% Similarity=0.032 Sum_probs=180.8 Q ss_pred Cccc------------------cccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeecccccc Q lcl|Aclame:pro 1 MANA------------------FSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRG 62 (392) Q Consensus 1 Man~------------------~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~ 62 (392) |+|- +|. |+|+.+++..|.++.+|.+++.+. ++ +.|++++|++.+..++..+.+ T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~l-e~~~geV~~af~~~s~~~~~~~~r---~i--~~G~s~~~~~iG~~~~~~~~~-- 72 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHI-EEHLGLVDASFMYSSKFASWMNVR---SL--RGTNQLRVDRVGASTIAGRKA-- 72 (334) T ss_pred CCCCcCCCccccccccccchheehh-hhhhhHHHHHHHHhhhhhccceee---ec--cccceEEEeeecceeeeeecC-- Confidence 6642 232 999999999999999999988653 55 459999999999999988754 Q ss_pred ccCCCccccccccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc---- Q lcl|Aclame:pro 63 AGAERNLTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAG---- 138 (392) Q Consensus 63 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~---- 138 (392) +.++..+++.+.+++|+||+.+|+.+.|+|.|+.+..+|++.++.++++++||+..|+.++..+..+...... T Consensus 73 ---g~~l~~~~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~ 149 (334) T protein:vir:80 73 ---GEELVVQKNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLK 149 (334) T ss_pred ---CCCCCCCCcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc Confidence 5567888999999999999999999999999999999999999999999999999999988665433221100 Q ss_pred -----------------ccccccchhhHHHHHHHHHHhhhccCC----CCCEEEEchHHHHHhhcccceeeeecccccee Q lcl|Aclame:pro 139 -----------------AVHEVAPDEFFKGVNGARRALNELYIP----QGRVLVVGTAVTEQILNDDRFIKYESQGQSAV 197 (392) Q Consensus 139 -----------------~~~~~~~~~~~~~i~~a~~~l~~~~vp----~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~ 197 (392) ......++..+..+.+|++.|+|+++| .+|+++|+|++|..|+.+++|.+.++.+.... T Consensus 150 ~~~~~G~~~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~s~~~ 229 (334) T protein:vir:80 150 PAFHDGILLPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGAKEGG 229 (334) T ss_pred ccccCCcceeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceecccccc Confidence 001122344567888999999999999 36999999999999999999998877655444 Q ss_pred eeEeeeeeeeEeeeEEEEecceeecccceeecccccccchhhhccccccccceeecccceee--------eeeeccccce Q lcl|Aclame:pro 198 SALQEARLGRIYGYEIVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAM--------RWLVDYDSTI 269 (392) Q Consensus 198 ~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~ 269 (392) ..+..|.+++++||+|++++++|..... .+..+ ...+ ...+.............+. ....+++... T Consensus 230 ~~~~~g~i~~v~G~~V~~Sn~~P~~~~t-~~~~g----~~~~-~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~ 303 (334) T protein:vir:80 230 NSFVGGRIAMLNGVRVVETPRFPQSAIT-ANALG----ADFN-VTDAEVRRKMITFIPSMALISAQVHPVSAQFWEEKKD 303 (334) T ss_pred ccccceeEEEEeceEEEeecCCCCcccc-ccccc----cccc-cccccccceEEEEEeCceEEEEEEeecceeeeechhh Confidence 5678999999999999999999865322 11110 0000 0000000000000000000 0001111111 Q ss_pred eeeecccccceeeeEEEeeccccceeeeeccceeeeeeeccccc Q lcl|Aclame:pro 270 TSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAG 313 (392) Q Consensus 270 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~~ 313 (392) ..+....-..+|....... . ...+.++.+.+ T Consensus 304 ~~d~i~~~~a~G~g~lRPe-----------a--a~vv~~~~~~~ 334 (334) T protein:vir:80 304 FGHYLDTFQSYNIGQRRPD-----------A--VAVHDITVTNP 334 (334) T ss_pred HHHHHHHHHHcCCceeccc-----------e--EEEEEEeeecC Confidence 1111100011111111000 0 00011111111 No 39 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=99.92 E-value=3.9e-27 Score=165.39 Aligned_cols=275 Identities=13% Similarity=0.101 Sum_probs=168.3 Q ss_pred CccccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccccCceEE Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTEDSFP 80 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (392) ||.-.+ +++|++++++.|++.+++..|.++.+.+++...-|++|+||+.......||+....+. ...+++....+ T Consensus 1 MA~~n~-a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~~gl~DY~R~~~g~----~~g~~~~~~~t 75 (299) T protein:vir:79 1 MAALNY-AKEYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTISTTGRVDSNRDTIAV----AQRNYDNAWEP 75 (299) T ss_pred Cccchh-HHHHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEeccccccccccccCCCcc----cccccCcceeE Confidence 995334 5999999999999999999998888777765444799999999999999997643222 12356778899 Q ss_pred EEEEeeeecceEee--HHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccc--cccccccchhhHHHHHHHH Q lcl|Aclame:pro 81 VTLTDVAYHLGVLT--DEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAA--GAVHEVAPDEFFKGVNGAR 156 (392) Q Consensus 81 ~~i~~~~~~~~~i~--d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~i~~a~ 156 (392) ++|+|++++.|.|+ |.+++...........+.+.+.++.++|.+.++.+.......+ ......++.+.|+.|.++. T Consensus 76 ~~ldqdr~~~f~vD~~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~~~g~~~~~~~~T~~n~y~~i~~~~ 155 (299) T protein:vir:79 76 KVLTNQRKWSTLVHPADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYADWTALGNTADTTVLTTTNVLEVFDKLM 155 (299) T ss_pred EEeeccccceeccchhhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhhhcCCcccccccCHHHHHHHHHHHH Confidence 99999999999999 5555433333333334445567899999998876654332221 2233456788999999999 Q ss_pred HHhhhccCC-CCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEE--ecceeec----c------ Q lcl|Aclame:pro 157 RALNELYIP-QGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVE--STLIPHG----D------ 223 (392) Q Consensus 157 ~~l~~~~vp-~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~--s~~v~~~----~------ 223 (392) ..|++++|| ++|+++++|+.+..|.++++|.+....+.. ...++|.+|++.||+|++ ++.+... . T Consensus 156 ~~lde~~vP~~~rvl~vtp~~~~~L~~~~~f~k~~~~~~~--~~~~~g~Vg~idG~~Ii~Vps~r~~t~~~~~~G~~~~~ 233 (299) T protein:vir:79 156 EKMTEARVPENGRILYVTPVVNTLIKNAKEIQRTVNIKDA--GTSLNRQTTDIDTVKIIKVPSNLMKTAYDFTTGWKVGA 233 (299) T ss_pred HHHHhcCCCCCCeEEEeCHHHHHHHhhchhhhcccccccc--cceeeeeeeeecceEEEEechhhcCccceeccCccccC Confidence 999999999 589999999999999999999887766543 246799999999999987 2222210 0 Q ss_pred ------cceeecccccccchhhhccccccccceeecccceeeeeeeccccceeeeecccc-cceeeeEEEeeccc Q lcl|Aclame:pro 224 ------AYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLID-TYFGLKVVEDPNGV 291 (392) Q Consensus 224 ------~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 291 (392) -...|+++.....+....-. ..+.....+-++.....| .+..+.. ...+..+....... T Consensus 234 ~ak~in~ii~~~~a~~~~~K~~~~~~----~~P~~~~~~~~~~~~r~y-----~d~~v~~nk~~~i~~~~~~a~~ 299 (299) T protein:vir:79 234 GAKQIFMSLVHPSAIITPVSYQFSKL----DEPTAVTEGKYFYFEESF-----EDVFILNKKADAIQFVVEGAGA 299 (299) T ss_pred cccccceEEEcCCeeeeeEeeeeEEe----ecCCCCCccceeeeeeee-----eeeeeeccccCeEEEEeeecCC Confidence 01111111111110000000 000000000000000000 0000000 00111111111000 No 40 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=99.91 E-value=1e-26 Score=163.13 Aligned_cols=282 Identities=10% Similarity=0.016 Sum_probs=154.3 Q ss_pred eeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccccCceEEEEEEeeeecceEeeHHHHhhhccChHH Q lcl|Aclame:pro 28 LVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFAT 107 (392) Q Consensus 28 ~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~ 107 (392) ++ ..+.+ |++++|++.+..++..+.+... -...++++......|+||+.+|+.+.|+|.|+.+.++|++. T Consensus 1 ~v-----r~i~~--g~s~~~~~iG~~~~~~~~~G~~---l~~~~~~~~~~e~~itID~~l~~~~~VdDiD~~qa~~Dlr~ 70 (324) T protein:vir:99 1 MT-----RTITS--GKSAQFPVMGRTKARYLKQGQS---LDDGREDIKHTEKVITIDGLLTTDVLIYDIEDAMNHYDVRS 70 (324) T ss_pred Ce-----eeeec--CceEEEeeeeeeEeccccCCCC---cCCCcCCcCcccEEEEecchhhhhhhhhhHHHHhcCccchh Confidence 33 34554 8999999999999999865332 11235778889999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccccc---------------------cccccccchhhHHHHHHHHHHhhhccCC- Q lcl|Aclame:pro 108 QILPRQVRGVADILEEGVRDMIVGAPYEAA---------------------GAVHEVAPDEFFKGVNGARRALNELYIP- 165 (392) Q Consensus 108 ~~~~~~~~ala~~vd~~~~~~~~~~~~~~~---------------------~~~~~~~~~~~~~~i~~a~~~l~~~~vp- 165 (392) ++.++++++||+.+|+.++..+........ .......+...++.|.++++.|++++|| T Consensus 71 e~s~~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP~ 150 (324) T protein:vir:99 71 EYSTQMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYIPA 150 (324) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcCCCC Confidence 999999999999999999877643211000 0001112335689999999999999999 Q ss_pred CCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccceeeccccc-------ccchh Q lcl|Aclame:pro 166 QGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTAFI-------MATRA 238 (392) Q Consensus 166 ~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a~~-------~a~~~ 238 (392) .+||++++|++|..|+.+..+......+. ..+++|.+++++||+|++|+++|........ .+.. ..... T Consensus 151 ~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~---~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~-~a~~~~~~~~~~~~~~ 226 (324) T protein:vir:99 151 GDRTFYTDPDTYSAILAALMPNAANYAAL---IDPETGNIRNVMGFEVVETPHMTAQMVTNPT-DAFDGTGHIFPATGDS 226 (324) T ss_pred CCCEEEeChHHHHHHhhcccccccccccc---cceecceEEEEeceEEEecCCcccccccccc-cccccccccccccccc Confidence 68999999999998876666655555433 3578999999999999999999975433211 1110 00000 Q ss_pred hhccccccccceeecccceee-------------eeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeee Q lcl|Aclame:pro 239 PAPPMGAVRSTAISGDQRIAM-------------RWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGS 305 (392) Q Consensus 239 ~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 305 (392) +.......... ...+..+ .....++.....+....-..+|.............-....+ T Consensus 227 ~~~~ky~~d~~---~~~gl~~~~~a~~tv~~~~~~~e~~~~~~~~~d~i~~~~a~G~~~lRPe~a~~v~l~~~~~----- 298 (324) T protein:vir:99 227 TTTGKMTVGAD---NVVGLFVHRSAVATLKLKDMALERARRPEYQADQIIAKYAMGHGGLRPEAVGAIIFEDGET----- 298 (324) T ss_pred ccccccccccC---ceeEEEEehhheEEEeeecceecceechhhHHHhhhhhhhhcCcccccceEEEEEEccCcc----- Confidence 00000000000 0000000 00001111111111111111111111110000000000000 Q ss_pred eeecccccccceeeeeeccCeeEEEEEee Q lcl|Aclame:pro 306 IEVAPEAGANATITAAAGEDHTVQLKVTD 334 (392) Q Consensus 306 v~v~~~~~~~~~~~~~~~~~~t~~~t~~~ 334 (392) +.+.+........-...-+..++.+- T Consensus 299 ---~~~~~~~~~~~~~~~~~~~~~~~~~~ 324 (324) T protein:vir:99 299 ---PAVAPDVITGVASFAAPASTRAKSSA 324 (324) T ss_pred ---ccccchhhhhhccccCcccceeeecC Confidence 00111111100000000000111100 No 41 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=99.90 E-value=7.4e-26 Score=158.36 Aligned_cols=288 Identities=11% Similarity=-0.012 Sum_probs=179.7 Q ss_pred CccccccHHHHHHHHHHHHHHh-hcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccccCceE Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNE-LILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTEDSF 79 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~-l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (392) -+|++.-.+++...+-+.|... +..+.++|++|+. . .|++|+||+.....+.||+... ...+++++.... T Consensus 36 ~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~--~--~g~tVkIp~i~~~gl~DY~R~~-----g~~~g~vt~~~~ 106 (329) T protein:vir:10 36 EPGDTLLKNKHVGILEKVTAANSYSAPAVISNDAIF--M--QGRSFTVIKGDVTELKDYKRNA-----TNEFDHPQIQET 106 (329) T ss_pred CCchhHHHHHHHHHHHHHHHhhceeeeeecccceee--c--cCcEEEEeeecccccccccCCC-----Ccccccccccee Confidence 5677777789998888888765 5567788999853 3 4899999999999999997532 356678889999 Q ss_pred EEEEEeeeecceEeeHHHHhhhccCh--HHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHHHHHH Q lcl|Aclame:pro 80 PVTLTDVAYHLGVLTDEELTFDLESF--ATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNGARR 157 (392) Q Consensus 80 ~~~i~~~~~~~~~i~d~~~~~~~~~~--~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~ 157 (392) +++|+|++++.|.|++.|..+....+ .....+++...++.++|.+.++.+..... .......++.+.|+.|.+++. T Consensus 107 t~tidqdR~~~F~VD~~D~dEtn~~l~a~~i~~~~~~~~v~pEiDay~~skla~~a~--~~~~~~~t~~nay~~i~~a~~ 184 (329) T protein:vir:10 107 TYFLDQEKYWGRFVDALDRRDTEGNIDINYVVAKQASEVVAPYLDNLRFATLARNKA--KHLTVGSGADAQYDAVLDVSV 184 (329) T ss_pred EEEeecccceeeecchhhHhhhhhhhhHHHHHHHHHHHHhhhHHHHHHHHHHHhhcc--cccccccCHHHHHHHHHHHHH Confidence 99999999999999999988876654 34455677889999999999988865432 233445677889999999999 Q ss_pred HhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecce--eecccceeeccccccc Q lcl|Aclame:pro 158 ALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLI--PHGDAYLYHPTAFIMA 235 (392) Q Consensus 158 ~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v--~~~~~~~~~~~a~~~a 235 (392) .|+++++|++|+++++|+++..|.++++|........ ..+++|.+|++.||+|+..+.. ........|+++.... T Consensus 185 ~Lde~~vp~~Rvl~VtP~~~~~Lk~~~~f~~~~~~~~---~~~~~g~Vg~idG~~Ii~vps~~~k~in~ii~~~~A~~~~ 261 (329) T protein:vir:10 185 ELDEIGAGASRILFVTPKFYKGIKKFVIELPQGDNRQ---QVLGKGVQGELDGFTIVKVPSKMLQGVEAMAVIGEVMASP 261 (329) T ss_pred HHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccc---cceeeeeeeeecCeEEEEecCCcccceeEEEEcCCceeee Confidence 9999999999999999999999999988876543332 4568999999999999976432 2223355566655544 Q ss_pred chhhhcccccc-ccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeecccccc Q lcl|Aclame:pro 236 TRAPAPPMGAV-RSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGA 314 (392) Q Consensus 236 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~~~ 314 (392) .+......... ... .+..+... .+.+..+..... .++........ +...-... T Consensus 262 ~K~~~~~~~~p~~~~-----~a~~v~gr---------------~yyd~~V~~~k~-~~I~~~~~~a~-----~~~~~~~~ 315 (329) T protein:vir:10 262 IQANEAKLNSNVPGM-----FGTLAEQM---------------LYTGAFVPEHLQ-KYIFTIGGKEV-----ETNRDGVD 315 (329) T ss_pred eeeeeeeeeCCCCcc-----chheeeee---------------eeeeeEEEcccc-CEEEEecccCc-----ccCCCCCC Confidence 44332211110 000 00010000 011111111100 00000000000 00000000 Q ss_pred cceeeeeeccCeeEEE Q lcl|Aclame:pro 315 NATITAAAGEDHTVQL 330 (392) Q Consensus 315 ~~~~~~~~~~~~t~~~ 330 (392) ..+.. .+...+..+ T Consensus 316 ~~~~~--~~~~~~~~~ 329 (329) T protein:vir:10 316 AHADE--TNASADTGA 329 (329) T ss_pred ccccc--cccccccCC Confidence 00000 000000000 No 42 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=99.90 E-value=2.2e-25 Score=155.79 Aligned_cols=286 Identities=12% Similarity=0.074 Sum_probs=177.3 Q ss_pred Ccc----------------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeecccccccc Q lcl|Aclame:pro 1 MAN----------------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAG 64 (392) Q Consensus 1 Man----------------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~ 64 (392) |+| ++|. |+|+.|++..|.+..+|.+++... ++ +.|++++||+.+..++..+.+ T Consensus 1 ms~~~~~tr~~~~~s~~d~al~l-e~f~geV~~af~~~s~~~~~~~~r---ti--~~g~s~~~~~iG~~~~~~~~p---- 70 (335) T protein:vir:63 1 MSFLNDLTRPNYAGKNADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIR---DL--RGSNVVRLDRLGNVEAKGRRA---- 70 (335) T ss_pred CCCcccchhhhcccccchhheeh-hhhhhhHHHHHHhhhhhcccccee---ee--ccceeEEEeeeeeeeeecccC---- Confidence 664 3444 999999999999999999887653 45 459999999999999988754 Q ss_pred CCCccccccccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccc------- Q lcl|Aclame:pro 65 AERNLTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAA------- 137 (392) Q Consensus 65 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~------- 137 (392) +.++..+++...+..|+||+.++..+.|+|.|+.+.++|++.++.++++++||+..|+.++..+..+..... T Consensus 71 -G~~l~~~~~~~~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~ 149 (335) T protein:vir:63 71 -GEELERSRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDA 149 (335) T ss_pred -CcCcCCCCccccceEEEecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCC Confidence 445666777888999999999999999999999999999999999999999999999999865544332111 Q ss_pred ------------cccccccchhhHHHHHHHHHHhhhccCCC----CCEEEEchHHHHHhhcccceeeeeccccceeeeEe Q lcl|Aclame:pro 138 ------------GAVHEVAPDEFFKGVNGARRALNELYIPQ----GRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQ 201 (392) Q Consensus 138 ------------~~~~~~~~~~~~~~i~~a~~~l~~~~vp~----~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~ 201 (392) .......++..+..+.++..+|++++||+ +|+++++|++|..|+.+++|.+.++...+....+. T Consensus 150 ~~~G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~ 229 (335) T protein:vir:63 150 FSPGVLEKLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATGATNDYV 229 (335) T ss_pred cCCCcceeeeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhcccccccccccccccccccc Confidence 00011133445677889999999999994 49999999999999999999887665444434567 Q ss_pred eeeeeeEeeeEEEEecceeecccceeecccccccchhhhccccccccceeecccceee--------eeeeccccceeeee Q lcl|Aclame:pro 202 EARLGRIYGYEIVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAM--------RWLVDYDSTITSNR 273 (392) Q Consensus 202 ~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~ 273 (392) +|.+++++||.|++++++|..... .|.-+.. .....+.............+. ..-.+++.....+. T Consensus 230 ~g~v~~v~Gv~V~~sn~lP~~~~t-~~~lg~a-----~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~~~~~~ 303 (335) T protein:vir:63 230 KSRVAILNGVKVLETPRFATKAIA-AHPLGRH-----FNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNEKFSWV 303 (335) T ss_pred CceeEEeeceEEEeeccCCCCCcc-ccccccc-----CCccccccceeEEEEEecceEEEEEEeecccceeeccchhhHH Confidence 899999999999999999864322 1111000 000000000000000000000 00000111110000 Q ss_pred cccccceeeeEEEeeccccceeeeeccceee--eeeecccccccceeee Q lcl|Aclame:pro 274 SLIDTYFGLKVVEDPNGVGFVRARKIHLIPG--SIEVAPEAGANATITA 320 (392) Q Consensus 274 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~v~v~~~~~~~~~~~~ 320 (392) .......|.. ..-+.. .+..+++.. ...+. T Consensus 304 i~~~~a~G~g---------------~lRPe~a~~i~~tg~~~--~~~~~ 335 (335) T protein:vir:63 304 LDTFQMYNIG---------------ARRPDTAGAIELKGIGA--FDITA 335 (335) T ss_pred hHHHHHcCCc---------------ccccceEEEEEEcCCCc--eeecC Confidence 0000000000 000000 000111100 00000 No 43 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=99.90 E-value=1e-25 Score=157.55 Aligned_cols=288 Identities=11% Similarity=0.059 Sum_probs=176.7 Q ss_pred Ccc----------------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeecccccccc Q lcl|Aclame:pro 1 MAN----------------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAG 64 (392) Q Consensus 1 Man----------------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~ 64 (392) |+| .+|. |+|+.|++..|.+..+|.+++.+. ++ +.|++++||+.+...+....+ T Consensus 1 ms~~~~~t~~~~~~s~~d~al~l-e~f~geV~~af~~~s~~~~~~~~r---ti--~~g~s~~~~~iG~~~~~~~~p---- 70 (335) T protein:vir:78 1 MSFLNDLTRPNYAGKNADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIR---DL--RGSNVVRLDRLGNVEAKGRRA---- 70 (335) T ss_pred CCccccccccccccccchhhhhh-hhhhhHHHHHHHHhhhhcccccee---ee--ccceeEEEeeeeeeeeccccc---- Confidence 664 3454 999999999999999999988653 55 459999999999988876643 Q ss_pred CCCccccccccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccc------- Q lcl|Aclame:pro 65 AERNLTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAA------- 137 (392) Q Consensus 65 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~------- 137 (392) +..+..+.+......|+||+.++..+.|+|.|+.+.++|++.++.++++++||+..|+.++..+..+..... T Consensus 71 -G~~l~~~~~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~ 149 (335) T protein:vir:78 71 -GEELERSRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDA 149 (335) T ss_pred -CcccCCCCcccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCC Confidence 555667788889999999999999999999999999999999999999999999999998865544332111 Q ss_pred ------------cccccccchhhHHHHHHHHHHhhhccCCC----CCEEEEchHHHHHhhcccceeeeeccccceeeeEe Q lcl|Aclame:pro 138 ------------GAVHEVAPDEFFKGVNGARRALNELYIPQ----GRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQ 201 (392) Q Consensus 138 ------------~~~~~~~~~~~~~~i~~a~~~l~~~~vp~----~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~ 201 (392) .......+....+.+.++...|++.++|+ +|+++++|++|..|+.+++|.+.++........+. T Consensus 150 ~~~G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~ 229 (335) T protein:vir:78 150 FSPGVLEKLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVEYQATGATNDYV 229 (335) T ss_pred cCCCcceeeeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhcccccccccccccccccccc Confidence 00011123345677888899999999994 59999999999999999999887665444334578 Q ss_pred eeeeeeEeeeEEEEecceeecccceeecccccccchhhhccccccccceeecccceee--------eeeeccccceeeee Q lcl|Aclame:pro 202 EARLGRIYGYEIVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAM--------RWLVDYDSTITSNR 273 (392) Q Consensus 202 ~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~ 273 (392) +|.+++++||.|+.++++|..... .|.-... . ....+.........-...+. ..-.+++.....+. T Consensus 230 ~g~v~~v~Gv~V~~Sn~lP~~~~t-~~~lg~a----~-n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~~~ 303 (335) T protein:vir:78 230 KSRVAILNGVKVLETPRFATKAIS-AHPLGRH----F-NVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDHDQFSWV 303 (335) T ss_pred cceeEEeeceEEEeeccCCCCCCc-ccccccc----C-CcccccccceEEEEEecceEEEEEEEecccceeeccchhhHh Confidence 899999999999999999865321 1110000 0 00000000000000000000 00000111000000 Q ss_pred cccccceeeeEEEeeccccceeeeeccceeeeeeecccccccceeee Q lcl|Aclame:pro 274 SLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGANATITA 320 (392) Q Consensus 274 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~ 320 (392) .......|.. ..-+...+.+.-..+.....+. T Consensus 304 i~~~~a~G~g---------------~lRPe~a~~i~~tg~~~~~~~~ 335 (335) T protein:vir:78 304 LDTFQMYNIG---------------ARRPDTAGAIELKGIEAFDITA 335 (335) T ss_pred hhHHHHcCCc---------------ccCcceEEEEEecCCCcccccC Confidence 0000000000 0000000000000000000000 No 44 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=99.89 E-value=4.1e-25 Score=154.27 Aligned_cols=316 Identities=9% Similarity=-0.009 Sum_probs=176.6 Q ss_pred Ccc----------------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeecccccccc Q lcl|Aclame:pro 1 MAN----------------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAG 64 (392) Q Consensus 1 Man----------------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~ 64 (392) |++ .++ -|++..|++..|.+..+|..++.. .++. -|++++||+.+..++..+.+ T Consensus 1 ms~~n~~t~~~~~~~~~~~al~-le~f~geV~taf~~~s~~~~~~~~---rti~--~gkS~q~~~iG~~~~~~~~~---- 70 (364) T protein:vir:10 1 MSNPNVLTQPAVSASGEVDSLL-IEKFNNRVHEQYLKGENLLQWFDV---QEVV--GTNSVSNKYIGETELQVLSP---- 70 (364) T ss_pred CCCcccccccccccccchhhhh-hhhhhhhHHHHHHHHHhhcCccee---eeec--ccceEEeeeeeeeEEeeecc---- Confidence 653 122 389999999999988888887754 3554 58999999999999988764 Q ss_pred CCCccccccccCceEEEEEEeeeecceEeeHHHHhhhccC-hHHHHHHHHHHHHHHHHHHHHHHHHhccccccc------ Q lcl|Aclame:pro 65 AERNLTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLES-FATQILPRQVRGVADILEEGVRDMIVGAPYEAA------ 137 (392) Q Consensus 65 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~-~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~------ 137 (392) +..+..+++...+..|+||+.+|+.+.|+|.|+.+..+| ++.++.++++++||+..|+.++.++..+..... T Consensus 71 -G~~ld~~~~~~~k~~itID~ll~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~ 149 (364) T protein:vir:10 71 -GKSPDASPTEFDKNRLVVDTTVIARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKN 149 (364) T ss_pred -CcccCCCCcccCcEEEEecceeeechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccC Confidence 334566788889999999999999999999999999999 899999999999999999999877753321000 Q ss_pred ----c-----------ccccccchhhHHHHHHHHHHhhhccCC-CCCEEEEchHHHHHhhcccceeeeeccccceeeeEe Q lcl|Aclame:pro 138 ----G-----------AVHEVAPDEFFKGVNGARRALNELYIP-QGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQ 201 (392) Q Consensus 138 ----~-----------~~~~~~~~~~~~~i~~a~~~l~~~~vp-~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~ 201 (392) . ....+.+...++.|.++.+.|+|++|| .+|+++++|++|..|+++++|...++..+. ...+. T Consensus 150 ~~~~~~g~~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~~~lvn~d~~~~~-~~~~~ 228 (364) T protein:vir:10 150 PRVAGHGFSIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDADRIVDKSYTIAA-SDNTV 228 (364) T ss_pred CcccCCcceeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcCCccccccccccC-CCccc Confidence 0 000112234567788999999999999 589999999999999999999877653222 23467 Q ss_pred eeeeeeEeeeEEEEecceeecccceeecccccccchhhhccccccccceeecccceeeeeeeccccceeeeec--cc--- Q lcl|Aclame:pro 202 EARLGRIYGYEIVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRS--LI--- 276 (392) Q Consensus 202 ~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~--- 276 (392) +|.++.++||.|++|+++|............+ .......+....+...++........+..+........ .. T Consensus 229 ~G~v~~v~Gv~Vv~Sn~lP~~~~~~~~t~~~t---~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~ 305 (364) T protein:vir:10 229 DGFVLKSWNTPIVPSNRFPKLSDNTEGTGNTK---HHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIF 305 (364) T ss_pred cceeEEEeceEEEecccccccccccccccccc---ccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeee Confidence 99999999999999999986433211000000 00000000000000001000000111110000000000 00 Q ss_pred -ccceeeeEEEeeccccceeeeeccceeeeeeec---ccccccceeeeeeccCeeEEEEEeecCcccccceEEEEEcCCc Q lcl|Aclame:pro 277 -DTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVA---PEAGANATITAAAGEDHTVQLKVTDANGDDVTALCDFESSATD 352 (392) Q Consensus 277 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~---~~~~~~~~~~~~~~~~~t~~~t~~~~~~~~~~~~vtw~Ssn~~ 352 (392) +.....+.+.....-+. ...-+...+.+. ...+......+-... ..++++.-+ T Consensus 306 ~~~~~~~~~ida~~a~G~----g~lRPeaa~~i~~~~~~~~~~~~~~~~~~~----------------~~~~~~~~~--- 362 (364) T protein:vir:10 306 YEKKEKTWYIDTFLAEGA----IPDRWEAVAVVTAADTAELATDHNAILARA----------------NRKVTLTKS--- 362 (364) T ss_pred eccceeeeeeeeehcccC----cccCccceEEEEecCCCCCccchhhhhhhc----------------cccEEEEEe--- Confidence 00000111110000000 000000000000 000000000000000 112222211 Q ss_pred eEEEC Q lcl|Aclame:pro 353 KATVA 357 (392) Q Consensus 353 VAtVd 357 (392) |+ T Consensus 363 ---~~ 364 (364) T protein:vir:10 363 ---VN 364 (364) T ss_pred ---cC Confidence 11 No 45 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=99.89 E-value=3.6e-25 Score=154.63 Aligned_cols=290 Identities=12% Similarity=-0.008 Sum_probs=177.6 Q ss_pred CccccccHHHHHHHHHHHHHHhhccc-ceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccccCceE Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELILT-NLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTEDSF 79 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~~-~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (392) =+|++.-.|.|+..+.+.+...+.-. .++|++|+.. .|++|+||+.......||+.. ....+++++.+.. T Consensus 25 ~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~----gg~tVkIp~i~~~gl~DY~R~-----~g~~~g~vt~~~~ 95 (319) T protein:vir:94 25 EPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFM----EGRSFTVMKGDTTELKDYKRN-----ATNEFDHPKIEET 95 (319) T ss_pred CcchHHHHHHHHHHHHHHHHHhhhhhhcccCcceEec----cCcEEEEeeecccccccccCC-----CCcccCCccccee Confidence 44566567789887555555444332 4578888553 389999999999999999753 2356678899999 Q ss_pred EEEEEeeeecceEeeHHHHhhhccCh--HHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHHHHHH Q lcl|Aclame:pro 80 PVTLTDVAYHLGVLTDEELTFDLESF--ATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNGARR 157 (392) Q Consensus 80 ~~~i~~~~~~~~~i~d~~~~~~~~~~--~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~ 157 (392) +++|+|++++.|.|++.|..+....+ .....+++...++..+|.+.++.+..... .....+.++.+.|+.|.++.. T Consensus 96 t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~--~~~~~~~t~~n~y~~i~~a~~ 173 (319) T protein:vir:94 96 TYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKA--KHLTVGTGSDAQYDAVLDVSV 173 (319) T ss_pred EEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcc--cccccccCHHHHHHHHHHHHH Confidence 99999999999999999988876655 34456677888999999999987765432 233445677889999999999 Q ss_pred HhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecce--eecccceeeccccccc Q lcl|Aclame:pro 158 ALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLI--PHGDAYLYHPTAFIMA 235 (392) Q Consensus 158 ~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v--~~~~~~~~~~~a~~~a 235 (392) .|++++||++|+++++|+.+..|.++++|.+....++ ..+++|.+|++.||+|+..+.. ........|+.+.... T Consensus 174 ~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~---~~~~~g~Vg~idG~~Vi~vps~~~k~in~i~~h~~A~~~~ 250 (319) T protein:vir:94 174 ELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQ---QVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASP 250 (319) T ss_pred HHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccc---cceeeeeceeecCeEEEEecccccccceEEEEcCCeeeee Confidence 9999999999999999999999999999988776654 4568999999999999875332 2233455566665544 Q ss_pred chhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeeccccccc Q lcl|Aclame:pro 236 TRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGAN 315 (392) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~~~~ 315 (392) .+.......... ....+..+....-++...... ...+..+................ .-.... T Consensus 251 ~k~~~~~~~~p~----~~~~a~~v~gr~y~d~~V~~~-----k~~~Iy~~~~~~~~~~~~~~~~~---------~~~~~~ 312 (319) T protein:vir:94 251 IQADLAKTNSNI----PGMFGTLAEQLLYTGAFVPEH-----LQKYIFTIGGTEVATKRDGVDAH---------ADNVAK 312 (319) T ss_pred eeeeeeeccCCC----ccccceeeeeeeeeeeEEecc-----ccceEEEeecCCcccCCCccccc---------cccccC Confidence 443222211100 000011111000000000000 00011111111000000000000 000000 Q ss_pred ceeeeee Q lcl|Aclame:pro 316 ATITAAA 322 (392) Q Consensus 316 ~~~~~~~ 322 (392) .+..+.+ T Consensus 313 ~~~~~~~ 319 (319) T protein:vir:94 313 PSGSLEM 319 (319) T ss_pred CcccccC Confidence 0111111 No 46 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=99.89 E-value=3.6e-25 Score=154.63 Aligned_cols=290 Identities=12% Similarity=-0.008 Sum_probs=177.6 Q ss_pred CccccccHHHHHHHHHHHHHHhhccc-ceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccccCceE Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELILT-NLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTEDSF 79 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~~-~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (392) =+|++.-.|.|+..+.+.+...+.-. .++|++|+.. .|++|+||+.......||+.. ....+++++.+.. T Consensus 25 ~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~----gg~tVkIp~i~~~gl~DY~R~-----~g~~~g~vt~~~~ 95 (319) T protein:vir:97 25 EPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFM----EGRSFTVMKGDTTELKDYKRN-----ATNEFDHPKIEET 95 (319) T ss_pred CcchHHHHHHHHHHHHHHHHHhhhhhhcccCcceEec----cCcEEEEeeecccccccccCC-----CCcccCCccccee Confidence 44566567789887555555444332 4578888553 389999999999999999753 2356678899999 Q ss_pred EEEEEeeeecceEeeHHHHhhhccCh--HHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHHHHHH Q lcl|Aclame:pro 80 PVTLTDVAYHLGVLTDEELTFDLESF--ATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNGARR 157 (392) Q Consensus 80 ~~~i~~~~~~~~~i~d~~~~~~~~~~--~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~ 157 (392) +++|+|++++.|.|++.|..+....+ .....+++...++..+|.+.++.+..... .....+.++.+.|+.|.++.. T Consensus 96 t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~--~~~~~~~t~~n~y~~i~~a~~ 173 (319) T protein:vir:97 96 TYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKA--KHLTVGTGSDAQYDAVLDVSV 173 (319) T ss_pred EEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcc--cccccccCHHHHHHHHHHHHH Confidence 99999999999999999988876655 34456677888999999999987765432 233445677889999999999 Q ss_pred HhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecce--eecccceeeccccccc Q lcl|Aclame:pro 158 ALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLI--PHGDAYLYHPTAFIMA 235 (392) Q Consensus 158 ~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v--~~~~~~~~~~~a~~~a 235 (392) .|++++||++|+++++|+.+..|.++++|.+....++ ..+++|.+|++.||+|+..+.. ........|+.+.... T Consensus 174 ~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~---~~~~~g~Vg~idG~~Vi~vps~~~k~in~i~~h~~A~~~~ 250 (319) T protein:vir:97 174 ELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQ---QVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASP 250 (319) T ss_pred HHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccc---cceeeeeceeecCeEEEEecccccccceEEEEcCCeeeee Confidence 9999999999999999999999999999988776654 4568999999999999875332 2233455566665544 Q ss_pred chhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeeccccccc Q lcl|Aclame:pro 236 TRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGAN 315 (392) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~~~~ 315 (392) .+.......... ....+..+....-++...... ...+..+................ .-.... T Consensus 251 ~k~~~~~~~~p~----~~~~a~~v~gr~y~d~~V~~~-----k~~~Iy~~~~~~~~~~~~~~~~~---------~~~~~~ 312 (319) T protein:vir:97 251 IQADLAKTNSNI----PGMFGTLAEQLLYTGAFVPEH-----LQKYIFTIGGTEVATKRDGVDAH---------ADNVAK 312 (319) T ss_pred eeeeeeeccCCC----ccccceeeeeeeeeeeEEecc-----ccceEEEeecCCcccCCCccccc---------cccccC Confidence 443222211100 000011111000000000000 00011111111000000000000 000000 Q ss_pred ceeeeee Q lcl|Aclame:pro 316 ATITAAA 322 (392) Q Consensus 316 ~~~~~~~ 322 (392) .+..+.+ T Consensus 313 ~~~~~~~ 319 (319) T protein:vir:97 313 PSGSLEM 319 (319) T ss_pred CcccccC Confidence 0111111 No 47 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.89 E-value=3.8e-25 Score=154.46 Aligned_cols=264 Identities=13% Similarity=0.087 Sum_probs=178.4 Q ss_pred Ccccc----ccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccccC Q lcl|Aclame:pro 1 MANAF----SKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTE 76 (392) Q Consensus 1 Man~~----~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (392) ||.+- ++||+|++++.+.+.+.++|.+++..| .++.+++|++|+||.+. .+.+.+... ++..+.++.++. T Consensus 1 Ma~T~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d--~~L~g~~G~ti~~P~~~--~igdae~~~--eg~~i~~~~lt~ 74 (270) T protein:vir:95 1 MTQTKKANLINPEVLANVVSAQMQNAIRFTPYAVTD--DTLVGQPGDTITRPKYA--YIGAAEDLQ--EGVAMDTTQMSM 74 (270) T ss_pred CCceehhhhcchHHHHHHHHHHHHhHHhhccccccc--cccCCCCCCEEEeeeec--CCCcccccc--CCCccchhhccc Confidence 99755 599999999999999999999999887 46788999999998865 344554433 355688899999 Q ss_pred ceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHHHHH Q lcl|Aclame:pro 77 DSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNGAR 156 (392) Q Consensus 77 ~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~ 156 (392) ++...+|.+ ..++|.++|++......|++.++.+|++.++|+++|+++++.++++.... +....++.|.+|. T Consensus 75 ~~~~a~i~~-~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~a~~~~-------~~~~t~~~~~dA~ 146 (270) T protein:vir:95 75 TTTKVTVKE-TGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAELNKSKQTA-------TVSADATGILDAI 146 (270) T ss_pred chheeeeeh-hhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhccccccc-------ccccCHHHHHHHH Confidence 999999944 58999999999999999999999999999999999999999988764332 2334578999999 Q ss_pred HHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEec-ceeecccceeeccccccc Q lcl|Aclame:pro 157 RALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVEST-LIPHGDAYLYHPTAFIMA 235 (392) Q Consensus 157 ~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~-~v~~~~~~~~~~~a~~~a 235 (392) ..|.++.- ...+++++|..++.|.++.. ....+.|. ..+++|.+|.+.|+.|+.+. ..+..+.+.++..++.+. T Consensus 147 ~~lgd~~~-~~~~i~vhs~~~~~Lrk~~~-~~~~~~~~---~~~~~G~ig~~~G~~Viv~s~~~~~~~~~l~~~gAi~~~ 221 (270) T protein:vir:95 147 EVFNSEND-EDYVLYVNPKDYNKLVKSLF-KVGGNVQD---RAISKGDLVEIVGVSDIVKSKRVSENTAFLQRYGAMEIV 221 (270) T ss_pred HHhccccC-CCcEEEEcHHHHHHHHhhhc-cccccccc---chhcccccceecceeEEEeCCCCCceeEEEEeccceeee Confidence 99976542 45689999999999988763 33333333 35789999999999986654 445556666666665544 Q ss_pred chhhhc-cccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeecccccc Q lcl|Aclame:pro 236 TRAPAP-PMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGA 314 (392) Q Consensus 236 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~~~ 314 (392) .+.... +..+ +.....+.......++.......... .++..+ . T Consensus 222 ~~~~~~vEtdR--------------------d~~~~~d~i~~~~~y~v~~~~~skvv------~~t~~~----------a 265 (270) T protein:vir:95 222 NKKKPEAYTDF--------------------DILKRTHLLSTNYHYSVNLKDETGVV------KVTFKP----------S 265 (270) T ss_pred ecCCceeeecc--------------------chhhcccEEEeeeEEEEEEEccceEE------EEEecC----------C Confidence 433211 1111 11111111111111222111111000 000000 0 Q ss_pred cceeeeee Q lcl|Aclame:pro 315 NATITAAA 322 (392) Q Consensus 315 ~~~~~~~~ 322 (392) . +..+ T Consensus 266 ~---~~~~ 270 (270) T protein:vir:95 266 G---SLEM 270 (270) T ss_pred C---CcCC Confidence 0 0000 No 48 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=99.89 E-value=6.9e-25 Score=153.06 Aligned_cols=273 Identities=14% Similarity=0.036 Sum_probs=170.4 Q ss_pred CccccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccccCceEE Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTEDSFP 80 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (392) ||.+++ ++|++++++.|++.+++..+.+++++. .+ |++|+||+.......||+...+ ....+++....+ T Consensus 1 Main~a--~~~~~~Ld~~~~~~~~t~~l~~~~~~~--~g--gktVkI~~i~~~gl~DY~R~~g-----~~~g~v~~~~et 69 (290) T protein:vir:78 1 MAINYV--DKYGKELDQKLVFGTYTNELETPNLLW--LD--AKTFKIQTITTTGLKAHTRNKG-----YNEGSASNTNKS 69 (290) T ss_pred CchhHH--HHHHHHHHHHHHhhheeeeccccceee--cc--CCEEEEeeeccCcccccccCCC-----cccCccccceee Confidence 998885 799999999999999999999988744 34 8999999999999999976332 334466778889 Q ss_pred EEEEeeeecceEee--HHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccc-cccccccccchhhHHHHHHHHH Q lcl|Aclame:pro 81 VTLTDVAYHLGVLT--DEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYE-AAGAVHEVAPDEFFKGVNGARR 157 (392) Q Consensus 81 ~~i~~~~~~~~~i~--d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~i~~a~~ 157 (392) ++|+|.+++.|.|+ |.|++.....+.....+++.+.++.++|.+.++.+...... ......+.++.+.|+.|.++.. T Consensus 70 ~tl~qdR~~~F~vD~~DvDEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~~~~~~~~~t~t~~n~~~~i~~~~~ 149 (290) T protein:vir:78 70 YTIDFDRDVEFFVDVMDVDETGQALSAANVTKEFNSRHAGPEMDAYRFSKLATAAKTNSNSVAEEITKDNVFTKLKAAIR 149 (290) T ss_pred EEeeccccceeeccccchhHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhhccCcccccccCHHHHHHHHHHHHH Confidence 99999999999999 88888777777777777888899999999988866543322 2233345678899999999999 Q ss_pred HhhhccCC-CCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecce-eecccceeeccc--cc Q lcl|Aclame:pro 158 ALNELYIP-QGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLI-PHGDAYLYHPTA--FI 233 (392) Q Consensus 158 ~l~~~~vp-~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v-~~~~~~~~~~~a--~~ 233 (392) .|++ +| ++|+++++|+.+..|..+++|.+....+..... ..+|.++++.||++++...- --.+.+.+.... .+ T Consensus 150 ~lde--vp~~~rvl~vtp~~~~lL~~~~~f~r~~~~~~~~~~-~i~~~V~~idG~~ii~vps~~r~~t~~~f~~G~~~~~ 226 (290) T protein:vir:78 150 KVKK--YGTQNLVMYVSPDVMAALELSDDFVRAINVQNIGPS-SIETRITAIDGTRIVEVEAEDRFYDTFDFTDGYKPAA 226 (290) T ss_pred HHHh--cCCCCeEEEECHHHHHHHhhChhhhccccccccccc-cccceeeeecCcEEEEecccchhhhhhhhcccccccC Confidence 9987 67 689999999999999999999887666543333 34899999999999874321 000111111000 00 Q ss_pred ccc---------hhhhccccccccceeecccc-eeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeec Q lcl|Aclame:pro 234 MAT---------RAPAPPMGAVRSTAISGDQR-IAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKI 299 (392) Q Consensus 234 ~a~---------~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 299 (392) .+. .....+.............. .... +..... .+...++.. ....++...... T Consensus 227 ~ak~in~ii~~~~a~i~~~K~~~~~~~~P~~~~~~d~--------~~~~~r---~y~d~~v~~-nk~~~i~~~~~~ 290 (290) T protein:vir:78 227 GAKKLNFLLVNKGSVVGGAKHASIYLHAPGSVGQGDG--------WLYQYR---VYHDIFVLD-QQKDGVIASTEV 290 (290) T ss_pred CccceeEEEEcCCceeeeeeeeEEEeeCCCCCcCcce--------eeeeee---eeeeeeeec-cccCeeEEEeeC Confidence 000 00000000000000000000 0000 000000 001111111 111111110000 No 49 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=99.86 E-value=2.2e-23 Score=144.85 Aligned_cols=348 Identities=9% Similarity=-0.044 Sum_probs=183.7 Q ss_pred Ccc--cc-------------ccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccC Q lcl|Aclame:pro 1 MAN--AF-------------SKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGA 65 (392) Q Consensus 1 Man--~~-------------~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~ 65 (392) |++ .+ |=-|+++.|++..|.+..+|.+++.. .++. .|++++|++.+..++..+.+. T Consensus 1 Ms~~n~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~v---rti~--~GkS~qf~~iG~~~a~y~~~G---- 71 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDV---QTVT--GTNTVSNKYLGETELQVLAPG---- 71 (402) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCccee---eeec--ccceEEEEEEeeeEEeeeccc---- Confidence 653 11 11389999999999988888887754 3554 589999999999999877543 Q ss_pred CCccccccccCceEEEEEEeeeecceEeeHHHHhhhccC-hHHHHHHHHHHHHHHHHHHHHHHHHhcccccc--c----- Q lcl|Aclame:pro 66 ERNLTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLES-FATQILPRQVRGVADILEEGVRDMIVGAPYEA--A----- 137 (392) Q Consensus 66 ~~~~~~~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~-~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~--~----- 137 (392) ..+..+++...+..|+||+-+|..+.|.|.|+.+.++| ++.++.++++++||+..|+.++.+++.+.... . T Consensus 72 -~~ldg~~~~~~k~~ItID~lL~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~ 150 (402) T protein:vir:97 72 -QSPNATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKP 150 (402) T ss_pred -cccCCCCcccccEEEEeCceeechhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccC Confidence 33556678888999999999999999999999999999 89999999999999999999988775432100 0 Q ss_pred -------ccc-------ccccchhhHHHHHHHHHHhhhccCC-CCCEEEEchHHHHHhhcccceeeeeccccceeeeEee Q lcl|Aclame:pro 138 -------GAV-------HEVAPDEFFKGVNGARRALNELYIP-QGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQE 202 (392) Q Consensus 138 -------~~~-------~~~~~~~~~~~i~~a~~~l~~~~vp-~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~ 202 (392) ... ..+.+...++.|.++...|+|.+|| .+|+++++|++|..|+++++|.+..+.... ...+.. T Consensus 151 ~~~~~g~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~rl~n~d~~~~~-~g~~~~ 229 (402) T protein:vir:97 151 RVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQ-SGATIN 229 (402) T ss_pred cccccccccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhcccccchhhcccc-CCcccc Confidence 000 0133345678889999999999999 589999999999999999999877653222 234679 Q ss_pred eeeeeEeeeEEEEecceeecccceeecccccccchhhhccccccccceeec-c-c------ceeeeeeeccccceeeeec Q lcl|Aclame:pro 203 ARLGRIYGYEIVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISG-D-Q------RIAMRWLVDYDSTITSNRS 274 (392) Q Consensus 203 g~ig~~~g~~v~~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~-~-~------~~~~~~~~~~~~~~~~~~~ 274 (392) |.++.++|+.|++++++|.......+........+......+......... . . ........+++.....+.. T Consensus 230 G~v~~v~Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d~r~~~~~i 309 (402) T protein:vir:97 230 GFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYI 309 (402) T ss_pred ceeEEEeceEEEecCccccccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEEeeccccchhhchhHHHHHH Confidence 999999999999999999643211111100000000000000000000000 0 0 0000011112221111111 Q ss_pred ccccceeeeEEEeeccccc--ee--eeeccceeeeeeecccccccceeeeeeccCeeEEEEEeecCcccccceEEEEEcC Q lcl|Aclame:pro 275 LIDTYFGLKVVEDPNGVGF--VR--ARKIHLIPGSIEVAPEAGANATITAAAGEDHTVQLKVTDANGDDVTALCDFESSA 350 (392) Q Consensus 275 ~~~~~~~~~~~~~~~~~~~--~~--~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~t~~~t~~~~~~~~~~~~vtw~Ssn 350 (392) .....+|............ .. ..........-..+-..-.+...++...++..+.....|..-.+ . T Consensus 310 d~~~a~G~g~~RPeaa~vv~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~ 379 (402) T protein:vir:97 310 DTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGPGDDHATVLARAQRKAVYVKTEGAAAAFSAAPAGIQA----------E 379 (402) T ss_pred HHHHHhCCcccCccceEEEEEecccccccCCccccchhhhhcccccceEEEeccccchhccccccccch----------H Confidence 1111111100000000000 00 00000000000000000111111111111111111111110000 0 Q ss_pred CceEEE----CCCceEEEEecce Q lcl|Aclame:pro 351 TDKATV----AAGGLVTGVAAGT 369 (392) Q Consensus 351 ~~VAtV----d~~G~VTa~~~Gt 369 (392) .-||-| -.+=+-|+.++-+ T Consensus 380 ~~~~~~~~~~~~~~~~~~~~~~~ 402 (402) T protein:vir:97 380 DLVAAVRAVMANDIKPTAMKPTE 402 (402) T ss_pred HHHHHHHHHHhccccccccCCCC Confidence 000000 0111223444333 No 50 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=99.84 E-value=2.1e-23 Score=144.95 Aligned_cols=349 Identities=11% Similarity=0.010 Sum_probs=198.3 Q ss_pred Ccc----------------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeecccccccc Q lcl|Aclame:pro 1 MAN----------------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAG 64 (392) Q Consensus 1 Man----------------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~ 64 (392) |++ .++ -|+|..+++..|.+..+|..++.. ..+ +.|+++++|+.+..++..+.++ T Consensus 1 Ms~~n~~t~~~~~~sg~~~al~-Le~f~GeV~taF~~~si~~~~~~v---Rti--~~gkS~qf~~~G~s~~~~~~pG--- 71 (401) T protein:vir:70 1 MSTPNNLTNVAVSASGEVDSLL-IEKFNGKVNEQYLKGENIMSYFDV---QTV--TGTNTVSNKYLGETELQVLAPG--- 71 (401) T ss_pred CCCCccccccccccccchhHhH-HhHhcchHHHHHHHHhhhccccee---eee--cccceEEEEEeeeeEeeeecCC--- Confidence 663 223 389999999999888888777643 245 4589999999999999888653 Q ss_pred CCCccccccccCceEEEEEEeeeecceEeeHHHHhhhccC-hHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------c Q lcl|Aclame:pro 65 AERNLTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLES-FATQILPRQVRGVADILEEGVRDMIVGAPYE-------A 136 (392) Q Consensus 65 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~-~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~-------~ 136 (392) ..+..+++...+..|+||.-++..+.|.|.|+.++++| ++.++.++++++||+..|+.++.+++.+... . T Consensus 72 --~~ld~~~~~~dK~~ItID~lL~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~ 149 (401) T protein:vir:70 72 --QSPAATSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTN 149 (401) T ss_pred --CCcCCCCcccccEEEEeCceeehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccC Confidence 34566788888999999999999999999999999999 8999999999999999999998887533210 0 Q ss_pred c--c------------ccccccchhhHHHHHHHHHHhhhccCCCCCEEEE-chHHHHHhhcccceeeeeccccceeeeEe Q lcl|Aclame:pro 137 A--G------------AVHEVAPDEFFKGVNGARRALNELYIPQGRVLVV-GTAVTEQILNDDRFIKYESQGQSAVSALQ 201 (392) Q Consensus 137 ~--~------------~~~~~~~~~~~~~i~~a~~~l~~~~vp~~r~~vv-~~~~~~~l~~~~~~~~~~~~G~~~~~a~~ 201 (392) . . ......+......|.+++..|+|.+||.+|++++ +|.+|..|+..+++.+.++..... ..+. T Consensus 150 p~~~~~G~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~~r~vvl~pp~~Ys~Ll~~d~L~nrd~~~s~~-g~~~ 228 (401) T protein:vir:70 150 PRVKGHGFSINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKTYTISQS-GATI 228 (401) T ss_pred CCcCCCceEEeccccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhcCcccchhhccccC-Cccc Confidence 0 0 0011233446778999999999999998787766 666676777767777766543332 3467 Q ss_pred eeeeeeEeeeEEEEecceeecccceeecccccccchhhhccccccccceee---cc-----cceeeeeeeccccceeeee Q lcl|Aclame:pro 202 EARLGRIYGYEIVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAIS---GD-----QRIAMRWLVDYDSTITSNR 273 (392) Q Consensus 202 ~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~---~~-----~~~~~~~~~~~~~~~~~~~ 273 (392) +|.+..++||.|++++++|.......+........+....+.+........ .. .........+++.....+. T Consensus 229 ~G~v~~vaGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~lt~~~~~d~r~~~~~ 308 (401) T protein:vir:70 229 QGFTLSSYNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDVTGDIFYEKKEKTYY 308 (401) T ss_pred cceEEEEeceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheEEEEeeccccchhhhhhhhHHH Confidence 899999999999999999964322111111000000000000000000000 00 0000000111222211111 Q ss_pred cccccceeeeEEEeeccccceeeeeccceeeeeeecccccccceeeeeeccCeeEEEEEeecCcccccceEEEEEcCCce Q lcl|Aclame:pro 274 SLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGANATITAAAGEDHTVQLKVTDANGDDVTALCDFESSATDK 353 (392) Q Consensus 274 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~t~~~t~~~~~~~~~~~~vtw~Ssn~~V 353 (392) ......+|....... ..+.. ........-.+.+...+.....+.-+... .+.+.+.. ..+.+.|++|.+.+ T Consensus 309 id~~~a~g~g~~RPe-aa~vv---~~k~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~---~~~~~~~~~~~~~~ 379 (401) T protein:vir:70 309 IDTFMAEGAIPDRWE-AVSVV---TTKRNTTTGAVEGTDGAQHTIVKNRAQRK--AVYVKNAA---PVAAAAASLSAEDL 379 (401) T ss_pred HHHHHHhCCcccchh-heEEE---eecCcccccccccCCcchhhhhhhhccce--eEEecccc---chhhhccccchHHH Confidence 111111111110000 00000 00000000012233334444333333332 23333333 24679999999875 Q ss_pred E-EE----CCCceEEEEecceE Q lcl|Aclame:pro 354 A-TV----AAGGLVTGVAAGTS 370 (392) Q Consensus 354 A-tV----d~~G~VTa~~~Gta 370 (392) . -| -.+=+-|++++-+- T Consensus 380 ~~~~~~~~~~~~~~~~~~~~~~ 401 (401) T protein:vir:70 380 VAAVRAVMANDIKPTALKPTEE 401 (401) T ss_pred HHHHHHHHhccccccccCcCCC Confidence 3 22 12234455555322 No 51 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=99.82 E-value=6.1e-22 Score=136.90 Aligned_cols=284 Identities=11% Similarity=-0.010 Sum_probs=156.4 Q ss_pred Ccccc---ccHHHHHHHHHHHHHHhhc-ccceeeecccccccCCCCCeEEEEeccceeeeccccccc-----cCCCcccc Q lcl|Aclame:pro 1 MANAF---SKPTAVVDTAIQMLQNELI-LTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGA-----GAERNLTV 71 (392) Q Consensus 1 Man~~---~~~~~~~~~~~~~l~~~l~-~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~-----~~~~~~~~ 71 (392) |+-++ |+ +.|++++...++++-. |-+.|... . ..+.+++++.+....+.. ++.... ....+..+ T Consensus 13 Ms~~i~~~fv-~qy~~~v~~~~qq~~s~L~~tV~~~--~--~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~d~~~dtp~ 85 (322) T protein:vir:10 13 IAGDIDQAFV-QTYETTLRILSQQKSAKLKQYCQHK--N--ESSESHNWETLASMDPDA--VKRKRSRQQSADGTYPTPV 85 (322) T ss_pred eechhhhHHH-HHHHHHHHHHHHHhhhhhhcccccc--c--ccccccceeecccccccc--cccccccccccCcccCCCc Confidence 77554 44 7788877666665443 55554311 2 234567777755433211 111000 01113334 Q ss_pred ccccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc---------- Q lcl|Aclame:pro 72 SDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVH---------- 141 (392) Q Consensus 72 ~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~---------- 141 (392) .+...+...+.+.+ +|+++.|+|.|+.+...|+.+.+.++++++|+++.|+.++..+.+.......++. T Consensus 86 ~~~~~~~r~~~~~d-~~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~~~~gt~v~~~ss~~i~ 164 (322) T protein:vir:10 86 NNKPFAKRRTNVDT-YDTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPASIKGTGQPVEFLATQEIG 164 (322) T ss_pred cccccceEEEeecc-cccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhccccccccccccccCCCcccc Confidence 45566777776655 5889999999999999999999999999999999999998766544322211111 Q ss_pred cccchhhHHHHHHHHHHhhhccCCC-C-CEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecce Q lcl|Aclame:pro 142 EVAPDEFFKGVNGARRALNELYIPQ-G-RVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLI 219 (392) Q Consensus 142 ~~~~~~~~~~i~~a~~~l~~~~vp~-~-r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v 219 (392) ..+...+++.+++|++.|++++||+ + ||++++|+++..|+.+++|...++.|.. ...++|.+|+++||.|+.++++ T Consensus 165 ~g~~g~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~~~--~l~~~G~ig~~lGf~~i~s~~l 242 (322) T protein:vir:10 165 DGTKPISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQITEATSADYTSAM--DLQSKGIITNWMGYTWIVSTRL 242 (322) T ss_pred cCccchhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchhhhhhhcccch--hhhhcCeeeeeeeEEEEEeccC Confidence 1122456899999999999999994 4 9999999999999999999999998754 2236799999999999999999 Q ss_pred eecccceeecccccccchhhhccccccccceeecccceee----eeeecccccee-eeecccccceeeeEEEeeccccce Q lcl|Aclame:pro 220 PHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAM----RWLVDYDSTIT-SNRSLIDTYFGLKVVEDPNGVGFV 294 (392) Q Consensus 220 ~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 294 (392) |........ .+......... ....-+....-+... ..-.+...... ...+......|...........+ T Consensus 243 p~~~~t~~~-~~~~~~~~~~~----~~~~a~~k~Av~~a~~~dv~~~i~~~~~~~~a~~I~~~~~~Ga~ri~~~gVv~i- 316 (322) T protein:vir:10 243 DKFDPTQWG-MAAEDGPQGDE----IWCIAMTDMALGYHSCKDIWTKVAEDPSASFAWRIYSAFTADCVRVEDEHIFKL- 316 (322) T ss_pred Ccccccccc-ccccCCCCccc----eeEEEEecCceeEEEeeeeeEEeeccCCcchhhhhhhhhhhCceEeccCcEEEE- Confidence 965432111 11000000000 000000000000000 00000101000 00011111111111111100000 Q ss_pred eeeeccceeeeeeeccccccccee Q lcl|Aclame:pro 295 RARKIHLIPGSIEVAPEAGANATI 318 (392) Q Consensus 295 ~~~~~~~~~~~v~v~~~~~~~~~~ 318 (392) .. ..++ T Consensus 317 -----------------~~-~e~~ 322 (322) T protein:vir:10 317 -----------------RL-KNSL 322 (322) T ss_pred -----------------EE-eccC Confidence 00 0000 No 52 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.81 E-value=6.5e-22 Score=136.74 Aligned_cols=227 Identities=14% Similarity=0.107 Sum_probs=148.9 Q ss_pred cccccCCCCCeEEEEeccceeeeccccccccCCCccccccccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHH Q lcl|Aclame:pro 34 IGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQ 113 (392) Q Consensus 34 ~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~ 113 (392) +.- -..||||+||.. ++|.+... ++..+.++.++.++.+.+|.+ .+++|+|+|++....++|++.+..+|+ T Consensus 1 ~~~--~~~Gdtit~P~~----iGda~~v~--eG~~i~~~~l~~t~~~atIk~-~gk~~~itD~a~l~~~gDp~~ea~~Q~ 71 (231) T protein:vir:73 1 ENG--INLANLCEYPND----IGDAADVA--EGGEISLDKIGTTTKSVTIKK-AAKGTEITDEAALSGYGDPIGESNKQL 71 (231) T ss_pred Ccc--ccCCceEEeccc----ccchhhhc--CCCcCChhhccccceeeeEee-eccceeeeHHHHhhccCchHHHHHHHH Confidence 222 246999999854 45555444 456688899999999999944 689999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccc Q lcl|Aclame:pro 114 VRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQG 193 (392) Q Consensus 114 ~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G 193 (392) +.+||+++|.+++..+..+.... .....++.|.+|...|++... ..++++++|..++.|+++..+......+ T Consensus 72 ~~~iA~kvD~di~~~~~~a~l~~-------~~~~t~d~i~~A~~~fgde~~-~~~vivv~p~~~~~Lrk~~~~~~~~~~~ 143 (231) T protein:vir:73 72 GLSLANKVDDDLLKAAKTTSQTV-------STKANVDGVQAALDIFNDEDA-QAYVLIVNPKDAAKIRKDANAKNIGSEV 143 (231) T ss_pred HHHHHHhhhHHHHHhhccccccc-------cccccHHHHHHHHHHhccccc-cceEEEEcchHHHhhhhccchhhhhhhh Confidence 99999999999999887654332 223568999999999987754 5689999999999999877665543222 Q ss_pred cceeeeEeeeeeeeEeeeEEEEecceeeccccee----ecccccccchhhhccccccccceeecccceeeeeeeccccce Q lcl|Aclame:pro 194 QSAVSALQEARLGRIYGYEIVESTLIPHGDAYLY----HPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTI 269 (392) Q Consensus 194 ~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~----~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 269 (392) ....+++|.+|++.|+.|+.|+.+|.++.... .+.+..+..+.... .-...+... T Consensus 144 --g~~i~~~G~iG~i~G~~Vi~S~~~~~~~~~~~~~i~~~gAl~~~~k~~~~-------------------vEtdRd~~~ 202 (231) T protein:vir:73 144 --GANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQ-------------------VETDRDIVT 202 (231) T ss_pred --ccceeeecccceEcceEEEEcCCCCCCceeeeeEEeeccceeeeecccce-------------------eeccccccc Confidence 22567999999999999999999997665432 22332222221110 001112222 Q ss_pred eeeecccccceeeeEEEeeccccceeeeeccceeeeeeeccc Q lcl|Aclame:pro 270 TSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPE 311 (392) Q Consensus 270 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~ 311 (392) ..+.......++.......... .++..++ T Consensus 203 k~~~i~~~~~y~v~l~~~~~vv-------------~~t~~g~ 231 (231) T protein:vir:73 203 KTTVITADEHYAAYLYDLTKVV-------------NITFTGV 231 (231) T ss_pred cccEEEEeEEEEEEEEcCccEE-------------EEEeecC Confidence 2222222222222222111100 0011111 No 53 >protein:vir:118 Length: 449 # NCBI annotation: major head protein # Family: family:all:4054 # MgeID: mge:4 # MgeName: B103 # Cross-refs: genbank:acc:NP_690641;swissprot:sw:q37888;genbank:gi:22855155;interpro:IPR003343;uniprot:Q37888;genbank:GeneID:955370 Probab=99.79 E-value=8.6e-20 Score=125.12 Aligned_cols=345 Identities=12% Similarity=0.064 Sum_probs=155.8 Q ss_pred Ccccccc----HHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccccC Q lcl|Aclame:pro 1 MANAFSK----PTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTE 76 (392) Q Consensus 1 Man~~~~----~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (392) |+.+.+. ...|-+-.|...+..|-.-+|. .|..+ .-..|+++.=..-......-|++... +-.+-...++.- T Consensus 52 ~~~~~~~n~~~~sl~~ri~~~~~~~~~~~NPL~--~F~~~-~~~~g~~i~~~~~d~~~~~~~~~~~~-e~~~f~~~~p~i 127 (449) T protein:vir:11 52 LVNQTVQNEFLTSLVDRIGLVIVKSISLRNPLA--KFKKG-ALPMGRTIEEIFTDITKEKLYDAEEA-EQKVFEREIPNV 127 (449) T ss_pred hhhHHHHHHHHHHHHHhhhhhhhhhhhhcChhH--HHhcC-Cccccceeeeheecccceeeechhhh-cccccccCCCce Confidence 5544322 3344444444444433222331 12111 12457777665555555555554222 333444444444 Q ss_pred ceEEEEEEeeeecceEeeHHHHh--hhccChHHHHHHHHHHHHHH--HHHHHHH-H-HH----hccccccccccccccch Q lcl|Aclame:pro 77 DSFPVTLTDVAYHLGVLTDEELT--FDLESFATQILPRQVRGVAD--ILEEGVR-D-MI----VGAPYEAAGAVHEVAPD 146 (392) Q Consensus 77 ~~~~~~i~~~~~~~~~i~d~~~~--~~~~~~~~~~~~~~~~ala~--~vd~~~~-~-~~----~~~~~~~~~~~~~~~~~ 146 (392) ...-.+.+++.++-+.|++..+. .....-.++++.+.+.+|.+ .+|++.. . ++ ...-.+......-.+.. T Consensus 128 ~a~~h~~~r~~~~~~ti~~~~~~~af~s~~~~~~~~~~~~~~~~~s~~~~ey~~~~~l~~~~~~~~~~~~~~i~d~~t~~ 207 (449) T protein:vir:11 128 KTLFHERNRQSFYHQTIQDDSLKTAFISWGNFESFIASIINAIYNSAEVDEYEYMKLIIDNYYSKGLFKVVKVDDPMTST 207 (449) T ss_pred eEEEeeccccceeeEeeeHHHHHhhhcChhHHHHHHHHHHHHHhccCchHHHHHHHHHHHHhhccCceEEeeCCccccch Confidence 55666777777788888876543 33333346777777777665 3444332 1 11 11111111111112333 Q ss_pred hhHHHHHHHHHH-hhhccCCC-----------------CCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeE Q lcl|Aclame:pro 147 EFFKGVNGARRA-LNELYIPQ-----------------GRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRI 208 (392) Q Consensus 147 ~~~~~i~~a~~~-l~~~~vp~-----------------~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~ 208 (392) ..++.+++..+. -.+...|. +.++++.|+.+..+-.+ -|.++.. .+.. ++ .+....+ T Consensus 208 ~~~~~~~k~~~~~~~~m~~P~~t~~~N~~~v~~~ad~~dl~~i~~~d~~~~ld~t-~ls~afN-~tav-Da--~~~~tvV 282 (449) T protein:vir:11 208 GALTNFIKKARATALKMTLPQGTRDYNAMAVRTRSDIRDVHLFIDADLNAELDVD-VLAKAFN-MDRT-TF--LGNVTVI 282 (449) T ss_pred HHHHHHHHHHHHHHHhhcCCCCCCCCCceeeccccCccceEEEEccCcceecccc-cchhhhc-ccee-ee--eeeeeec Confidence 456666554332 23555562 33455555554433211 0111110 0000 00 0010011 Q ss_pred eeeEEEEecceeecccceeecccccccchhhhccccccccceeecccceeeeeeeccccceeeeeccccccee--eeEEE Q lcl|Aclame:pro 209 YGYEIVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFG--LKVVE 286 (392) Q Consensus 209 ~g~~v~~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~ 286 (392) .+| +............. .... .............+.+.. ..... T Consensus 283 ddf--------Ast~~~a~~~sk~~-----------------------~~~~---d~~~~~~~~~~~~G~y~n~~~tvt~ 328 (449) T protein:vir:11 283 DGF--------ASTGLKAVMVDKDW-----------------------FMVY---DTLQKMETIRNPRGLYWNYYYHVWQ 328 (449) T ss_pred Ccc--------CCccceeeeeccce-----------------------eEEe---eeeeEEEEEEcCcceeeccceEEEE Confidence 110 00000000000000 0000 000000000000000000 00000 Q ss_pred eeccccceeeeeccceeeeeeecccccccceeeeeeccCeeEEEEEeecCcccccceEEEEEcCCc-eEEECCCceEEEE Q lcl|Aclame:pro 287 DPNGVGFVRARKIHLIPGSIEVAPEAGANATITAAAGEDHTVQLKVTDANGDDVTALCDFESSATD-KATVAAGGLVTGV 365 (392) Q Consensus 287 ~~~~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~t~~~t~~~~~~~~~~~~vtw~Ssn~~-VAtVd~~G~VTa~ 365 (392) ................+..+.++++.+++...++..|...++++++ .|.++.++.|+|+||+++ +|+||++|+|||+ T Consensus 329 t~~~~~~~~~~a~~~~~~~~~VTsVsVtPss~tL~~G~T~qLTATV--~psnatnk~VTWSsSd~s~~ATVda~G~VTAv 406 (449) T protein:vir:11 329 VLSASRFANAVAFVTGDDVPAVTQVIVSPAIASVKQGKSQAFTAYV--RATDDKEHEVVWSVDGGSTGTSISSDGVLTVA 406 (449) T ss_pred EEecccccceeeeeeeeccceeeEEEeeccceeeecCceEEEEEEE--ecCCCCCceEEEEEeCCceEEEEcCCceEEEe Confidence 0001111111112222333456677777777777666665555555 455666789999988876 6999999999999 Q ss_pred ecceEEEEEEEecCCCcEEEEEEEEeC Q lcl|Aclame:pro 366 AAGTSTVTATLVTPSGDREDTIVITVV 392 (392) Q Consensus 366 ~~GtatITat~~~~~g~~tat~~VtVv 392 (392) ++|+++|||++. +++.+++|.|+|. T Consensus 407 a~GTAtITAta~--~~s~TaT~tvtV~ 431 (449) T protein:vir:11 407 ANETNQLTVKAT--VDIGTADEPKPVV 431 (449) T ss_pred cCccEEEEEEEe--cCcEEEEEEeeec Confidence 999999999975 4567888888887 No 54 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=99.77 E-value=1.8e-21 Score=134.35 Aligned_cols=204 Identities=16% Similarity=0.114 Sum_probs=122.5 Q ss_pred EEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-------------cccccccchhhH Q lcl|Aclame:pro 83 LTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAA-------------GAVHEVAPDEFF 149 (392) Q Consensus 83 i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~-------------~~~~~~~~~~~~ 149 (392) ||...+..+.|+|.|+.+.++|++.++.++++++||+.+|+.++.++..+..... ....+..+...| T Consensus 1 iD~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g~~~~~~a~~t~~~~~l~ 80 (221) T protein:vir:17 1 MDDLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDGGFSVNIGAGNTNNAQAIV 80 (221) T ss_pred CCcchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccccCcceeccccccCCHHHHH Confidence 8999999999999999999999999999999999999999999988875432111 112234455678 Q ss_pred HHHHHHHHHhhhccCC-CCCEEEEchHHHHHhhc--ccceeeeeccccceeeeEeee-eeeeEeeeEEEEecceeecccc Q lcl|Aclame:pro 150 KGVNGARRALNELYIP-QGRVLVVGTAVTEQILN--DDRFIKYESQGQSAVSALQEA-RLGRIYGYEIVESTLIPHGDAY 225 (392) Q Consensus 150 ~~i~~a~~~l~~~~vp-~~r~~vv~~~~~~~l~~--~~~~~~~~~~G~~~~~a~~~g-~ig~~~g~~v~~s~~v~~~~~~ 225 (392) +.|++++++|+|++|| .|||++++|++|..|++ ++.+.+.+..++... +++| .+++++||+|++|+++|...+. T Consensus 81 dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~--~~~g~~i~~v~G~~V~~SnnlP~~~gt 158 (221) T protein:vir:17 81 DGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGNTQGD--MNTGKGLYVNAGIRIYKSNVLASLYGT 158 (221) T ss_pred HHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeeccccccc--ccccceeeeecCcEEEEeccCCccccc Confidence 9999999999999999 68999999998888886 466666666655432 5667 5999999999999999976555 Q ss_pred eeecccccccchhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeee Q lcl|Aclame:pro 226 LYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGS 305 (392) Q Consensus 226 ~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 305 (392) ..+..+.................+. ....+..... +++ ++...++.. T Consensus 159 ~~~~~ag~~~~~~~~~~~yr~~fs~-------~~glv~~~~A--------vgt------------------vkl~~~~~~ 205 (221) T protein:vir:17 159 NLVTDPGDATTSGENNGSYRPAITD-------RAGLVFHKEA--------ADT------------------VEVLLPPSR 205 (221) T ss_pred ccccCCccccccccccccccccccc-------eEEEEEcchh--------eee------------------eeeecCCCC Confidence 4443332221111111111100000 0000000000 000 000000000 Q ss_pred eeecccccccceeeeeeccCe Q lcl|Aclame:pro 306 IEVAPEAGANATITAAAGEDH 326 (392) Q Consensus 306 v~v~~~~~~~~~~~~~~~~~~ 326 (392) . +..++-.++. -.... T Consensus 206 ~---~~~~~~~~~~--~~~~~ 221 (221) T protein:vir:17 206 P---PLVISMFSIR--RPDRR 221 (221) T ss_pred C---ceeeeeeecc--CCCCC Confidence 0 0000000000 00000 No 55 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=99.75 E-value=3.9e-20 Score=126.96 Aligned_cols=316 Identities=11% Similarity=0.045 Sum_probs=158.3 Q ss_pred CccccccHHHHHHHHHHHHHHhhcc-cceeeecccccccCCCCCeEEEEecc-ceeeeccccccccCCCccccccccCce Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELIL-TNLVWLNGIGDFAHKFNDTITVRVPA-PSRGHTRKLRGAGAERNLTVSDFTEDS 78 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~-~~~v~~~~~~~~~~~~Gdtv~i~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (392) |+.++ .++|.+++.+.|...++. ..+.+....+++..--|++|+||+.. .....||+..... ....+++... T Consensus 1 Mainy--a~~~~~~Ld~~~~~~~lts~~l~~~~~~~~v~~~ggktVkIp~is~tsGl~DY~R~~g~----~~~g~v~~~~ 74 (346) T protein:vir:10 1 MTINY--AEKYQAAVQQAFYDGHLYSAELWNSPSNSIIKFDGAKHIKVPRLEITSGRKDRQRRTIT----TPVANYSNDW 74 (346) T ss_pred Ccchh--HHHHHHHHHHHHHhhhccchhhcccccccceEecCCCEEEEEEeeeecccccccccCCc----ccccccccce Confidence 99888 469999999999887543 44443333333332237999999985 3457888643321 1124667788 Q ss_pred EEEEEEeeeecceEee--HHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccc---cccccccccchhhHHHHH Q lcl|Aclame:pro 79 FPVTLTDVAYHLGVLT--DEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYE---AAGAVHEVAPDEFFKGVN 153 (392) Q Consensus 79 ~~~~i~~~~~~~~~i~--d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~i~ 153 (392) .+++|++.+++.|.|+ |.+++.....+..-..+.+...++.++|.+.++.+...... ........++.+.|+.|. T Consensus 75 et~tl~qDR~~~F~vD~mDvDETn~~~~~anv~~ef~r~~vvPEiDayrfskLa~~a~~~~~~~~~~~a~T~~ni~~~i~ 154 (346) T protein:vir:10 75 DSYELKNERYWSTLVDPSDIDETNMVVSLANITKQFNLDSKMPEKDRYMFSHLYSGKEAAHDGGITTNTLDEKNILPAFD 154 (346) T ss_pred eEEEeeccccceecccccchHHHHHHhHHHHHHHHHHHHhhcchhhHHHHHHHHHhhhhhccccccccccCHHHHHHHHH Confidence 9999999999999999 77765433333333333444567889999987765432211 222334467888999999 Q ss_pred HHHHHhhhccCC-CCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEe--cceee--------- Q lcl|Aclame:pro 154 GARRALNELYIP-QGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVES--TLIPH--------- 221 (392) Q Consensus 154 ~a~~~l~~~~vp-~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s--~~v~~--------- 221 (392) ++...|+++++| ++|+++++|+.+..|..+++|.+....++.. ..+|.++++.||+|+.. +.+.. T Consensus 155 ~~~~~lde~~vp~~~rvl~vTp~~~~lLk~s~~f~k~~~v~~~~---~i~~~V~siDGv~Ii~VPs~r~~t~~~f~~G~~ 231 (346) T protein:vir:10 155 NMMLDFDEARIPSTNRILYVTPKTNAILKRAEAMNRALTLKDPN---NIQRTVYSLDDVTIRVVPSDLMQTAYDFSDGSK 231 (346) T ss_pred HHHHHHHHccCCCCCeEEEECHHHHHHHhhchhheecccccccc---ccceeeeeecCeEEEEcchhhcccchhhccCcc Confidence 999999999999 6899999999999888899998877665432 35899999999999862 22210 Q ss_pred -cc------cceeecccccccchhhhccccccccceeecccceeeeeeeccccceeeeecccc-cceeeeEEEeeccccc Q lcl|Aclame:pro 222 -GD------AYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLID-TYFGLKVVEDPNGVGF 293 (392) Q Consensus 222 -~~------~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 293 (392) .+ -...++++.....+....-.. .......+.+......|. +..+.. ...+..+.......+. T Consensus 232 ~~t~ak~INfiiv~~~A~ia~~K~~~~~if----~P~~~~~g~~l~~~R~Y~-----D~fv~~nk~~~Iyv~~~~a~~~~ 302 (346) T protein:vir:10 232 IIDTAKQIEMFLIYNGVQIAPEKYSFVGFD----QPSAATSGNYLYYEQSYD-----DVLLLNTKTKGIQFVVSDKPKKD 302 (346) T ss_pred ccCCccceeEEEECCceeeeeeeeeeeEee----CCCCCcccceeeeeeeee-----eeeeeccccceEEEeeecccccC Confidence 00 011112111111110000000 000000110000000000 000000 0011111111100000 Q ss_pred eeeeeccc-eeeeeeecccc--cccceeeeeeccCeeEEEEEeecCcccccceEEEEEcCCceEEEC Q lcl|Aclame:pro 294 VRARKIHL-IPGSIEVAPEA--GANATITAAAGEDHTVQLKVTDANGDDVTALCDFESSATDKATVA 357 (392) Q Consensus 294 ~~~~~~~~-~~~~v~v~~~~--~~~~~~~~~~~~~~t~~~t~~~~~~~~~~~~vtw~Ssn~~VAtVd 357 (392) .....-.. +...-++..+. +....+-.+ +.|.+ ++-.|-|- T Consensus 303 ~~~~~~~~kpt~~~~~~~~~~~~~~~~~~~~---~~~~~--------------------~~~~~~~~ 346 (346) T protein:vir:10 303 QEQSGQDAKPTAESTLEEIKAYLDKNHIDYT---GKTKK--------------------DELLALVK 346 (346) T ss_pred ccCcccccCcccccchHHHHHHhcccccccc---cccch--------------------hhHHhhcC Confidence 00000000 00000000000 000000000 00000 00000000 No 56 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=99.75 E-value=2.3e-19 Score=122.79 Aligned_cols=290 Identities=11% Similarity=-0.025 Sum_probs=157.7 Q ss_pred CccccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccccCceEE Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTEDSFP 80 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (392) |||+|--.++|.+++.+.+...+.+..|-...-.=++.+ |++|+||+.......||+...... . ...+++....+ T Consensus 1 Mantl~ya~~~~~~LD~~~~~~~~s~~l~~~~~~v~~~g--gktVkIp~i~~~gl~DY~R~~g~~-~--~~g~v~~~~et 75 (312) T protein:vir:10 1 MANTLAYGQVLQQGLDKQATQELLTGWMDSNAKQIKYEG--GKEVKIGKLSTDGLGDYSRGSANA-Y--VGGDVKFEYET 75 (312) T ss_pred CCcchhHHHHHHHHHHHHHHhhhccccccCCCceEEEec--CcEEEEEeeecccccccccccCCc-c--cccccccccee Confidence 999997789999999999999998877742221123544 789999999988889987533211 0 11256678889 Q ss_pred EEEEeeeecceEee--HHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-----cccccccchhhHHHHH Q lcl|Aclame:pro 81 VTLTDVAYHLGVLT--DEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAA-----GAVHEVAPDEFFKGVN 153 (392) Q Consensus 81 ~~i~~~~~~~~~i~--d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~i~ 153 (392) .+|++++++.|.|+ |.|++.....+..-..+.+...++.++|++.++.+........ ....+.+..+.|+.|. T Consensus 76 ~tl~qDR~~~F~vD~mDvDETn~~~s~anv~~ef~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~~T~~ni~~~i~ 155 (312) T protein:vir:10 76 KTMTQDRGRKFTLDAMDVDETNFLVTATTVMGEFQRLKVIPEIDAYRLSRLATIAIGIKGDTNVEYSYSVNSSTIINKIK 155 (312) T ss_pred EEeeecccceeeccccchhhHhhHHHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhccccccccccccccCHHHHHHHHH Confidence 99999999999999 8888765555555555566778899999998876654332211 1223457788999999 Q ss_pred HHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccceeeccc-- Q lcl|Aclame:pro 154 GARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTA-- 231 (392) Q Consensus 154 ~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a-- 231 (392) ++...|+++++|.+|+++++|..+..|.++..+...... ......++.++.+.|+++++-..---.+.+.+.-.. T Consensus 156 ~~~~~lde~~vp~~rvl~vTp~~~~lLk~~~~~~~~~~~---~~~~~i~~~V~~iDgv~Ii~VPs~r~~t~~~f~dG~t~ 232 (312) T protein:vir:10 156 TGIKIIRENGYNGPLVCHLTYDSMFAIEEKVLEKLTAVT---FAQGGIQTQVPSIDGCALIKTPQNRMYSSILLNDGTTS 232 (312) T ss_pred HHHHHHHHccCCCceEEEeChHHHHHHhhhhhceecccc---cccceeeeeeeeecccEEEEchhhhccceeeeccCccc Confidence 999999999999999999999998666554333322222 223345889999999999863211111111111000 Q ss_pred ------ccccchhhhccccccccceeecccceeeeeeeccccce-------eeeecccccceeeeEEEeeccccceeeee Q lcl|Aclame:pro 232 ------FIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTI-------TSNRSLIDTYFGLKVVEDPNGVGFVRARK 298 (392) Q Consensus 232 ------~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 298 (392) +..+........-.............. .+....+.. ..+... +...++... ...++....+ T Consensus 233 ~~~~gg~~~~~~ak~INfiiv~~~a~i~~~K~~--~~~if~P~~~~~~d~~~~~~R~---Y~D~fv~~n-k~~~Iyv~~k 306 (312) T protein:vir:10 233 NQTAGGYLKGTKALDTNFIIAPVDVPLAITKQD--KMRIFDPETNQTANAWSMDYRR---YHDLWVTDN-KANSVYANFK 306 (312) T ss_pred ccccCceeecCcccccceEEeCCceeeceeeee--eeeeeCCCCCCCcceeeeeeee---eeeeeeecc-ccCeEEEEee Confidence 000000000000000000000000000 000000000 000000 000001000 0000000000 Q ss_pred ccceeeeeeecccc Q lcl|Aclame:pro 299 IHLIPGSIEVAPEA 312 (392) Q Consensus 299 ~~~~~~~v~v~~~~ 312 (392) ... ++. T Consensus 307 ------~a~--~~~ 312 (312) T protein:vir:10 307 ------DAK--PVG 312 (312) T ss_pred ------ccc--CCC Confidence 000 000 No 57 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=99.71 E-value=8.4e-19 Score=119.69 Aligned_cols=270 Identities=11% Similarity=0.032 Sum_probs=154.2 Q ss_pred CccccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEecc-ceeeeccccccccCCCccccccccCceE Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPA-PSRGHTRKLRGAGAERNLTVSDFTEDSF 79 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (392) ||+++ .++|.+.+.+.|...+....+.......++...-|.+|+||+.. .....||+...+ ....+++.... T Consensus 1 Main~--~~k~~~~ld~~~~~~~~~~~l~~~~n~~~~~~~gak~VkIp~ist~~gl~dY~R~~g-----~~~g~v~~~~e 73 (285) T protein:vir:79 1 MTVVL--DSKDLARIDEEYKADSQVWSYLTGGNGVTQRFRGHNEVRINKLSGFVDATAYKRGQD-----NARKTISVGKE 73 (285) T ss_pred Ccchh--hHHHHHHHHHHHHHhhhhhhhcccCCcceeEecCCCEEEEeeecccccccccccccC-----ccccccceeee Confidence 99998 57999999999999888776654432233332237899999986 356888865332 34456777889 Q ss_pred EEEEEeeeecceEee--HHHHhh--hccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHHHH Q lcl|Aclame:pro 80 PVTLTDVAYHLGVLT--DEELTF--DLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNGA 155 (392) Q Consensus 80 ~~~i~~~~~~~~~i~--d~~~~~--~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a 155 (392) +.+|++++++.|.|+ |.|+.. .+...+.++ +...++.++|++.++.+.... ......+.+..+.|+.|.++ T Consensus 74 t~tl~~DR~~~f~iD~mDvdEn~~~~~~ni~~ef---~~~~vvPEiDayrfskla~~a--~~~~~~~~T~~nv~~~i~~~ 148 (285) T protein:vir:79 74 TVKLTHEDWFGYDLDQFDMDENGAYTVENVVREH---NKMITIPHRDKVAVQKLFDSA--AKKATDSITKDNALDAYDTA 148 (285) T ss_pred EEEeeccccceecccccchhhhhhhhHHHHHHHH---HhhhhcchhhHHHHHHHHhhc--ccccccccCHHHHHHHHHHH Confidence 999999999999999 555432 222333332 344678899999888766443 23334456788899999999 Q ss_pred HHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEee-eEEEEe--cceeecc------cce Q lcl|Aclame:pro 156 RRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYG-YEIVES--TLIPHGD------AYL 226 (392) Q Consensus 156 ~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g-~~v~~s--~~v~~~~------~~~ 226 (392) ...|+++++|++|+++++|+.+..|.++++|.+......+....-.++.++.+.| +++... ..+...+ -.. T Consensus 149 ~~~lde~~vp~~rvl~vTp~~~~~Lk~s~~~~r~~~~~~~~~~~~i~~~V~~lDg~v~ii~Vps~r~kt~~~~k~Infii 228 (285) T protein:vir:79 149 EAYMFDNEVPGGFVMFVSSAYYTALKQSAAVTRTFSTDGTMVINGIDRRVAQLDGGVPIVRVSSDRLKGLGITNHVNFIL 228 (285) T ss_pred HHHHHHcCCCCceEEEEChHHHHHHHhhhhhheecccccceeccceeeeeccccceeEEEEcchhhccCcCcchhccEEE Confidence 9999999999999999999999999989888876654332212223556788887 666652 2222111 122 Q ss_pred eecccccccchhhhccccccccceeecccceeeeeeeccccceeeeeccccc-ceeeeEEEeecc Q lcl|Aclame:pro 227 YHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDT-YFGLKVVEDPNG 290 (392) Q Consensus 227 ~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 290 (392) .++++.....+.... .-..+.....+. .|...+ ..-.+..+... ..+..+...+.. T Consensus 229 v~~~a~i~~~K~~~~----~~f~P~~~~~~d--~~~~~~--R~Y~d~fv~~nk~~~Iy~~~~a~~ 285 (285) T protein:vir:79 229 TPLSAIAPIVKYDSV----SVIDPSTDRSGN--RWTIKG--LSYYDAIVLDNAKKGIYVAATAGV 285 (285) T ss_pred ecCceeccceeeeee----EeECCCCCCCcc--eeeeee--eeeeeeeehhhccceeeeeecccC Confidence 222221111111000 000000000000 000000 00000000000 011111110000 No 58 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=99.65 E-value=1e-17 Score=113.76 Aligned_cols=329 Identities=10% Similarity=-0.011 Sum_probs=169.1 Q ss_pred Ccc----------------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeecccccccc Q lcl|Aclame:pro 1 MAN----------------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAG 64 (392) Q Consensus 1 Man----------------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~ 64 (392) |++ .++ -|+|..|++..|.+..+|..++.. ..+. .|+++++++.+...+..+.+ T Consensus 1 Ms~~n~~t~p~~~gsg~~~aL~-Le~f~GeV~taF~~~si~~~~~~v---RtI~--~gkS~qf~~lG~s~a~y~~p---- 70 (400) T protein:vir:10 1 MSTPNNLTNVAVSASGEVDSLL-IEKFNGKVNEQYLKGENIMSYFDV---QTVT--GTNTVSNKYLGETELQVLAP---- 70 (400) T ss_pred CCCCccccccccccccchhhhH-HhHhcchHHHHHHHHhhhccccee---eeec--ccceEEEEEeeeeEEeeecC---- Confidence 653 233 389999999999888888777643 2454 58999999999999888765 Q ss_pred CCCccccccccCceEEEEEEeeeecceEeeHHHHhhhccC-hHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------- Q lcl|Aclame:pro 65 AERNLTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLES-FATQILPRQVRGVADILEEGVRDMIVGAPYE-------- 135 (392) Q Consensus 65 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~-~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~-------- 135 (392) +..+..+++...+..|+||.-.+....|.|.|+.++++| ++.++.++++++||+..|+.++.++..+... T Consensus 71 -G~~ldg~~~~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~ 149 (400) T protein:vir:10 71 -GQSPAATSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTN 149 (400) T ss_pred -CCCcCCCCcccCcEEEEeCceeeecchhhhHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Confidence 333556677888999999999999999999999999999 8999999999999999999998776443210 Q ss_pred --cc-----c------ccccccchhhHHHHHHHHHHhhhccCCCCCEEE-EchHHHHHhhcccceeeeeccccceeeeEe Q lcl|Aclame:pro 136 --AA-----G------AVHEVAPDEFFKGVNGARRALNELYIPQGRVLV-VGTAVTEQILNDDRFIKYESQGQSAVSALQ 201 (392) Q Consensus 136 --~~-----~------~~~~~~~~~~~~~i~~a~~~l~~~~vp~~r~~v-v~~~~~~~l~~~~~~~~~~~~G~~~~~a~~ 201 (392) +. . ....+.+......|.++...|+|.+||.+|+++ ++|++|..|+..+++.+.++..... .... T Consensus 150 ~~g~~~g~s~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~Ys~Ll~~dkLvnrdf~~s~~-g~~~ 228 (400) T protein:vir:10 150 PRVKGHGFSVNVEVNEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKSYTISQS-GATI 228 (400) T ss_pred CCccccccceeecccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhCCcccchhccccCC-Cccc Confidence 00 0 000012223455788899999999999766655 5666777777767777666543322 2356 Q ss_pred eeeeeeEeeeEEEEecceeecccc-eeeccccc-ccchhhhc--cccccccce---eec-ccceeeeeeeccccceeeee Q lcl|Aclame:pro 202 EARLGRIYGYEIVESTLIPHGDAY-LYHPTAFI-MATRAPAP--PMGAVRSTA---ISG-DQRIAMRWLVDYDSTITSNR 273 (392) Q Consensus 202 ~g~ig~~~g~~v~~s~~v~~~~~~-~~~~~a~~-~a~~~~~~--~~~~~~~~~---~~~-~~~~~~~~~~~~~~~~~~~~ 273 (392) .|.+..++|+.|++++++|..... ..|..... .....+.. ........+ ..+ ..........+++.....+. T Consensus 229 ~g~v~~v~Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt~~~~~d~r~~~~~ 308 (400) T protein:vir:10 229 QGFVLSSYNCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSIDVIGDIFYEKKEKTYY 308 (400) T ss_pred cceEEEEeceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheEEEEeeccccccccchhhHHHH Confidence 888999999999999999863211 11111000 00000000 000000000 000 00000111122222222211 Q ss_pred cccccceee--------eEEEeeccccceee--------eecc-ceeeeeeecccccccceeeeeeccCeeE-----EE- Q lcl|Aclame:pro 274 SLIDTYFGL--------KVVEDPNGVGFVRA--------RKIH-LIPGSIEVAPEAGANATITAAAGEDHTV-----QL- 330 (392) Q Consensus 274 ~~~~~~~~~--------~~~~~~~~~~~~~~--------~~~~-~~~~~v~v~~~~~~~~~~~~~~~~~~t~-----~~- 330 (392) ......+|. .+............ .... ..-+.+-+....+.-.....++....-+ .+ T Consensus 309 id~~~a~G~g~~RPeaa~vv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 388 (400) T protein:vir:10 309 IDTFMSEGAIPDRWEAVSVVTTKRQSTGAVDSGNAAQHTQVLNRAQRKAVYVKNAAPAGAFAAASLSAEDLVAAVRAVMA 388 (400) T ss_pred HHHHHHhCCcccchhheEEEEecCCcccccccCcchhHHHHHhhcccceEEEecccccccccccccchHHHHHHHHHHHh Confidence 111111110 00000000000000 0000 0000000000000000000000000000 00 Q ss_pred -EEeecCcccccc Q lcl|Aclame:pro 331 -KVTDANGDDVTA 342 (392) Q Consensus 331 -t~~~~~~~~~~~ 342 (392) ...|.....+ . T Consensus 389 ~~~~~~~~~~~-~ 400 (400) T protein:vir:10 389 NDIKPTAMKPT-E 400 (400) T ss_pred ccccccccCCC-C Confidence 0000000000 0 No 59 >protein:vir:99523 Length: 311 # NCBI annotation: putative protein # Family: family:all:701 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958538;genbank:gi:41179320;genbank:GeneID:2717161 Probab=99.54 E-value=1e-15 Score=102.74 Aligned_cols=277 Identities=13% Similarity=0.001 Sum_probs=152.4 Q ss_pred Cc---ccccc--HHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCcccccccc Q lcl|Aclame:pro 1 MA---NAFSK--PTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFT 75 (392) Q Consensus 1 Ma---n~~~~--~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (392) |+ |++-. .++|.+++.+.|.+.+....+.+.++.- +.| |.+|+||+.......||+...+ -...+++ T Consensus 1 ~~~~an~mAlnya~~~~~~Ld~~~~~~~~t~~l~~~~~~~-~~G--ak~VkIp~i~~~gl~dY~R~~g-----~~~g~v~ 72 (311) T protein:vir:99 1 MPTDAETRGFNYVTKDGNLLDQKITAGLFTAALGTPEVDL-VNG--GRSFTLKTISTSGLKDHTRGKG-----FNSGTIS 72 (311) T ss_pred CCCcchhhHHHHHHHHHHHHHHHHHhhhcccceecCchhe-eec--CCEEEEEeeeeccccccccccC-----cccccee Confidence 43 44322 7899999999999999888888877532 233 7899999999999999986443 2235667 Q ss_pred CceEEEEEEeeeecceEee--HHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-------------cccc Q lcl|Aclame:pro 76 EDSFPVTLTDVAYHLGVLT--DEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEA-------------AGAV 140 (392) Q Consensus 76 ~~~~~~~i~~~~~~~~~i~--d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~-------------~~~~ 140 (392) ....+.+|++++++.|.|+ |.|+......+..-..+.+....+.++|+..++.+....... .... T Consensus 73 ~~~et~tl~~DR~~~f~vD~mDvdETn~~~~~ani~~~f~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~~~~~~~ 152 (311) T protein:vir:99 73 DEKTIYTMGQDRDVEFYLDRQDVDETDNELAMANISNVFITEHVQPELDSYRFSKIATSFDNLDGTDTEGTLLAKTHKTE 152 (311) T ss_pred eeeeEEEeeeccceeeecchhchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhcccccccchhhhccccccc Confidence 7889999999999999999 777654444333333444445678899998887665332211 1222 Q ss_pred ccccchhhHHHHHHHHHHhhhccCC-CCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEe-cc Q lcl|Aclame:pro 141 HEVAPDEFFKGVNGARRALNELYIP-QGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVES-TL 218 (392) Q Consensus 141 ~~~~~~~~~~~i~~a~~~l~~~~vp-~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s-~~ 218 (392) ...+.++.|+.|..+...|++ +| ++|+++++|+.+..|...+.|.+.....+..... .++.++.+.|++++.. .. T Consensus 153 ~~lt~~nvl~~l~~~~~~~~~--v~~~~rvl~vTp~~~~lLk~~~~~~r~~~~~~~~~~~-i~~~V~~lDgv~Ii~V~ps 229 (311) T protein:vir:99 153 ETLDETNAYSQLKTGIGKVRK--YGTQNLVGYVSSEVMDALERSKEFTRNITNQNVGTTA-LESRITSIDGVQLIEVYES 229 (311) T ss_pred cccCHHHHHHHHHHHHHHHHh--cCCCCeEEEEChHHHHHHhhchhhheeeecccccccc-cccccceecCeEEEEecCc Confidence 345667789999999999987 56 6899999999999887777887655444333233 4677899999987754 11 Q ss_pred eeecccceeecccc--cccchh---------hhccccccccceeecccc-eeeeeeeccccceeeeecccccceeeeEEE Q lcl|Aclame:pro 219 IPHGDAYLYHPTAF--IMATRA---------PAPPMGAVRSTAISGDQR-IAMRWLVDYDSTITSNRSLIDTYFGLKVVE 286 (392) Q Consensus 219 v~~~~~~~~~~~a~--~~a~~~---------~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 286 (392) ---.+.+.+..... ..+... ...+.............. .... +...... +...++.. T Consensus 230 ~r~~t~~~ft~G~~~~~~ak~INfiiv~~~a~i~~~K~~~v~~f~P~~~~~gd~--------~l~~~R~---Y~D~fv~~ 298 (311) T protein:vir:99 230 NRFMTKYDFTDGAKPTEDAKAINFLVVAKPAVISIVKENAVFLFAPGQHTDGDG--------YLYQNRL---YHDLFIKK 298 (311) T ss_pred hhhcchhhhcCCccccCcccccceEEeCCCeeeeeeeeeeeeeeCCCCCCCcce--------eeeeeee---eeeeeeec Confidence 10011111111000 000000 000000000000000000 0000 0000000 00000000 Q ss_pred eeccccceeeeecc Q lcl|Aclame:pro 287 DPNGVGFVRARKIH 300 (392) Q Consensus 287 ~~~~~~~~~~~~~~ 300 (392) . ...++....+.. T Consensus 299 n-k~~~Iyv~~k~A 311 (311) T protein:vir:99 299 H-KRDGIFVSVKKA 311 (311) T ss_pred c-ccCeEEEeeecC Confidence 0 000000000000 No 60 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=99.46 E-value=5.4e-15 Score=98.80 Aligned_cols=324 Identities=13% Similarity=0.013 Sum_probs=163.2 Q ss_pred Ccccc----ccHHHHHHHHHHHHHHhhcccc---eeeecccccccCCCCCeEEEEeccceeeeccccccccCCCcccccc Q lcl|Aclame:pro 1 MANAF----SKPTAVVDTAIQMLQNELILTN---LVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSD 73 (392) Q Consensus 1 Man~~----~~~~~~~~~~~~~l~~~l~~~~---~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (392) ||.+. |+||++++.+.+.+.+.+.|.. ++.+...+.+...+|+++++|....+. .+.+ ...++..+.+.. T Consensus 1 MA~T~lsd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~-Gd~~--~~~~~~~i~~~k 77 (351) T protein:vir:15 1 MAETHLSDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLT-GDPD--NWTDSDDIDVNN 77 (351) T ss_pred CCceeeeeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCC-Cccc--ccCCCcccchhe Confidence 99654 8999999999999988888733 343322222222489999999876542 1222 223456688889 Q ss_pred ccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccc---------ccccccc Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAA---------GAVHEVA 144 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~---------~~~~~~~ 144 (392) +..++...++ ++..++|.++|+....+..|++.++.+|.+...++++++.+++.++++-.... ....+.. T Consensus 78 itt~~~~a~i-~~~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~l~gv~~~~~~~~~~~~d~t~~~~~~ 156 (351) T protein:vir:15 78 LTSGKQQGIK-FYQTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSVLKGVMGVTKIANSKVYDQTKVSPSE 156 (351) T ss_pred ecccceeEEE-EeeccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchhhcccceeccccccccc Confidence 9888888888 55678999999999999999999999999999999999999998865311100 0011122 Q ss_pred chhhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecc- Q lcl|Aclame:pro 145 PDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGD- 223 (392) Q Consensus 145 ~~~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~- 223 (392) ....++.|.+|...|.+..-..-..++++|..+..|.++. +.......+ .+..++.+.|+.|+.+..+|... T Consensus 157 ~~is~~~l~~A~~~~GD~~~~~~~~ivmhS~v~~~L~~~~-li~~~~~s~------~~~~i~t~~G~~VivdD~~p~~~~ 229 (351) T protein:vir:15 157 PMFGAKGFTGAIGLMGDLQDTAFGAIAVNSATYSLMKVQG-LIETIQPQN------GATPFEAYNGLRIVLDDDIEIDLT 229 (351) T ss_pred cccCHHHHHHHHHHhccccccceEEEEEChHHHHHHHhhh-hhhhccccc------cCcccceecceEEEEcCCCccccC Confidence 2345688999999986532222356788999999998764 322222211 13468999999999999988542 Q ss_pred --------cceeecccccccchhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeecccccee Q lcl|Aclame:pro 224 --------AYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVR 295 (392) Q Consensus 224 --------~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 295 (392) .+.+...++.+..+....+..+... ...+.......+..........+........+.. ........... T Consensus 230 ~~~~~~ytsyl~~~GAi~~~~~~~~ve~~rd~~-~~~g~d~l~~r~~~~~hp~G~s~~~~~~~~~~~s-Pt~~~L~~~~N 307 (351) T protein:vir:15 230 DKTKPVSTSYIFAPGAVRYSTNMRSTETKYDPL-INGGQDVIVQKRVGTIHVAGTSIKASFSPSKASF-PTIDELAKSST 307 (351) T ss_pred CCCCceeEEEEEecceeeeecCCcCcceeeccc-CCCCceEEEEeeeeeeeeeeeeecccccccCcCC-cChHHhcCCcc Confidence 3333444444333222221111100 0001111111111111111111100000000000 00000000000 Q ss_pred eeecc-ceeeeeeecccccccceeeeeeccCeeEEEEEeecCcccccc-eEEEE Q lcl|Aclame:pro 296 ARKIH-LIPGSIEVAPEAGANATITAAAGEDHTVQLKVTDANGDDVTA-LCDFE 347 (392) Q Consensus 296 ~~~~~-~~~~~v~v~~~~~~~~~~~~~~~~~~t~~~t~~~~~~~~~~~-~vtw~ 347 (392) =.+.. ...+.+.+..... .+...+... ...+..++.+ +-+-+ T Consensus 308 W~~v~~~d~k~I~iv~~~~---~~~~~~~~~-------~~~~~~~~~~~~~~~~ 351 (351) T protein:vir:15 308 WEVVDGIDVRSIGVVAYTA---QLDPALTPG-------AQMPAADTSTDTGTTK 351 (351) T ss_pred cccccCCCccccceEEEEE---ecCcccccC-------CcCcCCCCccccCCCC Confidence 00000 0011111000000 000000000 0001101000 00000 No 61 >protein:vir:78090 Length: 302 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468790;genbank:gi:157325371;genbank:GeneID:5601852 Probab=99.45 E-value=1.9e-14 Score=95.77 Aligned_cols=279 Identities=11% Similarity=0.028 Sum_probs=147.1 Q ss_pred CccccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc-----eeeeccccccccCCCcccccccc Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-----SRGHTRKLRGAGAERNLTVSDFT 75 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~ 75 (392) |||+|==.++|.+++.+.|...+.+..|....-.-++.| |.+|+||.... ....||+...+.. ..+++ T Consensus 1 Mantl~ya~~~~~~Ld~~~~~~~~t~~l~~~~~~v~~~G--ak~vkIp~is~~~~~TsGl~dy~R~~g~~-----~g~v~ 73 (302) T protein:vir:78 1 MANSLALAQIYQDNIDKAIAVNSKSAFLEANPNNVQYNG--GNTIKIADISFGSGTTGDLKAYNRSTGFT-----QGSVT 73 (302) T ss_pred CCchhHHHHHHHHHHHHHHHhhhceeecccCCceEEEec--CcEEEEEEEEeeccccccccccccccCcc-----cccee Confidence 999885579999999999999998887743221124555 78999999863 3556776543222 23455 Q ss_pred CceEEEEEEeeeecceEee--HHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccc----cccccccchhhH Q lcl|Aclame:pro 76 EDSFPVTLTDVAYHLGVLT--DEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAA----GAVHEVAPDEFF 149 (392) Q Consensus 76 ~~~~~~~i~~~~~~~~~i~--d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~----~~~~~~~~~~~~ 149 (392) ....+.+|++++++.|.|+ |+++......+..-..+.+...++.++|+..++.+........ ......+..+.| T Consensus 74 ~~~et~tlt~DR~~~f~vD~mDvdETn~~~~~ani~~ef~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~t~~nvl 153 (302) T protein:vir:78 74 LAWSDYTLDYDLAQSFQIDAMDVDETKNLATVGNVLSEYQRTKIVPAIDKYRFTKLANDGTGVGGVIDLSKPDASAQALM 153 (302) T ss_pred eeeeeEEeeeccceeeeccccchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHHhhhccCccccccccchhHHHHH Confidence 5778899999999999999 7666544444434444445567889999998876654322211 112234567788 Q ss_pred HHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccceeec Q lcl|Aclame:pro 150 KGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHP 229 (392) Q Consensus 150 ~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~ 229 (392) +.|..+...|+++ ++|+++++|..+..|...+.|.+........ ....++.++.+.|++++....---.+.+.+.. T Consensus 154 ~~i~~~~~~~~e~---~~~vl~vtp~~~~~Lk~a~~~~~~~~~~~~~-~~~i~~~V~~lDgv~Ii~VPs~r~~t~~~f~~ 229 (302) T protein:vir:78 154 GDIATAMELVDDS---NQLILVTSPTTLAGLLNTALIRESKNTQVLR-RGEVDTKITFIQDVEVLQVPSEYLYDKVAPKV 229 (302) T ss_pred HHHHHHHHHhhcc---CCeEEEEChHHHHHHhcchhhccceeccccc-cccccceeeeecccEEEEchhhhcccceeccC Confidence 9999999999985 5899999999999887777776544332211 12236778999999887532111111111110 Q ss_pred ccc-----------cccchhhhccccccccceeeccc-ceeeeeeeccccceeeeecccccceeeeEEEeeccccceeee Q lcl|Aclame:pro 230 TAF-----------IMATRAPAPPMGAVRSTAISGDQ-RIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRAR 297 (392) Q Consensus 230 ~a~-----------~~a~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 297 (392) ... .........+............. .....| ..+.. .+...++... ...++.... T Consensus 230 G~~~~~~ak~INfiiv~~~a~ia~~K~~~~~if~P~~~~~gd~~--------l~~~R---~Y~D~fV~~n-k~~gI~~~~ 297 (302) T protein:vir:78 230 GVPDYTGAKKIPYMIFKRDAPTGIVKTDKVRVFEPDTNQSADAY--------KVDLR---LYHDLIVPKN-QRPGIIKAS 297 (302) T ss_pred CccccCCccceeEEEECCCeeeeeeeeeeeEeeCCCCCCCccee--------eeeee---eEeeeeeecc-ccCeEEEee Confidence 000 00000000000000000000000 000000 00000 0000001100 000111000 Q ss_pred eccce Q lcl|Aclame:pro 298 KIHLI 302 (392) Q Consensus 298 ~~~~~ 302 (392) ...+. T Consensus 298 ~~~~~ 302 (302) T protein:vir:78 298 FGTIA 302 (302) T ss_pred ccccC Confidence 00000 No 62 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=99.44 E-value=2.1e-14 Score=95.54 Aligned_cols=298 Identities=12% Similarity=-0.006 Sum_probs=158.3 Q ss_pred Ccc----ccccHHHHHHHHHHHHHHhhcccc--eeeec--cccccc-CCCCCeEEEEeccceeeeccccccccCCCcccc Q lcl|Aclame:pro 1 MAN----AFSKPTAVVDTAIQMLQNELILTN--LVWLN--GIGDFA-HKFNDTITVRVPAPSRGHTRKLRGAGAERNLTV 71 (392) Q Consensus 1 Man----~~~~~~~~~~~~~~~l~~~l~~~~--~v~~~--~~~~~~-~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~ 71 (392) ||. .+|+||++.+.+.+.+.+.+.|.. .+-++ ...-|. +.+|++|++|....+. .+.. .-.++..+.+ T Consensus 1 MA~T~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~-Gd~~--~v~~~~~i~~ 77 (324) T protein:vir:59 1 MAYTKISDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLD-GDSQ--VLNDTDDLVP 77 (324) T ss_pred CCceeeeceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCC-Cccc--ccCCCcccch Confidence 995 448999999999999999988732 22222 112232 3579999999887652 1222 2234566888 Q ss_pred ccccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc------ccccccc Q lcl|Aclame:pro 72 SDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAG------AVHEVAP 145 (392) Q Consensus 72 ~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~------~~~~~~~ 145 (392) +.+..++...++. ++.++|.++|+....+..|++.++.+|.+..++++.++++++.++++-..... ....... T Consensus 78 ~~l~t~~~~a~i~-~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l~g~~~~~~~~~~~~dvsa~~~~ 156 (324) T protein:vir:59 78 QKINAGQDKAVLI-LRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAELAGVFSNDDMKDNKLDISGTADG 156 (324) T ss_pred hhcccceeeEEEE-eecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccceeeeeccccc Confidence 8888888888884 68899999999999999999999999999999999999999988653211100 0111112 Q ss_pred hhhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecc-- Q lcl|Aclame:pro 146 DEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGD-- 223 (392) Q Consensus 146 ~~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~-- 223 (392) ...++.|.+|...|.++.- .-..++++|..+..|.++.- ....... -.++.++.+.|..|..+..+|... T Consensus 157 ~~s~~~l~~A~~~~GD~~~-~~~~ivmhS~v~~~L~~~~l-i~~~~~s------~~~~~i~~~~G~~VivdD~~p~~~~~ 228 (324) T protein:vir:59 157 IYSAETFVDASYKLGDHES-LLTAIGMHSATMASAVKQDL-IEFVKDS------QSGIRFPTYMNKRVIVDDSMPVETLE 228 (324) T ss_pred eecHHHHHHHHHHhCCccc-CcEEEEEchHHHHHHHHhhh-hhhcccc------ccCceeeeecccEEEEeCCCCccccC Confidence 2357889999999976532 33578999999999987642 2222111 114568899999999999887532 Q ss_pred -------cceeecccccccchhhhc--cccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccce Q lcl|Aclame:pro 224 -------AYLYHPTAFIMATRAPAP--PMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFV 294 (392) Q Consensus 224 -------~~~~~~~a~~~a~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 294 (392) .+.+...++.+....... +..+.. ..+.............. .|..-.... ..+.. T Consensus 229 ~~~~~y~s~l~~~GAi~~~~~~~~v~vE~dRd~---~~g~~~l~~r~~~~~~p------------~G~s~~~~~-~~~~s 292 (324) T protein:vir:59 229 DGTKVFTSYLFGAGALGYAEGQPEVPTETARNA---LGSQDILINRKHFVLHP------------RGVKFTENA-MAGTT 292 (324) T ss_pred CCCceEEEEEEecCeEEEeecCCCcceecccCc---cccceEEEEeeEEEeEe------------eeEEecccc-cCCCC Confidence 233333333332211111 000000 00000000000000000 000000000 00000 Q ss_pred eeeeccceeeeeeecccccccceeeeeeccCeeEEEEEeecC Q lcl|Aclame:pro 295 RARKIHLIPGSIEVAPEAGANATITAAAGEDHTVQLKVTDAN 336 (392) Q Consensus 295 ~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~t~~~t~~~~~ 336 (392) .............. ..+...+. -+.+.-.... T Consensus 293 Pt~~~L~~~~NW~~---v~~~k~i~-------i~~~~~~~~~ 324 (324) T protein:vir:59 293 PTDEELANGANWQR---VYDPKKIR-------IVQFKHRLQA 324 (324) T ss_pred CChhhhcCCccccc---ccCccccc-------eEEEEeeccC Confidence 00000000000000 00000000 0000000000 No 63 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=99.42 E-value=1.9e-14 Score=95.81 Aligned_cols=297 Identities=12% Similarity=0.058 Sum_probs=158.1 Q ss_pred Cccc------cccHHHHHHHHHHHHHHhhcccc---eeeec-ccccccCCCCCeEEEEeccceeeeccccccccCCCccc Q lcl|Aclame:pro 1 MANA------FSKPTAVVDTAIQMLQNELILTN---LVWLN-GIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLT 70 (392) Q Consensus 1 Man~------~~~~~~~~~~~~~~l~~~l~~~~---~v~~~-~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~ 70 (392) |||+ +|+||++++.+.+.+.+.+.|.. ++.+. ....+.+ +|+++++|....+. .+.+...++ ...+. T Consensus 1 Ma~~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~-~G~~i~~P~~~~l~-G~~~~~~dg-~~~i~ 77 (330) T protein:vir:10 1 MANELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITS-GGLLVNMPFWNDLT-GDSEVLGNG-DKALE 77 (330) T ss_pred CCCCceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhc-CCCEEEecccccCC-CcccccCCC-ccccc Confidence 9973 48999999999999998877732 23222 2223333 79999999876542 222222121 23588 Q ss_pred cccccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccc------------c Q lcl|Aclame:pro 71 VSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAA------------G 138 (392) Q Consensus 71 ~~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~------------~ 138 (392) ++.+..++...++ ++..++|.++|+....+..|++.++.+|.+...+++.+..+++.+.+.-.... . T Consensus 78 ~~ki~t~~~~a~i-~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~l~gvf~~~~~~~~~~~~~~~~~ 156 (330) T protein:vir:10 78 TGKITAGADIACV-LYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIATLNGIFATGTAGEKGALEETHVS 156 (330) T ss_pred hhhcccceeEEEE-EeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHHHHhhhhhhhcccchhhhhhhee Confidence 8888888888887 44678999999999999999999999999999999999999988765432110 0 Q ss_pred ccccccchhhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecc Q lcl|Aclame:pro 139 AVHEVAPDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTL 218 (392) Q Consensus 139 ~~~~~~~~~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~ 218 (392) ...+......++.+.+|...|.++.- .-..++++|..+..|.++. +.......+ .++.++.+.|+.|..+.. T Consensus 157 ~~~~~~a~~s~~~l~~A~~~~GD~~~-~~~~ivmhS~v~~~L~~~~-li~~~~~s~------~~~~i~~~~G~~VivdD~ 228 (330) T protein:vir:10 157 DQSKASTGIDAGMVLDAKQLLGDSAD-QVTAIAMHSAVYTKLQKDN-LIQYIQPTT------ATINIPTYLGYRVIIDDG 228 (330) T ss_pred cccccccccCHHHHHHHHHHhccccc-cceEEEEcHHHHHHHHHhh-hhhhhcccc------cCcccccccceEEEEeCC Confidence 00111222356789999999976543 2357899999999998753 333322221 145789999999999999 Q ss_pred eeecc----cceeecccccccchhhh----ccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEee-c Q lcl|Aclame:pro 219 IPHGD----AYLYHPTAFIMATRAPA----PPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDP-N 289 (392) Q Consensus 219 v~~~~----~~~~~~~a~~~a~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 289 (392) +|... .+.+...++.+..+... .+..+... .+......... ..++ ..|..-.... . T Consensus 229 ~p~~~~~yt~yl~~~GAi~~~~~~~~~~v~~EtdRd~~---~g~~~l~~r~~-----------~~~h-p~G~s~~~~~~~ 293 (330) T protein:vir:10 229 IAPTGDIYTSYLFRTGSIGLNTGNPSGLTTFETSREAA---KGNDMIYTRRA-----------LVMH-PYGVKWTGAEVD 293 (330) T ss_pred CCCCCCceeEEEEecCceeeecccCCccccccccCCcc---ccceEEEEeeE-----------EEee-eeeeeecccccc Confidence 97543 23344444333221110 00010000 00000000000 0000 0000000000 0 Q ss_pred cccceeeeeccceeeeee--ecccccccceeeeeecc Q lcl|Aclame:pro 290 GVGFVRARKIHLIPGSIE--VAPEAGANATITAAAGE 324 (392) Q Consensus 290 ~~~~~~~~~~~~~~~~v~--v~~~~~~~~~~~~~~~~ 324 (392) ..+.......-....... ..+..+-...+.-.++. T Consensus 294 ~~~~sPt~~~L~~~~NW~~v~~~k~i~iv~~~~~~~~ 330 (330) T protein:vir:10 294 AGNITPSNADLAKFKNWKRVYEPKNIGIIALKHKIGK 330 (330) T ss_pred cCcCCcChHHhcCCcCcccccChhhcceEEEEEecCC Confidence 000000000000000000 00000000011111111 No 64 >protein:vir:5202 Length: 448 # NCBI annotation: major head protein # Family: family:all:4054 # MgeID: mge:116 # MgeName: PZA # Cross-refs: genbank:acc:NP_040725;genbank:gi:9626396;genbank:GeneID:1260967 Probab=99.33 E-value=3.2e-13 Score=89.09 Aligned_cols=340 Identities=11% Similarity=0.076 Sum_probs=140.7 Q ss_pred CccccccHHHHHHHHHHHHHHhhccc---ceeeecccccccC---CCCCeEEEEeccceeeeccccccccCCCccccccc Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELILT---NLVWLNGIGDFAH---KFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDF 74 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~~---~~v~~~~~~~~~~---~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (392) |.++-+.-|.+. .|-+.+.+. .+-+++--..|.. ..|+++.=..-....-.-|++.. ++-.+-...++ T Consensus 52 ~~~~~~~nef~~-----sLi~rIg~~~~~~~s~~NPL~~Fk~~~~~~g~~ieei~~d~~~~~~yd~~~-~e~~~F~~~~p 125 (448) T protein:vir:52 52 LINQTVQNDFIT-----SLVDRIGLVVIRQVSLNNPLKKFKKGQIPLGRTIEEIYTDITKEKQYDAEE-AEHKVFEREMP 125 (448) T ss_pred hhhHHHHHHHHH-----HHHHhhhhheeccccccchHHHHhhccccchhhhhhheeccccceeechhh-hcccccccCCC Confidence 554443333333 333222211 1111111122321 24666543333333333343322 22333334444 Q ss_pred cCceEEEEEEeeeecceEeeHHHH--hhhccChHHHHHHHHHHHHHH--HHHHHHH-HHH-hcc----cccccccccccc Q lcl|Aclame:pro 75 TEDSFPVTLTDVAYHLGVLTDEEL--TFDLESFATQILPRQVRGVAD--ILEEGVR-DMI-VGA----PYEAAGAVHEVA 144 (392) Q Consensus 75 ~~~~~~~~i~~~~~~~~~i~d~~~--~~~~~~~~~~~~~~~~~ala~--~vd~~~~-~~~-~~~----~~~~~~~~~~~~ 144 (392) .-...-.+.+++.+.-+.|.|.-+ +.-...-.++++.+...++.+ .+|++.. .++ ... -........-.+ T Consensus 126 ~vka~~h~~~r~~~y~~ti~~~~~~~aF~s~~~~d~~~~~i~~s~~~s~~~~ey~~~~~li~~~~~k~l~~~~~i~d~~t 205 (448) T protein:vir:52 126 NVKTLFHERNRQGFYHQTIQDDSLKTAFVSWGNFESFVSSIINAIYNSAEVDEYEYMKLLVDNYYSKGLFTTVKIDEPTS 205 (448) T ss_pred cceeeeeeccCcceeEEEEehhHHHHHHhhhcchHHHHHHHHHHHhcccchHHHHHHHHHHHHhhhccCeEEeeCCCccc Confidence 445566677777777778887443 333333346777777777665 3444332 111 111 001000001111 Q ss_pred chhhHHHHHHHHHHh-h------------hccCC-----CCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeee Q lcl|Aclame:pro 145 PDEFFKGVNGARRAL-N------------ELYIP-----QGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLG 206 (392) Q Consensus 145 ~~~~~~~i~~a~~~l-~------------~~~vp-----~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig 206 (392) .-..+..+++..+.. . ..+|+ ++.+++++++....|-- +.|..+-+.-.. ++ -+.+- T Consensus 206 ~~~~~~~~~k~~r~~~~~~~lp~~~~~~N~~~v~~~~~~~dl~li~~~~~~~~ldv-~~la~afn~~~~--~~--~~~~~ 280 (448) T protein:vir:52 206 STGALTEFVKKMRATARKLTLPQGSRDWNSMAVRTRSYMEDLHLIIDADLEAELDV-DVLAKAFNMNRT--DF--LGNVT 280 (448) T ss_pred chhHHHHHHHHHhhhhhheeCCCCCcccccccccccccceeeEEEECCCceEeecH-HHHHHHhccccc--cc--CcceE Confidence 222333333322221 2 12232 13356666666544311 111121111100 11 11222 Q ss_pred eEeeeEEEEecceeecccceeecccccccchhhhccccccccceeecccceeeeeeeccccceee--eecccccceeeeE Q lcl|Aclame:pro 207 RIYGYEIVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITS--NRSLIDTYFGLKV 284 (392) Q Consensus 207 ~~~g~~v~~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~ 284 (392) .+.||. +.+.. ++ .....|...++...+. ...-.+.+..... T Consensus 281 ~vd~F~---~~g~~---~i------------------------------~vskk~~~~~d~~~kg~t~~na~GL~~N~~~ 324 (448) T protein:vir:52 281 VIDGFA---STGLE---AV------------------------------LVDKDWFMVYDNLHKMETVRNPRGLYWNYYY 324 (448) T ss_pred EecCcc---ccCce---ee------------------------------eeeeeeeeeeeccceeeeeeccccceeeeee Confidence 222221 00000 00 0011111111111111 0111111111111 Q ss_pred EEeec--cccceeeeeccceeeeeeecccccccceeeeeeccCeeEEEEEeecCcccccceEEEEEcCCce-EEECCCce Q lcl|Aclame:pro 285 VEDPN--GVGFVRARKIHLIPGSIEVAPEAGANATITAAAGEDHTVQLKVTDANGDDVTALCDFESSATDK-ATVAAGGL 361 (392) Q Consensus 285 ~~~~~--~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~t~~~t~~~~~~~~~~~~vtw~Ssn~~V-AtVd~~G~ 361 (392) +..+. .......... +......+.++.+++.+.++..+..+++++++. +.++.++.|+|++||.++ +|||++|+ T Consensus 325 TItatss~~~~t~atA~-V~~t~paVtsVsVsPttasL~~G~TqqlTATVs--g~na~~~~VTWSvS~ns~~aTVsssG~ 401 (448) T protein:vir:52 325 HVWQTLSVSRSANAVAF-VSGDVPAVTQVIVSPNIAAVKQGGKQQFTAYVR--ATDGKDHKVVWSVEGGSTGTAITGDGL 401 (448) T ss_pred EEEEEEccCccccceEE-EEecccccceEEEcccceeecCCCeEEEEEEEe--cCCCCCCceEEEEcCCceeeEEeCCcc Confidence 11111 1111111111 111123456677777777777766666655555 555556889999987777 89999999 Q ss_pred EEEEecceEEEEEEEecCCCcEEEEEEEEeC Q lcl|Aclame:pro 362 VTGVAAGTSTVTATLVTPSGDREDTIVITVV 392 (392) Q Consensus 362 VTa~~~GtatITat~~~~~g~~tat~~VtVv 392 (392) +|+.+.|+++|||++... ..++.+.+.|+ T Consensus 402 vTv~a~gTatITVtATvd--ts~a~~~~~vv 430 (448) T protein:vir:52 402 LSVSGNEENQLTVKATVD--IGTEDKPNLVV 430 (448) T ss_pred EEeccCCcceEEEEEEec--CcccCCceeee Confidence 999999999999997432 22333333333 No 65 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=99.31 E-value=2.7e-14 Score=94.98 Aligned_cols=287 Identities=15% Similarity=0.170 Sum_probs=159.8 Q ss_pred Cc---c--ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCcccccccc Q lcl|Aclame:pro 1 MA---N--AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFT 75 (392) Q Consensus 1 Ma---n--~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (392) |- | .|+..|+|+++++..|++.| ++-.+.|+. .+|. -||+++||..+.+..+.. .+..+..+.++. T Consensus 1 ~~~TSNT~A~I~SE~~s~~I~~~LH~~L-L~~~~~R~V-~DF~--~G~~L~I~tiGs~~~~~~-----~E~~~~~~~~i~ 71 (313) T protein:vir:95 1 MQLTSNTRAFIESEQYSKFILLNLHDGL-LPETFYRNV-SDFG--SGETLHIKTIGSVTLQEA-----EEDTPLIYNPIE 71 (313) T ss_pred CcccccchheehhhhHHHHHHHHhhccc-cchhhhhhh-ccCC--CCCEEEecccCceeeecc-----ccCCCeeecccc Confidence 43 3 46889999999999999999 666677765 3464 499999999998888875 467789999999 Q ss_pred CceEEEEEEeeeecceEeeHHHH--hhhccChHHHHHHHHHHHHHHHHHHHHHHHHh--cc----cccccc-----cccc Q lcl|Aclame:pro 76 EDSFPVTLTDVAYHLGVLTDEEL--TFDLESFATQILPRQVRGVADILEEGVRDMIV--GA----PYEAAG-----AVHE 142 (392) Q Consensus 76 ~~~~~~~i~~~~~~~~~i~d~~~--~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~--~~----~~~~~~-----~~~~ 142 (392) .+.+++.|..+++-+|.|+|+-+ ...+..++.+...+++++|.+..+.|++++-. .+ |+...+ ..++ T Consensus 72 TGEIt~~i~~Y~G~A~~vt~~LR~D~~~I~~~~A~~~AE~~RAI~E~~~TD~L~~G~~~FA~~~~P~~vNG~PH~~V~~~ 151 (313) T protein:vir:95 72 TGEITFQITEYKGDAWYVTDDLREDGTDIDRLMAERAAESTRAIQETFETDFLKTGAEYFAANPGPHNVNGFPHVIVSAE 151 (313) T ss_pred cceEEEEEEeecCChhhhhhhhhhcchhHHHHhhhcchhhHHHHHHHHhhHHHhhchhhhccCCCCcccccccceEEecc Confidence 99999999999999999998654 34677888999999999999999999985422 11 111111 1123 Q ss_pred ccchhhHHHHHHHHHHhhhccCC-CCCEEEEchHHHHHhhcccceee-eeccccceeeeEeeee------eeeEeeeEEE Q lcl|Aclame:pro 143 VAPDEFFKGVNGARRALNELYIP-QGRVLVVGTAVTEQILNDDRFIK-YESQGQSAVSALQEAR------LGRIYGYEIV 214 (392) Q Consensus 143 ~~~~~~~~~i~~a~~~l~~~~vp-~~r~~vv~~~~~~~l~~~~~~~~-~~~~G~~~~~a~~~g~------ig~~~g~~v~ 214 (392) +.....++.+..++-.++++++| +||+.+++|....-|-..-.+.. ....|. ..+..|. +.++||+++. T Consensus 152 T~~~~~~~~~~~~~~~~~~a~~P~~G~v~IvDP~~~~~L~~l~~It~~vt~~~k---~I~ESG~A~~~~Fi~~~YG~Di~ 228 (313) T protein:vir:95 152 TNGVFALKHLIAMRLAFDKANVPAEGRVFIVDPVAEATLNGLVTITHDVTDFGK---MILESGMARGQRFIMNLYGWDIL 228 (313) T ss_pred CCceehhhHHHHhhhhhhhccCCccceEEEEcchhhhhhhhhheeecccccccc---eeeeccCCchhHHHHHHhhhhhh Confidence 33445688999999999999999 68999999998887754333322 222222 2223332 4578999999 Q ss_pred EecceeecccceeecccccccchhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccce Q lcl|Aclame:pro 215 ESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFV 294 (392) Q Consensus 215 ~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 294 (392) .|+.+....-........... +-.-+...........+.|..--..-...+-......-.+..-+|...... T Consensus 229 ~SN~L~~AN~~D~~tT~~G~~-~NlFM~i~D~~~~P~~~AWr~MP~s~~~~~~~~~~~~~~~~~R~G~Gi~R~------- 300 (313) T protein:vir:95 229 TSNRLHVANYNDGTTTGNGYV-GNLFMCILDDQTKPIMGAWRRMPKSEGERNKDRARDEHVVRCRYGFGIQRL------- 300 (313) T ss_pred hhhhhhhccccccccccCcee-eeeeeeeecccccceeeeeccccccccccccccccccceeeeeecccceee------- Confidence 988775322111000000000 000000000000000000000000000000000000000000000000000 Q ss_pred eeeeccceeeeeeecccccccceeeeeeccCeeEEEEEeecCcccc Q lcl|Aclame:pro 295 RARKIHLIPGSIEVAPEAGANATITAAAGEDHTVQLKVTDANGDDV 340 (392) Q Consensus 295 ~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~t~~~t~~~~~~~~~ 340 (392) .++.+-+. ++++. T Consensus 301 -------------------------------~~L~~~~~--~A~~~ 313 (313) T protein:vir:95 301 -------------------------------DTLGLLAT--SATAY 313 (313) T ss_pred -------------------------------cceeEEEe--ccccC Confidence 00000000 00000 No 66 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=98.85 E-value=1.6e-10 Score=74.27 Aligned_cols=277 Identities=14% Similarity=0.107 Sum_probs=130.8 Q ss_pred Ccccccc-------HHHHHHHHHHHHHHhhc-ccc--eeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccc Q lcl|Aclame:pro 1 MANAFSK-------PTAVVDTAIQMLQNELI-LTN--LVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLT 70 (392) Q Consensus 1 Man~~~~-------~~~~~~~~~~~l~~~l~-~~~--~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~ 70 (392) ||-+.++ |+.+ ....+|.+.+. |.. -|.|- .....|+||++|+.. ...+..... ++..++ T Consensus 1 mAe~nlt~~~dL~~~~si--dfv~~f~~~i~~L~~~Lgi~r~----~p~a~G~tIt~pK~~--~tgda~dVa--EGe~Ip 70 (295) T protein:vir:99 1 MAEKNLNTMADLGDIKSI--DFVNKFSKNINDLLKLLGVTRR----ETLTNDLKIQTYKWE--VTLDQTDPG--EGETIP 70 (295) T ss_pred CCCcccccHhhccCceee--hhhHHhhhhHHHHHHHhccccc----cccccCCeEEeeeee--eeccccccc--CCcccc Confidence 8865432 3332 12233322221 111 12121 123459999997743 344544444 455676 Q ss_pred cccccCc---eEEEEEEeeeecceEeeHHHH-hhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccch Q lcl|Aclame:pro 71 VSDFTED---SFPVTLTDVAYHLGVLTDEEL-TFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPD 146 (392) Q Consensus 71 ~~~~~~~---~~~~~i~~~~~~~~~i~d~~~-~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~ 146 (392) +..+..+ +.+++++|+.. . ++||.. ....++...+-.+|..++|+++||+|++..++.++.... +..-- T Consensus 71 lskvt~~~~~t~t~kikK~rK-~--tTdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lktat~t~t----g~~lq 143 (295) T protein:vir:99 71 LSKVTRTKDKDYTVKWFKKRR-A--TTAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKPTKVK----GVGLQ 143 (295) T ss_pred hhhheeeeeeeeEEEeeeecc-c--ccHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCceeee----hhhHH Confidence 7777654 67888866543 3 599996 689999999999999999999999999999876543321 11111 Q ss_pred hhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhccccee--eeeccccceeeeEeeeeeeeEeeeE-EEEecceeecc Q lcl|Aclame:pro 147 EFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFI--KYESQGQSAVSALQEARLGRIYGYE-IVESTLIPHGD 223 (392) Q Consensus 147 ~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~--~~~~~G~~~~~a~~~g~ig~~~g~~-v~~s~~v~~~~ 223 (392) ..++.+.++-..+.+.. ....+++++|...+.++++.... .+...|.+. +-++.|+. ++.+..+|.+. T Consensus 144 ~a~a~~~~al~~f~Ee~-~~~~V~FVnP~D~a~yl~~A~~~~~~a~~fG~~~--------L~nfLG~q~II~S~kv~~G~ 214 (295) T protein:vir:99 144 KALSASWAKLATFNEFE-GSPLVSFVSPLDVANYLGDTKVGADASNVFGMTL--------LKNFLGMQNVIVMPSVPEGK 214 (295) T ss_pred HHHHHhhhhhhhccccc-CCceEEEEehHHHHHHHhccccccchhhhhhhhh--------hhhhhccceEEEcccCCCce Confidence 13333333333332221 12468999999999999886643 222244331 22588986 99999999888 Q ss_pred cceeecccccccchhhh-ccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccce Q lcl|Aclame:pro 224 AYLYHPTAFIMATRAPA-PPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLI 302 (392) Q Consensus 224 ~~~~~~~a~~~a~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 302 (392) .+......+.++..... .............++-..+.-.... .+.+.+..... .........-.+. T Consensus 215 ~~aT~~~Ni~~ay~~~~~g~l~~~f~~~~D~tglIg~~h~~~~-~~~t~et~~~~------------~~~lfpE~~dgiv 281 (295) T protein:vir:99 215 IYSTAVENLVFASLNVKGGDLGGLFADFTDETGLIAAARNRQL-SNLTYESVFFG------------ANVLFAEIPEGVV 281 (295) T ss_pred EEEeeccceEEEEecCCchhhhhhhhhccCcccceEEEecccc-ceeeehhhhHh------------HHHhcccccceEE Confidence 76654444333222111 0000000000000000000000000 00000000000 0000000000000 Q ss_pred eeeeeecccccccceeeeeeccCe Q lcl|Aclame:pro 303 PGSIEVAPEAGANATITAAAGEDH 326 (392) Q Consensus 303 ~~~v~v~~~~~~~~~~~~~~~~~~ 326 (392) ...+. .....+.+. T Consensus 282 ~~tI~----------~~~~~~~~~ 295 (295) T protein:vir:99 282 EATIE----------AAAVPGIGG 295 (295) T ss_pred EEEEe----------cCcCCCCCC Confidence 00000 000000000 No 67 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=98.84 E-value=3e-10 Score=72.80 Aligned_cols=270 Identities=11% Similarity=0.077 Sum_probs=129.4 Q ss_pred CccccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccccC---c Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTE---D 77 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~ 77 (392) |+..| -+.+++ -+..|.+.|. |.| +. .-..|+++++. |.+....+....++ +..++...+.. . T Consensus 22 ~siDf--~~~f~~-~i~~L~~~LG----v~r-~~---pla~GstIkt~-k~~~y~gda~dVaE--Ge~Iplskvt~~~~~ 87 (296) T protein:vir:98 22 ITIDV--TNKFQE-NISKLLEMLG----VTR-KI---SVSEGMTLKTY-AGYDVTLAEGNVPE--GEVIPLSKVERKIHS 87 (296) T ss_pred hhhhh--HHHHhh-hHHHHHHHhh----hcc-cc---cccCCCEEeec-cceeeeeccccccC--Ccccchhhheeeecc Confidence 22222 122322 2333334331 111 11 12349999764 44555666655554 55566667764 3 Q ss_pred eEEEEEEeeeecceEeeHHHH-hhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHHHHH Q lcl|Aclame:pro 78 SFPVTLTDVAYHLGVLTDEEL-TFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNGAR 156 (392) Q Consensus 78 ~~~~~i~~~~~~~~~i~d~~~-~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~ 156 (392) +.+++++|+. +. ++||.. ..+.++...+-.+|..++|+++||+|++..++.+........ ..-.......+.++. T Consensus 88 t~t~~ikK~r-K~--tTdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~LktaT~t~~~t~-~~lQ~Ala~~~~~l~ 163 (296) T protein:vir:98 88 EKKIELKKYR-KA--TTGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDALG-AGLQGALASAWGKLQ 163 (296) T ss_pred eEEEEeeccc-cc--cCHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhcccceeeech-hhHHHHHHHHhhhhh Confidence 5888887764 34 599996 789999999999999999999999999999876542211100 000111123344444 Q ss_pred HHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccceeecccccccc Q lcl|Aclame:pro 157 RALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTAFIMAT 236 (392) Q Consensus 157 ~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a~~~a~ 236 (392) ..|++.+- ...+++++|...+.++++.++......|-+ .+-++.|..++.|..+|.+..+......+.++. T Consensus 164 ~~feded~-~~~V~FVnP~D~a~ylg~a~it~qt~fG~t--------yl~nfLG~~II~S~kV~~G~~~~T~~~Ni~~ay 234 (296) T protein:vir:98 164 VLFEDYGS-ERAIVFANSLDVAEYIAKAGITTQTAFGLT--------YLVDFTGTVIISTNDVTKGEIWATVPENIIFAY 234 (296) T ss_pred hhccccCC-CceEEEEehHHHHHHhcCCccchhheechh--------hhhhccccEEEEcCcCCCceEEEeeecceEEEe Confidence 55554431 346899999999999998876433333322 111477778999999998777665444433333 Q ss_pred hhhh-ccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeeccccccc Q lcl|Aclame:pro 237 RAPA-PPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGAN 315 (392) Q Consensus 237 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~~~~ 315 (392) .... ...+.........++-..+.-.... .+...+..... .........-.+....+ T Consensus 235 ~~~~~~~l~~~f~~~~d~tglIGv~h~~~~-~~~t~eT~~~~------------~~~lfpE~~dgiv~~tI--------- 292 (296) T protein:vir:98 235 INPNNSELAKEFNLYGDPTGYIGMNHFQEN-TTLTIQTLLVS------------GMLMYPERIDGIVKVTL--------- 292 (296) T ss_pred ecccccchhhhhccccccccceEEEecccc-ceeeehhHhHh------------HHHhcccccceEEEEEe--------- Confidence 2211 0111111111111000000000000 00000000000 00000000000000000 Q ss_pred ceeeeeecc Q lcl|Aclame:pro 316 ATITAAAGE 324 (392) Q Consensus 316 ~~~~~~~~~ 324 (392) +.+. T Consensus 293 -----~~~~ 296 (296) T protein:vir:98 293 -----TPGV 296 (296) T ss_pred -----cCCC Confidence 0000 No 68 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=98.67 E-value=2.9e-09 Score=67.38 Aligned_cols=277 Identities=12% Similarity=0.090 Sum_probs=131.6 Q ss_pred Cc--cccccHHHHH----H-------HHHHHHHHhhcccceeeecccccccCCCCCeEEEEec-cceeeeccccccccCC Q lcl|Aclame:pro 1 MA--NAFSKPTAVV----D-------TAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVP-APSRGHTRKLRGAGAE 66 (392) Q Consensus 1 Ma--n~~~~~~~~~----~-------~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~-~~~~~~~~~~~~~~~~ 66 (392) |+ +++.+++-+. - .-+..|.+.|. |.| +.+ -..|.++++++. ......+..... ++ T Consensus 1 M~~e~nl~~~~dL~~a~siDF~~~f~~~i~~L~~~LG----v~r-~~p---la~Gt~iktyK~~~~~y~gda~dVa--EG 70 (303) T protein:vir:10 1 MSAENNLINVEALGKAKSIDFANKLGVGLNKLFEALA----IQN-KIP---MNVGSALKQYRFKVEDSEKPNGDVA--EG 70 (303) T ss_pred CCCCcCCcchhhcccceeehhhhhhhhhHHHHHHHhh----hhc-ccc---ccCCceeeeeeeeceeecccccccc--CC Confidence 76 3444444332 1 12333333331 111 111 124788887763 343444444444 45 Q ss_pred CccccccccC---ceEEEEEEeeeecceEeeHHHH-hhhccChHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc Q lcl|Aclame:pro 67 RNLTVSDFTE---DSFPVTLTDVAYHLGVLTDEEL-TFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHE 142 (392) Q Consensus 67 ~~~~~~~~~~---~~~~~~i~~~~~~~~~i~d~~~-~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~ 142 (392) ..++...+.. .+.+++++|+.. . ++||.. ..+..+...+--+|..++|+++||++++..++.+.......... T Consensus 71 e~Iplskvt~~~~~t~~~~~kK~rK-~--tTdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lktaT~t~~~t~~t 147 (303) T protein:vir:10 71 DVIPLTKVTREQVDITELQFAKYRK-S--TSAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSAIENGKRTNKT 147 (303) T ss_pred cccchhhheeeecceEEEEeecccc-c--ccHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhcccccccccce Confidence 5676777774 467888877653 3 499996 78999999999999999999999999999988754322221111 Q ss_pred ccchhhHHHHHHHHH----Hh---hhccCCCCCEEEEchHHHHHhhcccceeee-eccccceeeeEeeeeeeeEeeeEEE Q lcl|Aclame:pro 143 VAPDEFFKGVNGARR----AL---NELYIPQGRVLVVGTAVTEQILNDDRFIKY-ESQGQSAVSALQEARLGRIYGYEIV 214 (392) Q Consensus 143 ~~~~~~~~~i~~a~~----~l---~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~-~~~G~~~~~a~~~g~ig~~~g~~v~ 214 (392) . ..++.|..|.. .| ++..+ .-+++++|...+.++++..+... ...|-+ . +-++.|+.++ T Consensus 148 ~---~s~~glq~Al~~~~~kl~~~~ed~~--~~V~FvNP~Daa~yl~~A~i~~~~t~fG~n---~-----L~nfLG~~II 214 (303) T protein:vir:10 148 K---LSAENLQGALSKGRANLSVLLDDEI--TPIAFVNPNDTAEYLANGFINSTGAQFGVN---L-----LTPYVGVKIV 214 (303) T ss_pred e---ecHHHHHHHHHhhhhhccccccccc--cEEEEEchHHHHHHhhcCCcchhhhhhhhh---h-----hhhhhcceEE Confidence 1 11333333322 22 33222 23899999999999988765432 233432 2 2258899999 Q ss_pred EecceeecccceeecccccccchhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccce Q lcl|Aclame:pro 215 ESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFV 294 (392) Q Consensus 215 ~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 294 (392) .|..+|.+..+......+.++..................++-..+.-.... .+...+...... .... T Consensus 215 ~S~kv~~G~~~~T~~~Ni~~ay~~~~g~l~~~f~~t~D~tglIGv~h~~~~-~~~t~eT~~~~~------------~~lf 281 (303) T protein:vir:10 215 EFADVPQGEVWMTVAENLNVAYANPRGELSRAFAFATDATGFVGVLHDIQP-QRLTSDTIYASA------------ISMF 281 (303) T ss_pred EeccCCCceEEEeeccceEEEEecCchhhhhhhhhccccccceEEEecccc-ceeeehhHhHhH------------HHhc Confidence 999999888766554444333322211111111111111100000000000 000000000000 0000 Q ss_pred eeeeccceeeeeeecccccccceeeeeeccCeeEEEEEeecCc Q lcl|Aclame:pro 295 RARKIHLIPGSIEVAPEAGANATITAAAGEDHTVQLKVTDANG 337 (392) Q Consensus 295 ~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~t~~~t~~~~~~ 337 (392) ....-.+....+ +.-..+. + |+ T Consensus 282 pE~~dgiv~~ti--~~~e~~~--~-----------------~~ 303 (303) T protein:vir:10 282 PENIDAVIKVTI--KKDEAGE--L-----------------PS 303 (303) T ss_pred ccccceEEEEEE--eccccCC--C-----------------CC Confidence 000000000000 0000000 0 00 No 69 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=98.54 E-value=2.6e-08 Score=62.19 Aligned_cols=309 Identities=14% Similarity=0.032 Sum_probs=150.4 Q ss_pred Cc--c------ccccHHHHHHHHHHHHHHhhccc--ceeeeccccccc---CCCCCeEEEEeccceeeeccccccccCCC Q lcl|Aclame:pro 1 MA--N------AFSKPTAVVDTAIQMLQNELILT--NLVWLNGIGDFA---HKFNDTITVRVPAPSRGHTRKLRGAGAER 67 (392) Q Consensus 1 Ma--n------~~~~~~~~~~~~~~~l~~~l~~~--~~v~~~~~~~~~---~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~ 67 (392) || | .+|+||+|.+.+.+.-.+...|. ..+-+| .+|. ...|++|+||......-.+-......... T Consensus 1 M~~~~~~T~l~Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d--~~l~~~~~~gG~~v~iPf~~~L~g~~~n~~~d~~~~ 78 (367) T protein:vir:80 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASN--DFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNV 78 (367) T ss_pred CcchhhhhhhhhccchhhhhHHHhhhhhhhhhhhhcceeecC--HHHHHHhhcCCCEEEeeeeccCCCCccccCCCCCcc Confidence 99 2 24999999999988887776663 234333 3443 25799999998877643221111111112 Q ss_pred ccccccccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhcccccc----------- Q lcl|Aclame:pro 68 NLTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEA----------- 136 (392) Q Consensus 68 ~~~~~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~----------- 136 (392) .+.+..++.++..-.+ .+..++|..+|....++-.|+++.+..|-+.--.+...+.|++.+++.-... T Consensus 79 ~~t~~kittg~~~a~v-~~r~kaw~~~Dla~~lsG~dpm~~Ia~qva~yW~r~~q~~Lla~L~Gvf~~~~a~~~~~~~~~ 157 (367) T protein:vir:80 79 EAPIDGLGSGEMKTTK-TWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTR 157 (367) T ss_pred cccccccccchheeee-ehhcccchhhhHHHHhhCchHHHHHHHHHHHHhhhhhHHHHHHHHHHhhccccccchhhhhhh Confidence 3455555555544443 5567889999988888888999999999888888888888887776442110 Q ss_pred ----------------ccccc--cccchhhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceee Q lcl|Aclame:pro 137 ----------------AGAVH--EVAPDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVS 198 (392) Q Consensus 137 ----------------~~~~~--~~~~~~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~ 198 (392) ..... ........+.+.+|+..|.++.- .=..+++++..+..|.+.. +.......+ T Consensus 158 ~~~~a~~~~~~~~~~~Dis~~t~~~~~~~s~~~~~~A~~~lGD~~~-~l~~i~mHS~V~~~L~~~~-li~~i~~sd---- 231 (367) T protein:vir:80 158 GRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVG-SIAAIAVHSMVYKRMTNND-EIEFIPDSK---- 231 (367) T ss_pred hccccccccccCceeeeeeccCCCccceecHHHHHHHHHHhccccc-cccEEEEchHHHHHHHhcc-ccccccCCC---- Confidence 00000 01122357789999999976433 2257899999999988764 333322221 Q ss_pred eEeeeeeeeEeeeEEEEecceeecc--------cceeecccccccchhhhcccccccccee---ecccceeeeeeecccc Q lcl|Aclame:pro 199 ALQEARLGRIYGYEIVESTLIPHGD--------AYLYHPTAFIMATRAPAPPMGAVRSTAI---SGDQRIAMRWLVDYDS 267 (392) Q Consensus 199 a~~~g~ig~~~g~~v~~s~~v~~~~--------~~~~~~~a~~~a~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~ 267 (392) .+..++.+.|..|.++..+|... .+.+...++.+.......+......... .+......+......+ T Consensus 232 --~~~~i~ty~G~~VIvDD~~Pv~~~~a~~~yttYlfg~GAi~~~~~~~~~~~E~~Rd~~~~~~gG~d~L~~Rr~~~~hP 309 (367) T protein:vir:80 232 --GQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHP 309 (367) T ss_pred --CccccceecceeEEEeCCCcccccCCCceEEEEEEecceeeecccCCccceecccchhhhcCCceEEEEeeeeEEeec Confidence 13458889999999999998643 2233333333322221111000000000 0000000000000000 Q ss_pred ceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeec-----ccccccceeeeeeccCeeEEEEEeecCcccccc Q lcl|Aclame:pro 268 TITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVA-----PEAGANATITAAAGEDHTVQLKVTDANGDDVTA 342 (392) Q Consensus 268 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~-----~~~~~~~~~~~~~~~~~t~~~t~~~~~~~~~~~ 342 (392) .|.. +.... ...+....+.. ...++...+ . .+.+. T Consensus 310 ------------~G~s---------~~~~~-v~~~~~~~~~~~~~~~~~sPt~~eL--a--------------~~~NW-- 349 (367) T protein:vir:80 310 ------------GGFN---------WLDAD-VTIPDNTGSPSGITSGPPAITLANL--A--------------NPDNW-- 349 (367) T ss_pred ------------ceee---------ecccc-cccccccccccccccccCCCChHHh--c--------------CCccc-- Confidence 0000 00000 00000000000 000000000 0 00000 Q ss_pred eEEEEEcCCceEEECCCc Q lcl|Aclame:pro 343 LCDFESSATDKATVAAGG 360 (392) Q Consensus 343 ~vtw~Ssn~~VAtVd~~G 360 (392) ...+.--+=.||.+=.+| T Consensus 350 ~~v~d~K~I~iv~~it~g 367 (367) T protein:vir:80 350 ERVTYRKNVPMAFLVTKG 367 (367) T ss_pred ccccchhhcceEEEEecC Confidence 011111122233333344 No 70 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=98.53 E-value=3.2e-08 Score=61.65 Aligned_cols=277 Identities=12% Similarity=0.043 Sum_probs=133.5 Q ss_pred Cccc-----------cccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCcc Q lcl|Aclame:pro 1 MANA-----------FSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNL 69 (392) Q Consensus 1 Man~-----------~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~ 69 (392) |..+ .+.|+.++.++++.+++..++..++.. +.- .|.+.++|+.....+... + ++... T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~-----~~~-~~~~~~~~~~~~~~a~~v---~--E~~~~ 69 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKA-----VPM-TKPEEEFTFMSGVGAFWV---D--EAERI 69 (299) T ss_pred CCcCCCcccccCCCceecchhHHHHHHHHHHhcchhhhhcee-----eec-CCCcEEEEEEcCCceeee---e--cCccc Confidence 3321 368999999999999999998777643 221 255678877654433333 2 33334 Q ss_pred ccccccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHh-ccccc-----cccccccc Q lcl|Aclame:pro 70 TVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIV-GAPYE-----AAGAVHEV 143 (392) Q Consensus 70 ~~~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~-~~~~~-----~~~~~~~~ 143 (392) ...++.-+.+++...+ .+.-+.|+++-+..+..++.+.+.+..++++++++|+.++.--. ..+.. ........ T Consensus 70 ~~~~~~f~~v~l~~~k-~~~~~~is~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~~gil~~~~~~~~~~~ 148 (299) T protein:vir:41 70 QTSKPTFTKAKMRSKK-MGVIIPTTKENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESPYNWNILKSATDASNLVE 148 (299) T ss_pred cccccceeEEEEeeEE-EEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccccccccccceeec Confidence 4445555666666644 34556788888887888999999999999999999998873110 00000 00111112 Q ss_pred cchhhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecc Q lcl|Aclame:pro 144 APDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGD 223 (392) Q Consensus 144 ~~~~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~ 223 (392) .....+++++++...|..++.+.. .++++|..+..|.+-. +..|.-.......+..+.+.|++|+.+..+|.+. T Consensus 149 ~~~~~~~~l~~~~~~l~~~~~~~~-~~v~n~~~~~~L~~lk-----d~~G~~l~~~~~~~~~~~l~G~PV~~~~~~~~~~ 222 (299) T protein:vir:41 149 ETANKYDDLNEAIGLIEAEDLEPN-GIATIRKQRVKYRSTK-----DGNGMPIFNTATSNGVDDVLGLPIAYTPKYTFGD 222 (299) T ss_pred cccccHHHHHHHHHhhhcccCCcC-EEEEcHHHHHHHHHhh-----ccCCceeecCCcCCCCceecceeeEEecccCCCC Confidence 233568999999888887776433 5789999998887532 1222211111112234578999999999988543 Q ss_pred cc----eeecccccccchhhhccccccccceeecccceeeeeeeccccceee-eecccccceeeeEEEeeccccceeeee Q lcl|Aclame:pro 224 AY----LYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITS-NRSLIDTYFGLKVVEDPNGVGFVRARK 298 (392) Q Consensus 224 ~~----~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 298 (392) .. ....+.............-....+... ..+..... +....+ ....-... ........... T Consensus 223 ~~~~~~~gdfs~~~i~~~~~~~i~~~~~~~~~~-----------~~~~~~~~~~~~~~~-~~~~r~~~-~~d~~v~~~~A 289 (299) T protein:vir:41 223 KDISELVGDWNQAYYGILRGVEYEILTEATLTT-----------VADETGKPLNLAERD-MAAIKATF-EVGFMVVKDEA 289 (299) T ss_pred CceEEEEEecccEEEEEecCcEEEEeecccccc-----------cccccccchhhhhcC-cEEEEEEE-EeccEEecccc Confidence 11 000000000000000000000000000 00000000 000000 00000000 00000000000 Q ss_pred ccceeeeeeeccccccc Q lcl|Aclame:pro 299 IHLIPGSIEVAPEAGAN 315 (392) Q Consensus 299 ~~~~~~~v~v~~~~~~~ 315 (392) +. .+.... ++ T Consensus 290 ~~------~l~~~a-a~ 299 (299) T protein:vir:41 290 FS------AVQPKA-GN 299 (299) T ss_pred eE------EEEecc-CC Confidence 00 000000 00 No 71 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=98.52 E-value=7.5e-09 Score=65.13 Aligned_cols=278 Identities=9% Similarity=0.034 Sum_probs=138.8 Q ss_pred Ccc----------------ccc-cHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEecc-ceeeecccccc Q lcl|Aclame:pro 1 MAN----------------AFS-KPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPA-PSRGHTRKLRG 62 (392) Q Consensus 1 Man----------------~~~-~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~-~~~~~~~~~~~ 62 (392) |.| .++ .|+++..++.+.++++. +..++.|+.+ .+.+-.|....-. .+...|+.... T Consensus 1 ~~~~~~i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~-iad~lf~~~~----a~~~~~v~f~~~~p~~~~~d~e~Va 75 (318) T protein:vir:10 1 MTAPTGIVSVSDGPAITVRELVGNPLWIPTALKKMMVNQF-ISESLFRNGG----ANPNGVVAYNEGNPSFLEDDVADVA 75 (318) T ss_pred CCCCCcceeeecCCceehHHhhCCchhHHHHHHHHHhccc-hhhhhhhccc----ccccceeEEEecccccccCcHhhcc Confidence 322 112 47888887777775555 4444444432 2335567765532 23355666666 Q ss_pred ccCCCccccccccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccc--ccccc Q lcl|Aclame:pro 63 AGAERNLTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYE--AAGAV 140 (392) Q Consensus 63 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~--~~~~~ 140 (392) ++.+.++ .....+...+-..+.....+.|+||.....-.+.+++.+++++.++++++|+.++..+..+... ..+.+ T Consensus 76 EggEiP~--~~~~~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~~~s~~ 153 (318) T protein:vir:10 76 EFGEIPV--SAGARGLPRTAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTLAVPTA 153 (318) T ss_pred Ccccccc--cCCCCCchhhhhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCCcC Confidence 5555444 3455556666444455789999999999999999999999999999999999999887554211 11111 Q ss_pred ccccchhhHHHHHHHHHH-------hhhc-----cCCCC---CEEEEchHHHHHhhcccceeeeeccccceee-e-Eeee Q lcl|Aclame:pro 141 HEVAPDEFFKGVNGARRA-------LNEL-----YIPQG---RVLVVGTAVTEQILNDDRFIKYESQGQSAVS-A-LQEA 203 (392) Q Consensus 141 ~~~~~~~~~~~i~~a~~~-------l~~~-----~vp~~---r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~-a-~~~g 203 (392) +. ..+....++++|... +..+ +...| ..++++|..+..|++++.|.+.......... . -..| T Consensus 154 w~-~~~~~~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~~~~~y~~~a~~~~~~~~~tg 232 (318) T protein:vir:10 154 WD-NGGKVRTDIAIAIEQISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNENFMKVYERNANYVSTAPDWTG 232 (318) T ss_pred CC-CcccccccchhhhhhhhhhhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcchhhhhhhhccchhhhhcccccc Confidence 11 112222333333221 1110 11112 4799999999999999887654321111011 0 1133 Q ss_pred ee-eeEeeeEEEEecceeecccceeecccccccchhhhcccccccccee--ecccceeeeeeeccccceeeeecccccce Q lcl|Aclame:pro 204 RL-GRIYGYEIVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAI--SGDQRIAMRWLVDYDSTITSNRSLIDTYF 280 (392) Q Consensus 204 ~i-g~~~g~~v~~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 280 (392) .+ |++.|++|..+..+|.+.+.........+..-.-.. .....+.. ..+++-...|.. +........ + T Consensus 233 ~~~g~~lGl~vi~s~~~p~~~alvlq~g~vG~~~d~~pl--~~t~~~~egg~~~g~~~~s~~~--~~~~~~~~~-V---- 303 (318) T protein:vir:10 233 NFPGSVMGLNVIRSRTFPIDRVLIMERGTVGFYSDTRPL--QFTALYPEGNGPNGGPTESYRA--DASHKRALA-V---- 303 (318) T ss_pred cccceeeceEEeecCccCCCeeEEEecCCcceeeccccc--eeeecccCCCCCCCCcchhhhe--ehheeeeee-e---- Confidence 33 678999999999999888766554333221100000 00000000 000000000000 000000000 0 Q ss_pred eeeEEEeeccccceeeeeccceeeeeeecccccc Q lcl|Aclame:pro 281 GLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGA 314 (392) Q Consensus 281 ~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~~~ 314 (392) .-+...+.++++..- T Consensus 304 -------------------~~PkA~~~itgi~~~ 318 (318) T protein:vir:10 304 -------------------DQPKAALWLTGIVTP 318 (318) T ss_pred -------------------eCcceeEEEeeccCC Confidence 000000001110000 No 72 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=98.48 E-value=2.9e-08 Score=61.92 Aligned_cols=291 Identities=12% Similarity=0.027 Sum_probs=128.4 Q ss_pred Cccc------cccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEecc-ceeeeccccccccCCCcccccc Q lcl|Aclame:pro 1 MANA------FSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPA-PSRGHTRKLRGAGAERNLTVSD 73 (392) Q Consensus 1 Man~------~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~-~~~~~~~~~~~~~~~~~~~~~~ 73 (392) ||.. ++.|+.+++++++.|++..++..++.+- .- .+..++||+.. ...+.+. +++..+...+ T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i-----~~-~~~~~~ip~~~~~~~a~wv-----~Eg~~~~~s~ 69 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQ-----PT-IFGPVKGAVFSGVPRAKIV-----GEGEVKPSAS 69 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhccee-----ec-CCCceEEEEEeCCcceEEe-----eCCccccccc Confidence 9953 4789999999999999999887776432 11 23457887743 2333332 2344444445 Q ss_pred ccCceEEEEEEeeeecceEeeHHHHhhhccC----hHHHHHHHHHHHHHHHHHHHHHHHHhc---ccc-----cccccc- Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLES----FATQILPRQVRGVADILEEGVRDMIVG---APY-----EAAGAV- 140 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~----~~~~~~~~~~~ala~~vd~~~~~~~~~---~~~-----~~~~~~- 140 (392) +.=+.+++...+. ..-+.|+++-+.++..+ +...+.+..+++|++++|..++.--.. .+. ...... T Consensus 70 ~~f~~v~l~~~kl-~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~~ 148 (315) T protein:vir:80 70 VDVSAFTAQPIKV-VTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKTKN 148 (315) T ss_pred cceeeeEeeeeeE-EeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCccccccccccccccc Confidence 5445555554332 24456777766666555 456677788899999999877732110 000 000000 Q ss_pred ccccchhhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEeccee Q lcl|Aclame:pro 141 HEVAPDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIP 220 (392) Q Consensus 141 ~~~~~~~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~ 220 (392) .....+..+.+++++...+..+..-....++++|.....|.+..........|.-.......|..+++.|+.|+.++.+| T Consensus 149 ~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~~~~g~~~tl~G~PV~~~~~~~ 228 (315) T protein:vir:80 149 IVDATDSATADLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVS 228 (315) T ss_pred eeeccccchHHHHHHHHHHhhccCccceEEEEcHHHHHHHHHHhhccCCcccccccccccccCCCceecceeeEecCcCC Confidence 11122345778888877775554433345889999998886542211112222211122334556789999999999887 Q ss_pred ecccceeecccccccchhhhccccccccceeecc-cceeeeeeeccccceeeeecccccce-e-eeEE-Eeeccccceee Q lcl|Aclame:pro 221 HGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGD-QRIAMRWLVDYDSTITSNRSLIDTYF-G-LKVV-EDPNGVGFVRA 296 (392) Q Consensus 221 ~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~~~-~~~~~~~~~~~ 296 (392) .............+ .+... ....+. ...........+. +......+. . ...- ........... T Consensus 229 ~~~~~~~~~~~~~~--------~GDfs-~~~~g~~~~~~i~i~~~~~~----~~~~~~~~~~~~v~~r~~~r~~~~v~~~ 295 (315) T protein:vir:80 229 GAPEMSPASGVKAI--------VGDFS-RVHWGFQRNFPIELIEYGDP----DQTGRDLKGHNEVMVRAEAVLYVAIESL 295 (315) T ss_pred cccccccccccEEE--------Eeecc-cEEEEEecCeeEEEeccccc----cCcccchhhcCcEEEEEEEEecceeecc Confidence 54322110000000 00000 000000 0000000000000 000000000 0 0000 00000000000 Q ss_pred eeccceeeeeeecccccccc Q lcl|Aclame:pro 297 RKIHLIPGSIEVAPEAGANA 316 (392) Q Consensus 297 ~~~~~~~~~v~v~~~~~~~~ 316 (392) ..+..........+..+... T Consensus 296 ~a~~~l~~~~a~~~~~~~~~ 315 (315) T protein:vir:80 296 DSFAVVKEKAAPKPNPPAEN 315 (315) T ss_pred cceEEEeeccCCCCCCCCCC Confidence 00000000000000000000 No 73 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=98.39 E-value=4.6e-07 Score=55.31 Aligned_cols=353 Identities=14% Similarity=0.101 Sum_probs=149.0 Q ss_pred Cc-------cccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCcccccc Q lcl|Aclame:pro 1 MA-------NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSD 73 (392) Q Consensus 1 Ma-------n~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (392) |+ -.++.|++ ..++++.++++..+..++.+- .. .+.+++||+...... ....+ ++..+...+ T Consensus 10 ~~~~~t~~~~g~l~~~~-~~~ii~~l~~~s~i~~l~~~~---~~---~~~~~~ip~~~~~~~--a~wv~--Eg~~~~~s~ 78 (397) T protein:vir:23 10 IAQTKDTMFTGYLDPVQ-AKDYFAEAEKTSIVQRVAQKI---PM---GATGIVIPHWTGDVS--AQWIG--EGDMKPITK 78 (397) T ss_pred HhhccCCCCccccchhH-HHHHHHHHHhccchhhhccee---ec---cCCceEEEEEcCCcc--eEEec--CCccccccc Confidence 33 23455654 567899999988887776432 11 245688877533221 12222 333444455 Q ss_pred ccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhcccc-----ccccccccccchhh Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPY-----EAAGAVHEVAPDEF 148 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~-----~~~~~~~~~~~~~~ 148 (392) +.-..+++.+.+. ..-+.|+++-+.++..++...+.++..++|+.++|+.++.--..... .............. T Consensus 79 ~~f~~v~l~~~k~-~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~~~~~~~~~~~~~~~~~~~~~ 157 (397) T protein:vir:23 79 GNMTKRDVHPAKI-ATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHGTNAPSAFQGYLDQSNKTQSISPNAY 157 (397) T ss_pred cceeEEEEeeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccCCcccccccccccceeeecccch Confidence 5556666666443 35567888888888899999999999999999999988742111000 00011111223345 Q ss_pred HHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhccc----ceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeeccc Q lcl|Aclame:pro 149 FKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDD----RFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDA 224 (392) Q Consensus 149 ~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~----~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~ 224 (392) ++.++++...|..+..+ .-.+++++..+..|.+-. ++.-..... ......+..+++.|++++.++.+|.+.. T Consensus 158 ~~~~~~~~~~l~~~~~~-~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~---~~~~~~~~~~tl~G~Pv~~s~~~~~g~~ 233 (397) T protein:vir:23 158 QGLGVSGLTKLVTDGKK-WTHTLLDDTVEPVLNGSVDANGRPLFVESTY---ESLTTPFREGRILGRPTILSDHVAEGDV 233 (397) T ss_pred hHHHHHHHHhhhhcccC-CCEEEEcHHHHHHHHHhhccCCceeeccccc---ccccccccCceeeeeeEEEeCCCCCCce Confidence 66777777777766553 346789999988886421 111100000 0111123346889999999999886543 Q ss_pred ceeecc--cccccchhhhccccccccceeecccce-eeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccc Q lcl|Aclame:pro 225 YLYHPT--AFIMATRAPAPPMGAVRSTAISGDQRI-AMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHL 301 (392) Q Consensus 225 ~~~~~~--a~~~a~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 301 (392) ..+... ...+..............+...+.... .....+..+..... ........+..... + ..+.. T Consensus 234 ~~~~gDfs~~~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~r----a~~r~d~~v~~~~a---~---~~~~~ 303 (397) T protein:vir:23 234 VGYAGDFSQIIWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVR----VEAEYGLLINDVNA---F---VKLTF 303 (397) T ss_pred EEEEeecceEEEEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEE----EEeeeccceecccc---e---EEEee Confidence 221111 110110000000000000000000000 00000000000000 00000000000000 0 00000 Q ss_pred eeeeeeecccccccceeeeeeccCeeEEEEEeecCcccccceEEEEEcCCceEE----EC---CCc--eEEEEecceEEE Q lcl|Aclame:pro 302 IPGSIEVAPEAGANATITAAAGEDHTVQLKVTDANGDDVTALCDFESSATDKAT----VA---AGG--LVTGVAAGTSTV 372 (392) Q Consensus 302 ~~~~v~v~~~~~~~~~~~~~~~~~~t~~~t~~~~~~~~~~~~vtw~Ssn~~VAt----Vd---~~G--~VTa~~~GtatI 372 (392) ... .. ..........+.+.++++..+.. ..+.|.-+...|.+ +| +.| .||+ ..|-.+| T Consensus 304 ~~~-------~~-~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 370 (397) T protein:vir:23 304 DPV-------LT-TYALDLDGASAGNFTLSLDGKTS----ANIAYNASTATVKSAIVAIDDGVSADDVTVTG-SAGDYTI 370 (397) T ss_pred ccc-------cc-eeeecccccCcceEEEEecCccc----cCcccccchhhhHHHhhhcccccccceeeeec-CCceeEE Confidence 000 00 00111111223344444433222 22333333322211 11 011 2333 2445555 Q ss_pred EEEEecC----CCcEEEEEEEEeC Q lcl|Aclame:pro 373 TATLVTP----SGDREDTIVITVV 392 (392) Q Consensus 373 Tat~~~~----~g~~tat~~VtVv 392 (392) |..-.-. .-.-.....|+|+ T Consensus 371 ~~~~~~~~~~~~~~~~~~~~~~~~ 394 (397) T protein:vir:23 371 TVPGTLTADFSGLTDGEGASISVV 394 (397) T ss_pred EeccccccCccccccCccccceee Confidence 5531000 0000112345555 No 74 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=98.38 E-value=1.7e-07 Score=57.77 Aligned_cols=281 Identities=9% Similarity=-0.003 Sum_probs=127.2 Q ss_pred Ccc--ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccce-eeeccc---cccccCCCccccccc Q lcl|Aclame:pro 1 MAN--AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPS-RGHTRK---LRGAGAERNLTVSDF 74 (392) Q Consensus 1 Man--~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~-~~~~~~---~~~~~~~~~~~~~~~ 74 (392) |.. .-+.|+.+..++++.+++..++..++.+- .. .+..++||+.... .+.... .....++.......+ T Consensus 20 ~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~---~~---~~~~~~~p~~~~~~~a~~v~eg~~~~~~e~~~~~~~~~ 93 (333) T protein:vir:78 20 LAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQI---PI---SYGETIIPTTVKRPEVGQVGVGTSNEQREGGLKPLSGT 93 (333) T ss_pred eecCCccccchhHHHHHHHHHHhhchhhhhccee---ec---cCCceEEEEEeCCceeEeecCccccccccccccccccc Confidence 110 11569999999999999999887776442 11 2345778775332 222111 000111111222222 Q ss_pred cCceEEEEEEeeee-cceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhc-cc-------cc------cccc Q lcl|Aclame:pro 75 TEDSFPVTLTDVAY-HLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVG-AP-------YE------AAGA 139 (392) Q Consensus 75 ~~~~~~~~i~~~~~-~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~-~~-------~~------~~~~ 139 (392) +-.++++..+|. .-+.++++-+.++..++.+.+.++.++++++.+|..++.--.. .+ .. .... T Consensus 94 --~f~~i~l~~~kl~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~~~g~~~~~~~~~~~~~~ 171 (333) T protein:vir:78 94 --AWDTRSVSPIKLATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSALQGIDTDNVIANTTNVD 171 (333) T ss_pred --ceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCccccccccccccccccccc Confidence 333445555554 4456777777788889999999999999999999988731110 00 00 0001 Q ss_pred cccccchhhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccce-eeeEeeeeeeeEeeeEEEEecc Q lcl|Aclame:pro 140 VHEVAPDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSA-VSALQEARLGRIYGYEIVESTL 218 (392) Q Consensus 140 ~~~~~~~~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~-~~a~~~g~ig~~~g~~v~~s~~ 218 (392) .........+++++++...+..+..-....++++|..+..|.+.....+. .|.-. ......+..+++.|+.|+.++. T Consensus 172 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d~--~G~~i~~~~~~~~~~~~l~G~Pv~~~~~ 249 (333) T protein:vir:78 172 YLQETGDPLLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLRAQAYRDA--NGNVDPSRINLAAQTGDVLGLPAQFGRA 249 (333) T ss_pred ccccccchhHHHHHHHHHhhccccccCceEEEEcchHHHHHHHHhhhcCC--CCceeecCccccCCCceeeceeeEEccc Confidence 11122334688888887766544332334688899998877654332221 12211 1123345668999999999999 Q ss_pred eeecccceeecccccccchhhhccccccccceeecccceeeeeeeccccce----e-----eeecc--cccceeeeEEEe Q lcl|Aclame:pro 219 IPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTI----T-----SNRSL--IDTYFGLKVVED 287 (392) Q Consensus 219 v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~-----~~~~~--~~~~~~~~~~~~ 287 (392) +|.+...........+. +...........+............. . .+... .....++.+... T Consensus 250 i~~~~~~~~~~~~~~~~--------gD~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~ 321 (333) T protein:vir:78 250 VGGDLGAAVDSKTRIIG--------GDFSQLKFGFADEIRIKMSDTATLTDSGSATVSMWQTNQIAILIEVTFGWLLGDK 321 (333) T ss_pred cCCCccccCCCccEEEE--------EecccEEEEEeeccEEEEeccccccccccceeehhhcCcEEEEEEEEEccEEecc Confidence 98654221111100000 00000000000000000000000000 0 00000 000000000000 Q ss_pred eccccceeeeeccceeee Q lcl|Aclame:pro 288 PNGVGFVRARKIHLIPGS 305 (392) Q Consensus 288 ~~~~~~~~~~~~~~~~~~ 305 (392) ... ..+.....+ T Consensus 322 ~a~------~~l~~~~a~ 333 (333) T protein:vir:78 322 QAF------VKFVDDEQP 333 (333) T ss_pred cce------EEEeccCCC Confidence 000 000000000 No 75 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=98.37 E-value=1.4e-07 Score=58.13 Aligned_cols=266 Identities=8% Similarity=-0.008 Sum_probs=131.5 Q ss_pred Ccc--------------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc-eeeeccccccccC Q lcl|Aclame:pro 1 MAN--------------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGA 65 (392) Q Consensus 1 Man--------------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~-~~~~~~~~~~~~~ 65 (392) ||- ..+.|+.+..++++.+++..++..++.+- . . .+..++||+... ..+.. .+ + T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~--~-~---~~~~~~ip~~~~~~~a~~---v~--E 69 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNE--P-M---TAQKKKFTYLAKGVGAYW---VS--E 69 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhccee--e-c---cCCceEEEEEeCCcceEE---ee--c Confidence 442 34789999999999999999988876542 1 1 134578876532 22222 22 2 Q ss_pred CCccccccccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhc-ccc---------- Q lcl|Aclame:pro 66 ERNLTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVG-APY---------- 134 (392) Q Consensus 66 ~~~~~~~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~-~~~---------- 134 (392) +....-.++.-..+++...+. +.-+.|+.+-+.++..++...+.++..+++++++|..++.--.. .+. T Consensus 70 ~~~~~~~~~~~~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~ 148 (304) T protein:vir:10 70 TERIQTSKPEYAQAEMEAKKI-GVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEG 148 (304) T ss_pred CcccccccceeeEEEEEEEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccccccccc Confidence 333444455556666666443 34567888878888889999999999999999999988742110 000 Q ss_pred ccccccccccchhhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEE Q lcl|Aclame:pro 135 EAAGAVHEVAPDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIV 214 (392) Q Consensus 135 ~~~~~~~~~~~~~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~ 214 (392) ..............|++|.++...|..++... ..++++|..+..|.+-. +..| ..+.....+++.|..|+ T Consensus 149 ~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~-~~~v~~~~~~~~L~~lk-----d~~G----~~l~~~~~~~l~G~PV~ 218 (304) T protein:vir:10 149 AEEKGNVVTDTNNLYVDLSALMATIEDEELDP-NGVLTTRSFRSKMRNAL-----DAND----RPLFDANGNEIMGLPLS 218 (304) T ss_pred ccccccccccccchHHHHHHHHHHhhhccCCc-CEEEEcHHHHHHHHHhh-----ccCC----cEeecCCCccccceeeE Confidence 00011111223346899999888887666533 36789999999886421 1222 23344456789999999 Q ss_pred Eecceeecccce--e--ecccccccchhhhccccccccceeecccceeeeeeeccccce------eeeeccc--ccceee Q lcl|Aclame:pro 215 ESTLIPHGDAYL--Y--HPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTI------TSNRSLI--DTYFGL 282 (392) Q Consensus 215 ~s~~v~~~~~~~--~--~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~--~~~~~~ 282 (392) .++.+|...... + ..+.......... ........ ........+... ..+.... -...+. T Consensus 219 ~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~--------~i~~~~e~-~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~ 289 (304) T protein:vir:10 219 YTGADVYDKKKSLALMGDWDYARYGILQGI--------EYAISEDA-TLTTLQASDASGQPVSLFERDMFALRATMHIAY 289 (304) T ss_pred EecccccCCCCcEEEEEehhhEEEEEecce--------EEEEeecc-eeeeecccccCccchhhhhcCcEEEEEEEEecc Confidence 998887533110 0 0000000000000 00000000 000000000000 0000000 000000 Q ss_pred eEEEeeccccceeeeeccceeeeeeecccc Q lcl|Aclame:pro 283 KVVEDPNGVGFVRARKIHLIPGSIEVAPEA 312 (392) Q Consensus 283 ~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~ 312 (392) .+...... +.++... T Consensus 290 ~v~~~~a~---------------~~l~~a~ 304 (304) T protein:vir:10 290 MNVKPEAF---------------ATLKPTE 304 (304) T ss_pred Eeecccce---------------EEEEecC Confidence 00000000 0000000 No 76 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=98.37 E-value=1.4e-07 Score=58.13 Aligned_cols=266 Identities=8% Similarity=-0.008 Sum_probs=131.5 Q ss_pred Ccc--------------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc-eeeeccccccccC Q lcl|Aclame:pro 1 MAN--------------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGA 65 (392) Q Consensus 1 Man--------------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~-~~~~~~~~~~~~~ 65 (392) ||- ..+.|+.+..++++.+++..++..++.+- . . .+..++||+... ..+.. .+ + T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~--~-~---~~~~~~ip~~~~~~~a~~---v~--E 69 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNE--P-M---TAQKKKFTYLAKGVGAYW---VS--E 69 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhccee--e-c---cCCceEEEEEeCCcceEE---ee--c Confidence 442 34789999999999999999988876542 1 1 134578876532 22222 22 2 Q ss_pred CCccccccccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhc-ccc---------- Q lcl|Aclame:pro 66 ERNLTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVG-APY---------- 134 (392) Q Consensus 66 ~~~~~~~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~-~~~---------- 134 (392) +....-.++.-..+++...+. +.-+.|+.+-+.++..++...+.++..+++++++|..++.--.. .+. T Consensus 70 ~~~~~~~~~~~~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~ 148 (304) T protein:vir:94 70 TERIQTSKPEYAQAEMEAKKI-GVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEG 148 (304) T ss_pred CcccccccceeeEEEEEEEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccccccccc Confidence 333444455556666666443 34567888878888889999999999999999999988742110 000 Q ss_pred ccccccccccchhhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEE Q lcl|Aclame:pro 135 EAAGAVHEVAPDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIV 214 (392) Q Consensus 135 ~~~~~~~~~~~~~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~ 214 (392) ..............|++|.++...|..++... ..++++|..+..|.+-. +..| ..+.....+++.|..|+ T Consensus 149 ~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~-~~~v~~~~~~~~L~~lk-----d~~G----~~l~~~~~~~l~G~PV~ 218 (304) T protein:vir:94 149 AEEKGNVVTDTNNLYVDLSALMATIEDEELDP-NGVLTTRSFRSKMRNAL-----DAND----RPLFDANGNEIMGLPLS 218 (304) T ss_pred ccccccccccccchHHHHHHHHHHhhhccCCc-CEEEEcHHHHHHHHHhh-----ccCC----cEeecCCCccccceeeE Confidence 00011111223346899999888887666533 36789999999886421 1222 23344456789999999 Q ss_pred Eecceeecccce--e--ecccccccchhhhccccccccceeecccceeeeeeeccccce------eeeeccc--ccceee Q lcl|Aclame:pro 215 ESTLIPHGDAYL--Y--HPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTI------TSNRSLI--DTYFGL 282 (392) Q Consensus 215 ~s~~v~~~~~~~--~--~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~--~~~~~~ 282 (392) .++.+|...... + ..+.......... ........ ........+... ..+.... -...+. T Consensus 219 ~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~--------~i~~~~e~-~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~ 289 (304) T protein:vir:94 219 YTGADVYDKKKSLALMGDWDYARYGILQGI--------EYAISEDA-TLTTLQASDASGQPVSLFERDMFALRATMHIAY 289 (304) T ss_pred EecccccCCCCcEEEEEehhhEEEEEecce--------EEEEeecc-eeeeecccccCccchhhhhcCcEEEEEEEEecc Confidence 998887533110 0 0000000000000 00000000 000000000000 0000000 000000 Q ss_pred eEEEeeccccceeeeeccceeeeeeecccc Q lcl|Aclame:pro 283 KVVEDPNGVGFVRARKIHLIPGSIEVAPEA 312 (392) Q Consensus 283 ~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~ 312 (392) .+...... +.++... T Consensus 290 ~v~~~~a~---------------~~l~~a~ 304 (304) T protein:vir:94 290 MNVKPEAF---------------ATLKPTE 304 (304) T ss_pred Eeecccce---------------EEEEecC Confidence 00000000 0000000 No 77 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=98.36 E-value=1.6e-07 Score=57.78 Aligned_cols=262 Identities=13% Similarity=0.027 Sum_probs=125.3 Q ss_pred Ccc--ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccce--eeeccccccccCCCccccccccC Q lcl|Aclame:pro 1 MAN--AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPS--RGHTRKLRGAGAERNLTVSDFTE 76 (392) Q Consensus 1 Man--~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~ 76 (392) ++- -.+.|..++.++++.+++...+.+++++.. . .|.++++|+.... .+.. . +++....-.++.- T Consensus 117 ~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~---~---~~~~~~~~~~~~~~~~a~~---v--~E~~~~~~~~~~~ 185 (395) T protein:vir:43 117 IDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGT---T---ESNSVEYVRETGFVNNAAP---V--SEGTQKPYSDLTF 185 (395) T ss_pred cCCCCccccchhhHHHHHHHHHhhhhHHhhcccee---c---CCCceEEEEEecCCCceee---e--cCCccccccccce Confidence 111 125677788899999999999988876541 1 2445777764221 2222 2 2233333344444 Q ss_pred ceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHh----------ccccccccccccccch Q lcl|Aclame:pro 77 DSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIV----------GAPYEAAGAVHEVAPD 146 (392) Q Consensus 77 ~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~----------~~~~~~~~~~~~~~~~ 146 (392) ..+++...+.. .-+.|+++-+ .+..++...+.+...++++..+|..++.--. ................ T Consensus 186 ~~i~~~~~k~~-~~~~is~ell-~d~~~l~~~v~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~ 263 (395) T protein:vir:43 186 ELENAPVRTIA-HLFKASRQIL-DDASALQSYIDARARYGLMLVEECQLLYGNGTGANLHGIIPQAQAYAPPSGVVVTAE 263 (395) T ss_pred eEEEEeeeeEE-EeehhhHHHH-HhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccc Confidence 55666654443 3456777644 4455666667778899999999998874210 0000001111122334 Q ss_pred hhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccce Q lcl|Aclame:pro 147 EFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYL 226 (392) Q Consensus 147 ~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~ 226 (392) ..++++.++...|..+..+. -.++++|..+..|.+-. +..|.-.......+..+.+.|++|+.++.+|.+..+. T Consensus 264 ~~~~~i~~~~~~~~~~~~~~-~~~vmn~~~~~~l~~lk-----d~~G~~i~~~~~~~~~~~l~G~pVv~~~~~~~~~~~~ 337 (395) T protein:vir:43 264 QRIDRIRLAILQAQLAEFPA-SGIVLNPIDWALIELNK-----DAENRYIIGSPQNGTTPTLWRLPVVETQAITQDEFLT 337 (395) T ss_pred hhHHHHHHHHHhhccccCCC-cEEEEcHHHHHHHHHhh-----ccCCceeccccccCCCceecceeeEEcCCCCCCcEEE Confidence 46888888887777665533 36889999988875421 1122211111234556789999999999988665432 Q ss_pred eeccc-ccccchhhhccccccccceeecccceeeeeeeccccceeeee--cccccceeeeEEEeeccccceeeeecccee Q lcl|Aclame:pro 227 YHPTA-FIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNR--SLIDTYFGLKVVEDPNGVGFVRARKIHLIP 303 (392) Q Consensus 227 ~~~~a-~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 303 (392) ...+. .....+ .+...............+. .......++.+....... .+ T Consensus 338 gd~~~~~~~~~~-----------------~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~------~~---- 390 (395) T protein:vir:43 338 GAFSLGAQIFDR-----------------MDIEVLVSTENDKDFENNMVTIRAEERLAFAVYRPEAFV------TG---- 390 (395) T ss_pred EeccceEEEEEe-----------------cceEEEEeccccchhhcCcEEEEEEEeeccEEecccceE------EE---- Confidence 22111 100000 0000000000000000000 000000111111000000 00 Q ss_pred eeeeeccc Q lcl|Aclame:pro 304 GSIEVAPE 311 (392) Q Consensus 304 ~~v~v~~~ 311 (392) .++.. T Consensus 391 ---~~taa 395 (395) T protein:vir:43 391 ---SLTAS 395 (395) T ss_pred ---EeccC Confidence 00000 No 78 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=98.28 E-value=1.9e-07 Score=57.37 Aligned_cols=261 Identities=11% Similarity=0.025 Sum_probs=124.3 Q ss_pred Ccc-ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc-eeeeccccccccCCCccccccccCce Q lcl|Aclame:pro 1 MAN-AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSDFTEDS 78 (392) Q Consensus 1 Man-~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (392) -++ .++.|+++...+.+.+++...+..++++- .-..|..+.||+-.. ..+.+ . +++..+...++.-.. T Consensus 116 ~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~-----~~~~~~~~~~p~~~~~~~a~w---v--~E~~~~~~~~~~f~~ 185 (390) T protein:vir:62 116 AGNPNVLSRTLYGQLIAQAVERSAIMRGGATTF-----TTSDANPLDFTVITGRSSASI---V--GETAEIPESYPATAQ 185 (390) T ss_pred cCCCccccccchHHHHHHHHhhhhhhhhcceee-----ecCCCceeEEEEEcCCcceee---e--cccccccccccceee Confidence 112 34667777777777777776665555431 112345677876432 22222 2 233344444555566 Q ss_pred EEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHH-------HhccccccccccccccchhhHHH Q lcl|Aclame:pro 79 FPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDM-------IVGAPYEAAGAVHEVAPDEFFKG 151 (392) Q Consensus 79 ~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~ 151 (392) +++...+. +.-+.|+++-+.++..++...+.+..+++|+.++|..++.- +...................+++ T Consensus 186 i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~p~Gi~~~~~~~~~~~~~~~~~~~~~~~ 264 (390) T protein:vir:62 186 RSMGGFKY-GFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFITGTGQPRGILTDASPATATFLATDTDSKVSDA 264 (390) T ss_pred eEeeeeeE-EeehHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhhhccCCccccccccccccccceecccccccchHH Confidence 66665443 34556888888888889999999999999999999988731 11110000011111222346888 Q ss_pred HHHHHHHhhhccCCCCCEEEEchHHHHHhhc--ccceeeeeccccce-eeeEeeeeeeeEeeeEEEEecceeecccceee Q lcl|Aclame:pro 152 VNGARRALNELYIPQGRVLVVGTAVTEQILN--DDRFIKYESQGQSA-VSALQEARLGRIYGYEIVESTLIPHGDAYLYH 228 (392) Q Consensus 152 i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~--~~~~~~~~~~G~~~-~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~ 228 (392) ++++...|+.... .+-.++++|..+..|.+ |.+ |.-. ...+..|..+.+.|++|+.++.+|........ T Consensus 265 l~~~~~~l~~~~~-~~a~~vmn~~~~~~L~~lkd~~-------g~~l~~~~~~~g~~~~l~G~Pv~~~~~~p~~~i~~gd 336 (390) T protein:vir:62 265 LIDLFHEVPSAYR-ANAKYVVNDLRAAQMRKLKDAN-------GQYLWQSGLTVGAPSLFNGKVVETDDGMPADKILFAD 336 (390) T ss_pred HHHHHHhhhhhhh-cCCEEEEchHHHHHHHHhhccC-------CCeeecCCcCCCccceecccceEEecCCCCccEEEee Confidence 8888777765433 23467889999888743 322 2110 01223455568999999999888764432211 Q ss_pred cccccccchhhhccccccccceeecccceeeeeeeccccceeeeec--ccccceeeeEEEeeccccceeeeeccceeeee Q lcl|Aclame:pro 229 PTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRS--LIDTYFGLKVVEDPNGVGFVRARKIHLIPGSI 306 (392) Q Consensus 229 ~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 306 (392) .+........ +....... +.....+.. ......++.+.... . + T Consensus 337 ~s~~~i~~~~-----------------~~~v~~~~--~~~~~~~~~~~~~~~r~d~~~~~~~---------A-------~ 381 (390) T protein:vir:62 337 LSKYRVRFAG-----------------SLRVDRSV--DAKFSTDQIVYRFLQRADGLLVDAR---------G-------A 381 (390) T ss_pred ccceeEEeec-----------------ceEEEeec--cccccCCcEEEEEEEEeCcEeechh---------h-------e Confidence 1111000000 00000000 000000000 00000000000000 0 0 Q ss_pred eeccccccc Q lcl|Aclame:pro 307 EVAPEAGAN 315 (392) Q Consensus 307 ~v~~~~~~~ 315 (392) .+..+.... T Consensus 382 ~~l~~~~~a 390 (390) T protein:vir:62 382 KVLTVTPGA 390 (390) T ss_pred EEEEeecCC Confidence 000000000 No 79 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=98.27 E-value=4.9e-07 Score=55.20 Aligned_cols=286 Identities=8% Similarity=0.000 Sum_probs=125.8 Q ss_pred Ccc--------------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc-eeeeccccccccC Q lcl|Aclame:pro 1 MAN--------------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGA 65 (392) Q Consensus 1 Man--------------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~-~~~~~~~~~~~~~ 65 (392) |+- .-+.|+.+.+++++.+++..++.+++.+- .- .+..+++|+... ..+.. . ++ T Consensus 1 m~~~~~~a~~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~-----~~-~~~~~~~p~~~~~~~a~~---v--~E 69 (330) T protein:vir:77 1 MAGSTVPSTQVALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKV-----PM-GPTGISIPHWTGAVSASW---T--GE 69 (330) T ss_pred CcccccchhhccccCCCcceechhHHHHHHHHHHhccchhhhccee-----ec-cCCceEEEEEcCCcceeE---e--cC Confidence 331 11344555678999999999887776432 11 244578877532 22222 2 23 Q ss_pred CCccccccccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHH----------Hhcccc- Q lcl|Aclame:pro 66 ERNLTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDM----------IVGAPY- 134 (392) Q Consensus 66 ~~~~~~~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~----------~~~~~~- 134 (392) +..+...++.-..+++...+ .+.-+.|+++-+.++..++.+.+.++.++++++++|+.++.- +..... T Consensus 70 g~~~~~~~~~f~~i~~~~~k-~~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~ 148 (330) T protein:vir:77 70 AERKPITKGSFGKQELEPVK-ITTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKV 148 (330) T ss_pred CCccccccceeeEEEEeEEE-EEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCcccccccccccc Confidence 44444455555556666533 234557888877777889999999999999999999988731 111000 Q ss_pred ----ccccccccccchhhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcc--cc--ee--eeeccccceeeeEeeee Q lcl|Aclame:pro 135 ----EAAGAVHEVAPDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILND--DR--FI--KYESQGQSAVSALQEAR 204 (392) Q Consensus 135 ----~~~~~~~~~~~~~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~--~~--~~--~~~~~G~~~~~a~~~g~ 204 (392) ..............|+++.++...|..++.+.. .++++|..+..|.+- .. +. .....+ ...... T Consensus 149 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~-~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~-----~~~~~~ 222 (330) T protein:vir:77 149 VSLADTNLTTASGPQGNAYLAVNNALSLLVNSGKKWT-GTLLDNVTEPILNTAVDGNGRPLFVESTYTE-----QVGAIR 222 (330) T ss_pred ceeecccccccccccchhHHHHHHHHHhhhhcCCCcc-EEEEcHHHHHHHHHHhccCCceeecCccccc-----cccccC Confidence 000111112233457888888777777665433 578999999887642 11 11 111011 111223 Q ss_pred eeeEeeeEEEEecceeecccce------eecccccccchhhhccccccccceeecccceeeeeeeccccceeeeec--cc Q lcl|Aclame:pro 205 LGRIYGYEIVESTLIPHGDAYL------YHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRS--LI 276 (392) Q Consensus 205 ig~~~g~~v~~s~~v~~~~~~~------~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~ 276 (392) -+++.|+.|+.++.+|...... ...+..................+...+............+. ...+.. -. T Consensus 223 ~~~l~G~PV~~~~~~p~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~-f~~~~~~~r~ 301 (330) T protein:vir:77 223 EGRILGRPTYVADNVVNGTVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISL-WQHNMVAVRC 301 (330) T ss_pred CceecceeeEEeccccCCCCCCccEEEEEecceEEEEEecCcEEEEeecceeeecccccccccccccch-hhcCcEEEEE Confidence 4578999999999988643211 00011000000000000000000000000000000000000 000000 00 Q ss_pred ccceeeeEEEeeccccceeeeeccceeeeeeecccccccceeeeeeccCeeEEEEEeecCccc Q lcl|Aclame:pro 277 DTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGANATITAAAGEDHTVQLKVTDANGDD 339 (392) Q Consensus 277 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~t~~~t~~~~~~~~ 339 (392) ....++.+..... + +.++..... ..|... T Consensus 302 ~~r~d~~v~~~~a---~------------~~i~~~~~~-------------------~~~~~~ 330 (330) T protein:vir:77 302 EAEFAFMVNDKDA---F------------VKLTDQVAG-------------------TDPEEE 330 (330) T ss_pred EEEeccEEecccc---e------------EEEEeccCC-------------------cCCCCC Confidence 0000000000000 0 000000000 000000 No 80 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=98.23 E-value=5.3e-07 Score=54.98 Aligned_cols=270 Identities=17% Similarity=0.114 Sum_probs=122.3 Q ss_pred Ccc---ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEe-ccceeeeccccccccCCCccccccccC Q lcl|Aclame:pro 1 MAN---AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRV-PAPSRGHTRKLRGAGAERNLTVSDFTE 76 (392) Q Consensus 1 Man---~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (392) +.+ ..+.|+.+...+++.+++..++..++.+- .. .|...++++ .....+.+.......+.. ........ T Consensus 165 ~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~---~~---~~~~~~~~~~~~~~~a~~v~e~~~~~~~-~~~~~~~~ 237 (458) T protein:vir:10 165 SSVEVSSESYETIFSQRIIRDLQKELVVGALFEEL---PM---SSKILTMLVEPDAGKATWVAASTYGTDT-TTGEEVKG 237 (458) T ss_pred ccCccccceehhhHhHHHHHHHHhhhhHHhhccee---ec---CCcceEEEEecCCcceeecccccccccc-cccccccc Confidence 221 23789999999999999999887776542 11 133455543 222222222211111111 00111111 Q ss_pred ceEEEEEEeeeecc-eEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHH-hccccc------------ccccccc Q lcl|Aclame:pro 77 DSFPVTLTDVAYHL-GVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMI-VGAPYE------------AAGAVHE 142 (392) Q Consensus 77 ~~~~~~i~~~~~~~-~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~-~~~~~~------------~~~~~~~ 142 (392) +-.++++.-++... +.|+++-+.++..++...+.+...++|+.++|..++.-- .+.|.+ ....... T Consensus 238 ~~~~i~~~~~k~~~~v~is~ell~ds~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~ 317 (458) T protein:vir:10 238 ALKEIHFSTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMTGDGSGKPKGLLTLASEDSAKVVTEAKAD 317 (458) T ss_pred cceeeEeeeeeEEeeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceeeecccccccceeeccccc Confidence 22334444454444 578888777777899999999999999999999887310 001100 0000111 Q ss_pred ccchhhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccc-----eeeeEeeeeeeeEeeeEEEEec Q lcl|Aclame:pro 143 VAPDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQS-----AVSALQEARLGRIYGYEIVEST 217 (392) Q Consensus 143 ~~~~~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~-----~~~a~~~g~ig~~~g~~v~~s~ 217 (392) ......|++|+++...|..+... +-.++++|..+..|.+-.. ..|.- .......|..+.+.|++|+.+. T Consensus 318 ~~~~~~~~~i~~~~~~l~~~~~~-~~~~v~~~~~~~~l~~lkd-----~~G~~i~~~~~~~~~~~~~~~~l~G~pv~~~~ 391 (458) T protein:vir:10 318 GSVLVTAKTISKLRRKLGRHGLK-LSKLVLIVSMDAYYDLLED-----EEWQDVAQVGNDSVKLQGQVGRIYGLPVVVSE 391 (458) T ss_pred ccccccHHHHHHHHHhhhhhhcC-CCEEEEcHHHHHHHHhhcc-----cCCceeeccccccccccCcCceecceeeEEcc Confidence 12234688999988888766553 3357889999887754211 11110 0112334556689999999999 Q ss_pred ceeecccceeecccccccchhhhccccccccceeecc-cceeeeeeeccccceeeeecccccceeeeEEEeeccccceee Q lcl|Aclame:pro 218 LIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGD-QRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRA 296 (392) Q Consensus 218 ~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 296 (392) .+|....... +.+ +.....+..++ .+..+. ...+...... ......-.+..+.... ++.. T Consensus 392 ~~p~~~~~~~----~~~---------~~f~~~~~~~~~~~~~v~-~d~~~~~~~~-~~~~~~r~~~~v~~~~---a~v~- 452 (458) T protein:vir:10 392 YFPAKANSAE----FAV---------IVYKDNFVMPRQRAVTVE-RERQAGKQRD-AYYVTQRVNLQRYFAN---GVVS- 452 (458) T ss_pred ccccccCCcc----eEE---------EEecccEEEEEeeceEEE-eecccCCCce-EEEEEEEecceEeccc---ceEE- Confidence 9875421100 000 00000000000 000000 0000000000 0000000000000000 0000 Q ss_pred eeccceeeeeeecccccccc Q lcl|Aclame:pro 297 RKIHLIPGSIEVAPEAGANA 316 (392) Q Consensus 297 ~~~~~~~~~v~v~~~~~~~~ 316 (392) . .+... T Consensus 453 -------~-------~~aa~ 458 (458) T protein:vir:10 453 -------G-------TYAAS 458 (458) T ss_pred -------E-------eeccC Confidence 0 00000 No 81 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=98.22 E-value=4.2e-07 Score=55.55 Aligned_cols=282 Identities=10% Similarity=-0.006 Sum_probs=132.4 Q ss_pred Cc-c--ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc-eeeeccccccccCCCccccccccC Q lcl|Aclame:pro 1 MA-N--AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSDFTE 76 (392) Q Consensus 1 Ma-n--~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 76 (392) |+ . ..+.|+.|..++++.+++..++..++.+- .. .|.+++||+... ..+.. . +++..+...+++- T Consensus 30 ~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~-----~~-~~~~~~ip~~~~~~~a~~---v--~Eg~~~~~~~~~f 98 (324) T protein:vir:93 30 MMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYE-----PM-EGTEKKFTFWADKPGAYW---V--GEGQKIETSKATW 98 (324) T ss_pred cccCCCcceechhHHHHHHHHHHhhchhhhhccee-----ec-cCCceEEEEEecCcceee---e--cCCccccccccce Confidence 22 1 23679999999999999999888776432 11 245688877532 22222 2 2344444455555 Q ss_pred ceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhcccc------ccccccccccchhhHH Q lcl|Aclame:pro 77 DSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPY------EAAGAVHEVAPDEFFK 150 (392) Q Consensus 77 ~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~------~~~~~~~~~~~~~~~~ 150 (392) ..++++..+. +.-+.|+++-+.++..++...+.++.++++++++|+.++.--..... ..............++ T Consensus 99 ~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 177 (324) T protein:vir:93 99 VNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQD 177 (324) T ss_pred eEEEEEeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCccccccccccceeccccccHH Confidence 6666666443 35567888888888889999999999999999999988632111100 0001111112234688 Q ss_pred HHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeeccccee--e Q lcl|Aclame:pro 151 GVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLY--H 228 (392) Q Consensus 151 ~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~--~ 228 (392) ++.++...|..++... ..++++|..+..|.+.. +..|.- .+..+..+++.|.+|+.+...+......+ . T Consensus 178 ~i~~~~~~l~~~~~~~-~~~v~n~~~~~~L~~l~-----d~~G~~---~~~~~~~~~l~G~PVv~~~~~~~~~~~i~~gd 248 (324) T protein:vir:93 178 NIIDLEALLEDDELEA-NAFISKTQNRSLLRKIV-----DPETKE---RIYDRNSDSLDGLPVVNLKSSNLKRGELITGD 248 (324) T ss_pred HHHHHHHhhhhccCCC-CEEEEcHHHHHHHHHhh-----CCCCCe---eecCCCCCcccceeeEeecCCCCCcceEEEEe Confidence 9999888887766533 36889999998886421 122321 23345567888999887655433222111 1 Q ss_pred cccccccchhhhccccccccceeec-ccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeee Q lcl|Aclame:pro 229 PTAFIMATRAPAPPMGAVRSTAISG-DQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIE 307 (392) Q Consensus 229 ~~a~~~a~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 307 (392) .+.............-......... ............+.. ..-.....+..+....... .+......-+ T Consensus 249 fs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~----~~r~~~r~d~~v~~~~a~~------~l~~a~~~~~ 318 (324) T protein:vir:93 249 FDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMV----ALRATMHVALHIADDKAFA------KLVPADKRTD 318 (324) T ss_pred cceEEEEEecCcEEEEeecccccccccccccchhhhhcCcE----EEEEEEEeccEEecccceE------EEecccccCC Confidence 1111111110000000000000000 000000000000000 0000001111111110000 0000000000 Q ss_pred eccccc Q lcl|Aclame:pro 308 VAPEAG 313 (392) Q Consensus 308 v~~~~~ 313 (392) +++-.+ T Consensus 319 ~~~~~~ 324 (324) T protein:vir:93 319 SVPGEV 324 (324) T ss_pred CCCCCC Confidence 111111 No 82 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=98.19 E-value=5.4e-07 Score=54.95 Aligned_cols=283 Identities=10% Similarity=-0.009 Sum_probs=132.3 Q ss_pred Cc-c--ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccccCc Q lcl|Aclame:pro 1 MA-N--AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTED 77 (392) Q Consensus 1 Ma-n--~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (392) |+ . ..+.|+.|+.++++.+++..++..++.+- . . .|.+++||+..... ..... +++......++.-. T Consensus 30 ~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~--~-~---~~~~~~~p~~~~~~--~a~~v--~Eg~~~~~~~~~f~ 99 (324) T protein:vir:96 30 MMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYE--P-M---EGTEKKFTFWADKP--GAYWV--GEGQKIETSKATWV 99 (324) T ss_pred cccCCCcceechhHHHHHHHHHHhhchhhhhccee--e-c---cCCceEEEEEecCc--ceeee--cCCcccccccccee Confidence 33 2 22678999999999999999887776442 1 1 24568887753211 11222 23444444555556 Q ss_pred eEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhcccc------ccccccccccchhhHHH Q lcl|Aclame:pro 78 SFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPY------EAAGAVHEVAPDEFFKG 151 (392) Q Consensus 78 ~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~ 151 (392) .+++...+. ..-+.|+++-+.++..++...+.++..+++++++|+.++.--..... ..............+++ T Consensus 100 ~v~~~~~k~-~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 178 (324) T protein:vir:96 100 NATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIKKTNKVIKGDFTQDN 178 (324) T ss_pred EEEEEeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCCcCccccccccccceecccccchHH Confidence 666666443 34567888888878889999999999999999999988732111100 01111111223346899 Q ss_pred HHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccce--eec Q lcl|Aclame:pro 152 VNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYL--YHP 229 (392) Q Consensus 152 i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~--~~~ 229 (392) |.++...|..+.... ..++++|..+..|.+.. +..|. ..+..+..+.+.|++|+.+...+...... ... T Consensus 179 i~~~~~~i~~~~~~~-~~~i~n~~~~~~L~~lk-----d~~G~---~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~ 249 (324) T protein:vir:96 179 IIDLEALLEDDELEA-NAFISKTQNRSLLRKIV-----DPETK---ERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDF 249 (324) T ss_pred HHHHHHhhhhccCCC-CEEEEcHHHHHHHHHhh-----CCCCC---eeecCCCCCcccceeeEeecCCCCCcceEEEEec Confidence 999888887665533 36889999988875421 12232 12334556678999988766544332211 111 Q ss_pred ccccccchhhhccccccccceeec-ccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeee Q lcl|Aclame:pro 230 TAFIMATRAPAPPMGAVRSTAISG-DQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEV 308 (392) Q Consensus 230 ~a~~~a~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v 308 (392) +....................... ............+... .-.....+..+....... .+......-++ T Consensus 250 s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~----~r~~~r~d~~v~~~~a~~------~l~~a~~~~~~ 319 (324) T protein:vir:96 250 DKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVA----LRATMHVALHIADDKAFA------KLVPADKRTDS 319 (324) T ss_pred ceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEE----EEEEEEeccEEecccceE------EEecccccCCC Confidence 111111110000000000000000 0000000000000000 000000111111100000 00000000001 Q ss_pred ccccc Q lcl|Aclame:pro 309 APEAG 313 (392) Q Consensus 309 ~~~~~ 313 (392) ++-.+ T Consensus 320 ~~~~~ 324 (324) T protein:vir:96 320 VPGEV 324 (324) T ss_pred CCCCC Confidence 11111 No 83 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=98.19 E-value=5.3e-07 Score=54.99 Aligned_cols=282 Identities=10% Similarity=0.011 Sum_probs=131.6 Q ss_pred Cc--cccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccccCce Q lcl|Aclame:pro 1 MA--NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTEDS 78 (392) Q Consensus 1 Ma--n~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (392) ++ ...+.|+.|..++++.+++..++..++.+- .. .|.+++||+.... ......+ ++..+...++.-.. T Consensus 31 ~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~-----~~-~~~~~~ip~~~~~--~~a~~v~--Eg~~~~~~~~~f~~ 100 (324) T protein:vir:97 31 MHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYE-----PM-EGTEKKFTFWADK--PGAYWVG--EGQKIETSKATWVN 100 (324) T ss_pred ccCCCcceechhHHHHHHHHHHhhcchhhhccee-----ec-cCCceEEEEEecC--cceeEec--cCccccccccceeE Confidence 22 234789999999999999999988876432 11 2456888775321 1122222 34445555555566 Q ss_pred EEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhcccc------ccccccccccchhhHHHH Q lcl|Aclame:pro 79 FPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPY------EAAGAVHEVAPDEFFKGV 152 (392) Q Consensus 79 ~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~i 152 (392) ++++..+. ..-+.|+++-+.++..++...+.++.++++++++|+.++.--..... ..............+++| T Consensus 101 v~~~~~k~-~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~~~~~i 179 (324) T protein:vir:97 101 ATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNI 179 (324) T ss_pred EEEeeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccCccccccccccceeccccCCHHHH Confidence 66665443 35557888777777789999999999999999999988742211110 001111112234468999 Q ss_pred HHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccceee--cc Q lcl|Aclame:pro 153 NGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYH--PT 230 (392) Q Consensus 153 ~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~--~~ 230 (392) .++...|..++.... .++++|..+..|.+-. +..|. ..+..+..+.+.|++|+.+...+......+. .+ T Consensus 180 ~~~~~~l~~~~~~~~-~~v~n~~~~~~L~~lk-----d~~g~---~~~~~~~~~tl~G~PV~~~~~~~~~~~~~~~gd~~ 250 (324) T protein:vir:97 180 IDLEALLEDDELEAN-AFISKTQNRSLLRKIV-----DPETK---ERIYDRNSDTLDGLPVVNLKSSNLKRGELITGDFD 250 (324) T ss_pred HHHHHhhhhccCCCC-EEEEcHHHHHHHHHhh-----cCCCc---eeecCCCCccccceeeEeecCCCCCcceEEEEecc Confidence 999888877665433 6789999988876421 11222 1223344567899998877655433322111 11 Q ss_pred cccccchhhhccccccccceeec--ccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeee Q lcl|Aclame:pro 231 AFIMATRAPAPPMGAVRSTAISG--DQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEV 308 (392) Q Consensus 231 a~~~a~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v 308 (392) ....................... ..+. ....+..+... .-.....+..+..... +. .+.......+. T Consensus 251 ~~~i~~~~~~~i~~~~~~~~~~~~~~~~~-~~~~f~~d~~~----~r~~~r~d~~v~~~~a---~~---~l~~~~~~~~~ 319 (324) T protein:vir:97 251 KLIYGIPQLIEYKIDETAQLSTVKNEDGT-PVNLFEQDMVA----LRATMHVALHIADDKA---FA---KLVPADKKTDS 319 (324) T ss_pred cEEEEEecCcEEEEeeccccccccccccc-chhhhhcCcEE----EEEEEEeccEEecccc---eE---EEEeccCCCCC Confidence 11110000000000000000000 0000 00000000000 0000000000000000 00 00000000000 Q ss_pred ccccc Q lcl|Aclame:pro 309 APEAG 313 (392) Q Consensus 309 ~~~~~ 313 (392) ++-.+ T Consensus 320 ~~~~~ 324 (324) T protein:vir:97 320 VPGEV 324 (324) T ss_pred CCCCC Confidence 11111 No 84 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=98.18 E-value=4.2e-07 Score=55.55 Aligned_cols=278 Identities=13% Similarity=-0.007 Sum_probs=126.7 Q ss_pred Ccc----ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEecc-ceeeeccccccccCCCcccccccc Q lcl|Aclame:pro 1 MAN----AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPA-PSRGHTRKLRGAGAERNLTVSDFT 75 (392) Q Consensus 1 Man----~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 75 (392) |+. ..+.|+.+++++++.+++...+..++.+. .- .+.+++||+.. ...+.+. . ++..+...++. T Consensus 1 m~t~t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~-----~~-~~~~~~ip~~~~~~~a~wv---~--E~~~~~~s~~~ 69 (303) T protein:vir:97 1 MGTETSKASLFDKHLVSDLINKVKGHSSLAKLSSQK-----PI-PFNGSKEFTFTLDSDIDVV---A--ENGKKTHGGLS 69 (303) T ss_pred CcccCCCCeEcchhHHHHHHHHHHhhchhhhhccee-----ec-CCCceEEEEEecCcceEEe---e--cCccccccccc Confidence 884 44789999999999999999888876442 11 23467887742 2233332 2 23333334444 Q ss_pred CceEEEEEEeeeecceEeeHHHHhh---hccChHHHHHHHHHHHHHHHHHHHHHHHHhccc------cc-------cccc Q lcl|Aclame:pro 76 EDSFPVTLTDVAYHLGVLTDEELTF---DLESFATQILPRQVRGVADILEEGVRDMIVGAP------YE-------AAGA 139 (392) Q Consensus 76 ~~~~~~~i~~~~~~~~~i~d~~~~~---~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~------~~-------~~~~ 139 (392) -+.+++...+ .+.-+.++++-+.+ +..++.+.+.++.+++|++++|+.++.-..... .. .... T Consensus 70 f~~v~l~~~k-l~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~ 148 (303) T protein:vir:97 70 LEPVTIVPIK-VEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKVTQV 148 (303) T ss_pred eeeEEeeeEE-EEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCccccccccccccccccccc Confidence 4555555422 23455677775533 345688889999999999999998874321000 00 0000 Q ss_pred cccccchhhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeecccccee--eeEeeeeeeeEeeeEEEEec Q lcl|Aclame:pro 140 VHEVAPDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAV--SALQEARLGRIYGYEIVEST 217 (392) Q Consensus 140 ~~~~~~~~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~--~a~~~g~ig~~~g~~v~~s~ 217 (392) .........|++|.++...|..++... ..++++|..+..|.+-. +..|.-.. ..-..+..+++.|+.++.++ T Consensus 149 ~~~~~~~~~~~~i~~~~~~~~~~~~~~-~~~vmn~~~~~~L~~lk-----d~~g~~~~~~~~~~~~~~~~l~G~Pv~~s~ 222 (303) T protein:vir:97 149 VKFTESEDADANIEAAVNLIQGAEGVV-TGLAMDTEFSTALAKVT-----NGEMGPKMYPELAWGANPDSINGLKSSVNT 222 (303) T ss_pred cccccccchHHHHHHHHHHHhhcCCCc-cEEEEcHHHHHHHHHhh-----ccCCCeEEecCccCCCCCceecceeeEEec Confidence 111123456888988887776655432 35888999998886421 11111100 00012345689999999999 Q ss_pred ceeecccceeecccccccchhhhccccccccceeec-ccceeeeeeecccccee-eeecccccceeeeEEEeecccccee Q lcl|Aclame:pro 218 LIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISG-DQRIAMRWLVDYDSTIT-SNRSLIDTYFGLKVVEDPNGVGFVR 295 (392) Q Consensus 218 ~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 295 (392) .+|............. .+........+ ..+.........+.... .+....+.. .. -......... T Consensus 223 ~v~~~~~~~~~~~~~~---------~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~-~~-r~~~r~~~~v-- 289 (303) T protein:vir:97 223 TVGAGADEAESKDLVI---------IGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQI-YL-RAEAYIGWGI-- 289 (303) T ss_pred ccCCccccCCCccEEE---------EeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcE-EE-EEEEEeccEe-- Confidence 9875332111100000 00000000000 00000000000000000 000000000 00 0000000000 Q ss_pred eeeccceeeeeeeccccc Q lcl|Aclame:pro 296 ARKIHLIPGSIEVAPEAG 313 (392) Q Consensus 296 ~~~~~~~~~~v~v~~~~~ 313 (392) ..+..-+.+....+ T Consensus 290 ----~~p~af~~l~~~~~ 303 (303) T protein:vir:97 290 ----LDAKSFARVTKGEV 303 (303) T ss_pred ----ecccceEEeeCCCC Confidence 00000000111111 No 85 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=98.18 E-value=4.8e-07 Score=55.21 Aligned_cols=309 Identities=8% Similarity=-0.050 Sum_probs=145.4 Q ss_pred Cccc----cccHH--HHHHHHHHHHHHhhccc--ceeeeccccccc---CCCCCeEEEEeccceeee-ccccccccCCCc Q lcl|Aclame:pro 1 MANA----FSKPT--AVVDTAIQMLQNELILT--NLVWLNGIGDFA---HKFNDTITVRVPAPSRGH-TRKLRGAGAERN 68 (392) Q Consensus 1 Man~----~~~~~--~~~~~~~~~l~~~l~~~--~~v~~~~~~~~~---~~~Gdtv~i~~~~~~~~~-~~~~~~~~~~~~ 68 (392) ||-+ +++|| ++.+.+.+.-.+...|. ..+-+| .++. ...|+.+++|......-. +........... T Consensus 1 Ma~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d--~~l~~~~~~gG~~~~iPf~~~L~g~~e~nv~~D~~~~~ 78 (349) T protein:vir:78 1 MAITTIGDIVTGNIPVLASYMTEDPVEKTAFFDSGILTST--PYAAEIANGPSNIANLPFWKAIDTSIEPNYSNDVYQDI 78 (349) T ss_pred CCceEEeeeeccCHHHHHHHHHHhhHHhhhhhhccceecc--HHHHHHhhcCCCEEEeeeeecCCCCcccccCCCCcccc Confidence 9954 36777 79888887776666653 334344 3443 256999999987664321 211111122223 Q ss_pred cccccccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc--------- Q lcl|Aclame:pro 69 LTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGA--------- 139 (392) Q Consensus 69 ~~~~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~--------- 139 (392) +.+..+...+..-.+ .++.++|..+|....++-.|+++++.++-+.--.+.-.+.+++.+++.-...... T Consensus 79 ~t~~kitt~~~~a~~-~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~a~~~~~~~~~ 157 (349) T protein:vir:78 79 ATPRAIQTGEMMARV-AYLNEGFGQADLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDAYHEQND 157 (349) T ss_pred cccccccccceeeee-eeeccccchhHHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcccccccchhhhccc Confidence 455555555544444 5667888888888777778999999999998888888888888776542211000 Q ss_pred -cc--cccchhhHHHHHHHHHHhhhccC--CCC--CEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeE Q lcl|Aclame:pro 140 -VH--EVAPDEFFKGVNGARRALNELYI--PQG--RVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYE 212 (392) Q Consensus 140 -~~--~~~~~~~~~~i~~a~~~l~~~~v--p~~--r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~ 212 (392) +. ........+.+++|...|.++.. ..+ ..+++++..+..|.+...+.. ... .-+...++.+.|.. T Consensus 158 ~t~d~s~~a~~~~~~~~dA~~~lgda~~Gd~~~~lt~i~mHS~v~~~L~~~~li~~-i~~------s~~~~~i~ty~G~~ 230 (349) T protein:vir:78 158 MVVDVSATLGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLIDF-IRD------AENNTMFATYQGYR 230 (349) T ss_pred ceeeeccccCCChhhhhhhHHHHHHHhccccccceeEEEEchHHHHHHHhhhhhhh-ccC------cccCcccceecCeE Confidence 00 11222457788888888877643 223 467899999999886544221 111 11244578889999 Q ss_pred EEEecceeeccc--------ceeecccccccchhhhcccc--ccccce-eecccceeeeeeeccccceeeeeccccccee Q lcl|Aclame:pro 213 IVESTLIPHGDA--------YLYHPTAFIMATRAPAPPMG--AVRSTA-ISGDQRIAMRWLVDYDSTITSNRSLIDTYFG 281 (392) Q Consensus 213 v~~s~~v~~~~~--------~~~~~~a~~~a~~~~~~~~~--~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 281 (392) |.++..+|.... +.+...++.+.......+.. +..... ..+......+......+....+........+ T Consensus 231 VivDD~~Pv~~~g~~~~yttylfg~GAi~~~~~~~~~~~et~rd~~~g~~~G~d~l~~R~~~~~hp~G~s~~~a~v~~~~ 310 (349) T protein:vir:78 231 VIVDDSMTVVGQGAQRKFISIIFGQGAIGYGEGNPVMPLEYEREASRANGGGVETLWTRKTWLLHPFGYRFTSAVITGNG 310 (349) T ss_pred EEEeCCCccccCCCCceEEEEEeecceEEEccCCCccceeeecccccCCcceeEEEEEeeEEEeeeeeeeeccccccCCc Confidence 999999986532 23333333333222111100 000000 0000011111111111111110000000000 Q ss_pred eeEEE-eeccccceeeeeccceeeeeeecccccccceeeeeeccCeeEEEEEeecCcccccceEEEEEcCCc Q lcl|Aclame:pro 282 LKVVE-DPNGVGFVRARKIHLIPGSIEVAPEAGANATITAAAGEDHTVQLKVTDANGDDVTALCDFESSATD 352 (392) Q Consensus 282 ~~~~~-~~~~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~t~~~t~~~~~~~~~~~~vtw~Ssn~~ 352 (392) ..... .+...-.. .-..-........+.+ +.+.+...+ T Consensus 311 ~~~~~~sPt~aeLa---~~~NW~~v~~~K~I~i------------------------------v~~~~~~~a 349 (349) T protein:vir:78 311 TETIARSASWQDLA---NATNWNRVVDRKHVPI------------------------------AFLVTGVGA 349 (349) T ss_pred cccccCCCChHHhc---CCcCcccccChhhcce------------------------------EEEEeccCC Confidence 00000 00000000 0000000000000000 111111111 No 86 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=98.18 E-value=5e-07 Score=55.13 Aligned_cols=265 Identities=13% Similarity=0.022 Sum_probs=125.5 Q ss_pred Cc-----cccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc--eeeeccccccccCCCcccccc Q lcl|Aclame:pro 1 MA-----NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP--SRGHTRKLRGAGAERNLTVSD 73 (392) Q Consensus 1 Ma-----n~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~--~~~~~~~~~~~~~~~~~~~~~ 73 (392) |. -..+.|+.++.++++.+++...+.+++..- . . .|.++++|+... ..+.. . +++......+ T Consensus 136 ~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~--~-~---~~~~~~~~~~~~~~~~a~~---v--~E~~~~~~~~ 204 (418) T protein:vir:10 136 VGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPG--Q-T---SSSSIEYTVETGFTNNAAA---V--AEGAQKPTSD 204 (418) T ss_pred ccCCCCCCccccchhHHHHHHHHHhhhhhHHhhccee--e-c---cCCceeEEEEecCCCceee---e--ccCccccccc Confidence 11 123789999999999999999988877532 1 1 244577766422 12222 1 2333343344 Q ss_pred ccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhcc--ccc------cccccccccc Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGA--PYE------AAGAVHEVAP 145 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~--~~~------~~~~~~~~~~ 145 (392) +.-..+.+...+.. .-+.|+++-+. +..++...+.+...++++.++|..++.--... +.+ .......... T Consensus 205 ~~f~~v~~~~~k~~-~~~~is~ell~-ds~~l~~~i~~~l~~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~ 282 (418) T protein:vir:10 205 LKFNLKNQPVRTIA-HLFKASRQILD-DAPALQSYIDGRARYGLQLTEEGQILKGDGTGANILGILPQASAFMPSITLAN 282 (418) T ss_pred cceeeEEEeeeeEE-EeehhhHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccc Confidence 44455555554432 34567776554 44578777888889999999999887421100 111 0011111222 Q ss_pred hhhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccc Q lcl|Aclame:pro 146 DEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAY 225 (392) Q Consensus 146 ~~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~ 225 (392) ...+++++++...+...+.+.. .++++|..+..|.+-. +..|.-.......+..+.+.|++|+.+..+|.+... T Consensus 283 ~~~~~~i~~~~~~~~~~~~~~~-~~v~n~~~~~~L~~lk-----d~~G~~i~~~~~~~~~~~l~G~pV~~~~~~p~~~~~ 356 (418) T protein:vir:10 283 ATPIDKIRLALLQAVLAEFPAT-GIVLNPIDWASIELTK-----DSQGRYIVGNPVNGTTPRLWNLPVVETQAMTANEFL 356 (418) T ss_pred cccHHHHHHHHHhhccccCCCC-EEEEcHHHHHHHHHhh-----cCCCceeccccccCCCceecceeeEEcCCCCCCcEE Confidence 3457888888777765555333 4788999988875421 112221111123455678999999999999876543 Q ss_pred eeeccc-ccccchhhhccccccccceeecccceeeeeeeccccceeeeecc--cccceeeeEEEeeccccceeeeeccce Q lcl|Aclame:pro 226 LYHPTA-FIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSL--IDTYFGLKVVEDPNGVGFVRARKIHLI 302 (392) Q Consensus 226 ~~~~~a-~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~ 302 (392) ....+. ....... +....+..........+... .....++.+... ..+... T Consensus 357 ~gd~s~~~~~~~~~-----------------~~~i~~~~~~~~~f~~~~~~~r~~~~~d~~~~~~---------~a~~~~ 410 (418) T protein:vir:10 357 VGAFSMAAQIFDRM-----------------EIEVLLSTENVDDFEKNMVSIRAEERLALAVYRP---------ESFVTG 410 (418) T ss_pred EeeccceEEEEEec-----------------ceEEEEecccchhhhcCceEEEEEEeeccEEecc---------cceEEE Confidence 322221 1111000 00000000000000000000 000000000000 000000 Q ss_pred eeeeeecc Q lcl|Aclame:pro 303 PGSIEVAP 310 (392) Q Consensus 303 ~~~v~v~~ 310 (392) ....++.+ T Consensus 411 ~~~~~~~g 418 (418) T protein:vir:10 411 ALVEQAGG 418 (418) T ss_pred EeccCCCC Confidence 00000001 No 87 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=98.17 E-value=6.6e-07 Score=54.46 Aligned_cols=271 Identities=9% Similarity=0.009 Sum_probs=126.5 Q ss_pred Cc------cccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEecc-ceeeeccccccccCCCcccccc Q lcl|Aclame:pro 1 MA------NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPA-PSRGHTRKLRGAGAERNLTVSD 73 (392) Q Consensus 1 Ma------n~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~-~~~~~~~~~~~~~~~~~~~~~~ 73 (392) |. ..-+.|+.+.+++++.+++...+..++.+-. ..+ +..+.+|+.. ...+... + ++..+...+ T Consensus 9 ~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~---~~~--~~~~~~~~~~~~~~a~~v---~--Eg~~~~~~~ 78 (297) T protein:vir:95 9 ENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQE---MEG--EQEKTVYVQTDGISAYWV---N--ETEKIKTDK 78 (297) T ss_pred ccccccCCCcceechhHHHHHHHHHHhhchhhhhcceee---cCC--CccEEEEEEcCCceeEEe---e--cCccccccc Confidence 21 1226799999999999999999888876531 122 1234554432 2222222 2 233333334 Q ss_pred ccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhc-ccc----ccccccccccchhh Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVG-APY----EAAGAVHEVAPDEF 148 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~-~~~----~~~~~~~~~~~~~~ 148 (392) +.-..+++...+. +.-+.|+++-+.++..++.+.+.++.++++++++|+.++.--.. .+. .............. T Consensus 79 ~~f~~v~l~~~k~-~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~gi~~~~~~~~~~~~~~~t 157 (297) T protein:vir:95 79 PEVVPVTLKAHKL-GIILVTSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFANSVAKAAKDANKVIGGPIN 157 (297) T ss_pred cceeEEEEeeEEE-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccccccccccccceecccccC Confidence 4445555555332 34567888777778889999999999999999999998731110 000 00111111222346 Q ss_pred HHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccce-- Q lcl|Aclame:pro 149 FKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYL-- 226 (392) Q Consensus 149 ~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~-- 226 (392) |++++++...|..++.+.. .++++|..+..|.+-. +..| ..+..+..+.+.|.+++.+...+...... T Consensus 158 ~~~i~~~~~~l~~~~~~~~-~~v~~~~~~~~L~~l~-----d~~G----~~i~~~~~~~l~G~Pv~~~~~~~~~~~~~~~ 227 (297) T protein:vir:95 158 YDNILKLQDALYDADVEPN-AFVSKIQNRSALREAR-----DGNK----VSIYDKAANTIDGITTVDLKSARFEKGDLLA 227 (297) T ss_pred HHHHHHHHHHhhhccCCcC-EEEEcHHHHHHHHHhh-----ccCC----ceeecCCCCcccceeeEeecCCCCCCceEEE Confidence 8999999888877665433 5788999988876421 1122 23445666788898887665443222211 Q ss_pred eecccccccchhhhccccccccceeecccceeeeeeeccccceeeeeccc----ccceeeeEEEeeccccceeeeeccce Q lcl|Aclame:pro 227 YHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLI----DTYFGLKVVEDPNGVGFVRARKIHLI 302 (392) Q Consensus 227 ~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~ 302 (392) ...+.............-....+.... ...............+ ....+..+... ..+... T Consensus 228 gd~s~~~~~~~~~~~i~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~---------~a~~~l 291 (297) T protein:vir:95 228 GDFDNLIYGVPYNITYKISEEGQISTI-------TNADGTPINLFEQEMIAIRATMDIAVMITKT---------DAFAKL 291 (297) T ss_pred EecccEEEEEecCeEEEEeeccccccc-------cccCccchhhhhcCcEEEEEEEEeccEeecc---------cceEEE Confidence 111111110000000000000000000 0000000000000000 00001111100 000000 Q ss_pred eeeeeeccc Q lcl|Aclame:pro 303 PGSIEVAPE 311 (392) Q Consensus 303 ~~~v~v~~~ 311 (392) . ..+++ T Consensus 292 ~---~at~~ 297 (297) T protein:vir:95 292 T---PAERV 297 (297) T ss_pred e---ecCCC Confidence 0 01111 No 88 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=98.17 E-value=8e-07 Score=54.02 Aligned_cols=282 Identities=9% Similarity=-0.016 Sum_probs=133.1 Q ss_pred Ccc---ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc-eeeeccccccccCCCccccccccC Q lcl|Aclame:pro 1 MAN---AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSDFTE 76 (392) Q Consensus 1 Man---~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 76 (392) |.. ..+.|+.|..++++.+++..++..++.+- . -.|.+++||+... ..+.. . +++..+...++.- T Consensus 30 ~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~-----~-~~~~~~~~p~~~~~~~a~~---v--~Eg~~~~~~~~~~ 98 (324) T protein:vir:78 30 MMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYE-----P-MEGTEKKFTFWADKPGAYW---V--GEGQKIETSKATW 98 (324) T ss_pred cccCcCccccchhHHHHHHHHHHhhchhhhhccee-----e-ccCCceEEEEEecCcceeE---e--cCCccccccccce Confidence 322 24789999999999999999988876542 2 1255688877532 22222 2 2344444455555 Q ss_pred ceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccc------cccccccccchhhHH Q lcl|Aclame:pro 77 DSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYE------AAGAVHEVAPDEFFK 150 (392) Q Consensus 77 ~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~------~~~~~~~~~~~~~~~ 150 (392) ..+++...+. ..-+.|+++-+.++..++...+.++.++++++++|..++.--...... .............++ T Consensus 99 ~~v~~~~~k~-~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~t~~ 177 (324) T protein:vir:78 99 VNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQD 177 (324) T ss_pred eEEEEeeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcCccccccccccceeccccccHH Confidence 6666665443 355678887777777899999999999999999999887322111100 001111122334689 Q ss_pred HHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeeccc--ceee Q lcl|Aclame:pro 151 GVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDA--YLYH 228 (392) Q Consensus 151 ~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~--~~~~ 228 (392) +|.++...|..+.... ..++++|..+..|.+-. +..|. ..+..+..+.+.|.+|+.+...+.... +... T Consensus 178 ~i~~~~~~l~~~~~~~-~~~vmn~~~~~~L~~l~-----d~~G~---~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd 248 (324) T protein:vir:78 178 NIIDLEALLEDDELEA-NAFISKTQNRSLLRKIV-----DPETK---ERIYDRNSDSLDGLPVVNLKSSNLKRGELITGD 248 (324) T ss_pred HHHHHHHhhhhccCCC-CEEEEcHHHHHHHHHhh-----ccCCC---eeecCCCCCcccceeeEeeCCCCCCcceEEEEe Confidence 9999988887766533 36889999988876421 12232 123345667889999887655443222 1111 Q ss_pred cccccccchhhhccccccccceeec-ccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeee Q lcl|Aclame:pro 229 PTAFIMATRAPAPPMGAVRSTAISG-DQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIE 307 (392) Q Consensus 229 ~~a~~~a~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 307 (392) .+....................... +........+..+... .-.....+..+...... ..+.......+ T Consensus 249 ~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~----~r~~~r~d~~v~~~~A~------~~l~~a~~~~~ 318 (324) T protein:vir:78 249 FDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVA----LRATMHVALHIADDKAF------AKLVPADKRTD 318 (324) T ss_pred cceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEE----EEEEEEEccEEecccce------EEEecccccCC Confidence 1111111110000000000000000 0000000000000000 00000011111110000 00110011111 Q ss_pred eccccc Q lcl|Aclame:pro 308 VAPEAG 313 (392) Q Consensus 308 v~~~~~ 313 (392) .++-.+ T Consensus 319 ~~~~~~ 324 (324) T protein:vir:78 319 SVPGEV 324 (324) T ss_pred CCCCCC Confidence 111111 No 89 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=98.17 E-value=8e-07 Score=54.02 Aligned_cols=282 Identities=9% Similarity=-0.016 Sum_probs=133.1 Q ss_pred Ccc---ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc-eeeeccccccccCCCccccccccC Q lcl|Aclame:pro 1 MAN---AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSDFTE 76 (392) Q Consensus 1 Man---~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 76 (392) |.. ..+.|+.|..++++.+++..++..++.+- . -.|.+++||+... ..+.. . +++..+...++.- T Consensus 30 ~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~-----~-~~~~~~~~p~~~~~~~a~~---v--~Eg~~~~~~~~~~ 98 (324) T protein:vir:96 30 MMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYE-----P-MEGTEKKFTFWADKPGAYW---V--GEGQKIETSKATW 98 (324) T ss_pred cccCcCccccchhHHHHHHHHHHhhchhhhhccee-----e-ccCCceEEEEEecCcceeE---e--cCCccccccccce Confidence 322 24789999999999999999988876542 2 1255688877532 22222 2 2344444455555 Q ss_pred ceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccc------cccccccccchhhHH Q lcl|Aclame:pro 77 DSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYE------AAGAVHEVAPDEFFK 150 (392) Q Consensus 77 ~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~------~~~~~~~~~~~~~~~ 150 (392) ..+++...+. ..-+.|+++-+.++..++...+.++.++++++++|..++.--...... .............++ T Consensus 99 ~~v~~~~~k~-~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~t~~ 177 (324) T protein:vir:96 99 VNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQD 177 (324) T ss_pred eEEEEeeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcCccccccccccceeccccccHH Confidence 6666665443 355678887777777899999999999999999999887322111100 001111122334689 Q ss_pred HHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeeccc--ceee Q lcl|Aclame:pro 151 GVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDA--YLYH 228 (392) Q Consensus 151 ~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~--~~~~ 228 (392) +|.++...|..+.... ..++++|..+..|.+-. +..|. ..+..+..+.+.|.+|+.+...+.... +... T Consensus 178 ~i~~~~~~l~~~~~~~-~~~vmn~~~~~~L~~l~-----d~~G~---~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd 248 (324) T protein:vir:96 178 NIIDLEALLEDDELEA-NAFISKTQNRSLLRKIV-----DPETK---ERIYDRNSDSLDGLPVVNLKSSNLKRGELITGD 248 (324) T ss_pred HHHHHHHhhhhccCCC-CEEEEcHHHHHHHHHhh-----ccCCC---eeecCCCCCcccceeeEeeCCCCCCcceEEEEe Confidence 9999988887766533 36889999988876421 12232 123345667889999887655443222 1111 Q ss_pred cccccccchhhhccccccccceeec-ccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeee Q lcl|Aclame:pro 229 PTAFIMATRAPAPPMGAVRSTAISG-DQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIE 307 (392) Q Consensus 229 ~~a~~~a~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 307 (392) .+....................... +........+..+... .-.....+..+...... ..+.......+ T Consensus 249 ~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~----~r~~~r~d~~v~~~~A~------~~l~~a~~~~~ 318 (324) T protein:vir:96 249 FDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVA----LRATMHVALHIADDKAF------AKLVPADKRTD 318 (324) T ss_pred cceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEE----EEEEEEEccEEecccce------EEEecccccCC Confidence 1111111110000000000000000 0000000000000000 00000011111110000 00110011111 Q ss_pred eccccc Q lcl|Aclame:pro 308 VAPEAG 313 (392) Q Consensus 308 v~~~~~ 313 (392) .++-.+ T Consensus 319 ~~~~~~ 324 (324) T protein:vir:96 319 SVPGEV 324 (324) T ss_pred CCCCCC Confidence 111111 No 90 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=98.16 E-value=7.4e-07 Score=54.19 Aligned_cols=280 Identities=11% Similarity=0.051 Sum_probs=129.4 Q ss_pred CccccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccc-cccccCceE Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLT-VSDFTEDSF 79 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 79 (392) -...+++|+.+..++++.+++..++..++..-. .. +...++++|............+ +.... ...+.-..+ T Consensus 127 ~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~---~~---~~~~~~~~~~~~~~~~~~~v~E--g~~~~~~~~~~~~~i 198 (415) T protein:vir:94 127 DSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR---VT---NGSGKYPVVRQSEVAALEKVEE--LEENPELAVKPFFQL 198 (415) T ss_pred ccccccCcHHHHHHHHHHHHhhhhhhhhcceee---cc---CCceeEEEEeecCCccceeccc--cccccccccccceee Confidence 123457899999999999999999888765431 11 1223444442222222222222 22222 112233455 Q ss_pred EEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------cccccccccchhhHHHH Q lcl|Aclame:pro 80 PVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYE-------AAGAVHEVAPDEFFKGV 152 (392) Q Consensus 80 ~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~i 152 (392) ++.+.+. +.-+.|+++-+.++..++...+.++.+++++..+|..++......... .............|++| T Consensus 199 ~~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i 277 (415) T protein:vir:94 199 AYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDI 277 (415) T ss_pred Eeeheee-eeechhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchHHH Confidence 5555343 244578888777778889899999999999999999887543321110 00111222334568899 Q ss_pred HHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeecccccee-eeEeeeeeeeEeeeEEEEecceeecccceeeccc Q lcl|Aclame:pro 153 NGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAV-SALQEARLGRIYGYEIVESTLIPHGDAYLYHPTA 231 (392) Q Consensus 153 ~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~-~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a 231 (392) .++...+...... +-.++++|..+..|.+-. +..|.-.. ....+|..+.+.|++|+.+..+|........ T Consensus 278 ~~~~~~~~~~~~~-~~~~vmn~~~~~~l~~lk-----d~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~--- 348 (415) T protein:vir:94 278 KDAINLNVKPNYE-HNVAIVSQTMFAKLDKMK-----DKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNT--- 348 (415) T ss_pred HHHHHhhhhhccC-CCEEEEcHHHHHHHHHhh-----ccCCCeeeccCcCCCCCceecceeeEEecccccCCCCccE--- Confidence 9888777666654 335788999998885421 11222110 1234566678999999988877754321100 Q ss_pred ccccchhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeeccc Q lcl|Aclame:pro 232 FIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPE 311 (392) Q Consensus 232 ~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~ 311 (392) +.++.- ..........+....+.. +.... ..........+.+. .+..-+.+.-. T Consensus 349 i~~gd~--------~~~~~~~~~~~~~v~~~~-~~~~~--~~~r~~~r~d~~~~---------------~~~a~~~~~~~ 402 (415) T protein:vir:94 349 LIIGNL--------KDAIVLFDRSQYQASWTD-YMHFG--ECLMIAVRQDCRIL---------------DYKSAIVIEYD 402 (415) T ss_pred EEEEeh--------hccEEEEeecceEEEEec-cccCc--eEEEEEEEeccEEe---------------ccccEEEEEEe Confidence 000000 000000000011111100 00000 00000000000000 00000000000 Q ss_pred ccccceeeeeeccCeeEEEEE Q lcl|Aclame:pro 312 AGANATITAAAGEDHTVQLKV 332 (392) Q Consensus 312 ~~~~~~~~~~~~~~~t~~~t~ 332 (392) ........+++ .+ T Consensus 403 ~~~~~~~~~~~--------~~ 415 (415) T protein:vir:94 403 DSERGEGDLGL--------EA 415 (415) T ss_pred ccCCCCCcccc--------CC Confidence 01111111111 11 No 91 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=98.16 E-value=1.1e-06 Score=53.20 Aligned_cols=263 Identities=11% Similarity=0.003 Sum_probs=125.4 Q ss_pred Ccc-----ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc--eeeeccccccccCCCcccccc Q lcl|Aclame:pro 1 MAN-----AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP--SRGHTRKLRGAGAERNLTVSD 73 (392) Q Consensus 1 Man-----~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~--~~~~~~~~~~~~~~~~~~~~~ 73 (392) |.. ..+.|+.+...+++.+++...+..++.+- . . .|..+++|+-.. ..+... +++..+...+ T Consensus 105 ~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~--~-~---~~~~~~~~~~~~~~~~a~~v-----~E~~~~~~~~ 173 (385) T protein:vir:18 105 LGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQG--R-T---SSNALEYVREEVFTNNADVV-----AEKALKPESD 173 (385) T ss_pred hccccccCCceecchhhhHHHHHhhhccchhhhccee--c-c---cCcceEEEEEecCCcceeee-----ccCccccccc Confidence 221 12456667888999999999887776542 1 1 134577765321 122221 2333344445 Q ss_pred ccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------ccccccccccc Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPY--------EAAGAVHEVAP 145 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~--------~~~~~~~~~~~ 145 (392) ++-..+++.+.+. +.-+.|+++-+. +..++...+.++.+++++..+|+.++.--..... ........... T Consensus 174 ~~~~~~~~~~~k~-~~~~~is~ell~-d~~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~ 251 (385) T protein:vir:18 174 ITFSKQTANVKTI-AHWVQASRQVMD-DAPMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLNATG 251 (385) T ss_pred cceeEEEEeeeeE-EEeehhhHHHHh-hHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccccc Confidence 5555666666444 344568876444 4456777788888999999999988742111000 00111112233 Q ss_pred hhhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccc Q lcl|Aclame:pro 146 DEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAY 225 (392) Q Consensus 146 ~~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~ 225 (392) ...+++|.++...|........ .++++|..+..|.+.. +..|.-.......|..+.+.|.+|+.+..+|.+... T Consensus 252 ~~~~d~i~~~~~~l~~~~~~~~-~~~~~~~~~~~l~~lk-----d~~G~~l~~~~~~~~~~~l~G~pV~~~~~~p~~~~~ 325 (385) T protein:vir:18 252 DTRADIIAHAIYQVTESEFSAS-GIVLNPRDWHNIALLK-----DNEGRYIFGGPQAFTSNIMWGLPVVPTKAQAAGTFT 325 (385) T ss_pred cchHHHHHHHHHhhccccCCCC-EEEEcHHHHHHHHHhh-----cCCCceeccCcccCCCceecceeeEEcCcCCCCcEE Confidence 4568889998888876655433 6888999998876432 122221111123455678999999999999866544 Q ss_pred eeeccc-ccccchhhhccccccccceeecccceeeeeeeccccceeeeec--ccccceeeeEEEeeccccceeeeeccce Q lcl|Aclame:pro 226 LYHPTA-FIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRS--LIDTYFGLKVVEDPNGVGFVRARKIHLI 302 (392) Q Consensus 226 ~~~~~a-~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 302 (392) ....+. .....+. +...............+.. ......++.+....... T Consensus 326 ~gd~~~~~~~~~~~-----------------~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~----------- 377 (385) T protein:vir:18 326 VGGFDMASQVWDRM-----------------DATVEVSREDRDNFVKNMLTILCEERLALAHYRPTAII----------- 377 (385) T ss_pred EeecccEEEEEEec-----------------ceEEEEeccccchhhcCcEEEEEEEeeccEEecccceE----------- Confidence 322211 1111000 0000000000000000000 00000000000000000 Q ss_pred eeeeeecccc Q lcl|Aclame:pro 303 PGSIEVAPEA 312 (392) Q Consensus 303 ~~~v~v~~~~ 312 (392) .+.+.... T Consensus 378 --~~~~~aa~ 385 (385) T protein:vir:18 378 --KGTFSSGS 385 (385) T ss_pred --EEEeccCC Confidence 00000000 No 92 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=98.16 E-value=1.1e-06 Score=53.20 Aligned_cols=263 Identities=11% Similarity=0.003 Sum_probs=125.4 Q ss_pred Ccc-----ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc--eeeeccccccccCCCcccccc Q lcl|Aclame:pro 1 MAN-----AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP--SRGHTRKLRGAGAERNLTVSD 73 (392) Q Consensus 1 Man-----~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~--~~~~~~~~~~~~~~~~~~~~~ 73 (392) |.. ..+.|+.+...+++.+++...+..++.+- . . .|..+++|+-.. ..+... +++..+...+ T Consensus 105 ~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~--~-~---~~~~~~~~~~~~~~~~a~~v-----~E~~~~~~~~ 173 (385) T protein:vir:19 105 LGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQG--R-T---SSNALEYVREEVFTNNADVV-----AEKALKPESD 173 (385) T ss_pred hccccccCCceecchhhhHHHHHhhhccchhhhccee--c-c---cCcceEEEEEecCCcceeee-----ccCccccccc Confidence 221 12456667888999999999887776542 1 1 134577765321 122221 2333344445 Q ss_pred ccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------ccccccccccc Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPY--------EAAGAVHEVAP 145 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~--------~~~~~~~~~~~ 145 (392) ++-..+++.+.+. +.-+.|+++-+. +..++...+.++.+++++..+|+.++.--..... ........... T Consensus 174 ~~~~~~~~~~~k~-~~~~~is~ell~-d~~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~ 251 (385) T protein:vir:19 174 ITFSKQTANVKTI-AHWVQASRQVMD-DAPMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLNATG 251 (385) T ss_pred cceeEEEEeeeeE-EEeehhhHHHHh-hHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccccc Confidence 5555666666444 344568876444 4456777788888999999999988742111000 00111112233 Q ss_pred hhhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccc Q lcl|Aclame:pro 146 DEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAY 225 (392) Q Consensus 146 ~~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~ 225 (392) ...+++|.++...|........ .++++|..+..|.+.. +..|.-.......|..+.+.|.+|+.+..+|.+... T Consensus 252 ~~~~d~i~~~~~~l~~~~~~~~-~~~~~~~~~~~l~~lk-----d~~G~~l~~~~~~~~~~~l~G~pV~~~~~~p~~~~~ 325 (385) T protein:vir:19 252 DTRADIIAHAIYQVTESEFSAS-GIVLNPRDWHNIALLK-----DNEGRYIFGGPQAFTSNIMWGLPVVPTKAQAAGTFT 325 (385) T ss_pred cchHHHHHHHHHhhccccCCCC-EEEEcHHHHHHHHHhh-----cCCCceeccCcccCCCceecceeeEEcCcCCCCcEE Confidence 4568889998888876655433 6888999998876432 122221111123455678999999999999866544 Q ss_pred eeeccc-ccccchhhhccccccccceeecccceeeeeeeccccceeeeec--ccccceeeeEEEeeccccceeeeeccce Q lcl|Aclame:pro 226 LYHPTA-FIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRS--LIDTYFGLKVVEDPNGVGFVRARKIHLI 302 (392) Q Consensus 226 ~~~~~a-~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 302 (392) ....+. .....+. +...............+.. ......++.+....... T Consensus 326 ~gd~~~~~~~~~~~-----------------~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~----------- 377 (385) T protein:vir:19 326 VGGFDMASQVWDRM-----------------DATVEVSREDRDNFVKNMLTILCEERLALAHYRPTAII----------- 377 (385) T ss_pred EeecccEEEEEEec-----------------ceEEEEeccccchhhcCcEEEEEEEeeccEEecccceE----------- Confidence 322211 1111000 0000000000000000000 00000000000000000 Q ss_pred eeeeeecccc Q lcl|Aclame:pro 303 PGSIEVAPEA 312 (392) Q Consensus 303 ~~~v~v~~~~ 312 (392) .+.+.... T Consensus 378 --~~~~~aa~ 385 (385) T protein:vir:19 378 --KGTFSSGS 385 (385) T ss_pred --EEEeccCC Confidence 00000000 No 93 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=98.14 E-value=6.6e-07 Score=54.48 Aligned_cols=265 Identities=12% Similarity=0.018 Sum_probs=120.1 Q ss_pred Cc---cccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc-eeeeccccccccCCCccccccccC Q lcl|Aclame:pro 1 MA---NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSDFTE 76 (392) Q Consensus 1 Ma---n~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 76 (392) ++ -.++.|+++.+.+.+.+++..++..++.. +....|..+.+|+-.. ..+.+ . +++..+...++.- T Consensus 114 t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~-----~~~~~~~~~~~~~~~~~~~a~~---v--~E~~~~~~~~~~f 183 (392) T protein:vir:13 114 TKAGNPNVLSRTLYGQLIAQAVERSAIMRGGAST-----FTTSDANPMDFTVITGRATAGI---V--GETAEIPESYPAT 183 (392) T ss_pred cccCCCccccccchHHHHHHHHhhhhhhhhccee-----eecCCCceeEEEEEcCCcceee---e--cccccccccccce Confidence 22 12456777776666666666555444322 1112345577765332 22222 2 2333344444544 Q ss_pred ceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHH-Hhccccc--------cccccccccchh Q lcl|Aclame:pro 77 DSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDM-IVGAPYE--------AAGAVHEVAPDE 147 (392) Q Consensus 77 ~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~-~~~~~~~--------~~~~~~~~~~~~ 147 (392) ..+.+...+. +.-+.|+++-+.++..++...+.+..+++|++.+|..++.- -.+.|.+ ............ T Consensus 184 ~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~Gt~~p~Gil~~~~~~~~~~~~~~~~~~ 262 (392) T protein:vir:13 184 TQRSMGGFKY-GFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFLTGTGTGQPRGILTDATGANAAFGEADADSK 262 (392) T ss_pred eeEEeeeeeE-EeeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcccCCccccccccccccccccccccccccc Confidence 5566665443 34456888878878888988899999999999999988731 0011110 001111112234 Q ss_pred hHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccce-eeeEeeeeeeeEeeeEEEEecceeecccce Q lcl|Aclame:pro 148 FFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSA-VSALQEARLGRIYGYEIVESTLIPHGDAYL 226 (392) Q Consensus 148 ~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~-~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~ 226 (392) .|++++++...|....- .+-.++++|..+..|.+-. +..|.-. ......|..+.+.|++|+.++.+|.+.... T Consensus 263 ~~d~l~~~~~~l~~~~~-~~a~~v~n~~~~~~l~~lk-----d~~G~~l~~~~~~~g~~~~l~G~Pv~~~~~~~~~~i~~ 336 (392) T protein:vir:13 263 VSDALIDLFHEVPSAYR-KNAKFVVNDLRAAQMRKLK-----DANGQYLWQSALTVGAPDTFNGKVVETDDGMPADKVLF 336 (392) T ss_pred cHHHHHHHHHhhhhhhh-cCCEEEEcHHHHHHHHHhh-----ccCCceeecCCcCCCCCceecceeeEEcCCCCCCcEEE Confidence 58888887777754432 2335688999988775321 1122110 012234555689999999999988654332 Q ss_pred eecccccccchhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeee Q lcl|Aclame:pro 227 YHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSI 306 (392) Q Consensus 227 ~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 306 (392) ...+.+...... +........................++.+.... .+. T Consensus 337 Gdf~~~~i~~~~-----------------~~~i~~~~~~~~~~~~~~~r~~~r~d~~~~~~~---------A~~------ 384 (392) T protein:vir:13 337 ADLSKYRVRFAG-----------------SLRVDRSVDAKFSTDQIVYRFLQRADGLLVDAR---------GAK------ 384 (392) T ss_pred eeccceeEEeec-----------------ceEEEeeccccccCCcEEEEEEEEeccEEeccc---------ceE------ Confidence 221111111000 000000000000000000000000000000000 000 Q ss_pred eeccccccc Q lcl|Aclame:pro 307 EVAPEAGAN 315 (392) Q Consensus 307 ~v~~~~~~~ 315 (392) +..+.... T Consensus 385 -~~~~~~aa 392 (392) T protein:vir:13 385 -VLTVTPAA 392 (392) T ss_pred -EEEeeccC Confidence 00000000 No 94 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=98.13 E-value=1.1e-06 Score=53.26 Aligned_cols=279 Identities=11% Similarity=0.066 Sum_probs=129.6 Q ss_pred CccccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCcccc-ccccCceE Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTV-SDFTEDSF 79 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 79 (392) -....++|+.|..++++.+++..++..+++... .. +...++++|............+ +..+.. ..+.-..+ T Consensus 127 ~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~---~~---~~~~~~~~~~~~~~~~~~~v~E--~~~~~~~~~~~~~~v 198 (415) T protein:vir:81 127 DSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR---VT---NGSGKYPVVRQSEVAALEKVEE--LEENPELAVKPFFQL 198 (415) T ss_pred cccccccchHHHHHHHHHHHhhhhhhhheeeee---cc---CCceeEEEEeecCCccceeecc--ccccCcccccceeeE Confidence 112347899999999999999998877765431 11 2233444443222222222222 222221 12233555 Q ss_pred EEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccc-cc------cccccccccchhhHHHH Q lcl|Aclame:pro 80 PVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAP-YE------AAGAVHEVAPDEFFKGV 152 (392) Q Consensus 80 ~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~-~~------~~~~~~~~~~~~~~~~i 152 (392) ++.+.+. +.-+.|+++-+.++..++...+.+..+++++..+|..++.-..... .. .............|++| T Consensus 199 ~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i 277 (415) T protein:vir:81 199 AYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDI 277 (415) T ss_pred Eeeeeee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchhHH Confidence 5555443 2445788887777778898999999999999999998875442211 10 01111222334568999 Q ss_pred HHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeecccccee-eeEeeeeeeeEeeeEEEEecceeecccceeeccc Q lcl|Aclame:pro 153 NGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAV-SALQEARLGRIYGYEIVESTLIPHGDAYLYHPTA 231 (392) Q Consensus 153 ~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~-~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a 231 (392) +++...|........ .++++|..+..|.+-. +..|.-.. ....+|..+.+.|++|+.+..+|........ T Consensus 278 ~~~~~~~~~~~~~~~-~~v~n~~~~~~l~~lk-----d~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~--- 348 (415) T protein:vir:81 278 KDAINLNVKPNYEHN-VAIVSQTMFAKLDKMK-----DKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNT--- 348 (415) T ss_pred HHHHHhhhhhccCCC-EEEEcHHHHHHHHHhh-----ccCCceeeccCcCCCCCceecceeeEEecccccCCCCccE--- Confidence 998888876666433 5788999998885421 11122110 1234566678999999888877643321100 Q ss_pred ccccchhhhccccccc-cceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeecc Q lcl|Aclame:pro 232 FIMATRAPAPPMGAVR-STAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAP 310 (392) Q Consensus 232 ~~~a~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~ 310 (392) ..+ +... .+......+....+.. +.... ..........+.+. .+..-+.+.. T Consensus 349 ~~~---------Gd~~~~~~~~~~~~~~v~~~~-~~~~~--~~~~~~~r~d~~v~---------------~~~a~~~~~~ 401 (415) T protein:vir:81 349 LII---------GNLKDAIVLFDRSQYQASWTD-YMHFG--ECLMIAVRQDCRIL---------------DYKSAIVIEY 401 (415) T ss_pred EEE---------EehhccEEEEeecceEEEEec-cccCc--eEEEEEEEeccEEe---------------ccccEEEEEE Confidence 000 0000 0000000011111100 00000 00000000000000 0000000101 Q ss_pred cccccceeeeeeccCeeEEEEE Q lcl|Aclame:pro 311 EAGANATITAAAGEDHTVQLKV 332 (392) Q Consensus 311 ~~~~~~~~~~~~~~~~t~~~t~ 332 (392) .........+++ .+ T Consensus 402 ~~~~~~~~~~~~--------~~ 415 (415) T protein:vir:81 402 DDSERGEGDLGL--------EA 415 (415) T ss_pred eccCCCCCcccc--------CC Confidence 111111111111 11 No 95 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=98.13 E-value=1.1e-06 Score=53.26 Aligned_cols=279 Identities=11% Similarity=0.066 Sum_probs=129.6 Q ss_pred CccccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCcccc-ccccCceE Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTV-SDFTEDSF 79 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 79 (392) -....++|+.|..++++.+++..++..+++... .. +...++++|............+ +..+.. ..+.-..+ T Consensus 127 ~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~---~~---~~~~~~~~~~~~~~~~~~~v~E--~~~~~~~~~~~~~~v 198 (415) T protein:vir:79 127 DSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR---VT---NGSGKYPVVRQSEVAALEKVEE--LEENPELAVKPFFQL 198 (415) T ss_pred cccccccchHHHHHHHHHHHhhhhhhhheeeee---cc---CCceeEEEEeecCCccceeecc--ccccCcccccceeeE Confidence 112347899999999999999998877765431 11 2233444443222222222222 222221 12233555 Q ss_pred EEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccc-cc------cccccccccchhhHHHH Q lcl|Aclame:pro 80 PVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAP-YE------AAGAVHEVAPDEFFKGV 152 (392) Q Consensus 80 ~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~-~~------~~~~~~~~~~~~~~~~i 152 (392) ++.+.+. +.-+.|+++-+.++..++...+.+..+++++..+|..++.-..... .. .............|++| T Consensus 199 ~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i 277 (415) T protein:vir:79 199 AYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDI 277 (415) T ss_pred Eeeeeee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchhHH Confidence 5555443 2445788887777778898999999999999999998875442211 10 01111222334568999 Q ss_pred HHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeecccccee-eeEeeeeeeeEeeeEEEEecceeecccceeeccc Q lcl|Aclame:pro 153 NGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAV-SALQEARLGRIYGYEIVESTLIPHGDAYLYHPTA 231 (392) Q Consensus 153 ~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~-~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a 231 (392) +++...|........ .++++|..+..|.+-. +..|.-.. ....+|..+.+.|++|+.+..+|........ T Consensus 278 ~~~~~~~~~~~~~~~-~~v~n~~~~~~l~~lk-----d~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~--- 348 (415) T protein:vir:79 278 KDAINLNVKPNYEHN-VAIVSQTMFAKLDKMK-----DKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNT--- 348 (415) T ss_pred HHHHHhhhhhccCCC-EEEEcHHHHHHHHHhh-----ccCCceeeccCcCCCCCceecceeeEEecccccCCCCccE--- Confidence 998888876666433 5788999998885421 11122110 1234566678999999888877643321100 Q ss_pred ccccchhhhccccccc-cceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeecc Q lcl|Aclame:pro 232 FIMATRAPAPPMGAVR-STAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAP 310 (392) Q Consensus 232 ~~~a~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~ 310 (392) ..+ +... .+......+....+.. +.... ..........+.+. .+..-+.+.. T Consensus 349 ~~~---------Gd~~~~~~~~~~~~~~v~~~~-~~~~~--~~~~~~~r~d~~v~---------------~~~a~~~~~~ 401 (415) T protein:vir:79 349 LII---------GNLKDAIVLFDRSQYQASWTD-YMHFG--ECLMIAVRQDCRIL---------------DYKSAIVIEY 401 (415) T ss_pred EEE---------EehhccEEEEeecceEEEEec-cccCc--eEEEEEEEeccEEe---------------ccccEEEEEE Confidence 000 0000 0000000011111100 00000 00000000000000 0000000101 Q ss_pred cccccceeeeeeccCeeEEEEE Q lcl|Aclame:pro 311 EAGANATITAAAGEDHTVQLKV 332 (392) Q Consensus 311 ~~~~~~~~~~~~~~~~t~~~t~ 332 (392) .........+++ .+ T Consensus 402 ~~~~~~~~~~~~--------~~ 415 (415) T protein:vir:79 402 DDSERGEGDLGL--------EA 415 (415) T ss_pred eccCCCCCcccc--------CC Confidence 111111111111 11 No 96 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=98.13 E-value=1.1e-06 Score=53.26 Aligned_cols=279 Identities=11% Similarity=0.066 Sum_probs=129.6 Q ss_pred CccccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCcccc-ccccCceE Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTV-SDFTEDSF 79 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 79 (392) -....++|+.|..++++.+++..++..+++... .. +...++++|............+ +..+.. ..+.-..+ T Consensus 127 ~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~---~~---~~~~~~~~~~~~~~~~~~~v~E--~~~~~~~~~~~~~~v 198 (415) T protein:vir:98 127 DSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR---VT---NGSGKYPVVRQSEVAALEKVEE--LEENPELAVKPFFQL 198 (415) T ss_pred cccccccchHHHHHHHHHHHhhhhhhhheeeee---cc---CCceeEEEEeecCCccceeecc--ccccCcccccceeeE Confidence 112347899999999999999998877765431 11 2233444443222222222222 222221 12233555 Q ss_pred EEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccc-cc------cccccccccchhhHHHH Q lcl|Aclame:pro 80 PVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAP-YE------AAGAVHEVAPDEFFKGV 152 (392) Q Consensus 80 ~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~-~~------~~~~~~~~~~~~~~~~i 152 (392) ++.+.+. +.-+.|+++-+.++..++...+.+..+++++..+|..++.-..... .. .............|++| T Consensus 199 ~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i 277 (415) T protein:vir:98 199 AYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDI 277 (415) T ss_pred Eeeeeee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchhHH Confidence 5555443 2445788887777778898999999999999999998875442211 10 01111222334568999 Q ss_pred HHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeecccccee-eeEeeeeeeeEeeeEEEEecceeecccceeeccc Q lcl|Aclame:pro 153 NGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAV-SALQEARLGRIYGYEIVESTLIPHGDAYLYHPTA 231 (392) Q Consensus 153 ~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~-~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a 231 (392) +++...|........ .++++|..+..|.+-. +..|.-.. ....+|..+.+.|++|+.+..+|........ T Consensus 278 ~~~~~~~~~~~~~~~-~~v~n~~~~~~l~~lk-----d~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~--- 348 (415) T protein:vir:98 278 KDAINLNVKPNYEHN-VAIVSQTMFAKLDKMK-----DKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNT--- 348 (415) T ss_pred HHHHHhhhhhccCCC-EEEEcHHHHHHHHHhh-----ccCCceeeccCcCCCCCceecceeeEEecccccCCCCccE--- Confidence 998888876666433 5788999998885421 11122110 1234566678999999888877643321100 Q ss_pred ccccchhhhccccccc-cceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeecc Q lcl|Aclame:pro 232 FIMATRAPAPPMGAVR-STAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAP 310 (392) Q Consensus 232 ~~~a~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~ 310 (392) ..+ +... .+......+....+.. +.... ..........+.+. .+..-+.+.. T Consensus 349 ~~~---------Gd~~~~~~~~~~~~~~v~~~~-~~~~~--~~~~~~~r~d~~v~---------------~~~a~~~~~~ 401 (415) T protein:vir:98 349 LII---------GNLKDAIVLFDRSQYQASWTD-YMHFG--ECLMIAVRQDCRIL---------------DYKSAIVIEY 401 (415) T ss_pred EEE---------EehhccEEEEeecceEEEEec-cccCc--eEEEEEEEeccEEe---------------ccccEEEEEE Confidence 000 0000 0000000011111100 00000 00000000000000 0000000101 Q ss_pred cccccceeeeeeccCeeEEEEE Q lcl|Aclame:pro 311 EAGANATITAAAGEDHTVQLKV 332 (392) Q Consensus 311 ~~~~~~~~~~~~~~~~t~~~t~ 332 (392) .........+++ .+ T Consensus 402 ~~~~~~~~~~~~--------~~ 415 (415) T protein:vir:98 402 DDSERGEGDLGL--------EA 415 (415) T ss_pred eccCCCCCcccc--------CC Confidence 111111111111 11 No 97 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=98.13 E-value=1.3e-06 Score=52.81 Aligned_cols=283 Identities=10% Similarity=0.002 Sum_probs=127.4 Q ss_pred Cc---------c------------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEecccee-eecc Q lcl|Aclame:pro 1 MA---------N------------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSR-GHTR 58 (392) Q Consensus 1 Ma---------n------------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~-~~~~ 58 (392) || . .-+.|+.|+.++++.+++...+.+++.+- .. .+..++||+..... +... T Consensus 1 ~~~~~e~~~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~---~~---~~~~~~ip~~~~~~~a~~v 74 (338) T protein:vir:78 1 MATLNELAPNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENI---PI---SYGETIIPTTVKRPEVGQV 74 (338) T ss_pred CcchHHhhhhhcccccccceecccccccchHHHHHHHHHHHhhchhhhhccee---ec---cCCceEEEEEecCccceee Confidence 11 1 11689999999999999999988877432 11 24568887743211 0000 Q ss_pred ---ccccccCCCccccccccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhcc--- Q lcl|Aclame:pro 59 ---KLRGAGAERNLTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGA--- 132 (392) Q Consensus 59 ---~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~--- 132 (392) ...-.+++......++.-..+++...+. +.-+.|+++-+.++..++...+.++.++++++++|..++.--... T Consensus 75 ~~~~~~~~~Eg~~~~~~~~~f~~v~l~~~k~-~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~ 153 (338) T protein:vir:78 75 GVGTSNEQREGGTKPLSGTAWDTRSVAPIKL-ATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTGS 153 (338) T ss_pred cccccccccccccccccccceeEEEEEEEEE-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccc Confidence 0000122333433444445555555332 345568888777888899999999999999999999887421110 Q ss_pred -----ccc-cc-----cccccccchhhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccce-eeeE Q lcl|Aclame:pro 133 -----PYE-AA-----GAVHEVAPDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSA-VSAL 200 (392) Q Consensus 133 -----~~~-~~-----~~~~~~~~~~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~-~~a~ 200 (392) ... .. ...........|+.+.++...+..+.--....++++|..+..|.+...+... .|.-. .... T Consensus 154 ~~~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d~--~g~~l~~~~~ 231 (338) T protein:vir:78 154 ALQGIDTNNVIVNTTNVDYLQTGTTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAYRDA--NGNVDPTRIN 231 (338) T ss_pred cccccccccccccccccccccccchhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhhccC--CCceeecccc Confidence 000 00 0011112234577787776665433222334688999998877543322111 11110 0112 Q ss_pred eeeeeeeEeeeEEEEecceeecccceeecccccccchhhhccccccccceeecc-cceeeeeeecccc----cee----- Q lcl|Aclame:pro 201 QEARLGRIYGYEIVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGD-QRIAMRWLVDYDS----TIT----- 270 (392) Q Consensus 201 ~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~----~~~----- 270 (392) ..|..+.+.|++|+.++.+|.............+. +.... ...+. .+........... ... T Consensus 232 ~~~~~~~l~G~PV~~~~~ip~~~~~~~~~~~~~~~--------gdfs~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~ 302 (338) T protein:vir:78 232 LAASAGDLLGLPVQFGKAVGGDLGAATDSKVRVVG--------GDFSQ-LKYGFADEIRVKMSDTATLTDNTSPTPQTVS 302 (338) T ss_pred cCCCCceeeeeeEEEccccCccccccCCcccEEEE--------Eecce-EEEEeecccEEEEeecccccccccccccchh Confidence 34556789999999999887543211100000000 00000 00000 0000000000000 000 Q ss_pred -eeeccc----ccceeeeEEEeeccccceeeeeccceeeeeeeccccccc Q lcl|Aclame:pro 271 -SNRSLI----DTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGAN 315 (392) Q Consensus 271 -~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~~~~ 315 (392) .....+ ....+..+.......... ....+ .. T Consensus 303 ~~~~~~~~~r~~~r~d~~v~~~~a~~~l~---~~~~~-----------~~ 338 (338) T protein:vir:78 303 MWQTNQIAILIEVTFGWLLGDKQAFVKFV---DDEDP-----------DA 338 (338) T ss_pred hhhcCcEEEEEEEEeccEeecccceEEEe---cccCC-----------CC Confidence 000000 000000000000000000 00000 00 No 98 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=98.12 E-value=8.5e-07 Score=53.86 Aligned_cols=283 Identities=10% Similarity=0.007 Sum_probs=131.0 Q ss_pred Ccc---ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccccCc Q lcl|Aclame:pro 1 MAN---AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTED 77 (392) Q Consensus 1 Man---~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (392) |+- ..+.|+.|+.++++.+++...+..++..- .- .|.+++||+.... ...... +++......++.-. T Consensus 30 ~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~-----~~-~~~~~~~p~~~~~--~~a~~v--~Eg~~~~~~~~~~~ 99 (324) T protein:vir:99 30 MMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYE-----PM-EGTEKKFTFWADK--PGAYWV--GEGQKIETSKATWV 99 (324) T ss_pred eccCCCcceechhHHHHHHHHHHhhchhhhhccee-----ec-cCCceEEEEEecC--cceeEe--ccCcccccccccee Confidence 322 23679999999999999999887776432 11 2456888774321 112222 23444444555556 Q ss_pred eEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccc------cccccccccchhhHHH Q lcl|Aclame:pro 78 SFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYE------AAGAVHEVAPDEFFKG 151 (392) Q Consensus 78 ~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~ 151 (392) .+++...+. +.-+.|+++-+.++..++.+.+.++.++++++++|+.++.--...+.. .............+++ T Consensus 100 ~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 178 (324) T protein:vir:99 100 NATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDN 178 (324) T ss_pred EEEEeeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCccCccccccccccceeccccCCHHH Confidence 666665443 355678888777778899999999999999999999887422111100 0111111223456899 Q ss_pred HHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeeccccee--ec Q lcl|Aclame:pro 152 VNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLY--HP 229 (392) Q Consensus 152 i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~--~~ 229 (392) ++++...|..+..... .++++|..+..|.+-. +..|. ..+..+..+.+.|.+|+.+...+......+ .. T Consensus 179 i~~~~~~l~~~~~~~~-~~v~n~~~~~~L~~l~-----d~~g~---~~~~~~~~~~l~G~PVv~~~~~~~~~~~~i~gd~ 249 (324) T protein:vir:99 179 IIDLEALLEDDELEAN-AFISKTQNRSLLRKIV-----DPETK---ERIYDRNSDTLDGLPVVNLKSSNLKRGELITGDF 249 (324) T ss_pred HHHHHHhhhhccCCCC-EEEEcHHHHHHHHHhh-----cCCCc---eeecCCCCccccceeEEeecCCCCCcceEEEEec Confidence 9999888877665333 5789999998876421 11222 112334456788999888766554332211 11 Q ss_pred ccccccchhhhccccccccceeec-ccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeee Q lcl|Aclame:pro 230 TAFIMATRAPAPPMGAVRSTAISG-DQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEV 308 (392) Q Consensus 230 ~a~~~a~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v 308 (392) +.............-......... ..+......+..+... .-.....+..+..... + ..+.......+. T Consensus 250 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~----~r~~~r~d~~v~~~~a---~---~~lt~a~~~~~~ 319 (324) T protein:vir:99 250 DKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVA----LRATMHVALHIADDKA---F---AKLVPADKKTDS 319 (324) T ss_pred ccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEE----EEEEEEEccEEecccc---e---EEEEeccCCCCC Confidence 111110000000000000000000 0000000000000000 0000000000000000 0 000000000000 Q ss_pred ccccc Q lcl|Aclame:pro 309 APEAG 313 (392) Q Consensus 309 ~~~~~ 313 (392) ++-.+ T Consensus 320 ~~~~~ 324 (324) T protein:vir:99 320 VPGEV 324 (324) T ss_pred CCCCC Confidence 00000 No 99 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=98.10 E-value=1.7e-06 Score=52.27 Aligned_cols=301 Identities=10% Similarity=0.024 Sum_probs=134.2 Q ss_pred Cc---cccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccccCc Q lcl|Aclame:pro 1 MA---NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTED 77 (392) Q Consensus 1 Ma---n~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (392) +. -..++|+.+...+++.+++...+.+++..- .- .+.++++|++...........+ ++..+.-.++.-. T Consensus 117 ~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~-----~~-~~~~~~~~~~~~~~~~~~~~~~--E~~~~~~s~~~f~ 188 (421) T protein:vir:13 117 MSSTNNGAVIPQEFVNEFEKLKEGYPSLKEHCHVI-----PV-NRNAGKMPVRAGASVDKLANLA--KDTELVKAMLKTQ 188 (421) T ss_pred cccCCcceecchhhHHHHHHHHHhhhhhhhhceee-----ec-cCCceEEEEeecCCccceeecc--cccccccccccee Confidence 11 134789999999999999998887776532 11 1345777765443333232222 2333333344445 Q ss_pred eEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHHHHHH Q lcl|Aclame:pro 78 SFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNGARR 157 (392) Q Consensus 78 ~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~ 157 (392) .+++.+.+. +.-+.|+++-+..+..++...+.+..+++++..+|..++....+.... .....|++|+++.. T Consensus 189 ~i~~~~~k~-~~~v~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~~~~g~~~~--------~~~~~~d~i~~~~~ 259 (421) T protein:vir:13 189 PMAYDIDDY-GLLAPIDNSLLEDSEINFLEFVNEEFAEFAVNTENAEIVKQAKAVLAE--------ETINDYAGLVKTIN 259 (421) T ss_pred EEEeeeeee-EeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHhhhhHhhhhhhcccc--------ccccchHHHHHHHH Confidence 566665443 345578888777777888888999999999999999888755443211 11234788888888 Q ss_pred HhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccceeecccccccch Q lcl|Aclame:pro 158 ALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTAFIMATR 237 (392) Q Consensus 158 ~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a~~~a~~ 237 (392) .|..+..+.. .++++|..+..|.+-. +..|.-.......|..+.+.|+.|+.+..++....... .+.++ T Consensus 260 ~l~~~~~~~a-~~v~n~~~~~~l~~lk-----d~~G~~i~~~~~~~~~~tl~G~pV~~~~~~~~~~~~~~---~~~~g-- 328 (421) T protein:vir:13 260 SLVPNARKRA-IIVTNSDGRAYLDGLM-----DKQGRPLLKELSDGGDLVFKGRPVIELEESIFDVGDET---KFIVS-- 328 (421) T ss_pred HhhhhhcCCC-EEEEcHHHHHHHHHhh-----cCCCceeecCcCCCCCceecceeeEEeccccccCCCce---EEEEE-- Confidence 8876655433 5788999988876421 11222111112245556899999998887764332100 00000 Q ss_pred hhhcccccccc-ceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeecccccccc Q lcl|Aclame:pro 238 APAPPMGAVRS-TAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGANA 316 (392) Q Consensus 238 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~~~~~ 316 (392) .... +......+....+.............-.-...+............ ........+.......... T Consensus 329 -------d~~~~~~~~~~~~~~v~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~----~~~~~~a~v~~~~~~~~~~ 397 (421) T protein:vir:13 329 -------DFKTLIKFMDRKQYLIDQSKEAGYTKNETIARIIERFDVNSPLDKSSDAE----KIRKFGVIVKLQEVLKSSP 397 (421) T ss_pred -------eccccEEEEEecceEEEeecccccccCeeEEEEEeeecceeecchhhhee----eecccceeeccccccCCCC Confidence 0000 000000001110000000000000000000000000000000000 0000000000000000000 Q ss_pred eeeeeeccCeeEEEEEeecCcccccceEEEEEcCC Q lcl|Aclame:pro 317 TITAAAGEDHTVQLKVTDANGDDVTALCDFESSAT 351 (392) Q Consensus 317 ~~~~~~~~~~t~~~t~~~~~~~~~~~~vtw~Ssn~ 351 (392) ..--+...+. -+++..+.... .|+ T Consensus 398 ~~~~~~~~~~-~~~~~~~~~~~----------~~~ 421 (421) T protein:vir:13 398 RSGKNKNESK-EEIKEEGEATQ----------QNE 421 (421) T ss_pred cCCCCccccc-hheeecccccc----------CCC Confidence 0000111111 12233222111 111 No 100 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=98.06 E-value=1.5e-06 Score=52.55 Aligned_cols=279 Identities=11% Similarity=0.064 Sum_probs=128.0 Q ss_pred CccccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccc-cccccCceE Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLT-VSDFTEDSF 79 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 79 (392) -....+.|+.|..++++.+++..++..++..-.- . +.+.++|++............ ++.... .+.+.-..+ T Consensus 127 ~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~---~---~~~~~~~~~~~~~~~~~~~v~--Eg~~~~~~~~~~~~~v 198 (415) T protein:vir:46 127 DSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV---T---NGSGKYPVVRQSEVAALEKVE--ELEENPELAVKPFFQL 198 (415) T ss_pred cCCcccccHHHHHHHHHHHHhhhhhhhhcceeec---c---CCceeEEEEEecCCcceeecc--cccccccccccceeeE Confidence 1223478999999999999999998777643211 1 123444444221111111222 222222 122233455 Q ss_pred EEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhcc-cccc------ccccccccchhhHHHH Q lcl|Aclame:pro 80 PVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGA-PYEA------AGAVHEVAPDEFFKGV 152 (392) Q Consensus 80 ~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~-~~~~------~~~~~~~~~~~~~~~i 152 (392) ++...+. +.-+.|+.+-+.++..++...+.+..+++|++.+|..++.-.... +... ............|++| T Consensus 199 ~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i 277 (415) T protein:vir:46 199 AYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDI 277 (415) T ss_pred Eeeeeee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccccceeccccccchHHH Confidence 5555333 345678887777778889999999999999999999887543221 1100 0111122333568889 Q ss_pred HHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeecccccee-eeEeeeeeeeEeeeEEEEecceeecccceeeccc Q lcl|Aclame:pro 153 NGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAV-SALQEARLGRIYGYEIVESTLIPHGDAYLYHPTA 231 (392) Q Consensus 153 ~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~-~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a 231 (392) +++...+...... +-.++++|..+..|.+-. +..|.-.. ..+.+|..+.+.|++|+.+..+|........ T Consensus 278 ~~~~~~~~~~~~~-~~~~v~n~~~~~~L~~lk-----d~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~--- 348 (415) T protein:vir:46 278 KDAINLNVKPNYE-HNVAIVSQTMFAKLDKMK-----DKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNT--- 348 (415) T ss_pred HHHHHhhhhhccC-CCEEEEcHHHHHHHHHhh-----ccCCCeeeccCcCCCCCccccceeeEEeccccccCCCccE--- Confidence 9888777666553 336789999998885421 11222110 1234566678999999988777643321100 Q ss_pred ccccchhhhccccccc-cceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeecc Q lcl|Aclame:pro 232 FIMATRAPAPPMGAVR-STAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAP 310 (392) Q Consensus 232 ~~~a~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~ 310 (392) +.++ ... ........+....+.. +.... .........++.+. ....-+.++. T Consensus 349 ~~~g---------d~~~~~~~~~~~~~~v~~~~-~~~~~--~~~~~~~r~d~~v~---------------~~~a~~~~~~ 401 (415) T protein:vir:46 349 LIIG---------NLKDAIVLFDRSQYQASWTD-YMHFG--ECLMIAVRQDCRIL---------------DYKSAIVIEY 401 (415) T ss_pred EEEE---------ehhccEEEEeecceEEEeec-cccCc--eEEEEEEEeccEEe---------------ccccEEEEEe Confidence 0000 000 0000000011111100 00000 00000000000000 0000000000 Q ss_pred cccccceeeeeeccCeeEEEEE Q lcl|Aclame:pro 311 EAGANATITAAAGEDHTVQLKV 332 (392) Q Consensus 311 ~~~~~~~~~~~~~~~~t~~~t~ 332 (392) .........+++ .+ T Consensus 402 ~~~~~~~~~~~~--------~~ 415 (415) T protein:vir:46 402 DDSERGEGDLGL--------EA 415 (415) T ss_pred eccCCCCCCccC--------CC Confidence 000111111111 11 No 101 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=98.06 E-value=1.5e-06 Score=52.55 Aligned_cols=279 Identities=11% Similarity=0.064 Sum_probs=128.0 Q ss_pred CccccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccc-cccccCceE Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLT-VSDFTEDSF 79 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 79 (392) -....+.|+.|..++++.+++..++..++..-.- . +.+.++|++............ ++.... .+.+.-..+ T Consensus 127 ~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~---~---~~~~~~~~~~~~~~~~~~~v~--Eg~~~~~~~~~~~~~v 198 (415) T protein:vir:47 127 DSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRV---T---NGSGKYPVVRQSEVAALEKVE--ELEENPELAVKPFFQL 198 (415) T ss_pred cCCcccccHHHHHHHHHHHHhhhhhhhhcceeec---c---CCceeEEEEEecCCcceeecc--cccccccccccceeeE Confidence 1223478999999999999999998777643211 1 123444444221111111222 222222 122233455 Q ss_pred EEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhcc-cccc------ccccccccchhhHHHH Q lcl|Aclame:pro 80 PVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGA-PYEA------AGAVHEVAPDEFFKGV 152 (392) Q Consensus 80 ~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~-~~~~------~~~~~~~~~~~~~~~i 152 (392) ++...+. +.-+.|+.+-+.++..++...+.+..+++|++.+|..++.-.... +... ............|++| T Consensus 199 ~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i 277 (415) T protein:vir:47 199 AYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDI 277 (415) T ss_pred Eeeeeee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccccceeccccccchHHH Confidence 5555333 345678887777778889999999999999999999887543221 1100 0111122333568889 Q ss_pred HHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeecccccee-eeEeeeeeeeEeeeEEEEecceeecccceeeccc Q lcl|Aclame:pro 153 NGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAV-SALQEARLGRIYGYEIVESTLIPHGDAYLYHPTA 231 (392) Q Consensus 153 ~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~-~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a 231 (392) +++...+...... +-.++++|..+..|.+-. +..|.-.. ..+.+|..+.+.|++|+.+..+|........ T Consensus 278 ~~~~~~~~~~~~~-~~~~v~n~~~~~~L~~lk-----d~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~--- 348 (415) T protein:vir:47 278 KDAINLNVKPNYE-HNVAIVSQTMFAKLDKMK-----DKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNT--- 348 (415) T ss_pred HHHHHhhhhhccC-CCEEEEcHHHHHHHHHhh-----ccCCCeeeccCcCCCCCccccceeeEEeccccccCCCccE--- Confidence 9888777666553 336789999998885421 11222110 1234566678999999988777643321100 Q ss_pred ccccchhhhccccccc-cceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeecc Q lcl|Aclame:pro 232 FIMATRAPAPPMGAVR-STAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAP 310 (392) Q Consensus 232 ~~~a~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~ 310 (392) +.++ ... ........+....+.. +.... .........++.+. ....-+.++. T Consensus 349 ~~~g---------d~~~~~~~~~~~~~~v~~~~-~~~~~--~~~~~~~r~d~~v~---------------~~~a~~~~~~ 401 (415) T protein:vir:47 349 LIIG---------NLKDAIVLFDRSQYQASWTD-YMHFG--ECLMIAVRQDCRIL---------------DYKSAIVIEY 401 (415) T ss_pred EEEE---------ehhccEEEEeecceEEEeec-cccCc--eEEEEEEEeccEEe---------------ccccEEEEEe Confidence 0000 000 0000000011111100 00000 00000000000000 0000000000 Q ss_pred cccccceeeeeeccCeeEEEEE Q lcl|Aclame:pro 311 EAGANATITAAAGEDHTVQLKV 332 (392) Q Consensus 311 ~~~~~~~~~~~~~~~~t~~~t~ 332 (392) .........+++ .+ T Consensus 402 ~~~~~~~~~~~~--------~~ 415 (415) T protein:vir:47 402 DDSERGEGDLGL--------EA 415 (415) T ss_pred eccCCCCCCccC--------CC Confidence 000111111111 11 No 102 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=98.05 E-value=2.2e-06 Score=51.63 Aligned_cols=259 Identities=12% Similarity=-0.005 Sum_probs=125.3 Q ss_pred Cc------cccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc--eeeeccccccccCCCccccc Q lcl|Aclame:pro 1 MA------NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP--SRGHTRKLRGAGAERNLTVS 72 (392) Q Consensus 1 Ma------n~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~--~~~~~~~~~~~~~~~~~~~~ 72 (392) |. .-.+.|+.+...+++.+++...+.+++..- .- .+.++++|+... ..+.. . +++...... T Consensus 113 ~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~-----~~-~~~~~~~~~~~~~~~~a~~---v--~Eg~~~~~~ 181 (390) T protein:vir:97 113 ASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSG-----RT-DSALIEYVQETGFVNNAAI---V--AEGALKPES 181 (390) T ss_pred hhcccccccccccchhhhHHHHHHHhhhhhhHhhccee-----ec-cCCceEEEEEecCCcceee---e--cCCcccccc Confidence 11 122566777788999999999887776432 21 244577766432 12222 2 233334444 Q ss_pred cccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhcc--ccc------ccccccccc Q lcl|Aclame:pro 73 DFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGA--PYE------AAGAVHEVA 144 (392) Q Consensus 73 ~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~--~~~------~~~~~~~~~ 144 (392) ++.-..+++...+. +.-+.|+++-+.. ..++...+.++.+++++.++|+.++.--... +.+ ........+ T Consensus 182 ~~~~~~i~~~~~k~-~~~~~is~ell~d-s~~l~~~i~~~la~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~ 259 (390) T protein:vir:97 182 SLKFAKKTDTTHVI-AHTMKATRQILSD-APQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIA 259 (390) T ss_pred ccceeEEEEeeeeE-EEeehhhHHHHHh-HHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCccccceeecccccccccccc Confidence 55556666666543 3455688765544 4567777888899999999999887421000 000 001111122 Q ss_pred chhhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeeccc Q lcl|Aclame:pro 145 PDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDA 224 (392) Q Consensus 145 ~~~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~ 224 (392) ....++++.++...|.....+.. .++++|..+..|.+-. +..|.-.......+..+++.|.+|+.+..+|.+.. T Consensus 260 ~~~~~d~~~~~~~~~~~~~~~~~-~~v~n~~~~~~L~~lk-----d~~G~~l~~~~~~~~~~~l~G~pV~~~~~~~~~~~ 333 (390) T protein:vir:97 260 GATRVDQLRLAMLQASLAEYPAS-GIVINPIDWAAIELAK-----DANNQYLIGNARGTLTPTLWGLPVVATQAMAPGEF 333 (390) T ss_pred ccchHHHHHHHHHhhccccCCCC-EEEEcHHHHHHHHHhh-----cCCCceeecCccCCCCceecceeeEEcCCCCCCcE Confidence 34457788888888877777544 5788999988886422 11222111111234456899999999999886553 Q ss_pred ceeecc-cccccchhhhccccccccceeecccceeeeeeeccccceeeee--cccccceeeeEEEeeccccceeeeeccc Q lcl|Aclame:pro 225 YLYHPT-AFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNR--SLIDTYFGLKVVEDPNGVGFVRARKIHL 301 (392) Q Consensus 225 ~~~~~~-a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 301 (392) .....+ +.....+ .+....+....+. ...+. .-.....+..+...... T Consensus 334 ~~gd~~~~~~~~~~-----------------~~~~i~~~~~~~~-f~~~~~~~r~~~r~d~~v~~~~a~----------- 384 (390) T protein:vir:97 334 LVGAFDLAAQIFDQ-----------------WDARVEIGYVNDD-FQRNMVTVLAEERLALVVYRPEAL----------- 384 (390) T ss_pred EEEeccceEEEEEe-----------------cceEEEEeecccc-cccCcEEEEEEEeeccEEeccccE----------- Confidence 322211 1111000 0111110000000 00000 00000001100100000 Q ss_pred eeeeeeecccccc Q lcl|Aclame:pro 302 IPGSIEVAPEAGA 314 (392) Q Consensus 302 ~~~~v~v~~~~~~ 314 (392) ..+.++ T Consensus 385 -------v~~~~a 390 (390) T protein:vir:97 385 -------ITGSFA 390 (390) T ss_pred -------EEEEeC Confidence 000001 No 103 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=98.02 E-value=1.6e-06 Score=52.32 Aligned_cols=275 Identities=9% Similarity=0.032 Sum_probs=122.7 Q ss_pred Ccc----ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceee-eccccccccCCCccc-cccc Q lcl|Aclame:pro 1 MAN----AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRG-HTRKLRGAGAERNLT-VSDF 74 (392) Q Consensus 1 Man----~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~-~~~~~~~~~~~~~~~-~~~~ 74 (392) ++. -+++|+.|..++++.+++..++..++++.. .. +.+.+++.+..... .......+ +.... .+.+ T Consensus 111 ~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~---~~---~~~~~~~~~~~~~~~~~a~~v~E--~~~~~~~~~~ 182 (397) T protein:vir:48 111 DASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVEN---VT---TLTGSRVYEKWADITGLAKLDDE--AGSIGTNDDP 182 (397) T ss_pred ccCCccccccccHHHHHHHHHHHHHHHHHHhhhceee---cc---CCcceEEEEeecCCCcceeeecc--cccccccccc Confidence 221 347899999999999999999888775532 22 22333333322111 11111222 22221 1223 Q ss_pred cCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHHH Q lcl|Aclame:pro 75 TEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNG 154 (392) Q Consensus 75 ~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~ 154 (392) .-..+++...+. +.-+.|+++-+.++..++...+.++..++++..+|..++.-... .........+++|++ T Consensus 183 ~~~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~v~~~l~~~~~~~~d~~il~G~g~--------~~~~~~~~~~d~i~~ 253 (397) T protein:vir:48 183 KLYPIRYAIKRY-AGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIAT--------LPTKPTLTKWDDIID 253 (397) T ss_pred ceeeEEeeheee-eeehhhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccc--------cccccccccHHHHHH Confidence 345555555443 34457888888888889999999999999999999988753211 111223356888999 Q ss_pred HHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeecccccee-eeEeeeeeeeEeeeEEEEecc--eeecccceeeccc Q lcl|Aclame:pro 155 ARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAV-SALQEARLGRIYGYEIVESTL--IPHGDAYLYHPTA 231 (392) Q Consensus 155 a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~-~a~~~g~ig~~~g~~v~~s~~--v~~~~~~~~~~~a 231 (392) +...|..+..+. -.++++|..+..|.+-. +..|.-.. ..+..|..+.+.|++|+.... ++...... .. T Consensus 254 ~~~~l~~~~~~~-a~~v~n~~~~~~L~~lk-----d~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~--~~- 324 (397) T protein:vir:48 254 LQAKVDPAIKQT-SFFLTNTSGFTALKKVK-----NAFGDYLMERDVKSPTGYSIDGFAVKEVADRWLANASSGA--MP- 324 (397) T ss_pred HHHHhhhhhcCC-CEEEECHHHHHHHHHhh-----cCCCceeeccCcCCCCCceeccceeEEecccccCCcCCCc--eE- Confidence 888887665543 46788999998886431 11222110 113355667899999876532 22111100 00 Q ss_pred ccccchhhhccccccccceeecc-cceeeeeeeccccceeeeec--ccccceeeeEEEeeccccceeeeeccceeeeeee Q lcl|Aclame:pro 232 FIMATRAPAPPMGAVRSTAISGD-QRIAMRWLVDYDSTITSNRS--LIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEV 308 (392) Q Consensus 232 ~~~a~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v 308 (392) +.++ .......... .+...............+.. ......++.+....... ..........+..... T Consensus 325 ~~~g---------d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~-~~~~~~~~~~~~~~~~ 394 (397) T protein:vir:48 325 LYFG---------DLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKIRVIDRFDVVATDTESFV-PASFKAIADQKGNLGS 394 (397) T ss_pred EEEE---------eccceEEEEeecceEEEEeccchhhhhcCceeEEEEeeeccEEecccceE-EEEecccccCCCCccc Confidence 0000 0000000000 00000000000000000000 00000000000000000 0000000000000000 Q ss_pred ccc Q lcl|Aclame:pro 309 APE 311 (392) Q Consensus 309 ~~~ 311 (392) +.+ T Consensus 395 ~~~ 397 (397) T protein:vir:48 395 TAV 397 (397) T ss_pred cCC Confidence 000 No 104 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=98.00 E-value=2.5e-06 Score=51.33 Aligned_cols=283 Identities=10% Similarity=-0.004 Sum_probs=130.7 Q ss_pred Ccc---ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccccCc Q lcl|Aclame:pro 1 MAN---AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTED 77 (392) Q Consensus 1 Man---~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (392) |+- ..+.|+.|+.++++.+++...+..++..- . . .+.+++||+.... ...... +++......++.-. T Consensus 30 ~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~--~-~---~~~~~~~p~~~~~--~~a~~v--~Eg~~~~~~~~~~~ 99 (324) T protein:vir:10 30 MMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYE--P-M---EGTEKKFTFWADK--PGAYWV--GEGQKIETSKATWV 99 (324) T ss_pred eccCCCcceechhHHHHHHHHHHhhchhhhhccee--e-c---cCCceEEEEEeCC--cceeEe--ccCcccccccccee Confidence 322 23779999999999999999887776432 1 1 2456888775321 112222 23444444455556 Q ss_pred eEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccc------cccccccccchhhHHH Q lcl|Aclame:pro 78 SFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYE------AAGAVHEVAPDEFFKG 151 (392) Q Consensus 78 ~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~ 151 (392) .+++...+. +.-+.|+.+-+.++..++...+.++..+++++++|+.++.--...+.. .............+++ T Consensus 100 ~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~a~l~G~g~~~~~~~i~~~~~~~~~~~~~~~t~~~ 178 (324) T protein:vir:10 100 NATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDN 178 (324) T ss_pred EEEEeeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCccCccccccccccceeccccCCHHH Confidence 666655343 345678887777777899999999999999999999887422111100 0111111223456899 Q ss_pred HHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeeccccee--ec Q lcl|Aclame:pro 152 VNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLY--HP 229 (392) Q Consensus 152 i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~--~~ 229 (392) +.++...|..+..... .++++|..+..|.+-. +..|. ..+..+..+.+.|.+|+.+...+......+ .. T Consensus 179 i~~~~~~l~~~~~~~~-~~v~n~~~~~~L~~l~-----d~~g~---~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~ 249 (324) T protein:vir:10 179 IIDLEALLEDDELEAN-AFISKTQNRSLLRKIV-----DPETK---ERIYDRNSDTLDGLPVVNLKSSNLKRGELITGDF 249 (324) T ss_pred HHHHHHhhhhccCCCC-EEEEcHHHHHHHHHhh-----ccCCc---eeecCCCCccccceeEEeecCCCCCcceEEEEec Confidence 9999888876655333 5788999998876421 11222 122344456789999887765543332211 11 Q ss_pred ccccccchhhhccccccccceeec-ccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeee Q lcl|Aclame:pro 230 TAFIMATRAPAPPMGAVRSTAISG-DQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEV 308 (392) Q Consensus 230 ~a~~~a~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v 308 (392) +..................+.... +...........+.... -.....+..+....... .+.......+. T Consensus 250 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----r~~~r~d~~v~~~~A~~------~l~~a~~~~~~ 319 (324) T protein:vir:10 250 DKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVAL----RATMHVALHIADDKAFA------KLVPADKKTDS 319 (324) T ss_pred ccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEE----EEEEEEccEEecccceE------EEEeccCCCCC Confidence 111111100000000000000000 00000000000000000 00000111111100000 00000000011 Q ss_pred ccccc Q lcl|Aclame:pro 309 APEAG 313 (392) Q Consensus 309 ~~~~~ 313 (392) ++-.+ T Consensus 320 ~~~~~ 324 (324) T protein:vir:10 320 VPGEV 324 (324) T ss_pred CCCCC Confidence 11111 No 105 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=97.99 E-value=2.7e-06 Score=51.09 Aligned_cols=293 Identities=13% Similarity=0.064 Sum_probs=126.8 Q ss_pred CccccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEecc-ceeeeccccccccCCCccccccccCceE Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPA-PSRGHTRKLRGAGAERNLTVSDFTEDSF 79 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (392) -+-.++.|+.+++++++.|+++.++..+..+-... +...+| .++||+-. ...+.+. +++......++.=..+ T Consensus 344 ~~Gg~~vp~~~~~~ii~~l~~~svv~~l~~~~~~~-~~~~~~-~~~ip~~t~~~~a~wv-----~Eg~~~~~s~~~f~~v 416 (645) T protein:vir:93 344 WAGSLSEYQEYAQDFIDYLRPQTIIGRFGQGGIPA-LRQVPF-NIRVHAQVSGGAAGWV-----GEGKTKPLTKFDFESI 416 (645) T ss_pred ccCCccCchhhHHHHHHhhhhhhhHHhhccccccc-cccccC-ceeeeeeecCcceEEe-----ccCccccccccceeEE Confidence 11346889999999999999998887664332211 122223 36666532 2222222 2333444444444555 Q ss_pred EEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhcc-----ccccccc-cccccchhhHHHHH Q lcl|Aclame:pro 80 PVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGA-----PYEAAGA-VHEVAPDEFFKGVN 153 (392) Q Consensus 80 ~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~-----~~~~~~~-~~~~~~~~~~~~i~ 153 (392) ++...+ .+.-+.|+.+-+.++..++...+.+...++|+.++|..++.--... +...... .........+.++. T Consensus 417 ~l~~~k-la~~~~iS~ell~ds~~~~~~~i~~~l~~aia~~~d~a~l~g~g~~~~~~~p~gi~~~~~~~~~~~~~~~d~~ 495 (645) T protein:vir:93 417 TFSHAK-VSAIAVLTEELIRFSSPAADALVRNALAEAVVARLDTDFVDPKKAAVADVSPASITHDVKGTASSGNPDADAE 495 (645) T ss_pred EEeeEE-EEEeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccCCccccceeccccccccccchHHHHH Confidence 555432 2334457776667777788888889999999999999887421111 1111100 11112233456777 Q ss_pred HHHHHhhhccCC-CCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccceeecccc Q lcl|Aclame:pro 154 GARRALNELYIP-QGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTAF 232 (392) Q Consensus 154 ~a~~~l~~~~vp-~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a~ 232 (392) .+...|..+++. .+-.++++|..+..|.+-.. ..|.-.... ....-+.+.|+.|+.++.+|.+.. ....+.. T Consensus 496 ~~~~~~~~a~~~~~~a~~vmn~~~~~~L~~lkd-----~~G~~~~~~-~~~~~~tL~G~PV~~s~~vp~~~~-~gd~s~~ 568 (645) T protein:vir:93 496 AAFGQFVAANLQPTGAVWLMSSTNALALSMRKN-----ALGQKEYPD-MTLLGGSFQGLPVIVSQYVGDQLV-LVNAPDI 568 (645) T ss_pred HHHHHHHhcCCCccccEEEEcHHHHHHHHhccc-----cCCceeecC-CCCCCceeeceeeEEeccCCccee-EeccccE Confidence 776777767664 45678899999888865321 122211000 011125799999999999875321 1111111 Q ss_pred cccchhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeE-EEeeccccceeeeeccceeeeeeeccc Q lcl|Aclame:pro 233 IMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKV-VEDPNGVGFVRARKIHLIPGSIEVAPE 311 (392) Q Consensus 233 ~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~v~v~~~ 311 (392) .++.............+.. +......+.........++.+..-.+ ...... ..-.+..+...+.++.+ T Consensus 569 ~ig~~~~v~i~~s~~a~~~-------~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r----~d~~~~~p~a~~~lt~~ 637 (645) T protein:vir:93 569 YLADDGGVAVDMSREASLE-------MQSEPTGDSTTPSPVELVSMFQTGSVAIRAERW----INWRRRRTAAVAVITGV 637 (645) T ss_pred EEEEecceEEEeecceeEE-------EeecccccccccccccchhHhhcCceEEEEEEE----EcceeeCccceEEEecc Confidence 1110000000000000000 00000000000000000000000000 000000 00000011111111122 Q ss_pred ccccceee Q lcl|Aclame:pro 312 AGANATIT 319 (392) Q Consensus 312 ~~~~~~~~ 319 (392) ..-....- T Consensus 638 ~~g~~~~~ 645 (645) T protein:vir:93 638 NYGSASGG 645 (645) T ss_pred cCCcccCC Confidence 11111100 No 106 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=97.99 E-value=4.2e-06 Score=50.09 Aligned_cols=281 Identities=10% Similarity=0.000 Sum_probs=124.2 Q ss_pred Cccc------cccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc-eeeeccccccccCCCcccccc Q lcl|Aclame:pro 1 MANA------FSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSD 73 (392) Q Consensus 1 Man~------~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~-~~~~~~~~~~~~~~~~~~~~~ 73 (392) |+.+ -+.|+.++.++++.+++...+..++.+- .- .+.+++||+... ..+.. .+ ++..+...+ T Consensus 14 ~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~-----~~-~~~~~~~p~~~~~~~a~~---v~--E~~~~~~~~ 82 (320) T protein:vir:10 14 IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKV-----PM-GTTGQKIPHWIGDVSAQW---IG--EGDMKPITK 82 (320) T ss_pred hhccccccccccccHHHHHHHHHHHHhccchhhhccee-----ec-cCCceEEEEEeCCcceEE---ec--CCccccccc Confidence 3332 1568888999999999999887776542 11 245688877532 22222 22 333444445 Q ss_pred ccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHh-ccc-------cccccc-ccccc Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIV-GAP-------YEAAGA-VHEVA 144 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~-~~~-------~~~~~~-~~~~~ 144 (392) ++-..+++...+. +.-+.|+++-+.++..++.+.+.++..+++++++|+.++.--. ..+ ...... ..... T Consensus 83 ~~f~~v~~~~~k~-~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~ 161 (320) T protein:vir:10 83 GNMTSQNIAPHKI-ATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQTTKSVSLADPGGAT 161 (320) T ss_pred cceeEEEEeeEEE-EEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCcccccccccccceeccccc Confidence 5555555555442 3556788888888888999999999999999999998873111 000 000000 00111 Q ss_pred ch--hhHH-HHHHHHHHhhhccCCCCCEEEEchHHHHHhhc--cc--ceeeeeccccceeeeEeeeeeeeEeeeEEEEec Q lcl|Aclame:pro 145 PD--EFFK-GVNGARRALNELYIPQGRVLVVGTAVTEQILN--DD--RFIKYESQGQSAVSALQEARLGRIYGYEIVEST 217 (392) Q Consensus 145 ~~--~~~~-~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~--~~--~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~ 217 (392) .+ ..++ .+.++...+..... ....++++|..+..|.+ |. ++.-.......... ...-+++.|++++.++ T Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~---~~~~~~i~g~pv~~~~ 237 (320) T protein:vir:10 162 ASDLTAYDAVAVNGLSLLVNAKK-KWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDENS---PFRAGRIVSRPTILSD 237 (320) T ss_pred ccccccHHHHHHHHHhhhhcccC-CCcEEEEcHHHHHHHHHhhccCCceeeccccccCccc---cccCceeeeeeeEecC Confidence 11 1122 34444444444333 34478899999998854 21 11111111000001 1123478999999999 Q ss_pred ceeecccceee--cccccccchhhhccccccccceeecccc-eeeeeeeccccceeeeecccccceeeeEEEeeccccce Q lcl|Aclame:pro 218 LIPHGDAYLYH--PTAFIMATRAPAPPMGAVRSTAISGDQR-IAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFV 294 (392) Q Consensus 218 ~v~~~~~~~~~--~~a~~~a~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 294 (392) .+|.+....+. .+...+.........-....+...+... .........+... .-.....+..+.... .+. T Consensus 238 ~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~----~r~~~~~d~~v~~~~---a~~ 310 (320) T protein:vir:10 238 HVADGTTVGYMGDFRNVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVA----VRVEAEYAFHNNDKD---AFV 310 (320) T ss_pred CCCCCceEEEEeecceEEEEEecCeEEEEeecceeeeccccccccchhhhcCcEE----EEEEEeeccEEeccc---ceE Confidence 88765532221 1111111111100000000000000000 0000000000000 000000111111100 000 Q ss_pred eeeeccceee Q lcl|Aclame:pro 295 RARKIHLIPG 304 (392) Q Consensus 295 ~~~~~~~~~~ 304 (392) .......++. T Consensus 311 ~l~~~~ap~~ 320 (320) T protein:vir:10 311 KLTNVVTPDA 320 (320) T ss_pred EEEeccCCCC Confidence 0000110000 No 107 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=97.98 E-value=3.7e-06 Score=50.35 Aligned_cols=261 Identities=11% Similarity=-0.021 Sum_probs=122.7 Q ss_pred Cc------cccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccc Q lcl|Aclame:pro 1 MA------NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDF 74 (392) Q Consensus 1 Ma------n~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (392) |. ...+.|+.+...+++.+++...+..++.+- .. .+.++++|+..... ......+ ++......++ T Consensus 113 ~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~-----~~-~~~~~~~~~~~~~~-~~a~~v~--Eg~~~~~~~~ 183 (390) T protein:vir:81 113 ASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSG-----RT-DSALIEYVQETGFV-NNAAIVA--EGALKPESSL 183 (390) T ss_pred hccccccCCcceechhhhHHHHHHHhhhhhhhhhccee-----ec-cCCceEEEEEecCC-cceeeec--CCcccccccc Confidence 11 111344456677999999999888876532 11 24456776543211 1112222 2333444444 Q ss_pred cCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhcc--ccc----c--ccccccccch Q lcl|Aclame:pro 75 TEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGA--PYE----A--AGAVHEVAPD 146 (392) Q Consensus 75 ~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~--~~~----~--~~~~~~~~~~ 146 (392) .-..+++.+.+. +.-+.|+++-+... .++...+.++.++++++++|+.++.--... +.+ . .......... T Consensus 184 ~~~~i~~~~~k~-~~~~~is~ell~d~-~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~ 261 (390) T protein:vir:81 184 KFAKKTDTTHVI-AHTMKATRQILSDA-PQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGA 261 (390) T ss_pred eeeEEEEeeeEE-EEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccceeecccccccccccccc Confidence 445666666444 34556777655444 567777788899999999999887421100 000 0 0111122334 Q ss_pred hhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccce Q lcl|Aclame:pro 147 EFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYL 226 (392) Q Consensus 147 ~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~ 226 (392) ..++++.++...|...+.+.. .++++|..+..|.+-. +..|.-.......+..+.+.|++|+.++.+|.+.... T Consensus 262 ~~~~~~~~~~~~~~~~~~~~~-~~v~~~~~~~~l~~lk-----d~~G~~l~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~ 335 (390) T protein:vir:81 262 TRVDQLRLAMLQASLAEYNPS-GIVINPIDWAAIELAK-----DANNQYLIGNARGTLTPTLWGLPVVATQAMAPGEFLV 335 (390) T ss_pred hhHHHHHHHHHhhccccCCCC-EEEEcHHHHHHHHHhh-----cCCCceeecCcccccCceecceeeEEcCCCCCCcEEE Confidence 567888888888877766544 5788999998876421 1112111011123334588999999999998665433 Q ss_pred eeccc-ccccchhhhccccccccceeecccceeeeeeeccccceeeeec--ccccceeeeEEEeeccccceeeeecccee Q lcl|Aclame:pro 227 YHPTA-FIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRS--LIDTYFGLKVVEDPNGVGFVRARKIHLIP 303 (392) Q Consensus 227 ~~~~a-~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 303 (392) ...+. .....+ .+....+...... ...+.. ......+..+....... T Consensus 336 gd~~~~~~~~~~-----------------~~~~v~~~~~~~~-~~~~~v~~r~~~r~d~~v~~~~a~v------------ 385 (390) T protein:vir:81 336 GAFDLAAQIFDQ-----------------WDARVEIGYVGED-FQRNMITVLAEERLALVVYRPEALI------------ 385 (390) T ss_pred EehhceEEEEEe-----------------cceEEEEecccch-hhcCcEEEEEEEeeccEEecccceE------------ Confidence 22211 111000 0011100000000 000000 00000000000000000 Q ss_pred eeeeecccccc Q lcl|Aclame:pro 304 GSIEVAPEAGA 314 (392) Q Consensus 304 ~~v~v~~~~~~ 314 (392) .++++ T Consensus 386 ------~~t~a 390 (390) T protein:vir:81 386 ------SGSFA 390 (390) T ss_pred ------EEEeC Confidence 00000 No 108 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=97.96 E-value=3.1e-06 Score=50.79 Aligned_cols=309 Identities=8% Similarity=-0.046 Sum_probs=143.3 Q ss_pred Cccc----cccHH--HHHHHHHHHHHHhhccc--ceeeeccccccc---CCCCCeEEEEeccceeee-ccccccccCCCc Q lcl|Aclame:pro 1 MANA----FSKPT--AVVDTAIQMLQNELILT--NLVWLNGIGDFA---HKFNDTITVRVPAPSRGH-TRKLRGAGAERN 68 (392) Q Consensus 1 Man~----~~~~~--~~~~~~~~~l~~~l~~~--~~v~~~~~~~~~---~~~Gdtv~i~~~~~~~~~-~~~~~~~~~~~~ 68 (392) ||-+ +++|| ++.+.+.+.-.+...|. ..+-+| .++. ...|+.+++|......-. +........... T Consensus 1 Ma~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d--~~l~~~~~~gG~~~~iPf~~~l~g~~e~n~~~dt~~~~ 78 (349) T protein:vir:94 1 MAITTIGNIVTGNIPVLASYMTEDPVEKTAFFNSGILTPT--PYAAEIARGPSNIANLPFWKAIDTSIEPNYSNDVYQDI 78 (349) T ss_pred CCceEEeeeeccChHHHHHHHHHhHHHhhhhhhccceecc--HHHHHHHhcCCCEEEeeeeecCCCCcccccCCCCcccc Confidence 9954 36777 78888877776665553 334444 3443 256999999887654321 111111111123 Q ss_pred cccccccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc-------- Q lcl|Aclame:pro 69 LTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAV-------- 140 (392) Q Consensus 69 ~~~~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~-------- 140 (392) +.+..+...+..-.+ .++.++|..+|....++-.|+++++.++-+.--.+.-.+.+++.+++.-....... T Consensus 79 ~t~~kit~~~~~a~~-~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~~~~~~~~~~~ 157 (349) T protein:vir:94 79 ATPRAIQTGEMMARV-AYLNEGFGQADLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDAYHEQND 157 (349) T ss_pred cccccccccceeeee-eeeccccchhHHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHHhhhcccccccccccccCc Confidence 444555555444333 55677888888887777789999999999998888888888887775432110000 Q ss_pred --c--cccchhhHHHHHHHHHHhhhccC--CCC--CEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeE Q lcl|Aclame:pro 141 --H--EVAPDEFFKGVNGARRALNELYI--PQG--RVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYE 212 (392) Q Consensus 141 --~--~~~~~~~~~~i~~a~~~l~~~~v--p~~--r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~ 212 (392) . ........+.+++|...|.++.. ..+ ..+++++..+..|.+...+... .. .-+...++.+.|.. T Consensus 158 ~~~d~~~~a~~~~~~~~~A~~~~Gdaa~Gd~~~~lt~i~mHS~v~~~L~~~~li~~i-~~------s~~~~~i~ty~G~~ 230 (349) T protein:vir:94 158 MVVDVSATSGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLIDFI-RD------AENNTMFATYQGYR 230 (349) T ss_pred eeEEecccCCCChhhHHHHHHHHHHHhccccccceeEEEEchHHHHHHHhcchhhhc-cC------cccCcccceecCcE Confidence 0 11223456788888888876643 223 3578999999998776443221 11 11233578889999 Q ss_pred EEEecceeeccc--------ceeecccccccchhhhccccccccce---eecccceeeeeeeccccceeeeeccccccee Q lcl|Aclame:pro 213 IVESTLIPHGDA--------YLYHPTAFIMATRAPAPPMGAVRSTA---ISGDQRIAMRWLVDYDSTITSNRSLIDTYFG 281 (392) Q Consensus 213 v~~s~~v~~~~~--------~~~~~~a~~~a~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 281 (392) |.++..+|.... +.+...++.+.......+........ ..+......+......+....+........+ T Consensus 231 VivDD~~Pv~~~g~~~~yttylfg~GAi~~~~~~~~~~~E~~rd~~~g~~~G~d~L~~R~~~~~hp~G~s~~~a~v~~~~ 310 (349) T protein:vir:94 231 VIVDDSMTVVGQDTSRKFISIIFGQGAIGYGEGNPEMPLEYEREASRANGGGVETLWTRKTWLLHPFGYSFTSAVITGNG 310 (349) T ss_pred EEEeCCCccccCCCCceEEEEEeecceEEeecCCCCcceeeecccccCCcceeEEEEEeeEEEeeeeeeeecccccCCCc Confidence 999999986432 23333333333332211100000000 0000001111111111111100000000000 Q ss_pred eeEEE-eeccccceeeeeccceeeeeeecccccccceeeeeeccCeeEEEEEeecCcccccceEEEEEcCCc Q lcl|Aclame:pro 282 LKVVE-DPNGVGFVRARKIHLIPGSIEVAPEAGANATITAAAGEDHTVQLKVTDANGDDVTALCDFESSATD 352 (392) Q Consensus 282 ~~~~~-~~~~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~t~~~t~~~~~~~~~~~~vtw~Ssn~~ 352 (392) ..... .+...-... -..-........+.+ +.+.+...+ T Consensus 311 ~~~~~~sPt~aeLa~---~~NW~~v~~~K~I~i------------------------------v~~~~~~~a 349 (349) T protein:vir:94 311 TETIARSASWQDLAN---AANWNRVVDRKHVPI------------------------------AFLVTGVGA 349 (349) T ss_pred cccccCCCChHHhcC---CcCcccccChhhcce------------------------------EEEEeccCC Confidence 00000 000000000 000000000000000 111111111 No 109 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=97.93 E-value=2e-06 Score=51.81 Aligned_cols=289 Identities=9% Similarity=0.031 Sum_probs=121.2 Q ss_pred CccccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccC---CCccccccccCc Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGA---ERNLTVSDFTED 77 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~ 77 (392) -.-.++.|+.+..++++.|++..++..++.+-. +.+ .+..+.||+.......- ....++. .......++.-. T Consensus 163 ~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~---~~~-~~~~~~ip~~~~~~~~a-~~~~Eg~~~~~~~~~~s~~~f~ 237 (477) T protein:vir:84 163 TGGYAVPPLWMMNRFIELARAGRTYANLCPTEP---LPG-GTSSINIPKILTGTSTA-IQAADNAALTAPSAHEVDLTDG 237 (477) T ss_pred CcceeeccchhHHHHHHHhhhcchHHHhhceee---ecC-CcceeEEEEEecCccee-eeeccCccccccccccccccee Confidence 111245577788899999999888777664432 222 23457887642221111 1111111 111112223334 Q ss_pred eEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHh--cccccc----ccc--c---ccc--- Q lcl|Aclame:pro 78 SFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIV--GAPYEA----AGA--V---HEV--- 143 (392) Q Consensus 78 ~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~--~~~~~~----~~~--~---~~~--- 143 (392) .+++...+. +.-+.|+.+-+.++..++...+.+...++|+.++|..++.--. ..|.+. ... . ... T Consensus 238 ~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~t~~ 316 (477) T protein:vir:84 238 FVQANVKTI-AGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRATAGITQVTATSAGSALE 316 (477) T ss_pred eEEEeeeeE-EeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeeeccccccccccccccchh Confidence 444444332 2334577777777788999999999999999999998773110 011110 000 0 000 Q ss_pred cchhhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhc--ccc----eeeeeccccce---eeeEeeeeeeeEeeeEEE Q lcl|Aclame:pro 144 APDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILN--DDR----FIKYESQGQSA---VSALQEARLGRIYGYEIV 214 (392) Q Consensus 144 ~~~~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~--~~~----~~~~~~~G~~~---~~a~~~g~ig~~~g~~v~ 214 (392) ..+..+..++++...++.+..-....++++|..+..|.+ |.+ |......+... ...+..+..|.+.|++|+ T Consensus 317 ~~~~~~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G~pVv 396 (477) T protein:vir:84 317 KHQIIYQKIADAIQRVHTSRFLEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHGLPVV 396 (477) T ss_pred hHHHHHHHHHHHHhhccccccCCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhcccceE Confidence 011245556665554443322223467889988887744 321 11111010000 011334556789999999 Q ss_pred EecceeecccceeecccccccchhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccce Q lcl|Aclame:pro 215 ESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFV 294 (392) Q Consensus 215 ~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 294 (392) .++.+|.+.+.......+.++.-. .......+... ....+..... ..+ .+. + .+.. T Consensus 397 ~s~~~p~~~~~~~d~~~i~~gd~~----------~~~i~~~~~~~-~~~~~~~~~~---~~~-~~~---v------~~~~ 452 (477) T protein:vir:84 397 TDPTLPTTLGTGTDQDVIHVLRAS----------DLALFESSVRM-RALQETRAEN---LSV-LLQ---V------YGYL 452 (477) T ss_pred ecCcccccccccCCcceEEEEEec----------eEEEEeeceeE-Eecccccccc---cee-eee---e------hhhh Confidence 999998654322111111111000 00000000000 0000000000 000 000 0 0000 Q ss_pred eeeeccceeeeeeecccccccceee Q lcl|Aclame:pro 295 RARKIHLIPGSIEVAPEAGANATIT 319 (392) Q Consensus 295 ~~~~~~~~~~~v~v~~~~~~~~~~~ 319 (392) .-.....+..-+.++.......+.. T Consensus 453 ~~~~~r~~~afv~~t~~~~~~~~~~ 477 (477) T protein:vir:84 453 AFTAARFPQSVVEIGGTALTAPTFA 477 (477) T ss_pred hhhhhccccceEEeecccccccccC Confidence 0000000111111222222111111 No 110 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=97.92 E-value=4.1e-06 Score=50.14 Aligned_cols=267 Identities=15% Similarity=0.072 Sum_probs=122.1 Q ss_pred CccccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeee------ccccccccCCCccccccc Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGH------TRKLRGAGAERNLTVSDF 74 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~------~~~~~~~~~~~~~~~~~~ 74 (392) -+...+.|+.+...+....+..+.+..+++.- .. .+..+++++-...... ..... +++......++ T Consensus 130 ~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~-----~~-~~~~~~~~~~~~~~~~~~~~~~~a~~v--~Eg~~~~~~~~ 201 (419) T protein:vir:94 130 NPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQ-----NA-DYNVLEYIRDTSGTAGAGSTWNKAAVV--PEGTAKPQSTL 201 (419) T ss_pred CCcccccchhhhHHHHHHHhhhhhhhhcceee-----ec-cCCceeeeeeccccccccccCccccee--cCCcccccccc Confidence 22334678888888888877777766665432 21 2445666552211111 11111 22333333444 Q ss_pred cCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHH-hccccc-----------ccccccc Q lcl|Aclame:pro 75 TEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMI-VGAPYE-----------AAGAVHE 142 (392) Q Consensus 75 ~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~-~~~~~~-----------~~~~~~~ 142 (392) .-..+++.+.+.. .-+.|+.+-+. +..++...+.++.+++++.++|..++.-- .+.+.+ ....... T Consensus 202 ~~~~i~~~~~k~~-~~~~is~ell~-d~~~l~~~i~~~la~a~~~~~d~aii~G~G~~~p~Gi~~~~~~~~~~~~~~~~~ 279 (419) T protein:vir:94 202 SFDTITTTLKTVA-HWLPITRQAAD-DNSQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAP 279 (419) T ss_pred ceeeEEeeeeeEE-EeehhhHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceecccccccccccccccc Confidence 4455555554432 34567765544 45567777777899999999999887310 001110 0111112 Q ss_pred ccchhhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccc--eeeeEeeeeeeeEeeeEEEEeccee Q lcl|Aclame:pro 143 VAPDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQS--AVSALQEARLGRIYGYEIVESTLIP 220 (392) Q Consensus 143 ~~~~~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~--~~~a~~~g~ig~~~g~~v~~s~~v~ 220 (392) .+....++++.++...+.....+. ..++++|..+..|.+... ..|.. .......+..+.+.|+.|+.+..+| T Consensus 280 ~t~~~~~~~l~~~~~~~~~~~~~~-~~~v~n~~~~~~l~~~k~-----~~~~~~~~~~~~~~~~~~~l~G~pV~~~~~~~ 353 (419) T protein:vir:94 280 ATDEPPLVDIRRAKTVAEIAGFPP-DGVVVHPQDWESIELDQA-----PGSGVFRVIANVQGEATPRIWGLNVVSTVAIA 353 (419) T ss_pred cccchhHHHHHHHHHhhhhccCCC-CEEEEcHHHHHHHHHHhh-----cCCCceeecCCcccCCCccccceeeEEcCCCC Confidence 233456889999888887666543 368999999888754211 01110 0011234556789999999999988 Q ss_pred ecccceeeccc-ccccchhhhccccccccceeecccceeeeeeeccc--cceeeeecccccceeeeEEEeeccccceeee Q lcl|Aclame:pro 221 HGDAYLYHPTA-FIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYD--STITSNRSLIDTYFGLKVVEDPNGVGFVRAR 297 (392) Q Consensus 221 ~~~~~~~~~~a-~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 297 (392) .+.......+. .....+ .+.......... ..............++.+.... T Consensus 354 ~~~~~~gd~~~~~~~~~~-----------------~~~~v~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~--------- 407 (419) T protein:vir:94 354 QGTALVGGFRQGATLWSR-----------------QGITVLMTDSHADFFTANTLVILAEFRANLAVYQPK--------- 407 (419) T ss_pred CccEEEeeccceEEEEEe-----------------cceEEEEeccccchhhcCcEEEEEEEeeccEEeccc--------- Confidence 65533221111 100000 000000000000 0000000000001111111100 Q ss_pred eccceeeeeeec Q lcl|Aclame:pro 298 KIHLIPGSIEVA 309 (392) Q Consensus 298 ~~~~~~~~v~v~ 309 (392) .+........++ T Consensus 408 a~~~~~~~aa~~ 419 (419) T protein:vir:94 408 AFVRVTFAAATT 419 (419) T ss_pred cEEEEEeccCCC Confidence 000000000000 No 111 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=97.91 E-value=4.4e-06 Score=49.96 Aligned_cols=274 Identities=12% Similarity=-0.001 Sum_probs=125.6 Q ss_pred Ccccc-----ccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc-eeeeccccccccCCCccccccc Q lcl|Aclame:pro 1 MANAF-----SKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSDF 74 (392) Q Consensus 1 Man~~-----~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 74 (392) ||.+. +.|+.++.++++.+++..++..++.+- . -++..+++|+... ..+.. . +++......++ T Consensus 1 ma~~t~~~G~lip~~~~~~ii~~l~~~s~i~~l~~~~-----~-~~~~~~~~p~~~~~~~a~w---v--~Eg~~~~~s~~ 69 (300) T protein:vir:95 1 MSEAQLSKGNLFNPELVTKVINKVKGHSSIAKLSPQK-----P-IPFNGQREFVFDFDSDIDI---V--AENGKKTHGGV 69 (300) T ss_pred CcccccCCcceechhhHHHHHHHHHhhhhhhhhccee-----e-ccCCceEEEEEecCcceEE---e--eCCcccccccc Confidence 98644 678888999999999998876665321 1 1234577776322 22222 2 23333444444 Q ss_pred cCceEEEEEEeeeecceEeeHHHHh---hhccChHHHHHHHHHHHHHHHHHHHHHHHHh---ccccc---------cccc Q lcl|Aclame:pro 75 TEDSFPVTLTDVAYHLGVLTDEELT---FDLESFATQILPRQVRGVADILEEGVRDMIV---GAPYE---------AAGA 139 (392) Q Consensus 75 ~~~~~~~~i~~~~~~~~~i~d~~~~---~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~---~~~~~---------~~~~ 139 (392) .=+.++++..+ .+.-+.|+++-+. .+..++...+.++.++++++++|+.++.-.. +.+.. .... T Consensus 70 ~f~~v~l~~~k-~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~ 148 (300) T protein:vir:95 70 SLDPVTIVPLK-VEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQ 148 (300) T ss_pred cceeeEeeeEE-EEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccce Confidence 44555555433 2345567777653 3456788889999999999999999884321 11100 0000 Q ss_pred cccccchhhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccce-eeeEeeeeeeeEeeeEEEEecc Q lcl|Aclame:pro 140 VHEVAPDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSA-VSALQEARLGRIYGYEIVESTL 218 (392) Q Consensus 140 ~~~~~~~~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~-~~a~~~g~ig~~~g~~v~~s~~ 218 (392) .........++++.++...+...+... ..++++|..+..|.+-. +..|.-. ......|..+++.|+.++.++. T Consensus 149 ~~~~~~~~~~~~i~~~~~~~~~~~~~~-~~~vmn~~~~~~L~~lk-----d~~G~~i~~~~~~~~~~~~l~G~Pv~~s~~ 222 (300) T protein:vir:95 149 TVPFKDTNPDESMEDAVGMIDGSERDI-TGAILDPIFTTALSKMK-----NAEGGKLYPELAWGGVPDAINGLAVDKNRT 222 (300) T ss_pred eecccccchHHHHHHHHHHhhhcCCCc-cEEEECHHHHHHHHHhh-----ccCCCeeccCccccCCCceecceeeEEecC Confidence 011123345788888888887655532 35889999998886432 1222211 0112345667899999999998 Q ss_pred eeecccceeecccccccchhhhccccccccceeec-ccceeeeeeeccccceeeeecccccce-e-ee-EEEeeccccce Q lcl|Aclame:pro 219 IPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISG-DQRIAMRWLVDYDSTITSNRSLIDTYF-G-LK-VVEDPNGVGFV 294 (392) Q Consensus 219 v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~-~~~~~~~~~~~ 294 (392) +|........ ..+ .+........+ ..+.........+.. ......+. . .. -.......... T Consensus 223 v~~~~~~~~~-~~~----------~GDf~~~~~~~~~~~~~~~v~~~~~~d----~~~~~~f~~~~v~~r~~~r~d~~v~ 287 (300) T protein:vir:95 223 VSYSQTDPKN-TAI----------VGDFETMFKWGYAKEVPMEIIKYGDPD----NSGRDLKGYNQIYIRCEAYIGWGIM 287 (300) T ss_pred CCCCCCCCcc-EEE----------EeeccceEEEEEecccEEEEeeccCCC----CcchhhhhcCcEEEEEEEeecceee Confidence 8753311000 000 00000000000 000000000000000 00000000 0 00 00000000000 Q ss_pred eeeeccceeeeeeecccccccceeeeeec Q lcl|Aclame:pro 295 RARKIHLIPGSIEVAPEAGANATITAAAG 323 (392) Q Consensus 295 ~~~~~~~~~~~v~v~~~~~~~~~~~~~~~ 323 (392) . +..-+.+..+ .+ T Consensus 288 ~------~~a~~~l~~~----------~g 300 (300) T protein:vir:95 288 D------AASFARIVKT----------GG 300 (300) T ss_pred c------ccceEEEecC----------CC Confidence 0 0000000000 00 No 112 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=97.90 E-value=8.6e-06 Score=48.37 Aligned_cols=282 Identities=9% Similarity=-0.042 Sum_probs=122.9 Q ss_pred Cccc------cccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc-eeeeccccccccCCCcccccc Q lcl|Aclame:pro 1 MANA------FSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSD 73 (392) Q Consensus 1 Man~------~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~-~~~~~~~~~~~~~~~~~~~~~ 73 (392) |++. .+.|+.+..++++.+++..++.+++.+- .- .+.+++||+... ..+... +++..+...+ T Consensus 14 ~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~-----~~-~~~~~~ip~~~~~~~a~~v-----~Eg~~~~~~~ 82 (318) T protein:vir:24 14 IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKV-----PM-GTTGQKIPHWVGDVSAQWI-----GEGDMKPITK 82 (318) T ss_pred hhcccCcccceeechhHHHHHHHHHHhhchhhhhccee-----ec-cCCceEEEEEeCCcceEEe-----cCCccccccc Confidence 4322 3578889999999999999988887542 11 244678876432 222222 2344444445 Q ss_pred ccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhc-cccc-------cccccccccc Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVG-APYE-------AAGAVHEVAP 145 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~-~~~~-------~~~~~~~~~~ 145 (392) +.-..++++..+. ..-+.++++-+.++..++...+.+..++++++++|+.++.--.. .+.. .......... T Consensus 83 ~~f~~i~~~~~k~-~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~ 161 (318) T protein:vir:24 83 GNMTSQTIAPHKI-ATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMHGTDSPFPTYIGQTTKAISIADTTGAT 161 (318) T ss_pred cceeEEEEeeEEE-EEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCCccccccccccccccccccc Confidence 5445555555332 34557888877778889999999999999999999988742110 0000 0000011111 Q ss_pred hhhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhc--ccc--eeeeeccccceeeeEeeeeeeeEeeeEEEEecceee Q lcl|Aclame:pro 146 DEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILN--DDR--FIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPH 221 (392) Q Consensus 146 ~~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~--~~~--~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~ 221 (392) ......+.++...+..... ....++++|..+..|.+ |.. +.-...... ........+++.|++++.+..++. T Consensus 162 ~~~~~~~~~~~~~~~~~~~-~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~---~~~~~~~~~~i~g~pv~~~~~~~~ 237 (318) T protein:vir:24 162 TVYDQVAVNGLSLLVNDGK-KWTHTLLDDITEPILNGAKDQNGRPLFIESTYG---EAASPFRSGRIVARPTILSDHVVE 237 (318) T ss_pred chHHHHHHHHHHhhccccC-CCCEEEEcHHHHHHHHHhhccCCceeecCcccc---CccccccCceEEEEeeEEeCCCCC Confidence 1223344555544444333 23467999999988864 211 110000000 001111225788999999888876 Q ss_pred cccceee--cccccccchhhhccccccccceeec--ccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeee Q lcl|Aclame:pro 222 GDAYLYH--PTAFIMATRAPAPPMGAVRSTAISG--DQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRAR 297 (392) Q Consensus 222 ~~~~~~~--~~a~~~a~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 297 (392) +....+. .+...+..............+.... ..+.. ......+... .-.....+..+........ T Consensus 238 ~~~~~~~gdfs~~~~~~~~~l~i~~~~~~~~~~~~~~~~~~-~~~f~~~~~~----~r~~~r~d~~v~~~~a~~~----- 307 (318) T protein:vir:24 238 GTTVGFMGDFSQLIWGQIGGLSFDVTDQATLNLGTVESPNF-VSLWQHNLVA----VRVEAEYAFHCNDAEAFVA----- 307 (318) T ss_pred CccEEEEeecceEEEEEecCeEEEEeeccceeccccccccc-hhhhhcCcEE----EEEEEEEccEEecccceEE----- Confidence 5432221 1111111111100000000000000 00000 0000000000 0000000000000000000 Q ss_pred eccceeeeeeeccccccccee Q lcl|Aclame:pro 298 KIHLIPGSIEVAPEAGANATI 318 (392) Q Consensus 298 ~~~~~~~~v~v~~~~~~~~~~ 318 (392) ++.+....... T Consensus 308 ----------i~~~~a~~~~~ 318 (318) T protein:vir:24 308 ----------LTNVVSGGGEG 318 (318) T ss_pred ----------EEeeccCCCCC Confidence 00000000000 No 113 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=97.90 E-value=6.5e-06 Score=49.01 Aligned_cols=281 Identities=13% Similarity=0.063 Sum_probs=114.7 Q ss_pred Ccc------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc-eeeeccccccccCCCcccccc Q lcl|Aclame:pro 1 MAN------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSD 73 (392) Q Consensus 1 Man------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~-~~~~~~~~~~~~~~~~~~~~~ 73 (392) ||. ..++|+.+.+++++.+++..++..++.+- .. .+.+++||+... ..+.........++......+ T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~-----~~-~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~~s~ 74 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNV-----NM-GTKTTHLPVLATLPEADWVGESATDPKGVKPTSK 74 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhccee-----ec-cCCcEEEEEEeCCcceEEeecccccccccccccc Confidence 885 34789999999999999999888877532 11 244678876432 222222111111111122223 Q ss_pred ccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhc---------cccc--ccccccc Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVG---------APYE--AAGAVHE 142 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~---------~~~~--~~~~~~~ 142 (392) +.-..+++...|. +.-+.|+++-+.++..++...+.+..++++++++|+.++.--.. .+.. ....... T Consensus 75 ~~f~~i~~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~ 153 (305) T protein:vir:25 75 VTWANRTLVAEEI-AVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEV 153 (305) T ss_pred cceeeEEeeeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhheeccCCCCCccccccccccccccccccc Confidence 3334444444332 34567888877788889999999999999999999988731100 0000 0000011 Q ss_pred ccch----hhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecc Q lcl|Aclame:pro 143 VAPD----EFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTL 218 (392) Q Consensus 143 ~~~~----~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~ 218 (392) .... ..++.+..+...+........ -++++|..+..|.+-. +..|. -.+.. +.+.|+.++.++. T Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~v~~~~~~~~l~~lk-----d~~G~---~i~~~---~~l~G~Pv~~~~~ 221 (305) T protein:vir:25 154 VGGVANESDIVGATNRAAKAVASAGWAPD-TLLSSLALRYEVANIR-----DANGN---PVFRD---DSFAGFRTFFNRN 221 (305) T ss_pred cccchhhhHHHHHHHHHHHhhhhcccccc-eeEecHHHHHHHHHhh-----ccCCc---eeecC---CcccccceEEcCc Confidence 1111 123333333333333222222 2678999888875421 12222 11112 4688889988877 Q ss_pred eeecccceeecccccccchhhhccccccccceeecc-cceeeeeeeccccceeeeecccccce-eeeEEEeeccccceee Q lcl|Aclame:pro 219 IPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGD-QRIAMRWLVDYDSTITSNRSLIDTYF-GLKVVEDPNGVGFVRA 296 (392) Q Consensus 219 v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 296 (392) ++...... ...++ .... ...+. .+......... ...........+. .-.........+. T Consensus 222 ~~~~~~~~----~~~~g---------d~s~-~~i~~~~~~~i~~~~~~--~~~~~~~~~~~~~~~~~~~R~~~r~~~--- 282 (305) T protein:vir:25 222 GAWDADAA----IEVIA---------DSSR-VKIGVRQDITVKFLDQA--TLGTGENQINLAERDMVALRLKARFAY--- 282 (305) T ss_pred cCCCCCcc----EEEEE---------ecce-EEEEEecCeEEEEeeee--eeecCCceeeeeecCcEEEEEEEeecc--- Confidence 66432110 00000 0000 00000 00000000000 0000000000000 0000000000000 Q ss_pred eeccceeeeeeecccccccceeeeeeccCeeEEEEEeecC Q lcl|Aclame:pro 297 RKIHLIPGSIEVAPEAGANATITAAAGEDHTVQLKVTDAN 336 (392) Q Consensus 297 ~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~t~~~t~~~~~ 336 (392) .+..+...+.+..+.... ++|.. T Consensus 283 -~v~~p~a~v~~~~~~~~~----------------~~pa~ 305 (305) T protein:vir:25 283 -VLGVSATAQGANKTPVAV----------------VAPAA 305 (305) T ss_pred -eeeCcccEEEEccccccc----------------cCCCC Confidence 000000000011100000 00000 No 114 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=97.89 E-value=1.2e-05 Score=47.53 Aligned_cols=283 Identities=10% Similarity=-0.040 Sum_probs=121.9 Q ss_pred Ccc-----ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEecc-ceeeeccccccccCCCccccccc Q lcl|Aclame:pro 1 MAN-----AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPA-PSRGHTRKLRGAGAERNLTVSDF 74 (392) Q Consensus 1 Man-----~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 74 (392) ||. ..+.|+.+++++++.+++..++..++.+-. .. +..++||+.. ...+... + ++..+...++ T Consensus 1 Mat~tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~----~~--~~~~~~p~~~~~~~a~wv---~--Eg~~~~~~~~ 69 (311) T protein:vir:99 1 MATFGTGNLKNLPRNIADGMVKDVVQGSTVAVLSARKP----QR--FGNEDIITFNGRPKAEFV---G--EGQQKSSTTG 69 (311) T ss_pred CceecCCCceeccHHHHHHHHHHHHhhchhhhhcceee----cc--CCceEEEEEeCCceeEEe---e--cCcccccccc Confidence 884 336799999999999999998877765421 11 2346777642 3333332 2 3333443444 Q ss_pred cCceEEEEEEeeeecceEeeHHHHh---hhccChHHHHHHHHHHHHHHHHHHHHHHHHhcc-cc---c----ccccc--- Q lcl|Aclame:pro 75 TEDSFPVTLTDVAYHLGVLTDEELT---FDLESFATQILPRQVRGVADILEEGVRDMIVGA-PY---E----AAGAV--- 140 (392) Q Consensus 75 ~~~~~~~~i~~~~~~~~~i~d~~~~---~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~-~~---~----~~~~~--- 140 (392) +-..+++...|. +.-+.|+++-+. .+..++.+.+.++.+++|++++|+.++.-.... +. . ..... T Consensus 70 ~f~~v~l~~~k~-~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~ 148 (311) T protein:vir:99 70 EFDFVTSTPKKA-QVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRV 148 (311) T ss_pred eeeEEEEeeEEE-EEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCcccccccccccccccee Confidence 445555554332 344567777553 345678888999999999999999888432110 00 0 00000 Q ss_pred --ccccchhhHHHHHHHHHHhhhccCC--CCCEEEEchHHHHHhhcccceeeeecccccee-eeEeeeeeeeEeeeEEEE Q lcl|Aclame:pro 141 --HEVAPDEFFKGVNGARRALNELYIP--QGRVLVVGTAVTEQILNDDRFIKYESQGQSAV-SALQEARLGRIYGYEIVE 215 (392) Q Consensus 141 --~~~~~~~~~~~i~~a~~~l~~~~vp--~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~-~a~~~g~ig~~~g~~v~~ 215 (392) ........+.++.++...+..++.. .+ .++++|..+..|.+-.. ..|.-.- .....+..+++.|+.++. T Consensus 149 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~-~~vmn~~~~~~L~~lkd-----~~G~~l~~~~~~~~~~~~l~G~Pv~~ 222 (311) T protein:vir:99 149 ELTADTIANPDLAIEAAVGLLVANGHPTPVN-GLALHPSIAWGLSTARY-----TDGRKKFPELGLGIGVSSFEGIDASV 222 (311) T ss_pred eccccccchhHHHHHHHHHHHhhhccCCCcc-EEEEcHHHHHHHHhhhc-----cCCCeeecCcccCCCCceecceeeEe Confidence 0111122344555555554444332 22 37889999888854211 1121110 111234456899999999 Q ss_pred ecceeecccceeecccccccchhhhccccccccce-eecccceeeeeeeccccceeeeeccccc--ceeeeEEEeecccc Q lcl|Aclame:pro 216 STLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTA-ISGDQRIAMRWLVDYDSTITSNRSLIDT--YFGLKVVEDPNGVG 292 (392) Q Consensus 216 s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~ 292 (392) ++.++..............+..... ..+...... ...............+.....+....+. +....-.. . . T Consensus 223 s~~i~~~~~~~~~~~~~~~~~~~~~-~~Gdf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d---~-~ 297 (311) T protein:vir:99 223 SDTVNGGDEADPDDEDLDAARAVRG-IVGDFANGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIALRLEIVYG---W-Y 297 (311) T ss_pred ecccccccccccccchhhccCcceE-EEeeccccEEEEEecCceEEEeecCCCCcchhhhhcCcEEEEEEEeec---c-e Confidence 9988754433222111111111000 000000000 0000000000000000000000000000 00000000 0 0 Q ss_pred ceeeeeccceeeeeeeccccc Q lcl|Aclame:pro 293 FVRARKIHLIPGSIEVAPEAG 313 (392) Q Consensus 293 ~~~~~~~~~~~~~v~v~~~~~ 313 (392) ... +.-+.+....- T Consensus 298 v~~-------~~~v~~~~~~A 311 (311) T protein:vir:99 298 VFT-------DRFVVIENAVA 311 (311) T ss_pred ecC-------hhHeeeecccC Confidence 000 00000000000 No 115 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=97.87 E-value=5.6e-06 Score=49.38 Aligned_cols=268 Identities=9% Similarity=-0.011 Sum_probs=126.7 Q ss_pred Cccc--cccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEecc-ceeeeccccccccCCCccccccccCc Q lcl|Aclame:pro 1 MANA--FSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPA-PSRGHTRKLRGAGAERNLTVSDFTED 77 (392) Q Consensus 1 Man~--~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (392) |+-+ .+.|+.+..++++.++++.++..++.+- .- .+..++||+.. ...+... + ++......++.-. T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~-----~~-~~~~~~~p~~~~~~~a~~v---~--Eg~~~~~~~~~f~ 69 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQK-----PI-PFNGEKVFTFTMDSEIDVV---A--ESGKKTHGGVTLA 69 (298) T ss_pred CeeccccccChhHHHHHHHHHHhhchhhhhccee-----ec-cCCceEEEEEecCcceEEe---e--CCcccccccccee Confidence 9953 3788889999999999998887766432 11 12357787753 2233332 2 2333333344445 Q ss_pred eEEEEEEeeeecceEeeHHHHhh---hccChHHHHHHHHHHHHHHHHHHHHHHHHhcc---cccc------c-----ccc Q lcl|Aclame:pro 78 SFPVTLTDVAYHLGVLTDEELTF---DLESFATQILPRQVRGVADILEEGVRDMIVGA---PYEA------A-----GAV 140 (392) Q Consensus 78 ~~~~~i~~~~~~~~~i~d~~~~~---~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~---~~~~------~-----~~~ 140 (392) .+++...+. ..-+.++++-+.+ +..++...+.++.+++|++++|..++.-.... .... . ... T Consensus 70 ~v~l~~~k~-~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~ 148 (298) T protein:vir:94 70 PQTMVPIKV-EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVE 148 (298) T ss_pred EEEEeeeEE-EEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccccccccccccccc Confidence 555554333 3455677776543 34567788889999999999999887432100 0000 0 000 Q ss_pred ccccchhhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccce-eeeEeeeeeeeEeeeEEEEecce Q lcl|Aclame:pro 141 HEVAPDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSA-VSALQEARLGRIYGYEIVESTLI 219 (392) Q Consensus 141 ~~~~~~~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~-~~a~~~g~ig~~~g~~v~~s~~v 219 (392) ........++++.++...|..++... ..++++|..+..|.+-. +..|.-. ......|..+++.|++|+.++.+ T Consensus 149 ~~~~~~~~~~~i~~~~~~~~~~~~~~-~~~vmn~~~~~~l~~lk-----d~~G~~l~~~~~~~~~~~tl~G~PV~~~~~v 222 (298) T protein:vir:94 149 APRGIADPNGAIENAVELLTGVDADV-TGIAINPSFRSALAKQK-----DLQGNALFPELKWGATPDTINGLPVDVNKTV 222 (298) T ss_pred cccccccHHHHHHHHHHhhhhcCCCc-cEEEEcHHHHHHHHHhh-----ccCCCeeecCcccCCCCceecceeeEEeccc Confidence 11122345778888888887766643 36899999998886521 1122211 11223455678999999999988 Q ss_pred eecccceeecccccccchhhhccccccccceeec-ccceeeeeeecccccee-e---eeccc----ccceeeeEEEeecc Q lcl|Aclame:pro 220 PHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISG-DQRIAMRWLVDYDSTIT-S---NRSLI----DTYFGLKVVEDPNG 290 (392) Q Consensus 220 ~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~-~---~~~~~----~~~~~~~~~~~~~~ 290 (392) |........ .. ..+........+ ............+.... . ....+ ....+..+.... T Consensus 223 ~~~~~~~~~---~~--------~~Gdfs~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~-- 289 (298) T protein:vir:94 223 SDMSLTQRD---RA--------IIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDAT-- 289 (298) T ss_pred ccccCCCcc---EE--------EEeeccceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEeccEeeccc-- Confidence 743211000 00 000000000000 00001111000000000 0 00000 000000000000 Q ss_pred ccceeeeeccceeeeeeecccc Q lcl|Aclame:pro 291 VGFVRARKIHLIPGSIEVAPEA 312 (392) Q Consensus 291 ~~~~~~~~~~~~~~~v~v~~~~ 312 (392) .-+.+...+ T Consensus 290 -------------a~~~l~~~t 298 (298) T protein:vir:94 290 -------------KFARVTEAN 298 (298) T ss_pred -------------ceEEEEecC Confidence 000000111 No 116 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=97.85 E-value=1.3e-05 Score=47.41 Aligned_cols=288 Identities=12% Similarity=0.017 Sum_probs=124.4 Q ss_pred Ccc----ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEecc-ceeeeccccccccCCCcccccccc Q lcl|Aclame:pro 1 MAN----AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPA-PSRGHTRKLRGAGAERNLTVSDFT 75 (392) Q Consensus 1 Man----~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 75 (392) ||- .++.|+.+++++++.++++.++..++.+- . . .+..+++|+.. ...+.+. +++......+++ T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i--~-~---~~~~~~~p~~~~~~~a~wv-----~Eg~~~~~~~~~ 69 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAE--P-Q---EFGEQQYMTLTAPPRGEVV-----GEGAQKSESTAT 69 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhcchhhhhccee--e-c---CCCceEEEEEeCCceeEEe-----ecCcccccccce Confidence 773 56899999999999999999887776542 1 1 12357887742 3333332 234434444554 Q ss_pred CceEEEEEEeeeecceEeeHHHHhh---hccChHHHHHHHHHHHHHHHHHHHHHHHHhcc-cc-------cc-----ccc Q lcl|Aclame:pro 76 EDSFPVTLTDVAYHLGVLTDEELTF---DLESFATQILPRQVRGVADILEEGVRDMIVGA-PY-------EA-----AGA 139 (392) Q Consensus 76 ~~~~~~~i~~~~~~~~~i~d~~~~~---~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~-~~-------~~-----~~~ 139 (392) -..+++...+. +.-+.|+++-+.+ +..++.+.+.++.+++|++++|..++.--... .. .. ... T Consensus 70 f~~v~l~~~kl-~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~ 148 (311) T protein:vir:81 70 FAPVTAIPRKV-QVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVE 148 (311) T ss_pred eeEEEEeeEEE-EEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeee Confidence 45555555333 3445677775543 33457888899999999999999887432100 00 00 000 Q ss_pred cccccchhhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccce-eeeEeeeeeeeEeeeEEEEecc Q lcl|Aclame:pro 140 VHEVAPDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSA-VSALQEARLGRIYGYEIVESTL 218 (392) Q Consensus 140 ~~~~~~~~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~-~~a~~~g~ig~~~g~~v~~s~~ 218 (392) .........+..+.++...+...+.. ...++++|..+..|.+-. +..|.-. ......+..+.+.|++|+.++. T Consensus 149 ~~~~~~~~~~~~i~~~~~~~~~~~~~-~~~~vmn~~~~~~l~~lk-----d~~G~~l~~~~~~~~~~~tl~G~Pv~~~~~ 222 (311) T protein:vir:81 149 LTTGTSATPDLAVEAAVGLVLGDNLS-PDGVALDNTFSFMLATQR-----DSQGRKLYPELGFGTDVASFAGLNAAVSDT 222 (311) T ss_pred ecccccchHHHHHHHHHHHhhhcCCC-ceEEEEcHHHHHHHHhhh-----ccCCCeeecCccccCCCceecceeEEeccc Confidence 11112223344555555555444432 235889999998886521 1122211 0112245567899999999988 Q ss_pred eeecccceeeccccc-ccchhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeee Q lcl|Aclame:pro 219 IPHGDAYLYHPTAFI-MATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRAR 297 (392) Q Consensus 219 v~~~~~~~~~~~a~~-~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 297 (392) +|............. ..........+...........+.........+.....+....+.. ..- ............. T Consensus 223 i~~~~~~~~~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v-~~r-~~~r~d~~v~~~~ 300 (311) T protein:vir:81 223 VRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQI-AIR-AEVVYGIGIMSTD 300 (311) T ss_pred ccccccccccccchhcccCCccEEEEEecccEEEEEeccceEEEeccCCCCcchhhhhcCcE-EEE-EEEEeccEeeccc Confidence 875442211100000 0000000000000000000000000000000000000000000000 000 0000000000000 Q ss_pred eccceeeeeeecccccc Q lcl|Aclame:pro 298 KIHLIPGSIEVAPEAGA 314 (392) Q Consensus 298 ~~~~~~~~v~v~~~~~~ 314 (392) .+ +.+...... T Consensus 301 a~------~~l~~a~~~ 311 (311) T protein:vir:81 301 AF------AVVRDADES 311 (311) T ss_pred ce------EEEEeeccC Confidence 00 000000000 No 117 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=97.80 E-value=8.7e-06 Score=48.32 Aligned_cols=262 Identities=12% Similarity=0.057 Sum_probs=123.6 Q ss_pred Ccc----ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccccC Q lcl|Aclame:pro 1 MAN----AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTE 76 (392) Q Consensus 1 Man----~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (392) |.- ..++|+.|...+++.+++...+..++.. +.. .+.++.+|+............ +++......++.- T Consensus 109 ~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~-----~~~-~~~~~~~~~~~~~~~~~~~~v--~Eg~~~~~~~~~f 180 (379) T protein:vir:10 109 MTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGA-----VSI-SGGTYTFVRENGAGEGAIGAQ--VEGATKGQKDYDI 180 (379) T ss_pred cccCCCCccccchhhhhHHHHhHHhhhhHHhhcee-----eec-cCCceEEEEeecCCCcccccc--cCCccccccccce Confidence 221 2257999999999999998888777643 222 245688876432211111112 2333333445555 Q ss_pred ceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHHHHH Q lcl|Aclame:pro 77 DSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNGAR 156 (392) Q Consensus 77 ~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~ 156 (392) ..+++.+.+... -+.|+++-+. +...+...+.+...++|+.++|..++.-..... .......+....++++.++. T Consensus 181 ~~i~~~~~k~~~-~~~iS~ell~-D~~~l~~~i~~~la~~~~~~~~~~~~~g~~~~~---~~~~~~~~~~~~~d~i~~~~ 255 (379) T protein:vir:10 181 SMIDVNTDFIAG-FTRYSKKMAN-NLPFLTSFIPNALRRDYAKAENAAFNAVLAANA---TASTEIITNKNKVEMLINEI 255 (379) T ss_pred eeeEeeeeeEEe-eehhhHHHHh-hHHHHHHHHHHHHHHHHHHHHHHHHhccccccc---ccccccccCcccHHHHHHHH Confidence 666666655433 4467776544 444566667777889999999988776443221 11112233344577888877 Q ss_pred HHhhhccCCCCCEEEEchHHHHHhhcccceeeeecccccee---eeEeeeeeeeEeeeEEEEecceeecccceeeccccc Q lcl|Aclame:pro 157 RALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAV---SALQEARLGRIYGYEIVESTLIPHGDAYLYHPTAFI 233 (392) Q Consensus 157 ~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~---~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a~~ 233 (392) ..+..++.+.+ .++++|..+..|.+-.. ..|.-.. -....|....+.|++|+.++.+|.+..+....+... T Consensus 256 ~~~~~~~~~~~-~~vmn~~~~~~l~~lkd-----~~G~~l~~~~~~~~~~~~~~l~G~pvv~s~~~~ag~~~~gdf~~~~ 329 (379) T protein:vir:10 256 AKQENLDFPVT-AIVLRPTDYYDILVTQK-----SVGAGYGLPGVVTQDNGVLRINGIPLFRATWLAANKYYVGDWTRVT 329 (379) T ss_pred HhhhhccCCCC-EEEEcHHHHHHHHHhhc-----cCCceeccCCccCCCCCcceecceeeEecCCCCCCceEEeecccEE Confidence 77776666544 47789999888754211 1111000 001133345788999999988875543221111110 Q ss_pred ccchhhhccccccccceeecccceeeeeeeccccceeeeecc--cccceeeeEEEeeccccceeeeeccceeeeeeeccc Q lcl|Aclame:pro 234 MATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSL--IDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPE 311 (392) Q Consensus 234 ~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~ 311 (392) ...+ .+...............+... ...-.++.+... .. ...+.++.+ T Consensus 330 ~~~~-----------------~~~~i~~~~~~~~~f~~~~~~~r~~~R~~~~v~~p---------~a----~v~~~~~~~ 379 (379) T protein:vir:10 330 KVTT-----------------EGLSLEFSEVEGTNFVKNNITARIEAQVALAVEQP---------AA----LIFGDFTAV 379 (379) T ss_pred EEEE-----------------eceEEEEeecccccccCCcEEEEEEEEeccEEecC---------cc----EEEEEecCC Confidence 0000 000000000000000000000 000000000000 00 000111111 No 118 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=97.79 E-value=1.7e-05 Score=46.75 Aligned_cols=260 Identities=12% Similarity=0.020 Sum_probs=120.5 Q ss_pred Ccc------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccc Q lcl|Aclame:pro 1 MAN------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDF 74 (392) Q Consensus 1 Man------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (392) |.. .++.|+ +...+++.+++...+.+++..- .. .+.++++|+..... ......+ ++....-.++ T Consensus 114 ~~~~~~~~g~~~~~~-~~~~ii~~~~~~~~l~~~~~~~-----~~-~~~~~~~~~~~~~~-~~a~~v~--Eg~~~~~~~~ 183 (390) T protein:vir:10 114 STDAAGSAGALTTPN-RLPGFITQPDARLTVRDLIGSG-----RT-DSALIEYVQETGFV-NNAAIVA--EGALKPESSL 183 (390) T ss_pred hcccccccccccchh-HHHHHHHHHHhhchhhhhccee-----ec-cCCceEEEEEecCC-cceeeec--CCcccccccc Confidence 111 234444 5567899999988887776432 21 23457776543211 1112222 2333333455 Q ss_pred cCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhcc--ccc------cccccccccch Q lcl|Aclame:pro 75 TEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGA--PYE------AAGAVHEVAPD 146 (392) Q Consensus 75 ~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~--~~~------~~~~~~~~~~~ 146 (392) +-..+++.+.+. +.-+.|+++-+. +..++...+.++.+++++.++|..++.--... +.+ ........... T Consensus 184 ~~~~i~~~~~k~-~~~~~is~ell~-d~~~l~~~i~~~l~~~~~~~~~~~il~G~G~~~~p~Gi~~~~~~~~~~~~~~~~ 261 (390) T protein:vir:10 184 KFAKKTDTTHVI-AHTMKATRQILS-DAPQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGA 261 (390) T ss_pred ceeEEEEeeEEE-EEeehhhHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCcccccccccccccccccccccc Confidence 556666666544 345567776444 44577777888889999999999887421000 100 00111122233 Q ss_pred hhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccce Q lcl|Aclame:pro 147 EFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYL 226 (392) Q Consensus 147 ~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~ 226 (392) ..++.+.++...|.....+.. .++++|..+..|.+-. +..|.-.......+..+.+.|.+|+.+..+|.+.... T Consensus 262 ~~~~~~~~~~~~l~~~~~~~~-~~v~n~~~~~~L~~lk-----d~~g~~l~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~ 335 (390) T protein:vir:10 262 TRVDQLRLAMLQASLAEYPAS-GIVINPIDWAAIELAK-----DANNQYLIGNARGTLTPTLWGLPVVATQAMAPGEFLV 335 (390) T ss_pred chHHHHHHHHHhhccccCCCC-EEEEcHHHHHHHHHhh-----cCCCceeecCCcCcCCceecceeeEEcCCCCCCcEEE Confidence 457788888888877766544 5788999988876421 1112111011112334578999999999998655432 Q ss_pred eeccc-ccccchhhhccccccccceeecccceeeeeeeccccceeeeecc--cccceeeeEEEeeccccceeeeecccee Q lcl|Aclame:pro 227 YHPTA-FIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSL--IDTYFGLKVVEDPNGVGFVRARKIHLIP 303 (392) Q Consensus 227 ~~~~a-~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 303 (392) ...+. .....+ .+....+....+. ...+... .....++.+...... T Consensus 336 gdf~~~~~~~~~-----------------~~~~i~~~~~~~~-~~~~~~~~r~~~r~d~~v~~~~a~------------- 384 (390) T protein:vir:10 336 GAFDLAAQIFDQ-----------------WDARVEIGYVNDD-FQRNMVTVLAEERLALVVYRPEAL------------- 384 (390) T ss_pred EeccceEEEEEe-----------------cceEEEEeecccc-cccCcEEEEEEEeeccEEeccccE------------- Confidence 22111 000000 0001100000000 0000000 000000000000000 Q ss_pred eeeeecccccc Q lcl|Aclame:pro 304 GSIEVAPEAGA 314 (392) Q Consensus 304 ~~v~v~~~~~~ 314 (392) ..+.++ T Consensus 385 -----~~~~~a 390 (390) T protein:vir:10 385 -----ISGSFA 390 (390) T ss_pred -----EEEEeC Confidence 000000 No 119 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=97.77 E-value=1.5e-05 Score=47.04 Aligned_cols=276 Identities=8% Similarity=0.042 Sum_probs=120.9 Q ss_pred Cc------cccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCcccc-cc Q lcl|Aclame:pro 1 MA------NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTV-SD 73 (392) Q Consensus 1 Ma------n~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~-~~ 73 (392) |+ ..++.|+.+...+++.+++..++..++..... ....|. +.++...... ......++ +..... .. T Consensus 109 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~---~~~~~~-~~~~~~~~~~-~~a~~v~E--~~~~~~~~~ 181 (397) T protein:vir:49 109 KTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENV---TTLTGS-RVYEKWTDIT-GLANIDDE--AGKIADVDD 181 (397) T ss_pred hhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceeec---ccCccc-eEEEeeccCC-cceeeecC--ccccccccc Confidence 32 23478999999999999999998887754321 111222 2332211111 11112222 222221 23 Q ss_pred ccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHH Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVN 153 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 153 (392) +.-..+++.+.+. +.-+.|+++-+.++..++...+.++.+++|++.+|..++.-.... ........++++. T Consensus 182 ~~~~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~--------~~~~~~~~~d~i~ 252 (397) T protein:vir:49 182 PKLSLIKYTIKRY-AGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIAAL--------PTKPTLTKWDDII 252 (397) T ss_pred cceeeEEeeeeeE-EeeehhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc--------ccccccccHHHHH Confidence 3445555655333 345578888777777889889999999999999999887532211 1112234578888 Q ss_pred HHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeecccccee-eeEeeeeeeeEeeeEEEEec--ceeecccceeecc Q lcl|Aclame:pro 154 GARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAV-SALQEARLGRIYGYEIVEST--LIPHGDAYLYHPT 230 (392) Q Consensus 154 ~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~-~a~~~g~ig~~~g~~v~~s~--~v~~~~~~~~~~~ 230 (392) ++...|..+..+. -.++++|..+..|.+-.. ..|.-.. .....|..+.+.|++|+... .+|....... T Consensus 253 ~~~~~l~~~~~~~-a~~vmn~~~~~~l~~lkd-----~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~--- 323 (397) T protein:vir:49 253 DLEAKVDPAIKQT-SFFLTNTSGFTALKKVKN-----ALGDYLMERDVKSPTGYSIDGFAVKEVADRWLANGTGGAM--- 323 (397) T ss_pred HHHHhhhhhhcCC-CEEEEcHHHHHHHHHhhc-----CCCceeeccCcCCCCCceecceeeEEecccccccccCCce--- Confidence 8888887665543 468889999988864311 1222111 11234555789999887643 2332221000 Q ss_pred cccccchhhhccccccccceeecc-cceeeeeeeccccceeeee--cccccceeeeEEEeeccccceeeeeccceeeeee Q lcl|Aclame:pro 231 AFIMATRAPAPPMGAVRSTAISGD-QRIAMRWLVDYDSTITSNR--SLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIE 307 (392) Q Consensus 231 a~~~a~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 307 (392) .+.++ .......... .+...............+. .......++......... ..........+.... T Consensus 324 ~i~~g---------d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~-~~~~~~~~~~~~~~~ 393 (397) T protein:vir:49 324 PLYFG---------DLKQAVTLFDRQHMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFV-PASFKAIADQKGNLG 393 (397) T ss_pred eEEEe---------eccceEEEEeecceEEEEeccccchhhcCceeEEEEeeeCcEEecccceE-EEEeecccCCCCCcc Confidence 00000 0000000000 0000000000000000000 000000000000000000 000000000000000 Q ss_pred eccc Q lcl|Aclame:pro 308 VAPE 311 (392) Q Consensus 308 v~~~ 311 (392) .+.+ T Consensus 394 ~~~~ 397 (397) T protein:vir:49 394 STAV 397 (397) T ss_pred cccC Confidence 0000 No 120 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=97.76 E-value=3.8e-06 Score=50.31 Aligned_cols=276 Identities=12% Similarity=0.058 Sum_probs=123.6 Q ss_pred Ccc-------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc-eeeeccccccccCCCccccc Q lcl|Aclame:pro 1 MAN-------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVS 72 (392) Q Consensus 1 Man-------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~-~~~~~~~~~~~~~~~~~~~~ 72 (392) ++. -+++|+.|..++++.+++...+..++++- .- +..+++|+... ..+..... .+++...... T Consensus 141 ~a~~~~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~-----~~--~~~~~~p~~~~~~~a~~~~~--~~e~~~~~~~ 211 (434) T protein:vir:62 141 RALGLVTGNGSVTIPDFLSKEIITYAQEENFLRRLGTGV-----KT--KENIKYPVLVKKAEAQGHKN--ERTNNEMPET 211 (434) T ss_pred hhhcccccccceecchhhHHHHHHhhhhhhhhhhhccee-----cc--CCceEEEEEecCCcccceec--cccccccccc Confidence 111 24789999999999999999887776542 11 22467766422 11221111 1222233333 Q ss_pred cccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHh-cccc----ccccccccccchh Q lcl|Aclame:pro 73 DFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIV-GAPY----EAAGAVHEVAPDE 147 (392) Q Consensus 73 ~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~-~~~~----~~~~~~~~~~~~~ 147 (392) ++.-..+++...+. +.-+.|+++-+.++..++...+.+..+++|+.++|..++.--. ..+. ............. T Consensus 212 ~~~f~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~~~~~~~~~~~~~ 290 (434) T protein:vir:62 212 DIEFDEIELSPTEF-DALATVTKKLLARTGLPIEQIVMDELKKAYVRKETQYMVNGDEANNINDGALAKKAVEFKTDEKN 290 (434) T ss_pred ccceeeEEeeheee-EeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccceeecccccccccccc Confidence 33334455554333 2344677777777778998899999999999999998873110 0000 0011111223335 Q ss_pred hHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeecccccee---eeEeeeeeeeEeeeEEEEecceeeccc Q lcl|Aclame:pro 148 FFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAV---SALQEARLGRIYGYEIVESTLIPHGDA 224 (392) Q Consensus 148 ~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~---~a~~~g~ig~~~g~~v~~s~~v~~~~~ 224 (392) .+++|+++...|+.+..+.. .++++|..+..|.+-. +..|.-.- .....|....+.|++|+.++.+|...+ T Consensus 291 ~~d~l~~l~~~l~~~~~~~a-~~v~n~~~~~~L~~lk-----d~~G~~l~~~~~~~~~g~~~tl~G~pV~~~~~~~~~~~ 364 (434) T protein:vir:62 291 LYDALVKMKNTPVKEVRKKA-RWVLNTAALTKIETMK-----TDDGFPLLRPFNQAEGGIGYTLLGFPVEEEDAIDIPDS 364 (434) T ss_pred hhhHHHHHHhhcchhhhcCC-EEEEcHHHHHHHHHhh-----ccCCCEeeccCCCccCCCCceecceeeEEecCccCccC Confidence 68899988877765544333 5688999998874421 11121100 011234445799999999888764332 Q ss_pred ceeecccccccchhhhccccccccceeeccc--ceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccce Q lcl|Aclame:pro 225 YLYHPTAFIMATRAPAPPMGAVRSTAISGDQ--RIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLI 302 (392) Q Consensus 225 ~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 302 (392) .... .+. .+....+. .... ....... ..........+.-...-..+..........+. T Consensus 365 ~~~~--~i~---------~Gdfs~~~-i~~~~g~~~i~~~--------~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~ 424 (434) T protein:vir:62 365 PDTP--VFY---------FGDFSKFY-IQDVIGSLEVQKL--------VELFSRTNRVGFRIWNLLDAQLIHSPFEVPVY 424 (434) T ss_pred CCce--EEE---------EeeccceE-EEEeeceeEEEee--------hhhhcccCceEEEEEeeecceeecCcccceEE Confidence 1100 000 00000000 0000 0000000 00000000000000000000000000000000 Q ss_pred eeeeeecccccccc Q lcl|Aclame:pro 303 PGSIEVAPEAGANA 316 (392) Q Consensus 303 ~~~v~v~~~~~~~~ 316 (392) .+.....+.. T Consensus 425 ----~~~~~~~~~~ 434 (434) T protein:vir:62 425 ----KYVLKAPTGA 434 (434) T ss_pred ----EEEeccCCCC Confidence 0000001110 No 121 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=97.73 E-value=1.6e-05 Score=46.90 Aligned_cols=273 Identities=10% Similarity=0.060 Sum_probs=121.7 Q ss_pred Ccc------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccc Q lcl|Aclame:pro 1 MAN------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDF 74 (392) Q Consensus 1 Man------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (392) |.- -+++|+.|..++++.+++...+.++++.- .-..|..+.++..... ....... +++......++ T Consensus 117 ~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~-----~~~~~~~~~~~~~~~~-~~~~~~v--~E~~~~~~~~~ 188 (409) T protein:vir:45 117 QGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQIL-----TTSDGRTMEWATADGT-SEVGVLL--GENEEAGEEDT 188 (409) T ss_pred ccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhceee-----ecCCCceEEEEeeccC-ccccccc--ccccccccccc Confidence 321 24789999999999999998887766432 1112334555443211 1111111 22333333344 Q ss_pred cCceEEEEEEeeee--cceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHH--------hccccccc-cccccc Q lcl|Aclame:pro 75 TEDSFPVTLTDVAY--HLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMI--------VGAPYEAA-GAVHEV 143 (392) Q Consensus 75 ~~~~~~~~i~~~~~--~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~--------~~~~~~~~-~~~~~~ 143 (392) .-.. +++..++. .-+.|+++-+.++..++...+..+..++++.++|..++.-- .+.-+... ...... T Consensus 189 ~f~~--~~l~~~k~~~~~i~is~ell~ds~~~l~~~i~~~la~a~~~~~~~a~l~G~G~~~~~~p~Gil~~~~~~~~~~~ 266 (409) T protein:vir:45 189 DFGM--GSLGALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIGRGEARYLIQGTGAGTPKQPKGLAASVTGTTQTAA 266 (409) T ss_pred ccce--eeeeeeeeeeeehhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCccccceeeecccccccccc Confidence 3333 44444443 23468888877777889899999999999999999887311 11100100 111112 Q ss_pred cchhhHHHHHHHHHHhhhccCCCCCE-EEEchHHHHHhhcccceeeeeccccce-eeeEeeeeeeeEeeeEEEEecceee Q lcl|Aclame:pro 144 APDEFFKGVNGARRALNELYIPQGRV-LVVGTAVTEQILNDDRFIKYESQGQSA-VSALQEARLGRIYGYEIVESTLIPH 221 (392) Q Consensus 144 ~~~~~~~~i~~a~~~l~~~~vp~~r~-~vv~~~~~~~l~~~~~~~~~~~~G~~~-~~a~~~g~ig~~~g~~v~~s~~v~~ 221 (392) .....+++++++...|..+..-...| +++++..+..|.+-. +..|.-. ......|..+.+.|++|+.++.+|. T Consensus 267 ~~~~~~d~i~~l~~~l~~~~~~~a~~~~~~n~~~~~~l~~lk-----d~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~p~ 341 (409) T protein:vir:45 267 ANAVKWQEILALKHSIDPAYRRGPKFRLAFNDNTLKLISEME-----DGQGRPLWLPDIVGVAPASVLNVPYVIDQEIDD 341 (409) T ss_pred ccccchHHHHHHHHhhhhhhccCCeEEEEECHHHHHHHHHhh-----cCCCceeeccCcCCCCCceecceeeEEecCcCC Confidence 23346788988888776554433455 467898887764321 1112110 0112345556899999999998874 Q ss_pred cccceeecccccccchhhhccccccccceeecccceeeeeee-ccccceeeeecccccceeeeEEEeeccccceeeeecc Q lcl|Aclame:pro 222 GDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLV-DYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIH 300 (392) Q Consensus 222 ~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 300 (392) ...... .+.++ ....+......+....+.. .+...... ........++...... .+. T Consensus 342 ~~~~~~---~i~~G---------d~~~~~i~~~~~~~~~~~~d~~~~~~~~-~~~~~~r~d~~~~~~~---------A~~ 399 (409) T protein:vir:45 342 IGAGKK---FMFCG---------DFDRFIIRRVRYMILKRLVERYAEYDQT-GFLAFHRFDCILEDTS---------AIK 399 (409) T ss_pred ccCCcc---EEEEe---------ehhhhheeeccceEEEEeecccccCCcE-EEEEEEEeccEeechh---------heE Confidence 321100 00000 0000000000000000000 00000000 0000000000000000 000 Q ss_pred ceeeeeeecccccccceeeeeecc Q lcl|Aclame:pro 301 LIPGSIEVAPEAGANATITAAAGE 324 (392) Q Consensus 301 ~~~~~v~v~~~~~~~~~~~~~~~~ 324 (392) ...+....+. T Consensus 400 --------------~l~~k~s~~~ 409 (409) T protein:vir:45 400 --------------ALVGKGSVGG 409 (409) T ss_pred --------------EEEeccCCCC Confidence 0000000000 No 122 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=97.73 E-value=2.3e-05 Score=46.05 Aligned_cols=268 Identities=9% Similarity=0.001 Sum_probs=125.5 Q ss_pred Ccc--ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEecc-ceeeeccccccccCCCccccccccCc Q lcl|Aclame:pro 1 MAN--AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPA-PSRGHTRKLRGAGAERNLTVSDFTED 77 (392) Q Consensus 1 Man--~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (392) ||- -.+.|+.+..++++.++++..+..++.+- .- .+..++||+.. ...+... +++..+...++.-. T Consensus 1 ma~~gG~lvp~~~~~~ii~~~~~~s~i~~l~~~~-----~~-~~~~~~ip~~~~~~~a~~v-----~E~~~~~~~~~~f~ 69 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDLISKVAGKSSIARLSAQK-----PI-PFNGEKVFTFTMDSEIDVV-----AESGKKTHGGVTLA 69 (298) T ss_pred CcccCcceechhHHHHHHHHHHhhhhhhhhccee-----ec-cCCceEEEEEecCcceEEe-----cCCcccccccccee Confidence 994 33677778889999999998887776432 11 12347777643 2233332 23333443444444 Q ss_pred eEEEEEEeeeecceEeeHHHHhh---hccChHHHHHHHHHHHHHHHHHHHHHHHHh---cccccc------c-----ccc Q lcl|Aclame:pro 78 SFPVTLTDVAYHLGVLTDEELTF---DLESFATQILPRQVRGVADILEEGVRDMIV---GAPYEA------A-----GAV 140 (392) Q Consensus 78 ~~~~~i~~~~~~~~~i~d~~~~~---~~~~~~~~~~~~~~~ala~~vd~~~~~~~~---~~~~~~------~-----~~~ 140 (392) .+++...+. ..-+.|+++-+.+ +..++.+.+.++.++++++++|+.++.-.. +.+... . ... T Consensus 70 ~v~l~~~k~-a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~ 148 (298) T protein:vir:16 70 PQTMVPIKV-EYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVE 148 (298) T ss_pred EEEEeeeeE-EEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcccccccccccccccccccc Confidence 555554332 2345677777643 345688889999999999999999875321 111000 0 000 Q ss_pred ccccchhhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccce-eeeEeeeeeeeEeeeEEEEecce Q lcl|Aclame:pro 141 HEVAPDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSA-VSALQEARLGRIYGYEIVESTLI 219 (392) Q Consensus 141 ~~~~~~~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~-~~a~~~g~ig~~~g~~v~~s~~v 219 (392) ........+.++.++...|..++.+.. .++++|..+..|.+-. +..|.-. ......|..+++.|.+|+.++.+ T Consensus 149 ~~~~~~~~~~~i~~~~~~~~~~~~~~~-~~vmn~~~~~~l~~lk-----d~~G~~i~~~~~~~~~~~~l~G~PV~~~~~v 222 (298) T protein:vir:16 149 APRGIADPNGAIENAVELLTGVDADVT-GIAINPSFRSALAKQK-----DLQDNALFPELKWGATPDTINGLPVDVNKTV 222 (298) T ss_pred cccccccHHHHHHHHHHHhhhcCCCcc-EEEEcHHHHHHHHHhh-----ccCCCeeecCcccCCCCceecceeeEEeccc Confidence 111122346788888877877666433 5788999998886532 1122211 11223555678999999999988 Q ss_pred eecccceeecccccccchhhhccccccccceeec-ccceeeeeeecccccee-eee---ccc----ccceeeeEEEeecc Q lcl|Aclame:pro 220 PHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISG-DQRIAMRWLVDYDSTIT-SNR---SLI----DTYFGLKVVEDPNG 290 (392) Q Consensus 220 ~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~-~~~---~~~----~~~~~~~~~~~~~~ 290 (392) |........ ....+........+ ............+.... .+. ..+ ....+..+.... T Consensus 223 ~~~~~~~~~-----------~~~~GDfs~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~-- 289 (298) T protein:vir:16 223 SDMSLTQRD-----------RAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDAT-- 289 (298) T ss_pred ccccCCCcc-----------EEEEeeccceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEccEeeccc-- Confidence 753211000 00000000000000 00000111100000000 000 000 000000000000 Q ss_pred ccceeeeeccceeeeeeecccc Q lcl|Aclame:pro 291 VGFVRARKIHLIPGSIEVAPEA 312 (392) Q Consensus 291 ~~~~~~~~~~~~~~~v~v~~~~ 312 (392) .-+.+...+ T Consensus 290 -------------a~~~l~~at 298 (298) T protein:vir:16 290 -------------KFARVTEAN 298 (298) T ss_pred -------------ceEEEeecC Confidence 000000101 No 123 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=97.71 E-value=1.2e-05 Score=47.50 Aligned_cols=306 Identities=15% Similarity=0.145 Sum_probs=125.5 Q ss_pred CccccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCcc----------- Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNL----------- 69 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~----------- 69 (392) |+.++-+ --|.+.+|...++.++|..+...- .+....|.||.++++.++... ..+...+-..+. T Consensus 19 ~~~~~~t-~y~~~k~L~~Aa~~lv~~~fA~~~---piPkn~GkTIk~r~y~pl~~~-~~pl~eGv~a~G~~~~~g~~y~~ 93 (401) T protein:vir:95 19 NSDQMQT-FFWLKKAIITARKEQYFMPLASVT---NMPKHYGKTIKVYEYVPLLDD-RNINDQGIDASGATIVNGNLYGS 93 (401) T ss_pred ccceeee-hhhHHHHHhhhhhhhhhhhccccc---ccccccCCeEEEEeccccccc-ccchhcCCCcccccccCcccccc Confidence 4444321 256777888888888888876432 344567999999997665432 122222111111 Q ss_pred ---------------------ccccccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHH-----HHHHH Q lcl|Aclame:pro 70 ---------------------TVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVA-----DILEE 123 (392) Q Consensus 70 ---------------------~~~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala-----~~vd~ 123 (392) +-+.+.-..+..+|.|+ ++=.+|+|+........-+.+++..-+..-+ +.+-+ T Consensus 94 ~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qy-G~~~e~Td~~~dt~~D~~l~~h~s~ell~g~~~~t~d~i~~ 172 (401) T protein:vir:95 94 SKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKF-GFFYEFTQESIDFDSDDGLMEHLSRELMNGATQITEAVLQK 172 (401) T ss_pred ccccceeecccccccccccccccccceeeeeeeeeeec-cCccchhhhhhhhhcchHHHHHHHHHHhhhhhhhHHHHHHH Confidence 11111112344455443 3445788877666666555554332222222 22333 Q ss_pred HHHHHHhccccccc-------cccccccchhhHHHHHHHHHHhhhccCCC------------------CCEEEEch---- Q lcl|Aclame:pro 124 GVRDMIVGAPYEAA-------GAVHEVAPDEFFKGVNGARRALNELYIPQ------------------GRVLVVGT---- 174 (392) Q Consensus 124 ~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~i~~a~~~l~~~~vp~------------------~r~~vv~~---- 174 (392) +++......-+... ...........++++.++.+.|+++.+|. -|++++.| T Consensus 173 dll~ag~~viyAg~ats~At~~~~~~~~t~vt~~~l~rl~~~L~~nRapk~t~~i~~s~~~dTk~i~~s~va~~h~~L~~ 252 (401) T protein:vir:95 173 DLLAAAGTVLYAGAATSDATITGEGSTPSVVSYKNLMRLDQILTENRTPTQTTIITGSRMIDTKVIGATRVMYVGSELVP 252 (401) T ss_pred HHHhhcCeeecCCccceeeeccccccccceechhHHHHHHHHHHhcccccchhhhhhhhccCccccccceEEEEecCchh Confidence 44422210111111 11122233446889999999999866654 25688888 Q ss_pred --HHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccceeecccccccchhhhccc---cccccc Q lcl|Aclame:pro 175 --AVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTAFIMATRAPAPPM---GAVRST 249 (392) Q Consensus 175 --~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a~~~a~~~~~~~~---~~~~~~ 249 (392) ....++..++.|+...+.+.. ..+.+|++|.+.++.+..+.....-........+...+.+...... ...... T Consensus 253 di~a~~D~~~~~~fi~v~kYa~~--~~i~~gEiG~i~~vR~i~~p~~~~w~~ag~~a~~~~~~y~~~~~~~gg~~dVyp~ 330 (401) T protein:vir:95 253 ELKAMKDLFGNKAFIETQHYADA--GTIMNGEVGSIDKFRIIQVPEMLHWAGAGAQATGANPGYRTSMVSGQEHYDVYPM 330 (401) T ss_pred HHHHHHHhcCCCCceehhhcCCc--cccccccccccCceeEEecccceeecCCcccccccccccccccccCCCcceeeee Confidence 334666788999999999875 5688999999999988765543211100000000000000000000 000000 Q ss_pred eeecccceeeeeeeccccceeeeecccccceee------eEEEeeccccceeeeeccceeeeeeecccccccceeeeeec Q lcl|Aclame:pro 250 AISGDQRIAMRWLVDYDSTITSNRSLIDTYFGL------KVVEDPNGVGFVRARKIHLIPGSIEVAPEAGANATITAAAG 323 (392) Q Consensus 250 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~ 323 (392) ..-+................+.. ..+. -.|. ....-....++.-.....+ +. T Consensus 331 lV~G~dAf~~~~l~g~g~~~~~~-~ivk-~pG~~~ad~~DPlgQ~g~vgwK~~~a~~v--------------------L~ 388 (401) T protein:vir:95 331 LVVGDDSFTSIGFQTDGKSLKFT-VMTK-MPGKETADRNDPYGETGFSSIKWYYGILV--------------------KR 388 (401) T ss_pred eEEccccceecccccCCccccce-eEee-cCCcCCCCCCCcccceehhhhhhhhhhhe--------------------ec Confidence 00000000000000000000000 0000 0000 0000000000000000000 00 Q ss_pred cCeeEEEEEeecCc Q lcl|Aclame:pro 324 EDHTVQLKVTDANG 337 (392) Q Consensus 324 ~~~t~~~t~~~~~~ 337 (392) ...-+.+. +..|- T Consensus 389 ~e~m~~ie-s~a~~ 401 (401) T protein:vir:95 389 PERLALIK-TVAPL 401 (401) T ss_pred cceeEEEE-eecCC Confidence 00000000 01110 No 124 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=97.70 E-value=2.6e-05 Score=45.76 Aligned_cols=281 Identities=9% Similarity=0.076 Sum_probs=118.5 Q ss_pred Cc---cccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc-eeeeccccccccCCCccccccccC Q lcl|Aclame:pro 1 MA---NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSDFTE 76 (392) Q Consensus 1 Ma---n~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 76 (392) +. --+++|+.|..++++.|++..++.++..|- +....| .+++|+... ..+.. .++ +....-.++.- T Consensus 135 ~~~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~~~----v~~~~~-~~~~p~~~~~~~a~~---v~E--~~~~~~~~~~f 204 (435) T protein:vir:80 135 LSPGAGGVLVPENLSSEVIELLRPKSVVRKLGART----LPLSNG-NITIPRLKGGAIVGY---IGA--DTDIPTTQQQF 204 (435) T ss_pred cCCCCCccccchhHHHHHHHHHhhhchhhhcccee----eecCCC-ceEEEEEeCCcceee---ecc--Cccccccccce Confidence 11 123789999999999999888776652221 222223 367765422 22222 222 23333334444 Q ss_pred ceEEEEEEeeeecceEeeHHHHhhhcc--ChHHHHHHHHHHHHHHHHHHHHHHHHhc--ccccc---------ccccccc Q lcl|Aclame:pro 77 DSFPVTLTDVAYHLGVLTDEELTFDLE--SFATQILPRQVRGVADILEEGVRDMIVG--APYEA---------AGAVHEV 143 (392) Q Consensus 77 ~~~~~~i~~~~~~~~~i~d~~~~~~~~--~~~~~~~~~~~~ala~~vd~~~~~~~~~--~~~~~---------~~~~~~~ 143 (392) ..+++...+. +.-+.|+++-+.++.. ++...+.++.+++|+.++|..++.--.. .+.+. ....... T Consensus 205 ~~i~~~~~k~-~~~~~is~ell~ds~~~~~l~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~ 283 (435) T protein:vir:80 205 DDLKLTAKKM-AALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPGNVITASDGS 283 (435) T ss_pred eeEEEeeEEE-EEeehhhHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCcccceeecccccceeeccccc Confidence 4555554333 2455677776665533 5777788889999999999988742110 01100 0111111 Q ss_pred cchhhHHHHHHHHHHhhhccCC-CCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeec Q lcl|Aclame:pro 144 APDEFFKGVNGARRALNELYIP-QGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHG 222 (392) Q Consensus 144 ~~~~~~~~i~~a~~~l~~~~vp-~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~ 222 (392) .....+.++.++...|..+... .+-.++++|..+..|.+-. +..|.-. +....-+.+.|++|+.++.+|.. T Consensus 284 ~~~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lk-----d~~G~~l---~~~~~~~~l~G~pv~~~~~~p~~ 355 (435) T protein:vir:80 284 TLQKIETDLGKAILALENADANLTQPGWIMAPRTFRFLEGLR-----DGNGNKV---YPELANGMLKGYPVGKTTQVPIN 355 (435) T ss_pred chhhHHHHHHHHHHHhhccccccccCEEEEcHHHHHHHHhhh-----ccCCcee---ccCCCCCeEeeeeeEEecccccc Confidence 2223345666666666555443 3456789999988775421 1222211 11112357899999999998864 Q ss_pred ccceeecccccccchhhhccccccccceeeccc-ceeeeeeeccccceeeeecccccc-eeeeEEEeeccccceeeeecc Q lcl|Aclame:pro 223 DAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQ-RIAMRWLVDYDSTITSNRSLIDTY-FGLKVVEDPNGVGFVRARKIH 300 (392) Q Consensus 223 ~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 300 (392) .........+.++. .. ....+.+ +......... .........+..+ .+........... ..+. T Consensus 356 ~~~~~~~~~i~~gd---------~s-~~~i~~~~~~~i~~~~~~-~~~~~~~~~~~~f~~n~~~~r~~~r~d----~~~~ 420 (435) T protein:vir:80 356 LGEAGKESEIYFTD---------FG-DVFIGEEETLEIDYSKEA-TYKDADGHMVSAFQRDQTLIRVIAKND----FGPR 420 (435) T ss_pred ccCCCCcceEEEEE---------cc-cEEEEeecceEEEEeccc-cccccccchhhhhhcCcceeeeeeeeC----cEee Confidence 32211111111000 00 0000000 0000000000 0000000000000 0000000000000 0000 Q ss_pred ceeeeeeeccccccc Q lcl|Aclame:pro 301 LIPGSIEVAPEAGAN 315 (392) Q Consensus 301 ~~~~~v~v~~~~~~~ 315 (392) .+..-+.++++.... T Consensus 421 ~~~a~~~l~~~~~~~ 435 (435) T protein:vir:80 421 HVESIAVLSGVAWGA 435 (435) T ss_pred cccceEEEeccCCCC Confidence 111111111111111 No 125 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=97.65 E-value=6.5e-06 Score=49.00 Aligned_cols=276 Identities=11% Similarity=0.056 Sum_probs=115.0 Q ss_pred Cc------cccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEec-cceeeeccccccccCCCcccccc Q lcl|Aclame:pro 1 MA------NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVP-APSRGHTRKLRGAGAERNLTVSD 73 (392) Q Consensus 1 Ma------n~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~-~~~~~~~~~~~~~~~~~~~~~~~ 73 (392) |. -.+++|+.|..++++.+++..++..++.... +....| .+.+++- ....+.+ ..++...+..... T Consensus 110 ~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~---~~~~~g-~~~~~~~~~~~~~~~---v~e~~~~~~~~~~ 182 (404) T protein:vir:10 110 ISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEP---VFTRSG-SRTYEKRSKQKPMKP---LSENQQIPTNGDN 182 (404) T ss_pred hccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceee---ccCCcc-ceEEEEecCCcceee---ccccccccccccc Confidence 32 1236799999999999999998877764431 111112 3444432 2222222 2222221111112 Q ss_pred ccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccc------cccccccccccchh Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAP------YEAAGAVHEVAPDE 147 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~------~~~~~~~~~~~~~~ 147 (392) +.-..++++..+. +.-+.|+++-+.++..++...+.+..+++++..+|..++.--.... .............. T Consensus 183 ~~f~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~G~g~~~~~~gi~~~~~~~~~~~~~~~ 261 (404) T protein:vir:10 183 GKLERFNFKLKDL-ADFMSIPNDLLKFADKSLEDWIINWFVDKVRITRNAEILYGAGGDEHATGIMTANKFKKITLPKSP 261 (404) T ss_pred cceeeeEeeheee-EeeehhhHHHHhhcHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcccceeeccccceeeccccc Confidence 2234444444333 2445688877777777899999999999999999998874211100 01111112223344 Q ss_pred hHHHHHHHHH-HhhhccCCCCCEEEEchHHHHHhhcccceeeeecccccee-eeEeeeeeeeEeeeEEEEec-ceeeccc Q lcl|Aclame:pro 148 FFKGVNGARR-ALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAV-SALQEARLGRIYGYEIVEST-LIPHGDA 224 (392) Q Consensus 148 ~~~~i~~a~~-~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~-~a~~~g~ig~~~g~~v~~s~-~v~~~~~ 224 (392) .++++.++.. .|... ...+-.++++|..+..|.+-.. ..|.-.. ..+..|..+.+.|++|+... ..+..+. T Consensus 262 ~~~~~~~~~~~~l~~~-~~~~~~~v~n~~~~~~L~~lkd-----~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~ 335 (404) T protein:vir:10 262 ALKDFKKCKNVELLNV-FKATSSWIVNQDGFNYLDSLED-----KTGRPYLQPDPKDPTQYRFLGLPVIELPNDLLLSTE 335 (404) T ss_pred cHHHHHHHHHhhhhcc-ccCCCEEEEcHHHHHHHHHhhc-----cCCceeeccCcCCCCCccccceeeEEecccccCCCC Confidence 5777766543 34332 2233467899999988765311 1121111 11234555678899887533 2322211 Q ss_pred ceeecccccccchhhhccccccccceeec-ccceeeeeeeccccceeee--ecccccceeeeEEEeeccccceeeeeccc Q lcl|Aclame:pro 225 YLYHPTAFIMATRAPAPPMGAVRSTAISG-DQRIAMRWLVDYDSTITSN--RSLIDTYFGLKVVEDPNGVGFVRARKIHL 301 (392) Q Consensus 225 ~~~~~~a~~~a~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 301 (392) ... .+.+ +......... ..+...............+ ........+..+.... .+.. T Consensus 336 ~~~---~~~~---------gd~s~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~---------a~~~ 394 (404) T protein:vir:10 336 SAI---PVLL---------GDTKEAYKYVSDGAYELATTNIGAGAFETNTTKARIIMRIDGNVKDSE---------ALLI 394 (404) T ss_pred Ccc---EEEE---------EeccccEEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEeccc---------ceEE Confidence 000 0000 0000000000 0000000000000000000 0000000111111000 0000 Q ss_pred eeeeeeeccc Q lcl|Aclame:pro 302 IPGSIEVAPE 311 (392) Q Consensus 302 ~~~~v~v~~~ 311 (392) ........+- T Consensus 395 ~~~~~aa~~~ 404 (404) T protein:vir:10 395 AEIPVESVQA 404 (404) T ss_pred EEeecccCCC Confidence 0000000000 No 126 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=97.65 E-value=2.5e-05 Score=45.81 Aligned_cols=272 Identities=15% Similarity=0.030 Sum_probs=113.6 Q ss_pred Cc------cccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeee--ccccccccCCCccccc Q lcl|Aclame:pro 1 MA------NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGH--TRKLRGAGAERNLTVS 72 (392) Q Consensus 1 Ma------n~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~--~~~~~~~~~~~~~~~~ 72 (392) ++ -..+.|+.|..++++.+++...+..++..- .. .|.++++|+....... .....++ +....-. T Consensus 118 ~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~---~~---~~~~~~~~~~~~~~~~~~~a~~v~E--g~~~~~~ 189 (413) T protein:vir:81 118 STATLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNL---TM---TNTTIKYLMEKANRVVEGGFKTVAE--GGKKPYM 189 (413) T ss_pred hhcccccccccccchhhHHHHHHHHhhhhhHHhhccee---ec---cCCceeEEEeccccccccccceecC--ccccccc Confidence 11 123568999999999999999887776432 11 2445666654332221 1222222 2222111 Q ss_pred cc-cCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHh------cccccccccc-cccc Q lcl|Aclame:pro 73 DF-TEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIV------GAPYEAAGAV-HEVA 144 (392) Q Consensus 73 ~~-~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~------~~~~~~~~~~-~~~~ 144 (392) +. .=..+++...+.. .-+.|+++-+.++ ..+...+.+..+++++..+|..++.--. +..+...... .... T Consensus 190 ~~~~f~~i~~~~~k~~-~~~~iS~ell~ds-~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~Gi~~~~~~~~~~~~~ 267 (413) T protein:vir:81 190 RFADFDIVTESLSKIA-GLTKITDEMIEDY-DFLVSYINARLLEELAIEEERQLLLGDGTGNNLTGLLKRDGIQTLAVSN 267 (413) T ss_pred CcccceeeEeeeeeEE-EeehhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcccccccccccccccccc Confidence 21 1234445443332 3457887755544 4466666677899999999998774110 0000000001 1112 Q ss_pred chhhHHHHHHHHHHhhhcc-CCCCCEEEEchHHHHHhhc--ccc--eeeeeccccceeeeEeeeeeeeEeeeEEEEecce Q lcl|Aclame:pro 145 PDEFFKGVNGARRALNELY-IPQGRVLVVGTAVTEQILN--DDR--FIKYESQGQSAVSALQEARLGRIYGYEIVESTLI 219 (392) Q Consensus 145 ~~~~~~~i~~a~~~l~~~~-vp~~r~~vv~~~~~~~l~~--~~~--~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v 219 (392) ....++.+.++...+..+. .+.+ .++++|..+..|.+ |.+ +............ -..+..+++.|++|+.+..+ T Consensus 268 ~~~~~~~i~~~~~~~~~~~~~~~~-~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~-~~~~~~~~l~G~pv~~s~~~ 345 (413) T protein:vir:81 268 KDELADSIYKAMTNISLATPFQAD-ALVINPLDYQELRLAKDANGQYYGGGVFQGQYGS-GGIMLDPAPWGLRTVQSQVV 345 (413) T ss_pred cchhHHHHHHHHHHhhhhccCCCc-EEEEcHHHHHHHHHhhccCCceeccccccccccc-cccccCceecceeeEEcCCC Confidence 2345666666655544332 2223 37889999888754 211 1111000000000 01122357889999999988 Q ss_pred eecccceeeccc-ccccchhhhccccccccceeecccceeeeeeeccccceeeeecc--cccceeeeEEEeeccccceee Q lcl|Aclame:pro 220 PHGDAYLYHPTA-FIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSL--IDTYFGLKVVEDPNGVGFVRA 296 (392) Q Consensus 220 ~~~~~~~~~~~a-~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~ 296 (392) |.+.......+. .....+ .+....+..........+... ......+.+... T Consensus 346 ~~~~~~~gd~~~~~~~~~~-----------------~~~~v~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~--------- 399 (413) T protein:vir:81 346 PVGKPVVGAFRSAASVLRK-----------------GGVRIDSTNTNVDDFENNLITVRAEERVGLMVTFP--------- 399 (413) T ss_pred CcccEEEEecccEEEEEEe-----------------cceEEEEeccccchhhcCcEEEEEEEeeccEEecc--------- Confidence 865433221111 111000 001111100000000000000 000001111100 Q ss_pred eeccceeeeeeecc Q lcl|Aclame:pro 297 RKIHLIPGSIEVAP 310 (392) Q Consensus 297 ~~~~~~~~~v~v~~ 310 (392) ..+........+.| T Consensus 400 ~a~~~l~~~~~~~p 413 (413) T protein:vir:81 400 EAIVQLDVAEVVTP 413 (413) T ss_pred cceEEEEecCCCCC Confidence 00000000001111 No 127 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=97.64 E-value=8e-06 Score=48.51 Aligned_cols=269 Identities=13% Similarity=0.117 Sum_probs=120.7 Q ss_pred CccccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccc-cCceE Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDF-TEDSF 79 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 79 (392) =...+++|+.+..++++.|++...+.+++..- . . .|+ ++||+.... .......+ +..+...+. .-+. T Consensus 144 ~~gg~~vP~~~~~~Ii~~l~~~~~i~~~~~~~--~-~---~g~-~~ip~~~~~--~~a~~v~E--~~~~~~~~~~~f~~- 211 (425) T protein:vir:95 144 AGGELTIPEVVVNRIMDIMGDYTTLYPLVDKI--R-V---KGT-TRILVDTDT--SPATWIEQ--SGALPTGDVGTIAS- 211 (425) T ss_pred ccCceeccHHHHHHHHHHHHhhhhHHHhhcee--e-c---Cce-eEEEEecCC--cccccccc--ccccccccccccce- Confidence 01234789999999999999999887776431 1 1 243 577664322 22222222 222322221 2234 Q ss_pred EEEEEeeee-cceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHh---cccccc------ccccccccchhhH Q lcl|Aclame:pro 80 PVTLTDVAY-HLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIV---GAPYEA------AGAVHEVAPDEFF 149 (392) Q Consensus 80 ~~~i~~~~~-~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~---~~~~~~------~~~~~~~~~~~~~ 149 (392) ++++.++. .-+.|+++-+.++..++...+..+.+++|+.++|..++.-=. ..|.+. ............+ T Consensus 212 -i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~G~G~~~~~p~Gil~~~~~~~~~~~~~~~~~~ 290 (425) T protein:vir:95 212 -IDFDGFKVGKVTFVDNYLLQDSIINLDDYVTKKIARAIAKALDLAIVKGTGAANKQPLGIIPSLPPENQVTVEADNNLL 290 (425) T ss_pred -eeeeheeeeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccccccccccccchH Confidence 44444444 445788887888888999999999999999999998875211 011110 0111122234468 Q ss_pred HHHHHHHHHhhhccCCC-CCEEEEchHHHH-HhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeeccccee Q lcl|Aclame:pro 150 KGVNGARRALNELYIPQ-GRVLVVGTAVTE-QILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLY 227 (392) Q Consensus 150 ~~i~~a~~~l~~~~vp~-~r~~vv~~~~~~-~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~ 227 (392) +++.++...+..+..+. +-.+++++..+. .+..-... .+..|. .....-.+..+.+.|.+|+.++.+|....... T Consensus 291 ~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~l~~~--kd~~g~-~i~~~~~~~~~~l~G~pvv~~~~~~~~~i~~G 367 (425) T protein:vir:95 291 KNLVKQIGLIDTGDDSVGEIVAVMKRSTYYNRLVEFSIQ--VDSNGN-VVGKLPNLRTPDLLGLRVVFNNFLDDDTVLFG 367 (425) T ss_pred HHHHHHHHhhhhhccccCceEEEEeChHHHHHHHHHHhh--cCCCCc-eeeccCCCCCccccceeeEEcCcCCCccEEEE Confidence 88888776666554433 334566766543 22211100 011121 01111234456788999999999986543221 Q ss_pred ecccccccchhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeee Q lcl|Aclame:pro 228 HPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIE 307 (392) Q Consensus 228 ~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 307 (392) ..+......+ .+....... +... ....+ .+... ............+.......+ T Consensus 368 d~~~~~~~~~-----------------~~~~i~~~~--~~~f--~~~~~-~~~~~----~r~d~~~~~~~a~~~~~i~~~ 421 (425) T protein:vir:95 368 EFEQYTLVER-----------------ENITIDSST--HVKF--TEDQT-AFRGK----GRFDGKPVKPEAFVLVTITDP 421 (425) T ss_pred ecccEEEEee-----------------cceEEEeec--cccc--ccCce-EEEEE----EeeCcEeecccceEEEEecCc Confidence 1111000000 000000000 0000 00000 00000 000000000000111011111 Q ss_pred eccc Q lcl|Aclame:pro 308 VAPE 311 (392) Q Consensus 308 v~~~ 311 (392) +.+. T Consensus 422 ~~g~ 425 (425) T protein:vir:95 422 VQGA 425 (425) T ss_pred CCCC Confidence 1111 No 128 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=97.64 E-value=2.1e-05 Score=46.24 Aligned_cols=276 Identities=9% Similarity=0.051 Sum_probs=119.6 Q ss_pred Cc------cccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCcccccc- Q lcl|Aclame:pro 1 MA------NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSD- 73 (392) Q Consensus 1 Ma------n~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~- 73 (392) |+ -.+++|+.+...+++.+++..++..++.... +....| ++.++..... .......++ +....... T Consensus 109 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~---~~~~~~-~~~~~~~~~~-~~~a~~v~E--~~~~~~~~~ 181 (397) T protein:vir:49 109 KTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVEN---VTTLTG-SRVYEKWADI-TGLAKLDDE--GGQIGQNDD 181 (397) T ss_pred hhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceee---ccCCcc-eEEEEeeccC-Ccceeeecc--ccccccccc Confidence 32 1357899999999999999998877765432 111111 2333332211 111112222 22222111 Q ss_pred ccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHH Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVN 153 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 153 (392) +.-..+++...+. +.-+.|+.+-+.++..++...+.+...++|+..+|..++.-... .........|+++. T Consensus 182 ~~~~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ail~G~g~--------~~~~~~~~~~d~i~ 252 (397) T protein:vir:49 182 PKLSLIRYAIKRY-AGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIGT--------LPNKPTLAKWDDII 252 (397) T ss_pred cceeeeEeeeeee-EeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------ccccccccCHHHHH Confidence 2234555555443 24456787777777788999999999999999999988743211 11112234588899 Q ss_pred HHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeecccccee-eeEeeeeeeeEeeeEEEEecc--eeecccceeecc Q lcl|Aclame:pro 154 GARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAV-SALQEARLGRIYGYEIVESTL--IPHGDAYLYHPT 230 (392) Q Consensus 154 ~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~-~a~~~g~ig~~~g~~v~~s~~--v~~~~~~~~~~~ 230 (392) ++...|+.+..+. -.++++|..+..|.+-. +..|.-.. ..+..|.-+.+.|++|+.... +|....... T Consensus 253 ~~~~~l~~~~~~~-a~~v~n~~~~~~l~~lk-----d~~g~~l~~~~~~~g~~~~l~G~pV~~~~~~~~~~~~~~~~--- 323 (397) T protein:vir:49 253 DLQAKVDPAIKQT-SLFLTNTSGFTALKKVK-----NAMGDYLMERDVKSPTGYSIDGFVVKEISDRFLPNGTGGAM--- 323 (397) T ss_pred HHHHhhhhhhcCC-CEEEEcHHHHHHHHHhh-----ccCCceeecccccCCCCceecceeeEEecccccccccCCce--- Confidence 8888887666543 46889999998885421 11222110 112345556899998876443 332221100 Q ss_pred cccccchhhhccccccccceeecc-cceeeeeeeccccceeeee--cccccceeeeEEEeeccccceeeeeccceeeeee Q lcl|Aclame:pro 231 AFIMATRAPAPPMGAVRSTAISGD-QRIAMRWLVDYDSTITSNR--SLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIE 307 (392) Q Consensus 231 a~~~a~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 307 (392) .+.++ ........+. .+...............+. .......++.+.... .+.. +. T Consensus 324 ~~~~g---------d~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~---------a~~~----~~ 381 (397) T protein:vir:49 324 PLYFG---------DLKQAVTLFDRQHLSLLSTNIGGGAFETDTTKVRVIDRFDVVSTDTE---------AFVP----AS 381 (397) T ss_pred eEEEe---------eccceEEEEeecccEEEEeccccchhhcCeeeEEEEEeeccEEeccc---------ceEE----EE Confidence 00000 0000000000 0000000000000000000 000000000000000 0000 00 Q ss_pred ecccccccceeeeeecc Q lcl|Aclame:pro 308 VAPEAGANATITAAAGE 324 (392) Q Consensus 308 v~~~~~~~~~~~~~~~~ 324 (392) +.... +....+...+. T Consensus 382 ~~~~~-~~~~~~~~~~~ 397 (397) T protein:vir:49 382 FKAIA-DQKAKLSTAGA 397 (397) T ss_pred ecccc-cccCcccccCC Confidence 00000 00000000000 No 129 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=97.63 E-value=3.5e-05 Score=45.01 Aligned_cols=271 Identities=8% Similarity=0.042 Sum_probs=122.1 Q ss_pred Ccc------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCcccc-cc Q lcl|Aclame:pro 1 MAN------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTV-SD 73 (392) Q Consensus 1 Man------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~-~~ 73 (392) |+- ..+.|+.|+.++++.+++...+..+++.-. ..... .+..|+...... ......++ +..+.- .. T Consensus 5 ~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~---~~~~~-g~~~~~~~~~~~-~~a~~v~E--g~~~~~~~~ 77 (293) T protein:vir:48 5 KTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVEN---VTTLT-GSRVYEKWTDIT-GLANIDDE--AGKIADIDD 77 (293) T ss_pred ecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeee---ccCCc-ceEEEEeecCCC-cceeeecC--Ccccccccc Confidence 442 347899999999999999999877764321 11111 234444322111 11222222 222221 22 Q ss_pred ccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHH Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVN 153 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 153 (392) +.-..+++...+. +.-+.|+++-+.++..++...+.++.+++++...|+.++.-.... ........|++|+ T Consensus 78 ~~~~~i~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~--------~~~~~~~~~d~i~ 148 (293) T protein:vir:48 78 PKLSLIKYTIKRY-AGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKL--------PTKPTLTKWDDII 148 (293) T ss_pred cceeEEEEeeeEE-EEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhccccc--------cccccccCHHHHH Confidence 3345555555443 245678888888888899999999999999999999887543321 1122334688999 Q ss_pred HHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccce-eeeEeeeeeeeEeeeEEEEecc--eeecccce---- Q lcl|Aclame:pro 154 GARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSA-VSALQEARLGRIYGYEIVESTL--IPHGDAYL---- 226 (392) Q Consensus 154 ~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~-~~a~~~g~ig~~~g~~v~~s~~--v~~~~~~~---- 226 (392) ++...|..+.-+. -.++++|..+..|.+-.. ..|.-. ...+.+|..+++.|++|+.... ++...... T Consensus 149 ~~~~~l~~~~~~~-a~~vmn~~~~~~L~~lkd-----~~g~~l~~~~~~~~~~~~l~G~Pv~~~~~~~~~~~~~~~~~~~ 222 (293) T protein:vir:48 149 DLEAKVDPAIKQT-SFFLTNTSGFTALKKVKN-----ALGDYLMERDVKSPTGYSIAGFAVKEISDRWLPNASSGVMPLY 222 (293) T ss_pred HHHHhhhhhhcCC-CEEEEcHHHHHHHHHhhc-----cCCceEeecCcCCCCCceecceeeEEecccccCCccCCceEEE Confidence 9888887554433 367889999988754211 122111 0113355667899999876443 22211100 Q ss_pred e-ecc-cccccchhhhccccccccceeecccceeeeeeecc--ccceeeeecccccceeeeEEEeeccccceeeeeccce Q lcl|Aclame:pro 227 Y-HPT-AFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDY--DSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLI 302 (392) Q Consensus 227 ~-~~~-a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 302 (392) + ..+ ......+ .+......... ...............++........ . .+ T Consensus 223 ~gd~~~~~~~~~~-----------------~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~---~---~l--- 276 (293) T protein:vir:48 223 FGDLKQAVTLFDR-----------------QQMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAF---V---PA--- 276 (293) T ss_pred EEeccceEEEEEe-----------------cceEEEEecccchhhhcCeEEEEEEEeeCcEEecccce---E---EE--- Confidence 0 000 0000000 00000000000 0000000000000000000000000 0 00 Q ss_pred eeeeeecccccccceeeeeeccCeeEEEEEe Q lcl|Aclame:pro 303 PGSIEVAPEAGANATITAAAGEDHTVQLKVT 333 (392) Q Consensus 303 ~~~v~v~~~~~~~~~~~~~~~~~~t~~~t~~ 333 (392) .+ +......-++..+.. T Consensus 277 ----~~----------~~~~~~~~~~~~~~~ 293 (293) T protein:vir:48 277 ----SF----------KAIADQKGNIGSTAV 293 (293) T ss_pred ----Ee----------eccccCCccccccCC Confidence 00 000000000000000 No 130 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=97.61 E-value=2.5e-05 Score=45.77 Aligned_cols=271 Identities=9% Similarity=0.023 Sum_probs=120.0 Q ss_pred Cc------cccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccc-ccc Q lcl|Aclame:pro 1 MA------NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLT-VSD 73 (392) Q Consensus 1 Ma------n~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~-~~~ 73 (392) |. -.++.|+.+..++++.+++..++..++.... .. +.+.+++.|...........++ +.... ... T Consensus 91 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~---~~---~~~~~~~~~~~~~~~~a~~v~E--g~~~~~~~~ 162 (371) T protein:vir:81 91 MSEGSNQDGGYTVPQDIQTRINELRESKDALQNLITVEP---VT---TLSGSRVFKKRSQQTGFVEVAE--GAAIGEKAT 162 (371) T ss_pred hccCCCccCceeecHhHHHHHHHHHHhhhhhhhhceeee---cc---CCceeEEEEeecCCcceeeecc--ccccccccc Confidence 32 1247899999999999999998877765331 11 2233443332222122222222 22221 122 Q ss_pred ccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHH Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVN 153 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 153 (392) +.-+.++++..+. +.-+.|+++-+..+..++...+.+..+++++..+|..++...... .......++++. T Consensus 163 ~~f~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~a~~~~~~~~i~~g~g~~---------~~~~~~~~~~i~ 232 (371) T protein:vir:81 163 PQFTLLQYQVKKY-AGFFRVTNELLNDSTEAIVNTLVRWIGDESRVTRNGLIINVLNTK---------AKTAIADLDGLK 232 (371) T ss_pred cceeeEEeeeeEE-EEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc---------cccccccHHHHH Confidence 3335555555443 234578888777777889899999999999999998887632211 111223456666 Q ss_pred HHH-HHhhhccCCCCCEEEEchHHHHHhhcccceeeeecccccee-eeEeeeeeeeEeeeEEEEecceeecccceeeccc Q lcl|Aclame:pro 154 GAR-RALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAV-SALQEARLGRIYGYEIVESTLIPHGDAYLYHPTA 231 (392) Q Consensus 154 ~a~-~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~-~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a 231 (392) .+. ..|....- .+-.++++|..+..|.+-.. ..|.-.. .....|..+.+.|++|+.+..+|.+........+ T Consensus 233 ~~~~~~l~~~~~-~~a~~vmn~~~~~~L~~lkd-----~~g~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~ 306 (371) T protein:vir:81 233 QIINVQLDPVFR-STSSVIVNQDAFNWLDTLKD-----QNGQYLLQPSISSPTGRQLLGLPVVIVSNKVLANRVDGGTGA 306 (371) T ss_pred HHHHhhcchhhh-cCCEEEEcHHHHHHHHHhhc-----cCCCeeeecccCCCCCceecceeEEEecccccCccccccccC Confidence 543 23433222 23468899999988864211 1121110 1123466678999999999888754332211000 Q ss_pred ccccchhhhccccccccceeecc-cceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeecc Q lcl|Aclame:pro 232 FIMATRAPAPPMGAVRSTAISGD-QRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAP 310 (392) Q Consensus 232 ~~~a~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~ 310 (392) .......+.......... .+.............. ...+ .+.... ...........+... .+.. T Consensus 307 -----~~~~i~~Gd~~~~~~~~~~~~~~i~~~~~~~~~f~--~~~v-~~~~~~----r~d~~~~~~~a~~~~--~~~~-- 370 (371) T protein:vir:81 307 -----QFAPIIVGDLKEAVVMFDRQRTEIMSSNVAMDAFE--TDAT-LWRAIE----RMDVKMRDDEAFVFG--EVQL-- 370 (371) T ss_pred -----CcceEEEEehhceEEEEeecceEEEEeccccchhh--cCce-EEEEEE----eeccEEecccceEEE--EEec-- Confidence 000000000000000000 0000000000000000 0000 000000 000000000000000 0000 Q ss_pred cccc Q lcl|Aclame:pro 311 EAGA 314 (392) Q Consensus 311 ~~~~ 314 (392) + T Consensus 371 ---A 371 (371) T protein:vir:81 371 ---A 371 (371) T ss_pred ---C Confidence 0 No 131 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=97.60 E-value=1.5e-05 Score=47.10 Aligned_cols=260 Identities=11% Similarity=0.096 Sum_probs=122.8 Q ss_pred Ccc----------------ccccH-HHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEecc-ceeeecccccc Q lcl|Aclame:pro 1 MAN----------------AFSKP-TAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPA-PSRGHTRKLRG 62 (392) Q Consensus 1 Man----------------~~~~~-~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~-~~~~~~~~~~~ 62 (392) |+. ..++| +++++++++.|++..++..+-.|- +....| .++||+-. ...+... T Consensus 347 ~~~~~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l~~~~----~~~~~g-~~~ip~~~~~~~a~wv---- 417 (632) T protein:vir:96 347 MPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARM----LPGLVG-DVDIPKKTSGANFYWI---- 417 (632) T ss_pred hhHHHHHHhhhhcccccccccccccccchHHHHHHHhhcchhhhhcceE----eecCCc-ceEEEEEeCCceeEee---- Confidence 110 12444 566889999999988776652222 222223 37776632 2222222 Q ss_pred ccCCCccccccccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHh--ccccccc--- Q lcl|Aclame:pro 63 AGAERNLTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIV--GAPYEAA--- 137 (392) Q Consensus 63 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~--~~~~~~~--- 137 (392) +++......++.-..+++...+. +.-+.|+.+-+.++..++...+.....++|+.++|..++.--. ..+.+.. T Consensus 418 -~E~~~~~~s~~~f~~i~l~~~k~-~~~v~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~ 495 (632) T protein:vir:96 418 -GEDEDVQDSDFDFTTLSFSPKTI-AGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMT 495 (632) T ss_pred -cCCccccccccceeeEEeeeeEE-EEehhhHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccceeeecc Confidence 23333444455555666655333 2345677776777778888888899999999999998874211 1111110 Q ss_pred -cc-cccccchhhHHHHHHHHHHhhhccCCCC-CEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEE Q lcl|Aclame:pro 138 -GA-VHEVAPDEFFKGVNGARRALNELYIPQG-RVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIV 214 (392) Q Consensus 138 -~~-~~~~~~~~~~~~i~~a~~~l~~~~vp~~-r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~ 214 (392) .. .........|..+.++...+...++..+ -..+++|.....+.... + .+..|. .+.. -+.+.|+.++ T Consensus 496 ~~~~~~~~~~~~~~~~i~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~~~-l--~d~~G~----~i~~--~~~l~G~pv~ 566 (632) T protein:vir:96 496 GVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQ-V--FDNTGE----RIWQ--NNEVNGYRAE 566 (632) T ss_pred cccceecccccCCHHHHHHHHHHHhhcccccCccEEEEchhHHHHHHHHh-c--cCCCCc----eeec--CCeecccceE Confidence 00 0111233468889998888877776543 45678887766654321 1 112222 1222 2468899999 Q ss_pred EecceeecccceeecccccccchhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccce Q lcl|Aclame:pro 215 ESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFV 294 (392) Q Consensus 215 ~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 294 (392) .++.+|.+.......+.+................. ..............+ .... T Consensus 567 ~s~~ip~~~~~~gd~s~~~i~~~~~~~i~~~~~~~--~~~~~v~~~~~~~~d------------------------~~v~ 620 (632) T protein:vir:96 567 ASNQIPADTWIFGDWSQIVIAMWGVLDLKVDPYTK--AASDGLVLRVFQDVD------------------------AGVR 620 (632) T ss_pred eccccccCcEEEeecceEEEEEecceEEEEccccc--cccCceEEEEEeecC------------------------ceee Confidence 99999866543322221111111000000000000 000000000000000 0000 Q ss_pred eeeeccceeeee Q lcl|Aclame:pro 295 RARKIHLIPGSI 306 (392) Q Consensus 295 ~~~~~~~~~~~v 306 (392) ....+....... T Consensus 621 ~~~af~~~k~~A 632 (632) T protein:vir:96 621 RKEAFCIAKKGA 632 (632) T ss_pred chhhhhheeecC Confidence 000010000000 No 132 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=97.60 E-value=3.1e-05 Score=45.29 Aligned_cols=279 Identities=9% Similarity=-0.012 Sum_probs=120.1 Q ss_pred Ccc---ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc-eeeeccccccccCCCccccccccC Q lcl|Aclame:pro 1 MAN---AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSDFTE 76 (392) Q Consensus 1 Man---~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 76 (392) +.. .-+.|+.+++++++.+++...+..++.+- .- .+.+.++|+-.. ..+.. . +++..+...++.- T Consensus 22 ~~~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~-----~~-~~~~~~~p~~~~~~~a~~---v--~Eg~~~~~~~~~f 90 (326) T protein:vir:42 22 TGDSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKI-----PM-GTTGQKIPHWTGDVSASW---I--GEGDMKPITKGNM 90 (326) T ss_pred ccccCCcceechhhHHHHHHHHHhcchhhhhccee-----ec-cCCceEEEEEeCCcceEE---e--cCCccccccccce Confidence 111 12568888899999999999887776542 11 134577766332 22222 2 2344444455555 Q ss_pred ceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHh-cccc----------ccccccccccc Q lcl|Aclame:pro 77 DSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIV-GAPY----------EAAGAVHEVAP 145 (392) Q Consensus 77 ~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~-~~~~----------~~~~~~~~~~~ 145 (392) ..+++...+. +.-+.++++-+.++..++...+.++..+++++++|+.++.--. +.+. ........... T Consensus 91 ~~i~~~~~k~-~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~p~gi~~~~~~~~~~~~~~~~~~~ 169 (326) T protein:vir:42 91 TSQTIAPHKI-ATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAINGTDSPFPTFLAQTTKEVSLVDPDGTGSNA 169 (326) T ss_pred eEEEEeeEEE-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccccccceeecccccccc Confidence 6666666443 4566788888888888999999999999999999998873110 0000 00001111111 Q ss_pred hhhHHHH--HHHHHHhhhccCCCCCEEEEchHHHHHhhc--ccc----eeeeeccccceeeeEeeeeeeeEeeeEEEEec Q lcl|Aclame:pro 146 DEFFKGV--NGARRALNELYIPQGRVLVVGTAVTEQILN--DDR----FIKYESQGQSAVSALQEARLGRIYGYEIVEST 217 (392) Q Consensus 146 ~~~~~~i--~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~--~~~----~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~ 217 (392) +..+.++ ..+...+.... ..+-.++++|..+..|.+ |.. |......|. ......+.+.|++++.++ T Consensus 170 ~~~~~~~~~~~~~~~~~~~~-~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~-----~~~~~~~~l~G~pv~~~~ 243 (326) T protein:vir:42 170 DLTVYDAVAVNALSLLVNAG-KKWTHTLLDDITEPILNGAKDKSGRPLFIESTYTEE-----NSPFRLGRIVARPTILSD 243 (326) T ss_pred cchhHHHHHHHHHhhhhhhc-cCccEEEEeHHHHHHHHHhhccCCceeeccccccCc-----cccccCceeeeeeEEEcC Confidence 1222222 22222222211 134467899999988854 211 111111111 111234579999999999 Q ss_pred ceeecccceeeccc--ccccchhhhccccccccceeecccceeee-eeeccccceeeeecccccceeeeEEEeeccccce Q lcl|Aclame:pro 218 LIPHGDAYLYHPTA--FIMATRAPAPPMGAVRSTAISGDQRIAMR-WLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFV 294 (392) Q Consensus 218 ~v~~~~~~~~~~~a--~~~a~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 294 (392) .+|.+....+.... ..+.........-....+...+....... .....+.. ..-.....++.+...... T Consensus 244 ~~~~~~~~~~~Gd~s~~~~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~----~~r~~~~~d~~v~~~~a~---- 315 (326) T protein:vir:42 244 HVASGTVVGYQGDFRQLVWGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLV----AVRVEAEYAFHCNDKDAF---- 315 (326) T ss_pred CCCCCceEEEEeecceEEEEEecceEEEEeecceeeecccccccchhhhhcCcE----EEEEEEEeccEEecccce---- Confidence 88865543321111 00000000000000000000000000000 00000000 000000011111100000 Q ss_pred eeeeccceeeeeeecccccccc Q lcl|Aclame:pro 295 RARKIHLIPGSIEVAPEAGANA 316 (392) Q Consensus 295 ~~~~~~~~~~~v~v~~~~~~~~ 316 (392) +.+..+..... T Consensus 316 -----------~~l~~~~~~~~ 326 (326) T protein:vir:42 316 -----------VKLTNVDATEA 326 (326) T ss_pred -----------EEEeeccccCC Confidence 00001000100 No 133 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=97.59 E-value=3.4e-05 Score=45.10 Aligned_cols=278 Identities=9% Similarity=0.055 Sum_probs=116.8 Q ss_pred Cc--c----ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeec-cccccccCCCccc-cc Q lcl|Aclame:pro 1 MA--N----AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHT-RKLRGAGAERNLT-VS 72 (392) Q Consensus 1 Ma--n----~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~-~~~~~~~~~~~~~-~~ 72 (392) |. . .++.|+.|..++++.+++...+..++....-. +.+.+++.+....... ..... ++.... .. T Consensus 116 ~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~v~--E~~~~~~~~ 187 (408) T protein:vir:74 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVS------TSSGSRVYEKWTDVTPLKAMDE--EDGKIPDLD 187 (408) T ss_pred hcccccCCCceeechhHhhHHHHHHhhhcchhhhcceeecc------CCcceEEEEeecCCcccccccc--ccccccccc Confidence 21 1 23679999999999999999888776543111 2223333332211110 11111 122221 12 Q ss_pred cccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHH Q lcl|Aclame:pro 73 DFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGV 152 (392) Q Consensus 73 ~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 152 (392) .+.-..++++..+. +.-+.|+++-+..+..++...+.++..++|+.++|..++.--.. .........++++ T Consensus 188 ~~~~~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~G~--------~~~~~~~~~~~~i 258 (408) T protein:vir:74 188 NPRLTIIKYLIKRY-AGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAMGT--------VPKKPTIANFDDV 258 (408) T ss_pred ccceeeEEeeeeeE-EeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccc--------cccccccccHHHH Confidence 23345555555443 34567888888878889999999999999999999987742111 1111222346777 Q ss_pred HHHH-HHhhhccCCCCCEEEEchHHHHHhhcccceeeeecccccee-eeEeeeeeeeEeeeEEEEecc--eeecccceee Q lcl|Aclame:pro 153 NGAR-RALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAV-SALQEARLGRIYGYEIVESTL--IPHGDAYLYH 228 (392) Q Consensus 153 ~~a~-~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~-~a~~~g~ig~~~g~~v~~s~~--v~~~~~~~~~ 228 (392) +++. ..|+.+.. .+-.++++|..+..|.+-. +..|.-.. .....|..+.+.|++|+.+.. +|....... T Consensus 259 ~~~~~~~l~~~~~-~~a~~v~n~~~~~~l~~lk-----d~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~- 331 (408) T protein:vir:74 259 ITMINTSVDPAII-ATSSLLTNQSGLNKLALVK-----TAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNSGSTVY- 331 (408) T ss_pred HHHHHHhhhhhhc-CCCEEEEcHHHHHHHHHhh-----cCCCceEeccCcCCCCCceecceeeEEecCcccccccCCcc- Confidence 7764 35554333 2346788999998886421 11222111 012345556899999887543 332211000 Q ss_pred cccccccchhhhccccccccceeec-ccceeeeeeeccccceeeeecc--cccceeeeEEEeeccccceeeeeccceeee Q lcl|Aclame:pro 229 PTAFIMATRAPAPPMGAVRSTAISG-DQRIAMRWLVDYDSTITSNRSL--IDTYFGLKVVEDPNGVGFVRARKIHLIPGS 305 (392) Q Consensus 229 ~~a~~~a~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 305 (392) .+.+ +......... ..+....+..........+... .....++.+....... . T Consensus 332 --~i~~---------gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~-------------~ 387 (408) T protein:vir:74 332 --PLYY---------GDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALV-------------A 387 (408) T ss_pred --eEEE---------EehhccEEEEEecceEEEEeccccchhhcceeeEEEEEeeCcEEecccceE-------------E Confidence 0000 0000000000 0000000000000000000000 0000000000000000 0 Q ss_pred eeecccccccceeeeeeccCeeEEEEEeecCcccccceE Q lcl|Aclame:pro 306 IEVAPEAGANATITAAAGEDHTVQLKVTDANGDDVTALC 344 (392) Q Consensus 306 v~v~~~~~~~~~~~~~~~~~~t~~~t~~~~~~~~~~~~v 344 (392) +.+..+... .+...+-+++. + T Consensus 388 ~~~~~~~~~-------~~~~~~~~~~~-----------~ 408 (408) T protein:vir:74 388 GSFTAIADQ-------VGNFKTTTSTA-----------V 408 (408) T ss_pred EEeecccCC-------CCCCCCCcccc-----------C Confidence 000000000 00000000000 0 No 134 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=97.55 E-value=4.7e-05 Score=44.33 Aligned_cols=281 Identities=10% Similarity=0.079 Sum_probs=116.1 Q ss_pred Cc-c-------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc-eeeeccccccccCCCcccc Q lcl|Aclame:pro 1 MA-N-------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTV 71 (392) Q Consensus 1 Ma-n-------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~-~~~~~~~~~~~~~~~~~~~ 71 (392) ++ + -++.|+.|..++++.+++...+..+..|. +....| .+++|+... ..+.. .+ ++....- T Consensus 130 ~~~~~~t~~~gg~~vP~~~~~~ii~~l~~~~~i~~~~~~~----~~~~~~-~~~~p~~~~~~~a~~---v~--E~~~~~~ 199 (435) T protein:vir:14 130 MSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGART----LPLSNG-NITIPRLKGGAIVGY---IG--ADTDIPT 199 (435) T ss_pred hhcccCCcCCCccccchhHHHHHHHHHhhhchhhhhccee----eecCCC-ceEEEEEeCCcceee---ec--cCccccc Confidence 11 1 13689999999999999988776653322 222223 367766422 22222 22 2333333 Q ss_pred ccccCceEEEEEEeeeecceEeeHHHHhhhccC--hHHHHHHHHHHHHHHHHHHHHHHHHhc--ccccc---------cc Q lcl|Aclame:pro 72 SDFTEDSFPVTLTDVAYHLGVLTDEELTFDLES--FATQILPRQVRGVADILEEGVRDMIVG--APYEA---------AG 138 (392) Q Consensus 72 ~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~--~~~~~~~~~~~ala~~vd~~~~~~~~~--~~~~~---------~~ 138 (392) .++.-..+++...+. +.-+.|+++-+.++..+ +...+.++..++|++++|+.++.--.. .+.+. .. T Consensus 200 ~~~~f~~i~~~~~k~-~~~~~iS~ell~ds~~~~~l~~~i~~~l~~ai~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~ 278 (435) T protein:vir:14 200 TQQQFDDLKLTAKKM-AALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPSNVIT 278 (435) T ss_pred cccceeEEEeeeEEE-EEeehhhHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccceec Confidence 344334455544333 24456777766665444 667778888999999999988732100 01110 01 Q ss_pred ccccccchhhHHHHHHHHHHhhhccCC-CCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEec Q lcl|Aclame:pro 139 AVHEVAPDEFFKGVNGARRALNELYIP-QGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVEST 217 (392) Q Consensus 139 ~~~~~~~~~~~~~i~~a~~~l~~~~vp-~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~ 217 (392) .....+....+.++.++...|..+..- .+..++++|..+..|.+-. +..|.-. +....-|.+.|++|+.++ T Consensus 279 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~L~~lk-----d~~G~~l---~~~~~~g~l~G~Pv~~~~ 350 (435) T protein:vir:14 279 ASDASTLQKIETDLGKVILALENADANLTQPGWIMAPRTFRFLEGLR-----DGNGNKV---YPELANGMLKGYPVGKTT 350 (435) T ss_pred cccccchhhHHHHHHHHHHHhhhccccccCCEEEEcHHHHHHHHHhh-----ccCCcee---ccCCCCCeeecceeEeec Confidence 111122223345566655555544331 3456789999998775421 1222211 111123578999999999 Q ss_pred ceeecccceeecccccccchhhhccccccccceeeccc-ceeeeeeeccccceeeeecccccc-eeeeEEEeecccccee Q lcl|Aclame:pro 218 LIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQ-RIAMRWLVDYDSTITSNRSLIDTY-FGLKVVEDPNGVGFVR 295 (392) Q Consensus 218 ~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 295 (392) .+|...........+.++ ... ....+.+ +...... .+.............+ ............. T Consensus 351 ~~p~~~~~~~~~~~i~~g---------d~s-~~~i~~~~~~~~~~~-~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d--- 416 (435) T protein:vir:14 351 QVPINLGETGKESEIYFT---------DFG-DVFIGEEETLEIDYS-KEATYKDADGHMVSAFQRDQTLIRVIAKND--- 416 (435) T ss_pred cccccccCCCccceEEEe---------ecc-cEEEEEecccEEEEe-ccccccccccchhhhhhcChhheeeeeeeC--- Confidence 987653221111100000 000 0000000 0000000 0000000000000000 0000000000000 Q ss_pred eeeccceeeeeeeccccccc Q lcl|Aclame:pro 296 ARKIHLIPGSIEVAPEAGAN 315 (392) Q Consensus 296 ~~~~~~~~~~v~v~~~~~~~ 315 (392) ..+..+..-+.++++..-. T Consensus 417 -~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:14 417 -FGPRHVESIAVLAGVAWGA 435 (435) T ss_pred -ceeecccceEEEecCCCCC Confidence 0000000011111111111 No 135 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=97.53 E-value=2.7e-05 Score=45.61 Aligned_cols=271 Identities=10% Similarity=0.080 Sum_probs=119.8 Q ss_pred Cc------cccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccc Q lcl|Aclame:pro 1 MA------NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDF 74 (392) Q Consensus 1 Ma------n~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (392) |. -.+++|+.|..++++.+++..++..+++.- .. .+.+.++|.+.... .......++... ...+.+ T Consensus 111 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~-----~~-~~~~~~~~~~~~~~-~~~~~~~E~~~~-~~~~~~ 182 (394) T protein:vir:10 111 AGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKT-----PV-TTPKGTYPILKRAT-DRFSSVAELAEN-PALAEP 182 (394) T ss_pred hcccccccCceeccHHHHHHHHHHHHhhhhhhhhceee-----ec-cCCceEEEEEecCC-Cccccccccccc-cccccc Confidence 11 125789999999999999999888776532 11 23456776654321 112222222111 112334 Q ss_pred cCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHHH Q lcl|Aclame:pro 75 TEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNG 154 (392) Q Consensus 75 ~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~ 154 (392) .-..+++.+.+.. .-+.|+++-+.++..++...+.+..+++++..+|..++....... .........++++.+ T Consensus 183 ~~~~v~l~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~------~~~~~~~~~~d~l~~ 255 (394) T protein:vir:10 183 EFEQVDWSVSTYR-GAIPLSEEAIADSAVDLTSLVGQSINEKSVNTYNAMIAPVLQSFT------AKATTTDTLVDSLKH 255 (394) T ss_pred cceeEEeeeeeeE-eeehhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcccccc------cccccccccHHHHHH Confidence 4456666664443 345788888888888999999999999999999998876543221 122233345667766 Q ss_pred HHH-HhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccc-----eeeeEeeeeeeeEeeeEEEEecceeecccceee Q lcl|Aclame:pro 155 ARR-ALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQS-----AVSALQEARLGRIYGYEIVESTLIPHGDAYLYH 228 (392) Q Consensus 155 a~~-~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~-----~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~ 228 (392) +.. .++.+. +-.++++|..+..|.+-.. ..|.- .......|..+++.|++|+.............. T Consensus 256 ~~~~~~~~~~---~a~~vmn~~~~~~l~~lkd-----~~G~~i~~~~~~~~~~~~~~~~L~G~PV~~~~~~~~~~~~~~~ 327 (394) T protein:vir:10 256 ILNVDLDPAY---SRALVVTQSLFNTLDTLKD-----KNGRYLLHDASDSITDGTAKGTVLGVPVYVVGDALLGSAAGDQ 327 (394) T ss_pred HHHhhhhhhc---cCEEEecHHHHHHHHHhhc-----cCCCeeeeccccccccCCcccccccceeEEecccccCCCCCce Confidence 543 233221 3468899999888864211 11110 001112244467899999875432111110000 Q ss_pred cccccccchhhhccccccccceeec-ccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeee Q lcl|Aclame:pro 229 PTAFIMATRAPAPPMGAVRSTAISG-DQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIE 307 (392) Q Consensus 229 ~~a~~~a~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 307 (392) .+.++ ......... ..+....+... .. . ...........+.+.. ...+ ..++ T Consensus 328 --~i~~g---------d~s~~~~~~~~~~~~v~~~~~-~~-~-~~~~~~~~r~d~~~~~---------~~ai----~~~~ 380 (394) T protein:vir:10 328 --KAFVG---------DLKRGVLFADRQQVTLAWEDS-KI-Y-GRYLGAAFRFGVKQAD---------SNAG----YFVT 380 (394) T ss_pred --EEEEe---------eccccEEEEeecceEEEEecc-cc-c-ceeEEEEEEeccEEec---------cccE----EEEE Confidence 00000 000000000 00011111000 00 0 0000000000000000 0000 0000 Q ss_pred ecccccccceeeeeeccCe Q lcl|Aclame:pro 308 VAPEAGANATITAAAGEDH 326 (392) Q Consensus 308 v~~~~~~~~~~~~~~~~~~ 326 (392) +... ... ...+.+. T Consensus 381 ~~~~---~~~--~~~~~~~ 394 (394) T protein:vir:10 381 NTDA---ASG--STSGTGK 394 (394) T ss_pred eecc---cCC--CCCCCCC Confidence 0000 000 0011111 No 136 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=97.49 E-value=4.6e-05 Score=44.36 Aligned_cols=278 Identities=8% Similarity=0.041 Sum_probs=117.1 Q ss_pred Ccc-ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceee-eccccccccCCCccc-cccccCc Q lcl|Aclame:pro 1 MAN-AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRG-HTRKLRGAGAERNLT-VSDFTED 77 (392) Q Consensus 1 Man-~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~-~~~~~~~~~~~~~~~-~~~~~~~ 77 (392) .++ .++.|+.|+.++++.+++...+..++..-. .. +...+++.|..... ......+ ++.... ...+.-. T Consensus 121 ~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~---~~---~~~~~~~~~~~~~~~~~a~~v~--E~~~~~~~~~~~~~ 192 (408) T protein:vir:10 121 DSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVES---VS---TSNGSRVYEKWTDVTPLTVMDA--EDGKIPDLDNPQLT 192 (408) T ss_pred ccCCceeccHhHHHHHHHHHHhhchhhhhcceee---cc---CCcceEEEeeccccccceeeec--CccccccccCccee Confidence 221 256799999999999999998877765321 11 12234443322111 1111122 222222 1122334 Q ss_pred eEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHHHHH- Q lcl|Aclame:pro 78 SFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNGAR- 156 (392) Q Consensus 78 ~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~- 156 (392) .+++...+. +.-+.|+++-+.++..++...+.+..+++++..+|..++.-..... .......+++++++. T Consensus 193 ~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~~--------~~~~~~~~~~l~~~~~ 263 (408) T protein:vir:10 193 IIKYLIKRY-AGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAP--------KKPTIAKFDDVITMIN 263 (408) T ss_pred eEEeeeeeE-EeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--------cccccccHHHHHHHHH Confidence 555555333 2445688877777788998999999999999999998875432211 111223467777754 Q ss_pred HHhhhccCCCCCEEEEchHHHHHhhcccceeeeecccccee-eeEeeeeeeeEeeeEEEEecc--eeecccceeeccccc Q lcl|Aclame:pro 157 RALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAV-SALQEARLGRIYGYEIVESTL--IPHGDAYLYHPTAFI 233 (392) Q Consensus 157 ~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~-~a~~~g~ig~~~g~~v~~s~~--v~~~~~~~~~~~a~~ 233 (392) ..|+...- .+-.++++|..+..|.+-.. ..|.-.. ....+|..+.+.|++|+.... +|..+.... .+. T Consensus 264 ~~~~~~~~-~~a~~v~n~~~~~~l~~lkd-----~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~---~i~ 334 (408) T protein:vir:10 264 TAVDPAII-ATSSLLTNQSGLNKLALVKT-----AEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVY---PLY 334 (408) T ss_pred Hhhhhhhc-cCCEEEEcHHHHHHHHHhhc-----cCCceEeccCcCCCCCceecceeeEEecccccCccCCCce---EEE Confidence 34543222 34468899999998865321 1222111 112345566899999887543 332221000 000 Q ss_pred ccchhhhccccccccceeec-ccceeeeeeeccccceeeee--cccccceeeeEEEeeccccceeeeeccceeeeeeecc Q lcl|Aclame:pro 234 MATRAPAPPMGAVRSTAISG-DQRIAMRWLVDYDSTITSNR--SLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAP 310 (392) Q Consensus 234 ~a~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~ 310 (392) ++ ......... ..+....+..........+. .-.....++.+....... .+..... .+..+ T Consensus 335 ~g---------d~~~~~~~~~~~~~~v~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~------~~~~~~~-~~~~~ 398 (408) T protein:vir:10 335 YG---------DMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALV------AGSFSAI-ADQVG 398 (408) T ss_pred EE---------ehhccEEEEEecceEEEEcccccchhhcCceEEEEEEeeccEEeccccEE------EEEeecc-ccCCC Confidence 00 000000000 00000000000000000000 000000111111000000 0000000 00000 Q ss_pred cccccceeee Q lcl|Aclame:pro 311 EAGANATITA 320 (392) Q Consensus 311 ~~~~~~~~~~ 320 (392) .........+ T Consensus 399 ~~~~~~~~~~ 408 (408) T protein:vir:10 399 NFKTTTSTAV 408 (408) T ss_pred CCCCCCcccC Confidence 0000000000 No 137 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=97.48 E-value=5.9e-05 Score=43.78 Aligned_cols=273 Identities=11% Similarity=0.052 Sum_probs=120.4 Q ss_pred Ccc-------ccccHHHHHHHHH-HHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccc Q lcl|Aclame:pro 1 MAN-------AFSKPTAVVDTAI-QMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVS 72 (392) Q Consensus 1 Man-------~~~~~~~~~~~~~-~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (392) ++. -.++|+-+...++ ..+++...+..++.. +.. .|+ +.+|+-.. .......+ ++..+... T Consensus 249 ~~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~-----~~~-~g~-~~~~~~~~--~~~a~~v~--Eg~~~~~~ 317 (543) T protein:vir:81 249 RAMGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQ-----VVA-TGD-VWHGVSSA--AVQWSWDA--EFEEVSDD 317 (543) T ss_pred hhcccccccCcccCchhhhhHHHHHHHhhhchhhhhccc-----ccC-Ccc-eEEEEecC--Ccceeecc--cCcccccc Confidence 111 2367777766655 556665666665432 121 243 55554221 12222222 33444445 Q ss_pred cccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHH------hcc----cccccccccc Q lcl|Aclame:pro 73 DFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMI------VGA----PYEAAGAVHE 142 (392) Q Consensus 73 ~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~------~~~----~~~~~~~~~~ 142 (392) .+.-..+++...+. +.-+.|+.+-+.. ..++...+.+...++++..+|..++.-- .+. .......... T Consensus 318 ~~~~~~i~~~~~k~-~~~~~is~ell~d-~~~~~~~i~~~l~~~~~~~~d~ail~G~Gt~~~p~Gi~~~~~~~~~~~~~~ 395 (543) T protein:vir:81 318 SPEFGQPEIPVKKA-QGFVPISIEALQD-EANVTETVALLFAEGKDELEAVTLTTGTGQGNQPTGIVTALAGTAAEIAPV 395 (543) T ss_pred ccccceeeeeeeee-EeeehhhHHHHhc-cHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccchhhccccccccccc Confidence 55556666666444 3456788765554 4689899999999999999999886310 110 0001111122 Q ss_pred ccchhhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeec Q lcl|Aclame:pro 143 VAPDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHG 222 (392) Q Consensus 143 ~~~~~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~ 222 (392) ......+++++++...|..+..+ +-.++++|..+..|.+-.. ..|.-....+..|..+.+.|+.|+.+..+|.. T Consensus 396 ~~~~~~~~~~~~~~~~l~~~~~~-~~~~v~n~~~~~~l~~lkd-----~~G~~l~~~~~~g~~~~l~G~pv~~~~~~~~~ 469 (543) T protein:vir:81 396 TAETFALADVYAVYEQLAARHRR-QGAWLANNLIYNKIRQFDT-----QGGAGLWTTIGNGEPSQLLGRPVGEAEAMDAN 469 (543) T ss_pred ccccccHHHHHHHHHhhhccccC-CcEEEEcHHHHHHHHHhhc-----CCCceeccCcCCCCCccccceeeEEecccccc Confidence 33345688888887777654432 3367899999988864211 11211111233455568999999999998865 Q ss_pred ccceeecccccccchhhhccccccccceeecccceeeeeeeccccceeeeeccc----ccceeeeEEEeeccccceeeee Q lcl|Aclame:pro 223 DAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLI----DTYFGLKVVEDPNGVGFVRARK 298 (392) Q Consensus 223 ~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~ 298 (392) .......+..... .+....+......+..+..............+.+ ....++.+.... . T Consensus 470 ~~~~~~~~~~~i~-------~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~---------A 533 (543) T protein:vir:81 470 WNTSASADNFVLL-------YGNFQNYVIADRIGMTVEFIPHLFGTNRRPNGSRGWFAYYRMGADVVNPN---------A 533 (543) T ss_pred ccccccCCcceEE-------EeeccceeEEeecccEEEEeccccccchhhcCceEEEEEEeeccEeeccc---------c Confidence 4322111110000 0000000000000000100000000000000000 000000000000 0 Q ss_pred ccceeeeeeeccccccc Q lcl|Aclame:pro 299 IHLIPGSIEVAPEAGAN 315 (392) Q Consensus 299 ~~~~~~~v~v~~~~~~~ 315 (392) +... .+.. +. T Consensus 534 ~~~l--~~~~-----~a 543 (543) T protein:vir:81 534 FRLL--NVET-----AS 543 (543) T ss_pred eEEE--Eecc-----cC Confidence 0000 0000 00 No 138 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=97.45 E-value=6.5e-05 Score=43.53 Aligned_cols=280 Identities=11% Similarity=0.067 Sum_probs=117.9 Q ss_pred Ccc-------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCcccccc Q lcl|Aclame:pro 1 MAN-------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSD 73 (392) Q Consensus 1 Man-------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (392) ++. -+++|+-|..++++.+++..++..+..|- +....|+ +++|+..... ..... +++......+ T Consensus 125 ~~~~~~~~~gg~liP~~~~~~ii~~l~~~~~l~~~~~~~----~~~~~g~-~~~p~~~~~~--~a~~v--~Eg~~~~~~~ 195 (428) T protein:vir:10 125 MAISTAAGSGGVLIPQNIHSEVIELLRDRTIVRKLGARS----IPLPNGN-MSLPRLAGGA--TASYT--GENQDAKVSE 195 (428) T ss_pred hhhcccccCCccccchhHHHHHHHHHhhhchhhhhccee----eecCCcc-eEEEEEeCCc--ceeee--ccCccccccc Confidence 111 24789999999999999998876663222 2222233 6777643211 11122 2333444445 Q ss_pred ccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhc--ccccc----c----cccccc Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVG--APYEA----A----GAVHEV 143 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~--~~~~~----~----~~~~~~ 143 (392) +.-..+++...+. +.-+.|+.+-+.++..++...+.+..+++|+.++|..++.--.. .|.+. . ...... T Consensus 196 ~~f~~i~~~~~k~-~~~v~is~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~ 274 (428) T protein:vir:10 196 ARFDDVKLTAKTM-IAMVPISNALIGRAGFNVEQLVLQDILTAISVREDKAFMRDDGTGDTPIGMKARATQWNRLLPWAA 274 (428) T ss_pred cceeeEEeeeEEE-EEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccc Confidence 5545566665443 34567888877777888988889999999999999988731100 01000 0 000001 Q ss_pred cchhhHH---HHHHHHHHhhhcc--CCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecc Q lcl|Aclame:pro 144 APDEFFK---GVNGARRALNELY--IPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTL 218 (392) Q Consensus 144 ~~~~~~~---~i~~a~~~l~~~~--vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~ 218 (392) .....++ ..+++...+.... ...+-.++++|..+..|.+-. +..|.-. +....-|.+.|++|+.++. T Consensus 275 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lk-----d~~G~~i---~~~~~~g~l~G~pv~~~~~ 346 (428) T protein:vir:10 275 DAAVNLDTIDTYLDSIILMSMDGNSNMISSGWGMSNRTYMKLFGLR-----DGNGNKV---YPEMAQGMLKGYPIQRTSA 346 (428) T ss_pred cccccHHHHHHHHHHHHHhhhccccccccCEEEEcHHHHHHHHHhh-----ccCCcee---ccCCCCCeeeceeeEEecc Confidence 1111222 2222222221111 112346788999988775421 1222211 1122235789999999999 Q ss_pred eeecccceeecccccccchhhhccccccccceeeccc-ceeeeeeeccccceeeeecccccc-eeeeEEEeeccccceee Q lcl|Aclame:pro 219 IPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQ-RIAMRWLVDYDSTITSNRSLIDTY-FGLKVVEDPNGVGFVRA 296 (392) Q Consensus 219 v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 296 (392) +|.+.........+.++ .. .....+.+ +...... ...............+ ........... .- T Consensus 347 ~p~~~~~~~~~~~i~~g---------d~-s~~~i~~~~~i~i~~~-~~~~~~~~~~~~~~~f~~~~~~~R~~~r----~d 411 (428) T protein:vir:10 347 IPANLGEGGKESEIYFA---------DF-NDVVIGEDGNMKVDFS-KEASYIDTDGKLVSAFSRNQSLIRVVTE----HD 411 (428) T ss_pred ccccccCCCccceEEEE---------ec-ceEEEEEecceEEEee-cccccccccccccchhhcchhheeeeee----eC Confidence 88654322111111100 00 00000000 0000000 0000000000000000 00000000000 00 Q ss_pred eeccceeeeeeeccccc Q lcl|Aclame:pro 297 RKIHLIPGSIEVAPEAG 313 (392) Q Consensus 297 ~~~~~~~~~v~v~~~~~ 313 (392) ..+..+..-+.++.+.. T Consensus 412 ~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 412 IGFRHPEGLVLGTGVLF 428 (428) T ss_pred ceeeccceEEEEeccCC Confidence 00111111111122222 No 139 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=97.37 E-value=7.3e-05 Score=43.26 Aligned_cols=262 Identities=10% Similarity=0.016 Sum_probs=115.1 Q ss_pred Ccc------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEec-cceeeeccccccccCCCcccc-c Q lcl|Aclame:pro 1 MAN------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVP-APSRGHTRKLRGAGAERNLTV-S 72 (392) Q Consensus 1 Man------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~-~~~~~~~~~~~~~~~~~~~~~-~ 72 (392) |+- .+++|+.|..++++.+++..++..+++.-.-. ...| .+.+++- ....+.. ..+ +..... . T Consensus 123 ~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~~---~~~~-~~~~~~~~~~~~a~~---v~E--g~~~~~~~ 193 (397) T protein:vir:12 123 MSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVTVEPVT---TRSG-TRLLEKNADMVPFSP---VEE--LGNLPEID 193 (397) T ss_pred ccccccccCcccCchhHHHHHHHhhhhhhhHHhhcceeecc---CCce-eEEEEEecCCcceee---ecc--cccccccc Confidence 321 24789999999999999999887776543111 1112 2444331 2222222 222 222211 1 Q ss_pred cccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHH Q lcl|Aclame:pro 73 DFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGV 152 (392) Q Consensus 73 ~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 152 (392) .+.-..+++...+. +.-+.|+++-+..+..++...+.+..+++|++++|..++.-.... .......++++ T Consensus 194 ~~~~~~v~~~~~k~-~~~~~is~e~l~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~g~~---------~~~g~~~~~~i 263 (397) T protein:vir:12 194 QPRFTKVSYSIIDY-GGIMTLSNSMLNDSDQAIMTYVAKWFAKKSVVTRNNLILAAIASL---------KKVDIDGLDGI 263 (397) T ss_pred cccceeEEeeheee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhccccc---------cccccccHHHH Confidence 23334455554333 244567887777777889899999999999999999887532111 11122346777 Q ss_pred HHHH-HHhhhccCCCCCEEEEchHHHHHhhcccceeeeecccccee-eeEeeeeeeeEeeeEEEEecceeecccceeecc Q lcl|Aclame:pro 153 NGAR-RALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAV-SALQEARLGRIYGYEIVESTLIPHGDAYLYHPT 230 (392) Q Consensus 153 ~~a~-~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~-~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~ 230 (392) .++. ..|+...- .+-.++++|..+..|.+-. +..|.-.. ..+.+|..+.+.|++|+.++.......... . T Consensus 264 ~~~~~~~l~~~~~-~~a~~~~n~~~~~~L~~lk-----d~~G~~l~~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~--~ 335 (397) T protein:vir:12 264 KKALNVTLDPMVA-PGSIVLTNQDGYDWLDTLK-----DGTGRYLLQPDPTNPTKKLLDGRPVVPFTNRVLKTQKGK--A 335 (397) T ss_pred HHHHhhccchhhh-CCCEEEEcHHHHHHHHHhh-----ccCCceeecccccCCCCccccceeeEEecccccccCCCc--c Confidence 7654 34432222 3446889999998885421 11122110 123356667899999987654322211000 0 Q ss_pred cccccchhhhccccccccceeec-ccceeeeeeeccccceeeeecc--cccceeeeEEEeeccccceeeeeccceeeeee Q lcl|Aclame:pro 231 AFIMATRAPAPPMGAVRSTAISG-DQRIAMRWLVDYDSTITSNRSL--IDTYFGLKVVEDPNGVGFVRARKIHLIPGSIE 307 (392) Q Consensus 231 a~~~a~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 307 (392) .+ ..+.....+... ..+....+..........+... ........+.. ...+.. ..++ T Consensus 336 ~~---------~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~---------~~a~~~--~~~t 395 (397) T protein:vir:12 336 PL---------IIGNLKEAIVLFDREQQSIASTDTGAGAFETNSTKVRGIEREDVRKWD---------EDAVVF--GQIT 395 (397) T ss_pred EE---------EEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEec---------ccceEE--EEEe Confidence 00 000000000000 0001111100000000000000 00000000000 000000 0000 Q ss_pred ec Q lcl|Aclame:pro 308 VA 309 (392) Q Consensus 308 v~ 309 (392) +. T Consensus 396 ~~ 397 (397) T protein:vir:12 396 VE 397 (397) T ss_pred eC Confidence 00 No 140 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=97.37 E-value=8.3e-05 Score=42.94 Aligned_cols=274 Identities=8% Similarity=0.034 Sum_probs=116.3 Q ss_pred Cc------cccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceee-eccccccccCCCccc-cc Q lcl|Aclame:pro 1 MA------NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRG-HTRKLRGAGAERNLT-VS 72 (392) Q Consensus 1 Ma------n~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~-~~~~~~~~~~~~~~~-~~ 72 (392) |. -.++.|+.++.++++.+++...+..++..-. ... ...+++.+..... ......++ +.... .+ T Consensus 116 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~---~~~---~~~~~~~~~~~~~~~~a~~v~E--g~~~~~~~ 187 (404) T protein:vir:39 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVES---VST---SNGSRVYEKWTDVTPLTVMDAE--DGKIPDLD 187 (404) T ss_pred hhcccccCCceeccHHHHHHHHHHHHhhhhHHhhcceee---ccC---CcceEEEEeecCCccceeeecC--cccccccc Confidence 21 1246799999999999999998887764321 122 2233333221110 11111222 22221 12 Q ss_pred cccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHH Q lcl|Aclame:pro 73 DFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGV 152 (392) Q Consensus 73 ~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 152 (392) .+.-..+++.+.+. +.-+.|+++-+.++..++...+.++..++++.++|..++.-.... ........++++ T Consensus 188 ~~~f~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~g~g~~--------~~~~~~~~~~~i 258 (404) T protein:vir:39 188 NPRLTIIKYLIKRY-AGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAMGTV--------PKKPTIAKFDDV 258 (404) T ss_pred ccceeeEEeeeeeE-EeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhccccc--------ccccccccHHHH Confidence 33345566666444 345578888777778889899999999999999999887532111 111122346677 Q ss_pred HHHHH-HhhhccCCCCCEEEEchHHHHHhhcccceeeeecccccee-eeEeeeeeeeEeeeEEEEecc--eeecccceee Q lcl|Aclame:pro 153 NGARR-ALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAV-SALQEARLGRIYGYEIVESTL--IPHGDAYLYH 228 (392) Q Consensus 153 ~~a~~-~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~-~a~~~g~ig~~~g~~v~~s~~--v~~~~~~~~~ 228 (392) .++.. .++.... .+-.++++|..+..|.+-. +..|.-.. .....+..+.+.|++|+.... +|..+.... T Consensus 259 ~~~~~~~~~~~~~-~~a~~v~n~~~~~~L~~lk-----d~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~- 331 (404) T protein:vir:39 259 ITMINTSVDPAII-ATSSLLTNQSGLNKLALVK-----TAEGKYLLEPDPTKPNSYLIKGKKVIVVADRWLPNSGSTVY- 331 (404) T ss_pred HHHHHHhhhhhhc-cCCEEEEcHHHHHHHHHhh-----ccCCceeeccCcCCCCcceecceeEEEecccccCccCCCcc- Confidence 76643 3333222 3346899999998886421 11222110 112345556899999887543 222111000 Q ss_pred cccccccchhhhccccccccceeec-ccceeeeeeeccccceeeee--cccccceeeeEEEeeccccceeeeeccceeee Q lcl|Aclame:pro 229 PTAFIMATRAPAPPMGAVRSTAISG-DQRIAMRWLVDYDSTITSNR--SLIDTYFGLKVVEDPNGVGFVRARKIHLIPGS 305 (392) Q Consensus 229 ~~a~~~a~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 305 (392) . + ..+......... ..+....+..........+. .......++.+.... .- T Consensus 332 -~-~---------~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~---------------a~ 385 (404) T protein:vir:39 332 -P-L---------YYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKTTDSE---------------AL 385 (404) T ss_pred -E-E---------EEEeccccEEEEeecceEEEEeccchhhhhhceeeEEEEeeeccEEeccc---------------ce Confidence 0 0 000000000000 00000000000000000000 000000000000000 00 Q ss_pred eeecccccccceeeeeecc Q lcl|Aclame:pro 306 IEVAPEAGANATITAAAGE 324 (392) Q Consensus 306 v~v~~~~~~~~~~~~~~~~ 324 (392) +.+.........-+.+.|. T Consensus 386 ~~~~~~~~a~~~~~~~~~~ 404 (404) T protein:vir:39 386 VAGSFTAIADQVGNFTAGK 404 (404) T ss_pred EEEEeeccccCCCCCCCCC Confidence 0000000011111111111 No 141 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=97.27 E-value=0.00011 Score=42.34 Aligned_cols=272 Identities=11% Similarity=0.049 Sum_probs=122.2 Q ss_pred Cc------cccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc-eeeeccccccccCCCcccccc Q lcl|Aclame:pro 1 MA------NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSD 73 (392) Q Consensus 1 Ma------n~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~-~~~~~~~~~~~~~~~~~~~~~ 73 (392) |. --+++|+.|..++++.+++..++..++.. +... +..+.+|+-.. ..+.+ .++ +....... T Consensus 106 ~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~-----~~~~-~~~~~~~~~~~~~~a~~---v~E--~~~~~~~~ 174 (407) T protein:vir:48 106 LQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQEATV-----ITLG-GSDYKKLVNLGGTTSGW---VGE--TDARPETA 174 (407) T ss_pred hhcccCCCCcccccHhHHHHHHHHHHhhhhhhhhcee-----eecC-CCceEEEEecCCcceee---ecc--cccccccc Confidence 22 12478999999999999999888776643 1211 33566654221 22222 222 22222111 Q ss_pred -ccCceEEEEEEeeeec-ceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHH-hcccccc----c--------- Q lcl|Aclame:pro 74 -FTEDSFPVTLTDVAYH-LGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMI-VGAPYEA----A--------- 137 (392) Q Consensus 74 -~~~~~~~~~i~~~~~~-~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~-~~~~~~~----~--------- 137 (392) ..-..+++.+ ++.. -+.|+++-+.++..++...+.++.+++++..+|..++.-= .+.|.+. . T Consensus 175 ~~~f~~i~~~~--~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~~~p~Gil~~~~~~~~~~~~~ 252 (407) T protein:vir:48 175 TSKLGLIEPFM--GEIYGNPQATQKMLDDAFFNVEDWINSELALEFAEQEEIAFTSGDGSKKPKGFLAYESTDEDDKTRA 252 (407) T ss_pred cccceeEEeee--eeeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeeecccccccccccc Confidence 2234444544 3433 3468888778788899999999999999999999876310 0000000 0 Q ss_pred -----cccccccchhhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccce-eeeEeeeeeeeEeee Q lcl|Aclame:pro 138 -----GAVHEVAPDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSA-VSALQEARLGRIYGY 211 (392) Q Consensus 138 -----~~~~~~~~~~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~-~~a~~~g~ig~~~g~ 211 (392) ...........+++++++...|.....+.. .+++++..+..|.+-. +..|.-. ...+..|..+.+.|. T Consensus 253 ~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~~a-~~v~n~~~~~~L~~lk-----D~~Gr~l~~~~~~~g~~~~l~G~ 326 (407) T protein:vir:48 253 FGKLQHIASGAASGVTADAIIKLIYTLRKAHRSGA-KFMMNNSSLFAIRLLK-----DNDGNYLWRPGIELGQPSSLAGY 326 (407) T ss_pred cccccccccccccccChHHHHHHHHhhchhhhcCC-EEEEcHHHHHHHHHhh-----ccCCceeeccCcCCCCCceecce Confidence 001111223358889888887866544333 5688999988775421 1112111 112335666789999 Q ss_pred EEEEecceeecccceeecccccccchhhhccccccc-cceeecccceeeeeeeccccceeeeecccccceeeeEEEeecc Q lcl|Aclame:pro 212 EIVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVR-STAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNG 290 (392) Q Consensus 212 ~v~~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 290 (392) +|+.++.+|....... .+.+ +... .+......+.... ...+...... ........++.+.... T Consensus 327 PV~~~~~~p~~~~~~~---~i~~---------Gd~~~~~~i~~~~~~~i~-~d~~~~~~~~-~~~~~~r~d~~v~~~~-- 390 (407) T protein:vir:48 327 GIVENEQMPDIAADAK---AIAF---------GNFKRGYTIVDRIGTRIL-RDPYTNKPFV-GFYTTKRTGGMLVDSQ-- 390 (407) T ss_pred eeEEecCcCCccCCcc---EEEE---------EeccccEEEEEeeceEEE-eeccccCCcE-EEEEEEEeccEEeccc-- Confidence 9999998875321100 0000 0000 0000000000000 0000000000 0000000000000000 Q ss_pred ccceeeeeccceeeeeeeccccccccee Q lcl|Aclame:pro 291 VGFVRARKIHLIPGSIEVAPEAGANATI 318 (392) Q Consensus 291 ~~~~~~~~~~~~~~~v~v~~~~~~~~~~ 318 (392) .+. .+.+.....+...- T Consensus 391 -------a~~----~l~~~aa~~~~~~~ 407 (407) T protein:vir:48 391 -------AIK----LMKIGAATRQKAAA 407 (407) T ss_pred -------ceE----EEEeeccCCCCCCC Confidence 000 00000000000000 No 142 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=97.24 E-value=0.00012 Score=42.06 Aligned_cols=279 Identities=9% Similarity=0.040 Sum_probs=115.4 Q ss_pred Ccc-------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc-eeeeccccccccCCCccccc Q lcl|Aclame:pro 1 MAN-------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVS 72 (392) Q Consensus 1 Man-------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~-~~~~~~~~~~~~~~~~~~~~ 72 (392) |+- -.++|+.+..++++.|++..++..+-.|- +....| .+++|+-.. ..+.. .. ++...... T Consensus 64 ~a~~~~~~~Gg~lvP~~~~~~ii~~l~~~s~l~~lg~~~----v~~~~g-~~~~p~~t~~~~a~w---v~--E~~~~~~s 133 (366) T protein:vir:57 64 MAISTAAGSGGALIPQNMQNEVIELLRDRTVVRILGARS----IPLPNG-NLSMPRLSGGATAGY---VG--EGKDVVAT 133 (366) T ss_pred hhccccccCCccccchhHHHHHHHHHhhhcchhhhceee----eecCCC-ceEEEEEeCCcceee---ec--cCcccccc Confidence 221 23679999999999999988876652222 222234 377776422 22222 22 33334444 Q ss_pred cccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhc--ccccc----cccc-----c Q lcl|Aclame:pro 73 DFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVG--APYEA----AGAV-----H 141 (392) Q Consensus 73 ~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~--~~~~~----~~~~-----~ 141 (392) +++-..+++...+. +.-+.|+++-+.++..++...+.++..+++++++|+.++.--.. .+.+. .... . T Consensus 134 ~~~f~~i~~~~~k~-~~~~~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~ 212 (366) T protein:vir:57 134 GATFDDVKLSAKTM-IALVPVSNQLIGRAGFNVEQLLLGDILSAIATREDKAFLRDDGTGDTPKGMKAVATAANRLVAWT 212 (366) T ss_pred ccceeEEEEeeEEE-EEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeeccccccceeecc Confidence 55555566655333 34557887777777788988899999999999999988742110 11110 0000 0 Q ss_pred cccc-hhhHHHHHHHHHH-hhhccC-CCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecc Q lcl|Aclame:pro 142 EVAP-DEFFKGVNGARRA-LNELYI-PQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTL 218 (392) Q Consensus 142 ~~~~-~~~~~~i~~a~~~-l~~~~v-p~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~ 218 (392) .... ...++..++.... +...+. ..+-.++++|..+..|.+-. +..|.- .+....-|.+.|+.|+.++. T Consensus 213 ~t~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~~lk-----d~~G~~---l~~~~~~g~l~G~Pvv~s~~ 284 (366) T protein:vir:57 213 GTAINLTTIDEYLDSLILKHMDSNSNMIRCGWGLSNRTYMTLFGLR-----DGNGNK---VYPEMSQGILKGYPIQRTSA 284 (366) T ss_pred ccccchhhHHHHHHHHHHhhhccccccccCEEEecHHHHHHHHhhh-----ccCCce---eccCCCCCeecceeeEEccc Confidence 0000 0112222222111 111111 12345789999988876421 122221 11122335789999999999 Q ss_pred eeecccceeecccccccchhhhccccccccceeecccc-eeeeeeeccccceeeeecccccce-eeeEEEeeccccceee Q lcl|Aclame:pro 219 IPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQR-IAMRWLVDYDSTITSNRSLIDTYF-GLKVVEDPNGVGFVRA 296 (392) Q Consensus 219 v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 296 (392) +|...........+.++.-.. ...+.++ ......... .....+......+. ........... . T Consensus 285 ip~~~~~~~~~~~i~~gdfs~----------~~i~~~~~i~i~~~~ea-~~~~~~g~~~~~f~~~~~~iR~~~~~----d 349 (366) T protein:vir:57 285 IPANLGDDGNESEIYFCDFND----------VVIGEDGMMKVDFSTEA-TYKDADGQLVSAFARNQSLIRVVTEH----D 349 (366) T ss_pred cccccccCCCccEEEEEecce----------EEEEEecceEEEEeecc-ccccccccchhhhhcCceeEEeeeee----C Confidence 986532211111111100000 0000000 000000000 00000000000000 00000000000 0 Q ss_pred eeccceeeeeeeccccc Q lcl|Aclame:pro 297 RKIHLIPGSIEVAPEAG 313 (392) Q Consensus 297 ~~~~~~~~~v~v~~~~~ 313 (392) ..+.....-+.++.+.- T Consensus 350 ~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 350 IGFRHPEGLVLGTGVIW 366 (366) T ss_pred cEeeccccEEEEecccC Confidence 00000001111111111 No 143 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=97.19 E-value=0.00014 Score=41.80 Aligned_cols=277 Identities=8% Similarity=0.052 Sum_probs=113.7 Q ss_pred Cc--------cccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccc-c Q lcl|Aclame:pro 1 MA--------NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLT-V 71 (392) Q Consensus 1 Ma--------n~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~-~ 71 (392) |. ...+.|+.|+.++++.+++..++..+++.- ......|. +.++.-.... .......+ +.... - T Consensus 105 ~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~---~~~~~~~~-~~~~~~~~~~-~~a~~v~E--~~~~~~~ 177 (395) T protein:vir:38 105 VTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVE---NVTTSHGS-RVYEKLADIT-PLKDLDDE--SALIGDN 177 (395) T ss_pred HhhccCccCCCceecchhHhhHHHHHHHhhcchhhhccee---eccCCcce-EEEEeeccCC-cccccccc--ccccccc Confidence 11 124679999999999999999887776432 11111122 2232211110 11111111 22221 1 Q ss_pred ccccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHH Q lcl|Aclame:pro 72 SDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKG 151 (392) Q Consensus 72 ~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 151 (392) ..+.-..++++..+. +.-+.|+++-+..+..++...+.++.+++++..+|..++.-.... ........+++ T Consensus 178 ~~~~f~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~--------~~~~~~~~~~~ 248 (395) T protein:vir:38 178 DDPELTVVKYLIHRY-AGITTVTNTLLKDTVDNIIQWLVNWAAKKDVVTRNAKILEVMGKA--------PKKPTISQFDN 248 (395) T ss_pred cccceeeEEeeeeee-EeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--------ccccccccHHH Confidence 112223444444333 233467777777777889899999999999999999887532211 11112234666 Q ss_pred HHHHHH-HhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccce-eeeEeeeeeeeEeeeEEEEecceeecccceeec Q lcl|Aclame:pro 152 VNGARR-ALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSA-VSALQEARLGRIYGYEIVESTLIPHGDAYLYHP 229 (392) Q Consensus 152 i~~a~~-~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~-~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~ 229 (392) +.++.. .|....- .+-.++++|..+..|.+-. +..|.-. ...+..|..+.+.|++|+.+..++....... T Consensus 249 i~~~~~~~l~~~~~-~~a~~v~n~~~~~~L~~lk-----d~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~-- 320 (395) T protein:vir:38 249 IKDLENNTLDPAIE-STSSFITNQSGYNILSKVK-----DADGRYLMQPDVTSPDKYLIDGKPVIRIADKWLPDVSGS-- 320 (395) T ss_pred HHHHHHHhhhhhhc-CCCEEEEcHHHHHHHHHhh-----ccCCceeeccCcCCCCcceeccceeEEecccccCcCCCc-- Confidence 666542 3333222 2346889999998886421 1122211 0113345567899999988765443221100 Q ss_pred ccccccchhhhcccccccc-ceeecccceeeeeeeccccceeeeec--ccccceeeeEEEeeccccceeeeeccceeeee Q lcl|Aclame:pro 230 TAFIMATRAPAPPMGAVRS-TAISGDQRIAMRWLVDYDSTITSNRS--LIDTYFGLKVVEDPNGVGFVRARKIHLIPGSI 306 (392) Q Consensus 230 ~a~~~a~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 306 (392) ..+.+ +.... .......+....+..........+.. ......+..+....... .+ T Consensus 321 ~~i~~---------gd~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~------~~------- 378 (395) T protein:vir:38 321 HPLYF---------GDLKQGITLFDRQQMQIDTTNVGAGSFEHDTTKLRFIDRFDVQLIDDGAFA------AA------- 378 (395) T ss_pred ceEEE---------EeccccEEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEecccceE------EE------- Confidence 00000 00000 00000001111110000000000000 00000011011000000 00 Q ss_pred eecccccccceeeeeeccCe Q lcl|Aclame:pro 307 EVAPEAGANATITAAAGEDH 326 (392) Q Consensus 307 ~v~~~~~~~~~~~~~~~~~~ 326 (392) .+...... .. .+...+. T Consensus 379 ~~~~~~~~-~~--~~~~~~~ 395 (395) T protein:vir:38 379 SFKTVANQ-AQ--GTAGTGK 395 (395) T ss_pred EeecccCC-CC--CccCCCC Confidence 00000000 00 0000111 No 144 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=97.16 E-value=0.0001 Score=42.42 Aligned_cols=267 Identities=13% Similarity=0.038 Sum_probs=121.0 Q ss_pred Ccc------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccc Q lcl|Aclame:pro 1 MAN------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDF 74 (392) Q Consensus 1 Man------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (392) |.. -+++|+.|..++++.+++..++..++.+- .- .|...++|+-.. .......+++...+. .... T Consensus 107 ~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~-----~~-~~~~~~~~~~~~--~~~a~wv~E~~~~~~-~~~~ 177 (401) T protein:vir:44 107 LQVGTDEDGGYAVPEELDRSILSLLKDEVVMRQEATVI-----TV-GGSDYKKLVNLG--GTASGWVGETDTRSQ-TATS 177 (401) T ss_pred hhcCCCCCCceeccHhHHHHHHHHHHhhhhhhhhceee-----ec-CCCceEEEEecC--CccceeeccccccCc-cccc Confidence 332 25789999999999999999887776442 11 133455544211 111112222221110 1112 Q ss_pred cCceEEEEEEeeeec-ceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHH-hcccccc---------------- Q lcl|Aclame:pro 75 TEDSFPVTLTDVAYH-LGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMI-VGAPYEA---------------- 136 (392) Q Consensus 75 ~~~~~~~~i~~~~~~-~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~-~~~~~~~---------------- 136 (392) .-.. ++++-++.. -+.|+.+-+..+..++...+.+..+++++..+|..++.-= .+.|.+. T Consensus 178 ~~~~--v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~ai~~~~~~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~ 255 (401) T protein:vir:44 178 RLGL--IEPFMGEIYGNPQATQKMLDDAFFNVEAWINSELATEFAEQEEIAFTTGDGTKKPKGFLAYESTEESDKARAFG 255 (401) T ss_pred ccee--eeeehhheeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeeccccccccccccccc Confidence 2233 444444443 4467777777778899999999999999999999887310 0001000 Q ss_pred --ccccccccchhhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccce-eeeEeeeeeeeEeeeEE Q lcl|Aclame:pro 137 --AGAVHEVAPDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSA-VSALQEARLGRIYGYEI 213 (392) Q Consensus 137 --~~~~~~~~~~~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~-~~a~~~g~ig~~~g~~v 213 (392) ............|++++++...|..... .+-.+++++..+..|.+-. +..|.-. ...+..|..+.+.|++| T Consensus 256 ~~~~~~t~~~~~~~~d~i~~~~~~l~~~~~-~~a~~v~n~~~~~~L~~lk-----d~~G~~l~~~~~~~g~~~~l~G~PV 329 (401) T protein:vir:44 256 KLQHIVSGEATAVTADAIIKLIYTLRKAHR-TGAKFMMNNNSLFAIRLLK-----DTEGNYLWRPGLELGQPSSLAGYGI 329 (401) T ss_pred cccccccccccccCHHHHHHHHHhcchhhh-cCCEEEEcHHHHHHHHHhh-----ccCCceeecCCcCCCCCceecceee Confidence 0000111223458889888877764433 2336789999988875421 1112111 11233566678999999 Q ss_pred EEecceeecccceeecccccccchhhhcccccc-ccceeecccceeeeeeeccccceeeeecccccceeeeEEEeecccc Q lcl|Aclame:pro 214 VESTLIPHGDAYLYHPTAFIMATRAPAPPMGAV-RSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVG 292 (392) Q Consensus 214 ~~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 292 (392) +.++.+|....... ...+ +.. ..+......+.... ...+...... ........++.+........ T Consensus 330 v~~~~~p~~~~~~~---~i~~---------Gd~~~~~~i~~~~~~~~~-~~~~~~~~~v-~~~a~~r~d~~~~~~~a~~~ 395 (401) T protein:vir:44 330 AENEQMPDIAADAK---AIAF---------GNFKRGYTIVDRIGTRIL-RDPYTNKPFV-GFYTTKRTGGMLVDSQAIKL 395 (401) T ss_pred EEecCcCCccCCcc---EEEE---------eehhccEEEEEecceEEe-eeccccCCcE-EEEEEEEeccEEecccceEE Confidence 99998875321110 0000 000 00000000000000 0000000000 00000001111111000000 Q ss_pred ceeeeeccceeeeeeecccccc Q lcl|Aclame:pro 293 FVRARKIHLIPGSIEVAPEAGA 314 (392) Q Consensus 293 ~~~~~~~~~~~~~v~v~~~~~~ 314 (392) + .+..+ T Consensus 396 ------l----------~~~aa 401 (401) T protein:vir:44 396 ------L----------KIAAA 401 (401) T ss_pred ------E----------EeecC Confidence 0 00000 No 145 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=97.15 E-value=0.00015 Score=41.56 Aligned_cols=267 Identities=11% Similarity=0.071 Sum_probs=121.4 Q ss_pred Ccc------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCcccccc- Q lcl|Aclame:pro 1 MAN------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSD- 73 (392) Q Consensus 1 Man------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~- 73 (392) |.. -+++|+.|..++++.+++..++..+++.- .. .+..+++|+..... .....++ +......+ T Consensus 130 l~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~~~~-----~~-~~~~~~~~~~~~~~--~a~wv~E--~~~~~~~~~ 199 (425) T protein:vir:10 130 LNKGEDSEGGYLTPIEWDRTITNKLVLISPMRQLCRVQ-----PV-SKAGFSKLFNMGGT--TSGWVGE--ASQRPQTNA 199 (425) T ss_pred hhcCcCCCCceeccHhHHHHHHHHHHhhhhhhhhceee-----ec-cCCceEEEEEcCCc--ceeeecc--ccccccccc Confidence 321 23789999999999999999888876432 11 12346666532211 1122222 22221111 Q ss_pred ccCceEEEEEEeeee-cceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHH---------Hhccccccc------ Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAY-HLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDM---------IVGAPYEAA------ 137 (392) Q Consensus 74 ~~~~~~~~~i~~~~~-~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~---------~~~~~~~~~------ 137 (392) ..-.. +++.-+++ .-+.|+.+-+.++..++...+.++.+++++.++|..++.- +........ T Consensus 200 ~~f~~--v~~~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~ai~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~ 277 (425) T protein:vir:10 200 ATFQP--LSFASGEIYANPAATQQILDDAEIDLESWLATEVQTEFAKQEGKAFLAGDGTNKPNGLLTYIAGGANAAKHPF 277 (425) T ss_pred cccce--eeeeheeeEeehHhHHHHHhcchhHHHHHHHHHHHHHHHHHHHhhhhcccCCCCcceeeeccccccccccccc Confidence 12233 44444444 3446777777777789999999999999999999988731 110000000 Q ss_pred ----cccccccchhhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccce-eeeEeeeeeeeEeeeE Q lcl|Aclame:pro 138 ----GAVHEVAPDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSA-VSALQEARLGRIYGYE 212 (392) Q Consensus 138 ----~~~~~~~~~~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~-~~a~~~g~ig~~~g~~ 212 (392) ...........+++++++...|..... .+-.++++|..+..|.+-. +..|.-. ...+..|..+.+.|.+ T Consensus 278 ~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~-~~a~~vmn~~~~~~L~~lk-----D~~G~~l~~~~~~~g~~~~l~G~P 351 (425) T protein:vir:10 278 GAIEVVNSGAAADITSDGIIDLVYDLPSAFT-GNARFAMNRNTQRQVRKLK-----DGQGNYLWQPSYVAGQPATLAGYP 351 (425) T ss_pred cccccccccccccccHHHHHHHHhhhhhhhc-cCCEEEEchHHHHHHHHhh-----cCCCceeeccCccCCCCceeccee Confidence 000112233467888887776654433 2336789999998875421 1222210 1123456667899999 Q ss_pred EEEecceeecccceeecccccccchhhhccccccc-cceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccc Q lcl|Aclame:pro 213 IVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVR-STAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGV 291 (392) Q Consensus 213 v~~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 291 (392) |+.++.+|....... .+.+ +... .+......+.... ...+.. .. ..... ....... T Consensus 352 V~~~~~~p~~~~~~~---~i~~---------Gd~~~~~~i~~~~~~~v~-~d~~~~-----~~----~~~~~-~~~r~d~ 408 (425) T protein:vir:10 352 VTEVPDMPDVAANST---PILF---------GDFQQTYLIIDRIGVRVL-RDPYTA-----KP----YVLFY-TTKRVGG 408 (425) T ss_pred eEEecCcCCccCCcc---EEEE---------EehhccEEEEEecceEEE-eccccc-----CC----cEEEE-EEEEecc Confidence 999988874321100 0000 0000 0000000000000 000000 00 00000 0000000 Q ss_pred cceeeeeccceeeeeeeccccccc Q lcl|Aclame:pro 292 GFVRARKIHLIPGSIEVAPEAGAN 315 (392) Q Consensus 292 ~~~~~~~~~~~~~~v~v~~~~~~~ 315 (392) ...... .+.+..+..+. T Consensus 409 ~v~~~~-------A~~~l~~~as~ 425 (425) T protein:vir:10 409 GLLNPE-------PMRAMKVAASE 425 (425) T ss_pred Eeeccc-------ceEEEEeeccC Confidence 000000 00000000000 No 146 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=96.78 E-value=0.00034 Score=39.59 Aligned_cols=257 Identities=12% Similarity=0.086 Sum_probs=116.9 Q ss_pred CccccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccc-cccccCceE Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLT-VSDFTEDSF 79 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 79 (392) -...+++|+.|..++++.+++...+.+++..- .. .+.++++|++.... .......+ +.... .+++.-..+ T Consensus 140 ~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~-----~~-~~~~~~~~~~~~~~-~~~~~~~E--~~~~~~~~~~~f~~i 210 (400) T protein:vir:38 140 ADAASTIPETISNTPQRELQTVVDLKPFTNVF-----QA-STQKGTYPTVANAT-TKMVTVAE--LEKNPAMAKPEFKPV 210 (400) T ss_pred cCCcccccHHHHHHHHHHHHhhhhhhhcceeE-----ec-cCcceEEEEEecCC-Cccccccc--cccccccccccceee Confidence 11245789999999999999988877765432 11 13456776653221 11111122 22111 123333445 Q ss_pred EEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHHHHHH-H Q lcl|Aclame:pro 80 PVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNGARR-A 158 (392) Q Consensus 80 ~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~-~ 158 (392) ++...+. +.-+.|+++-+.++..++...+.+..+++++..+|..++...... .......++++.++.. . T Consensus 211 ~~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~---------~~~~~~~~~~~~~~~~~~ 280 (400) T protein:vir:38 211 NWSVETY-RQALPVSQESIDDSAIDLVGLIAQNGQQIKVNTTNGAVATLLKGF---------TAKTISSVDDLKHINNVD 280 (400) T ss_pred Eeehhhe-eeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhhhhccccc---------cccccccHHHHHHHHHhh Confidence 5554333 345567777777777888888899999999999998777433221 1122234566665533 2 Q ss_pred hhhccCCCCCEEEEchHHHHHhhcccceeeeecccccee-eeEeeeeeeeEeeeEEEEecceeecccceeecccccccch Q lcl|Aclame:pro 159 LNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAV-SALQEARLGRIYGYEIVESTLIPHGDAYLYHPTAFIMATR 237 (392) Q Consensus 159 l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~-~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a~~~a~~ 237 (392) ++.. .+..++++|..+..|.+-.. ..|.-.. ..+..|..+.+.|++|+.+...|........ + T Consensus 281 ~~~~---~~a~~v~~~~~~~~l~~lkd-----~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~g~~~---~----- 344 (400) T protein:vir:38 281 LDPA---YSRVIIASQSFYNFLDTVKD-----GNGRYLLQDSILTPSGKSVLGMPIAVVSDDTLGAAGEAH---A----- 344 (400) T ss_pred hhhh---hCcEEEEcHHHHHHHHHhhc-----cCCCeeeecCcCCCCccccccceeEEecccccCCCCceE---E----- Confidence 2211 24578899999988764211 1121110 1233455678999999988877643321100 0 Q ss_pred hhhcccccccc-ceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeecccc Q lcl|Aclame:pro 238 APAPPMGAVRS-TAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEA 312 (392) Q Consensus 238 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~ 312 (392) ..+.... .......+....++.. +.. ..........++.+... ..+ ..+.+++.- T Consensus 345 ----~~gd~s~~~~~~~~~~~~~~~~~~-~~~--~~~~~~~~r~d~~~~~~---------~a~----~~l~~~~~a 400 (400) T protein:vir:38 345 ----FLGDIKRAILFANRADFMVRWVDD-QIY--GQFLQAGMRFGVSVADE---------KAG----YFLTYTPKA 400 (400) T ss_pred ----EEEeccccEEEEeecceEEEEecc-ccc--ceeEEEEEEeccEEecc---------cce----EEEEeecCC Confidence 0000000 0000000111111100 000 00000000000000000 000 000010100 No 147 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=96.73 E-value=0.00028 Score=40.10 Aligned_cols=275 Identities=14% Similarity=0.081 Sum_probs=116.5 Q ss_pred CccccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccccCceEE Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTEDSFP 80 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (392) ...-+..|.-+.+++++.+.++-.|..+++.-.-.+ ..|.-..+-..+. +..... .+ .......++.-+.++ T Consensus 24 ~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~---~~~~i~~~~~~~~--~~~~~~--e~-~~~~~~~~~~~~~~~ 95 (321) T protein:vir:31 24 LDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGA---KKTRIPTLNIGER--HRRPQD--EG-EWNENESDVSTGTID 95 (321) T ss_pred cCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccC---cceeeeeeccCCc--cccccc--cc-ccccccccceeeeee Confidence 223345566667778888888888877776542222 2232222211111 111100 01 111222233445556 Q ss_pred EEEEeeeecceEeeHHHHhhhc--cChHHHHHHHHHHHHHHHHHHHHHHHHhc-cc------cc------cccc-ccccc Q lcl|Aclame:pro 81 VTLTDVAYHLGVLTDEELTFDL--ESFATQILPRQVRGVADILEEGVRDMIVG-AP------YE------AAGA-VHEVA 144 (392) Q Consensus 81 ~~i~~~~~~~~~i~d~~~~~~~--~~~~~~~~~~~~~ala~~vd~~~~~~~~~-~~------~~------~~~~-~~~~~ 144 (392) +...+. ...+.|+.+-+.... .||...+....+++++..++...+.--.. .+ .+ .... ..... T Consensus 96 ~~~~k~-~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~~~~~n~G~l~~a~~~~~~~~~~~ 174 (321) T protein:vir:31 96 ISTEKA-TVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANGDEDAEDSFENQNDGFITVAEGDVETIDAAD 174 (321) T ss_pred eeeEEE-EeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCCcccccchhhhhhhccccccccccc Confidence 665443 356678877666543 48888888888889999888866521100 00 00 0000 01112 Q ss_pred chhhHHHHHHHHHHhhhccCCC-CCEEEEchHHHHHhhc---ccceeeeeccccceeeeEeeeeeeeEeeeEEEEeccee Q lcl|Aclame:pro 145 PDEFFKGVNGARRALNELYIPQ-GRVLVVGTAVTEQILN---DDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIP 220 (392) Q Consensus 145 ~~~~~~~i~~a~~~l~~~~vp~-~r~~vv~~~~~~~l~~---~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~ 220 (392) ....++.+.++...|+...--. +-.++++++....+.. +.+ ...+. ..+..+...++.|+.++.+..+| T Consensus 175 ~~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~~~~----~~~~~---~~l~~~~~~tl~G~pvv~~~~mP 247 (321) T protein:vir:31 175 DILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYTLTDRD----TPLGD---NVIMGEADVNPFSFPIIGSGLWP 247 (321) T ss_pred cccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHHHhcCC----Ccccc---chhhccccccccceeEEEcCCCC Confidence 2234667777776665443212 2256788887655432 211 11222 23445566678999999999998 Q ss_pred ecccceeecccccccchhhhccccccccceeecccceeeeeeeccccce-eeeecc--cccceeeeEEEeeccccceeee Q lcl|Aclame:pro 221 HGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTI-TSNRSL--IDTYFGLKVVEDPNGVGFVRAR 297 (392) Q Consensus 221 ~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~--~~~~~~~~~~~~~~~~~~~~~~ 297 (392) .+.........+.+...... ........+... ...... .....+ ++.......... . T Consensus 248 ~~~il~t~~~nl~~~~~~~~-----------------~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~ve~~~a~a~~--~ 307 (321) T protein:vir:31 248 DDKAMFTDPQNLIYALYRDL-----------------EIDVLTESDKVSERDLHARYFMRGDDD-FAIENTEAVVLA--E 307 (321) T ss_pred CCcEEEeccccEEEEEeecc-----------------EEEEeecCccccccceeeEeeeeeecc-eeEeccccEEEE--e Confidence 76544333222221111000 000000000000 000000 000000 011111000000 0 Q ss_pred eccceeeeeeecccccccc Q lcl|Aclame:pro 298 KIHLIPGSIEVAPEAGANA 316 (392) Q Consensus 298 ~~~~~~~~v~v~~~~~~~~ 316 (392) .+.. +...+..... T Consensus 308 ~i~~-----~~~~~~~~~~ 321 (321) T protein:vir:31 308 GLGD-----PLEHLEEETS 321 (321) T ss_pred cCCc-----chhcccCCCC Confidence 0111 0111111111 No 148 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=96.67 E-value=0.00042 Score=39.08 Aligned_cols=267 Identities=10% Similarity=0.067 Sum_probs=117.1 Q ss_pred Ccc------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccc-ccc Q lcl|Aclame:pro 1 MAN------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLT-VSD 73 (392) Q Consensus 1 Man------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~-~~~ 73 (392) |+- .+++|+.|..++++.+++...+..+++.- .- .+.+.++|...... ....... ++.... ... T Consensus 109 ~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~-----~~-~~~~~~~~~~~~~~-~~~~~~~--E~~~~~~~~~ 179 (389) T protein:vir:10 109 TSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKT-----PV-TTPKGTYPILKRAT-DRFSSVA--ELAENPKLAE 179 (389) T ss_pred hcccccCCcceeehHHHHHHHHHHHHhhhhHHhhccee-----ec-cCCeeEEEEEecCC-Ccccccc--cccccccccc Confidence 331 25789999999999999999887766432 11 12345555442211 1111111 122121 223 Q ss_pred ccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHH Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVN 153 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 153 (392) +.-..+++.+.+. +.-+.++++-+.++..++...+.+..+++++...|..++....... ..+......++++. T Consensus 180 ~~~~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~------~~~~~~~~~~d~l~ 252 (389) T protein:vir:10 180 PEFNKVDWSVATY-RGAIPLSEEAIADSAVDLTALVGQSIKEKSVNTYNAMIAPVLQSFT------AKKTTTDTLVDSLK 252 (389) T ss_pred ccceeeeeeheee-EeeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc------cccccccccHHHHH Confidence 3335555555333 3455688877777778898889999999999999998876554321 12233345577777 Q ss_pred HHHH-HhhhccCCCCCEEEEchHHHHHhhccc----ceeeeeccccceeeeEeeeeeeeEeeeEEEEecce-eeccccee Q lcl|Aclame:pro 154 GARR-ALNELYIPQGRVLVVGTAVTEQILNDD----RFIKYESQGQSAVSALQEARLGRIYGYEIVESTLI-PHGDAYLY 227 (392) Q Consensus 154 ~a~~-~l~~~~vp~~r~~vv~~~~~~~l~~~~----~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v-~~~~~~~~ 227 (392) ++.. .++.. .+..++++|..+..|.+-. ++.-.... ......|..+++.|++|+..... +....... T Consensus 253 ~~~~~~~~~~---~~a~~~~n~~~~~~L~~lkd~~G~~i~~~~~----~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~ 325 (389) T protein:vir:10 253 HILNVDLDPA---YSRALVVTQSLFNTLDTLKDKNGRYLLHDAS----DSITDGTAKGTILGVPVYVVGDTLLGSLAGDQ 325 (389) T ss_pred HHHHhhhhhh---hCcEEEecHHHHHHHHHhhccCCCeeeecCc----ccccccccccccccceeEEecccccCCCCCce Confidence 6543 33321 1346889999998886421 11110000 01112244467999998765432 22211100 Q ss_pred ecccccccchhhhccccccccceeecc-cceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeee Q lcl|Aclame:pro 228 HPTAFIMATRAPAPPMGAVRSTAISGD-QRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSI 306 (392) Q Consensus 228 ~~~a~~~a~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 306 (392) .+.+ +.......... .+....+... .... .........++.+.... .+. .+ T Consensus 326 ---~~~~---------gd~~~~~~~~~~~~~~i~~~~~--~~~~-~~~~~~~r~d~~~~~~~---------a~~----~~ 377 (389) T protein:vir:10 326 ---KAFV---------GDLKRGVLFTDRQQVTLAWEDS--KIYG-KYLGAAFRFGVQKADSK---------AGY----FV 377 (389) T ss_pred ---EEEE---------eeccccEEEEeecceEEEeecc--cccc-ceEEEEEEeccEEeccc---------ceE----EE Confidence 0000 00000000000 0111111000 0000 00000000000000000 000 00 Q ss_pred eecccccccceeeeeecc Q lcl|Aclame:pro 307 EVAPEAGANATITAAAGE 324 (392) Q Consensus 307 ~v~~~~~~~~~~~~~~~~ 324 (392) .++...... .+. T Consensus 378 ~~~~~~~~~------~~~ 389 (389) T protein:vir:10 378 TNTDVPGSA------LGK 389 (389) T ss_pred EeeccCCCC------CCC Confidence 000000000 000 No 149 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=96.60 E-value=0.00017 Score=41.31 Aligned_cols=282 Identities=9% Similarity=0.008 Sum_probs=125.4 Q ss_pred Cc---cccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccccCc Q lcl|Aclame:pro 1 MA---NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTED 77 (392) Q Consensus 1 Ma---n~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (392) |+ -++++|.+++.-+++.-+.-..-.+++ .+...+.|....++..+... .|+....++=.....+..++. T Consensus 74 mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~-----qk~~L~~Grsm~F~~~g~~R--a~~IgEGgE~~~~sld~~T~d 146 (393) T protein:vir:79 74 MATPSAQILIPRVIVGTMREAAEPLYIGTKML-----QKIRLKSGQSMIFPSIGIMR--AYDVAEGQEIPEDSIDWQTHE 146 (393) T ss_pred hcCCCcceechhhhhhhhhhcccchhHHHHHH-----HHHhhhcCcceeccchheee--eccccccccccccchhhhcCC Confidence 65 477899999987776333322222332 23444556677775555443 443322222222333334456 Q ss_pred eEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc---c-----cc------c Q lcl|Aclame:pro 78 SFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGA---V-----HE------V 143 (392) Q Consensus 78 ~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~---~-----~~------~ 143 (392) .+++...|. ...++++|+-...+--|++...+.+++++|+++.+..++...+.....+--+ + .+ - T Consensus 147 sv~~~~gK~-G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~~ghtvfDa~st~t~ahptGr~~~~~q 225 (393) T protein:vir:79 147 SPEIRVGKS-GIRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRSHGHTVFDNYSTNKLAHTTGLDKNGVQ 225 (393) T ss_pred ceeEEechh-hhhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhhhhcccceeeeccccCccceeecCCccccc Confidence 666655332 4778899999999999999999999999999999999998887654421100 0 00 0 Q ss_pred cchhhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccce--eeeeccccceeeeEeeeeeeeEe-----------e Q lcl|Aclame:pro 144 APDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRF--IKYESQGQSAVSALQEARLGRIY-----------G 210 (392) Q Consensus 144 ~~~~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~--~~~~~~G~~~~~a~~~g~ig~~~-----------g 210 (392) ......++++|+.-..-.+.. .+..++++|-.+..+-+.... ..++..|.-... .....+.. . T Consensus 226 NGTlSleDllDm~~av~~~hy-t~svi~MHPLAWnv~AKna~me~~~~na~gN~~~~---~~~ts~algp~~i~~~~~~n 301 (393) T protein:vir:79 226 NDTFSAEDFLDLIIAVMANEY-TPSDLMMHPLAWTVFAKNELMGSLQANPYGNYPAK---GAPSSMALGPDSIQGRLPFN 301 (393) T ss_pred cccccHHHHHHHHHHHhcccC-CcceEEEcCchhhhhhhhhhhcceeeccccccCcc---ccchhhhhchhhhccccccc Confidence 112356788876543322222 345688888887776655322 122222211000 00011112 2 Q ss_pred eEEEEecceeecccceeecccccccchhhhccccccccceeecccceee--------eeeeccccceeeeecccccceee Q lcl|Aclame:pro 211 YEIVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAM--------RWLVDYDSTITSNRSLIDTYFGL 282 (392) Q Consensus 211 ~~v~~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~ 282 (392) +.|..+.-+|....... ......+..... ....+.+.....+.....--+|. T Consensus 302 lnv~~sPfvp~d~k~~r--------------------Fd~~~Vd~NnvgvlLV~D~i~tdq~ddk~rdiq~iKl~ERYG~ 361 (393) T protein:vir:79 302 FNVNLSPFIPLDKKSRR--------------------FDVYAVDRNNVGVLLVRDDLKTDQWDEKARGLQNIKMIERYGI 361 (393) T ss_pred eeEEEecccccccccce--------------------eeEEEeecCCceEEEEecCcceeccccccccceeeeeeeeece Confidence 45555555554432100 000111100000 00001111111111111111222 Q ss_pred eEEEeeccccceeeeeccceeeeeeecccccccceeeeeecc Q lcl|Aclame:pro 283 KVVEDPNGVGFVRARKIHLIPGSIEVAPEAGANATITAAAGE 324 (392) Q Consensus 283 ~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~ 324 (392) .+.............+ +...-..+ .+...++. T Consensus 362 gvLn~gkaiavakNI~---------~~k~y~~P-~~~~~~~~ 393 (393) T protein:vir:79 362 GILNEGKAIAVAKNIS---------MDKSYAEP-MLIKNVGN 393 (393) T ss_pred eeeeCCceEEEEecce---------eecccccc-hhhhccCC Confidence 1221111111111110 00000000 00001111 No 150 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=96.56 E-value=0.00051 Score=38.64 Aligned_cols=300 Identities=12% Similarity=-0.007 Sum_probs=122.0 Q ss_pred Ccc---ccccHHHHHHHHHHHHHHhhcccceeeec---ccccccCCCCCeEEEEeccceeeeccccccccCCCccccccc Q lcl|Aclame:pro 1 MAN---AFSKPTAVVDTAIQMLQNELILTNLVWLN---GIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDF 74 (392) Q Consensus 1 Man---~~~~~~~~~~~~~~~l~~~l~~~~~v~~~---~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (392) |+- ..|+|+++.. .++.+++++.+-+..-+. ...++ -.||-+++|..........+..+......+.+..+ T Consensus 1 m~lsD~~vfN~~~~~a-~~e~~~q~~~~fn~as~gai~l~~~~--~~Gd~~~~pf~~~l~g~~~~~~~~~~~~~vt~~ki 77 (325) T protein:vir:95 1 MALSDLAVYSEYAYSA-FSETLRQQVDLFNTATGGAIMLQSAA--HQGDFSDVAFFAKVTGGLVRRRNAYGSGTVAEKVL 77 (325) T ss_pred Cchhhhhhhhhhhhhh-hhhhhhhhHhhhhhcccceeEecccc--ccCceeeccccccccccccccccCCCCceecccee Confidence 873 3478888876 466677665432221100 11121 24899999877654333222222223334555554 Q ss_pred cCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHH----hccccccc----ccccccc-- Q lcl|Aclame:pro 75 TEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMI----VGAPYEAA----GAVHEVA-- 144 (392) Q Consensus 75 ~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~----~~~~~~~~----~~~~~~~-- 144 (392) ..... +.+.-++.+++.-.|++......+.+.+++++.+..+++...++.+..+ ..+-.... ....... T Consensus 78 tt~~~-~av~~~r~~g~~~~d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~a~~~~~~~v~dis~~~~~~ 156 (325) T protein:vir:95 78 KHLVD-TSVKVAAGTPPVRLDPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVYSALSQVSDVVYDATANTDAA 156 (325) T ss_pred ccccc-eeeEEecccCcccccHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccceeeeecccCcc Confidence 43332 2222344566666777777777778888888777777777666654333 22111100 0011111 Q ss_pred -chhhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecc Q lcl|Aclame:pro 145 -PDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGD 223 (392) Q Consensus 145 -~~~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~ 223 (392) .-...+.+.+|..+|.+++- .=..+++++..+..|.++.- ....+..+. ... -.+....|..|.++..+|... T Consensus 157 ~~~~s~~~l~~A~~klGD~~~-~l~~~~MHS~v~~~L~~~~L-~~~~~~~~~--~g~--~~i~t~~G~~VIVdD~~p~~~ 230 (325) T protein:vir:95 157 DKLPTWNNLNNGQAKFGDQSS-QIAAWIMHSTPMHKLYGSNL-TNGERLFTY--GTV--NVVRDPFGKLLVMTDSPNLFA 230 (325) T ss_pred cccccHHHHHHHHHHhccccc-ceeEEEEchHHHHHHHHhhc-ccccccccc--CCc--ccccccCCcEEEEeCCCCCCC Confidence 11256889999999865422 11357899999999987533 222111110 000 124566788899998887654 Q ss_pred cc--------eeecccccccchhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeecccccee Q lcl|Aclame:pro 224 AY--------LYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVR 295 (392) Q Consensus 224 ~~--------~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 295 (392) .. .+...++............ ....+........-..+ ...++- .|..- ... .. T Consensus 231 ~g~~~~ytty~lg~GAi~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~-------tf~lhp-~G~sw-~~s-~~---- 292 (325) T protein:vir:95 231 AGTPNVYHILGLVPGGVLIGQNNDFDANE----ETKNGDENIIRTYQAEW-------SYNIGV-KGFAW-DKA-NG---- 292 (325) T ss_pred ccCceeEEEEEEecCeEEecCCCCccccc----cccCcccceeeeeeeee-------eEEeec-ceeee-ecc-cc---- Confidence 21 1122222221111100000 00000000000000000 000000 00000 000 00 Q ss_pred eeeccceeeeeeecccccccceeeeeeccCeeEEEEEeecCcccccceEEEEEc Q lcl|Aclame:pro 296 ARKIHLIPGSIEVAPEAGANATITAAAGEDHTVQLKVTDANGDDVTALCDFESS 349 (392) Q Consensus 296 ~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~t~~~t~~~~~~~~~~~~vtw~Ss 349 (392) ..+++...+. .+.. =...|. +..+ +..|.=+++ T Consensus 293 --------------g~sPt~aeL~--~~~N--W~rv~~--~~K~-tagv~~~~~ 325 (325) T protein:vir:95 293 --------------GKSPTDAALF--TSTN--WDKYAT--SHKD-LAGVVVKTN 325 (325) T ss_pred --------------cCCcChHhhc--CCcC--cceecC--CCcc-ccceeEeeC Confidence 0000000000 0000 000000 0000 011111111 No 151 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=96.26 E-value=0.00082 Score=37.51 Aligned_cols=260 Identities=10% Similarity=0.066 Sum_probs=113.8 Q ss_pred Cc------cccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccc Q lcl|Aclame:pro 1 MA------NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDF 74 (392) Q Consensus 1 Ma------n~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (392) |. .-+++|+-+..++++.+++...+-.+++.- .. +. .++|... ....+....++ +....-.++ T Consensus 83 l~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~---~~----~~-~~~p~~~-~~~~~a~~v~E--~~~~~~~~~ 151 (352) T protein:vir:78 83 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLT---NI----KG-LEIPRVS-YTLDDDDFITD--VETAKELKL 151 (352) T ss_pred hccCCCCCCceeccHhHHHHHHHHHHhhcchhhheeeE---ec----CC-ceEEEEe-cCCCccccccc--ccccccccc Confidence 22 245889999999999999988876666431 11 11 2343321 11112222222 333333344 Q ss_pred cCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccc--ccc--ccccccccchhhHH Q lcl|Aclame:pro 75 TEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAP--YEA--AGAVHEVAPDEFFK 150 (392) Q Consensus 75 ~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~--~~~--~~~~~~~~~~~~~~ 150 (392) .-+.+++...+.. .-+.|+.+-+.++..++...+.+..+++++...+..++..-.+.. ... ......++....|+ T Consensus 152 ~f~~v~~~~~k~~-~~i~is~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~~~~~g~l~~~~~~~~t~~~~~d 230 (352) T protein:vir:78 152 KGDTVKFTTNKFK-VFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGANMYD 230 (352) T ss_pred cceeeeecceeEE-eechhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCcccccceeccccccccccchHH Confidence 4455555554432 335788887777788998889999999998765554553211111 000 01112233344588 Q ss_pred HHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccceeecc Q lcl|Aclame:pro 151 GVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPT 230 (392) Q Consensus 151 ~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~ 230 (392) +|+++...|...... +-.+++++..+..+++--. ..| ..+..|.-..+.|.+|+.+...+..- + . T Consensus 231 ~i~~~~~~l~~~~~~-~a~~~mn~~t~~~l~~~~~-----~~~----~~~~~~~~~~llG~PV~~~~~~~~~~---~--G 295 (352) T protein:vir:78 231 AIINALADLHEDYRD-NATIYMRYADYVKIISVLS-----NGT----TNFFDTPAEKVFGKPVVFTDAAVKPI---V--G 295 (352) T ss_pred HHHHHHhccChhhhc-CCEEEEehHHHHHHHHHHh-----ccC----CcccccCCccccccceEEecCCCcee---E--e Confidence 898887777655443 3356777777666643210 111 12334445578899998776443210 0 0 Q ss_pred cccccchhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeecc Q lcl|Aclame:pro 231 AFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAP 310 (392) Q Consensus 231 a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~ 310 (392) .+... +............ +...............+...... .+... . T Consensus 296 df~~~--------------~~~~~~~~~~~~~---~~~~g~~~f~~~~r~Dg~~~~~e---------A~~~l-------~ 342 (352) T protein:vir:78 296 DFNYF--------------GINYDGTTYDTDK---DVKKGEYLFVLTAWYDQQRTLDS---------AFRIA-------K 342 (352) T ss_pred ehhhh--------------hhhhhhheeeeec---cccCCeeEEEEEeeeCceeechh---------heEEE-------E Confidence 00000 0000000000000 00000000000000111111100 00000 0 Q ss_pred cccccceeee Q lcl|Aclame:pro 311 EAGANATITA 320 (392) Q Consensus 311 ~~~~~~~~~~ 320 (392) +..+...++. T Consensus 343 ~~a~~~~~~~ 352 (352) T protein:vir:78 343 AKESTGSLPS 352 (352) T ss_pred eecccCCCCC Confidence 0000000000 No 152 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=96.09 E-value=0.001 Score=36.98 Aligned_cols=281 Identities=15% Similarity=0.029 Sum_probs=111.5 Q ss_pred Cc------cccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc-eeeeccccccccCCCccc-cc Q lcl|Aclame:pro 1 MA------NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLT-VS 72 (392) Q Consensus 1 Ma------n~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~-~~~~~~~~~~~~~~~~~~-~~ 72 (392) |+ .-.+.|+.+..++++.+++...+..+++.- .- .+...+||+... ..+.+. . ++..+. .. T Consensus 84 ~~~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~-----~~-~~~~~~i~~~~~~~~a~~~---~--E~~~~~~~~ 152 (390) T protein:vir:40 84 IAGNGFAGVTALLPPTVFERVFEDLTVEHPLLSKINFV-----NT-TATTEWIISVGDVATAWWG---P--LCAEIKEVL 152 (390) T ss_pred HhccCcccCcccccHHHHHHHHHHHHhhhhhhhhceee-----ec-CCceeEEEEEcCCcceeee---c--cccccCccc Confidence 11 234789999999999999998887776542 11 134466665322 222222 1 112221 12 Q ss_pred cccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHH-hccccc-------c-c-ccccc Q lcl|Aclame:pro 73 DFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMI-VGAPYE-------A-A-GAVHE 142 (392) Q Consensus 73 ~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~-~~~~~~-------~-~-~~~~~ 142 (392) ++.=+.+++...+. +.-+.|+.+-+.++..++...+.+..+++++.++|+.++.-- .+.|.+ . . ..... T Consensus 153 ~~~f~~i~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~i~~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~ 231 (390) T protein:vir:40 153 DNGFDKIQTGMYKL-SAYIPVCNAMLDLGPSWLDQYVRTILGEAMALGLEAGIVNGSGKDQPIGMMRDLNNVTAGEHPVK 231 (390) T ss_pred cccceeeEeeeeeE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhcccCCCccceeeeccccccccccccc Confidence 33335556655443 344678888888888899899999999999999999887410 001100 0 0 00000 Q ss_pred ccchhhHHHHHH----HHHHhhhccCC--CCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEe Q lcl|Aclame:pro 143 VAPDEFFKGVNG----ARRALNELYIP--QGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVES 216 (392) Q Consensus 143 ~~~~~~~~~i~~----a~~~l~~~~vp--~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s 216 (392) ......+.++.+ ....|...... .+-.++++|..+..++..-.. ..+..|.- +.. ....|..|+.+ T Consensus 232 ~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~~~-~~d~~G~~----v~~---~~~~g~pvv~~ 303 (390) T protein:vir:40 232 TATPLTDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAATS-YMTPQGVW----VTG---ILPVPLEIVQS 303 (390) T ss_pred cccccchhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHHhh-ccCCCCcc----ccc---cCCCceeEEEc Confidence 111111222222 22233322221 234578888765443321110 11122211 111 12357888888 Q ss_pred cceeecccceeecccccccchhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceee Q lcl|Aclame:pro 217 TLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRA 296 (392) Q Consensus 217 ~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 296 (392) +.+|.+.......+.+....+. +.........................+.+....... . T Consensus 304 ~~~p~~~i~~Gd~s~~~i~~~~-----------------~~~v~~~~~~~f~~~~~~~r~~~r~dg~v~~~~A~~----~ 362 (390) T protein:vir:40 304 VAVPVGKAVAGRAKDYFMGIGS-----------------EQVIRTSTEYRLLDDETLYYAKQYANGRPKDNSSFL----V 362 (390) T ss_pred CCCCCCcEEEEeeceEEEEeec-----------------ceEEEecchhhhhcCcEEEEEEEEeCCEEecccceE----E Confidence 8887654322211111111000 000000000000000000000000001111100000 0 Q ss_pred eeccc--eeeeeeecccccccceeeeeecc Q lcl|Aclame:pro 297 RKIHL--IPGSIEVAPEAGANATITAAAGE 324 (392) Q Consensus 297 ~~~~~--~~~~v~v~~~~~~~~~~~~~~~~ 324 (392) ..+.. ....+++..+++...+-+ .++ T Consensus 363 l~~~~~~~~~~~~~~~~~~~~~~~~--~~~ 390 (390) T protein:vir:40 363 FDITGLEGSPAIDVNVVNNATPSET--PAE 390 (390) T ss_pred EEeeccCCCCCCCcceeeCCCCCCC--CCC Confidence 00000 000011111100000000 000 No 153 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=96.00 E-value=0.0011 Score=36.70 Aligned_cols=259 Identities=8% Similarity=0.037 Sum_probs=114.9 Q ss_pred Ccc-ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccc-cccccCce Q lcl|Aclame:pro 1 MAN-AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLT-VSDFTEDS 78 (392) Q Consensus 1 Man-~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 78 (392) -++ .++.|+.|...+++.+++..++.+++..- .. .+.+.++|.+.... .......+ +.... ...+.-.. T Consensus 133 ~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~-----~~-~~~~~~~~~~~~~~-~~~~~v~E--~~~~~~~~~~~~~~ 203 (394) T protein:vir:97 133 KENAKPVSSEEILYTPAREVKTVVDLKPFTTVY-----QA-KKASGKYPVLQRAT-TKMVTVAE--LEKNPALAKPDFKD 203 (394) T ss_pred cccccccChHHHHHHHHHHhhhhhhhhhhceee-----ec-cCcceEEEEEecCC-Cccceecc--ccccccccccccee Confidence 111 24789999999999999988887776432 11 12345666542211 11111222 22221 12333455 Q ss_pred EEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHHHHHHH Q lcl|Aclame:pro 79 FPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNGARRA 158 (392) Q Consensus 79 ~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~ 158 (392) +++...+. +.-+.|+.+-+.++..++...+.+..+++|+..+|..++...... .......++++.++... T Consensus 204 v~l~~~k~-~~~i~is~ell~ds~~~~~~~i~~~la~~~~~~~~~~i~~g~~~~---------~~~~~~~~~~~~~~~~~ 273 (394) T protein:vir:97 204 VAWNIDTY-RGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSF---------TTKTVKNLDEIKALLNG 273 (394) T ss_pred EEeehhhe-eeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhccccc---------cccccccHHHHHHHHHh Confidence 55555333 345578887777777788888999999999999998877533221 11122346666655432 Q ss_pred hhhccCCCCCEEEEchHHHHHhhcccceeeeecccccee-eeEeeeeeeeEeeeEEEEecceeecccceeecccccccch Q lcl|Aclame:pro 159 LNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAV-SALQEARLGRIYGYEIVESTLIPHGDAYLYHPTAFIMATR 237 (392) Q Consensus 159 l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~-~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a~~~a~~ 237 (392) +-... .+-.++++|..+..|.+-. +..|.-.. ..+..|..+.+.|++|+.+.....+....+...- . T Consensus 274 ~~~~~--~~a~~v~n~~~~~~l~~lk-----d~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~gd~-----~ 341 (394) T protein:vir:97 274 GFDPA--YNVSLIVSQSFYQTLDTLK-----DGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDF-----K 341 (394) T ss_pred hhhhh--hCCEEEEcHHHHHHHHHhh-----ccCCCeeeecCcCCCCCceeccceeEEecccccCCccEEEeec-----c Confidence 21111 2346889999988875421 11122111 1123455568999998875433222211110000 0 Q ss_pred hhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeeccc Q lcl|Aclame:pro 238 APAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPE 311 (392) Q Consensus 238 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~ 311 (392) ..+......+....+.. .+. ...........++.+.... ....+...+... |. T Consensus 342 ---------~~~~~~~~~~~~~~~~~-~~~--~~~~~~~~~r~d~~v~~~~------a~~~~~~~~~~~---p~ 394 (394) T protein:vir:97 342 ---------RGVLFADRKDLGLRWAD-NEI--YGQYLQAVLRFGVSKVDDK------AGYYVTFTPEPL---PL 394 (394) T ss_pred ---------ccEEEEEecceEEEEec-ccc--cceeEEEEEEEccEEeccc------ceEEEEeccccc---CC Confidence 00000000011111100 000 0000000001111111100 000111111111 11 No 154 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=95.51 E-value=0.0019 Score=35.47 Aligned_cols=292 Identities=11% Similarity=0.025 Sum_probs=131.8 Q ss_pred Cccccc---cHH---HHHHHHHHHHHHhhcccc-eeeecc------cccccCCCCCeEEEEeccceeeeccccccccCCC Q lcl|Aclame:pro 1 MANAFS---KPT---AVVDTAIQMLQNELILTN-LVWLNG------IGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAER 67 (392) Q Consensus 1 Man~~~---~~~---~~~~~~~~~l~~~l~~~~-~v~~~~------~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~ 67 (392) ||.+.+ .|+ +|+..+...-.+...|.+ ++-+.- -.|+.-..||+|+++........... +...- T Consensus 1 Ma~T~~~~~~p~a~~~ws~~l~~~~~~~s~f~~~l~G~~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~g~gv~---Gd~~l 77 (364) T protein:vir:93 1 MSQTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDRITFDLSVHLRGKPTY---GDARV 77 (364) T ss_pred CceeccCcCCHHHHHHHHHHHHHHHHhhCccccccccCCCCCcEEEeeecCCCCCceEEeeeeeecccCCcc---cCcee Confidence 997774 344 777655555545544544 332211 13555567999999887655422211 11111 Q ss_pred ccccccccCceEEEEEEeeeecceEeeH-HHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhcccc------------ Q lcl|Aclame:pro 68 NLTVSDFTEDSFPVTLTDVAYHLGVLTD-EELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPY------------ 134 (392) Q Consensus 68 ~~~~~~~~~~~~~~~i~~~~~~~~~i~d-~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~------------ 134 (392) ....+.+.-.+.+|.||+..+ ++.... ....-...||+.+..+.+..=+++..|+.++-.+.++.. T Consensus 78 eGnee~L~~~~~~i~idq~r~-~V~~~g~ms~qRt~~dlr~~ar~~L~~w~~~~~d~~~f~~laGarg~~~~~~~~~~~~ 156 (364) T protein:vir:93 78 EGKEESLRFYQDEVRIDQVRH-SVSAGGRMSRKRTVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFIETPDFT 156 (364) T ss_pred eccccceeEEeeEEEEeeccc-cccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccCcc Confidence 122345555778899988754 555433 233346678888888888888888888877755543210 Q ss_pred --c---ccccc---------------ccccchhhHHHHHHHHHHhhhccC---------C---C--C-CEEEEchHHHHH Q lcl|Aclame:pro 135 --E---AAGAV---------------HEVAPDEFFKGVNGARRALNELYI---------P---Q--G-RVLVVGTAVTEQ 179 (392) Q Consensus 135 --~---~~~~~---------------~~~~~~~~~~~i~~a~~~l~~~~v---------p---~--~-r~~vv~~~~~~~ 179 (392) . ...++ -+.+....++.|-.+...+...+. | . + .+++++|.++.. T Consensus 157 ~~~~N~v~aPt~~r~~~~~~at~~~~l~stD~~sl~~id~a~~~a~~~~~~~~~~~~~~Pv~~~g~~~yV~~l~p~q~~~ 236 (364) T protein:vir:93 157 GYAGNPLDAPDVDHLLYGGVATSKASLAATDIMAPLVIEKAVEKAAMMQAENPDVANMVPVSIDGDDHYVCVMSEYQATD 236 (364) T ss_pred cccccccCCCCCCcEEeccccCchhhccccccccHHHHHHHHHHHHHhCCCCCCCcccceeEecCcceeEEEEcchhhhh Confidence 0 00000 001122356677777776654432 2 1 1 267899999999 Q ss_pred hhc--ccceeeeec---cccceeeeEeeeeeeeEeeeEEEEecceeecccceeecc-----cccccchhhhccccccccc Q lcl|Aclame:pro 180 ILN--DDRFIKYES---QGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPT-----AFIMATRAPAPPMGAVRST 249 (392) Q Consensus 180 l~~--~~~~~~~~~---~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~-----a~~~a~~~~~~~~~~~~~~ 249 (392) |.. +++|....+ .+......+..|.+|.+.|+-+++...+........... ++-+..+...... T Consensus 237 Lr~~t~~~w~d~qk~A~~~~g~~nPlF~G~~gm~ngvii~~~~~vi~~~~~~~~~~v~~~ralllGaQA~~~a~------ 310 (364) T protein:vir:93 237 MRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAY------ 310 (364) T ss_pred hhhcCCHHHHHHHHHhhhcccccCCceecCeeeEcCeEEeccCCcccccccccCccccchhhheecceeeEEEe------ Confidence 974 344433322 233334568889999999998877666542211110000 0000000000000 Q ss_pred eeecccceeeeeeec-cccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeecc Q lcl|Aclame:pro 250 AISGDQRIAMRWLVD-YDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAP 310 (392) Q Consensus 250 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~ 310 (392) .. ..+....|.-. .|..... .+.++...|.....=. .. +- .+.+.+.-+..-+ T Consensus 311 -g~-~~g~~~~w~Ee~~D~gn~~-~i~~~~i~G~kK~rF~-~~---Df-Gvi~idtaa~~~~ 364 (364) T protein:vir:93 311 -GT-ANGLRFDWEETVKDYGNEP-AIAAGFIAGMKKARFN-NK---DF-GVISIDTAAKKHS 364 (364) T ss_pred -ec-CCCCCceeeecccCCCCch-hhhhhhHhhhhhcccC-Cc---cc-eEEEecccccccC Confidence 00 01111111111 1111100 1111111111100000 00 00 0000000000000 No 155 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=95.44 E-value=0.0021 Score=35.30 Aligned_cols=282 Identities=14% Similarity=0.098 Sum_probs=109.8 Q ss_pred Ccc-ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccccCceE Q lcl|Aclame:pro 1 MAN-AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTEDSF 79 (392) Q Consensus 1 Man-~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (392) -++ .++.|+.+..++.+.|++...+.+++... . .. ..+++++...... ..... ++..+.-.++.=+.+ T Consensus 154 ~~g~~~~vP~~~~~~i~~~l~~~~~l~~~~~v~--~-~~----g~~~~~~~~~~~~--a~wv~--E~~~~~~~~~~f~~i 222 (466) T protein:vir:80 154 VSGAELTIPDVMLELLRDNMHRYSKLISKVRLR--P-LK----GTARQNIAGAIPE--GVWTE--AVANLNELSLSFSQI 222 (466) T ss_pred hccccccccHHHHHHHHHhhhhhhhhhhheeee--e-cC----ceeEeeeecCCcc--eeecc--cccccccccccccce Confidence 111 25789999999999999888777766432 1 11 2345544322111 11111 222232223333445 Q ss_pred EEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHH-hcccccc--------cccccc-------- Q lcl|Aclame:pro 80 PVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMI-VGAPYEA--------AGAVHE-------- 142 (392) Q Consensus 80 ~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~-~~~~~~~--------~~~~~~-------- 142 (392) ++.+.+. +.-+.|+++-+.++..++...+.+..+++++..+|..++.-- .+.|.+. ...... T Consensus 223 ~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~ail~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~ 301 (466) T protein:vir:80 223 EVDGYKV-GGFIPIPNSTLEDSDLNLADEILDAIGQAIGFALDKAILYGTGTKMPVGIVTRLAQTTQPPNWGTKAPAWTN 301 (466) T ss_pred eecceee-eeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhheeeccCCCCcceeeecccccccccccccccccccc Confidence 5544333 234568888787788889999999999999999999876310 0001000 000000 Q ss_pred ccchh----------hHHHHHHHHHHhh--hccCCCC-CEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEe Q lcl|Aclame:pro 143 VAPDE----------FFKGVNGARRALN--ELYIPQG-RVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIY 209 (392) Q Consensus 143 ~~~~~----------~~~~i~~a~~~l~--~~~vp~~-r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~ 209 (392) ..... .+..+.+....+. +.+...+ .++++++..+..+.+-.-. .+..|.- .. ..+.-..+. T Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~--~~~~g~~-~~--~~~~~~~i~ 376 (466) T protein:vir:80 302 LSTTNLLKIDPTGKSAEEFFSELVLKLSKARANYSNGMKFWAMSSNTHAVLMSKAIT--FNSAGAL-VA--SLNNTMPIV 376 (466) T ss_pred cchhhhhhhhhhccchhhHHHHHHHHHHhhhccccCCceeEEecchhHHHhhccccc--ccCCccc-cc--cCCCccccc Confidence 00000 0011111111111 2222233 3467788777666443211 1111111 00 111112477 Q ss_pred eeEEEEecceeecccceeecccccccchhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeec Q lcl|Aclame:pro 210 GYEIVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPN 289 (392) Q Consensus 210 g~~v~~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 289 (392) |.+|+.++.+|.+..+....+.+.+..+............... .........-.+. .... ..+....... T Consensus 377 G~pvv~s~~~~~~~~~~g~~~~y~i~~r~~~~i~~~~~~~f~~--d~~~~r~~~r~dg------~~~~-~~afv~~~~~- 446 (466) T protein:vir:80 377 GGDIVILDFIPDNDIIGGYGSLYLLAERADIKLAQSEHVRFIE--DQTVFKGTARYDG------KPVF-GEGFVAVNIA- 446 (466) T ss_pred ccceeecCccCccceeeeccccEEEEeecceEEEechhhhhhc--CcEEEEEEEEEcc------EEec-cCceEEEEec- Confidence 8899999988876544333332222211111000000000000 0000000000000 0000 0000000000 Q ss_pred cccceeeeeccceeeeeeeccc Q lcl|Aclame:pro 290 GVGFVRARKIHLIPGSIEVAPE 311 (392) Q Consensus 290 ~~~~~~~~~~~~~~~~v~v~~~ 311 (392) .....+.....+.+..+..+ T Consensus 447 --~~~~~~~~~~~~~~~~~~~~ 466 (466) T protein:vir:80 447 --NANPTTSITFAPDEANVPEV 466 (466) T ss_pred --CCCcccceeeecCcCcCCCC Confidence 00000000000000000000 No 156 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=95.19 E-value=0.0017 Score=35.81 Aligned_cols=265 Identities=12% Similarity=0.084 Sum_probs=125.2 Q ss_pred Ccc--ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEec-cceeeeccc--cccccCCCcccccccc Q lcl|Aclame:pro 1 MAN--AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVP-APSRGHTRK--LRGAGAERNLTVSDFT 75 (392) Q Consensus 1 Man--~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~-~~~~~~~~~--~~~~~~~~~~~~~~~~ 75 (392) =++ ..+.|+ |-.-.++.+++.-.+.+++. .+.. +|.|+.-|+- ...++..+. .+...++..+.+..+. T Consensus 136 Tgd~~~~i~~~-~v~d~i~li~q~r~i~slf~-----tLP~-~g~T~eY~v~t~~~tV~~q~~~~kqa~EGd~L~~gKl~ 208 (410) T protein:vir:83 136 TGDLQGVIPDP-IVGPVIDFIDSARPLVSTLG-----TLPL-NNATFYRPIVSQRPAVGLQGVAGGASDEKTELDSQKMV 208 (410) T ss_pred ccccccccchh-HhhhHHHHHhhccchhhhhh-----hCCC-CCCeeEEeeeccccccccccccccccccccccccccee Confidence 122 234555 76678888888777766653 2444 4777776543 233333332 1223467778888888 Q ss_pred CceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHHHH Q lcl|Aclame:pro 76 EDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNGA 155 (392) Q Consensus 76 ~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a 155 (392) -++.+..|+.+-+.. .++=.....+.-...+-.++-...+-|+..++.+-..+...... .....-.+.+.....|.++ T Consensus 209 ~~t~tA~ikTyGGyt-~LSRQ~IERs~v~~L~~~lraL~~AYA~atea~vra~L~~t~t~-~~a~~~~Tad~~~~~i~da 286 (410) T protein:vir:83 209 IDRLTVNAKTLGGYV-NVSRQAIDFSSPSALDLVVNGLGQQYAIETEALVGAALASTSTG-AVGYGNATADNVASAIWQA 286 (410) T ss_pred eeeccceeehhcCcc-cccceeeecCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-hhhhhhccHHHHHHHHHHH Confidence 888888887765432 23322222233333333333333333444444443333221111 1122233455555667777 Q ss_pred HHHhhhccCC-CCCEEEEchHHHHHhhcccceeeeeccccce----eeeEeeeeeeeEeeeEEEEecceeecccceeecc Q lcl|Aclame:pro 156 RRALNELYIP-QGRVLVVGTAVTEQILNDDRFIKYESQGQSA----VSALQEARLGRIYGYEIVESTLIPHGDAYLYHPT 230 (392) Q Consensus 156 ~~~l~~~~vp-~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~----~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~ 230 (392) ....+.+.-- .-+++.|+|+....+. +.|....--|... ...+-+|..|++.+.+|.+....+.+++..+.+. T Consensus 287 ~~~v~da~~~~~~~~i~vS~DVl~~~~--~~f~~~~~~~~dt~Gfg~~~lg~gi~G~~~~ipVvm~~~a~AgTA~f~~~~ 364 (410) T protein:vir:83 287 AGAVYTAVKGMGRLVIAIAPDVLGDFG--PLFAPVNPTNAHSTGFEAGRFGQGVMGSISGIPVVMSAALGSGDAYLFSTA 364 (410) T ss_pred HHHHhhhhccceeeeEEechhhhhhcc--ceeeccCCCCcccccccccccccchhhhhcccceEEecCCCcCeeeEeccc Confidence 7777765222 2367899999854443 2343322222211 1223367778999999999999998888777655 Q ss_pred cccccchhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeecc Q lcl|Aclame:pro 231 AFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAP 310 (392) Q Consensus 231 a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~ 310 (392) ++..-..... .....+..+++......... ..........+++.+ T Consensus 365 Ai~~~eS~~g--------------------------p~qL~d~~i~nLt~~ySgY~---------a~a~~~~~gliPv~g 409 (410) T protein:vir:83 365 AIECFEQRVG--------------------------TLQVVEPSVFGLQVAYAGYF---------STLVVNEDAIVPLVG 409 (410) T ss_pred eeeeeecCCc--------------------------eeEeeCCchhhhhhhheeee---------eeccccccceeeecc Confidence 5432211100 00000111110000000000 000000001111111 Q ss_pred c Q lcl|Aclame:pro 311 E 311 (392) Q Consensus 311 ~ 311 (392) . T Consensus 410 ~ 410 (410) T protein:vir:83 410 S 410 (410) T ss_pred C Confidence 1 No 157 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=95.11 E-value=0.0027 Score=34.63 Aligned_cols=260 Identities=10% Similarity=0.035 Sum_probs=113.8 Q ss_pred Ccc------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccc Q lcl|Aclame:pro 1 MAN------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDF 74 (392) Q Consensus 1 Man------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (392) |.. -+++|+-+..++++.+++...+-.+++.- . .+ ..++|.... ...+..... ++....-.++ T Consensus 118 ~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~-----~--~~-~~~~p~~~~-~~~~a~~v~--Eg~~~~~~~~ 186 (387) T protein:vir:26 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLT-----N--IK-GLEIPRVSY-TLDDDDFIT--DVETAKELKA 186 (387) T ss_pred hccCCCCCCceeechhHHHHHHHHHHhhchhhhhceee-----e--cC-Cceeeeeec-cCCcccccc--cccccccccc Confidence 221 34789999999999999988776655431 1 11 134443221 112222222 2333333344 Q ss_pred cCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhcccc--cc--ccccccccchhhHH Q lcl|Aclame:pro 75 TEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPY--EA--AGAVHEVAPDEFFK 150 (392) Q Consensus 75 ~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~--~~--~~~~~~~~~~~~~~ 150 (392) .-..+++...+. +.-+.|+.+-+.++..++...+.++.+++++...++.++..-.+... .. ......++....++ T Consensus 187 ~f~~v~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d 265 (387) T protein:vir:26 187 KGDTVKFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYD 265 (387) T ss_pred ccceeeechhee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchHH Confidence 445555554333 23356887777777889988899999999998777666543221111 10 01122234445688 Q ss_pred HHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccceeecc Q lcl|Aclame:pro 151 GVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPT 230 (392) Q Consensus 151 ~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~ 230 (392) +|+++...|+.+..+... +++++..+..++.-- + +. | ..+..|.-..+.|.+|+.+...+..-... T Consensus 266 ~i~~~~~~l~~~y~~na~-~imn~~t~~~~~~~~---~-~~-~----~~~~~~~~~~llG~PV~~~~~~~~~~~GD---- 331 (387) T protein:vir:26 266 AIINALADLHEDYRDNAT-IYMRYADYVKIISVL---S-NG-T----TNFFDTPAEKVFGKPVVFTDAAVKPIVGD---- 331 (387) T ss_pred HHHHHHhccChhhhcCCE-EEEechHHHHHHHHH---h-cC-C----CcccccCCccccccceEEecCCCceeeec---- Confidence 999887777665444444 456666655543210 0 01 1 12334555678899998876543210000 Q ss_pred cccccchhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeecc Q lcl|Aclame:pro 231 AFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAP 310 (392) Q Consensus 231 a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~ 310 (392) +.... ..........+. ................+.+.. ...+...... .... T Consensus 332 -f~~~~--------------~~~~~~~~~~~~---~~~~~~~~~~~~~r~Dg~v~~---------~~A~~~l~~k-a~~~ 383 (387) T protein:vir:26 332 -FNYFG--------------INYDGTTYDTDK---DVKKGEYLFVLTAWYDQQRTL---------DSAFRIAKAK-ENTG 383 (387) T ss_pred -hhhhh--------------hhhhhhhheecc---cccCCceEEEEEEEeCcEeec---------hhheEEEEee-cCCC Confidence 00000 000000000000 000000000000000111110 0000000000 0000 Q ss_pred cccc Q lcl|Aclame:pro 311 EAGA 314 (392) Q Consensus 311 ~~~~ 314 (392) ..++ T Consensus 384 ~~~~ 387 (387) T protein:vir:26 384 PLPS 387 (387) T ss_pred CCCC Confidence 0011 No 158 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=95.11 E-value=0.0027 Score=34.63 Aligned_cols=260 Identities=10% Similarity=0.035 Sum_probs=113.8 Q ss_pred Ccc------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccc Q lcl|Aclame:pro 1 MAN------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDF 74 (392) Q Consensus 1 Man------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (392) |.. -+++|+-+..++++.+++...+-.+++.- . .+ ..++|.... ...+..... ++....-.++ T Consensus 118 ~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~-----~--~~-~~~~p~~~~-~~~~a~~v~--Eg~~~~~~~~ 186 (387) T protein:vir:94 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLT-----N--IK-GLEIPRVSY-TLDDDDFIT--DVETAKELKA 186 (387) T ss_pred hccCCCCCCceeechhHHHHHHHHHHhhchhhhhceee-----e--cC-Cceeeeeec-cCCcccccc--cccccccccc Confidence 221 34789999999999999988776655431 1 11 134443221 112222222 2333333344 Q ss_pred cCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhcccc--cc--ccccccccchhhHH Q lcl|Aclame:pro 75 TEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPY--EA--AGAVHEVAPDEFFK 150 (392) Q Consensus 75 ~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~--~~--~~~~~~~~~~~~~~ 150 (392) .-..+++...+. +.-+.|+.+-+.++..++...+.++.+++++...++.++..-.+... .. ......++....++ T Consensus 187 ~f~~v~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d 265 (387) T protein:vir:94 187 KGDTVKFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYD 265 (387) T ss_pred ccceeeechhee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchHH Confidence 445555554333 23356887777777889988899999999998777666543221111 10 01122234445688 Q ss_pred HHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccceeecc Q lcl|Aclame:pro 151 GVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPT 230 (392) Q Consensus 151 ~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~ 230 (392) +|+++...|+.+..+... +++++..+..++.-- + +. | ..+..|.-..+.|.+|+.+...+..-... T Consensus 266 ~i~~~~~~l~~~y~~na~-~imn~~t~~~~~~~~---~-~~-~----~~~~~~~~~~llG~PV~~~~~~~~~~~GD---- 331 (387) T protein:vir:94 266 AIINALADLHEDYRDNAT-IYMRYADYVKIISVL---S-NG-T----TNFFDTPAEKVFGKPVVFTDAAVKPIVGD---- 331 (387) T ss_pred HHHHHHhccChhhhcCCE-EEEechHHHHHHHHH---h-cC-C----CcccccCCccccccceEEecCCCceeeec---- Confidence 999887777665444444 456666655543210 0 01 1 12334555678899998876543210000 Q ss_pred cccccchhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeecc Q lcl|Aclame:pro 231 AFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAP 310 (392) Q Consensus 231 a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~ 310 (392) +.... ..........+. ................+.+.. ...+...... .... T Consensus 332 -f~~~~--------------~~~~~~~~~~~~---~~~~~~~~~~~~~r~Dg~v~~---------~~A~~~l~~k-a~~~ 383 (387) T protein:vir:94 332 -FNYFG--------------INYDGTTYDTDK---DVKKGEYLFVLTAWYDQQRTL---------DSAFRIAKAK-ENTG 383 (387) T ss_pred -hhhhh--------------hhhhhhhheecc---cccCCceEEEEEEEeCcEeec---------hhheEEEEee-cCCC Confidence 00000 000000000000 000000000000000111110 0000000000 0000 Q ss_pred cccc Q lcl|Aclame:pro 311 EAGA 314 (392) Q Consensus 311 ~~~~ 314 (392) ..++ T Consensus 384 ~~~~ 387 (387) T protein:vir:94 384 PLPS 387 (387) T ss_pred CCCC Confidence 0011 No 159 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=95.11 E-value=0.0027 Score=34.63 Aligned_cols=260 Identities=10% Similarity=0.035 Sum_probs=113.8 Q ss_pred Ccc------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccc Q lcl|Aclame:pro 1 MAN------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDF 74 (392) Q Consensus 1 Man------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (392) |.. -+++|+-+..++++.+++...+-.+++.- . .+ ..++|.... ...+..... ++....-.++ T Consensus 118 ~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~-----~--~~-~~~~p~~~~-~~~~a~~v~--Eg~~~~~~~~ 186 (387) T protein:vir:96 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLT-----N--IK-GLEIPRVSY-TLDDDDFIT--DVETAKELKA 186 (387) T ss_pred hccCCCCCCceeechhHHHHHHHHHHhhchhhhhceee-----e--cC-Cceeeeeec-cCCcccccc--cccccccccc Confidence 221 34789999999999999988776655431 1 11 134443221 112222222 2333333344 Q ss_pred cCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhcccc--cc--ccccccccchhhHH Q lcl|Aclame:pro 75 TEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPY--EA--AGAVHEVAPDEFFK 150 (392) Q Consensus 75 ~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~--~~--~~~~~~~~~~~~~~ 150 (392) .-..+++...+. +.-+.|+.+-+.++..++...+.++.+++++...++.++..-.+... .. ......++....++ T Consensus 187 ~f~~v~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d 265 (387) T protein:vir:96 187 KGDTVKFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYD 265 (387) T ss_pred ccceeeechhee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchHH Confidence 445555554333 23356887777777889988899999999998777666543221111 10 01122234445688 Q ss_pred HHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccceeecc Q lcl|Aclame:pro 151 GVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPT 230 (392) Q Consensus 151 ~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~ 230 (392) +|+++...|+.+..+... +++++..+..++.-- + +. | ..+..|.-..+.|.+|+.+...+..-... T Consensus 266 ~i~~~~~~l~~~y~~na~-~imn~~t~~~~~~~~---~-~~-~----~~~~~~~~~~llG~PV~~~~~~~~~~~GD---- 331 (387) T protein:vir:96 266 AIINALADLHEDYRDNAT-IYMRYADYVKIISVL---S-NG-T----TNFFDTPAEKVFGKPVVFTDAAVKPIVGD---- 331 (387) T ss_pred HHHHHHhccChhhhcCCE-EEEechHHHHHHHHH---h-cC-C----CcccccCCccccccceEEecCCCceeeec---- Confidence 999887777665444444 456666655543210 0 01 1 12334555678899998876543210000 Q ss_pred cccccchhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeecc Q lcl|Aclame:pro 231 AFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAP 310 (392) Q Consensus 231 a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~ 310 (392) +.... ..........+. ................+.+.. ...+...... .... T Consensus 332 -f~~~~--------------~~~~~~~~~~~~---~~~~~~~~~~~~~r~Dg~v~~---------~~A~~~l~~k-a~~~ 383 (387) T protein:vir:96 332 -FNYFG--------------INYDGTTYDTDK---DVKKGEYLFVLTAWYDQQRTL---------DSAFRIAKAK-ENTG 383 (387) T ss_pred -hhhhh--------------hhhhhhhheecc---cccCCceEEEEEEEeCcEeec---------hhheEEEEee-cCCC Confidence 00000 000000000000 000000000000000111110 0000000000 0000 Q ss_pred cccc Q lcl|Aclame:pro 311 EAGA 314 (392) Q Consensus 311 ~~~~ 314 (392) ..++ T Consensus 384 ~~~~ 387 (387) T protein:vir:96 384 PLPS 387 (387) T ss_pred CCCC Confidence 0011 No 160 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=95.10 E-value=0.0028 Score=34.61 Aligned_cols=273 Identities=7% Similarity=-0.042 Sum_probs=113.8 Q ss_pred Cc------cccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCcccc-cc Q lcl|Aclame:pro 1 MA------NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTV-SD 73 (392) Q Consensus 1 Ma------n~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~-~~ 73 (392) |. -.++.|+.+..++++.+++..++..++..- .+. +.+.+.+.|............++ ....- .. T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~---~~~---~~~~~~~~~~~~~~~~a~~v~E~--~~~~~~~~ 177 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVE---PVR---TRSGSRVLEKNSDMIPFAEITEM--GEIPETDN 177 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceee---ecc---CCceeEEEEeecCCccceeeccc--cccccccc Confidence 32 134789999999999999999887766432 111 22233333322211122222222 22211 12 Q ss_pred ccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHH Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVN 153 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 153 (392) +.-..+++...+. +.-+.|+++-+..+..++...+.+..+++|+..+|..++...... .......+++++ T Consensus 178 ~~~~~v~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~---------~~~~~~~~d~i~ 247 (392) T protein:vir:10 178 PKFSNVQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKL---------TKQAIKSLDDIK 247 (392) T ss_pred ccceeEEeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc---------cccCccCHHHHH Confidence 3334555554333 355578887777777889999999999999999998887533221 112234577777 Q ss_pred HHH-HHhhhccCCCCCEEEEchHHHHHhhcccceeeeecccccee-eeEeeeeeeeEeeeEEEEe--cceeecccceeec Q lcl|Aclame:pro 154 GAR-RALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAV-SALQEARLGRIYGYEIVES--TLIPHGDAYLYHP 229 (392) Q Consensus 154 ~a~-~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~-~a~~~g~ig~~~g~~v~~s--~~v~~~~~~~~~~ 229 (392) ++. ..|..... .+-.++++|..+..|.+-.. ..|.-.. ..+..|..+.+.|+.+... +..+......... T Consensus 248 ~~~~~~l~~~~~-~~a~~vm~~~~~~~L~~lkd-----~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~ 321 (392) T protein:vir:10 248 DVLNVKLDPAIS-PNAILLTNQDGFNYLDKLKD-----KDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKK 321 (392) T ss_pred HHHHHhhhhhhc-cCCEEEEcHHHHHHHHHhhc-----cCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCc Confidence 764 34544433 23468899999998854211 1121100 1122344556777654432 1222111110000 Q ss_pred ccccccchhhhccccccccceeecc-cceeeeeeeccccceeeeec--ccccceeeeEEEeeccccceeeeeccceeeee Q lcl|Aclame:pro 230 TAFIMATRAPAPPMGAVRSTAISGD-QRIAMRWLVDYDSTITSNRS--LIDTYFGLKVVEDPNGVGFVRARKIHLIPGSI 306 (392) Q Consensus 230 ~a~~~a~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 306 (392) ..+ ..+........+. .+....+..........+.. ......+..+.... + ...+......+ T Consensus 322 ~~~---------~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~---a---~~~l~~~~~a~ 386 (392) T protein:vir:10 322 APL---------IIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNE---A---AVYGEIDLSAP 386 (392) T ss_pred eEE---------EEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEeccc---c---eEEEEeccccc Confidence 000 0000000000000 00011110000000000000 00000111111000 0 00111111111 Q ss_pred eecccc Q lcl|Aclame:pro 307 EVAPEA 312 (392) Q Consensus 307 ~v~~~~ 312 (392) ..++.. T Consensus 387 ~~~~~~ 392 (392) T protein:vir:10 387 VEQPQG 392 (392) T ss_pred ccCCCC Confidence 111111 No 161 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=95.10 E-value=0.0028 Score=34.61 Aligned_cols=273 Identities=7% Similarity=-0.042 Sum_probs=113.8 Q ss_pred Cc------cccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCcccc-cc Q lcl|Aclame:pro 1 MA------NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTV-SD 73 (392) Q Consensus 1 Ma------n~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~-~~ 73 (392) |. -.++.|+.+..++++.+++..++..++..- .+. +.+.+.+.|............++ ....- .. T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~---~~~---~~~~~~~~~~~~~~~~a~~v~E~--~~~~~~~~ 177 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVE---PVR---TRSGSRVLEKNSDMIPFAEITEM--GEIPETDN 177 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceee---ecc---CCceeEEEEeecCCccceeeccc--cccccccc Confidence 32 134789999999999999999887766432 111 22233333322211122222222 22211 12 Q ss_pred ccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHH Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVN 153 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 153 (392) +.-..+++...+. +.-+.|+++-+..+..++...+.+..+++|+..+|..++...... .......+++++ T Consensus 178 ~~~~~v~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~---------~~~~~~~~d~i~ 247 (392) T protein:vir:10 178 PKFSNVQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKL---------TKQAIKSLDDIK 247 (392) T ss_pred ccceeEEeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc---------cccCccCHHHHH Confidence 3334555554333 355578887777777889999999999999999998887533221 112234577777 Q ss_pred HHH-HHhhhccCCCCCEEEEchHHHHHhhcccceeeeecccccee-eeEeeeeeeeEeeeEEEEe--cceeecccceeec Q lcl|Aclame:pro 154 GAR-RALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAV-SALQEARLGRIYGYEIVES--TLIPHGDAYLYHP 229 (392) Q Consensus 154 ~a~-~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~-~a~~~g~ig~~~g~~v~~s--~~v~~~~~~~~~~ 229 (392) ++. ..|..... .+-.++++|..+..|.+-.. ..|.-.. ..+..|..+.+.|+.+... +..+......... T Consensus 248 ~~~~~~l~~~~~-~~a~~vm~~~~~~~L~~lkd-----~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~ 321 (392) T protein:vir:10 248 DVLNVKLDPAIS-PNAILLTNQDGFNYLDKLKD-----KDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKK 321 (392) T ss_pred HHHHHhhhhhhc-cCCEEEEcHHHHHHHHHhhc-----cCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCc Confidence 764 34544433 23468899999998854211 1121100 1122344556777654432 1222111110000 Q ss_pred ccccccchhhhccccccccceeecc-cceeeeeeeccccceeeeec--ccccceeeeEEEeeccccceeeeeccceeeee Q lcl|Aclame:pro 230 TAFIMATRAPAPPMGAVRSTAISGD-QRIAMRWLVDYDSTITSNRS--LIDTYFGLKVVEDPNGVGFVRARKIHLIPGSI 306 (392) Q Consensus 230 ~a~~~a~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 306 (392) ..+ ..+........+. .+....+..........+.. ......+..+.... + ...+......+ T Consensus 322 ~~~---------~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~---a---~~~l~~~~~a~ 386 (392) T protein:vir:10 322 APL---------IIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNE---A---AVYGEIDLSAP 386 (392) T ss_pred eEE---------EEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEeccc---c---eEEEEeccccc Confidence 000 0000000000000 00011110000000000000 00000111111000 0 00111111111 Q ss_pred eecccc Q lcl|Aclame:pro 307 EVAPEA 312 (392) Q Consensus 307 ~v~~~~ 312 (392) ..++.. T Consensus 387 ~~~~~~ 392 (392) T protein:vir:10 387 VEQPQG 392 (392) T ss_pred ccCCCC Confidence 111111 No 162 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=95.10 E-value=0.0028 Score=34.61 Aligned_cols=273 Identities=7% Similarity=-0.042 Sum_probs=113.8 Q ss_pred Cc------cccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCcccc-cc Q lcl|Aclame:pro 1 MA------NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTV-SD 73 (392) Q Consensus 1 Ma------n~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~-~~ 73 (392) |. -.++.|+.+..++++.+++..++..++..- .+. +.+.+.+.|............++ ....- .. T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~---~~~---~~~~~~~~~~~~~~~~a~~v~E~--~~~~~~~~ 177 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVE---PVR---TRSGSRVLEKNSDMIPFAEITEM--GEIPETDN 177 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceee---ecc---CCceeEEEEeecCCccceeeccc--cccccccc Confidence 32 134789999999999999999887766432 111 22233333322211122222222 22211 12 Q ss_pred ccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHH Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVN 153 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 153 (392) +.-..+++...+. +.-+.|+++-+..+..++...+.+..+++|+..+|..++...... .......+++++ T Consensus 178 ~~~~~v~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~---------~~~~~~~~d~i~ 247 (392) T protein:vir:10 178 PKFSNVQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKL---------TKQAIKSLDDIK 247 (392) T ss_pred ccceeEEeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc---------cccCccCHHHHH Confidence 3334555554333 355578887777777889999999999999999998887533221 112234577777 Q ss_pred HHH-HHhhhccCCCCCEEEEchHHHHHhhcccceeeeecccccee-eeEeeeeeeeEeeeEEEEe--cceeecccceeec Q lcl|Aclame:pro 154 GAR-RALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAV-SALQEARLGRIYGYEIVES--TLIPHGDAYLYHP 229 (392) Q Consensus 154 ~a~-~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~-~a~~~g~ig~~~g~~v~~s--~~v~~~~~~~~~~ 229 (392) ++. ..|..... .+-.++++|..+..|.+-.. ..|.-.. ..+..|..+.+.|+.+... +..+......... T Consensus 248 ~~~~~~l~~~~~-~~a~~vm~~~~~~~L~~lkd-----~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~ 321 (392) T protein:vir:10 248 DVLNVKLDPAIS-PNAILLTNQDGFNYLDKLKD-----KDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKK 321 (392) T ss_pred HHHHHhhhhhhc-cCCEEEEcHHHHHHHHHhhc-----cCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCc Confidence 764 34544433 23468899999998854211 1121100 1122344556777654432 1222111110000 Q ss_pred ccccccchhhhccccccccceeecc-cceeeeeeeccccceeeeec--ccccceeeeEEEeeccccceeeeeccceeeee Q lcl|Aclame:pro 230 TAFIMATRAPAPPMGAVRSTAISGD-QRIAMRWLVDYDSTITSNRS--LIDTYFGLKVVEDPNGVGFVRARKIHLIPGSI 306 (392) Q Consensus 230 ~a~~~a~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 306 (392) ..+ ..+........+. .+....+..........+.. ......+..+.... + ...+......+ T Consensus 322 ~~~---------~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~---a---~~~l~~~~~a~ 386 (392) T protein:vir:10 322 APL---------IIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNE---A---AVYGEIDLSAP 386 (392) T ss_pred eEE---------EEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEeccc---c---eEEEEeccccc Confidence 000 0000000000000 00011110000000000000 00000111111000 0 00111111111 Q ss_pred eecccc Q lcl|Aclame:pro 307 EVAPEA 312 (392) Q Consensus 307 ~v~~~~ 312 (392) ..++.. T Consensus 387 ~~~~~~ 392 (392) T protein:vir:10 387 VEQPQG 392 (392) T ss_pred ccCCCC Confidence 111111 No 163 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=95.10 E-value=0.0028 Score=34.61 Aligned_cols=273 Identities=7% Similarity=-0.042 Sum_probs=113.8 Q ss_pred Cc------cccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCcccc-cc Q lcl|Aclame:pro 1 MA------NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTV-SD 73 (392) Q Consensus 1 Ma------n~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~-~~ 73 (392) |. -.++.|+.+..++++.+++..++..++..- .+. +.+.+.+.|............++ ....- .. T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~---~~~---~~~~~~~~~~~~~~~~a~~v~E~--~~~~~~~~ 177 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVE---PVR---TRSGSRVLEKNSDMIPFAEITEM--GEIPETDN 177 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceee---ecc---CCceeEEEEeecCCccceeeccc--cccccccc Confidence 32 134789999999999999999887766432 111 22233333322211122222222 22211 12 Q ss_pred ccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHH Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVN 153 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 153 (392) +.-..+++...+. +.-+.|+++-+..+..++...+.+..+++|+..+|..++...... .......+++++ T Consensus 178 ~~~~~v~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~---------~~~~~~~~d~i~ 247 (392) T protein:vir:10 178 PKFSNVQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKL---------TKQAIKSLDDIK 247 (392) T ss_pred ccceeEEeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc---------cccCccCHHHHH Confidence 3334555554333 355578887777777889999999999999999998887533221 112234577777 Q ss_pred HHH-HHhhhccCCCCCEEEEchHHHHHhhcccceeeeecccccee-eeEeeeeeeeEeeeEEEEe--cceeecccceeec Q lcl|Aclame:pro 154 GAR-RALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAV-SALQEARLGRIYGYEIVES--TLIPHGDAYLYHP 229 (392) Q Consensus 154 ~a~-~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~-~a~~~g~ig~~~g~~v~~s--~~v~~~~~~~~~~ 229 (392) ++. ..|..... .+-.++++|..+..|.+-.. ..|.-.. ..+..|..+.+.|+.+... +..+......... T Consensus 248 ~~~~~~l~~~~~-~~a~~vm~~~~~~~L~~lkd-----~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~ 321 (392) T protein:vir:10 248 DVLNVKLDPAIS-PNAILLTNQDGFNYLDKLKD-----KDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKK 321 (392) T ss_pred HHHHHhhhhhhc-cCCEEEEcHHHHHHHHHhhc-----cCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCc Confidence 764 34544433 23468899999998854211 1121100 1122344556777654432 1222111110000 Q ss_pred ccccccchhhhccccccccceeecc-cceeeeeeeccccceeeeec--ccccceeeeEEEeeccccceeeeeccceeeee Q lcl|Aclame:pro 230 TAFIMATRAPAPPMGAVRSTAISGD-QRIAMRWLVDYDSTITSNRS--LIDTYFGLKVVEDPNGVGFVRARKIHLIPGSI 306 (392) Q Consensus 230 ~a~~~a~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 306 (392) ..+ ..+........+. .+....+..........+.. ......+..+.... + ...+......+ T Consensus 322 ~~~---------~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~---a---~~~l~~~~~a~ 386 (392) T protein:vir:10 322 APL---------IIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNE---A---AVYGEIDLSAP 386 (392) T ss_pred eEE---------EEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEeccc---c---eEEEEeccccc Confidence 000 0000000000000 00011110000000000000 00000111111000 0 00111111111 Q ss_pred eecccc Q lcl|Aclame:pro 307 EVAPEA 312 (392) Q Consensus 307 ~v~~~~ 312 (392) ..++.. T Consensus 387 ~~~~~~ 392 (392) T protein:vir:10 387 VEQPQG 392 (392) T ss_pred ccCCCC Confidence 111111 No 164 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=95.10 E-value=0.0028 Score=34.60 Aligned_cols=259 Identities=10% Similarity=0.052 Sum_probs=111.6 Q ss_pred Ccc------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccc Q lcl|Aclame:pro 1 MAN------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDF 74 (392) Q Consensus 1 Man------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (392) |.- -+++|+-+..++++.+++...+-.++..- . .| ...+|.... ...+....+ ++......++ T Consensus 118 l~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v~-----~--~~-~~~~p~~~~-~~~~a~~v~--E~~~~~~~~~ 186 (387) T protein:vir:93 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLT-----N--IK-GLEIPRVSY-TLDDDDFIT--DVETAKELKL 186 (387) T ss_pred hccCcCCCCceeechhHHHHHHHHHHhhchhhhheeee-----e--cC-CceEEEEee-cCCcccccc--Cccccccccc Confidence 221 24789999999999999988776655431 1 11 133443211 111122222 2333333444 Q ss_pred cCceEEEEEEeeeecc-eEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhcccccc----ccccccccchhhH Q lcl|Aclame:pro 75 TEDSFPVTLTDVAYHL-GVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEA----AGAVHEVAPDEFF 149 (392) Q Consensus 75 ~~~~~~~~i~~~~~~~-~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~----~~~~~~~~~~~~~ 149 (392) .-+.+++.. +++.. +.|+.+-+..+..++...+.+..+++++...++.++..-.+..... ......++....| T Consensus 187 ~f~~v~~~~--~k~~~~~~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~l~~~~~~~v~~~~~~ 264 (387) T protein:vir:93 187 KGDTVKFTT--NKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLDHMSFYNGSVKEVEGADMY 264 (387) T ss_pred ccceeeeeh--eeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchH Confidence 445555544 44434 5688777777778898888889999999887776653222111100 0111223444568 Q ss_pred HHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccceeec Q lcl|Aclame:pro 150 KGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHP 229 (392) Q Consensus 150 ~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~ 229 (392) ++|+++...|+.+...... +++++..+..++.- + + +..| .+..|.-.++.|.+|+.+...+..-... T Consensus 265 d~i~~~~~~l~~~~~~~a~-~~mn~~t~~~~~~~--~-~-d~~~-----~~~~~~~~~llG~PV~~~~~~~~~~~GD--- 331 (387) T protein:vir:93 265 DAIINALADLHEDYRDNAT-IYMRYADYVKIISV--L-S-NGTT-----NFFDTPAEKVFGKPVVFTDAAVKPIVGD--- 331 (387) T ss_pred HHHHHHHhccChhhhcCCE-EEEechHHHHHHHH--H-h-cCCC-----cccccCCccccccceEEecCCCceeeee--- Confidence 8898887777665544444 46676655444321 0 0 0111 1223444578899998876443210000 Q ss_pred ccccccchhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeec Q lcl|Aclame:pro 230 TAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVA 309 (392) Q Consensus 230 ~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~ 309 (392) +... +.. ..+...... . ...............++.+....... ...+.. .. T Consensus 332 --f~~~--------------~~~-~~~~~~~~~-~-~~~~~~~~~~~~~r~d~~v~~~eA~~----~l~~k~------~~ 382 (387) T protein:vir:93 332 --FNYF--------------GIN-YDGTTYDTD-K-DVKKGEYLFVLTAWYDQQRTLDSAFR----IAKAKE------NT 382 (387) T ss_pred --hhhh--------------hee-hhhheeeec-c-cccCCceeEEEEeeeCceeechhheE----EEEeec------CC Confidence 0000 000 000000000 0 00000000000000111111000000 000000 00 Q ss_pred ccccc Q lcl|Aclame:pro 310 PEAGA 314 (392) Q Consensus 310 ~~~~~ 314 (392) .-.++ T Consensus 383 ~~~~~ 387 (387) T protein:vir:93 383 GSLPS 387 (387) T ss_pred CCCCC Confidence 00001 No 165 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=95.09 E-value=0.0028 Score=34.60 Aligned_cols=256 Identities=11% Similarity=0.066 Sum_probs=106.4 Q ss_pred Ccc---ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc-eeeeccccccccCCCccccccccC Q lcl|Aclame:pro 1 MAN---AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSDFTE 76 (392) Q Consensus 1 Man---~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 76 (392) .+. -.+.|+.+.+++++.|++.-.+-+++++. . .. ..++||.... ..+.+. .+ ...+.. .... T Consensus 82 ~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~--~-~~----~~~~i~~~~~~~~a~wv---~e--~~~~~~-~~~~ 148 (377) T protein:vir:96 82 VGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFK--N-TS----LRLKALTAETSGTAVWG---DI--FGEIKG-QLKQ 148 (377) T ss_pred CCCCCCceecCHHHHHHHHHHHHhhhhhhhhceeE--e-cC----CceEEEEecCCcceeEe---ec--cccccc-ccCc Confidence 111 34789999999999999988887877642 2 22 2366766433 222222 11 111111 1112 Q ss_pred ceEEEEEEeeeecce-EeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHH---------HHhccccc-----cccc-- Q lcl|Aclame:pro 77 DSFPVTLTDVAYHLG-VLTDEELTFDLESFATQILPRQVRGVADILEEGVRD---------MIVGAPYE-----AAGA-- 139 (392) Q Consensus 77 ~~~~~~i~~~~~~~~-~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~---------~~~~~~~~-----~~~~-- 139 (392) .--.++|..++...+ .|+.+-+.++..++...+.+..+++++..+|..++. .+...... .... T Consensus 149 ~f~~i~l~~~kl~~~~~is~~ll~ds~~~le~~i~~~l~~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~ 228 (377) T protein:vir:96 149 AFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDIT 228 (377) T ss_pred cceeEeeeeeeEEeechhhHHHhhcchhhHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeecccccccccccccccc Confidence 223445555555444 577777777888998999999999999999988763 11100000 0000 Q ss_pred -----------cccccchhhHHHHHHHHHHhhhcc--CC----CCCEEEEchHHHHHhhcccceeeeeccccceeeeEee Q lcl|Aclame:pro 140 -----------VHEVAPDEFFKGVNGARRALNELY--IP----QGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQE 202 (392) Q Consensus 140 -----------~~~~~~~~~~~~i~~a~~~l~~~~--vp----~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~ 202 (392) .....++..++-+..+.+.+..++ -| .+-+++++|..+..+... +......| T Consensus 229 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~--~~~~~~~G--------- 297 (377) T protein:vir:96 229 TYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAK--FTSRNQFG--------- 297 (377) T ss_pred ceeeccccccccccCChhHHHHHHHHHHHhhccccccccccccCceEEEEchhhHHhcccc--ccccCCCC--------- Confidence 001122223333334444443321 12 123577888877655322 11112222 Q ss_pred eeeeeEee--eEEEEecceeecccceeecccccccchhhhccccccccceeecccceeeeeeeccccceeeeecccccce Q lcl|Aclame:pro 203 ARLGRIYG--YEIVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYF 280 (392) Q Consensus 203 g~ig~~~g--~~v~~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 280 (392) ....+.| ..+..+..+|.+.......+......+............... .-...... ... T Consensus 298 -~~~~~l~~p~~v~~s~~~p~~~i~fgdf~~Y~i~~r~~~~i~~~~~~~~~~--d~~~f~~~---------------~r~ 359 (377) T protein:vir:96 298 -EYVTVLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAME--DLQLYLTK---------------NYF 359 (377) T ss_pred -CceeccCCCceEEecCCCCcccEEEEEcCcEEEEEecccEEEeehhhhhhc--CCeEEEEE---------------EEE Confidence 2223333 345556666644332222111111111100000000000000 00000000 000 Q ss_pred eeeEEEeeccccceeeeeccceeeeeeeccc Q lcl|Aclame:pro 281 GLKVVEDPNGVGFVRARKIHLIPGSIEVAPE 311 (392) Q Consensus 281 ~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~ 311 (392) .+........ ..+.++.. T Consensus 360 dG~~~d~~a~-------------~vl~l~~~ 377 (377) T protein:vir:96 360 YGKAKDNHTA-------------ALLTLAGG 377 (377) T ss_pred cCEEecCCcE-------------EEEEEecC Confidence 0000000000 00000000 No 166 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=95.09 E-value=0.0028 Score=34.59 Aligned_cols=272 Identities=13% Similarity=0.099 Sum_probs=106.0 Q ss_pred Cc-cccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEecccee-eeccccccccCCCccc-cccccCc Q lcl|Aclame:pro 1 MA-NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSR-GHTRKLRGAGAERNLT-VSDFTED 77 (392) Q Consensus 1 Ma-n~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~-~~~~~~~~~~~~~~~~-~~~~~~~ 77 (392) -+ --.++|+.+.+++++.|++..++.+++++- ... | .++||+..... +.+.. + ...+. -.++.=. T Consensus 91 ~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~---~~~---~-~~~i~~~~~~~~a~w~~---e--~~~~~~~~~~~f~ 158 (395) T protein:vir:95 91 GYTDEKILPETVVERVFDDLQKDHPLLSKINFQ---NAG---I-KTRVIKADPAGQAVWGK---V--FGEIKGQLDAAFR 158 (395) T ss_pred CCCCceeccHHHHHHHHHHHHhhhhhhhhceeE---ecC---C-ceEEEEecCCcceEEee---c--ccccCccccccce Confidence 11 123689999999999999999888887642 122 3 45776643322 22211 1 11111 1122223 Q ss_pred eEEEEEEeeeec-ceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHh---cccccccc-------c-ccc-cc Q lcl|Aclame:pro 78 SFPVTLTDVAYH-LGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIV---GAPYEAAG-------A-VHE-VA 144 (392) Q Consensus 78 ~~~~~i~~~~~~-~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~---~~~~~~~~-------~-~~~-~~ 144 (392) . +++.-++.. -+.|+.+-+.++..++...+.+..+++++.++|+.++.--. ..|.+.-. . ... .. T Consensus 159 ~--i~l~~~kl~~~~~iS~ell~ds~~~ie~~i~~~la~~ia~~~~~a~i~G~G~~~~qP~Gil~~~~~~~~~~~~~~~~ 236 (395) T protein:vir:95 159 E--ENFTQYKLTCFVVLPDDLSTFGPAWIERFVRTQIQEAISVALESAIINGGGAAKTQPVGLMKDVNTNSGAVTDKASS 236 (395) T ss_pred e--eeeceeeEEEeecccHHHHhcchhHHHHHHHHHHHHHHHHHHhhheeeccCCCCcCceeeeeccccccccccccccc Confidence 4 444445444 44677777777888998999999999999999997763110 01111000 0 000 00 Q ss_pred ----c---hhhHHHHHHHHHHhh------hccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeE--e Q lcl|Aclame:pro 145 ----P---DEFFKGVNGARRALN------ELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRI--Y 209 (392) Q Consensus 145 ----~---~~~~~~i~~a~~~l~------~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~--~ 209 (392) . ...+..+.++...|. .........+++++..+..+....-+.. . .|...++ + T Consensus 237 ~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~~~~g~~~~~~--~----------~G~~~~~lg~ 304 (395) T protein:vir:95 237 GTLTFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSWDVQARYTYLT--A----------NGGFVTVLPY 304 (395) T ss_pred chhhhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhhhcCCcceecc--C----------CCcceeccCC Confidence 0 112223333322221 0011123356788876655433222211 1 2333333 3 Q ss_pred eeEEEEecceeecccceeecccccccchhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeec Q lcl|Aclame:pro 210 GYEIVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPN 289 (392) Q Consensus 210 g~~v~~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 289 (392) |..++.+..+|.+.......+.+....+. +.........-..............++.+..... T Consensus 305 g~~v~~~~~~p~~~i~fgdfs~y~i~~r~-----------------~~~i~~~~~~~~~~d~~~f~~~~r~dg~~~~~~A 367 (395) T protein:vir:95 305 NVTIITSEFVPEGKLVAFVTDRYNAVRGG-----------------GLTVKKFDQTLALEDAVLFTAKTFAYGQPDDNKA 367 (395) T ss_pred cceEEEcCCCCCCcEEEEecccEEEEEec-----------------ceEEEeccchhhhCCcEEEEEEEEECCEEecccc Confidence 55677788777544222111111111000 0000000000000000000000000111111000 Q ss_pred cccceeeeeccceeeeeee--cccccccceeeeeeccCeeEEE Q lcl|Aclame:pro 290 GVGFVRARKIHLIPGSIEV--APEAGANATITAAAGEDHTVQL 330 (392) Q Consensus 290 ~~~~~~~~~~~~~~~~v~v--~~~~~~~~~~~~~~~~~~t~~~ 330 (392) .. ...++....++.. .+-..+.. +.+ T Consensus 368 ~~----~l~i~~~~~~~~~~~~~~~~~~~-----------~~~ 395 (395) T protein:vir:95 368 SA----VYDLKVASAPRRQTSAGGTTDGI-----------AEA 395 (395) T ss_pred EE----EEEeeccCCCCCCCCCCCCCCcc-----------ccC Confidence 00 0001100000000 00000000 000 No 167 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=94.74 E-value=0.0036 Score=33.96 Aligned_cols=272 Identities=13% Similarity=0.043 Sum_probs=117.2 Q ss_pred CccccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccce--eeeccccccccCCCccccccccCce Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPS--RGHTRKLRGAGAERNLTVSDFTEDS 78 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (392) ..--.+.|+... +.++.+++.-.|.+++++.- ... -.+.+|+..+.. ........ ++.....-.+++-++ T Consensus 19 ~~gG~L~P~~~~-~~i~~l~e~s~i~~~a~vi~--t~~---s~~~~i~~i~~g~~~~~~~~~~--~~~~~~~~~~~tf~~ 90 (314) T protein:vir:41 19 LGKGILAVQRFG-EFVREVRENSAIIKDARVLN--ALK---SYEVDISRISLGVELEPGRNTS--GTKVAPTADEVTVST 90 (314) T ss_pred CCCceeChHHHH-HHHHHHHhccchhhheeeec--ccC---ccceeecccccCcccccccccc--cCCccCCcccccccc Confidence 222347899974 68899999999988876531 111 123566554321 11111111 111222334455567 Q ss_pred EEEEEEeeeecceEeeHHHHhhhcc--ChHHHHHHHHHHHHHHHHHHHHHHHHhc---------cccc----cccc---c Q lcl|Aclame:pro 79 FPVTLTDVAYHLGVLTDEELTFDLE--SFATQILPRQVRGVADILEEGVRDMIVG---------APYE----AAGA---V 140 (392) Q Consensus 79 ~~~~i~~~~~~~~~i~d~~~~~~~~--~~~~~~~~~~~~ala~~vd~~~~~~~~~---------~~~~----~~~~---~ 140 (392) +++...+.. ..+.|+++.+..... +|...+....+++++.+.+...+.-=.. .+.+ .... . T Consensus 91 ~~l~~~kl~-~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~~~p~G~l~~a~~~~~~~ 169 (314) T protein:vir:41 91 NTLEMKELV-TKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTGRELYRINDGWMKLAGNQYTDA 169 (314) T ss_pred eeeeeEEEE-EeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccchhcchhhhhhcccceeec Confidence 777765554 467899988888775 8888888888999999887766531100 0000 0000 1 Q ss_pred ccccchhhHHHHHHHHHHhhhcc--CCCCCEEEEchHHHHHhhc--ccceeeeeccccceeeeEeeeeeeeEeeeEEEEe Q lcl|Aclame:pro 141 HEVAPDEFFKGVNGARRALNELY--IPQGRVLVVGTAVTEQILN--DDRFIKYESQGQSAVSALQEARLGRIYGYEIVES 216 (392) Q Consensus 141 ~~~~~~~~~~~i~~a~~~l~~~~--vp~~r~~vv~~~~~~~l~~--~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s 216 (392) ...+.....+.+.++...|...- -+.+-.++++++....+.+ +.+ ....|. ..+..|....+.|++|+.. T Consensus 170 ~~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~~---~~~l~~---~~~~~~~~~~l~G~PV~~~ 243 (314) T protein:vir:41 170 EPEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLLVR---ETGLGD---SALIGATGLQYDGIPIQYV 243 (314) T ss_pred CccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHhcc---CCcccc---hhhhCCCCceecceeeEec Confidence 11112233444555555554321 1122246679888777653 111 111222 2344566667889999988 Q ss_pred cceeecccceeecccccccchhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceee Q lcl|Aclame:pro 217 TLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRA 296 (392) Q Consensus 217 ~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 296 (392) +.++.-... ...+.+..-.+... +. ..........+...............+... ..... T Consensus 244 ~~~~~~~~~---~~~i~fgd~~nlv~----------~~-~~~ir~~~~~~a~~~~~~~~~~~r~d~~~~-~~~aa----- 303 (314) T protein:vir:41 244 PALDALGDD---KARALLTVPTNLVY----------GF-WRNIRIEPKRDAAMRRTEYIASLRADCNYE-DENAA----- 303 (314) T ss_pred ccccccCCC---CceEEEechhheEE----------Ee-eceeEEeecccCcCCeEEEEEEEEeceEEE-EcCcE----- Confidence 877532210 00011110000000 00 000000000000000000000000000000 00000 Q ss_pred eeccceeeeeeecccccccce Q lcl|Aclame:pro 297 RKIHLIPGSIEVAPEAGANAT 317 (392) Q Consensus 297 ~~~~~~~~~v~v~~~~~~~~~ 317 (392) . .. -+.-+... T Consensus 304 ~-----~~-----~~~~~~~~ 314 (314) T protein:vir:41 304 V-----AA-----VIDMSSGG 314 (314) T ss_pred E-----EE-----EeeccCCC Confidence 0 00 00000000 No 168 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=94.37 E-value=0.0046 Score=33.40 Aligned_cols=270 Identities=15% Similarity=0.122 Sum_probs=108.3 Q ss_pred Ccc------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc--eeeeccccccccCCCccccc Q lcl|Aclame:pro 1 MAN------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP--SRGHTRKLRGAGAERNLTVS 72 (392) Q Consensus 1 Man------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~--~~~~~~~~~~~~~~~~~~~~ 72 (392) |.- -.++|+.|...+++.+++...+..++++- .-. +..++||+... ..+.+ . +++...... T Consensus 151 ~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~-----~~~-~~~~~~~~~~~~~~~a~w---v--~E~~~~~~s 219 (497) T protein:vir:78 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSR-----PVT-SPNLSYLTESAAHNNAAA---V--AEAGTYPFS 219 (497) T ss_pred hhcccCcccccccchhhhHHHHHHHHhhhhHHhhcccc-----ccC-CCceEEEEEcCCCCccee---e--ccCcccccc Confidence 221 23678889999999999999887776542 211 34578776422 12222 2 233334444 Q ss_pred cccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHH-hcccccc----c----cccccc Q lcl|Aclame:pro 73 DFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMI-VGAPYEA----A----GAVHEV 143 (392) Q Consensus 73 ~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~-~~~~~~~----~----~~~~~~ 143 (392) ++.-..+++...+.. .-+.|+.+-+. +..++...+.+..+++|+..+|..++.-- .+.+.+. . ...... T Consensus 220 ~~~f~~i~~~~~k~a-~~~~iS~ell~-d~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~ 297 (497) T protein:vir:78 220 SEEFARVYEQVGKVA-NALTITDEGLR-DAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSL 297 (497) T ss_pred cccceeeEeeeeeeE-eecHhHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccc Confidence 554455666554433 23456666554 44567777888889999999998876310 0000000 0 000000 Q ss_pred ----------------------------------------------------cchhhHHHHHHHHHHhhhccCCCCCEEE Q lcl|Aclame:pro 144 ----------------------------------------------------APDEFFKGVNGARRALNELYIPQGRVLV 171 (392) Q Consensus 144 ----------------------------------------------------~~~~~~~~i~~a~~~l~~~~vp~~r~~v 171 (392) ........+..+...+.....-..-.++ T Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 377 (497) T protein:vir:78 298 FGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVV 377 (497) T ss_pred hhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEE Confidence 0000111111111111111110112578 Q ss_pred EchHHHHHhhc--cc--ceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccceeeccc--ccccchhhhccccc Q lcl|Aclame:pro 172 VGTAVTEQILN--DD--RFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTA--FIMATRAPAPPMGA 245 (392) Q Consensus 172 v~~~~~~~l~~--~~--~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a--~~~a~~~~~~~~~~ 245 (392) ++|..+..|.+ |. ++.-....+...... .+...++.|..|+.++.+|.+......-+. .....+ T Consensus 378 mn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~--~~~~~~l~G~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r-------- 447 (497) T protein:vir:78 378 MNPRDWELLRLTKDANGQYMGGNFFGNAYGNP--VNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARR-------- 447 (497) T ss_pred EchHHHHHHHHhhcCCCceeccCccccccccc--ccCCceeeceeeEecCCCCCCceEEeecccceEEEEEe-------- Confidence 89988887753 32 122111111111111 122347889999999998865432111110 000000 Q ss_pred cccceeecccceeeeeeeccccceeeeeccc--ccceeeeEEEeeccccceeeeeccceeeeeeecccccccce Q lcl|Aclame:pro 246 VRSTAISGDQRIAMRWLVDYDSTITSNRSLI--DTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGANAT 317 (392) Q Consensus 246 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~~~~~~ 317 (392) .+....+..........+...+ -.-.+..+... ..+ +.+.-......+ T Consensus 448 ---------~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p---------~A~------~~l~~~~~~~~~ 497 (497) T protein:vir:78 448 ---------EGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRP---------SAF------QLIQLKKGATGS 497 (497) T ss_pred ---------cccEEEeecccchhhhcCcEEEEEEEeecceeecc---------ccE------EEEEecCCccCC Confidence 0000000000000000000000 00000000000 000 000000000000 No 169 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=94.37 E-value=0.0046 Score=33.40 Aligned_cols=270 Identities=15% Similarity=0.122 Sum_probs=108.3 Q ss_pred Ccc------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc--eeeeccccccccCCCccccc Q lcl|Aclame:pro 1 MAN------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP--SRGHTRKLRGAGAERNLTVS 72 (392) Q Consensus 1 Man------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~--~~~~~~~~~~~~~~~~~~~~ 72 (392) |.- -.++|+.|...+++.+++...+..++++- .-. +..++||+... ..+.+ . +++...... T Consensus 151 ~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~-----~~~-~~~~~~~~~~~~~~~a~w---v--~E~~~~~~s 219 (497) T protein:vir:10 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSR-----PVT-SPNLSYLTESAAHNNAAA---V--AEAGTYPFS 219 (497) T ss_pred hhcccCcccccccchhhhHHHHHHHHhhhhHHhhcccc-----ccC-CCceEEEEEcCCCCccee---e--ccCcccccc Confidence 221 23678889999999999999887776542 211 34578776422 12222 2 233334444 Q ss_pred cccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHH-hcccccc----c----cccccc Q lcl|Aclame:pro 73 DFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMI-VGAPYEA----A----GAVHEV 143 (392) Q Consensus 73 ~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~-~~~~~~~----~----~~~~~~ 143 (392) ++.-..+++...+.. .-+.|+.+-+. +..++...+.+..+++|+..+|..++.-- .+.+.+. . ...... T Consensus 220 ~~~f~~i~~~~~k~a-~~~~iS~ell~-d~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~ 297 (497) T protein:vir:10 220 SEEFARVYEQVGKVA-NALTITDEGLR-DAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSL 297 (497) T ss_pred cccceeeEeeeeeeE-eecHhHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccc Confidence 554455666554433 23456666554 44567777888889999999998876310 0000000 0 000000 Q ss_pred ----------------------------------------------------cchhhHHHHHHHHHHhhhccCCCCCEEE Q lcl|Aclame:pro 144 ----------------------------------------------------APDEFFKGVNGARRALNELYIPQGRVLV 171 (392) Q Consensus 144 ----------------------------------------------------~~~~~~~~i~~a~~~l~~~~vp~~r~~v 171 (392) ........+..+...+.....-..-.++ T Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 377 (497) T protein:vir:10 298 FGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVV 377 (497) T ss_pred hhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEE Confidence 0000111111111111111110112578 Q ss_pred EchHHHHHhhc--cc--ceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccceeeccc--ccccchhhhccccc Q lcl|Aclame:pro 172 VGTAVTEQILN--DD--RFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTA--FIMATRAPAPPMGA 245 (392) Q Consensus 172 v~~~~~~~l~~--~~--~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a--~~~a~~~~~~~~~~ 245 (392) ++|..+..|.+ |. ++.-....+...... .+...++.|..|+.++.+|.+......-+. .....+ T Consensus 378 mn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~--~~~~~~l~G~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r-------- 447 (497) T protein:vir:10 378 MNPRDWELLRLTKDANGQYMGGNFFGNAYGNP--VNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARR-------- 447 (497) T ss_pred EchHHHHHHHHhhcCCCceeccCccccccccc--ccCCceeeceeeEecCCCCCCceEEeecccceEEEEEe-------- Confidence 89988887753 32 122111111111111 122347889999999998865432111110 000000 Q ss_pred cccceeecccceeeeeeeccccceeeeeccc--ccceeeeEEEeeccccceeeeeccceeeeeeecccccccce Q lcl|Aclame:pro 246 VRSTAISGDQRIAMRWLVDYDSTITSNRSLI--DTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGANAT 317 (392) Q Consensus 246 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~~~~~~ 317 (392) .+....+..........+...+ -.-.+..+... ..+ +.+.-......+ T Consensus 448 ---------~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p---------~A~------~~l~~~~~~~~~ 497 (497) T protein:vir:10 448 ---------EGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRP---------SAF------QLIQLKKGATGS 497 (497) T ss_pred ---------cccEEEeecccchhhhcCcEEEEEEEeecceeecc---------ccE------EEEEecCCccCC Confidence 0000000000000000000000 00000000000 000 000000000000 No 170 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=94.28 E-value=0.0048 Score=33.27 Aligned_cols=259 Identities=10% Similarity=0.070 Sum_probs=112.1 Q ss_pred Ccc------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccc Q lcl|Aclame:pro 1 MAN------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDF 74 (392) Q Consensus 1 Man------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (392) |.. -+++|+-+..++++.+++...+-.+++.- . .. | .++|..... ..+.....+ +....-.++ T Consensus 133 ~~~~t~~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v~--~-~~---~--~~~p~~~~~-~~~a~~v~E--g~~~~~~~~ 201 (402) T protein:vir:93 133 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLT--N-IK---G--LEIPRVSYT-LDDDDFITD--VETAKELKA 201 (402) T ss_pred hccCCCcCCccccchhHHHHHHHhHHhhhhhhhhceee--e-cC---C--ceeeeeecc-CCccccccc--ccccccccc Confidence 221 24789999999999999988776665431 1 11 1 344432211 111222222 322333344 Q ss_pred cCceEEEEEEeeeecc-eEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhcc--cccc--ccccccccchhhH Q lcl|Aclame:pro 75 TEDSFPVTLTDVAYHL-GVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGA--PYEA--AGAVHEVAPDEFF 149 (392) Q Consensus 75 ~~~~~~~~i~~~~~~~-~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~--~~~~--~~~~~~~~~~~~~ 149 (392) .-..+++.. +++.. +.|+.+-+..+..++...+.++.+++++...++.++..-.+. +... ......++....+ T Consensus 202 ~f~~i~~~~--~k~~~~i~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~~~~~~~~~~~~~~~~ 279 (402) T protein:vir:93 202 KGDTVKFTT--NKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMY 279 (402) T ss_pred ccceeeecc--eeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchH Confidence 444455544 44433 567877777778899888999999999987766665322211 1110 0111223344568 Q ss_pred HHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEecceeecccceeec Q lcl|Aclame:pro 150 KGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHP 229 (392) Q Consensus 150 ~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~ 229 (392) ++|+++...|+........ +++++..+..++.- + + +. | ..+..|.-..+.|.+|+.+...+..- + T Consensus 280 d~l~~~~~~l~~~y~~na~-~imn~~t~~~~~~~--~-~-d~-~----~~~~~~~~~~llG~PV~~t~~~~~i~---~-- 344 (402) T protein:vir:93 280 DAIINALADLHEDYRDNAT-IYMRYADYVKIISV--L-S-NG-T----TNFFDTPAEKVFGKPVVFTDAAVKPI---V-- 344 (402) T ss_pred HHHHHHHhccChhhhcCCE-EEEechHHHHHHHH--H-h-cC-C----CcccccCCccccccceEEecCCCcee---e-- Confidence 8899887777665444344 46666655544321 0 0 11 1 12234555678899998876543210 0 Q ss_pred ccccccchhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeec Q lcl|Aclame:pro 230 TAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVA 309 (392) Q Consensus 230 ~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~ 309 (392) ..+..... ... +..... ..+...............+.+... .+.++...... . T Consensus 345 GDf~~~~~--------------~~~-~~~~~~--~~~~~~~~~~~~~~~r~Dg~v~~~-------~A~~~l~ik~~---~ 397 (402) T protein:vir:93 345 GDFNYFGI--------------NYD-GTTYDT--DKDVKKGEYLFVLTAWYDQQRTLD-------SAFRIAKAKEN---T 397 (402) T ss_pred echhhhhh--------------hhh-hhhhhh--hhcccCCceEEEEEEEeCcEEech-------hheEEEEeecC---C Confidence 00000000 000 000000 000000000000000001111100 00000000000 0 Q ss_pred ccccc Q lcl|Aclame:pro 310 PEAGA 314 (392) Q Consensus 310 ~~~~~ 314 (392) ...++ T Consensus 398 ~~~~~ 402 (402) T protein:vir:93 398 GPLPS 402 (402) T ss_pred CCCCC Confidence 00111 No 171 >protein:vir:2770 Length: 318 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612887;genbank:gi:20065804;genbank:GeneID:935710 Probab=94.10 E-value=0.0054 Score=33.02 Aligned_cols=228 Identities=11% Similarity=0.005 Sum_probs=114.9 Q ss_pred CccccccHHHHHHHHHHHHHHhhcccc--------eeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccc Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELILTN--------LVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVS 72 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~~~--------~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (392) +.|+- .-.+|+..+-..-.++.-+.. -|.|- .|+....||+|++.........-.. +...-....+ T Consensus 22 ~~~~~-~vk~ws~~l~~~~~~~~~~~~~~g~~~~~~I~r~--~dL~K~~GD~Vtf~L~~~L~g~gv~---Gd~~lEGnee 95 (318) T protein:vir:27 22 NRNRS-MVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRI--TDLNKQAGDEVTFSIMHKLSKRPTM---GDERVEGRGE 95 (318) T ss_pred hcCCh-HHHHHHHhhhhHHHhhhhhhcccCCCCCceEEEe--ccCCCCCccEEEEeEeeccccCccc---cCceeecccc Confidence 33332 123566543333333322222 23333 4565567999999886554322111 1111112223 Q ss_pred cccCceEEEEEEeeeecceEeeH-HHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhcccc----------------- Q lcl|Aclame:pro 73 DFTEDSFPVTLTDVAYHLGVLTD-EELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPY----------------- 134 (392) Q Consensus 73 ~~~~~~~~~~i~~~~~~~~~i~d-~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~----------------- 134 (392) .+.-.+.+|.||+..+ ++...+ .+..-...||+....+.+..-+++..|+-.+-.+.++.. T Consensus 96 ~L~~~~d~l~IDq~r~-~V~~gg~msqqRt~~dlR~~ar~~L~~w~~~~~Dq~~~v~laGarg~~~n~~~~~p~~~~~~~ 174 (318) T protein:vir:27 96 DLSHADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEF 174 (318) T ss_pred ceEEEeeEEEEeeecc-ccccccchhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccceEecccCccc Confidence 4555778888987754 544333 233335678888878888888888888877655432221 Q ss_pred ------cccccc---------c----cc--cchhhHHHHHHHHHHhhhccC---C---CC---------CEEEEchHHHH Q lcl|Aclame:pro 135 ------EAAGAV---------H----EV--APDEFFKGVNGARRALNELYI---P---QG---------RVLVVGTAVTE 178 (392) Q Consensus 135 ------~~~~~~---------~----~~--~~~~~~~~i~~a~~~l~~~~v---p---~~---------r~~vv~~~~~~ 178 (392) ....++ . .+ +....++-|-.++..+++..- | .+ ++++++|.++. T Consensus 175 ~~~~~N~v~aPt~~r~~~~g~at~~~~l~stD~~s~~lid~~~~~~~~~a~pi~PV~v~g~~~~~~~~~yV~~~~p~q~~ 254 (318) T protein:vir:27 175 KKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWN 254 (318) T ss_pred hhhhhcccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceeeccccccCCcceEEEEechHHHH Confidence 000000 0 00 111234445456666655222 2 12 46789999999 Q ss_pred Hhhcccc---eeee----eccccceeeeEeeeeeeeEeeeEEEEecceeecccceeecccccccchhh Q lcl|Aclame:pro 179 QILNDDR---FIKY----ESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTAFIMATRAP 239 (392) Q Consensus 179 ~l~~~~~---~~~~----~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a~~~a~~~~ 239 (392) .|..|.. |... ...+......+..|..|.+.|+-+.+...+|.-= +.++.. ...+.. T Consensus 255 ~Lrtdt~~~~w~d~q~~A~~r~~g~knPLF~G~~gm~ngvil~~~~~vpIrf---~~G~~v-~~~~~~ 318 (318) T protein:vir:27 255 DWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRF---YQGQRF-WYQRIT 318 (318) T ss_pred HHhhcCCCHHHHHHHHHHHhcccccCCCceecceeeecCEEEeecCCccEEE---cCCCee-eeeecC Confidence 9988753 4332 2222222356889999999998887776655311 000000 000000 No 172 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=93.65 E-value=0.0068 Score=32.46 Aligned_cols=256 Identities=12% Similarity=0.048 Sum_probs=102.4 Q ss_pred Cc-----c-ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc-eeeeccccccccCCCcccccc Q lcl|Aclame:pro 1 MA-----N-AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSD 73 (392) Q Consensus 1 Ma-----n-~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~-~~~~~~~~~~~~~~~~~~~~~ 73 (392) |. + -.+.|+.+..++++.|++.-.+-+++++. . .. |+ ++||+... ..+.+.. + ..... .. T Consensus 79 ~~~~~~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~--~-~~---~~-~~~~~~~~~~~a~w~~---e--~~~~~-~~ 145 (377) T protein:vir:98 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFK--N-TS---LR-LKALTAETSGTAVWGD---I--FGEIK-GQ 145 (377) T ss_pred HhccCCCCCccccCHHHHHHHHHHHHHhhhhhhheeeE--e-cC---cc-eEEEEecCCcceeEee---c--ccccC-cc Confidence 22 2 34789999999999999988887777542 2 12 33 57765332 2222221 1 11111 11 Q ss_pred ccCceEEEEEEeeeecce-EeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHH-hcccccc--------c---c-- Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLG-VLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMI-VGAPYEA--------A---G-- 138 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~-~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~-~~~~~~~--------~---~-- 138 (392) ...+-..++|..++..++ .|+.+-+.++..++...+.+..+++++..+|..++.-- ...|.+. . . T Consensus 146 ~~~~f~~i~l~~~kl~a~~~is~elL~ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~ 225 (377) T protein:vir:98 146 LKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWIKQFITEQLKEAIAVALELAIVKGDGLLQPVGLLKDLSQPTVDQSTGR 225 (377) T ss_pred cCccceeEeecceeEEeeecccHHhhhccHhHHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeeccccccccccccc Confidence 222334556666665444 57776677778899888999999999999998876310 0111100 0 0 Q ss_pred ccccccch-hhH---------------HHHHHH--HHHhhhccCCCCCE-EEEchHHHHHhhcccceeeeeccccceeee Q lcl|Aclame:pro 139 AVHEVAPD-EFF---------------KGVNGA--RRALNELYIPQGRV-LVVGTAVTEQILNDDRFIKYESQGQSAVSA 199 (392) Q Consensus 139 ~~~~~~~~-~~~---------------~~i~~a--~~~l~~~~vp~~r~-~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a 199 (392) ...+..++ ..+ .++... ...+.+.+-.+||+ ++++|..+..+. +...... T Consensus 226 ~~~~~~~~~~~~~~l~~~~~~~~~~~a~~~m~~~t~~~~~klkd~~G~~i~~~n~~~~~~~~--p~~~~~~--------- 294 (377) T protein:vir:98 226 DITTYKTDKEAIADLSDLTPDNAPKKLVPVMKHLSVNDKKRPLKIAGQVKLILNPEDRWALE--AQFTSRN--------- 294 (377) T ss_pred ccccccchhhhHhhhhhhchhHHHHHHHHHHHHHHHHHHhhhhccCCceEEEecccchhhcc--ccccccC--------- Confidence 00000000 000 011110 11123333456664 456776554432 1111111 Q ss_pred EeeeeeeeEeee--EEEEecceeecccceeecccccccchhhhccccccccceeecccceeeeeeeccccceeeeecccc Q lcl|Aclame:pro 200 LQEARLGRIYGY--EIVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLID 277 (392) Q Consensus 200 ~~~g~ig~~~g~--~v~~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 277 (392) .+|....+.|+ .+..+..+|.+.......+......+............... ........ T Consensus 295 -~~G~~~t~lg~p~~vv~s~~~p~~~i~fgdf~~Y~i~~r~~~~i~~~~~~~~~~--d~~~f~~~--------------- 356 (377) T protein:vir:98 295 -QFGEYVTVLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAME--DLQLYLTK--------------- 356 (377) T ss_pred -CCCccccccCCCceEEecCCCCcccEEEEEecceeEEeecceEEEeechhhhhc--CceEEEEE--------------- Confidence 12333344443 34556666654433222222111111100000000000000 00000000 Q ss_pred cceeeeEEEeeccccceeeeeccceeeeeeeccc Q lcl|Aclame:pro 278 TYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPE 311 (392) Q Consensus 278 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~ 311 (392) ...++.+..... ...+.++.. T Consensus 357 ~r~dg~~~~~~a-------------~~vl~i~~~ 377 (377) T protein:vir:98 357 NYFYGKAKDNHT-------------AALLTLAGG 377 (377) T ss_pred EEEcCEEeccCc-------------EEEEEEecC Confidence 000000000000 000000000 No 173 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=93.28 E-value=0.0081 Score=32.05 Aligned_cols=269 Identities=10% Similarity=0.028 Sum_probs=110.3 Q ss_pred Cc------cccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc-eeeeccccccccCCCcccccc Q lcl|Aclame:pro 1 MA------NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSD 73 (392) Q Consensus 1 Ma------n~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~-~~~~~~~~~~~~~~~~~~~~~ 73 (392) |. --.+.|+.+.+++++.|++.-.+.+++++. . .. |. .+|++... ..+.+.. + ...+. .. T Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~--~-~~---~~-~~i~~~~~~~~a~w~~---e--~~~~~-~~ 142 (381) T protein:vir:10 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIK--N-AG---LR-LKFLKSETSGVAVWGK---I--YGEIK-GQ 142 (381) T ss_pred HhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeE--e-cC---cc-eEEEEecCCcceeeec---c--ccccc-cc Confidence 11 135789999999999999999888877643 2 22 33 56665432 2222321 1 11111 11 Q ss_pred ccCceEEEEEEeeeecce-EeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHH-hcccccc----------cc--- Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLG-VLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMI-VGAPYEA----------AG--- 138 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~-~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~-~~~~~~~----------~~--- 138 (392) ...+-.++++..++...+ .|+.+-+.++..++...+.+..+++++..+|..++.-- ...|.+. .. T Consensus 143 ~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~ 222 (381) T protein:vir:10 143 LDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAY 222 (381) T ss_pred ccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCCCceeeeeccCcccccccccc Confidence 112223455555555444 57777677778899888999999999999998775310 0111100 00 Q ss_pred ----cc---ccccchhhHHHHHHHHHHhhhc-----cCCC-CCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeee Q lcl|Aclame:pro 139 ----AV---HEVAPDEFFKGVNGARRALNEL-----YIPQ-GRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARL 205 (392) Q Consensus 139 ----~~---~~~~~~~~~~~i~~a~~~l~~~-----~vp~-~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~i 205 (392) .. ........++.+.+....|... ..+. +-+++++|..+..+.....+. +..| .. T Consensus 223 ~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~--~~~G----------~~ 290 (381) T protein:vir:10 223 PEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHL--NANG----------VY 290 (381) T ss_pred cccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccC--CCCC----------ce Confidence 00 0011222345555554444322 1223 346788998877765432211 1112 11 Q ss_pred eeE--eeeEEEEecceeecccceeecccccccchhhhccccccccceeecccceeeeeeeccccceeeeecccccceeee Q lcl|Aclame:pro 206 GRI--YGYEIVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLK 283 (392) Q Consensus 206 g~~--~g~~v~~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 283 (392) -.. +|..++.++.+|.+.......+.+....+... ........-..........-....+. T Consensus 291 v~~l~~g~~vv~s~~~p~~~iifgDfs~Y~i~~r~~~-----------------~i~~~~~~~~~~d~~~f~a~~r~dg~ 353 (381) T protein:vir:10 291 VTALPFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGI-----------------NVQKFKETLALDDMDLYTAKQFAYGK 353 (381) T ss_pred eecCCCCceEEecCCCCcCcEEEEecccEEEEEeccc-----------------EEEeechhHhhcCCeEEEEEEEEcCE Confidence 111 34456677777654432222211111111100 00000000000000000000000000 Q ss_pred EEEeeccccceeeeeccceeeeeeecccccccceeeeeeccCeeE Q lcl|Aclame:pro 284 VVEDPNGVGFVRARKIHLIPGSIEVAPEAGANATITAAAGEDHTV 328 (392) Q Consensus 284 ~~~~~~~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~t~ 328 (392) .... ....+. .+.+....... .....|+ T Consensus 354 ~~~~---------~A~~v~--~l~~~~~~~~~------~~~~~~~ 381 (381) T protein:vir:10 354 AKDN---------KVAAVW--KLDLKGHKPAL------EGTEETL 381 (381) T ss_pred EecC---------ceEEEE--EEEecCCCcCc------ccccccC Confidence 0000 000000 00110000000 0001111 No 174 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=93.28 E-value=0.0081 Score=32.05 Aligned_cols=269 Identities=10% Similarity=0.028 Sum_probs=110.3 Q ss_pred Cc------cccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc-eeeeccccccccCCCcccccc Q lcl|Aclame:pro 1 MA------NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSD 73 (392) Q Consensus 1 Ma------n~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~-~~~~~~~~~~~~~~~~~~~~~ 73 (392) |. --.+.|+.+.+++++.|++.-.+.+++++. . .. |. .+|++... ..+.+.. + ...+. .. T Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~--~-~~---~~-~~i~~~~~~~~a~w~~---e--~~~~~-~~ 142 (381) T protein:vir:95 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIK--N-AG---LR-LKFLKSETSGVAVWGK---I--YGEIK-GQ 142 (381) T ss_pred HhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeE--e-cC---cc-eEEEEecCCcceeeec---c--ccccc-cc Confidence 11 135789999999999999999888877643 2 22 33 56665432 2222321 1 11111 11 Q ss_pred ccCceEEEEEEeeeecce-EeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHH-hcccccc----------cc--- Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLG-VLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMI-VGAPYEA----------AG--- 138 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~-~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~-~~~~~~~----------~~--- 138 (392) ...+-.++++..++...+ .|+.+-+.++..++...+.+..+++++..+|..++.-- ...|.+. .. T Consensus 143 ~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~ 222 (381) T protein:vir:95 143 LDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAY 222 (381) T ss_pred ccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCCCceeeeeccCcccccccccc Confidence 112223455555555444 57777677778899888999999999999998775310 0111100 00 Q ss_pred ----cc---ccccchhhHHHHHHHHHHhhhc-----cCCC-CCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeee Q lcl|Aclame:pro 139 ----AV---HEVAPDEFFKGVNGARRALNEL-----YIPQ-GRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARL 205 (392) Q Consensus 139 ----~~---~~~~~~~~~~~i~~a~~~l~~~-----~vp~-~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~i 205 (392) .. ........++.+.+....|... ..+. +-+++++|..+..+.....+. +..| .. T Consensus 223 ~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~--~~~G----------~~ 290 (381) T protein:vir:95 223 PEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHL--NANG----------VY 290 (381) T ss_pred cccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccC--CCCC----------ce Confidence 00 0011222345555554444322 1223 346788998877765432211 1112 11 Q ss_pred eeE--eeeEEEEecceeecccceeecccccccchhhhccccccccceeecccceeeeeeeccccceeeeecccccceeee Q lcl|Aclame:pro 206 GRI--YGYEIVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLK 283 (392) Q Consensus 206 g~~--~g~~v~~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 283 (392) -.. +|..++.++.+|.+.......+.+....+... ........-..........-....+. T Consensus 291 v~~l~~g~~vv~s~~~p~~~iifgDfs~Y~i~~r~~~-----------------~i~~~~~~~~~~d~~~f~a~~r~dg~ 353 (381) T protein:vir:95 291 VTALPFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGI-----------------NVQKFKETLALDDMDLYTAKQFAYGK 353 (381) T ss_pred eecCCCCceEEecCCCCcCcEEEEecccEEEEEeccc-----------------EEEeechhHhhcCCeEEEEEEEEcCE Confidence 111 34456677777654432222211111111100 00000000000000000000000000 Q ss_pred EEEeeccccceeeeeccceeeeeeecccccccceeeeeeccCeeE Q lcl|Aclame:pro 284 VVEDPNGVGFVRARKIHLIPGSIEVAPEAGANATITAAAGEDHTV 328 (392) Q Consensus 284 ~~~~~~~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~t~ 328 (392) .... ....+. .+.+....... .....|+ T Consensus 354 ~~~~---------~A~~v~--~l~~~~~~~~~------~~~~~~~ 381 (381) T protein:vir:95 354 AKDN---------KVAAVW--KLDLKGHKPAL------EGTEETL 381 (381) T ss_pred EecC---------ceEEEE--EEEecCCCcCc------ccccccC Confidence 0000 000000 00110000000 0001111 No 175 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=93.11 E-value=0.0087 Score=31.87 Aligned_cols=258 Identities=9% Similarity=0.022 Sum_probs=105.4 Q ss_pred CccccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccccccCceEE Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTEDSFP 80 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (392) +....+.|+.+...+.+ +++...+..++.. +.. .+....+|.+..... ......++... ....++.-..++ T Consensus 138 ~~~~~~vp~~~~~~i~~-~~~~~~l~~~~~~-----~~~-~~~~~~~~~~~~~~~-~~~~~~E~~~~-~~~~~~~~~~i~ 208 (397) T protein:vir:96 138 VEGGALIPQELLQPQLE-PKDIVDLSKYVRS-----VPV-NSASGKFPVISKSGS-KMATVQQLEKN-PQLANPKMVEID 208 (397) T ss_pred cccccchhHHHHHHHHH-hhhhhhHHHhhhh-----ccc-cccceeEEEEeccCC-ccccccccccc-ccccccccccee Confidence 44455778888877766 3444334333321 111 122345554432111 11111111111 111233345555 Q ss_pred EEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHHHHHHHHHhh Q lcl|Aclame:pro 81 VTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNGARRALN 160 (392) Q Consensus 81 ~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~l~ 160 (392) +.+.+. +.-+.++.+-+.++..++...+.+..+++++..++..++.-.... .......++++.++..... T Consensus 209 ~~~~~~-~~~~~~s~ell~ds~~~l~~~i~~~l~~~~~~~~~~~i~~g~g~~---------~~~~~~~~d~~~~~~~~~~ 278 (397) T protein:vir:96 209 YSVATR-RGYIPISQEMIDDASYDVTGLIADEIQDQSLNTKNADIAAVLKTA---------TAKSVVGVDGLKDLINKEI 278 (397) T ss_pred ecHhHh-hcchhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc---------ccccccchHHHHHHHHHhh Confidence 555332 344567777666677788888888889999999998877533211 1122234667766543322 Q ss_pred hccCCCCCEEEEchHHHHHhhcccceeeeecccccee-eeEeeeeeeeEeeeEEEEecceeecccceeecccccccchhh Q lcl|Aclame:pro 161 ELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAV-SALQEARLGRIYGYEIVESTLIPHGDAYLYHPTAFIMATRAP 239 (392) Q Consensus 161 ~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~-~a~~~g~ig~~~g~~v~~s~~v~~~~~~~~~~~a~~~a~~~~ 239 (392) .. . .+-.++++|..+..|.+-. +..|.-.. ..+..|..+.+.|++|+.++........... .+. T Consensus 279 ~~-~-~~a~~v~n~~~~~~l~~lk-----d~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~--~~~------ 343 (397) T protein:vir:96 279 KK-V-YDVKLFISASMYSELDKLK-----DKNGRYLLQDSITAASGKQLLGKEVVVLDDDVIGKSVGNV--VGF------ 343 (397) T ss_pred hh-h-cCcEEEEcHHHHHHHHHhh-----ccCCCeEeccCccCCCcccccccceEEecccccCCCCCce--EEE------ Confidence 11 1 2346899999998886521 11222111 1233455568999999876543322110000 000 Q ss_pred hccccccccceeecc-cceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeeeccccccccee Q lcl|Aclame:pro 240 APPMGAVRSTAISGD-QRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGANATI 318 (392) Q Consensus 240 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~~~~~~~ 318 (392) .+.......... .+....+.. ..... . .+.+.. ...........+ ....+ T Consensus 344 ---~gd~~~~~~~~~~~~~~~~~~~-~~~~~--~-----~~~~~~----r~d~~~~~~~a~--------------~~~~~ 394 (397) T protein:vir:96 344 ---IGDAKAFASFFDRKQVSVSWVD-NNIYG--Q-----LLAGII----RYDVKATDKKAG--------------FYVTF 394 (397) T ss_pred ---EeehhcceEeEeecceEEEEec-ccccc--e-----eEEEEE----EEccEEecccce--------------EEEEe Confidence 000000000000 011111000 00000 0 000000 000000000000 00000 Q ss_pred eee Q lcl|Aclame:pro 319 TAA 321 (392) Q Consensus 319 ~~~ 321 (392) +.. T Consensus 395 ~~a 397 (397) T protein:vir:96 395 TIG 397 (397) T ss_pred ecC Confidence 000 No 176 >protein:vir:96792 Length: 315 # NCBI annotation: major capsid protein # Family: family:all:47 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224246;genbank:gi:62362381;genbank:GeneID:3345731 Probab=91.01 E-value=0.018 Score=30.16 Aligned_cols=295 Identities=13% Similarity=0.055 Sum_probs=110.9 Q ss_pred Cccc------cccHHHHHHHHHHHHHHhhc-ccc-------eeeecccccccCCCCCeEEEEecc-ceeeeccccccccC Q lcl|Aclame:pro 1 MANA------FSKPTAVVDTAIQMLQNELI-LTN-------LVWLNGIGDFAHKFNDTITVRVPA-PSRGHTRKLRGAGA 65 (392) Q Consensus 1 Man~------~~~~~~~~~~~~~~l~~~l~-~~~-------~v~~~~~~~~~~~~Gdtv~i~~~~-~~~~~~~~~~~~~~ 65 (392) ||-+ +|++.+... .++.+++.+. |-. |.+.-++ ||=+..+... ...+.+.+. .. T Consensus 1 ~~~t~~sdl~vfn~~~~~a-~~e~~~~~~~~Fnaas~Gai~l~~~~~~-------GDf~~~~ff~i~~~~~~rnv---~~ 69 (315) T protein:vir:96 1 MATTVNSDLVIYNDTAQTA-YLERNMDNLAVFNENSRAAIGLNSELIE-------GDLKLRSFYKVGGAIADRDV---NS 69 (315) T ss_pred Cceeeecceeeehhhhhhh-HHhhhHHHHHHhhhhcCCcccccccccc-------cccccccccccccchhhccc---CC Confidence 8853 367777765 5677776654 311 2222233 3333322221 001111111 12 Q ss_pred CCccccccccCc-eEEEEEEeeeecceEeeHHHHhhhccChH---HHHHHHHHHHHHHHHHHHHHHHHhccccc-ccccc Q lcl|Aclame:pro 66 ERNLTVSDFTED-SFPVTLTDVAYHLGVLTDEELTFDLESFA---TQILPRQVRGVADILEEGVRDMIVGAPYE-AAGAV 140 (392) Q Consensus 66 ~~~~~~~~~~~~-~~~~~i~~~~~~~~~i~d~~~~~~~~~~~---~~~~~~~~~ala~~vd~~~~~~~~~~~~~-~~~~~ 140 (392) +.++....+... .+-+++ .+.+-++.++..++...-.|++ ..+.++...++.+.+-...+..+..+-.. ..... T Consensus 70 ~~~~t~~kit~~~dvaVk~-~~~~~~~~~~~~~~a~~g~dp~~~~~~i~~~~~~~~l~~~l~~~l~~~~aai~~~t~~~~ 148 (315) T protein:vir:96 70 TATVAGTKIAADEMVSVKV-PWKYGPYETTEEAFKRRARSPEEFSMLIGQDMADATMAGWIGYALNALQGAIGSNAGMNV 148 (315) T ss_pred CccccceecccccceeEEE-eecCCchhccHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcccccccc Confidence 333444444332 233333 4456677788777765444443 22333333333333333333222221111 11111 Q ss_pred ccccchhhHHHHHHHHHHhhhccCCCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEEEEeccee Q lcl|Aclame:pro 141 HEVAPDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIP 220 (392) Q Consensus 141 ~~~~~~~~~~~i~~a~~~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v~~s~~v~ 220 (392) .+.......+.+.+|.++|.++.- .=-.+++++..+..|.+ ..+....+. ......+.+..+.. |..|.++..+| T Consensus 149 ~~~~a~~~~~~l~dA~~klGD~~~-~l~~~vMHS~v~~~L~~-q~L~~~~~~--~~~~~~~~~~~~~l-GkrViVdD~~P 223 (315) T protein:vir:96 149 SGELATEGKKVLTKGLRTMGDKAS-SIAIWVMDSTSYFDIVD-EAIDNKLYE--EAGVVVYGGTPGTL-GKPVLVTDQCP 223 (315) T ss_pred cccccccCHHHHHHHHHHhccccc-CeeEEEEchHHHHHHHH-hhhhhhccc--ccceeEecCcCccc-ccEEEEECCCC Confidence 112223457788999998854422 11246889999999988 333332221 11112333334434 88899999999 Q ss_pred ecccceeecccccccchhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeecc Q lcl|Aclame:pro 221 HGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIH 300 (392) Q Consensus 221 ~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 300 (392) ....+.+...++.+......... .....+......+.-..+. ...+. .|. ....... .+.+. T Consensus 224 ~~~~~gl~~GAi~~~~~~~~~~~----~~~~~g~e~l~~~~r~e~t-------f~l~p-~G~-sw~~~~~---~sPt~-- 285 (315) T protein:vir:96 224 ATKIFGLVAGAVMITESQAPGMR----SYQIDDQENLAIGFRAEGT-------ANVEV-LGY-KWKTKTN---VNPAS-- 285 (315) T ss_pred cceeeeeecceeeecCCCccccc----cccCCCcceeEEEEeeeeE-------eeeee-eeE-EeecCCC---cCCCh-- Confidence 76655554555443332211000 0000011111111000000 00000 000 0000000 00000 Q ss_pred ceeeeeeecccccccceeeeeeccCeeEEEEEee Q lcl|Aclame:pro 301 LIPGSIEVAPEAGANATITAAAGEDHTVQLKVTD 334 (392) Q Consensus 301 ~~~~~v~v~~~~~~~~~~~~~~~~~~t~~~t~~~ 334 (392) ..+ -++..=...........+.-++|+-+| T Consensus 286 ---aeL-at~~NWekV~~~~K~tagv~~~~~~~~ 315 (315) T protein:vir:96 286 ---ATL-ATTTNWEKYATDDKATAGFIITLTTTP 315 (315) T ss_pred ---HHh-cCCcCcccccCCCcccceEEEEecCCC Confidence 000 000000000000011111112222122 No 177 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=89.56 E-value=0.026 Score=29.30 Aligned_cols=268 Identities=9% Similarity=0.014 Sum_probs=106.3 Q ss_pred Ccc------ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccc--eeeeccccccccCCCccc-c Q lcl|Aclame:pro 1 MAN------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP--SRGHTRKLRGAGAERNLT-V 71 (392) Q Consensus 1 Man------~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~--~~~~~~~~~~~~~~~~~~-~ 71 (392) |.. -++.|+.+...+ ..+++...+..+++.- ... ....++|++.. ...... .+ +.... . T Consensus 156 ~~~~~~~~~g~lvp~~~~~~i-~~~~~~~~l~~~~~~~-----~~~-~~~~~~~~~~~~~~~~~~~---~e--~~~~~e~ 223 (437) T protein:vir:10 156 VTGIALKDGKVIIPETILTPE-KEVHQFPRLGSLVRTE-----SVT-TTTGKLPIFNNSTDLLTAH---TE--YGQTTKN 223 (437) T ss_pred hhhcccccccccchHHHHHHH-HHhhhhhhhhhcceeE-----eec-cCceeeEEeeccccccccc---cc--ccccccc Confidence 111 236788887654 4455544444444321 111 12355554422 112211 11 11111 1 Q ss_pred ccccCceEEEEEEeeeecceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccchhhHHH Q lcl|Aclame:pro 72 SDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKG 151 (392) Q Consensus 72 ~~~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 151 (392) .+..-..+++...+. +.-+.|+.+-+.++..++...+.+..+++|+..+|..++.-.... .........+++ T Consensus 224 ~~~~~~~v~~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~-------~~~~~~~~~~~~ 295 (437) T protein:vir:10 224 ATPVITPILWDLKTY-TGGYVFSQELISDSSYDWQAELQSRLIELRDNTDDSLIITALTDG-------IKKTTSTYLLGD 295 (437) T ss_pred ccccceeeeeehhhe-eeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc-------ccccccccchhh Confidence 222223444443232 344578887777778889888999999999999999887643221 111222233455 Q ss_pred HHHHHH-HhhhccCCCCCEEEEchHHHHHhhcccceeeeecccccee-eeEeeeeeeeEeeeEEEEecce--eeccccee Q lcl|Aclame:pro 152 VNGARR-ALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAV-SALQEARLGRIYGYEIVESTLI--PHGDAYLY 227 (392) Q Consensus 152 i~~a~~-~l~~~~vp~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~-~a~~~g~ig~~~g~~v~~s~~v--~~~~~~~~ 227 (392) +.++.. .|+.... .+-.++++|..+..|.+-. +..|.-.. ..+..|..+.+.|++|+.+... |....... T Consensus 296 ~~~~~~~~l~~~~~-~~~~~~~~~~~~~~l~~lk-----d~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~ 369 (437) T protein:vir:10 296 LKKVLNVTLKPQDS-AAASIVMSQSAYNLFDMAT-----DAMGRPLLQPNVTAATGYTLLGKTVVIVDDKLFPSASAGDV 369 (437) T ss_pred HHHHHHhhhhhhhh-cCCEEEEcHHHHHHHHHhh-----ccCCCeeeccCccCCCCcccccceeEEecccccCCcCCCce Confidence 555432 3443322 2346799999988875421 11221110 1233455668999999886543 32221100 Q ss_pred ecccccccchhhhccccccccceeec-ccceeeeeeeccccceeeeecccccceeeeEEEeeccccceeeeeccceeeee Q lcl|Aclame:pro 228 HPTAFIMATRAPAPPMGAVRSTAISG-DQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSI 306 (392) Q Consensus 228 ~~~a~~~a~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 306 (392) . + ..+......... ..+....+...++..... ..+..-..+.+....... .++.....+ T Consensus 370 --~-~---------~~gd~~~~~~~~~r~~~~~~~~~~~~~~~~~--~~~~~r~d~~~~~~~a~~------~l~~~~~~~ 429 (437) T protein:vir:10 370 --N-I---------VVAPLKKAVINFKLTEITGQFQDTYDIWYKQ--LGIFLRQNVVQASKDLIV------NLTGKLKAV 429 (437) T ss_pred --E-E---------EEeeccccEEEEeeeceEEEEecccccccce--eeEEEEEccEEecccceE------EEEeecccc Confidence 0 0 000000000000 001111111111110000 000000111111100000 000000000 Q ss_pred eecccccc Q lcl|Aclame:pro 307 EVAPEAGA 314 (392) Q Consensus 307 ~v~~~~~~ 314 (392) ++...... T Consensus 430 ~~~~~~~~ 437 (437) T protein:vir:10 430 TVVQSTAV 437 (437) T ss_pred ccCCCCCC Confidence 00000000 No 178 >protein:vir:105610 Length: 430 # NCBI annotation: virion structural protein # Family: family:all:974 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164307;genbank:gi:56692923;genbank:GeneID:3197221 Probab=89.11 E-value=0.028 Score=29.08 Aligned_cols=304 Identities=13% Similarity=0.110 Sum_probs=123.1 Q ss_pred Ccc--------ccccHHHHHHHHHHHH-HHhhcccc------------------------eeeecccccccCCCCCeEEE Q lcl|Aclame:pro 1 MAN--------AFSKPTAVVDTAIQML-QNELILTN------------------------LVWLNGIGDFAHKFNDTITV 47 (392) Q Consensus 1 Man--------~~~~~~~~~~~~~~~l-~~~l~~~~------------------------~v~~~~~~~~~~~~Gdtv~i 47 (392) |.- +-....+|+..+-..- +++.-+.. -|.|- .|+....||+|++ T Consensus 1 ~~~a~T~~~~~~p~a~~~ws~~l~~~~~k~~~~~~kl~G~~~~~~~~~~~~~~~~ts~~~pI~r~--~dL~K~~GD~Vtf 78 (430) T protein:vir:10 1 MTASKTTMRYGDPNAMIQQAAGLFALCQGRNSTLNRLTGKMPSGTSDAEKKTKGQSSLELPIVQA--QDLGRNKGDEVRF 78 (430) T ss_pred CcceeeecccCChhHHHHHHHHHHHHHhhhhhhHHHhhccccccccchhhhccCCCCCCccEEEe--ccCCCCCccEEEE Confidence 542 1223457775442222 21111111 14443 4565567999999 Q ss_pred Eecccee----eeccccccccCCCccccccccCceEEEEEEeeeecceEeeHH-HHhhhccChHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 48 RVPAPSR----GHTRKLRGAGAERNLTVSDFTEDSFPVTLTDVAYHLGVLTDE-ELTFDLESFATQILPRQVRGVADILE 122 (392) Q Consensus 48 ~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~d~-~~~~~~~~~~~~~~~~~~~ala~~vd 122 (392) .....+. ..+... ++.+ +.+.-.+..|+||+..+ ++.+.+. ...-...||+.+..+.+..=+++..| T Consensus 79 ~L~~~L~g~gv~Gd~~l--EGne-----e~L~~~~d~l~IDq~R~-~V~~gg~msqQRt~~dlR~~ar~~L~~w~~~~~D 150 (430) T protein:vir:10 79 HFVQPANAFPIMGSEYA--EGKG-----TGLKIGSDQLRVNQARF-PVDLGDVMSQIRNPYDLRRLGRPKAKWFMDAYLD 150 (430) T ss_pred eEeeccccCceecCcee--eccc-----cceEEEeeEEEEeeecc-ccccCCchhhhhhhhHHHHHHHHHHHHHHHHHHH Confidence 8865542 222211 1222 34445778899988764 5655532 23335667777777777776777777 Q ss_pred HHHHHHHhcc-----------------------------ccc------ccccc-----------ccccchhhHHHHHHHH Q lcl|Aclame:pro 123 EGVRDMIVGA-----------------------------PYE------AAGAV-----------HEVAPDEFFKGVNGAR 156 (392) Q Consensus 123 ~~~~~~~~~~-----------------------------~~~------~~~~~-----------~~~~~~~~~~~i~~a~ 156 (392) +-.+-.+.++ |.. .+..+ .+.+....++.|-.++ T Consensus 151 q~~~v~laGarg~~~~~~~~~~~~~~~~~~~~~~N~v~aPt~nrh~~~~G~at~~~~~~~~~~sl~stD~~s~~~id~a~ 230 (430) T protein:vir:10 151 QSMLVHLAGARGNHYNKEWCLPLETHPKLADMLVNRVKAPTKNRHFVASADAITGVAPNAGEYNITTADVLDVDVVDSIA 230 (430) T ss_pred HHHHHHHhhhhcccccccccccccCCcchhhhhccccCCCCCceeEeecccccccccccccccchhhhcccCHHHHHHHH Confidence 6655433221 000 00000 0111123466666777 Q ss_pred HHhhhccCC-C-------C-------CEEEEchHHHHHhhcccceee-----eeccccceeeeEeeeeeeeEeeeEEEEe Q lcl|Aclame:pro 157 RALNELYIP-Q-------G-------RVLVVGTAVTEQILNDDRFIK-----YESQGQSAVSALQEARLGRIYGYEIVES 216 (392) Q Consensus 157 ~~l~~~~vp-~-------~-------r~~vv~~~~~~~l~~~~~~~~-----~~~~G~~~~~a~~~g~ig~~~g~~v~~s 216 (392) ..++....| . . ++++++|.++..|..|+.|.. ....+......+..|..|.+.|+-+++. T Consensus 231 ~~a~~~~~~i~Pv~v~gd~~~g~~~~yV~~~~p~q~~~Lr~dt~~~~wq~~~~a~a~~g~~nPlF~G~~gm~ngvii~~~ 310 (430) T protein:vir:10 231 TYMDQIELPPPPVKFEGDEAAEDSPIRVLLCSPAQYNSFAKQEKFRSWQAAALARASNAKQHPIFRVDAGLWSNTLIIKM 310 (430) T ss_pred HHHHhhCCCCcceEeecccccCCccEEEEEechHHHHHHhhCcchHHHHHHHHHhhcccccCCceecceeeecCeEEecC Confidence 777665432 1 1 567899999999999988642 1122222246788999999999877764 Q ss_pred cce-e--ecccceeec----ccc-------cccchhh---hcccccc---ccceeecccceeeeeee-ccccceeeeecc Q lcl|Aclame:pro 217 TLI-P--HGDAYLYHP----TAF-------IMATRAP---APPMGAV---RSTAISGDQRIAMRWLV-DYDSTITSNRSL 275 (392) Q Consensus 217 ~~v-~--~~~~~~~~~----~a~-------~~a~~~~---~~~~~~~---~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 275 (392) ..+ . .+....+.. .+. ..+.... ....+.. .........+....|.- ..|.... .... T Consensus 311 ~~virf~~g~~~~~~a~~~~~~~~~~~~~a~~~~~~~v~RalllGaQA~~~A~g~~~~~g~~f~w~Ee~~D~g~~-~~i~ 389 (430) T protein:vir:10 311 PKPIRFYAGDTIKYCAAYNSEAESSAVVSDSFGNQYAVDRALLLGGQALAQAWAASEHSGMPFFWSEKDMDHGDK-LELL 389 (430) T ss_pred CceeeecCCCccccccCCcccccccccccccccccccchhhhhccchhheeeeeccCCCCcceeeeeeccccCch-hhhh Confidence 322 0 000000000 000 0000000 0000000 00000000111111221 1111111 1112 Q ss_pred cccceeeeEEEeeccccc-eeeeeccceeeeeeeccccccc Q lcl|Aclame:pro 276 IDTYFGLKVVEDPNGVGF-VRARKIHLIPGSIEVAPEAGAN 315 (392) Q Consensus 276 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~v~v~~~~~~~ 315 (392) ++...|.....-....+. .......+......+.-..+-. T Consensus 390 ~~~i~G~kK~rF~~~~~~~~~~~DfGvi~idtaa~~~~~~~ 430 (430) T protein:vir:10 390 IGAILGCSKIRFAVEATNGLEYTDHGVMAIDTAVKIIGPRK 430 (430) T ss_pred hhHHhccceeeecCCCCCCceeeeeEEEEhhhhhhhhcCCC Confidence 222222211111100000 0000000000000000000000 No 179 >protein:vir:10123 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859253;genbank:gi:32171009;genbank:GeneID:2653345 Probab=88.42 E-value=0.032 Score=28.75 Aligned_cols=300 Identities=12% Similarity=0.011 Sum_probs=121.8 Q ss_pred CccccccHHHHHHHHHHHHHHhhcc--------cceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccc Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELIL--------TNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVS 72 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~--------~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (392) +.|+-+ -.+|+..+...-..+.-+ ..-|.|- .|+....||+|++.....+.-.... +...-....+ T Consensus 22 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~--~dL~K~aGd~vtf~L~~~L~g~gv~---Gd~~lEGnee 95 (404) T protein:vir:10 22 NRNRSM-VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRI--TDLNKQAGDEVTFSIMHKLSKRPTM---GDERVEGRGE 95 (404) T ss_pred hcCChh-HhhhhhhhhhhhhhccchhhccCCCCCccEEEe--ecCCCCCCcEEEEeEeeecccCCcc---cCceeecccc Confidence 444332 233433221111111111 1123332 4555567999999887655422211 1111122334 Q ss_pred cccCceEEEEEEeeeecceEeeH-HHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccc---------------- Q lcl|Aclame:pro 73 DFTEDSFPVTLTDVAYHLGVLTD-EELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYE---------------- 135 (392) Q Consensus 73 ~~~~~~~~~~i~~~~~~~~~i~d-~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~---------------- 135 (392) .++-.+.+|.||+..+ ++.... ....-...||+++..+.+..-+++..|+.++-.+.++... T Consensus 96 ~L~~~s~~i~Idq~r~-~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~~~~ 174 (404) T protein:vir:10 96 DLSHADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEF 174 (404) T ss_pred ceeEEeeEEEEeeecc-cccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeeccccccc Confidence 5566788899988754 444433 2333467788888888888888889998887555433210 Q ss_pred -------ccccc-------cc--------ccchhhHHHHHHHHHHhhhccCC-C-----C---------CEEEEchHHHH Q lcl|Aclame:pro 136 -------AAGAV-------HE--------VAPDEFFKGVNGARRALNELYIP-Q-----G---------RVLVVGTAVTE 178 (392) Q Consensus 136 -------~~~~~-------~~--------~~~~~~~~~i~~a~~~l~~~~vp-~-----~---------r~~vv~~~~~~ 178 (392) ...++ .+ .+....++-|-.++..+++..-| . + ++++++|.++. T Consensus 175 ~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~~~~yV~~~~p~q~~ 254 (404) T protein:vir:10 175 KKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWN 254 (404) T ss_pred cceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCccceEEEEechHHHH Confidence 00000 00 01112344455666666553333 1 2 56789999999 Q ss_pred Hhhcccc---eeeeec---cc-cceeeeEeeeeeeeEeeeEEEEecceeecc--cce--eecccc-----cccchhh--- Q lcl|Aclame:pro 179 QILNDDR---FIKYES---QG-QSAVSALQEARLGRIYGYEIVESTLIPHGD--AYL--YHPTAF-----IMATRAP--- 239 (392) Q Consensus 179 ~l~~~~~---~~~~~~---~G-~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~--~~~--~~~~a~-----~~a~~~~--- 239 (392) .|..|+. |....+ .+ ......+..|..|.+.|+-+.+....|..- ... ...+.. ..+.... T Consensus 255 ~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~R 334 (404) T protein:vir:10 255 DWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDR 334 (404) T ss_pred HHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccccccccccccccchh Confidence 9999863 333222 11 112367888999999998887655443211 000 000000 0000000 Q ss_pred hccccccccceeec-ccceeeeeeec-cccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeee Q lcl|Aclame:pro 240 APPMGAVRSTAISG-DQRIAMRWLVD-YDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEV 308 (392) Q Consensus 240 ~~~~~~~~~~~~~~-~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v 308 (392) ....+........+ ..+....|... .|.... -...++...|.....=....+...--.+.+.+.-+.+ T Consensus 335 allLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~-~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idta~~~ 404 (404) T protein:vir:10 335 AMLLGAQALANAYGQKAGGHFNMVEKKTDMDNR-TEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) T ss_pred heeecceeEEEEeeccCCCCceeEeeccccCch-hhhhhHHHhhhhhccccCCCCceeeEEEEEecccccC Confidence 00000000000000 00000111100 011000 0111111111111100000000000000000000000 No 180 >protein:vir:104439 Length: 404 # NCBI annotation: putative virion structural protein # Family: family:all:974 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794063;genbank:gi:116222008;genbank:GeneID:4397504 Probab=88.42 E-value=0.032 Score=28.75 Aligned_cols=300 Identities=12% Similarity=0.011 Sum_probs=121.8 Q ss_pred CccccccHHHHHHHHHHHHHHhhcc--------cceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccc Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELIL--------TNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVS 72 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~--------~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (392) +.|+-+ -.+|+..+...-..+.-+ ..-|.|- .|+....||+|++.....+.-.... +...-....+ T Consensus 22 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~--~dL~K~aGd~vtf~L~~~L~g~gv~---Gd~~lEGnee 95 (404) T protein:vir:10 22 NRNRSM-VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRI--TDLNKQAGDEVTFSIMHKLSKRPTM---GDERVEGRGE 95 (404) T ss_pred hcCChh-HhhhhhhhhhhhhhccchhhccCCCCCccEEEe--ecCCCCCCcEEEEeEeeecccCCcc---cCceeecccc Confidence 444332 233433221111111111 1123332 4555567999999887655422211 1111122334 Q ss_pred cccCceEEEEEEeeeecceEeeH-HHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccc---------------- Q lcl|Aclame:pro 73 DFTEDSFPVTLTDVAYHLGVLTD-EELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYE---------------- 135 (392) Q Consensus 73 ~~~~~~~~~~i~~~~~~~~~i~d-~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~---------------- 135 (392) .++-.+.+|.||+..+ ++.... ....-...||+++..+.+..-+++..|+.++-.+.++... T Consensus 96 ~L~~~s~~i~Idq~r~-~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~~~~ 174 (404) T protein:vir:10 96 DLSHADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEF 174 (404) T ss_pred ceeEEeeEEEEeeecc-cccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeeccccccc Confidence 5566788899988754 444433 2333467788888888888888889998887555433210 Q ss_pred -------ccccc-------cc--------ccchhhHHHHHHHHHHhhhccCC-C-----C---------CEEEEchHHHH Q lcl|Aclame:pro 136 -------AAGAV-------HE--------VAPDEFFKGVNGARRALNELYIP-Q-----G---------RVLVVGTAVTE 178 (392) Q Consensus 136 -------~~~~~-------~~--------~~~~~~~~~i~~a~~~l~~~~vp-~-----~---------r~~vv~~~~~~ 178 (392) ...++ .+ .+....++-|-.++..+++..-| . + ++++++|.++. T Consensus 175 ~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~~~~yV~~~~p~q~~ 254 (404) T protein:vir:10 175 KKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWN 254 (404) T ss_pred cceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCccceEEEEechHHHH Confidence 00000 00 01112344455666666553333 1 2 56789999999 Q ss_pred Hhhcccc---eeeeec---cc-cceeeeEeeeeeeeEeeeEEEEecceeecc--cce--eecccc-----cccchhh--- Q lcl|Aclame:pro 179 QILNDDR---FIKYES---QG-QSAVSALQEARLGRIYGYEIVESTLIPHGD--AYL--YHPTAF-----IMATRAP--- 239 (392) Q Consensus 179 ~l~~~~~---~~~~~~---~G-~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~--~~~--~~~~a~-----~~a~~~~--- 239 (392) .|..|+. |....+ .+ ......+..|..|.+.|+-+.+....|..- ... ...+.. ..+.... T Consensus 255 ~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~R 334 (404) T protein:vir:10 255 DWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDR 334 (404) T ss_pred HHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccccccccccccccchh Confidence 9999863 333222 11 112367888999999998887655443211 000 000000 0000000 Q ss_pred hccccccccceeec-ccceeeeeeec-cccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeee Q lcl|Aclame:pro 240 APPMGAVRSTAISG-DQRIAMRWLVD-YDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEV 308 (392) Q Consensus 240 ~~~~~~~~~~~~~~-~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v 308 (392) ....+........+ ..+....|... .|.... -...++...|.....=....+...--.+.+.+.-+.+ T Consensus 335 allLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~-~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idta~~~ 404 (404) T protein:vir:10 335 AMLLGAQALANAYGQKAGGHFNMVEKKTDMDNR-TEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) T ss_pred heeecceeEEEEeeccCCCCceeEeeccccCch-hhhhhHHHhhhhhccccCCCCceeeEEEEEecccccC Confidence 00000000000000 00000111100 011000 0111111111111100000000000000000000000 No 181 >protein:vir:819 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050552;genbank:gi:9633449;genbank:GeneID:1262254 Probab=88.42 E-value=0.032 Score=28.75 Aligned_cols=300 Identities=12% Similarity=0.011 Sum_probs=121.8 Q ss_pred CccccccHHHHHHHHHHHHHHhhcc--------cceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccc Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELIL--------TNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVS 72 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~--------~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (392) +.|+-+ -.+|+..+...-..+.-+ ..-|.|- .|+....||+|++.....+.-.... +...-....+ T Consensus 22 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~--~dL~K~aGd~vtf~L~~~L~g~gv~---Gd~~lEGnee 95 (404) T protein:vir:81 22 NRNRSM-VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRI--TDLNKQAGDEVTFSIMHKLSKRPTM---GDERVEGRGE 95 (404) T ss_pred hcCChh-HhhhhhhhhhhhhhccchhhccCCCCCccEEEe--ecCCCCCCcEEEEeEeeecccCCcc---cCceeecccc Confidence 444332 233433221111111111 1123332 4555567999999887655422211 1111122334 Q ss_pred cccCceEEEEEEeeeecceEeeH-HHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccc---------------- Q lcl|Aclame:pro 73 DFTEDSFPVTLTDVAYHLGVLTD-EELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYE---------------- 135 (392) Q Consensus 73 ~~~~~~~~~~i~~~~~~~~~i~d-~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~---------------- 135 (392) .++-.+.+|.||+..+ ++.... ....-...||+++..+.+..-+++..|+.++-.+.++... T Consensus 96 ~L~~~s~~i~Idq~r~-~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~~~~ 174 (404) T protein:vir:81 96 DLSHADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEF 174 (404) T ss_pred ceeEEeeEEEEeeecc-cccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeeccccccc Confidence 5566788899988754 444433 2333467788888888888888889998887555433210 Q ss_pred -------ccccc-------cc--------ccchhhHHHHHHHHHHhhhccCC-C-----C---------CEEEEchHHHH Q lcl|Aclame:pro 136 -------AAGAV-------HE--------VAPDEFFKGVNGARRALNELYIP-Q-----G---------RVLVVGTAVTE 178 (392) Q Consensus 136 -------~~~~~-------~~--------~~~~~~~~~i~~a~~~l~~~~vp-~-----~---------r~~vv~~~~~~ 178 (392) ...++ .+ .+....++-|-.++..+++..-| . + ++++++|.++. T Consensus 175 ~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~~~~yV~~~~p~q~~ 254 (404) T protein:vir:81 175 KKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWN 254 (404) T ss_pred cceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCccceEEEEechHHHH Confidence 00000 00 01112344455666666553333 1 2 56789999999 Q ss_pred Hhhcccc---eeeeec---cc-cceeeeEeeeeeeeEeeeEEEEecceeecc--cce--eecccc-----cccchhh--- Q lcl|Aclame:pro 179 QILNDDR---FIKYES---QG-QSAVSALQEARLGRIYGYEIVESTLIPHGD--AYL--YHPTAF-----IMATRAP--- 239 (392) Q Consensus 179 ~l~~~~~---~~~~~~---~G-~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~--~~~--~~~~a~-----~~a~~~~--- 239 (392) .|..|+. |....+ .+ ......+..|..|.+.|+-+.+....|..- ... ...+.. ..+.... T Consensus 255 ~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~R 334 (404) T protein:vir:81 255 DWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDR 334 (404) T ss_pred HHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccccccccccccccchh Confidence 9999863 333222 11 112367888999999998887655443211 000 000000 0000000 Q ss_pred hccccccccceeec-ccceeeeeeec-cccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeee Q lcl|Aclame:pro 240 APPMGAVRSTAISG-DQRIAMRWLVD-YDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEV 308 (392) Q Consensus 240 ~~~~~~~~~~~~~~-~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v 308 (392) ....+........+ ..+....|... .|.... -...++...|.....=....+...--.+.+.+.-+.+ T Consensus 335 allLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~-~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idta~~~ 404 (404) T protein:vir:81 335 AMLLGAQALANAYGQKAGGHFNMVEKKTDMDNR-TEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) T ss_pred heeecceeEEEEeeccCCCCceeEeeccccCch-hhhhhHHHhhhhhccccCCCCceeeEEEEEecccccC Confidence 00000000000000 00000111100 011000 0111111111111100000000000000000000000 No 182 >protein:vir:3298 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049514;genbank:gi:9632520;genbank:GeneID:1262006 Probab=88.42 E-value=0.032 Score=28.75 Aligned_cols=300 Identities=12% Similarity=0.011 Sum_probs=121.8 Q ss_pred CccccccHHHHHHHHHHHHHHhhcc--------cceeeecccccccCCCCCeEEEEeccceeeeccccccccCCCccccc Q lcl|Aclame:pro 1 MANAFSKPTAVVDTAIQMLQNELIL--------TNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVS 72 (392) Q Consensus 1 Man~~~~~~~~~~~~~~~l~~~l~~--------~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (392) +.|+-+ -.+|+..+...-..+.-+ ..-|.|- .|+....||+|++.....+.-.... +...-....+ T Consensus 22 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~--~dL~K~aGd~vtf~L~~~L~g~gv~---Gd~~lEGnee 95 (404) T protein:vir:32 22 NRNRSM-VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRI--TDLNKQAGDEVTFSIMHKLSKRPTM---GDERVEGRGE 95 (404) T ss_pred hcCChh-HhhhhhhhhhhhhhccchhhccCCCCCccEEEe--ecCCCCCCcEEEEeEeeecccCCcc---cCceeecccc Confidence 444332 233433221111111111 1123332 4555567999999887655422211 1111122334 Q ss_pred cccCceEEEEEEeeeecceEeeH-HHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHHhccccc---------------- Q lcl|Aclame:pro 73 DFTEDSFPVTLTDVAYHLGVLTD-EELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYE---------------- 135 (392) Q Consensus 73 ~~~~~~~~~~i~~~~~~~~~i~d-~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~~~~~~~---------------- 135 (392) .++-.+.+|.||+..+ ++.... ....-...||+++..+.+..-+++..|+.++-.+.++... T Consensus 96 ~L~~~s~~i~Idq~r~-~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~~~~ 174 (404) T protein:vir:32 96 DLSHADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEF 174 (404) T ss_pred ceeEEeeEEEEeeecc-cccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeeccccccc Confidence 5566788899988754 444433 2333467788888888888888889998887555433210 Q ss_pred -------ccccc-------cc--------ccchhhHHHHHHHHHHhhhccCC-C-----C---------CEEEEchHHHH Q lcl|Aclame:pro 136 -------AAGAV-------HE--------VAPDEFFKGVNGARRALNELYIP-Q-----G---------RVLVVGTAVTE 178 (392) Q Consensus 136 -------~~~~~-------~~--------~~~~~~~~~i~~a~~~l~~~~vp-~-----~---------r~~vv~~~~~~ 178 (392) ...++ .+ .+....++-|-.++..+++..-| . + ++++++|.++. T Consensus 175 ~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~~~~yV~~~~p~q~~ 254 (404) T protein:vir:32 175 KKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWN 254 (404) T ss_pred cceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCccceEEEEechHHHH Confidence 00000 00 01112344455666666553333 1 2 56789999999 Q ss_pred Hhhcccc---eeeeec---cc-cceeeeEeeeeeeeEeeeEEEEecceeecc--cce--eecccc-----cccchhh--- Q lcl|Aclame:pro 179 QILNDDR---FIKYES---QG-QSAVSALQEARLGRIYGYEIVESTLIPHGD--AYL--YHPTAF-----IMATRAP--- 239 (392) Q Consensus 179 ~l~~~~~---~~~~~~---~G-~~~~~a~~~g~ig~~~g~~v~~s~~v~~~~--~~~--~~~~a~-----~~a~~~~--- 239 (392) .|..|+. |....+ .+ ......+..|..|.+.|+-+.+....|..- ... ...+.. ..+.... T Consensus 255 ~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~R 334 (404) T protein:vir:32 255 DWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDR 334 (404) T ss_pred HHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccccccccccccccchh Confidence 9999863 333222 11 112367888999999998887655443211 000 000000 0000000 Q ss_pred hccccccccceeec-ccceeeeeeec-cccceeeeecccccceeeeEEEeeccccceeeeeccceeeeeee Q lcl|Aclame:pro 240 APPMGAVRSTAISG-DQRIAMRWLVD-YDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEV 308 (392) Q Consensus 240 ~~~~~~~~~~~~~~-~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v 308 (392) ....+........+ ..+....|... .|.... -...++...|.....=....+...--.+.+.+.-+.+ T Consensus 335 allLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~-~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idta~~~ 404 (404) T protein:vir:32 335 AMLLGAQALANAYGQKAGGHFNMVEKKTDMDNR-TEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVKL 404 (404) T ss_pred heeecceeEEEEeeccCCCCceeEeeccccCch-hhhhhHHHhhhhhccccCCCCceeeEEEEEecccccC Confidence 00000000000000 00000111100 011000 0111111111111100000000000000000000000 No 183 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=88.21 E-value=0.034 Score=28.65 Aligned_cols=263 Identities=13% Similarity=0.050 Sum_probs=100.7 Q ss_pred Cc---c---ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEecccee-eeccccccccCCCccc-cc Q lcl|Aclame:pro 1 MA---N---AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSR-GHTRKLRGAGAERNLT-VS 72 (392) Q Consensus 1 Ma---n---~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~-~~~~~~~~~~~~~~~~-~~ 72 (392) |. . -.+.|+.+.+++++.|++.-.+-+++++. . .. |+ .+||+..... +.+.. . ...+. -. T Consensus 83 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~--~-~~---~~-~~i~~~~~~~~a~w~~---e--~~~~~~~~ 150 (383) T protein:vir:78 83 INKEVGYKEETLLPQTVVDEIFEDLTTEHPFLASIGMR--T-TG---LR-TKFLKSETSGVAVWGK---I--FGEIKGQL 150 (383) T ss_pred HhccCCCCCccccCHHHHHHHHHHHHhhccceeeeeeE--e-cC---Cc-eEEEEEcCCcceEEee---c--cccccccc Confidence 22 1 24789999999999999998888877542 2 22 33 5777654322 22211 1 11111 11 Q ss_pred cccCceEEEEEEeeee-cceEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHHH-hcccccc-------ccccccc Q lcl|Aclame:pro 73 DFTEDSFPVTLTDVAY-HLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMI-VGAPYEA-------AGAVHEV 143 (392) Q Consensus 73 ~~~~~~~~~~i~~~~~-~~~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~~-~~~~~~~-------~~~~~~~ 143 (392) +..=..+++. -++. .-+.++.+-+.++..++...+.+..+++++..+|+.++.-- ...|.+. .....+. T Consensus 151 ~~~f~~i~l~--~~kl~~~i~is~ell~Ds~~~ie~~i~~~l~~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~ 228 (383) T protein:vir:78 151 DATFSDEESI--QNKLTAFVVVPKDLEKFGPAWVKRFVVTQIEEAFAVALESAYIVGDGNDKPIGLNRKVGKGSTVVDGV 228 (383) T ss_pred CcceeeEeec--ceeeEeeccchHHHhhccHHHHHHHHHHHHHHHHHHHHhhheEeccCCCCceeeeeccCCcccccccc Confidence 2222444444 4444 34467777777778889888999999999999999876210 1111110 0000000 Q ss_pred ------cchhhHHHHHHHHHH---hhh------ccCC--C--CCEEEEchHHHHHhhcccceeeeeccccceeeeEeeee Q lcl|Aclame:pro 144 ------APDEFFKGVNGARRA---LNE------LYIP--Q--GRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEAR 204 (392) Q Consensus 144 ------~~~~~~~~i~~a~~~---l~~------~~vp--~--~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ 204 (392) .....+.++...... +.+ ++.+ . .-.++++|..+..+.. .+...+. +|. T Consensus 229 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~--~~~~~~~----------~G~ 296 (383) T protein:vir:78 229 YAEKAATGTLTFANPKTTVNELTDVYKYHSVKENGHPLNVAGKVTLLVNPTDAWDVKK--QYTSLNA----------NGV 296 (383) T ss_pred cccccccchhhhhhhHHHHHHHHHHHhccchhcccchhhhcCceEEEEcCcchhhhcc--chhccCC----------CCc Confidence 000111122111111 111 1111 1 1235667755444421 1111111 222 Q ss_pred eeeEe--eeEEEEecceeecccceeecccccccchhhhccccccccceeecccceeeeeeeccccceeeeecccccceee Q lcl|Aclame:pro 205 LGRIY--GYEIVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGL 282 (392) Q Consensus 205 ig~~~--g~~v~~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 282 (392) ..... |..++.+..+|.+.......+.+....+.. .........-...............+ T Consensus 297 ~~t~l~~~~~iv~s~~~p~~~iifgdfs~Y~i~~r~~-----------------~~i~~~~~~~f~~d~~~f~~~~r~dG 359 (383) T protein:vir:78 297 YVTALPFNLNIIESLFVPEKKAISYVAERYDALIGGP-----------------LDIGTYDQTLAIEDLNLYAAKQFAYG 359 (383) T ss_pred eeeecCCCceEEecCCCCcccEEEeeccceEEEeccc-----------------ceEEecchhhhhcCceEEEEEEEEcC Confidence 22333 334566666665443222211111111100 00000000000000000000000001 Q ss_pred eEEEeeccccceeeeeccceeeeeeecccccccce Q lcl|Aclame:pro 283 KVVEDPNGVGFVRARKIHLIPGSIEVAPEAGANAT 317 (392) Q Consensus 283 ~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~~~~~~ 317 (392) ........ ....+.....+ ..+.. T Consensus 360 ~~~~~~A~----~vl~~~~~~~~-------~~~~~ 383 (383) T protein:vir:78 360 KAKDDKAA----AVWTLNINPAE-------QTPEG 383 (383) T ss_pred EEecCCeE----EEEEEEecCCC-------CCCCC Confidence 01100000 00001111111 01000 No 184 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=82.62 E-value=0.075 Score=26.74 Aligned_cols=271 Identities=11% Similarity=0.031 Sum_probs=107.0 Q ss_pred Ccc--ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEecccee-eeccccccccCCCccccccccCc Q lcl|Aclame:pro 1 MAN--AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSR-GHTRKLRGAGAERNLTVSDFTED 77 (392) Q Consensus 1 Man--~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 77 (392) +.- -.+.|+.+.+++++.|++.-.+-+++++. . .. | ..+|++..... +.+.. . ..... .....+ T Consensus 80 t~~~Gg~lvP~~~~~~I~~~l~~~spir~~a~v~--~-~~---~-~~~i~~~~~~~~a~W~~---e--~~~~~-~~~~~~ 146 (381) T protein:vir:10 80 VGYKEEKLLPEETIDRIFEDLTTNHPLLADLGIK--N-AG---L-RLKFLKSETSGVAVWGK---I--YGEIK-GQLDAA 146 (381) T ss_pred CCCCCceecCHHHHHHHHHHHHhhcceeeeeeeE--e-cC---c-ceEEEeecCCcceEEee---c--ccccc-cccCcc Confidence 221 25789999999999999998887777542 2 11 2 35666654322 22211 1 11111 111112 Q ss_pred eEEEEEEeeeecc-eEeeHHHHhhhccChHHHHHHHHHHHHHHHHHHHHHHH-Hhccccccc-------cccccc----- Q lcl|Aclame:pro 78 SFPVTLTDVAYHL-GVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDM-IVGAPYEAA-------GAVHEV----- 143 (392) Q Consensus 78 ~~~~~i~~~~~~~-~~i~d~~~~~~~~~~~~~~~~~~~~ala~~vd~~~~~~-~~~~~~~~~-------~~~~~~----- 143 (392) --+++|..++... +.++.+-+.++..++...+....+++++..+|+.++.- =...|.+.. ....+. T Consensus 147 f~~i~l~~~kl~a~i~is~elL~Ds~~~le~~i~~~la~~~a~~~~~afi~GdG~~qP~Gil~~~~~~~~~~~g~~~~~~ 226 (381) T protein:vir:10 147 FSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTDGAYPEKE 226 (381) T ss_pred ceeEeecceeEEeeccccHHHHhccHHHHHHHHHHHHHHHHHHHhhceeEecccCCCceeeeecCCcccccccccccccc Confidence 2345555555544 46777777777788888889999999999999876521 011111100 000000 Q ss_pred --------cchhhHHHHHHHHHHhhhc----cC-C-CCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEe Q lcl|Aclame:pro 144 --------APDEFFKGVNGARRALNEL----YI-P-QGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIY 209 (392) Q Consensus 144 --------~~~~~~~~i~~a~~~l~~~----~v-p-~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~ 209 (392) .....++.+.+....+... .. + .+.+++++|..+..+.....+. +..|. .+.. --+ T Consensus 227 ~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~~~~--~~~G~-~v~~-------lp~ 296 (381) T protein:vir:10 227 EQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHL--NANGV-YVTA-------LPF 296 (381) T ss_pred ccccccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhccccccC--CCCCc-eeec-------CCC Confidence 0111222222222222111 11 2 2457788998887775432221 11221 1110 113 Q ss_pred eeEEEEecceeecccceeecccccccchhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeec Q lcl|Aclame:pro 210 GYEIVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPN 289 (392) Q Consensus 210 g~~v~~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 289 (392) |..++.++.+|.+.......+.+....+.. .........-...............+...... T Consensus 297 g~~vv~~~~~p~~~i~fGDfs~Y~i~~r~~-----------------~~i~~~~~~~~~~d~~~f~a~~r~dG~~~~~~- 358 (381) T protein:vir:10 297 NLNVIESTVQEAGKVLTYVKGLYDGYLAGG-----------------INVQKFKETLALDDMDLYTAKQFAYGKAKDNK- 358 (381) T ss_pred CceeEEcCCCCcCcEEEEEcccEEEEEecc-----------------cEEEeechhhhhcCceEEEEEEEEcCEEecCC- Confidence 556777777765432222111111111100 00000000000000000000000000000000 Q ss_pred cccceeeeeccceeeeeeecccccccceeeeeeccCeeE Q lcl|Aclame:pro 290 GVGFVRARKIHLIPGSIEVAPEAGANATITAAAGEDHTV 328 (392) Q Consensus 290 ~~~~~~~~~~~~~~~~v~v~~~~~~~~~~~~~~~~~~t~ 328 (392) ...+ -.+.+.+..+.... ...++ T Consensus 359 --------A~~v--~~l~~~~~~~~~~~------~~~~~ 381 (381) T protein:vir:10 359 --------VAAV--WKLDLKGHKPALED------TEETL 381 (381) T ss_pred --------cEEE--EEEeecCCcccccc------ccccC Confidence 0000 00011110110000 00111 No 185 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=80.15 E-value=0.098 Score=26.12 Aligned_cols=270 Identities=11% Similarity=0.045 Sum_probs=109.4 Q ss_pred Ccc-----ccccHHHHHHHHHHHHHHhhcccceeeecccccccCCCCCeEEEEeccce--eeeccccccccCCCcccccc Q lcl|Aclame:pro 1 MAN-----AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPS--RGHTRKLRGAGAERNLTVSD 73 (392) Q Consensus 1 Man-----~~~~~~~~~~~~~~~l~~~l~~~~~v~~~~~~~~~~~~Gdtv~i~~~~~~--~~~~~~~~~~~~~~~~~~~~ 73 (392) |.. -.+.|+... ++++.+.+...|..+++.- .... +.+..|+..+.. ........ ++.....-.. T Consensus 19 ~t~~d~~Gg~l~P~~~~-~~i~~~~e~s~~l~~~~vi--~~~~---~~~~~i~~~g~~~~~~~g~~~~--~~~~~~~~~~ 90 (315) T protein:vir:41 19 IDVPDLGRGVLSVDRFG-EFVKAVRDSAVIIPEARID--NALK---SYEKDISRLSLVLDVGPGRDET--GQKLAPPEST 90 (315) T ss_pred cCCcCCCCceechHHHH-HHHHHHHhhhhhhhhceee--eccc---cccccccccccCcccccccccc--cCcCCCCCCc Confidence 322 236798875 5788898888887776532 0000 122223222110 01111111 1111112223 Q ss_pred ccCceEEEEEEeeeecceEeeHHHHhhhcc--ChHHHHHHHHHHHHHHHHHHHHHHHHhc-------cccc----ccc-c Q lcl|Aclame:pro 74 FTEDSFPVTLTDVAYHLGVLTDEELTFDLE--SFATQILPRQVRGVADILEEGVRDMIVG-------APYE----AAG-A 139 (392) Q Consensus 74 ~~~~~~~~~i~~~~~~~~~i~d~~~~~~~~--~~~~~~~~~~~~ala~~vd~~~~~~~~~-------~~~~----~~~-~ 139 (392) +.-+..++...+. +..+.++++.+..... +|...+....+++++++.+..++.-=.. .+.+ ... . T Consensus 91 ~~f~~~~l~~~~l-~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s~~p~~~~~~G~l~~a~~~~ 169 (315) T protein:vir:41 91 AEVKTNTLYMREM-VTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHGDTSSSDPLLRMSDGWLKLASEKL 169 (315) T ss_pred cccceeeeceeee-eeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhccCCcCcCccccccccceecccccc Confidence 3345555555433 3446788888877764 8988899999999999988877632110 0000 000 0 Q ss_pred ----cccccchhhHHHHHHHHHHhhhccC--CCCCEEEEchHHHHHhhcccceeeeeccccceeeeEeeeeeeeEeeeEE Q lcl|Aclame:pro 140 ----VHEVAPDEFFKGVNGARRALNELYI--PQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEI 213 (392) Q Consensus 140 ----~~~~~~~~~~~~i~~a~~~l~~~~v--p~~r~~vv~~~~~~~l~~~~~~~~~~~~G~~~~~a~~~g~ig~~~g~~v 213 (392) ......+...+.+.++...|...-- .++-.++++.+....+.+-.. .+....+. ..+..|....+.|+.| T Consensus 170 ~~~~~~~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rklk~-~~g~~lw~---~~~~~g~~~tl~G~PV 245 (315) T protein:vir:41 170 TESDVDPEAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDALK-GRETGLGD---QALTGANSILYDGRPV 245 (315) T ss_pred cccccccccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHHhc-cCCCcccc---chhhcCCCceecccce Confidence 0001111234455555555543211 123357889888877654211 01111222 2344566668899999 Q ss_pred EEecceeecccceeecccccccchhhhccccccccceeecccceeeeeeeccccceeeeecccccceeeeEEEeeccccc Q lcl|Aclame:pro 214 VESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGF 293 (392) Q Consensus 214 ~~s~~v~~~~~~~~~~~a~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 293 (392) +....+|....... ...+..-.+. . .+.. ..+......+..............++... T Consensus 246 ~~~~~m~~~~~~~~---~ilf~d~~nl---------~-~~~~-~~i~i~~~~~a~~~~~~~~~~~r~d~~~~-------- 303 (315) T protein:vir:41 246 QYVPALEALNDGKS---RALFVVPTQL---------V-YGFW-RNIKVVPDYDAEMRLTKYVASLRTDNHYE-------- 303 (315) T ss_pred EecccccccCCCCc---cEEEecccce---------E-EEec-cccEEEeeecCCCCceEEEEEEEeceeEE-------- Confidence 88777764321100 0000000000 0 0000 00000000010000000000000000000 Q ss_pred eeeeeccceeeeeee Q lcl|Aclame:pro 294 VRARKIHLIPGSIEV 308 (392) Q Consensus 294 ~~~~~~~~~~~~v~v 308 (392) . ........+.| T Consensus 304 -~--~~~~a~~~~~v 315 (315) T protein:vir:41 304 -D--EEGAVSATITV 315 (315) T ss_pred -e--ccceeEeeeeC Confidence 0 00000001111 Done!